community[minor]: Firecrawl.dev integration (#20364)

Added the [FireCrawl](https://firecrawl.dev) document loader. Firecrawl
crawls and convert any website into LLM-ready data. It crawls all
accessible subpages and give you clean markdown for each.

    - **Description:** Adds FireCrawl data loader
    - **Dependencies:** firecrawl-py
    - **Twitter handle:** @mendableai 

ccing contributors: (@ericciarla @nickscamara)

---------

Co-authored-by: Bagatur <baskaryan@gmail.com>
Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>
This commit is contained in:
Nicolas
2024-04-12 15:13:48 -04:00
committed by GitHub
parent a1b105ac00
commit ad04585e30
6 changed files with 295 additions and 2 deletions

View File

@@ -65,6 +65,7 @@ EXPECTED_ALL = [
"FaunaLoader",
"FigmaFileLoader",
"FileSystemBlobLoader",
"FireCrawlLoader",
"GCSDirectoryLoader",
"GCSFileLoader",
"GeoDataFrameLoader",