..
blob_loaders
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
parsers
community: bytes as a source to AzureAIDocumentIntelligenceLoader
( #26618 )
2024-11-07 03:40:21 +00:00
__init__.py
[community] Added PebbloTextLoader for loading text data in PebbloSafeLoader ( #26582 )
2024-09-19 09:59:04 -04:00
acreom.py
community[patch]: Add missing annotations ( #24890 )
2024-07-31 18:13:44 +00:00
airbyte_json.py
airbyte.py
airtable.py
docs: fix kwargs docstring ( #25010 )
2024-08-02 19:54:54 -07:00
apify_dataset.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
arcgis_loader.py
arxiv.py
docs: Arxiv docs update ( #23871 )
2024-07-05 11:43:51 -04:00
assemblyai.py
astradb.py
multiple: update removal targets ( #25361 )
2024-08-14 09:50:39 -04:00
async_html.py
community[patch]: Release 0.2.11 ( #24989 )
2024-08-02 20:08:44 +00:00
athena.py
community: make AthenaLoader profile_name optional and fix type hint ( #24958 )
2024-08-05 14:28:58 +00:00
azlyrics.py
azure_ai_data.py
azure_blob_storage_container.py
azure_blob_storage_file.py
baiducloud_bos_directory.py
baiducloud_bos_file.py
base_o365.py
community: Allow other than default parsers in SharePointLoader and OneDriveLoader ( #27716 )
2024-11-06 17:44:34 -05:00
base.py
bibtex.py
bigquery.py
multiple: update removal targets ( #25361 )
2024-08-14 09:50:39 -04:00
bilibili.py
blackboard.py
community: add flag to toggle progress bar ( #24463 )
2024-07-20 13:18:02 +00:00
blockchain.py
community: add supported blockchains to Blockchain Document Loader ( #25428 )
2024-08-23 14:39:42 +00:00
brave_search.py
browserbase.py
browserless.py
cassandra.py
chatgpt.py
chm.py
chromium.py
community[minor]: add user agent for web scraping loaders ( #22480 )
2024-06-05 15:20:34 +00:00
college_confidential.py
concurrent.py
confluence.py
community[minor]: Fix missing 'keep_newlines' parameter forward-pass to 'process_pages' function in confluence loader ( #20086 ) ( #20087 )
2024-08-23 12:59:38 +00:00
conllu.py
couchbase.py
csv_loader.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
cube_semantic.py
datadog_logs.py
dataframe.py
dedoc.py
community[minor]: added new document loaders based on dedoc library ( #24303 )
2024-07-23 02:04:53 +00:00
diffbot.py
directory.py
community: glob multiple patterns when using DirectoryLoader ( #22852 )
2024-06-18 09:24:50 -07:00
discord.py
doc_intelligence.py
community: bytes as a source to AzureAIDocumentIntelligenceLoader
( #26618 )
2024-11-07 03:40:21 +00:00
docugami.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
docusaurus.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
dropbox.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
duckdb_loader.py
email.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
epub.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
etherscan.py
evernote.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
excel.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
facebook_chat.py
fauna.py
figma.py
firecrawl.py
Community: Updated Firecrawl Document Loader to v1 ( #26548 )
2024-10-15 13:13:28 +00:00
gcs_directory.py
multiple: update removal targets ( #25361 )
2024-08-14 09:50:39 -04:00
gcs_file.py
multiple: update removal targets ( #25361 )
2024-08-14 09:50:39 -04:00
generic.py
geodataframe.py
git.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
gitbook.py
community: add flag to toggle progress bar ( #24463 )
2024-07-20 13:18:02 +00:00
github.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
glue_catalog.py
google_speech_to_text.py
multiple: update removal targets ( #25361 )
2024-08-14 09:50:39 -04:00
googledrive.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
gutenberg.py
helpers.py
hn.py
html_bs.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
html.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
hugging_face_dataset.py
hugging_face_model.py
community[patch]: Add missing annotations ( #24890 )
2024-07-31 18:13:44 +00:00
ifixit.py
image_captions.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
image.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
imsdb.py
iugu.py
joplin.py
json_loader.py
community: Update file_path
type in JSONLoader.__init__()
signature ( #27535 )
2024-10-22 11:18:36 -07:00
kinetica_loader.py
community[patch]: Kinetica Integrations handled error in querying; quotes in table names; updated gpudb API ( #22724 )
2024-06-11 10:01:26 -04:00
lakefs.py
larksuite.py
llmsherpa.py
markdown.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
mastodon.py
max_compute.py
mediawikidump.py
merge.py
mhtml.py
mintbase.py
modern_treasury.py
mongodb.py
community: Enhance MongoDBLoader with flexible metadata and optimized field extraction ( #23376 )
2024-09-17 10:23:17 -04:00
news.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
notebook.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
notion.py
notiondb.py
community: Fix KeyError in NotionDB loader when 'name' is missing ( #24224 )
2024-08-01 13:55:40 +00:00
nuclia.py
obs_directory.py
obs_file.py
obsidian.py
community[patch]: Add missing annotations ( #24890 )
2024-07-31 18:13:44 +00:00
odt.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
onedrive_file.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
onedrive.py
community: Allow other than default parsers in SharePointLoader and OneDriveLoader ( #27716 )
2024-11-06 17:44:34 -05:00
onenote.py
community[patch]: Fix validation error in SettingsConfigDict across multiple Langchain modules ( #26852 )
2024-09-25 10:02:14 -04:00
open_city_data.py
oracleadb_loader.py
community: Add support for clob datatype in oracle database ( #27330 )
2024-10-16 02:19:20 +00:00
oracleai.py
org_mode.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
pdf.py
community: ZeroxPDFLoader ( #27800 )
2024-11-07 03:14:57 +00:00
pebblo.py
community[minor]: [Pebblo] Enhance PebbloSafeLoader to take anonymize flag ( #26812 )
2024-09-25 09:33:06 -04:00
polars_dataframe.py
powerpoint.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
psychic.py
pubmed.py
pyspark_dataframe.py
python.py
quip.py
readthedocs.py
recursive_url_loader.py
community[minor]: add proxy support to RecursiveUrlLoader ( #27364 )
2024-10-16 16:29:59 +00:00
reddit.py
roam.py
rocksetdb.py
rspace.py
rss.py
rst.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
rtf.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
s3_directory.py
s3_file.py
scrapfly.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
scrapingant.py
community[minor]: Add ScrapingAnt Loader Community Integration ( #24514 )
2024-07-24 21:11:43 -04:00
sharepoint.py
community: Allow other than default parsers in SharePointLoader and OneDriveLoader ( #27716 )
2024-11-06 17:44:34 -05:00
sitemap.py
community[patch]: SitemapLoader restrict depth of parsing sitemap (CVE-2024-2965) ( #22903 )
2024-06-14 13:04:40 -04:00
slack_directory.py
snowflake_loader.py
spider.py
spreedly.py
sql_database.py
community[patch]: restore compatibility with SQLAlchemy 1.x ( #22546 )
2024-06-19 17:58:57 +00:00
srt.py
stripe.py
surrealdb.py
telegram.py
tencent_cos_directory.py
tencent_cos_file.py
tensorflow_datasets.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
text.py
tidb.py
tomarkdown.py
community[patch]: Update URL to the 2markdown API ( #24546 )
2024-07-23 14:27:55 +00:00
toml.py
trello.py
tsv.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
twitter.py
unstructured.py
multiple: update removal targets ( #25361 )
2024-08-14 09:50:39 -04:00
url_playwright.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
url_selenium.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
url.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
vsdx.py
weather.py
infra: update mypy 1.10, ruff 0.5 ( #23721 )
2024-07-03 10:33:27 -07:00
web_base.py
[docs]: standardize doc loader doc strings ( #25325 )
2024-08-13 23:18:56 +00:00
whatsapp_chat.py
wikipedia.py
word_document.py
Update word_document.py | Fixed metadata["source"] for web paths ( #27220 )
2024-10-31 18:37:41 +00:00
xml.py
all: test 3.13 ci ( #27197 )
2024-10-25 12:56:58 -07:00
xorbits.py
youtube.py
multiple: pydantic 2 compatibility, v0.3 ( #26443 )
2024-09-13 14:38:45 -07:00
yuque.py