langchain/examples at 0e21463f07123506061246076302ffc86d02c80b - langchain - Gitea: Git with a cup of tea at home

github/langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-29 07:19:59 +00:00

Files

History

Tim Asp 23231d65a9 Add PyMuPDF PDF loader (#1426 )

Different PDF libraries have different strengths and weaknesses. PyMuPDF
does a good job at extracting the most amount of content from the doc,
regardless of the source quality, extremely fast (especially compared to
Unstructured).

https://pymupdf.readthedocs.io/en/latest/index.html

2023-03-03 20:59:28 -08:00

..

add CoNLL-U document loader (#1297 )

2023-02-26 17:27:00 -08:00

airbyte_json.ipynb

Added initial capital letter to bullet points that had it missing (#1000 )

2023-02-11 20:31:34 -08:00

azlyrics.ipynb

adding webpage loading logic (#942 )

2023-02-09 07:52:50 -08:00

college_confidential.ipynb

adding webpage loading logic (#942 )

2023-02-09 07:52:50 -08:00

CoNLL-U.ipynb

add CoNLL-U document loader (#1297 )

2023-02-26 17:27:00 -08:00

copypaste.ipynb

copy paste loader (#1302 )

2023-02-26 17:26:37 -08:00

directory_loader.ipynb

directory loader improvements (#1162 )

2023-02-19 20:47:08 -08:00

email.ipynb

Harrison/unstructured structured (#1004 )

2023-02-12 07:36:11 -08:00

evernote.ipynb

Harrison/evernote nb (#1078 )

2023-02-15 22:47:30 -08:00

facebook_chat.ipynb

Harrison/fb loader (#1277 )

2023-02-24 07:22:48 -08:00

gcs_directory.ipynb

Harrison/add roam loader (#939 )

2023-02-08 00:35:33 -08:00

gcs_file.ipynb

Harrison/add roam loader (#939 )

2023-02-08 00:35:33 -08:00

gitbook.ipynb

Harrison/updating docs (#1196 )

2023-02-20 22:54:26 -08:00

googledrive.ipynb

add GoogleDriveLoader (#914 )

2023-02-06 21:44:35 -08:00

gutenberg.ipynb

gutenberg books (#946 )

2023-02-08 12:00:47 -08:00

hn.ipynb

Harrison/hn loader (#1130 )

2023-02-17 15:15:02 -08:00

html.ipynb

add unstructured examples (#913 )

2023-02-06 18:13:46 -08:00

ifixit.ipynb

Add new iFixit document loader (#1333 )

2023-02-27 20:40:20 -08:00

image.ipynb

feat: document loader for image files (#1330 )

2023-02-27 14:43:32 -08:00

imsdb.ipynb

adding webpage loading logic (#942 )

2023-02-09 07:52:50 -08:00

microsoft_word.ipynb

Harrison/unstructured structured (#1004 )

2023-02-12 07:36:11 -08:00

notebook.ipynb

cleanup (#1274 )

2023-02-24 07:38:24 -08:00

notion.ipynb

update docs (#905 )

2023-02-06 00:26:20 -08:00

obsidian.ipynb

Harrison/obsidian (#920 )

2023-02-06 22:21:16 -08:00

online_pdf.ipynb

fix docs (#1133 )

2023-02-18 08:13:54 -08:00

pdf.ipynb

Add PyMuPDF PDF loader (#1426 )

2023-03-03 20:59:28 -08:00

powerpoint.ipynb

Harrison/unstructured structured (#1004 )

2023-02-12 07:36:11 -08:00

readthedocs_documentation.ipynb

Harrison/unstructured support (#903 )

2023-02-05 23:02:07 -08:00

roam.ipynb

Harrison/add roam loader (#939 )

2023-02-08 00:35:33 -08:00

s3_directory.ipynb

Harrison/add roam loader (#939 )

2023-02-08 00:35:33 -08:00

s3_file.ipynb

Harrison/add roam loader (#939 )

2023-02-08 00:35:33 -08:00

srt.ipynb

add srt loader (#1140 )

2023-02-18 10:58:39 -08:00

telegram.ipynb

Harrison/telegram loader (#1080 )

2023-02-15 23:24:32 -08:00

unstructured_file.ipynb

docs: add quotes to unstructured[local-inference] install instructions (#1208 )

2023-02-21 08:06:43 -08:00

url.ipynb

feat: adds UnstructuredURLLoader for loading data from urls (#979 )

2023-02-10 10:18:38 -08:00

web_base.ipynb

adding webpage loading logic (#942 )

2023-02-09 07:52:50 -08:00

word_document.ipynb

feat: document loader for MS Word documents (#1282 )

2023-02-24 08:26:19 -08:00

youtube.ipynb

Harrison/youtube fixes (#955 )

2023-02-09 08:12:22 -08:00