langchain/docs/modules/document_loaders/examples
Matt Robinson 07a407d89a
feat: adds UnstructuredURLLoader for loading data from urls (#979)
### Summary

Adds a `UnstructuredURLLoader` that supports loading data from a list of
URLs.


### Testing

```python
from langchain.document_loaders import UnstructuredURLLoader

urls = [
    "https://www.understandingwar.org/backgrounder/russian-offensive-campaign-assessment-february-8-2023",
    "https://www.understandingwar.org/backgrounder/russian-offensive-campaign-assessment-february-9-2023"
]
loader = UnstructuredURLLoader(urls=urls)
raw_documents = loader.load()
```
2023-02-10 10:18:38 -08:00
..
example_data Harrison/everynote (#974) 2023-02-10 08:02:35 -08:00
azlyrics.ipynb adding webpage loading logic (#942) 2023-02-09 07:52:50 -08:00
college_confidential.ipynb adding webpage loading logic (#942) 2023-02-09 07:52:50 -08:00
directory_loader.ipynb Harrison/unstructured support (#903) 2023-02-05 23:02:07 -08:00
email.ipynb Harrison/obsidian (#920) 2023-02-06 22:21:16 -08:00
everynote.ipynb Harrison/everynote (#974) 2023-02-10 08:02:35 -08:00
gcs_directory.ipynb Harrison/add roam loader (#939) 2023-02-08 00:35:33 -08:00
gcs_file.ipynb Harrison/add roam loader (#939) 2023-02-08 00:35:33 -08:00
googledrive.ipynb add GoogleDriveLoader (#914) 2023-02-06 21:44:35 -08:00
gutenberg.ipynb gutenberg books (#946) 2023-02-08 12:00:47 -08:00
html.ipynb add unstructured examples (#913) 2023-02-06 18:13:46 -08:00
imsdb.ipynb adding webpage loading logic (#942) 2023-02-09 07:52:50 -08:00
microsoft_word.ipynb Harrison/obsidian (#920) 2023-02-06 22:21:16 -08:00
notion.ipynb update docs (#905) 2023-02-06 00:26:20 -08:00
obsidian.ipynb Harrison/obsidian (#920) 2023-02-06 22:21:16 -08:00
pdf.ipynb Harrison/format agent instructions (#973) 2023-02-10 10:07:26 -08:00
powerpoint.ipynb add unstructured examples (#913) 2023-02-06 18:13:46 -08:00
readthedocs_documentation.ipynb Harrison/unstructured support (#903) 2023-02-05 23:02:07 -08:00
roam.ipynb Harrison/add roam loader (#939) 2023-02-08 00:35:33 -08:00
s3_directory.ipynb Harrison/add roam loader (#939) 2023-02-08 00:35:33 -08:00
s3_file.ipynb Harrison/add roam loader (#939) 2023-02-08 00:35:33 -08:00
unstructured_file.ipynb Harrison/unstructured support (#903) 2023-02-05 23:02:07 -08:00
url.ipynb feat: adds UnstructuredURLLoader for loading data from urls (#979) 2023-02-10 10:18:38 -08:00
web_base.ipynb adding webpage loading logic (#942) 2023-02-09 07:52:50 -08:00
youtube.ipynb Harrison/youtube fixes (#955) 2023-02-09 08:12:22 -08:00