langchain/docs
Matt Robinson 2f15c11b87
feat: document loader for MS Word documents (#1282)
### Summary

Adds a document loader for MS Word Documents. Works with both `.docx`
and `.doc` files as longer as the user has installed
`unstructured>=0.4.11`.

### Testing

The follow workflow test the loader for both `.doc` and `.docx` files
using example docs from the `unstructured` repo.

#### `.docx`

```python
from langchain.document_loaders import UnstructuredWordDocumentLoader

filename = "../unstructured/example-docs/fake.docx"
loader = UnstructuredWordDocumentLoader(filename)
loader.load()
```

#### `.doc`

```python
from langchain.document_loaders import UnstructuredWordDocumentLoader

filename = "../unstructured/example-docs/fake.doc"
loader = UnstructuredWordDocumentLoader(filename)
loader.load()
```
2023-02-24 08:26:19 -08:00
..
_static docs: increase width (#1049) 2023-02-15 23:07:01 -08:00
ecosystem Add Writer, Banana, Modal, StochasticAI (#1270) 2023-02-24 06:58:58 -08:00
getting_started add reqs (#918) 2023-02-06 20:30:03 -08:00
modules feat: document loader for MS Word documents (#1282) 2023-02-24 08:26:19 -08:00
reference Add Support for OpenSearch Vector database (#1191) 2023-02-20 18:39:34 -08:00
tracing Make Tools own model, add ToolKit Concept (#1095) 2023-02-18 13:40:43 -08:00
use_cases Harrison/updating docs (#1196) 2023-02-20 22:54:26 -08:00
conf.py improve css (#615) 2023-01-14 07:39:29 -08:00
deployments.md add docs for steamship deployment (#949) 2023-02-08 16:01:19 -08:00
ecosystem.rst Docs refactor (#480) 2023-01-02 08:24:09 -08:00
gallery.rst update gallery with slack bot (#1177) 2023-02-20 08:21:00 -08:00
glossary.md Feature: linkcheck-action (#534) (#542) 2023-01-04 21:39:50 -08:00
index.rst improve docs for indexes (#1146) 2023-02-19 23:14:50 -08:00
make.bat initial commit 2022-10-24 14:51:15 -07:00
Makefile Feature: linkcheck-action (#534) (#542) 2023-01-04 21:39:50 -08:00
reference.rst Feature: linkcheck-action (#534) (#542) 2023-01-04 21:39:50 -08:00
requirements.txt Docs refactor (#480) 2023-01-02 08:24:09 -08:00
tracing.md Harrison/tracing docs (#806) 2023-01-29 20:49:35 -08:00