Files
langchain/docs/modules/document_loaders/examples
Matt Robinson 2f15c11b87 feat: document loader for MS Word documents (#1282)
### Summary

Adds a document loader for MS Word Documents. Works with both `.docx`
and `.doc` files as longer as the user has installed
`unstructured>=0.4.11`.

### Testing

The follow workflow test the loader for both `.doc` and `.docx` files
using example docs from the `unstructured` repo.

#### `.docx`

```python
from langchain.document_loaders import UnstructuredWordDocumentLoader

filename = "../unstructured/example-docs/fake.docx"
loader = UnstructuredWordDocumentLoader(filename)
loader.load()
```

#### `.doc`

```python
from langchain.document_loaders import UnstructuredWordDocumentLoader

filename = "../unstructured/example-docs/fake.doc"
loader = UnstructuredWordDocumentLoader(filename)
loader.load()
```
2023-02-24 08:26:19 -08:00
..
2023-02-24 07:22:48 -08:00
2023-02-15 22:47:30 -08:00
2023-02-24 07:22:48 -08:00
2023-02-08 00:35:33 -08:00
2023-02-20 22:54:26 -08:00
2023-02-06 21:44:35 -08:00
2023-02-08 12:00:47 -08:00
2023-02-17 15:15:02 -08:00
2023-02-06 18:13:46 -08:00
2023-02-24 07:38:24 -08:00
2023-02-06 00:26:20 -08:00
2023-02-06 22:21:16 -08:00
2023-02-18 08:13:54 -08:00
2023-02-12 23:02:01 -08:00
2023-02-08 00:35:33 -08:00
2023-02-08 00:35:33 -08:00
2023-02-18 10:58:39 -08:00
2023-02-09 08:12:22 -08:00