Files
langchain/docs/ecosystem
Eugene Yurtsev 3c490b5ba3 Docugami DataLoader (#4727)
### Adds a document loader for Docugami

Specifically:

1. Adds a data loader that talks to the [Docugami](http://docugami.com)
API to download processed documents as semantic XML
2. Parses the semantic XML into chunks, with additional metadata
capturing chunk semantics
3. Adds a detailed notebook showing how you can use additional metadata
returned by Docugami for techniques like the [self-querying
retriever](https://python.langchain.com/en/latest/modules/indexes/retrievers/examples/self_query_retriever.html)
4. Adds an integration test, and related documentation

Here is an example of a result that is not possible without the
capabilities added by Docugami (from the notebook):

<img width="1585" alt="image"
src="https://github.com/hwchase17/langchain/assets/749277/bb6c1ce3-13dc-4349-a53b-de16681fdd5b">

---------

Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>
Co-authored-by: Taqi Jaffri <tjaffri@gmail.com>
2023-05-15 10:53:00 -04:00
..
2023-03-30 20:58:14 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-04-10 18:49:47 -07:00
2023-04-09 12:55:22 -07:00
2023-05-15 10:53:00 -04:00
2023-03-26 19:49:46 -07:00
2023-04-04 07:21:50 -07:00
2023-04-30 11:14:09 -07:00
2023-04-06 12:42:01 -07:00
2023-04-27 08:14:36 -07:00
cr
2023-04-04 07:25:28 -07:00
2023-04-26 15:57:48 -07:00
2023-03-26 19:49:46 -07:00
2023-04-22 09:17:38 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-04-28 20:47:18 -07:00
2023-04-06 14:41:06 -07:00
2023-03-26 19:49:46 -07:00
2023-04-28 21:25:33 -07:00
2023-03-26 19:49:46 -07:00
2023-04-20 15:20:21 -07:00