langchain/docs/modules/indexes
Lance Martin b023f0c0f2
Text splitter for Markdown files by header (#5860)
This creates a new kind of text splitter for markdown files.

The user can supply a set of headers that they want to split the file
on.

We define a new text splitter class, `MarkdownHeaderTextSplitter`, that
does a few things:

(1) For each line, it determines the associated set of user-specified
headers
(2) It groups lines with common headers into splits

See notebook for example usage and test cases.
2023-06-12 15:46:42 -07:00
..
document_loaders/examples feat: Add UnstructuredXMLLoader for .xml files (#5955) 2023-06-10 16:24:42 -07:00
retrievers/examples Fixed typo missing "use" (#5991) 2023-06-10 14:31:58 -07:00
text_splitters Text splitter for Markdown files by header (#5860) 2023-06-12 15:46:42 -07:00
vectorstores comment out 2023-06-12 10:57:31 -07:00
document_loaders.rst Create Airtable loader (#5958) 2023-06-10 15:43:18 -07:00
getting_started.ipynb Update getting_started.ipynb (#4850) 2023-05-17 13:19:14 -07:00
retrievers.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
text_splitters.rst code splitter docs (#5480) 2023-05-31 07:11:53 -07:00
vectorstores.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00