langchain/docs/modules/indexes/text_splitters
Lance Martin b023f0c0f2
Text splitter for Markdown files by header (#5860)
This creates a new kind of text splitter for markdown files.

The user can supply a set of headers that they want to split the file
on.

We define a new text splitter class, `MarkdownHeaderTextSplitter`, that
does a few things:

(1) For each line, it determines the associated set of user-specified
headers
(2) It groups lines with common headers into splits

See notebook for example usage and test cases.
2023-06-12 15:46:42 -07:00
..
examples Text splitter for Markdown files by header (#5860) 2023-06-12 15:46:42 -07:00
getting_started.ipynb Add start index to metadata in TextSplitter (#5912) 2023-06-08 23:09:32 -07:00