Files
langchain/langchain
Lance Martin b023f0c0f2 Text splitter for Markdown files by header (#5860)
This creates a new kind of text splitter for markdown files.

The user can supply a set of headers that they want to split the file
on.

We define a new text splitter class, `MarkdownHeaderTextSplitter`, that
does a few things:

(1) For each line, it determines the associated set of user-specified
headers
(2) It groups lines with common headers into splits

See notebook for example usage and test cases.
2023-06-12 15:46:42 -07:00
..
2023-06-11 17:59:09 -07:00
2023-06-11 15:51:28 -07:00
2023-06-11 15:51:28 -07:00
2023-06-06 06:51:05 -07:00
2023-06-11 10:09:22 -07:00
2023-06-11 15:51:28 -07:00
2023-06-11 15:51:28 -07:00
2023-06-11 15:51:28 -07:00
2023-06-11 15:45:12 -07:00
2023-06-11 09:48:09 -07:00
2023-05-17 01:28:43 +00:00
2023-06-11 15:51:28 -07:00
2023-06-08 21:15:14 -07:00
2023-06-03 16:48:48 -07:00
2022-11-12 11:22:32 -08:00
2023-06-11 15:51:28 -07:00
2023-02-20 21:15:45 -08:00
2023-05-16 13:26:43 +00:00