langchain/docs/modules/indexes
Jens Madsen 8d9e9e013c
refactor: extract token text splitter function (#5179)
# Token text splitter for sentence transformers

The current TokenTextSplitter only works with OpenAi models via the
`tiktoken` package. This is not clear from the name `TokenTextSplitter`.
In this (first PR) a token based text splitter for sentence transformer
models is added. In the future I think we should work towards injecting
a tokenizer into the TokenTextSplitter to make ti more flexible.
Could perhaps be reviewed by @dev2049

---------

Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>
2023-06-04 14:41:44 -07:00
..
document_loaders/examples docs ecosystem/integrations update 4 (#5590) 2023-06-03 15:29:03 -07:00
retrievers/examples Harrison/pubmed integration (#5664) 2023-06-03 16:25:28 -07:00
text_splitters refactor: extract token text splitter function (#5179) 2023-06-04 14:41:44 -07:00
vectorstores Created fix for 5475 (#5659) 2023-06-03 16:47:52 -07:00
document_loaders.rst Documentation fixes (linting and broken links) (#5563) 2023-06-01 13:06:17 -07:00
getting_started.ipynb Update getting_started.ipynb (#4850) 2023-05-17 13:19:14 -07:00
retrievers.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00
text_splitters.rst code splitter docs (#5480) 2023-05-31 07:11:53 -07:00
vectorstores.rst big docs refactor (#1978) 2023-03-26 19:49:46 -07:00