This commit is contained in:
Alan Cha
2023-04-28 16:01:09 -04:00
committed by GitHub
parent 5042bd40d3
commit e3b7a20454

View File

@@ -6,7 +6,7 @@
"metadata": {},
"source": [
"# tiktoken (OpenAI) Length Function\n",
"You can also use tiktoken, a open source tokenizer package from OpenAI to estimate tokens used. Will probably be more accurate for their models.\n",
"You can also use tiktoken, an open source tokenizer package from OpenAI to estimate tokens used. Will probably be more accurate for their models.\n",
"\n",
"1. How the text is split: by character passed in\n",
"2. How the chunk size is measured: by `tiktoken` tokenizer"