langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 10:17:00 +00:00

Author	SHA1	Message	Date
ccurme	328ba36601	chore(openai): skip Azure text completions tests (#34021 )	2025-11-19 09:29:12 -05:00
Mason Daugherty	52b1516d44	style(langchain): fix some middleware ref syntax (#33988 )	2025-11-16 00:33:17 -05:00
Mason Daugherty	8a3bb73c05	release(openai): 1.0.3 (#33981 ) - Respect 300k token limit for embeddings API requests #33668 - fix create_agent / response_format for Responses API #33939 - fix response.incomplete event is not handled when using stream_mode=['messages'] #33871	2025-11-14 19:18:50 -05:00
Mason Daugherty	099c042395	refactor(openai): embedding utils and calculations (#33982 ) Now returns (`_iter`, `tokens`, `indices`, token_counts`). The `token_counts` are calculated directly during tokenization, which is more accurate and efficient than splitting strings later.	2025-11-14 19:18:37 -05:00
Kaparthy Reddy	2d4f00a451	fix(openai): Respect 300k token limit for embeddings API requests (#33668 ) ## Description Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds OpenAI's 300,000 token per request limit, causing 400 BadRequest errors. ## Problem When embedding large document sets, LangChain would send batches containing more than 300,000 tokens in a single API request, causing this error: ``` openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}} ``` The issue occurred because: - The code chunks texts by `embedding_ctx_length` (8191 tokens per chunk) - Then batches chunks by `chunk_size` (default 1000 chunks per request) - But didn't check: Total tokens per batch against OpenAI's 300k limit - Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds limit! ## Solution This PR implements dynamic batching that respects the 300k token limit: 1. Added constant: `MAX_TOKENS_PER_REQUEST = 300000` 2. Track token counts: Calculate actual tokens for each chunk 3. Dynamic batching: Instead of fixed `chunk_size` batches, accumulate chunks until approaching the 300k limit 4. Applied to both sync and async: Fixed both `_get_len_safe_embeddings` and `_aget_len_safe_embeddings` ## Changes - Modified `langchain_openai/embeddings/base.py`: - Added `MAX_TOKENS_PER_REQUEST` constant - Replaced fixed-size batching with token-aware dynamic batching - Applied to both sync (line ~478) and async (line ~527) methods - Added test in `tests/unit_tests/embeddings/test_base.py`: - `test_embeddings_respects_token_limit()` - Verifies large document sets are properly batched ## Testing All existing tests pass (280 passed, 4 xfailed, 1 xpassed). New test verifies: - Large document sets (500 texts × 1000 tokens = 500k tokens) are split into multiple API calls - Each API call respects the 300k token limit ## Usage After this fix, users can embed large document sets without errors: ```python from langchain_openai import OpenAIEmbeddings from langchain_chroma import Chroma from langchain_text_splitters import CharacterTextSplitter # This will now work without exceeding token limits embeddings = OpenAIEmbeddings() documents = CharacterTextSplitter().split_documents(large_documents) Chroma.from_documents(documents, embeddings) ``` Resolves #31227 --------- Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-14 18:12:07 -05:00
ccurme	3d415441e8	fix(langchain, openai): backward compat for response_format (#33945 )	2025-11-13 11:11:35 -05:00
ccurme	74385e0ebd	fix(langchain, openai): fix create_agent / response_format for Responses API (#33939 )	2025-11-13 10:18:15 -05:00
Mason Daugherty	3dfea96ec1	chore: update `README.md` files (#33919 )	2025-11-10 22:51:35 -05:00
Mason Daugherty	69c7d1b01b	test(groq,openai): add retries for flaky tests (#33914 )	2025-11-10 10:36:11 -05:00
riunyfir	1b77a191f4	feat: The response.incomplete event is not handled when using stream_mode=['messages'] (#33871 )	2025-11-07 09:46:11 -05:00
Mason Daugherty	e023201d42	style: some cleanup (#33857 )	2025-11-06 23:50:46 -05:00
Mason Daugherty	d40e340479	chore: attribute package change versions (#33854 ) Needed to disambiguate for within inherited docs	2025-11-06 16:57:30 -05:00
Mason Daugherty	dfb05a7fa0	style: refs pass (#33813 )	2025-11-03 22:11:10 -05:00
ccurme	81c4f21b52	fix(standard-tests): update multimodal tests (#33781 )	2025-11-01 16:38:20 -04:00
ccurme	61196a8280	release(openai): 1.0.2 (#33769 )	2025-10-31 14:21:32 -04:00
Mason Daugherty	dc5b7dace8	test(openai): mark tests flaky (#33750 ) see: https://github.com/langchain-ai/langchain/actions/runs/18921929210/job/54020065079#step:10:560	2025-10-30 16:07:58 -04:00
Shagun Gupta	75fff151e8	fix(openai): replace pytest.warns(None) with warnings.catch_warnings in ChatOpenAI test to resolve TypeError . Resolves issue #33705 (#33741 )	2025-10-30 09:22:34 -04:00
ccurme	d218936763	fix(openai): update model used in test (#33733 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-10-29 17:09:18 -04:00
Mason Daugherty	123e29dc26	style: more refs fixes (#33730 )	2025-10-29 16:34:46 -04:00
Mason Daugherty	a2a9a02ecb	style(core): more cleanup all around (#33711 )	2025-10-28 22:58:19 -04:00
Mason Daugherty	f94108b4bc	fix: links (#33691 ) * X-ref to new docs * Formatting updates	2025-10-27 19:04:29 -04:00
ccurme	6ab0476676	fix(openai): update test (#33659 )	2025-10-24 11:04:33 -04:00
Ali Ismail	5acd34ae92	feat(openai): add unit test for streaming error in `_generate` (#33134 )	2025-10-21 15:08:37 -04:00
ccurme	2222470f69	release(openai): 1.0.1 (#33624 )	2025-10-21 11:37:47 -04:00
Marlene	78175fcb96	feat(openai): add callable support for openai_api_key parameter (#33532 )	2025-10-21 11:16:02 -04:00
Mason Daugherty	64e6798a39	chore: update `pyproject.toml` url entries (#33587 )	2025-10-17 17:16:55 -04:00
ccurme	4d623133a5	release(openai): 1.0.0 (#33578 )	2025-10-17 11:25:25 -04:00
Mason Daugherty	241a382fba	docs: fix Anthropic, OpenAI docstrings (#33566 ) minor	2025-10-17 11:18:32 -04:00
ccurme	3152d25811	fix: support python 3.14 in various projects (#33575 ) Co-authored-by: cbornet <cbornet@hotmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-10-17 11:06:23 -04:00
Mason Daugherty	1d2273597a	docs: more fixes for refs (#33554 )	2025-10-16 22:54:16 -04:00
Mason Daugherty	15db024811	chore: more sweeping (#33533 ) more fixes for refs	2025-10-16 15:44:56 -04:00
Jacob Lee	6d73003b17	feat(openai): Populate OpenAI service tier token details (#32721 )	2025-10-16 15:14:57 -04:00
Mason Daugherty	26e0a00c4c	style: more work for refs (#33508 ) Largely: - Remove explicit `"Default is x"` since new refs show default inferred from sig - Inline code (useful for eventual parsing) - Fix code block rendering (indentations)	2025-10-15 18:46:55 -04:00
Nuno Campos	0788461abd	feat(openai): Add openai moderation middleware (#33492 )	2025-10-15 13:59:49 -04:00
Mason Daugherty	79200cf3c2	docs: update package READMEs (#33488 )	2025-10-15 10:49:35 -04:00
Chenyang Li	6e25e185f6	fix(docs): Fix several typos and grammar (#33487 ) Just typo changes Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-10-14 20:04:14 -04:00
ccurme	78903ac285	fix(openai): conditionally skip test (#33431 )	2025-10-10 21:04:18 +00:00
Mason Daugherty	291a9fcea1	style: `llm` -> `model` (#33423 )	2025-10-10 13:19:13 -04:00
Mason Daugherty	6fc21afbc9	style: `.. code-block::` admonition translations (#33400 ) biiiiiiiiiiiiiiiigggggggg pass	2025-10-09 16:52:58 -04:00
Mason Daugherty	d8a680ee57	style: address Sphinx double-backtick snippet syntax (#33389 )	2025-10-09 13:35:51 -04:00
Mason Daugherty	3576e690fa	chore: update Sphinx links to markdown (#33386 )	2025-10-09 11:54:14 -04:00
ccurme	c27271f3ae	fix(openai): update file index key name (#33350 )	2025-10-09 13:15:27 +00:00
Mason Daugherty	b6132fc23e	style: remove more `Optional` syntax (#33371 )	2025-10-08 23:28:43 -04:00
Mason Daugherty	31eeb50ce0	chore: drop UP045 (#33362 ) Python 3.9 EOL	2025-10-08 21:17:53 -04:00
Mason Daugherty	d13823043d	style: monorepo pass for refs (#33359 ) * Delete some double backticks previously used by Sphinx (not done everywhere yet) * Fix some code blocks / dropdowns Ignoring CLI CI for now	2025-10-08 18:41:39 -04:00
Mason Daugherty	6b9b177b89	chore(openai): `base.py` ref pass (#33355 )	2025-10-08 16:08:52 -04:00
Mason Daugherty	cda336295f	chore: enrich `pyproject.toml` files with links to new references, others (#33343 )	2025-10-07 16:17:14 -04:00
Mason Daugherty	8bcdfbb24e	chore: clean up `pyproject.toml` files, use core a7 (#33334 )	2025-10-07 10:49:04 -04:00
ccurme	aa442bc52f	release(openai): 1.0.0a4 (#33316 )	2025-10-07 09:25:05 -04:00
ccurme	d0f5a1cc96	fix(standard-tests,openai): minor fix for Responses API tests (#33315 ) Following https://github.com/langchain-ai/langchain/pull/33301	2025-10-06 16:46:41 -04:00

1 2 3 4 5 ...

509 Commits