langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 10:17:00 +00:00

Author	SHA1	Message	Date
Mason Daugherty	099c042395	refactor(openai): embedding utils and calculations (#33982 ) Now returns (`_iter`, `tokens`, `indices`, token_counts`). The `token_counts` are calculated directly during tokenization, which is more accurate and efficient than splitting strings later.	2025-11-14 19:18:37 -05:00
Kaparthy Reddy	2d4f00a451	fix(openai): Respect 300k token limit for embeddings API requests (#33668 ) ## Description Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds OpenAI's 300,000 token per request limit, causing 400 BadRequest errors. ## Problem When embedding large document sets, LangChain would send batches containing more than 300,000 tokens in a single API request, causing this error: ``` openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}} ``` The issue occurred because: - The code chunks texts by `embedding_ctx_length` (8191 tokens per chunk) - Then batches chunks by `chunk_size` (default 1000 chunks per request) - But didn't check: Total tokens per batch against OpenAI's 300k limit - Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds limit! ## Solution This PR implements dynamic batching that respects the 300k token limit: 1. Added constant: `MAX_TOKENS_PER_REQUEST = 300000` 2. Track token counts: Calculate actual tokens for each chunk 3. Dynamic batching: Instead of fixed `chunk_size` batches, accumulate chunks until approaching the 300k limit 4. Applied to both sync and async: Fixed both `_get_len_safe_embeddings` and `_aget_len_safe_embeddings` ## Changes - Modified `langchain_openai/embeddings/base.py`: - Added `MAX_TOKENS_PER_REQUEST` constant - Replaced fixed-size batching with token-aware dynamic batching - Applied to both sync (line ~478) and async (line ~527) methods - Added test in `tests/unit_tests/embeddings/test_base.py`: - `test_embeddings_respects_token_limit()` - Verifies large document sets are properly batched ## Testing All existing tests pass (280 passed, 4 xfailed, 1 xpassed). New test verifies: - Large document sets (500 texts × 1000 tokens = 500k tokens) are split into multiple API calls - Each API call respects the 300k token limit ## Usage After this fix, users can embed large document sets without errors: ```python from langchain_openai import OpenAIEmbeddings from langchain_chroma import Chroma from langchain_text_splitters import CharacterTextSplitter # This will now work without exceeding token limits embeddings = OpenAIEmbeddings() documents = CharacterTextSplitter().split_documents(large_documents) Chroma.from_documents(documents, embeddings) ``` Resolves #31227 --------- Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-14 18:12:07 -05:00
ccurme	74385e0ebd	fix(langchain, openai): fix create_agent / response_format for Responses API (#33939 )	2025-11-13 10:18:15 -05:00
Mason Daugherty	69c7d1b01b	test(groq,openai): add retries for flaky tests (#33914 )	2025-11-10 10:36:11 -05:00
riunyfir	1b77a191f4	feat: The response.incomplete event is not handled when using stream_mode=['messages'] (#33871 )	2025-11-07 09:46:11 -05:00
ccurme	81c4f21b52	fix(standard-tests): update multimodal tests (#33781 )	2025-11-01 16:38:20 -04:00
Mason Daugherty	dc5b7dace8	test(openai): mark tests flaky (#33750 ) see: https://github.com/langchain-ai/langchain/actions/runs/18921929210/job/54020065079#step:10:560	2025-10-30 16:07:58 -04:00
Shagun Gupta	75fff151e8	fix(openai): replace pytest.warns(None) with warnings.catch_warnings in ChatOpenAI test to resolve TypeError . Resolves issue #33705 (#33741 )	2025-10-30 09:22:34 -04:00
ccurme	d218936763	fix(openai): update model used in test (#33733 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-10-29 17:09:18 -04:00
Mason Daugherty	f94108b4bc	fix: links (#33691 ) * X-ref to new docs * Formatting updates	2025-10-27 19:04:29 -04:00
ccurme	6ab0476676	fix(openai): update test (#33659 )	2025-10-24 11:04:33 -04:00
Ali Ismail	5acd34ae92	feat(openai): add unit test for streaming error in `_generate` (#33134 )	2025-10-21 15:08:37 -04:00
Marlene	78175fcb96	feat(openai): add callable support for openai_api_key parameter (#33532 )	2025-10-21 11:16:02 -04:00
Jacob Lee	6d73003b17	feat(openai): Populate OpenAI service tier token details (#32721 )	2025-10-16 15:14:57 -04:00
Nuno Campos	0788461abd	feat(openai): Add openai moderation middleware (#33492 )	2025-10-15 13:59:49 -04:00
Chenyang Li	6e25e185f6	fix(docs): Fix several typos and grammar (#33487 ) Just typo changes Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-10-14 20:04:14 -04:00
ccurme	78903ac285	fix(openai): conditionally skip test (#33431 )	2025-10-10 21:04:18 +00:00
ccurme	c27271f3ae	fix(openai): update file index key name (#33350 )	2025-10-09 13:15:27 +00:00
Mason Daugherty	31eeb50ce0	chore: drop UP045 (#33362 ) Python 3.9 EOL	2025-10-08 21:17:53 -04:00
Mason Daugherty	d13823043d	style: monorepo pass for refs (#33359 ) * Delete some double backticks previously used by Sphinx (not done everywhere yet) * Fix some code blocks / dropdowns Ignoring CLI CI for now	2025-10-08 18:41:39 -04:00
ccurme	d0f5a1cc96	fix(standard-tests,openai): minor fix for Responses API tests (#33315 ) Following https://github.com/langchain-ai/langchain/pull/33301	2025-10-06 16:46:41 -04:00
ccurme	de48e102c4	fix(core,openai,anthropic): delegate to core implementation on invoke when streaming=True (#33308 )	2025-10-06 15:54:55 -04:00
ccurme	4e50ec4b98	feat(openai): enable stream_usage when using default base URL and client (#33205 )	2025-10-06 08:56:38 -04:00
ccurme	010ed5d096	fix(anthropic,openai): fix tests (#33257 ) following https://github.com/langchain-ai/langchain/pull/33192	2025-10-03 13:41:37 -04:00
Mason Daugherty	5a016de53f	chore: delete deprecated items (#33192 ) Removed: - `libs/core/langchain_core/chat_history.py`: `add_user_message` and `add_ai_message` in favor of `add_messages` and `aadd_messages` - `libs/core/langchain_core/language_models/base.py`: `predict`, `predict_messages`, and async versions in favor of `invoke`. removed `_all_required_field_names` since it was a wrapper on `get_pydantic_field_names` - `libs/core/langchain_core/language_models/chat_models.py`: `callback_manager` param in favor of `callbacks`. `__call__` and `call_as_llm` method in favor of `invoke` - `libs/core/langchain_core/language_models/llms.py`: `callback_manager` param in favor of `callbacks`. `__call__`, `predict`, `apredict`, and `apredict_messages` methods in favor of `invoke` - `libs/core/langchain_core/prompts/chat.py`: `from_role_strings` and `from_strings` in favor of `from_messages` - `libs/core/langchain_core/prompts/pipeline.py`: removed `PipelinePromptTemplate` - `libs/core/langchain_core/prompts/prompt.py`: `input_variables` param on `from_file` as it wasn't used - `libs/core/langchain_core/tools/base.py`: `callback_manager` param in favor of `callbacks` - `libs/core/langchain_core/tracers/context.py`: `tracing_enabled` in favor of `tracing_enabled_v2` - `libs/core/langchain_core/tracers/langchain_v1.py`: entire module - `libs/core/langchain_core/utils/loading.py`: entire module, `try_load_from_hub` - `libs/core/langchain_core/vectorstores/in_memory.py`: `upsert` in favor of `add_documents` - `libs/standard-tests/langchain_tests/integration_tests/chat_models.py` and `libs/standard-tests/langchain_tests/unit_tests/chat_models.py`: `tool_choice_value` as models should accept `tool_choice="any"` - `langchain` will consequently no longer expose these items if it was previously --------- Co-authored-by: Mohammad Mohtashim <45242107+keenborder786@users.noreply.github.com> Co-authored-by: Caspar Broekhuizen <caspar@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Sadra Barikbin <sadraqazvin1@yahoo.com> Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2025-10-03 03:33:24 +00:00
Mason Daugherty	eaa6dcce9e	release: v1.0.0 (#32567 ) Co-authored-by: Mohammad Mohtashim <45242107+keenborder786@users.noreply.github.com> Co-authored-by: Caspar Broekhuizen <caspar@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Sadra Barikbin <sadraqazvin1@yahoo.com> Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2025-10-02 10:49:42 -04:00
ccurme	002d623f2d	feat: (core, standard-tests) support PDF inputs in ToolMessages (#33183 )	2025-10-01 10:16:16 -04:00
ccurme	64141072a3	feat(openai): support openai sdk 2.0 (#33168 )	2025-09-30 16:34:00 -04:00
ccurme	839a18e112	fix(openai): remove __future__.annotations import from test files (#33144 ) Breaks schema conversion in places.	2025-09-29 16:23:32 +00:00
Mason Daugherty	986302322f	docs: more standardization (#33124 )	2025-09-25 20:46:20 -04:00
Mason Daugherty	12daba63ff	test(openai): raise token limit for o1 test (#33118 ) `test_o1[False-False]` was sometimes failing because the OpenAI o1 model was hitting a token limit with only 100 tokens	2025-09-25 12:57:33 -04:00
Mason Daugherty	043a7560a5	test: use `.get()` for safe `ls_params` access (#33034 )	2025-09-20 23:46:37 -04:00
Mason Daugherty	9f6431924f	feat(openai): add `max_tokens` to `AzureChatOpenAI` (#32959 ) Fixes #32949 This pattern is [present in `ChatOpenAI`](https://github.com/langchain-ai/langchain/blob/master/libs/partners/openai/langchain_openai/chat_models/base.py#L2821) but wasn't carried over to Azure. [CI](https://github.com/langchain-ai/langchain/actions/runs/17741751797/job/50417180998)	2025-09-15 14:09:20 -04:00
Matthew Lapointe	b1f08467cd	feat(core): allow overriding `ls_model_name` from kwargs (#32541 )	2025-09-11 16:18:06 -04:00
Mason Daugherty	4c6af2d1b2	fix(openai): structured output (#32551 )	2025-09-09 11:37:50 -04:00
Sadiq Khan	228fbac3a6	fix(openai): handle `AIMessage`s without `response_id` in `_get_last_messages` (#32824 )	2025-09-08 10:12:50 -04:00
JunHyungKang	6ea06ca972	fix(openai): Fix Azure OpenAI Responses API model field issue (#32649 )	2025-09-08 10:08:35 -04:00
Jacob Lee	1459d4f4ce	fix(openai): Always add raw response object to OpenAI client errors for invoke (#32655 )	2025-08-26 09:59:25 -04:00
Alex Naidis	21f7a9a9e5	fix(openai): allow temperature parameter for gpt-5-chat models (#32624 )	2025-08-21 16:40:10 -04:00
sa411022	61bc1bf9cc	fix(openai): construct responses api input (#32557 )	2025-08-21 15:56:29 -04:00
Mason Daugherty	262c83763f	release(openai): 0.3.30 (#32515 )	2025-08-12 16:06:17 +00:00
Mason Daugherty	0024dffa68	feat(openai): officially support `verbosity` (#32470 )	2025-08-12 16:00:30 +00:00
Mason Daugherty	ee4c2510eb	feat: port various nit changes from `wip-v0.4` (#32506 ) Lots of work that wasn't directly related to core improvements/messages/testing functionality	2025-08-11 15:09:08 -04:00
Mason Daugherty	c31236264e	chore: formatting across codebase (#32466 )	2025-08-08 10:20:10 -04:00
ccurme	02001212b0	fix(openai): revert some changes (#32462 ) Keep coverage on `output_version="v0"` (increasing coverage is being managed in v0.4 branch).	2025-08-08 08:51:18 -04:00
Mason Daugherty	00244122bd	feat(openai): `minimal` and `verbosity` (#32455 )	2025-08-08 02:24:21 +00:00
ccurme	ec2b34a02d	feat(openai): custom tools (#32449 )	2025-08-07 16:30:01 -04:00
Mason Daugherty	145d38f7dd	test(openai): add tests for `prompt_cache_key` parameter and update docs (#32363 ) Introduce tests to validate the behavior and inclusion of the `prompt_cache_key` parameter in request payloads for the `ChatOpenAI` model.	2025-08-07 15:29:47 -04:00
ccurme	a9e52ca605	chore(openai): bump openai sdk (#32322 )	2025-07-30 10:58:18 -04:00
Mason Daugherty	e79e0bd6b4	fix(openai): add `max_retries` parameter to ChatOpenAI for handling 503 capacity errors (#32286 ) Some integration tests were failing	2025-07-28 13:58:23 -04:00

1 2 3 4 5

217 Commits