langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 10:17:00 +00:00

Author	SHA1	Message	Date
Mason Daugherty	099c042395	refactor(openai): embedding utils and calculations (#33982 ) Now returns (`_iter`, `tokens`, `indices`, token_counts`). The `token_counts` are calculated directly during tokenization, which is more accurate and efficient than splitting strings later.	2025-11-14 19:18:37 -05:00
Kaparthy Reddy	2d4f00a451	fix(openai): Respect 300k token limit for embeddings API requests (#33668 ) ## Description Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds OpenAI's 300,000 token per request limit, causing 400 BadRequest errors. ## Problem When embedding large document sets, LangChain would send batches containing more than 300,000 tokens in a single API request, causing this error: ``` openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}} ``` The issue occurred because: - The code chunks texts by `embedding_ctx_length` (8191 tokens per chunk) - Then batches chunks by `chunk_size` (default 1000 chunks per request) - But didn't check: Total tokens per batch against OpenAI's 300k limit - Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds limit! ## Solution This PR implements dynamic batching that respects the 300k token limit: 1. Added constant: `MAX_TOKENS_PER_REQUEST = 300000` 2. Track token counts: Calculate actual tokens for each chunk 3. Dynamic batching: Instead of fixed `chunk_size` batches, accumulate chunks until approaching the 300k limit 4. Applied to both sync and async: Fixed both `_get_len_safe_embeddings` and `_aget_len_safe_embeddings` ## Changes - Modified `langchain_openai/embeddings/base.py`: - Added `MAX_TOKENS_PER_REQUEST` constant - Replaced fixed-size batching with token-aware dynamic batching - Applied to both sync (line ~478) and async (line ~527) methods - Added test in `tests/unit_tests/embeddings/test_base.py`: - `test_embeddings_respects_token_limit()` - Verifies large document sets are properly batched ## Testing All existing tests pass (280 passed, 4 xfailed, 1 xpassed). New test verifies: - Large document sets (500 texts × 1000 tokens = 500k tokens) are split into multiple API calls - Each API call respects the 300k token limit ## Usage After this fix, users can embed large document sets without errors: ```python from langchain_openai import OpenAIEmbeddings from langchain_chroma import Chroma from langchain_text_splitters import CharacterTextSplitter # This will now work without exceeding token limits embeddings = OpenAIEmbeddings() documents = CharacterTextSplitter().split_documents(large_documents) Chroma.from_documents(documents, embeddings) ``` Resolves #31227 --------- Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-14 18:12:07 -05:00
ccurme	3d415441e8	fix(langchain, openai): backward compat for response_format (#33945 )	2025-11-13 11:11:35 -05:00
ccurme	74385e0ebd	fix(langchain, openai): fix create_agent / response_format for Responses API (#33939 )	2025-11-13 10:18:15 -05:00
riunyfir	1b77a191f4	feat: The response.incomplete event is not handled when using stream_mode=['messages'] (#33871 )	2025-11-07 09:46:11 -05:00
Mason Daugherty	e023201d42	style: some cleanup (#33857 )	2025-11-06 23:50:46 -05:00
Mason Daugherty	d40e340479	chore: attribute package change versions (#33854 ) Needed to disambiguate for within inherited docs	2025-11-06 16:57:30 -05:00
Mason Daugherty	dfb05a7fa0	style: refs pass (#33813 )	2025-11-03 22:11:10 -05:00
Mason Daugherty	123e29dc26	style: more refs fixes (#33730 )	2025-10-29 16:34:46 -04:00
Mason Daugherty	a2a9a02ecb	style(core): more cleanup all around (#33711 )	2025-10-28 22:58:19 -04:00
Mason Daugherty	f94108b4bc	fix: links (#33691 ) * X-ref to new docs * Formatting updates	2025-10-27 19:04:29 -04:00
Marlene	78175fcb96	feat(openai): add callable support for openai_api_key parameter (#33532 )	2025-10-21 11:16:02 -04:00
Mason Daugherty	241a382fba	docs: fix Anthropic, OpenAI docstrings (#33566 ) minor	2025-10-17 11:18:32 -04:00
Mason Daugherty	1d2273597a	docs: more fixes for refs (#33554 )	2025-10-16 22:54:16 -04:00
Mason Daugherty	15db024811	chore: more sweeping (#33533 ) more fixes for refs	2025-10-16 15:44:56 -04:00
Jacob Lee	6d73003b17	feat(openai): Populate OpenAI service tier token details (#32721 )	2025-10-16 15:14:57 -04:00
Mason Daugherty	26e0a00c4c	style: more work for refs (#33508 ) Largely: - Remove explicit `"Default is x"` since new refs show default inferred from sig - Inline code (useful for eventual parsing) - Fix code block rendering (indentations)	2025-10-15 18:46:55 -04:00
Nuno Campos	0788461abd	feat(openai): Add openai moderation middleware (#33492 )	2025-10-15 13:59:49 -04:00
Mason Daugherty	291a9fcea1	style: `llm` -> `model` (#33423 )	2025-10-10 13:19:13 -04:00
Mason Daugherty	6fc21afbc9	style: `.. code-block::` admonition translations (#33400 ) biiiiiiiiiiiiiiiigggggggg pass	2025-10-09 16:52:58 -04:00
Mason Daugherty	d8a680ee57	style: address Sphinx double-backtick snippet syntax (#33389 )	2025-10-09 13:35:51 -04:00
Mason Daugherty	3576e690fa	chore: update Sphinx links to markdown (#33386 )	2025-10-09 11:54:14 -04:00
ccurme	c27271f3ae	fix(openai): update file index key name (#33350 )	2025-10-09 13:15:27 +00:00
Mason Daugherty	b6132fc23e	style: remove more `Optional` syntax (#33371 )	2025-10-08 23:28:43 -04:00
Mason Daugherty	31eeb50ce0	chore: drop UP045 (#33362 ) Python 3.9 EOL	2025-10-08 21:17:53 -04:00
Mason Daugherty	d13823043d	style: monorepo pass for refs (#33359 ) * Delete some double backticks previously used by Sphinx (not done everywhere yet) * Fix some code blocks / dropdowns Ignoring CLI CI for now	2025-10-08 18:41:39 -04:00
Mason Daugherty	6b9b177b89	chore(openai): `base.py` ref pass (#33355 )	2025-10-08 16:08:52 -04:00
ccurme	de48e102c4	fix(core,openai,anthropic): delegate to core implementation on invoke when streaming=True (#33308 )	2025-10-06 15:54:55 -04:00
ccurme	95a451ef2c	fix(openai): disable stream_usage in chat completions if OPENAI_BASE_URL is set (#33298 ) This env var is used internally by the OpenAI client.	2025-10-06 10:14:43 -04:00
ccurme	c8636a626a	chore(openai): (v1) fix sort order of mcp call keys (#33295 )	2025-10-06 09:29:41 -04:00
ccurme	4e50ec4b98	feat(openai): enable stream_usage when using default base URL and client (#33205 )	2025-10-06 08:56:38 -04:00
Mason Daugherty	8e7cd85431	style: drop `target-version = "py39"` for OpenAI, Anthropic, HuggingFace (#33287 )	2025-10-06 03:29:34 +00:00
Mason Daugherty	ae5b105d11	docs: v1 docs updates (#33173 ) Co-authored-by: Mohammad Mohtashim <45242107+keenborder786@users.noreply.github.com> Co-authored-by: Caspar Broekhuizen <caspar@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Sadra Barikbin <sadraqazvin1@yahoo.com> Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2025-10-02 18:46:26 -04:00
Mason Daugherty	eaa6dcce9e	release: v1.0.0 (#32567 ) Co-authored-by: Mohammad Mohtashim <45242107+keenborder786@users.noreply.github.com> Co-authored-by: Caspar Broekhuizen <caspar@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Sadra Barikbin <sadraqazvin1@yahoo.com> Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2025-10-02 10:49:42 -04:00
ccurme	64141072a3	feat(openai): support openai sdk 2.0 (#33168 )	2025-09-30 16:34:00 -04:00
Mason Daugherty	986302322f	docs: more standardization (#33124 )	2025-09-25 20:46:20 -04:00
Mason Daugherty	5bea28393d	docs: standardize `.. code-block` directive usage (#33122 ) and fix typos	2025-09-25 16:49:56 -04:00
Mason Daugherty	9f6431924f	feat(openai): add `max_tokens` to `AzureChatOpenAI` (#32959 ) Fixes #32949 This pattern is [present in `ChatOpenAI`](https://github.com/langchain-ai/langchain/blob/master/libs/partners/openai/langchain_openai/chat_models/base.py#L2821) but wasn't carried over to Azure. [CI](https://github.com/langchain-ai/langchain/actions/runs/17741751797/job/50417180998)	2025-09-15 14:09:20 -04:00
Matthew Lapointe	b1f08467cd	feat(core): allow overriding `ls_model_name` from kwargs (#32541 )	2025-09-11 16:18:06 -04:00
Aasish	9c7d262ff4	fix(openai): update `AzureOpenAIEmbeddings` validation logic for `openai_api_base` (#31782 )	2025-09-10 14:53:30 -04:00
Mason Daugherty	4c6af2d1b2	fix(openai): structured output (#32551 )	2025-09-09 11:37:50 -04:00
Sadiq Khan	228fbac3a6	fix(openai): handle `AIMessage`s without `response_id` in `_get_last_messages` (#32824 )	2025-09-08 10:12:50 -04:00
JunHyungKang	6ea06ca972	fix(openai): Fix Azure OpenAI Responses API model field issue (#32649 )	2025-09-08 10:08:35 -04:00
ccurme	5b0a55ad35	chore(openai): apply formatting changes to AzureChatOpenAI (#32848 )	2025-09-08 09:54:20 -04:00
Shahroz Ahmad	4828a85ab0	feat(core): add `web_search` in OpenAI tools list (#32738 )	2025-09-02 21:57:25 +00:00
Ravirajsingh Sodha	b42dac5fe6	docs: standardize `OllamaLLM` and `BaseOpenAI` docstrings (#32758 ) - Add comprehensive docstring following LangChain standards - Include Setup, Key init args, Instantiate, Invoke, Stream, and Async sections - Provide detailed parameter descriptions and code examples - Fix linting issues for code formatting compliance Contributes to #24803 --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-31 17:45:56 -05:00
Jacob Lee	1459d4f4ce	fix(openai): Always add raw response object to OpenAI client errors for invoke (#32655 )	2025-08-26 09:59:25 -04:00
Alex Naidis	21f7a9a9e5	fix(openai): allow temperature parameter for gpt-5-chat models (#32624 )	2025-08-21 16:40:10 -04:00
sa411022	61bc1bf9cc	fix(openai): construct responses api input (#32557 )	2025-08-21 15:56:29 -04:00
Shahrukh Shaik	4ba222148d	fix(openai): Chat Message `Annotations` defaults to `[ ]` if not list or None (#32614 )	2025-08-21 15:30:12 -04:00

1 2 3 4 5 ...

288 Commits