langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 18:50:33 +00:00

Author	SHA1	Message	Date
ccurme	46dbb3967e	chore(anthropic): update test_tool_search cassette (#34297 )	2025-12-11 10:53:52 -05:00
Towseef Altaf	d27fb0c432	feat(langchain,openai): add strict flag to ProviderStrategy structured output (#34149 )	2025-12-10 15:35:23 -05:00
ccurme	69dd39c461	fix(anthropic): ignore null values of caller on tool_use blocks (#34286 )	2025-12-10 13:13:02 -05:00
ccurme	5350967ddc	feat(anthropic): support mcp_toolset in bind_tools (#34284 )	2025-12-10 14:39:35 +00:00
Mason Daugherty	7542278997	feat(core,anthropic): `extras` on `BaseTool` (#34120 )	2025-12-10 09:37:14 -05:00
Mason Daugherty	ff6e3558d7	docs(fireworks,groq,huggingface,mistralai,ollama,openai): x-ref `convert_to_openai_tool` (#34276 )	2025-12-09 19:51:04 -05:00
Mason Daugherty	dff229d018	fix(openai): add missing `tools` param to `ChatOpenAI` `with_structured_output` (#34075 )	2025-12-08 15:47:31 -05:00
Mason Daugherty	2faed37ff1	feat(anthropic): document and test fine grained tool streaming (#34118 ) https://platform.claude.com/docs/en/agents-and-tools/tool-use/fine-grained-tool-streaming	2025-12-08 15:34:56 -05:00
Mason Daugherty	91d5ca275d	feat(anthropic): use model profile for max output tokens (#34163 ) Need(?) to adjust tests to also pull from model profile? currently hardcoded	2025-12-08 15:31:16 -05:00
Mason Daugherty	dcb670f395	feat(anthropic): auto append relevant beta headers for computer use (#34117 ) in addition to documenting it https://platform.claude.com/docs/en/agents-and-tools/tool-use/computer-use-tool	2025-12-08 15:25:36 -05:00
Mason Daugherty	8a5f46322b	feat(anthropic): tool search support (#34119 )	2025-12-08 10:46:37 -05:00
ccurme	b5efafe80c	release(openai): 1.1.1 (#34252 )	2025-12-08 09:23:13 -05:00
Marlene	ff3353f02f	fix(openai): Fixing error that comes up using the Responses API with built-in tools and custom tools (#34136 )	2025-12-08 09:10:44 -05:00
Mason Daugherty	3ace4e3680	docs(core,groq,openai): nits for ref docs (#34243 )	2025-12-07 19:45:38 -05:00
Mason Daugherty	4a42158e6c	feat(anthropic): add `effort` support (#34116 )	2025-12-05 13:44:42 -05:00
Mason Daugherty	7ba3e80057	test(openai): mark `test_structured_output_and_tools` flaky (#34223 ) Often raises `KeyError: 'explanation'`	2025-12-05 11:26:17 -05:00
Sydney Runkle	78c10f8790	chore: update core dep in lockfiles (#34216 )	2025-12-04 15:30:42 -05:00
Mason Daugherty	b7091d391d	feat(anthropic): auto append relevant beta headers (#34113 )	2025-12-01 12:20:41 -05:00
ccurme	7549845d82	chore(anthropic): vcr integration test (#34160 )	2025-12-01 15:28:28 +00:00
Mason Daugherty	0a6d01e61d	docs(anthropic,core,langchain): updates (#34106 )	2025-11-25 17:58:09 -05:00
Mason Daugherty	c6f8b0875a	style(core,langchain,qdrant): fix some docstrings for refs (#34105 )	2025-11-25 13:58:53 -05:00
ccurme	880652b713	release: (integration packages): 1.1 (#34088 )	2025-11-24 10:00:06 -05:00
Sydney Runkle	4ab94579ad	feat(langchain): support `SystemMessage` in `create_agent`'s `system_prompt` (#34055 ) * `create_agent`'s `system_prompt` allows `str \| SystemMessage` * added `system_message: SystemMessage` on `ModelRequest` * `ModelRequest.system_prompt` is a function of `system_message.text`, now deprecated * disallow setting `system_prompt` and `system_message` * `ModelRequest.system_prompt` can still be set (w/ custom setattr) for custom backwards compat, but the updates just get propogated to the `ModelRequest.system_message` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-11-24 14:53:57 +00:00
ccurme	eb0545a173	release: (integration packages) 1.1 (#34087 )	2025-11-24 09:13:01 -05:00
ccurme	a2e389de9f	release(fireworks): 1.1 (#34086 )	2025-11-24 09:05:43 -05:00
Abhinav	2ba3ce81a6	fix(openai): make GPT-5 temperature validation case-insensitive (#34012 ) Fixed a bug where GPT-5 temperature validation was case-sensitive, causing issues when users specified Azure deployment names or model names in uppercase (e.g., `"GPT-5-2025-01-01"`, `"GPT-5-NANO"`). The validation now correctly handles model names regardless of case. Changes made: - Updated `validate_temperature()` method in `BaseChatOpenAI` to perform case-insensitive model name comparisons - Updated `_get_encoding_model()` method to use case-insensitive checks for tiktoken encoder selection - Added comprehensive unit tests to verify case-insensitive behavior with various case combinations Issue: Fixes #34003 Dependencies: None Test Coverage: - All existing tests pass - New test `test_gpt_5_temperature_case_insensitive` covers uppercase, lowercase, and mixed-case model names - Tests verify both non-chat GPT-5 models (temperature removed) and chat models (temperature preserved) - Lint and format checks pass (`make lint`, `make format`) --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-23 20:17:03 -05:00
Mason Daugherty	cbaea351b2	style(core,langchain-classic,openai): fix griffe warnings (#34074 )	2025-11-23 01:06:46 -05:00
ccurme	0915682c12	chore(fireworks): update tested models (#34070 )	2025-11-22 16:50:49 -05:00
Mason Daugherty	47b79c30c0	chore(docs): fix a few refs syntax errors (#34044 ) missing whitespace for some admonitions	2025-11-22 00:58:21 -05:00
ccurme	33e5d01f7c	feat(model-profiles): distribute data across packages (#34024 )	2025-11-21 15:47:05 -05:00
Sydney Runkle	b7d1831f9d	fix: deprecate `setattr` on `ModelCallRequest` (#34022 ) * one alternative considered was setting `frozen=True` on the dataclass, but this is breaking, so a deprecation is a nicer approach	2025-11-19 11:08:55 -05:00
ccurme	328ba36601	chore(openai): skip Azure text completions tests (#34021 )	2025-11-19 09:29:12 -05:00
ccurme	990e346c46	release(anthropic): 1.1 (#33997 )	2025-11-17 16:24:29 -05:00
ccurme	9b7792631d	feat(anthropic): support native structured output feature and strict tool calling (#33980 )	2025-11-17 16:14:20 -05:00
Mason Daugherty	52b1516d44	style(langchain): fix some middleware ref syntax (#33988 )	2025-11-16 00:33:17 -05:00
Mason Daugherty	8a3bb73c05	release(openai): 1.0.3 (#33981 ) - Respect 300k token limit for embeddings API requests #33668 - fix create_agent / response_format for Responses API #33939 - fix response.incomplete event is not handled when using stream_mode=['messages'] #33871	2025-11-14 19:18:50 -05:00
Mason Daugherty	099c042395	refactor(openai): embedding utils and calculations (#33982 ) Now returns (`_iter`, `tokens`, `indices`, token_counts`). The `token_counts` are calculated directly during tokenization, which is more accurate and efficient than splitting strings later.	2025-11-14 19:18:37 -05:00
Kaparthy Reddy	2d4f00a451	fix(openai): Respect 300k token limit for embeddings API requests (#33668 ) ## Description Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds OpenAI's 300,000 token per request limit, causing 400 BadRequest errors. ## Problem When embedding large document sets, LangChain would send batches containing more than 300,000 tokens in a single API request, causing this error: ``` openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}} ``` The issue occurred because: - The code chunks texts by `embedding_ctx_length` (8191 tokens per chunk) - Then batches chunks by `chunk_size` (default 1000 chunks per request) - But didn't check: Total tokens per batch against OpenAI's 300k limit - Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds limit! ## Solution This PR implements dynamic batching that respects the 300k token limit: 1. Added constant: `MAX_TOKENS_PER_REQUEST = 300000` 2. Track token counts: Calculate actual tokens for each chunk 3. Dynamic batching: Instead of fixed `chunk_size` batches, accumulate chunks until approaching the 300k limit 4. Applied to both sync and async: Fixed both `_get_len_safe_embeddings` and `_aget_len_safe_embeddings` ## Changes - Modified `langchain_openai/embeddings/base.py`: - Added `MAX_TOKENS_PER_REQUEST` constant - Replaced fixed-size batching with token-aware dynamic batching - Applied to both sync (line ~478) and async (line ~527) methods - Added test in `tests/unit_tests/embeddings/test_base.py`: - `test_embeddings_respects_token_limit()` - Verifies large document sets are properly batched ## Testing All existing tests pass (280 passed, 4 xfailed, 1 xpassed). New test verifies: - Large document sets (500 texts × 1000 tokens = 500k tokens) are split into multiple API calls - Each API call respects the 300k token limit ## Usage After this fix, users can embed large document sets without errors: ```python from langchain_openai import OpenAIEmbeddings from langchain_chroma import Chroma from langchain_text_splitters import CharacterTextSplitter # This will now work without exceeding token limits embeddings = OpenAIEmbeddings() documents = CharacterTextSplitter().split_documents(large_documents) Chroma.from_documents(documents, embeddings) ``` Resolves #31227 --------- Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-14 18:12:07 -05:00
Sydney Runkle	1bc88028e6	fix(anthropic): execute bash + file tools via tool node (#33960 ) * use `override` instead of directly patching things on `ModelRequest` * rely on `ToolNode` for execution of tools related to said middleware, using `wrap_model_call` to inject the relevant claude tool specs + allowing tool node to forward them along to corresponding langchain tool implementations * making the same change for the native shell tool middleware * allowing shell tool middleware to specify a name for the shell tool (negative diff then for claude bash middleware) long term I think the solution might be to attach metadata to a tool to map the provider spec to a langchain implementation, which we could also take some lessons from on the MCP front.	2025-11-14 13:17:01 -05:00
Sydney Runkle	83c078f363	fix: adding missing async hooks (#33957 ) * filling in missing async gaps * using recommended tool runtime injection instead of injected state * updating tests to use helper function as well	2025-11-14 09:13:39 -05:00
Mason Daugherty	ee19a30dde	fix(groq): bump min ver for `core` dep (#33949 ) Due to issue with unit tests and docs URL for exceptions	2025-11-13 11:46:54 -05:00
Mason Daugherty	5d799b3174	release(nomic): 1.0.1 (#33948 ) support Python 3.14 #33655	2025-11-13 11:25:39 -05:00
Mason Daugherty	8f33a985a2	release(groq): 1.0.1 (#33947 ) - fix: handle tool calls with no args #33896 - add prompt caching token usage details #33708	2025-11-13 11:25:00 -05:00
Mason Daugherty	78eeccef0e	release(deepseek): 1.0.1 (#33946 ) - support strict beta structured output #32727	2025-11-13 11:24:39 -05:00
ccurme	3d415441e8	fix(langchain, openai): backward compat for response_format (#33945 )	2025-11-13 11:11:35 -05:00
ccurme	74385e0ebd	fix(langchain, openai): fix create_agent / response_format for Responses API (#33939 )	2025-11-13 10:18:15 -05:00
ccurme	fbe32c8e89	release(anthropic): 1.0.3 (#33935 )	2025-11-12 10:55:28 -05:00
Mohammad Mohtashim	2511c28f92	feat(anthropic): support code_execution_20250825 (#33925 )	2025-11-12 10:44:51 -05:00
Mason Daugherty	3dfea96ec1	chore: update `README.md` files (#33919 )	2025-11-10 22:51:35 -05:00
Mason Daugherty	69c7d1b01b	test(groq,openai): add retries for flaky tests (#33914 )	2025-11-10 10:36:11 -05:00

1 2 3 4 5 ...

1783 Commits