langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 18:50:33 +00:00

Author	SHA1	Message	Date
Towseef Altaf	d27fb0c432	feat(langchain,openai): add strict flag to ProviderStrategy structured output (#34149 )	2025-12-10 15:35:23 -05:00
ccurme	69dd39c461	fix(anthropic): ignore null values of caller on tool_use blocks (#34286 )	2025-12-10 13:13:02 -05:00
ccurme	5350967ddc	feat(anthropic): support mcp_toolset in bind_tools (#34284 )	2025-12-10 14:39:35 +00:00
Mason Daugherty	7542278997	feat(core,anthropic): `extras` on `BaseTool` (#34120 )	2025-12-10 09:37:14 -05:00
Mason Daugherty	ff6e3558d7	docs(fireworks,groq,huggingface,mistralai,ollama,openai): x-ref `convert_to_openai_tool` (#34276 )	2025-12-09 19:51:04 -05:00
Mason Daugherty	dff229d018	fix(openai): add missing `tools` param to `ChatOpenAI` `with_structured_output` (#34075 )	2025-12-08 15:47:31 -05:00
Mason Daugherty	2faed37ff1	feat(anthropic): document and test fine grained tool streaming (#34118 ) https://platform.claude.com/docs/en/agents-and-tools/tool-use/fine-grained-tool-streaming	2025-12-08 15:34:56 -05:00
Mason Daugherty	91d5ca275d	feat(anthropic): use model profile for max output tokens (#34163 ) Need(?) to adjust tests to also pull from model profile? currently hardcoded	2025-12-08 15:31:16 -05:00
Mason Daugherty	dcb670f395	feat(anthropic): auto append relevant beta headers for computer use (#34117 ) in addition to documenting it https://platform.claude.com/docs/en/agents-and-tools/tool-use/computer-use-tool	2025-12-08 15:25:36 -05:00
Mason Daugherty	8a5f46322b	feat(anthropic): tool search support (#34119 )	2025-12-08 10:46:37 -05:00
ccurme	b5efafe80c	release(openai): 1.1.1 (#34252 )	2025-12-08 09:23:13 -05:00
Marlene	ff3353f02f	fix(openai): Fixing error that comes up using the Responses API with built-in tools and custom tools (#34136 )	2025-12-08 09:10:44 -05:00
Mason Daugherty	3ace4e3680	docs(core,groq,openai): nits for ref docs (#34243 )	2025-12-07 19:45:38 -05:00
Mason Daugherty	4a42158e6c	feat(anthropic): add `effort` support (#34116 )	2025-12-05 13:44:42 -05:00
Mason Daugherty	7ba3e80057	test(openai): mark `test_structured_output_and_tools` flaky (#34223 ) Often raises `KeyError: 'explanation'`	2025-12-05 11:26:17 -05:00
Sydney Runkle	78c10f8790	chore: update core dep in lockfiles (#34216 )	2025-12-04 15:30:42 -05:00
Mason Daugherty	b7091d391d	feat(anthropic): auto append relevant beta headers (#34113 )	2025-12-01 12:20:41 -05:00
ccurme	7549845d82	chore(anthropic): vcr integration test (#34160 )	2025-12-01 15:28:28 +00:00
Mason Daugherty	0a6d01e61d	docs(anthropic,core,langchain): updates (#34106 )	2025-11-25 17:58:09 -05:00
Mason Daugherty	c6f8b0875a	style(core,langchain,qdrant): fix some docstrings for refs (#34105 )	2025-11-25 13:58:53 -05:00
ccurme	880652b713	release: (integration packages): 1.1 (#34088 )	2025-11-24 10:00:06 -05:00
Sydney Runkle	4ab94579ad	feat(langchain): support `SystemMessage` in `create_agent`'s `system_prompt` (#34055 ) * `create_agent`'s `system_prompt` allows `str \| SystemMessage` * added `system_message: SystemMessage` on `ModelRequest` * `ModelRequest.system_prompt` is a function of `system_message.text`, now deprecated * disallow setting `system_prompt` and `system_message` * `ModelRequest.system_prompt` can still be set (w/ custom setattr) for custom backwards compat, but the updates just get propogated to the `ModelRequest.system_message` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-11-24 14:53:57 +00:00
ccurme	eb0545a173	release: (integration packages) 1.1 (#34087 )	2025-11-24 09:13:01 -05:00
ccurme	a2e389de9f	release(fireworks): 1.1 (#34086 )	2025-11-24 09:05:43 -05:00
Abhinav	2ba3ce81a6	fix(openai): make GPT-5 temperature validation case-insensitive (#34012 ) Fixed a bug where GPT-5 temperature validation was case-sensitive, causing issues when users specified Azure deployment names or model names in uppercase (e.g., `"GPT-5-2025-01-01"`, `"GPT-5-NANO"`). The validation now correctly handles model names regardless of case. Changes made: - Updated `validate_temperature()` method in `BaseChatOpenAI` to perform case-insensitive model name comparisons - Updated `_get_encoding_model()` method to use case-insensitive checks for tiktoken encoder selection - Added comprehensive unit tests to verify case-insensitive behavior with various case combinations Issue: Fixes #34003 Dependencies: None Test Coverage: - All existing tests pass - New test `test_gpt_5_temperature_case_insensitive` covers uppercase, lowercase, and mixed-case model names - Tests verify both non-chat GPT-5 models (temperature removed) and chat models (temperature preserved) - Lint and format checks pass (`make lint`, `make format`) --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-23 20:17:03 -05:00
Mason Daugherty	cbaea351b2	style(core,langchain-classic,openai): fix griffe warnings (#34074 )	2025-11-23 01:06:46 -05:00
ccurme	0915682c12	chore(fireworks): update tested models (#34070 )	2025-11-22 16:50:49 -05:00
Mason Daugherty	47b79c30c0	chore(docs): fix a few refs syntax errors (#34044 ) missing whitespace for some admonitions	2025-11-22 00:58:21 -05:00
ccurme	33e5d01f7c	feat(model-profiles): distribute data across packages (#34024 )	2025-11-21 15:47:05 -05:00
Sydney Runkle	b7d1831f9d	fix: deprecate `setattr` on `ModelCallRequest` (#34022 ) * one alternative considered was setting `frozen=True` on the dataclass, but this is breaking, so a deprecation is a nicer approach	2025-11-19 11:08:55 -05:00
ccurme	328ba36601	chore(openai): skip Azure text completions tests (#34021 )	2025-11-19 09:29:12 -05:00
ccurme	990e346c46	release(anthropic): 1.1 (#33997 )	2025-11-17 16:24:29 -05:00
ccurme	9b7792631d	feat(anthropic): support native structured output feature and strict tool calling (#33980 )	2025-11-17 16:14:20 -05:00
Mason Daugherty	52b1516d44	style(langchain): fix some middleware ref syntax (#33988 )	2025-11-16 00:33:17 -05:00
Mason Daugherty	8a3bb73c05	release(openai): 1.0.3 (#33981 ) - Respect 300k token limit for embeddings API requests #33668 - fix create_agent / response_format for Responses API #33939 - fix response.incomplete event is not handled when using stream_mode=['messages'] #33871	2025-11-14 19:18:50 -05:00
Mason Daugherty	099c042395	refactor(openai): embedding utils and calculations (#33982 ) Now returns (`_iter`, `tokens`, `indices`, token_counts`). The `token_counts` are calculated directly during tokenization, which is more accurate and efficient than splitting strings later.	2025-11-14 19:18:37 -05:00
Kaparthy Reddy	2d4f00a451	fix(openai): Respect 300k token limit for embeddings API requests (#33668 ) ## Description Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds OpenAI's 300,000 token per request limit, causing 400 BadRequest errors. ## Problem When embedding large document sets, LangChain would send batches containing more than 300,000 tokens in a single API request, causing this error: ``` openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}} ``` The issue occurred because: - The code chunks texts by `embedding_ctx_length` (8191 tokens per chunk) - Then batches chunks by `chunk_size` (default 1000 chunks per request) - But didn't check: Total tokens per batch against OpenAI's 300k limit - Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds limit! ## Solution This PR implements dynamic batching that respects the 300k token limit: 1. Added constant: `MAX_TOKENS_PER_REQUEST = 300000` 2. Track token counts: Calculate actual tokens for each chunk 3. Dynamic batching: Instead of fixed `chunk_size` batches, accumulate chunks until approaching the 300k limit 4. Applied to both sync and async: Fixed both `_get_len_safe_embeddings` and `_aget_len_safe_embeddings` ## Changes - Modified `langchain_openai/embeddings/base.py`: - Added `MAX_TOKENS_PER_REQUEST` constant - Replaced fixed-size batching with token-aware dynamic batching - Applied to both sync (line ~478) and async (line ~527) methods - Added test in `tests/unit_tests/embeddings/test_base.py`: - `test_embeddings_respects_token_limit()` - Verifies large document sets are properly batched ## Testing All existing tests pass (280 passed, 4 xfailed, 1 xpassed). New test verifies: - Large document sets (500 texts × 1000 tokens = 500k tokens) are split into multiple API calls - Each API call respects the 300k token limit ## Usage After this fix, users can embed large document sets without errors: ```python from langchain_openai import OpenAIEmbeddings from langchain_chroma import Chroma from langchain_text_splitters import CharacterTextSplitter # This will now work without exceeding token limits embeddings = OpenAIEmbeddings() documents = CharacterTextSplitter().split_documents(large_documents) Chroma.from_documents(documents, embeddings) ``` Resolves #31227 --------- Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-14 18:12:07 -05:00
Sydney Runkle	1bc88028e6	fix(anthropic): execute bash + file tools via tool node (#33960 ) * use `override` instead of directly patching things on `ModelRequest` * rely on `ToolNode` for execution of tools related to said middleware, using `wrap_model_call` to inject the relevant claude tool specs + allowing tool node to forward them along to corresponding langchain tool implementations * making the same change for the native shell tool middleware * allowing shell tool middleware to specify a name for the shell tool (negative diff then for claude bash middleware) long term I think the solution might be to attach metadata to a tool to map the provider spec to a langchain implementation, which we could also take some lessons from on the MCP front.	2025-11-14 13:17:01 -05:00
Sydney Runkle	83c078f363	fix: adding missing async hooks (#33957 ) * filling in missing async gaps * using recommended tool runtime injection instead of injected state * updating tests to use helper function as well	2025-11-14 09:13:39 -05:00
Mason Daugherty	ee19a30dde	fix(groq): bump min ver for `core` dep (#33949 ) Due to issue with unit tests and docs URL for exceptions	2025-11-13 11:46:54 -05:00
Mason Daugherty	5d799b3174	release(nomic): 1.0.1 (#33948 ) support Python 3.14 #33655	2025-11-13 11:25:39 -05:00
Mason Daugherty	8f33a985a2	release(groq): 1.0.1 (#33947 ) - fix: handle tool calls with no args #33896 - add prompt caching token usage details #33708	2025-11-13 11:25:00 -05:00
Mason Daugherty	78eeccef0e	release(deepseek): 1.0.1 (#33946 ) - support strict beta structured output #32727	2025-11-13 11:24:39 -05:00
ccurme	3d415441e8	fix(langchain, openai): backward compat for response_format (#33945 )	2025-11-13 11:11:35 -05:00
ccurme	74385e0ebd	fix(langchain, openai): fix create_agent / response_format for Responses API (#33939 )	2025-11-13 10:18:15 -05:00
ccurme	fbe32c8e89	release(anthropic): 1.0.3 (#33935 )	2025-11-12 10:55:28 -05:00
Mohammad Mohtashim	2511c28f92	feat(anthropic): support code_execution_20250825 (#33925 )	2025-11-12 10:44:51 -05:00
Mason Daugherty	3dfea96ec1	chore: update `README.md` files (#33919 )	2025-11-10 22:51:35 -05:00
Mason Daugherty	69c7d1b01b	test(groq,openai): add retries for flaky tests (#33914 )	2025-11-10 10:36:11 -05:00
Shahroz Ahmad	31b5e4810c	feat(deepseek): support `strict` beta structured output (#32727 ) Description: This PR adds support for DeepSeek's beta strict mode feature for structured outputs and tool calling. It overrides `bind_tools()` and `with_structured_output()` to automatically use DeepSeek's beta endpoint (https://api.deepseek.com/beta) when `strict=True`. Both methods need overriding because they're independent entry points and user can call either directly. When DeepSeek's strict mode graduates from beta, we can just remove both overriden methods. You can read more about the beta feature here: https://api-docs.deepseek.com/guides/function_calling#strict-mode-beta Issue: Implements #32670 Dependencies: None Sample Code ```python from langchain_deepseek import ChatDeepSeek from pydantic import BaseModel, Field from typing import Optional import os # Enter your DeepSeek API Key here API_KEY = "YOUR_API_KEY" # location, temperature, condition are required fields # humidity is optional field with default value class WeatherInfo(BaseModel): location: str = Field(description="City name") temperature: int = Field(description="Temperature in Celsius") condition: str = Field(description="Weather condition (sunny, cloudy, rainy)") humidity: Optional[int] = Field(default=None, description="Humidity percentage") llm = ChatDeepSeek( model="deepseek-chat", api_key=API_KEY, ) # just to confirm that a new instance will use the default base url (instead of beta) print(f"Default API base: {llm.api_base}") # Test 1: bind_tools with strict=True shoud list all the tools calls print("\nTest 1: bind_tools with strict=True") llm_with_tools = llm.bind_tools([WeatherInfo], strict=True) response = llm_with_tools.invoke("Tell me the weather in New York. It's 22 degrees, sunny.") print(response.tool_calls) # Test 2: with_structured_output with strict=True print("\nTest 2: with_structured_output with strict=True") structured_llm = llm.with_structured_output(WeatherInfo, strict=True) result = structured_llm.invoke("Tell me the weather in New York.") print(f" Result: {result}") assert isinstance(result, WeatherInfo), "Result should be a WeatherInfo instance" ``` --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-09 22:24:33 -05:00

1 2 3 4 5 ...

1682 Commits