langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-03-18 19:18:48 +00:00

Author	SHA1	Message	Date
Mason Daugherty	b7091d391d	feat(anthropic): auto append relevant beta headers (#34113 )	2025-12-01 12:20:41 -05:00
ccurme	7549845d82	chore(anthropic): vcr integration test (#34160 )	2025-12-01 15:28:28 +00:00
Mason Daugherty	0a6d01e61d	docs(anthropic,core,langchain): updates (#34106 )	2025-11-25 17:58:09 -05:00
Mason Daugherty	c6f8b0875a	style(core,langchain,qdrant): fix some docstrings for refs (#34105 )	2025-11-25 13:58:53 -05:00
ccurme	880652b713	release: (integration packages): 1.1 (#34088 )	2025-11-24 10:00:06 -05:00
Sydney Runkle	4ab94579ad	feat(langchain): support `SystemMessage` in `create_agent`'s `system_prompt` (#34055 ) * `create_agent`'s `system_prompt` allows `str \| SystemMessage` * added `system_message: SystemMessage` on `ModelRequest` * `ModelRequest.system_prompt` is a function of `system_message.text`, now deprecated * disallow setting `system_prompt` and `system_message` * `ModelRequest.system_prompt` can still be set (w/ custom setattr) for custom backwards compat, but the updates just get propogated to the `ModelRequest.system_message` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-11-24 14:53:57 +00:00
ccurme	eb0545a173	release: (integration packages) 1.1 (#34087 )	2025-11-24 09:13:01 -05:00
ccurme	a2e389de9f	release(fireworks): 1.1 (#34086 )	2025-11-24 09:05:43 -05:00
Abhinav	2ba3ce81a6	fix(openai): make GPT-5 temperature validation case-insensitive (#34012 ) Fixed a bug where GPT-5 temperature validation was case-sensitive, causing issues when users specified Azure deployment names or model names in uppercase (e.g., `"GPT-5-2025-01-01"`, `"GPT-5-NANO"`). The validation now correctly handles model names regardless of case. Changes made: - Updated `validate_temperature()` method in `BaseChatOpenAI` to perform case-insensitive model name comparisons - Updated `_get_encoding_model()` method to use case-insensitive checks for tiktoken encoder selection - Added comprehensive unit tests to verify case-insensitive behavior with various case combinations Issue: Fixes #34003 Dependencies: None Test Coverage: - All existing tests pass - New test `test_gpt_5_temperature_case_insensitive` covers uppercase, lowercase, and mixed-case model names - Tests verify both non-chat GPT-5 models (temperature removed) and chat models (temperature preserved) - Lint and format checks pass (`make lint`, `make format`) --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-23 20:17:03 -05:00
Mason Daugherty	cbaea351b2	style(core,langchain-classic,openai): fix griffe warnings (#34074 )	2025-11-23 01:06:46 -05:00
ccurme	0915682c12	chore(fireworks): update tested models (#34070 )	2025-11-22 16:50:49 -05:00
Mason Daugherty	47b79c30c0	chore(docs): fix a few refs syntax errors (#34044 ) missing whitespace for some admonitions	2025-11-22 00:58:21 -05:00
ccurme	33e5d01f7c	feat(model-profiles): distribute data across packages (#34024 )	2025-11-21 15:47:05 -05:00
Sydney Runkle	b7d1831f9d	fix: deprecate `setattr` on `ModelCallRequest` (#34022 ) * one alternative considered was setting `frozen=True` on the dataclass, but this is breaking, so a deprecation is a nicer approach	2025-11-19 11:08:55 -05:00
ccurme	328ba36601	chore(openai): skip Azure text completions tests (#34021 )	2025-11-19 09:29:12 -05:00
ccurme	990e346c46	release(anthropic): 1.1 (#33997 )	2025-11-17 16:24:29 -05:00
ccurme	9b7792631d	feat(anthropic): support native structured output feature and strict tool calling (#33980 )	2025-11-17 16:14:20 -05:00
Mason Daugherty	52b1516d44	style(langchain): fix some middleware ref syntax (#33988 )	2025-11-16 00:33:17 -05:00
Mason Daugherty	8a3bb73c05	release(openai): 1.0.3 (#33981 ) - Respect 300k token limit for embeddings API requests #33668 - fix create_agent / response_format for Responses API #33939 - fix response.incomplete event is not handled when using stream_mode=['messages'] #33871	2025-11-14 19:18:50 -05:00
Mason Daugherty	099c042395	refactor(openai): embedding utils and calculations (#33982 ) Now returns (`_iter`, `tokens`, `indices`, token_counts`). The `token_counts` are calculated directly during tokenization, which is more accurate and efficient than splitting strings later.	2025-11-14 19:18:37 -05:00
Kaparthy Reddy	2d4f00a451	fix(openai): Respect 300k token limit for embeddings API requests (#33668 ) ## Description Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds OpenAI's 300,000 token per request limit, causing 400 BadRequest errors. ## Problem When embedding large document sets, LangChain would send batches containing more than 300,000 tokens in a single API request, causing this error: ``` openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}} ``` The issue occurred because: - The code chunks texts by `embedding_ctx_length` (8191 tokens per chunk) - Then batches chunks by `chunk_size` (default 1000 chunks per request) - But didn't check: Total tokens per batch against OpenAI's 300k limit - Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds limit! ## Solution This PR implements dynamic batching that respects the 300k token limit: 1. Added constant: `MAX_TOKENS_PER_REQUEST = 300000` 2. Track token counts: Calculate actual tokens for each chunk 3. Dynamic batching: Instead of fixed `chunk_size` batches, accumulate chunks until approaching the 300k limit 4. Applied to both sync and async: Fixed both `_get_len_safe_embeddings` and `_aget_len_safe_embeddings` ## Changes - Modified `langchain_openai/embeddings/base.py`: - Added `MAX_TOKENS_PER_REQUEST` constant - Replaced fixed-size batching with token-aware dynamic batching - Applied to both sync (line ~478) and async (line ~527) methods - Added test in `tests/unit_tests/embeddings/test_base.py`: - `test_embeddings_respects_token_limit()` - Verifies large document sets are properly batched ## Testing All existing tests pass (280 passed, 4 xfailed, 1 xpassed). New test verifies: - Large document sets (500 texts × 1000 tokens = 500k tokens) are split into multiple API calls - Each API call respects the 300k token limit ## Usage After this fix, users can embed large document sets without errors: ```python from langchain_openai import OpenAIEmbeddings from langchain_chroma import Chroma from langchain_text_splitters import CharacterTextSplitter # This will now work without exceeding token limits embeddings = OpenAIEmbeddings() documents = CharacterTextSplitter().split_documents(large_documents) Chroma.from_documents(documents, embeddings) ``` Resolves #31227 --------- Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-14 18:12:07 -05:00
Sydney Runkle	1bc88028e6	fix(anthropic): execute bash + file tools via tool node (#33960 ) * use `override` instead of directly patching things on `ModelRequest` * rely on `ToolNode` for execution of tools related to said middleware, using `wrap_model_call` to inject the relevant claude tool specs + allowing tool node to forward them along to corresponding langchain tool implementations * making the same change for the native shell tool middleware * allowing shell tool middleware to specify a name for the shell tool (negative diff then for claude bash middleware) long term I think the solution might be to attach metadata to a tool to map the provider spec to a langchain implementation, which we could also take some lessons from on the MCP front.	2025-11-14 13:17:01 -05:00
Sydney Runkle	83c078f363	fix: adding missing async hooks (#33957 ) * filling in missing async gaps * using recommended tool runtime injection instead of injected state * updating tests to use helper function as well	2025-11-14 09:13:39 -05:00
Mason Daugherty	ee19a30dde	fix(groq): bump min ver for `core` dep (#33949 ) Due to issue with unit tests and docs URL for exceptions	2025-11-13 11:46:54 -05:00
Mason Daugherty	5d799b3174	release(nomic): 1.0.1 (#33948 ) support Python 3.14 #33655	2025-11-13 11:25:39 -05:00
Mason Daugherty	8f33a985a2	release(groq): 1.0.1 (#33947 ) - fix: handle tool calls with no args #33896 - add prompt caching token usage details #33708	2025-11-13 11:25:00 -05:00
Mason Daugherty	78eeccef0e	release(deepseek): 1.0.1 (#33946 ) - support strict beta structured output #32727	2025-11-13 11:24:39 -05:00
ccurme	3d415441e8	fix(langchain, openai): backward compat for response_format (#33945 )	2025-11-13 11:11:35 -05:00
ccurme	74385e0ebd	fix(langchain, openai): fix create_agent / response_format for Responses API (#33939 )	2025-11-13 10:18:15 -05:00
ccurme	fbe32c8e89	release(anthropic): 1.0.3 (#33935 )	2025-11-12 10:55:28 -05:00
Mohammad Mohtashim	2511c28f92	feat(anthropic): support code_execution_20250825 (#33925 )	2025-11-12 10:44:51 -05:00
Mason Daugherty	3dfea96ec1	chore: update `README.md` files (#33919 )	2025-11-10 22:51:35 -05:00
Mason Daugherty	69c7d1b01b	test(groq,openai): add retries for flaky tests (#33914 )	2025-11-10 10:36:11 -05:00
Shahroz Ahmad	31b5e4810c	feat(deepseek): support `strict` beta structured output (#32727 ) Description: This PR adds support for DeepSeek's beta strict mode feature for structured outputs and tool calling. It overrides `bind_tools()` and `with_structured_output()` to automatically use DeepSeek's beta endpoint (https://api.deepseek.com/beta) when `strict=True`. Both methods need overriding because they're independent entry points and user can call either directly. When DeepSeek's strict mode graduates from beta, we can just remove both overriden methods. You can read more about the beta feature here: https://api-docs.deepseek.com/guides/function_calling#strict-mode-beta Issue: Implements #32670 Dependencies: None Sample Code ```python from langchain_deepseek import ChatDeepSeek from pydantic import BaseModel, Field from typing import Optional import os # Enter your DeepSeek API Key here API_KEY = "YOUR_API_KEY" # location, temperature, condition are required fields # humidity is optional field with default value class WeatherInfo(BaseModel): location: str = Field(description="City name") temperature: int = Field(description="Temperature in Celsius") condition: str = Field(description="Weather condition (sunny, cloudy, rainy)") humidity: Optional[int] = Field(default=None, description="Humidity percentage") llm = ChatDeepSeek( model="deepseek-chat", api_key=API_KEY, ) # just to confirm that a new instance will use the default base url (instead of beta) print(f"Default API base: {llm.api_base}") # Test 1: bind_tools with strict=True shoud list all the tools calls print("\nTest 1: bind_tools with strict=True") llm_with_tools = llm.bind_tools([WeatherInfo], strict=True) response = llm_with_tools.invoke("Tell me the weather in New York. It's 22 degrees, sunny.") print(response.tool_calls) # Test 2: with_structured_output with strict=True print("\nTest 2: with_structured_output with strict=True") structured_llm = llm.with_structured_output(WeatherInfo, strict=True) result = structured_llm.invoke("Tell me the weather in New York.") print(f" Result: {result}") assert isinstance(result, WeatherInfo), "Result should be a WeatherInfo instance" ``` --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-09 22:24:33 -05:00
AmazingcatAndrew	1b563067f8	fix(chroma): resolve OpenCLIP + Chroma image embedding test regression (#33899 ) Description: Fixes the OpenCLIP × Chroma regression that caused nested embedding errors when adding or searching image data. The test case `test_openclip_chroma_embed_no_nesting_error` has been restored and verified to work correctly with the current LangChain core dependencies. Functional validation confirms that `similarity_search_by_image` now returns correct, metadata‑preserving results. Issue: Fixes #33851 Dependencies: No new dependencies introduced. Testing: All tests under ```bash uv run --group test pytest tests/unit_tests ``` result: ``` 30 passed in 91.26s (0:01:31) ``` have passed successfully using Python 3.13.9 and uv‑managed environment. This confirms that the regression has been fixed. Running ```bash make test ``` still produces cleanup‑time `AttributeError: 'ProactorEventLoop' object has no attribute '_ssock'` on Windows (Python 3.13+). This is a benign asyncio teardown message rather than a functional failure. `uv run pytest` closes event loops immediately after tests, while `make test` invokes pytest through a secondary process layer that leaves a background loop alive at interpreter shutdown. This difference in teardown behavior explains the extra messages seen only when using `make test`. Summary: - Verified the OpenCLIP + Chroma image pipeline works correctly. - `uv run --group test pytest` fully passes; the fix is complete. - The residual `_ssock` warnings occur only during Windows asyncio cleanup and are not related to this code change. This is my first time contributing code, please contact me with any questions --- --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-09 21:24:33 -05:00
Mason Daugherty	ab0677c6f1	fix(groq): handle tool calls with no args (#33896 ) When Groq returns tool calls with no arguments, it sends arguments: `'null'` (JSON null), but LangChain's core parsing expects either a dict or converts null to Python None, which fails the `isinstance(args_, dict)` check and incorrectly marks the tool call as invalid. Related to #32017	2025-11-08 22:30:44 -05:00
Mshari	9383b78be1	feat(groq): add prompt caching token usage details (#33708 ) Description: Adds support for prompt caching usage metadata in ChatGroq. The integration now captures cached token information from the Groq API response and includes it in the `input_token_details` field of the `usage_metadata`. Changes: - Created new `_create_usage_metadata()` helper function to centralize usage metadata creation logic - Extracts `cached_tokens` from `prompt_tokens_details` in API responses and maps to `input_token_details.cache_read` - Integrated the helper function in both streaming (`_convert_chunk_to_message_chunk`) and non-streaming (`_create_chat_result`) code paths - Added comprehensive unit tests to verify caching metadata handling and backward compatibility This enables users to monitor prompt caching effectiveness when using Groq models with prompt caching enabled. Issue: N/A Dependencies: None --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-11-07 17:05:22 -05:00
ccurme	3c492571ab	release(anthropic): 1.0.2 (#33888 )	2025-11-07 16:47:25 -05:00
Azibek	d8b94007c1	fix(huggingface): pass llm params to `ChatHuggingFace` (#32368 ) This PR fixes #32234 and improves HuggingFace chat model integration by: Ensuring ChatHuggingFace inherits key parameters (temperature, max_tokens, top_p, streaming, etc.) from the underlying LLM when not explicitly set. Adding and updating unit tests to verify property inheritance. No breaking changes; these updates enhance reliability and maintainability. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-07 14:29:15 -05:00
Abhinav	0861cba04b	fix(chroma): pydantic validation error when using `retriever.invoke()` (#31377 )	2025-11-07 10:59:16 -05:00
Lê Nam Khánh	9d98c1b669	docs: fix typos in libs/partners/groq/langchain_groq/chat_models.py (#33878 )	2025-11-07 10:31:35 -05:00
Mohammad Mohtashim	65716cf590	feat(perplexity): Created Dedicated Output Parser to Support Reasoning Model Output for perplexity (#33670 )	2025-11-07 10:17:35 -05:00
riunyfir	1b77a191f4	feat: The response.incomplete event is not handled when using stream_mode=['messages'] (#33871 )	2025-11-07 09:46:11 -05:00
Mason Daugherty	e023201d42	style: some cleanup (#33857 )	2025-11-06 23:50:46 -05:00
Mason Daugherty	d40e340479	chore: attribute package change versions (#33854 ) Needed to disambiguate for within inherited docs	2025-11-06 16:57:30 -05:00
Mason Daugherty	dfb05a7fa0	style: refs pass (#33813 )	2025-11-03 22:11:10 -05:00
ccurme	2f67f9ddcb	release(huggingface): 1.0.1 (#33803 )	2025-11-03 14:49:52 -05:00
Hyejeong Jo	0e36185933	fix(huggingface): add `stream_usage` support for `ChatHuggingFace` invoke/stream (#32708 )	2025-11-03 14:44:32 -05:00
Mason Daugherty	0a442644e3	test(anthropic): add vcr to `test_search_result_tool_message` (#33793 ) To fix nondeterministic results causing integration testing to sometimes fail Also speeds up from 10s to 0.5 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-11-03 15:13:30 +00:00
ccurme	81c4f21b52	fix(standard-tests): update multimodal tests (#33781 )	2025-11-01 16:38:20 -04:00

1 2 3 4 5 ...

1666 Commits