langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-07-13 12:14:06 +00:00

Author	SHA1	Message	Date
Mason Daugherty	3108b14164	docs(standard-tests): fix `supports_json_mode` docstring (#34181 )	2025-12-03 00:12:57 -05:00
Mason Daugherty	1922adc092	docs(standard-tests): fix formatting bug, rearrange admonition (#34180 )	2025-12-02 23:40:11 -05:00
Mason Daugherty	4a242a8a4f	docs(standard-tests): enrich doc to indicate missing default values (#34179 )	2025-12-02 23:32:21 -05:00
Mason Daugherty	064b37f90e	docs(standard-tests): improve doc for `structured_output_kwargs` and `supports_json_mode` (#34178 )	2025-12-02 23:18:53 -05:00
Mason Daugherty	062678fa18	fix(standard-tests): fix broken links (#34175 )	2025-12-02 20:52:27 -05:00
Mason Daugherty	5d3e3d3f31	fix(standard-tests): remove broken code block docstring title (#34173 )	2025-12-02 20:18:31 -05:00
Mason Daugherty	5a7cf87626	style(standard-tests): some fencing (#34171 )	2025-12-02 14:42:26 -05:00
ccurme	c63f23d233	revert(model-profiles): update docs link (#34162 )	2025-12-01 17:29:45 +00:00
Mason Daugherty	b7091d391d	feat(anthropic): auto append relevant beta headers (#34113 )	2025-12-01 12:20:41 -05:00
ccurme	7a2952210e	fix(langchain): (SummarizationMiddleware) adjust token counts based on model (#34161 )	2025-12-01 16:22:44 +00:00
ccurme	7549845d82	chore(anthropic): vcr integration test (#34160 )	2025-12-01 15:28:28 +00:00
Mason Daugherty	878f033ed7	docs(langchain): docstrings for summariziation middleware types (#34158 ) improving devx :)	2025-12-01 09:39:33 -05:00
Steffen Hausmann	4065106c2e	fix(langchain): add types to human_in_the_loop middleware (#34137 ) The `HumanInTheLoopMiddleware` is missing a type annotation for the context schema. Without the fix in this PR, the following code does not type check: ``` graph = create_agent( "gpt-5", tools=[send_email_tool, read_email_tool], middleware=[ HumanInTheLoopMiddleware( interrupt_on={ # Require approval or rejection for sending emails "send_email_tool": { "allowed_decisions": ["approve", "reject"], }, # Auto-approve reading emails "read_email_tool": False, } ), ], context_schema=ContextSchema, ) ``` ``` Argument of type "list[HumanInTheLoopMiddleware]" cannot be assigned to parameter "middleware" of type "Sequence[AgentMiddleware[StateT_co@create_agent, ContextT@create_agent]]" in function "create_agent" "HumanInTheLoopMiddleware" is not assignable to "AgentMiddleware[AgentState[Unknown], ContextSchema \| None]" Type parameter "ContextT@AgentMiddleware" is invariant, but "None" is not the same as "ContextSchema \| None" ```	2025-12-01 08:46:38 -05:00
Mason Daugherty	12df938ace	docs(core): update docstrings in `RunnableConfig`, `dereference_refs` (#34131 )	2025-11-28 03:55:37 -05:00
Mason Daugherty	fe7c000fc1	fix(model-profiles): update docs link (#34127 )	2025-11-28 00:19:36 +00:00
Mason Daugherty	0a6d01e61d	docs(anthropic,core,langchain): updates (#34106 )	2025-11-25 17:58:09 -05:00
Mason Daugherty	c6f8b0875a	style(core,langchain,qdrant): fix some docstrings for refs (#34105 )	2025-11-25 13:58:53 -05:00
Bagatur	c375732396	fix(core): handle missing `StructuredPrompt` schema (#34096 ) - Description: if you dont pass in schema= or schema_= to StrucutredPrompt(...) today you get a confusing KeyError. Raise a more readable ValueError instead. - Issue: na - Dependencies: na	2025-11-24 18:39:29 -05:00
ccurme	9c21f83e82	release(langchain): 1.1 (#34090 )	2025-11-24 10:27:13 -05:00
ccurme	880652b713	release: (integration packages): 1.1 (#34088 )	2025-11-24 10:00:06 -05:00
Sydney Runkle	4ab94579ad	feat(langchain): support `SystemMessage` in `create_agent`'s `system_prompt` (#34055 ) * `create_agent`'s `system_prompt` allows `str \| SystemMessage` * added `system_message: SystemMessage` on `ModelRequest` * `ModelRequest.system_prompt` is a function of `system_message.text`, now deprecated * disallow setting `system_prompt` and `system_message` * `ModelRequest.system_prompt` can still be set (w/ custom setattr) for custom backwards compat, but the updates just get propogated to the `ModelRequest.system_message` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-11-24 14:53:57 +00:00
ccurme	eb0545a173	release: (integration packages) 1.1 (#34087 )	2025-11-24 09:13:01 -05:00
ccurme	a2e389de9f	release(fireworks): 1.1 (#34086 )	2025-11-24 09:05:43 -05:00
Alex Kondratev	01573c1375	fix(core): `ensure_ascii=False` in `PydanticOutputParser` exception formatting (#34006 ) - Description: When formatting an error, `PydanticOutputParser` dumps json with default `ensure_ascii=True` - Issue: Fixes #34005 - Dependencies: None - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. We will not consider a PR unless these three are passing in CI. See [contribution guidelines](https://docs.langchain.com/oss/python/contributing) for more. Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-11-23 20:22:50 -05:00
Abhinav	2ba3ce81a6	fix(openai): make GPT-5 temperature validation case-insensitive (#34012 ) Fixed a bug where GPT-5 temperature validation was case-sensitive, causing issues when users specified Azure deployment names or model names in uppercase (e.g., `"GPT-5-2025-01-01"`, `"GPT-5-NANO"`). The validation now correctly handles model names regardless of case. Changes made: - Updated `validate_temperature()` method in `BaseChatOpenAI` to perform case-insensitive model name comparisons - Updated `_get_encoding_model()` method to use case-insensitive checks for tiktoken encoder selection - Added comprehensive unit tests to verify case-insensitive behavior with various case combinations Issue: Fixes #34003 Dependencies: None Test Coverage: - All existing tests pass - New test `test_gpt_5_temperature_case_insensitive` covers uppercase, lowercase, and mixed-case model names - Tests verify both non-chat GPT-5 models (temperature removed) and chat models (temperature preserved) - Lint and format checks pass (`make lint`, `make format`) --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-23 20:17:03 -05:00
Mason Daugherty	2a863727f9	fix(infra,core): nits (#34079 ) * Add missing `nits` to allowed PR linting scopes * Ensure `MAJOR.MINOR.PATCH` consistency in admonitions * Ensure valid spacing in admonitions	2025-11-23 20:00:07 -05:00
dumko2001	30e2260e26	fix(core): Decouple provider prefix from model name in init_chat_mode… (#34046 ) :…l logic Addresses Issue #34007. Fixes a bug where aliases like 'mistral:' were inferred correctly as a provider but the prefix was not stripped from the model name, causing API 400 errors. Added logic to strip prefix when inference succeeds. Description This PR resolves a logic error in `init_chat_model` where inferred provider aliases (specifically `mistral:`) were correctly identified but not stripped from the model string. The Problem When passing a string like `mistral:ministral-8b-latest`, the factory logic correctly inferred the provider as `mistralai` but failed to enter the string-splitting block because the alias `mistral` was not in the hardcoded `_SUPPORTED_PROVIDERS` list. This caused the raw string `mistral:ministral-8b-latest` to be passed to the `ChatMistralAI` constructor, resulting in a 400 API error. The Fix I updated `_parse_model` in `libs/langchain/langchain/chat_models/base.py`. The logic now attempts to infer the provider from the prefix before determining whether to split the string. This ensures that valid aliases trigger the stripping logic, passing only the clean `model_name` to the integration class. Issue Fixes #34007 Dependencies None. Verification Validated locally with a reproduction script: - Input: `mistral:ministral-8b-latest` - Result: Successfully instantiates `ChatMistralAI` with `model="ministral-8b-latest"`. - Validated that standard inputs (e.g., `gpt-4o`) remain unaffected. Co-authored-by: ioop <ioop@Sidharths-MacBook-Air.local>	2025-11-23 19:52:24 -05:00
Mason Daugherty	cbaea351b2	style(core,langchain-classic,openai): fix griffe warnings (#34074 )	2025-11-23 01:06:46 -05:00
ccurme	f070217c3b	release(standard-tests): 1.0.2 (#34071 ) Resolves https://github.com/langchain-ai/langchain/issues/34069	2025-11-22 18:35:09 -05:00
ccurme	0915682c12	chore(fireworks): update tested models (#34070 )	2025-11-22 16:50:49 -05:00
Sydney Runkle	68ab9a1e56	fix: don't reorder tool calls in HITL middleware (#34023 )	2025-11-22 05:10:32 -05:00
Mason Daugherty	47b79c30c0	chore(docs): fix a few refs syntax errors (#34044 ) missing whitespace for some admonitions	2025-11-22 00:58:21 -05:00
ccurme	5899f980aa	release(model-profiles): 0.0.5 (#34064 )	2025-11-21 16:12:00 -05:00
ccurme	b0bf4afe81	release(core): 1.1.0 (#34063 )	2025-11-21 15:57:25 -05:00
ccurme	33e5d01f7c	feat(model-profiles): distribute data across packages (#34024 )	2025-11-21 15:47:05 -05:00
Sydney Runkle	ee3373afc2	chore: add more robust test for runtime injection w/ explicit `args_schema` (#34051 )	2025-11-20 16:51:37 +00:00
Sydney Runkle	b296f103a9	feat: `ModelRetryMiddleware` (#34027 ) Closes https://github.com/langchain-ai/langchain/issues/33983 * Adds `ModelRetryMiddleware` modeled after `ToolRetryMiddleware` * Uses `on_failure` modes of `error` and `continue` to match the `exit_behavior` modes of model + tool call limit middleware * In a backwards compatible manner, aligns the API of `ToolRetryMiddleware`'s `on_failure` with the above * Centralize common "retry" utils across these middlewares	2025-11-20 11:42:33 -05:00
Eugene Yurtsev	525d5c0169	release(core): 1.0.7 (#34036 ) Release core 1.0.7	2025-11-19 21:17:31 +00:00
Eugene Yurtsev	c4b6ba254e	fix(core): fix validation for input variables in f-string templates, restrict functionality supported by jinja2, mustache templates (#34035 ) * Fix validation for input variables in f-string templates * Restrict functionality of features supported by jinja2 and mustache templates	2025-11-19 16:09:46 -05:00
Sydney Runkle	b7d1831f9d	fix: deprecate `setattr` on `ModelCallRequest` (#34022 ) * one alternative considered was setting `frozen=True` on the dataclass, but this is breaking, so a deprecation is a nicer approach	2025-11-19 11:08:55 -05:00
ccurme	328ba36601	chore(openai): skip Azure text completions tests (#34021 )	2025-11-19 09:29:12 -05:00
Sydney Runkle	d47d41cbd3	release: langchain-core 1.0.6 (#34018 )	2025-11-19 08:16:34 -05:00
William FH	32bbe99efc	chore: Support tool runtime injection when custom args schema is prov… (#33999 ) Support injection of injected args (like `InjectedToolCallId`, `ToolRuntime`) when an `args_schema` is specified that doesn't contain said args. This allows for pydantic validation of other args while retaining the ability to inject langchain specific arguments. fixes https://github.com/langchain-ai/langchain/issues/33646 fixes https://github.com/langchain-ai/langchain/issues/31688 Taking a deep dive here reminded me that we definitely need to revisit our internal tooling logic, but I don't think we should do that in this PR. --------- Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com> Co-authored-by: Sydney Runkle <sydneymarierunkle@gmail.com>	2025-11-18 17:09:59 +00:00
ccurme	990e346c46	release(anthropic): 1.1 (#33997 )	2025-11-17 16:24:29 -05:00
ccurme	9b7792631d	feat(anthropic): support native structured output feature and strict tool calling (#33980 )	2025-11-17 16:14:20 -05:00
CKLogic	558a8fe25b	feat(core): add proxy support for mermaid png rendering (#32400 ) ### Description This PR adds support for configuring HTTP/HTTPS proxies when rendering Mermaid diagrams as PNG images using the remote Mermaid.INK API. This enhancement allows users in restricted network environments to access the API via a proxy, making the remote rendering feature more robust and accessible. The changes include: - Added optional `proxies` parameter to `draw_mermaid_png` and `_render_mermaid_using_api` functions - Updated `Graph.draw_mermaid_png` method to support and pass through proxy configuration - Enhanced docstrings with usage examples for the new parameter - Maintained full backward compatibility with existing code ### Usage Example ```python proxies = { "http": "http://127.0.0.1:7890", "https": "http://127.0.0.1:7890" } display(Image(chain.get_graph().draw_mermaid_png(proxies=proxies))) ``` ### Dependencies No new dependencies required. Uses existing `requests` library for HTTP requests. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-17 12:45:17 -06:00
Mason Daugherty	52b1516d44	style(langchain): fix some middleware ref syntax (#33988 )	2025-11-16 00:33:17 -05:00
Mason Daugherty	8a3bb73c05	release(openai): 1.0.3 (#33981 ) - Respect 300k token limit for embeddings API requests #33668 - fix create_agent / response_format for Responses API #33939 - fix response.incomplete event is not handled when using stream_mode=['messages'] #33871	2025-11-14 19:18:50 -05:00
Mason Daugherty	099c042395	refactor(openai): embedding utils and calculations (#33982 ) Now returns (`_iter`, `tokens`, `indices`, token_counts`). The `token_counts` are calculated directly during tokenization, which is more accurate and efficient than splitting strings later.	2025-11-14 19:18:37 -05:00
Kaparthy Reddy	2d4f00a451	fix(openai): Respect 300k token limit for embeddings API requests (#33668 ) ## Description Fixes #31227 - Resolves the issue where `OpenAIEmbeddings` exceeds OpenAI's 300,000 token per request limit, causing 400 BadRequest errors. ## Problem When embedding large document sets, LangChain would send batches containing more than 300,000 tokens in a single API request, causing this error: ``` openai.BadRequestError: Error code: 400 - {'error': {'message': 'Requested 673477 tokens, max 300000 tokens per request'}} ``` The issue occurred because: - The code chunks texts by `embedding_ctx_length` (8191 tokens per chunk) - Then batches chunks by `chunk_size` (default 1000 chunks per request) - But didn't check: Total tokens per batch against OpenAI's 300k limit - Result: `1000 chunks × 8191 tokens = 8,191,000 tokens` → Exceeds limit! ## Solution This PR implements dynamic batching that respects the 300k token limit: 1. Added constant: `MAX_TOKENS_PER_REQUEST = 300000` 2. Track token counts: Calculate actual tokens for each chunk 3. Dynamic batching: Instead of fixed `chunk_size` batches, accumulate chunks until approaching the 300k limit 4. Applied to both sync and async: Fixed both `_get_len_safe_embeddings` and `_aget_len_safe_embeddings` ## Changes - Modified `langchain_openai/embeddings/base.py`: - Added `MAX_TOKENS_PER_REQUEST` constant - Replaced fixed-size batching with token-aware dynamic batching - Applied to both sync (line ~478) and async (line ~527) methods - Added test in `tests/unit_tests/embeddings/test_base.py`: - `test_embeddings_respects_token_limit()` - Verifies large document sets are properly batched ## Testing All existing tests pass (280 passed, 4 xfailed, 1 xpassed). New test verifies: - Large document sets (500 texts × 1000 tokens = 500k tokens) are split into multiple API calls - Each API call respects the 300k token limit ## Usage After this fix, users can embed large document sets without errors: ```python from langchain_openai import OpenAIEmbeddings from langchain_chroma import Chroma from langchain_text_splitters import CharacterTextSplitter # This will now work without exceeding token limits embeddings = OpenAIEmbeddings() documents = CharacterTextSplitter().split_documents(large_documents) Chroma.from_documents(documents, embeddings) ``` Resolves #31227 --------- Co-authored-by: Kaparthy Reddy <kaparthyreddy@Kaparthys-MacBook-Air.local> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-14 18:12:07 -05:00

1 2 3 4 5 ...

8083 Commits