langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-03-18 02:53:16 +00:00

Author	SHA1	Message	Date
Eugene Yurtsev	b0add1f989	fix(langchain): backport patch ReDoS vulnerability in MRKL and ReAct action regex (CVE-2024-58340) (#35603 ) Backport of #35598 to the v0.3 branch. Patches the ReDoS vulnerability in MRKL and ReAct agent action regex patterns (CVE-2024-58340). Created with [Deep Agents CLI](https://docs.langchain.com/oss/python/deepagents/cli/overview).	2026-03-06 16:22:23 -05:00
Angus Jelinek	4dab5fafc0	feat(core,langchain,text-splitters): (v0.3) use uuid7 for run ids (#34732 ) Backports #34172 --------- Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com> Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-01-12 20:09:58 -05:00
ccurme	3a465d635b	feat(openai): enable stream_usage when using default base URL and client (#33296 )	2025-10-06 10:22:36 -04:00
ccurme	d7cce2f469	feat(langchain_v1): update messages namespace (#33207 )	2025-10-02 10:35:00 -04:00
Sydney Runkle	a336afaecd	feat(langchain): use decorators for jumps instead (#33179 ) The old `before_model_jump_to` classvar approach was quite clunky, this is nicer imo and easier to document. Also moving from `jump_to` to `can_jump_to` which is more idiomatic. Before: ```py class MyMiddleware(AgentMiddleware): before_model_jump_to: ClassVar[list[JumpTo]] = ["end"] def before_model(state, runtime) -> dict[str, Any]: return {"jump_to": "end"} ``` After ```py class MyMiddleware(AgentMiddleware): @hook_config(can_jump_to=["end"]) def before_model(state, runtime) -> dict[str, Any]: return {"jump_to": "end"} ```	2025-10-01 16:49:27 -07:00
Sydney Runkle	a10e880c00	feat(langchain_v1): add `async` support for `create_agent` (#33175 ) This makes branching much more simple internally and helps greatly w/ type safety for users. It just allows for one signature on hooks instead of multiple. Opened after https://github.com/langchain-ai/langchain/pull/33164 ballooned more than expected, w/ branching for: * sync vs async * runtime vs no runtime (this is self imposed) This also removes support for nodes w/o `runtime` in the signature. We can always go back and add support for nodes w/o `runtime`. I think @christian-bromann's idea to re-export `runtime` from langchain's agents might make sense due to the abundance of imports here. Check out the value of the change based on this diff: https://github.com/langchain-ai/langchain/pull/33176	2025-10-01 19:15:39 +00:00
Eugene Yurtsev	7b5e839be3	chore(langchain_v1): use list[str] for modifyModelRequest (#33166 ) Update model request to return tools by name. This will decrease the odds of misusing the API. We'll need to extend the type for built-in tools later.	2025-10-01 14:46:19 -04:00
Mohammad Mohtashim	34f8031bd9	feat(langchain): Using Structured Response as Key in Output Schema for Middleware Agent (#33159 ) - Description: Changing the key from `response` to `structured_response` for middleware agent to keep it sync with agent without middleware. This a breaking change. - Issue: #33154	2025-10-01 03:24:59 +00:00
Eugene Yurtsev	9c97597175	chore(langchain_v1): expose middleware decorators and selected messages (#33163 ) * Make it easy to improve the middleware shortcuts * Export the messages that we're confident we'll expose	2025-09-30 14:14:57 -04:00
Sydney Runkle	eed0f6c289	feat(langchain): todo middleware (#33152 ) Porting the [planning middleware](`39c0138d0f/src/deepagents/middleware.py (L21)`) over from deepagents. Also adding the ability to configure: * System prompt * Tool description ```py from langchain.agents.middleware.planning import PlanningMiddleware from langchain.agents import create_agent agent = create_agent("openai:gpt-4o", middleware=[PlanningMiddleware()]) result = await agent.invoke({"messages": [HumanMessage("Help me refactor my codebase")]}) print(result["todos"]) # Array of todo items with status tracking ```	2025-09-30 02:23:26 +00:00
Mason Daugherty	3325196be1	fix(langchain): handle `gpt-5` model name in `init_chat_model` (#33148 ) expand to match any `gpt-*` model to openai	2025-09-29 16:16:17 -04:00
Mason Daugherty	f402fdcea3	fix(langchain): add `context_management` to Anthropic chat model init (#33150 )	2025-09-29 16:13:47 -04:00
nhuang-lc	c456c8ae51	fix(langchain): fix response action for HITL (#33131 ) Multiple improvements to HITL flow: * On a `response` type resume, we should still append the tool call to the last AIMessage (otherwise we have a ToolResult without a corresponding ToolCall) * When all interrupts have `response` types (so there's no pending tool calls), we should jump back to the first node (instead of end) as we enforced in the previous `post_model_hook_router` * Added comments to `model_to_tools` router so clarify all of the potential exit conditions Additionally: * Lockfile update to use latest LG alpha release * Added test for `jump_to` behaving ephemerally, this was fixed in LG but surfaced as a bug w/ `jump_to`. * Bump version to v1.0.0a10 to prep for alpha release --------- Co-authored-by: Sydney Runkle <sydneymarierunkle@gmail.com> Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com>	2025-09-29 13:08:18 +00:00
Eugene Yurtsev	54ea62050b	chore(langchain_v1): move tool node to tools namespace (#33132 ) * Move ToolNode to tools namespace * Expose injected variable as well in tools namespace * Update doc-strings throughout	2025-09-26 15:23:57 -04:00
Mason Daugherty	986302322f	docs: more standardization (#33124 )	2025-09-25 20:46:20 -04:00
Mason Daugherty	5bea28393d	docs: standardize `.. code-block` directive usage (#33122 ) and fix typos	2025-09-25 16:49:56 -04:00
Mason Daugherty	c3fed20940	docs: correct ported over directives (#33121 ) Match rest of repo	2025-09-25 15:54:54 -04:00
Christophe Bornet	eaf8dce7c2	chore: bump ruff version to 0.13 (#33043 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-25 12:27:39 -04:00
Mason Daugherty	f82de1a8d7	chore: bump locks (#33114 )	2025-09-25 01:46:01 -04:00
Sydney Runkle	f015526e42	release(langchain): v1.0.0a9 (#33098 )	2025-09-24 21:02:53 +00:00
Sydney Runkle	57d931532f	fix(langchain): extra arg for anthropic caching, `__end__` -> `end` for `jump_to` (#33097 ) Also updating `jump_to` to use `end` instead of `__end__`	2025-09-24 17:00:40 -04:00
Mason Daugherty	33f06875cb	fix(langchain_v1): version equality check (#33095 )	2025-09-24 16:27:55 -04:00
Sydney Runkle	dd81e1c3fb	release(langchain): 1.0.0a8 (#33090 )	2025-09-24 15:31:29 -04:00
Sydney Runkle	135a5b97e6	feat(langchain): improvements to anthropic prompt caching (#33058 ) Adding an `unsupported_model_behavior` arg that can be `'ignore'`, `'warn'`, or `'raise'`. Defaults to `'warn'`.	2025-09-24 15:28:49 -04:00
Mason Daugherty	b92b394804	style: repo linting pass (#33089 ) enable docstring-code-format	2025-09-24 15:25:55 -04:00
Sydney Runkle	083bb3cdd7	fix(langchain): need to inject all state for tools registered by middleware (#33087 ) Type hints matter for conditional edges!	2025-09-24 15:25:51 -04:00
Sydney Runkle	4f8a76b571	chore(langchain): renaming for HITL (#33067 )	2025-09-24 07:19:44 -04:00
Sydney Runkle	b5720ff17a	chore(langchain): simplifying HITL condition (#33065 ) Simplifying condition	2025-09-23 21:24:14 +00:00
nhuang-lc	48b05224ad	fix(langchain_v1): only interrupt if at least one ToolConfig value is True (#33064 ) Description: Right now, we interrupt even if the provided ToolConfig has all false values. We should ignore ToolConfigs which do not have at least one value marked as true (just as we would if tool_name: False was passed into the dict).	2025-09-23 17:20:34 -04:00
Sydney Runkle	89079ad411	feat(langchain): new decorator pattern for dynamically generated middleware (#33053 ) # Main Changes 1. Adding decorator utilities for dynamically defining middleware with single hook functions (see an example below for dynamic system prompt) 2. Adding better conditional edge drawing with jump configuration attached to middleware. Can be registered w/ the decorator new decorator! ## Decorator Utilities ```py from langchain.agents.middleware_agent import create_agent, AgentState, ModelRequest from langchain.agents.middleware.types import modify_model_request from langchain_core.messages import HumanMessage from langgraph.checkpoint.memory import InMemorySaver @modify_model_request def modify_system_prompt(request: ModelRequest, state: AgentState) -> ModelRequest: request.system_prompt = ( "You are a helpful assistant." f"Please record the number of previous messages in your response: {len(state['messages'])}" ) return request agent = create_agent( model="openai:gpt-4o-mini", middleware=[modify_system_prompt] ).compile(checkpointer=InMemorySaver()) ``` ## Visualization and Routing improvements We now require that middlewares define the valid jumps for each hook. If using the new decorator syntax, this can be done with: ```py @before_model(jump_to=["__end__"]) @after_model(jump_to=["tools", "__end__"]) ``` If using the subclassing syntax, you can use these two class vars: ```py class MyMiddlewareAgentMiddleware): before_model_jump_to = ["__end__"] after_model_jump_to = ["tools", "__end__"] ``` Open for debate if we want to bundle these in a single jump map / config for a middleware. Easy to migrate later if we decide to add more hooks. We will need to really clearly document that these must be explicitly set in order to enable conditional edges. Notice for the below case, `Middleware2` does actually enable jumps. <table> <thead> <tr> <th>Before (broken), adding conditional edges unconditionally</th> <th>After (fixed), adding conditional edges sparingly</th> </tr> </thead> <tbody> <tr> <td> <img width="619" height="508" alt="Screenshot 2025-09-23 at 10 23 23 AM" src="https://github.com/user-attachments/assets/bba2d098-a839-4335-8e8c-b50dd8090959" /> </td> <td> <img width="469" height="490" alt="Screenshot 2025-09-23 at 10 23 13 AM" src="https://github.com/user-attachments/assets/717abf0b-fc73-4d5f-9313-b81247d8fe26" /> </td> </tr> </tbody> </table> <details> <summary>Snippet for the above</summary> ```py from typing import Any from langchain.agents.tool_node import InjectedState from langgraph.runtime import Runtime from langchain.agents.middleware.types import AgentMiddleware, AgentState from langchain.agents.middleware_agent import create_agent from langchain_core.tools import tool from typing import Annotated from langchain_core.messages import HumanMessage from typing_extensions import NotRequired @tool def simple_tool(input: str) -> str: """A simple tool.""" return "successful tool call" class Middleware1(AgentMiddleware): """Custom middleware that adds a simple tool.""" tools = [simple_tool] def before_model(self, state: AgentState, runtime: Runtime) -> None: return None def after_model(self, state: AgentState, runtime: Runtime) -> None: return None class Middleware2(AgentMiddleware): before_model_jump_to = ["tools", "__end__"] def before_model(self, state: AgentState, runtime: Runtime) -> None: return None def after_model(self, state: AgentState, runtime: Runtime) -> None: return None class Middleware3(AgentMiddleware): def before_model(self, state: AgentState, runtime: Runtime) -> None: return None def after_model(self, state: AgentState, runtime: Runtime) -> None: return None builder = create_agent( model="openai:gpt-4o-mini", middleware=[Middleware1(), Middleware2(), Middleware3()], system_prompt="You are a helpful assistant.", ) agent = builder.compile() ``` </details> ## More Examples ### Guardrails `after_model` <img width="379" height="335" alt="Screenshot 2025-09-23 at 10 40 09 AM" src="https://github.com/user-attachments/assets/45bac7dd-398e-45d1-ae58-6ecfa27dfc87" /> <details> <summary>Code</summary> ```py from langchain.agents.middleware_agent import create_agent, AgentState, ModelRequest from langchain.agents.middleware.types import after_model from langchain_core.messages import HumanMessage, AIMessage from langgraph.checkpoint.memory import InMemorySaver from typing import cast, Any @after_model(jump_to=["model", "__end__"]) def after_model_hook(state: AgentState) -> dict[str, Any]: """Check the last AI message for safety violations.""" last_message_content = cast(AIMessage, state["messages"][-1]).content.lower() print(last_message_content) unsafe_keywords = ["pineapple"] if any(keyword in last_message_content for keyword in unsafe_keywords): # Jump back to model to regenerate response return {"jump_to": "model", "messages": [HumanMessage("Please regenerate your response, and don't talk about pineapples. You can talk about apples instead.")]} return {"jump_to": "__end__"} # Create agent with guardrails middleware agent = create_agent( model="openai:gpt-4o-mini", middleware=[after_model_hook], system_prompt="Keep your responses to one sentence please!" ).compile() # Test with potentially unsafe input result = agent.invoke( {"messages": [HumanMessage("Tell me something about pineapples")]}, ) for msg in result["messages"]: print(msg.pretty_print()) """ ================================ Human Message ================================= Tell me something about pineapples None ================================== Ai Message ================================== Pineapples are tropical fruits known for their sweet, tangy flavor and distinctive spiky exterior. None ================================ Human Message ================================= Please regenerate your response, and don't talk about pineapples. You can talk about apples instead. None ================================== Ai Message ================================== Apples are popular fruits that come in various varieties, known for their crisp texture and sweetness, and are often used in cooking and baking. None """ ``` </details>	2025-09-23 13:25:55 -04:00
Sydney Runkle	c3be45bf14	fix(langchain): HITL bug causing dupe interrupt (#33052 ) Need to find last AI msg (not first). Getting too creative w/ generators.	2025-09-22 20:09:12 -04:00
Mason Daugherty	781db9d892	chore: update `pyproject.toml` files, remove codespell (#33028 ) - Removes Codespell from deps, docs, and `Makefile`s - Python version requirements in all `pyproject.toml` files now use the `~=` (compatible release) specifier - All dependency groups and main dependencies now use explicit lower and upper bounds, reducing potential for breaking changes	2025-09-20 22:09:33 -04:00
Sydney Runkle	f2b0afd0b7	release(langchain): 1.0.0a6 (#33024 ) w/ improvements to HITL, state schema merging, dynamic system prompt	2025-09-19 18:47:41 +00:00
Sydney Runkle	c3654202a3	fix(langchain): use state schema as input schema to middleware nodes (#33023 ) We want state schema as the input schema to middleware nodes because the conditional edges after these nodes need access to the full state. Also, we just generally want all state passed to middleware nodes, so we should be specifying this explicitly. If we don't, the state annotations used by users in their node signatures are used (so they might be missing fields).	2025-09-19 18:43:33 +00:00
Sydney Runkle	4d118777bc	feat(langchain): dynamic system prompt middleware (#33006 ) # Changes ## Adds support for `DynamicSystemPromptMiddleware` ```py from langchain.agents.middleware import DynamicSystemPromptMiddleware from langgraph.runtime import Runtime from typing_extensions import TypedDict class Context(TypedDict): user_name: str def system_prompt(state: AgentState, runtime: Runtime[Context]) -> str: user_name = runtime.context.get("user_name", "n/a") return f"You are a helpful assistant. Always address the user by their name: {user_name}" middleware = DynamicSystemPromptMiddleware(system_prompt) ``` ## Adds support for `runtime` in middleware hooks ```py class AgentMiddleware(Generic[StateT, ContextT]): def modify_model_request( self, request: ModelRequest, state: StateT, runtime: Runtime[ContextT], # Optional runtime parameter ) -> ModelRequest: # upgrade model if runtime.context.subscription is `top-tier` or whatever ``` ## Adds support for omitting state attributes from input / output schemas ```py from typing import Annotated, NotRequired from langchain.agents.middleware.types import PrivateStateAttr, OmitFromInput, OmitFromOutput class CustomState(AgentState): # Private field - not in input or output schemas internal_counter: NotRequired[Annotated[int, PrivateStateAttr]] # Input-only field - not in output schema user_input: NotRequired[Annotated[str, OmitFromOutput]] # Output-only field - not in input schema computed_result: NotRequired[Annotated[str, OmitFromInput]] ``` ## Additionally * Removes filtering of state before passing into middleware hooks Typing is not foolproof here, still need to figure out some of the generics stuff w/ state and context schema extensions for middleware. TODO: * More docs for middleware, should hold off on this until other prios like MCP and deepagents are met --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-09-18 16:07:16 -04:00
Sydney Runkle	d5ba5d3511	feat(langchain): improved HITL patterns (#32996 ) # Main changes / new features ## Better support for parallel tool calls 1. Support for multiple tool calls requiring human input 2. Support for combination of tool calls requiring human input + those that are auto-approved 3. Support structured output w/ tool calls requiring human input 4. Support structured output w/ standard tool calls ## Shortcut for allowed actions Adds a shortcut where tool config can be specified as a `bool`, meaning "all actions allowed" ```py HumanInTheLoopMiddleware(tool_configs={"expensive_tool": True}) ``` ## A few design decisions here * We only raise one interrupt w/ all `HumanInterrupt`s, currently we won't be able to execute all tools until all of these are resolved. This isn't super blocking bc we can't re-invoke the model until all tools have finished execution. That being said, if you have a long running auto-approved tool, this could slow things down. ## TODOs * Ideally, we would rename `accept` -> `approve` * Ideally, we would rename `respond` -> `reject` * Docs update (@sydney-runkle to own) * In another PR I'd like to refactor testing to have one file for each prebuilt middleware :) Fast follow to https://github.com/langchain-ai/langchain/pull/32962 which was deemed as too breaking	2025-09-17 16:53:01 -04:00
Mason Daugherty	8180020b93	chore: restore commented out optional deps (#32971 ) langchain & langchain_v1	2025-09-16 10:10:49 -04:00
Christophe Bornet	cbaf97ada4	chore: bump mypy version to 1.18 (#32914 )	2025-09-12 09:19:23 -04:00
Sydney Runkle	dc2da95ac0	release(langchain): v1.0.0a5 (#32917 )	2025-09-12 08:36:44 -04:00
Sydney Runkle	9e78ff19ab	fix(langchain): use messages from model request (#32908 ) Oversight when moving back to basic function call for `modify_model_request` rather than implementation as its own node. Basic test right now failing on main, passing on this branch Revealed a gap in testing. Will write up a more robust test suite for basic middleware features.	2025-09-12 08:18:02 -04:00
Caspar Broekhuizen	15d558ff16	fix(core): resolve mermaid node id collisions when special chars are used (#32857 ) ### Description * Replace the Mermaid graph node label escaping logic (`_escape_node_label`) with `_to_safe_id`, which converts a string into a unique, Mermaid-compatible node id. Ensures nodes with special characters always render correctly. Before * Invalid characters (e.g. `开`) replaced with `_`. Causes collisions between nodes with names that are the same length and contain all non-safe characters: ```python _escape_node_label("开") # '_' _escape_node_label("始") # '_' same as above, but different character passed in. not a unique mapping. ``` After ```python _to_safe_id("开") # \5f00 _to_safe_id("始") # \59cb unique! ``` ### Tests * Rename `test_graph_mermaid_escape_node_label()` to `test_graph_mermaid_to_safe_id()` and update function logic to use `_to_safe_id` * Add `test_graph_mermaid_special_chars()` ### Issue Fixes langchain-ai/langgraph#6036	2025-09-11 14:15:17 -07:00
Mason Daugherty	7a158c7f1c	revert: "chore: remove ruff target-version" (#32895 ) Reverts langchain-ai/langchain#32880 Not needed at the moment, will do when finishing v1	2025-09-10 20:56:48 -04:00
Christophe Bornet	b274416441	chore: remove ruff target-version (#32880 ) This is not needed anymore since `requires-python` was added when moving to `uv`.	2025-09-10 11:12:30 -04:00
Mason Daugherty	c124e67325	chore(docs): update package `README`s (#32869 ) - Fix badges - Focus on agents - Cut down fluff	2025-09-09 14:50:32 +00:00
Christophe Bornet	e36e25fe2f	feat(langchain): support PEP604 ( `\|` union) in tool node error handlers (#32861 ) This allows to use PEP604 syntax for `ToolNode` error handlers ```python def error_handler(e: ValueError \| ToolException) -> str: return "error" ToolNode(my_tool, handle_tool_errors=error_handler).invoke(...) ``` Without this change, this fails with `AttributeError: 'types.UnionType' object has no attribute '__mro__'`	2025-09-09 10:11:12 -04:00
Christophe Bornet	017348b27c	chore(langchain): add ruff rule E501 in `langchain_v1` (#32812 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 17:28:14 -04:00
Christophe Bornet	fe6c415c9f	chore(langchain): add ruff rule UP007 in `langchain_v1` (#32811 ) Done by autofix	2025-09-08 17:26:00 -04:00
Christophe Bornet	54c2419a4e	chore(langchain): enable ruff docstring-code-format in langchain_v1 (#32855 )	2025-09-08 16:51:18 -04:00
Christophe Bornet	5840dad40b	chore(core): enable ruff docstring-code-format (#32834 ) See https://docs.astral.sh/ruff/settings/#format_docstring-code-format --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 15:13:50 +00:00
Sydney Runkle	6e2f46d04c	feat(langchain): middleware support in `create_agent` (#32828 ) ## Overview Adding new `AgentMiddleware` primitive that supports `before_model`, `after_model`, and `prepare_model_request` hooks. This is very exciting! It makes our `create_agent` prebuilt much more extensible + capable. Still in alpha and subject to change. This is different than the initial [implementation](https://github.com/langchain-ai/langgraph/tree/nc/25aug/agent) in that it: * Fills in gaps w/ missing features, for ex -- new structured output, optionality of tools + system prompt, sync and async model requests, provider builtin tools * Exposes private state extensions for middleware, enabling things like model call tracking, etc * Middleware can register tools * Uses a `TypedDict` for `AgentState` -- dataclass subclassing is tricky w/ required values + required decorators * Addition of `model_settings` to `ModelRequest` so that we can pass through things to bind (like cache kwargs for anthropic middleware) ## TODOs ### top prio - [x] add middleware support to existing agent - [x] top prio middlewares - [x] summarization node - [x] HITL - [x] prompt caching other ones - [x] model call limits - [x] tool calling limits - [ ] usage (requires output state) ### secondary prio - [x] improve typing for state updates from middleware (not working right now w/ simple `AgentUpdate` and `AgentJump`, at least in Python) - [ ] add support for public state (input / output modifications via pregel channel mods) -- to be tackled in another PR - [x] testing! ### docs See https://github.com/langchain-ai/docs/pull/390 - [x] high level docs about middleware - [x] summarization node - [x] HITL - [x] prompt caching ## open questions Lots of open questions right now, many of them inlined as comments for the short term, will catalog some more significant ones here. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2025-09-08 01:10:57 +00:00

1 2

78 Commits