langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-03-18 19:18:48 +00:00

Author	SHA1	Message	Date
Eugene Yurtsev	edeab12e95	qxq	2025-09-25 16:20:47 -04:00
Christophe Bornet	eaf8dce7c2	chore: bump ruff version to 0.13 (#33043 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-25 12:27:39 -04:00
Mason Daugherty	f82de1a8d7	chore: bump locks (#33114 )	2025-09-25 01:46:01 -04:00
Mason Daugherty	e3efd1e891	test(text-splitters): capture beta warnings (#33113 )	2025-09-25 01:30:20 -04:00
Mason Daugherty	d6769cf032	test(text-splitters): resolve pytest marker warning (#33112 ) #33111	2025-09-25 01:29:42 -04:00
Mason Daugherty	7ab2e0dd3b	test(core): resolve pytest marker warning (#33111 ) Remove redundant/outdated `@pytest.mark.requires("jinja2")` decorator Pytest marks (like `@pytest.mark.requires(...)`) applied directly to fixtures have no effect and are deprecated.	2025-09-25 01:08:54 -04:00
Mason Daugherty	81319ad3f0	test(core): resolve `pydantic_v1` deprecation warning (#33110 ) Excluded pydantic_v1 module from import testing Acceptable since this pydantic_v1 is explicitly deprecated. Testing its importability at this stage serves little purpose since users should migrate away from it.	2025-09-25 01:08:03 -04:00
Mason Daugherty	e3f3c13b75	refactor(core): use `aadd_documents` in vectorstores unit tests (#33109 ) Don't use the deprecated `upsert()` and `aupsert()` Instead use the recommended alternatives	2025-09-25 00:57:08 -04:00
Mason Daugherty	c30844fce4	fix(core): use version agnostic `get_fields` (#33108 ) Resolves a warning	2025-09-25 00:54:29 -04:00
Mason Daugherty	c9eb3bdb2d	test(core): use secure hash algorithm in indexing test to eliminate SHA-1 warning (#33107 ) Finish work from #33101	2025-09-25 00:49:11 -04:00
Mason Daugherty	e97baeb9a6	test(core): suppress `pydantic_v1` deprecation warnings during import tests (#33106 ) We intentionally import these. Hide warnings to reduce testing noise.	2025-09-25 00:37:40 -04:00
Mason Daugherty	3a6046b157	test(core): don't use deprecated `input_variables` param in `from_file` (#33105 ) finish #33104	2025-09-25 04:29:31 +00:00
Mason Daugherty	8fdc619f75	refactor(core): don't use deprecated `input_variables` param in `from_file` (#33104 ) Missed awhile back; causes warnings during tests	2025-09-25 00:14:17 -04:00
Ali Ismail	729bfe8369	test(core): enhance stringify_value test coverage for nested structures (#33099 ) ## Summary Adds test coverage for the `stringify_value` utility function to handle complex nested data structures that weren't previously tested. ## Changes - Added `test_stringify_value_nested_structures()` to `test_strings.py` - Tests nested dictionaries within lists - Tests mixed-type lists with various data types - Verifies proper stringification of complex nested structures ## Why This Matters - Fills a gap in test coverage for edge cases - Ensures `stringify_value` handles complex data structures correctly - Improves confidence in string utility functions used throughout the codebase - Low risk addition that strengthens existing test suite ## Testing ```bash uv run --group test pytest libs/core/tests/unit_tests/utils/test_strings.py::test_stringify_value_nested_structures -v ``` This test addition follows the project's testing patterns and adds meaningful coverage without introducing any breaking changes. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-25 00:04:47 -04:00
Mason Daugherty	9b624a79b2	test(core): suppress deprecation warnings in `PipelinePromptTemplate` (#33102 ) We're intentionally testing this still so as not to regress. Reduce warning noise.	2025-09-25 04:03:27 +00:00
Mason Daugherty	c60c5a91cb	fix(core): use secure hash algorithm in indexing test to eliminate SHA-1 warning (#33101 ) Use SHA-256 (collision-resistant) instead of the default SHA-1. No functional changes to test behavior.	2025-09-25 00:02:11 -04:00
Sydney Runkle	f015526e42	release(langchain): v1.0.0a9 (#33098 )	2025-09-24 21:02:53 +00:00
Sydney Runkle	57d931532f	fix(langchain): extra arg for anthropic caching, `__end__` -> `end` for `jump_to` (#33097 ) Also updating `jump_to` to use `end` instead of `__end__`	2025-09-24 17:00:40 -04:00
Mason Daugherty	33f06875cb	fix(langchain_v1): version equality check (#33095 )	2025-09-24 16:27:55 -04:00
Mason Daugherty	ee4d84de7c	style(core): typo/docs lint pass (#33093 )	2025-09-24 16:11:21 -04:00
Sydney Runkle	dd81e1c3fb	release(langchain): 1.0.0a8 (#33090 )	2025-09-24 15:31:29 -04:00
Sydney Runkle	135a5b97e6	feat(langchain): improvements to anthropic prompt caching (#33058 ) Adding an `unsupported_model_behavior` arg that can be `'ignore'`, `'warn'`, or `'raise'`. Defaults to `'warn'`.	2025-09-24 15:28:49 -04:00
Mason Daugherty	b92b394804	style: repo linting pass (#33089 ) enable docstring-code-format	2025-09-24 15:25:55 -04:00
Sydney Runkle	083bb3cdd7	fix(langchain): need to inject all state for tools registered by middleware (#33087 ) Type hints matter for conditional edges!	2025-09-24 15:25:51 -04:00
Mason Daugherty	2e9291cdd7	fix: lift openai version constraints across packages (#33088 ) re: #33038 and https://github.com/openai/openai-python/issues/2644	2025-09-24 15:25:10 -04:00
Sydney Runkle	4f8a76b571	chore(langchain): renaming for HITL (#33067 )	2025-09-24 07:19:44 -04:00
Mason Daugherty	05ba941230	style(cli): linting pass (#33078 )	2025-09-24 01:24:52 -04:00
Sydney Runkle	b5720ff17a	chore(langchain): simplifying HITL condition (#33065 ) Simplifying condition	2025-09-23 21:24:14 +00:00
nhuang-lc	48b05224ad	fix(langchain_v1): only interrupt if at least one ToolConfig value is True (#33064 ) Description: Right now, we interrupt even if the provided ToolConfig has all false values. We should ignore ToolConfigs which do not have at least one value marked as true (just as we would if tool_name: False was passed into the dict).	2025-09-23 17:20:34 -04:00
Sydney Runkle	89079ad411	feat(langchain): new decorator pattern for dynamically generated middleware (#33053 ) # Main Changes 1. Adding decorator utilities for dynamically defining middleware with single hook functions (see an example below for dynamic system prompt) 2. Adding better conditional edge drawing with jump configuration attached to middleware. Can be registered w/ the decorator new decorator! ## Decorator Utilities ```py from langchain.agents.middleware_agent import create_agent, AgentState, ModelRequest from langchain.agents.middleware.types import modify_model_request from langchain_core.messages import HumanMessage from langgraph.checkpoint.memory import InMemorySaver @modify_model_request def modify_system_prompt(request: ModelRequest, state: AgentState) -> ModelRequest: request.system_prompt = ( "You are a helpful assistant." f"Please record the number of previous messages in your response: {len(state['messages'])}" ) return request agent = create_agent( model="openai:gpt-4o-mini", middleware=[modify_system_prompt] ).compile(checkpointer=InMemorySaver()) ``` ## Visualization and Routing improvements We now require that middlewares define the valid jumps for each hook. If using the new decorator syntax, this can be done with: ```py @before_model(jump_to=["__end__"]) @after_model(jump_to=["tools", "__end__"]) ``` If using the subclassing syntax, you can use these two class vars: ```py class MyMiddlewareAgentMiddleware): before_model_jump_to = ["__end__"] after_model_jump_to = ["tools", "__end__"] ``` Open for debate if we want to bundle these in a single jump map / config for a middleware. Easy to migrate later if we decide to add more hooks. We will need to really clearly document that these must be explicitly set in order to enable conditional edges. Notice for the below case, `Middleware2` does actually enable jumps. <table> <thead> <tr> <th>Before (broken), adding conditional edges unconditionally</th> <th>After (fixed), adding conditional edges sparingly</th> </tr> </thead> <tbody> <tr> <td> <img width="619" height="508" alt="Screenshot 2025-09-23 at 10 23 23 AM" src="https://github.com/user-attachments/assets/bba2d098-a839-4335-8e8c-b50dd8090959" /> </td> <td> <img width="469" height="490" alt="Screenshot 2025-09-23 at 10 23 13 AM" src="https://github.com/user-attachments/assets/717abf0b-fc73-4d5f-9313-b81247d8fe26" /> </td> </tr> </tbody> </table> <details> <summary>Snippet for the above</summary> ```py from typing import Any from langchain.agents.tool_node import InjectedState from langgraph.runtime import Runtime from langchain.agents.middleware.types import AgentMiddleware, AgentState from langchain.agents.middleware_agent import create_agent from langchain_core.tools import tool from typing import Annotated from langchain_core.messages import HumanMessage from typing_extensions import NotRequired @tool def simple_tool(input: str) -> str: """A simple tool.""" return "successful tool call" class Middleware1(AgentMiddleware): """Custom middleware that adds a simple tool.""" tools = [simple_tool] def before_model(self, state: AgentState, runtime: Runtime) -> None: return None def after_model(self, state: AgentState, runtime: Runtime) -> None: return None class Middleware2(AgentMiddleware): before_model_jump_to = ["tools", "__end__"] def before_model(self, state: AgentState, runtime: Runtime) -> None: return None def after_model(self, state: AgentState, runtime: Runtime) -> None: return None class Middleware3(AgentMiddleware): def before_model(self, state: AgentState, runtime: Runtime) -> None: return None def after_model(self, state: AgentState, runtime: Runtime) -> None: return None builder = create_agent( model="openai:gpt-4o-mini", middleware=[Middleware1(), Middleware2(), Middleware3()], system_prompt="You are a helpful assistant.", ) agent = builder.compile() ``` </details> ## More Examples ### Guardrails `after_model` <img width="379" height="335" alt="Screenshot 2025-09-23 at 10 40 09 AM" src="https://github.com/user-attachments/assets/45bac7dd-398e-45d1-ae58-6ecfa27dfc87" /> <details> <summary>Code</summary> ```py from langchain.agents.middleware_agent import create_agent, AgentState, ModelRequest from langchain.agents.middleware.types import after_model from langchain_core.messages import HumanMessage, AIMessage from langgraph.checkpoint.memory import InMemorySaver from typing import cast, Any @after_model(jump_to=["model", "__end__"]) def after_model_hook(state: AgentState) -> dict[str, Any]: """Check the last AI message for safety violations.""" last_message_content = cast(AIMessage, state["messages"][-1]).content.lower() print(last_message_content) unsafe_keywords = ["pineapple"] if any(keyword in last_message_content for keyword in unsafe_keywords): # Jump back to model to regenerate response return {"jump_to": "model", "messages": [HumanMessage("Please regenerate your response, and don't talk about pineapples. You can talk about apples instead.")]} return {"jump_to": "__end__"} # Create agent with guardrails middleware agent = create_agent( model="openai:gpt-4o-mini", middleware=[after_model_hook], system_prompt="Keep your responses to one sentence please!" ).compile() # Test with potentially unsafe input result = agent.invoke( {"messages": [HumanMessage("Tell me something about pineapples")]}, ) for msg in result["messages"]: print(msg.pretty_print()) """ ================================ Human Message ================================= Tell me something about pineapples None ================================== Ai Message ================================== Pineapples are tropical fruits known for their sweet, tangy flavor and distinctive spiky exterior. None ================================ Human Message ================================= Please regenerate your response, and don't talk about pineapples. You can talk about apples instead. None ================================== Ai Message ================================== Apples are popular fruits that come in various varieties, known for their crisp texture and sweetness, and are often used in cooking and baking. None """ ``` </details>	2025-09-23 13:25:55 -04:00
Sydney Runkle	c3be45bf14	fix(langchain): HITL bug causing dupe interrupt (#33052 ) Need to find last AI msg (not first). Getting too creative w/ generators.	2025-09-22 20:09:12 -04:00
Mason Daugherty	7ddc798f95	fix(openai): pin upper bound to prevent Pydantic 2.7.0 issues (#33038 ) https://github.com/openai/openai-python/issues/2644	2025-09-21 00:27:03 -04:00
Mason Daugherty	7dcf6a515e	fix: update method calls from dict to `model_dump` in `Chain` (#33035 )	2025-09-20 23:47:44 -04:00
Mason Daugherty	043a7560a5	test: use `.get()` for safe `ls_params` access (#33034 )	2025-09-20 23:46:37 -04:00
Mason Daugherty	781db9d892	chore: update `pyproject.toml` files, remove codespell (#33028 ) - Removes Codespell from deps, docs, and `Makefile`s - Python version requirements in all `pyproject.toml` files now use the `~=` (compatible release) specifier - All dependency groups and main dependencies now use explicit lower and upper bounds, reducing potential for breaking changes	2025-09-20 22:09:33 -04:00
Sydney Runkle	f2b0afd0b7	release(langchain): 1.0.0a6 (#33024 ) w/ improvements to HITL, state schema merging, dynamic system prompt	2025-09-19 18:47:41 +00:00
Sydney Runkle	c3654202a3	fix(langchain): use state schema as input schema to middleware nodes (#33023 ) We want state schema as the input schema to middleware nodes because the conditional edges after these nodes need access to the full state. Also, we just generally want all state passed to middleware nodes, so we should be specifying this explicitly. If we don't, the state annotations used by users in their node signatures are used (so they might be missing fields).	2025-09-19 18:43:33 +00:00
Sydney Runkle	4d118777bc	feat(langchain): dynamic system prompt middleware (#33006 ) # Changes ## Adds support for `DynamicSystemPromptMiddleware` ```py from langchain.agents.middleware import DynamicSystemPromptMiddleware from langgraph.runtime import Runtime from typing_extensions import TypedDict class Context(TypedDict): user_name: str def system_prompt(state: AgentState, runtime: Runtime[Context]) -> str: user_name = runtime.context.get("user_name", "n/a") return f"You are a helpful assistant. Always address the user by their name: {user_name}" middleware = DynamicSystemPromptMiddleware(system_prompt) ``` ## Adds support for `runtime` in middleware hooks ```py class AgentMiddleware(Generic[StateT, ContextT]): def modify_model_request( self, request: ModelRequest, state: StateT, runtime: Runtime[ContextT], # Optional runtime parameter ) -> ModelRequest: # upgrade model if runtime.context.subscription is `top-tier` or whatever ``` ## Adds support for omitting state attributes from input / output schemas ```py from typing import Annotated, NotRequired from langchain.agents.middleware.types import PrivateStateAttr, OmitFromInput, OmitFromOutput class CustomState(AgentState): # Private field - not in input or output schemas internal_counter: NotRequired[Annotated[int, PrivateStateAttr]] # Input-only field - not in output schema user_input: NotRequired[Annotated[str, OmitFromOutput]] # Output-only field - not in input schema computed_result: NotRequired[Annotated[str, OmitFromInput]] ``` ## Additionally * Removes filtering of state before passing into middleware hooks Typing is not foolproof here, still need to figure out some of the generics stuff w/ state and context schema extensions for middleware. TODO: * More docs for middleware, should hold off on this until other prios like MCP and deepagents are met --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-09-18 16:07:16 -04:00
Mason Daugherty	f158cea1e8	release(mistralai): 0.2.12 (#33008 )	2025-09-18 11:42:11 -04:00
Sadiq Khan	90280d1f58	docs(core): fix bugs and improve example code in `chat_history.py` (#32994 ) ## Summary This PR fixes several bugs and improves the example code in `BaseChatMessageHistory` docstring that would prevent it from working correctly. ### Bugs Fixed - Critical bug: Fixed `json.dump(messages, f)` → `json.dump(serialized, f)` - was using wrong variable - NameError: Fixed bare variable references to use `self.storage_path` and `self.session_id` - Missing imports: Added required imports (`json`, `os`, message converters) to make example runnable ### Improvements - Added missing type hints following project standards (`messages() -> list[BaseMessage]`, `clear() -> None`) - Added robust error handling with `FileNotFoundError` exception handling - Added directory creation with `os.makedirs(exist_ok=True)` to prevent path errors - Improved performance: `json.load(f)` instead of `json.loads(f.read())` - Added explicit UTF-8 encoding to all file operations - Updated stores.py to use modern union syntax (`int \| None` vs `Optional[int]`) ### Test Plan - [x] Code passes linting (`ruff check`) - [x] Example code now has all required imports and proper syntax - [x] Fixed variable references prevent runtime errors - [x] Follows project's type annotation standards The example code in the docstring is now fully functional and follows LangChain's coding standards. --------- Co-authored-by: sadiqkhzn <sadiqkhzn@users.noreply.github.com>	2025-09-18 09:34:19 -04:00
Sydney Runkle	d5ba5d3511	feat(langchain): improved HITL patterns (#32996 ) # Main changes / new features ## Better support for parallel tool calls 1. Support for multiple tool calls requiring human input 2. Support for combination of tool calls requiring human input + those that are auto-approved 3. Support structured output w/ tool calls requiring human input 4. Support structured output w/ standard tool calls ## Shortcut for allowed actions Adds a shortcut where tool config can be specified as a `bool`, meaning "all actions allowed" ```py HumanInTheLoopMiddleware(tool_configs={"expensive_tool": True}) ``` ## A few design decisions here * We only raise one interrupt w/ all `HumanInterrupt`s, currently we won't be able to execute all tools until all of these are resolved. This isn't super blocking bc we can't re-invoke the model until all tools have finished execution. That being said, if you have a long running auto-approved tool, this could slow things down. ## TODOs * Ideally, we would rename `accept` -> `approve` * Ideally, we would rename `respond` -> `reject` * Docs update (@sydney-runkle to own) * In another PR I'd like to refactor testing to have one file for each prebuilt middleware :) Fast follow to https://github.com/langchain-ai/langchain/pull/32962 which was deemed as too breaking	2025-09-17 16:53:01 -04:00
Mason Daugherty	54a9556f5c	chore(cli): update lock (#32986 )	2025-09-17 02:08:20 +00:00
Mason Daugherty	66041a2778	refactor(cli): target ruff 310 (#32985 ) Use union types for optional parameters	2025-09-16 22:04:28 -04:00
Chase Lean	543d90e108	docs: add langchain-scraperapi (#31973 ) Adds documentation for the integration langchain-scraperapi, which contains 3 tools using the ScraperAPI service. The tools give AI agents the ability to Scrape the web and return HTML/text/markdown Perform Google search and return json output Perform Amazon search and return json output For reference, here is the official repo for langchain_scraperapi: https://github.com/scraperapi/langchain-scraperapi	2025-09-16 21:46:20 -04:00
ccurme	e63c1d7171	chore(langchain): drop cap on python version (#32974 )	2025-09-16 10:44:21 -04:00
Mason Daugherty	8180020b93	chore: restore commented out optional deps (#32971 ) langchain & langchain_v1	2025-09-16 10:10:49 -04:00
Mason Daugherty	244c699551	refactor(cli): drop Python 3.9 (#32964 )	2025-09-15 19:22:53 -04:00
Ali Ismail	4ebce80fbb	docs(langchain): add docstring for _load_map_reduce_chain (#32961 ) Description: Add a docstring to _load_map_reduce_chain in chains/summarize/ to explain the purpose of the prompt argument and document function parameters. This addresses an existing TODO in the codebase. Issue: N/A (documentation improvement only) Dependencies: None	2025-09-15 17:19:20 -04:00
Mason Daugherty	8670b24c8e	test(groq): xfail tool integration test (#32960 ) Groq models have known issues with tool calling consistency, [particularly with complex tools derived from runnables](https://github.com/langchain-ai/langchain/discussions/19990). [(more)](https://github.com/langchain-ai/langchain/discussions/24309) xfail until we can dedicate time to wrangling their API/model handling	2025-09-15 14:23:22 -04:00
Mason Daugherty	9f6431924f	feat(openai): add `max_tokens` to `AzureChatOpenAI` (#32959 ) Fixes #32949 This pattern is [present in `ChatOpenAI`](https://github.com/langchain-ai/langchain/blob/master/libs/partners/openai/langchain_openai/chat_models/base.py#L2821) but wasn't carried over to Azure. [CI](https://github.com/langchain-ai/langchain/actions/runs/17741751797/job/50417180998)	2025-09-15 14:09:20 -04:00

1 2 3 4 5 ...

7620 Commits