langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 18:50:33 +00:00

Author	SHA1	Message	Date
Nick Hollon	da380bccf8	chore(infra): merge v1.4 into master (#37350 )	2026-05-11 11:39:25 -07:00
Nick Hollon	c979c6187b	fix(core, langchain): harden `load()` against untrusted manifests (#37197 )	2026-05-05 14:36:58 -04:00
Mason Daugherty	a1f336fdc7	fix(core): preserve structured `inputs` on tool runs in tracers (#37108 ) Tool runs in `_TracerCore._create_tool_run` were discarding the structured `inputs` dict that `BaseTool.run` passes to `on_tool_start`, replacing it with `{"input": str(filtered_tool_input)}`. Consequently, every multi-arg tool (e.g. ones in `deepagents` like `execute`, `edit_file`, `write_file`, `grep`, ...) appeared in LangSmith with a stringified, escaped dump of its arguments — multi-line bash commands rendered with `\n` and were effectively unreadable. Chain runs already preserved dicts via `_get_chain_inputs`; tool runs are now symmetric. ## Changes - Preserve `inputs` when it is already a `dict` in the `original` / `original+chat` branch of `_TracerCore._create_tool_run`, falling back to `{"input": input_str}` only when no structured payload was provided - Add regression tests in the sync and async base-tracer suites that pass a structured `inputs` to `on_tool_start` and assert the dict survives onto the resulting `Run` ## Breaking change Custom `BaseTracer` subclasses that parsed `Run.inputs["input"]` as a stringified dict for tool runs will need to read the structured fields directly. The shape now matches what `on_tool_start(inputs=...)` has always received — introduced alongside `_schema_format` in the `astream_events` work — and what `streaming_events` consumers already see.	2026-04-30 14:56:14 -04:00
Nick Hollon	9ce72eba9f	feat(core): add content-block-centric streaming (v2) (#36834 )	2026-04-24 11:36:17 -04:00
Jacob Lee	40026a7282	feat(core): Update inheritance behavior for tracer metadata for special keys (#36900 ) JS equivalent: https://github.com/langchain-ai/langchainjs/pull/10733	2026-04-20 14:58:01 -07:00
Jacob Lee	a6eb829701	fix(core): Use reference counting for storing inherited run trees to support garbage collection (#36660 ) When a langsmith `@traceable` function invokes a LangChain Runnable or LangGraph subgraph, the callback manager's `_configure` function injects the `@traceable` RunTree into the `LangChainTracer`'s `run_map` so that child runs can resolve their parent for trace nesting. However, since the RunTree was created outside the tracer's callback lifecycle, `_end_trace` never removes it. The entry persists in `run_map` indefinitely, retaining the full RunTree and its entire child tree. In applications with nested subgraph invocations (e.g. an outer investigation graph delegating to skill agent subgraphs, each compiled as their own `StateGraph`), this causes RunTree objects to accumulate linearly with every call. Fix: Track which `run_map` entries were injected externally via a shared `_external_run_ids` refcount dict on `_TracerCore`. When `_start_trace` adds a child under an external parent, it increments the count. When `_end_trace` finishes a child, it decrements — and evicts the external parent from `run_map` once the last child completes. The refcount (rather than a simple set) is necessary because a single external parent may have multiple sibling children in the callback chain (e.g. a `prompt \| llm` `RunnableSequence`). Only truly external runs are tracked — the `_configure` guard `if run_id_str not in handler.run_map` prevents tracer-managed runs from being misclassified.	2026-04-13 09:50:37 -04:00
Eugene Yurtsev	af4d711a2f	chore(core): reduce streaming metadata / perf (#36588 ) - looking into reducing streaming metadata / perfm --------- Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2026-04-10 10:47:54 -04:00
Mason Daugherty	61fd90a2f3	fix(core): extract usage metadata from serialized tracer message outputs (#35526 ) Fixes missing `run.metadata.usage_metadata` population in `LangChainTracer` for real LLM/chat traces following #34414 - Fix extraction to read usage from serialized tracer message shape: `outputs.generations[][].message.kwargs.usage_metadata` - Remove non-serialized direct message shape handling (`message.usage_metadata`) from extractor to match real tracer output path - Clarify tracer docstrings around chat callback naming (`on_chat_model_start` + shared `on_llm_end`) to reduce ambiguity ## Why #34414 introduced usage duplication into `run.metadata.usage_metadata`, but the extractor read `message.usage_metadata`. In real tracer flow, messages are serialized with `dumpd(...)` during run completion, so usage metadata lives under `message.kwargs.usage_metadata`. Because of this mismatch, duplication did not trigger in real traces.	2026-03-02 17:43:33 -05:00
Shivangi Sharma	f7dbdab5ba	docs: fix docstring inaccuracies and update outdated LangSmith URLs (#35283 ) Fix several docstring inaccuracies in langchain-core and update outdated LangSmith URLs across three README files. Docstring fixes (libs/core): - `tap_output_iter`: docstring says "async iterator" but method accepts sync `Iterator` - `agenerate_from_stream`: docstring says "Iterator" but method accepts `AsyncIterator` - `BaseLLM.OutputType`: docstring says "input type" but property returns output type - Grammar: "or deprecated" → "or be deprecated", "relies" → "rely", "whose the" → "whose" URL fixes (libs/core, libs/langchain, libs/langchain_v1): - Updated `smith.langchain.com` → `www.langchain.com/langsmith` (root README already uses the correct URL) Verified with `make lint` and `make format` in libs/core — no new issues introduced. Changes are docs-only with no code logic impact. This PR was created with assistance from an AI coding tool.	2026-02-17 11:22:18 -05:00
Luka Aladashvili	97ee14c179	fix(core): replace bare except with Exception in tracer (#35138 ) ## Description This PR replaces a bare `except:` clause with `except Exception:` in `libs/core/langchain_core/tracers/core.py`. The previous implementation caught `BaseException`, which includes `SystemExit` and `KeyboardInterrupt`. This meant that if a user tried to interrupt the program (Ctrl+C) during a traceback formatting error, the signal would be suppressed, potentially making the process un-killable. This change ensures that standard runtime errors are still caught and logged, but system control signals are allowed to propagate correctly. ## Verification - Verified via code inspection. - This is a standard safety fix for exception handling patterns in Python to avoid suppressing system exit signals.	2026-02-10 12:12:46 -05:00
Mason Daugherty	11df1bedc3	style(core): lint (#34862 ) it looks scary but i promise it is not improving documentation consistency across core. primarily update docstrings and comments for better formatting, readability, and accuracy, as well as add minor clarifications and formatting improvements to user-facing documentation.	2026-01-23 23:07:48 -05:00
Shreyansh Singh Gautam	2ef23882d2	fix(core): add `tool_call_id` to `on_tool_error` event data (#33731 ) # Add `tool_call_id` to `on_tool_error` event data ## Summary This PR addresses issue #33597 by adding `tool_call_id` to the `on_tool_error` callback event data. This enables users to link tool errors to specific tool calls in stateless agent implementations, which is essential for building OpenAI-compatible APIs and tracking tool execution flows. ## Problem When streaming events using `astream_events` with `version="v2"`, the `on_tool_error` event only included the error and input data, but lacked the `tool_call_id`. This made it difficult to: - Link errors to specific tool calls in stateless agent scenarios - Implement OpenAI-compatible APIs that require tool call tracking - Track tool execution flows when using `run_id` is not sufficient ## Solution The fix adds `tool_call_id` propagation through the callback chain: 1. Pass `tool_call_id` to callbacks: Updated `BaseTool.run()` and `BaseTool.arun()` to pass `tool_call_id` to both `on_tool_start` and `on_tool_error` callbacks 2. Store in event stream handler: Modified `_AstreamEventsCallbackHandler` to store `tool_call_id` in run info during `on_tool_start` 3. Include in error events: Updated `on_tool_error` handler to extract and include `tool_call_id` in the event data ## Changes - `libs/core/langchain_core/tools/base.py`: - Pass `tool_call_id` to `on_tool_start` in both sync and async methods - Pass `tool_call_id` to `on_tool_error` when errors occur - `libs/core/langchain_core/tracers/event_stream.py`: - Store `tool_call_id` in run info during `on_tool_start` - Extract `tool_call_id` from kwargs or run info in `on_tool_error` - Include `tool_call_id` in the `on_tool_error` event data ## Testing The fix was verified by: 1. Direct tool invocation: Confirmed `tool_call_id` appears in `on_tool_error` event data when calling tools directly 2. Agent integration: Tested with `create_agent` to ensure `tool_call_id` is present in error events during agent execution ```python # Example verification async for event in agent.astream_events( {"messages": "Please demonstrate a tool error"}, version="v2", ): if event["event"] == "on_tool_error": assert "tool_call_id" in event["data"] # ✓ Now passes print(event["data"]["tool_call_id"]) ``` ## Backward Compatibility - ✅ Fully backward compatible: `tool_call_id` is optional (can be `None`) - ✅ No breaking changes: All changes are additive - ✅ Existing code continues to work without modification ## Related Issues Fixes #33597 --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-01-10 02:35:13 -05:00
Christophe Bornet	8e3c6b109f	style(core): fix some noqa escapes (#34675 ) Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-01-09 17:36:08 -05:00
Angus Jelinek	458a186540	chore(core): Update LangChainTracer to use Pydantic v2 methods (#34541 )	2026-01-02 16:02:13 -05:00
Christophe Bornet	03ae39747b	refactor(core): fix some missing generic types (#31658 ) See https://mypy.readthedocs.io/en/stable/config_file.html#confval-disallow_any_generics --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-12-27 16:53:08 -06:00
Christophe Bornet	a92c032ff6	style(core): fix mypy no-any-return violations (#34204 ) * FIxed where possible * Used `cast` when not possible to fix --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-26 21:35:27 -06:00
Christophe Bornet	1f403cf612	style(core): add ruff rules TC (#34476 ) * Fixed a few TC * Added a few Pydantic classes to `flake8-type-checking.runtime-evaluated-base-classes` (not as much as I would have imagined) * Added a few `noqa: TC` * Activated TC rules	2025-12-25 21:23:31 -06:00
ccurme	5ec0fa69de	fix(core): serialization patch (#34455 ) - `allowed_objects` kwarg in `load` - escape lc-ser formatted dicts on `dump` - fix for jinja2 --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-12-22 17:33:31 -06:00
Mason Daugherty	04ec6cacaf	fix(core): ensure `tool_call_count` is never null (#34431 ) add truthiness check to guard against `None`	2025-12-19 21:04:01 -06:00
Mason Daugherty	ed9bd6e3ad	feat(core): automatically count and store meta for tool call count (#33756 ) Adds automatic tool call counting to tracing by means of a new `store_tool_call_count_in_run()`, which calls on newly added `count_tool_calls_in_run()`. Runs on successful LLM completion. Does not run on errored runs.	2025-12-19 20:41:57 -06:00
Christophe Bornet	72f1d79022	chore(core): fix some ruff preview rules (#34425 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-19 14:33:42 -06:00
Hunter Lovell	7902fa3238	feat(core): add `usage_metadata` to metadata in `LangChainTracer` (#34414 ) Adds `usage_metadata` (token counts, etc.) to the run metadata in `LangChainTracer`. When an LLM run ends, usage metadata is extracted from all generations and aggregated using the existing `add_usage` helper, then stored in `run.extra["metadata"]["usage_metadata"]`. The original data in outputs remains unchanged. Also, see #34415 --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-19 12:59:52 -06:00
Hunter Lovell	9225bff326	fix(core): defer persisting traces for iterator inputs (#34416 ) ref https://github.com/langchain-ai/langchainjs/pull/9665 Fixes trace persistence for iterator/generator inputs (like `RunnableGenerator`) where the full input isn't available at chain start. Instead of POSTing a run with incomplete inputs on start and PATCHing later, this defers the POST until chain end when inputs are fully realized. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-12-19 12:45:22 -06:00
Christophe Bornet	8bca31f8c4	chore(core): fix some docstrings (#34426 )	2025-12-19 13:08:10 -05:00
Christophe Bornet	bb71f53585	chore(core): use anext and deprecate py_anext (#34211 ) LangChain uses Python 3.10+ so `py_anext` isn't needed anymore. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-08 09:50:40 -05:00
William FH	1867521d1a	feat: Use uuid7 for run ids (#34172 ) Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com> Co-authored-by: Sydney Runkle <sydneymarierunkle@gmail.com>	2025-12-03 10:09:10 -08:00
Christophe Bornet	2bfbc29ccc	chore(core): fix some ruff TC rules (#33929 ) fix some ruff TC rules but still don't enforce them as Pydantic model fields use type annotations at runtime.	2025-11-12 14:07:19 -05:00
Mason Daugherty	a2a9a02ecb	style(core): more cleanup all around (#33711 )	2025-10-28 22:58:19 -04:00
Christophe Bornet	2d5efd7b29	fix(core): support for Python 3.14 (#33461 ) * Fix detection of support of context in `asyncio.create_task` * Fix: in Python 3.14 `asyncio.get_event_loop()` raises an exception if there's no running loop * Bump pydantic to version 2.12 * Skips tests with pydantic v1 models as they are not supported with Python 3.14 * Run core tests with Python 3.14 in CI. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com>	2025-10-17 05:27:34 -04:00
Mason Daugherty	26e0a00c4c	style: more work for refs (#33508 ) Largely: - Remove explicit `"Default is x"` since new refs show default inferred from sig - Inline code (useful for eventual parsing) - Fix code block rendering (indentations)	2025-10-15 18:46:55 -04:00
Mason Daugherty	53e9f00804	chore(core): delete items marked for removal in `schemas.py` (#33375 )	2025-10-15 09:56:27 -04:00
Christophe Bornet	dd994b9d7f	chore(langchain): remove arg types from docstrings (#33413 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-10-10 11:51:00 -04:00
Mason Daugherty	5f9e3e33cd	style: remove `Defaults to None` (#33404 )	2025-10-09 17:27:35 -04:00
Mason Daugherty	6fc21afbc9	style: `.. code-block::` admonition translations (#33400 ) biiiiiiiiiiiiiiiigggggggg pass	2025-10-09 16:52:58 -04:00
Mason Daugherty	d8a680ee57	style: address Sphinx double-backtick snippet syntax (#33389 )	2025-10-09 13:35:51 -04:00
Christophe Bornet	f405a2c57d	chore(core): remove arg types from docstrings (#33388 ) * Remove types args * Remove types from Returns * Remove types from Yield * Replace `kwargs` by `**kwargs` when needed	2025-10-09 13:13:23 -04:00
Mason Daugherty	b6132fc23e	style: remove more `Optional` syntax (#33371 )	2025-10-08 23:28:43 -04:00
Mason Daugherty	d13823043d	style: monorepo pass for refs (#33359 ) * Delete some double backticks previously used by Sphinx (not done everywhere yet) * Fix some code blocks / dropdowns Ignoring CLI CI for now	2025-10-08 18:41:39 -04:00
Mason Daugherty	6ea03ab46c	style(core): drop python `39` linting target for 3.10 (#33286 )	2025-10-05 23:22:34 -04:00
Mason Daugherty	5a016de53f	chore: delete deprecated items (#33192 ) Removed: - `libs/core/langchain_core/chat_history.py`: `add_user_message` and `add_ai_message` in favor of `add_messages` and `aadd_messages` - `libs/core/langchain_core/language_models/base.py`: `predict`, `predict_messages`, and async versions in favor of `invoke`. removed `_all_required_field_names` since it was a wrapper on `get_pydantic_field_names` - `libs/core/langchain_core/language_models/chat_models.py`: `callback_manager` param in favor of `callbacks`. `__call__` and `call_as_llm` method in favor of `invoke` - `libs/core/langchain_core/language_models/llms.py`: `callback_manager` param in favor of `callbacks`. `__call__`, `predict`, `apredict`, and `apredict_messages` methods in favor of `invoke` - `libs/core/langchain_core/prompts/chat.py`: `from_role_strings` and `from_strings` in favor of `from_messages` - `libs/core/langchain_core/prompts/pipeline.py`: removed `PipelinePromptTemplate` - `libs/core/langchain_core/prompts/prompt.py`: `input_variables` param on `from_file` as it wasn't used - `libs/core/langchain_core/tools/base.py`: `callback_manager` param in favor of `callbacks` - `libs/core/langchain_core/tracers/context.py`: `tracing_enabled` in favor of `tracing_enabled_v2` - `libs/core/langchain_core/tracers/langchain_v1.py`: entire module - `libs/core/langchain_core/utils/loading.py`: entire module, `try_load_from_hub` - `libs/core/langchain_core/vectorstores/in_memory.py`: `upsert` in favor of `add_documents` - `libs/standard-tests/langchain_tests/integration_tests/chat_models.py` and `libs/standard-tests/langchain_tests/unit_tests/chat_models.py`: `tool_choice_value` as models should accept `tool_choice="any"` - `langchain` will consequently no longer expose these items if it was previously --------- Co-authored-by: Mohammad Mohtashim <45242107+keenborder786@users.noreply.github.com> Co-authored-by: Caspar Broekhuizen <caspar@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Sadra Barikbin <sadraqazvin1@yahoo.com> Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2025-10-03 03:33:24 +00:00
Mason Daugherty	eaa6dcce9e	release: v1.0.0 (#32567 ) Co-authored-by: Mohammad Mohtashim <45242107+keenborder786@users.noreply.github.com> Co-authored-by: Caspar Broekhuizen <caspar@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Sadra Barikbin <sadraqazvin1@yahoo.com> Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2025-10-02 10:49:42 -04:00
Mason Daugherty	8e213c9f1a	fix(core): `AsyncCallbackHandler` docstring cleanup (#32897 ) plus IDE warning fixes	2025-09-10 21:31:45 -04:00
William FH	f1d44d0f9d	fix(core): honor `enabled=false` in nested tracing (#31986 ) Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-09-09 13:12:17 -07:00
Christophe Bornet	cc98fb9bee	chore(core): add ruff rule PLC0415 (#32351 ) See https://docs.astral.sh/ruff/rules/import-outside-top-level/ Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 14:15:04 -04:00
Christophe Bornet	16420cad71	chore(core): fix some pydocs to use google-style (#32764 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 17:52:17 +00:00
Christophe Bornet	01fdeede50	chore(core): fix some ruff preview rules (#32785 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 15:55:20 +00:00
Christophe Bornet	f4e83e0ad8	chore(core): fix some docstrings (from DOC preview rule) (#32833 ) * Add `Raises` sections * Add `Returns` sections * Add `Yields` sections --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 15:44:15 +00:00
Maitrey Talware	622337a297	docs(docs): fixed typos in documentations (#32661 ) Minor typo fixes. (Not linked to current open issues)	2025-08-25 10:02:53 -04:00
William FH	b470c79f1d	refactor(core): Use duck typing for `_StreamingCallbackHandler` (#32535 ) It's used in langgraph and maybe elsewhere, so would be preferable if it could just be duck-typed	2025-08-19 05:41:07 -07:00
Mason Daugherty	ee4c2510eb	feat: port various nit changes from `wip-v0.4` (#32506 ) Lots of work that wasn't directly related to core improvements/messages/testing functionality	2025-08-11 15:09:08 -04:00

1 2 3 4

170 Commits