langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 18:50:33 +00:00

Author	SHA1	Message	Date
Mason Daugherty	764db7afab	docs(core): note override for `_get_ls_params` (#37503 )	2026-05-18 14:00:26 -05:00
Mason Daugherty	6c091564ac	chore(core,langchain,openai): refresh stale OpenAI model references (#37487 )	2026-05-18 01:06:42 -05:00
Nick Hollon	3802938f1c	fix(core): accept `Serializable` constructor-envelope wire shape in `_convert_to_message` (#37456 )	2026-05-15 15:34:04 -07:00
Nick Hollon	f42d80ca1c	fix(core): preserve chunk `additional_kwargs` across v3 stream assembly (#37435 ) The v3 streaming path drops `additional_kwargs` from per-chunk `AIMessageChunk`s during assembly: `chunks_to_events` emits no event field for them, and `ChatModelStream._assemble_message` constructs the final `AIMessage` without an `additional_kwargs` argument. Non-streaming `ainvoke` returns the provider message unchanged, so streaming and non-streaming diverge for any provider that uses `additional_kwargs` to carry data outside the typed protocol blocks. ## How this surfaces The concrete failure mode is Gemini's `__gemini_function_call_thought_signatures__` — a per-tool-call signature blob the Google GenAI integration places in `additional_kwargs`, keyed by `tool_call_id`. Gemini requires that signature on follow-up turns to replay the prior thought trace; without it, multi-turn streaming flows lose thought continuity (and may regenerate thinking, charging additional reasoning tokens, or in some cases refuse). Other providers that use `additional_kwargs` (e.g. older `function_call` accumulators, custom routing metadata) hit the same gap; the fix is intentionally provider-agnostic. ## Fix Provider-agnostic, two seams: - `_compat_bridge` accumulates `msg.additional_kwargs` across chunks with `merge_dicts` (matching `AIMessageChunk`'s own merge semantics for fields that accumulate, like `function_call`) and emits the merged dict on the `message-finish` event as an off-spec extension. The bridge already uses one such extension (`metadata` on `MessageFinishData`); this PR follows the same pattern for `additional_kwargs`. - `ChatModelStream._finish` reads the new field; `_assemble_message` threads it onto the final `AIMessage` only when non-empty, preserving today's behavior of leaving `additional_kwargs` empty when no provider data needs to ride on it.	2026-05-14 11:19:45 -07:00
Nick Hollon	649d82f206	fix(core): preserve reasoning blocks alongside tool_call in v3 stream (#37434 ) Closes #37420 --- `stream_events(version="v3")` (and the `astream_events` async twin) silently dropped reasoning content from the final assembled `AIMessage` whenever the same message also produced a tool_call. The bug reproduces against Gemini 2.5 Pro with `include_thoughts=True`: reasoning streams correctly through `ChatModelStream.reasoning`, but the persisted message in the final graph state carries only the `tool_call` block. ## Root cause `_iter_protocol_blocks` in the compat bridge groups per-chunk content blocks by source-side identifier. When a provider doesn't supply an `index` field on its content blocks — which the Google GenAI translator does not for either `reasoning` or `tool_call` blocks — the bridge falls back to positional `i` as the bucket key. Because Gemini typically emits one block per chunk, every reasoning chunk and the later tool_call chunk all key to `0`, and the type mismatch trips `_accumulate`'s self-contained `else` branch. That branch clears accumulated reasoning state and replaces it with the incoming tool_call, so reasoning never reaches `content-block-finish`. ## Fix When a block has no source-side `index`, key it by `("__lc_no_index__", block_type, positional_i)` instead of bare `i`. Same-type chunks at the same position still share a bucket and merge cleanly (streaming text and reasoning unchanged); different-type chunks at the same position now occupy distinct wire blocks and both reach `content-block-finish`. Providers that supply explicit indices (Anthropic, OpenAI Responses) are unaffected. ## Verification Unit-tested at the compat-bridge layer for both sync (`chunks_to_events`) and async (`achunks_to_events`) paths. Verified live against Gemini 2.5 Pro `gemini-2.5-pro` with `thinking_budget=2048`, `include_thoughts=True`, and a single `get_weather` tool. Pre-fix: `final_state.messages[tool_calling_ai_message].content == [{type: tool_call, ...}]`. Post-fix: `[..., {type: reasoning, reasoning: "..."}, {type: tool_call, ...}]`, matching the shape `ainvoke` returns on the same input.	2026-05-14 11:11:30 -07:00
Nick Hollon	da380bccf8	chore(infra): merge v1.4 into master (#37350 )	2026-05-11 11:39:25 -07:00
Mason Daugherty	8b21400627	fix(core): avoid eager `pydantic.v1` import in `@deprecated` (#37308 ) `langchain_core._api.deprecation` previously did `from pydantic.v1.fields import FieldInfo as FieldInfoV1` at module scope, which triggers Pydantic's `UserWarning("Core Pydantic V1 functionality isn't compatible with Python 3.14 or greater.")` on every `langchain_core` import under 3.14+. The v1 symbol is only needed inside one runtime branch of `@deprecated`, so it's now resolved lazily. ## Changes - Replace the top-level v1 `FieldInfo` import with `_is_pydantic_v1_field_info`, which probes `sys.modules.get("pydantic.v1.fields")` instead of forcing the import. The reconstruction inside `deprecated`'s `finalize` closure imports `FieldInfoV1` lazily, gated by the predicate — so the warning only fires if a caller has already loaded `pydantic.v1` themselves. - Add a subprocess-based regression test asserting that importing `langchain_core._api.deprecation` does not pull any `pydantic.v1*` module into `sys.modules`. Verified to fail when the eager import is reintroduced. - Add a v1 `FieldInfo` decoration test — the v1 branch of `@deprecated` previously had zero direct coverage. - Update the stale `# Last Any should be FieldInfoV1 but this leads to circular imports` comment on `T`'s bound, which no longer reflects the real reason (it's about the 3.14 warning, not circularity).	2026-05-09 20:35:17 -04:00
Nick Hollon	5039dfec1f	release(core): 1.3.3 (#37198 )	2026-05-05 15:00:01 -04:00
Nick Hollon	55a7707837	fix(core): set deprecation `since` to 1.3.3 to match release (#37200 )	2026-05-05 14:59:47 -04:00
Nick Hollon	c979c6187b	fix(core, langchain): harden `load()` against untrusted manifests (#37197 )	2026-05-05 14:36:58 -04:00
Mason Daugherty	a1f336fdc7	fix(core): preserve structured `inputs` on tool runs in tracers (#37108 ) Tool runs in `_TracerCore._create_tool_run` were discarding the structured `inputs` dict that `BaseTool.run` passes to `on_tool_start`, replacing it with `{"input": str(filtered_tool_input)}`. Consequently, every multi-arg tool (e.g. ones in `deepagents` like `execute`, `edit_file`, `write_file`, `grep`, ...) appeared in LangSmith with a stringified, escaped dump of its arguments — multi-line bash commands rendered with `\n` and were effectively unreadable. Chain runs already preserved dicts via `_get_chain_inputs`; tool runs are now symmetric. ## Changes - Preserve `inputs` when it is already a `dict` in the `original` / `original+chat` branch of `_TracerCore._create_tool_run`, falling back to `{"input": input_str}` only when no structured payload was provided - Add regression tests in the sync and async base-tracer suites that pass a structured `inputs` to `on_tool_start` and assert the dict survives onto the resulting `Run` ## Breaking change Custom `BaseTracer` subclasses that parsed `Run.inputs["input"]` as a stringified dict for tool runs will need to read the structured fields directly. The shape now matches what `on_tool_start(inputs=...)` has always received — introduced alongside `_schema_format` in the `astream_events` work — and what `streaming_events` consumers already see.	2026-04-30 14:56:14 -04:00
Mason Daugherty	37be34be82	fix(core): make `removal` optional in `warn_deprecated` (#37056 ) Drop the `NotImplementedError` branch in `warn_deprecated` so callers can pass `pending=False` without specifying a `removal` version. The previous behavior contradicted the docstring (which claimed an empty default would auto-compute a removal version) — no such computation existed; the function just raised a placeholder "Need to determine which default deprecation schedule to use" error.	2026-04-28 11:05:31 -04:00
Sharvil Saxena	78546e9242	fix(core): validate batch_size in _batch and _abatch to prevent infinite loop (#36663 )	2026-04-26 15:13:20 -04:00
Nick Hollon	c4498ccaf9	chore(core): mark stream_v2/astream_v2 as beta (#36992 )	2026-04-24 13:27:38 -04:00
Nick Hollon	fa0f0d8efa	release(core): 1.3.2 (#36990 )	2026-04-24 11:46:25 -04:00
Nick Hollon	9ce72eba9f	feat(core): add content-block-centric streaming (v2) (#36834 )	2026-04-24 11:36:17 -04:00
ccurme	3f382a9e20	release(core): 1.3.1 (#36972 )	2026-04-23 14:50:43 -04:00
Hunter Lovell	9a671d7919	feat(core): allow _format_output to pass through list of ToolOutputMixin instances (#36963 )	2026-04-23 13:49:46 -04:00
Jacob Lee	40026a7282	feat(core): Update inheritance behavior for tracer metadata for special keys (#36900 ) JS equivalent: https://github.com/langchain-ai/langchainjs/pull/10733	2026-04-20 14:58:01 -07:00
Eugene Yurtsev	c87cd04927	release(core): release 1.3.0 (#36851 ) xRelease 1.3.0	2026-04-17 14:42:01 +00:00
Eugene Yurtsev	af0e174ef7	release(core): 1.3.0a3 (#36829 ) release 1.3.0a3	2026-04-16 15:37:28 -04:00
Eugene Yurtsev	b00646d882	chore(core): keep checkpoint_ns behavior in streaming metadata for backwards compat (#36828 ) minor buglet	2026-04-16 15:17:20 -04:00
Jacob Lee	c04e05feb1	feat(core): Add chat model and LLM invocation params to traceable metadata (#36771 ) Equivalent to: https://github.com/langchain-ai/langchainjs/pull/10711/ --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2026-04-16 18:30:54 +00:00
ccurme	338aa8131a	fix(core): restore cloud metadata IPs and link-local range in SSRF policy (#36816 )	2026-04-16 09:15:42 -04:00
ccurme	7d601dc2c6	chore(core): harden private SSRF utilities (#36768 )	2026-04-15 16:13:20 -04:00
William FH	885f2c2c2d	fix(openai): handle content blocks without type key in responses api conversion (#36725 )	2026-04-14 15:13:40 -04:00
Eugene Yurtsev	8182d6302d	release(core): 1.3.0.a2 (#36698 ) release 1.3.0a2	2026-04-13 10:13:48 -04:00
Jacob Lee	a6eb829701	fix(core): Use reference counting for storing inherited run trees to support garbage collection (#36660 ) When a langsmith `@traceable` function invokes a LangChain Runnable or LangGraph subgraph, the callback manager's `_configure` function injects the `@traceable` RunTree into the `LangChainTracer`'s `run_map` so that child runs can resolve their parent for trace nesting. However, since the RunTree was created outside the tracer's callback lifecycle, `_end_trace` never removes it. The entry persists in `run_map` indefinitely, retaining the full RunTree and its entire child tree. In applications with nested subgraph invocations (e.g. an outer investigation graph delegating to skill agent subgraphs, each compiled as their own `StateGraph`), this causes RunTree objects to accumulate linearly with every call. Fix: Track which `run_map` entries were injected externally via a shared `_external_run_ids` refcount dict on `_TracerCore`. When `_start_trace` adds a child under an external parent, it increments the count. When `_end_trace` finishes a child, it decrements — and evicts the external parent from `run_map` once the last child completes. The refcount (rather than a simple set) is necessary because a single external parent may have multiple sibling children in the callback chain (e.g. a `prompt \| llm` `RunnableSequence`). Only truly external runs are tracked — the `_configure` guard `if run_id_str not in handler.run_map` prevents tracer-managed runs from being misclassified.	2026-04-13 09:50:37 -04:00
Mason Daugherty	cfb16f634f	docs(core): nit (#36685 )	2026-04-12 12:56:02 -05:00
Eugene Yurtsev	9ee4617fba	release(core): 1.3.0a1 (#36656 ) 1.3.0a1 release	2026-04-10 11:58:34 -04:00
Eugene Yurtsev	af4d711a2f	chore(core): reduce streaming metadata / perf (#36588 ) - looking into reducing streaming metadata / perfm --------- Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2026-04-10 10:47:54 -04:00
Eugene Yurtsev	dd7c3eb3a4	release(core): release 1.2.28 (#36614 ) release 1.27.8	2026-04-08 14:15:50 -04:00
Eugene Yurtsev	af2ed47c6f	fix(core): add more sanitization to templates (#36612 ) add more sanitization to templates	2026-04-08 14:10:10 -04:00
ccurme	6486404116	release(core): 1.2.27 (#36586 )	2026-04-07 10:52:46 -04:00
ccurme	7629c74726	fix(core): handle symlinks in deprecated prompt save path (#36585 ) Resolve symlinks before validating file extensions in the deprecated `save()` method on prompt classes. Credit to Jeff Ponte (@JDP-Security) for reporting the symlink resolution issue.	2026-04-07 10:45:42 -04:00
Mason Daugherty	0a1d290ac2	release(core): 1.2.26 (#36511 )	2026-04-03 19:27:36 -04:00
Michael Chin	ebecdddb1b	fix(core): add init validator and serialization mappings for Bedrock models (#34510 ) Adds serialization mappings for `ChatBedrockConverse` and `BedrockLLM` to unblock standard tests on `langchain-core>=1.2.5` (context: [langchain-aws#821](https://github.com/langchain-ai/langchain-aws/pull/821)). Also introduces a class-specific validator system in `langchain_core.load` that blocks deserialization of AWS Bedrock models when `endpoint_url` or `base_url` parameters are present, preventing SSRF attacks via crafted serialized payloads. Closes #34645 ## Changes - Add `ChatBedrockConverse` and `BedrockLLM` entries to `SERIALIZABLE_MAPPING` in `mapping.py`, mapping legacy paths to their `langchain_aws` import locations - Add `validators.py` with `_bedrock_validator` — rejects deserialization kwargs containing `endpoint_url` or `base_url` for all Bedrock-related classes (`ChatBedrock`, `BedrockChat`, `ChatBedrockConverse`, `ChatAnthropicBedrock`, `BedrockLLM`, `Bedrock`) - `CLASS_INIT_VALIDATORS` registry covers both serialized (legacy) keys and resolved import paths from `ALL_SERIALIZABLE_MAPPINGS`, preventing bypass via direct-path payloads - Move kwargs extraction and all validator checks (`CLASS_INIT_VALIDATORS` + `init_validator`) in `Reviver.__call__` to run before `importlib.import_module()` — fail fast on security violations before executing third-party code - Class-specific validators are independent of `init_validator` and cannot be disabled by passing `init_validator=None` ## Testing - `test_validator_registry_keys_in_serializable_mapping` — structural invariant test ensuring every `CLASS_INIT_VALIDATORS` key exists in `ALL_SERIALIZABLE_MAPPINGS` - 10 end-to-end `load()` tests covering all Bedrock class paths (legacy aliases, resolved import paths, `ChatAnthropicBedrock`, `init_validator=None` bypass attempt) - Unit tests for `_bedrock_validator` covering `endpoint_url`, `base_url`, both params, and safe kwargs --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-04-03 19:22:39 -04:00
Mason Daugherty	e94cd41fee	feat(core): add `ChatBaseten` to serializable mapping (#36510 ) Register `ChatBaseten` from `langchain_baseten` in the core serialization mapping so it can round-trip through `loads`/`dumps`. Without this entry, serialized `ChatBaseten` objects fail to deserialize.	2026-04-03 18:46:58 -04:00
Mason Daugherty	aec6d42d10	chore(core): drop `gpt-3.5-turbo` from docstrings (#36497 )	2026-04-03 10:53:33 -04:00
Ujjwal Reddy K S	d1529dd0bc	fix(core): correct parameter names in filter_messages docstring example (#36462 )	2026-04-03 09:10:17 -04:00
ccurme	e89afedfec	release(core): 1.2.25 (#36473 )	2026-04-02 18:36:14 -04:00
ccurme	0b5f2c08ee	fix(core): harden check for txt files in deprecated prompt loading functions (#36471 )	2026-04-02 16:42:48 -04:00
jasiecky	c9f51aef85	fix(core): fixed typos in the documentation (#36459 ) Fixes #36458 Fixed typos in the documentation in the core module.	2026-04-02 11:32:12 -04:00
ccurme	b3dff4a04c	release(core): 1.2.24 (#36434 )	2026-04-01 15:57:16 -04:00
ccurme	bdfd4462ac	feat(core): impute placeholder filenames for OpenAI file inputs (#36433 )	2026-04-01 14:41:53 -04:00
Weiguang Li	e6c1b29e80	fix(core): add "computer" to _WellKnownOpenAITools (#36261 )	2026-03-29 08:54:42 -04:00
ccurme	d48364130d	release(core): 1.2.23 (#36323 )	2026-03-27 19:25:21 -04:00
Jacob Lee	389f7ad1bc	revert: Revert "fix(core): trace invocation params in metadata" (#36322 )	2026-03-27 19:14:02 -04:00
ccurme	d22df94537	release(core): 1.2.22 (#36201 )	2026-03-24 14:45:30 -04:00
ccurme	27add91347	fix(core): validate paths in `prompt.save` and `load_prompt`, deprecate methods (#36200 )	2026-03-24 14:27:14 -04:00

1 2 3 4 5 ...

1335 Commits