langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-07-02 07:07:48 +00:00

Author	SHA1	Message	Date
Alvin Tang	95fe150ad2	fix(core): `_parse_google_docstring` mishandling continuation lines with colons (#35680 ) ## Description `_parse_google_docstring` incorrectly parses multi-line argument descriptions when a continuation line contains a colon. The continuation line is treated as a new argument definition instead of being appended to the current argument's description. ### Example ```python def search(query: str, top_k: int = 5) -> str: """Search the knowledge base. Args: query: The search query to use for finding things: important ones top_k: Number of results to return """ ``` Before (broken): The parser creates 3 args: `query`, `for finding things`, `top_k` After (fixed): The parser correctly creates 2 args: `query` (with full description including "for finding things: important ones"), `top_k` ### Root Cause The parser used `if ":" in line` to detect new argument lines without considering indentation. In Google-style docstrings, continuation lines have deeper indentation than argument definition lines. ### Fix Detect the base indentation level from the first argument line and treat any line with deeper indentation as a continuation of the current argument's description, regardless of whether it contains a colon. ## Issue Fixes #35679 ## Dependencies None. ## Testing Added 4 unit tests in `test_function_calling.py::TestParseGoogleDocstring`: - `test_continuation_line_with_colon` — the core bug scenario - `test_simple_args_still_work` — regression check for basic args - `test_continuation_line_without_colon` — multi-line descriptions without colons - `test_multiple_continuation_lines_with_colons` — multiple continuation lines each containing colons All tests pass locally with Python 3.12. --------- Co-authored-by: gambletan <ethanchang32@gmail.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-06-23 00:34:02 -04:00
Christophe Bornet	9ac8882a2c	refactor(langchain-classic): remove code for Python < 3.10 (#38194 )	2026-06-18 13:15:32 -04:00
Christophe Bornet	62f255980d	chore(core): add mypy `warn_unreachable` (#38109 ) Enables mypy's `warn_unreachable` rule for `langchain-core`, bringing it in line with the other strict libraries in the monorepo. Previously this rule was intentionally disabled by a code comment, because under mypy 2.x it false-flags intentional defensive runtime checks — most notably the SSRF / IP-policy guards in `langchain_core/_security/` — as unreachable. This PR resolves all of those warnings without deleting or blanket-ignoring the defensive guards, so contributors get unreachable-code coverage going forward and accidental dead code is caught in CI. The bulk of the change is mechanical: a targeted `# type: ignore[unreachable]` on each defensive `else`/error branch that mypy considers unreachable but that we deliberately keep as a runtime guard against unexpected input. A few changes are more substantive and worth a closer look: - `coro_with_context` (`runnables/utils.py`) — behavior change on Python < 3.11. The pre-3.11 path is rewritten to always route through `context.run(asyncio.create_task, coro)`, so the supplied context is reliably propagated to the task. Previously, on 3.10 the helper returned the bare coroutine (run in the caller's context) when `create_task=False`, and dropped the context entirely when `create_task=True`. The new behavior matches 3.11+. The `create_task` parameter is now inert but retained for signature compatibility. All callers `await` the result, so returning a `Task` rather than a coroutine is transparent. - `_create_template_from_message_type` (`prompts/chat.py`) — signature widening. This private helper's `template` parameter now accepts `bool` inside the list, accurately reflecting the existing `["{var}", is_optional]` placeholder form. No public-API impact. - `PydanticOutputFunctionsParser` (`output_parsers/openai_functions.py`). The `pydantic_schema` field is typed as `TypeBaseModel` (which covers both v1 and v2 model classes, unlike the prior annotation), and the `args_only` parse path now dispatches explicitly on `BaseModel` vs `BaseModelV1` rather than duck-typing via `hasattr`. This also yields clearer errors for unsupported / dict schemas. - `_security/_policy.py`. Loop variables are renamed so mypy can narrow their types, which lets the old `# type: ignore[assignment]` comments be dropped. The IP-blocklist logic is unchanged. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-06-14 17:05:48 -04:00
Christophe Bornet	0392b6bae4	fix(core): fix Pydantic v1 support in tools/runnable (#33698 ) `BaseTool.args_schema` is documented as accepting a Pydantic v1 model, but several code paths assumed v2 and raised when handed a v1 schema (e.g. an `AttributeError` from calling `model_json_schema()`/`model_fields` on a v1 model). This affected anyone using a v1 `args_schema`, and anyone composing runnables whose input/output schema is a v1 model. This PR makes the tool/runnable schema-derivation code version-agnostic. ## Type contract `TypeBaseModel` (and `PydanticBaseModel`) now include `pydantic.v1.BaseModel`, so the type honestly reflects what tools and runnables already accept at runtime. The public schema accessors (`Runnable.get_input_schema`/`get_output_schema` and the `input_schema`/`output_schema` properties) return `TypeBaseModel`. ## Version-agnostic helpers Added to `langchain_core.utils.pydantic`, each dispatching on the model's Pydantic version so callers don't have to: - `model_json_schema(model)` — JSON schema for either version. - `model_validate(model, obj)` — validation for either version. - `get_fields(model)` — field map for either version (existing helper, now used consistently). Internally, direct `.model_json_schema()` / `.model_fields` calls are replaced with these helpers (or with `get_input_jsonschema()` / `get_output_jsonschema()`). ## Behavior change worth a close look When deriving a schema from a v1 model (in `RunnableParallel`, `RunnableAssign`, and `RunnableSequence` output schemas), a required v1 field is now correctly carried over as required. Previously the v1 path read the field's `default` — which is `None` for a required v1 field — and silently turned required fields into optional/nullable ones; `default_factory` fields were dropped entirely. The new `_get_schema_field_definition` helper translates a v1 `ModelField` faithfully (required → `...`, factory preserved) and dispatches explicitly on the field type. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-06-12 00:18:49 -04:00
Christophe Bornet	a063ec26dd	chore(core): fix some `any` generics (#34545 ) Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-06-10 15:32:14 -04:00
Nidhi Rajani	0f45b2c285	feat(openai): support `apply_patch` built-in tool (#37157 ) [Docs](https://github.com/langchain-ai/docs/pull/4370) Fixes #37031 Adds support for OpenAI Responses API `apply_patch` built-in tool. This PR: - Adds `apply_patch` to the OpenAI well-known tools list so `bind_tools([{"type": "apply_patch"}])` works. - Preserves `apply_patch_call` and `apply_patch_call_output` items when converting OpenAI Responses API outputs into LangChain `AIMessage.content`. - Preserves the same item types in streaming `AIMessageChunk` conversion. - Supports round-trip input conversion for `apply_patch_call` and `apply_patch_call_output`. - Adds unit tests for core tool passthrough, non-streaming conversion, streaming conversion, and round-trip input conversion. ## Testing - `cd libs/core && uv run --group test pytest tests/unit_tests/utils/test_function_calling.py -k "apply_patch" -vv` - `cd libs/partners/openai && uv run --group test pytest tests/unit_tests/chat_models/test_base.py -k "apply_patch" -vv` - `cd libs/core && uv run --all-groups ruff check langchain_core/utils/function_calling.py tests/unit_tests/utils/test_function_calling.py` - `cd libs/partners/openai && uv run --all-groups ruff check langchain_openai/chat_models/base.py tests/unit_tests/chat_models/test_base.py` --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-06-09 16:13:37 -04:00
Weiguang Li	e6c1b29e80	fix(core): add "computer" to _WellKnownOpenAITools (#36261 )	2026-03-29 08:54:42 -04:00
Mason Daugherty	9b22f9c450	chore: housekeeping (#35850 )	2026-03-13 16:24:35 -04:00
Mohammad Mohtashim	b21c0a8062	fix(core): preserve default_factory when generating tool call schema (#35550 )	2026-03-08 15:34:21 -04:00
ccurme	fbfe4b812d	feat(openai): support tool search (#35582 )	2026-03-08 08:53:13 -04:00
Guofang.Tang	78678534f9	fix(core): treat empty tool chunk ids as missing in merge (#35414 )	2026-02-24 18:12:49 -05:00
Tanzim Hossain Romel	2d1492a864	fix(core): improve error message for non-JSON-serializable tool schemas (#34376 )	2026-02-22 17:32:00 -05:00
yaowubarbara	5053436dcf	fix(core): fix merge_lists incorrectly merging parallel tool calls (#35281 )	2026-02-18 20:33:17 -05:00
Mason Daugherty	ba3ad67328	fix(core): preserve index and timestamp fields when merging (#34731 ) Porting https://github.com/langchain-ai/langchainjs/pull/9781	2026-02-17 11:29:41 -05:00
Varun Chawla	a5f22e7cb1	chore(core): clean up docstring mismatch and redundant logic in langchain-core (#35064 ) ## Description Fixes #35046 Two minor cleanups in `langchain-core`: 1. Fix docstring mismatch in `mustache.render()`: The docstring incorrectly documented `partials_path` and `partials_ext` parameters that do not exist in the function signature. These were likely carried over from the original [chevron](https://github.com/noahmorrison/chevron) library but were never part of this adapted implementation. 2. Remove redundant logic in `Blob.from_path()`: The expression `mimetypes.guess_type(path)[0] if guess_type else None` had a redundant `if guess_type` ternary since the outer condition `if mime_type is None and guess_type:` already guarantees `guess_type` is `True` at that point. Simplified to just `mimetypes.guess_type(path)[0]`. ## AI Disclaimer An AI coding assistant was used to help identify and implement these changes.	2026-02-10 12:25:50 -05:00
Louis Auneau	f5252b438e	fix(core): google docstring parsing with no arguments/reserved arguments (#34861 )	2026-01-30 22:48:58 -05:00
Mason Daugherty	11df1bedc3	style(core): lint (#34862 ) it looks scary but i promise it is not improving documentation consistency across core. primarily update docstrings and comments for better formatting, readability, and accuracy, as well as add minor clarifications and formatting improvements to user-facing documentation.	2026-01-23 23:07:48 -05:00
David Fernandez	5b401fa414	refactor(core): generalize `comma_list` utility to support any `Iterable` (#34714 ) Updates `comma_list` in `libs/core/langchain_core/utils/strings.py` to accept `Iterable[Any]` instead of `list[Any]`, making the utility more flexible. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-01-12 20:26:59 -05:00
Bhavesh Sharma	e261924030	fix(core): improve error message for missing title in JSON schema functions (#34683 ) Changes Created I have fixed the issue where a generic and misleading error message was displayed when a JSON schema was missing the top-level title key. [Fix: Improve error message for missing title in JSON schema functions](https://github.com/Bhavesh007Sharma/langchain/tree/fix-json-schema-title-error) File Modified: libs/core/langchain_core/utils/function_calling.py I updated the convert_to_openai_function validation logic to specifically check for dict inputs that look like schemas ( type or properties keys present) but are missing the title key. # Before (Generic Error) raise ValueError( f"Unsupported function\n\n{function}\n\nFunctions must be passed in" " as Dict, pydantic.BaseModel, or Callable. If they're a dict they must" " either be in OpenAI function format or valid JSON schema with top-level" " 'title' and 'description' keys." ) # After (Specific Error) if isinstance(function, dict) and ("type" in function or "properties" in function): msg = ( "Unsupported function\n\nTo use a JSON schema as a function, " "it must have a top-level 'title' key to be used as the function name." ) raise ValueError(msg) Verification Results Automated Tests I created a reproduction script reproduce_issue.py to confirm the behavior. Before Fix: The script would have raised the generic "Unsupported function" error claiming description was also required. After Fix: The script now confirms that the new, specific error message is raised when title is missing. (Note: Verification was performed by inspecting the code logic and running a lightweight reproduction script locally, as full suite verification had environment dependency issues.) --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-01-09 23:10:09 -05:00
Christophe Bornet	8e3c6b109f	style(core): fix some noqa escapes (#34675 ) Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-01-09 17:36:08 -05:00
Aman Gupta	2847814c70	feat(core): add more file extensions to ignore in HTML link extraction (#34552 ) # feat(core): add more file extensions to ignore in HTML link extraction ## Description This PR enhances the HTML link extraction utility in `libs/core/langchain_core/utils/html.py` by expanding the `SUFFIXES_TO_IGNORE` list to include additional common binary file extensions: - `.webp` - `.pdf` - `.docx` - `.xlsx` - `.pptx` - `.pptm` These file types are non-HTML, non-crawlable resources. Ignoring them prevents `find_all_links` and `extract_sub_links` from mistakenly treating such binary assets as navigable links. This improves link filtering, reduces unnecessary crawling, and aligns behavior with typical web scraping expectations. ## Summary of Changes - Updated `libs/core/langchain_core/utils/html.py`: Added `.webp`, `.pdf`, `.docx`, `.xlsx`, `.pptx`, `.pptm` to `SUFFIXES_TO_IGNORE`. ## Related Issues N/A ## Verification - `ruff check libs/core/langchain_core/utils/html.py`: Passed - `mypy libs/core/langchain_core/utils/html.py`: Passed - `pytest libs/core/tests/unit_tests/utils/test_html.py`: Passed (11 tests) --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-01-08 14:40:22 -05:00
Aman Gupta	50c5bb5607	refactor(core): improve docstrings for HTML link extraction utilities (#34550 ) # refactor(core): improve docstrings for HTML link extraction utilities ## Description This PR updates and clarifies the docstrings for `find_all_links` and `extract_sub_links` in `libs/core/langchain_core/utils/html.py`. The previous return-value descriptions were vague (e.g., "all links", "sub links"). They have now been revised to clearly describe the behavior and output of each function: - find_all_links → “A list of all links found in the HTML.” - extract_sub_links → “A list of absolute paths to sub links.” These improvements make the utilities more understandable and developer-friendly without altering functionality. ## Verification - `ruff check libs/core/langchain_core/utils/html.py`: Passed - `pytest libs/core/tests/unit_tests/utils/test_html.py`: Passed ## Checklists - PR title follows the required format: `TYPE(SCOPE): DESCRIPTION` - Changes are limited to the `langchain-core` package - `make format`, `make lint`, and `make test` pass	2026-01-08 10:21:17 -05:00
Mohammad Mohtashim	e6a9694f5d	fix(core): fix strict schema generation for functions with optional args (#34599 )	2026-01-07 15:13:18 -05:00
ゆり	be2c7f1aa8	test(core): add tests for formatting utils and merge functions (#34511 ) ## Summary Add comprehensive test coverage for previously untested utilities in `langchain-core`. ## Changes ### New file: `test_formatting.py` (18 tests) Tests for `StrictFormatter` class: - `test_vformat_with_keyword_args` - basic functionality - `test_vformat_with_multiple_keyword_args` - multiple placeholders - `test_vformat_with_empty_string` - edge case - `test_vformat_with_no_placeholders` - literal strings - `test_vformat_raises_on_positional_args` - error handling - `test_vformat_raises_on_multiple_positional_args` - error handling - `test_vformat_with_special_characters` - newlines, tabs - `test_vformat_with_unicode` - emoji, CJK characters - `test_vformat_with_format_spec` - format specifications - `test_vformat_with_nested_braces` - escaped braces Tests for `validate_input_variables`: - `test_validate_input_variables_success` - valid input - `test_validate_input_variables_with_extra_variables` - extra vars allowed - `test_validate_input_variables_with_missing_variable` - KeyError - `test_validate_input_variables_empty_format` - edge case - `test_validate_input_variables_no_placeholders` - edge case Tests for `formatter` singleton: - `test_formatter_is_strict_formatter` - type check - `test_formatter_format_works` - functionality - `test_formatter_rejects_positional_args` - error handling ### Extended `test_utils.py` (14 new tests) Tests for `merge_lists`: - Parametrized tests covering None handling, simple merge, empty lists, index-based merging - `test_merge_lists_multiple_others` - merging 3+ lists - `test_merge_lists_all_none` - all None inputs Tests for `merge_obj`: - Parametrized tests for None, strings, dicts, lists, equal values - `test_merge_obj_type_mismatch` - TypeError on type mismatch - `test_merge_obj_unmergeable_values` - ValueError on different values - `test_merge_obj_tuple_raises` - ValueError for tuples ## Test plan - [x] Tests follow existing patterns in the codebase - [x] All tests are unit tests (no network calls) - [x] Tests cover happy paths and error conditions - [x] Tests verify no mutation of input data ## AI Disclosure This contribution was developed with AI assistance (Claude Code). 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: yurekami <yurekami@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-01-05 14:20:11 -05:00
weiii668	5517ef37fb	docs(core): add docstrings to internal helper functions (#34525 ) Co-authored-by: weiii668 <your-email@example.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-30 21:58:00 -06:00
Christophe Bornet	a92c032ff6	style(core): fix mypy no-any-return violations (#34204 ) * FIxed where possible * Used `cast` when not possible to fix --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-26 21:35:27 -06:00
Rudra Tiwari	75e237643a	perf(core): move origin type map to module level in `function_calling.py` (#34481 ) Moves `_ORIGIN_MAP` dict from inside `_py_38_safe_origin()` to module level constant. This avoids dict allocation on every function call, reducing garbage collection pressure during frequent tool conversions. The function is called during typed dict to pydantic model conversion which happens during tool binding and invocation - a hot path in LangChain. Testing: `make lint` passes --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-12-25 21:29:31 -06:00
Christophe Bornet	1f403cf612	style(core): add ruff rules TC (#34476 ) * Fixed a few TC * Added a few Pydantic classes to `flake8-type-checking.runtime-evaluated-base-classes` (not as much as I would have imagined) * Added a few `noqa: TC` * Activated TC rules	2025-12-25 21:23:31 -06:00
Christophe Bornet	72f1d79022	chore(core): fix some ruff preview rules (#34425 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-19 14:33:42 -06:00
Christophe Bornet	8bca31f8c4	chore(core): fix some docstrings (#34426 )	2025-12-19 13:08:10 -05:00
Mason Daugherty	516d74b6df	fix(core): use `get_type_hints` for Python 3.14 `TypedDict` compatibility (#34390 ) Replace direct `__annotations__` access with `get_type_hints()` in `_convert_any_typed_dicts_to_pydantic` to handle [PEP 649](https://peps.python.org/pep-0649/) deferred annotations in Python 3.14: > [`Changed in version 3.14: Annotations are now lazily evaluated by default`](https://docs.python.org/3/reference/compound_stmts.html#annotations) Before: ```python class MyTool(TypedDict): name: str MyTool.__annotations__ # {'name': 'str'} - string, not type issubclass('str', ...) # TypeError: arg 1 must be a class ``` After: ```python get_type_hints(MyTool) # {'name': <class 'str'>} - actual type ``` Fixes #34291	2025-12-16 14:08:01 -05:00
Mason Daugherty	75d365418b	style(core): docs nit (#34312 )	2025-12-12 10:33:14 -05:00
Mason Daugherty	10377a7373	fix(core): widen openai tool/function conversion input type to `Mapping` (#34304 ) Motivated by changes to accept `TypedDict` tool types (e.g. in case of Anthropic/Claude built-in tools)	2025-12-11 16:33:53 -05:00
Christophe Bornet	bb71f53585	chore(core): use anext and deprecate py_anext (#34211 ) LangChain uses Python 3.10+ so `py_anext` isn't needed anymore. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-08 09:50:40 -05:00
William FH	1867521d1a	feat: Use uuid7 for run ids (#34172 ) Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com> Co-authored-by: Sydney Runkle <sydneymarierunkle@gmail.com>	2025-12-03 10:09:10 -08:00
Mason Daugherty	12df938ace	docs(core): update docstrings in `RunnableConfig`, `dereference_refs` (#34131 )	2025-11-28 03:55:37 -05:00
Mason Daugherty	47b79c30c0	chore(docs): fix a few refs syntax errors (#34044 ) missing whitespace for some admonitions	2025-11-22 00:58:21 -05:00
Eugene Yurtsev	c4b6ba254e	fix(core): fix validation for input variables in f-string templates, restrict functionality supported by jinja2, mustache templates (#34035 ) * Fix validation for input variables in f-string templates * Restrict functionality of features supported by jinja2 and mustache templates	2025-11-19 16:09:46 -05:00
Christophe Bornet	2bfbc29ccc	chore(core): fix some ruff TC rules (#33929 ) fix some ruff TC rules but still don't enforce them as Pydantic model fields use type annotations at runtime.	2025-11-12 14:07:19 -05:00
Lê Nam Khánh	88246f45b3	docs: fix typos in libs/core/langchain_core/utils/function_calling.py (#33873 )	2025-11-07 10:34:28 -05:00
Mason Daugherty	d40e340479	chore: attribute package change versions (#33854 ) Needed to disambiguate for within inherited docs	2025-11-06 16:57:30 -05:00
Christophe Bornet	915c446c48	chore(core): add ruff rule `PLR2004` (#33706 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-11-04 13:33:37 -05:00
Michael Li	6617865440	fix(core): add no colors check (#33780 ) Patch edge case in get_color_mapping	2025-11-03 13:23:23 -05:00
Mason Daugherty	f94108b4bc	fix: links (#33691 ) * X-ref to new docs * Formatting updates	2025-10-27 19:04:29 -04:00
Arun Prasad	86ac39e11f	refactor(core): Minor refactor for code readability (#33674 )	2025-10-27 11:39:36 -04:00
ccurme	f1742954ab	fix(core): make handling of schemas more defensive (#33660 )	2025-10-24 11:10:06 -04:00
Yu Zhong	df46c82ae2	feat(core): automatic set required to include all properties in strict mode (#32930 )	2025-10-22 11:31:08 -04:00
Mason Daugherty	a47386f6dc	style: more refs polishing (#33601 )	2025-10-20 00:52:52 -04:00
Mason Daugherty	26e0a00c4c	style: more work for refs (#33508 ) Largely: - Remove explicit `"Default is x"` since new refs show default inferred from sig - Inline code (useful for eventual parsing) - Fix code block rendering (indentations)	2025-10-15 18:46:55 -04:00
Mason Daugherty	68ceeb64f6	chore(core): delete `function_calling.py` utils marked for removal (#33376 )	2025-10-14 16:13:19 -04:00

1 2 3 4 5

236 Commits