langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-10-28 06:10:30 +00:00

Author	SHA1	Message	Date
Christophe Bornet	cc98fb9bee	chore(core): add ruff rule PLC0415 (#32351 ) See https://docs.astral.sh/ruff/rules/import-outside-top-level/ Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 14:15:04 -04:00
Christophe Bornet	16420cad71	chore(core): fix some pydocs to use google-style (#32764 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 17:52:17 +00:00
Christophe Bornet	01fdeede50	chore(core): fix some ruff preview rules (#32785 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 15:55:20 +00:00
Christophe Bornet	f4e83e0ad8	chore(core): fix some docstrings (from DOC preview rule) (#32833 ) * Add `Raises` sections * Add `Returns` sections * Add `Yields` sections --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 15:44:15 +00:00
Christophe Bornet	5840dad40b	chore(core): enable ruff docstring-code-format (#32834 ) See https://docs.astral.sh/ruff/settings/#format_docstring-code-format --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 15:13:50 +00:00
Christophe Bornet	e3b6c9bb66	chore(core): fix some mypy `warn_unreachable` issues (#32560 ) Found by setting `warn_unreachable: true` in mypy. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-09-08 15:02:08 +00:00
Mason Daugherty	a0331285d7	fix(core): Support no-args tools by defaulting args to empty dict (#32530 ) Supersedes #32408 Description: This PR ensures that tool calls without explicitly provided `args` will default to an empty dictionary (`{}`), allowing tools with no parameters (e.g. `def foo() -> str`) to be registered and invoked without validation errors. This change improves compatibility with agent frameworks that may omit the `args` field when generating tool calls. Issue: See [langgraph#5722](https://github.com/langchain-ai/langgraph/issues/5722) – LangGraph currently emits tool calls without `args`, which leads to validation errors when tools with no parameters are invoked. This PR ensures compatibility by defaulting `args` to `{}` when missing. Dependencies: None --------- Thank you for contributing to LangChain! Follow these steps to mark your pull request as ready for review. If any of these steps are not completed, your PR will not be considered for review. - [ ] PR title: Follows the format: {TYPE}({SCOPE}): {DESCRIPTION} - Examples: - feat(core): add multi-tenant support - fix(cli): resolve flag parsing error - docs(openai): update API usage examples - Allowed `{TYPE}` values: - feat, fix, docs, style, refactor, perf, test, build, ci, chore, revert, release - Allowed `{SCOPE}` values (optional): - core, cli, langchain, standard-tests, docs, anthropic, chroma, deepseek, exa, fireworks, groq, huggingface, mistralai, nomic, ollama, openai, perplexity, prompty, qdrant, xai - Note: the `{DESCRIPTION}` must not start with an uppercase letter. - Once you've written the title, please delete this checklist item; do not include it in the PR. - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change. Include a [closing keyword](https://docs.github.com/en/issues/tracking-your-work-with-issues/using-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword) if applicable to a relevant issue. - Issue: the issue # it fixes, if applicable (e.g. Fixes #123) - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, you must include: 1. A test for the integration, preferably unit tests that do not rely on network access, 2. An example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. We will not consider a PR unless these three are passing in CI. See [contribution guidelines](https://python.langchain.com/docs/contributing/) for more. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to `pyproject.toml` files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. --------- Signed-off-by: jitokim <pigberger70@gmail.com> Co-authored-by: jito <pigberger70@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-08-14 20:28:36 +00:00
Mason Daugherty	ee4c2510eb	feat: port various nit changes from `wip-v0.4` (#32506 ) Lots of work that wasn't directly related to core improvements/messages/testing functionality	2025-08-11 15:09:08 -04:00
Mason Daugherty	c31236264e	chore: formatting across codebase (#32466 )	2025-08-08 10:20:10 -04:00
Mason Daugherty	96cbd90cba	fix: formatting issues in docstrings (#32265 ) Ensures proper reStructuredText formatting by adding the required blank line before closing docstring quotes, which resolves the "Block quote ends without a blank line; unexpected unindent" warning.	2025-07-27 23:37:47 -04:00
niceg	0d6f915442	fix: LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. (#32222 ) fix: Fix LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. - Description: This PR fixes an issue where the LLM would mimic Unicode responses due to forced Unicode conversion of non-ASCII characters in tool calls. The fix involves disabling the `ensure_ascii` flag in `json.dumps()` when converting tool calls to OpenAI format. - Issue: Fixes ↓↓↓ input： ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "你好啊集团"}'}}]} ``` output: ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "\\u4f60\\u597d\\u554a\\u96c6\\u56e2"}'}}]} ``` then: llm will mimic outputting unicode. Unicode's vast number of symbols can lengthen LLM responses, leading to slower performance. <img width="686" height="277" alt="image" src="https://github.com/user-attachments/assets/28f3b007-3964-4455-bee2-68f86ac1906d" /> --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 17:01:31 -04:00
Christophe Bornet	03e8327e01	core: Ruff preview fixes (#31877 ) Auto-fixes from `uv run ruff check --fix --unsafe-fixes --preview` --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-07 13:02:40 -04:00
Christophe Bornet	8aed3b61a9	core: Bump ruff version to 0.12 (#31846 )	2025-07-07 10:02:51 -04:00
Mason Daugherty	6d71b6b6ee	standard-tests: refactoring and fixes (#31703 ) - `libs/core/langchain_core/messages/base.py`: add model name to examples [per docs](https://python.langchain.com/api_reference/standard_tests/integration_tests/langchain_tests.integration_tests.chat_models.ChatModelIntegrationTests.html#langchain_tests.integration_tests.chat_models.ChatModelIntegrationTests.test_usage_metadata) ("0.3.17: Additionally check for the presence of model_name in the response metadata, which is needed for usage tracking in callback handlers") - `libs/core/langchain_core/utils/function_calling.py`: correct typo - `libs/standard-tests/langchain_tests/integration_tests/chat_models.py`: - `magic_function(input)` -> `magic_function(_input)` to prevent warning about redefining built in `input` - relocate a few tests for better grouping and narrative flow - suppress some type hint warnings following suit from similar tests - fix a few more typos - validate not only that `model_name` is defined, but that it is not empty (test_usage_metadata)	2025-06-23 23:22:31 +00:00
ccurme	ee83993b91	docs: document Anthropic cache TTL count details (#31708 )	2025-06-23 20:16:42 +00:00
Mikhail	6105a5841b	core: fix `get_buffer_string` output for structured message content (#31600 )	2025-06-20 23:21:50 +00:00
Christophe Bornet	c982573f1e	core: Add ruff rules A (builtins shadowing) (#29312 ) See https://docs.astral.sh/ruff/rules/#flake8-builtins-a * Renamed vars where possible * Added `noqa` where backward compatibility was needed * Added `@override` when applicable	2025-05-16 15:19:37 -04:00
Christophe Bornet	a8f2ddee31	core: Add ruff rules RUF (#29353 ) See https://docs.astral.sh/ruff/rules/#ruff-specific-rules-ruf Mostly: * [RUF022](https://docs.astral.sh/ruff/rules/unsorted-dunder-all/) (unsorted `__all__`) * [RUF100](https://docs.astral.sh/ruff/rules/unused-noqa/) (unused noqa) * [RUF021](https://docs.astral.sh/ruff/rules/parenthesize-chained-operators/) (parenthesize-chained-operators) * [RUF015](https://docs.astral.sh/ruff/rules/unnecessary-iterable-allocation-for-first-element/) (unnecessary-iterable-allocation-for-first-element) * [RUF005](https://docs.astral.sh/ruff/rules/collection-literal-concatenation/) (collection-literal-concatenation) * [RUF046](https://docs.astral.sh/ruff/rules/unnecessary-cast-to-int/) (unnecessary-cast-to-int) --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-15 15:43:57 -04:00
Sydney Runkle	7263011b24	perf[core]: remove unnecessary model validators (#31238 ) * Remove unnecessary cast of id -> str (can do with a field setting) * Remove unnecessary `set_text` model validator (can be done with a computed field - though we had to make some changes to the `Generation` class to make this possible Before: ~2.4s Blue circles represent time spent in custom validators :( <img width="1337" alt="Screenshot 2025-05-14 at 10 10 12 AM" src="https://github.com/user-attachments/assets/bb4f477f-4ee3-4870-ae93-14ca7f197d55" /> After: ~2.2s <img width="1344" alt="Screenshot 2025-05-14 at 10 11 03 AM" src="https://github.com/user-attachments/assets/99f97d80-49de-462f-856f-9e7e8662adbc" /> We still want to optimize the backwards compatible tool calls model validator, though I think this might involve breaking changes, so wanted to separate that into a different PR. This is circled in green.	2025-05-14 10:20:22 -07:00
Jacob Lee	66d1ed6099	fix(core): Permit OpenAI style blocks to be passed into convert_to_openai_messages (#31140 ) Should effectively be a noop, just shouldn't throw CC @madams0013 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-07 10:57:37 -04:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00
Jacob Lee	6b0b317cb5	feat(core): Autogenerate filenames for when converting file content blocks to OpenAI format (#30984 ) CC @ccurme --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-24 13:36:31 +00:00
ccurme	4bc70766b5	core, openai: support standard multi-modal blocks in convert_to_openai_messages (#30968 )	2025-04-23 11:20:44 -04:00
Sydney Runkle	75e50a3efd	core[patch]: Raise `AttributeError` (instead of `ModuleNotFoundError`) in custom `__getattr__` (#30905 ) Follow up to https://github.com/langchain-ai/langchain/pull/30769, fixing the regression reported [here](https://github.com/langchain-ai/langchain/pull/30769#issuecomment-2807483610), thanks @krassowski for the report! Fix inspired by https://github.com/PrefectHQ/prefect/pull/16172/files Other changes: * Using tuples for `__all__`, except in `output_parsers` bc of a list namespace conflict * Using a helper function for imports due to repeated logic across `__init__.py` files becoming hard to maintain. Co-authored-by: Michał Krassowski < krassowski 5832902+krassowski@users.noreply.github.com>"	2025-04-17 14:15:28 -04:00
ccurme	86d51f6be6	multiple: permit optional fields on multimodal content blocks (#30887 ) Instead of stuffing provider-specific fields in `metadata`, they can go directly on the content block.	2025-04-17 12:48:46 +00:00
Sydney Runkle	88fce67724	core: Removing unnecessary `pydantic` core schema rebuilds (#30848 ) We only need to rebuild model schemas if type annotation information isn't available during declaration - that shouldn't be the case for these types corrected here. Need to do more thorough testing to make sure these structures have complete schemas, but hopefully this boosts startup / import time.	2025-04-16 12:00:08 -04:00
ccurme	9cfe6bcacd	multiple: multi-modal content blocks (#30746 ) Introduces standard content block format for images, audio, and files. ## Examples Image from url: ``` { "type": "image", "source_type": "url", "url": "https://path.to.image.png", } ``` Image, in-line data: ``` { "type": "image", "source_type": "base64", "data": "<base64 string>", "mime_type": "image/png", } ``` PDF, in-line data: ``` { "type": "file", "source_type": "base64", "data": "<base64 string>", "mime_type": "application/pdf", } ``` File from ID: ``` { "type": "file", "source_type": "id", "id": "file-abc123", } ``` Plain-text file: ``` { "type": "file", "source_type": "text", "text": "foo bar", } ```	2025-04-15 09:48:06 -04:00
Sydney Runkle	edb6a23aea	core[lint]: fix issue with unused ignore in `__init__.py` files (#30825 ) Fixing a race condition between https://github.com/langchain-ai/langchain/pull/30769 and https://github.com/langchain-ai/langchain/pull/30737	2025-04-14 17:57:00 +00:00
Sydney Runkle	4f69094b51	core[performance]: use custom `__getattr__` in `__init__.py` files for lazy imports (#30769 ) Most easily reviewed with the "hide whitespace" option toggled. Seeing 10-50% speed ups in import time for common structures 🚀 The general purpose of this PR is to lazily import structures within `langchain_core.XXX_module.__init__.py` so that we're not eagerly importing expensive dependencies (`pydantic`, `requests`, etc). Analysis of flamegraphs generated with `importtime` motivated these changes. For example, the one below demonstrates that importing `HumanMessage` accidentally triggered imports for `importlib.metadata`, `requests`, etc. There's still much more to do on this front, and we can start digging into our own internal code for optimizations now that we're less concerned about external imports. <img width="1210" alt="Screenshot 2025-04-11 at 1 10 54 PM" src="https://github.com/user-attachments/assets/112a3fe7-24a9-4294-92c1-d5ae64df839e" /> I've tracked the improvements with some local benchmarks: ## `pytest-benchmark` results \| Name \| Before (s) \| After (s) \| Delta (s) \| % Change \| \|-----------------------------\|------------\|-----------\|-----------\|----------\| \| Document \| 2.8683 \| 1.2775 \| -1.5908 \| -55.46% \| \| HumanMessage \| 2.2358 \| 1.1673 \| -1.0685 \| -47.79% \| \| ChatPromptTemplate \| 5.5235 \| 2.9709 \| -2.5526 \| -46.22% \| \| Runnable \| 2.9423 \| 1.7793 \| -1.163 \| -39.53% \| \| InMemoryVectorStore \| 3.1180 \| 1.8417 \| -1.2763 \| -40.93% \| \| RunnableLambda \| 2.7385 \| 1.8745 \| -0.864 \| -31.55% \| \| tool \| 5.1231 \| 4.0771 \| -1.046 \| -20.42% \| \| CallbackManager \| 4.2263 \| 3.4099 \| -0.8164 \| -19.32% \| \| LangChainTracer \| 3.8394 \| 3.3101 \| -0.5293 \| -13.79% \| \| BaseChatModel \| 4.3317 \| 3.8806 \| -0.4511 \| -10.41% \| \| PydanticOutputParser \| 3.2036 \| 3.2995 \| 0.0959 \| 2.99% \| \| InMemoryRateLimiter \| 0.5311 \| 0.5995 \| 0.0684 \| 12.88% \| Note the lack of change for `InMemoryRateLimiter` and `PydanticOutputParser` is just random noise, I'm getting comparable numbers locally. ## Local CodSpeed results We're still working on configuring CodSpeed on CI. The local usage produced similar results.	2025-04-14 08:57:54 -04:00
Christophe Bornet	42944f3499	core: Improve mypy config (#30737 ) * Cleanup mypy config * Add mypy `strict` rules except `disallow_any_generics`, `warn_return_any` and `strict_equality` (TODO) * Add mypy `strict_byte` rule * Add mypy support for PEP702 `@deprecated` decorator * Bump mypy version to 1.15 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-11 16:35:13 -04:00
Christophe Bornet	913c896598	core: Add ruff rules FBT001 and FBT002 (#30695 ) Add ruff rules [FBT001](https://docs.astral.sh/ruff/rules/boolean-type-hint-positional-argument/) and [FBT002](https://docs.astral.sh/ruff/rules/boolean-default-value-positional-argument/). Mostly `noqa`s to not introduce breaking changes and possible non-breaking fixes have already been done in a [previous PR](https://github.com/langchain-ai/langchain/pull/29424). These rules will prevent new violations to happen.	2025-04-11 16:26:33 -04:00
Sydney Runkle	fdc2b4bcac	core[lint]: Use 3.9 formatting for docs and tests (#30780 ) Looks like `pyupgrade` was already used here but missed some docs and tests. This helps to keep our docs looking professional and up to date. Eventually, we should lint / format our inline docs.	2025-04-11 10:39:25 -04:00
Christophe Bornet	dc19d42d37	core: Specify code when ignoring type issue (ruff PGH003) (#30675 ) See https://docs.astral.sh/ruff/rules/blanket-type-ignore/	2025-04-10 22:23:52 -04:00
Christophe Bornet	4cc7bc6c93	core: Add ruff rules PLR (#30696 ) Add ruff rules [PLR](https://docs.astral.sh/ruff/rules/#refactor-plr) Except PLR09xxx and PLR2004. Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-09 15:15:38 -04:00
Christophe Bornet	f241fd5c11	core: Add ruff rules RET (#29384 ) See https://docs.astral.sh/ruff/rules/#flake8-return-ret All auto-fixes	2025-04-02 16:59:56 -04:00
Christophe Bornet	ccc3d32ec8	core: Add ruff rules for Pylint PLC (Convention) and PLE (Errors) (#29286 ) See https://docs.astral.sh/ruff/rules/#pylint-pl	2025-04-02 10:58:03 -04:00
Christophe Bornet	4f8ea13cea	core: Add ruff rules PERF (#29375 ) See https://docs.astral.sh/ruff/rules/#perflint-perf	2025-04-01 13:34:56 -04:00
Christophe Bornet	768e4f695a	core: Add ruff rules S110 and S112 (#30599 )	2025-04-01 13:17:22 -04:00
Christophe Bornet	88b4233fa1	core: Add ruff rules D (docstring) (#29406 ) This ensures that the code is properly documented: https://docs.astral.sh/ruff/rules/#pydocstyle-d Related to #21983	2025-04-01 13:15:45 -04:00
Christophe Bornet	e181d43214	core: Bump ruff version to 0.11 (#30519 ) Changes are from the new TC006 rule: https://docs.astral.sh/ruff/rules/runtime-cast-value/ TC006 is auto-fixed.	2025-03-27 13:01:49 -04:00
Christophe Bornet	b28a474e79	core[patch]: Add ruff rules for PLW (Pylint Warnings) (#29288 ) See https://docs.astral.sh/ruff/rules/#warning-w_1 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-03-27 10:26:12 +00:00
Vadym Barda	97dec30eea	docs[patch]: update trim_messages doc (#30462 )	2025-03-24 18:50:48 +00:00
Adrián Panella	b75573e858	core: add tool_call exclusion in filter_message (#30289 ) Extend functionallity to allow to filter pairs of tool calls (ai + tool). --------- Co-authored-by: vbarda <vadym@langchain.dev>	2025-03-21 23:05:29 +00:00
Vadym Barda	673ec00030	docs[patch]: add warning to token counter docstring (#30426 )	2025-03-21 18:59:40 -04:00
Vadym Barda	07823cd41c	core[patch]: optimize trim_messages (#30327 ) Refactored w/ Claude Up to 20x speedup! (with theoretical max improvement of `O(n / log n)`)	2025-03-21 17:08:26 -04:00
Vadym Barda	37190881d3	core[patch]: add util for approximate token counting (#30373 )	2025-03-19 17:48:38 +00:00
ccurme	cd1ea8e94d	openai[patch]: support Responses API (#30231 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2025-03-12 12:25:46 -04:00
ccurme	52b0570bec	core, openai, standard-tests: improve OpenAI compatibility with Anthropic content blocks (#30128 ) - Support thinking blocks in core's `convert_to_openai_messages` (pass through instead of error) - Ignore thinking blocks in ChatOpenAI (instead of error) - Support Anthropic-style image blocks in ChatOpenAI --- Standard integration tests include a `supports_anthropic_inputs` property which is currently enabled only for tests on `ChatAnthropic`. This test enforces compatibility with message histories of the form: ``` - system message - human message - AI message with tool calls specified only through `tool_use` content blocks - human message containing `tool_result` and an additional `text` block ``` It additionally checks support for Anthropic-style image inputs if `supports_image_inputs` is enabled. Here we change this test, such that if you enable `supports_anthropic_inputs`: - You support AI messages with text and `tool_use` content blocks - You support Anthropic-style image inputs (if `supports_image_inputs` is enabled) - You support thinking content blocks. That is, we add a test case for thinking content blocks, but we also remove the requirement of handling tool results within HumanMessages (motivated by existing agent abstractions, which should all return ToolMessage). We move that requirement to a ChatAnthropic-specific test.	2025-03-06 09:53:14 -05:00
ZhangShenao	8575d7491f	[Doc] Improve api doc (#30073 ) - Update api_doc for `BaseMessage` - add static method decorator for `retry_runnable`	2025-03-04 09:39:07 -05:00
Christophe Bornet	b3885c124f	core: Add ruff rules TC (#29268 ) See https://docs.astral.sh/ruff/rules/#flake8-type-checking-tc Some fixes done for TC001,TC002 and TC003 but these rules are excluded since they don't play well with Pydantic. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-26 19:39:05 +00:00

1 2 3

143 Commits