langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-06-21 14:18:52 +00:00

Author	SHA1	Message	Date
Mikhail	6105a5841b	core: fix `get_buffer_string` output for structured message content (#31600 )	2025-06-20 23:21:50 +00:00
Bagatur	5271fd76f1	core[patch]: check before removing tags (#31691 )	2025-06-20 17:46:50 -04:00
ccurme	39a8a1121a	core: release 0.3.66 (#31690 )	2025-06-20 17:45:03 -04:00
Mohammad Mohtashim	7ff405077d	core[patch]: Returning always 2D Array for _cosine_similarity (#31528 ) - Description: Very simple change in `_cosine_similarity` which always 2D array. - Issue: #31497	2025-06-20 11:25:02 -04:00
Eugene Yurtsev	2842e0c8c1	core[patch]: Add doc-strings to tools/base.py (#31684 ) Add doc-strings	2025-06-20 11:16:57 -04:00
Christophe Bornet	7e046ea848	core: Cleanup Pydantic models and handle deprecation warnings (#30799 ) * Simplified Pydantic handling since Pydantic v1 is not supported anymore. * Replace use of deprecated v1 methods by corresponding v2 methods. * Remove use of other deprecated methods. * Activate mypy errors on deprecated methods use. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-06-20 10:42:52 -04:00
Nuno Campos	ddc850ca72	core: In LangChainTracer, send only the first token event (#31591 ) - only the first one is used for analytics	2025-06-12 14:04:23 -07:00
ccurme	b0f100af7e	core: release 0.3.65 (#31557 )	2025-06-10 19:39:50 +00:00
Sydney Runkle	5b165effcd	core(fix): revert `set_text` optimization (#31555 ) Revert serialization regression introduced in https://github.com/langchain-ai/langchain/pull/31238 Fixes https://github.com/langchain-ai/langchain/issues/31486	2025-06-10 13:36:55 -04:00
lc-arjun	35ae5eab4f	core: use run tree post/patch (#31500 ) Use run post/patch	2025-06-05 14:05:57 -07:00
Mohammad Mohtashim	ae3551c96b	core[patch]: Correct type casting of annotations in _infer_arg_descriptions (#31181 ) - Description: - In _infer_arg_descriptions, the annotations dictionary contains string representations of types instead of actual typing objects. This causes _is_annotated_type to fail, preventing the correct description from being generated. - This is a simple fix using the get_type_hints method, which resolves the annotations properly and is supported across all Python versions. - Issue: #31051 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-06-05 11:58:36 -04:00
ccurme	741bb1ffa1	core[patch]: revert change to stream type hint (#31501 ) https://github.com/langchain-ai/langchain/pull/31286 included an update to the return type for `BaseChatModel.(a)stream`, from `Iterator[BaseMessageChunk]` to `Iterator[BaseMessage]`. This change is correct, because when streaming is disabled, the stream methods return an iterator of `BaseMessage`, and the inheritance is such that an `BaseMessage` is not a `BaseMessageChunk` (but the reverse is true). However, LangChain includes a pattern throughout its docs of [summing BaseMessageChunks](https://python.langchain.com/docs/how_to/streaming/#llms-and-chat-models) to accumulate a chat model stream. This pattern is implemented in tests for most integration packages and appears in application code. So https://github.com/langchain-ai/langchain/pull/31286 introduces mypy errors throughout the ecosystem (or maybe more accurately, it reveals that this pattern does not account for use of the `.stream` method when streaming is disabled). Here we revert just the change to the stream return type to unblock things. A fix for this should address docs + integration packages (or if we elect to just force people to update code, be explicit about that).	2025-06-05 11:20:06 -04:00
Christophe Bornet	539e5b6936	core: Add mypy strict-equality rule (#31286 )	2025-06-02 18:24:35 +00:00
Sam Zhang	2c4e0ab3bc	fix: module 'defusedxml' has no attribute 'ElementTree' (#31429 ) (#31431 ) Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-06-02 18:09:22 +00:00
Eugene Yurtsev	19f2a92609	core: release 0.3.63 (#31419 ) Release core 0.3.63 Small update just to expand the list of well known tools. This is necessary while the logic lives in langchain-core. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-29 14:48:18 -04:00
Eugene Yurtsev	e6633a7efb	langchain-core: Add image_generation tool to list of known openai tools (#31396 ) Add image generation tool to the list of well known tools. This is needed for changes in the ChatOpenAI client. TODO: Some of this logic needs to be moved from core directly into the client as changes in core should not be required to add a new tool to the openai chat client.	2025-05-29 13:13:21 -04:00
ccurme	930aa6073e	core: release 0.3.62 (#31376 )	2025-05-27 16:52:09 +00:00
ccurme	580986b260	anthropic: support for code execution, MCP connector, files API features (#31340 ) Support for the new [batch of beta features](https://www.anthropic.com/news/agent-capabilities-api) released yesterday: - [Code execution](https://docs.anthropic.com/en/docs/agents-and-tools/tool-use/code-execution-tool) - [MCP connector](https://docs.anthropic.com/en/docs/agents-and-tools/mcp-connector) - [Files API](https://docs.anthropic.com/en/docs/build-with-claude/files) Also verified support for [prompt cache TTL](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching#1-hour-cache-duration-beta).	2025-05-27 12:45:45 -04:00
ccurme	71c074d28f	core: release 0.3.61 (#31317 )	2025-05-22 11:54:28 -04:00
ccurme	053a1246da	openai[patch]: support built-in code interpreter and remote MCP tools (#31304 )	2025-05-22 11:47:57 -04:00
Christophe Bornet	17c5a1621f	core: Improve Runnable `__or__` method typing annotations (#31273 ) * It is possible to chain a `Runnable` with an `AsyncIterator` as seen in `test_runnable.py`. * Iterator and AsyncIterator Input/Output of Callables must be put before `Callable[[Other], Any]` otherwise the pattern matching picks the latter.	2025-05-19 09:32:31 -04:00
OysterMax	eb25d7472d	core: support `Union` type args in strict mode of OpenAI function calling / structured output (#30971 ) Issue:[ #309070](https://github.com/langchain-ai/langchain/issues/30970) Cause Arg type in python code ``` arg: Union[SubSchema1, SubSchema2] ``` is translated to `anyOf` in json schema ``` "anyOf" : [{sub schema 1 ...}, {sub schema 1 ...}] ``` The value of anyOf is a list sub schemas. The bug is caused since the sub schemas inside `anyOf` list is not taken care of. The location where the issue happens is `convert_to_openai_function` function -> `_recursive_set_additional_properties_false` function, that recursively adds `"additionalProperties": false` to json schema which is [required by OpenAI's strict function calling](https://platform.openai.com/docs/guides/structured-outputs?api-mode=responses#additionalproperties-false-must-always-be-set-in-objects) Solution: This PR fixes this issue by iterating each sub schema inside `anyOf` list. A unit test is added. Twitter handle: shengboma If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-16 16:20:32 -04:00
Christophe Bornet	c982573f1e	core: Add ruff rules A (builtins shadowing) (#29312 ) See https://docs.astral.sh/ruff/rules/#flake8-builtins-a * Renamed vars where possible * Added `noqa` where backward compatibility was needed * Added `@override` when applicable	2025-05-16 15:19:37 -04:00
Shkarupa Alex	671e4fd114	langchain[patch]: Allow async indexing code to work for vectorstores that only defined sync delete (#30869 ) `aindex` function should check not only `adelete` method, but `delete` method too PR title: "core: fix async indexing issue with adelete/delete checking" PR message: Currently `langchain.indexes.aindex` checks if vector store has overrided adelete method. But due to `adelete` default implementation store can have just `delete` overrided to make `adelete` working. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-16 15:10:25 -04:00
Christophe Bornet	a8f2ddee31	core: Add ruff rules RUF (#29353 ) See https://docs.astral.sh/ruff/rules/#ruff-specific-rules-ruf Mostly: * [RUF022](https://docs.astral.sh/ruff/rules/unsorted-dunder-all/) (unsorted `__all__`) * [RUF100](https://docs.astral.sh/ruff/rules/unused-noqa/) (unused noqa) * [RUF021](https://docs.astral.sh/ruff/rules/parenthesize-chained-operators/) (parenthesize-chained-operators) * [RUF015](https://docs.astral.sh/ruff/rules/unnecessary-iterable-allocation-for-first-element/) (unnecessary-iterable-allocation-for-first-element) * [RUF005](https://docs.astral.sh/ruff/rules/collection-literal-concatenation/) (collection-literal-concatenation) * [RUF046](https://docs.astral.sh/ruff/rules/unnecessary-cast-to-int/) (unnecessary-cast-to-int) --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-15 15:43:57 -04:00
ccurme	672339f3c6	core: release 0.3.60 (#31249 )	2025-05-15 11:14:04 -04:00
Christophe Bornet	921573e2b7	core: Add ruff rules SLF (#30666 ) Add ruff rules SLF: https://docs.astral.sh/ruff/rules/#flake8-self-slf --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-14 18:42:39 +00:00
Sydney Runkle	7263011b24	perf[core]: remove unnecessary model validators (#31238 ) * Remove unnecessary cast of id -> str (can do with a field setting) * Remove unnecessary `set_text` model validator (can be done with a computed field - though we had to make some changes to the `Generation` class to make this possible Before: ~2.4s Blue circles represent time spent in custom validators :( <img width="1337" alt="Screenshot 2025-05-14 at 10 10 12 AM" src="https://github.com/user-attachments/assets/bb4f477f-4ee3-4870-ae93-14ca7f197d55" /> After: ~2.2s <img width="1344" alt="Screenshot 2025-05-14 at 10 11 03 AM" src="https://github.com/user-attachments/assets/99f97d80-49de-462f-856f-9e7e8662adbc" /> We still want to optimize the backwards compatible tool calls model validator, though I think this might involve breaking changes, so wanted to separate that into a different PR. This is circled in green.	2025-05-14 10:20:22 -07:00
Lope Ramos	b8ae2de169	langchain-core[patch]: `Incremental` record manager deletion should be batched (#31206 ) Description: Before this commit, if one record is batched in more than 32k rows for sqlite3 >= 3.32 or more than 999 rows for sqlite3 < 3.31, the `record_manager.delete_keys()` will fail, as we are creating a query with too many variables. This commit ensures that we are batching the delete operation leveraging the `cleanup_batch_size` as it is already done for `full` cleanup. Added unit tests for incremental mode as well on different deleting batch size.	2025-05-14 11:38:21 -04:00
Sydney Runkle	263c215112	perf[core]: remove generations summation from hot loop (#31231 ) 1. Removes summation of `ChatGenerationChunk` from hot loops in `stream` and `astream` 2. Removes run id gen from loop as well (minor impact) Again, benchmarking on processing ~200k chunks (a poem about broccoli). Before: ~4.2s Blue circle is all the time spent adding up gen chunks <img width="1345" alt="Screenshot 2025-05-14 at 7 48 33 AM" src="https://github.com/user-attachments/assets/08a59d78-134d-4cd3-9d54-214de689df51" /> After: ~2.3s Blue circle is remaining time spent on adding chunks, which can be minimized in a future PR by optimizing the `merge_content`, `merge_dicts`, and `merge_lists` utilities. <img width="1353" alt="Screenshot 2025-05-14 at 7 50 08 AM" src="https://github.com/user-attachments/assets/df6b3506-929e-4b6d-b198-7c4e992c6d34" />	2025-05-14 08:13:05 -07:00
Sydney Runkle	17b799860f	perf[core]: remove costly async helpers for non-end event handlers (#31230 ) 1. Remove `shielded` decorator from non-end event handlers 2. Exit early with a `self.handlers` check instead of doing unnecessary asyncio work Using a benchmark that processes ~200k chunks (a poem about broccoli). Before: ~15s Circled in blue is unnecessary event handling time. This is addressed by point 2 above <img width="1347" alt="Screenshot 2025-05-14 at 7 37 53 AM" src="https://github.com/user-attachments/assets/675e0fed-8f37-46c0-90b3-bef3cb9a1e86" /> After: ~4.2s The total time is largely reduced by the removal of the `shielded` decorator, which holds little significance for non-end handlers. <img width="1348" alt="Screenshot 2025-05-14 at 7 37 22 AM" src="https://github.com/user-attachments/assets/54be8a3e-5827-4136-a87b-54b0d40fe331" />	2025-05-14 07:42:56 -07:00
Christophe Bornet	83d006190d	core: Fix some private member accesses (#30912 ) See https://github.com/langchain-ai/langchain/pull/30666 --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2025-05-12 17:42:26 +00:00
CtrlMj	1e56c66f86	core: Fix issue 31035 alias fields in base tool langchain core (#31112 ) Description: The 'inspect' package in python skips over the aliases set in the schema of a pydantic model. This is a workound to include the aliases from the original input. issue: #31035 Cc: @ccurme @eyurtsev --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-12 11:04:13 -04:00
ccurme	f70b263ff3	core: release 0.3.59 (#31150 )	2025-05-07 17:36:59 +00:00
Jacob Lee	66d1ed6099	fix(core): Permit OpenAI style blocks to be passed into convert_to_openai_messages (#31140 ) Should effectively be a noop, just shouldn't throw CC @madams0013 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-07 10:57:37 -04:00
ccurme	ff41f47e91	core: release 0.3.58 (#31099 )	2025-05-02 12:46:32 -04:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00
William FH	b5bf2d6218	0.3.57 (#31095 )	2025-05-01 23:42:26 -07:00
William FH	167afa5102	Enable run mutation (#31090 ) This lets you more easily modify a run in-flight	2025-05-01 17:00:51 -07:00
ccurme	403fae8eec	core: release 0.3.56 (#31000 )	2025-04-24 13:22:31 -04:00
ccurme	8fc7a723b9	core: release 0.3.56rc1 (#30998 )	2025-04-24 15:09:44 +00:00
ccurme	f4863f82e2	core[patch]: fix edge cases for _is_openai_data_block (#30997 )	2025-04-24 10:48:52 -04:00
Jacob Lee	6b0b317cb5	feat(core): Autogenerate filenames for when converting file content blocks to OpenAI format (#30984 ) CC @ccurme --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-24 13:36:31 +00:00
ccurme	faef3e5d50	core, standard-tests: support PDF and audio input in Chat Completions format (#30979 ) Chat models currently implement support for: - images in OpenAI Chat Completions format - other multimodal types (e.g., PDF and audio) in a cross-provider [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/) Here we update core to extend support to PDF and audio input in Chat Completions format. If an OAI-format PDF or audio content block is passed into any chat model, it will be transformed to the LangChain standard format. We assume that any chat model supporting OAI-format PDF or audio has implemented support for the standard format.	2025-04-23 18:32:51 +00:00
Bagatur	d4fc734250	core[patch]: update dict prompt template (#30967 ) Align with JS changes made in https://github.com/langchain-ai/langchainjs/pull/8043	2025-04-23 10:04:50 -07:00
ccurme	4bc70766b5	core, openai: support standard multi-modal blocks in convert_to_openai_messages (#30968 )	2025-04-23 11:20:44 -04:00
ccurme	8574442c57	core[patch]: release 0.3.55 (#30952 )	2025-04-21 17:56:24 +00:00
Nuno Campos	27296bdb0c	core: Make Graph.Node.data optional (#30943 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-04-21 07:18:36 -07:00
Ahmed Tammaa	de56c31672	core: Improve OutputParser error messaging when model output is truncated (max_tokens) (#30936 ) Addresses #30158 When using the output parser—either in a chain or standalone—hitting max_tokens triggers a misleading “missing variable” error instead of indicating the output was truncated. This subtle bug often surfaces with Anthropic models. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-21 10:06:18 -04:00
ccurme	096f0e5966	core[patch]: de-beta usage callback (#30928 )	2025-04-18 15:45:09 +00:00

1 2 3 4 5 ...

912 Commits