langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-21 10:26:57 +00:00

Author	SHA1	Message	Date
zhurou603	1df3ee91e7	partners: (langchain-openai) total_tokens should not add 'Nonetype' t… (#31146 ) partners: (langchain-openai) total_tokens should not add 'Nonetype' t… # PR Description ## Description Fixed an issue in `langchain-openai` where `total_tokens` was incorrectly adding `None` to an integer, causing a TypeError. The fix ensures proper type checking before adding token counts. ## Issue Fixes the TypeError traceback shown in the image where `'NoneType'` cannot be added to an integer. ## Dependencies None ## Twitter handle None ![image](https://github.com/user-attachments/assets/9683a795-a003-455a-ada9-fe277245e2b2) Co-authored-by: qiulijie <qiulijie@yuaiweiwu.com>	2025-05-07 11:09:50 -04:00
Collier King	19041dcc95	docs: update langchain-cloudflare repo/path on packages.yaml (#31138 ) Library Repo Path Update : "langchain-cloudflare" We recently changed our `langchain-cloudflare` repo to allow for future libraries. Created a `libs` folder to hold `langchain-cloudflare` python package. https://github.com/cloudflare/langchain-cloudflare/tree/main/libs/langchain-cloudflare On `langchain`, updating `packages.yaml` to point to new `libs/langchain-cloudflare` library folder.	2025-05-07 11:01:25 -04:00
Jacob Lee	66d1ed6099	fix(core): Permit OpenAI style blocks to be passed into convert_to_openai_messages (#31140 ) Should effectively be a noop, just shouldn't throw CC @madams0013 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-07 10:57:37 -04:00
唐小鸭	50fa524a6d	partners: (langchain-deepseek) fix deepseek-r1 always returns an empty `reasoning_content` when reasoning (#31065 ) ## Description deepseek-r1 always returns an empty string `reasoning_content` to the first chunk when thinking, and sets `reasoning_content` to None when thinking is over, to determine when to switch to normal output. Therefore, whether the reasoning_content field exists should be judged as None. ## Demo deepseek-r1 reasoning output: ``` {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': 'assistant', 'tool_calls': None, 'reasoning_content': ''}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '好的'}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '，'}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '用户'}, 'finish_reason': None, 'index': 0, 'logprobs': None} ... ``` deepseek-r1 first normal output ``` ... {'delta': {'content': ' main', 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': None}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': '\n\nimport', 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': None}, 'finish_reason': None, 'index': 0, 'logprobs': None} ... ``` --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-05 22:31:58 +00:00
Stefano Lottini	325f729a92	docs: improvements to Astra DB pages, especially modernize Vector DB example notebook (#30961 ) This PR brings several improvements and modernizations to the documentation around the Astra DB partner package. - language alignment for better matching with the terms used in the Astra DB docs - updated several links to pages on said documentation - for the `AstraDBVectorStore`, added mentions of the new features in the overall `astra.mdx` - for the vector store, rewritten/upgraded most of the usage example notebook for a more straightforward experience able to highlight the main usage patterns (including new ones such as the newly-introduced "autodetect feature") --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-03 14:26:52 -04:00
Asif Mehmood	00ac49dd3e	Replace deprecated .dict() with .model_dump() for Pydantic v2 compatibility (#31107 ) What does this PR do? This PR replaces deprecated usages of ```.dict()``` with ```.model_dump()``` to ensure compatibility with Pydantic v2 and prepare for v3, addressing the deprecation warning ```PydanticDeprecatedSince20``` as required in [Issue# 31103](https://github.com/langchain-ai/langchain/issues/31103). Changes made: * Replaced ```.dict()``` with ```.model_dump()``` in multiple locations * Ensured consistency with Pydantic v2 migration guidelines * Verified compatibility across affected modules Notes * This is a code maintenance and compatibility update * Tested locally with Pydantic v2.11 * No functional logic changes; only internal method replacements to prevent deprecation issues	2025-05-03 13:40:54 -04:00
ccurme	6268ae8db0	langchain: release 0.3.25 (#31101 )	2025-05-02 17:42:32 +00:00
ccurme	77ecf47f6d	openai: release 0.3.16 (#31100 )	2025-05-02 13:14:46 -04:00
ccurme	ff41f47e91	core: release 0.3.58 (#31099 )	2025-05-02 12:46:32 -04:00
Eugene Yurtsev	4da525bc63	langchain[patch]: Remove beta decorator from init_embeddings (#31098 ) Remove beta decorator from init_embeddings.	2025-05-02 11:52:50 -04:00
ccurme	94139ffcd3	openai[patch]: format system content blocks for Responses API (#31096 ) ```python from langchain_core.messages import HumanMessage, SystemMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI(model="gpt-4.1", use_responses_api=True) messages = [ SystemMessage("test"), # Works HumanMessage("test"), # Works SystemMessage([{"type": "text", "text": "test"}]), # Bug in this case HumanMessage([{"type": "text", "text": "test"}]), # Works SystemMessage([{"type": "input_text", "text": "test"}]) # Works ] llm._get_request_payload(messages) ```	2025-05-02 15:22:30 +00:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00
William FH	b5bf2d6218	0.3.57 (#31095 )	2025-05-01 23:42:26 -07:00
William FH	167afa5102	Enable run mutation (#31090 ) This lets you more easily modify a run in-flight	2025-05-01 17:00:51 -07:00
ccurme	c51eadd54f	openai[patch]: propagate service_tier to response metadata (#31089 )	2025-05-01 13:50:48 -04:00
ccurme	6110c3ffc5	openai[patch]: release 0.3.15 (#31087 )	2025-05-01 09:22:30 -04:00
Ben Gladwell	da59eb7eb4	anthropic: Allow kwargs to pass through when counting tokens (#31082 ) - Description: `ChatAnthropic.get_num_tokens_from_messages` does not currently receive `kwargs` and pass those on to `self._client.beta.messages.count_tokens`. This is a problem if you need to pass specific options to `count_tokens`, such as the `thinking` option. This PR fixes that. - Issue: N/A - Dependencies: None - Twitter handle: @bengladwell Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-30 17:56:22 -04:00
Really Him	918c950737	DOCS: `partners/chroma`: Fix documentation around `chroma` query filter syntax (#31058 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: * Starting to put together some PR's to fix the typing around `langchain-chroma` `filter` and `where_document` query filtering, as mentioned: https://github.com/langchain-ai/langchain/issues/30879 https://github.com/langchain-ai/langchain/issues/30507 The typing of `dict[str, str]` is on the one hand too restrictive (marks valid filter expressions as ill-typed) and also too permissive (allows illegal filter expressions). That's not what this PR addresses though. This PR just removes from the documentation some examples of filters that are illegal, and also syntactically incorrect: (a) dictionaries with keys like `$contains` but the key is missing quotation marks; (b) dictionaries with multiple entries - this is illegal in Chroma filter syntax and will raise an exception. (`{"foo": "bar", "qux": "baz"}`). Filter dictionaries in Chroma must have one and one key only. Again this is just the documentation issue, which is the lowest hanging fruit. I also think we need to update the types for `filter` and `where_document` to be (at the very least `dict[str, Any]`), or, since we have access to Chroma's types, they should be `Where` and `WhereDocument` types. This has a wider blast radius though, so I'm starting small. This PR does not fix the issues mentioned above, it's just starting to get the ball rolling, and cleaning up the documentation. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Really Him <hesereallyhim@proton.me>	2025-04-30 17:51:07 -04:00
yberber-sap	952a0b7b40	Docs: Fix SAP HANA Cloud docs - remove pip output, update vectorstore link, rename provider (#31077 ) This PR includes the following documentation fixes for the SAP HANA Cloud vector store integration: - Removed stale output from the `%pip install` code cell. - Replaced an unrelated vectorstore documentation link on the provider overview page. - Renamed the provider from "SAP HANA" to "SAP HANA Cloud"	2025-04-30 08:57:40 -04:00
ccurme	bdb7c4a8b3	huggingface: fix embeddings return type (#31072 ) Integration tests failing cc @hanouticelina	2025-04-29 18:45:04 +00:00
célina	868f07f8f4	partners: (langchain-huggingface) Chat Models - Integrate Hugging Face Inference Providers and remove deprecated code (#30733 ) Hi there, I'm Célina from 🤗, This PR introduces support for Hugging Face's serverless Inference Providers (documentation [here](https://huggingface.co/docs/inference-providers/index)), allowing users to specify different providers for chat completion and text generation tasks. This PR also removes the usage of `InferenceClient.post()` method in `HuggingFaceEndpoint`, in favor of the task-specific `text_generation` method. `InferenceClient.post()` is deprecated and will be removed in `huggingface_hub v0.31.0`. --- ## Changes made - bumped the minimum required version of the `huggingface-hub` package to ensure compatibility with the latest API usage. - added a `provider` field to `HuggingFaceEndpoint`, enabling users to select the inference provider (e.g., 'cerebras', 'together', 'fireworks-ai'). Defaults to `hf-inference` (HF Inference API). - replaced the deprecated `InferenceClient.post()` call in `HuggingFaceEndpoint` with the task-specific `text_generation` method for future-proofing, `post()` will be removed in huggingface-hub v0.31.0. - updated the `ChatHuggingFace` component: - added async and streaming support. - added support for tool calling. - exposed underlying chat completion parameters for more granular control. - Added integration tests for `ChatHuggingFace` and updated the corresponding unit tests. ✅ All changes are backward compatible. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-29 09:53:14 -04:00
ccurme	3072e4610a	community: move to separate repo (continued) (#31069 ) Missed these after merging	2025-04-29 09:25:32 -04:00
ccurme	9ff5b5d282	community: move to separate repo (#31060 ) langchain-community is moving to https://github.com/langchain-ai/langchain-community	2025-04-29 09:22:04 -04:00
Sydney Runkle	7e926520d5	packaging: remove Python upper bound for langchain and co libs (#31025 ) Follow up to https://github.com/langchain-ai/langsmith-sdk/pull/1696, I've bumped the `langsmith` version where applicable in `uv.lock`. Type checking problems here because deps have been updated in `pyproject.toml` and `uv lock` hasn't been run - we should enforce that in the future - goes with the other dependabot todos :).	2025-04-28 14:44:28 -04:00
Sydney Runkle	d614842d23	ci: temporarily run chroma on 3.12 for CI (#31056 ) Waiting on a fix for https://github.com/chroma-core/chroma/issues/4382	2025-04-28 13:20:37 -04:00
Christophe Bornet	aee7988a94	community: add mypy warn_unused_ignores rule (#30816 )	2025-04-28 11:54:12 -04:00
Bae-ChangHyun	a2863f8757	community: add 'get_col_comments' option for retrieve database columns comments (#30646 ) ## Description Added support for retrieving column comments in the SQL Database utility. This feature allows users to see comments associated with database columns when querying table information. Column comments provide valuable metadata that helps LLMs better understand the semantics and purpose of database columns. A new optional parameter `get_col_comments` was added to the `get_table_info` method, defaulting to `False` for backward compatibility. When set to `True`, it retrieves and formats column comments for each table. Currently, this feature is supported on PostgreSQL, MySQL, and Oracle databases. ## Implementation You should create Table with column comments before. ```python db = SQLDatabase.from_uri("YOUR_DB_URI") print(db.get_table_info(get_col_comments=True)) ``` ## Result ``` CREATE TABLE test_table ( name VARCHAR school VARCHAR) /* Column Comments: {'name': person name, 'school":school_name} / / 3 rows from test_table: name a b c */ ``` ## Benefits 1. Enhances LLM's understanding of database schema semantics 2. Preserves valuable domain knowledge embedded in database design 3. Improves accuracy of SQL query generation 4. Provides more context for data interpretation Tests are available in `langchain/libs/community/tests/test_sql_get_table_info.py`. --------- Co-authored-by: chbae <chbae@gcsc.co.kr> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-28 15:19:46 +00:00
yberber-sap	3fb0a55122	Deprecate HanaDB, HanaTranslator and update example notebook to use new implementation (#30896 ) - Description: This PR marks the `HanaDB` vector store (and related utilities) in `langchain_community` as deprecated using the `@deprecated` annotation. - Set `since="0.1.0"` and `removal="1.0"` - Added a clear migration path and a link to the SAP-maintained replacement in the [`langchain_hana`](https://github.com/SAP/langchain-integration-for-sap-hana-cloud) package. Additionally, the example notebook has been updated to use the new `HanaDB` class from `langchain_hana`, ensuring users follow the recommended integration moving forward. - Issue: None - Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-27 16:37:35 -04:00
湛露先生	5fb8fd863a	langchain_openai: clean duplicate code for openai embedding. (#30872 ) The `_chunk_size` has not changed by method `self._tokenize`, So i think these is duplicate code. Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-27 15:07:41 -04:00
ccurme	ba2518995d	standard-tests: add condition for image tool message test (#31041 ) Require support for [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/).	2025-04-27 17:24:43 +00:00
ccurme	04a899ebe3	infra: support third-party integration packages in API ref build (#31021 )	2025-04-25 16:02:27 -04:00
ccurme	a60fd06784	docs: document OpenAI flex processing (#31023 ) Following https://github.com/langchain-ai/langchain/pull/31005	2025-04-25 15:10:25 -04:00
ccurme	629b7a5a43	openai[patch]: add explicit attribute for service tier (#31005 )	2025-04-25 18:38:23 +00:00
ccurme	ab871a7b39	docs: enable milvus in API ref build (#31016 ) Reverts langchain-ai/langchain#30996 Should be fixed following https://github.com/langchain-ai/langchain-milvus/pull/68	2025-04-25 12:48:10 +00:00
Georgi Stefanov	d30c56a8c1	langchain: return attachments in _get_response (#30853 ) This is a PR to return the message attachments in _get_response, as when files are generated these attachments are not returned thus generated files cannot be retrieved Fixes issue: https://github.com/langchain-ai/langchain/issues/30851	2025-04-24 21:39:11 -04:00
ccurme	a7903280dd	openai[patch]: delete redundant tests (#31004 ) These are covered by standard tests.	2025-04-24 17:56:32 +00:00
Kyle Jeong	d0f0d1f966	[docs/community]: langchain docs + browserbaseloader fix (#30973 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" community: fix browserbase integration docs: update docs - [ ] PR message: *Delete this entire checklist* and replace with - Description: Updated BrowserbaseLoader to use the new python sdk. - Issue: update browserbase integration with langchain - Dependencies: n/a - Twitter handle: @kylejeong21 - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-04-24 13:38:49 -04:00
ccurme	403fae8eec	core: release 0.3.56 (#31000 )	2025-04-24 13:22:31 -04:00
ccurme	10a9c24dae	openai: fix streaming reasoning without summaries (#30999 ) Following https://github.com/langchain-ai/langchain/pull/30909: need to retain "empty" reasoning output when streaming, e.g., ```python {'id': 'rs_...', 'summary': [], 'type': 'reasoning'} ``` Tested by existing integration tests, which are currently failing.	2025-04-24 16:01:45 +00:00
ccurme	8fc7a723b9	core: release 0.3.56rc1 (#30998 )	2025-04-24 15:09:44 +00:00
ccurme	f4863f82e2	core[patch]: fix edge cases for _is_openai_data_block (#30997 )	2025-04-24 10:48:52 -04:00
Jacob Lee	6b0b317cb5	feat(core): Autogenerate filenames for when converting file content blocks to OpenAI format (#30984 ) CC @ccurme --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-24 13:36:31 +00:00
ccurme	21962e2201	docs: temporarily disable milvus in API ref build (#30996 )	2025-04-24 09:31:23 -04:00
Behrad Hemati	1eb0bdadfa	community: add indexname to other functions in opensearch (#30987 ) - [x] PR title: "community: add indexname to other functions in opensearch" - [x] PR message: - Description: add ability to over-ride index-name if provided in the kwargs of sub-functions. When used in WSGI application it's crucial to be able to dynamically change parameters. - [ ] Add tests and docs: - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-04-24 08:59:33 -04:00
Nicky Parseghian	7ecdac5240	community: Strip URLs from sitemap. (#30830 ) Fixes #30829 - Description: Simply strips the loc value when building the element. - Issue: Fixes #30829	2025-04-23 18:18:42 -04:00
ccurme	faef3e5d50	core, standard-tests: support PDF and audio input in Chat Completions format (#30979 ) Chat models currently implement support for: - images in OpenAI Chat Completions format - other multimodal types (e.g., PDF and audio) in a cross-provider [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/) Here we update core to extend support to PDF and audio input in Chat Completions format. If an OAI-format PDF or audio content block is passed into any chat model, it will be transformed to the LangChain standard format. We assume that any chat model supporting OAI-format PDF or audio has implemented support for the standard format.	2025-04-23 18:32:51 +00:00
Bagatur	d4fc734250	core[patch]: update dict prompt template (#30967 ) Align with JS changes made in https://github.com/langchain-ai/langchainjs/pull/8043	2025-04-23 10:04:50 -07:00
ccurme	4bc70766b5	core, openai: support standard multi-modal blocks in convert_to_openai_messages (#30968 )	2025-04-23 11:20:44 -04:00
ccurme	e4877e5ef1	fireworks: release 0.3.0 (#30977 )	2025-04-23 10:08:38 -04:00
Christophe Bornet	8c5ae108dd	text-splitters: Set strict mypy rules (#30900 ) * Add strict mypy rules * Fix mypy violations * Add error codes to all type ignores * Add ruff rule PGH003 * Bump mypy version to 1.15	2025-04-22 20:41:24 -07:00
ccurme	eedda164c6	fireworks[minor]: remove default model and temperature (#30965 ) `mixtral-8x-7b-instruct` was recently retired from Fireworks Serverless. Here we remove the default model altogether, so that the model must be explicitly specified on init: ```python ChatFireworks(model="accounts/fireworks/models/llama-v3p1-70b-instruct") # for example ``` We also set a null default for `temperature`, which previously defaulted to 0.0. This parameter will no longer be included in request payloads unless it is explicitly provided.	2025-04-22 15:58:58 -04:00
CLOVA Studio 개발	577cb53a00	community: update Naver integration to use langchain-naver package and improve documentation (#30956 ) ## Description: This PR was requested after the `langchain-naver` partner-managed packages were completed. We build our package as requested in [this comment](https://github.com/langchain-ai/langchain/pull/29243#issuecomment-2595222791) and the initial version is now uploaded to [pypi](https://pypi.org/project/langchain-naver/). So we've updated some our documents with the additional changed features and how to download our partner-managed package. ## Dependencies: https://github.com/langchain-ai/langchain/pull/29243#issuecomment-2595222791 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-22 12:00:10 -04:00
ccurme	a7c1bccd6a	openai[patch]: remove xfails from image token counting tests (#30963 ) These appear to be passing again.	2025-04-22 15:55:33 +00:00
ccurme	25d77aa8b4	community: release 0.3.22 (#30962 )	2025-04-22 15:34:47 +00:00
ccurme	59fd4cb4c0	docs: update package registry sort order (#30960 )	2025-04-22 15:27:32 +00:00
ccurme	b8c454b42b	langchain: release 0.3.24 (#30959 )	2025-04-22 11:23:34 -04:00
Dmitrii Rashchenko	a43df006de	Support of openai reasoning summary streaming (#30909 ) langchain_openai: Support of reasoning summary streaming Description: OpenAI API now supports streaming reasoning summaries for reasoning models (o1, o3, o3-mini, o4-mini). More info about it: https://platform.openai.com/docs/guides/reasoning#reasoning-summaries It is supported only in Responses API (not Completion API), so you need to create LangChain Open AI model as follows to support reasoning summaries streaming: ``` llm = ChatOpenAI( model="o4-mini", # also o1, o3, o3-mini support reasoning streaming use_responses_api=True, # reasoning streaming works only with responses api, not completion api model_kwargs={ "reasoning": { "effort": "high", # also "low" and "medium" supported "summary": "auto" # some models support "concise" summary, some "detailed", but auto will always work } } ) ``` Now, if you stream events from llm: ``` async for event in llm.astream_events(prompt, version="v2"): print(event) ``` or ``` for chunk in llm.stream(prompt): print (chunk) ``` OpenAI API will send you new types of events: `response.reasoning_summary_text.added` `response.reasoning_summary_text.delta` `response.reasoning_summary_text.done` These events are new, so they were ignored. So I have added support of these events in function `_convert_responses_chunk_to_generation_chunk`, so reasoning chunks or full reasoning added to the chunk additional_kwargs. Example of how this reasoning summary may be printed: ``` async for event in llm.astream_events(prompt, version="v2"): if event["event"] == "on_chat_model_stream": chunk: AIMessageChunk = event["data"]["chunk"] if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` or ``` for chunk in llm.stream(prompt): if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-22 14:51:13 +00:00
Alexander Ng	0f6fa34372	Community: Valyu Integration docs (#30926 ) PR title: docs: add Valyu integration documentation Description: This PR adds documentation and example notebooks for the Valyu integration, including retriever and tool usage. Issue: N/A Dependencies: No new dependencies. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-21 17:43:00 -04:00
ccurme	8574442c57	core[patch]: release 0.3.55 (#30952 )	2025-04-21 17:56:24 +00:00
ccurme	920d504e47	fireworks[patch]: update model in LLM integration tests (#30951 ) `mixtral-8x7b-instruct` has been retired.	2025-04-21 17:53:27 +00:00
Anton Masalovich	1f3054502e	community: fix cost calculations for 4.1 and o4 in OpenAI callback (#30899 ) Issue: #30898	2025-04-21 10:59:47 -04:00
Ahmed Tammaa	589bc19890	anthropic[patch]: make description optional on AnthropicTool (#30935 ) PR Summary This change adds a fallback in ChatAnthropic.with_structured_output() to handle Pydantic models that don’t include a docstring. Without it, calling: ```py from pydantic import BaseModel from langchain_anthropic import ChatAnthropic class SampleModel(BaseModel): sample_field: str llm = ChatAnthropic( model="claude-3-7-sonnet-latest" ).with_structured_output(SampleModel.model_json_schema()) llm.invoke("test") ``` will raise a ``` KeyError: 'description' ``` because Pydantic omits the description field when no docstring is present. This issue doesn’t occur when using ChatOpenAI or if you add a docstring to the model: ```py from pydantic import BaseModel from langchain_openai import ChatOpenAI class SampleModel(BaseModel): """Schema for sample_field output.""" sample_field: str llm = ChatOpenAI( model="gpt-4o-mini" ).with_structured_output(SampleModel.model_json_schema()) llm.invoke("test") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-21 10:44:39 -04:00
Nuno Campos	27296bdb0c	core: Make Graph.Node.data optional (#30943 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-04-21 07:18:36 -07:00
Ahmed Tammaa	de56c31672	core: Improve OutputParser error messaging when model output is truncated (max_tokens) (#30936 ) Addresses #30158 When using the output parser—either in a chain or standalone—hitting max_tokens triggers a misleading “missing variable” error instead of indicating the output was truncated. This subtle bug often surfaces with Anthropic models. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-21 10:06:18 -04:00
xsai9101	335f089d6a	Community: Add bind variable support for oracle adb docloader (#30937 ) PR title: Community: Add bind variable support for oracle adb docloader Description: This PR adds support of using bind variable to oracle adb doc loader class, including minor document change. Issue: N/A Dependencies: No new dependencies.	2025-04-21 08:47:33 -04:00
Aubrey Ford	23f701b08e	langchain_community: OpenAIEmbeddings not respecting chunk_size argument (#30946 ) This is a follow-on PR to go with the identical changes that were made in parters/openai. Previous PR: https://github.com/langchain-ai/langchain/pull/30757 When calling embed_documents and providing a chunk_size argument, that argument is ignored when OpenAIEmbeddings is instantiated with its default configuration (where check_embedding_ctx_length=True). _get_len_safe_embeddings specifies a chunk_size parameter but it's not being passed through in embed_documents, which is its only caller. This appears to be an oversight, especially given that the _get_len_safe_embeddings docstring states it should respect "the set embedding context length and chunk size." Developers typically expect method parameters to take effect (also, take precedence) when explicitly provided, especially when instantiating using defaults. I was confused as to why my API calls were being rejected regardless of the chunk size I provided.	2025-04-21 08:39:07 -04:00
Aubrey Ford	b344f34635	partners/openai: OpenAIEmbeddings not respecting chunk_size argument (#30757 ) When calling `embed_documents` and providing a `chunk_size` argument, that argument is ignored when `OpenAIEmbeddings` is instantiated with its default configuration (where `check_embedding_ctx_length=True`). `_get_len_safe_embeddings` specifies a `chunk_size` parameter but it's not being passed through in `embed_documents`, which is its only caller. This appears to be an oversight, especially given that the `_get_len_safe_embeddings` docstring states it should respect "the set embedding context length and chunk size." Developers typically expect method parameters to take effect (also, take precedence) when explicitly provided, especially when instantiating using defaults. I was confused as to why my API calls were being rejected regardless of the chunk size I provided. This bug also exists in langchain_community package. I can add that to this PR if requested otherwise I will create a new one once this passes.	2025-04-18 15:27:27 -04:00
Konsti-s	017c8079e1	partners: ChatAnthropic supports urls (#30809 ) Description: partners-anthropic: ChatAnthropic supports b64 and urls in the part[image_url][url] message variable Issue: ChatAnthropic right now only supports b64 encoded images in the part[image_url][url] message variable. This PR enables ChatAnthropic to also accept image urls in said variable and makes it compatible with OpenAI messages to make model switching easier. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-18 15:15:45 -04:00
Volodymyr Tkachuk	d0cd115356	community: Add deprecation decorator to SingleStore community integrations (#30846 ) SingleStore integration now has its package `langchain-singlestore', so the community implementation will no longer be maintained. Added `deprecated` decorator to `SingleStoreDBChatMessageHistory`, `SingleStoreDBSemanticCache`, and `SingleStoreDB` classes in the community package. Dependencies: https://github.com/langchain-ai/langchain/pull/30841 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-18 12:58:39 -04:00
Alejandro Rodríguez	34ddfba76b	community: support usage_metadata for litellm streaming calls (#30683 ) Support "usage_metadata" for LiteLLM streaming calls. This is a follow-up to https://github.com/langchain-ai/langchain/pull/30625, which tackled non-streaming calls. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-04-18 12:50:32 -04:00
Volodymyr Tkachuk	5ffcd01c41	docs: Register langchain-singlestore integration (#30841 ) I created and published `langchain-singlestoe` integration package that should replace SingleStoreDB community implementation.	2025-04-18 12:11:33 -04:00
ccurme	096f0e5966	core[patch]: de-beta usage callback (#30928 )	2025-04-18 15:45:09 +00:00
Behrad Hemati	d624a475e4	community: change metadata in opensearch mmr (#30921 ) - [ ] PR message: - Description: including metadata_field in max_marginal_relevance_search() would result in error, changed the logic to be similar to how it's handled in similarity_search, where it can be any field or simply a "*" to include every field	2025-04-18 10:10:23 -04:00
rylativity	dbf9986d44	langchain-ollama (partners) / langchain-core: allow passing ChatMessages to Ollama (including arbitrary roles) (#30411 ) Replacement for PR #30191 (@ccurme) Description: currently, ChatOllama [will raise a value error if a ChatMessage is passed to it](https://github.com/langchain-ai/langchain/blob/master/libs/partners/ollama/langchain_ollama/chat_models.py#L514), as described https://github.com/langchain-ai/langchain/pull/30147#issuecomment-2708932481. Furthermore, ollama-python is removing the limitations on valid roles that can be passed through chat messages to a model in ollama - https://github.com/ollama/ollama-python/pull/462#event-16917810634. This PR removes the role limitations imposed by langchain and enables passing langchain ChatMessages with arbitrary 'role' values through the langchain ChatOllama class to the underlying ollama-python Client. As this PR relies on [merged but unreleased functionality in ollama-python]( https://github.com/ollama/ollama-python/pull/462#event-16917810634), I have temporarily pointed the ollama package source to the main branch of the ollama-python github repo. Format, lint, and tests of new functionality passing. Need to resolve issue with recently added ChatOllama tests. (Now resolved) Issue: resolves #30122 (related to ollama issue https://github.com/ollama/ollama/issues/8955) Dependencies: no new dependencies [x] PR title [x] PR message [x] Lint and test: format, lint, and test all running successfully and passing --------- Co-authored-by: Ryan Stewart <ryanstewart@Ryans-MacBook-Pro.local> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-18 10:07:07 -04:00
Christophe Bornet	0c723af4b0	langchain[lint]: fix mypy type ignores (#30894 ) * Remove unused ignores * Add type ignore codes * Add mypy rule `warn_unused_ignores` * Add ruff rule PGH003 NB: some `type: ignore[unused-ignore]` are added because the ignores are needed when `extended_testing_deps.txt` deps are installed.	2025-04-17 17:54:34 -04:00
Sydney Runkle	98c357b3d7	core: release 0.3.54 (#30911 )	2025-04-17 14:27:06 -04:00
Vadym Barda	d2cbfa379f	core[patch]: add retries and better messages to draw_mermaid_png (#30881 )	2025-04-17 18:25:37 +00:00
Sydney Runkle	75e50a3efd	core[patch]: Raise `AttributeError` (instead of `ModuleNotFoundError`) in custom `__getattr__` (#30905 ) Follow up to https://github.com/langchain-ai/langchain/pull/30769, fixing the regression reported [here](https://github.com/langchain-ai/langchain/pull/30769#issuecomment-2807483610), thanks @krassowski for the report! Fix inspired by https://github.com/PrefectHQ/prefect/pull/16172/files Other changes: * Using tuples for `__all__`, except in `output_parsers` bc of a list namespace conflict * Using a helper function for imports due to repeated logic across `__init__.py` files becoming hard to maintain. Co-authored-by: Michał Krassowski < krassowski 5832902+krassowski@users.noreply.github.com>"	2025-04-17 14:15:28 -04:00
ccurme	61d2dc011e	openai: release 0.3.14 (#30908 )	2025-04-17 10:49:14 -04:00
ccurme	f0f90c4d88	anthropic: release 0.3.12 (#30907 )	2025-04-17 14:45:12 +00:00
ccurme	f01b89df56	standard-tests: release 0.3.19 (#30906 )	2025-04-17 10:37:44 -04:00
ccurme	add6a78f98	standard-tests, openai[patch]: add support standard audio inputs (#30904 )	2025-04-17 10:30:57 -04:00
ccurme	2c2db1ab69	core: release 0.3.53 (#30901 )	2025-04-17 13:10:32 +00:00
ccurme	86d51f6be6	multiple: permit optional fields on multimodal content blocks (#30887 ) Instead of stuffing provider-specific fields in `metadata`, they can go directly on the content block.	2025-04-17 12:48:46 +00:00
湛露先生	ff2930c119	partners: bug fix check_imports.py exit code. (#30897 ) Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-17 08:02:23 -04:00
ccurme	fa362189a1	docs: document OpenAI reasoning summaries (#30882 )	2025-04-16 19:21:14 +00:00
Sydney Runkle	88fce67724	core: Removing unnecessary `pydantic` core schema rebuilds (#30848 ) We only need to rebuild model schemas if type annotation information isn't available during declaration - that shouldn't be the case for these types corrected here. Need to do more thorough testing to make sure these structures have complete schemas, but hopefully this boosts startup / import time.	2025-04-16 12:00:08 -04:00
rrozanski-smabbler	60d8ade078	Galaxia integration (#30792 ) - [ ] PR title: "docs: adding Smabbler's Galaxia integration" - [ ] PR message: Twitter handle: @Galaxia_graph I'm adding docs here + added the package to the packages.yml. I didn't add a unit test, because this integration is just a thin wrapper on top of our API. There isn't much left to test if you mock it away. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-16 10:39:04 -04:00
ccurme	ca39680d2a	ollama: release 0.3.2 (#30865 )	2025-04-16 09:14:57 -04:00
milosz-l	4ff576e37d	langchain: infer Perplexity provider for sonar model prefix (#30861 ) Description: This PR adds provider inference logic to `init_chat_model` for Perplexity models that use the "sonar..." prefix (`sonar`, `sonar-pro`, `sonar-reasoning`, `sonar-reasoning-pro` or `sonar-deep-research`). This allows users to initialize these models by simply passing the model name, without needing to explicitly set `model_provider="perplexity"`. The docstring for `init_chat_model` has also been updated to reflect this new inference rule.	2025-04-15 18:17:21 -04:00
ccurme	085baef926	ollama[patch]: support standard image format (#30864 ) Following https://github.com/langchain-ai/langchain/pull/30746	2025-04-15 22:14:50 +00:00
ccurme	47ded80b64	ollama[patch]: fix generation info (#30863 ) https://github.com/langchain-ai/langchain/pull/30778 (not released) broke all invocation modes of ChatOllama (intent was to remove `"message"` from `generation_info`, but we turned `generation_info` into `stream_resp["message"]`), resulting in validation errors.	2025-04-15 19:22:58 +00:00
Sydney Runkle	cf2697ec53	chroma: release 0.2.3 (#30860 )	2025-04-15 14:11:23 -04:00
ccurme	8e9569cbc8	perplexity: release 0.1.1 (#30859 )	2025-04-15 18:02:15 +00:00
ccurme	dd5f5902e3	openai: release 0.3.13 (#30858 )	2025-04-15 17:58:12 +00:00
ccurme	3382ee8f57	anthropic: release 0.3.11 (#30857 )	2025-04-15 17:57:00 +00:00
Sydney Runkle	ef5aff3b6c	core[fix]: Fix `__dir__` in `__init__.py` for `output_parsers` module (#30856 ) We have a `list.py` file which causes a namespace conflict with `list` from stdlib, unfortunately. `__all__` is already a list, so no need to coerce.	2025-04-15 13:09:13 -04:00
Christophe Bornet	a4ca1fe0ed	core: Remove some noqa (#30855 )	2025-04-15 13:08:40 -04:00
ccurme	6baf5c05a6	standard-tests: release 0.3.18 (#30854 )	2025-04-15 16:56:54 +00:00
Sydney Runkle	1f5e207379	core[fix]: remove `load` from dynamic imports dict (#30849 )	2025-04-15 12:02:46 -04:00
ccurme	7240458619	core: release 0.3.52 (#30850 )	2025-04-15 15:28:31 +00:00
Sydney Runkle	6aa5494a75	Fix `from langchain_core.load.load import load` import (#30843 ) TL;DR: you can't optimize imports with a lazy `__getattr__` if there is a namespace conflict with a module name and an attribute name. We should avoid introducing conflicts like this in the future. This PR fixes a bug introduced by my lazy imports PR: https://github.com/langchain-ai/langchain/pull/30769. In `langchain_core`, we have utilities for loading and dumping data. Unfortunately, one of those utilities is a `load` function, located in `langchain_core/load/load.py`. To make this function more visible, we make it accessible at the top level `langchain_core.load` module via importing the function in `langchain_core/load/__init__.py`. So, either of these imports should work: ```py from langchain_core.load import load from langchain_core.load.load import load ``` As you can tell, this is already a bit confusing. You'd think that the first import would produce the module `load`, but because of the `__init__.py` shortcut, both produce the function `load`. <details> More on why the lazy imports PR broke this support... All was well, except when the absolute import was run first, see the last snippet: ``` >>> from langchain_core.load import load >>> load <function load at 0x101c320c0> ``` ``` >>> from langchain_core.load.load import load >>> load <function load at 0x1069360c0> ``` ``` >>> from langchain_core.load import load >>> load <function load at 0x10692e0c0> >>> from langchain_core.load.load import load >>> load <function load at 0x10692e0c0> ``` ``` >>> from langchain_core.load.load import load >>> load <function load at 0x101e2e0c0> >>> from langchain_core.load import load >>> load <module 'langchain_core.load.load' from '/Users/sydney_runkle/oss/langchain/libs/core/langchain_core/load/load.py'> ``` In this case, the function `load` wasn't stored in the globals cache for the `langchain_core.load` module (by the lazy import logic), so Python defers to a module import. </details> New `langchain` tongue twister 😜: we've created a problem for ourselves because you have to load the load function from the load file in the load module 😨.	2025-04-15 11:06:13 -04:00
Bagatur	7262de4217	core[patch]: dict chat prompt template support (#25674 ) - Support passing dicts as templates to chat prompt template - Support making any attribute on a message a runtime variable - Significantly simpler than trying to update our existing prompt template classes ```python template = ChatPromptTemplate( [ { "role": "assistant", "content": [ { "type": "text", "text": "{text1}", "cache_control": {"type": "ephemeral"}, }, {"type": "image_url", "image_url": {"path": "{local_image_path}"}}, ], "name": "{name1}", "tool_calls": [ { "name": "{tool_name1}", "args": {"arg1": "{tool_arg1}"}, "id": "1", "type": "tool_call", } ], }, { "role": "tool", "content": "{tool_content2}", "tool_call_id": "1", "name": "{tool_name1}", }, ] ) ``` will likely close #25514 if we like this idea and update to use this logic --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-15 11:00:49 -04:00
ccurme	9cfe6bcacd	multiple: multi-modal content blocks (#30746 ) Introduces standard content block format for images, audio, and files. ## Examples Image from url: ``` { "type": "image", "source_type": "url", "url": "https://path.to.image.png", } ``` Image, in-line data: ``` { "type": "image", "source_type": "base64", "data": "<base64 string>", "mime_type": "image/png", } ``` PDF, in-line data: ``` { "type": "file", "source_type": "base64", "data": "<base64 string>", "mime_type": "application/pdf", } ``` File from ID: ``` { "type": "file", "source_type": "id", "id": "file-abc123", } ``` Plain-text file: ``` { "type": "file", "source_type": "text", "text": "foo bar", } ```	2025-04-15 09:48:06 -04:00
Sydney Runkle	59f2c9e737	Tinkering with CodSpeed (#30824 ) Fix CI to trigger benchmarks on `run-codspeed-benchmarks` label addition Reduce scope of async benchmark to save time on CI Waiting to merge this PR until we figure out how to use walltime on local runners.	2025-04-15 08:49:09 -04:00
William FH	ed5c4805f6	Consistent docstring indentation (#30834 ) Should be 4 spaces instead of 3.	2025-04-14 19:04:35 -07:00
ccurme	f7c4965fb6	openai[patch]: update imports in test (#30828 ) Quick fix to unblock CI, will need to address in core separately.	2025-04-14 19:33:38 +00:00
Sydney Runkle	edb6a23aea	core[lint]: fix issue with unused ignore in `__init__.py` files (#30825 ) Fixing a race condition between https://github.com/langchain-ai/langchain/pull/30769 and https://github.com/langchain-ai/langchain/pull/30737	2025-04-14 17:57:00 +00:00
湛露先生	3a64c7195f	community: redis tool typos fix (#30811 )	2025-04-14 09:01:36 -04:00
Sydney Runkle	4f69094b51	core[performance]: use custom `__getattr__` in `__init__.py` files for lazy imports (#30769 ) Most easily reviewed with the "hide whitespace" option toggled. Seeing 10-50% speed ups in import time for common structures 🚀 The general purpose of this PR is to lazily import structures within `langchain_core.XXX_module.__init__.py` so that we're not eagerly importing expensive dependencies (`pydantic`, `requests`, etc). Analysis of flamegraphs generated with `importtime` motivated these changes. For example, the one below demonstrates that importing `HumanMessage` accidentally triggered imports for `importlib.metadata`, `requests`, etc. There's still much more to do on this front, and we can start digging into our own internal code for optimizations now that we're less concerned about external imports. <img width="1210" alt="Screenshot 2025-04-11 at 1 10 54 PM" src="https://github.com/user-attachments/assets/112a3fe7-24a9-4294-92c1-d5ae64df839e" /> I've tracked the improvements with some local benchmarks: ## `pytest-benchmark` results \| Name \| Before (s) \| After (s) \| Delta (s) \| % Change \| \|-----------------------------\|------------\|-----------\|-----------\|----------\| \| Document \| 2.8683 \| 1.2775 \| -1.5908 \| -55.46% \| \| HumanMessage \| 2.2358 \| 1.1673 \| -1.0685 \| -47.79% \| \| ChatPromptTemplate \| 5.5235 \| 2.9709 \| -2.5526 \| -46.22% \| \| Runnable \| 2.9423 \| 1.7793 \| -1.163 \| -39.53% \| \| InMemoryVectorStore \| 3.1180 \| 1.8417 \| -1.2763 \| -40.93% \| \| RunnableLambda \| 2.7385 \| 1.8745 \| -0.864 \| -31.55% \| \| tool \| 5.1231 \| 4.0771 \| -1.046 \| -20.42% \| \| CallbackManager \| 4.2263 \| 3.4099 \| -0.8164 \| -19.32% \| \| LangChainTracer \| 3.8394 \| 3.3101 \| -0.5293 \| -13.79% \| \| BaseChatModel \| 4.3317 \| 3.8806 \| -0.4511 \| -10.41% \| \| PydanticOutputParser \| 3.2036 \| 3.2995 \| 0.0959 \| 2.99% \| \| InMemoryRateLimiter \| 0.5311 \| 0.5995 \| 0.0684 \| 12.88% \| Note the lack of change for `InMemoryRateLimiter` and `PydanticOutputParser` is just random noise, I'm getting comparable numbers locally. ## Local CodSpeed results We're still working on configuring CodSpeed on CI. The local usage produced similar results.	2025-04-14 08:57:54 -04:00
Christophe Bornet	ada740b5b9	community: Add ruff rule PGH003 (#30812 ) See https://docs.astral.sh/ruff/rules/blanket-type-ignore/ --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-14 02:32:13 +00:00
ccurme	f005988e31	community[patch]: fix cost calculations for o3 in OpenAI callback (#30807 ) Resolves https://github.com/langchain-ai/langchain/issues/30795	2025-04-13 15:20:46 +00:00
Marina Gómez	afd457d8e1	perplexity[patch]: Fix #30767 : Handle missing citations attribute in ChatPerplexity (#30805 ) This PR fixes an issue where ChatPerplexity would raise an AttributeError when the citations attribute was missing from the model response (e.g., when using offline models like r1-1776). The fix checks for the presence of citations, images, and related_questions before attempting to access them, avoiding crashes in models that don't provide these fields. Tested locally with models that omit citations, and the fix works as expected.	2025-04-13 09:24:05 -04:00
Christophe Bornet	42944f3499	core: Improve mypy config (#30737 ) * Cleanup mypy config * Add mypy `strict` rules except `disallow_any_generics`, `warn_return_any` and `strict_equality` (TODO) * Add mypy `strict_byte` rule * Add mypy support for PEP702 `@deprecated` decorator * Bump mypy version to 1.15 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-11 16:35:13 -04:00
Christophe Bornet	913c896598	core: Add ruff rules FBT001 and FBT002 (#30695 ) Add ruff rules [FBT001](https://docs.astral.sh/ruff/rules/boolean-type-hint-positional-argument/) and [FBT002](https://docs.astral.sh/ruff/rules/boolean-default-value-positional-argument/). Mostly `noqa`s to not introduce breaking changes and possible non-breaking fixes have already been done in a [previous PR](https://github.com/langchain-ai/langchain/pull/29424). These rules will prevent new violations to happen.	2025-04-11 16:26:33 -04:00
William FH	2803a48661	core[patch]: Share executor for async callbacks run in sync context (#30779 ) To avoid having to create ephemeral threads, grab the thread lock, etc.	2025-04-11 10:34:43 -07:00
Sydney Runkle	fdc2b4bcac	core[lint]: Use 3.9 formatting for docs and tests (#30780 ) Looks like `pyupgrade` was already used here but missed some docs and tests. This helps to keep our docs looking professional and up to date. Eventually, we should lint / format our inline docs.	2025-04-11 10:39:25 -04:00
Sydney Runkle	48affc498b	langchain[lint]: use `pyupgrade` to get to 3.9 standards (#30782 )	2025-04-11 10:33:26 -04:00
ccurme	d9b628e764	xai: release 0.2.3 (#30790 )	2025-04-11 14:05:11 +00:00
ccurme	9cfb95e621	xai[patch]: support reasoning content (#30758 ) https://docs.x.ai/docs/guides/reasoning ```python from langchain.chat_models import init_chat_model llm = init_chat_model( "xai:grok-3-mini-beta", reasoning_effort="low" ) response = llm.invoke("Hello, world!") ```	2025-04-11 14:00:27 +00:00
Christophe Bornet	89f28a24d3	core[lint]: Fix typing in `test_async_callbacks` (#30788 )	2025-04-11 07:26:38 -04:00
Sydney Runkle	8c6734325b	partners[lint]: run `pyupgrade` to get code in line with 3.9 standards (#30781 ) Using `pyupgrade` to get all `partners` code up to 3.9 standards (mostly, fixing old `typing` imports).	2025-04-11 07:18:44 -04:00
Jacob Lee	e72f3c26a0	fix(ollama): Remove redundant message from response_metadata (#30778 )	2025-04-10 23:12:57 -07:00
Christophe Bornet	dc19d42d37	core: Specify code when ignoring type issue (ruff PGH003) (#30675 ) See https://docs.astral.sh/ruff/rules/blanket-type-ignore/	2025-04-10 22:23:52 -04:00
Paul Czarkowski	68d16d8a07	Community: Add Managed Identity support for Azure AI Search (#30730 ) Add Managed Identity support for Azure AI Search --------- Signed-off-by: Paul Czarkowski <username.taken@gmail.com>	2025-04-10 22:22:58 -04:00
Eugene Yurtsev	e42b3d285a	langchain: remove langchain-server script (#30755 ) Has been replaced by langsmith a long long time ago	2025-04-10 22:11:42 -04:00
Pol de Font-Réaulx	48cf7c838d	feat(community): add oauth2 support for Jira toolkit (#30684 ) Description: add support for oauth2 in Jira tool by adding the possibility to pass a dictionary with oauth parameters. I also adapted the documentation to show this new behavior	2025-04-10 22:04:09 -04:00
Oleg Ovcharuk	b6fe7e8c10	docs: YDB Vector Store docs (#30636 ) This PR adds docs about how to use YDB as a vector store [YDB](https://ydb.tech/) is a versatile open-source distributed SQL database. It supports [vector search](https://ydb.tech/docs/en/yql/reference/udf/list/knn) which means it can be used as a vector store with langchain. YDB vectore store comes with [langchain-ydb](https://pypi.org/project/langchain-ydb/) pypi package. Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-10 21:33:56 -04:00
湛露先生	7a4ae6fbff	community[patch]: simplify cache logic (#30760 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-10 19:20:57 -04:00
ccurme	8e053ac9d2	core[patch]: support customization of backoff parameters in `with_retries` (#30773 ) Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com>	2025-04-10 19:18:36 -04:00
William FH	70532a65f8	Async callback benchmark (#30777 )	2025-04-10 15:47:19 -07:00
Sydney Runkle	8f8fea2d7e	[performance]: Use hard coded `langchain-core` version to avoid `importlib` import (#30744 ) This PR aims to reduce import time of `langchain-core` tools by removing the `importlib.metadata` import previously used in `__init__.py`. This is the first in a sequence of PRs to reduce import time delays for `langchain-core` features and structures 🚀. Because we're now hard coding the version, we need to make sure `version.py` and `pyproject.toml` stay in sync, so I've added a new CI job that runs whenever either of those files are modified. [This run](https://github.com/langchain-ai/langchain/actions/runs/14358012706/job/40251952044?pr=30744) demonstrates the failure that occurs whenever the version gets out of sync (thus blocking a PR). Before, note the ~15% of time spent on the `importlib.metadata` /related imports <img width="1081" alt="Screenshot 2025-04-09 at 9 06 15 AM" src="https://github.com/user-attachments/assets/59f405ec-ee8d-4473-89ff-45dea5befa31" /> After (note, lack of `importlib.metadata` time sink): <img width="1245" alt="Screenshot 2025-04-09 at 9 01 23 AM" src="https://github.com/user-attachments/assets/9c32e77c-27ce-485e-9b88-e365193ed58d" />	2025-04-10 14:15:02 -04:00
Sydney Runkle	cd6a83117c	Adding more import time benchmarks for `langchain-core` (#30770 ) Plus minor typo fix in `ChatPromptTemplate` case id.	2025-04-10 11:50:12 -04:00
amohan	44b83460b2	docs: Add Cloudflare integrations (#30749 ) Description: This PR adds documentation for the langchain-cloudflare integration package. Issue: N/A Dependencies: No new dependencies are required. Tests and Docs: Added an example notebook demonstrating the usage of the langchain-cloudflare package, located in docs/docs/integrations. Added a new package to libs/packages.yml. Lint and Format: Successfully ran make format and make lint. --------- Co-authored-by: Collier King <collier@cloudflare.com> Co-authored-by: Collier King <collierking99@gmail.com>	2025-04-10 09:27:23 -04:00
ccurme	63c16f5ca8	community: deprecate AzureCosmosDBNoSqlVectorSearch in favor of langchain-azure-ai implementation (#30756 )	2025-04-09 21:04:16 +00:00
Christophe Bornet	4cc7bc6c93	core: Add ruff rules PLR (#30696 ) Add ruff rules [PLR](https://docs.astral.sh/ruff/rules/#refactor-plr) Except PLR09xxx and PLR2004. Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-09 15:15:38 -04:00
célina	68361f9c2d	partners: (langchain-huggingface) Embeddings - Integrate Inference Providers and remove deprecated code (#30735 ) Hi there, This is a complementary PR to #30733. This PR introduces support for Hugging Face's serverless Inference Providers (documentation [here](https://huggingface.co/docs/inference-providers/index)), allowing users to specify different providers This PR also removes the usage of `InferenceClient.post()` method in `HuggingFaceEndpointEmbeddings`, in favor of the task-specific `feature_extraction` method. `InferenceClient.post()` is deprecated and will be removed in `huggingface_hub` v0.31.0. ## Changes made - bumped the minimum required version of the `huggingface_hub` package to ensure compatibility with the latest API usage. - added a provider field to `HuggingFaceEndpointEmbeddings`, enabling users to select the inference provider. - replaced the deprecated `InferenceClient.post()` call in `HuggingFaceEndpointEmbeddings` with the task-specific `feature_extraction` method for future-proofing, `post()` will be removed in `huggingface-hub` v0.31.0. ✅ All changes are backward compatible. --------- Co-authored-by: Lucain <lucainp@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-09 19:05:43 +00:00
Christophe Bornet	98f0016fc2	core: Add ruff rules ARG (#30732 ) See https://docs.astral.sh/ruff/rules/#flake8-unused-arguments-arg	2025-04-09 14:39:36 -04:00
Sydney Runkle	78ec7d886d	[performance]: Adding benchmarks for common `langchain-core` imports (#30747 ) The first in a sequence of PRs focusing on improving performance in core. We're starting with reducing import times for common structures, hence the benchmarks here. The benchmark looks a little bit complicated - we have to use a process so that we don't suffer from Python's import caching system. I tried doing manual modification of `sys.modules` between runs, but that's pretty tricky / hacky to get right, hence the subprocess approach. Motivated by extremely slow baseline for common imports (we're talking 2-5 seconds): <img width="633" alt="Screenshot 2025-04-09 at 12 48 12 PM" src="https://github.com/user-attachments/assets/994616fe-1798-404d-bcbe-48ad0eb8a9a0" /> Also added a `make benchmark` command to make local runs easy :). Currently using walltimes so that we can track total time despite using a manual proces.	2025-04-09 13:00:15 -04:00
German Molina	5fb261ce27	community: Google Vertex AI Search now returns the website title as part of the document metadata (#30688 ) Google vertex ai search will now return the title of the found website as part of the document metadata, if available. Thank you for contributing to LangChain! - Description: Vertex AI Search can be used to index websites and then develop chatbots that use these websites to answer questions. At present, the document metadata includes an `id` and `source` (which is the URL). While the URL is enough to create a link, the ID is not descriptive enough to show users. Therefore, I propose we return `title` as well, when available (e.g., it will not be available in `.txt` documents found during the website indexing). - Issue: No bug in particular, but it would be better if this was here. - Dependencies: None - I do not use twitter. Format, Lint and Test seem to be all good.	2025-04-09 08:54:06 -04:00
Sydney Runkle	4556b81b1d	Clean up `numpy` dependencies and speed up 3.13 CI with `numpy>=2.1.0` (#30714 ) Generally, this PR is CI performance focused + aims to clean up some dependencies at the same time. 1. Unpins upper bounds for `numpy` in all `pyproject.toml` files where `numpy` is specified 2. Requires `numpy >= 2.1.0` for Python 3.13 and `numpy > v1.26.0` for Python 3.12, plus a `numpy` min version bump for `chroma` 3. Speeds up CI by minutes - linting on Python 3.13, installing `numpy < 2.1.0` was taking [~3 minutes](https://github.com/langchain-ai/langchain/actions/runs/14316342925/job/40123305868?pr=30713), now the entire env setup takes a few seconds 4. Deleted the `numpy` test dependency from partners where that was not used, specifically `huggingface`, `voyageai`, `xai`, and `nomic`. It's a bit unfortunate that `langchain-community` depends on `numpy`, we might want to try to fix that in the future... Closes https://github.com/langchain-ai/langchain/issues/26026 Fixes https://github.com/langchain-ai/langchain/issues/30555	2025-04-08 09:45:07 -04:00
湛露先生	9cbe91896e	Fix deepseek release tag, as it is update name. (#30717 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-08 08:43:16 -04:00
Nithish Raghunandanan	893942651b	docs: Update couchbase vector store docs (#30710 ) - Update LangChain-Couchbase documentation - Rename `CouchbaseVectorStore` in favor of `CouchbaseSearchVectorStore` - [x] Lint and test	2025-04-07 18:45:14 -04:00
ccurme	a2bec5f2e5	ollama: release 0.3.1 (#30716 )	2025-04-07 20:31:25 +00:00
ccurme	e3f15f0a47	ollama[patch]: add model_name to response metadata (#30706 ) Fixes [this standard test](https://python.langchain.com/api_reference/standard_tests/integration_tests/langchain_tests.integration_tests.chat_models.ChatModelIntegrationTests.html#langchain_tests.integration_tests.chat_models.ChatModelIntegrationTests.test_usage_metadata).	2025-04-07 16:27:58 -04:00
ccurme	e106e9602f	groq[patch]: add retries to integration tests (#30707 ) Tool-calling tests started intermittently failing with > groq.APIError: Failed to call a function. Please adjust your prompt. See 'failed_generation' for more details.	2025-04-07 12:45:53 -04:00
Mohammad Mohtashim	e935da0b12	ChatTongyi reasoning_content fix (#30694 ) - Description: Small fix for `reasoning_content` key - Issue: #30689	2025-04-07 09:27:33 -04:00
Tin Lai	4d03ba4686	langchain_qdrant: fix showing the missing sparse vector name (#30701 ) Description: The error message was supposed to display the missing vector name, but instead, it includes only the existing collection configs. This simple PR just includes the correct variable name, so that the user knows the requested vector does not exist in the collection. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. Signed-off-by: Tin Lai <tin@tinyiu.com>	2025-04-07 09:19:08 -04:00
Christophe Bornet	6650b94627	core: Add ruff rules PYI (#29335 ) See https://docs.astral.sh/ruff/rules/#flake8-pyi-pyi --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-04 19:59:44 +00:00
Philippe PRADOS	d8e3b7667f	community[patch]: Fix empty producer in PDF Parsers (#30620 ) Fix an issue where if a pdf file doesn't have a “producer” in metadata, it generates an exception.	2025-04-04 15:53:49 -04:00
Christophe Bornet	f0159c7125	core: Add ruff rules PGH (except PGH003) (#30656 ) Add ruff rules PGH: https://docs.astral.sh/ruff/rules/#pygrep-hooks-pgh Except PGH003 which will be dealt in a dedicated PR. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2025-04-04 19:53:27 +00:00
Armaanjeet Singh Sandhu	7c2468f36b	core: Fix handler removal in BaseCallbackManager (Fixes #30640 ) (#30659 ) Description: Fixed a bug in `BaseCallbackManager.remove_handler()` that caused a `ValueError` when removing a handler added via the constructor's `handlers` parameter. The issue occurred because handlers passed to the constructor were added only to the `handlers` list and not automatically to `inheritable_handlers` unless explicitly specified. However, `remove_handler()` attempted to remove the handler from both lists unconditionally, triggering a `ValueError` when it wasn't in `inheritable_handlers`. The fix ensures the method checks for the handler’s presence in each list before attempting removal, making it more robust while preserving its original behavior. Issue: Fixes #30640 Dependencies: None --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-04 15:45:15 -04:00
Mohammad Mohtashim	bff56c5fa6	community[patch]: `Redundant` Parser checker for Webbaseloader (#30632 ) - Description: We do not need to set parser in `scrape` since it is already been done in `_scrape` - Issue: #30629, not directly related but makes sure xml parser is used	2025-04-04 14:11:26 -04:00
Christophe Bornet	150ac0cb79	core: Add ruff rules DTZ (#30657 ) Add ruff rules DTZ: https://docs.astral.sh/ruff/rules/#flake8-datetimez-dtz	2025-04-04 13:43:47 -04:00
Christophe Bornet	5e418c2666	core: Rework pydantic version checks (#30653 ) This pull request includes various changes to the `langchain_core` library, focusing on improving compatibility with different versions of Pydantic. The primary change involves replacing checks for Pydantic major versions with boolean flags, which simplifies the code and improves readability. This also solves ruff rule checks for [RUF048](https://docs.astral.sh/ruff/rules/map-int-version-parsing/) and [PLR2004](https://docs.astral.sh/ruff/rules/magic-value-comparison/). Key changes include: ### Compatibility Improvements: * [`libs/core/langchain_core/output_parsers/json.py`](diffhunk://#diff-5add0cf7134636ae4198a1e0df49ee332ae0c9123c3a2395101e02687c717646L22-R24): Replaced `PYDANTIC_MAJOR_VERSION` with `IS_PYDANTIC_V1` to check for Pydantic version 1. * [`libs/core/langchain_core/output_parsers/pydantic.py`](diffhunk://#diff-2364b5b4aee01c462aa5dbda5dc3a877dcd20f29df173ad540dc8adf8b192361L14-R14): Updated version checks from `PYDANTIC_MAJOR_VERSION` to `IS_PYDANTIC_V2` in the `PydanticOutputParser` class. [[1]](diffhunk://#diff-2364b5b4aee01c462aa5dbda5dc3a877dcd20f29df173ad540dc8adf8b192361L14-R14) [[2]](diffhunk://#diff-2364b5b4aee01c462aa5dbda5dc3a877dcd20f29df173ad540dc8adf8b192361L27-R27) ### Utility Enhancements: * [`libs/core/langchain_core/utils/pydantic.py`](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896R23): Introduced `IS_PYDANTIC_V1` and `IS_PYDANTIC_V2` flags and deprecated the `get_pydantic_major_version` function. Updated various functions to use these flags instead of version numbers. [[1]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896R23) [[2]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896R42-R78) [[3]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L90-R89) [[4]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L104-R101) [[5]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L120-R122) [[6]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L135-R132) [[7]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L149-R151) [[8]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L164-R161) [[9]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L248-R250) [[10]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L330-R335) [[11]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L356-R357) [[12]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L393-R390) [[13]](diffhunk://#diff-ff28020c5f1073a8b63bcd9d8b756a187fd682cb81935295120c63b207071896L403-R400) ### Test Updates: * [`libs/core/tests/unit_tests/output_parsers/test_openai_tools.py`](diffhunk://#diff-694cc0318edbd6bbca34f53304934062ad59ba9f5a788252ce6c5f5452489d67L19-R22): Updated tests to use `IS_PYDANTIC_V1` and `IS_PYDANTIC_V2` for version checks. [[1]](diffhunk://#diff-694cc0318edbd6bbca34f53304934062ad59ba9f5a788252ce6c5f5452489d67L19-R22) [[2]](diffhunk://#diff-694cc0318edbd6bbca34f53304934062ad59ba9f5a788252ce6c5f5452489d67L532-R535) [[3]](diffhunk://#diff-694cc0318edbd6bbca34f53304934062ad59ba9f5a788252ce6c5f5452489d67L567-R570) [[4]](diffhunk://#diff-694cc0318edbd6bbca34f53304934062ad59ba9f5a788252ce6c5f5452489d67L602-R605) * [`libs/core/tests/unit_tests/prompts/test_chat.py`](diffhunk://#diff-3e60e744842086a4f3c4b21bc83e819c3435720eab210078e77e2430fb8c7e84R7): Replaced version tuple checks with `PYDANTIC_VERSION` comparisons. [[1]](diffhunk://#diff-3e60e744842086a4f3c4b21bc83e819c3435720eab210078e77e2430fb8c7e84R7) [[2]](diffhunk://#diff-3e60e744842086a4f3c4b21bc83e819c3435720eab210078e77e2430fb8c7e84L35-R38) [[3]](diffhunk://#diff-3e60e744842086a4f3c4b21bc83e819c3435720eab210078e77e2430fb8c7e84L924-R927) [[4]](diffhunk://#diff-3e60e744842086a4f3c4b21bc83e819c3435720eab210078e77e2430fb8c7e84L935-R938) * [`libs/core/tests/unit_tests/runnables/test_graph.py`](diffhunk://#diff-99a290330ef40103d0ce02e52e21310d6fadea142bfdea13c94d23fc81c0bb5dR3): Simplified version checks using `PYDANTIC_VERSION`. [[1]](diffhunk://#diff-99a290330ef40103d0ce02e52e21310d6fadea142bfdea13c94d23fc81c0bb5dR3) [[2]](diffhunk://#diff-99a290330ef40103d0ce02e52e21310d6fadea142bfdea13c94d23fc81c0bb5dL15-R18) [[3]](diffhunk://#diff-99a290330ef40103d0ce02e52e21310d6fadea142bfdea13c94d23fc81c0bb5dL234-L239) * [`libs/core/tests/unit_tests/runnables/test_runnable.py`](diffhunk://#diff-06bed920c0dad0cfd41d57a8d9e47a7b56832409649c10151061a791860d5bb5L18-R20): Introduced `PYDANTIC_VERSION_AT_LEAST_29` and `PYDANTIC_VERSION_AT_LEAST_210` for more readable version checks. [[1]](diffhunk://#diff-06bed920c0dad0cfd41d57a8d9e47a7b56832409649c10151061a791860d5bb5L18-R20) [[2]](diffhunk://#diff-06bed920c0dad0cfd41d57a8d9e47a7b56832409649c10151061a791860d5bb5L92-R99) [[3]](diffhunk://#diff-06bed920c0dad0cfd41d57a8d9e47a7b56832409649c10151061a791860d5bb5L230-R233) [[4]](diffhunk://#diff-06bed920c0dad0cfd41d57a8d9e47a7b56832409649c10151061a791860d5bb5L652-R655)	2025-04-04 13:42:30 -04:00
Christophe Bornet	43b5dc7191	core: Add ruff rules TD and FIX (#30654 ) Add ruff rules: * FIX: https://docs.astral.sh/ruff/rules/#flake8-fixme-fix * TD: https://docs.astral.sh/ruff/rules/#flake8-todos-td Code cleanup: * [`libs/core/langchain_core/outputs/chat_generation.py`](diffhunk://#diff-a1017ee46f58fa4005b110ffd4f8e1fb08f6a2a11d6ca4c78ff8be641cbb89e5L56-R56): Removed the "HACK" prefix from a comment in the `set_text` method. Configuration adjustments: * [`libs/core/pyproject.toml`](diffhunk://#diff-06baaee12b22a370fef9f170c9ed13e2727e377d3b32f5018430f4f0a39d3537R85-R93): Added new rules `FIX002`, `TD002`, and `TD003` to the ignore list. * [`libs/core/pyproject.toml`](diffhunk://#diff-06baaee12b22a370fef9f170c9ed13e2727e377d3b32f5018430f4f0a39d3537L102-L108): Removed the `FIX` and `TD` rules from the ignore list. Test refinement: * [`libs/core/tests/unit_tests/runnables/test_runnable.py`](diffhunk://#diff-06bed920c0dad0cfd41d57a8d9e47a7b56832409649c10151061a791860d5bb5L3231-R3232): Updated a TODO comment to improve clarity in the `test_map_stream` function.	2025-04-04 13:40:42 -04:00
ccurme	a007c57285	docs: update package registry sort order (#30677 )	2025-04-04 13:12:39 -04:00
Sydney Runkle	33ed7c31da	docs: fix perplexity install instructions in `ChatPerplexity` docstring (#30676 ) * `openai` install no longer needs to be done manually	2025-04-04 12:58:18 -04:00
Dhruvajyoti Sarma	f9bb5ec5d0	feature: removed pandas dataframe dependency for similary_search when using DuckDB as vector store (#30445 ) - [ ] PR title: "community: Removes pandas dependency for using DuckDB for similarity search" - [ ] PR message: - Description: Removes pandas dependency for using DuckDB for similarity search. The old function still exists as `similarity_search_pd`, while the new one is at `similarity_search` and requires no code changes. Return format remains the same. - Issue: Issue #29933 and update on PR #30435 - Dependencies: No dependencies	2025-04-04 12:19:18 -04:00
Akshay Dongare	f79473b752	Solved issue `Implement langchain-litellm` #30368 (#30637 ) PR title: - [x] 1. docs: docs/docs/integrations/providers/LiteLLM.md - [x] 2. docs: docs/docs/integrations/chat/litellm.ipynb - [x] 3. libs: libs/packages.yml - [x] PR message: - Description: Implement langchain-litellm - Issue: the issue #30368 - Twitter handle: akshay_d02 - LinkedIn Handle https://linkedin.com/in/akshay-dongare - [x] Add tests and docs: Done - [x] Lint and test: Done --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-04 16:12:10 +00:00
Yiğit Bekir Kaya, PhD	87e82fe1e8	Added langchain-qwq package documentation (Alibaba Cloud) (#30628 ) LangChain QwQ allows non-Tongyi users to access thinking models with extra capabilities which serve as an extension to Alibaba Cloud. Hi @ccurme I'm back with the updated PR this time with documentation and a finished package. - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - Description: adds documentation of `langchain-qwq` integration package. Also adds it to Alibaba Cloud provider - Issue: #30580 #30317 #30579 - Dependencies: openai, json-repair - Twitter handle: YigitBekir - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-04-04 11:47:14 -04:00
Andrew Benton	4e7a9a7014	community: Add support for custom runtimes to Riza tools (#30664 ) Description: Adds support for Riza custom runtimes to the two Riza code interpreter tools, allowing users to run LLM-generated code that depends on libraries outside stdlib. Issue: N/A Dependencies: None Twitter handle: @rizaio	2025-04-04 11:03:14 -04:00
diego dupin	aa37893c00	MariaDB vector store documentation addition (#30229 ) ### New Feature Since version 11.7.1, MariaDB support vector. This is a super fast implementation (see [some perf blog](https://smalldatum.blogspot.com/2025/01/evaluating-vector-indexes-in-mariadb.html) The goal is to support MariaDB with langchain Implementation is done in https://github.com/mariadb-corporation/langchain-mariadb, published in https://pypi.org/project/langchain-mariadb/ This concerns the doc addition (initial PR https://github.com/langchain-ai/langchain/pull/29989) --------- Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Oskar Stark <oskarstark@googlemail.com>	2025-04-04 14:56:25 +00:00
Sydney Runkle	1cdea6ab07	langchain-community: release 0.3.21 (#30673 )	2025-04-04 14:14:50 +00:00
Sydney Runkle	901dffe06b	langchain: release 0.3.23 (#30670 ) * Bump `text-splitters` min version * Bump `langchain-core` min version * Bump `langchain` version 🚀	2025-04-04 10:06:29 -04:00
ccurme	0c2c8c36c1	text-splitters: release 0.3.8 (#30671 )	2025-04-04 09:58:45 -04:00
ccurme	59d508a2ee	openai[patch]: make computer test more reliable (#30672 )	2025-04-04 13:53:59 +00:00
Sydney Runkle	c235328b39	Revert "update langchain version and bump min core v" This reverts commit `d0f154dbaa`.	2025-04-04 09:31:51 -04:00
Sydney Runkle	d0f154dbaa	update langchain version and bump min core v	2025-04-04 09:27:49 -04:00
Sydney Runkle	32cd70d7d2	release: bump core to `v0.3.51` (#30668 )	2025-04-04 13:23:09 +00:00
Max Forsey	18cf457eec	langchain-runpod integration (#30648 ) ## Description: This PR adds the necessary documentation for the `langchain-runpod` partner package integration. It includes: * A provider page (`docs/docs/integrations/providers/runpod.ipynb`) explaining the overall setup. * An LLM component page (`docs/docs/integrations/llms/runpod.ipynb`) detailing the `RunPod` class usage. * A Chat Model component page (`docs/docs/integrations/chat/runpod.ipynb`) detailing the `ChatRunPod` class usage, including a feature support table. These documentation files reflect the latest features of the `langchain-runpod` package (v0.2.0+) such as async support and API polling logic. This work also addresses the review feedback provided on the previous attempt in PR #30246 by: * Removing all TODOs from documentation. * Adding the required links between provider and component pages. * Completing the feature support table in the chat documentation. * Linking to the source code on GitHub for API reference. Finally, it registers the `langchain-runpod` package in `libs/packages.yml`. ## Dependencies: None added to the core LangChain repository by these documentation changes. The required dependency (`langchain-runpod`) is managed as a separate package. ## Twitter handle: @runpod_io --------- Co-authored-by: Max Forsey <maxpod@maxpod.local> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-03 23:57:06 +00:00
Sydney Runkle	af66ab098e	Adding `Perplexity` extra and deprecating the community version of `ChatPerplexity` (#30649 ) Plus, some accompanying docs updates Some compelling usage: ```py from langchain_perplexity import ChatPerplexity chat = ChatPerplexity(model="llama-3.1-sonar-small-128k-online") response = chat.invoke( "What were the most significant newsworthy events that occurred in the US recently?", extra_body={"search_recency_filter": "week"}, ) print(response.content) # > Here are the top significant newsworthy events in the US recently: ... ``` Also, some confirmation of structured outputs: ```py from langchain_perplexity import ChatPerplexity from pydantic import BaseModel class AnswerFormat(BaseModel): first_name: str last_name: str year_of_birth: int num_seasons_in_nba: int messages = [ {"role": "system", "content": "Be precise and concise."}, { "role": "user", "content": ( "Tell me about Michael Jordan. " "Please output a JSON object containing the following fields: " "first_name, last_name, year_of_birth, num_seasons_in_nba. " ), }, ] llm = ChatPerplexity(model="llama-3.1-sonar-small-128k-online") structured_llm = llm.with_structured_output(AnswerFormat) response = structured_llm.invoke(messages) print(repr(response)) #> AnswerFormat(first_name='Michael', last_name='Jordan', year_of_birth=1963, num_seasons_in_nba=15) ```	2025-04-03 14:29:17 -04:00
ccurme	374769e8fe	core[patch]: log information from certain errors (#30626 ) Some exceptions raised by SDKs include information in httpx responses (see for example [OpenAI](https://github.com/openai/openai-python/blob/main/src/openai/_exceptions.py)). Here we trace information from those exceptions. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2025-04-03 16:45:19 +00:00
Sydney Runkle	17a9cd61e9	Bump `langchain-core` version in perplexity's `pyproject.toml` (#30647 ) Blocking v0.1.0 release of `langchain-perplexity`	2025-04-03 16:19:10 +00:00
Sydney Runkle	3814bd1ea7	partners: Add Perplexity Chat Integration (#30618 ) Perplexity's importance in the space has been growing, so we think it's time to add an official integration! Note: following the release of `langchain-perplexity` to `pypi`, we should be able to add `perplexity` as an extra in `libs/langchain/pyproject.toml`, but we're blocked by a circular import for now. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-03 16:09:14 +00:00
Alejandro Rodríguez	884125e129	community: support usage_metadata for litellm (#30625 ) Support "usage_metadata" for LiteLLM. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-04-02 19:45:15 -04:00
Christophe Bornet	f241fd5c11	core: Add ruff rules RET (#29384 ) See https://docs.astral.sh/ruff/rules/#flake8-return-ret All auto-fixes	2025-04-02 16:59:56 -04:00
Eugene Yurtsev	9ae792f56c	core: 0.3.50 release (#30623 ) 0.3.50 release	2025-04-02 14:46:23 -04:00
Christophe Bornet	ccc3d32ec8	core: Add ruff rules for Pylint PLC (Convention) and PLE (Errors) (#29286 ) See https://docs.astral.sh/ruff/rules/#pylint-pl	2025-04-02 10:58:03 -04:00
ccurme	fe0fd9dd70	openai[patch]: upgrade tiktoken and fix test (#30621 ) Related to https://github.com/langchain-ai/langchain/issues/30344 https://github.com/langchain-ai/langchain/pull/30542 introduced an erroneous test for token counts for o-series models. tiktoken==0.8 does not support o-series models in `tiktoken.encoding_for_model(model_name)`, and this is the version of tiktoken we had in the lock file. So we would default to `cl100k_base` for o-series, which is the wrong encoding model. The test tested against this wrong encoding (so it passed with tiktoken 0.8). Here we update tiktoken to 0.9 in the lock file, and fix the expected counts in the test. Verified that we are pulling [o200k_base](https://github.com/openai/tiktoken/blob/main/tiktoken/model.py#L8), as expected.	2025-04-02 10:44:48 -04:00
oxy-tg	38807871ec	docs: Add Oxylabs integration (#30591 ) Description: This PR adds documentation for the langchain-oxylabs integration package. The documentation includes instructions for configuring Oxylabs credentials and provides example code demonstrating how to use the package. Issue: N/A Dependencies: No new dependencies are required. Tests and Docs: Added an example notebook demonstrating the usage of the Langchain-Oxylabs package, located in docs/docs/integrations. Added a provider page in docs/docs/providers. Added a new package to libs/packages.yml. Lint and Test: Successfully ran make format, make lint, and make test.	2025-04-02 14:40:32 +00:00
ccurme	816492e1d3	openai: release 0.3.12 (#30616 )	2025-04-02 13:20:15 +00:00
Bagatur	111dd90a46	openai[patch]: support structured output and tools (#30581 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-02 09:14:02 -04:00
Mahir Shah	9d3262c7aa	core: Propagate config_factories in RunnableBinding (#30603 ) - Description: Propagates config_factories when calling decoration methods for RunnableBinding--e.g. bind, with_config, with_types, with_retry, and with_listeners. This ensures that configs attached to the original RunnableBinding are kept when creating the new RunnableBinding and the configs are merged during invocation. Picks up where #30551 left off. - Issue: #30531 Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-01 18:03:58 -04:00
ccurme	8a69de5c24	openai[patch]: ignore file blocks when counting tokens (#30601 ) OpenAI does not appear to document how it transforms PDF pages to images, which determines how tokens are counted: https://platform.openai.com/docs/guides/pdf-files?api-mode=chat#usage-considerations Currently these block types raise ValueError inside `get_num_tokens_from_messages`. Here we update to generate a warning and continue.	2025-04-01 15:29:33 -04:00
Christophe Bornet	558191198f	core: Add ruff rule FBT003 (boolean-trap) (#29424 ) See https://docs.astral.sh/ruff/rules/boolean-positional-value-in-call/#boolean-positional-value-in-call-fbt003 This PR also fixes some FBT001/002 in private methods but does not enforce these rules globally atm. Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-01 17:40:12 +00:00
Christophe Bornet	4f8ea13cea	core: Add ruff rules PERF (#29375 ) See https://docs.astral.sh/ruff/rules/#perflint-perf	2025-04-01 13:34:56 -04:00
Christophe Bornet	8a33402016	core: Add ruff rules PT (pytest) (#29381 ) See https://docs.astral.sh/ruff/rules/#flake8-pytest-style-pt	2025-04-01 13:31:07 -04:00
Christophe Bornet	768e4f695a	core: Add ruff rules S110 and S112 (#30599 )	2025-04-01 13:17:22 -04:00
Christophe Bornet	88b4233fa1	core: Add ruff rules D (docstring) (#29406 ) This ensures that the code is properly documented: https://docs.astral.sh/ruff/rules/#pydocstyle-d Related to #21983	2025-04-01 13:15:45 -04:00
Andras L Ferenczi	64df60e690	community[minor]: Add custom sitemap URL parameter to GitbookLoader (#30549 ) ## Description This PR adds a new `sitemap_url` parameter to the `GitbookLoader` class that allows users to specify a custom sitemap URL when loading content from a GitBook site. This is particularly useful for GitBook sites that use non-standard sitemap file names like `sitemap-pages.xml` instead of the default `sitemap.xml`. The standard `GitbookLoader` assumes that the sitemap is located at `/sitemap.xml`, but some GitBook instances (including GitBook's own documentation) use different paths for their sitemaps. This parameter makes the loader more flexible and helps users extract content from a wider range of GitBook sites. ## Issue Fixes bug [30473](https://github.com/langchain-ai/langchain/issues/30473) where the `GitbookLoader` would fail to find pages on GitBook sites that use custom sitemap URLs. ## Dependencies No new dependencies required. I've added: * Unit tests to verify the parameter works correctly * Integration tests to confirm the parameter is properly used with real GitBook sites * Updated docstrings with parameter documentation The changes are fully backward compatible, as the parameter is optional with a sensible default. --------- Co-authored-by: andrasfe <andrasf94@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2025-04-01 16:17:21 +00:00
Christophe Bornet	fdda1aaea1	core: Accept ALL ruff rules with exclusions (#30595 ) This pull request updates the `pyproject.toml` configuration file to modify the linting rules and ignored warnings for the project. The most important changes include switching to a more comprehensive selection of linting rules and updating the list of ignored rules to better align with the project's requirements. Linting rules update: * Changed the `select` option to include all available linting rules by setting it to `["ALL"]`. Ignored rules update: * Updated the `ignore` option to include specific rules that interfere with the formatter, are incompatible with Pydantic, or are temporarily excluded due to project constraints.	2025-04-01 11:17:51 -04:00
Kacper Włodarczyk	26a3256fc6	community[major]: DynamoDBChatMessageHistory bulk add messages, raise errors (#30572 ) This PR addresses two key issues: - Prevent history errors from failing silently: Previously, errors in message history were only logged and not raised, which can lead to inconsistent state and downstream failures (e.g., ValidationError from Bedrock due to malformed message history). This change ensures that such errors are raised explicitly, making them easier to detect and debug. (Side note: I’m using AWS Lambda Powertools Logger but hadn’t configured it properly with the standard Python logger—my bad. If the error had been raised, I would’ve seen it in the logs 😄) This is a BREAKING CHANGE - Add messages in bulk instead of iteratively: This introduces a custom add_messages method to add all messages at once. The previous approach failed silently when individual messages were too large, resulting in partial history updates and inconsistent state. With this change, either all messages are added successfully, or none are—helping avoid obscure history-related errors from Bedrock. --------- Co-authored-by: Kacper Wlodarczyk <kacper.wlodarczyk@chaosgears.com>	2025-04-01 11:13:32 -04:00
Armaanjeet Singh Sandhu	4bbc249b13	community: Fix attribute access for transcript text in YoutubeLoader (Fixes #30309 ) (#30582 ) Description: Fixes a bug in the YoutubeLoader where FetchedTranscript objects were not properly processed. The loader was only extracting the 'text' attribute from FetchedTranscriptSnippet objects while ignoring 'start' and 'duration' attributes. This would cause a TypeError when the code later tried to access these missing keys, particularly when using the CHUNKS format or any code path that needed timestamp information. This PR modifies the conversion of FetchedTranscriptSnippet objects to include all necessary attributes, ensuring that the loader works correctly with all transcript formats. Issue: Fixes #30309 Dependencies: None Testing: - Tested the fix with multiple YouTube videos to confirm it resolves the issue - Verified that both regular loading and CHUNKS format work correctly	2025-04-01 07:13:06 -04:00
Ivan Brko	ecff055096	community[minor]: Improve Brave Search Tool, allow api key in env var (#30364 ) - Description: - Make Brave Search Tool consistent with other tools and allow reading its api key from `BRAVE_SEARCH_API_KEY` instead of having to pass the api key manually (no breaking changes) - Improve Brave Search Tool by storing api key in `SecretStr` instead of plain `str`. - Add unit test for `BraveSearchWrapper` - Reflect the changes in the documentation - Issue: N/A - Dependencies: N/A - Twitter handle: ivan_brko	2025-03-31 14:48:52 -04:00
ccurme	0c623045b5	core[patch]: pydantic 2.11 compat (#30554 ) Release notes: https://pydantic.dev/articles/pydantic-v2-11-release Covered here: - We no longer access `model_fields` on class instances (that is now deprecated); - Update schema normalization for Pydantic version testing to reflect changes to generated JSON schema (addition of `"additionalProperties": True` for dict types with value Any or object). ## Considerations: ### Changes to JSON schema generation #### Tool-calling / structured outputs This may impact tool-calling + structured outputs for some providers, but schema generation only changes if you have parameters of the form `dict`, `dict[str, Any]`, `dict[str, object]`, etc. If dict parameters are typed my understanding is there are no changes. For OpenAI for example, untyped dicts work for structured outputs with default settings before and after updating Pydantic, and error both before/after if `strict=True`. ### Use of `model_fields` There is one spot where we previously accessed `super(cls, self).model_fields`, where `cls` is an object in the MRO. This was done for the purpose of tracking aliases in secrets. I've updated this to always be `type(self).model_fields`-- see comment in-line for detail. --------- Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com>	2025-03-31 14:22:57 -04:00
keshavshrikant	e8be3cca5c	fix huggingface tokenizer default length function (#30185 ) #30184	2025-03-31 11:54:30 -04:00
Wenqi Li	64f97e707e	ollama[patch]: Support seed param for OllamaLLM (#30553 ) Description: a description of the change add the seed param for OllamaLLM client reproducibility Issue: the issue # it fixes, if applicable follow up of a similar issue https://github.com/langchain-ai/langchain/issues/24703 see also https://github.com/langchain-ai/langchain/pull/24782 Dependencies: any dependencies required for this change n/a	2025-03-31 11:28:49 -04:00
Christophe Bornet	8395abbb42	core: Fix test_stream_error_callback (#30228 ) Fixes #29436 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-03-31 10:37:22 -04:00
Christophe Bornet	026de908eb	core: Add ruff rules G, FA, INP, AIR and ISC (#29334 ) Fixes mostly for rules G. See https://docs.astral.sh/ruff/rules/#flake8-logging-format-g	2025-03-31 10:05:23 -04:00

... 2 3 4 5 6 ...

7149 Commits