langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-12 06:13:36 +00:00

Author	SHA1	Message	Date
Christophe Bornet	6cd1aadf60	langchain: use mypy strict checking with exemptions (#31018 ) * Use strict checking and exclude some rules as TODOs * Fix imports not exposed in `__all__` Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-15 11:37:18 -04:00
Christophe Bornet	eab8484a80	text-splitters[patch]: fix some import-untyped errors (#31030 )	2025-05-15 11:34:22 -04:00
ccurme	672339f3c6	core: release 0.3.60 (#31249 )	2025-05-15 11:14:04 -04:00
ccurme	8b145d5dc3	openai: release 0.3.17 (#31246 )	2025-05-15 09:18:22 -04:00
Christophe Bornet	921573e2b7	core: Add ruff rules SLF (#30666 ) Add ruff rules SLF: https://docs.astral.sh/ruff/rules/#flake8-self-slf --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-14 18:42:39 +00:00
Sydney Runkle	7263011b24	perf[core]: remove unnecessary model validators (#31238 ) * Remove unnecessary cast of id -> str (can do with a field setting) * Remove unnecessary `set_text` model validator (can be done with a computed field - though we had to make some changes to the `Generation` class to make this possible Before: ~2.4s Blue circles represent time spent in custom validators :( <img width="1337" alt="Screenshot 2025-05-14 at 10 10 12 AM" src="https://github.com/user-attachments/assets/bb4f477f-4ee3-4870-ae93-14ca7f197d55" /> After: ~2.2s <img width="1344" alt="Screenshot 2025-05-14 at 10 11 03 AM" src="https://github.com/user-attachments/assets/99f97d80-49de-462f-856f-9e7e8662adbc" /> We still want to optimize the backwards compatible tool calls model validator, though I think this might involve breaking changes, so wanted to separate that into a different PR. This is circled in green.	2025-05-14 10:20:22 -07:00
Sydney Runkle	1523602196	packaging[core]: bump min pydantic version (#31239 ) Bumping to a version that's a year old, so seems like a reasonable bump.	2025-05-14 10:01:24 -07:00
Lope Ramos	b8ae2de169	langchain-core[patch]: `Incremental` record manager deletion should be batched (#31206 ) Description: Before this commit, if one record is batched in more than 32k rows for sqlite3 >= 3.32 or more than 999 rows for sqlite3 < 3.31, the `record_manager.delete_keys()` will fail, as we are creating a query with too many variables. This commit ensures that we are batching the delete operation leveraging the `cleanup_batch_size` as it is already done for `full` cleanup. Added unit tests for incremental mode as well on different deleting batch size.	2025-05-14 11:38:21 -04:00
Sydney Runkle	263c215112	perf[core]: remove generations summation from hot loop (#31231 ) 1. Removes summation of `ChatGenerationChunk` from hot loops in `stream` and `astream` 2. Removes run id gen from loop as well (minor impact) Again, benchmarking on processing ~200k chunks (a poem about broccoli). Before: ~4.2s Blue circle is all the time spent adding up gen chunks <img width="1345" alt="Screenshot 2025-05-14 at 7 48 33 AM" src="https://github.com/user-attachments/assets/08a59d78-134d-4cd3-9d54-214de689df51" /> After: ~2.3s Blue circle is remaining time spent on adding chunks, which can be minimized in a future PR by optimizing the `merge_content`, `merge_dicts`, and `merge_lists` utilities. <img width="1353" alt="Screenshot 2025-05-14 at 7 50 08 AM" src="https://github.com/user-attachments/assets/df6b3506-929e-4b6d-b198-7c4e992c6d34" />	2025-05-14 08:13:05 -07:00
Sydney Runkle	17b799860f	perf[core]: remove costly async helpers for non-end event handlers (#31230 ) 1. Remove `shielded` decorator from non-end event handlers 2. Exit early with a `self.handlers` check instead of doing unnecessary asyncio work Using a benchmark that processes ~200k chunks (a poem about broccoli). Before: ~15s Circled in blue is unnecessary event handling time. This is addressed by point 2 above <img width="1347" alt="Screenshot 2025-05-14 at 7 37 53 AM" src="https://github.com/user-attachments/assets/675e0fed-8f37-46c0-90b3-bef3cb9a1e86" /> After: ~4.2s The total time is largely reduced by the removal of the `shielded` decorator, which holds little significance for non-end handlers. <img width="1348" alt="Screenshot 2025-05-14 at 7 37 22 AM" src="https://github.com/user-attachments/assets/54be8a3e-5827-4136-a87b-54b0d40fe331" />	2025-05-14 07:42:56 -07:00
ccurme	0b8837a0cc	openai: support runtime kwargs in embeddings (#31195 )	2025-05-14 09:14:40 -04:00
ccurme	868cfc4a8f	openai: ignore function_calls if tool_calls are present (#31198 ) Some providers include (legacy) function calls in `additional_kwargs` in addition to tool calls. We currently unpack both function calls and tool calls if present, but OpenAI will raise 400 in this case. This can come up if providers are mixed in a tool-calling loop. Example: ```python from langchain.chat_models import init_chat_model from langchain_core.messages import HumanMessage from langchain_core.tools import tool @tool def get_weather(location: str) -> str: """Get weather at a location.""" return "It's sunny." gemini = init_chat_model("google_genai:gemini-2.0-flash-001").bind_tools([get_weather]) openai = init_chat_model("openai:gpt-4.1-mini").bind_tools([get_weather]) input_message = HumanMessage("What's the weather in Boston?") tool_call_message = gemini.invoke([input_message]) assert len(tool_call_message.tool_calls) == 1 tool_call = tool_call_message.tool_calls[0] tool_message = get_weather.invoke(tool_call) response = openai.invoke( # currently raises 400 / BadRequestError [input_message, tool_call_message, tool_message] ) ``` Here we ignore function calls if tool calls are present.	2025-05-12 13:50:56 -04:00
Christophe Bornet	83d006190d	core: Fix some private member accesses (#30912 ) See https://github.com/langchain-ai/langchain/pull/30666 --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2025-05-12 17:42:26 +00:00
CtrlMj	1e56c66f86	core: Fix issue 31035 alias fields in base tool langchain core (#31112 ) Description: The 'inspect' package in python skips over the aliases set in the schema of a pydantic model. This is a workound to include the aliases from the original input. issue: #31035 Cc: @ccurme @eyurtsev --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-12 11:04:13 -04:00
meirk-brd	e6147ce5d2	docs: Add Brightdata integration documentation (#31114 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "core: add foobar LLM" - Description: Integrated the Bright Data package to enable Langchain users to seamlessly incorporate Bright Data into their agents. - Dependencies: None - LinkedIn handle:[Bright Data](https://www.linkedin.com/company/bright-data) - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-11 16:07:21 +00:00
ccurme	ff9183fd3c	docs: add Gel integration (#31186 ) Continued from https://github.com/langchain-ai/langchain/pull/31050 --------- Co-authored-by: deepbuzin <contactbuzin@gmail.com>	2025-05-11 10:17:18 -04:00
ccurme	77d3f04e0a	docs: add Aerospike to package registry (#31185 ) Missed as part of https://github.com/langchain-ai/langchain/pull/31156	2025-05-11 09:33:58 -04:00
Sumin Shin	683da2c9e9	text-splitters: Fix regex separator merge bug in CharacterTextSplitter (#31137 ) Description: Fix the merge logic in `CharacterTextSplitter.split_text` so that when using a regex lookahead separator (`is_separator_regex=True`) with `keep_separator=False`, the raw pattern is not re-inserted between chunks. Issue: Fixes #31136 Dependencies: None Twitter handle: None Since this is my first open-source PR, please feel free to point out any mistakes, and I'll be eager to make corrections.	2025-05-10 15:42:03 -04:00
ccurme	e9e597be8e	docs: update sort order in integrations table (#31171 )	2025-05-08 20:44:21 +00:00
ccurme	9aac8923a3	docs: add web search to anthropic docs (#31169 )	2025-05-08 16:20:11 -04:00
ccurme	2d202f9762	anthropic[patch]: split test into two (#31167 )	2025-05-08 09:23:36 -04:00
ccurme	d4555ac924	anthropic: release 0.3.13 (#31162 )	2025-05-08 03:13:15 +00:00
ccurme	e34f9fd6f7	anthropic: update streaming usage metadata (#31158 ) Anthropic updated how they report token counts during streaming today. See changes to `MessageDeltaUsage` in [this commit](`2da00f26c5 (diff-1a396eba0cd9cd8952dcdb58049d3b13f6b7768ead1411888d66e28211f7bfc5)`). It's clean and simple to grab these fields from the final `message_delta` event. However, some of them are typed as Optional, and language [here](`e42451ab3f/src/anthropic/lib/streaming/_messages.py (L462)`) suggests they may not always be present. So here we take the required field from the `message_delta` event as we were doing previously, and ignore the rest.	2025-05-07 23:09:56 -04:00
ccurme	682f338c17	anthropic[patch]: support web search (#31157 )	2025-05-07 18:04:06 -04:00
ccurme	d7e016c5fc	huggingface: release 0.2 (#31153 )	2025-05-07 15:33:07 -04:00
ccurme	4b11cbeb47	huggingface[patch]: update lockfile (#31152 )	2025-05-07 15:17:33 -04:00
ccurme	b5b90b5929	anthropic[patch]: be robust to null fields when translating usage metadata (#31151 )	2025-05-07 18:30:21 +00:00
ccurme	f70b263ff3	core: release 0.3.59 (#31150 )	2025-05-07 17:36:59 +00:00
ccurme	bb69d4c42e	docs: specify js support for tavily (#31149 )	2025-05-07 11:30:04 -04:00
zhurou603	1df3ee91e7	partners: (langchain-openai) total_tokens should not add 'Nonetype' t… (#31146 ) partners: (langchain-openai) total_tokens should not add 'Nonetype' t… # PR Description ## Description Fixed an issue in `langchain-openai` where `total_tokens` was incorrectly adding `None` to an integer, causing a TypeError. The fix ensures proper type checking before adding token counts. ## Issue Fixes the TypeError traceback shown in the image where `'NoneType'` cannot be added to an integer. ## Dependencies None ## Twitter handle None ![image](https://github.com/user-attachments/assets/9683a795-a003-455a-ada9-fe277245e2b2) Co-authored-by: qiulijie <qiulijie@yuaiweiwu.com>	2025-05-07 11:09:50 -04:00
Collier King	19041dcc95	docs: update langchain-cloudflare repo/path on packages.yaml (#31138 ) Library Repo Path Update : "langchain-cloudflare" We recently changed our `langchain-cloudflare` repo to allow for future libraries. Created a `libs` folder to hold `langchain-cloudflare` python package. https://github.com/cloudflare/langchain-cloudflare/tree/main/libs/langchain-cloudflare On `langchain`, updating `packages.yaml` to point to new `libs/langchain-cloudflare` library folder.	2025-05-07 11:01:25 -04:00
Jacob Lee	66d1ed6099	fix(core): Permit OpenAI style blocks to be passed into convert_to_openai_messages (#31140 ) Should effectively be a noop, just shouldn't throw CC @madams0013 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-07 10:57:37 -04:00
唐小鸭	50fa524a6d	partners: (langchain-deepseek) fix deepseek-r1 always returns an empty `reasoning_content` when reasoning (#31065 ) ## Description deepseek-r1 always returns an empty string `reasoning_content` to the first chunk when thinking, and sets `reasoning_content` to None when thinking is over, to determine when to switch to normal output. Therefore, whether the reasoning_content field exists should be judged as None. ## Demo deepseek-r1 reasoning output: ``` {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': 'assistant', 'tool_calls': None, 'reasoning_content': ''}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '好的'}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '，'}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '用户'}, 'finish_reason': None, 'index': 0, 'logprobs': None} ... ``` deepseek-r1 first normal output ``` ... {'delta': {'content': ' main', 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': None}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': '\n\nimport', 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': None}, 'finish_reason': None, 'index': 0, 'logprobs': None} ... ``` --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-05 22:31:58 +00:00
Stefano Lottini	325f729a92	docs: improvements to Astra DB pages, especially modernize Vector DB example notebook (#30961 ) This PR brings several improvements and modernizations to the documentation around the Astra DB partner package. - language alignment for better matching with the terms used in the Astra DB docs - updated several links to pages on said documentation - for the `AstraDBVectorStore`, added mentions of the new features in the overall `astra.mdx` - for the vector store, rewritten/upgraded most of the usage example notebook for a more straightforward experience able to highlight the main usage patterns (including new ones such as the newly-introduced "autodetect feature") --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-03 14:26:52 -04:00
Asif Mehmood	00ac49dd3e	Replace deprecated .dict() with .model_dump() for Pydantic v2 compatibility (#31107 ) What does this PR do? This PR replaces deprecated usages of ```.dict()``` with ```.model_dump()``` to ensure compatibility with Pydantic v2 and prepare for v3, addressing the deprecation warning ```PydanticDeprecatedSince20``` as required in [Issue# 31103](https://github.com/langchain-ai/langchain/issues/31103). Changes made: * Replaced ```.dict()``` with ```.model_dump()``` in multiple locations * Ensured consistency with Pydantic v2 migration guidelines * Verified compatibility across affected modules Notes * This is a code maintenance and compatibility update * Tested locally with Pydantic v2.11 * No functional logic changes; only internal method replacements to prevent deprecation issues	2025-05-03 13:40:54 -04:00
ccurme	6268ae8db0	langchain: release 0.3.25 (#31101 )	2025-05-02 17:42:32 +00:00
ccurme	77ecf47f6d	openai: release 0.3.16 (#31100 )	2025-05-02 13:14:46 -04:00
ccurme	ff41f47e91	core: release 0.3.58 (#31099 )	2025-05-02 12:46:32 -04:00
Eugene Yurtsev	4da525bc63	langchain[patch]: Remove beta decorator from init_embeddings (#31098 ) Remove beta decorator from init_embeddings.	2025-05-02 11:52:50 -04:00
ccurme	94139ffcd3	openai[patch]: format system content blocks for Responses API (#31096 ) ```python from langchain_core.messages import HumanMessage, SystemMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI(model="gpt-4.1", use_responses_api=True) messages = [ SystemMessage("test"), # Works HumanMessage("test"), # Works SystemMessage([{"type": "text", "text": "test"}]), # Bug in this case HumanMessage([{"type": "text", "text": "test"}]), # Works SystemMessage([{"type": "input_text", "text": "test"}]) # Works ] llm._get_request_payload(messages) ```	2025-05-02 15:22:30 +00:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00
William FH	b5bf2d6218	0.3.57 (#31095 )	2025-05-01 23:42:26 -07:00
William FH	167afa5102	Enable run mutation (#31090 ) This lets you more easily modify a run in-flight	2025-05-01 17:00:51 -07:00
ccurme	c51eadd54f	openai[patch]: propagate service_tier to response metadata (#31089 )	2025-05-01 13:50:48 -04:00
ccurme	6110c3ffc5	openai[patch]: release 0.3.15 (#31087 )	2025-05-01 09:22:30 -04:00
Ben Gladwell	da59eb7eb4	anthropic: Allow kwargs to pass through when counting tokens (#31082 ) - Description: `ChatAnthropic.get_num_tokens_from_messages` does not currently receive `kwargs` and pass those on to `self._client.beta.messages.count_tokens`. This is a problem if you need to pass specific options to `count_tokens`, such as the `thinking` option. This PR fixes that. - Issue: N/A - Dependencies: None - Twitter handle: @bengladwell Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-30 17:56:22 -04:00
Really Him	918c950737	DOCS: `partners/chroma`: Fix documentation around `chroma` query filter syntax (#31058 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: * Starting to put together some PR's to fix the typing around `langchain-chroma` `filter` and `where_document` query filtering, as mentioned: https://github.com/langchain-ai/langchain/issues/30879 https://github.com/langchain-ai/langchain/issues/30507 The typing of `dict[str, str]` is on the one hand too restrictive (marks valid filter expressions as ill-typed) and also too permissive (allows illegal filter expressions). That's not what this PR addresses though. This PR just removes from the documentation some examples of filters that are illegal, and also syntactically incorrect: (a) dictionaries with keys like `$contains` but the key is missing quotation marks; (b) dictionaries with multiple entries - this is illegal in Chroma filter syntax and will raise an exception. (`{"foo": "bar", "qux": "baz"}`). Filter dictionaries in Chroma must have one and one key only. Again this is just the documentation issue, which is the lowest hanging fruit. I also think we need to update the types for `filter` and `where_document` to be (at the very least `dict[str, Any]`), or, since we have access to Chroma's types, they should be `Where` and `WhereDocument` types. This has a wider blast radius though, so I'm starting small. This PR does not fix the issues mentioned above, it's just starting to get the ball rolling, and cleaning up the documentation. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Really Him <hesereallyhim@proton.me>	2025-04-30 17:51:07 -04:00
yberber-sap	952a0b7b40	Docs: Fix SAP HANA Cloud docs - remove pip output, update vectorstore link, rename provider (#31077 ) This PR includes the following documentation fixes for the SAP HANA Cloud vector store integration: - Removed stale output from the `%pip install` code cell. - Replaced an unrelated vectorstore documentation link on the provider overview page. - Renamed the provider from "SAP HANA" to "SAP HANA Cloud"	2025-04-30 08:57:40 -04:00
ccurme	bdb7c4a8b3	huggingface: fix embeddings return type (#31072 ) Integration tests failing cc @hanouticelina	2025-04-29 18:45:04 +00:00
célina	868f07f8f4	partners: (langchain-huggingface) Chat Models - Integrate Hugging Face Inference Providers and remove deprecated code (#30733 ) Hi there, I'm Célina from 🤗, This PR introduces support for Hugging Face's serverless Inference Providers (documentation [here](https://huggingface.co/docs/inference-providers/index)), allowing users to specify different providers for chat completion and text generation tasks. This PR also removes the usage of `InferenceClient.post()` method in `HuggingFaceEndpoint`, in favor of the task-specific `text_generation` method. `InferenceClient.post()` is deprecated and will be removed in `huggingface_hub v0.31.0`. --- ## Changes made - bumped the minimum required version of the `huggingface-hub` package to ensure compatibility with the latest API usage. - added a `provider` field to `HuggingFaceEndpoint`, enabling users to select the inference provider (e.g., 'cerebras', 'together', 'fireworks-ai'). Defaults to `hf-inference` (HF Inference API). - replaced the deprecated `InferenceClient.post()` call in `HuggingFaceEndpoint` with the task-specific `text_generation` method for future-proofing, `post()` will be removed in huggingface-hub v0.31.0. - updated the `ChatHuggingFace` component: - added async and streaming support. - added support for tool calling. - exposed underlying chat completion parameters for more granular control. - Added integration tests for `ChatHuggingFace` and updated the corresponding unit tests. ✅ All changes are backward compatible. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-29 09:53:14 -04:00
ccurme	3072e4610a	community: move to separate repo (continued) (#31069 ) Missed these after merging	2025-04-29 09:25:32 -04:00
ccurme	9ff5b5d282	community: move to separate repo (#31060 ) langchain-community is moving to https://github.com/langchain-ai/langchain-community	2025-04-29 09:22:04 -04:00
Sydney Runkle	7e926520d5	packaging: remove Python upper bound for langchain and co libs (#31025 ) Follow up to https://github.com/langchain-ai/langsmith-sdk/pull/1696, I've bumped the `langsmith` version where applicable in `uv.lock`. Type checking problems here because deps have been updated in `pyproject.toml` and `uv lock` hasn't been run - we should enforce that in the future - goes with the other dependabot todos :).	2025-04-28 14:44:28 -04:00
Sydney Runkle	d614842d23	ci: temporarily run chroma on 3.12 for CI (#31056 ) Waiting on a fix for https://github.com/chroma-core/chroma/issues/4382	2025-04-28 13:20:37 -04:00
Christophe Bornet	aee7988a94	community: add mypy warn_unused_ignores rule (#30816 )	2025-04-28 11:54:12 -04:00
Bae-ChangHyun	a2863f8757	community: add 'get_col_comments' option for retrieve database columns comments (#30646 ) ## Description Added support for retrieving column comments in the SQL Database utility. This feature allows users to see comments associated with database columns when querying table information. Column comments provide valuable metadata that helps LLMs better understand the semantics and purpose of database columns. A new optional parameter `get_col_comments` was added to the `get_table_info` method, defaulting to `False` for backward compatibility. When set to `True`, it retrieves and formats column comments for each table. Currently, this feature is supported on PostgreSQL, MySQL, and Oracle databases. ## Implementation You should create Table with column comments before. ```python db = SQLDatabase.from_uri("YOUR_DB_URI") print(db.get_table_info(get_col_comments=True)) ``` ## Result ``` CREATE TABLE test_table ( name VARCHAR school VARCHAR) /* Column Comments: {'name': person name, 'school":school_name} / / 3 rows from test_table: name a b c */ ``` ## Benefits 1. Enhances LLM's understanding of database schema semantics 2. Preserves valuable domain knowledge embedded in database design 3. Improves accuracy of SQL query generation 4. Provides more context for data interpretation Tests are available in `langchain/libs/community/tests/test_sql_get_table_info.py`. --------- Co-authored-by: chbae <chbae@gcsc.co.kr> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-28 15:19:46 +00:00
yberber-sap	3fb0a55122	Deprecate HanaDB, HanaTranslator and update example notebook to use new implementation (#30896 ) - Description: This PR marks the `HanaDB` vector store (and related utilities) in `langchain_community` as deprecated using the `@deprecated` annotation. - Set `since="0.1.0"` and `removal="1.0"` - Added a clear migration path and a link to the SAP-maintained replacement in the [`langchain_hana`](https://github.com/SAP/langchain-integration-for-sap-hana-cloud) package. Additionally, the example notebook has been updated to use the new `HanaDB` class from `langchain_hana`, ensuring users follow the recommended integration moving forward. - Issue: None - Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-27 16:37:35 -04:00
湛露先生	5fb8fd863a	langchain_openai: clean duplicate code for openai embedding. (#30872 ) The `_chunk_size` has not changed by method `self._tokenize`, So i think these is duplicate code. Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-27 15:07:41 -04:00
ccurme	ba2518995d	standard-tests: add condition for image tool message test (#31041 ) Require support for [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/).	2025-04-27 17:24:43 +00:00
ccurme	04a899ebe3	infra: support third-party integration packages in API ref build (#31021 )	2025-04-25 16:02:27 -04:00
ccurme	a60fd06784	docs: document OpenAI flex processing (#31023 ) Following https://github.com/langchain-ai/langchain/pull/31005	2025-04-25 15:10:25 -04:00
ccurme	629b7a5a43	openai[patch]: add explicit attribute for service tier (#31005 )	2025-04-25 18:38:23 +00:00
ccurme	ab871a7b39	docs: enable milvus in API ref build (#31016 ) Reverts langchain-ai/langchain#30996 Should be fixed following https://github.com/langchain-ai/langchain-milvus/pull/68	2025-04-25 12:48:10 +00:00
Georgi Stefanov	d30c56a8c1	langchain: return attachments in _get_response (#30853 ) This is a PR to return the message attachments in _get_response, as when files are generated these attachments are not returned thus generated files cannot be retrieved Fixes issue: https://github.com/langchain-ai/langchain/issues/30851	2025-04-24 21:39:11 -04:00
ccurme	a7903280dd	openai[patch]: delete redundant tests (#31004 ) These are covered by standard tests.	2025-04-24 17:56:32 +00:00
Kyle Jeong	d0f0d1f966	[docs/community]: langchain docs + browserbaseloader fix (#30973 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" community: fix browserbase integration docs: update docs - [ ] PR message: *Delete this entire checklist* and replace with - Description: Updated BrowserbaseLoader to use the new python sdk. - Issue: update browserbase integration with langchain - Dependencies: n/a - Twitter handle: @kylejeong21 - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-04-24 13:38:49 -04:00
ccurme	403fae8eec	core: release 0.3.56 (#31000 )	2025-04-24 13:22:31 -04:00
ccurme	10a9c24dae	openai: fix streaming reasoning without summaries (#30999 ) Following https://github.com/langchain-ai/langchain/pull/30909: need to retain "empty" reasoning output when streaming, e.g., ```python {'id': 'rs_...', 'summary': [], 'type': 'reasoning'} ``` Tested by existing integration tests, which are currently failing.	2025-04-24 16:01:45 +00:00
ccurme	8fc7a723b9	core: release 0.3.56rc1 (#30998 )	2025-04-24 15:09:44 +00:00
ccurme	f4863f82e2	core[patch]: fix edge cases for _is_openai_data_block (#30997 )	2025-04-24 10:48:52 -04:00
Jacob Lee	6b0b317cb5	feat(core): Autogenerate filenames for when converting file content blocks to OpenAI format (#30984 ) CC @ccurme --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-24 13:36:31 +00:00
ccurme	21962e2201	docs: temporarily disable milvus in API ref build (#30996 )	2025-04-24 09:31:23 -04:00
Behrad Hemati	1eb0bdadfa	community: add indexname to other functions in opensearch (#30987 ) - [x] PR title: "community: add indexname to other functions in opensearch" - [x] PR message: - Description: add ability to over-ride index-name if provided in the kwargs of sub-functions. When used in WSGI application it's crucial to be able to dynamically change parameters. - [ ] Add tests and docs: - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-04-24 08:59:33 -04:00
Nicky Parseghian	7ecdac5240	community: Strip URLs from sitemap. (#30830 ) Fixes #30829 - Description: Simply strips the loc value when building the element. - Issue: Fixes #30829	2025-04-23 18:18:42 -04:00
ccurme	faef3e5d50	core, standard-tests: support PDF and audio input in Chat Completions format (#30979 ) Chat models currently implement support for: - images in OpenAI Chat Completions format - other multimodal types (e.g., PDF and audio) in a cross-provider [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/) Here we update core to extend support to PDF and audio input in Chat Completions format. If an OAI-format PDF or audio content block is passed into any chat model, it will be transformed to the LangChain standard format. We assume that any chat model supporting OAI-format PDF or audio has implemented support for the standard format.	2025-04-23 18:32:51 +00:00
Bagatur	d4fc734250	core[patch]: update dict prompt template (#30967 ) Align with JS changes made in https://github.com/langchain-ai/langchainjs/pull/8043	2025-04-23 10:04:50 -07:00
ccurme	4bc70766b5	core, openai: support standard multi-modal blocks in convert_to_openai_messages (#30968 )	2025-04-23 11:20:44 -04:00
ccurme	e4877e5ef1	fireworks: release 0.3.0 (#30977 )	2025-04-23 10:08:38 -04:00
Christophe Bornet	8c5ae108dd	text-splitters: Set strict mypy rules (#30900 ) * Add strict mypy rules * Fix mypy violations * Add error codes to all type ignores * Add ruff rule PGH003 * Bump mypy version to 1.15	2025-04-22 20:41:24 -07:00
ccurme	eedda164c6	fireworks[minor]: remove default model and temperature (#30965 ) `mixtral-8x-7b-instruct` was recently retired from Fireworks Serverless. Here we remove the default model altogether, so that the model must be explicitly specified on init: ```python ChatFireworks(model="accounts/fireworks/models/llama-v3p1-70b-instruct") # for example ``` We also set a null default for `temperature`, which previously defaulted to 0.0. This parameter will no longer be included in request payloads unless it is explicitly provided.	2025-04-22 15:58:58 -04:00
CLOVA Studio 개발	577cb53a00	community: update Naver integration to use langchain-naver package and improve documentation (#30956 ) ## Description: This PR was requested after the `langchain-naver` partner-managed packages were completed. We build our package as requested in [this comment](https://github.com/langchain-ai/langchain/pull/29243#issuecomment-2595222791) and the initial version is now uploaded to [pypi](https://pypi.org/project/langchain-naver/). So we've updated some our documents with the additional changed features and how to download our partner-managed package. ## Dependencies: https://github.com/langchain-ai/langchain/pull/29243#issuecomment-2595222791 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-22 12:00:10 -04:00
ccurme	a7c1bccd6a	openai[patch]: remove xfails from image token counting tests (#30963 ) These appear to be passing again.	2025-04-22 15:55:33 +00:00
ccurme	25d77aa8b4	community: release 0.3.22 (#30962 )	2025-04-22 15:34:47 +00:00
ccurme	59fd4cb4c0	docs: update package registry sort order (#30960 )	2025-04-22 15:27:32 +00:00
ccurme	b8c454b42b	langchain: release 0.3.24 (#30959 )	2025-04-22 11:23:34 -04:00
Dmitrii Rashchenko	a43df006de	Support of openai reasoning summary streaming (#30909 ) langchain_openai: Support of reasoning summary streaming Description: OpenAI API now supports streaming reasoning summaries for reasoning models (o1, o3, o3-mini, o4-mini). More info about it: https://platform.openai.com/docs/guides/reasoning#reasoning-summaries It is supported only in Responses API (not Completion API), so you need to create LangChain Open AI model as follows to support reasoning summaries streaming: ``` llm = ChatOpenAI( model="o4-mini", # also o1, o3, o3-mini support reasoning streaming use_responses_api=True, # reasoning streaming works only with responses api, not completion api model_kwargs={ "reasoning": { "effort": "high", # also "low" and "medium" supported "summary": "auto" # some models support "concise" summary, some "detailed", but auto will always work } } ) ``` Now, if you stream events from llm: ``` async for event in llm.astream_events(prompt, version="v2"): print(event) ``` or ``` for chunk in llm.stream(prompt): print (chunk) ``` OpenAI API will send you new types of events: `response.reasoning_summary_text.added` `response.reasoning_summary_text.delta` `response.reasoning_summary_text.done` These events are new, so they were ignored. So I have added support of these events in function `_convert_responses_chunk_to_generation_chunk`, so reasoning chunks or full reasoning added to the chunk additional_kwargs. Example of how this reasoning summary may be printed: ``` async for event in llm.astream_events(prompt, version="v2"): if event["event"] == "on_chat_model_stream": chunk: AIMessageChunk = event["data"]["chunk"] if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` or ``` for chunk in llm.stream(prompt): if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-22 14:51:13 +00:00
Alexander Ng	0f6fa34372	Community: Valyu Integration docs (#30926 ) PR title: docs: add Valyu integration documentation Description: This PR adds documentation and example notebooks for the Valyu integration, including retriever and tool usage. Issue: N/A Dependencies: No new dependencies. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-21 17:43:00 -04:00
ccurme	8574442c57	core[patch]: release 0.3.55 (#30952 )	2025-04-21 17:56:24 +00:00
ccurme	920d504e47	fireworks[patch]: update model in LLM integration tests (#30951 ) `mixtral-8x7b-instruct` has been retired.	2025-04-21 17:53:27 +00:00
Anton Masalovich	1f3054502e	community: fix cost calculations for 4.1 and o4 in OpenAI callback (#30899 ) Issue: #30898	2025-04-21 10:59:47 -04:00
Ahmed Tammaa	589bc19890	anthropic[patch]: make description optional on AnthropicTool (#30935 ) PR Summary This change adds a fallback in ChatAnthropic.with_structured_output() to handle Pydantic models that don’t include a docstring. Without it, calling: ```py from pydantic import BaseModel from langchain_anthropic import ChatAnthropic class SampleModel(BaseModel): sample_field: str llm = ChatAnthropic( model="claude-3-7-sonnet-latest" ).with_structured_output(SampleModel.model_json_schema()) llm.invoke("test") ``` will raise a ``` KeyError: 'description' ``` because Pydantic omits the description field when no docstring is present. This issue doesn’t occur when using ChatOpenAI or if you add a docstring to the model: ```py from pydantic import BaseModel from langchain_openai import ChatOpenAI class SampleModel(BaseModel): """Schema for sample_field output.""" sample_field: str llm = ChatOpenAI( model="gpt-4o-mini" ).with_structured_output(SampleModel.model_json_schema()) llm.invoke("test") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-21 10:44:39 -04:00
Nuno Campos	27296bdb0c	core: Make Graph.Node.data optional (#30943 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-04-21 07:18:36 -07:00
Ahmed Tammaa	de56c31672	core: Improve OutputParser error messaging when model output is truncated (max_tokens) (#30936 ) Addresses #30158 When using the output parser—either in a chain or standalone—hitting max_tokens triggers a misleading “missing variable” error instead of indicating the output was truncated. This subtle bug often surfaces with Anthropic models. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-21 10:06:18 -04:00
xsai9101	335f089d6a	Community: Add bind variable support for oracle adb docloader (#30937 ) PR title: Community: Add bind variable support for oracle adb docloader Description: This PR adds support of using bind variable to oracle adb doc loader class, including minor document change. Issue: N/A Dependencies: No new dependencies.	2025-04-21 08:47:33 -04:00
Aubrey Ford	23f701b08e	langchain_community: OpenAIEmbeddings not respecting chunk_size argument (#30946 ) This is a follow-on PR to go with the identical changes that were made in parters/openai. Previous PR: https://github.com/langchain-ai/langchain/pull/30757 When calling embed_documents and providing a chunk_size argument, that argument is ignored when OpenAIEmbeddings is instantiated with its default configuration (where check_embedding_ctx_length=True). _get_len_safe_embeddings specifies a chunk_size parameter but it's not being passed through in embed_documents, which is its only caller. This appears to be an oversight, especially given that the _get_len_safe_embeddings docstring states it should respect "the set embedding context length and chunk size." Developers typically expect method parameters to take effect (also, take precedence) when explicitly provided, especially when instantiating using defaults. I was confused as to why my API calls were being rejected regardless of the chunk size I provided.	2025-04-21 08:39:07 -04:00
Aubrey Ford	b344f34635	partners/openai: OpenAIEmbeddings not respecting chunk_size argument (#30757 ) When calling `embed_documents` and providing a `chunk_size` argument, that argument is ignored when `OpenAIEmbeddings` is instantiated with its default configuration (where `check_embedding_ctx_length=True`). `_get_len_safe_embeddings` specifies a `chunk_size` parameter but it's not being passed through in `embed_documents`, which is its only caller. This appears to be an oversight, especially given that the `_get_len_safe_embeddings` docstring states it should respect "the set embedding context length and chunk size." Developers typically expect method parameters to take effect (also, take precedence) when explicitly provided, especially when instantiating using defaults. I was confused as to why my API calls were being rejected regardless of the chunk size I provided. This bug also exists in langchain_community package. I can add that to this PR if requested otherwise I will create a new one once this passes.	2025-04-18 15:27:27 -04:00
Konsti-s	017c8079e1	partners: ChatAnthropic supports urls (#30809 ) Description: partners-anthropic: ChatAnthropic supports b64 and urls in the part[image_url][url] message variable Issue: ChatAnthropic right now only supports b64 encoded images in the part[image_url][url] message variable. This PR enables ChatAnthropic to also accept image urls in said variable and makes it compatible with OpenAI messages to make model switching easier. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-18 15:15:45 -04:00
Volodymyr Tkachuk	d0cd115356	community: Add deprecation decorator to SingleStore community integrations (#30846 ) SingleStore integration now has its package `langchain-singlestore', so the community implementation will no longer be maintained. Added `deprecated` decorator to `SingleStoreDBChatMessageHistory`, `SingleStoreDBSemanticCache`, and `SingleStoreDB` classes in the community package. Dependencies: https://github.com/langchain-ai/langchain/pull/30841 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-18 12:58:39 -04:00
Alejandro Rodríguez	34ddfba76b	community: support usage_metadata for litellm streaming calls (#30683 ) Support "usage_metadata" for LiteLLM streaming calls. This is a follow-up to https://github.com/langchain-ai/langchain/pull/30625, which tackled non-streaming calls. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-04-18 12:50:32 -04:00
Volodymyr Tkachuk	5ffcd01c41	docs: Register langchain-singlestore integration (#30841 ) I created and published `langchain-singlestoe` integration package that should replace SingleStoreDB community implementation.	2025-04-18 12:11:33 -04:00

1 2 3 4 5 ...

7078 Commits