langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-10 21:35:08 +00:00

Author	SHA1	Message	Date
Hugh Gao	aa6dae4a5b	community: Remove the system message count limit for ChatTongyi. (#30192 ) ## Description The models in DashScope support multiple SystemMessage. Here is the [Doc](https://bailian.console.aliyun.com/model_experience_center/text#/model-market/detail/qwen-long?tabKey=sdk), and the example code on the document page: ```python import os from openai import OpenAI client = OpenAI( api_key=os.getenv("DASHSCOPE_API_KEY"), # 如果您没有配置环境变量，请在此处替换您的API-KEY base_url="https://dashscope.aliyuncs.com/compatible-mode/v1", # 填写DashScope服务base_url ) # 初始化messages列表 completion = client.chat.completions.create( model="qwen-long", messages=[ {'role': 'system', 'content': 'You are a helpful assistant.'}, # 请将 'file-fe-xxx'替换为您实际对话场景所使用的 file-id。 {'role': 'system', 'content': 'fileid://file-fe-xxx'}, {'role': 'user', 'content': '这篇文章讲了什么？'} ], stream=True, stream_options={"include_usage": True} ) full_content = "" for chunk in completion: if chunk.choices and chunk.choices[0].delta.content: # 拼接输出内容 full_content += chunk.choices[0].delta.content print(chunk.model_dump()) print({full_content}) ``` Tip: The example code is for OpenAI, but the document said that it also supports the DataScope API, and I tested it, and it works. ``` Is the Dashscope SDK invocation method compatible? Yes, the Dashscope SDK remains compatible for model invocation. However, file uploads and file-ID retrieval are currently only supported via the OpenAI SDK. The file-ID obtained through this method is also compatible with Dashscope for model invocation. ```	2025-03-10 08:58:40 -04:00
ccurme	67aff1648b	community: Add OpenGradient integration (Toolkit) (#30190 ) Commandeering https://github.com/langchain-ai/langchain/pull/30135 --------- Co-authored-by: kylexqian <kylexqian@gmail.com>	2025-03-09 18:08:07 -04:00
ccurme	b209d46eb3	mistral[patch]: set global ssl context (#30189 )	2025-03-09 21:27:41 +00:00
Vijay Selvaraj	df459d0d5e	community: add Valthera integration (#30105 ) ```markdown Description: This PR integrates Valthera into LangChain, introducing an framework designed to send highly personalized nudges by an LLM agent. This is modeled after Dr. BJ Fogg's Behavior Model. This integration includes: - Custom data connectors for HubSpot, PostHog, and Snowflake. - A unified data aggregator that consolidates user data. - Scoring configurations to compute motivation and ability scores. - A reasoning engine that determines the appropriate user action. - A trigger generator to create personalized messages for user engagement. Issue: N/A Dependencies: N/A Twitter handle: - `@vselvarajijay` Tests and Docs: - `docs/docs/integrations/tools/valthera` - `https://github.com/valthera/langchain-valthera/tree/main/tests` ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-09 21:19:08 +00:00
ccurme	3823daa0b9	cli: update integration doc template for tools (#30188 ) Chain example -> langgraph agent	2025-03-09 21:14:43 +00:00
Jonathan Feng	911accf733	docs: add contextualai documentation (#30050 ) Thank you for contributing to LangChain! Description: adds ContextualAI's `langchain-contextual` package's documentation If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-09 02:43:13 +00:00
Bharat	b9746a6910	fixes#30182: update tool names to match OpenAI function name pattern (#30183 ) The OpenAI API requires function names to match the pattern '^[a-zA-Z0-9_-]+$'. This updates the JIRA toolkit's tool names to use underscores instead of spaces to comply with this requirement and prevent BadRequestError when using the tools with OpenAI functions. Error fixed: ``` File "langgraph-bug-fix/.venv/lib/python3.13/site-packages/openai/_base_client.py", line 1023, in _request raise self._make_status_error_from_response(err.response) from None openai.BadRequestError: Error code: 400 - {'error': {'message': "Invalid 'tools[0].function.name': string does not match pattern. Expected a string that matches the pattern '^[a-zA-Z0-9_-]+$'.", 'type': 'invalid_request_error', 'param': 'tools[0].function.name', 'code': 'invalid_value'}} During task with name 'agent' and id 'aedd7537-e8d5-6678-d0c5-98129586d3ac' ``` Issue:#30182	2025-03-08 20:48:25 -05:00
ccurme	cee0fecb08	docs: update package registry counts (#30181 )	2025-03-08 20:37:59 -05:00
William FH	bac3a28e70	Flush (#30157 )	2025-03-07 16:32:15 -08:00
ccurme	a7ab5e8372	community[patch]: ChatPerplexity: track usage metadata (#30175 )	2025-03-07 23:25:05 +00:00
ccurme	1c993b921c	core[patch]: release 0.3.43 (#30173 )	2025-03-07 21:56:00 +00:00
ccurme	9893e5cb80	core[patch]: catch structured_output_format (#30172 ) Change to `ls_structured_output_format` was not backward-compatible with older versions of integration packages.	2025-03-07 16:50:06 -05:00
ccurme	33a3510243	core[patch]: export ArgsSchema (#30169 ) This is needed for type hints see: https://github.com/langchain-ai/langchain/pull/30167	2025-03-07 20:43:05 +00:00
ccurme	17507c9ba6	groq[patch]: release 0.2.5 (#30168 )	2025-03-07 20:25:51 +00:00
andyzhou1982	9e863c89d2	add JiebaLinkExtractor for chinese doc extracting (#30150 ) Thank you for contributing to LangChain! - [ ] PR title: "community: chinese doc extracting" - [ ] PR message: - Description: add jieba_link_extractor.py for chinese doc extracting - Dependencies: jieba - [ ] Add tests and docs: If you're adding a new integration, please include /doc/doc/integrations/providers/jieba.md /doc/doc/integrations/vectorstores/jieba_link_extractor.ipynb /libs/packages.yml --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-07 20:21:46 +00:00
ccurme	74e7772a5f	groq[patch]: warn if model is not specified (#30161 ) Groq is retiring `mixtral-8x7b-32768`, which is currently the default model for ChatGroq, on March 20. Here we emit a warning if the model is not specified explicitly. A version 0.3.0 will be released ahead of March 20 that removes the default altogether.	2025-03-07 15:21:13 -05:00
Ioannis Bakagiannis	3444e587ee	docs: Integration Update - ADS4GPTs (#30153 ) docs: New integration for LangChain - ads4gpts-langchain Description: Tools and Toolkit for Agentic integration natively within LangChain with ADS4GPTs, in order to help applications monetize with advertising. Twitter handle: @ads4gpts Co-authored-by: knitlydevaccount <loom+github@knitly.app>	2025-03-07 14:35:44 -05:00
ccurme	3c258194ae	tests[patch]: release 0.3.14 (#30165 )	2025-03-07 18:34:05 +00:00
ccurme	34638ccfae	openai[patch]: release 0.3.8 (#30164 )	2025-03-07 18:26:40 +00:00
ccurme	4e5058f29c	core[patch]: release 0.3.42 (#30163 )	2025-03-07 18:14:45 +00:00
Eugene Yurtsev	894fd63a61	cli: release 0.0.36 (#30159 ) Bump for 0.0.36	2025-03-07 13:05:40 -05:00
ccurme	806211475a	core[patch]: update structured output tracing (#30123 ) - Trace JSON schema in `options` - Rename to `ls_structured_output_format`	2025-03-07 13:05:25 -05:00
ccurme	230876a7c5	anthropic[patch]: add PDF input example to API reference (#30156 )	2025-03-07 14:19:08 +00:00
joeconstantino	022ff9eead	Tableau docs for new datasource qa tool (#30125 ) - Description: a notebook showing langchain and langraph agents using the new langchain_tableau tool - Twitter handle: @joe_constantin0 --------- Co-authored-by: Joe Constantino <joe@constantino.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-06 14:58:56 +00:00
ccurme	52b0570bec	core, openai, standard-tests: improve OpenAI compatibility with Anthropic content blocks (#30128 ) - Support thinking blocks in core's `convert_to_openai_messages` (pass through instead of error) - Ignore thinking blocks in ChatOpenAI (instead of error) - Support Anthropic-style image blocks in ChatOpenAI --- Standard integration tests include a `supports_anthropic_inputs` property which is currently enabled only for tests on `ChatAnthropic`. This test enforces compatibility with message histories of the form: ``` - system message - human message - AI message with tool calls specified only through `tool_use` content blocks - human message containing `tool_result` and an additional `text` block ``` It additionally checks support for Anthropic-style image inputs if `supports_image_inputs` is enabled. Here we change this test, such that if you enable `supports_anthropic_inputs`: - You support AI messages with text and `tool_use` content blocks - You support Anthropic-style image inputs (if `supports_image_inputs` is enabled) - You support thinking content blocks. That is, we add a test case for thinking content blocks, but we also remove the requirement of handling tool results within HumanMessages (motivated by existing agent abstractions, which should all return ToolMessage). We move that requirement to a ChatAnthropic-specific test.	2025-03-06 09:53:14 -05:00
Pat Patterson	b3dc66f7a3	community: fix AttributeError when creating LanceDB vectorstore (#30127 ) Description: This PR adds a call to `guard_import()` to fix an AttributeError raised when creating LanceDB vectorstore instance with an existing LanceDB table. Issue: This PR fixes issue #30124. Dependencies: No additional dependencies. Twitter handle: [@metadaddy](https://x.com/metadaddy), but I spend more time at [@metadaddy.net](https://bsky.app/profile/metadaddy.net) these days. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-05 23:04:38 +00:00
Hugh Gao	9b7b8e4a1a	community: make DashScope models support Partial Mode for text continuation. (#30108 ) ## Description make DashScope models support Partial Mode for text continuation. For text continuation in ChatTongYi, it supports text continuation with a prefix by adding a "partial" argument in AIMessage. The document is [Partial Mode ](https://help.aliyun.com/zh/model-studio/user-guide/partial-mode?spm=a2c4g.11186623.help-menu-2400256.d_1_0_0_8.211e5b77KMH5Pn&scm=20140722.H_2862210._.OR_help-T_cn~zh-V_1). The API example is: ```py import os import dashscope messages = [{ "role": "user", "content": "请对“春天来了，大地”这句话进行续写，来表达春天的美好和作者的喜悦之情" }, { "role": "assistant", "content": "春天来了，大地", "partial": True }] response = dashscope.Generation.call( api_key=os.getenv("DASHSCOPE_API_KEY"), model='qwen-plus', messages=messages, result_format='message', ) print(response.output.choices[0].message.content) ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-05 16:22:14 +00:00
黑牛	f0153414d5	Add request_id field to improve request tracking and debugging (for Tongyi model) (#30110 ) - Description: Added the request_id field to the check_response function to improve request tracking and debugging, applicable for the Tongyi model. - Issue: None - Dependencies: None - Twitter handle: None - Add tests and docs: None - Lint and test: Ran `make format`, `make lint`, and `make test` to ensure the code meets formatting and testing requirements.	2025-03-05 11:03:47 -05:00
Manthan Surkar	1ee8aceaee	community: fix Jira API wrapper failing initialization with cloud param (#30117 ) ### Description Converts the boolean `jira_cloud` parameter in the Jira API Wrapper to a string before initializing the Jira Client. Also adds tests for the same. ### Issue [Jira API Wrapper Bug](`8abb65e138/libs/community/langchain_community/utilities/jira.py (L47)`) ```python jira_cloud_str = get_from_dict_or_env(values, "jira_cloud", "JIRA_CLOUD") jira_cloud = jira_cloud_str.lower() == "true" ``` The above code has a bug where the value of `"jira_cloud"` is a boolean. If it is passed, calling `.lower()` on a boolean raises an error. Additionally, `False` cannot be passed explicitly since `get_from_dict_or_env` falls back to environment variables. Relevant code in `langchain_core`: [Source](https://github.com/thesmallstar/langchain/blob/master/.venv/lib/python3.13/site-packages/langchain_core/utils/env.py#L46) ```python if isinstance(key, str) and key in data and data[key]: # Here, data[key] is False ``` This PR fixes both issues. ### Twitter Handle [Manthan Surkar](https://x.com/manthan_surkar)	2025-03-05 10:49:25 -05:00
Adrián Panella	c599ba47d5	core(mermaid): fix error when 3+ subgraph levels (#29970 )	2025-03-04 13:27:49 -05:00
Alexander Henlein	417efa30a6	docs: add Taiga Tool integration docs (#30042 ) This PR adds documentation for the langchain-taiga Tool integration, including an example notebook at 'docs/docs/integrations/tools/taiga.ipynb' and updates to 'libs/packages.yml' to track the new package. Issue: N/A Dependencies: None Twitter handle: N/A --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-03-04 17:51:20 +00:00
Mathias Marciano	5f0102242a	Fixed an issue with the OpenAI Assistant's 'retrieval' tool and adding support for the 'attachments' parameter (#30006 ) PR Title: langchain: add attachments support in OpenAIAssistantRunnable PR Description: This PR fixes an issue with the "retrieval" tool (internally named "file_search") in the OpenAI Assistant by adding support for the "attachments" parameter in the invoke method. This change allows files to be linked to messages when they are inserted into threads, which is essential for utilizing OpenAI's Retrieval Augmented Generation (RAG) feature. Issue: N/A Dependencies: None Twitter handle: N/A --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-04 17:34:11 +00:00
Philippe PRADOS	4710c1fa8c	community[minor]: Fix regular expression in visualize and outlines modules. (#30002 ) Fix invalid escape characteres	2025-03-04 12:23:48 -05:00
ccurme	577c0d0715	community[patch]: release 0.3.19 (#30104 )	2025-03-04 16:12:03 +00:00
ccurme	ba5ddb218f	anthropic[patch]: release 0.3.9 (#30103 )	2025-03-04 10:53:55 -05:00
ccurme	9383a0536a	tests[patch]: release 0.3.13 (#30102 )	2025-03-04 10:53:43 -05:00
ccurme	fb16c25920	langchain[patch]: release 0.3.20 (#30101 )	2025-03-04 15:47:27 +00:00
ccurme	692a68bf1c	core[patch]: release 0.3.41 (#30100 )	2025-03-04 15:08:57 +00:00
ccurme	484d945500	community[patch]: remove numpy cap for python < 3.12 (#30084 )	2025-03-04 09:46:41 -05:00
ZhangShenao	8575d7491f	[Doc] Improve api doc (#30073 ) - Update api_doc for `BaseMessage` - add static method decorator for `retry_runnable`	2025-03-04 09:39:07 -05:00
Samuel Dion-Girardeau	ccb64e9f4f	docs: Fix typo in code samples for max_tokens_for_prompt (#30088 ) - Description: Fix typo in code samples for max_tokens_for_prompt. Code blocks had singular "token" but the method has plural "tokens". - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2025-03-04 09:11:21 -05:00
ArrayPD	c671d54c6f	core: make with_alisteners() example workable. (#30059 ) Description: 5 fix of example from function with_alisteners() in libs/core/langchain_core/runnables/base.py Replace incoherent example output with workable example's output. 1. SyntaxError: unterminated string literal print(f"on start callback starts at {format_t(time.time())} correct as print(f"on start callback starts at {format_t(time.time())}") 2. SyntaxError: unterminated string literal print(f"on end callback starts at {format_t(time.time())} correct as print(f"on end callback starts at {format_t(time.time())}") 3. NameError: name 'Runnable' is not defined Fix as from langchain_core.runnables import Runnable 4. NameError: name 'asyncio' is not defined Fix as import asyncio 5. NameError: name 'format_t' is not defined. Implement format_t() as from datetime import datetime, timezone def format_t(timestamp: float) -> str: return datetime.fromtimestamp(timestamp, tz=timezone.utc).isoformat()	2025-03-01 15:39:02 -05:00
cold-eye	7c175e3fda	Update ascend.py (#30060 ) add batch_size to fix oom when embed large amount texts Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-03-01 14:10:41 -05:00
ccurme	3b066dc005	anthropic[patch]: allow structured output when thinking is enabled (#30047 ) Structured output will currently always raise a BadRequestError when Claude 3.7 Sonnet's `thinking` is enabled, because we rely on forced tool use for structured output and this feature is not supported when `thinking` is enabled. Here we: - Emit a warning if `with_structured_output` is called when `thinking` is enabled. - Raise `OutputParserException` if no tool calls are generated. This is arguably preferable to raising an error in all cases. ```python from langchain_anthropic import ChatAnthropic from pydantic import BaseModel class Person(BaseModel): name: str age: int llm = ChatAnthropic( model="claude-3-7-sonnet-latest", max_tokens=5_000, thinking={"type": "enabled", "budget_tokens": 2_000}, ) structured_llm = llm.with_structured_output(Person) # <-- this generates a warning ``` ```python structured_llm.invoke("Alice is 30.") # <-- works ``` ```python structured_llm.invoke("Hello!") # <-- raises OutputParserException ```	2025-02-28 14:44:11 -05:00
ccurme	f8ed5007ea	anthropic, mistral: return `model_name` in response metadata (#30048 ) Took a "census" of models supported by init_chat_model-- of those that return model names in response metadata, these were the only two that had it keyed under `"model"` instead of `"model_name"`.	2025-02-28 18:56:05 +00:00
Christophe Bornet	9e6ffd1264	core: Add ruff rules PTH (pathlib) (#29338 ) See https://docs.astral.sh/ruff/rules/#flake8-use-pathlib-pth Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-28 13:22:20 -05:00
TheSongg	86b364de3b	Add asynchronous generate interface (#30001 ) - [ ] PR title: [langchain_community.llms.xinference]: Add asynchronous generate interface - [ ] PR message: The asynchronous generate interface support stream data and non-stream data. chain = prompt \| llm async for chunk in chain.astream(input=user_input): yield chunk - [ ] Add tests and docs: from langchain_community.llms import Xinference from langchain.prompts import PromptTemplate llm = Xinference( server_url="http://0.0.0.0:9997", # replace your xinference server url model_uid={model_uid} # replace model_uid with the model UID return from launching the model stream = True ) prompt = PromptTemplate(input=['country'], template="Q: where can we visit in the capital of {country}? A:") chain = prompt \| llm async for chunk in chain.astream(input=user_input): yield chunk	2025-02-28 12:32:44 -05:00
Fakai Zhao	f07338d2bf	Implementing the MMR algorithm for OLAP vector storage (#30033 ) Thank you for contributing to LangChain! - Implementing the MMR algorithm for OLAP vector storage: - Support Apache Doris and StarRocks OLAP database. - Example: "vectorstore.as_retriever(search_type="mmr", search_kwargs={"k": 10})" - Implementing the MMR algorithm for OLAP vector storage: - Apache Doris - StarRocks - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - Add tests and docs: - Example: "vectorstore.as_retriever(search_type="mmr", search_kwargs={"k": 10})" - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: fakzhao <fakzhao@cisco.com>	2025-02-28 08:50:22 -05:00
Daniel Rauber	186cd7f1a1	community: PlaywrightURLLoader should wait for page load event before attempting to extract data (#30043 ) ## Description The PlaywrightURLLoader should wait for a page to be loaded before attempting to extract data.	2025-02-28 08:45:51 -05:00
ccurme	0dbcc1d099	docs: document anthropic features (#30030 ) Update integrations page with extended thinking feature. Update API reference with extended thinking and citations.	2025-02-27 19:37:04 -05:00
ccurme	6c7c8a164f	openai[patch]: add unit test (#30022 ) Test `max_completion_tokens` is propagated to payload for AzureChatOpenAI.	2025-02-27 11:09:17 -05:00
DamonXue	156a60013a	docs: fix tavily_search code-block format. (#30012 ) This pull request includes a change to the `TavilySearchResults` class in the `tool.py` file, which updates the code block format in the documentation. Documentation update: * [`libs/community/langchain_community/tools/tavily_search/tool.py`](diffhunk://#diff-e3b6a980979268b639c6a86e9b182756b0f7c7e9e5605e613bc0a72ea6aa5301L54-R59): Changed the code block format from Python to JSON in the example provided in the docstring.Thank you for contributing to LangChain!	2025-02-27 10:55:15 -05:00
kawamou	8977ac5ab0	community[fix]: Handle None value in raw_content from Tavily API response (#30021 ) ## Description: When using the Tavily retriever with include_raw_content=True, the retriever occasionally fails with a Pydantic ValidationError because raw_content can be None. The Document model in langchain_core/documents/base.py requires page_content to be a non-None value, but the Tavily API sometimes returns None for raw_content. This PR fixes the issue by ensuring that even when raw_content is None, an empty string is used instead: ```python page_content=result.get("content", "") if not self.include_raw_content else (result.get("raw_content") or ""),	2025-02-27 10:53:53 -05:00
Lakindu Boteju	f69deee1bd	community: Add cost data for aws bedrock anthropic.claude-3-7 model (#30016 ) This pull request includes updates to the `libs/community/langchain_community/callbacks/bedrock_anthropic_callback.py` file to add a new model version to the list of supported models. Updates to supported models: * Added support for the `anthropic.claude-3-7-sonnet-20250219-v1:0` model with a rate of `0.003` for 1000 input tokens. * Added support for the `anthropic.claude-3-7-sonnet-20250219-v1:0` model with a rate of `0.015` for 1000 output tokens. AWS Bedrock pricing reference : https://aws.amazon.com/bedrock/pricing	2025-02-27 09:51:52 -05:00
Lakindu Boteju	e0e9e560b3	PyMuPDF4LLM integration to LangChain (#29953 ) ## PyMuPDF4LLM integration to LangChain for PDF content extraction in Markdown format ### Description [PyMuPDF4LLM](https://github.com/pymupdf/RAG) makes it easier to extract PDF content in Markdown format, needed for LLM & RAG applications. (License: GNU Affero General Public License v3.0) [langchain-pymupdf4llm](https://github.com/lakinduboteju/langchain-pymupdf4llm) integrates PyMuPDF4LLM to LangChain as a Document Loader. (License: MIT License) This pull request introduces the integration of [PyMuPDF4LLM](https://pymupdf.readthedocs.io/en/latest/pymupdf4llm) into the LangChain project as an integration package: [`langchain-pymupdf4llm`](https://github.com/lakinduboteju/langchain-pymupdf4llm). The most important changes include adding new Jupyter notebooks to document the integration and updating the package configuration file to include the new package. ### Documentation: * `docs/docs/integrations/providers/pymupdf4llm.ipynb`: Added a new Jupyter notebook to document the integration of `PyMuPDF4LLM` with LangChain, including installation instructions and class imports. * `docs/docs/integrations/document_loaders/pymupdf4llm.ipynb`: Added a new Jupyter notebook to document the usage of `langchain-pymupdf4llm` as a LangChain integration package in detail. ### Package registration: * `libs/packages.yml`: Updated the package configuration file to include the `langchain-pymupdf4llm` package. ### Additional information * Related to: https://github.com/langchain-ai/langchain/pull/29848 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-26 15:59:12 -05:00
Dan Mirsky	d98c3f76c2	core[patch]: Fix FileCallbackHandler name resolution, Fixes #29941 (#29942 ) - Description: Same changes as #26593 but for FileCallbackHandler - Issue: Fixes #29941 - Dependencies: None - Twitter handle: None - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-02-26 14:54:24 -05:00
Christophe Bornet	b3885c124f	core: Add ruff rules TC (#29268 ) See https://docs.astral.sh/ruff/rules/#flake8-type-checking-tc Some fixes done for TC001,TC002 and TC003 but these rules are excluded since they don't play well with Pydantic. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-26 19:39:05 +00:00
talos	9cd20080fc	community: Update SQLiteVec table trigger (#29914 ) Issue: This trigger can only be used by the first table created. Cannot create additional triggers for other tables. fixed: Update the trigger name so that it can be used for new tables. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-26 15:10:13 +00:00
ccurme	7562677f3f	langchain[patch]: delete erroneous lock file (#30007 ) Picked up during merge.	2025-02-26 15:01:05 +00:00
Erick Friis	3c96012f5e	langchain: make numpy optional (#29182 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-26 14:35:24 +00:00
Artem Yankov	6177b9f9ab	community: add title, score and raw_content to tavily search results (#29995 ) Description: Tavily search results returned from API include useful information like title, score and (optionally) raw_content that is missed in wrapper although it's documented there properly. Add this data to the result structure. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-25 23:27:21 +00:00
Eugene Yurtsev	b525226531	core[patch]: version 0.3.40 (#29997 ) Version 0.3.40 release	2025-02-25 23:09:40 +00:00
Vadym Barda	0fc50b82a0	core[patch]: allow passing description to @tool decorator (#29976 )	2025-02-25 17:45:36 -05:00
Naveen SK	21bfc95e14	docs: Correct grammatical typos in various documentation files (#29983 ) Description: Fixed grammatical typos in various documentation files Issue: N/A Dependencies: N/A Twitter handle: @MrNaveenSK Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-25 19:13:31 +00:00
ccurme	1158d3134d	langchain[patch]: remove aiohttp (#29991 ) My guess is this was left over from when `community` was in langchain.	2025-02-25 11:43:00 -05:00
ccurme	afd7888392	langchain[patch]: remove explicit dependency on tenacity (#29990 ) Not used anywhere in `langchain`, already a dependency of langchain-core.	2025-02-25 11:31:55 -05:00
ccurme	32704f0ad8	langchain: update extended test (#29988 )	2025-02-25 14:58:20 +00:00
Yan	47e1a384f7	Writer partners integration docs (#29961 ) Documentation of Writer provider and additional features * [PyPi langchain-writer web-page](https://pypi.org/project/langchain-writer/) * [GitHub langchain-writer repo](https://github.com/writer/langchain-writer) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-24 19:30:09 -05:00
ccurme	79f5bbfb26	anthropic[patch]: release 0.3.8 (#29973 )	2025-02-24 15:24:35 -05:00
ccurme	ded886f622	anthropic[patch]: support claude 3.7 sonnet (#29971 )	2025-02-24 15:17:47 -05:00
Bagatur	d00d645829	docs[patch]: update disable_streaming docstring (#29968 )	2025-02-24 18:40:31 +00:00
ccurme	b7a1705052	openai[patch]: release 0.3.7 (#29967 )	2025-02-24 11:59:28 -05:00
ccurme	5437ee385b	core[patch]: release 0.3.39 (#29966 )	2025-02-24 11:47:01 -05:00
ccurme	291a232fb8	openai[patch]: set global ssl context (#29932 ) We set ```python global_ssl_context = ssl.create_default_context(cafile=certifi.where()) ``` at the module-level and share it among httpx clients.	2025-02-24 11:25:16 -05:00
ccurme	9ce07980b7	core[patch]: pydantic 2.11 compat (#29963 ) Resolves https://github.com/langchain-ai/langchain/issues/29951 Was able to reproduce the issue with Anthropic installing from pydantic `main` and correct it with the fix recommended in the issue. Thanks very much @Viicos for finding the bug and the detailed writeup!	2025-02-24 11:11:25 -05:00
ccurme	0d3a3b99fc	core[patch]: release 0.3.38 (#29962 )	2025-02-24 15:04:53 +00:00
ccurme	b1a7f4e106	core, openai[patch]: support serialization of pydantic models in messages (#29940 ) Resolves https://github.com/langchain-ai/langchain/issues/29003, https://github.com/langchain-ai/langchain/issues/27264 Related: https://github.com/langchain-ai/langchain-redis/issues/52 ```python from langchain.chat_models import init_chat_model from langchain.globals import set_llm_cache from langchain_community.cache import SQLiteCache from pydantic import BaseModel cache = SQLiteCache() set_llm_cache(cache) class Temperature(BaseModel): value: int city: str llm = init_chat_model("openai:gpt-4o-mini") structured_llm = llm.with_structured_output(Temperature) ``` ```python # 681 ms response = structured_llm.invoke("What is the average temperature of Rome in May?") ``` ```python # 6.98 ms response = structured_llm.invoke("What is the average temperature of Rome in May?") ```	2025-02-24 09:34:27 -05:00
ccurme	927ec20b69	openai[patch]: update system role to developer for o-series models (#29785 ) Some o-series models will raise a 400 error for `"role": "system"` (`o1-mini` and `o1-preview` will raise, `o1` and `o3-mini` will not). Here we update `ChatOpenAI` to update the role to `"developer"` for all model names matching `^o\d`. We only make this change on the ChatOpenAI class (not BaseChatOpenAI).	2025-02-24 08:59:46 -05:00
Ahmed Tammaa	8b511a3a78	[Exception Handling] DeepSeek JSONDecodeError (#29758 ) For Context please check #29626 The Deepseek is using langchain_openai. The error happens that it show `json decode error`. I added a handler for this to give a more sensible error message which is DeepSeek API returned empty/invalid json. Reproducing the issue is a bit challenging as it is inconsistent, sometimes DeepSeek returns valid data and in other times it returns invalid data which triggers the JSON Decode Error. This PR is an exception handling, but not an ultimate fix for the issue. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-23 15:00:32 -05:00
Julien Elkaim	e586bffe51	community: Repair embeddings/llamacpp's embed_query method (#29935 ) Description: As commented on the commit [`41b6a86`](`41b6a86bbe`) it introduced a bug for when we do an embedding request and the model returns a non-nested list. Typically it's the case for model _nomic-embed-text_. - I added the unit test, and ran `make format`, `make lint` and `make test` from the `community` package. - No new dependency. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-23 19:32:17 +00:00
Saraswathy Kalaiselvan	5ca4933b9d	docs: updated ChatLiteLLM model_kwargs description (#29937 ) - [x] PR title: docs: (community) update ChatLiteLLM - [x] PR message: - Description: updated description of model_kwargs parameter which was wrongly describing for temperature. - Issue: #29862 - Dependencies: N/A - [x] Add tests and docs: N/A - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-23 19:27:13 +00:00
ccurme	512eb1b764	anthropic[patch]: update models for integration tests (#29938 )	2025-02-23 14:23:48 -05:00
Christophe Bornet	f6d4fec4d5	core: Add ruff rules ANN (type annotations) (#29271 ) See https://docs.astral.sh/ruff/rules/#flake8-annotations-ann The interest compared to only mypy is that ruff is very fast at detecting missing annotations. ANN101 and ANN102 are deprecated so we ignore them ANN401 (no Any type) ignored to be in sync with mypy config --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-22 17:46:28 -05:00
Bagatur	979a991dc2	core[patch]: dont deep copy merge_message_runs (#28454 ) afaict no need to deep copy here, if we merge messages then we convert them to chunks first anyways	2025-02-22 21:56:45 +00:00
Mohammad Mohtashim	afa94e5bf7	`_wait_for_run` calling fix for `OpenAIAssistantRunnable` (#29927 ) - Description: Fixed the `OpenAIAssistantRunnable` call of `_wait_for_run` - Issue: #29923	2025-02-22 00:27:24 +00:00
Vadym Barda	437fe6d216	core[patch]: return ToolMessage from tools when tool call ID is empty string (#29921 )	2025-02-21 11:53:15 -05:00
Taofiq Aiyelabegan	5ee8a8f063	[Integration]: Langchain-Permit (#29867 ) ## Which area of LangChain is being modified? - This PR adds a new "Permit" integration to the `docs/integrations/` folder. - Introduces two new Tools (`LangchainJWTValidationTool` and `LangchainPermissionsCheckTool`) - Introduces two new Retrievers (`PermitSelfQueryRetriever` and `PermitEnsembleRetriever`) - Adds demo scripts in `examples/` showcasing usage. ## Description of Changes - Created `langchain_permit/tools.py` for JWT validation and permission checks with Permit. - Created `langchain_permit/retrievers.py` for custom Permit-based retrievers. - Added documentation in `docs/integrations/providers/permit.ipynb` (or `.mdx`) to explain setup, usage, and examples. - Provided sample scripts in `examples/demo_scripts/` to illustrate usage of these tools and retrievers. - Ensured all code is linted and tested locally. Thank you again for reviewing! --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-21 10:59:00 -05:00
Jean-Philippe Dournel	ebe38baaf9	community/mlx_pipeline: fix crash at mlx call (#29915 ) - Description: Since mlx_lm 0.20, all calls to mlx crash due to deprecation of the way parameters are passed to methods generate and generate_step. Parameters top_p, temp, repetition_penalty and repetition_context_size are not passed directly to those method anymore but wrapped into "sampler" and "logit_processor". - Dependencies: mlx_lm (optional) - Tests: I've had a new test to existing test file: tests/integration_tests/llms/test_mlx_pipeline.py --------- Co-authored-by: Jean-Philippe Dournel <jp@insightkeeper.io>	2025-02-21 09:14:53 -05:00
ccurme	1fa9f6bc20	docs: build mongo in api ref (#29908 )	2025-02-20 19:58:35 -05:00
Chaunte W. Lacewell	d972c6d6ea	partners: add langchain-vdms (#29857 ) Description: Deprecate vdms in community, add integration langchain-vdms, and update any related files Issue: n/a Dependencies: langchain-vdms Twitter handle: n/a --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 19:48:46 -05:00
Mohammad Mohtashim	8293142fa0	mistral[patch]: support model_kwargs (#29838 ) - Description: Frequency_penalty added as a client parameter - Issue: #29803 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 18:47:39 -05:00
ccurme	924d9b1b33	cli[patch]: fix retriever template (#29907 ) Chat model tabs don't render correctly in .ipynb template.	2025-02-20 17:51:19 +00:00
Brayden Zhong	a70f31de5f	Community: RankLLMRerank AttributeError (Handle list-based rerank results) (#29840 ) # community: Fix AttributeError in RankLLMRerank (`list` object has no attribute `candidates`) ## Description This PR fixes an issue in `RankLLMRerank` where reranking fails with the following error: ``` AttributeError: 'list' object has no attribute 'candidates' ``` The issue arises because `rerank_batch()` returns a `List[Result]` instead of an object containing `.candidates`. ### Changes Introduced - Adjusted `compress_documents()` to support both: - Old API format: `rerank_results.candidates` - New API format: `rerank_results` as a list - Also fix wrong .txt location parsing while I was at it. --- ## Issue Fixes AttributeError in `RankLLMRerank` when using `compression_retriever.invoke()`. The issue is observed when `rerank_batch()` returns a list instead of an object with `.candidates`. Relevant log: ``` AttributeError: 'list' object has no attribute 'candidates' ``` ## Dependencies - No additional dependencies introduced. --- ## Checklist - [x] Backward compatible with previous API versions - [x] Tested locally with different RankLLM models - [x] No new dependencies introduced - [x] Linted with `make format && make lint` - [x] Ready for review --- ## Testing - Ran `compression_retriever.invoke(query)` ## Reviewers If no review within a few days, please @mention one of: - @baskaryan - @efriis - @eyurtsev - @ccurme - @vbarda - @hwchase17	2025-02-20 12:38:31 -05:00
Levon Ghukasyan	ec403c442a	Separate deepale vector store (#29902 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 17:37:19 +00:00
Jorge Piedrahita Ortiz	3acf842e35	core: add sambanova chat models to load module mapping (#29855 ) - Description: add sambanova integration package chat models to load module mapping, to allow serialization and deserialization	2025-02-20 12:30:50 -05:00
ccurme	d227e4a08e	mistralai[patch]: release 0.2.7 (#29906 )	2025-02-20 17:27:12 +00:00
Hande	d8bab89e6e	community: add cognee retriever (#29878 ) This PR adds a new cognee integration, knowledge graph based retrieval enabling developers to ingest documents into cognee’s knowledge graph, process them, and then retrieve context via CogneeRetriever. It includes: - langchain_cognee package with a CogneeRetriever class - a test for the integration, demonstrating how to create, process, and retrieve with cognee - an example notebook showing its use. It lives in `docs/docs/integrations` directory. Followed additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. Thank you for the review! --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 17:15:23 +00:00
dokato	92b415a9f6	community: Made some Jira fields optional for agent to work correctly (#29876 ) Description: Two small changes have been proposed here: (1) Previous code assumes that every issue has a priority field. If an issue lacks this field, the code will raise a KeyError. Now, the code checks if priority exists before accessing it. If priority is missing, it assigns None instead of crashing. This prevents runtime errors when processing issues without a priority. (2) Also If the "style" field is missing, the code throws a KeyError. `.get("style", None)` safely retrieves the value if present. Issue: #29875 Dependencies: N/A	2025-02-20 12:10:11 -05:00
am-kinetica	ca7eccba1f	Handled a bug around empty query results differently (#29877 ) Thank you for contributing to LangChain! - [ ] Handled query records properly: "community: vectorstores/kinetica" - [ ] Bugfix for empty query results handling: - Description: checked for the number of records returned by a query before processing further - Issue: resulted in an `AttributeError` earlier which has now been fixed @efriis	2025-02-20 12:07:49 -05:00
Antonio Pisani	2c403a3ea9	docs: Add langchain-prolog documentation (#29788 ) I want to add documentation for a new integration with SWI-Prolog. @hwchase17 check this out: https://github.com/apisani1/langchain-prolog/tree/main/examples/travel_agent --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 11:50:28 -05:00
Marlene	be7fa920fa	Partner: Azure AI Langchain Docs and Package Registry (#29879 ) This PR adds documentation for the Azure AI package in Langchain to the main mono-repo No issue connected or updated dependencies. Utilises existing tests and makes updates to the docs --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 14:35:26 +00:00
Hankyeol Kyung	2dd0ce3077	openai: Update reasoning_effort arg documentation (#29897 ) Description: Update docstring for `reasoning_effort` argument to specify that it applies to reasoning models only (e.g., OpenAI o1 and o3-mini), clarifying its supported models. Issue: None Dependencies: None	2025-02-20 09:03:42 -05:00
ccurme	ed3c2bd557	core[patch]: set version="v2" as default in astream_events (#29894 )	2025-02-19 23:21:37 +00:00
Fabian Blatz	a2d05a376c	community: ConfluenceLoader: add a filter method for attachments (#29882 ) Adds a `attachment_filter_func` parameter to the ConfluenceLoader class which can be used to determine which files are indexed. This is useful if you are interested in excluding files based on their media type or other metadata.	2025-02-19 18:20:45 -05:00
ccurme	9ed47a4d63	community[patch]: release 0.3.18 (#29896 )	2025-02-19 20:13:00 +00:00
ccurme	92889edafd	core[patch]: release 0.3.37 (#29895 )	2025-02-19 20:04:35 +00:00
ccurme	ffd6194060	core[patch]: de-beta rate limiters (#29891 )	2025-02-19 19:19:59 +00:00
ccurme	fb4c8423f0	docs: fix builds (#29890 ) Missed in https://github.com/langchain-ai/langchain/pull/29889	2025-02-19 13:35:59 -05:00
ccurme	68b13e5172	pinecone: delete from monorepo (#29889 ) This now lives in https://github.com/langchain-ai/langchain-pinecone	2025-02-19 12:55:15 -05:00
Erick Friis	6c1e21d128	core: basemessage.text() (#29078 )	2025-02-18 17:45:44 -08:00
Eugene Yurtsev	8e5074d82d	core: release 0.3.36 (#29869 ) Release 0.3.36	2025-02-18 19:51:43 +00:00
Vadym Barda	d04fa1ae50	core[patch]: allow passing JSON schema as args_schema to tools (#29812 )	2025-02-18 14:44:31 -05:00
ccurme	5034a8dc5c	xai[patch]: release 0.2.1 (#29854 )	2025-02-17 14:30:41 -05:00
ccurme	83dcef234d	xai[patch]: support dedicated structured output feature (#29853 ) https://docs.x.ai/docs/guides/structured-outputs Interface appears identical to OpenAI's. ```python from langchain.chat_models import init_chat_model from pydantic import BaseModel class Joke(BaseModel): setup: str punchline: str llm = init_chat_model("xai:grok-2").with_structured_output( Joke, method="json_schema" ) llm.invoke("Tell me a joke about cats.") ```	2025-02-17 14:19:51 -05:00
ccurme	9d6fcd0bfb	infra: add xai to scheduled testing (#29852 )	2025-02-17 18:59:45 +00:00
ccurme	8a3b05ae69	langchain[patch]: release 0.3.19 (#29851 )	2025-02-17 13:36:23 -05:00
ccurme	c9061162a1	langchain[patch]: add xai to extras (#29850 )	2025-02-17 17:49:34 +00:00
Bagatur	1acf57e9bd	langchain[patch]: init_chat_model xai support (#29849 )	2025-02-17 09:45:39 -08:00
hsm207	037b129b86	weaviate: Add-deprecation-warning (#29757 ) - Description: add deprecation warning when using weaviate from langchain_community - Issue: NA - Dependencies: NA - Twitter handle: NA --------- Signed-off-by: hsm207 <hsm207@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-16 21:42:18 -05:00
Đỗ Quang Minh	cd198ac9ed	community: add custom model for OpenAIWhisperParser (#29831 ) Add `model` properties for OpenAIWhisperParser. Defaulted to `whisper-1` (previous value). Please help me update the docs and other related components of this repo.	2025-02-16 21:26:07 -05:00
Cole McIntosh	6874c9c1d0	docs: add notebook for langchain-salesforce package (#29800 ) Description: This PR adds a Jupyter notebook that explains the features, installation, and usage of the [`langchain-salesforce`](https://github.com/colesmcintosh/langchain-salesforce) package. The notebook includes: - Setup instructions for configuring Salesforce credentials - Example code demonstrating common operations such as querying, describing objects, creating, updating, and deleting records Issue: N/A Dependencies: No new dependencies are required. Tests and Docs: - Added an example notebook demonstrating the usage of the `langchain-salesforce` package, located in `docs/docs/integrations`. Lint and Test: - Ran `make format`, `make lint`, and `make test` successfully. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-16 08:34:57 -05:00
Jan Heimes	60f58df5b3	community: add top_k as param to Needle Retriever (#29821 ) Thank you for contributing to LangChain! - [X] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: This PR adds top_k as a param to the Needle Retriever. By default we use top 10. - [X] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-16 08:30:52 -05:00
Jesus Fernandez Bes	1dfac909d8	community: Adding IN Operator to AzureCosmosDBNoSQLVectorStore (#29805 ) - Description: I have added a new operator in the operator map with key `$in` and value `IN`, so that you can define filters using lists as values. This was already contemplated but as IN operator was not in the map they cannot be used. - Issue: Fixes #29804. - Dependencies: No extra.	2025-02-15 21:44:54 -05:00
Wahed Hemati	8901b113c3	docs: add Discord integration docs (#29822 ) This PR adds documentation for the `langchain-discord-shikenso` integration, including an example notebook at `docs/docs/integrations/tools/discord.ipynb` and updates to `libs/packages.yml` to track the new package. Issue: N/A Dependencies: None Twitter handle: N/A --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-15 21:43:45 -05:00
Krishna Kulkarni	a98c5f1c4b	langchain_community: add image support to DuckDuckGoSearchAPIWrapper (#29816 ) - [ ] PR title: langchain_community: add image support to DuckDuckGoSearchAPIWrapper - Description: This PR enhances the DuckDuckGoSearchAPIWrapper within the langchain_community package by introducing support for image searches. The enhancement includes: - Adding a new method _ddgs_images to handle image search queries. - Updating the run and results methods to process and return image search results appropriately. - Modifying the source parameter to accept "images" as a valid option, alongside "text" and "news". - Dependencies: No additional dependencies are required for this change.	2025-02-15 21:32:14 -05:00
Iris Liu	0d9f0b4215	docs: updates Chroma integration API ref docs (#29826 ) - Description: updates Chroma integration API ref docs - Issue: #29817 - Dependencies: N/A - Twitter handle: @irieliu Co-authored-by: “Iris <“liuirisny@gmail.com”>	2025-02-15 21:05:21 -05:00
ccurme	3fe7c07394	openai[patch]: release 0.3.6 (#29824 )	2025-02-15 13:53:35 -05:00
ccurme	65a6dce428	openai[patch]: enable streaming for o1 (#29823 ) Verified streaming works for the `o1-2024-12-17` snapshot as well.	2025-02-15 12:42:05 -05:00
Christophe Bornet	3dffee3d0b	all: Bump blockbuster version to 1.5.18 (#29806 ) Has fixes for running on Windows and non-CPython runtimes.	2025-02-14 07:55:38 -08:00
ccurme	d9a069c414	tests[patch]: release 0.3.12 (#29797 )	2025-02-13 23:57:44 +00:00
ccurme	e4f106ea62	groq[patch]: remove xfails (#29794 ) These appear to pass.	2025-02-13 15:49:50 -08:00
Erick Friis	f34e62ef42	packages: add langchain-xai (#29795 ) wasn't registered per the contribution guide: https://python.langchain.com/docs/contributing/how_to/integrations/	2025-02-13 15:36:41 -08:00
ccurme	49cc6106f7	tests[patch]: fix query for test_tool_calling_with_no_arguments (#29793 )	2025-02-13 23:15:52 +00:00
Erick Friis	1a225fad03	multiple: fix uv path deps (#29790 ) file:// format wasn't working with updates - it doesn't install as an editable dep move to tool.uv.sources with path= instead	2025-02-13 21:32:34 +00:00
Erick Friis	ff13384eb6	packages: update counts, add command (#29789 )	2025-02-13 20:45:25 +00:00
HackHuang	76d32754ff	core : update the class docs of InMemoryVectorStore in in_memory.py (#29781 ) - Description: Add the new introduction about checking `store` in in_memory.py, It’s necessary and useful for beginners. ```python Check Documents: .. code-block:: python for doc in vector_store.store.values(): print(doc) ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 16:41:47 +00:00
Mohammad Mohtashim	96ad09fa2d	(Community): Added API Key for Jina Search API Wrapper (#29622 ) - Description: Simple change for adding the API Key for Jina Search API Wrapper - Issue: #29596	2025-02-12 20:12:07 -08:00
ccurme	f1c66a3040	docs: minor fix to provider table (#29771 ) Langfair renders as LangfAIr	2025-02-13 04:06:58 +00:00
Jakub Kopecký	c8cb7c25bf	docs: update apify integration (#29553 ) Description: Fixed and updated Apify integration documentation to use the new [langchain-apify](https://github.com/apify/langchain-apify) package. Twitter handle: @apify	2025-02-12 20:02:55 -08:00
ccurme	16fb1f5371	chroma[patch]: release 0.2.2 (#29769 ) Resolves https://github.com/langchain-ai/langchain/issues/29765	2025-02-13 02:39:16 +00:00
Mohammad Mohtashim	2310847c0f	(Chroma): Small Fix in `add_texts` when checking for embeddings (#29766 ) - Description: Small fix in `add_texts` to make embedding nullability is checked properly. - Issue: #29765 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 02:26:13 +00:00
Eric Pinzur	716fd89d8e	docs: contributed `Graph RAG` Retriever integration (#29744 ) Description: This adds the `Graph RAG` Retriever integration documentation, per https://python.langchain.com/docs/contributing/how_to/integrations/. * The integration exists in this public repository: https://github.com/datastax/graph-rag * We've implemented the standard langchain tests for retrievers: https://github.com/datastax/graph-rag/blob/main/packages/langchain-graph-retriever/tests/test_langchain.py * Our integration is published to PyPi: https://pypi.org/project/langchain-graph-retriever/	2025-02-12 18:25:48 -08:00
Sunish Sheth	f42dafa809	Deprecating sql_database access for creating UC functions for agent tools (#29745 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-13 02:24:44 +00:00
Thor 雷神 Schaeff	a0970d8d7e	[WIP] chore: update ElevenLabs tool. (#29722 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 01:54:34 +00:00
Chaymae El Aattabi	4b08a7e8e8	Fix #29759 : Use local chunk_size_ for looping in embed_documents (#29761 ) This fix ensures that the chunk size is correctly determined when processing text embeddings. Previously, the code did not properly handle cases where chunk_size was None, potentially leading to incorrect chunking behavior. Now, chunk_size_ is explicitly set to either the provided chunk_size or the default self.chunk_size, ensuring consistent chunking. This update improves reliability when processing large text inputs in batches and prevents unintended behavior when chunk_size is not specified. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 01:28:26 +00:00
Sunish Sheth	043d78d85d	Deprecate langhchain community ucfunctiontoolkit in favor for databricks_langchain (#29746 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-12 15:50:35 -08:00
Hugues Chocart	e4eec9e9aa	community: add langchain-abso documentation (#29739 ) Add the documentation for the community package `langchain-abso`. It provides a new Chat Model class, that uses https://abso.ai --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2025-02-12 19:57:33 +00:00
ccurme	e61f463745	core[patch]: release 0.3.35 (#29764 )	2025-02-12 18:13:10 +00:00
Nuno Campos	fe59f2cc88	core: Fix output of convert_messages when called with BaseMessage.model_dump() (#29763 ) - additional_kwargs was being nested twice - example, response_metadata was placed inside additional_kwargs	2025-02-12 10:05:33 -08:00
Jacob Lee	f4e3e86fbb	feat(langchain): Infer o3 modelstrings passed to init_chat_model as OpenAI (#29743 )	2025-02-11 16:51:41 -08:00
Mohammad Mohtashim	9f3bcee30a	(Community): Adding Structured Support for ChatPerplexity (#29361 ) - Description: Adding Structured Support for ChatPerplexity - Issue: #29357 - This is implemented as per the Perplexity official docs: https://docs.perplexity.ai/guides/structured-outputs --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-11 15:51:18 -08:00
Jawahar S	994c5465e0	feat: add support for IBM WatsonX AI chat models (#29688 ) Description: Updated init_chat_model to support Granite models deployed on IBM WatsonX Dependencies: [langchain-ibm](https://github.com/langchain-ai/langchain-ibm) Tagging @baskaryan @efriis for review when you get a chance.	2025-02-11 15:34:29 -08:00
Shailendra Mishra	c7d74eb7a3	Oraclevs integration (#29723 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" community: langchain_community/vectorstore/oraclevs.py - [ ] PR message: *Delete this entire checklist* and replace with - Description: Refactored code to allow a connection or a connection pool. - Issue: Normally an idel connection is terminated by the server side listener at timeout. A user thus has to re-instantiate the vector store. The timeout in case of connection is not configurable. The solution is to use a connection pool where a user can specify a user defined timeout and the connections are managed by the pool. - Dependencies: None - Twitter handle: - [ ] Add tests and docs: This is not a new integration. A user can pass either a connection or a connection pool. The determination of what is passed is made at run time. Everything should work as before. - [ ] Lint and test: Already done. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-11 14:56:55 -08:00
ccurme	42ebf6ae0c	deepseek[patch]: release 0.1.2 (#29742 )	2025-02-11 11:53:43 -08:00
ccurme	ec55553807	pinecone[patch]: release 0.2.3 (#29741 )	2025-02-11 19:27:39 +00:00
ccurme	001cf99253	pinecone[patch]: add support for python 3.13 (#29737 )	2025-02-11 11:20:21 -08:00
ccurme	ba8f752bf5	openai[patch]: release 0.3.5 (#29740 )	2025-02-11 19:20:11 +00:00
ccurme	9477f49409	openai, deepseek: make _convert_chunk_to_generation_chunk an instance method (#29731 ) 1. Make `_convert_chunk_to_generation_chunk` an instance method on BaseChatOpenAI 2. Override on ChatDeepSeek to add `"reasoning_content"` to message additional_kwargs. Resolves https://github.com/langchain-ai/langchain/issues/29513	2025-02-11 11:13:23 -08:00
ccurme	d0c2dc06d5	mongodb[patch]: fix link in readme (#29738 )	2025-02-11 18:19:59 +00:00
zzaebok	3b3d52206f	community: change wikidata rest api version from v0 to v1 (#29708 ) Description: According to the [wikidata documentation](https://www.wikidata.org/wiki/Wikidata_talk:REST_API), Wikibase REST API version 1 (stable) is released from November 11, 2024. Their guide is to use the new v1 API and, it just requires replacing v0 in the routes with v1 in almost all cases. So I replaced WIKIDATA_REST_API_URL from v0 to v1 for stable usage. Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-10 17:12:38 -08:00
ccurme	4a389ef4c6	community: fix extended testing (#29715 ) v0.3.100 of premai sdk appears to break on import: `89d9276cbf/premai/api/__init__.py (L230)`	2025-02-10 16:57:34 -08:00
Bhav Sardana	624216aa64	community:Fix for Pydantic model validator of GoogleApiYoutubeLoader (#29694 ) - Description: Community: bugfix for pedantic model validator for GoogleApiYoutubeLoader - Issue: #29165, #27432 Fix is similar to #29346	2025-02-10 08:57:58 -05:00
Changyong Um	60740c44c5	community: Add configurable text key for indexing and the retriever in Pinecone Hybrid Search (#29697 ) issue In Langchain, the original content is generally stored under the `text` key. However, the `PineconeHybridSearchRetriever` searches the `context` field in the metadata and cannot change this key. To address this, I have modified the code to allow changing the key to something other than context. In my opinion, following Langchain's conventions, the `text` key seems more appropriate than `context`. However, since I wasn't sure about the author's intent, I have left the default value as `context`.	2025-02-10 08:56:37 -05:00
manukychen	3de445d521	using getattr and default value to prevent 'OpenSearchVectorSearch' has no attribute 'bulk_size' (#29682 ) - Description: Adding getattr methods and set default value 500 to cls.bulk_size, it can prevent the error below: Error: type object 'OpenSearchVectorSearch' has no attribute 'bulk_size' - Issue: https://github.com/langchain-ai/langchain/issues/29071	2025-02-08 14:39:57 -05:00
Yao Tianjia	5d581ba22c	langchain: support the situation when action_input is null in json output_parser (#29680 ) Description: This PR fixes handling of null action_input in [langchain.agents.output_parser]. Previously, passing null to action_input could cause OutputParserException with unclear error message which cause LLM don't know how to modify the action. The changes include: Added null-check validation before processing action_input Implemented proper fallback behavior with default values Maintained backward compatibility with existing implementations Error Examples: ``` { "action":"some action", "action_input":null } ``` Issue: None Dependencies: None	2025-02-07 22:01:01 -05:00
Philippe PRADOS	beb75b2150	community[minor]: 05 - Refactoring PyPDFium2 parser (#29625 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the PyPDFium2 parser. For more details, see https://github.com/langchain-ai/langchain/pull/28970.	2025-02-07 21:31:12 -05:00
Christophe Bornet	723031d548	community: Bump ruff version to 0.9 (#29206 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:21:10 +00:00
Christophe Bornet	30f6c9f5c8	community: Use Blockbuster to detect blocking calls in asyncio during tests (#29609 ) Same as https://github.com/langchain-ai/langchain/pull/29043 for langchain-community. Dependencies: - blockbuster (test) Twitter handle: cbornet_ Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:10:39 +00:00
Christophe Bornet	3a57a28daa	langchain: Use Blockbuster to detect blocking calls in asyncio during tests (#29616 ) Same as https://github.com/langchain-ai/langchain/pull/29043 for the langchain package. Dependencies: - blockbuster (test) Twitter handle: cbornet_ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:08:15 +00:00
Keenan Pepper	c67d473397	core: Make abatch_as_completed respect max_concurrency (#29426 ) - Description: Add tests for respecting max_concurrency and implement it for abatch_as_completed so that test passes - Issue: #29425 - Dependencies: none - Twitter handle: keenanpepper	2025-02-07 16:51:22 -08:00
Aaron V	dcfaae85d2	Core: Fix __add__ for concatting two BaseMessageChunk's (#29531 ) Description: The change allows you to use the overloaded `+` operator correctly when `+`ing two BaseMessageChunk subclasses. Without this you must instantiate a subclass for it to work. Which feels... wrong. Base classes should be decoupled from sub classes and should have in no way a dependency on them. Issue: You can't `+` a BaseMessageChunk with a BaseMessageChunk e.g. this will explode ```py from langchain_core.outputs import ( ChatGenerationChunk, ) from langchain_core.messages import BaseMessageChunk chunk1 = ChatGenerationChunk( message=BaseMessageChunk( type="customChunk", content="HI", ), ) chunk2 = ChatGenerationChunk( message=BaseMessageChunk( type="customChunk", content="HI", ), ) # this will throw new_chunk = chunk1 + chunk2 ``` In case anyone ran into this issue themselves, it's probably best to use the AIMessageChunk: a la ```py from langchain_core.outputs import ( ChatGenerationChunk, ) from langchain_core.messages import AIMessageChunk chunk1 = ChatGenerationChunk( message=AIMessageChunk( content="HI", ), ) chunk2 = ChatGenerationChunk( message=AIMessageChunk( content="HI", ), ) # No explosion! new_chunk = chunk1 + chunk2 ``` Dependencies: None! Twitter handle: `aaron_vogler` Keeping these for later if need be: ``` baskaryan efriis eyurtsev ccurme vbarda hwchase17 baskaryan efriis ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 00:43:36 +00:00
Marlene	4fa3ef0d55	Community/Partner: Adding Azure community and partner user agent to better track usage in Python (#29561 ) - This pull request includes various changes to add a `user_agent` parameter to Azure OpenAI, Azure Search and Whisper in the Community and Partner packages. This helps in identifying the source of API requests so we can better track usage and help support the community better. I will also be adding the user_agent to the new `langchain-azure` repo as well. - No issue connected or updated dependencies. - Utilises existing tests and docs --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:28:30 +00:00
Ella Charlaix	c401254770	huggingface: Add ipex support to HuggingFaceEmbeddings (#29386 ) ONNX and OpenVINO models are available by specifying the `backend` argument (the model is loaded using `optimum` https://github.com/huggingface/optimum) ```python from langchain_huggingface import HuggingFaceEmbeddings embedding = HuggingFaceEmbeddings( model_name=model_id, model_kwargs={"backend": "onnx"}, ) ``` With this PR we also enable the IPEX backend ```python from langchain_huggingface import HuggingFaceEmbeddings embedding = HuggingFaceEmbeddings( model_name=model_id, model_kwargs={"backend": "ipex"}, ) ```	2025-02-07 15:21:09 -08:00
Bruno Alvisio	3eaf561561	core: Handle unterminated escape character when parsing partial JSON (#29065 ) Description Currently, when parsing a partial JSON, if a string ends with the escape character, the whole key/value is removed. For example: ``` >>> from langchain_core.utils.json import parse_partial_json >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> >>> parse_partial_json(my_str) {'foo': 'bar'} ``` My expectation (and with this fix) would be for `parse_partial_json()` to return: ``` >>> from langchain_core.utils.json import parse_partial_json >>> >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> parse_partial_json(my_str) {'foo': 'bar', 'baz': 'qux'} ``` Notes: 1. It could be argued that current behavior is still desired. 2. I have experienced this issue when the streaming output from an LLM and the chunk happens to end with `\\` 3. I haven't included tests. Will do if change is accepted. 4. This is specially troublesome when this function is used by `187131c55c/libs/core/langchain_core/output_parsers/transform.py (L111)` since what happens is that, for example, if the received sequence of chunks are: `{"foo": "b` , `ar\\` : Then, the result of calling `self.parse_result()` is: ``` {"foo": "b"} ``` and the second time: ``` {} ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:18:21 +00:00
Viren	252cf0af10	docs: add LangFair as a provider (#29390 ) Description: - Add `docs/docs/providers/langfair.mdx` - Register langfair in `libs/packages.yml` Twitter handle: @LangFair Tests and docs 1. Integration tests not needed as this PR only adds a .mdx file to docs. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Dylan Bouchard <dylan.bouchard@cvshealth.com> Co-authored-by: Dylan Bouchard <109233938+dylanbouchard@users.noreply.github.com> Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 21:27:37 +00:00
Erick Friis	eb9eddae0c	docs: use init_chat_model (#29623 )	2025-02-07 12:39:27 -08:00
ccurme	bff25b552c	community: release 0.3.17 (#29676 )	2025-02-07 19:41:44 +00:00
ccurme	01314c51fa	langchain: release 0.3.18 (#29654 )	2025-02-07 13:40:26 -05:00
ccurme	92e2239414	openai[patch]: make parallel_tool_calls explicit kwarg of bind_tools (#29669 ) Improves discoverability and documentation. cc @vbarda	2025-02-07 13:34:32 -05:00
Marc Ammann	5690575f13	openai: Removed tool_calls from completion chunk after other chunks have already been sent. (#29649 ) - Description: Before sending a completion chunk at the end of an OpenAI stream, removing the tool_calls as those have already been sent as chunks. - Issue: - - Dependencies: - - Twitter handle: - @ccurme as mentioned in another PR --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-07 10:15:52 -05:00
Ikko Eltociear Ashimine	0d45ad57c1	community: update base_o365.py (#29657 ) extention -> extension	2025-02-07 08:43:29 -05:00
Vincent Emonet	3645181d0e	qdrant: Add `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` (#29641 ) Added `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` class. It is required when we want to query multiple time with the same embeddings. It was present in the now deprecated original `Qdrant` vectorstore implementation, but was absent from the new one. It is also implemented in a number of others `VectorStore` implementations I have added tests for this new function Note that I also argued in this discussion that it should be part of the general `VectorStore` https://github.com/langchain-ai/langchain/discussions/29638 Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 00:55:58 +00:00
ccurme	488cb4a739	anthropic: release 0.3.7 (#29653 )	2025-02-06 17:05:57 -05:00
ccurme	ab09490c20	openai: release 0.3.4 (#29652 )	2025-02-06 17:02:21 -05:00
ccurme	29a0c38cc3	openai[patch]: add test for message.name (#29651 )	2025-02-06 16:49:28 -05:00
ccurme	91cca827c0	tests: release 0.3.11 (#29648 )	2025-02-06 21:48:09 +00:00
Sunish Sheth	25ce1e211a	docs: Updating the imports for langchain-databricks to databricks-langchain (#29646 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-06 13:28:07 -08:00
ccurme	e1b593ae77	text-splitters[patch]: release 0.3.6 (#29647 )	2025-02-06 16:16:05 -05:00
ccurme	a91e58bc10	core: release 0.3.34 (#29644 )	2025-02-06 15:53:56 -05:00
Vincent Emonet	08b9eaaa6f	community: improve FastEmbedEmbeddings support for ONNX execution provider (e.g. GPU) (#29645 ) I made a change to how was implemented the support for GPU in `FastEmbedEmbeddings` to be more consistent with the existing implementation `langchain-qdrant` sparse embeddings implementation It is directly enabling to provide the list of ONNX execution providers: https://github.com/langchain-ai/langchain/blob/master/libs/partners/qdrant/langchain_qdrant/fastembed_sparse.py#L15 It is a bit less clear to a user that just wants to enable GPU, but gives more capabilities to work with other execution providers that are not the `CUDAExecutionProvider`, and is more future proof Sorry for the disturbance @ccurme > Nice to see you just moved to `uv`! It is so much nicer to run format/lint/test! No need to manually rerun the `poetry install` with all required extras now	2025-02-06 15:31:23 -05:00
ccurme	3450bfc806	infra: add UV_FROZEN to makefiles (#29642 ) These are set in Github workflows, but forgot to add them to most makefiles for convenience when developing locally. `uv run` will automatically sync the lock file. Because many of our development dependencies are local installs, it will pick up version changes and update the lock file. Passing `--frozen` or setting this environment variable disables the behavior.	2025-02-06 14:36:54 -05:00
ccurme	d172984c91	infra: migrate to uv (#29566 )	2025-02-06 13:36:26 -05:00
ccurme	9da06e6e94	standard-tests[patch]: use `has_structured_output` property to engage structured output tests (#29635 ) Motivation: dedicated structured output features are becoming more common, such that integrations can support structured output without supporting tool calling. Here we make two changes: 1. Update the `has_structured_output` method to default to True if a model supports tool calling (in addition to defaulting to True if `with_structured_output` is overridden). 2. Update structured output tests to engage if `has_structured_output` is True.	2025-02-06 10:09:06 -08:00
Vincent Emonet	db8201d4da	community: fix typo in the module imported when using GPU with FastEmbedEmbeddings (#29631 ) Made a mistake in the module to import (the module stay the same only the installed package changes), fixed it and tested it https://github.com/langchain-ai/langchain/pull/29627 @ccurme	2025-02-06 10:26:08 -05:00
Mohammed Abbadi	f8fd65dea2	community: Update deeplake.py (#29633 ) Deep Lake recently released version 4, which introduces significant architectural changes, including a new on-disk storage format, enhanced indexing mechanisms, and improved concurrency. However, LangChain's vector store integration currently does not support Deep Lake v4 due to breaking API changes. Previously, the installation command was: `pip install deeplake[enterprise]` This installs the latest available version, which now defaults to Deep Lake v4. Since LangChain's vector store integration is still dependent on v3, this can lead to compatibility issues when using Deep Lake as a vector database within LangChain. To ensure compatibility, the installation command has been updated to: `pip install deeplake[enterprise]<4.0.0` This constraint ensures that pip installs the latest available version of Deep Lake within the v3 series while avoiding the incompatible v4 update.	2025-02-06 10:25:13 -05:00
Vincent Emonet	0ac5536f04	community: add support for using GPUs with FastEmbedEmbeddings (#29627 ) - Description: add a `gpu: bool = False` field to the `FastEmbedEmbeddings` class which enables to use GPU (through ONNX CUDA provider) when generating embeddings with any fastembed model. It just requires the user to install a different dependency and we use a different provider when instantiating `fastembed.TextEmbedding` - Issue: when generating embeddings for a really large amount of documents this drastically increase performance (honestly that is a must have in some situations, you can't just use CPU it is way too slow) - Dependencies: no direct change to dependencies, but internally the users will need to install `fastembed-gpu` instead of `fastembed`, I made all the changes to the init function to properly let the user know which dependency they should install depending on if they enabled `gpu` or not cf. fastembed docs about GPU for more details: https://qdrant.github.io/fastembed/examples/FastEmbed_GPU/ I did not added test because it would require access to a GPU in the testing environment	2025-02-06 08:04:19 -05:00
Dmitrii Rashchenko	0ceda557aa	add o1 and o3-mini to pricing (#29628 ) ### PR Title: community: add latest OpenAI models pricing ### Description: This PR updates the OpenAI model cost calculation mapping by adding the latest OpenAI models, o1 (non-preview) and o3-mini, based on the pricing listed on the [OpenAI pricing page](https://platform.openai.com/docs/pricing). ### Changes: - Added pricing for `o1`, `o1-2024-12-17`, `o1-cached`, and `o1-2024-12-17-cached` for input tokens. - Added pricing for `o1-completion` and `o1-2024-12-17-completion` for output tokens. - Added pricing for `o3-mini`, `o3-mini-2025-01-31`, `o3-mini-cached`, and `o3-mini-2025-01-31-cached` for input tokens. - Added pricing for `o3-mini-completion` and `o3-mini-2025-01-31-completion` for output tokens. ### Issue: N/A ### Dependencies: None ### Testing & Validation: - No functional changes outside of updating the cost mapping. - No tests were added or modified.	2025-02-06 08:02:20 -05:00
ZhangShenao	ac53977dbc	[MistralAI] Improve MistralAIEmbeddings (#29242 ) - Add static method decorator for method. - Add expected exception for retry decorator #29125	2025-02-05 21:31:54 -05:00
Andrew Wason	22aa5e07ed	standard-tests: Fix ToolsIntegrationTests to correctly handle "content_and_artifact" tools (#29391 ) Description: The response from `tool.invoke()` is always a ToolMessage, with content and artifact fields, not a tuple. The tuple is converted to a ToolMessage here `b6ae7ca91d/libs/core/langchain_core/tools/base.py (L726)` Issue: Currently `ToolsIntegrationTests` requires `invoke()` to return a tuple and so standard tests fail for "content_and_artifact" tools. This fixes that to check the returned ToolMessage. This PR also adds a test that now passes.	2025-02-05 21:27:09 -05:00
Mohammad Anash	f849305a56	fixed Bug in PreFilter of AzureCosmosDBNoSqlVectorSearch (#29613 ) Description: Fixes PreFilter value handling in Azure Cosmos DB NoSQL vectorstore. The current implementation fails to handle numeric values in filter conditions, causing an undefined value variable error. This PR adds support for numeric, boolean, and NULL values while maintaining the existing string and list handling. Changes: Added handling for numeric types (int/float) Added boolean value support Added NULL value handling Added type validation for unsupported values Fixed scope of value variable initialization Issue: Fixes #29610 Implementation Notes: No changes to public API Backwards compatible Maintains consistent behavior with existing MongoDB-style filtering Preserves SQL injection prevention through proper value handling --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-06 02:20:26 +00:00

... 2 3 4 5 6 ...

6803 Commits