langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-06-19 21:33:51 +00:00

Author	SHA1	Message	Date
am-kinetica	ca7eccba1f	Handled a bug around empty query results differently (#29877 ) Thank you for contributing to LangChain! - [ ] Handled query records properly: "community: vectorstores/kinetica" - [ ] Bugfix for empty query results handling: - Description: checked for the number of records returned by a query before processing further - Issue: resulted in an `AttributeError` earlier which has now been fixed @efriis	2025-02-20 12:07:49 -05:00
Antonio Pisani	2c403a3ea9	docs: Add langchain-prolog documentation (#29788 ) I want to add documentation for a new integration with SWI-Prolog. @hwchase17 check this out: https://github.com/apisani1/langchain-prolog/tree/main/examples/travel_agent --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 11:50:28 -05:00
Marlene	be7fa920fa	Partner: Azure AI Langchain Docs and Package Registry (#29879 ) This PR adds documentation for the Azure AI package in Langchain to the main mono-repo No issue connected or updated dependencies. Utilises existing tests and makes updates to the docs --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 14:35:26 +00:00
Hankyeol Kyung	2dd0ce3077	openai: Update reasoning_effort arg documentation (#29897 ) Description: Update docstring for `reasoning_effort` argument to specify that it applies to reasoning models only (e.g., OpenAI o1 and o3-mini), clarifying its supported models. Issue: None Dependencies: None	2025-02-20 09:03:42 -05:00
ccurme	ed3c2bd557	core[patch]: set version="v2" as default in astream_events (#29894 )	2025-02-19 23:21:37 +00:00
Fabian Blatz	a2d05a376c	community: ConfluenceLoader: add a filter method for attachments (#29882 ) Adds a `attachment_filter_func` parameter to the ConfluenceLoader class which can be used to determine which files are indexed. This is useful if you are interested in excluding files based on their media type or other metadata.	2025-02-19 18:20:45 -05:00
ccurme	9ed47a4d63	community[patch]: release 0.3.18 (#29896 )	2025-02-19 20:13:00 +00:00
ccurme	92889edafd	core[patch]: release 0.3.37 (#29895 )	2025-02-19 20:04:35 +00:00
ccurme	ffd6194060	core[patch]: de-beta rate limiters (#29891 )	2025-02-19 19:19:59 +00:00
ccurme	fb4c8423f0	docs: fix builds (#29890 ) Missed in https://github.com/langchain-ai/langchain/pull/29889	2025-02-19 13:35:59 -05:00
ccurme	68b13e5172	pinecone: delete from monorepo (#29889 ) This now lives in https://github.com/langchain-ai/langchain-pinecone	2025-02-19 12:55:15 -05:00
Erick Friis	6c1e21d128	core: basemessage.text() (#29078 )	2025-02-18 17:45:44 -08:00
Eugene Yurtsev	8e5074d82d	core: release 0.3.36 (#29869 ) Release 0.3.36	2025-02-18 19:51:43 +00:00
Vadym Barda	d04fa1ae50	core[patch]: allow passing JSON schema as args_schema to tools (#29812 )	2025-02-18 14:44:31 -05:00
ccurme	5034a8dc5c	xai[patch]: release 0.2.1 (#29854 )	2025-02-17 14:30:41 -05:00
ccurme	83dcef234d	xai[patch]: support dedicated structured output feature (#29853 ) https://docs.x.ai/docs/guides/structured-outputs Interface appears identical to OpenAI's. ```python from langchain.chat_models import init_chat_model from pydantic import BaseModel class Joke(BaseModel): setup: str punchline: str llm = init_chat_model("xai:grok-2").with_structured_output( Joke, method="json_schema" ) llm.invoke("Tell me a joke about cats.") ```	2025-02-17 14:19:51 -05:00
ccurme	9d6fcd0bfb	infra: add xai to scheduled testing (#29852 )	2025-02-17 18:59:45 +00:00
ccurme	8a3b05ae69	langchain[patch]: release 0.3.19 (#29851 )	2025-02-17 13:36:23 -05:00
ccurme	c9061162a1	langchain[patch]: add xai to extras (#29850 )	2025-02-17 17:49:34 +00:00
Bagatur	1acf57e9bd	langchain[patch]: init_chat_model xai support (#29849 )	2025-02-17 09:45:39 -08:00
hsm207	037b129b86	weaviate: Add-deprecation-warning (#29757 ) - Description: add deprecation warning when using weaviate from langchain_community - Issue: NA - Dependencies: NA - Twitter handle: NA --------- Signed-off-by: hsm207 <hsm207@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-16 21:42:18 -05:00
Đỗ Quang Minh	cd198ac9ed	community: add custom model for OpenAIWhisperParser (#29831 ) Add `model` properties for OpenAIWhisperParser. Defaulted to `whisper-1` (previous value). Please help me update the docs and other related components of this repo.	2025-02-16 21:26:07 -05:00
Cole McIntosh	6874c9c1d0	docs: add notebook for langchain-salesforce package (#29800 ) Description: This PR adds a Jupyter notebook that explains the features, installation, and usage of the [`langchain-salesforce`](https://github.com/colesmcintosh/langchain-salesforce) package. The notebook includes: - Setup instructions for configuring Salesforce credentials - Example code demonstrating common operations such as querying, describing objects, creating, updating, and deleting records Issue: N/A Dependencies: No new dependencies are required. Tests and Docs: - Added an example notebook demonstrating the usage of the `langchain-salesforce` package, located in `docs/docs/integrations`. Lint and Test: - Ran `make format`, `make lint`, and `make test` successfully. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-16 08:34:57 -05:00
Jan Heimes	60f58df5b3	community: add top_k as param to Needle Retriever (#29821 ) Thank you for contributing to LangChain! - [X] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: This PR adds top_k as a param to the Needle Retriever. By default we use top 10. - [X] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-16 08:30:52 -05:00
Jesus Fernandez Bes	1dfac909d8	community: Adding IN Operator to AzureCosmosDBNoSQLVectorStore (#29805 ) - Description: I have added a new operator in the operator map with key `$in` and value `IN`, so that you can define filters using lists as values. This was already contemplated but as IN operator was not in the map they cannot be used. - Issue: Fixes #29804. - Dependencies: No extra.	2025-02-15 21:44:54 -05:00
Wahed Hemati	8901b113c3	docs: add Discord integration docs (#29822 ) This PR adds documentation for the `langchain-discord-shikenso` integration, including an example notebook at `docs/docs/integrations/tools/discord.ipynb` and updates to `libs/packages.yml` to track the new package. Issue: N/A Dependencies: None Twitter handle: N/A --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-15 21:43:45 -05:00
Krishna Kulkarni	a98c5f1c4b	langchain_community: add image support to DuckDuckGoSearchAPIWrapper (#29816 ) - [ ] PR title: langchain_community: add image support to DuckDuckGoSearchAPIWrapper - Description: This PR enhances the DuckDuckGoSearchAPIWrapper within the langchain_community package by introducing support for image searches. The enhancement includes: - Adding a new method _ddgs_images to handle image search queries. - Updating the run and results methods to process and return image search results appropriately. - Modifying the source parameter to accept "images" as a valid option, alongside "text" and "news". - Dependencies: No additional dependencies are required for this change.	2025-02-15 21:32:14 -05:00
Iris Liu	0d9f0b4215	docs: updates Chroma integration API ref docs (#29826 ) - Description: updates Chroma integration API ref docs - Issue: #29817 - Dependencies: N/A - Twitter handle: @irieliu Co-authored-by: “Iris <“liuirisny@gmail.com”>	2025-02-15 21:05:21 -05:00
ccurme	3fe7c07394	openai[patch]: release 0.3.6 (#29824 )	2025-02-15 13:53:35 -05:00
ccurme	65a6dce428	openai[patch]: enable streaming for o1 (#29823 ) Verified streaming works for the `o1-2024-12-17` snapshot as well.	2025-02-15 12:42:05 -05:00
Christophe Bornet	3dffee3d0b	all: Bump blockbuster version to 1.5.18 (#29806 ) Has fixes for running on Windows and non-CPython runtimes.	2025-02-14 07:55:38 -08:00
ccurme	d9a069c414	tests[patch]: release 0.3.12 (#29797 )	2025-02-13 23:57:44 +00:00
ccurme	e4f106ea62	groq[patch]: remove xfails (#29794 ) These appear to pass.	2025-02-13 15:49:50 -08:00
Erick Friis	f34e62ef42	packages: add langchain-xai (#29795 ) wasn't registered per the contribution guide: https://python.langchain.com/docs/contributing/how_to/integrations/	2025-02-13 15:36:41 -08:00
ccurme	49cc6106f7	tests[patch]: fix query for test_tool_calling_with_no_arguments (#29793 )	2025-02-13 23:15:52 +00:00
Erick Friis	1a225fad03	multiple: fix uv path deps (#29790 ) file:// format wasn't working with updates - it doesn't install as an editable dep move to tool.uv.sources with path= instead	2025-02-13 21:32:34 +00:00
Erick Friis	ff13384eb6	packages: update counts, add command (#29789 )	2025-02-13 20:45:25 +00:00
HackHuang	76d32754ff	core : update the class docs of InMemoryVectorStore in in_memory.py (#29781 ) - Description: Add the new introduction about checking `store` in in_memory.py, It’s necessary and useful for beginners. ```python Check Documents: .. code-block:: python for doc in vector_store.store.values(): print(doc) ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 16:41:47 +00:00
Mohammad Mohtashim	96ad09fa2d	(Community): Added API Key for Jina Search API Wrapper (#29622 ) - Description: Simple change for adding the API Key for Jina Search API Wrapper - Issue: #29596	2025-02-12 20:12:07 -08:00
ccurme	f1c66a3040	docs: minor fix to provider table (#29771 ) Langfair renders as LangfAIr	2025-02-13 04:06:58 +00:00
Jakub Kopecký	c8cb7c25bf	docs: update apify integration (#29553 ) Description: Fixed and updated Apify integration documentation to use the new [langchain-apify](https://github.com/apify/langchain-apify) package. Twitter handle: @apify	2025-02-12 20:02:55 -08:00
ccurme	16fb1f5371	chroma[patch]: release 0.2.2 (#29769 ) Resolves https://github.com/langchain-ai/langchain/issues/29765	2025-02-13 02:39:16 +00:00
Mohammad Mohtashim	2310847c0f	(Chroma): Small Fix in `add_texts` when checking for embeddings (#29766 ) - Description: Small fix in `add_texts` to make embedding nullability is checked properly. - Issue: #29765 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 02:26:13 +00:00
Eric Pinzur	716fd89d8e	docs: contributed `Graph RAG` Retriever integration (#29744 ) Description: This adds the `Graph RAG` Retriever integration documentation, per https://python.langchain.com/docs/contributing/how_to/integrations/. * The integration exists in this public repository: https://github.com/datastax/graph-rag * We've implemented the standard langchain tests for retrievers: https://github.com/datastax/graph-rag/blob/main/packages/langchain-graph-retriever/tests/test_langchain.py * Our integration is published to PyPi: https://pypi.org/project/langchain-graph-retriever/	2025-02-12 18:25:48 -08:00
Sunish Sheth	f42dafa809	Deprecating sql_database access for creating UC functions for agent tools (#29745 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-13 02:24:44 +00:00
Thor 雷神 Schaeff	a0970d8d7e	[WIP] chore: update ElevenLabs tool. (#29722 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 01:54:34 +00:00
Chaymae El Aattabi	4b08a7e8e8	Fix #29759 : Use local chunk_size_ for looping in embed_documents (#29761 ) This fix ensures that the chunk size is correctly determined when processing text embeddings. Previously, the code did not properly handle cases where chunk_size was None, potentially leading to incorrect chunking behavior. Now, chunk_size_ is explicitly set to either the provided chunk_size or the default self.chunk_size, ensuring consistent chunking. This update improves reliability when processing large text inputs in batches and prevents unintended behavior when chunk_size is not specified. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 01:28:26 +00:00
Sunish Sheth	043d78d85d	Deprecate langhchain community ucfunctiontoolkit in favor for databricks_langchain (#29746 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-12 15:50:35 -08:00
Hugues Chocart	e4eec9e9aa	community: add langchain-abso documentation (#29739 ) Add the documentation for the community package `langchain-abso`. It provides a new Chat Model class, that uses https://abso.ai --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2025-02-12 19:57:33 +00:00
ccurme	e61f463745	core[patch]: release 0.3.35 (#29764 )	2025-02-12 18:13:10 +00:00
Nuno Campos	fe59f2cc88	core: Fix output of convert_messages when called with BaseMessage.model_dump() (#29763 ) - additional_kwargs was being nested twice - example, response_metadata was placed inside additional_kwargs	2025-02-12 10:05:33 -08:00
Jacob Lee	f4e3e86fbb	feat(langchain): Infer o3 modelstrings passed to init_chat_model as OpenAI (#29743 )	2025-02-11 16:51:41 -08:00
Mohammad Mohtashim	9f3bcee30a	(Community): Adding Structured Support for ChatPerplexity (#29361 ) - Description: Adding Structured Support for ChatPerplexity - Issue: #29357 - This is implemented as per the Perplexity official docs: https://docs.perplexity.ai/guides/structured-outputs --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-11 15:51:18 -08:00
Jawahar S	994c5465e0	feat: add support for IBM WatsonX AI chat models (#29688 ) Description: Updated init_chat_model to support Granite models deployed on IBM WatsonX Dependencies: [langchain-ibm](https://github.com/langchain-ai/langchain-ibm) Tagging @baskaryan @efriis for review when you get a chance.	2025-02-11 15:34:29 -08:00
Shailendra Mishra	c7d74eb7a3	Oraclevs integration (#29723 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" community: langchain_community/vectorstore/oraclevs.py - [ ] PR message: *Delete this entire checklist* and replace with - Description: Refactored code to allow a connection or a connection pool. - Issue: Normally an idel connection is terminated by the server side listener at timeout. A user thus has to re-instantiate the vector store. The timeout in case of connection is not configurable. The solution is to use a connection pool where a user can specify a user defined timeout and the connections are managed by the pool. - Dependencies: None - Twitter handle: - [ ] Add tests and docs: This is not a new integration. A user can pass either a connection or a connection pool. The determination of what is passed is made at run time. Everything should work as before. - [ ] Lint and test: Already done. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-11 14:56:55 -08:00
ccurme	42ebf6ae0c	deepseek[patch]: release 0.1.2 (#29742 )	2025-02-11 11:53:43 -08:00
ccurme	ec55553807	pinecone[patch]: release 0.2.3 (#29741 )	2025-02-11 19:27:39 +00:00
ccurme	001cf99253	pinecone[patch]: add support for python 3.13 (#29737 )	2025-02-11 11:20:21 -08:00
ccurme	ba8f752bf5	openai[patch]: release 0.3.5 (#29740 )	2025-02-11 19:20:11 +00:00
ccurme	9477f49409	openai, deepseek: make _convert_chunk_to_generation_chunk an instance method (#29731 ) 1. Make `_convert_chunk_to_generation_chunk` an instance method on BaseChatOpenAI 2. Override on ChatDeepSeek to add `"reasoning_content"` to message additional_kwargs. Resolves https://github.com/langchain-ai/langchain/issues/29513	2025-02-11 11:13:23 -08:00
ccurme	d0c2dc06d5	mongodb[patch]: fix link in readme (#29738 )	2025-02-11 18:19:59 +00:00
zzaebok	3b3d52206f	community: change wikidata rest api version from v0 to v1 (#29708 ) Description: According to the [wikidata documentation](https://www.wikidata.org/wiki/Wikidata_talk:REST_API), Wikibase REST API version 1 (stable) is released from November 11, 2024. Their guide is to use the new v1 API and, it just requires replacing v0 in the routes with v1 in almost all cases. So I replaced WIKIDATA_REST_API_URL from v0 to v1 for stable usage. Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-10 17:12:38 -08:00
ccurme	4a389ef4c6	community: fix extended testing (#29715 ) v0.3.100 of premai sdk appears to break on import: `89d9276cbf/premai/api/__init__.py (L230)`	2025-02-10 16:57:34 -08:00
Bhav Sardana	624216aa64	community:Fix for Pydantic model validator of GoogleApiYoutubeLoader (#29694 ) - Description: Community: bugfix for pedantic model validator for GoogleApiYoutubeLoader - Issue: #29165, #27432 Fix is similar to #29346	2025-02-10 08:57:58 -05:00
Changyong Um	60740c44c5	community: Add configurable text key for indexing and the retriever in Pinecone Hybrid Search (#29697 ) issue In Langchain, the original content is generally stored under the `text` key. However, the `PineconeHybridSearchRetriever` searches the `context` field in the metadata and cannot change this key. To address this, I have modified the code to allow changing the key to something other than context. In my opinion, following Langchain's conventions, the `text` key seems more appropriate than `context`. However, since I wasn't sure about the author's intent, I have left the default value as `context`.	2025-02-10 08:56:37 -05:00
manukychen	3de445d521	using getattr and default value to prevent 'OpenSearchVectorSearch' has no attribute 'bulk_size' (#29682 ) - Description: Adding getattr methods and set default value 500 to cls.bulk_size, it can prevent the error below: Error: type object 'OpenSearchVectorSearch' has no attribute 'bulk_size' - Issue: https://github.com/langchain-ai/langchain/issues/29071	2025-02-08 14:39:57 -05:00
Yao Tianjia	5d581ba22c	langchain: support the situation when action_input is null in json output_parser (#29680 ) Description: This PR fixes handling of null action_input in [langchain.agents.output_parser]. Previously, passing null to action_input could cause OutputParserException with unclear error message which cause LLM don't know how to modify the action. The changes include: Added null-check validation before processing action_input Implemented proper fallback behavior with default values Maintained backward compatibility with existing implementations Error Examples: ``` { "action":"some action", "action_input":null } ``` Issue: None Dependencies: None	2025-02-07 22:01:01 -05:00
Philippe PRADOS	beb75b2150	community[minor]: 05 - Refactoring PyPDFium2 parser (#29625 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the PyPDFium2 parser. For more details, see https://github.com/langchain-ai/langchain/pull/28970.	2025-02-07 21:31:12 -05:00
Christophe Bornet	723031d548	community: Bump ruff version to 0.9 (#29206 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:21:10 +00:00
Christophe Bornet	30f6c9f5c8	community: Use Blockbuster to detect blocking calls in asyncio during tests (#29609 ) Same as https://github.com/langchain-ai/langchain/pull/29043 for langchain-community. Dependencies: - blockbuster (test) Twitter handle: cbornet_ Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:10:39 +00:00
Christophe Bornet	3a57a28daa	langchain: Use Blockbuster to detect blocking calls in asyncio during tests (#29616 ) Same as https://github.com/langchain-ai/langchain/pull/29043 for the langchain package. Dependencies: - blockbuster (test) Twitter handle: cbornet_ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:08:15 +00:00
Keenan Pepper	c67d473397	core: Make abatch_as_completed respect max_concurrency (#29426 ) - Description: Add tests for respecting max_concurrency and implement it for abatch_as_completed so that test passes - Issue: #29425 - Dependencies: none - Twitter handle: keenanpepper	2025-02-07 16:51:22 -08:00
Aaron V	dcfaae85d2	Core: Fix __add__ for concatting two BaseMessageChunk's (#29531 ) Description: The change allows you to use the overloaded `+` operator correctly when `+`ing two BaseMessageChunk subclasses. Without this you must instantiate a subclass for it to work. Which feels... wrong. Base classes should be decoupled from sub classes and should have in no way a dependency on them. Issue: You can't `+` a BaseMessageChunk with a BaseMessageChunk e.g. this will explode ```py from langchain_core.outputs import ( ChatGenerationChunk, ) from langchain_core.messages import BaseMessageChunk chunk1 = ChatGenerationChunk( message=BaseMessageChunk( type="customChunk", content="HI", ), ) chunk2 = ChatGenerationChunk( message=BaseMessageChunk( type="customChunk", content="HI", ), ) # this will throw new_chunk = chunk1 + chunk2 ``` In case anyone ran into this issue themselves, it's probably best to use the AIMessageChunk: a la ```py from langchain_core.outputs import ( ChatGenerationChunk, ) from langchain_core.messages import AIMessageChunk chunk1 = ChatGenerationChunk( message=AIMessageChunk( content="HI", ), ) chunk2 = ChatGenerationChunk( message=AIMessageChunk( content="HI", ), ) # No explosion! new_chunk = chunk1 + chunk2 ``` Dependencies: None! Twitter handle: `aaron_vogler` Keeping these for later if need be: ``` baskaryan efriis eyurtsev ccurme vbarda hwchase17 baskaryan efriis ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 00:43:36 +00:00
Marlene	4fa3ef0d55	Community/Partner: Adding Azure community and partner user agent to better track usage in Python (#29561 ) - This pull request includes various changes to add a `user_agent` parameter to Azure OpenAI, Azure Search and Whisper in the Community and Partner packages. This helps in identifying the source of API requests so we can better track usage and help support the community better. I will also be adding the user_agent to the new `langchain-azure` repo as well. - No issue connected or updated dependencies. - Utilises existing tests and docs --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:28:30 +00:00
Ella Charlaix	c401254770	huggingface: Add ipex support to HuggingFaceEmbeddings (#29386 ) ONNX and OpenVINO models are available by specifying the `backend` argument (the model is loaded using `optimum` https://github.com/huggingface/optimum) ```python from langchain_huggingface import HuggingFaceEmbeddings embedding = HuggingFaceEmbeddings( model_name=model_id, model_kwargs={"backend": "onnx"}, ) ``` With this PR we also enable the IPEX backend ```python from langchain_huggingface import HuggingFaceEmbeddings embedding = HuggingFaceEmbeddings( model_name=model_id, model_kwargs={"backend": "ipex"}, ) ```	2025-02-07 15:21:09 -08:00
Bruno Alvisio	3eaf561561	core: Handle unterminated escape character when parsing partial JSON (#29065 ) Description Currently, when parsing a partial JSON, if a string ends with the escape character, the whole key/value is removed. For example: ``` >>> from langchain_core.utils.json import parse_partial_json >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> >>> parse_partial_json(my_str) {'foo': 'bar'} ``` My expectation (and with this fix) would be for `parse_partial_json()` to return: ``` >>> from langchain_core.utils.json import parse_partial_json >>> >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> parse_partial_json(my_str) {'foo': 'bar', 'baz': 'qux'} ``` Notes: 1. It could be argued that current behavior is still desired. 2. I have experienced this issue when the streaming output from an LLM and the chunk happens to end with `\\` 3. I haven't included tests. Will do if change is accepted. 4. This is specially troublesome when this function is used by `187131c55c/libs/core/langchain_core/output_parsers/transform.py (L111)` since what happens is that, for example, if the received sequence of chunks are: `{"foo": "b` , `ar\\` : Then, the result of calling `self.parse_result()` is: ``` {"foo": "b"} ``` and the second time: ``` {} ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:18:21 +00:00
Viren	252cf0af10	docs: add LangFair as a provider (#29390 ) Description: - Add `docs/docs/providers/langfair.mdx` - Register langfair in `libs/packages.yml` Twitter handle: @LangFair Tests and docs 1. Integration tests not needed as this PR only adds a .mdx file to docs. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Dylan Bouchard <dylan.bouchard@cvshealth.com> Co-authored-by: Dylan Bouchard <109233938+dylanbouchard@users.noreply.github.com> Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 21:27:37 +00:00
Erick Friis	eb9eddae0c	docs: use init_chat_model (#29623 )	2025-02-07 12:39:27 -08:00
ccurme	bff25b552c	community: release 0.3.17 (#29676 )	2025-02-07 19:41:44 +00:00
ccurme	01314c51fa	langchain: release 0.3.18 (#29654 )	2025-02-07 13:40:26 -05:00
ccurme	92e2239414	openai[patch]: make parallel_tool_calls explicit kwarg of bind_tools (#29669 ) Improves discoverability and documentation. cc @vbarda	2025-02-07 13:34:32 -05:00
Marc Ammann	5690575f13	openai: Removed tool_calls from completion chunk after other chunks have already been sent. (#29649 ) - Description: Before sending a completion chunk at the end of an OpenAI stream, removing the tool_calls as those have already been sent as chunks. - Issue: - - Dependencies: - - Twitter handle: - @ccurme as mentioned in another PR --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-07 10:15:52 -05:00
Ikko Eltociear Ashimine	0d45ad57c1	community: update base_o365.py (#29657 ) extention -> extension	2025-02-07 08:43:29 -05:00
Vincent Emonet	3645181d0e	qdrant: Add `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` (#29641 ) Added `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` class. It is required when we want to query multiple time with the same embeddings. It was present in the now deprecated original `Qdrant` vectorstore implementation, but was absent from the new one. It is also implemented in a number of others `VectorStore` implementations I have added tests for this new function Note that I also argued in this discussion that it should be part of the general `VectorStore` https://github.com/langchain-ai/langchain/discussions/29638 Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 00:55:58 +00:00
ccurme	488cb4a739	anthropic: release 0.3.7 (#29653 )	2025-02-06 17:05:57 -05:00
ccurme	ab09490c20	openai: release 0.3.4 (#29652 )	2025-02-06 17:02:21 -05:00
ccurme	29a0c38cc3	openai[patch]: add test for message.name (#29651 )	2025-02-06 16:49:28 -05:00
ccurme	91cca827c0	tests: release 0.3.11 (#29648 )	2025-02-06 21:48:09 +00:00
Sunish Sheth	25ce1e211a	docs: Updating the imports for langchain-databricks to databricks-langchain (#29646 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-06 13:28:07 -08:00
ccurme	e1b593ae77	text-splitters[patch]: release 0.3.6 (#29647 )	2025-02-06 16:16:05 -05:00
ccurme	a91e58bc10	core: release 0.3.34 (#29644 )	2025-02-06 15:53:56 -05:00
Vincent Emonet	08b9eaaa6f	community: improve FastEmbedEmbeddings support for ONNX execution provider (e.g. GPU) (#29645 ) I made a change to how was implemented the support for GPU in `FastEmbedEmbeddings` to be more consistent with the existing implementation `langchain-qdrant` sparse embeddings implementation It is directly enabling to provide the list of ONNX execution providers: https://github.com/langchain-ai/langchain/blob/master/libs/partners/qdrant/langchain_qdrant/fastembed_sparse.py#L15 It is a bit less clear to a user that just wants to enable GPU, but gives more capabilities to work with other execution providers that are not the `CUDAExecutionProvider`, and is more future proof Sorry for the disturbance @ccurme > Nice to see you just moved to `uv`! It is so much nicer to run format/lint/test! No need to manually rerun the `poetry install` with all required extras now	2025-02-06 15:31:23 -05:00
ccurme	3450bfc806	infra: add UV_FROZEN to makefiles (#29642 ) These are set in Github workflows, but forgot to add them to most makefiles for convenience when developing locally. `uv run` will automatically sync the lock file. Because many of our development dependencies are local installs, it will pick up version changes and update the lock file. Passing `--frozen` or setting this environment variable disables the behavior.	2025-02-06 14:36:54 -05:00
ccurme	d172984c91	infra: migrate to uv (#29566 )	2025-02-06 13:36:26 -05:00
ccurme	9da06e6e94	standard-tests[patch]: use `has_structured_output` property to engage structured output tests (#29635 ) Motivation: dedicated structured output features are becoming more common, such that integrations can support structured output without supporting tool calling. Here we make two changes: 1. Update the `has_structured_output` method to default to True if a model supports tool calling (in addition to defaulting to True if `with_structured_output` is overridden). 2. Update structured output tests to engage if `has_structured_output` is True.	2025-02-06 10:09:06 -08:00
Vincent Emonet	db8201d4da	community: fix typo in the module imported when using GPU with FastEmbedEmbeddings (#29631 ) Made a mistake in the module to import (the module stay the same only the installed package changes), fixed it and tested it https://github.com/langchain-ai/langchain/pull/29627 @ccurme	2025-02-06 10:26:08 -05:00
Mohammed Abbadi	f8fd65dea2	community: Update deeplake.py (#29633 ) Deep Lake recently released version 4, which introduces significant architectural changes, including a new on-disk storage format, enhanced indexing mechanisms, and improved concurrency. However, LangChain's vector store integration currently does not support Deep Lake v4 due to breaking API changes. Previously, the installation command was: `pip install deeplake[enterprise]` This installs the latest available version, which now defaults to Deep Lake v4. Since LangChain's vector store integration is still dependent on v3, this can lead to compatibility issues when using Deep Lake as a vector database within LangChain. To ensure compatibility, the installation command has been updated to: `pip install deeplake[enterprise]<4.0.0` This constraint ensures that pip installs the latest available version of Deep Lake within the v3 series while avoiding the incompatible v4 update.	2025-02-06 10:25:13 -05:00
Vincent Emonet	0ac5536f04	community: add support for using GPUs with FastEmbedEmbeddings (#29627 ) - Description: add a `gpu: bool = False` field to the `FastEmbedEmbeddings` class which enables to use GPU (through ONNX CUDA provider) when generating embeddings with any fastembed model. It just requires the user to install a different dependency and we use a different provider when instantiating `fastembed.TextEmbedding` - Issue: when generating embeddings for a really large amount of documents this drastically increase performance (honestly that is a must have in some situations, you can't just use CPU it is way too slow) - Dependencies: no direct change to dependencies, but internally the users will need to install `fastembed-gpu` instead of `fastembed`, I made all the changes to the init function to properly let the user know which dependency they should install depending on if they enabled `gpu` or not cf. fastembed docs about GPU for more details: https://qdrant.github.io/fastembed/examples/FastEmbed_GPU/ I did not added test because it would require access to a GPU in the testing environment	2025-02-06 08:04:19 -05:00
Dmitrii Rashchenko	0ceda557aa	add o1 and o3-mini to pricing (#29628 ) ### PR Title: community: add latest OpenAI models pricing ### Description: This PR updates the OpenAI model cost calculation mapping by adding the latest OpenAI models, o1 (non-preview) and o3-mini, based on the pricing listed on the [OpenAI pricing page](https://platform.openai.com/docs/pricing). ### Changes: - Added pricing for `o1`, `o1-2024-12-17`, `o1-cached`, and `o1-2024-12-17-cached` for input tokens. - Added pricing for `o1-completion` and `o1-2024-12-17-completion` for output tokens. - Added pricing for `o3-mini`, `o3-mini-2025-01-31`, `o3-mini-cached`, and `o3-mini-2025-01-31-cached` for input tokens. - Added pricing for `o3-mini-completion` and `o3-mini-2025-01-31-completion` for output tokens. ### Issue: N/A ### Dependencies: None ### Testing & Validation: - No functional changes outside of updating the cost mapping. - No tests were added or modified.	2025-02-06 08:02:20 -05:00
ZhangShenao	ac53977dbc	[MistralAI] Improve MistralAIEmbeddings (#29242 ) - Add static method decorator for method. - Add expected exception for retry decorator #29125	2025-02-05 21:31:54 -05:00

1 2 3 4 5 ...

6605 Commits