langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-04 12:39:32 +00:00

Author	SHA1	Message	Date
Brayden Zhong	a70f31de5f	Community: RankLLMRerank AttributeError (Handle list-based rerank results) (#29840 ) # community: Fix AttributeError in RankLLMRerank (`list` object has no attribute `candidates`) ## Description This PR fixes an issue in `RankLLMRerank` where reranking fails with the following error: ``` AttributeError: 'list' object has no attribute 'candidates' ``` The issue arises because `rerank_batch()` returns a `List[Result]` instead of an object containing `.candidates`. ### Changes Introduced - Adjusted `compress_documents()` to support both: - Old API format: `rerank_results.candidates` - New API format: `rerank_results` as a list - Also fix wrong .txt location parsing while I was at it. --- ## Issue Fixes AttributeError in `RankLLMRerank` when using `compression_retriever.invoke()`. The issue is observed when `rerank_batch()` returns a list instead of an object with `.candidates`. Relevant log: ``` AttributeError: 'list' object has no attribute 'candidates' ``` ## Dependencies - No additional dependencies introduced. --- ## Checklist - [x] Backward compatible with previous API versions - [x] Tested locally with different RankLLM models - [x] No new dependencies introduced - [x] Linted with `make format && make lint` - [x] Ready for review --- ## Testing - Ran `compression_retriever.invoke(query)` ## Reviewers If no review within a few days, please @mention one of: - @baskaryan - @efriis - @eyurtsev - @ccurme - @vbarda - @hwchase17	2025-02-20 12:38:31 -05:00
Levon Ghukasyan	ec403c442a	Separate deepale vector store (#29902 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 17:37:19 +00:00
dokato	92b415a9f6	community: Made some Jira fields optional for agent to work correctly (#29876 ) Description: Two small changes have been proposed here: (1) Previous code assumes that every issue has a priority field. If an issue lacks this field, the code will raise a KeyError. Now, the code checks if priority exists before accessing it. If priority is missing, it assigns None instead of crashing. This prevents runtime errors when processing issues without a priority. (2) Also If the "style" field is missing, the code throws a KeyError. `.get("style", None)` safely retrieves the value if present. Issue: #29875 Dependencies: N/A	2025-02-20 12:10:11 -05:00
am-kinetica	ca7eccba1f	Handled a bug around empty query results differently (#29877 ) Thank you for contributing to LangChain! - [ ] Handled query records properly: "community: vectorstores/kinetica" - [ ] Bugfix for empty query results handling: - Description: checked for the number of records returned by a query before processing further - Issue: resulted in an `AttributeError` earlier which has now been fixed @efriis	2025-02-20 12:07:49 -05:00
Fabian Blatz	a2d05a376c	community: ConfluenceLoader: add a filter method for attachments (#29882 ) Adds a `attachment_filter_func` parameter to the ConfluenceLoader class which can be used to determine which files are indexed. This is useful if you are interested in excluding files based on their media type or other metadata.	2025-02-19 18:20:45 -05:00
hsm207	037b129b86	weaviate: Add-deprecation-warning (#29757 ) - Description: add deprecation warning when using weaviate from langchain_community - Issue: NA - Dependencies: NA - Twitter handle: NA --------- Signed-off-by: hsm207 <hsm207@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-16 21:42:18 -05:00
Đỗ Quang Minh	cd198ac9ed	community: add custom model for OpenAIWhisperParser (#29831 ) Add `model` properties for OpenAIWhisperParser. Defaulted to `whisper-1` (previous value). Please help me update the docs and other related components of this repo.	2025-02-16 21:26:07 -05:00
Jan Heimes	60f58df5b3	community: add top_k as param to Needle Retriever (#29821 ) Thank you for contributing to LangChain! - [X] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: This PR adds top_k as a param to the Needle Retriever. By default we use top 10. - [X] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-16 08:30:52 -05:00
Jesus Fernandez Bes	1dfac909d8	community: Adding IN Operator to AzureCosmosDBNoSQLVectorStore (#29805 ) - Description: I have added a new operator in the operator map with key `$in` and value `IN`, so that you can define filters using lists as values. This was already contemplated but as IN operator was not in the map they cannot be used. - Issue: Fixes #29804. - Dependencies: No extra.	2025-02-15 21:44:54 -05:00
Krishna Kulkarni	a98c5f1c4b	langchain_community: add image support to DuckDuckGoSearchAPIWrapper (#29816 ) - [ ] PR title: langchain_community: add image support to DuckDuckGoSearchAPIWrapper - Description: This PR enhances the DuckDuckGoSearchAPIWrapper within the langchain_community package by introducing support for image searches. The enhancement includes: - Adding a new method _ddgs_images to handle image search queries. - Updating the run and results methods to process and return image search results appropriately. - Modifying the source parameter to accept "images" as a valid option, alongside "text" and "news". - Dependencies: No additional dependencies are required for this change.	2025-02-15 21:32:14 -05:00
Mohammad Mohtashim	96ad09fa2d	(Community): Added API Key for Jina Search API Wrapper (#29622 ) - Description: Simple change for adding the API Key for Jina Search API Wrapper - Issue: #29596	2025-02-12 20:12:07 -08:00
Jakub Kopecký	c8cb7c25bf	docs: update apify integration (#29553 ) Description: Fixed and updated Apify integration documentation to use the new [langchain-apify](https://github.com/apify/langchain-apify) package. Twitter handle: @apify	2025-02-12 20:02:55 -08:00
Sunish Sheth	f42dafa809	Deprecating sql_database access for creating UC functions for agent tools (#29745 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-13 02:24:44 +00:00
Thor 雷神 Schaeff	a0970d8d7e	[WIP] chore: update ElevenLabs tool. (#29722 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-13 01:54:34 +00:00
Sunish Sheth	043d78d85d	Deprecate langhchain community ucfunctiontoolkit in favor for databricks_langchain (#29746 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-12 15:50:35 -08:00
Mohammad Mohtashim	9f3bcee30a	(Community): Adding Structured Support for ChatPerplexity (#29361 ) - Description: Adding Structured Support for ChatPerplexity - Issue: #29357 - This is implemented as per the Perplexity official docs: https://docs.perplexity.ai/guides/structured-outputs --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-11 15:51:18 -08:00
Shailendra Mishra	c7d74eb7a3	Oraclevs integration (#29723 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" community: langchain_community/vectorstore/oraclevs.py - [ ] PR message: *Delete this entire checklist* and replace with - Description: Refactored code to allow a connection or a connection pool. - Issue: Normally an idel connection is terminated by the server side listener at timeout. A user thus has to re-instantiate the vector store. The timeout in case of connection is not configurable. The solution is to use a connection pool where a user can specify a user defined timeout and the connections are managed by the pool. - Dependencies: None - Twitter handle: - [ ] Add tests and docs: This is not a new integration. A user can pass either a connection or a connection pool. The determination of what is passed is made at run time. Everything should work as before. - [ ] Lint and test: Already done. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-11 14:56:55 -08:00
zzaebok	3b3d52206f	community: change wikidata rest api version from v0 to v1 (#29708 ) Description: According to the [wikidata documentation](https://www.wikidata.org/wiki/Wikidata_talk:REST_API), Wikibase REST API version 1 (stable) is released from November 11, 2024. Their guide is to use the new v1 API and, it just requires replacing v0 in the routes with v1 in almost all cases. So I replaced WIKIDATA_REST_API_URL from v0 to v1 for stable usage. Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-10 17:12:38 -08:00
Bhav Sardana	624216aa64	community:Fix for Pydantic model validator of GoogleApiYoutubeLoader (#29694 ) - Description: Community: bugfix for pedantic model validator for GoogleApiYoutubeLoader - Issue: #29165, #27432 Fix is similar to #29346	2025-02-10 08:57:58 -05:00
Changyong Um	60740c44c5	community: Add configurable text key for indexing and the retriever in Pinecone Hybrid Search (#29697 ) issue In Langchain, the original content is generally stored under the `text` key. However, the `PineconeHybridSearchRetriever` searches the `context` field in the metadata and cannot change this key. To address this, I have modified the code to allow changing the key to something other than context. In my opinion, following Langchain's conventions, the `text` key seems more appropriate than `context`. However, since I wasn't sure about the author's intent, I have left the default value as `context`.	2025-02-10 08:56:37 -05:00
manukychen	3de445d521	using getattr and default value to prevent 'OpenSearchVectorSearch' has no attribute 'bulk_size' (#29682 ) - Description: Adding getattr methods and set default value 500 to cls.bulk_size, it can prevent the error below: Error: type object 'OpenSearchVectorSearch' has no attribute 'bulk_size' - Issue: https://github.com/langchain-ai/langchain/issues/29071	2025-02-08 14:39:57 -05:00
Philippe PRADOS	beb75b2150	community[minor]: 05 - Refactoring PyPDFium2 parser (#29625 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the PyPDFium2 parser. For more details, see https://github.com/langchain-ai/langchain/pull/28970.	2025-02-07 21:31:12 -05:00
Christophe Bornet	723031d548	community: Bump ruff version to 0.9 (#29206 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:21:10 +00:00
Christophe Bornet	30f6c9f5c8	community: Use Blockbuster to detect blocking calls in asyncio during tests (#29609 ) Same as https://github.com/langchain-ai/langchain/pull/29043 for langchain-community. Dependencies: - blockbuster (test) Twitter handle: cbornet_ Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:10:39 +00:00
Marlene	4fa3ef0d55	Community/Partner: Adding Azure community and partner user agent to better track usage in Python (#29561 ) - This pull request includes various changes to add a `user_agent` parameter to Azure OpenAI, Azure Search and Whisper in the Community and Partner packages. This helps in identifying the source of API requests so we can better track usage and help support the community better. I will also be adding the user_agent to the new `langchain-azure` repo as well. - No issue connected or updated dependencies. - Utilises existing tests and docs --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:28:30 +00:00
Ikko Eltociear Ashimine	0d45ad57c1	community: update base_o365.py (#29657 ) extention -> extension	2025-02-07 08:43:29 -05:00
Sunish Sheth	25ce1e211a	docs: Updating the imports for langchain-databricks to databricks-langchain (#29646 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-06 13:28:07 -08:00
Vincent Emonet	08b9eaaa6f	community: improve FastEmbedEmbeddings support for ONNX execution provider (e.g. GPU) (#29645 ) I made a change to how was implemented the support for GPU in `FastEmbedEmbeddings` to be more consistent with the existing implementation `langchain-qdrant` sparse embeddings implementation It is directly enabling to provide the list of ONNX execution providers: https://github.com/langchain-ai/langchain/blob/master/libs/partners/qdrant/langchain_qdrant/fastembed_sparse.py#L15 It is a bit less clear to a user that just wants to enable GPU, but gives more capabilities to work with other execution providers that are not the `CUDAExecutionProvider`, and is more future proof Sorry for the disturbance @ccurme > Nice to see you just moved to `uv`! It is so much nicer to run format/lint/test! No need to manually rerun the `poetry install` with all required extras now	2025-02-06 15:31:23 -05:00
ccurme	d172984c91	infra: migrate to uv (#29566 )	2025-02-06 13:36:26 -05:00
Vincent Emonet	db8201d4da	community: fix typo in the module imported when using GPU with FastEmbedEmbeddings (#29631 ) Made a mistake in the module to import (the module stay the same only the installed package changes), fixed it and tested it https://github.com/langchain-ai/langchain/pull/29627 @ccurme	2025-02-06 10:26:08 -05:00
Mohammed Abbadi	f8fd65dea2	community: Update deeplake.py (#29633 ) Deep Lake recently released version 4, which introduces significant architectural changes, including a new on-disk storage format, enhanced indexing mechanisms, and improved concurrency. However, LangChain's vector store integration currently does not support Deep Lake v4 due to breaking API changes. Previously, the installation command was: `pip install deeplake[enterprise]` This installs the latest available version, which now defaults to Deep Lake v4. Since LangChain's vector store integration is still dependent on v3, this can lead to compatibility issues when using Deep Lake as a vector database within LangChain. To ensure compatibility, the installation command has been updated to: `pip install deeplake[enterprise]<4.0.0` This constraint ensures that pip installs the latest available version of Deep Lake within the v3 series while avoiding the incompatible v4 update.	2025-02-06 10:25:13 -05:00
Vincent Emonet	0ac5536f04	community: add support for using GPUs with FastEmbedEmbeddings (#29627 ) - Description: add a `gpu: bool = False` field to the `FastEmbedEmbeddings` class which enables to use GPU (through ONNX CUDA provider) when generating embeddings with any fastembed model. It just requires the user to install a different dependency and we use a different provider when instantiating `fastembed.TextEmbedding` - Issue: when generating embeddings for a really large amount of documents this drastically increase performance (honestly that is a must have in some situations, you can't just use CPU it is way too slow) - Dependencies: no direct change to dependencies, but internally the users will need to install `fastembed-gpu` instead of `fastembed`, I made all the changes to the init function to properly let the user know which dependency they should install depending on if they enabled `gpu` or not cf. fastembed docs about GPU for more details: https://qdrant.github.io/fastembed/examples/FastEmbed_GPU/ I did not added test because it would require access to a GPU in the testing environment	2025-02-06 08:04:19 -05:00
Dmitrii Rashchenko	0ceda557aa	add o1 and o3-mini to pricing (#29628 ) ### PR Title: community: add latest OpenAI models pricing ### Description: This PR updates the OpenAI model cost calculation mapping by adding the latest OpenAI models, o1 (non-preview) and o3-mini, based on the pricing listed on the [OpenAI pricing page](https://platform.openai.com/docs/pricing). ### Changes: - Added pricing for `o1`, `o1-2024-12-17`, `o1-cached`, and `o1-2024-12-17-cached` for input tokens. - Added pricing for `o1-completion` and `o1-2024-12-17-completion` for output tokens. - Added pricing for `o3-mini`, `o3-mini-2025-01-31`, `o3-mini-cached`, and `o3-mini-2025-01-31-cached` for input tokens. - Added pricing for `o3-mini-completion` and `o3-mini-2025-01-31-completion` for output tokens. ### Issue: N/A ### Dependencies: None ### Testing & Validation: - No functional changes outside of updating the cost mapping. - No tests were added or modified.	2025-02-06 08:02:20 -05:00
Mohammad Anash	f849305a56	fixed Bug in PreFilter of AzureCosmosDBNoSqlVectorSearch (#29613 ) Description: Fixes PreFilter value handling in Azure Cosmos DB NoSQL vectorstore. The current implementation fails to handle numeric values in filter conditions, causing an undefined value variable error. This PR adds support for numeric, boolean, and NULL values while maintaining the existing string and list handling. Changes: Added handling for numeric types (int/float) Added boolean value support Added NULL value handling Added type validation for unsupported values Fixed scope of value variable initialization Issue: Fixes #29610 Implementation Notes: No changes to public API Backwards compatible Maintains consistent behavior with existing MongoDB-style filtering Preserves SQL injection prevention through proper value handling --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-06 02:20:26 +00:00
Philippe PRADOS	6ff0d5c807	community[minor]: 04 - Refactoring PDFMiner parser (#29526 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the XXX parser. For more details, see [PR 28970](https://github.com/langchain-ai/langchain/pull/28970). --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-02-05 21:08:27 -05:00
Philippe PRADOS	5771e561fb	[Bugfix langchain_community] Fix PyMuPDFLoader (#29550 ) - Description: add legacy properties - Issue: #29470 - Twitter handle: pprados	2025-02-04 09:24:40 -05:00
Ashutosh Kumar	65b404a2d1	[oci_generative_ai] Option to pass auth_file_location (#29481 ) PR title: "community: Option to pass auth_file_location for oci_generative_ai" Description: Option to pass auth_file_location, to overwrite config file default location "~/.oci/config" where profile name configs present. This is not fixing any issues. Just added optional parameter called "auth_file_location", which internally supported by any OCI client including GenerativeAiInferenceClient.	2025-02-03 21:44:13 -05:00
Hemant Rawat	db1693aa70	community: fix issue #29429 in age_graph.py (#29506 ) ## Description: This PR addresses issue #29429 by fixing the _wrap_query method in langchain_community/graphs/age_graph.py. The method now correctly handles Cypher queries with UNION and EXCEPT operators, ensuring that the fields in the SQL query are ordered as they appear in the Cypher query. Additionally, the method now properly handles cases where RETURN * is not supported. ### Issue: #29429 ### Dependencies: None ### Add tests and docs: Added unit tests in tests/unit_tests/graphs/test_age_graph.py to validate the changes. No new integrations were added, so no example notebook is necessary. Lint and test: Ran make format, make lint, and make test to ensure code quality and functionality.	2025-02-01 21:24:45 -05:00
Philippe PRADOS	ceda8bc050	community[minor]: 03 - Refactoring PyPDF parser (#29330 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the PyPDF parser. For more details, see [PR 28970](https://github.com/langchain-ai/langchain/pull/28970).	2025-01-31 10:05:07 -05:00
Julian Castro Pulgarin	b7e3e337b1	community: Fix YahooFinanceNewsTool to handle updated yfinance data structure (#29498 ) Description:* Updates the YahooFinanceNewsTool to handle the current yfinance news data structure. The tool was failing with a KeyError due to changes in the yfinance API's response format. This PR updates the code to correctly extract news URLs from the new structure. Issue: #29495 Dependencies: No new dependencies required. Works with existing yfinance package. The changes maintain backwards compatibility while fixing the KeyError that users were experiencing. The modified code properly handles the new data structure where: - News type is now at `content.contentType` - News URL is now at `content.canonicalUrl.url` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-31 02:31:44 +00:00
Mohammad Anash	12bcc85927	added operator filter for supabase (#29475 ) Description This PR adds support for MongoDB-style $in operator filtering in the Supabase vectorstore implementation. Currently, filtering with $in operators returns no results, even when matching documents exist. This change properly translates MongoDB-style filters to PostgreSQL syntax, enabling efficient multi-document filtering. Changes Modified similarity_search_by_vector_with_relevance_scores to handle MongoDB-style $in operators Added automatic conversion of $in filters to PostgreSQL IN clauses Preserved original vector type handling and numpy array conversion Maintained compatibility with existing postgrest filters Added support for the same filtering in similarity_search_by_vector_returning_embeddings Issue Closes #27932 Implementation Notes No changes to public API or function signatures Backwards compatible - behavior unchanged for non-$in filters More efficient than multiple individual queries for multi-ID searches Preserves all existing functionality including numpy array conversion for vector types Dependencies None Additional Notes The implementation handles proper SQL escaping for filter values Maintains consistent behavior with other vectorstore implementations that support MongoDB-style operators Future extensions could support additional MongoDB-style operators ($gt, $lt, etc.) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-29 14:24:18 +00:00
Michael Chin	e120378695	community: Additional AWS deprecations (#29447 ) Added deprecation warnings for a few more classes that weremoved to `langchain-aws` package: - [SageMaker Endpoint LLM](https://python.langchain.com/api_reference/aws/retrievers/langchain_aws.retrievers.bedrock.AmazonKnowledgeBasesRetriever.html) - [Amazon Kendra retriever](https://python.langchain.com/api_reference/aws/retrievers/langchain_aws.retrievers.kendra.AmazonKendraRetriever.html) - [Amazon Bedrock Knowledge Bases retriever](https://python.langchain.com/api_reference/aws/retrievers/langchain_aws.retrievers.bedrock.AmazonKnowledgeBasesRetriever.html)	2025-01-28 09:50:14 -05:00
Adrián Panella	1551d9750c	community(doc_loaders): allow any credential type in AzureAIDocumentI… (#29289 ) allow any credential type in AzureAIDocumentInteligence, not only `api_key`. This allows to use any of the credentials types integrated with AD. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-27 20:56:30 +00:00
Jorge Piedrahita Ortiz	3b886cdbb2	libs: add sambanova-lagchain integration package (#29417 ) - Description:: Add sambanova-langchain integration package as suggested in previous PRs --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-27 20:34:55 +00:00
Mohammad Anash	aba1fd0bd4	fixed similarity search with score error #29407 (#29413 ) Description: Fix TypeError in AzureSearch similarity_search_with_score by removing search_type from kwargs before passing to underlying requests. This resolves issue #29407 where search_type was being incorrectly passed through to Session.request(). Issue: #29407 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-27 20:34:42 +00:00
Teruaki Ishizaki	3fce78994e	community: Fixed the procedure of initializing pad_token_id (#29434 ) - Description: Add to check pad_token_id and eos_token_id of model config. It seems that this is the same bug as the HuggingFace TGI bug. In addition, the source code of libs/partners/huggingface/langchain_huggingface/llms/huggingface_pipeline.py also requires similar changes. - Issue: #29431 - Dependencies: none - Twitter handle: tell14	2025-01-27 14:54:54 -05:00
Loris Alexandre	e4921239a6	community: missing mandatory parameter partition_key for AzureCosmosDBNoSqlVectorSearch (#29382 ) - Description: the `delete` function of AzureCosmosDBNoSqlVectorSearch is using `self._container.delete_item(document_id)` which miss a mandatory parameter `partition_key` We use the class function `delete_document_by_id` to provide a default `partition_key` - Issue: #29372 - Dependencies: None - Twitter handle: None Co-authored-by: Loris Alexandre <loris.alexandre@boursorama.fr>	2025-01-23 10:05:10 -05:00
Terry Tan	ec0ebb76f2	community: fix Google Scholar tool errors (#29371 ) Resolve https://github.com/langchain-ai/langchain/issues/27557	2025-01-23 10:03:01 -05:00
江同学呀	a1e62070d0	community: Fix the problem of error reporting when OCR extracts text from PDF. (#29378 ) - Description: The issue has been fixed where images could not be recognized from ```xObject[obj]["/Filter"]``` (whose value can be either a string or a list of strings) in the ```_extract_images_from_page()``` method. It also resolves the bug where vectorization by Faiss fails due to the failure of image extraction from a PDF containing only images```IndexError: list index out of range```. ![69a60f3f6bd474641b9126d74bb18f7e](https://github.com/user-attachments/assets/dc9e098d-2862-49f7-93b0-00f1056727dc) - Issue: Fix the following issues: [#15227 ](https://github.com/langchain-ai/langchain/issues/15227) [#22892 ](https://github.com/langchain-ai/langchain/issues/22892) [#26652 ](https://github.com/langchain-ai/langchain/issues/26652) [#27153 ](https://github.com/langchain-ai/langchain/issues/27153) Related issues: [#7067 ](https://github.com/langchain-ai/langchain/issues/7067) - Dependencies: None - Twitter handle: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-23 15:01:52 +00:00
Tim Mallezie	a13faab6b7	community; allow to set gitlab url in gitlab tool in constrictor (#29380 ) This pr, expands the gitlab url so it can also be set in a constructor, instead of only through env variables. This allows to do something like this. ``` # Create the GitLab API wrapper gitlab_api = GitLabAPIWrapper( gitlab_url=self.gitlab_url, gitlab_personal_access_token=self.gitlab_personal_access_token, gitlab_repository=self.gitlab_repository, gitlab_branch=self.gitlab_branch, gitlab_base_branch=self.gitlab_base_branch, ) ``` Where before you could not set the url in the constructor. Co-authored-by: Tim Mallezie <tim.mallezie@dropsolid.com>	2025-01-23 09:36:27 -05:00

1 2 3 4 5 ...

1783 Commits