langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-07 12:06:43 +00:00

Author	SHA1	Message	Date
Nuno Campos	fe59f2cc88	core: Fix output of convert_messages when called with BaseMessage.model_dump() (#29763 ) - additional_kwargs was being nested twice - example, response_metadata was placed inside additional_kwargs	2025-02-12 10:05:33 -08:00
Jacob Lee	f4e3e86fbb	feat(langchain): Infer o3 modelstrings passed to init_chat_model as OpenAI (#29743 )	2025-02-11 16:51:41 -08:00
Mohammad Mohtashim	9f3bcee30a	(Community): Adding Structured Support for ChatPerplexity (#29361 ) - Description: Adding Structured Support for ChatPerplexity - Issue: #29357 - This is implemented as per the Perplexity official docs: https://docs.perplexity.ai/guides/structured-outputs --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-11 15:51:18 -08:00
Jawahar S	994c5465e0	feat: add support for IBM WatsonX AI chat models (#29688 ) Description: Updated init_chat_model to support Granite models deployed on IBM WatsonX Dependencies: [langchain-ibm](https://github.com/langchain-ai/langchain-ibm) Tagging @baskaryan @efriis for review when you get a chance.	2025-02-11 15:34:29 -08:00
Shailendra Mishra	c7d74eb7a3	Oraclevs integration (#29723 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" community: langchain_community/vectorstore/oraclevs.py - [ ] PR message: *Delete this entire checklist* and replace with - Description: Refactored code to allow a connection or a connection pool. - Issue: Normally an idel connection is terminated by the server side listener at timeout. A user thus has to re-instantiate the vector store. The timeout in case of connection is not configurable. The solution is to use a connection pool where a user can specify a user defined timeout and the connections are managed by the pool. - Dependencies: None - Twitter handle: - [ ] Add tests and docs: This is not a new integration. A user can pass either a connection or a connection pool. The determination of what is passed is made at run time. Everything should work as before. - [ ] Lint and test: Already done. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-11 14:56:55 -08:00
ccurme	42ebf6ae0c	deepseek[patch]: release 0.1.2 (#29742 )	2025-02-11 11:53:43 -08:00
ccurme	ec55553807	pinecone[patch]: release 0.2.3 (#29741 )	2025-02-11 19:27:39 +00:00
ccurme	001cf99253	pinecone[patch]: add support for python 3.13 (#29737 )	2025-02-11 11:20:21 -08:00
ccurme	ba8f752bf5	openai[patch]: release 0.3.5 (#29740 )	2025-02-11 19:20:11 +00:00
ccurme	9477f49409	openai, deepseek: make _convert_chunk_to_generation_chunk an instance method (#29731 ) 1. Make `_convert_chunk_to_generation_chunk` an instance method on BaseChatOpenAI 2. Override on ChatDeepSeek to add `"reasoning_content"` to message additional_kwargs. Resolves https://github.com/langchain-ai/langchain/issues/29513	2025-02-11 11:13:23 -08:00
Christopher Menon	1edd27d860	docs: fix SQL-based metadata filter syntax, add link to BigQuery docs (#29736 ) Fix the syntax for SQL-based metadata filtering in the [Google BigQuery Vector Search docs](https://python.langchain.com/docs/integrations/vectorstores/google_bigquery_vector_search/#searching-documents-with-metadata-filters). Also add a link to learn more about BigQuery operators that can be used here. I have been using this library, and have found that this is the correct syntax to use for the SQL-based filters. Issue: no open issue. Dependencies: none. Twitter handle: none. No tests as this is only a change to the documentation. <!-- Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. -->	2025-02-11 11:10:12 -08:00
ccurme	d0c2dc06d5	mongodb[patch]: fix link in readme (#29738 )	2025-02-11 18:19:59 +00:00
zzaebok	3b3d52206f	community: change wikidata rest api version from v0 to v1 (#29708 ) Description: According to the [wikidata documentation](https://www.wikidata.org/wiki/Wikidata_talk:REST_API), Wikibase REST API version 1 (stable) is released from November 11, 2024. Their guide is to use the new v1 API and, it just requires replacing v0 in the routes with v1 in almost all cases. So I replaced WIKIDATA_REST_API_URL from v0 to v1 for stable usage. Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-10 17:12:38 -08:00
ccurme	4a389ef4c6	community: fix extended testing (#29715 ) v0.3.100 of premai sdk appears to break on import: `89d9276cbf/premai/api/__init__.py (L230)`	2025-02-10 16:57:34 -08:00
Yoav Levy	af3f759073	docs: fixed nimble's provider page and retriever (#29695 ) ## Description: - Added information about the retriever that Nimble's provider exposes. - Fixed the authentication explanation on the retriever page.	2025-02-10 15:30:40 -08:00
Bhav Sardana	624216aa64	community:Fix for Pydantic model validator of GoogleApiYoutubeLoader (#29694 ) - Description: Community: bugfix for pedantic model validator for GoogleApiYoutubeLoader - Issue: #29165, #27432 Fix is similar to #29346	2025-02-10 08:57:58 -05:00
Changyong Um	60740c44c5	community: Add configurable text key for indexing and the retriever in Pinecone Hybrid Search (#29697 ) issue In Langchain, the original content is generally stored under the `text` key. However, the `PineconeHybridSearchRetriever` searches the `context` field in the metadata and cannot change this key. To address this, I have modified the code to allow changing the key to something other than context. In my opinion, following Langchain's conventions, the `text` key seems more appropriate than `context`. However, since I wasn't sure about the author's intent, I have left the default value as `context`.	2025-02-10 08:56:37 -05:00
Jun He	894b0cac3c	docs: Remove redundant line (#29698 ) If I understand it correctly, chain1 is never used.	2025-02-10 08:53:21 -05:00
Tiest van Gool	6655246504	Classification Tutorial: Replaced .dict() with .model_dump() method (#29701 ) The .dict() method is deprecated inf Pydantic V2.0 and use `model_dump` method instead. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-10 08:38:15 -05:00
Edmond Wang	c36e6d4371	docs: Add Comments and Supplementary Example Code to Vearch Vector Dat… (#29706 ) - Description: Added some comments to the example code in the Vearch vector database documentation and included commonly used sample code. - Issue: None - Dependencies: None --------- Co-authored-by: wangchuxiong <wangchuxiong@jd.com>	2025-02-10 08:35:38 -05:00
Akmal Ali Jasmin	bc5fafa20e	[DOC] Fix #29685 : HuggingFaceEndpoint missing task argument in documentation (#29686 ) ## Description This PR updates the LangChain documentation to address an issue where the `HuggingFaceEndpoint` example does not specify the required `task` argument. Without this argument, users on `huggingface_hub == 0.28.1` encounter the following error: ``` ValueError: Task unknown has no recommended model. Please specify a model explicitly. ``` --- ## Issue Fixes #29685 --- ## Changes Made ✅ Updated `HuggingFaceEndpoint` documentation to explicitly define `task="text-generation"`: ```python llm = HuggingFaceEndpoint( repo_id=GEN_MODEL_ID, huggingfacehub_api_token=HF_TOKEN, task="text-generation" # Explicitly specify task ) ``` ✅ Added a deprecation warning note and recommended using `InferenceClient`: ```python from huggingface_hub import InferenceClient from langchain.llms.huggingface_hub import HuggingFaceHub client = InferenceClient(model=GEN_MODEL_ID, token=HF_TOKEN) llm = HuggingFaceHub( repo_id=GEN_MODEL_ID, huggingfacehub_api_token=HF_TOKEN, client=client, ) ``` --- ## Dependencies - No new dependencies introduced. - Change only affects documentation. --- ## Testing - ✅ Verified that adding `task="text-generation"` resolves the issue. - ✅ Tested the alternative approach with `InferenceClient` in Google Colab. --- ## Twitter Handle (Optional) If this PR gets announced, a shout-out to @AkmalJasmin would be great! 🚀 --- ## Reviewers 📌 @langchain-maintainers Please review this PR. Let me know if further changes are needed. 🚀 This fix improves developer onboarding and ensures the LangChain documentation remains up to date! 🚀	2025-02-08 14:41:02 -05:00
manukychen	3de445d521	using getattr and default value to prevent 'OpenSearchVectorSearch' has no attribute 'bulk_size' (#29682 ) - Description: Adding getattr methods and set default value 500 to cls.bulk_size, it can prevent the error below: Error: type object 'OpenSearchVectorSearch' has no attribute 'bulk_size' - Issue: https://github.com/langchain-ai/langchain/issues/29071	2025-02-08 14:39:57 -05:00
Yao Tianjia	5d581ba22c	langchain: support the situation when action_input is null in json output_parser (#29680 ) Description: This PR fixes handling of null action_input in [langchain.agents.output_parser]. Previously, passing null to action_input could cause OutputParserException with unclear error message which cause LLM don't know how to modify the action. The changes include: Added null-check validation before processing action_input Implemented proper fallback behavior with default values Maintained backward compatibility with existing implementations Error Examples: ``` { "action":"some action", "action_input":null } ``` Issue: None Dependencies: None	2025-02-07 22:01:01 -05:00
Philippe PRADOS	beb75b2150	community[minor]: 05 - Refactoring PyPDFium2 parser (#29625 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the PyPDFium2 parser. For more details, see https://github.com/langchain-ai/langchain/pull/28970.	2025-02-07 21:31:12 -05:00
Christophe Bornet	723031d548	community: Bump ruff version to 0.9 (#29206 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:21:10 +00:00
Christophe Bornet	30f6c9f5c8	community: Use Blockbuster to detect blocking calls in asyncio during tests (#29609 ) Same as https://github.com/langchain-ai/langchain/pull/29043 for langchain-community. Dependencies: - blockbuster (test) Twitter handle: cbornet_ Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:10:39 +00:00
Christophe Bornet	3a57a28daa	langchain: Use Blockbuster to detect blocking calls in asyncio during tests (#29616 ) Same as https://github.com/langchain-ai/langchain/pull/29043 for the langchain package. Dependencies: - blockbuster (test) Twitter handle: cbornet_ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 01:08:15 +00:00
Keenan Pepper	c67d473397	core: Make abatch_as_completed respect max_concurrency (#29426 ) - Description: Add tests for respecting max_concurrency and implement it for abatch_as_completed so that test passes - Issue: #29425 - Dependencies: none - Twitter handle: keenanpepper	2025-02-07 16:51:22 -08:00
Aaron V	dcfaae85d2	Core: Fix __add__ for concatting two BaseMessageChunk's (#29531 ) Description: The change allows you to use the overloaded `+` operator correctly when `+`ing two BaseMessageChunk subclasses. Without this you must instantiate a subclass for it to work. Which feels... wrong. Base classes should be decoupled from sub classes and should have in no way a dependency on them. Issue: You can't `+` a BaseMessageChunk with a BaseMessageChunk e.g. this will explode ```py from langchain_core.outputs import ( ChatGenerationChunk, ) from langchain_core.messages import BaseMessageChunk chunk1 = ChatGenerationChunk( message=BaseMessageChunk( type="customChunk", content="HI", ), ) chunk2 = ChatGenerationChunk( message=BaseMessageChunk( type="customChunk", content="HI", ), ) # this will throw new_chunk = chunk1 + chunk2 ``` In case anyone ran into this issue themselves, it's probably best to use the AIMessageChunk: a la ```py from langchain_core.outputs import ( ChatGenerationChunk, ) from langchain_core.messages import AIMessageChunk chunk1 = ChatGenerationChunk( message=AIMessageChunk( content="HI", ), ) chunk2 = ChatGenerationChunk( message=AIMessageChunk( content="HI", ), ) # No explosion! new_chunk = chunk1 + chunk2 ``` Dependencies: None! Twitter handle: `aaron_vogler` Keeping these for later if need be: ``` baskaryan efriis eyurtsev ccurme vbarda hwchase17 baskaryan efriis ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-08 00:43:36 +00:00
Marlene	4fa3ef0d55	Community/Partner: Adding Azure community and partner user agent to better track usage in Python (#29561 ) - This pull request includes various changes to add a `user_agent` parameter to Azure OpenAI, Azure Search and Whisper in the Community and Partner packages. This helps in identifying the source of API requests so we can better track usage and help support the community better. I will also be adding the user_agent to the new `langchain-azure` repo as well. - No issue connected or updated dependencies. - Utilises existing tests and docs --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:28:30 +00:00
Ella Charlaix	c401254770	huggingface: Add ipex support to HuggingFaceEmbeddings (#29386 ) ONNX and OpenVINO models are available by specifying the `backend` argument (the model is loaded using `optimum` https://github.com/huggingface/optimum) ```python from langchain_huggingface import HuggingFaceEmbeddings embedding = HuggingFaceEmbeddings( model_name=model_id, model_kwargs={"backend": "onnx"}, ) ``` With this PR we also enable the IPEX backend ```python from langchain_huggingface import HuggingFaceEmbeddings embedding = HuggingFaceEmbeddings( model_name=model_id, model_kwargs={"backend": "ipex"}, ) ```	2025-02-07 15:21:09 -08:00
Bruno Alvisio	3eaf561561	core: Handle unterminated escape character when parsing partial JSON (#29065 ) Description Currently, when parsing a partial JSON, if a string ends with the escape character, the whole key/value is removed. For example: ``` >>> from langchain_core.utils.json import parse_partial_json >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> >>> parse_partial_json(my_str) {'foo': 'bar'} ``` My expectation (and with this fix) would be for `parse_partial_json()` to return: ``` >>> from langchain_core.utils.json import parse_partial_json >>> >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> parse_partial_json(my_str) {'foo': 'bar', 'baz': 'qux'} ``` Notes: 1. It could be argued that current behavior is still desired. 2. I have experienced this issue when the streaming output from an LLM and the chunk happens to end with `\\` 3. I haven't included tests. Will do if change is accepted. 4. This is specially troublesome when this function is used by `187131c55c/libs/core/langchain_core/output_parsers/transform.py (L111)` since what happens is that, for example, if the received sequence of chunks are: `{"foo": "b` , `ar\\` : Then, the result of calling `self.parse_result()` is: ``` {"foo": "b"} ``` and the second time: ``` {} ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:18:21 +00:00
ccurme	0040d93b09	docs: showcase extras in chat model tabs (#29677 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 18:16:44 -05:00
Viren	252cf0af10	docs: add LangFair as a provider (#29390 ) Description: - Add `docs/docs/providers/langfair.mdx` - Register langfair in `libs/packages.yml` Twitter handle: @LangFair Tests and docs 1. Integration tests not needed as this PR only adds a .mdx file to docs. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Dylan Bouchard <dylan.bouchard@cvshealth.com> Co-authored-by: Dylan Bouchard <109233938+dylanbouchard@users.noreply.github.com> Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 21:27:37 +00:00
Erick Friis	eb9eddae0c	docs: use init_chat_model (#29623 )	2025-02-07 12:39:27 -08:00
ccurme	bff25b552c	community: release 0.3.17 (#29676 )	2025-02-07 19:41:44 +00:00
ccurme	01314c51fa	langchain: release 0.3.18 (#29654 )	2025-02-07 13:40:26 -05:00
ccurme	92e2239414	openai[patch]: make parallel_tool_calls explicit kwarg of bind_tools (#29669 ) Improves discoverability and documentation. cc @vbarda	2025-02-07 13:34:32 -05:00
ccurme	2a243df7bb	infra: add UV_NO_SYNC to monorepo makefile (#29670 ) Helpful for running `api_docs_quick_preview` locally.	2025-02-07 17:17:05 +00:00
Marc Ammann	5690575f13	openai: Removed tool_calls from completion chunk after other chunks have already been sent. (#29649 ) - Description: Before sending a completion chunk at the end of an OpenAI stream, removing the tool_calls as those have already been sent as chunks. - Issue: - - Dependencies: - - Twitter handle: - @ccurme as mentioned in another PR --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-07 10:15:52 -05:00
Ikko Eltociear Ashimine	0d45ad57c1	community: update base_o365.py (#29657 ) extention -> extension	2025-02-07 08:43:29 -05:00
weeix	1b064e198f	docs: Fix llama.cpp GPU Installation in llamacpp.ipynb (Deprecated Env Variable) (#29659 ) - Description: The llamacpp.ipynb notebook used a deprecated environment variable, LLAMA_CUBLAS, for llama.cpp installation with GPU support. This commit updates the notebook to use the correct GGML_CUDA variable, fixing the installation error. - Issue: none - Dependencies: none	2025-02-07 08:43:09 -05:00
Vincent Emonet	3645181d0e	qdrant: Add `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` (#29641 ) Added `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` class. It is required when we want to query multiple time with the same embeddings. It was present in the now deprecated original `Qdrant` vectorstore implementation, but was absent from the new one. It is also implemented in a number of others `VectorStore` implementations I have added tests for this new function Note that I also argued in this discussion that it should be part of the general `VectorStore` https://github.com/langchain-ai/langchain/discussions/29638 Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 00:55:58 +00:00
ccurme	488cb4a739	anthropic: release 0.3.7 (#29653 )	2025-02-06 17:05:57 -05:00
ccurme	ab09490c20	openai: release 0.3.4 (#29652 )	2025-02-06 17:02:21 -05:00
ccurme	29a0c38cc3	openai[patch]: add test for message.name (#29651 )	2025-02-06 16:49:28 -05:00
ccurme	91cca827c0	tests: release 0.3.11 (#29648 )	2025-02-06 21:48:09 +00:00
Sunish Sheth	25ce1e211a	docs: Updating the imports for langchain-databricks to databricks-langchain (#29646 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-06 13:28:07 -08:00
ccurme	e1b593ae77	text-splitters[patch]: release 0.3.6 (#29647 )	2025-02-06 16:16:05 -05:00
ccurme	a91e58bc10	core: release 0.3.34 (#29644 )	2025-02-06 15:53:56 -05:00

... 2 3 4 5 6 ...

12790 Commits