langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-07-14 22:27:16 +00:00

Author	SHA1	Message	Date
Chester Curme	9d23627fcd	handle refusals	2025-02-07 11:41:01 -05:00
Chester Curme	33e7d91f1a	refactor	2025-02-07 11:16:45 -05:00
Marc Ammann	5690575f13	openai: Removed tool_calls from completion chunk after other chunks have already been sent. (#29649 ) - Description: Before sending a completion chunk at the end of an OpenAI stream, removing the tool_calls as those have already been sent as chunks. - Issue: - - Dependencies: - - Twitter handle: - @ccurme as mentioned in another PR --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-07 10:15:52 -05:00
Ikko Eltociear Ashimine	0d45ad57c1	community: update base_o365.py (#29657 ) extention -> extension	2025-02-07 08:43:29 -05:00
Vincent Emonet	3645181d0e	qdrant: Add `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` (#29641 ) Added `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` class. It is required when we want to query multiple time with the same embeddings. It was present in the now deprecated original `Qdrant` vectorstore implementation, but was absent from the new one. It is also implemented in a number of others `VectorStore` implementations I have added tests for this new function Note that I also argued in this discussion that it should be part of the general `VectorStore` https://github.com/langchain-ai/langchain/discussions/29638 Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 00:55:58 +00:00
ccurme	488cb4a739	anthropic: release 0.3.7 (#29653 )	2025-02-06 17:05:57 -05:00
ccurme	ab09490c20	openai: release 0.3.4 (#29652 )	2025-02-06 17:02:21 -05:00
ccurme	29a0c38cc3	openai[patch]: add test for message.name (#29651 )	2025-02-06 16:49:28 -05:00
ccurme	91cca827c0	tests: release 0.3.11 (#29648 )	2025-02-06 21:48:09 +00:00
Sunish Sheth	25ce1e211a	docs: Updating the imports for langchain-databricks to databricks-langchain (#29646 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-06 13:28:07 -08:00
ccurme	e1b593ae77	text-splitters[patch]: release 0.3.6 (#29647 )	2025-02-06 16:16:05 -05:00
ccurme	a91e58bc10	core: release 0.3.34 (#29644 )	2025-02-06 15:53:56 -05:00
Vincent Emonet	08b9eaaa6f	community: improve FastEmbedEmbeddings support for ONNX execution provider (e.g. GPU) (#29645 ) I made a change to how was implemented the support for GPU in `FastEmbedEmbeddings` to be more consistent with the existing implementation `langchain-qdrant` sparse embeddings implementation It is directly enabling to provide the list of ONNX execution providers: https://github.com/langchain-ai/langchain/blob/master/libs/partners/qdrant/langchain_qdrant/fastembed_sparse.py#L15 It is a bit less clear to a user that just wants to enable GPU, but gives more capabilities to work with other execution providers that are not the `CUDAExecutionProvider`, and is more future proof Sorry for the disturbance @ccurme > Nice to see you just moved to `uv`! It is so much nicer to run format/lint/test! No need to manually rerun the `poetry install` with all required extras now	2025-02-06 15:31:23 -05:00
ccurme	3450bfc806	infra: add UV_FROZEN to makefiles (#29642 ) These are set in Github workflows, but forgot to add them to most makefiles for convenience when developing locally. `uv run` will automatically sync the lock file. Because many of our development dependencies are local installs, it will pick up version changes and update the lock file. Passing `--frozen` or setting this environment variable disables the behavior.	2025-02-06 14:36:54 -05:00
ccurme	d172984c91	infra: migrate to uv (#29566 )	2025-02-06 13:36:26 -05:00
ccurme	9da06e6e94	standard-tests[patch]: use `has_structured_output` property to engage structured output tests (#29635 ) Motivation: dedicated structured output features are becoming more common, such that integrations can support structured output without supporting tool calling. Here we make two changes: 1. Update the `has_structured_output` method to default to True if a model supports tool calling (in addition to defaulting to True if `with_structured_output` is overridden). 2. Update structured output tests to engage if `has_structured_output` is True.	2025-02-06 10:09:06 -08:00
Vincent Emonet	db8201d4da	community: fix typo in the module imported when using GPU with FastEmbedEmbeddings (#29631 ) Made a mistake in the module to import (the module stay the same only the installed package changes), fixed it and tested it https://github.com/langchain-ai/langchain/pull/29627 @ccurme	2025-02-06 10:26:08 -05:00
Mohammed Abbadi	f8fd65dea2	community: Update deeplake.py (#29633 ) Deep Lake recently released version 4, which introduces significant architectural changes, including a new on-disk storage format, enhanced indexing mechanisms, and improved concurrency. However, LangChain's vector store integration currently does not support Deep Lake v4 due to breaking API changes. Previously, the installation command was: `pip install deeplake[enterprise]` This installs the latest available version, which now defaults to Deep Lake v4. Since LangChain's vector store integration is still dependent on v3, this can lead to compatibility issues when using Deep Lake as a vector database within LangChain. To ensure compatibility, the installation command has been updated to: `pip install deeplake[enterprise]<4.0.0` This constraint ensures that pip installs the latest available version of Deep Lake within the v3 series while avoiding the incompatible v4 update.	2025-02-06 10:25:13 -05:00
Vincent Emonet	0ac5536f04	community: add support for using GPUs with FastEmbedEmbeddings (#29627 ) - Description: add a `gpu: bool = False` field to the `FastEmbedEmbeddings` class which enables to use GPU (through ONNX CUDA provider) when generating embeddings with any fastembed model. It just requires the user to install a different dependency and we use a different provider when instantiating `fastembed.TextEmbedding` - Issue: when generating embeddings for a really large amount of documents this drastically increase performance (honestly that is a must have in some situations, you can't just use CPU it is way too slow) - Dependencies: no direct change to dependencies, but internally the users will need to install `fastembed-gpu` instead of `fastembed`, I made all the changes to the init function to properly let the user know which dependency they should install depending on if they enabled `gpu` or not cf. fastembed docs about GPU for more details: https://qdrant.github.io/fastembed/examples/FastEmbed_GPU/ I did not added test because it would require access to a GPU in the testing environment	2025-02-06 08:04:19 -05:00
Dmitrii Rashchenko	0ceda557aa	add o1 and o3-mini to pricing (#29628 ) ### PR Title: community: add latest OpenAI models pricing ### Description: This PR updates the OpenAI model cost calculation mapping by adding the latest OpenAI models, o1 (non-preview) and o3-mini, based on the pricing listed on the [OpenAI pricing page](https://platform.openai.com/docs/pricing). ### Changes: - Added pricing for `o1`, `o1-2024-12-17`, `o1-cached`, and `o1-2024-12-17-cached` for input tokens. - Added pricing for `o1-completion` and `o1-2024-12-17-completion` for output tokens. - Added pricing for `o3-mini`, `o3-mini-2025-01-31`, `o3-mini-cached`, and `o3-mini-2025-01-31-cached` for input tokens. - Added pricing for `o3-mini-completion` and `o3-mini-2025-01-31-completion` for output tokens. ### Issue: N/A ### Dependencies: None ### Testing & Validation: - No functional changes outside of updating the cost mapping. - No tests were added or modified.	2025-02-06 08:02:20 -05:00
ZhangShenao	ac53977dbc	[MistralAI] Improve MistralAIEmbeddings (#29242 ) - Add static method decorator for method. - Add expected exception for retry decorator #29125	2025-02-05 21:31:54 -05:00
Andrew Wason	22aa5e07ed	standard-tests: Fix ToolsIntegrationTests to correctly handle "content_and_artifact" tools (#29391 ) Description: The response from `tool.invoke()` is always a ToolMessage, with content and artifact fields, not a tuple. The tuple is converted to a ToolMessage here `b6ae7ca91d/libs/core/langchain_core/tools/base.py (L726)` Issue: Currently `ToolsIntegrationTests` requires `invoke()` to return a tuple and so standard tests fail for "content_and_artifact" tools. This fixes that to check the returned ToolMessage. This PR also adds a test that now passes.	2025-02-05 21:27:09 -05:00
Mohammad Anash	f849305a56	fixed Bug in PreFilter of AzureCosmosDBNoSqlVectorSearch (#29613 ) Description: Fixes PreFilter value handling in Azure Cosmos DB NoSQL vectorstore. The current implementation fails to handle numeric values in filter conditions, causing an undefined value variable error. This PR adds support for numeric, boolean, and NULL values while maintaining the existing string and list handling. Changes: Added handling for numeric types (int/float) Added boolean value support Added NULL value handling Added type validation for unsupported values Fixed scope of value variable initialization Issue: Fixes #29610 Implementation Notes: No changes to public API Backwards compatible Maintains consistent behavior with existing MongoDB-style filtering Preserves SQL injection prevention through proper value handling --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-06 02:20:26 +00:00
Philippe PRADOS	6ff0d5c807	community[minor]: 04 - Refactoring PDFMiner parser (#29526 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the XXX parser. For more details, see [PR 28970](https://github.com/langchain-ai/langchain/pull/28970). --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-02-05 21:08:27 -05:00
Isaac Francisco	91ffd7caad	core: allow passing message dicts into ChatPromptTemplate (#29363 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-05 09:45:52 -08:00
ccurme	69595b0914	docs: fix builds (#29607 ) Failing with: > ValueError: Provider page not found for databricks-langchain. Please add one at docs/integrations/providers/databricks-langchain.{mdx,ipynb}	2025-02-05 14:24:53 +00:00
ccurme	91a33a9211	anthropic[patch]: release 0.3.6 (#29606 )	2025-02-05 14:18:02 +00:00
ccurme	5cbe6aba8f	anthropic[patch]: support citations in streaming (#29591 )	2025-02-05 09:12:07 -05:00
William FH	5ae4ed791d	Drop duplicate inputs (#29589 )	2025-02-04 18:06:10 -08:00
Erick Friis	65f0deb81a	packages: databricks-langchain (#29593 )	2025-02-05 01:53:34 +00:00
Yoav Levy	621bba7e26	docs: add nimble as a provider (#29579 ) ## Description: - Add docs/docs/providers/nimbleway.ipynb - Add docs/docs/integrations/retrievers/nimbleway.ipynb - Register nimbleway in libs/packages.yml - X (twitter) handle: @urielkn / @LevyNorbit8	2025-02-04 16:47:03 -08:00
Erick Friis	50d61eafa2	partners/deepseek: release 0.1.1 (#29592 )	2025-02-04 23:46:38 +00:00
Erick Friis	7edfcbb090	docs: rename to langchain-deepseek in docs (#29587 )	2025-02-04 14:22:17 -08:00
Erick Friis	df8fa882b2	deepseek: bump core (#29584 )	2025-02-04 10:25:46 -08:00
Erick Friis	455f65947a	deepseek: rename to langchain-deepseek from langchain-deepseek-official (#29583 )	2025-02-04 17:57:25 +00:00
Philippe PRADOS	5771e561fb	[Bugfix langchain_community] Fix PyMuPDFLoader (#29550 ) - Description: add legacy properties - Issue: #29470 - Twitter handle: pprados	2025-02-04 09:24:40 -05:00
Ashutosh Kumar	65b404a2d1	[oci_generative_ai] Option to pass auth_file_location (#29481 ) PR title: "community: Option to pass auth_file_location for oci_generative_ai" Description: Option to pass auth_file_location, to overwrite config file default location "~/.oci/config" where profile name configs present. This is not fixing any issues. Just added optional parameter called "auth_file_location", which internally supported by any OCI client including GenerativeAiInferenceClient.	2025-02-03 21:44:13 -05:00
Teruaki Ishizaki	aeb42dc900	partners: Fixed the procedure of initializing pad_token_id (#29500 ) - Description: Add to check pad_token_id and eos_token_id of model config. It seems that this is the same bug as the HuggingFace TGI bug. It's same bug as #29434 - Issue: #29431 - Dependencies: none - Twitter handle: tell14 Example code is followings: ```python from langchain_huggingface.llms import HuggingFacePipeline hf = HuggingFacePipeline.from_model_id( model_id="meta-llama/Llama-3.2-3B-Instruct", task="text-generation", pipeline_kwargs={"max_new_tokens": 10}, ) from langchain_core.prompts import PromptTemplate template = """Question: {question} Answer: Let's think step by step.""" prompt = PromptTemplate.from_template(template) chain = prompt \| hf question = "What is electroencephalography?" print(chain.invoke({"question": question})) ```	2025-02-03 21:40:33 -05:00
AmirPoursaberi	a6efd22ba1	Fix a tiny typo in `create_retrieval_chain` docstring (#29552 ) Hi there! To fix a tiny typo in `create_retrieval_chain` docstring.	2025-02-03 10:54:49 -05:00
Hemant Rawat	db1693aa70	community: fix issue #29429 in age_graph.py (#29506 ) ## Description: This PR addresses issue #29429 by fixing the _wrap_query method in langchain_community/graphs/age_graph.py. The method now correctly handles Cypher queries with UNION and EXCEPT operators, ensuring that the fields in the SQL query are ordered as they appear in the Cypher query. Additionally, the method now properly handles cases where RETURN * is not supported. ### Issue: #29429 ### Dependencies: None ### Add tests and docs: Added unit tests in tests/unit_tests/graphs/test_age_graph.py to validate the changes. No new integrations were added, so no example notebook is necessary. Lint and test: Ran make format, make lint, and make test to ensure code quality and functionality.	2025-02-01 21:24:45 -05:00
Keenan Pepper	2f97916dea	docs: Add goodfire notebook and add to packages.yml (#29512 ) - Description: Add Goodfire ipynb notebook and add langchain-goodfire package to packages.yml - Issue: n/a - Dependencies: docs only - Twitter handle: keenanpepper --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-01 19:43:20 -05:00
ccurme	a3c5e4d070	deepseek[patch]: bump langchain-openai and add to scheduled testing (#29535 )	2025-02-01 18:40:59 -05:00
ccurme	16a422f3fa	community: add standard tests for Perplexity (#29534 )	2025-02-01 17:02:57 -05:00
Amit Ghadge	0c405245c4	[Integrations][Tool] Added Jenkins tools support (#29516 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-31 12:50:10 -05:00
Christophe Bornet	aab2e42169	core[patch]: Use Blockbuster to detect blocking calls in asyncio during tests (#29043 ) This PR uses the [blockbuster](https://github.com/cbornet/blockbuster) library in langchain-core to detect blocking calls made in the asyncio event loop during unit tests. Avoiding blocking calls is hard as these can be deeply buried in the code or made in 3rd party libraries. Blockbuster makes it easier to detect them by raising an exception when a call is made to a known blocking function (eg: `time.sleep`). Adding blockbuster allowed to find a blocking call in `aconfig_with_context` (it ends up calling `get_function_nonlocals` which loads function code). Dependencies: - blockbuster (test) Twitter handle: cbornet_	2025-01-31 10:06:34 -05:00
Philippe PRADOS	ceda8bc050	community[minor]: 03 - Refactoring PyPDF parser (#29330 ) This is one part of a larger Pull Request (PR) that is too large to be submitted all at once. This specific part focuses on updating the PyPDF parser. For more details, see [PR 28970](https://github.com/langchain-ai/langchain/pull/28970).	2025-01-31 10:05:07 -05:00
Julian Castro Pulgarin	b7e3e337b1	community: Fix YahooFinanceNewsTool to handle updated yfinance data structure (#29498 ) Description:* Updates the YahooFinanceNewsTool to handle the current yfinance news data structure. The tool was failing with a KeyError due to changes in the yfinance API's response format. This PR updates the code to correctly extract news URLs from the new structure. Issue: #29495 Dependencies: No new dependencies required. Works with existing yfinance package. The changes maintain backwards compatibility while fixing the KeyError that users were experiencing. The modified code properly handles the new data structure where: - News type is now at `content.contentType` - News URL is now at `content.canonicalUrl.url` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-01-31 02:31:44 +00:00
Erick Friis	332e303858	partners/mistralai: release 0.2.6 (#29491 )	2025-01-29 22:23:14 +00:00
Erick Friis	2c795f5628	partners/openai: release 0.3.3 (#29490 )	2025-01-29 22:23:03 +00:00
Erick Friis	f307b3cc5f	langchain: release 0.3.17 (#29485 )	2025-01-29 22:22:49 +00:00

1 2 3 4 5 ...

6476 Commits