langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-31 10:23:18 +00:00

Author	SHA1	Message	Date
Eric Pinzur	01ded5e2f9	community: add metadata filter to CassandraGraphVectorStore (#25663 ) - Description: - Added metadata filtering support to `langchain_community.graph_vectorstores.cassandra.CassandraGraphVectorStore` - Also fixed type conversion issues highlighted by mypy. - Dependencies: - `ragstack-ai-knowledge-store 0.2.0` (released July 23, 2024) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:27:16 -04:00
mschoenb97IL	e499caa9cd	community: Give more context on DeepInfra 500 errors (#25671 ) Description: DeepInfra 500 errors have useful information in the text field that isn't being exposed to the user. I updated the error message to fix this. As an example, this code ``` from langchain_community.chat_models import ChatDeepInfra from langchain_core.messages import HumanMessage model = "meta-llama/Meta-Llama-3-70B-Instruct" deepinfra_api_token = "..." model = ChatDeepInfra(model=model, deepinfra_api_token=deepinfra_api_token) messages = [HumanMessage("All work and no play makes Jack a dull boy\n" * 9000)] response = model.invoke(messages) ``` Currently gives this error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server: Error 500 ``` This change would give the following error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server error status 500: {"error":{"message":"Requested input length 99009 exceeds maximum input length 8192"}} ```	2024-08-22 10:10:51 -07:00
Rajendra Kadam	4ff2f4499e	community: Refactor PebbloRetrievalQA (#25583 ) Refactor PebbloRetrievalQA - Created `APIWrapper` and moved API logic into it. - Created smaller functions/methods for better readability. - Properly read environment variables. - Removed unused code. - Updated models Issue: NA Dependencies: NA tests: NA	2024-08-22 11:51:21 -04:00
Rajendra Kadam	1f1679e960	community: Refactor PebbloSafeLoader (#25582 ) Refactor PebbloSafeLoader - Created `APIWrapper` and moved API logic into it. - Moved helper functions to the utility file. - Created smaller functions and methods for better readability. - Properly read environment variables. - Removed unused code. Issue: NA Dependencies: NA tests: Updated	2024-08-22 11:46:52 -04:00
maang-h	5e3a321f71	docs: Add ChatZhipuAI tool calling and structured output docstring (#25669 ) - Description: Add `ChatZhipuAI` tool calling and structured output docstring.	2024-08-22 10:34:41 -04:00
Noah Mayerhofer	0091947efd	community: add retry for session expired exception in neo4j (#25660 ) Description: The neo4j driver can raise a SessionExpired error, which is considered a retriable error. If a query fails with a SessionExpired error, this change retries every query once. This change will make the neo4j integration less flaky. Twitter handle: noahmay_	2024-08-22 13:07:36 +00:00
Dristy Srivastava	b002702af6	[Community][minor]: Updating metadata with full_path in SharePoint loader (#25593 ) - Description: Updating metadata for sharepoint loader with full path i.e., webUrl - Issue: NA - Dependencies: NA - Tests: NA - Docs NA Co-authored-by: dristy.cd <dristy@clouddefense.io> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-21 13:10:14 +00:00
Jabir	12e490ea56	Update azuresearch.py (#25577 ) This will allow complextype metadata to be returned. the current implementation throws error when dealing with nested metadata Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-20 12:53:30 +00:00
Erick Friis	e01c6789c4	core,community: add beta decorator to missed GraphVectorStore extensions (#25562 )	2024-08-19 17:29:09 -07:00
maang-h	015ab91b83	community[patch]: Add ToolMessage for ChatZhipuAI (#25547 ) - Description: Add ToolMessage for `ChatZhipuAI` to solve the issue #25490	2024-08-19 11:26:38 -04:00
Mohammad Mohtashim	75c3c81b8c	[Community]: Fix - Open AI Whisper `client.audio.transcriptions` returning Text Object which raises error (#25271 ) - Description: The following [line](`fd546196ef/libs/community/langchain_community/document_loaders/parsers/audio.py (L117)`) in `OpenAIWhisperParser` returns a text object for some odd reason despite the official documentation saying it should return `Transcript` Instance which should have the text attribute. But for the example given in the issue and even when I tried running on my own, I was directly getting the text. The small PR accounts for that. - Issue: : #25218 I was able to replicate the error even without the GenericLoader as shown below and the issue was with `OpenAIWhisperParser` ```python parser = OpenAIWhisperParser(api_key="sk-fxxxxxxxxx", response_format="srt", temperature=0) list(parser.lazy_parse(Blob.from_path('path_to_file.m4a'))) ```	2024-08-19 09:36:42 -04:00
maang-h	32f5147523	docs: Fix QianfanLLMEndpoint and Tongyi input text (#25529 ) - Description: Fix `QianfanLLMEndpoint` and `Tongyi` input text.	2024-08-19 09:23:09 -04:00
ZhangShenao	4255a30f20	Improvement[Community] Improve api doc for `SingleFileFacebookMessengerChatLoader` (#25536 ) Delete redundant args in api doc	2024-08-19 09:00:21 -04:00
ccurme	b83f1eb0d5	core, partners: implement standard tracing params for LLMs (#25410 )	2024-08-16 13:18:09 -04:00
Bagatur	253ceca76a	docs: fix mimetype parser docstring (#25463 )	2024-08-15 16:16:52 -07:00
ccurme	8afbab4cf6	langchain[patch]: deprecate various chains (#25310 ) - [x] NatbotChain: move to community, deprecate langchain version. Update to use `prompt \| llm \| output_parser` instead of LLMChain. - [x] LLMMathChain: deprecate + add langgraph replacement example to API ref - [x] HypotheticalDocumentEmbedder (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] FlareChain: update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] ConstitutionalChain: deprecate + add langgraph replacement example to API ref - [x] LLMChainExtractor (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] LLMChainFilter (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] RePhraseQueryRetriever (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain	2024-08-15 10:49:26 -04:00
ccurme	ba167dc158	community[patch]: update connection string in azure cosmos integration test (#25438 )	2024-08-15 14:07:54 +00:00
Isaac Francisco	966b408634	[docs]: doc loader changes (#25417 )	2024-08-14 19:46:33 -07:00
Werner van der Merwe	1d3f7231b8	fix: typo where github should be gitlab (#25397 ) PR title: "GitLabToolkit: fix typo" - Description: fix typo where GitHub should have been GitLab - Dependencies: None	2024-08-14 18:36:25 +00:00
Bagatur	493e474063	docs: udpated api reference (#25172 ) - Move the API reference into the vercel build - Update api reference organization and styling	2024-08-14 07:00:17 -07:00
ccurme	27690506d0	multiple: update removal targets (#25361 )	2024-08-14 09:50:39 -04:00
Harrison Chase	967b6f21f6	docs: improve document loaders index (#25365 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-14 01:48:48 +00:00
Isaac Francisco	f4ffd692a3	[docs]: standardize doc loader doc strings (#25325 )	2024-08-13 23:18:56 +00:00
Isaac Francisco	e0bbb81d04	[docs]: standardize tool docstrings (#25351 )	2024-08-13 16:10:00 -07:00
thedavgar	9d08369442	community: fix AzureSearch vectorstore asyncronous methods (#24921 ) Description Fix the asyncronous methods to retrieve documents from AzureSearch VectorStore. The previous changes from [this commit](`ffe6ca986e`) create a similar code for the syncronous methods and the asyncronous ones but the asyncronous client return an asyncronous iterator "AsyncSearchItemPaged" as said in the issue #24740. To solve this issue, the syncronous iterators in asyncronous methods where changed to asyncronous iterators. @chrislrobert said in [this comment](https://github.com/langchain-ai/langchain/issues/24740#issuecomment-2254168302) that there was a still a flaw due to `with` blocks that close the client after each call. I removed this `with` blocks in the `async_client` following the same pattern as the sync `client`. In order to close up the connections, a __del__ method is included to gently close up clients once the vectorstore object is destroyed. Issue: #24740 and #24064 Dependencies: No new dependencies for this change Example notebook: I created a notebook just to test the changes work and gives the same results as the syncronous methods for vector and hybrid search. With these changes, the asyncronous methods in the retriever work as well. ![image](https://github.com/user-attachments/assets/697e431b-9d7f-4d0d-b205-59d051ac2b67) Lint and test: Passes the tests and the linter	2024-08-13 14:20:51 -07:00
Fedor Nikolaev	2b15518c5f	community: add args_schema to SearxSearchResults tool (#25350 ) This adds `args_schema` member to `SearxSearchResults` tool. This member is already present in the `SearxSearchRun` tool in the same file. I was having `TypeError: Type is not JSON serializable: AsyncCallbackManagerForToolRun` being thrown in langserve playground when I was using `SearxSearchResults` tool as a part of chain there. This fixes the issue, so the error is not raised anymore. This is a example langserve app that was giving me the error, but it works properly after the proposed fix: ```python #!/usr/bin/env python from fastapi import FastAPI from langchain_core.prompts import ChatPromptTemplate from langchain_core.output_parsers import StrOutputParser from langchain_core.runnables import RunnablePassthrough from langchain_openai import ChatOpenAI from langchain_community.utilities import SearxSearchWrapper from langchain_community.tools.searx_search.tool import SearxSearchResults from langserve import add_routes template = """Answer the question based only on the following context: {context} Question: {question} """ prompt = ChatPromptTemplate.from_template(template) model = ChatOpenAI() s = SearxSearchWrapper(searx_host="http://localhost:8080") search = SearxSearchResults(wrapper=s) search_chain = ( {"context": search, "question": RunnablePassthrough()} \| prompt \| model \| StrOutputParser() ) app = FastAPI() add_routes( app, search_chain, path="/chain", ) if __name__ == "__main__": import uvicorn uvicorn.run(app, host="localhost", port=8000) ```	2024-08-13 18:26:09 +00:00
maang-h	089f5e6cad	Standardize SparkLLM (#25239 ) - Description: Standardize SparkLLM, include: - docs, the issue #24803 - to support stream - update api url - model init arg names, the issue #20085	2024-08-13 09:50:12 -04:00
Chen Xiabin	24155aa1ac	qianfan generate/agenerate with usage_metadata (#25332 )	2024-08-13 09:24:41 -04:00
Erick Friis	2907ab2297	community: release 0.2.12 (#25324 )	2024-08-12 23:30:27 +00:00
Ben Chambers	1adc161642	community: kwargs for CassandraGraphVectorStore (#25300 ) - Description: pass kwargs from CassandraGraphVectorStore to underlying store Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 18:01:29 +00:00
ccurme	e77eeee6ee	core[patch]: add standard tracing params for retrievers (#25240 )	2024-08-12 14:51:59 +00:00
Mohammad Mohtashim	9927a4866d	[Community] - Added bind_tools and with_structured_output for ChatZhipuAI (#23887 ) - Description: This PR implements the `bind_tool` functionality for ChatZhipuAI as requested by the user. ChatZhipuAI models support tool calling according to the `OpenAI` tool format, as outlined in their official documentation [here](https://open.bigmodel.cn/dev/api#glm-4). - Issue: ##23868 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 14:11:43 +00:00
maang-h	bc60cddc1b	docs: Fix ChatBaichuan, QianfanChatEndpoint, ChatSparkLLM, ChatZhipuAI docs (#25265 ) - Description: Fix some chat models docs, include: - ChatBaichuan - QianfanChatEndpoint - ChatSparkLLM - ChatZhipuAI	2024-08-11 16:23:55 -04:00
ZhangShenao	43deed2a95	Improvement[Embeddings] Add dimension support to `ZhipuAIEmbeddings` (#25274 ) - In the in ` embedding-3 ` and later models of Zhipu AI, it is supported to specify the dimensions parameter of Embedding. Ref: https://bigmodel.cn/dev/api#text_embedding-3 . - Add test case for `embedding-3` model by assigning dimensions.	2024-08-11 16:20:37 -04:00
Eugene Yurtsev	6dd9f053e3	core[patch]: Deprecating beta upsert APIs in vectorstore (#25069 ) This PR deprecates the beta upsert APIs in vectorstore. We'll introduce them in a V2 abstraction instead to keep the existing vectorstore implementations lighter weight. The main problem with the existing APIs is that it's a bit more challenging to implement the correct behavior w/ respect to IDs since ID can be present in both the function signature and as an optional attribute on the document object. But VectorStores that pass the standard tests should have implemented the semantics properly! --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-09 17:17:36 -04:00
Eugene Yurtsev	b6f0174bb9	community[patch],core[patch]: Update EdenaiTool root_validator and add unit test in core (#25233 ) This PR gets rid `root_validators(allow_reuse=True)` logic used in EdenAI Tool in preparation for pydantic 2 upgrade. - add another test to secret_from_env_factory	2024-08-09 15:59:27 +00:00
Eugene Yurtsev	bd6c31617e	community[patch]: Remove more @allow_reuse=True validators (#25236 ) Remove some additional allow_reuse=True usage in @root_validators.	2024-08-09 11:10:27 -04:00
Eugene Yurtsev	6e57aa7c36	community[patch]: Remove usage of @root_validator(allow_reuse=True) (#25235 ) Remove usage of @root_validator(allow_reuse=True)	2024-08-09 10:57:42 -04:00
thiswillbeyourgithub	a2b4c33bd6	community[patch]: FAISS: ValueError mentions normalize_score_fn isntead of relevance_score_fn (#25225 ) Thank you for contributing to LangChain! - [X] PR title: "community: fix valueerror mentions wrong argument missing" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [X] PR message: *Delete this entire checklist* and replace with - Description: when faiss.py has a None relevance_score_fn it raises a ValueError that says a normalize_fn_score argument is needed. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-09 14:40:29 +00:00
Shivendra Soni	66b7206ab6	community: Add llm-extraction option to FireCrawl Document Loader (#25231 ) Description: This minor PR aims to add `llm_extraction` to Firecrawl loader. This feature is supported on API and PythonSDK, but the langchain loader omits adding this to the response. Twitter handle: [scalable_pizza](https://x.com/scalablepizza) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-09 13:59:10 +00:00
ccurme	3b7437d184	docs: update integration api refs (#25195 ) - [x] toolkits - [x] retrievers (in this repo)	2024-08-09 12:27:32 +00:00
Eugene Yurtsev	98779797fe	community[patch]: Use get_fields adapter for pydantic (#25191 ) Change all usages of __fields__ with get_fields adapter merged into langchain_core. Code mod generated using the following grit pattern: ``` engine marzano(0.1) language python `$X.__fields__` => `get_fields($X)` where { add_import(source="langchain_core.utils.pydantic", name="get_fields") } ```	2024-08-08 14:43:09 -04:00
Rajendra Kadam	663638d6a8	community[minor]: [SharePointLoader] Load extended metadata for the root folder (#24872 ) - Title: [SharePointLoader] Load extended metadata for the root folder - Description: - Ensure extended metadata loads correctly for the root folder. - Cleanup: Refactor SharePointLoader to remove unused fields(`file_id` & `site_id`). - Dependencies: NA - Add tests and docs: NA	2024-08-08 14:39:16 -04:00
Eugene Yurtsev	bf5193bb99	community[patch]: Upgrade pydantic extra (#25185 ) Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 17:20:39 +00:00
ololand	249945a572	Update polygon.py for business subscription (#25085 ) For business subscription the status is STOCKSBUSINESS not OK Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-08 15:28:41 +00:00
ogawa	d895db11d6	community[patch]: gpt-4o-2024-08-06 costs (#25164 ) - Description: updated OpenAI cost definitions according to the following: - https://openai.com/api/pricing/ - Twitter handle: `@ogawa65a` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-08 13:22:11 +00:00
maang-h	0ba125c3cd	docs: Standardize QianfanLLMEndpoint LLM (#25139 ) - Description: Standardize QianfanLLMEndpoint LLM，include: - docs, the issue #24803 - model init arg names, the issue #20085	2024-08-07 10:57:27 -04:00
Pat Patterson	7e7fcf5b1f	community: Fix ValidationError on creating GPT4AllEmbeddings with no gpt4all_kwargs (#25124 ) - Description: Instantiating `GPT4AllEmbeddings` with no `gpt4all_kwargs` argument raised a `ValidationError`. Root cause: #21238 added the capability to pass `gpt4all_kwargs` through to the `GPT4All` instance via `Embed4All`, but broke code that did not specify a `gpt4all_kwargs` argument. - Issue: #25119 - Dependencies: None - Twitter handle: [`@metadaddy`](https://twitter.com/metadaddy)	2024-08-07 13:34:01 +00:00
Virat Singh	264ab96980	community: Add stock market tools from financialdatasets.ai (#25025 ) Description: In this PR, I am adding three stock market tools from financialdatasets.ai (my API!): - get balance sheets - get cash flow statements - get income statements Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 18:28:12 +00:00
Naval Chand	71c0698ee4	Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler (#25104 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Example: "community: Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Naval Chand <navalchand@192.168.1.36>	2024-08-06 17:28:47 +00:00

... 10 11 12 13 14 ...

2009 Commits