langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-07 12:06:43 +00:00

Author	SHA1	Message	Date
Christophe Bornet	ee98da4f4e	core[patch]: Add UP(upgrade) ruff rules (#25358 )	2024-08-22 16:29:22 -07:00
William FH	294f7fcb38	core[patch]: Remove different parent run id warning (#25683 )	2024-08-22 16:10:35 -07:00
Vadym Barda	46d344c33d	core[patch]: support drawing nested subgraphs in draw_mermaid (#25581 ) Previously the code was able to only handle a single level of nesting for subgraphs in mermaid. This change adds support for arbitrary nesting of subgraphs.	2024-08-22 16:08:49 -07:00
Manuel Jaiczay	1c31234eed	community: fix HuggingFacePipeline pipeline_kwargs (#19920 ) Fix handling of pipeline_kwargs to prioritize class attribute defaults. #19770 Co-authored-by: jaizo <manuel.jaiczay@polygons.at> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:29:46 -04:00
Nobuhiko Otoba	4b63a217c2	"community: Fix GithubFileLoader source code", "docs: Fix GithubFileLoader code sample" (#19943 ) This PR adds tiny improvements to the `GithubFileLoader` document loader and its code sample, addressing the following issues: 1. Currently, the `file_extension` argument of `GithubFileLoader` does not change its behavior at all. 1. The `GithubFileLoader` sample code in `docs/docs/integrations/document_loaders/github.ipynb` does not work as it stands. The respective solutions I propose are the following: 1. Remove `file_extension` argument from `GithubFileLoader`. 1. Specify the branch as `master` (not the default `main`) and rename `documents` as `document`. --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:24:57 -04:00
Bagatur	cf9c484715	standard-tests[patch]: test Message.name (#25677 ) Tests: https://github.com/langchain-ai/langchain/actions/runs/10516092584	2024-08-22 14:47:31 -07:00
Nada Amin	ac7b71e0d7	langchain_community.graphs: Neo4JGraph: prop min_size might be None (#23944 ) When I used the Neo4JGraph enhanced_schema=True option, I ran into an error because a prop min_size of None was compared numerically with an int. The fix I applied is similar to the pattern of skipping embeddings elsewhere in the file. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-22 20:29:52 +00:00
CastaChick	7d13a2f958	core[patch]: add option to specify the chunk separator in `merge_message_runs` (#24783 ) Description: LLM will stop generating text even in the middle of a sentence if `finish_reason` is `length` (for OpenAI) or `stop_reason` is `max_tokens` (for Anthropic). To obtain longer outputs from LLM, we should call the message generation API multiple times and merge the results into the text to circumvent the API's output token limit. The extra line breaks forced by the `merge_message_runs` function when seamlessly merging messages can be annoying, so I added the option to specify the chunk separator. Issue: No corresponding issues. Dependencies: No dependencies required. Twitter handle: @hanama_chem https://x.com/hanama_chem --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 19:46:25 +00:00
basirsedighi	0f3fe44e44	parsed_json is expected to be a list of dictionaries, but it seems to… (#24018 ) parsed_json is expected to be a list of dictionaries, but it seems to… be a single dictionary instead. This is at libs/experimental/langchain_experimental/graph_transformers/llm.py process process_response Thank you for contributing to LangChain! - [ ] Bugfix: "experimental: bugfix" --------- Co-authored-by: based <basir.sedighi@nris.no> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 19:09:43 +00:00
ZhangShenao	8bde04079b	patch[experimental] Fix start_index in `SemanticChunker` (#24761 ) - Cause chunks are joined by space, so they can't be found in text, and the final `start_index` is very possibility to be -1. - The simplest way is to use the natural index of the chunk as `start_index`.	2024-08-22 14:59:40 -04:00
William FH	fad6fc866a	Rm DeepInfra Breakpoint Comment (#25206 ) tbh should rm the print staement too	2024-08-22 14:43:44 -04:00
yahya-mouman	e5bb4cb646	lagchain-pinecone: add id to similarity documents results (#25630 ) - Description: This change adds the ID field that's required in Pinecone to the result documents of the similarity search method. - Issue: Lack of document metadata namely the ID field - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 18:33:26 +00:00
Eric Pinzur	01ded5e2f9	community: add metadata filter to CassandraGraphVectorStore (#25663 ) - Description: - Added metadata filtering support to `langchain_community.graph_vectorstores.cassandra.CassandraGraphVectorStore` - Also fixed type conversion issues highlighted by mypy. - Dependencies: - `ragstack-ai-knowledge-store 0.2.0` (released July 23, 2024) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:27:16 -04:00
Ivan	5b9290a449	Fix UnionType type var replacement (#25566 ) [langchain_core] Fix UnionType type var replacement - Added types.UnionType to typing.Union mapping Type replacement cause `TypeError: 'type' object is not subscriptable` if any of union type comes as function `_py_38_safe_origin` return `types.UnionType` instead of `typing.Union` ```python >>> from types import UnionType >>> from typing import Union, get_origin >>> type_ = get_origin(str \| None) >>> type_ <class 'types.UnionType'> >>> UnionType[(str, None)] Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'type' object is not subscriptable >>> Union[(str, None)] typing.Optional[str] ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:22:09 -04:00
William FH	8230ba47f3	core[patch]: Improve some error messages and add another test for checking RunnableWithMessageHistory (#25209 ) Also add more useful error messages. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-22 18:14:27 +00:00
Hasan Kumar	b4fcda7657	langchain: Fix type warnings when passing Runnable as agent to AgentExecutor (#24750 ) Fix for https://github.com/langchain-ai/langchain/issues/13075 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:02:02 -04:00
Erick Friis	9447925d94	cli: release 0.0.30 (#25672 )	2024-08-22 10:21:19 -07:00
mschoenb97IL	e499caa9cd	community: Give more context on DeepInfra 500 errors (#25671 ) Description: DeepInfra 500 errors have useful information in the text field that isn't being exposed to the user. I updated the error message to fix this. As an example, this code ``` from langchain_community.chat_models import ChatDeepInfra from langchain_core.messages import HumanMessage model = "meta-llama/Meta-Llama-3-70B-Instruct" deepinfra_api_token = "..." model = ChatDeepInfra(model=model, deepinfra_api_token=deepinfra_api_token) messages = [HumanMessage("All work and no play makes Jack a dull boy\n" * 9000)] response = model.invoke(messages) ``` Currently gives this error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server: Error 500 ``` This change would give the following error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server error status 500: {"error":{"message":"Requested input length 99009 exceeds maximum input length 8192"}} ```	2024-08-22 10:10:51 -07:00
Rajendra Kadam	4ff2f4499e	community: Refactor PebbloRetrievalQA (#25583 ) Refactor PebbloRetrievalQA - Created `APIWrapper` and moved API logic into it. - Created smaller functions/methods for better readability. - Properly read environment variables. - Removed unused code. - Updated models Issue: NA Dependencies: NA tests: NA	2024-08-22 11:51:21 -04:00
Rajendra Kadam	1f1679e960	community: Refactor PebbloSafeLoader (#25582 ) Refactor PebbloSafeLoader - Created `APIWrapper` and moved API logic into it. - Moved helper functions to the utility file. - Created smaller functions and methods for better readability. - Properly read environment variables. - Removed unused code. Issue: NA Dependencies: NA tests: Updated	2024-08-22 11:46:52 -04:00
maang-h	5e3a321f71	docs: Add ChatZhipuAI tool calling and structured output docstring (#25669 ) - Description: Add `ChatZhipuAI` tool calling and structured output docstring.	2024-08-22 10:34:41 -04:00
Krishna Kulkarni	820da64983	limit the most recent documents to fetch from MongoDB database. (#25435 ) limit the most recent documents to fetch from MongoDB database. Thank you for contributing to LangChain! - [ ] limit the most recent documents to fetch from MongoDB database.: "langchain_mongodb: limit the most recent documents to fetch from MongoDB database." - [ ] PR message: *Delete this entire checklist* and replace with - Description: Added a doc_limit parameter which enables the limit for the documents to fetch from MongoDB database - Issue: - Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 10:33:45 -04:00
Noah Mayerhofer	0091947efd	community: add retry for session expired exception in neo4j (#25660 ) Description: The neo4j driver can raise a SessionExpired error, which is considered a retriable error. If a query fails with a SessionExpired error, this change retries every query once. This change will make the neo4j integration less flaky. Twitter handle: noahmay_	2024-08-22 13:07:36 +00:00
Yuki Watanabe	3981d736df	databricks: Add partner package directory and ChatDatabricks implementation (#25430 ) ### Summary Create `langchain-databricks` as a new partner packages. This PR does not migrate all existing Databricks integration, but the package will eventually contain: * `ChatDatabricks` (implemented in this PR) * `DatabricksVectorSearch` * `DatabricksEmbeddings` * ~`UCFunctionToolkit`~ (will be done after UC SDK work which drastically simplify implementation) Also, this PR does not add integration tests yet. This will be added once the Databricks test workspace is ready. Tagging @efriis as POC ### Tracker [✍️] Create a package and imgrate ChatDatabricks [ ] Migrate DatabricksVectorSearch, DatabricksEmbeddings, and their docs ~[ ] Migrate UCFunctionToolkit and its doc~ [ ] Add provider document and update README.md [ ] Add integration tests and set up secrets (after moved to an external package) [ ] Add deprecation note to the community implementations. --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-21 17:19:28 -07:00
Scott Hurrey	fb1d67edf6	box: add retrievers and fix docs (#25633 ) Thank you for contributing to LangChain! Description: Adding `BoxRetriever` for langchain_box. This retriever handles two use cases: * Retrieve all documents that match a full-text search * Retrieve the answer to a Box AI prompt as a Document Twitter handle: @BoxPlatform - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-21 22:40:40 +00:00
Erick Friis	766b650fdc	chroma: add back fastapi optional dep (#25641 )	2024-08-21 20:00:47 +00:00
Bagatur	9daff60698	docs: fix openai api ref (#25639 )	2024-08-21 12:55:17 -07:00
Erick Friis	c8be0a9f70	partners/unstructured: release 0.1.2 (#25637 )	2024-08-21 12:53:55 -07:00
Christophe Bornet	b71ae52e65	[unstructured][security] Bump unstructured version (#25364 ) This ensures version 0.15.7+ is pulled. This version of unstructured uses a version of NLTK >= 3.8.2 that has a fix for a critical CVE: https://github.com/advisories/GHSA-cgvx-9447-vcch	2024-08-21 12:25:24 -07:00
Bagatur	39c44817ae	infra: test convert_message (#25632 )	2024-08-21 18:24:06 +00:00
Bagatur	628574b9c2	core[patch]: Release 0.2.34 (#25622 )	2024-08-21 16:26:51 +00:00
Bagatur	0bc3845e1e	core[patch]: support oai dicts as messages (#25621 ) and update langsmtih example selector docs	2024-08-21 16:13:15 +00:00
ccurme	10a2ce2a26	together[patch]: use mixtral in standard integration tests (#25619 ) Mistral 7B occasionally fails tool-calling tests. Updating to Mixtral appears to improve this.	2024-08-21 14:26:25 +00:00
Dristy Srivastava	b002702af6	[Community][minor]: Updating metadata with full_path in SharePoint loader (#25593 ) - Description: Updating metadata for sharepoint loader with full path i.e., webUrl - Issue: NA - Dependencies: NA - Tests: NA - Docs NA Co-authored-by: dristy.cd <dristy@clouddefense.io> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-21 13:10:14 +00:00
ZhangShenao	34d0417eb5	Improvement[Doc] Improve api doc in of `PineconeVectorStore` (#25605 ) Complete missing arguments in api doc of `PineconeVectorStore`.	2024-08-21 08:58:00 -04:00
Scott Hurrey	55fd2e2158	box: add langchain box package and DocumentLoader (#25506 ) Thank you for contributing to LangChain! -Description: Adding new package: `langchain-box`: * `langchain_box.document_loaders.BoxLoader` — DocumentLoader functionality * `langchain_box.utilities.BoxAPIWrapper` — Box-specific code * `langchain_box.utilities.BoxAuth` — Helper class for Box authentication * `langchain_box.utilities.BoxAuthType` — enum used by BoxAuth class - Twitter handle: @boxplatform - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-21 02:23:43 +00:00
Erick Friis	f878df404f	partners/chroma: release 0.1.3 (#25599 )	2024-08-20 23:24:32 +00:00
Erick Friis	60cf49a618	chroma: ban chromadb sdk versions 0.5.4 and 0.5.5 due to pydantic bug (#25586 ) also remove some unused dependencies (fastapi) and unused test/lint/dev dependencies (community, openai, textsplitters) chromadb 0.5.4 introduced usage of `model_fields` which is pydantic v2 specific. also released in 0.5.5	2024-08-20 23:21:38 +00:00
Erick Friis	e37caa9b9a	core: fix fallback context overwriting (#25550 ) fixes #25337	2024-08-20 16:07:12 -07:00
Bagatur	8a71f1b41b	core[minor]: add langsmith document loader (#25493 ) needs tests	2024-08-20 10:22:14 -07:00
Jabir	12e490ea56	Update azuresearch.py (#25577 ) This will allow complextype metadata to be returned. the current implementation throws error when dealing with nested metadata Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-20 12:53:30 +00:00
Bagatur	4bd005adb6	core[patch]: Allow bound models as token_counter in trim_messages (#25563 )	2024-08-20 00:21:22 -07:00
Erick Friis	e01c6789c4	core,community: add beta decorator to missed GraphVectorStore extensions (#25562 )	2024-08-19 17:29:09 -07:00
Bagatur	6b98207eda	infra: test chat prompt ser/des (#25557 )	2024-08-19 15:27:36 -07:00
ccurme	c5bf114c0f	together, standard-tests: specify tool_choice in standard tests (#25548 ) Here we allow standard tests to specify a value for `tool_choice` via a `tool_choice_value` property, which defaults to None. Chat models [available in Together](https://docs.together.ai/docs/chat-models) have issues passing standard tool calling tests: - llama 3.1 models currently [appear to rely on user-side parsing](https://docs.together.ai/docs/llama-3-function-calling) in Together; - Mixtral-8x7B and Mistral-7B (currently tested) consistently do not call tools in some tests. Specifying tool_choice also lets us remove an existing `xfail` and use a smaller model in Groq tests.	2024-08-19 16:37:36 -04:00
maang-h	015ab91b83	community[patch]: Add ToolMessage for ChatZhipuAI (#25547 ) - Description: Add ToolMessage for `ChatZhipuAI` to solve the issue #25490	2024-08-19 11:26:38 -04:00
ccurme	5a3aaae6dc	groq[patch]: update model used for llama tests (#25542 ) `llama-3.1-8b-instant` often fails some of the tool calling standard tests. Here we update to `llama-3.1-70b-versatile`.	2024-08-19 13:58:06 +00:00
Mohammad Mohtashim	75c3c81b8c	[Community]: Fix - Open AI Whisper `client.audio.transcriptions` returning Text Object which raises error (#25271 ) - Description: The following [line](`fd546196ef/libs/community/langchain_community/document_loaders/parsers/audio.py (L117)`) in `OpenAIWhisperParser` returns a text object for some odd reason despite the official documentation saying it should return `Transcript` Instance which should have the text attribute. But for the example given in the issue and even when I tried running on my own, I was directly getting the text. The small PR accounts for that. - Issue: : #25218 I was able to replicate the error even without the GenericLoader as shown below and the issue was with `OpenAIWhisperParser` ```python parser = OpenAIWhisperParser(api_key="sk-fxxxxxxxxx", response_format="srt", temperature=0) list(parser.lazy_parse(Blob.from_path('path_to_file.m4a'))) ```	2024-08-19 09:36:42 -04:00
Thin red line 未来产品经理	0f7b8adddf	fix issue: cannot use document_variable_name to override context in create_stuff_documents_chain (#25531 ) …he prompt in the create_stuff_documents_chain Thank you for contributing to LangChain! - [ ] PR title: "langchain:add document_variable_name in the function _validate_prompt in create_stuff_documents_chain" - [ ] PR message: - Description: add document_variable_name in the function _validate_prompt in create_stuff_documents_chain - Issue: according to the description of create_stuff_documents_chain function, the parameter document_variable_name can be used to override the "context" in the prompt, but in the function, _validate_prompt it still use DOCUMENTS_KEY to check if it is a valid prompt, the value of DOCUMENTS_KEY is always "context", so even through the user use document_variable_name to override it, the code still tries to check if "context" is in the prompt, and finally it reports error. so I use document_variable_name to replace DOCUMENTS_KEY, the default value of document_variable_name is "context" which is same as DOCUMENTS_KEY, but it can be override by users. - Dependencies: none - Twitter handle: https://x.com/xjr199703 - [ ] Add tests and docs: none - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-19 13:33:19 +00:00
maang-h	32f5147523	docs: Fix QianfanLLMEndpoint and Tongyi input text (#25529 ) - Description: Fix `QianfanLLMEndpoint` and `Tongyi` input text.	2024-08-19 09:23:09 -04:00
ZhangShenao	4255a30f20	Improvement[Community] Improve api doc for `SingleFileFacebookMessengerChatLoader` (#25536 ) Delete redundant args in api doc	2024-08-19 09:00:21 -04:00
Bagatur	49dea06af1	docs: fix Agent deprecation msg (#25464 )	2024-08-18 19:15:52 +00:00
Hassan El Mghari	937b3904eb	together[patch]: update base url (#25524 ) Updated the Together base URL from `.ai` to `.xyz` since some customers have reported problems with `.ai`.	2024-08-18 10:48:30 -07:00
Isaac Francisco	a2e90a5a43	add embeddings integration tests (#25508 )	2024-08-16 13:20:37 -07:00
Bagatur	a06818a654	openai[patch]: update core dep (#25502 )	2024-08-16 18:30:17 +00:00
Bagatur	df98552b6f	core[patch]: Release 0.2.33 (#25498 )	2024-08-16 11:18:54 -07:00
ccurme	b83f1eb0d5	core, partners: implement standard tracing params for LLMs (#25410 )	2024-08-16 13:18:09 -04:00
Bagatur	9f0c76bf89	openai[patch]: Release 0.1.22 (#25496 )	2024-08-16 16:53:04 +00:00
ccurme	01ecd0acba	openai[patch]: fix json mode for Azure (#25488 ) https://github.com/langchain-ai/langchain/issues/25479 https://github.com/langchain-ai/langchain/issues/25485 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-16 09:50:50 -07:00
Bagatur	253ceca76a	docs: fix mimetype parser docstring (#25463 )	2024-08-15 16:16:52 -07:00
Eugene Yurtsev	e18511bb22	core[minor], anthropic[patch]: Upgrade @root_validator usage to be consistent with pydantic 2 (#25457 ) anthropic: Upgrade `@root_validator` usage to be consistent with pydantic 2 core: support looking up multiple keys from env in from_env factory	2024-08-15 20:09:34 +00:00
Eugene Yurtsev	34da8be60b	pinecone[patch]: Upgrade @root_validators to be consistent with pydantic 2 (#25453 ) Upgrade root validators for pydantic 2 migration	2024-08-15 19:45:14 +00:00
Eugene Yurtsev	b297af5482	voyageai[patch]: Upgrade root validators for pydantic 2 (#25455 ) Update @root_validators to be consistent with pydantic 2 semantics	2024-08-15 15:30:41 -04:00
Eugene Yurtsev	4cdaca67dc	ai21[patch]: Upgrade @root_validators for pydantic 2 migration (#25454 ) Upgrade @root_validators usage to match pydantic 2 semantics	2024-08-15 14:54:08 -04:00
Eugene Yurtsev	d72a08a60d	groq[patch]: Update root validators for pydantic 2 migration (#25402 )	2024-08-15 18:46:52 +00:00
Isaac Francisco	5150ec3a04	[experimental]: minor fix to open assistants code (#24682 )	2024-08-15 17:50:57 +00:00
Bagatur	2b4fbcb4b4	docs: format oai embeddings docstring (#25448 )	2024-08-15 16:57:54 +00:00
Eugene Yurtsev	eb3870e9d8	fireworks[patch]: Upgrade @root_validators to be pydantic 2 compliant (#25443 ) Update @root_validators to be pydantic 2 compliant	2024-08-15 16:56:48 +00:00
William FH	75ae585deb	Merge support for group manager (#25360 )	2024-08-15 09:56:31 -07:00
Eugene Yurtsev	2ef9d12372	mistralai[patch]: Update more @root_validators for pydantic 2 compatibility (#25446 ) Update @root_validators in mistralai integration for pydantic 2 compatibility	2024-08-15 12:44:42 -04:00
Eugene Yurtsev	831708beb7	together[patch]: Update @root_validator for pydantic 2 compatibility (#25423 ) This PR updates usage of @root_validator to be compatible with pydantic 2.	2024-08-15 11:27:42 -04:00
Eugene Yurtsev	a114255b82	ai21[patch]: Update @root_validators for pydantic2 migration (#25401 ) Update @root_validators for pydantic 2 migration.	2024-08-15 11:26:44 -04:00
Eugene Yurtsev	6f68c8d6ab	mistralai[patch]: Update root validator for compatibility with pydantic 2 (#25403 )	2024-08-15 11:26:24 -04:00
ccurme	8afbab4cf6	langchain[patch]: deprecate various chains (#25310 ) - [x] NatbotChain: move to community, deprecate langchain version. Update to use `prompt \| llm \| output_parser` instead of LLMChain. - [x] LLMMathChain: deprecate + add langgraph replacement example to API ref - [x] HypotheticalDocumentEmbedder (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] FlareChain: update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] ConstitutionalChain: deprecate + add langgraph replacement example to API ref - [x] LLMChainExtractor (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] LLMChainFilter (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] RePhraseQueryRetriever (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain	2024-08-15 10:49:26 -04:00
Luke	66e30efa61	experimental: Fix divide by 0 error (#25439 ) Within the semantic chunker, when calling `_threshold_from_clusters` there is the possibility for a divide by 0 error if the `number_of_chunks` is equal to the length of `distances`. Fix simply implements a check if these values match to prevent the error and enable chunking to continue.	2024-08-15 14:46:30 +00:00
ccurme	ba167dc158	community[patch]: update connection string in azure cosmos integration test (#25438 )	2024-08-15 14:07:54 +00:00
Isaac Francisco	966b408634	[docs]: doc loader changes (#25417 )	2024-08-14 19:46:33 -07:00
ccurme	bd261456f6	langchain: bump core to 0.2.32 (#25421 )	2024-08-15 00:00:42 +00:00
Bagatur	ec8ffc8f40	core[patch]: Release 0.2.32 (#25420 )	2024-08-14 15:56:56 -07:00
Bagatur	2494cecabf	core[patch]: tool import fix (#25419 )	2024-08-14 22:54:13 +00:00
ccurme	df632b8cde	langchain: bump min core version (#25418 )	2024-08-14 22:51:35 +00:00
ccurme	1050e890c6	langchain: release 0.2.14 (#25416 ) Fixes https://github.com/langchain-ai/langchain/issues/25413	2024-08-14 22:29:39 +00:00
gbaian10	0a99935794	docs: remove the extra period in docstring (#25414 ) Remove the period after the hyperlink in the docstring of BaseChatOpenAI.with_structured_output. I have repeatedly copied the extra period at the end of the hyperlink, which results in a "Page not found" page when pasted into the browser.	2024-08-14 18:07:15 -04:00
Werner van der Merwe	1d3f7231b8	fix: typo where github should be gitlab (#25397 ) PR title: "GitLabToolkit: fix typo" - Description: fix typo where GitHub should have been GitLab - Dependencies: None	2024-08-14 18:36:25 +00:00
Bagatur	a58d4ba340	core[patch]: Release 0.2.31 (#25388 )	2024-08-14 11:26:49 -07:00
Bagatur	414154fa59	experimental[patch]: refactor rl chain structure (#25398 ) can't have a class and function with same name but different capitalization in same file for api reference building	2024-08-14 17:09:43 +00:00
Bagatur	63c483ea01	standard-tests: import fix (#25395 )	2024-08-14 09:13:56 -07:00
Bagatur	eec7bb4f51	anthropic[patch]: Release 0.1.23 (#25394 )	2024-08-14 09:03:39 -07:00
Eugene Yurtsev	f4196f1fb8	ollama[patch]: Update extra in ollama package (#25383 ) Backwards compatible change that converts pydantic extras to literals which is consistent with pydantic 2 usage.	2024-08-14 10:30:01 -04:00
Chengyu Yan	d0ad713937	core: fix issue#24660, slove error messages about `ValueError` when use model with history (#25183 ) - Description: This PR will slove error messages about `ValueError` when use model with history. Detail in #24660. #22933 causes that `langchain_core.runnables.history.RunnableWithMessageHistory._get_output_messages` miss type check of `output_val` if `output_val` is `False`. After running `RunnableWithMessageHistory._is_not_async`, `output` is `False`. `249945a572/libs/core/langchain_core/runnables/history.py (L323-L334)` `15a36dd0a2/libs/core/langchain_core/runnables/history.py (L461-L471)` ~~I suggest that `_get_output_messages` return empty list when `output_val == False`.~~ - Issue: - #24660 - Dependencies:: No Change. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-14 14:26:22 +00:00
Bagatur	493e474063	docs: udpated api reference (#25172 ) - Move the API reference into the vercel build - Update api reference organization and styling	2024-08-14 07:00:17 -07:00
Eugene Yurtsev	5f5e8c9a60	huggingface[patch], pinecone[patch], fireworks[patch], mistralai[patch], voyageai[patch], togetherai[path]: convert Pydantic extras to literals (#25384 ) Backwards compatible change that converts pydantic extras to literals which is consistent with pydantic 2 usage. - fireworks - voyage ai - mistralai - mistral ai - together ai - huggigng face - pinecone	2024-08-14 09:55:30 -04:00
Eugene Yurtsev	d00176e523	openai[patch]: Update extra to match pydantic 2 (#25382 ) Backwards compatible change that converts pydantic extras to literals which is consistent with pydantic 2 usage.	2024-08-14 09:55:18 -04:00
Eugene Yurtsev	dc51cc5690	core[minor]: Prevent PydanticOutputParser from encoding schema as ASCII (#25386 ) This allows users to provide parameter descriptions in the pydantic models in other languages. Continuing this PR: https://github.com/langchain-ai/langchain/pull/24809	2024-08-14 13:54:31 +00:00
ccurme	27690506d0	multiple: update removal targets (#25361 )	2024-08-14 09:50:39 -04:00
Harrison Chase	967b6f21f6	docs: improve document loaders index (#25365 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-14 01:48:48 +00:00
Isaac Francisco	f4ffd692a3	[docs]: standardize doc loader doc strings (#25325 )	2024-08-13 23:18:56 +00:00
Isaac Francisco	e0bbb81d04	[docs]: standardize tool docstrings (#25351 )	2024-08-13 16:10:00 -07:00
Erick Friis	d5b548b4ce	docs: index pages, sidebars (#25316 )	2024-08-13 15:52:51 -07:00
Isaac Francisco	0478f7f5e4	[docs]: LLM integration pages (#25005 )	2024-08-13 14:50:45 -07:00
thedavgar	9d08369442	community: fix AzureSearch vectorstore asyncronous methods (#24921 ) Description Fix the asyncronous methods to retrieve documents from AzureSearch VectorStore. The previous changes from [this commit](`ffe6ca986e`) create a similar code for the syncronous methods and the asyncronous ones but the asyncronous client return an asyncronous iterator "AsyncSearchItemPaged" as said in the issue #24740. To solve this issue, the syncronous iterators in asyncronous methods where changed to asyncronous iterators. @chrislrobert said in [this comment](https://github.com/langchain-ai/langchain/issues/24740#issuecomment-2254168302) that there was a still a flaw due to `with` blocks that close the client after each call. I removed this `with` blocks in the `async_client` following the same pattern as the sync `client`. In order to close up the connections, a __del__ method is included to gently close up clients once the vectorstore object is destroyed. Issue: #24740 and #24064 Dependencies: No new dependencies for this change Example notebook: I created a notebook just to test the changes work and gives the same results as the syncronous methods for vector and hybrid search. With these changes, the asyncronous methods in the retriever work as well. ![image](https://github.com/user-attachments/assets/697e431b-9d7f-4d0d-b205-59d051ac2b67) Lint and test: Passes the tests and the linter	2024-08-13 14:20:51 -07:00
Fedor Nikolaev	2b15518c5f	community: add args_schema to SearxSearchResults tool (#25350 ) This adds `args_schema` member to `SearxSearchResults` tool. This member is already present in the `SearxSearchRun` tool in the same file. I was having `TypeError: Type is not JSON serializable: AsyncCallbackManagerForToolRun` being thrown in langserve playground when I was using `SearxSearchResults` tool as a part of chain there. This fixes the issue, so the error is not raised anymore. This is a example langserve app that was giving me the error, but it works properly after the proposed fix: ```python #!/usr/bin/env python from fastapi import FastAPI from langchain_core.prompts import ChatPromptTemplate from langchain_core.output_parsers import StrOutputParser from langchain_core.runnables import RunnablePassthrough from langchain_openai import ChatOpenAI from langchain_community.utilities import SearxSearchWrapper from langchain_community.tools.searx_search.tool import SearxSearchResults from langserve import add_routes template = """Answer the question based only on the following context: {context} Question: {question} """ prompt = ChatPromptTemplate.from_template(template) model = ChatOpenAI() s = SearxSearchWrapper(searx_host="http://localhost:8080") search = SearxSearchResults(wrapper=s) search_chain = ( {"context": search, "question": RunnablePassthrough()} \| prompt \| model \| StrOutputParser() ) app = FastAPI() add_routes( app, search_chain, path="/chain", ) if __name__ == "__main__": import uvicorn uvicorn.run(app, host="localhost", port=8000) ```	2024-08-13 18:26:09 +00:00
maang-h	089f5e6cad	Standardize SparkLLM (#25239 ) - Description: Standardize SparkLLM, include: - docs, the issue #24803 - to support stream - update api url - model init arg names, the issue #20085	2024-08-13 09:50:12 -04:00
Chen Xiabin	24155aa1ac	qianfan generate/agenerate with usage_metadata (#25332 )	2024-08-13 09:24:41 -04:00
Christophe Bornet	ebbe609193	Add README for astradb package (#25345 ) Similar to https://github.com/langchain-ai/langchain/blob/master/libs/partners/ibm/README.md	2024-08-13 09:17:23 -04:00
Eugene Yurtsev	f679ed72ca	ollama[patch]: Update API Reference for ollama embeddings (#25315 ) Update API reference for OllamaEmbeddings Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 21:31:48 -04:00
Erick Friis	2907ab2297	community: release 0.2.12 (#25324 )	2024-08-12 23:30:27 +00:00
Erick Friis	06f8bd9946	langchain: release 0.2.13 (#25323 )	2024-08-12 22:24:06 +00:00
Erick Friis	252f0877d1	core: release 0.2.30 (#25321 )	2024-08-12 22:01:24 +00:00
Eugene Yurtsev	217a915b29	openai: Update API Reference docs for AzureOpenAI Embeddings (#25312 ) Update AzureOpenAI Embeddings docs	2024-08-12 19:41:18 +00:00
Eugene Yurtsev	056c7c2983	core[patch]: Update API reference for fake embeddings (#25313 ) Issue: https://github.com/langchain-ai/langchain/issues/24856 Using the same template for the fake embeddings in langchain_core as used in the integrations.	2024-08-12 19:40:05 +00:00
Ben Chambers	1adc161642	community: kwargs for CassandraGraphVectorStore (#25300 ) - Description: pass kwargs from CassandraGraphVectorStore to underlying store Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 18:01:29 +00:00
Eugene Yurtsev	1c9917dfa2	fireworks[patch]: Fix doc-string for API Referenmce (#25304 )	2024-08-12 17:16:13 +00:00
Eugene Yurtsev	ccff1ba8b8	ai21[patch]: Update API reference documentation (#25302 ) Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 13:15:27 -04:00
Eugene Yurtsev	53ee5770d3	fireworks: Add APIReference for the FireworksEmbeddings model (#25292 ) Add API Reference documentation for the FireworksEmbedding model. Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 13:13:43 -04:00
Eugene Yurtsev	8626abf8b5	togetherai[patch]: Update API Reference for together AI embeddings model (#25295 ) Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 17:12:28 +00:00
Eugene Yurtsev	1af8456a2c	mistralai[patch]: Docs Update APIReference for MistralAIEmbeddings (#25294 ) Update API Reference for MistralAI embeddings Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 15:25:37 +00:00
Eugene Yurtsev	0a3500808d	openai[patch]: Docs fix RST formatting in OpenAIEmbeddings (#25293 )	2024-08-12 11:24:35 -04:00
Eugene Yurtsev	ee8a585791	openai[patch]: Add API Reference docs to OpenAIEmbeddings (#25290 ) Issue: [24856](https://github.com/langchain-ai/langchain/issues/24856)	2024-08-12 14:53:51 +00:00
ccurme	e77eeee6ee	core[patch]: add standard tracing params for retrievers (#25240 )	2024-08-12 14:51:59 +00:00
Mohammad Mohtashim	9927a4866d	[Community] - Added bind_tools and with_structured_output for ChatZhipuAI (#23887 ) - Description: This PR implements the `bind_tool` functionality for ChatZhipuAI as requested by the user. ChatZhipuAI models support tool calling according to the `OpenAI` tool format, as outlined in their official documentation [here](https://open.bigmodel.cn/dev/api#glm-4). - Issue: ##23868 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 14:11:43 +00:00
Anush	472527166f	qdrant: Update API reference link and install command (#25245 ) ## Description As the title goes. The current API reference links to the deprecated class.	2024-08-11 16:54:14 -04:00
gbaian10	aa2722cbe2	docs: update numbering of items in docstring (#25267 ) A problem similar to #25093 . Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-11 20:50:24 +00:00
Maddy Adams	a82c0533f2	langchain: default to langsmith sdk for pulling prompts, fallback to langchainhub (#24156 ) Description: Deprecating langchainhub, replacing with langsmith sdk	2024-08-11 13:30:52 -07:00
maang-h	bc60cddc1b	docs: Fix ChatBaichuan, QianfanChatEndpoint, ChatSparkLLM, ChatZhipuAI docs (#25265 ) - Description: Fix some chat models docs, include: - ChatBaichuan - QianfanChatEndpoint - ChatSparkLLM - ChatZhipuAI	2024-08-11 16:23:55 -04:00
ZhangShenao	43deed2a95	Improvement[Embeddings] Add dimension support to `ZhipuAIEmbeddings` (#25274 ) - In the in ` embedding-3 ` and later models of Zhipu AI, it is supported to specify the dimensions parameter of Embedding. Ref: https://bigmodel.cn/dev/api#text_embedding-3 . - Add test case for `embedding-3` model by assigning dimensions.	2024-08-11 16:20:37 -04:00
maang-h	9cd608efb3	docs: Standardize OpenAI Docs (#25280 ) - Description: Standardize OpenAI Docs - Issue: the issue #24803 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-11 20:20:16 +00:00
Bagatur	fd546196ef	openai[patch]: Release 0.1.21 (#25269 )	2024-08-10 16:37:31 -07:00
Eugene Yurtsev	6dd9f053e3	core[patch]: Deprecating beta upsert APIs in vectorstore (#25069 ) This PR deprecates the beta upsert APIs in vectorstore. We'll introduce them in a V2 abstraction instead to keep the existing vectorstore implementations lighter weight. The main problem with the existing APIs is that it's a bit more challenging to implement the correct behavior w/ respect to IDs since ID can be present in both the function signature and as an optional attribute on the document object. But VectorStores that pass the standard tests should have implemented the semantics properly! --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-09 17:17:36 -04:00
Bagatur	ca9dcee940	standard-tests[patch]: test ToolMessage.status="error" (#25210 )	2024-08-09 13:00:14 -07:00
Eugene Yurtsev	dadb6f1445	cli[patch]: Update integration template for embedding models (#25248 ) Update integration template for embedding models	2024-08-09 14:28:57 -04:00
Eugene Yurtsev	b6f0174bb9	community[patch],core[patch]: Update EdenaiTool root_validator and add unit test in core (#25233 ) This PR gets rid `root_validators(allow_reuse=True)` logic used in EdenAI Tool in preparation for pydantic 2 upgrade. - add another test to secret_from_env_factory	2024-08-09 15:59:27 +00:00
blueoom	c3ced4c6ce	core[patch]: use time.monotonic() instead time.time() in InMemoryRateLimiter Description: The get time point method in the _consume() method of core.rate_limiters.InMemoryRateLimiter uses time.time(), which can be affected by system time backwards. Therefore, it is recommended to use the monotonically increasing monotonic() to obtain the time ```python with self._consume_lock: now = time.time() # time.time() -> time.monotonic() # initialize on first call to avoid a burst if self.last is None: self.last = now elapsed = now - self.last # when use time.time(), elapsed may be negative when system time backwards ```	2024-08-09 11:31:20 -04:00
Eugene Yurtsev	bd6c31617e	community[patch]: Remove more @allow_reuse=True validators (#25236 ) Remove some additional allow_reuse=True usage in @root_validators.	2024-08-09 11:10:27 -04:00
Eugene Yurtsev	6e57aa7c36	community[patch]: Remove usage of @root_validator(allow_reuse=True) (#25235 ) Remove usage of @root_validator(allow_reuse=True)	2024-08-09 10:57:42 -04:00
thiswillbeyourgithub	a2b4c33bd6	community[patch]: FAISS: ValueError mentions normalize_score_fn isntead of relevance_score_fn (#25225 ) Thank you for contributing to LangChain! - [X] PR title: "community: fix valueerror mentions wrong argument missing" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [X] PR message: *Delete this entire checklist* and replace with - Description: when faiss.py has a None relevance_score_fn it raises a ValueError that says a normalize_fn_score argument is needed. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-09 14:40:29 +00:00
ccurme	4825dc0d76	langchain[patch]: add deprecations (#24792 )	2024-08-09 10:34:43 -04:00
ccurme	02300471be	langchain[patch]: extended-tests: drop logprobs from OAI expected config (#25234 ) Following https://github.com/langchain-ai/langchain/pull/25229	2024-08-09 14:23:11 +00:00
Shivendra Soni	66b7206ab6	community: Add llm-extraction option to FireCrawl Document Loader (#25231 ) Description: This minor PR aims to add `llm_extraction` to Firecrawl loader. This feature is supported on API and PythonSDK, but the langchain loader omits adding this to the response. Twitter handle: [scalable_pizza](https://x.com/scalablepizza) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-09 13:59:10 +00:00
blaufink	c81c77b465	partners: fix of issue #24880 (#25229 ) - Description: As described in the related issue: There is an error occuring when using langchain-openai>=0.1.17 which can be attributed to the following PR: #23691 Here, the parameter logprobs is added to requests per default. However, AzureOpenAI takes issue with this parameter as stated here: https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/chatgpt?tabs=python-new&pivots=programming-language-chat-completions -> "If you set any of these parameters, you get an error." Therefore, this PR changes the default value of logprobs parameter to None instead of False. This results in it being filtered before the request is sent. - Issue: #24880 - Dependencies: / Co-authored-by: blaufink <sebastian.brueckner@outlook.de>	2024-08-09 13:21:37 +00:00
ccurme	3b7437d184	docs: update integration api refs (#25195 ) - [x] toolkits - [x] retrievers (in this repo)	2024-08-09 12:27:32 +00:00
Bagatur	7040013140	core[patch]: fix deprecation pydantic bug (#25204 ) #25004 is incompatible with pydantic < 1.10.17. Introduces fix for this.	2024-08-08 16:39:38 -07:00
Isaac Francisco	dc7423e88f	[docs]: standardizing document loader integration pages (#25002 )	2024-08-08 16:33:09 -07:00
Casey Clements	25f2e25be1	partners[patch]: Mongodb Retrievers - CI final touches. (#25202 ) ## Description Contains 2 updates to for integration tests to run on langchain's CI. Addendum to #25057 to get release github action to succeed.	2024-08-08 15:38:31 -07:00
Eugene Yurtsev	429a0ee7fd	core[minor]: Add factory for looking up secrets from the env (#25198 ) Add factory method for looking secrets from the env.	2024-08-08 16:41:58 -04:00
Erick Friis	da9281feb2	cli: release 0.0.29 (#25196 )	2024-08-08 12:52:49 -07:00
Erick Friis	c6ece6a96d	core: autodetect more ls params (#25044 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-08 12:44:21 -07:00
Eugene Yurtsev	86355640c3	experimental[patch]: Use get_fields adapter (#25193 ) Change all usages of __fields__ with get_fields adapter merged into langchain_core. Code mod generated using the following grit pattern: ``` engine marzano(0.1) language python `$X.__fields__` => `get_fields($X)` where { add_import(source="langchain_core.utils.pydantic", name="get_fields") } ```	2024-08-08 15:10:11 -04:00
Eugene Yurtsev	b9f65e5038	experimental[patch]: Migrate pydantic extra to literals (#25194 ) Migrate pydantic extra to literals Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 19:05:54 +00:00
Eugene Yurtsev	30fb345342	core[minor]: Add from_env utility (#25189 ) Add a utility that can be used as a default factory The goal will be to start migrating from of the pydantic models to use `from_env` as a default factory if possible. ```python from pydantic import Field, BaseModel from langchain_core.utils import from_env class Foo(BaseModel): name: str = Field(default_factory=from_env('HELLO')) ```	2024-08-08 14:52:35 -04:00
Eugene Yurtsev	98779797fe	community[patch]: Use get_fields adapter for pydantic (#25191 ) Change all usages of __fields__ with get_fields adapter merged into langchain_core. Code mod generated using the following grit pattern: ``` engine marzano(0.1) language python `$X.__fields__` => `get_fields($X)` where { add_import(source="langchain_core.utils.pydantic", name="get_fields") } ```	2024-08-08 14:43:09 -04:00
Rajendra Kadam	663638d6a8	community[minor]: [SharePointLoader] Load extended metadata for the root folder (#24872 ) - Title: [SharePointLoader] Load extended metadata for the root folder - Description: - Ensure extended metadata loads correctly for the root folder. - Cleanup: Refactor SharePointLoader to remove unused fields(`file_id` & `site_id`). - Dependencies: NA - Add tests and docs: NA	2024-08-08 14:39:16 -04:00
Eugene Yurtsev	2f209d84fa	core[patch]: Add pydantic get_fields adapter (#25187 ) Add adapter to get fields	2024-08-08 17:47:42 +00:00
Eugene Yurtsev	c72e522e96	langchain[patch]: Upgrade pydantic extra (#25186 ) Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 17:27:27 +00:00
Eugene Yurtsev	bf5193bb99	community[patch]: Upgrade pydantic extra (#25185 ) Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 17:20:39 +00:00
Isaac Francisco	11adc09e02	[docs]: change rag reference in vector store pages (#25125 )	2024-08-08 10:08:14 -07:00
Eugene Yurtsev	425f6ffa5b	core[patch]: Fix aindex API (#25155 ) A previous PR accidentally broke the aindex API by renaming a positional argument vectorstore into vector_store. This PR reverts this change.	2024-08-08 12:08:18 -04:00
Isaac Francisco	15a36dd0a2	[docs]: combine tools and toolkits (#25158 )	2024-08-08 08:59:02 -07:00
ololand	249945a572	Update polygon.py for business subscription (#25085 ) For business subscription the status is STOCKSBUSINESS not OK Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-08 15:28:41 +00:00
ccurme	59b8850909	groq[patch]: update rate limit in integration tests (#25177 ) Divide by ~2 to account for testing python 3.8 and 3.12 in parallel.	2024-08-08 13:33:25 +00:00
ogawa	d895db11d6	community[patch]: gpt-4o-2024-08-06 costs (#25164 ) - Description: updated OpenAI cost definitions according to the following: - https://openai.com/api/pricing/ - Twitter handle: `@ogawa65a` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-08 13:22:11 +00:00
Brace Sproul	d77c7c4236	docs: Fix misspelling of instantiate in docs (#25107 )	2024-08-07 15:05:06 -07:00
Eugene Yurtsev	7b1a132aff	core[patch]: Add unit tests for Serializable (#25152 ) Add a few test cases for serializable (many other test cases already covered throguh runnable tests).	2024-08-07 21:01:36 +00:00
Bagatur	df99b832a7	core[patch]: support Field deprecation (#25004 ) ![Screenshot 2024-08-02 at 4 23 17 PM](https://github.com/user-attachments/assets/c757e093-877e-4af6-9dcd-984195454158)	2024-08-07 13:57:55 -07:00
ccurme	803eba3163	core[patch]: check for model_fields attribute (#25108 ) `__fields__` raises a warning in pydantic v2	2024-08-07 13:32:56 -07:00
Casey Clements	6e9a8b188f	mongodb: Add Hybrid and Full-Text Search Retrievers, release 0.2.0 (#25057 ) ## Description This pull-request extends the existing vector search strategies of MongoDBAtlasVectorSearch to include Hybrid (Reciprocal Rank Fusion) and Full-text via new Retrievers. There is a small breaking change in the form of the `prefilter` kwarg to search. For this, and because we have now added a great deal of features, including programmatic Index creation/deletion since 0.1.0, we plan to bump the version to 0.2.0. ### Checklist * Unit tests have been extended * formatting has been applied * One mypy error remains which will either go away in CI or be simplified. --------- Signed-off-by: Casey Clements <casey.clements@mongodb.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-07 20:10:29 +00:00
Bagatur	0b4608f71e	infra: temp skip oai embeddings test (#25148 )	2024-08-07 17:51:39 +00:00
Bagatur	a4086119f8	openai[patch]: Release 0.1.21rc2 (#25146 )	2024-08-07 16:59:15 +00:00
Bagatur	b4c12346cc	core[patch]: Release 0.2.29 (#25126 )	2024-08-07 09:50:20 -07:00
Erick Friis	dff83cce66	core[patch]: base language model disable_streaming (#25070 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-07 09:26:21 -07:00
Bagatur	09fbce13c5	openai[patch]: ChatOpenAI.with_structured_output json_schema support (#25123 )	2024-08-07 08:09:07 -07:00
maang-h	0ba125c3cd	docs: Standardize QianfanLLMEndpoint LLM (#25139 ) - Description: Standardize QianfanLLMEndpoint LLM，include: - docs, the issue #24803 - model init arg names, the issue #20085	2024-08-07 10:57:27 -04:00
Eugene Yurtsev	28e0958ff4	core[patch]: Relax rate limit unit tests in terms of timing (#25140 ) Relax rate limit unit tests	2024-08-07 14:04:58 +00:00
Pat Patterson	7e7fcf5b1f	community: Fix ValidationError on creating GPT4AllEmbeddings with no gpt4all_kwargs (#25124 ) - Description: Instantiating `GPT4AllEmbeddings` with no `gpt4all_kwargs` argument raised a `ValidationError`. Root cause: #21238 added the capability to pass `gpt4all_kwargs` through to the `GPT4All` instance via `Embed4All`, but broke code that did not specify a `gpt4all_kwargs` argument. - Issue: #25119 - Dependencies: None - Twitter handle: [`@metadaddy`](https://twitter.com/metadaddy)	2024-08-07 13:34:01 +00:00
ZhangShenao	63d84e93b9	patch[doc] Fix word spelling error (#25128 ) Fix word spelling error	2024-08-07 09:16:17 -04:00
Eugene Yurtsev	4d28c70000	core[patch]: Sort Config attributes (#25127 ) This PR does an aesthetic sort of the config object attributes. This will make it a bit easier to go back and forth between pydantic v1 and pydantic v2 on the 0.3.x branch	2024-08-07 02:53:50 +00:00
Erick Friis	46a47710b0	partners/milvus: release 0.1.4 (#25058 )	2024-08-06 16:29:29 -07:00
Erick Friis	35ebd2620c	infra,cli: template matching registration (#25110 )	2024-08-06 15:29:55 -07:00
ccurme	23c9aba575	groq[patch]: allow warnings during tests (#25105 ) Among integration packages in libs/partners, Groq is an exception in that it errors on warnings. Following https://github.com/langchain-ai/langchain/pull/25084, Groq fails with > pydantic.warnings.PydanticDeprecatedSince20: The `__fields__` attribute is deprecated, use `model_fields` instead. Deprecated in Pydantic V2.0 to be removed in V3.0. Here we update the behavior to no longer fail on warning, which is consistent with the rest of the packages in libs/partners.	2024-08-06 18:02:20 -04:00
Bagatur	7882d5c978	openai[patch]: Release 0.1.21rc1 (#25116 )	2024-08-06 21:50:36 +00:00
Bagatur	70677202c7	core[patch]: Release 0.2.29rc1 (#25115 )	2024-08-06 21:36:56 +00:00
Bagatur	78403a3746	core[patch], openai[patch]: enable strict tool calling (#25111 ) Introduced https://openai.com/index/introducing-structured-outputs-in-the-api/	2024-08-06 21:21:06 +00:00
Eugene Yurtsev	d283f452cc	core[minor]: Add support for DocumentIndex in the index api (#25100 ) Support document index in the index api.	2024-08-06 12:30:49 -07:00
Virat Singh	264ab96980	community: Add stock market tools from financialdatasets.ai (#25025 ) Description: In this PR, I am adding three stock market tools from financialdatasets.ai (my API!): - get balance sheets - get cash flow statements - get income statements Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 18:28:12 +00:00
William FH	267855b3c1	Set Context in RunnableSequence & RunnableParallel (#25073 )	2024-08-06 11:10:37 -07:00
Naval Chand	71c0698ee4	Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler (#25104 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Example: "community: Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Naval Chand <navalchand@192.168.1.36>	2024-08-06 17:28:47 +00:00
Isaac Francisco	a72fddbf8d	[docs]: vector store integration pages (#24858 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 17:20:27 +00:00
Bagatur	2c798622cd	docs: runnable docstring space (#25106 )	2024-08-06 16:46:50 +00:00
maang-h	1028af17e7	docs: Standardize Tongyi (#25103 ) - Description: Standardize Tongyi LLM，include: - docs, the issue #24803 - model init arg names, the issue #20085	2024-08-06 11:44:12 -04:00
Dobiichi-Origami	061ed250f6	delete the default model value from langchain and discard the need fo… (#24915 ) - description: I remove the limitation of mandatory existence of `QIANFAN_AK` and default model name which langchain uses cause there is already a default model nama underlying `qianfan` SDK powering langchain component. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-06 14:11:05 +00:00
Eugene Yurtsev	293a4a78de	core[patch]: Include dependencies in sys_info (#25076 ) `python -m langchain_core.sys_info` ```bash System Information ------------------ > OS: Linux > OS Version: #44~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Tue Jun 18 14:36:16 UTC 2 > Python Version: 3.11.4 (main, Sep 25 2023, 10:06:23) [GCC 11.4.0] Package Information ------------------- > langchain_core: 0.2.28 > langchain: 0.2.8 > langsmith: 0.1.85 > langchain_anthropic: 0.1.20 > langchain_openai: 0.1.20 > langchain_standard_tests: 0.1.1 > langchain_text_splitters: 0.2.2 > langgraph: 0.1.19 Optional packages not installed ------------------------------- > langserve Other Dependencies ------------------ > aiohttp: 3.9.5 > anthropic: 0.31.1 > async-timeout: Installed. No version info available. > defusedxml: 0.7.1 > httpx: 0.27.0 > jsonpatch: 1.33 > numpy: 1.26.4 > openai: 1.39.0 > orjson: 3.10.6 > packaging: 24.1 > pydantic: 2.8.2 > pytest: 7.4.4 > PyYAML: 6.0.1 > requests: 2.32.3 > SQLAlchemy: 2.0.31 > tenacity: 8.5.0 > tiktoken: 0.7.0 > typing-extensions: 4.12.2 ```	2024-08-06 09:57:39 -04:00
Dominik Fladung	ffa0c838d8	Allow ConfluenceLoader authorization via Personal Access Tokens (#25096 ) - community: Allow authorization to Confluence with bearer token - Description: Allow authorization to Confluence with [Personal Access Token](https://confluence.atlassian.com/enterprise/using-personal-access-tokens-1026032365.html) by checking for the keys `['client_id', token: ['access_token', 'token_type']]` - Issue: Currently the following error occurs when using an personal access token for authorization. ```python loader = ConfluenceLoader( url=os.getenv('CONFLUENCE_URL'), oauth2={ 'token': {"access_token": os.getenv("CONFLUENCE_ACCESS_TOKEN"), "token_type": "bearer"}, 'client_id': 'client_id', }, page_ids=['12345678'], ) ``` ``` ValueError: Error(s) while validating input: ["You have either omitted require keys or added extra keys to the oauth2 dictionary. key values should be `['access_token', 'access_token_secret', 'consumer_key', 'key_cert']`"] ``` With this PR the loader runs as expected. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-06 13:42:47 +00:00
orkhank	111c7df117	docs: update numbering of items in method docs (#25093 ) Some methods' doc strings have a wrong numbering of items. The numbers were adjusted accordingly	2024-08-06 09:21:52 -04:00
Bagatur	6eb42c657e	core[patch]: Remove default BaseModel init docstring (#25009 ) Currently a default init docstring gets appended to the class docstring of every BaseModel inherited object. This removes the default init docstring. ![Screenshot 2024-08-02 at 5 09 55 PM](https://github.com/user-attachments/assets/757fe4ae-a793-4e7d-8354-512de2c06818)	2024-08-06 01:04:04 +00:00
Gram Liu	88a9a6a758	core[patch]: Add pydantic metadata to subset model (#25032 ) - Description: This includes Pydantic field metadata in `_create_subset_model_v2` so that it gets included in the final serialized form that get sent out. - Issue: #25031 - Dependencies: n/a - Twitter handle: @gramliu --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-05 17:57:39 -07:00
BhujayKumarBhatta	8f33fce871	docs: change for optional variables in chatprompt (#25017 ) Fixes #24884	2024-08-05 23:57:44 +00:00
Bagatur	e572521f2a	core[patch]: exclude special pydantic init params (#25084 )	2024-08-05 23:32:51 +00:00
Isaac Francisco	63ddf0afb4	ollama: allow base_url, headers, and auth to be passed (#25078 )	2024-08-05 15:39:36 -07:00
Eugene Yurtsev	4bcd2aad6c	core[patch]: Relax time constraints on rate limit test (#25071 ) Try to keep the unit test fast, but also have it repeat more robustly	2024-08-05 17:04:22 -04:00
jigsawlabs-student	427a04151c	community: fix neo4j from_existing_graph (#24912 ) Fixes Neo4JVector.from_existing_graph integration with huggingface Previously threw an error with existing databases, because from_existing_graph query returns empty list of new nodes, which are then passed to embedding function, and huggingface errors with empty list. Fixes [24401](https://github.com/langchain-ai/langchain/issues/24401) --------- Co-authored-by: Jeff Katzy <jeffreyerickatz@gmail.com>	2024-08-05 21:01:46 +00:00
Tomaz Bratanic	d166967003	experimental: Add gliner graph transformer (#25066 ) You can use this with: ``` from langchain_experimental.graph_transformers import GlinerGraphTransformer gliner = GlinerGraphTransformer(allowed_nodes=["Person", "Organization", "Nobel"], allowed_relationships=["EMPLOYEE", "WON"]) from langchain_core.documents import Document text = """ Marie Curie, was a Polish and naturalised-French physicist and chemist who conducted pioneering research on radioactivity. She was the first woman to win a Nobel Prize, the first person to win a Nobel Prize twice, and the only person to win a Nobel Prize in two scientific fields. Her husband, Pierre Curie, was a co-winner of her first Nobel Prize, making them the first-ever married couple to win the Nobel Prize and launching the Curie family legacy of five Nobel Prizes. She was, in 1906, the first woman to become a professor at the University of Paris. """ documents = [Document(page_content=text)] gliner.convert_to_graph_documents(documents) ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 21:01:27 +00:00
Eugene Yurtsev	41dfad5104	core[minor]: Introduce DocumentIndex abstraction (#25062 ) This PR adds a minimal document indexer abstraction. The goal of this abstraction is to allow developers to create custom retrievers that also have a standard indexing API and allow updating the document content in them. The abstraction comes with a test suite that can verify that the indexer implements the correct semantics. This is an iteration over a previous PRs (https://github.com/langchain-ai/langchain/pull/24364). The main difference is that we're sub-classing from BaseRetriever in this iteration and as so have consolidated the sync and async interfaces. The main problem with the current design is that runt time search configuration has to be specified at init rather than provided at run time. We will likely resolve this issue in one of the two ways: (1) Define a method (`get_retriever`) that will allow creating a retriever at run time with a specific configuration.. If we do this, we will likely break the subclass on BaseRetriever (2) Generalize base retriever so it can support structured queries --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 18:06:33 +00:00
Stuart Marsh	16bd0697dc	milvus: fixed bug when using partition key and dynamic fields together (#25028 ) Description: This PR fixes a bug where if `enable_dynamic_field` and `partition_key_field` are enabled at the same time, a pymilvus error occurs. Milvus requires the partition key field to be a full schema defined field, and not a dynamic one, so it will throw the error "the specified partition key field {field} not exist" when creating the collection. When `enabled_dynamic_field` is set to `True`, all schema field creation based on `metadatas` is skipped. This code now checks if `partition_key_field` is set, and creates the field. Integration test added. Twitter handle: StuartMarshUK --------- Co-authored-by: Stuart Marsh <stuart.marsh@qumata.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 16:01:55 +00:00
Jim Baldwin	6890daa90c	community: make AthenaLoader profile_name optional and fix type hint (#24958 ) - Description: This PR makes the AthenaLoader profile_name optional and fixes the type hint which says the type is `str` but it should be `str` or `None` as None is handled in the loader init. This is a minor problem but it just confused me when I was using the Athena Loader to why we had to use a Profile, as I want that for local but not production. - Issue: #24957 - Dependencies: None.	2024-08-05 14:28:58 +00:00
Alexey Lapin	335894893b	langchain: Make RetryWithErrorOutputParser.from_llm() create a correct retry chain (#25053 ) Description: RetryWithErrorOutputParser.from_llm() creates a retry chain that returns a Generation instance, when it should actually just return a string. This class was forgotten when fixing the issue in PR #24687	2024-08-05 14:21:27 +00:00
Dobiichi-Origami	c5cb52a3c6	community: fix issue of the existence of numeric object in `additional_kwargs` a… (#24863 ) - Description: A previous PR breaks the code from `baidu_qianfan_endpoint.py` which causes the malfunction of streaming	2024-08-05 10:15:55 -04:00
ZhangShenao	cda79dbb6c	community[patch]: Optimize test case for `MoonshotChat` (#25050 ) Optimize test case for `MoonshotChat`. Use standard ChatModelIntegrationTests.	2024-08-05 10:11:25 -04:00
ZhangShenao	02c35da445	doc[Retriever] Enhance api docs for `MultiQueryRetriever` (#25035 ) Enhance api docs for `MultiQueryRetriever`: - Complete missing parameters - Unify parameter name	2024-08-04 13:56:38 -04:00
Alex Sherstinsky	208042e0f2	community: Fix Predibase Integration for HuggingFace-hosted fine-tuned adapters (#25015 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-03 14:05:43 -07:00
maang-h	f5da0d6d87	docs: Standardize MiniMaxEmbeddings (#24983 ) - Description: Standardize MiniMaxEmbeddings - docs, the issue #24856 - model init arg names, the issue #20085	2024-08-03 14:01:23 -04:00
ZhangShenao	2c3e3dc6b1	patch[Partners] Unified fix of incorrect variable declarations in all check_imports (#25014 ) There are some incorrect declarations of variable `has_failure` in check_imports. The purpose of this PR is to uniformly fix these errors.	2024-08-03 13:49:41 -04:00
maang-h	7de62abc91	docs: Standardize SparkLLMTextEmbeddings docstrings (#25021 ) - Description: Standardize SparkLLMTextEmbeddings docstrings - Issue: the issue #24856	2024-08-03 13:44:09 -04:00
Tomaz Bratanic	f9a11a9197	Add relik transformer config (#25019 )	2024-08-03 08:41:45 -04:00
Bagatur	1dcee68cb8	docs: show beta directive (#25013 ) ![Screenshot 2024-08-02 at 7 15 34 PM](https://github.com/user-attachments/assets/086831c7-36f3-4962-98dc-d707b6289747)	2024-08-03 03:07:45 +00:00
Bagatur	e81ddb32a6	docs: fix kwargs docstring (#25010 ) Fix: ![Screenshot 2024-08-02 at 5 33 37 PM](https://github.com/user-attachments/assets/7c56cdeb-ee81-454c-b3eb-86aa8a9bdc8d)	2024-08-02 19:54:54 -07:00
Bagatur	57747892ce	docs: show deprecation warning first in api ref (#25001 ) OLD ![Screenshot 2024-08-02 at 3 29 39 PM](https://github.com/user-attachments/assets/7f169121-1202-4770-a006-d72ac7a1aa33) NEW ![Screenshot 2024-08-02 at 3 29 45 PM](https://github.com/user-attachments/assets/9cc07cbd-2ae9-4077-95c5-03cb051e6cd7)	2024-08-02 17:35:25 -07:00
Isaac Francisco	73570873ab	docs: standardizing tavily tool docs (#24736 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-02 22:25:27 +00:00
Bagatur	4305f78e40	core[patch]: Release 0.2.28 (#25000 )	2024-08-02 21:07:06 +00:00
Bagatur	dd8e4cd020	text-splitters[patch]: Release 0.2.3 (#24998 )	2024-08-02 20:27:22 +00:00
Bagatur	0de0cd2d31	core[patch]: merge message runs nit (#24997 ) Only add separator if both chunks are non-empty	2024-08-02 20:25:43 +00:00
Bagatur	8e2316b8c2	community[patch]: Release 0.2.11 (#24989 )	2024-08-02 20:08:44 +00:00
ccurme	c2538e7834	experimental[patch]: bump min versions of core and community (#24996 ) Ollama functions unit test broken with min version of community.	2024-08-02 19:58:55 +00:00
ccurme	22c1a4041b	community[patch]: support named arguments in github toolkit (#24986 ) Parameters may be passed in by name if generated from tool calls.	2024-08-02 18:27:32 +00:00
ccurme	4797b806c2	experimental[patch]: release 0.0.64 (#24990 )	2024-08-02 18:00:57 +00:00
Tomaz Bratanic	7061869aec	Add relik graph transformer (#24982 ) Relik is a new library for graph extraction that offers smaller and cheaper models for graph construction	2024-08-02 13:55:41 -04:00
ccurme	c04d95b962	standard-tests: set integration test parameters independent of unit test (#24979 ) This ends up getting set in integration tests.	2024-08-02 10:40:11 -07:00
gbaian10	54e9ea433a	fix: Modify the order of init_chat_model import ollama package. (#24977 )	2024-08-02 08:32:56 -07:00
ZhangShenao	71c0564c9f	community[patch]: Add test case for MoonshotChat (#24960 ) Add test case for `MoonshotChat`.	2024-08-02 09:37:31 -04:00
ZhangShenao	c65e48996c	patch[partners] Fix check_imports bugs in pinecone and milvus (#24971 ) Fix wrong declared variables of `check_imports` in pinecone and milvus	2024-08-02 09:27:11 -04:00
Isaac Francisco	d7688a4328	community[patch]: adding artifact to Tavily search (#24376 ) This allows you to get raw content as well as the answer, instead of just getting the results. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 21:12:11 -07:00
Bagatur	7b08de8909	langchain[patch]: Release 0.2.12 (#24954 )	2024-08-02 04:04:49 +00:00
Bagatur	245cb5a252	core[patch]: Release 0.2.27 (#24952 )	2024-08-02 01:43:24 +00:00
Bagatur	199e9c5ae0	core[patch]: Fix tool args schema inherited field parsing (#24936 ) Fix #24925	2024-08-01 18:36:33 -07:00
Leonid Ganeline	4092876863	core: docstrings `BaseCallbackHandler update (#24948 ) Added missed docstrings	2024-08-01 20:46:53 -04:00
WU LIFU	ad16eed119	core[patch]: runnable config ensure_config deep copy from var_child_runnable… (#24862 ) issue: #24660 RunnableWithMessageHistory.stream result in error because the [evaluation](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/runnables/branch.py#L220) of the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) unexpectedly trigger the "[on_end](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`)" (exit_history) callback of the default branch descriptions After a lot of investigation I'm convinced that the root cause is that 1. during the execution of the runnable, the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L122)`) is shared between the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) runnable and the [default branch runnable](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`) within the same context 2. when the default branch runnable runs, it gets the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L163)`) and may unintentionally [add more handlers ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L325)`)to the callback manager of this config 3. when it is again the turn for the [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) to run, it gets the `var_child_runnable_config` whose callback manager has the handlers added by the default branch. When it runs that handler (`exit_history`) it leads to the error with the assumption that, the `ensure_config` function actually does want to create a immutable copy from `var_child_runnable_config` because it starts with an [`empty` variable ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L156)`), i go ahead to do a deepcopy to ensure that future modification to the returned value won't affect the `var_child_runnable_config` variable Having said that I actually 1. don't know if this is a proper fix 2. don't know whether it will lead to other unintended consequence 3. don't know why only "stream" runs into this issue while "invoke" runs without problem so @nfcampos @hwchase17 please help review, thanks! --------- Co-authored-by: Lifu Wu <lifu@nextbillion.ai> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 17:30:32 -07:00
ccurme	9cb69a8746	docs: update retriever template, add arxiv retriever (#24947 )	2024-08-01 16:53:18 -04:00
Casey Clements	db3ceb4d0a	partners/mongodb: Improved search index commands (#24745 ) Hardens index commands with try/except for free clusters and optional waits for syncing and tests. [efriis](https://github.com/efriis) These are the upgrades to the search index commands (CRUD) that I mentioned. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-01 20:16:32 +00:00
ccurme	41ed23a050	docs: update retriever integration pages (#24931 )	2024-08-01 14:37:07 -04:00
maang-h	ea505985c4	docs: Standardize ZhipuAIEmbeddings docstrings (#24933 ) - Description: Standardize ZhipuAIEmbeddings rich docstrings. - Issue: the issue #24856	2024-08-01 14:06:53 -04:00
Anneli Samuel	2204d8cb7d	community[patch]: Invoke on_llm_new_token callback before yielding chunk (#24938 ) Description: Invoke on_llm_new_token callback before yielding chunk in streaming mode Issue: [#16913](https://github.com/langchain-ai/langchain/issues/16913)	2024-08-01 16:39:04 +00:00
Eugene Yurtsev	75776e4a54	core[patch]: In unit tests, use `_schema()` instead of BaseModel.schema() (#24930 ) This PR introduces a module with some helper utilities for the pydantic 1 -> 2 migration. They're meant to be used in the following way: 1) Use the utility code to get unit tests pass without requiring modification to the unit tests 2) (If desired) upgrade the unit tests to match pydantic 2 output 3) (If desired) stop using the utility code Currently, this module contains a way to map `schema()` generated by pydantic 2 to (mostly) match the output from pydantic v1.	2024-08-01 11:59:04 -04:00
Serena Ruan	1827bb4042	community[patch]: support bind_tools for ChatMlflow (#24547 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - Description: Support ChatMlflow.bind_tools method Tested in Databricks: <img width="836" alt="image" src="https://github.com/user-attachments/assets/fa28ef50-0110-4698-8eda-4faf6f0b9ef8"> - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-08-01 08:43:07 -07:00
BottlePumpkin	bfc59c1d26	community: Fix KeyError in NotionDB loader when 'name' is missing (#24224 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Description: This PR fixes a KeyError in NotionDBLoader when the "name" key is missing in the "people" property. Issue: Fixes #24223 Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-01 13:55:40 +00:00
alexqiao	8eb0bdead3	community[patch]: Invoke callback prior to yielding token (#24917 ) Description: Invoke callback prior to yielding token in stream method for chat_models . Issue: https://github.com/langchain-ai/langchain/issues/16913 #16913	2024-08-01 13:19:55 +00:00
ZhangShenao	b2dd9ffaaf	patch[cli] Fix bug in `check_imports.py` (#24918 ) The variable `has_failure` in check_imports.py is wrong-declared. It's actually an another variable.	2024-08-01 09:08:12 -04:00
Erick Friis	600fc233ef	partners/ollama: release 0.1.1 (#24910 )	2024-07-31 17:31:29 -07:00
Bagatur	25b93cc4c0	core[patch]: stringify tool non-content blocks (#24626 ) Slightly breaking bugfix. Shouldn't cause too many issues since no models would be able to handle non-content block ToolMessage.content anyways.	2024-07-31 16:42:38 -07:00
Nikita Pakunov	c776471ac6	community: fix AttributeError: 'YandexGPT' object has no attribute '_grpc_metadata' (#24432 ) Fixes #24049 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-31 21:18:33 +00:00
Bagatur	752a71b688	integrations[patch]: release model packages (#24900 )	2024-07-31 20:48:20 +00:00
Jacob Lee	1213a59f87	docs[patch]: Update kv store docs pages (#24848 )	2024-07-31 13:23:24 -07:00
Erick Friis	a7380dd531	cli: release 0.0.28 (#24852 )	2024-07-31 13:03:24 -07:00
Erick Friis	e98e4be0f7	cli: register new integration doc templates (#24854 ) - wait to merge for retriever.ipynb merge #24836	2024-07-31 13:03:05 -07:00
Eugene Yurtsev	210623b409	core[minor]: Add support for pydantic 2 to utility to get fields (#24899 ) Add compatibility for pydantic 2 for a utility function. This will help push some small changes to master, so they don't have to be kept track of on a separate branch.	2024-07-31 19:11:07 +00:00
Bagatur	7d1694040d	core[patch]: Release 0.2.26 (#24898 )	2024-07-31 19:00:50 +00:00
Eugene Yurtsev	add16111b9	community[patch]: Make the pydantic linter stricter (#24897 ) Stricter linting of deprecated pydantic features.	2024-07-31 18:57:37 +00:00
Eugene Yurtsev	a4a444f73d	community[patch]: Fix arcee llm usage of root_validator(pre=False) (#24896 ) Should be pre=True	2024-07-31 18:49:20 +00:00
Eugene Yurtsev	69c656aa5f	langchain[minor]: Upgrade ambiguous root_validator to @pre_init (#24895 ) The @pre_init validator is a temporary solution for base models. It has similar (but not identical) semantics to @root_validator(), but it works strictly as a pre-init validator. It'll work as expected as long as the pydantic model type hints were correct.	2024-07-31 18:46:47 +00:00
Eugene Yurtsev	5099a9c9b4	core[patch]: Update unit tests with a workaround for using AnyID in pydantic 2 (#24892 ) Pydantic 2 ignores __eq__ overload for subclasses of strings.	2024-07-31 14:42:12 -04:00
Bagatur	8461934c2b	core[patch], integrations[patch]: convert TypedDict to tool schema support (#24641 ) supports following UX ```python class SubTool(TypedDict): """Subtool docstring""" args: Annotated[Dict[str, Any], {}, "this does bar"] class Tool(TypedDict): """Docstring Args: arg1: foo """ arg1: str arg2: Union[int, str] arg3: Optional[List[SubTool]] arg4: Annotated[Literal["bar", "baz"], ..., "this does foo"] arg5: Annotated[Optional[float], None] ``` - can parse google style docstring - can use Annotated to specify default value (second arg) - can use Annotated to specify arg description (third arg) - can have nested complex types	2024-07-31 18:27:24 +00:00
Eugene Yurtsev	d24b82357f	community[patch]: Add missing annotations (#24890 ) This PR adds annotations in comunity package. Annotations are only strictly needed in subclasses of BaseModel for pydantic 2 compatibility. This PR adds some unnecessary annotations, but they're not bad to have regardless for documentation pages.	2024-07-31 18:13:44 +00:00
Eugene Yurtsev	7720483432	langchain[patch]: Update unit tests to workaround a pydantic 2 issue (#24886 ) This will allow our unit tests to pass when using AnyID() with our pydantic models.	2024-07-31 14:09:40 -04:00
Eugene Yurtsev	2019e31bc5	langchain[patch]: Add missing type annotations (#24889 ) Adds missing type annotations in preparation for pydantic 2 upgrade.	2024-07-31 14:09:22 -04:00
ccurme	30f18c7b02	docs: add retriever integrations template (#24836 )	2024-07-31 13:50:44 -04:00
Anirudh31415926535	4da3d4b18e	docs: Minor corrections and updates to Cohere docs (#22726 ) - Description: Update the Cohere's provider and RagRetriever documentations with latest updates. - Twitter handle: Anirudh1810	2024-07-31 10:16:26 -07:00
Nishan Jain	b00c0fc558	[Community][minor]: Added prompt governance in pebblo_retrieval (#24874 ) Title: [pebblo_retrieval] Identifying entities in prompts given in PebbloRetrievalQA leading to prompt governance Description: Implemented identification of entities in the prompt using Pebblo prompt governance API. Issue: NA Dependencies: NA Add tests and docs: NA	2024-07-31 13:14:51 +00:00
Rajendra Kadam	a6add89bd4	community[minor]: [PebbloSafeLoader] Implement content-size-based batching (#24871 ) - Title: [PebbloSafeLoader] Implement content-size-based batching in the classification flow(loader/doc API) - Description: - Implemented content-size-based batching in the loader/doc API, set to 100KB with no external configuration option, intentionally hard-coded to prevent timeouts. - Remove unused field(pb_id) from doc_metadata - Issue: NA - Dependencies: NA - Add tests and docs: Updated	2024-07-31 09:10:28 -04:00
TrumanYan	096b66db4a	community: replace it with Tencent Cloud SDK (#24172 ) Description: The old method will be discontinued; use the official SDK for more model options. Issue: None Dependencies: None Twitter handle: None Co-authored-by: trumanyan <trumanyan@tencent.com>	2024-07-31 09:05:38 -04:00
Erick Friis	99eb31ec41	cli: embed docstring template (#24855 )	2024-07-31 02:16:40 +00:00
Erick Friis	3999e9035c	cli/docs: embedding template standardization (#24849 )	2024-07-30 18:54:03 -07:00
Erick Friis	1f5444817a	community: deprecate BedrockEmbeddings in favor of langchain-aws (#24846 )	2024-07-30 23:13:17 +00:00
Jacob Lee	21eb4c9e5d	docs[patch]: Adds first kv store doc matching new template (#24844 )	2024-07-30 15:58:51 -07:00
Erick Friis	b099cc3507	cli: release 0.0.27 (#24842 )	2024-07-30 22:07:50 +00:00
Bagatur	419f2c2585	cli[patch]: tool integration templates (#24837 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-30 14:59:33 -07:00
Erick Friis	88418af3f5	core: release 0.2.25 (#24833 )	2024-07-30 18:41:09 +00:00
Bagatur	37b060112a	langchain[patch]: fix ollama in init_chat_model (#24832 )	2024-07-30 18:38:53 +00:00
Jerron Lim	d8f3ea82db	langchain[patch]: init_chat_model() to import ChatOllama from langchain-ollama and fallback on langchain-community (#24821 ) Description: init_chat_model() should import ChatOllama from `langchain-ollama`. If that fails, fallback to `langchain-community`	2024-07-30 11:16:10 -07:00
Isaac Francisco	511242280b	[docs]: standardize vectorstores (#24797 )	2024-07-30 10:38:04 -07:00
Jacob Lee	ac649800df	docs[patch]: Adds kv store integration docs template (#24804 )	2024-07-30 10:07:57 -07:00
cffranco94	b01d938997	experimental: Add config to convert_to_graph_documents (#24012 ) PR title: Experimental: Add config to convert_to_graph_documents Description: In order to use langfuse, i need to pass the langfuse configuration when invoking the chain. langchain_experimental does not allow to add any parameters (beside the documents) to the convert_to_graph_documents method. This way, I cannot monitor the chain in langfuse. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Catarina Franco <catarina.franco@criticalsoftware.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-30 17:01:06 +00:00
Shailendra Mishra	f2d810b3c0	clob_bugfix... (#24813 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-30 12:44:04 -04:00
Anush	51b15448cc	community: Fix FastEmbedEmbeddings (#24462 ) ## Description This PR: - Fixes the validation error in `FastEmbedEmbeddings`. - Adds support for `batch_size`, `parallel` params. - Removes support for very old FastEmbed versions. - Updates the FastEmbed doc with the new params. Associated Issues: - Resolves #24039 - Resolves #https://github.com/qdrant/fastembed/issues/296	2024-07-30 12:42:46 -04:00
ccurme	73ec24fc56	docs[patch]: add toolkit template (#24791 )	2024-07-30 12:36:09 -04:00
Tamir Zitman	b3e1378f2b	langchain : text_splitters Added PowerShell (#24582 ) - Description: Added PowerShell support for text splitters language include docs relevant update - Issue: None - Dependencies: None --------- Co-authored-by: tzitman <tamir.zitman@intel.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-30 16:13:52 +00:00
Nuno Campos	68ecebf1ec	core: Fix implementation of trim_first_node/trim_last_node to use exact same definition of first/last node as in the getter methods (#24802 )	2024-07-30 08:44:27 -07:00
Igor Drozdov	c2706cfb9e	feat(community): add tools support for litellm (#23906 ) I used the following example to validate the behavior ```python from langchain_core.prompts import ChatPromptTemplate from langchain_core.runnables import ConfigurableField from langchain_anthropic import ChatAnthropic from langchain_community.chat_models import ChatLiteLLM from langchain_core.tools import tool from langchain.agents import create_tool_calling_agent, AgentExecutor @tool def multiply(x: float, y: float) -> float: """Multiply 'x' times 'y'.""" return x * y @tool def exponentiate(x: float, y: float) -> float: """Raise 'x' to the 'y'.""" return x**y @tool def add(x: float, y: float) -> float: """Add 'x' and 'y'.""" return x + y prompt = ChatPromptTemplate.from_messages([ ("system", "you're a helpful assistant"), ("human", "{input}"), ("placeholder", "{agent_scratchpad}"), ]) tools = [multiply, exponentiate, add] llm = ChatAnthropic(model="claude-3-sonnet-20240229", temperature=0) # llm = ChatLiteLLM(model="claude-3-sonnet-20240229", temperature=0) agent = create_tool_calling_agent(llm, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke({"input": "what's 3 plus 5 raised to the 2.743. also what's 17.24 - 918.1241", }) ``` `ChatAnthropic` version works: ``` > Entering new AgentExecutor chain... Invoking: `exponentiate` with `{'x': 5, 'y': 2.743}` responded: [{'text': 'To calculate 3 + 5^2.743, we can use the "exponentiate" and "add" tools:', 'type': 'text', 'index': 0}, {'id': 'toolu_01Gf54DFTkfLMJQX3TXffmxe', 'input': {}, 'name': 'exponentiate', 'type': 'tool_use', 'index': 1, 'partial_json': '{"x": 5, "y": 2.743}'}] 82.65606421491815 Invoking: `add` with `{'x': 3, 'y': 82.65606421491815}` responded: [{'id': 'toolu_01XUq9S56GT3Yv2N1KmNmmWp', 'input': {}, 'name': 'add', 'type': 'tool_use', 'index': 0, 'partial_json': '{"x": 3, "y": 82.65606421491815}'}] 85.65606421491815 Invoking: `add` with `{'x': 17.24, 'y': -918.1241}` responded: [{'text': '\n\nSo 3 + 5^2.743 = 85.66\n\nTo calculate 17.24 - 918.1241, we can use:', 'type': 'text', 'index': 0}, {'id': 'toolu_01BkXTwP7ec9JKYtZPy5JKjm', 'input': {}, 'name': 'add', 'type': 'tool_use', 'index': 1, 'partial_json': '{"x": 17.24, "y": -918.1241}'}] -900.8841[{'text': '\n\nTherefore, 17.24 - 918.1241 = -900.88', 'type': 'text', 'index': 0}] > Finished chain. ``` While `ChatLiteLLM` version doesn't. But with the changes in this PR, along with: - https://github.com/langchain-ai/langchain/pull/23823 - https://github.com/BerriAI/litellm/pull/4554 The result is _almost_ the same: ``` > Entering new AgentExecutor chain... Invoking: `exponentiate` with `{'x': 5, 'y': 2.743}` responded: To calculate 3 + 5^2.743, we can use the "exponentiate" and "add" tools: 82.65606421491815 Invoking: `add` with `{'x': 3, 'y': 82.65606421491815}` 85.65606421491815 Invoking: `add` with `{'x': 17.24, 'y': -918.1241}` responded: So 3 + 5^2.743 = 85.66 To calculate 17.24 - 918.1241, we can use: -900.8841 Therefore, 17.24 - 918.1241 = -900.88 > Finished chain. ``` If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-30 15:39:34 +00:00
David Robertson	bfb7f8d40a	Brave Search: Enhance search result details with extra snippets (#19209 ) Description: This update significantly improves the Brave Search Tool's utility within the LangChain library by enriching the search results it returns. The tool previously returned title, link, and snippet, with the snippet being a truncated 140-character description from the search engine. To make the search results more informative, this update enables extra_snippets by default and introduces additional result fields: title, link, description (enhancing and renaming the former snippet field), age, and snippets. The snippets field provides a list of strings summarizing the webpage, utilizing Brave's capability for more detailed search insights. This enhancement aims to make the search tool far more informative and beneficial for users. Issue: N/A Dependencies: No additional dependencies introduced. Twitter handle: @davidalexr987 Code Changes Summary: - Changed the default setting to include extra_snippets in search results. - Renamed the snippet field to description to accurately reflect its content and included an age field for search results. - Introduced a snippets field that lists webpage summaries, providing users with comprehensive search result insights. Backward Compatibility Note: The renaming of snippet to description improves the accuracy of the returned data field but may impact existing users who have developed integration's or analyses based on the snippet field. I believe this change is essential for clarity and utility, and it aligns better with the data provided by Brave Search. Additional Notes: This proposal focuses exclusively on the Brave Search package, without affecting other LangChain packages or introducing new dependencies.	2024-07-30 15:29:38 +00:00
Ben Chambers	435771fe74	[community]: Fix package name mismatch (#24824 ) - Description: fix a mismatch in pypi package names	2024-07-30 11:21:39 -04:00
ccurme	b7bbfc7c67	langchain: revert "init_chat_model() to support ChatOllama from langchain-ollama" (#24819 ) Reverts langchain-ai/langchain#24818 Overlooked discussion in https://github.com/langchain-ai/langchain/pull/24801.	2024-07-30 14:23:36 +00:00
Jerron Lim	5abfc85fec	langchain: init_chat_model() to support ChatOllama from langchain-ollama (#24818 ) Description: Since moving away from `langchain-community` is recommended, `init_chat_models()` should import ChatOllama from `langchain-ollama` instead.	2024-07-30 10:17:38 -04:00
Erick Friis	cdaea17b3e	cli/docs: llm integration template standardization (#24795 )	2024-07-29 17:47:13 -07:00
Bagatur	a6d1fb4275	core[patch]: introduce ToolMessage.status (#24628 ) Anthropic models (including via Bedrock and other cloud platforms) accept a status/is_error attribute on tool messages/results (specifically in `tool_result` content blocks for Anthropic API). Adding a ToolMessage.status attribute so that users can set this attribute when using those models	2024-07-29 14:01:53 -07:00
Isaac Francisco	78d97b49d9	[partner]: ollama llm fix (#24790 )	2024-07-29 13:00:02 -07:00
maang-h	4bb1a11e02	community: Add MiniMaxChat bind_tools and structured output (#24310 ) - Description: - Add `bind_tools` method to support tool calling - Add `with_structured_output` method to support structured output	2024-07-29 15:51:52 -04:00
John	0a2ff40fcc	partners/unstructured: fix client api_url (#24680 ) Description: Add empty string default for api_key and change `server_url` to `url` to match existing loaders. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-29 11:16:41 -07:00
maang-h	bf685c242f	docs: Standardize QianfanEmbeddingsEndpoint (#24786 ) - Description: Standardize QianfanEmbeddingsEndpoint, include: - docstrings, the issue #21983 - model init arg names, the issue #20085	2024-07-29 13:19:24 -04:00
ccurme	9998e55936	core[patch]: support tool calls with non-pickleable args in tools (#24741 ) Deepcopy raises with non-pickleable args.	2024-07-29 13:18:39 -04:00
Erick Friis	df78608741	mongodb: bson optional import (#24685 )	2024-07-29 09:54:01 -07:00
M. Ali	c086410677	fix docs typos (#23668 ) Thank you for contributing to LangChain! - [x] PR title: "docs: fix multiple typos" Co-authored-by: mohblnk <mohamed.ali@blnk.ai> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-29 16:10:55 +00:00
Pere Pasamonte	98175860ad	community: Fix AWS DocumentDB similarity_search when filter is None (#24777 ) Description Fixes DocumentDBVectorSearch similarity_search when no filter is used; it defaults to None but $match does not accept None, so changed default to empty {} before pipeline is created. Issue AWS DocumentDB similarity search does not work when no filter is used. Error msg: "the match filter must be an expression in an object" #24775 Dependencies No dependencies Twitter handle https://x.com/perepasamonte --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-29 15:32:05 +00:00
Lennart J. Kurzweg	7da0597ecb	partners[ollama]: Support seed parameter for ChatOllama (#24782 ) ## Description Adds seed parameter to ChatOllama ## Resolves Issues - #24703 ## Dependency Changes None Co-authored-by: Lennart J. Kurzweg (Nx2) <git@nx2.site>	2024-07-29 15:15:20 +00:00
ccurme	e264ccf484	standard-tests[patch]: update groq and structured output test (#24781 ) - Mixtral with Groq has started consistently failing tool calling tests. Here we restrict testing to llama 3.1. - `.schema` is deprecated in pydantic proper in favor of `.model_json_schema`.	2024-07-29 11:10:01 -04:00
ZhangShenao	4a05679fdb	patch[experimental] Fix prompt in `GenerativeAgentMemory` (#24771 ) There is an issue with the prompt format in `GenerativeAgentMemory` , try to fix it. The prompt is same as the one in method `_score_memory_importance`.	2024-07-29 07:02:31 -04:00
WU LIFU	2ba8393182	graph_transformers: bug fix for create_simple_model not passing in ll… (#24643 ) issue: #24615 descriptions: The _Graph pydantic model generated from create_simple_model (which LLMGraphTransformer uses when allowed nodes and relationships are provided) does not constrain the relationships (source and target types, relationship type), and the node and relationship properties with enums when using ChatOpenAI. The issue is that when calling optional_enum_field throughout create_simple_model the llm_type parameter is not passed in except for when creating node type. Passing it into each call fixes the issue. Co-authored-by: Lifu Wu <lifu@nextbillion.ai>	2024-07-29 07:00:56 -04:00
William FH	01ab2918a2	core[patch]: Respect injected in bound fns (#24733 ) Since right now you cant use the nice injected arg syntas directly with model.bind_tools()	2024-07-28 15:45:19 -07:00
Pavel	7fcfe7c1f4	openai[patch]: openai proxy added to base embeddings (#24539 ) - [ ] PR title: "langchain-openai: openai proxy added to base embeddings" - [ ] PR message: - Description: Dear langchain developers, You've already supported proxy for ChatOpenAI implementation in your package. At the same time, if somebody needed to use proxy for chat, it also could be necessary to be able to use it for OpenAIEmbeddings. That's why I think it's important to add proxy support for OpenAI embeddings. That's what I've done in this PR. @baskaryan --------- Co-authored-by: karpov <karpov@dohod.ru> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-07-28 20:54:13 +00:00
Chuck Wooters	56c2a7f6d4	partners: add missing key name to Field() for ChatFireworks model (#24721 ) Description: In the `ChatFireworks` class definition, the Field() call for the "stop" ("stop_sequences") parameter is missing the "default" keyword. Issue: Type checker reports "stop_sequences" as a missing arg (not recognizing the default value is None) Dependencies: None Twitter handle: None	2024-07-28 18:40:21 +00:00
AmosDinh	c113682328	community:Add support for specifying document_loaders.firecrawl api url. (#24747 ) community:Add support for specifying document_loaders.firecrawl api url. Add support for specifying document_loaders.firecrawl api url. This is mainly to support the [self-hosting](https://github.com/mendableai/firecrawl/blob/main/SELF_HOST.md) option firecrawl provides. Eg. now I can specify localhost:.... The corresponding firecrawl class already provides functionality to pass the argument. See here: `4c9d62f6d3/apps/python-sdk/firecrawl/firecrawl.py (L29)` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-28 14:30:36 -04:00
Jerron Lim	df37c0d086	partners[ollama]: Support base_url for ChatOllama (#24719 ) Add a class attribute `base_url` for ChatOllama to allow users to choose a different URL to connect to. Fixes #24555	2024-07-28 14:25:58 -04:00
Bagatur	8964f8a710	core: use mypy<1.11 (#24749 ) Bug in mypy 1.11.0 blocking CI, see example: https://github.com/langchain-ai/langchain/actions/runs/10127096903/job/28004492692?pr=24641	2024-07-27 16:37:02 -07:00
Isaac Francisco	152427eca1	make image inputs compatible with langchain_ollama (#24619 )	2024-07-26 17:39:57 -07:00
William FH	0535d72927	Add type() in error msg (#24723 )	2024-07-26 16:48:45 -07:00
Eugene Yurtsev	9be6b5a20f	core[patch]: Correct doc-string for InMemoryRateLimiter (#24730 ) Correct the documentaiton string.	2024-07-26 22:17:22 +00:00
Cristi Burcă	174e7d2ab2	langchain: Make OutputFixingParser.from_llm() create a useable retry chain (#24687 ) Description: OutputFixingParser.from_llm() creates a retry chain that returns a Generation instance, when it should actually just return a string. Issue: https://github.com/langchain-ai/langchain/issues/24600 Twitter handle: scribu --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-26 13:55:47 -07:00
Bagatur	b3a23ddf93	integration releases (#24725 ) Release anthropic, openai, groq, mistralai, robocorp	2024-07-26 12:30:10 -07:00
Bagatur	315223ce26	core[patch]: Release 0.2.24 (#24722 )	2024-07-26 18:55:32 +00:00
Haijian Wang	cda3025ee1	Integrating the Yi family of models. (#24491 ) Thank you for contributing to LangChain! - [x] PR title: "community:add Yi LLM", "docs:add Yi Documentation" - [x] PR message: *Delete this entire checklist* and replace with - Description: This PR adds support for the Yi model to LangChain. - Dependencies: [langchain_core,requests,contextlib,typing,logging,json,langchain_community] - Twitter handle: 01.AI - [x] Add tests and docs: I've added the corresponding documentation to the relevant paths --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-26 10:57:33 -07:00
Bagatur	ad7581751f	core[patch]: ChatPromptTemplate.init same as ChatPromptTemplate.from_… (#24486 )	2024-07-26 10:48:39 -07:00
Marc Gibbons	cc451effd1	community[patch]: langchain_community.vectorstores.azuresearch Raise LangChainException instead of bare Exception (#23935 ) Raise `LangChainException` instead of `Exception`. This alleviates the need for library users to use bare try/except to handle exceptions raised by `AzureSearch`. Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-26 15:59:06 +00:00
Eugene Yurtsev	3a5365a33e	ai21: apply rate limiter in integration tests (#24717 ) Apply rate limiter in integration tests	2024-07-26 11:15:36 -04:00
Eugene Yurtsev	03d62a737a	together: Add rate limiter to integration tests (#24714 ) Rate limit the integration tests to avoid getting 429s.	2024-07-26 10:59:33 -04:00
Diverrez morgan	c4d2a53f18	community: creation score_threshold in flashrank_rerank.py (#24016 ) Description: add a optional score relevance threshold for select only coherent document, it's in complement of top_n Discussion: add relevance score threshold in flashrank_rerank document compressors #24013 Dependencies: no dependencies --------- Co-authored-by: Benjamin BERNARD <benjamin.bernard@openpathview.fr> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-26 13:34:39 +00:00
Cong Peng	190988d93e	community: Add parameter `allow_dangerous_requests` to `WebResearchRetriever.from_llm` construct (#24712 ) Description: To avoid ValueError when construct the retriever from method `from_llm()`.	2024-07-26 06:24:58 -07:00
monysun	5f593c172a	community: fix dashcope embeddings embed_query func post too much req to api (#24707 ) the fuc of embed_query of dashcope embeddings send a str param, and in the embed_with_retry func will send error content to api	2024-07-26 12:44:07 +00:00
yonarw	b65ac8d39c	community[minor]: Self query retriever for HANA Cloud Vector Engine (#24494 ) Description: - This PR adds a self query retriever implementation for SAP HANA Cloud Vector Engine. The retriever supports all operators except for contains. - Issue: N/A - Dependencies: no new dependencies added Add tests and docs: Added integration tests to: libs/community/tests/unit_tests/query_constructors/test_hanavector.py Documentation for self query retriever: /docs/integrations/retrievers/self_query/hanavector_self_query.ipynb --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-07-26 06:56:51 +00:00
nobbbbby	4f3b4fc7fe	community[patch]: Extend Baichuan model with tool support (#24529 ) Description: Expanded the chat model functionality to support tools in the 'baichuan.py' file. Updated module imports and added tool object handling in message conversions. Additional changes include the implementation of tool binding and related unit tests. The alterations offer enhanced model capabilities by enabling interaction with tool-like objects. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-25 23:20:44 -07:00
Rave Harpaz	ee399e3ec5	community[patch]: Add OCI Generative AI tool and structured output support (#24693 ) - [x] PR title: community: Add OCI Generative AI tool and structured output support - [x] PR message: - Description: adding tool calling and structured output support for chat models offered by OCI Generative AI services. This is an update to our last PR 22880 with changes in /langchain_community/chat_models/oci_generative_ai.py - Issue: NA - Dependencies: NA - Twitter handle: NA - [x] Add tests and docs: 1. we have updated our unit tests 2. we have updated our documentation under /docs/docs/integrations/chat/oci_generative_ai.ipynb - [x] Lint and test: `make format`, `make lint` and `make test` we run successfully --------- Co-authored-by: RHARPAZ <RHARPAZ@RHARPAZ-5750.us.oracle.com> Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com>	2024-07-25 23:19:00 -07:00
Yuki Watanabe	2b6a262f84	community[patch]: Replace `filters` argument to `filter` in DatabricksVectorSearch (#24530 ) The [DatabricksVectorSearch](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/vectorstores/databricks_vector_search.py#L21) class exposes similarity search APIs with argument `filters`, which is inconsistent with other VS classes who uses `filter` (singular). This PR updates the argument and add alias for backward compatibility. --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	2024-07-25 21:20:18 -07:00
Sunish Sheth	59880a9147	community[patch]: mlflow handle empty chunk(#24689 )	2024-07-25 20:36:29 -07:00
Eugene Yurtsev	20690db482	core[minor]: Add BaseModel.rate_limiter, RateLimiter abstraction and in-memory implementation (#24669 ) This PR proposes to create a rate limiter in the chat model directly, and would replace: https://github.com/langchain-ai/langchain/pull/21992 It resolves most of the constraints that the Runnable rate limiter introduced: 1. It's not annoying to apply the rate limiter to existing code; i.e., possible to roll out the change at the location where the model is instantiated, rather than at every location where the model is used! (Which is necessary if the model is used in different ways in a given application.) 2. batch rate limiting is enforced properly 3. the rate limiter works correctly with streaming 4. the rate limiter is aware of the cache 5. The rate limiter can take into account information about the inputs into the model (we can add optional inputs to it down-the road together with outputs!) The only downside is that information will not be properly reflected in tracing as we don't have any metadata evens about a rate limiter. So the total time spent on a model invocation will be: * time spent waiting for the rate limiter * time spend on the actual model request ## Example ```python from langchain_core.rate_limiters import InMemoryRateLimiter from langchain_groq import ChatGroq groq = ChatGroq(rate_limiter=InMemoryRateLimiter(check_every_n_seconds=1)) groq.invoke('hello') ```	2024-07-26 03:03:34 +00:00
Eugene Yurtsev	c623ae6661	experimental[patch]: Fix import test (#24672 ) Import test was misconfigured, the glob wasn't returning any file paths	2024-07-25 22:14:40 -04:00
Chaunte W. Lacewell	69eacaa887	Community[minor]: Update VDMS vectorstore (#23729 ) Description: - This PR exposes some functions in VDMS vectorstore, updates VDMS related notebooks, updates tests, and upgrade version of VDMS (>=0.0.20) Issue: N/A Dependencies: - Update vdms>=0.0.20	2024-07-25 22:13:04 -04:00
Nuno Campos	8734cabc09	core: Don't draw None edge labels (#24690 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-25 22:12:39 +00:00
Bagatur	4840db6892	docs: standardize groq chat model docs (#24616 ) part of #22296 --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-25 14:10:49 -07:00
Isaac Francisco	218c554c4f	[docs]: add doctoring to ChatTogether (#24636 )	2024-07-25 14:10:41 -07:00
Isaac Francisco	5c7e589aaf	deprecating ollama_functions (#24632 )	2024-07-25 13:50:04 -07:00
KyrianC	0fdbaf4a8d	community: fix ChatEdenAI + EdenAI Tools (#23715 ) Fixes for Eden AI Custom tools and ChatEdenAI: - add missing import in __init__ of chat_models - add `args_schema` to custom tools. otherwise '__arg1' would sometimes be passed to the `run` method - fix IndexError when no human msg is added in ChatEdenAI	2024-07-25 15:19:14 -04:00
ccurme	58dd69f7f2	core[patch]: fix mutating tool calls (#24677 ) In some cases tool calls are mutated when passed through a tool.	2024-07-25 16:46:36 +00:00
ccurme	dfbd12b384	mistral[patch]: translate tool call IDs to mistral compatible format (#24668 ) Mistral appears to have added validation for the format of its tool call IDs: `{"object":"error","message":"Tool call id was abc123 but must be a-z, A-Z, 0-9, with a length of 9.","type":"invalid_request_error","param":null,"code":null}` This breaks compatibility of messages from other providers. Here we add a function that converts any string to a Mistral-valid tool call ID, and apply it to incoming messages.	2024-07-25 12:39:32 -04:00
maang-h	38d30e285a	docs: Standardize BaichuanTextEmbeddings docstrings (#24674 ) - Description: Standardize BaichuanTextEmbeddings docstrings. - Issue: the issue #21983	2024-07-25 12:12:00 -04:00
Eugene Yurtsev	89bcca3542	experimental[patch]: Bump core (#24671 )	2024-07-25 09:05:43 -07:00
rick-SOPTIM	cd563fb628	community[minor]: passthrough auth parameter on requests to Ollama-LLMs (#24068 ) Thank you for contributing to LangChain! Description: This PR allows users of `langchain_community.llms.ollama.Ollama` to specify the `auth` parameter, which is then forwarded to all internal calls of `requests.request`. This works in the same way as the existing `headers` parameters. The auth parameter enables the usage of the given class with Ollama instances, which are secured by more complex authentication mechanisms, that do not only rely on static headers. An example are AWS API Gateways secured by the IAM authorizer, which expects signatures dynamically calculated on the specific HTTP request. Issue: Integrating a remote LLM running through Ollama using `langchain_community.llms.ollama.Ollama` only allows setting static HTTP headers with the parameter `headers`. This does not work, if the given instance of Ollama is secured with an authentication mechanism that makes use of dynamically created HTTP headers which for example may depend on the content of a given request. Dependencies: None Twitter handle: None --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-25 15:48:35 +00:00
남광우	256bad3251	core[minor]: Support asynchronous in InMemoryVectorStore (#24472 ) ### Description * support asynchronous in InMemoryVectorStore * since embeddings might be possible to call asynchronously, ensure that both asynchronous and synchronous functions operate correctly.	2024-07-25 11:36:55 -04:00
Luca Dorigo	5fdbdd6bec	community[patch]: Fix invalid iohttp verify parameter (#24655 ) Should fix https://github.com/langchain-ai/langchain/issues/24654	2024-07-25 11:09:21 -04:00
Eugene Yurtsev	7dd6b32991	core[minor]: Add InMemoryRateLimiter (#21992 ) This PR introduces the following Runnables: 1. BaseRateLimiter: an abstraction for specifying a time based rate limiter as a Runnable 2. InMemoryRateLimiter: Provides an in-memory implementation of a rate limiter ## Example ```python from langchain_core.runnables import InMemoryRateLimiter, RunnableLambda from datetime import datetime foo = InMemoryRateLimiter(requests_per_second=0.5) def meow(x): print(datetime.now().strftime("%H:%M:%S.%f")) return x chain = foo \| meow for _ in range(10): print(chain.invoke('hello')) ``` Produces: ``` 17:12:07.530151 hello 17:12:09.537932 hello 17:12:11.548375 hello 17:12:13.558383 hello 17:12:15.568348 hello 17:12:17.578171 hello 17:12:19.587508 hello 17:12:21.597877 hello 17:12:23.607707 hello 17:12:25.617978 hello ``` ![image](https://github.com/user-attachments/assets/283af59f-e1e1-408b-8e75-d3910c3c44cc) ## Interface The rate limiter uses the following interface for acquiring a token: ```python class BaseRateLimiter(Runnable[Input, Output], abc.ABC): @abc.abstractmethod def acquire(self, *, blocking: bool = True) -> bool: """Attempt to acquire the necessary tokens for the rate limiter.``` ``` The flag `blocking` has been added to the abstraction to allow supporting streaming (which is easier if blocking=False). ## Limitations - The rate limiter is not designed to work across different processes. It is an in-memory rate limiter, but it is thread safe. - The rate limiter only supports time-based rate limiting. It does not take into account the size of the request or any other factors. - The current implementation does not handle streaming inputs well and will consume all inputs even if the rate limit has been reached. Better support for streaming inputs will be added in the future. - When the rate limiter is combined with another runnable via a RunnableSequence, usage of .batch() or .abatch() will only respect the average rate limit. There will be bursty behavior as .batch() and .abatch() wait for each step to complete before starting the next step. One way to mitigate this is to use batch_as_completed() or abatch_as_completed(). ## Bursty behavior in `batch` and `abatch` When the rate limiter is combined with another runnable via a RunnableSequence, usage of .batch() or .abatch() will only respect the average rate limit. There will be bursty behavior as .batch() and .abatch() wait for each step to complete before starting the next step. This becomes a problem if users are using `batch` and `abatch` with many inputs (e.g., 100). In this case, there will be a burst of 100 inputs into the batch of the rate limited runnable. 1. Using a RunnableBinding The API would look like: ```python from langchain_core.runnables import InMemoryRateLimiter, RunnableLambda rate_limiter = InMemoryRateLimiter(requests_per_second=0.5) def meow(x): return x rate_limited_meow = RunnableLambda(meow).with_rate_limiter(rate_limiter) ``` 2. Another option is to add some init option to RunnableSequence that changes `.batch()` to be depth first (e.g., by delegating to `batch_as_completed`) ```python RunnableSequence(first=rate_limiter, last=model, how='batch-depth-first') ``` Pros: Does not require Runnable Binding Cons: Feels over-complicated	2024-07-25 01:34:03 +00:00
Oleg Kulyk	4b1b7959a2	community[minor]: Add ScrapingAnt Loader Community Integration (#24514 ) Added [ScrapingAnt](https://scrapingant.com/) Web Loader integration. ScrapingAnt is a web scraping API that allows extracting web page data into accessible and well-formatted markdown. Description: Added ScrapingAnt web loader for retrieving web page data as markdown Dependencies: scrapingant-client Twitter: @WeRunTheWorld3 --------- Co-authored-by: Oleg Kulyk <oleg@scrapingant.com>	2024-07-24 21:11:43 -04:00
John	d59c656ea5	unstructured, community, initialize langchain-unstructured package (#22779 ) #### Update (2): A single `UnstructuredLoader` is added to handle both local and api partitioning. This loader also handles single or multiple documents. #### Changes in `community`: Changes here do not affect users. In the initial process of using the SDK for the API Loaders, the Loaders in community were refactored. Other changes include: The `UnstructuredBaseLoader` has a new check to see if both `mode="paged"` and `chunking_strategy="by_page"`. It also now has `Element.element_id` added to the `Document.metadata`. `UnstructuredAPIFileLoader` and `UnstructuredAPIFileIOLoader`. As such, now both directly inherit from `UnstructuredBaseLoader` and initialize their `file_path`/`file` attributes respectively and implement their own `_post_process_elements` methods. -------- #### Update: New SDK Loaders in a [partner package](https://python.langchain.com/v0.1/docs/contributing/integrations/#partner-package-in-langchain-repo) are introduced to prevent breaking changes for users (see discussion below). ##### TODO: - [x] Test docstring examples -------- - Description: UnstructuredAPIFileIOLoader and UnstructuredAPIFileLoader calls to the unstructured api are now made using the unstructured-client sdk. - New Dependencies: unstructured-client - [x] Add tests and docs: If you're adding a new integration, please include - [x] a test for the integration, preferably unit tests that do not rely on network access, - [x] update the description in `docs/docs/integrations/providers/unstructured.mdx` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. TODO: - [x] Update https://python.langchain.com/v0.1/docs/integrations/document_loaders/unstructured_file/#unstructured-api - `langchain/docs/docs/integrations/document_loaders/unstructured_file.ipynb` - The description here needs to indicate that users should install `unstructured-client` instead of `unstructured`. Read over closely to look for any other changes that need to be made. - [x] Update the `lazy_load` method in `UnstructuredBaseLoader` to handle json responses from the API instead of just lists of elements. - This method may need to be overwritten by the API loaders instead of changing it in the `UnstructuredBaseLoader`. - [x] Update the documentation links in the class docstrings (the Unstructured documents have moved) - [x] Update Document.metadata to include `element_id` (see thread [here](https://unstructuredw-kbe4326.slack.com/archives/C044N0YV08G/p1718187499818419)) --------- Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com>	2024-07-24 23:21:20 +00:00
Joel Akeret	acfce30017	Adding compatibility for OllamaFunctions with ImagePromptTemplate (#24499 ) - [ ] PR title: "experimental: Adding compatibility for OllamaFunctions with ImagePromptTemplate" - [ ] PR message: - Description: Removes the outdated `_convert_messages_to_ollama_messages` method override in the `OllamaFunctions` class to ensure that ollama multimodal models can be invoked with an image. - Issue: #24174 --------- Co-authored-by: Joel Akeret <joel.akeret@ti&m.com> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-24 14:57:05 -07:00
Erick Friis	8f3c052db1	cli: release 0.0.26 (#24623 ) - cli: remove snapshot flag from pytest defaults - x - x	2024-07-24 13:13:58 -07:00
ChengZi	29a3b3a711	partners[milvus]: add dynamic field (#24544 ) add dynamic field feature to langchain_milvus more unittest, more robustic plan to deprecate the `metadata_field` in the future, because it's function is the same as `enable_dynamic_field`, but the latter one is a more advanced concept in milvus Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-24 20:01:58 +00:00
Erick Friis	20fe4deea0	milvus: release 0.1.3 (#24624 )	2024-07-24 13:01:27 -07:00

... 5 6 7 8 9 ...

5699 Commits