langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-08 14:31:55 +00:00

Author	SHA1	Message	Date
Erick Friis	b28bc252c4	core[patch]: mmr util (#25689 )	2024-08-22 21:31:17 -07:00
ZhangShenao	ba89933c2c	Doc[Embeddings] Add docs for `ZhipuAIEmbeddings` (#25662 ) - Add docs for `ZhipuAIEmbeddings`. - Using integration doc template. - Source api reference: https://bigmodel.cn/dev/api#vector --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-23 01:33:43 +00:00
Erick Friis	6096c80b71	core: pydantic output parser streaming fix (#24415 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 18:00:09 -07:00
Eugene Yurtsev	c316361115	core[patch]: Add _api.rename_parameter to support renaming of parameters in functions (#25101 ) Add ability to rename paramerters in function signatures ```python @rename_parameter(since="2.0.0", removal="3.0.0", old="old_name", new="new_name") def foo(new_name: str) -> str: """original doc""" return new_name ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-22 17:16:31 -07:00
Yusuke Fukasawa	0258cb96fa	core[patch]: add additionalProperties recursively to oai function if strict (#25169 ) Hello. First of all, thank you for maintaining such a great project. ## Description In https://github.com/langchain-ai/langchain/pull/25123, support for structured_output is added. However, `"additionalProperties": false` needs to be included at all levels when a nested object is generated. error from current code: https://gist.github.com/fufufukakaka/e9b475300e6934853d119428e390f204 ``` BadRequestError: Error code: 400 - {'error': {'message': "Invalid schema for response_format 'JokeWithEvaluation': In context=('properties', 'self_evaluation'), 'additionalProperties' is required to be supplied and to be false", 'type': 'invalid_request_error', 'param': 'response_format', 'code': None}} ``` Reference: [Introducing Structured Outputs in the API](https://openai.com/index/introducing-structured-outputs-in-the-api/) ```json { "model": "gpt-4o-2024-08-06", "messages": [ { "role": "system", "content": "You are a helpful math tutor." }, { "role": "user", "content": "solve 8x + 31 = 2" } ], "response_format": { "type": "json_schema", "json_schema": { "name": "math_response", "strict": true, "schema": { "type": "object", "properties": { "steps": { "type": "array", "items": { "type": "object", "properties": { "explanation": { "type": "string" }, "output": { "type": "string" } }, "required": ["explanation", "output"], "additionalProperties": false } }, "final_answer": { "type": "string" } }, "required": ["steps", "final_answer"], "additionalProperties": false } } } } ``` In the current code, `"additionalProperties": false` is only added at the last level. This PR introduces the `_add_additional_properties_key` function, which recursively adds `"additionalProperties": false` to the entire JSON schema for the request. Twitter handle: `@fukkaa1225` Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-23 00:08:58 +00:00
Bagatur	b35ee09b3f	infra: xfail pydantic v2 arg to py function (#25686 ) Issue to track: #25687	2024-08-22 23:52:57 +00:00
Christophe Bornet	ee98da4f4e	core[patch]: Add UP(upgrade) ruff rules (#25358 )	2024-08-22 16:29:22 -07:00
William FH	294f7fcb38	core[patch]: Remove different parent run id warning (#25683 )	2024-08-22 16:10:35 -07:00
Vadym Barda	46d344c33d	core[patch]: support drawing nested subgraphs in draw_mermaid (#25581 ) Previously the code was able to only handle a single level of nesting for subgraphs in mermaid. This change adds support for arbitrary nesting of subgraphs.	2024-08-22 16:08:49 -07:00
Manuel Jaiczay	1c31234eed	community: fix HuggingFacePipeline pipeline_kwargs (#19920 ) Fix handling of pipeline_kwargs to prioritize class attribute defaults. #19770 Co-authored-by: jaizo <manuel.jaiczay@polygons.at> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:29:46 -04:00
Nobuhiko Otoba	4b63a217c2	"community: Fix GithubFileLoader source code", "docs: Fix GithubFileLoader code sample" (#19943 ) This PR adds tiny improvements to the `GithubFileLoader` document loader and its code sample, addressing the following issues: 1. Currently, the `file_extension` argument of `GithubFileLoader` does not change its behavior at all. 1. The `GithubFileLoader` sample code in `docs/docs/integrations/document_loaders/github.ipynb` does not work as it stands. The respective solutions I propose are the following: 1. Remove `file_extension` argument from `GithubFileLoader`. 1. Specify the branch as `master` (not the default `main`) and rename `documents` as `document`. --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:24:57 -04:00
Bagatur	cf9c484715	standard-tests[patch]: test Message.name (#25677 ) Tests: https://github.com/langchain-ai/langchain/actions/runs/10516092584	2024-08-22 14:47:31 -07:00
Nada Amin	ac7b71e0d7	langchain_community.graphs: Neo4JGraph: prop min_size might be None (#23944 ) When I used the Neo4JGraph enhanced_schema=True option, I ran into an error because a prop min_size of None was compared numerically with an int. The fix I applied is similar to the pattern of skipping embeddings elsewhere in the file. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-22 20:29:52 +00:00
CastaChick	7d13a2f958	core[patch]: add option to specify the chunk separator in `merge_message_runs` (#24783 ) Description: LLM will stop generating text even in the middle of a sentence if `finish_reason` is `length` (for OpenAI) or `stop_reason` is `max_tokens` (for Anthropic). To obtain longer outputs from LLM, we should call the message generation API multiple times and merge the results into the text to circumvent the API's output token limit. The extra line breaks forced by the `merge_message_runs` function when seamlessly merging messages can be annoying, so I added the option to specify the chunk separator. Issue: No corresponding issues. Dependencies: No dependencies required. Twitter handle: @hanama_chem https://x.com/hanama_chem --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 19:46:25 +00:00
basirsedighi	0f3fe44e44	parsed_json is expected to be a list of dictionaries, but it seems to… (#24018 ) parsed_json is expected to be a list of dictionaries, but it seems to… be a single dictionary instead. This is at libs/experimental/langchain_experimental/graph_transformers/llm.py process process_response Thank you for contributing to LangChain! - [ ] Bugfix: "experimental: bugfix" --------- Co-authored-by: based <basir.sedighi@nris.no> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 19:09:43 +00:00
ZhangShenao	8bde04079b	patch[experimental] Fix start_index in `SemanticChunker` (#24761 ) - Cause chunks are joined by space, so they can't be found in text, and the final `start_index` is very possibility to be -1. - The simplest way is to use the natural index of the chunk as `start_index`.	2024-08-22 14:59:40 -04:00
William FH	fad6fc866a	Rm DeepInfra Breakpoint Comment (#25206 ) tbh should rm the print staement too	2024-08-22 14:43:44 -04:00
yahya-mouman	e5bb4cb646	lagchain-pinecone: add id to similarity documents results (#25630 ) - Description: This change adds the ID field that's required in Pinecone to the result documents of the similarity search method. - Issue: Lack of document metadata namely the ID field - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 18:33:26 +00:00
Eric Pinzur	01ded5e2f9	community: add metadata filter to CassandraGraphVectorStore (#25663 ) - Description: - Added metadata filtering support to `langchain_community.graph_vectorstores.cassandra.CassandraGraphVectorStore` - Also fixed type conversion issues highlighted by mypy. - Dependencies: - `ragstack-ai-knowledge-store 0.2.0` (released July 23, 2024) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:27:16 -04:00
Ivan	5b9290a449	Fix UnionType type var replacement (#25566 ) [langchain_core] Fix UnionType type var replacement - Added types.UnionType to typing.Union mapping Type replacement cause `TypeError: 'type' object is not subscriptable` if any of union type comes as function `_py_38_safe_origin` return `types.UnionType` instead of `typing.Union` ```python >>> from types import UnionType >>> from typing import Union, get_origin >>> type_ = get_origin(str \| None) >>> type_ <class 'types.UnionType'> >>> UnionType[(str, None)] Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'type' object is not subscriptable >>> Union[(str, None)] typing.Optional[str] ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:22:09 -04:00
William FH	8230ba47f3	core[patch]: Improve some error messages and add another test for checking RunnableWithMessageHistory (#25209 ) Also add more useful error messages. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-22 18:14:27 +00:00
Hasan Kumar	b4fcda7657	langchain: Fix type warnings when passing Runnable as agent to AgentExecutor (#24750 ) Fix for https://github.com/langchain-ai/langchain/issues/13075 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:02:02 -04:00
Erick Friis	9447925d94	cli: release 0.0.30 (#25672 )	2024-08-22 10:21:19 -07:00
mschoenb97IL	e499caa9cd	community: Give more context on DeepInfra 500 errors (#25671 ) Description: DeepInfra 500 errors have useful information in the text field that isn't being exposed to the user. I updated the error message to fix this. As an example, this code ``` from langchain_community.chat_models import ChatDeepInfra from langchain_core.messages import HumanMessage model = "meta-llama/Meta-Llama-3-70B-Instruct" deepinfra_api_token = "..." model = ChatDeepInfra(model=model, deepinfra_api_token=deepinfra_api_token) messages = [HumanMessage("All work and no play makes Jack a dull boy\n" * 9000)] response = model.invoke(messages) ``` Currently gives this error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server: Error 500 ``` This change would give the following error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server error status 500: {"error":{"message":"Requested input length 99009 exceeds maximum input length 8192"}} ```	2024-08-22 10:10:51 -07:00
Rajendra Kadam	4ff2f4499e	community: Refactor PebbloRetrievalQA (#25583 ) Refactor PebbloRetrievalQA - Created `APIWrapper` and moved API logic into it. - Created smaller functions/methods for better readability. - Properly read environment variables. - Removed unused code. - Updated models Issue: NA Dependencies: NA tests: NA	2024-08-22 11:51:21 -04:00
Rajendra Kadam	1f1679e960	community: Refactor PebbloSafeLoader (#25582 ) Refactor PebbloSafeLoader - Created `APIWrapper` and moved API logic into it. - Moved helper functions to the utility file. - Created smaller functions and methods for better readability. - Properly read environment variables. - Removed unused code. Issue: NA Dependencies: NA tests: Updated	2024-08-22 11:46:52 -04:00
maang-h	5e3a321f71	docs: Add ChatZhipuAI tool calling and structured output docstring (#25669 ) - Description: Add `ChatZhipuAI` tool calling and structured output docstring.	2024-08-22 10:34:41 -04:00
Krishna Kulkarni	820da64983	limit the most recent documents to fetch from MongoDB database. (#25435 ) limit the most recent documents to fetch from MongoDB database. Thank you for contributing to LangChain! - [ ] limit the most recent documents to fetch from MongoDB database.: "langchain_mongodb: limit the most recent documents to fetch from MongoDB database." - [ ] PR message: *Delete this entire checklist* and replace with - Description: Added a doc_limit parameter which enables the limit for the documents to fetch from MongoDB database - Issue: - Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 10:33:45 -04:00
Noah Mayerhofer	0091947efd	community: add retry for session expired exception in neo4j (#25660 ) Description: The neo4j driver can raise a SessionExpired error, which is considered a retriable error. If a query fails with a SessionExpired error, this change retries every query once. This change will make the neo4j integration less flaky. Twitter handle: noahmay_	2024-08-22 13:07:36 +00:00
Yuki Watanabe	3981d736df	databricks: Add partner package directory and ChatDatabricks implementation (#25430 ) ### Summary Create `langchain-databricks` as a new partner packages. This PR does not migrate all existing Databricks integration, but the package will eventually contain: * `ChatDatabricks` (implemented in this PR) * `DatabricksVectorSearch` * `DatabricksEmbeddings` * ~`UCFunctionToolkit`~ (will be done after UC SDK work which drastically simplify implementation) Also, this PR does not add integration tests yet. This will be added once the Databricks test workspace is ready. Tagging @efriis as POC ### Tracker [✍️] Create a package and imgrate ChatDatabricks [ ] Migrate DatabricksVectorSearch, DatabricksEmbeddings, and their docs ~[ ] Migrate UCFunctionToolkit and its doc~ [ ] Add provider document and update README.md [ ] Add integration tests and set up secrets (after moved to an external package) [ ] Add deprecation note to the community implementations. --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-21 17:19:28 -07:00
Scott Hurrey	fb1d67edf6	box: add retrievers and fix docs (#25633 ) Thank you for contributing to LangChain! Description: Adding `BoxRetriever` for langchain_box. This retriever handles two use cases: * Retrieve all documents that match a full-text search * Retrieve the answer to a Box AI prompt as a Document Twitter handle: @BoxPlatform - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-21 22:40:40 +00:00
Erick Friis	766b650fdc	chroma: add back fastapi optional dep (#25641 )	2024-08-21 20:00:47 +00:00
Bagatur	9daff60698	docs: fix openai api ref (#25639 )	2024-08-21 12:55:17 -07:00
Erick Friis	c8be0a9f70	partners/unstructured: release 0.1.2 (#25637 )	2024-08-21 12:53:55 -07:00
Christophe Bornet	b71ae52e65	[unstructured][security] Bump unstructured version (#25364 ) This ensures version 0.15.7+ is pulled. This version of unstructured uses a version of NLTK >= 3.8.2 that has a fix for a critical CVE: https://github.com/advisories/GHSA-cgvx-9447-vcch	2024-08-21 12:25:24 -07:00
Bagatur	39c44817ae	infra: test convert_message (#25632 )	2024-08-21 18:24:06 +00:00
Bagatur	628574b9c2	core[patch]: Release 0.2.34 (#25622 )	2024-08-21 16:26:51 +00:00
Bagatur	0bc3845e1e	core[patch]: support oai dicts as messages (#25621 ) and update langsmtih example selector docs	2024-08-21 16:13:15 +00:00
ccurme	10a2ce2a26	together[patch]: use mixtral in standard integration tests (#25619 ) Mistral 7B occasionally fails tool-calling tests. Updating to Mixtral appears to improve this.	2024-08-21 14:26:25 +00:00
Dristy Srivastava	b002702af6	[Community][minor]: Updating metadata with full_path in SharePoint loader (#25593 ) - Description: Updating metadata for sharepoint loader with full path i.e., webUrl - Issue: NA - Dependencies: NA - Tests: NA - Docs NA Co-authored-by: dristy.cd <dristy@clouddefense.io> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-21 13:10:14 +00:00
ZhangShenao	34d0417eb5	Improvement[Doc] Improve api doc in of `PineconeVectorStore` (#25605 ) Complete missing arguments in api doc of `PineconeVectorStore`.	2024-08-21 08:58:00 -04:00
Scott Hurrey	55fd2e2158	box: add langchain box package and DocumentLoader (#25506 ) Thank you for contributing to LangChain! -Description: Adding new package: `langchain-box`: * `langchain_box.document_loaders.BoxLoader` — DocumentLoader functionality * `langchain_box.utilities.BoxAPIWrapper` — Box-specific code * `langchain_box.utilities.BoxAuth` — Helper class for Box authentication * `langchain_box.utilities.BoxAuthType` — enum used by BoxAuth class - Twitter handle: @boxplatform - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-21 02:23:43 +00:00
Erick Friis	f878df404f	partners/chroma: release 0.1.3 (#25599 )	2024-08-20 23:24:32 +00:00
Erick Friis	60cf49a618	chroma: ban chromadb sdk versions 0.5.4 and 0.5.5 due to pydantic bug (#25586 ) also remove some unused dependencies (fastapi) and unused test/lint/dev dependencies (community, openai, textsplitters) chromadb 0.5.4 introduced usage of `model_fields` which is pydantic v2 specific. also released in 0.5.5	2024-08-20 23:21:38 +00:00
Erick Friis	e37caa9b9a	core: fix fallback context overwriting (#25550 ) fixes #25337	2024-08-20 16:07:12 -07:00
Bagatur	8a71f1b41b	core[minor]: add langsmith document loader (#25493 ) needs tests	2024-08-20 10:22:14 -07:00
Jabir	12e490ea56	Update azuresearch.py (#25577 ) This will allow complextype metadata to be returned. the current implementation throws error when dealing with nested metadata Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-20 12:53:30 +00:00
Bagatur	4bd005adb6	core[patch]: Allow bound models as token_counter in trim_messages (#25563 )	2024-08-20 00:21:22 -07:00
Erick Friis	e01c6789c4	core,community: add beta decorator to missed GraphVectorStore extensions (#25562 )	2024-08-19 17:29:09 -07:00
Bagatur	6b98207eda	infra: test chat prompt ser/des (#25557 )	2024-08-19 15:27:36 -07:00

... 17 18 19 20 21 ...

6305 Commits