langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-06-20 22:03:52 +00:00

Author	SHA1	Message	Date
Bharat Ramanathan	103e573f9b	community[patch]: chore warn deprecate the wandb callback handler (#27062 ) - Description:: This PR deprecates the wandb callback handler in favor of the new [WeaveTracer](https://weave-docs.wandb.ai/guides/integrations/langchain#using-weavetracer) in W&B - Dependencies: No dependencies, just a deprecation warning. - Twitter handle: @parambharat @baskaryan	2024-10-03 11:59:20 -04:00
Erick Friis	95a87291fd	community: deprecate community ollama integrations (#26733 )	2024-10-01 09:18:07 -07:00
ZhangShenao	e317d457cf	Bug-Fix[Community] Fix `FastEmbedEmbeddings` (#26764 ) #26759 - Fix https://github.com/langchain-ai/langchain/issues/26759 - Change `model` param from private to public, which may not be initiated. - Add test case	2024-09-30 21:23:08 -04:00
Erick Friis	7ecd720120	multiple: update docs urls to latest 2 (#26837 )	2024-09-30 17:37:07 -07:00
Tomaz Bratanic	446144e7c6	Update neo4j vector procedures (#26775 )	2024-09-30 14:45:09 -07:00
Eugene Yurtsev	7fde2791dc	core[patch]: Add kwargs to Runnable (#27008 ) Fixes #26685 --------- Co-authored-by: Tibor Reiss <tibor.reiss@gmail.com>	2024-09-30 16:45:29 -04:00
Christophe Bornet	2a6abd3f0a	community[patch]: Add docstring for Links (#25969 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-30 20:33:50 +00:00
Ben Chambers	29bf89db25	community: Add conversions from GVS to networkx (#26906 ) These allow converting linked documents (such as those used with GraphVectorStore) to networkx for rendering and/or in-memory graph algorithms such as community detection.	2024-09-27 16:48:55 -04:00
Subhrajyoty Roy	7f37fd8b80	community[patch]: callback before yield for cloudflare (#26927 ) Description: Moves yield to after callback for `_stream` function for the cloudfare workersai model in the community llm package Issue: #16913	2024-09-27 08:42:01 -04:00
Youshin Kim	2d9a09dfa4	Fix typo in mlflow code example in mlflow.py (#26931 ) - [x] PR title: Fix typo in code example in mlflow.py - In libs/community/langchain_community/chat_models/mlflow.py	2024-09-27 12:41:39 +00:00
Subhrajyoty Roy	7037ba0f06	community[patch]: callback before yield for mlx pipeline (#26928 ) Description: Moves yield to after callback for `_stream` function for the MLX pipeline model in the community llm package Issue: #16913	2024-09-27 08:41:34 -04:00
Subhrajyoty Roy	adcfecdb67	community[patch]: callback before yield for textgen (#26929 ) Description: Moves callback to before yield for `_stream` and `_astream` function for the textgen model in the community llm package Issue: #16913	2024-09-27 08:41:13 -04:00
Subhrajyoty Roy	5f2cc4ecb2	community[patch]: callback before yield for titan takeoff (#26930 ) Description: Moves yield to after callback for `_stream` function for the titan takeoff model in the community llm package Issue: #16913	2024-09-27 08:40:22 -04:00
Abhi Agarwal	696114e145	community: add sqlite-vec vectorstore (#25003 ) Description: Adds a vector store integration with [sqlite-vec](https://alexgarcia.xyz/sqlite-vec/), the successor to sqlite-vss that is a single C file with no external dependencies. Pretty straightforward, just copy-pasted the sqlite-vss integration and made a few tweaks and added integration tests. Only question is whether all documentation should be directed away from sqlite-vss if it is defacto deprecated (cc @asg017). --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: philippe-oger <philippe.oger@adevinta.com>	2024-09-26 17:37:10 +00:00
Subhrajyoty Roy	ba467f1a36	community[patch]: callback before yield for gigachat (#26881 ) Description: Moves yield to after callback for `_stream` and `_astream` function for the gigachat model in the community llm package Issue: #16913	2024-09-26 12:47:28 -04:00
Subhrajyoty Roy	11e703a97e	community[patch]: callback before yield for google palm (#26882 ) Description: Moves yield to after callback for `_stream` function for the google palm model in the community package Issue: #16913	2024-09-26 12:47:05 -04:00
Erick Friis	6f3c8313ba	community: bump langchain version (#26876 )	2024-09-25 12:58:24 -07:00
Erick Friis	e068407f18	community: bump core versoin (#26875 )	2024-09-25 12:57:16 -07:00
Eugene Yurtsev	25cb44c9ee	0.3.1 release community (#26872 ) Release for 0.3.1 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-25 19:38:53 +00:00
Rajendra Kadam	51c4393298	community[patch]: Fix validation error in SettingsConfigDict across multiple Langchain modules (#26852 ) - Description: This pull request addresses the validation error in `SettingsConfigDict` due to extra fields in the `.env` file. The issue is prevalent across multiple Langchain modules. This fix ensures that extra fields in the `.env` file are ignored, preventing validation errors. Changes include: - Applied fixes to modules using `SettingsConfigDict`. - Issue: NA, similar https://github.com/langchain-ai/langchain/issues/26850 - Dependencies: NA	2024-09-25 10:02:14 -04:00
Rajendra Kadam	7e5a9c317f	community[minor]: [Pebblo] Enhance PebbloSafeLoader to take anonymize flag (#26812 ) - Description: The flag is named `anonymize_snippets`. When set to true, the Pebblo server will anonymize snippets by redacting all personally identifiable information (PII) from the snippets going into VectorDB and the generated reports - Issue: NA - Dependencies: NA - docs: Updated	2024-09-25 09:33:06 -04:00
Rajendra Kadam	92003b3724	community[patch]: [SharePointLoader] Fix validation error in _O365Settings due to extra fields in .env file (#26851 ) Description: Fix validation error in _O365Settings by ignoring extra fields in .env file Issue: https://github.com/langchain-ai/langchain/issues/26850 Dependencies: NA	2024-09-25 09:31:59 -04:00
Subhrajyoty Roy	b61fb98466	community[patch]: callback before yield for friendli (#26842 ) Description: Moves yield to after callback for `_stream` and `_astream` function for the friendli model in the community package Issue: #16913	2024-09-25 09:31:12 -04:00
Subhrajyoty Roy	b1da532522	community[patch]: callback before yield for deepsparse llm (#26822 ) Description: Moves yield to after callback for `_stream` and `_astream` function for the deepsparse model in the community package Issue: #16913	2024-09-24 13:55:52 -04:00
Jorge Piedrahita Ortiz	408a930d55	community: Add Sambanova Cloud Chat model community integration (#26333 ) Description: : Add SambaNova Cloud Chat model community integration Includes - chat model integration (following Standardize ChatModel docstrings) - tests - docs usage notebook (following Standardize ChatModel integration docs) https://cloud.sambanova.ai/ --------- Co-authored-by: luisfucros <luisfucros@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-09-24 14:11:32 +00:00
Tom	2b83c7c3ab	community[patch]: Fix `tool_calls` parsing when streaming from DeepInfra (#26813 ) - Description: This PR fixes the response parsing logic for `ChatDeepInfra`, more specifially `_convert_delta_to_message_chunk()`, which is invoked when streaming via `ChatDeepInfra`. - Issue: Streaming from DeepInfra via `ChatDeepInfra` is currently broken because the response parsing logic doesn't handle that `tool_calls` can be `None`. (There is no GitHub issue for this problem yet.) - Dependencies: – - Twitter handle: – Keeping this here as a reminder: > If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-09-24 13:47:36 +00:00
Subhrajyoty Roy	997d95c8f8	community[patch]: callback before yield for bedrock llm (#26804 ) Description: Moves yield to after callback for `_prepare_input_and_invoke_stream` and `_aprepare_input_and_invoke_stream` for bedrock llm in community package. Issue: #16913	2024-09-24 12:14:59 +00:00
Gabriel Altay	bb40a0fb32	Remove pydantic restricted namespaces from HuggingFaceInferenceAPIEmbedings (#26744 ) without this `model_config` importing this package produces warnings about "model_name" having conflicts with protected namespace "model_". Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-09-22 08:05:37 -04:00
Gor Hayrapetyan	f97ac92f00	community[patch]: Handle empty PR body in get_pull_request in Github utility (#26739 ) Description: When PR body is empty `get_pull_request` method fails with bellow exception. Issue: ``` TypeError('expected string or buffer')Traceback (most recent call last): File ".../.venv/lib/python3.9/site-packages/langchain_core/tools/base.py", line 661, in run response = context.run(self._run, tool_args, tool_kwargs) File ".../.venv/lib/python3.9/site-packages/langchain_community/tools/github/tool.py", line 52, in _run return self.api_wrapper.run(self.mode, query) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 816, in run return json.dumps(self.get_pull_request(int(query))) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 495, in get_pull_request add_to_dict(response_dict, "body", pull.body) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 487, in add_to_dict tokens = get_tokens(value) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 483, in get_tokens return len(tiktoken.get_encoding("cl100k_base").encode(text)) File "....venv/lib/python3.9/site-packages/tiktoken/core.py", line 116, in encode if match := _special_token_regex(disallowed_special).search(text): TypeError: expected string or buffer ``` Twitter:* __gorros__	2024-09-22 01:56:24 +00:00
Christophe Bornet	58f339a67c	community: Fix links in GraphVectorStore pydoc (#25959 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-20 23:17:53 +00:00
Christophe Bornet	e49c413977	core: Add docstring for GraphVectorStoreRetriever (#26224 ) Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-09-20 23:16:37 +00:00
ccurme	f2285376a5	community[patch]: add web loader tests (#26728 )	2024-09-20 18:29:54 -04:00
William FH	19ce95d3c9	Avoid copying runs (#26689 ) Also, re-unify run trees. Use a single shared client.	2024-09-20 10:57:41 -07:00
Eric	90031b1b3e	support epsilla cloud vector database in langchain (#26065 ) Description - support epsilla cloud in langchain --------- Co-authored-by: Leonid Ganeline <leo.gan.57@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-09-20 17:14:23 +00:00
stein1988	91594928c5	fix:fix ChatZhipuAI tool call bug (#26693 ) - [ ] PR title: "community:fix ChatZhipuAI tool call bug" - [ ] Description: ZhipuAI api response as follows: {'id': '20240920132549e379a9152a6a4d7c', 'created': 1726809949, 'model': 'glm-4-flash', 'choices': [{'index': 0, 'finish_reason': 'tool_calls', 'delta': {'role': 'assistant', 'tool_calls': [{'id': 'call_20240920132549e379a9152a6a4d7c', 'index': 0, 'type': 'function', 'function': {'name': 'get_datetime_offline', 'arguments': '{}'}}]}}]} so, tool_calls = dct.get("tool_call", None) in _convert_delta_to_message_chunk should be "tool_calls"	2024-09-20 13:06:42 +00:00
Erick Friis	311f861547	core, community: move graph vectorstores to community (#26678 ) remove beta namespace from core, add to community	2024-09-19 11:38:14 -07:00
Serena Ruan	c77c28e631	[community] Fix WorkspaceClient error with pydantic validation (#26649 ) Thank you for contributing to LangChain! Fix error like <img width="1167" alt="image" src="https://github.com/user-attachments/assets/2e219b26-ec7e-48ef-8111-e0ff2f5ac4c0"> After the fix: <img width="584" alt="image" src="https://github.com/user-attachments/assets/48f36fe7-628c-48b6-81b2-7fe741e4ca85"> - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: serena-ruan <serena.rxy@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-19 18:25:33 +00:00
ccurme	f91bdd12d2	community[patch]: add to pypdf tests and run in CI (#26663 )	2024-09-19 14:45:49 +00:00
Rajendra Kadam	60dc19da30	[community] Added PebbloTextLoader for loading text data in PebbloSafeLoader (#26582 ) - Description: Added PebbloTextLoader for loading text in PebbloSafeLoader. - Since PebbloSafeLoader wraps document loaders, this new loader enables direct loading of text into Documents using PebbloSafeLoader. - Issue: NA - Dependencies: NA - [x] Tests: Added/Updated tests	2024-09-19 09:59:04 -04:00
Jorge Piedrahita Ortiz	55b641b761	community: fix error in sambastudio embeddings (#26260 ) fix error in samba studio embeddings result unpacking	2024-09-19 09:57:04 -04:00
Jorge Piedrahita Ortiz	37b72023fe	community: remove sambaverse (#26265 ) removing Sambaverse llm model and references given is not available after Sep/10/2024 <img width="1781" alt="image" src="https://github.com/user-attachments/assets/4dcdb5f7-5264-4a03-b8e5-95c88304e059">	2024-09-19 09:56:30 -04:00
Martin Triska	3fc0ea510e	community : [bugfix] Use document ids as keys in AzureSearch vectorstore (#25486 ) # Description [Vector store base class](`4cdaca67dc/libs/core/langchain_core/vectorstores/base.py (L65)`) currently expects `ids` to be passed in and that is what it passes along to the AzureSearch vector store when attempting to `add_texts()`. However AzureSearch expects `keys` to be passed in. When they are not present, AzureSearch `add_embeddings()` makes up new uuids. This is a problem when trying to run indexing. [Indexing code expects](`b297af5482/libs/core/langchain_core/indexing/api.py (L371)`) the documents to be uploaded using provided ids. Currently AzureSearch ignores `ids` passed from `indexing` and makes up new ones. Later when `indexer` attempts to delete removed file, it uses the `id` it had stored when uploading the document, however it was uploaded under different `id`. Twitter handle: @martintriska1	2024-09-19 09:37:18 -04:00
Erik	4e0a6ebe7d	community: Add warning when page_content is empty (#25955 ) Page content sometimes is empty when PyMuPDF can not find text on pages. For example, this can happen when the text of the PDF is not copyable "by hand". Then an OCR solution is need - which is not integrated here. This warning should accurately warn the user that some pages are lost during this process. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-19 05:22:09 +00:00
Tibor Reiss	85caaa773f	docs[community]: Fix raw string in docstring (#26350 ) Fixes #26212: replaced the raw string with backslashes. Alternative: raw-stringif the full docstring. --------- Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-09-19 04:18:56 +00:00
Tomaz Bratanic	03b9aca55d	community: Retry retriable errors in Neo4j (#26211 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-19 04:01:07 +00:00
ZhangShenao	c3b3f46cb8	Improvement[Community] Improve api doc of `BeautifulSoupTransformer` (#26423 ) - Add missing args Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-17 22:00:07 +00:00
ogawa	e2245fac82	community[patch]: o1-preview and o1-mini costs (#26411 ) updated OpenAI cost definitions according to the following: https://openai.com/api/pricing/ Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-17 21:59:46 +00:00
ZhangShenao	1a8e9023de	Improvement[Community] Improve `streamlit_callback_handler` (#26373 ) - add decorator for static methods Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-17 21:54:37 +00:00
Christophe Bornet	c6bdd6f482	community: Fix references in link extractors docstrings (#26314 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-17 21:26:25 +00:00
wlleiiwang	2ef4c9466f	community: modify document links for tencent vectordb (#26316 ) - modify document links for create a tencent vectordb database instance. Co-authored-by: wlleiiwang <wlleiiwang@tencent.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-17 21:11:10 +00:00
Erick Friis	194adc485c	docs: pypi readme image links (#26590 )	2024-09-17 20:41:34 +00:00
Nuno Campos	5fc44989bf	core[patch]: Fix "argument of type 'NoneType' is not iterable" error in LangChainTracer (#26576 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-17 10:29:46 -07:00
RUO	0a177ec2cc	community: Enhance MongoDBLoader with flexible metadata and optimized field extraction (#23376 ) ### Description: This pull request significantly enhances the MongodbLoader class in the LangChain community package by adding robust metadata customization and improved field extraction capabilities. The updated class now allows users to specify additional metadata fields through the metadata_names parameter, enabling the extraction of both top-level and deeply nested document attributes as metadata. This flexibility is crucial for users who need to include detailed contextual information without altering the database schema. Moreover, the include_db_collection_in_metadata flag offers optional inclusion of database and collection names in the metadata, allowing for even greater customization depending on the user's needs. The loader's field extraction logic has been refined to handle missing or nested fields more gracefully. It now employs a safe access mechanism that avoids the KeyError previously encountered when a specified nested field was absent in a document. This update ensures that the loader can handle diverse and complex data structures without failure, making it more resilient and user-friendly. ### Issue: This pull request addresses a critical issue where the MongodbLoader class in the LangChain community package could throw a KeyError when attempting to access nested fields that may not exist in some documents. The previous implementation did not handle the absence of specified nested fields gracefully, leading to runtime errors and interruptions in data processing workflows. This enhancement ensures robust error handling by safely accessing nested document fields, using default values for missing data, thus preventing KeyError and ensuring smoother operation across various data structures in MongoDB. This improvement is crucial for users working with diverse and complex data sets, ensuring the loader can adapt to documents with varying structures without failing. ### Dependencies: Requires motor for asynchronous MongoDB interaction. ### Twitter handle: N/A ### Add tests and docs Tests: Unit tests have been added to verify that the metadata inclusion toggle works as expected and that the field extraction correctly handles nested fields. Docs: An example notebook demonstrating the use of the enhanced MongodbLoader is included in the docs/docs/integrations directory. This notebook includes setup instructions, example usage, and outputs. (Here is the notebook link : [colab link](https://colab.research.google.com/drive/1tp7nyUnzZa3dxEFF4Kc3KS7ACuNF6jzH?usp=sharing)) Lint and test Before submitting, I ran make format, make lint, and make test as per the contribution guidelines. All tests pass, and the code style adheres to the LangChain standards. ```python import unittest from unittest.mock import patch, MagicMock import asyncio from langchain_community.document_loaders.mongodb import MongodbLoader class TestMongodbLoader(unittest.TestCase): def setUp(self): """Setup the MongodbLoader test environment by mocking the motor client and database collection interactions.""" # Mocking the AsyncIOMotorClient self.mock_client = MagicMock() self.mock_db = MagicMock() self.mock_collection = MagicMock() self.mock_client.get_database.return_value = self.mock_db self.mock_db.get_collection.return_value = self.mock_collection # Initialize the MongodbLoader with test data self.loader = MongodbLoader( connection_string="mongodb://localhost:27017", db_name="testdb", collection_name="testcol" ) @patch('langchain_community.document_loaders.mongodb.AsyncIOMotorClient', return_value=MagicMock()) def test_constructor(self, mock_motor_client): """Test if the constructor properly initializes with the correct database and collection names.""" loader = MongodbLoader( connection_string="mongodb://localhost:27017", db_name="testdb", collection_name="testcol" ) self.assertEqual(loader.db_name, "testdb") self.assertEqual(loader.collection_name, "testcol") def test_aload(self): """Test the aload method to ensure it correctly queries and processes documents.""" # Setup mock data and responses for the database operations self.mock_collection.count_documents.return_value = asyncio.Future() self.mock_collection.count_documents.return_value.set_result(1) self.mock_collection.find.return_value = [ {"_id": "1", "content": "Test document content"} ] # Run the aload method and check responses loop = asyncio.get_event_loop() results = loop.run_until_complete(self.loader.aload()) self.assertEqual(len(results), 1) self.assertEqual(results[0].page_content, "Test document content") def test_construct_projection(self): """Verify that the projection dictionary is constructed correctly based on field names.""" self.loader.field_names = ['content', 'author'] self.loader.metadata_names = ['timestamp'] expected_projection = {'content': 1, 'author': 1, 'timestamp': 1} projection = self.loader._construct_projection() self.assertEqual(projection, expected_projection) if __name__ == '__main__': unittest.main() ``` ### Additional Example for Documentation Sample Data: ```json [ { "_id": "1", "title": "Artificial Intelligence in Medicine", "content": "AI is transforming the medical industry by providing personalized medicine solutions.", "author": { "name": "John Doe", "email": "john.doe@example.com" }, "tags": ["AI", "Healthcare", "Innovation"] }, { "_id": "2", "title": "Data Science in Sports", "content": "Data science provides insights into player performance and strategic planning in sports.", "author": { "name": "Jane Smith", "email": "jane.smith@example.com" }, "tags": ["Data Science", "Sports", "Analytics"] } ] ``` Example Code: ```python loader = MongodbLoader( connection_string="mongodb://localhost:27017", db_name="example_db", collection_name="articles", filter_criteria={"tags": "AI"}, field_names=["title", "content"], metadata_names=["author.name", "author.email"], include_db_collection_in_metadata=True ) documents = loader.load() for doc in documents: print("Page Content:", doc.page_content) print("Metadata:", doc.metadata) ``` Expected Output: ``` Page Content: Artificial Intelligence in Medicine AI is transforming the medical industry by providing personalized medicine solutions. Metadata: {'author_name': 'John Doe', 'author_email': 'john.doe@example.com', 'database': 'example_db', 'collection': 'articles'} ``` Thank you. --- Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-09-17 10:23:17 -04:00
ccurme	900115a568	community: release 0.3 (#26472 )	2024-09-13 22:55:56 +00:00
Erick Friis	c2a3021bb0	multiple: pydantic 2 compatibility, v0.3 (#26443 ) Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Dan O'Donovan <dan.odonovan@gmail.com> Co-authored-by: Tom Daniel Grande <tomdgrande@gmail.com> Co-authored-by: Grande <Tom.Daniel.Grande@statsbygg.no> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: ZhangShenao <15201440436@163.com> Co-authored-by: Friso H. Kingma <fhkingma@gmail.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Morgante Pell <morgantep@google.com>	2024-09-13 14:38:45 -07:00
Bagatur	e32adad17a	community[patch]: Release 0.2.17 (#26432 )	2024-09-13 09:56:39 -07:00
Harrison Chase	28ad244e77	community, openai: support nested dicts (#26414 ) needed for thinking tokens --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-12 21:47:47 -07:00
Nuno Campos	212c688ee0	core[minor]: Remove serialized manifest from tracing requests for non-llm runs (#26270 ) - This takes a long time to compute, isn't used, and currently called on every invocation of every chain/retriever/etc	2024-09-10 12:58:24 -07:00
Christophe Bornet	9cf7ae0a52	community: Add docstring for HtmlLinkExtractor (#26213 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-10 00:27:37 +00:00
Christophe Bornet	56580b5fff	community: Add docstring for GLiNERLinkExtractor (#26218 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-10 00:27:23 +00:00
Christophe Bornet	e235a572a0	community: Add docstring for KeybertLinkExtractor (#26210 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-10 00:26:29 +00:00
Tomaz Bratanic	181e4fc0e0	Add session expired retry to neo4j graph (#26182 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-08 11:40:43 -07:00
Sebastian Cherny	b3c7ed4913	Adding bind_tools in ChatOctoAI (#26168 ) The object extends from langchain_community.chat_models.openai.ChatOpenAI which doesn't have `bind_tools` defined. I tried extending from `langchain_openai.ChatOpenAI` in https://github.com/langchain-ai/langchain/pull/25975 but that PR got closed because this is not correct. So adding our own `bind_tools` (which for now copying from ChatOpenAI is good enough) will solve the tool calling issue we are having now. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-08 18:38:43 +00:00
William FH	262e19b15d	infra: Clear cache for env-var checks (#26073 )	2024-09-06 21:29:29 +00:00
Bagatur	1241a004cb	fmt	2024-09-04 11:44:59 -07:00
Bagatur	4ba14ae9e5	fmt	2024-09-04 11:34:59 -07:00
Bagatur	dba308447d	fmt	2024-09-04 11:28:04 -07:00
ZhangShenao	c812237217	Improvement[Community] Improve args description in api doc of `DocArrayInMemorySearch` (#26024 ) - Add missing arg - Remove redundant arg	2024-09-04 09:26:26 -04:00
Tom Daniel Grande	0207dc1431	community: delta in openai choice can be None, creates handler for that (#25954 ) Thank you for contributing to LangChain! - [X ] PR title - [X ] PR message: Description: adds a handler for when delta choice is None Issue: Fixes #25951 Dependencies: Not applicable - [ X] Add tests and docs: Not applicable - [X ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Grande <Tom.Daniel.Grande@statsbygg.no> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-03 20:30:03 +00:00
Bagatur	0af447c90b	community[patch]: Release 0.2.16 (#25982 )	2024-09-03 18:34:18 +00:00
Dan O'Donovan	f49da71e87	community[patch]: change default Neo4j username/password (#25226 ) Description: Change the default Neo4j username/password (when not supplied as environment variable or in code) from `None` to `""`. Neo4j has an option to [disable auth](https://neo4j.com/docs/operations-manual/current/configuration/configuration-settings/#config_dbms.security.auth_enabled) which is helpful when developing. When auth is disabled, the username / password through the `neo4j` module should be `""` (ie an empty string). Empty strings get marked as false in `langchain_core.utils.env.get_from_dict_or_env` -- changing this code / behaviour would have a wide impact and is undesirable. In order to both _allow_ access to Neo4j with auth disabled and _not_ impact `langchain_core` this patch is presented. The downside would be that if a user forgets to set NEO4J_USERNAME or NEO4J_PASSWORD they would see an invalid credentials error rather than missing credentials error. This could be mitigated but would result in a less elegant patch! Issue: Fix issue where langchain cannot communicate with Neo4j if Neo4j auth is disabled.	2024-09-03 11:24:18 -07:00
Jorge Piedrahita Ortiz	c7154a4045	community: sambastudio llms api v2 support (#25063 ) - Description: SambaStudio GenericV2 API support	2024-09-03 10:18:15 -04:00
ZhangShenao	8d784db107	docs: Add missing args in api doc of `WebResearchRetriever` (#25949 ) Add missing args in api doc of `WebResearchRetriever`	2024-09-03 01:24:23 -07:00
Isaac Francisco	4833375200	community[patch]: added option to change how duckduckgosearchresults tool converts api outputs into string (#22580 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-09-02 22:42:19 +00:00
JonZeolla	78ff51ce83	community[patch]: update the default hf bge embeddings (#22627 ) Description: This updates the langchain_community > huggingface > default bge embeddings ([the current default recommends this change](https://huggingface.co/BAAI/bge-large-en)) Issue: None Dependencies: None Twitter handle: @jonzeolla --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-09-02 22:10:21 +00:00
Leonid Ganeline	150251fd49	docs: `integrations` reference updates 13 (#25711 ) Added missed provider pages and links. Fixed inconsistent formatting. Added arxiv references to docstirngs. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-09-02 22:08:50 +00:00
Yash Parmar	51dae57357	community[minor]: jina search tools integrating (jina reader) (#23339 ) - PR title: "community: add Jina Search tool" - Description: Added the Jina Search tool for querying the Jina search API. This includes the implementation of the JinaSearchAPIWrapper and the JinaSearch tool, along with a Jupyter notebook example demonstrating its usage. - Issue: N/A - Dependencies: N/A - Twitter handle: [Twitter handle](https://x.com/yashp3020?t=7wM0gQ7XjGciFoh9xaBtqA&s=09) - [x] Add tests and docs: If you're adding a new integration, please include 1. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-09-02 14:52:14 -07:00
Qingchuan Hao	3145995ed9	community[patch]: BingSearchResults returns raw snippets as artifact(#23304 ) Returns an array of results which is more specific and easier for later use. Tested locally: ``` resp = tool.invoke("what's the weather like in Shanghai?") for item in resp: print(item) ``` returns ``` {'snippet': '<b>Shanghai</b>, <b>Shanghai</b>, China <b>Weather</b> Forecast, with current conditions, wind, air quality, and what to expect for the next 3 days.', 'title': 'Shanghai, Shanghai, China Weather Forecast \| AccuWeather', 'link': 'https://www.accuweather.com/en/cn/shanghai/106577/weather-forecast/106577'} {'snippet': '5. 99 / 87 °F. 6. 99 / 86 °F. 7. Detailed forecast for 14 days. Need some help? Current <b>weather</b> <b>in Shanghai</b> and forecast for today, tomorrow, and next 14 days.', 'title': 'Weather for Shanghai, Shanghai Municipality, China - timeanddate.com', 'link': 'https://www.timeanddate.com/weather/china/shanghai'} {'snippet': '<b>Shanghai</b> - <b>Weather</b> warnings issued 14-day forecast. <b>Weather</b> warnings issued. Forecast - <b>Shanghai</b>. Day by day forecast. Last updated Friday at 01:05. Tonight, ... Temperature feels <b>like</b> 34 ...', 'title': 'Shanghai - BBC Weather', 'link': 'https://www.bbc.com/weather/1796236'} {'snippet': 'Current <b>weather</b> <b>in Shanghai</b>, <b>Shanghai</b>, China. Check current conditions <b>in Shanghai</b>, <b>Shanghai</b>, China with radar, hourly, and more.', 'title': 'Shanghai, Shanghai, China Current Weather \| AccuWeather', 'link': 'https://www.accuweather.com/en/cn/shanghai/106577/current-weather/106577'} 13-Day Beijing, Xi'an, Chengdu, <b>Shanghai</b> Chinese Language and Culture Immersion Tour. <b>Shanghai</b> in September. Average daily temperature range: 23–29°C (73–84°F) Average rainy days: 10. Average sunny days: 20. September ushers in pleasant autumn <b>weather</b>, making it one of the best months to visit <b>Shanghai</b>. <b>Weather</b> in <b>Shanghai</b>: Climate, Seasons, and Average Monthly Temperature. <b>Shanghai</b> has a subtropical maritime monsoon climate, meaning high humidity and lots of rain. Hot muggy summers, cool falls, cold winters with little snow, and warm springs are the norm. Midsummer through early fall is the best time to visit <b>Shanghai</b>. <b>Shanghai</b>, <b>Shanghai</b>, China <b>Weather</b> Forecast, with current conditions, wind, air quality, and what to expect for the next 3 days. 1165. 45.9. 121. Winter, from December to February, is quite cold: the average January temperature is 5 °C (41 °F). There may be cold periods, with highs around 5 °C (41 °F) or below, and occasionally, even snow can fall. The temperature dropped to -10 °C (14 °F) in January 1977 and to -7 °C (19.5 °F) in January 2016. 5. 99 / 87 °F. 6. 99 / 86 °F. 7. Detailed forecast for 14 days. Need some help? Current <b>weather</b> in <b>Shanghai</b> and forecast for today, tomorrow, and next 14 days. Everything you need to know about today's <b>weather</b> in <b>Shanghai</b>, <b>Shanghai</b>, China. High/Low, Precipitation Chances, Sunrise/Sunset, and today's Temperature History. <b>Shanghai</b> - <b>Weather</b> warnings issued 14-day forecast. <b>Weather</b> warnings issued. Forecast - <b>Shanghai</b>. Day by day forecast. Last updated Friday at 01:05. Tonight, ... Temperature feels <b>like</b> 34 ... <b>Shanghai</b> 14 Day Extended Forecast. <b>Weather</b> Today <b>Weather</b> Hourly 14 Day Forecast Yesterday/Past <b>Weather</b> Climate (Averages) Currently: 84 °F. Passing clouds. (<b>Weather</b> station: <b>Shanghai</b> Hongqiao Airport, China). See more current <b>weather</b>. Current <b>weather</b> in <b>Shanghai</b>, <b>Shanghai</b>, China. Check current conditions in <b>Shanghai</b>, <b>Shanghai</b>, China with radar, hourly, and more. <b>Shanghai</b> <b>Weather</b> Forecasts. <b>Weather Underground</b> provides local & long-range <b>weather</b> forecasts, weatherreports, maps & tropical <b>weather</b> conditions for the <b>Shanghai</b> area. ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-09-02 21:11:32 +00:00
Alexander KIRILOV	6a8f8a56ac	community[patch]: added content_columns option to CSVLoader (#23809 ) Description: Adding a new option to the CSVLoader that allows us to implicitly specify the columns that are used for generating the Document content. Currently these are implicitly set as "all fields not part of the metadata_columns". In some cases however it is useful to have a field both as a metadata and as part of the document content.	2024-09-02 20:25:53 +00:00
Bruno Alvisio	ab527027ac	community: Resolve refs recursively when generating openai_fn from OpenAPI spec (#19002 ) - Description: This PR is intended to improve the generation of payloads for OpenAI functions when converting from an OpenAPI spec file. The solution is to recursively resolve `$refs`. Currently when converting OpenAPI specs into OpenAI functions using `openapi_spec_to_openai_fn`, if the schemas have nested references, the generated functions contain `$ref` that causes the LLM to generate payloads with an incorrect schema. For example, for the for OpenAPI spec: ``` text = """ { "openapi": "3.0.3", "info": { "title": "Swagger Petstore - OpenAPI 3.0", "termsOfService": "http://swagger.io/terms/", "contact": { "email": "apiteam@swagger.io" }, "license": { "name": "Apache 2.0", "url": "http://www.apache.org/licenses/LICENSE-2.0.html" }, "version": "1.0.11" }, "externalDocs": { "description": "Find out more about Swagger", "url": "http://swagger.io" }, "servers": [ { "url": "https://petstore3.swagger.io/api/v3" } ], "tags": [ { "name": "pet", "description": "Everything about your Pets", "externalDocs": { "description": "Find out more", "url": "http://swagger.io" } }, { "name": "store", "description": "Access to Petstore orders", "externalDocs": { "description": "Find out more about our store", "url": "http://swagger.io" } }, { "name": "user", "description": "Operations about user" } ], "paths": { "/pet": { "post": { "tags": [ "pet" ], "summary": "Add a new pet to the store", "description": "Add a new pet to the store", "operationId": "addPet", "requestBody": { "description": "Create a new pet in the store", "content": { "application/json": { "schema": { "$ref": "#/components/schemas/Pet" } } }, "required": true }, "responses": { "200": { "description": "Successful operation", "content": { "application/json": { "schema": { "$ref": "#/components/schemas/Pet" } } } } } } } }, "components": { "schemas": { "Tag": { "type": "object", "properties": { "id": { "type": "integer", "format": "int64" }, "model_type": { "type": "number" } } }, "Category": { "type": "object", "required": [ "model", "year", "age" ], "properties": { "year": { "type": "integer", "format": "int64", "example": 1 }, "model": { "type": "string", "example": "Ford" }, "age": { "type": "integer", "example": 42 } } }, "Pet": { "required": [ "name" ], "type": "object", "properties": { "id": { "type": "integer", "format": "int64", "example": 10 }, "name": { "type": "string", "example": "doggie" }, "category": { "$ref": "#/components/schemas/Category" }, "tags": { "type": "array", "items": { "$ref": "#/components/schemas/Tag" } }, "status": { "type": "string", "description": "pet status in the store", "enum": [ "available", "pending", "sold" ] } } } } } } """ ``` Executing: ``` spec = OpenAPISpec.from_text(text) pet_openai_functions, pet_callables = openapi_spec_to_openai_fn(spec) response = model.invoke("Create a pet named Scott", functions=pet_openai_functions) ``` `pet_open_functions` contains unresolved `$refs`: ``` [ { "name": "addPet", "description": "Add a new pet to the store", "parameters": { "type": "object", "properties": { "json": { "properties": { "id": { "type": "integer", "schema_format": "int64", "example": 10 }, "name": { "type": "string", "example": "doggie" }, "category": { "ref": "#/components/schemas/Category" }, "tags": { "items": { "ref": "#/components/schemas/Tag" }, "type": "array" }, "status": { "type": "string", "enum": [ "available", "pending", "sold" ], "description": "pet status in the store" } }, "type": "object", "required": [ "name", "photoUrls" ] } } } } ] ``` and the generated JSON has an incorrect schema (e.g. category is filled with `id` and `name` instead of `model`, `year` and `age`: ``` { "id": 1, "name": "Scott", "category": { "id": 1, "name": "Dogs" }, "tags": [ { "id": 1, "name": "tag1" } ], "status": "available" } ``` With this change, the generated JSON by the LLM becomes, `pet_openai_functions` becomes: ``` [ { "name": "addPet", "description": "Add a new pet to the store", "parameters": { "type": "object", "properties": { "json": { "properties": { "id": { "type": "integer", "schema_format": "int64", "example": 10 }, "name": { "type": "string", "example": "doggie" }, "category": { "properties": { "year": { "type": "integer", "schema_format": "int64", "example": 1 }, "model": { "type": "string", "example": "Ford" }, "age": { "type": "integer", "example": 42 } }, "type": "object", "required": [ "model", "year", "age" ] }, "tags": { "items": { "properties": { "id": { "type": "integer", "schema_format": "int64" }, "model_type": { "type": "number" } }, "type": "object" }, "type": "array" }, "status": { "type": "string", "enum": [ "available", "pending", "sold" ], "description": "pet status in the store" } }, "type": "object", "required": [ "name" ] } } } } ] ``` and the JSON generated by the LLM is: ``` { "id": 1, "name": "Scott", "category": { "year": 2022, "model": "Dog", "age": 42 }, "tags": [ { "id": 1, "model_type": 1 } ], "status": "available" } ``` which has the intended schema. - Twitter handle:: @brunoalvisio --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-09-02 13:17:39 -07:00
Luiz F. G. dos Santos	36bbdc776e	community: fix bug to support for `file_search` tool from OpenAI (#25927 ) - Description: The function `_is_assistants_builtin_tool` didn't had support for `file_search` from OpenAI. This was creating conflict and blocking the usage of such. OpenAI Assistant changed from`retrieval` to `file_search`. The following code ``` agent = OpenAIAssistantV2Runnable.create_assistant( name="Data Analysis Assistant", instructions=prompt[0].content, tools={'type': 'file_search'}, model=self.chat_config.connection.deployment_name, client=llm, as_agent=True, tool_resources={ "file_search": { "vector_store_ids": vector_store_id } } ) ``` Was throwing the following error ``` Traceback (most recent call last): File "/Users/l.guedesdossantos/Documents/codes/shellai-nlp-backend/app/chat/chat_decorators.py", line 500, in get_response return await super().get_response(post, context) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/Documents/codes/shellai-nlp-backend/app/chat/chat_decorators.py", line 96, in get_response response = await self.inner_chat.get_response(post, context) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/Documents/codes/shellai-nlp-backend/app/chat/chat_decorators.py", line 96, in get_response response = await self.inner_chat.get_response(post, context) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/Documents/codes/shellai-nlp-backend/app/chat/chat_decorators.py", line 96, in get_response response = await self.inner_chat.get_response(post, context) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [Previous line repeated 4 more times] File "/Users/l.guedesdossantos/Documents/codes/shellai-nlp-backend/app/chat/azure_open_ai_chat.py", line 147, in get_response chain = chain_factory.get_chain(prompts, post.conversation.id, overrides, context) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/Documents/codes/shellai-nlp-backend/app/llm_connections/chains.py", line 1324, in get_chain agent = OpenAIAssistantV2Runnable.create_assistant( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/anaconda3/envs/shell-e/lib/python3.11/site-packages/langchain_community/agents/openai_assistant/base.py", line 256, in create_assistant tools=[_get_assistants_tool(tool) for tool in tools], # type: ignore ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/anaconda3/envs/shell-e/lib/python3.11/site-packages/langchain_community/agents/openai_assistant/base.py", line 256, in <listcomp> tools=[_get_assistants_tool(tool) for tool in tools], # type: ignore ^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/anaconda3/envs/shell-e/lib/python3.11/site-packages/langchain_community/agents/openai_assistant/base.py", line 119, in _get_assistants_tool return convert_to_openai_tool(tool) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/anaconda3/envs/shell-e/lib/python3.11/site-packages/langchain_core/utils/function_calling.py", line 255, in convert_to_openai_tool function = convert_to_openai_function(tool) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/l.guedesdossantos/anaconda3/envs/shell-e/lib/python3.11/site-packages/langchain_core/utils/function_calling.py", line 230, in convert_to_openai_function raise ValueError( ValueError: Unsupported function {'type': 'file_search'} Functions must be passed in as Dict, pydantic.BaseModel, or Callable. If they're a dict they must either be in OpenAI function format or valid JSON schema with top-level 'title' and 'description' keys. ``` With the proposed changes, this is fixed and the function will have support for `file_search`. This was the only place missing the support for `file_search`. Reference doc https://platform.openai.com/docs/assistants/tools/file-search - Twitter handle: luizf0992 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-09-02 18:21:51 +00:00
xander-art	6cd452d985	Feature/update hunyuan (#25779 ) Description: - Add system templates and user templates in integration testing - initialize the response id field value to request_id - Adjust the default model to hunyuan-pro - Remove the default values of Temperature and TopP - Add SystemMessage all the integration tests have passed. 1、Execute integration tests for the first time <img width="1359" alt="71ca77a2-e9be-4af6-acdc-4d665002bd9b" src="https://github.com/user-attachments/assets/9298dc3a-aa26-4bfa-968b-c011a4e699c9"> 2、Run the integration test a second time <img width="1501" alt="image" src="https://github.com/user-attachments/assets/61335416-4a67-4840-bb89-090ba668e237"> Issue: None Dependencies: None Twitter handle: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-09-02 12:55:08 +00:00
Yuwen Hu	566e9ba164	community: add Intel GPU support to `ipex-llm` llm integration (#22458 ) Description: [IPEX-LLM](https://github.com/intel-analytics/ipex-llm) is a PyTorch library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with very low latency. This PR adds Intel GPU support to `ipex-llm` llm integration. Dependencies: `ipex-llm` Contribution maintainer: @ivy-lv11 @Oscilloscope98 tests and docs: - Add: langchain/docs/docs/integrations/llms/ipex_llm_gpu.ipynb - Update: langchain/docs/docs/integrations/llms/ipex_llm_gpu.ipynb - Update: langchain/libs/community/tests/llms/test_ipex_llm.py --------- Co-authored-by: ivy-lv11 <zhicunlv@gmail.com>	2024-09-02 08:49:08 -04:00
Emmanuel Leroy	654da27255	improve llamacpp embeddings (#12972 ) - Description: Improve llamacpp embedding class by adding the `device` parameter so it can be passed to the model and used with `gpu`, `cpu` or Apple metal (`mps`). Improve performance by making use of the bulk client api to compute embeddings in batches. - Dependencies: none - Tag maintainer: @hwchase17 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-31 18:27:59 +00:00
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟	64b62f6ae4	community[neo4j_vector]: make embedding dimension check optional (#25737 ) Description: Starting from Neo4j 5.23 (22 August 2024), with vector-2.0 indexes, `vector.dimensions` is not required to be set, which will cause it the key not exist error in index config if it's not set. Since the existence of vector.dimensions will only ensure additional checks, this commit turns embedding dimension check optional, and only do checks when it exists (not None). https://neo4j.com/release-notes/database/neo4j-5/ Twitter handle: @HollowM186 Signed-off-by: Hollow Man <hollowman@opensuse.org> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-31 12:36:20 +00:00
Christophe Bornet	0a752a74cc	community[patch], docs: Add API reference doc for GraphVectorStore (#25751 )	2024-08-30 17:42:00 -07:00
Bagatur	ca1c3bd9c0	community[patch]: bump + fix core dep (#25901 )	2024-08-30 15:54:07 -07:00
mehdiosa	c6f00e6bdc	community: Fix branch not being considered when using GithubFileLoader (#20075 ) - Description: Added `ref` query parameter so data is not loaded only from the default branch but any branch passed --------- Co-authored-by: Osama Mehdi <mehdi@hm.edu> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-30 21:47:11 +00:00
Alex Sherstinsky	617a4e617b	community: Fix a bug in handling kwargs overwrites in Predibase integration, and update the documentation. (#25893 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-30 12:41:42 -07:00
Anush	ade4bfdff1	qdrant: Updated class check in Self-Query Retriever factory (#25877 ) ## Description - Updates the self-query retriever factory to check for the new Qdrant vector store class. i.e. `langchain_qdrant.QdrantVectorstore`. - Deprecates `QdrantSparseVectorRetriever`, since the vector store implementation natively supports it now. Resolves #25798	2024-08-30 12:11:55 -04:00
Djordje	862ef32fdc	community: Fixed infinity embeddings async request (#25882 ) Description: Fix async infinity embeddings Issue: #24942 @baskaryan, @ccurme	2024-08-30 12:10:34 -04:00
rainsubtime	f75d5621e2	community:Fix a bug of LLM in moonshot (#25878 ) - Description: When useing LLM integration moonshot,it's occurring error "'Moonshot' object has no attribute '_client'",it's because of the "_client" that is private in pydantic v1.0 so that we can't use it.I turn "_client" into "client" , the error to be resolved! - Issue: the issue #24390 - Dependencies: none - Twitter handle: @Rainsubtime - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Cyue <Cyue_work2001@163.com>	2024-08-30 16:09:39 +00:00
ZhangShenao	fd0f147df3	Improvement[Community] Add tool-calling test case for `ChatZhipuAI` (#25884 ) - Add tool-calling test case for `ChatZhipuAI`	2024-08-30 12:05:43 -04:00
默奕	6377185291	add neo4j query constructor for self query (#25288 ) - [x] PR title - community: add neo4j query constructor for self query - [x] PR message - Description: adding a Neo4jTranslator so that the Neo4j vector database can use SelfQueryRetriever - Issue: this issue had been raised before in #19748 - Dependencies: none. - Twitter handle: @moyi_dang - p.s. I have not added the query constructor in BUILTIN_TRANSLATORS in this PR, I want to make changes to only one package at a time. - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-30 14:54:33 +00:00
Erick Friis	09b04c7e3b	"community: release 0.2.15" (#25867 )	2024-08-30 02:18:48 +00:00
Erick Friis	f7e62754a1	community: undo azure_ad_access_token breaking change (#25818 )	2024-08-30 02:06:14 +00:00
Kyle Winkelman	201bdf7148	community: Cap AzureOpenAIEmbeddings chunk_size at 2048 instead of 16. (#25852 ) Description: Within AzureOpenAIEmbeddings there is a validation to cap `chunk_size` at 16. The value of 16 is either an old limitation or was erroneously chosen. I have checked all of the `preview` and `stable` releases to ensure that the `embeddings` endpoint can handle 2048 entries [Azure/azure-rest-api-specs](https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference). I have also found many locations that confirm this limit should be 2048: - https://learn.microsoft.com/en-us/azure/ai-services/openai/reference#embeddings - https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits Issue: fixes #25462	2024-08-29 16:48:04 +00:00
Allan Ascencio	a8af396a82	added octoai test (#21793 ) - [ ] PR title: community: add tests for ChatOctoAI - [ ] PR message: Description: Added unit tests for the ChatOctoAI class in the community package to ensure proper validation and default values. These tests verify the correct initialization of fields, the handling of missing required parameters, and the proper setting of aliases. Issue: N/A Dependencies: None --------- Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-08-29 15:07:27 +00:00
Param Singh	69f9acb60f	premai[patch]: Standardize premai params (#21513 ) Thank you for contributing to LangChain! community:premai[patch]: standardize init args - updated `temperature` with Pydantic Field, updated the unit test. - updated `max_tokens` with Pydantic Field, updated the unit test. - updated `max_retries` with Pydantic Field, updated the unit test. Related to #20085 --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-29 11:01:28 -04:00
Guangdong Liu	fcf9230257	community(sparkllm): Add function call support in Sparkllm chat model. (#20607 ) - Description: Add function call support in Sparkllm chat model. Related documents https://www.xfyun.cn/doc/spark/Web.html#_2-function-call%E8%AF%B4%E6%98%8E - @baskaryan --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-29 14:38:39 +00:00
Jorge Piedrahita Ortiz	9ac953a948	Community: sambastudio embeddings GenericV2 API support (#25064 ) - Description: SambaStudio GenericV2 API support Minor changes for requests error handling	2024-08-29 09:52:49 -04:00
Sam Jove	bdce9a47d0	community[patch]: callback before yield for _astream (gigachat) (#25834 ) Description: Moves yield to after callback for _astream for gigachat in the community package Issue: #16913 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-29 13:29:28 +00:00
Jinoos Lee	703af9ffe3	Patch enable to use Amazon OpenSearch Serverless(aoss) for Semantic Cache store (#25833 ) - [x] PR title: "community: Patch enable to use Amazon OpenSearch Serverless for Semantic Cache store" - [x] PR message: - Description: OpenSearchSemanticCache class support Amazon OpenSearch Serverless for Semantic Cache store, it's only required to pass auth(http_auth) parameter to initializer - Dependencies: none If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Jinoos Lee <jinoos@amazon.com>	2024-08-29 13:28:22 +00:00
Mikhail Khludnev	a017f49fd3	comminity[patch]: fix #25575 YandexGPTs for _grpc_metadata (#25617 ) it fixes two issues: ### YGPTs are broken #25575 ``` File ....conda/lib/python3.11/site-packages/langchain_community/embeddings/yandex.py:211, in _make_request(self, texts, **kwargs) .. --> 211 res = stub.TextEmbedding(request, metadata=self._grpc_metadata) # type: ignore[attr-defined] AttributeError: 'YandexGPTEmbeddings' object has no attribute '_grpc_metadata' ``` My gut feeling that #23841 is the cause. I have to drop leading underscore from `_grpc_metadata` for quickfix, but I just don't know how to do it _pydantic_ enough. ### minor issue: if we use `api_key`, which is not the best practice the code fails with ``` File ~/git/...../python3.11/site-packages/langchain_community/embeddings/yandex.py:119, in YandexGPTEmbeddings.validate_environment(cls, values) ... AttributeError: 'tuple' object has no attribute 'append' ``` - Added new integration test. But it requires YGPT env available and active account. I don't know how int tests dis\enabled in CI. - added small unit tests with mocks. Should be fine. --------- Co-authored-by: mikhail-khludnev <mikhail_khludnev@rntgroup.com>	2024-08-28 18:48:10 -07:00
Serena Ruan	850bf89e48	community[patch]: Support passing extra params for executing functions in UCFunctionToolkit (#25652 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Support passing extra params when executing UC functions: The params should be a dictionary with key EXECUTE_FUNCTION_ARG_NAME, the assumption is that the function itself doesn't use such variable name (starting and ending with double underscores), and if it does we raise Exception. If invalid params passing to the execute_statement, we raise Exception as well. - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: Serena Ruan <serena.rxy@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-28 18:47:32 -07:00
崔浩	3555882a0d	community[patch]: optimize xinference llm import (#25809 ) Thank you for contributing to LangChain! - [ ] PR title: "community: optimize xinference llm import" - [ ] PR message: - Description: from xinferece_client import RESTfulClient when there is no importing xinference. - Dependencies: xinferece_client - Why do so: the total xinference(pip install xinference[all]) is too heavy for installing, let alone it is useless for langchain user except RESTfulClient. The modification has maintained consistency with the xinference embeddings [embeddings/xinference](../blob/master/libs/community/langchain_community/embeddings/xinference.py#L89).	2024-08-29 01:41:43 +00:00
ccurme	afe8ccaaa6	community[patch]: Add ID field back to Azure AI Search results (#25828 ) Commandeering https://github.com/langchain-ai/langchain/pull/23243 as maintainers don't have ability to modify that PR. Fixes https://github.com/langchain-ai/langchain/issues/22827 --------- Co-authored-by: Ming Quah <fleetadmiralbutter@icloud.com>	2024-08-28 17:56:50 -04:00
Erick Friis	5db6c6d96d	community: release 0.2.14 (#25822 )	2024-08-28 19:05:53 +00:00
Cillian Berragan	754f3c41f9	community: add score to PineconeHybridSearchRetriever (#25781 ) Description: Adds the 'score' returned by Pinecone to the `PineconeHybridSearchRetriever` list of returned Documents. There is currently no way to return the score when using Pinecone hybrid search, so in this PR I include it by default. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-28 13:11:06 +00:00
ZhangShenao	3f1d652f15	Improvement[Community] Improve api doc for `PineconeHybridSearchRetriever` (#25803 ) - Complete missing args in api doc	2024-08-28 08:38:56 -04:00
Moritz Schlager	555f97becb	community[patch]: fix model initialization bug for deepinfra (#25727 ) ### Description adds an init method to ChatDeepInfra to set the model_name attribute accordings to the argument ### Issue currently, the model_name specified by the user during initialization of the ChatDeepInfra class is never set. Therefore, it always chooses the default model (meta-llama/Llama-2-70b-chat-hf, however probably since this is deprecated it always uses meta-llama/Llama-3-70b-Instruct). We stumbled across this issue and fixed it as proposed in this pull request. Feel free to change the fix according to your coding guidelines and style, this is just a proposal and we want to draw attention to this problem. ### Dependencies no additional dependencies required Feel free to contact me or @timo282 and @finitearth if you have any questions. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-28 02:02:35 -07:00
Bagatur	b0ac6fe8d3	community[patch]: Release 0.2.13 (#25806 )	2024-08-28 08:57:49 +00:00
zysoong	25a6790e1a	community[patch]: Minor Improvement of extract hyperlinks tool output (#25728 ) Description: Make the hyperlink only appear once in the extract_hyperlinks tool output. (for some websites output contains meaningless '#' hyperlinks multiple times which will extend the tokens of context window without any advantage) Issue: None Dependencies: None	2024-08-28 08:02:40 +00:00
Isaac Francisco	d5ddaac1fc	docs minor fix (#25794 )	2024-08-28 04:14:36 +00:00
Tomaz Bratanic	f359e6b0a5	Add mmr to neo4j vector (#25765 )	2024-08-27 08:55:19 -04:00
Luis Valencia	99f9a664a5	community: Azure Search Vector Store is missing Access Token Authentication (#24330 ) Added Azure Search Access Token Authentication instead of API KEY auth. Fixes Issue: https://github.com/langchain-ai/langchain/issues/24263 Dependencies: None Twitter: @levalencia @baskaryan Could you please review? First time creating a PR that fixes some code. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-26 15:41:50 -07:00
ZhangShenao	44e3e2391c	Improvement[Community] Improve methods in `IMessageChatLoader` (#25746 ) - Add @staticmethod to static methods in `IMessageChatLoader`. - Format args name.	2024-08-26 14:20:22 -04:00
maang-h	a566a15930	Fix MoonshotChat instantiate with alias (#25755 ) - Description: - Fix `MoonshotChat` instantiate with alias - Add `MoonshotChat` to `__init__.py` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-26 17:33:22 +00:00
Ashvin	af3b3a4474	Update endpoint for AzureMLEndpointApiType class. (#25725 ) This addresses the issue mentioned in #25702 I have updated the endpoint used in validating the endpoint API type in the AzureMLBaseEndpoint class from `/v1/completions` to `/completions` and `/v1/chat/completions` to `/chat/completions`. Co-authored-by: = <=>	2024-08-26 08:50:02 -04:00
Dristy Srivastava	7205057c3e	[Community][minor]: Added langchain_version while calling discover API (#24428 ) - Description: Added langchain version while calling discover API during both ingestion and retrieval - Issue: NA - Dependencies: NA - Tests: NA - Docs NA --------- Co-authored-by: dristy.cd <dristy@clouddefense.io>	2024-08-26 08:47:48 -04:00
Dristy Srivastava	fbb4761199	[Community][minor]: Updating source path, and file path for SharePoint loader in PebbloSafeLoader (#25592 ) - Description: Updating source path and file path in Pebblo safe loader for SharePoint apps during loading - Issue: NA - Dependencies: NA - Tests: NA - Docs NA --------- Co-authored-by: dristy.cd <dristy@clouddefense.io>	2024-08-26 08:38:40 -04:00
Rajendra Kadam	745d1c2b8d	community[minor]: [Pebblo] Fix URL construction in newer Python versions (#25747 ) - PR message: Fix URL construction in newer Python versions - Description: - Update the URL construction logic to use the .value attribute for Routes enum members. - This adjustment resolves an issue where the code worked correctly in Python 3.9 but failed in Python 3.11. - Clean up unused routes. - Issue: NA - Dependencies: NA	2024-08-26 07:27:30 -04:00
Rajendra Kadam	58a98c7d8a	community: [PebbloRetrievalQA] Implemented Async support for prompt APIs (#25748 ) - Description: PebbloRetrievalQA: Implemented Async support for prompt APIs (classification and governance) - Issue: NA - Dependencies: NA	2024-08-26 07:27:05 -04:00
Christophe Bornet	038c287b3a	all: Improve make lint command (#25344 ) * Removed `ruff check --select I` as `I` is already selected and checked in the main `ruff check` command * Added checks for non-empty `PYTHON_FILES` * Run `ruff check` only on `PYTHON_FILES` Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-23 18:23:52 -07:00
Erick Friis	f6491ceb7d	community: remove integration test deps (#24460 ) they arent used	2024-08-23 23:25:17 +00:00
Sharmistha S. Gupta	90439b12f6	Added support for Nebula Chat model (#21925 ) Description: Added support for Nebula Chat model in addition to Nebula Instruct Dependencies: N/A Twitter handle: @Symbldotai --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-23 22:34:32 +00:00
Ian	64ace25eb8	<Community>: tidb vector support vector index (#19984 ) This PR introduces adjustments to ensure compatibility with the recently released preview version of [TiDB Serverless Vector Search](https://tidb.cloud/ai), aiming to prevent user confusion. - TiDB Vector now supports vector indexing with cosine and l2 distance strategies, although inner_product remains unsupported. - Changing the distance strategy is currently not supported, so the test cased should be adjusted.	2024-08-23 13:59:23 -04:00
Austin Burdette	f355a98bb6	community:yuan2[patch]: standardize init args (#21462 ) updated stop and request_timeout so they aliased to stop_sequences, and timeout respectively. Added test that both continue to set the same underlying attributes. Related to [20085](https://github.com/langchain-ai/langchain/issues/20085) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-23 17:56:19 +00:00
Erick Friis	b365ee996b	community: remove unused verify_ssl kwarg from aiohttp request (#25707 ) it's not a valid kwarg in aiohttp request	2024-08-23 17:14:04 +00:00
Ashvin	2cd77a53a3	docs: Add docstrings for CassandraChatMessageHistory class and package namespace function. (#24222 ) - Modified docstring for CassandraChatMessageHistory in libs/community/langchain_community/chat_message_history/cassandra.py. - Added docstring for _package_namespace function in docs/api_reference/create_api_rst.py --------- Co-authored-by: ashvin <ashvin.anilkumar@qburst.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-23 15:49:41 +00:00
Leonid Ganeline	8788a34bfa	community: `NeptuneGraph` fix (#23281 ) Issue: the `service` optional parameter was mentioned but not used. Fix: added this parameter. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-23 15:34:26 +00:00
Djordje	22f9ae489f	community: Opensearch - added score function for similarity_score_threshold (#23928 ) This PR resolves the NotImplemented error for the similarity_score_threshold search type for OpenSearch.	2024-08-23 11:30:04 -04:00
ZhangShenao	b38c83ff93	patch[Community] Optimize methods in several ChatLoaders (#24806 ) There are some static methods in ChatLoaders, try to add @staticmethod decorator for them.	2024-08-23 11:00:41 -04:00
James Espichan Vilca	644e0d3463	Use extend method for embeddings concatenation in mlflow_gateway (#14358 ) ## Description There is a bug in the concatenation of embeddings obtained from MLflow that does not conform to the type hint requested by the function. ``` python def _query(self, texts: List[str]) -> List[List[float]]: ``` It is logical to expect a List[List[float]] for a List[str]. However, the append method encapsulates the response in a global List. To avoid this, the extend method should be used, which will add the embeddings of all strings at the same list level. ## Testing I have tried using OpenAI-ADA to obtain the embeddings, and the result of executing this snippet is as follows: ``` python embeds = await MlflowAIGatewayEmbeddings().aembed_documents(texts=["hi", "how are you?"]) print(embeds) ``` ``` python [[[-0.03512698, -0.020624293, -0.015343423, ...], [-0.021260535, -0.011461929, -0.00033121882, ...]]] ``` When in reality, the expected result should be: ``` python [[-0.03512698, -0.020624293, -0.015343423, ...], [-0.021260535, -0.011461929, -0.00033121882, ...]] ``` The above result complies with the expected type hint: List[List[float]] . As I mentioned, we can achieve that by using the extend method instead of the append method. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-23 14:43:43 +00:00
Christophe Bornet	7f1e444efa	partners: Use simsimd types (#25299 ) The simsimd package [now has types](https://github.com/ashvardanian/SimSIMD/releases/tag/v5.0.0)	2024-08-23 10:41:39 -04:00
clement.l	642f9530cd	community: add supported blockchains to Blockchain Document Loader (#25428 ) - Remove deprecated chains. - Add more supported chains. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-23 14:39:42 +00:00
conjuncts	818267bbc3	community: allow chroma DB delete() to use "where" argument (#19826 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Simply pass kwargs to allow arguments like "where" to be propagated Issue: Previously, db.delete(where={}) wouldn't work for chroma vectorstores Dependencies: N/A Twitter handle: N/A - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-08-23 10:10:57 -04:00
Kevin Engelke	3c7f12cbf5	community[minor]: Fix missing 'keep_newlines' parameter forward-pass to 'process_pages' function in confluence loader (#20086 ) (#20087 ) - Description: Fixed missing `keep_newlines` parameter forward-pass in confluence-loader - Issue: #20086 - Dependencies: None --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-23 12:59:38 +00:00
Erik Lindgren	583b0449eb	community[patch]: Fix Hybrid Search for non-Databricks managed embeddings (#25590 ) Description: Send both the query and query_embedding to the Databricks index for hybrid search. Issue: When using hybrid search with non-Databricks managed embedding we currently don't pass both the embedding and query_text to the index. Hybrid search requires both of these. This change fixes this issue for both `similarity_search` and `similarity_search_by_vector`. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-23 08:57:13 +00:00
Alejandro Companioni	bcd5842b5d	community[patch]: Updating default PPLX model to supported llama-3.1 model. (#25643 ) # Issue As of late July, Perplexity [no longer supports Llama 3 models](https://docs.perplexity.ai/changelog/introducing-new-and-improved-sonar-models). # Description This PR updates the default model and doc examples to reflect their latest supported model. (Mostly updating the same places changed by #23723.) # Twitter handle `@acompa_` on behalf of the team at Not Diamond. Check us out [here](https://notdiamond.ai). --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-08-23 08:33:30 +00:00
Jakub W.	b865ee49a0	community[patch]: Dynamodb history messages key (#25658 ) - Description: adding the history_messages_key so you don't have to use "History" as a key in langchain	2024-08-23 08:05:28 +00:00
Manuel Jaiczay	1c31234eed	community: fix HuggingFacePipeline pipeline_kwargs (#19920 ) Fix handling of pipeline_kwargs to prioritize class attribute defaults. #19770 Co-authored-by: jaizo <manuel.jaiczay@polygons.at> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:29:46 -04:00
Nobuhiko Otoba	4b63a217c2	"community: Fix GithubFileLoader source code", "docs: Fix GithubFileLoader code sample" (#19943 ) This PR adds tiny improvements to the `GithubFileLoader` document loader and its code sample, addressing the following issues: 1. Currently, the `file_extension` argument of `GithubFileLoader` does not change its behavior at all. 1. The `GithubFileLoader` sample code in `docs/docs/integrations/document_loaders/github.ipynb` does not work as it stands. The respective solutions I propose are the following: 1. Remove `file_extension` argument from `GithubFileLoader`. 1. Specify the branch as `master` (not the default `main`) and rename `documents` as `document`. --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-22 18:24:57 -04:00
Nada Amin	ac7b71e0d7	langchain_community.graphs: Neo4JGraph: prop min_size might be None (#23944 ) When I used the Neo4JGraph enhanced_schema=True option, I ran into an error because a prop min_size of None was compared numerically with an int. The fix I applied is similar to the pattern of skipping embeddings elsewhere in the file. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-22 20:29:52 +00:00
William FH	fad6fc866a	Rm DeepInfra Breakpoint Comment (#25206 ) tbh should rm the print staement too	2024-08-22 14:43:44 -04:00
Eric Pinzur	01ded5e2f9	community: add metadata filter to CassandraGraphVectorStore (#25663 ) - Description: - Added metadata filtering support to `langchain_community.graph_vectorstores.cassandra.CassandraGraphVectorStore` - Also fixed type conversion issues highlighted by mypy. - Dependencies: - `ragstack-ai-knowledge-store 0.2.0` (released July 23, 2024) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:27:16 -04:00
mschoenb97IL	e499caa9cd	community: Give more context on DeepInfra 500 errors (#25671 ) Description: DeepInfra 500 errors have useful information in the text field that isn't being exposed to the user. I updated the error message to fix this. As an example, this code ``` from langchain_community.chat_models import ChatDeepInfra from langchain_core.messages import HumanMessage model = "meta-llama/Meta-Llama-3-70B-Instruct" deepinfra_api_token = "..." model = ChatDeepInfra(model=model, deepinfra_api_token=deepinfra_api_token) messages = [HumanMessage("All work and no play makes Jack a dull boy\n" * 9000)] response = model.invoke(messages) ``` Currently gives this error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server: Error 500 ``` This change would give the following error: ``` langchain_community.chat_models.deepinfra.ChatDeepInfraException: DeepInfra Server error status 500: {"error":{"message":"Requested input length 99009 exceeds maximum input length 8192"}} ```	2024-08-22 10:10:51 -07:00
Rajendra Kadam	4ff2f4499e	community: Refactor PebbloRetrievalQA (#25583 ) Refactor PebbloRetrievalQA - Created `APIWrapper` and moved API logic into it. - Created smaller functions/methods for better readability. - Properly read environment variables. - Removed unused code. - Updated models Issue: NA Dependencies: NA tests: NA	2024-08-22 11:51:21 -04:00
Rajendra Kadam	1f1679e960	community: Refactor PebbloSafeLoader (#25582 ) Refactor PebbloSafeLoader - Created `APIWrapper` and moved API logic into it. - Moved helper functions to the utility file. - Created smaller functions and methods for better readability. - Properly read environment variables. - Removed unused code. Issue: NA Dependencies: NA tests: Updated	2024-08-22 11:46:52 -04:00
maang-h	5e3a321f71	docs: Add ChatZhipuAI tool calling and structured output docstring (#25669 ) - Description: Add `ChatZhipuAI` tool calling and structured output docstring.	2024-08-22 10:34:41 -04:00
Noah Mayerhofer	0091947efd	community: add retry for session expired exception in neo4j (#25660 ) Description: The neo4j driver can raise a SessionExpired error, which is considered a retriable error. If a query fails with a SessionExpired error, this change retries every query once. This change will make the neo4j integration less flaky. Twitter handle: noahmay_	2024-08-22 13:07:36 +00:00
Dristy Srivastava	b002702af6	[Community][minor]: Updating metadata with full_path in SharePoint loader (#25593 ) - Description: Updating metadata for sharepoint loader with full path i.e., webUrl - Issue: NA - Dependencies: NA - Tests: NA - Docs NA Co-authored-by: dristy.cd <dristy@clouddefense.io> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-21 13:10:14 +00:00
Jabir	12e490ea56	Update azuresearch.py (#25577 ) This will allow complextype metadata to be returned. the current implementation throws error when dealing with nested metadata Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-20 12:53:30 +00:00
Erick Friis	e01c6789c4	core,community: add beta decorator to missed GraphVectorStore extensions (#25562 )	2024-08-19 17:29:09 -07:00
maang-h	015ab91b83	community[patch]: Add ToolMessage for ChatZhipuAI (#25547 ) - Description: Add ToolMessage for `ChatZhipuAI` to solve the issue #25490	2024-08-19 11:26:38 -04:00
Mohammad Mohtashim	75c3c81b8c	[Community]: Fix - Open AI Whisper `client.audio.transcriptions` returning Text Object which raises error (#25271 ) - Description: The following [line](`fd546196ef/libs/community/langchain_community/document_loaders/parsers/audio.py (L117)`) in `OpenAIWhisperParser` returns a text object for some odd reason despite the official documentation saying it should return `Transcript` Instance which should have the text attribute. But for the example given in the issue and even when I tried running on my own, I was directly getting the text. The small PR accounts for that. - Issue: : #25218 I was able to replicate the error even without the GenericLoader as shown below and the issue was with `OpenAIWhisperParser` ```python parser = OpenAIWhisperParser(api_key="sk-fxxxxxxxxx", response_format="srt", temperature=0) list(parser.lazy_parse(Blob.from_path('path_to_file.m4a'))) ```	2024-08-19 09:36:42 -04:00
maang-h	32f5147523	docs: Fix QianfanLLMEndpoint and Tongyi input text (#25529 ) - Description: Fix `QianfanLLMEndpoint` and `Tongyi` input text.	2024-08-19 09:23:09 -04:00
ZhangShenao	4255a30f20	Improvement[Community] Improve api doc for `SingleFileFacebookMessengerChatLoader` (#25536 ) Delete redundant args in api doc	2024-08-19 09:00:21 -04:00
ccurme	b83f1eb0d5	core, partners: implement standard tracing params for LLMs (#25410 )	2024-08-16 13:18:09 -04:00
Bagatur	253ceca76a	docs: fix mimetype parser docstring (#25463 )	2024-08-15 16:16:52 -07:00
ccurme	8afbab4cf6	langchain[patch]: deprecate various chains (#25310 ) - [x] NatbotChain: move to community, deprecate langchain version. Update to use `prompt \| llm \| output_parser` instead of LLMChain. - [x] LLMMathChain: deprecate + add langgraph replacement example to API ref - [x] HypotheticalDocumentEmbedder (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] FlareChain: update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] ConstitutionalChain: deprecate + add langgraph replacement example to API ref - [x] LLMChainExtractor (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] LLMChainFilter (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] RePhraseQueryRetriever (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain	2024-08-15 10:49:26 -04:00
ccurme	ba167dc158	community[patch]: update connection string in azure cosmos integration test (#25438 )	2024-08-15 14:07:54 +00:00
Isaac Francisco	966b408634	[docs]: doc loader changes (#25417 )	2024-08-14 19:46:33 -07:00
Werner van der Merwe	1d3f7231b8	fix: typo where github should be gitlab (#25397 ) PR title: "GitLabToolkit: fix typo" - Description: fix typo where GitHub should have been GitLab - Dependencies: None	2024-08-14 18:36:25 +00:00
Bagatur	493e474063	docs: udpated api reference (#25172 ) - Move the API reference into the vercel build - Update api reference organization and styling	2024-08-14 07:00:17 -07:00
ccurme	27690506d0	multiple: update removal targets (#25361 )	2024-08-14 09:50:39 -04:00
Harrison Chase	967b6f21f6	docs: improve document loaders index (#25365 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-14 01:48:48 +00:00
Isaac Francisco	f4ffd692a3	[docs]: standardize doc loader doc strings (#25325 )	2024-08-13 23:18:56 +00:00
Isaac Francisco	e0bbb81d04	[docs]: standardize tool docstrings (#25351 )	2024-08-13 16:10:00 -07:00
thedavgar	9d08369442	community: fix AzureSearch vectorstore asyncronous methods (#24921 ) Description Fix the asyncronous methods to retrieve documents from AzureSearch VectorStore. The previous changes from [this commit](`ffe6ca986e`) create a similar code for the syncronous methods and the asyncronous ones but the asyncronous client return an asyncronous iterator "AsyncSearchItemPaged" as said in the issue #24740. To solve this issue, the syncronous iterators in asyncronous methods where changed to asyncronous iterators. @chrislrobert said in [this comment](https://github.com/langchain-ai/langchain/issues/24740#issuecomment-2254168302) that there was a still a flaw due to `with` blocks that close the client after each call. I removed this `with` blocks in the `async_client` following the same pattern as the sync `client`. In order to close up the connections, a __del__ method is included to gently close up clients once the vectorstore object is destroyed. Issue: #24740 and #24064 Dependencies: No new dependencies for this change Example notebook: I created a notebook just to test the changes work and gives the same results as the syncronous methods for vector and hybrid search. With these changes, the asyncronous methods in the retriever work as well. ![image](https://github.com/user-attachments/assets/697e431b-9d7f-4d0d-b205-59d051ac2b67) Lint and test: Passes the tests and the linter	2024-08-13 14:20:51 -07:00
Fedor Nikolaev	2b15518c5f	community: add args_schema to SearxSearchResults tool (#25350 ) This adds `args_schema` member to `SearxSearchResults` tool. This member is already present in the `SearxSearchRun` tool in the same file. I was having `TypeError: Type is not JSON serializable: AsyncCallbackManagerForToolRun` being thrown in langserve playground when I was using `SearxSearchResults` tool as a part of chain there. This fixes the issue, so the error is not raised anymore. This is a example langserve app that was giving me the error, but it works properly after the proposed fix: ```python #!/usr/bin/env python from fastapi import FastAPI from langchain_core.prompts import ChatPromptTemplate from langchain_core.output_parsers import StrOutputParser from langchain_core.runnables import RunnablePassthrough from langchain_openai import ChatOpenAI from langchain_community.utilities import SearxSearchWrapper from langchain_community.tools.searx_search.tool import SearxSearchResults from langserve import add_routes template = """Answer the question based only on the following context: {context} Question: {question} """ prompt = ChatPromptTemplate.from_template(template) model = ChatOpenAI() s = SearxSearchWrapper(searx_host="http://localhost:8080") search = SearxSearchResults(wrapper=s) search_chain = ( {"context": search, "question": RunnablePassthrough()} \| prompt \| model \| StrOutputParser() ) app = FastAPI() add_routes( app, search_chain, path="/chain", ) if __name__ == "__main__": import uvicorn uvicorn.run(app, host="localhost", port=8000) ```	2024-08-13 18:26:09 +00:00
maang-h	089f5e6cad	Standardize SparkLLM (#25239 ) - Description: Standardize SparkLLM, include: - docs, the issue #24803 - to support stream - update api url - model init arg names, the issue #20085	2024-08-13 09:50:12 -04:00
Chen Xiabin	24155aa1ac	qianfan generate/agenerate with usage_metadata (#25332 )	2024-08-13 09:24:41 -04:00
Erick Friis	2907ab2297	community: release 0.2.12 (#25324 )	2024-08-12 23:30:27 +00:00
Ben Chambers	1adc161642	community: kwargs for CassandraGraphVectorStore (#25300 ) - Description: pass kwargs from CassandraGraphVectorStore to underlying store Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 18:01:29 +00:00
ccurme	e77eeee6ee	core[patch]: add standard tracing params for retrievers (#25240 )	2024-08-12 14:51:59 +00:00
Mohammad Mohtashim	9927a4866d	[Community] - Added bind_tools and with_structured_output for ChatZhipuAI (#23887 ) - Description: This PR implements the `bind_tool` functionality for ChatZhipuAI as requested by the user. ChatZhipuAI models support tool calling according to the `OpenAI` tool format, as outlined in their official documentation [here](https://open.bigmodel.cn/dev/api#glm-4). - Issue: ##23868 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 14:11:43 +00:00
maang-h	bc60cddc1b	docs: Fix ChatBaichuan, QianfanChatEndpoint, ChatSparkLLM, ChatZhipuAI docs (#25265 ) - Description: Fix some chat models docs, include: - ChatBaichuan - QianfanChatEndpoint - ChatSparkLLM - ChatZhipuAI	2024-08-11 16:23:55 -04:00
ZhangShenao	43deed2a95	Improvement[Embeddings] Add dimension support to `ZhipuAIEmbeddings` (#25274 ) - In the in ` embedding-3 ` and later models of Zhipu AI, it is supported to specify the dimensions parameter of Embedding. Ref: https://bigmodel.cn/dev/api#text_embedding-3 . - Add test case for `embedding-3` model by assigning dimensions.	2024-08-11 16:20:37 -04:00
Eugene Yurtsev	6dd9f053e3	core[patch]: Deprecating beta upsert APIs in vectorstore (#25069 ) This PR deprecates the beta upsert APIs in vectorstore. We'll introduce them in a V2 abstraction instead to keep the existing vectorstore implementations lighter weight. The main problem with the existing APIs is that it's a bit more challenging to implement the correct behavior w/ respect to IDs since ID can be present in both the function signature and as an optional attribute on the document object. But VectorStores that pass the standard tests should have implemented the semantics properly! --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-09 17:17:36 -04:00
Eugene Yurtsev	b6f0174bb9	community[patch],core[patch]: Update EdenaiTool root_validator and add unit test in core (#25233 ) This PR gets rid `root_validators(allow_reuse=True)` logic used in EdenAI Tool in preparation for pydantic 2 upgrade. - add another test to secret_from_env_factory	2024-08-09 15:59:27 +00:00
Eugene Yurtsev	bd6c31617e	community[patch]: Remove more @allow_reuse=True validators (#25236 ) Remove some additional allow_reuse=True usage in @root_validators.	2024-08-09 11:10:27 -04:00
Eugene Yurtsev	6e57aa7c36	community[patch]: Remove usage of @root_validator(allow_reuse=True) (#25235 ) Remove usage of @root_validator(allow_reuse=True)	2024-08-09 10:57:42 -04:00
thiswillbeyourgithub	a2b4c33bd6	community[patch]: FAISS: ValueError mentions normalize_score_fn isntead of relevance_score_fn (#25225 ) Thank you for contributing to LangChain! - [X] PR title: "community: fix valueerror mentions wrong argument missing" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [X] PR message: *Delete this entire checklist* and replace with - Description: when faiss.py has a None relevance_score_fn it raises a ValueError that says a normalize_fn_score argument is needed. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-09 14:40:29 +00:00
Shivendra Soni	66b7206ab6	community: Add llm-extraction option to FireCrawl Document Loader (#25231 ) Description: This minor PR aims to add `llm_extraction` to Firecrawl loader. This feature is supported on API and PythonSDK, but the langchain loader omits adding this to the response. Twitter handle: [scalable_pizza](https://x.com/scalablepizza) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-09 13:59:10 +00:00
ccurme	3b7437d184	docs: update integration api refs (#25195 ) - [x] toolkits - [x] retrievers (in this repo)	2024-08-09 12:27:32 +00:00
Eugene Yurtsev	98779797fe	community[patch]: Use get_fields adapter for pydantic (#25191 ) Change all usages of __fields__ with get_fields adapter merged into langchain_core. Code mod generated using the following grit pattern: ``` engine marzano(0.1) language python `$X.__fields__` => `get_fields($X)` where { add_import(source="langchain_core.utils.pydantic", name="get_fields") } ```	2024-08-08 14:43:09 -04:00
Rajendra Kadam	663638d6a8	community[minor]: [SharePointLoader] Load extended metadata for the root folder (#24872 ) - Title: [SharePointLoader] Load extended metadata for the root folder - Description: - Ensure extended metadata loads correctly for the root folder. - Cleanup: Refactor SharePointLoader to remove unused fields(`file_id` & `site_id`). - Dependencies: NA - Add tests and docs: NA	2024-08-08 14:39:16 -04:00
Eugene Yurtsev	bf5193bb99	community[patch]: Upgrade pydantic extra (#25185 ) Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 17:20:39 +00:00
ololand	249945a572	Update polygon.py for business subscription (#25085 ) For business subscription the status is STOCKSBUSINESS not OK Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-08 15:28:41 +00:00
ogawa	d895db11d6	community[patch]: gpt-4o-2024-08-06 costs (#25164 ) - Description: updated OpenAI cost definitions according to the following: - https://openai.com/api/pricing/ - Twitter handle: `@ogawa65a` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-08 13:22:11 +00:00
maang-h	0ba125c3cd	docs: Standardize QianfanLLMEndpoint LLM (#25139 ) - Description: Standardize QianfanLLMEndpoint LLM，include: - docs, the issue #24803 - model init arg names, the issue #20085	2024-08-07 10:57:27 -04:00
Pat Patterson	7e7fcf5b1f	community: Fix ValidationError on creating GPT4AllEmbeddings with no gpt4all_kwargs (#25124 ) - Description: Instantiating `GPT4AllEmbeddings` with no `gpt4all_kwargs` argument raised a `ValidationError`. Root cause: #21238 added the capability to pass `gpt4all_kwargs` through to the `GPT4All` instance via `Embed4All`, but broke code that did not specify a `gpt4all_kwargs` argument. - Issue: #25119 - Dependencies: None - Twitter handle: [`@metadaddy`](https://twitter.com/metadaddy)	2024-08-07 13:34:01 +00:00
Virat Singh	264ab96980	community: Add stock market tools from financialdatasets.ai (#25025 ) Description: In this PR, I am adding three stock market tools from financialdatasets.ai (my API!): - get balance sheets - get cash flow statements - get income statements Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 18:28:12 +00:00
Naval Chand	71c0698ee4	Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler (#25104 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Example: "community: Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Naval Chand <navalchand@192.168.1.36>	2024-08-06 17:28:47 +00:00
Isaac Francisco	a72fddbf8d	[docs]: vector store integration pages (#24858 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 17:20:27 +00:00
maang-h	1028af17e7	docs: Standardize Tongyi (#25103 ) - Description: Standardize Tongyi LLM，include: - docs, the issue #24803 - model init arg names, the issue #20085	2024-08-06 11:44:12 -04:00
Dobiichi-Origami	061ed250f6	delete the default model value from langchain and discard the need fo… (#24915 ) - description: I remove the limitation of mandatory existence of `QIANFAN_AK` and default model name which langchain uses cause there is already a default model nama underlying `qianfan` SDK powering langchain component. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-06 14:11:05 +00:00
Dominik Fladung	ffa0c838d8	Allow ConfluenceLoader authorization via Personal Access Tokens (#25096 ) - community: Allow authorization to Confluence with bearer token - Description: Allow authorization to Confluence with [Personal Access Token](https://confluence.atlassian.com/enterprise/using-personal-access-tokens-1026032365.html) by checking for the keys `['client_id', token: ['access_token', 'token_type']]` - Issue: Currently the following error occurs when using an personal access token for authorization. ```python loader = ConfluenceLoader( url=os.getenv('CONFLUENCE_URL'), oauth2={ 'token': {"access_token": os.getenv("CONFLUENCE_ACCESS_TOKEN"), "token_type": "bearer"}, 'client_id': 'client_id', }, page_ids=['12345678'], ) ``` ``` ValueError: Error(s) while validating input: ["You have either omitted require keys or added extra keys to the oauth2 dictionary. key values should be `['access_token', 'access_token_secret', 'consumer_key', 'key_cert']`"] ``` With this PR the loader runs as expected. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-06 13:42:47 +00:00
jigsawlabs-student	427a04151c	community: fix neo4j from_existing_graph (#24912 ) Fixes Neo4JVector.from_existing_graph integration with huggingface Previously threw an error with existing databases, because from_existing_graph query returns empty list of new nodes, which are then passed to embedding function, and huggingface errors with empty list. Fixes [24401](https://github.com/langchain-ai/langchain/issues/24401) --------- Co-authored-by: Jeff Katzy <jeffreyerickatz@gmail.com>	2024-08-05 21:01:46 +00:00
Jim Baldwin	6890daa90c	community: make AthenaLoader profile_name optional and fix type hint (#24958 ) - Description: This PR makes the AthenaLoader profile_name optional and fixes the type hint which says the type is `str` but it should be `str` or `None` as None is handled in the loader init. This is a minor problem but it just confused me when I was using the Athena Loader to why we had to use a Profile, as I want that for local but not production. - Issue: #24957 - Dependencies: None.	2024-08-05 14:28:58 +00:00
Dobiichi-Origami	c5cb52a3c6	community: fix issue of the existence of numeric object in `additional_kwargs` a… (#24863 ) - Description: A previous PR breaks the code from `baidu_qianfan_endpoint.py` which causes the malfunction of streaming	2024-08-05 10:15:55 -04:00
ZhangShenao	cda79dbb6c	community[patch]: Optimize test case for `MoonshotChat` (#25050 ) Optimize test case for `MoonshotChat`. Use standard ChatModelIntegrationTests.	2024-08-05 10:11:25 -04:00
Alex Sherstinsky	208042e0f2	community: Fix Predibase Integration for HuggingFace-hosted fine-tuned adapters (#25015 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-03 14:05:43 -07:00
maang-h	f5da0d6d87	docs: Standardize MiniMaxEmbeddings (#24983 ) - Description: Standardize MiniMaxEmbeddings - docs, the issue #24856 - model init arg names, the issue #20085	2024-08-03 14:01:23 -04:00
maang-h	7de62abc91	docs: Standardize SparkLLMTextEmbeddings docstrings (#25021 ) - Description: Standardize SparkLLMTextEmbeddings docstrings - Issue: the issue #24856	2024-08-03 13:44:09 -04:00
Bagatur	e81ddb32a6	docs: fix kwargs docstring (#25010 ) Fix: ![Screenshot 2024-08-02 at 5 33 37 PM](https://github.com/user-attachments/assets/7c56cdeb-ee81-454c-b3eb-86aa8a9bdc8d)	2024-08-02 19:54:54 -07:00
Isaac Francisco	73570873ab	docs: standardizing tavily tool docs (#24736 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-02 22:25:27 +00:00
Bagatur	8e2316b8c2	community[patch]: Release 0.2.11 (#24989 )	2024-08-02 20:08:44 +00:00
ccurme	22c1a4041b	community[patch]: support named arguments in github toolkit (#24986 ) Parameters may be passed in by name if generated from tool calls.	2024-08-02 18:27:32 +00:00
ZhangShenao	71c0564c9f	community[patch]: Add test case for MoonshotChat (#24960 ) Add test case for `MoonshotChat`.	2024-08-02 09:37:31 -04:00
Isaac Francisco	d7688a4328	community[patch]: adding artifact to Tavily search (#24376 ) This allows you to get raw content as well as the answer, instead of just getting the results. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 21:12:11 -07:00
maang-h	ea505985c4	docs: Standardize ZhipuAIEmbeddings docstrings (#24933 ) - Description: Standardize ZhipuAIEmbeddings rich docstrings. - Issue: the issue #24856	2024-08-01 14:06:53 -04:00
Anneli Samuel	2204d8cb7d	community[patch]: Invoke on_llm_new_token callback before yielding chunk (#24938 ) Description: Invoke on_llm_new_token callback before yielding chunk in streaming mode Issue: [#16913](https://github.com/langchain-ai/langchain/issues/16913)	2024-08-01 16:39:04 +00:00
Serena Ruan	1827bb4042	community[patch]: support bind_tools for ChatMlflow (#24547 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - Description: Support ChatMlflow.bind_tools method Tested in Databricks: <img width="836" alt="image" src="https://github.com/user-attachments/assets/fa28ef50-0110-4698-8eda-4faf6f0b9ef8"> - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-08-01 08:43:07 -07:00
BottlePumpkin	bfc59c1d26	community: Fix KeyError in NotionDB loader when 'name' is missing (#24224 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Description: This PR fixes a KeyError in NotionDBLoader when the "name" key is missing in the "people" property. Issue: Fixes #24223 Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-01 13:55:40 +00:00
alexqiao	8eb0bdead3	community[patch]: Invoke callback prior to yielding token (#24917 ) Description: Invoke callback prior to yielding token in stream method for chat_models . Issue: https://github.com/langchain-ai/langchain/issues/16913 #16913	2024-08-01 13:19:55 +00:00
Nikita Pakunov	c776471ac6	community: fix AttributeError: 'YandexGPT' object has no attribute '_grpc_metadata' (#24432 ) Fixes #24049 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-31 21:18:33 +00:00
Eugene Yurtsev	add16111b9	community[patch]: Make the pydantic linter stricter (#24897 ) Stricter linting of deprecated pydantic features.	2024-07-31 18:57:37 +00:00
Eugene Yurtsev	a4a444f73d	community[patch]: Fix arcee llm usage of root_validator(pre=False) (#24896 ) Should be pre=True	2024-07-31 18:49:20 +00:00
Eugene Yurtsev	d24b82357f	community[patch]: Add missing annotations (#24890 ) This PR adds annotations in comunity package. Annotations are only strictly needed in subclasses of BaseModel for pydantic 2 compatibility. This PR adds some unnecessary annotations, but they're not bad to have regardless for documentation pages.	2024-07-31 18:13:44 +00:00
ccurme	30f18c7b02	docs: add retriever integrations template (#24836 )	2024-07-31 13:50:44 -04:00
Anirudh31415926535	4da3d4b18e	docs: Minor corrections and updates to Cohere docs (#22726 ) - Description: Update the Cohere's provider and RagRetriever documentations with latest updates. - Twitter handle: Anirudh1810	2024-07-31 10:16:26 -07:00
Nishan Jain	b00c0fc558	[Community][minor]: Added prompt governance in pebblo_retrieval (#24874 ) Title: [pebblo_retrieval] Identifying entities in prompts given in PebbloRetrievalQA leading to prompt governance Description: Implemented identification of entities in the prompt using Pebblo prompt governance API. Issue: NA Dependencies: NA Add tests and docs: NA	2024-07-31 13:14:51 +00:00
Rajendra Kadam	a6add89bd4	community[minor]: [PebbloSafeLoader] Implement content-size-based batching (#24871 ) - Title: [PebbloSafeLoader] Implement content-size-based batching in the classification flow(loader/doc API) - Description: - Implemented content-size-based batching in the loader/doc API, set to 100KB with no external configuration option, intentionally hard-coded to prevent timeouts. - Remove unused field(pb_id) from doc_metadata - Issue: NA - Dependencies: NA - Add tests and docs: Updated	2024-07-31 09:10:28 -04:00
TrumanYan	096b66db4a	community: replace it with Tencent Cloud SDK (#24172 ) Description: The old method will be discontinued; use the official SDK for more model options. Issue: None Dependencies: None Twitter handle: None Co-authored-by: trumanyan <trumanyan@tencent.com>	2024-07-31 09:05:38 -04:00
Erick Friis	1f5444817a	community: deprecate BedrockEmbeddings in favor of langchain-aws (#24846 )	2024-07-30 23:13:17 +00:00
Shailendra Mishra	f2d810b3c0	clob_bugfix... (#24813 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-30 12:44:04 -04:00
Anush	51b15448cc	community: Fix FastEmbedEmbeddings (#24462 ) ## Description This PR: - Fixes the validation error in `FastEmbedEmbeddings`. - Adds support for `batch_size`, `parallel` params. - Removes support for very old FastEmbed versions. - Updates the FastEmbed doc with the new params. Associated Issues: - Resolves #24039 - Resolves #https://github.com/qdrant/fastembed/issues/296	2024-07-30 12:42:46 -04:00
ccurme	73ec24fc56	docs[patch]: add toolkit template (#24791 )	2024-07-30 12:36:09 -04:00
Igor Drozdov	c2706cfb9e	feat(community): add tools support for litellm (#23906 ) I used the following example to validate the behavior ```python from langchain_core.prompts import ChatPromptTemplate from langchain_core.runnables import ConfigurableField from langchain_anthropic import ChatAnthropic from langchain_community.chat_models import ChatLiteLLM from langchain_core.tools import tool from langchain.agents import create_tool_calling_agent, AgentExecutor @tool def multiply(x: float, y: float) -> float: """Multiply 'x' times 'y'.""" return x * y @tool def exponentiate(x: float, y: float) -> float: """Raise 'x' to the 'y'.""" return x**y @tool def add(x: float, y: float) -> float: """Add 'x' and 'y'.""" return x + y prompt = ChatPromptTemplate.from_messages([ ("system", "you're a helpful assistant"), ("human", "{input}"), ("placeholder", "{agent_scratchpad}"), ]) tools = [multiply, exponentiate, add] llm = ChatAnthropic(model="claude-3-sonnet-20240229", temperature=0) # llm = ChatLiteLLM(model="claude-3-sonnet-20240229", temperature=0) agent = create_tool_calling_agent(llm, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke({"input": "what's 3 plus 5 raised to the 2.743. also what's 17.24 - 918.1241", }) ``` `ChatAnthropic` version works: ``` > Entering new AgentExecutor chain... Invoking: `exponentiate` with `{'x': 5, 'y': 2.743}` responded: [{'text': 'To calculate 3 + 5^2.743, we can use the "exponentiate" and "add" tools:', 'type': 'text', 'index': 0}, {'id': 'toolu_01Gf54DFTkfLMJQX3TXffmxe', 'input': {}, 'name': 'exponentiate', 'type': 'tool_use', 'index': 1, 'partial_json': '{"x": 5, "y": 2.743}'}] 82.65606421491815 Invoking: `add` with `{'x': 3, 'y': 82.65606421491815}` responded: [{'id': 'toolu_01XUq9S56GT3Yv2N1KmNmmWp', 'input': {}, 'name': 'add', 'type': 'tool_use', 'index': 0, 'partial_json': '{"x": 3, "y": 82.65606421491815}'}] 85.65606421491815 Invoking: `add` with `{'x': 17.24, 'y': -918.1241}` responded: [{'text': '\n\nSo 3 + 5^2.743 = 85.66\n\nTo calculate 17.24 - 918.1241, we can use:', 'type': 'text', 'index': 0}, {'id': 'toolu_01BkXTwP7ec9JKYtZPy5JKjm', 'input': {}, 'name': 'add', 'type': 'tool_use', 'index': 1, 'partial_json': '{"x": 17.24, "y": -918.1241}'}] -900.8841[{'text': '\n\nTherefore, 17.24 - 918.1241 = -900.88', 'type': 'text', 'index': 0}] > Finished chain. ``` While `ChatLiteLLM` version doesn't. But with the changes in this PR, along with: - https://github.com/langchain-ai/langchain/pull/23823 - https://github.com/BerriAI/litellm/pull/4554 The result is _almost_ the same: ``` > Entering new AgentExecutor chain... Invoking: `exponentiate` with `{'x': 5, 'y': 2.743}` responded: To calculate 3 + 5^2.743, we can use the "exponentiate" and "add" tools: 82.65606421491815 Invoking: `add` with `{'x': 3, 'y': 82.65606421491815}` 85.65606421491815 Invoking: `add` with `{'x': 17.24, 'y': -918.1241}` responded: So 3 + 5^2.743 = 85.66 To calculate 17.24 - 918.1241, we can use: -900.8841 Therefore, 17.24 - 918.1241 = -900.88 > Finished chain. ``` If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-30 15:39:34 +00:00
David Robertson	bfb7f8d40a	Brave Search: Enhance search result details with extra snippets (#19209 ) Description: This update significantly improves the Brave Search Tool's utility within the LangChain library by enriching the search results it returns. The tool previously returned title, link, and snippet, with the snippet being a truncated 140-character description from the search engine. To make the search results more informative, this update enables extra_snippets by default and introduces additional result fields: title, link, description (enhancing and renaming the former snippet field), age, and snippets. The snippets field provides a list of strings summarizing the webpage, utilizing Brave's capability for more detailed search insights. This enhancement aims to make the search tool far more informative and beneficial for users. Issue: N/A Dependencies: No additional dependencies introduced. Twitter handle: @davidalexr987 Code Changes Summary: - Changed the default setting to include extra_snippets in search results. - Renamed the snippet field to description to accurately reflect its content and included an age field for search results. - Introduced a snippets field that lists webpage summaries, providing users with comprehensive search result insights. Backward Compatibility Note: The renaming of snippet to description improves the accuracy of the returned data field but may impact existing users who have developed integration's or analyses based on the snippet field. I believe this change is essential for clarity and utility, and it aligns better with the data provided by Brave Search. Additional Notes: This proposal focuses exclusively on the Brave Search package, without affecting other LangChain packages or introducing new dependencies.	2024-07-30 15:29:38 +00:00
Ben Chambers	435771fe74	[community]: Fix package name mismatch (#24824 ) - Description: fix a mismatch in pypi package names	2024-07-30 11:21:39 -04:00
maang-h	4bb1a11e02	community: Add MiniMaxChat bind_tools and structured output (#24310 ) - Description: - Add `bind_tools` method to support tool calling - Add `with_structured_output` method to support structured output	2024-07-29 15:51:52 -04:00
maang-h	bf685c242f	docs: Standardize QianfanEmbeddingsEndpoint (#24786 ) - Description: Standardize QianfanEmbeddingsEndpoint, include: - docstrings, the issue #21983 - model init arg names, the issue #20085	2024-07-29 13:19:24 -04:00
M. Ali	c086410677	fix docs typos (#23668 ) Thank you for contributing to LangChain! - [x] PR title: "docs: fix multiple typos" Co-authored-by: mohblnk <mohamed.ali@blnk.ai> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-29 16:10:55 +00:00
Pere Pasamonte	98175860ad	community: Fix AWS DocumentDB similarity_search when filter is None (#24777 ) Description Fixes DocumentDBVectorSearch similarity_search when no filter is used; it defaults to None but $match does not accept None, so changed default to empty {} before pipeline is created. Issue AWS DocumentDB similarity search does not work when no filter is used. Error msg: "the match filter must be an expression in an object" #24775 Dependencies No dependencies Twitter handle https://x.com/perepasamonte --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-29 15:32:05 +00:00
AmosDinh	c113682328	community:Add support for specifying document_loaders.firecrawl api url. (#24747 ) community:Add support for specifying document_loaders.firecrawl api url. Add support for specifying document_loaders.firecrawl api url. This is mainly to support the [self-hosting](https://github.com/mendableai/firecrawl/blob/main/SELF_HOST.md) option firecrawl provides. Eg. now I can specify localhost:.... The corresponding firecrawl class already provides functionality to pass the argument. See here: `4c9d62f6d3/apps/python-sdk/firecrawl/firecrawl.py (L29)` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-28 14:30:36 -04:00
Haijian Wang	cda3025ee1	Integrating the Yi family of models. (#24491 ) Thank you for contributing to LangChain! - [x] PR title: "community:add Yi LLM", "docs:add Yi Documentation" - [x] PR message: *Delete this entire checklist* and replace with - Description: This PR adds support for the Yi model to LangChain. - Dependencies: [langchain_core,requests,contextlib,typing,logging,json,langchain_community] - Twitter handle: 01.AI - [x] Add tests and docs: I've added the corresponding documentation to the relevant paths --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-26 10:57:33 -07:00
Marc Gibbons	cc451effd1	community[patch]: langchain_community.vectorstores.azuresearch Raise LangChainException instead of bare Exception (#23935 ) Raise `LangChainException` instead of `Exception`. This alleviates the need for library users to use bare try/except to handle exceptions raised by `AzureSearch`. Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-26 15:59:06 +00:00
Diverrez morgan	c4d2a53f18	community: creation score_threshold in flashrank_rerank.py (#24016 ) Description: add a optional score relevance threshold for select only coherent document, it's in complement of top_n Discussion: add relevance score threshold in flashrank_rerank document compressors #24013 Dependencies: no dependencies --------- Co-authored-by: Benjamin BERNARD <benjamin.bernard@openpathview.fr> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-26 13:34:39 +00:00
Cong Peng	190988d93e	community: Add parameter `allow_dangerous_requests` to `WebResearchRetriever.from_llm` construct (#24712 ) Description: To avoid ValueError when construct the retriever from method `from_llm()`.	2024-07-26 06:24:58 -07:00
monysun	5f593c172a	community: fix dashcope embeddings embed_query func post too much req to api (#24707 ) the fuc of embed_query of dashcope embeddings send a str param, and in the embed_with_retry func will send error content to api	2024-07-26 12:44:07 +00:00
yonarw	b65ac8d39c	community[minor]: Self query retriever for HANA Cloud Vector Engine (#24494 ) Description: - This PR adds a self query retriever implementation for SAP HANA Cloud Vector Engine. The retriever supports all operators except for contains. - Issue: N/A - Dependencies: no new dependencies added Add tests and docs: Added integration tests to: libs/community/tests/unit_tests/query_constructors/test_hanavector.py Documentation for self query retriever: /docs/integrations/retrievers/self_query/hanavector_self_query.ipynb --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-07-26 06:56:51 +00:00
nobbbbby	4f3b4fc7fe	community[patch]: Extend Baichuan model with tool support (#24529 ) Description: Expanded the chat model functionality to support tools in the 'baichuan.py' file. Updated module imports and added tool object handling in message conversions. Additional changes include the implementation of tool binding and related unit tests. The alterations offer enhanced model capabilities by enabling interaction with tool-like objects. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-25 23:20:44 -07:00
Rave Harpaz	ee399e3ec5	community[patch]: Add OCI Generative AI tool and structured output support (#24693 ) - [x] PR title: community: Add OCI Generative AI tool and structured output support - [x] PR message: - Description: adding tool calling and structured output support for chat models offered by OCI Generative AI services. This is an update to our last PR 22880 with changes in /langchain_community/chat_models/oci_generative_ai.py - Issue: NA - Dependencies: NA - Twitter handle: NA - [x] Add tests and docs: 1. we have updated our unit tests 2. we have updated our documentation under /docs/docs/integrations/chat/oci_generative_ai.ipynb - [x] Lint and test: `make format`, `make lint` and `make test` we run successfully --------- Co-authored-by: RHARPAZ <RHARPAZ@RHARPAZ-5750.us.oracle.com> Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com>	2024-07-25 23:19:00 -07:00
Yuki Watanabe	2b6a262f84	community[patch]: Replace `filters` argument to `filter` in DatabricksVectorSearch (#24530 ) The [DatabricksVectorSearch](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/vectorstores/databricks_vector_search.py#L21) class exposes similarity search APIs with argument `filters`, which is inconsistent with other VS classes who uses `filter` (singular). This PR updates the argument and add alias for backward compatibility. --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	2024-07-25 21:20:18 -07:00
Sunish Sheth	59880a9147	community[patch]: mlflow handle empty chunk(#24689 )	2024-07-25 20:36:29 -07:00
Chaunte W. Lacewell	69eacaa887	Community[minor]: Update VDMS vectorstore (#23729 ) Description: - This PR exposes some functions in VDMS vectorstore, updates VDMS related notebooks, updates tests, and upgrade version of VDMS (>=0.0.20) Issue: N/A Dependencies: - Update vdms>=0.0.20	2024-07-25 22:13:04 -04:00
KyrianC	0fdbaf4a8d	community: fix ChatEdenAI + EdenAI Tools (#23715 ) Fixes for Eden AI Custom tools and ChatEdenAI: - add missing import in __init__ of chat_models - add `args_schema` to custom tools. otherwise '__arg1' would sometimes be passed to the `run` method - fix IndexError when no human msg is added in ChatEdenAI	2024-07-25 15:19:14 -04:00

... 3 4 5 6 7 ...

1804 Commits