langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-01-23 05:09:12 +00:00

Author	SHA1	Message	Date
Bagatur	c24df760b5	fmt	2024-08-30 17:45:20 -07:00
William Fu-Hinthorn	1cc61a07a0	[Logging] Warn to debug	2024-08-30 11:09:30 -07:00
Eugene Yurtsev	b7c070d437	docs[patch]: Update code that checks API keys (#25444 ) Check whether the API key is already in the environment Update: ```python import getpass import os os.environ["DATABRICKS_HOST"] = "https://your-workspace.cloud.databricks.com" os.environ["DATABRICKS_TOKEN"] = getpass.getpass("Enter your Databricks access token: ") ``` To: ```python import getpass import os os.environ["DATABRICKS_HOST"] = "https://your-workspace.cloud.databricks.com" if "DATABRICKS_TOKEN" not in os.environ: os.environ["DATABRICKS_TOKEN"] = getpass.getpass( "Enter your Databricks access token: " ) ``` grit migration: ``` engine marzano(0.1) language python `os.environ[$Q] = getpass.getpass("$X")` as $CHECK where { $CHECK <: ! within if_statement(), $CHECK => `if $Q not in os.environ:\n $CHECK` } ```	2024-08-15 12:52:37 -04:00
Bagatur	60b65528c5	docs: fix api ref mod links in pkg page (#25447 )	2024-08-15 16:52:12 +00:00
Eugene Yurtsev	2ef9d12372	mistralai[patch]: Update more @root_validators for pydantic 2 compatibility (#25446 ) Update @root_validators in mistralai integration for pydantic 2 compatibility	2024-08-15 12:44:42 -04:00
Eugene Yurtsev	6910b0b3aa	docs[patch]: Fix integration notebook for Fireworks llm (#25442 ) Fix integration notebook	2024-08-15 12:42:33 -04:00
Eugene Yurtsev	831708beb7	together[patch]: Update @root_validator for pydantic 2 compatibility (#25423 ) This PR updates usage of @root_validator to be compatible with pydantic 2.	2024-08-15 11:27:42 -04:00
Eugene Yurtsev	a114255b82	ai21[patch]: Update @root_validators for pydantic2 migration (#25401 ) Update @root_validators for pydantic 2 migration.	2024-08-15 11:26:44 -04:00
Eugene Yurtsev	6f68c8d6ab	mistralai[patch]: Update root validator for compatibility with pydantic 2 (#25403 )	2024-08-15 11:26:24 -04:00
ccurme	8afbab4cf6	langchain[patch]: deprecate various chains (#25310 ) - [x] NatbotChain: move to community, deprecate langchain version. Update to use `prompt \| llm \| output_parser` instead of LLMChain. - [x] LLMMathChain: deprecate + add langgraph replacement example to API ref - [x] HypotheticalDocumentEmbedder (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] FlareChain: update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] ConstitutionalChain: deprecate + add langgraph replacement example to API ref - [x] LLMChainExtractor (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] LLMChainFilter (document compressor): update to use `prompt \| llm \| output_parser` instead of LLMChain - [x] RePhraseQueryRetriever (retriever): update to use `prompt \| llm \| output_parser` instead of LLMChain	2024-08-15 10:49:26 -04:00
Luke	66e30efa61	experimental: Fix divide by 0 error (#25439 ) Within the semantic chunker, when calling `_threshold_from_clusters` there is the possibility for a divide by 0 error if the `number_of_chunks` is equal to the length of `distances`. Fix simply implements a check if these values match to prevent the error and enable chunking to continue.	2024-08-15 14:46:30 +00:00
ccurme	ba167dc158	community[patch]: update connection string in azure cosmos integration test (#25438 )	2024-08-15 14:07:54 +00:00
Eugene Yurtsev	44f69063b1	docs[patch]: Fix a few typos in the chat integration docs for TogetherAI (#25424 ) Fix a few minor typos	2024-08-15 09:48:36 -04:00
Isaac Francisco	f18b77fd59	[docs]: pdf loaders (#25425 )	2024-08-14 21:44:57 -07:00
Isaac Francisco	966b408634	[docs]: doc loader changes (#25417 )	2024-08-14 19:46:33 -07:00
ccurme	bd261456f6	langchain: bump core to 0.2.32 (#25421 )	2024-08-15 00:00:42 +00:00
Bagatur	ec8ffc8f40	core[patch]: Release 0.2.32 (#25420 )	2024-08-14 15:56:56 -07:00
Bagatur	2494cecabf	core[patch]: tool import fix (#25419 )	2024-08-14 22:54:13 +00:00
ccurme	df632b8cde	langchain: bump min core version (#25418 )	2024-08-14 22:51:35 +00:00
ccurme	1050e890c6	langchain: release 0.2.14 (#25416 ) Fixes https://github.com/langchain-ai/langchain/issues/25413	2024-08-14 22:29:39 +00:00
Isaac Francisco	c4779f5b9c	[docs]: sitemaploader update (#25363 )	2024-08-14 15:27:40 -07:00
gbaian10	0a99935794	docs: remove the extra period in docstring (#25414 ) Remove the period after the hyperlink in the docstring of BaseChatOpenAI.with_structured_output. I have repeatedly copied the extra period at the end of the hyperlink, which results in a "Page not found" page when pasted into the browser.	2024-08-14 18:07:15 -04:00
Isaac Francisco	63aba3fe5b	[docs]: link fix directory loader (#25411 )	2024-08-14 20:58:54 +00:00
Bagatur	dc80be5efe	docs: fix deprecated functions table (#25409 )	2024-08-14 12:25:39 -07:00
Erick Friis	ab29ee79a3	docs: fix tool index (#25404 )	2024-08-14 18:36:41 +00:00
Werner van der Merwe	1d3f7231b8	fix: typo where github should be gitlab (#25397 ) PR title: "GitLabToolkit: fix typo" - Description: fix typo where GitHub should have been GitLab - Dependencies: None	2024-08-14 18:36:25 +00:00
Bagatur	a58d4ba340	core[patch]: Release 0.2.31 (#25388 )	2024-08-14 11:26:49 -07:00
Bagatur	d178fb9dc3	docs: fix api ref package tables (#25400 )	2024-08-14 10:40:16 -07:00
Bagatur	414154fa59	experimental[patch]: refactor rl chain structure (#25398 ) can't have a class and function with same name but different capitalization in same file for api reference building	2024-08-14 17:09:43 +00:00
Flávio Knob	94c9cb7321	Update document_loader_custom.ipynb (#25393 ) Fix typo Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-14 12:33:21 -04:00
Jacob Lee	012929551c	docs[patch]: Hide deprecated integration pages (#25389 )	2024-08-14 09:17:39 -07:00
Bagatur	63c483ea01	standard-tests: import fix (#25395 )	2024-08-14 09:13:56 -07:00
Bagatur	eec7bb4f51	anthropic[patch]: Release 0.1.23 (#25394 )	2024-08-14 09:03:39 -07:00
Flávio Knob	f0f125dac7	Update document_loader_custom.ipynb (#25391 ) Fix typo and some `callout` tags Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-14 15:07:42 +00:00
Eugene Yurtsev	f4196f1fb8	ollama[patch]: Update extra in ollama package (#25383 ) Backwards compatible change that converts pydantic extras to literals which is consistent with pydantic 2 usage.	2024-08-14 10:30:01 -04:00
Chengyu Yan	d0ad713937	core: fix issue#24660, slove error messages about `ValueError` when use model with history (#25183 ) - Description: This PR will slove error messages about `ValueError` when use model with history. Detail in #24660. #22933 causes that `langchain_core.runnables.history.RunnableWithMessageHistory._get_output_messages` miss type check of `output_val` if `output_val` is `False`. After running `RunnableWithMessageHistory._is_not_async`, `output` is `False`. `249945a572/libs/core/langchain_core/runnables/history.py (L323-L334)` `15a36dd0a2/libs/core/langchain_core/runnables/history.py (L461-L471)` ~~I suggest that `_get_output_messages` return empty list when `output_val == False`.~~ - Issue: - #24660 - Dependencies:: No Change. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-14 14:26:22 +00:00
Jacob Lee	ddd7919f6a	docs[patch]: Add conceptual guide links to integration index pages (#25387 )	2024-08-14 07:14:24 -07:00
Bagatur	493e474063	docs: udpated api reference (#25172 ) - Move the API reference into the vercel build - Update api reference organization and styling	2024-08-14 07:00:17 -07:00
Leonid Ganeline	4a812e3193	docs: `integrations` references update (#25217 ) Added missed provider pages. Fixed formats and added descriptions and links. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-14 13:58:38 +00:00
Eugene Yurtsev	5f5e8c9a60	huggingface[patch], pinecone[patch], fireworks[patch], mistralai[patch], voyageai[patch], togetherai[path]: convert Pydantic extras to literals (#25384 ) Backwards compatible change that converts pydantic extras to literals which is consistent with pydantic 2 usage. - fireworks - voyage ai - mistralai - mistral ai - together ai - huggigng face - pinecone	2024-08-14 09:55:30 -04:00
Eugene Yurtsev	d00176e523	openai[patch]: Update extra to match pydantic 2 (#25382 ) Backwards compatible change that converts pydantic extras to literals which is consistent with pydantic 2 usage.	2024-08-14 09:55:18 -04:00
Eugene Yurtsev	dc51cc5690	core[minor]: Prevent PydanticOutputParser from encoding schema as ASCII (#25386 ) This allows users to provide parameter descriptions in the pydantic models in other languages. Continuing this PR: https://github.com/langchain-ai/langchain/pull/24809	2024-08-14 13:54:31 +00:00
ccurme	27690506d0	multiple: update removal targets (#25361 )	2024-08-14 09:50:39 -04:00
Ikko Eltociear Ashimine	4029f5650c	docs: update clarifai.ipynb (#25373 ) Intialize -> Initialize	2024-08-14 09:20:17 -04:00
Erick Friis	10e6725a7e	docs: tools index table (#25370 )	2024-08-14 02:38:03 +00:00
Harrison Chase	967b6f21f6	docs: improve document loaders index (#25365 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-14 01:48:48 +00:00
Erick Friis	4a78be7861	docs: remove sidebar comment (#25369 )	2024-08-14 01:47:12 +00:00
Eugene Yurtsev	d6c180996f	docs[patch]: Fix typo in CohereEmbeddings integration docs (#25367 ) Fix typo	2024-08-14 01:18:54 +00:00
Eugene Yurtsev	93dcc47463	docs: Partial integration update for cohere embeddings (#25250 ) This can be finished after the following issue is resolved: https://github.com/langchain-ai/langchain-cohere/issues/81 Related to: https://github.com/langchain-ai/langchain/issues/24856 ```json [ { "provider": "cohere", "js": true, "local": false, "serializable": false, } ] ``` --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-14 00:53:13 +00:00
Eugene Yurtsev	27def6bddb	docs[patch]: Update integration docs for AzureOpenAIEmbeddings (#25311 ) https://github.com/langchain-ai/langchain/issues/24856 --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-08-14 00:33:13 +00:00
Eugene Yurtsev	b4e3bdb714	docs: Update nomic AI embeddings integration docs (#25308 ) Issue: https://github.com/langchain-ai/langchain/issues/24856 --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-08-14 00:32:07 +00:00
Eugene Yurtsev	f82c3f622a	docs: Update AI21Embeddings Integration docs (#25298 ) Update AI21 Integration docs Issue: https://github.com/langchain-ai/langchain/issues/24856 --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-08-14 00:30:16 +00:00
Eugene Yurtsev	d55d99222b	docs: update integration docs for mistral ai embedding model (#25253 ) Related issue: https://github.com/langchain-ai/langchain/issues/24856 ```json [ { "provider": "mistralai", "js": true, "local": false, "serializable": false, "native_async": true } ] ``` --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-08-14 00:25:36 +00:00
Eugene Yurtsev	0f6217f507	docs: together ai embeddings integration docs (#25252 ) Update together AI embedding integration docs Related issue: https://github.com/langchain-ai/langchain/issues/24856 ```json [ { "provider": "together", "js": true, "local": false, "serializable": false, } ] ``` --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-08-14 00:24:02 +00:00
Eugene Yurtsev	8645a49f31	docs: Update integration docs for OllamaEmbeddingsModel (#25314 ) Issue: https://github.com/langchain-ai/langchain/issues/24856 --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-08-14 00:23:05 +00:00
Eugene Yurtsev	a4ef830480	docs: update integration docs for openai embeddings (#25249 ) Related issue: https://github.com/langchain-ai/langchain/issues/24856 ```json { "provider": "openai", "js": true, "local": false, "serializable": false, "async_native": true } ``` --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-08-14 00:21:36 +00:00
Eugene Yurtsev	b1aed44540	docs: Updating integration docs for Fireworks Embeddings (#25247 ) Providers: * fireworks See related issue: * https://github.com/langchain-ai/langchain/issues/24856 Features: ```json [ { "provider": "fireworks", "js": true, "local": false, "serializable": false, } ] ``` --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-08-13 17:04:18 -07:00
Isaac Francisco	f4ffd692a3	[docs]: standardize doc loader doc strings (#25325 )	2024-08-13 23:18:56 +00:00
Isaac Francisco	e0bbb81d04	[docs]: standardize tool docstrings (#25351 )	2024-08-13 16:10:00 -07:00
Erick Friis	d5b548b4ce	docs: index pages, sidebars (#25316 )	2024-08-13 15:52:51 -07:00
Isaac Francisco	0478f7f5e4	[docs]: LLM integration pages (#25005 )	2024-08-13 14:50:45 -07:00
thedavgar	9d08369442	community: fix AzureSearch vectorstore asyncronous methods (#24921 ) Description Fix the asyncronous methods to retrieve documents from AzureSearch VectorStore. The previous changes from [this commit](`ffe6ca986e`) create a similar code for the syncronous methods and the asyncronous ones but the asyncronous client return an asyncronous iterator "AsyncSearchItemPaged" as said in the issue #24740. To solve this issue, the syncronous iterators in asyncronous methods where changed to asyncronous iterators. @chrislrobert said in [this comment](https://github.com/langchain-ai/langchain/issues/24740#issuecomment-2254168302) that there was a still a flaw due to `with` blocks that close the client after each call. I removed this `with` blocks in the `async_client` following the same pattern as the sync `client`. In order to close up the connections, a __del__ method is included to gently close up clients once the vectorstore object is destroyed. Issue: #24740 and #24064 Dependencies: No new dependencies for this change Example notebook: I created a notebook just to test the changes work and gives the same results as the syncronous methods for vector and hybrid search. With these changes, the asyncronous methods in the retriever work as well. ![image](https://github.com/user-attachments/assets/697e431b-9d7f-4d0d-b205-59d051ac2b67) Lint and test: Passes the tests and the linter	2024-08-13 14:20:51 -07:00
Isaac Francisco	6bc451b942	[docs]: merge tool/toolkit duplicates (#25197 )	2024-08-13 12:19:17 -07:00
Fedor Nikolaev	2b15518c5f	community: add args_schema to SearxSearchResults tool (#25350 ) This adds `args_schema` member to `SearxSearchResults` tool. This member is already present in the `SearxSearchRun` tool in the same file. I was having `TypeError: Type is not JSON serializable: AsyncCallbackManagerForToolRun` being thrown in langserve playground when I was using `SearxSearchResults` tool as a part of chain there. This fixes the issue, so the error is not raised anymore. This is a example langserve app that was giving me the error, but it works properly after the proposed fix: ```python #!/usr/bin/env python from fastapi import FastAPI from langchain_core.prompts import ChatPromptTemplate from langchain_core.output_parsers import StrOutputParser from langchain_core.runnables import RunnablePassthrough from langchain_openai import ChatOpenAI from langchain_community.utilities import SearxSearchWrapper from langchain_community.tools.searx_search.tool import SearxSearchResults from langserve import add_routes template = """Answer the question based only on the following context: {context} Question: {question} """ prompt = ChatPromptTemplate.from_template(template) model = ChatOpenAI() s = SearxSearchWrapper(searx_host="http://localhost:8080") search = SearxSearchResults(wrapper=s) search_chain = ( {"context": search, "question": RunnablePassthrough()} \| prompt \| model \| StrOutputParser() ) app = FastAPI() add_routes( app, search_chain, path="/chain", ) if __name__ == "__main__": import uvicorn uvicorn.run(app, host="localhost", port=8000) ```	2024-08-13 18:26:09 +00:00
Matt Kandler	b6df3405fb	docs: Fix broken link to Runhouse documentation (#25349 ) - Description: Runhouse recently migrated from Read the Docs to a self-hosted solution. This PR updates a broken link from the old docs to www.run.house/docs. Also changed "The Runhouse" to "Runhouse" (it's cleaner). - Issue: None - Dependencies: None	2024-08-13 18:18:19 +00:00
maang-h	089f5e6cad	Standardize SparkLLM (#25239 ) - Description: Standardize SparkLLM, include: - docs, the issue #24803 - to support stream - update api url - model init arg names, the issue #20085	2024-08-13 09:50:12 -04:00
Leonid Ganeline	35e2230f56	docs: `integrations`references update (#25322 ) Added missed provider pages. Fixed formats and added descriptions and links.	2024-08-13 09:29:51 -04:00
Chen Xiabin	24155aa1ac	qianfan generate/agenerate with usage_metadata (#25332 )	2024-08-13 09:24:41 -04:00
Christophe Bornet	ebbe609193	Add README for astradb package (#25345 ) Similar to https://github.com/langchain-ai/langchain/blob/master/libs/partners/ibm/README.md	2024-08-13 09:17:23 -04:00
Eugene Yurtsev	f679ed72ca	ollama[patch]: Update API Reference for ollama embeddings (#25315 ) Update API reference for OllamaEmbeddings Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 21:31:48 -04:00
Erick Friis	2907ab2297	community: release 0.2.12 (#25324 )	2024-08-12 23:30:27 +00:00
Erick Friis	06f8bd9946	langchain: release 0.2.13 (#25323 )	2024-08-12 22:24:06 +00:00
Erick Friis	252f0877d1	core: release 0.2.30 (#25321 )	2024-08-12 22:01:24 +00:00
Eugene Yurtsev	217a915b29	openai: Update API Reference docs for AzureOpenAI Embeddings (#25312 ) Update AzureOpenAI Embeddings docs	2024-08-12 19:41:18 +00:00
Eugene Yurtsev	056c7c2983	core[patch]: Update API reference for fake embeddings (#25313 ) Issue: https://github.com/langchain-ai/langchain/issues/24856 Using the same template for the fake embeddings in langchain_core as used in the integrations.	2024-08-12 19:40:05 +00:00
Ben Chambers	1adc161642	community: kwargs for CassandraGraphVectorStore (#25300 ) - Description: pass kwargs from CassandraGraphVectorStore to underlying store Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 18:01:29 +00:00
Hassan-Memon	deb27d8970	docs: remove unused imports in Conversational RAG tutorial (#25297 ) Cleaned up the "Tying it Together" section of the Conversational RAG tutorial by removing unnecessary imports that were not used. This reduces confusion and makes the code more concise. Thank you for contributing to LangChain! PR title: docs: remove unused imports in Conversational RAG tutorial PR message: Description: Removed unnecessary imports from the "Tying it Together" section of the Conversational RAG tutorial. These imports were not used in the code and created confusion. The updated code is now more concise and easier to understand. Issue: N/A Dependencies: None LinkedIn handle: [Hassan Memon](https://www.linkedin.com/in/hassan-memon-a109b3257/) Add tests and docs: Hi [LangChain Team Member’s Name], I hope you're doing well! I’m thrilled to share that I recently made my second contribution to the LangChain project. If possible, could you give me a shoutout on LinkedIn? It would mean a lot to me and could help inspire others to contribute to the community as well. Here’s my LinkedIn profile: [Hassan Memon](https://www.linkedin.com/in/hassan-memon-a109b3257/). Thank you so much for your support and for creating such a great platform for learning and collaboration. I'm looking forward to contributing more in the future! Best regards, Hassan Memon	2024-08-12 13:49:55 -04:00
gbaian10	5efd0fe9ae	docs: Change SqliteSaver to MemorySaver (#25306 ) fix: #25137 `SqliteSaver.from_conn_string()` has been changed to a `contextmanager` method in `langgraph >= 0.2.0`, the original usage is no longer applicable. Refer to <https://github.com/langchain-ai/langgraph/pull/1271#issue-2454736415> modification method to replace `SqliteSaver` with `MemorySaver`.	2024-08-12 13:45:32 -04:00
Eugene Yurtsev	1c9917dfa2	fireworks[patch]: Fix doc-string for API Referenmce (#25304 )	2024-08-12 17:16:13 +00:00
Eugene Yurtsev	ccff1ba8b8	ai21[patch]: Update API reference documentation (#25302 ) Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 13:15:27 -04:00
Eugene Yurtsev	53ee5770d3	fireworks: Add APIReference for the FireworksEmbeddings model (#25292 ) Add API Reference documentation for the FireworksEmbedding model. Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 13:13:43 -04:00
Eugene Yurtsev	8626abf8b5	togetherai[patch]: Update API Reference for together AI embeddings model (#25295 ) Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 17:12:28 +00:00
Eugene Yurtsev	1af8456a2c	mistralai[patch]: Docs Update APIReference for MistralAIEmbeddings (#25294 ) Update API Reference for MistralAI embeddings Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 15:25:37 +00:00
Eugene Yurtsev	0a3500808d	openai[patch]: Docs fix RST formatting in OpenAIEmbeddings (#25293 )	2024-08-12 11:24:35 -04:00
Eugene Yurtsev	ee8a585791	openai[patch]: Add API Reference docs to OpenAIEmbeddings (#25290 ) Issue: [24856](https://github.com/langchain-ai/langchain/issues/24856)	2024-08-12 14:53:51 +00:00
ccurme	e77eeee6ee	core[patch]: add standard tracing params for retrievers (#25240 )	2024-08-12 14:51:59 +00:00
Mohammad Mohtashim	9927a4866d	[Community] - Added bind_tools and with_structured_output for ChatZhipuAI (#23887 ) - Description: This PR implements the `bind_tool` functionality for ChatZhipuAI as requested by the user. ChatZhipuAI models support tool calling according to the `OpenAI` tool format, as outlined in their official documentation [here](https://open.bigmodel.cn/dev/api#glm-4). - Issue: ##23868 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-12 14:11:43 +00:00
Hassan-Memon	420534c8ca	docs: Replaced SqliteSaver with MemorySaver and updated installation instru… (#25285 ) …ctions to match LangGraph v2 documentation. Corrected code snippet to prevent validation errors. Here's how you can fill out the provided template for your pull request: --- Thank you for contributing to LangChain! - [ ] PR title: `docs: update checkpointer example in Conversational RAG tutorial` - [ ] PR message: - Description: Updated the Conversational RAG tutorial to correct the checkpointer example by replacing `SqliteSaver` with `MemorySaver`. Added installation instructions for `langgraph-checkpoint-memory` to match LangGraph v2 documentation and prevent validation errors. - Issue: N/A - Dependencies: `langgraph-checkpoint-memory` - Twitter handle: N/A - [ ] Add tests and docs: 1. No new integration tests are required. 2. Updated documentation in the Conversational RAG tutorial. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: [LangChain Contribution Guidelines](https://python.langchain.com/docs/contributing/) Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-12 09:24:51 -04:00
Yunus Emre Özdemir	794f28d4e2	docs: document upstash vector namespaces (#25289 ) Description: This PR rearranges the examples in Upstash Vector integration documentation to describe how to use namespaces and improve the description of metadata filtering.	2024-08-12 09:17:11 -04:00
JasonJ	f28ae20b81	docs: pip install bug fixed (#25287 ) Thank you for contributing to LangChain! - Description: Fixing package install bug in cookbook - Issue: zsh:1: no matches found: unstructured[all-docs] - Dependencies: N/A - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-12 05:12:44 +00:00
Soichi Sumi	9f0eda6a18	docs: Fix link for API reference of Gmail Toolkit (#25286 ) - Description: Fix link for API reference of Gmail Toolkit - Issue: I've just found this issue while I'm reading the doc - Dependencies: N/A - Twitter handle: [@soichisumi](https://x.com/soichisumi) TODO: If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-12 05:12:31 +00:00
Anush	472527166f	qdrant: Update API reference link and install command (#25245 ) ## Description As the title goes. The current API reference links to the deprecated class.	2024-08-11 16:54:14 -04:00
Aryan Singh	074fa0db73	docs: Fixed grammer error in functions.ipynb (#25255 ) Description: Grammer Error in functions.ipynb Issue: #25222	2024-08-11 20:53:27 +00:00
gbaian10	4fd1efc48f	docs: update "Build an Agent" Installation Hint in agents.ipynb (#25263 ) fix #25257	2024-08-11 16:51:34 -04:00
gbaian10	aa2722cbe2	docs: update numbering of items in docstring (#25267 ) A problem similar to #25093 . Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-11 20:50:24 +00:00
Maddy Adams	a82c0533f2	langchain: default to langsmith sdk for pulling prompts, fallback to langchainhub (#24156 ) Description: Deprecating langchainhub, replacing with langsmith sdk	2024-08-11 13:30:52 -07:00
maang-h	bc60cddc1b	docs: Fix ChatBaichuan, QianfanChatEndpoint, ChatSparkLLM, ChatZhipuAI docs (#25265 ) - Description: Fix some chat models docs, include: - ChatBaichuan - QianfanChatEndpoint - ChatSparkLLM - ChatZhipuAI	2024-08-11 16:23:55 -04:00
ZhangShenao	43deed2a95	Improvement[Embeddings] Add dimension support to `ZhipuAIEmbeddings` (#25274 ) - In the in ` embedding-3 ` and later models of Zhipu AI, it is supported to specify the dimensions parameter of Embedding. Ref: https://bigmodel.cn/dev/api#text_embedding-3 . - Add test case for `embedding-3` model by assigning dimensions.	2024-08-11 16:20:37 -04:00
maang-h	9cd608efb3	docs: Standardize OpenAI Docs (#25280 ) - Description: Standardize OpenAI Docs - Issue: the issue #24803 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-11 20:20:16 +00:00
Bagatur	fd546196ef	openai[patch]: Release 0.1.21 (#25269 )	2024-08-10 16:37:31 -07:00
Eugene Yurtsev	6dd9f053e3	core[patch]: Deprecating beta upsert APIs in vectorstore (#25069 ) This PR deprecates the beta upsert APIs in vectorstore. We'll introduce them in a V2 abstraction instead to keep the existing vectorstore implementations lighter weight. The main problem with the existing APIs is that it's a bit more challenging to implement the correct behavior w/ respect to IDs since ID can be present in both the function signature and as an optional attribute on the document object. But VectorStores that pass the standard tests should have implemented the semantics properly! --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-09 17:17:36 -04:00
Bagatur	ca9dcee940	standard-tests[patch]: test ToolMessage.status="error" (#25210 )	2024-08-09 13:00:14 -07:00
Eugene Yurtsev	dadb6f1445	cli[patch]: Update integration template for embedding models (#25248 ) Update integration template for embedding models	2024-08-09 14:28:57 -04:00
Eugene Yurtsev	b6f0174bb9	community[patch],core[patch]: Update EdenaiTool root_validator and add unit test in core (#25233 ) This PR gets rid `root_validators(allow_reuse=True)` logic used in EdenAI Tool in preparation for pydantic 2 upgrade. - add another test to secret_from_env_factory	2024-08-09 15:59:27 +00:00
blueoom	c3ced4c6ce	core[patch]: use time.monotonic() instead time.time() in InMemoryRateLimiter Description: The get time point method in the _consume() method of core.rate_limiters.InMemoryRateLimiter uses time.time(), which can be affected by system time backwards. Therefore, it is recommended to use the monotonically increasing monotonic() to obtain the time ```python with self._consume_lock: now = time.time() # time.time() -> time.monotonic() # initialize on first call to avoid a burst if self.last is None: self.last = now elapsed = now - self.last # when use time.time(), elapsed may be negative when system time backwards ```	2024-08-09 11:31:20 -04:00
Eugene Yurtsev	bd6c31617e	community[patch]: Remove more @allow_reuse=True validators (#25236 ) Remove some additional allow_reuse=True usage in @root_validators.	2024-08-09 11:10:27 -04:00
Eugene Yurtsev	6e57aa7c36	community[patch]: Remove usage of @root_validator(allow_reuse=True) (#25235 ) Remove usage of @root_validator(allow_reuse=True)	2024-08-09 10:57:42 -04:00
thiswillbeyourgithub	a2b4c33bd6	community[patch]: FAISS: ValueError mentions normalize_score_fn isntead of relevance_score_fn (#25225 ) Thank you for contributing to LangChain! - [X] PR title: "community: fix valueerror mentions wrong argument missing" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [X] PR message: *Delete this entire checklist* and replace with - Description: when faiss.py has a None relevance_score_fn it raises a ValueError that says a normalize_fn_score argument is needed. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-09 14:40:29 +00:00
ccurme	4825dc0d76	langchain[patch]: add deprecations (#24792 )	2024-08-09 10:34:43 -04:00
ccurme	02300471be	langchain[patch]: extended-tests: drop logprobs from OAI expected config (#25234 ) Following https://github.com/langchain-ai/langchain/pull/25229	2024-08-09 14:23:11 +00:00
Shivendra Soni	66b7206ab6	community: Add llm-extraction option to FireCrawl Document Loader (#25231 ) Description: This minor PR aims to add `llm_extraction` to Firecrawl loader. This feature is supported on API and PythonSDK, but the langchain loader omits adding this to the response. Twitter handle: [scalable_pizza](https://x.com/scalablepizza) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-09 13:59:10 +00:00
blaufink	c81c77b465	partners: fix of issue #24880 (#25229 ) - Description: As described in the related issue: There is an error occuring when using langchain-openai>=0.1.17 which can be attributed to the following PR: #23691 Here, the parameter logprobs is added to requests per default. However, AzureOpenAI takes issue with this parameter as stated here: https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/chatgpt?tabs=python-new&pivots=programming-language-chat-completions -> "If you set any of these parameters, you get an error." Therefore, this PR changes the default value of logprobs parameter to None instead of False. This results in it being filtered before the request is sent. - Issue: #24880 - Dependencies: / Co-authored-by: blaufink <sebastian.brueckner@outlook.de>	2024-08-09 13:21:37 +00:00
ccurme	3b7437d184	docs: update integration api refs (#25195 ) - [x] toolkits - [x] retrievers (in this repo)	2024-08-09 12:27:32 +00:00
Bagatur	91ea4b7449	infra: avoid orjson 3.10.7 in vercel build (#25212 )	2024-08-09 02:23:18 +00:00
Isaac Francisco	652b3fa4a4	[docs]: playwright fix (#25163 )	2024-08-08 17:13:42 -07:00
Bagatur	7040013140	core[patch]: fix deprecation pydantic bug (#25204 ) #25004 is incompatible with pydantic < 1.10.17. Introduces fix for this.	2024-08-08 16:39:38 -07:00
Isaac Francisco	dc7423e88f	[docs]: standardizing document loader integration pages (#25002 )	2024-08-08 16:33:09 -07:00
Casey Clements	25f2e25be1	partners[patch]: Mongodb Retrievers - CI final touches. (#25202 ) ## Description Contains 2 updates to for integration tests to run on langchain's CI. Addendum to #25057 to get release github action to succeed.	2024-08-08 15:38:31 -07:00
Bagatur	786ef021a3	docs: redirect toolkits (#25190 )	2024-08-08 14:54:11 -07:00
Eugene Yurtsev	429a0ee7fd	core[minor]: Add factory for looking up secrets from the env (#25198 ) Add factory method for looking secrets from the env.	2024-08-08 16:41:58 -04:00
Erick Friis	da9281feb2	cli: release 0.0.29 (#25196 )	2024-08-08 12:52:49 -07:00
Erick Friis	c6ece6a96d	core: autodetect more ls params (#25044 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-08 12:44:21 -07:00
Eugene Yurtsev	86355640c3	experimental[patch]: Use get_fields adapter (#25193 ) Change all usages of __fields__ with get_fields adapter merged into langchain_core. Code mod generated using the following grit pattern: ``` engine marzano(0.1) language python `$X.__fields__` => `get_fields($X)` where { add_import(source="langchain_core.utils.pydantic", name="get_fields") } ```	2024-08-08 15:10:11 -04:00
Eugene Yurtsev	b9f65e5038	experimental[patch]: Migrate pydantic extra to literals (#25194 ) Migrate pydantic extra to literals Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 19:05:54 +00:00
Eugene Yurtsev	30fb345342	core[minor]: Add from_env utility (#25189 ) Add a utility that can be used as a default factory The goal will be to start migrating from of the pydantic models to use `from_env` as a default factory if possible. ```python from pydantic import Field, BaseModel from langchain_core.utils import from_env class Foo(BaseModel): name: str = Field(default_factory=from_env('HELLO')) ```	2024-08-08 14:52:35 -04:00
Eugene Yurtsev	98779797fe	community[patch]: Use get_fields adapter for pydantic (#25191 ) Change all usages of __fields__ with get_fields adapter merged into langchain_core. Code mod generated using the following grit pattern: ``` engine marzano(0.1) language python `$X.__fields__` => `get_fields($X)` where { add_import(source="langchain_core.utils.pydantic", name="get_fields") } ```	2024-08-08 14:43:09 -04:00
Rajendra Kadam	663638d6a8	community[minor]: [SharePointLoader] Load extended metadata for the root folder (#24872 ) - Title: [SharePointLoader] Load extended metadata for the root folder - Description: - Ensure extended metadata loads correctly for the root folder. - Cleanup: Refactor SharePointLoader to remove unused fields(`file_id` & `site_id`). - Dependencies: NA - Add tests and docs: NA	2024-08-08 14:39:16 -04:00
Eugene Yurtsev	2f209d84fa	core[patch]: Add pydantic get_fields adapter (#25187 ) Add adapter to get fields	2024-08-08 17:47:42 +00:00
Eugene Yurtsev	c72e522e96	langchain[patch]: Upgrade pydantic extra (#25186 ) Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 17:27:27 +00:00
Eugene Yurtsev	bf5193bb99	community[patch]: Upgrade pydantic extra (#25185 ) Upgrade to using a literal for specifying the extra which is the recommended approach in pydantic 2. This works correctly also in pydantic v1. ```python from pydantic.v1 import BaseModel class Foo(BaseModel, extra="forbid"): x: int Foo(x=5, y=1) ``` And ```python from pydantic.v1 import BaseModel class Foo(BaseModel): x: int class Config: extra = "forbid" Foo(x=5, y=1) ``` ## Enum -> literal using grit pattern: ``` engine marzano(0.1) language python or { `extra=Extra.allow` => `extra="allow"`, `extra=Extra.forbid` => `extra="forbid"`, `extra=Extra.ignore` => `extra="ignore"` } ``` Resorted attributes in config and removed doc-string in case we will need to deal with going back and forth between pydantic v1 and v2 during the 0.3 release. (This will reduce merge conflicts.) ## Sort attributes in Config: ``` engine marzano(0.1) language python function sort($values) js { return $values.text.split(',').sort().join("\n"); } class_definition($name, $body) as $C where { $name <: `Config`, $body <: block($statements), $values = [], $statements <: some bubble($values) assignment() as $A where { $values += $A }, $body => sort($values), } ```	2024-08-08 17:20:39 +00:00
Isaac Francisco	11adc09e02	[docs]: change rag reference in vector store pages (#25125 )	2024-08-08 10:08:14 -07:00
Anush	6b32810b68	qdrant: Update doc with usage snippets (#25179 ) ## Description This PR adds back snippets demonstrating sparse and hybrid retrieval in the Qdrant notebook. Without the snippets, it's hard to grok the usage.	2024-08-08 12:58:26 -04:00
Eugene Yurtsev	3da2713172	docs: Update pydantic compatibility (#25145 ) Update pydantic compatibility	2024-08-08 12:10:44 -04:00
Eugene Yurtsev	425f6ffa5b	core[patch]: Fix aindex API (#25155 ) A previous PR accidentally broke the aindex API by renaming a positional argument vectorstore into vector_store. This PR reverts this change.	2024-08-08 12:08:18 -04:00
Isaac Francisco	15a36dd0a2	[docs]: combine tools and toolkits (#25158 )	2024-08-08 08:59:02 -07:00
ololand	249945a572	Update polygon.py for business subscription (#25085 ) For business subscription the status is STOCKSBUSINESS not OK Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-08 15:28:41 +00:00
ccurme	59b8850909	groq[patch]: update rate limit in integration tests (#25177 ) Divide by ~2 to account for testing python 3.8 and 3.12 in parallel.	2024-08-08 13:33:25 +00:00
Chad Juliano	4828c441a7	docs: Update notebook name for Kinetica (#25149 ) Description: Change notebook description in documentation. Issue: N/A Dependencies: N/A	2024-08-08 09:27:29 -04:00
Francisco Kurucz	725e4912ae	docs: Fix reference to SQL QA migration (#25157 ) Description: I found that the link to the notebook in the Migration notes is broken, i found that it was linked to this file https://github.com/langchain-ai/langchain/blob/v0.0.250/docs/extras/use_cases/tabular/sql_query.ipynb and i think now this tutorial https://github.com/JuanFKurucz/langchain/blob/master/docs/docs/tutorials/sql_qa.ipynb is the best fit for this reference Twitter handle: @juanfkurucz --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-08 09:26:13 -04:00
ogawa	d895db11d6	community[patch]: gpt-4o-2024-08-06 costs (#25164 ) - Description: updated OpenAI cost definitions according to the following: - https://openai.com/api/pricing/ - Twitter handle: `@ogawa65a` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-08 13:22:11 +00:00
Brace Sproul	d77c7c4236	docs: Fix misspelling of instantiate in docs (#25107 )	2024-08-07 15:05:06 -07:00
Eugene Yurtsev	7b1a132aff	core[patch]: Add unit tests for Serializable (#25152 ) Add a few test cases for serializable (many other test cases already covered throguh runnable tests).	2024-08-07 21:01:36 +00:00
Bagatur	df99b832a7	core[patch]: support Field deprecation (#25004 ) ![Screenshot 2024-08-02 at 4 23 17 PM](https://github.com/user-attachments/assets/c757e093-877e-4af6-9dcd-984195454158)	2024-08-07 13:57:55 -07:00
ccurme	803eba3163	core[patch]: check for model_fields attribute (#25108 ) `__fields__` raises a warning in pydantic v2	2024-08-07 13:32:56 -07:00
Casey Clements	6e9a8b188f	mongodb: Add Hybrid and Full-Text Search Retrievers, release 0.2.0 (#25057 ) ## Description This pull-request extends the existing vector search strategies of MongoDBAtlasVectorSearch to include Hybrid (Reciprocal Rank Fusion) and Full-text via new Retrievers. There is a small breaking change in the form of the `prefilter` kwarg to search. For this, and because we have now added a great deal of features, including programmatic Index creation/deletion since 0.1.0, we plan to bump the version to 0.2.0. ### Checklist * Unit tests have been extended * formatting has been applied * One mypy error remains which will either go away in CI or be simplified. --------- Signed-off-by: Casey Clements <casey.clements@mongodb.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-07 20:10:29 +00:00
Isaac Francisco	f337408b0f	[docs]: add sidebar for different tool categories (#25065 )	2024-08-07 12:57:58 -07:00
Bagatur	0b4608f71e	infra: temp skip oai embeddings test (#25148 )	2024-08-07 17:51:39 +00:00
Bagatur	a4086119f8	openai[patch]: Release 0.1.21rc2 (#25146 )	2024-08-07 16:59:15 +00:00
Bagatur	b4c12346cc	core[patch]: Release 0.2.29 (#25126 )	2024-08-07 09:50:20 -07:00
Erick Friis	dff83cce66	core[patch]: base language model disable_streaming (#25070 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-07 09:26:21 -07:00
eric-langenberg	130e80b60f	docs: rag.ipynb - fixing typo (#25142 ) Just changing gpt-3.5 to gpt-4o-mini . That's what's used in the code examples now. It just didn't get updated in the main text.	2024-08-07 16:02:22 +00:00
Bagatur	09fbce13c5	openai[patch]: ChatOpenAI.with_structured_output json_schema support (#25123 )	2024-08-07 08:09:07 -07:00
maang-h	0ba125c3cd	docs: Standardize QianfanLLMEndpoint LLM (#25139 ) - Description: Standardize QianfanLLMEndpoint LLM，include: - docs, the issue #24803 - model init arg names, the issue #20085	2024-08-07 10:57:27 -04:00
Eugene Yurtsev	28e0958ff4	core[patch]: Relax rate limit unit tests in terms of timing (#25140 ) Relax rate limit unit tests	2024-08-07 14:04:58 +00:00
Eray Eroğlu	a2e9910268	Documentation Update for Upstash Semantic Caching (#25114 ) Thank you for contributing to LangChain! - [ ] PR title: "Documentation Update : Semantic Caching Update for Upstash" - Docs, llm caching integrations update - Description: Upstash supports semantic caching, and we would like to inform you about this - Twitter handle: You can mention eray_eroglu_ if you want to post a tweet about the PR --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-07 14:02:07 +00:00
Pat Patterson	7e7fcf5b1f	community: Fix ValidationError on creating GPT4AllEmbeddings with no gpt4all_kwargs (#25124 ) - Description: Instantiating `GPT4AllEmbeddings` with no `gpt4all_kwargs` argument raised a `ValidationError`. Root cause: #21238 added the capability to pass `gpt4all_kwargs` through to the `GPT4All` instance via `Embed4All`, but broke code that did not specify a `gpt4all_kwargs` argument. - Issue: #25119 - Dependencies: None - Twitter handle: [`@metadaddy`](https://twitter.com/metadaddy)	2024-08-07 13:34:01 +00:00
Atanu Dasgupta	04dd8d3b0a	Update google_search.ipynb (#25135 ) updated with langchain_google_community instead as the latest revision Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-07 13:30:59 +00:00
ZhangShenao	63d84e93b9	patch[doc] Fix word spelling error (#25128 ) Fix word spelling error	2024-08-07 09:16:17 -04:00
Eugene Yurtsev	4d28c70000	core[patch]: Sort Config attributes (#25127 ) This PR does an aesthetic sort of the config object attributes. This will make it a bit easier to go back and forth between pydantic v1 and pydantic v2 on the 0.3.x branch	2024-08-07 02:53:50 +00:00
Erick Friis	46a47710b0	partners/milvus: release 0.1.4 (#25058 )	2024-08-06 16:29:29 -07:00
Erick Friis	35ebd2620c	infra,cli: template matching registration (#25110 )	2024-08-06 15:29:55 -07:00
ccurme	23c9aba575	groq[patch]: allow warnings during tests (#25105 ) Among integration packages in libs/partners, Groq is an exception in that it errors on warnings. Following https://github.com/langchain-ai/langchain/pull/25084, Groq fails with > pydantic.warnings.PydanticDeprecatedSince20: The `__fields__` attribute is deprecated, use `model_fields` instead. Deprecated in Pydantic V2.0 to be removed in V3.0. Here we update the behavior to no longer fail on warning, which is consistent with the rest of the packages in libs/partners.	2024-08-06 18:02:20 -04:00
Bagatur	1331e8589c	docs: oai chat nit (#25117 )	2024-08-06 22:00:42 +00:00
Bagatur	7882d5c978	openai[patch]: Release 0.1.21rc1 (#25116 )	2024-08-06 21:50:36 +00:00
Bagatur	70677202c7	core[patch]: Release 0.2.29rc1 (#25115 )	2024-08-06 21:36:56 +00:00
Bagatur	78403a3746	core[patch], openai[patch]: enable strict tool calling (#25111 ) Introduced https://openai.com/index/introducing-structured-outputs-in-the-api/	2024-08-06 21:21:06 +00:00
ccurme	5d10139fc7	docs[patch]: add to qa with sources guide (#25112 )	2024-08-06 17:08:35 -04:00
Eugene Yurtsev	d283f452cc	core[minor]: Add support for DocumentIndex in the index api (#25100 ) Support document index in the index api.	2024-08-06 12:30:49 -07:00
Virat Singh	264ab96980	community: Add stock market tools from financialdatasets.ai (#25025 ) Description: In this PR, I am adding three stock market tools from financialdatasets.ai (my API!): - get balance sheets - get cash flow statements - get income statements Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 18:28:12 +00:00
William FH	267855b3c1	Set Context in RunnableSequence & RunnableParallel (#25073 )	2024-08-06 11:10:37 -07:00
Naval Chand	71c0698ee4	Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler (#25104 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Example: "community: Added bedrock 3-5 sonnet cost detials for BedrockAnthropicTokenUsageCallbackHandler" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Naval Chand <navalchand@192.168.1.36>	2024-08-06 17:28:47 +00:00
Isaac Francisco	a72fddbf8d	[docs]: vector store integration pages (#24858 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 17:20:27 +00:00
Bagatur	2c798622cd	docs: runnable docstring space (#25106 )	2024-08-06 16:46:50 +00:00
Bagatur	3abf1b6905	docs: versions sidebar (#25061 )	2024-08-06 09:23:43 -07:00
maang-h	1028af17e7	docs: Standardize Tongyi (#25103 ) - Description: Standardize Tongyi LLM，include: - docs, the issue #24803 - model init arg names, the issue #20085	2024-08-06 11:44:12 -04:00
Dobiichi-Origami	061ed250f6	delete the default model value from langchain and discard the need fo… (#24915 ) - description: I remove the limitation of mandatory existence of `QIANFAN_AK` and default model name which langchain uses cause there is already a default model nama underlying `qianfan` SDK powering langchain component. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-06 14:11:05 +00:00
Eugene Yurtsev	293a4a78de	core[patch]: Include dependencies in sys_info (#25076 ) `python -m langchain_core.sys_info` ```bash System Information ------------------ > OS: Linux > OS Version: #44~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Tue Jun 18 14:36:16 UTC 2 > Python Version: 3.11.4 (main, Sep 25 2023, 10:06:23) [GCC 11.4.0] Package Information ------------------- > langchain_core: 0.2.28 > langchain: 0.2.8 > langsmith: 0.1.85 > langchain_anthropic: 0.1.20 > langchain_openai: 0.1.20 > langchain_standard_tests: 0.1.1 > langchain_text_splitters: 0.2.2 > langgraph: 0.1.19 Optional packages not installed ------------------------------- > langserve Other Dependencies ------------------ > aiohttp: 3.9.5 > anthropic: 0.31.1 > async-timeout: Installed. No version info available. > defusedxml: 0.7.1 > httpx: 0.27.0 > jsonpatch: 1.33 > numpy: 1.26.4 > openai: 1.39.0 > orjson: 3.10.6 > packaging: 24.1 > pydantic: 2.8.2 > pytest: 7.4.4 > PyYAML: 6.0.1 > requests: 2.32.3 > SQLAlchemy: 2.0.31 > tenacity: 8.5.0 > tiktoken: 0.7.0 > typing-extensions: 4.12.2 ```	2024-08-06 09:57:39 -04:00
Dominik Fladung	ffa0c838d8	Allow ConfluenceLoader authorization via Personal Access Tokens (#25096 ) - community: Allow authorization to Confluence with bearer token - Description: Allow authorization to Confluence with [Personal Access Token](https://confluence.atlassian.com/enterprise/using-personal-access-tokens-1026032365.html) by checking for the keys `['client_id', token: ['access_token', 'token_type']]` - Issue: Currently the following error occurs when using an personal access token for authorization. ```python loader = ConfluenceLoader( url=os.getenv('CONFLUENCE_URL'), oauth2={ 'token': {"access_token": os.getenv("CONFLUENCE_ACCESS_TOKEN"), "token_type": "bearer"}, 'client_id': 'client_id', }, page_ids=['12345678'], ) ``` ``` ValueError: Error(s) while validating input: ["You have either omitted require keys or added extra keys to the oauth2 dictionary. key values should be `['access_token', 'access_token_secret', 'consumer_key', 'key_cert']`"] ``` With this PR the loader runs as expected. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-06 13:42:47 +00:00
orkhank	111c7df117	docs: update numbering of items in method docs (#25093 ) Some methods' doc strings have a wrong numbering of items. The numbers were adjusted accordingly	2024-08-06 09:21:52 -04:00
Bagatur	6eb42c657e	core[patch]: Remove default BaseModel init docstring (#25009 ) Currently a default init docstring gets appended to the class docstring of every BaseModel inherited object. This removes the default init docstring. ![Screenshot 2024-08-02 at 5 09 55 PM](https://github.com/user-attachments/assets/757fe4ae-a793-4e7d-8354-512de2c06818)	2024-08-06 01:04:04 +00:00
Gram Liu	88a9a6a758	core[patch]: Add pydantic metadata to subset model (#25032 ) - Description: This includes Pydantic field metadata in `_create_subset_model_v2` so that it gets included in the final serialized form that get sent out. - Issue: #25031 - Dependencies: n/a - Twitter handle: @gramliu --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-05 17:57:39 -07:00
BhujayKumarBhatta	8f33fce871	docs: change for optional variables in chatprompt (#25017 ) Fixes #24884	2024-08-05 23:57:44 +00:00
Erick Friis	423d286546	infra: check doc script skip index page (#25088 )	2024-08-05 16:38:30 -07:00
Bagatur	e572521f2a	core[patch]: exclude special pydantic init params (#25084 )	2024-08-05 23:32:51 +00:00
Isaac Francisco	63ddf0afb4	ollama: allow base_url, headers, and auth to be passed (#25078 )	2024-08-05 15:39:36 -07:00
Eugene Yurtsev	4bcd2aad6c	core[patch]: Relax time constraints on rate limit test (#25071 ) Try to keep the unit test fast, but also have it repeat more robustly	2024-08-05 17:04:22 -04:00
jigsawlabs-student	427a04151c	community: fix neo4j from_existing_graph (#24912 ) Fixes Neo4JVector.from_existing_graph integration with huggingface Previously threw an error with existing databases, because from_existing_graph query returns empty list of new nodes, which are then passed to embedding function, and huggingface errors with empty list. Fixes [24401](https://github.com/langchain-ai/langchain/issues/24401) --------- Co-authored-by: Jeff Katzy <jeffreyerickatz@gmail.com>	2024-08-05 21:01:46 +00:00
Tomaz Bratanic	d166967003	experimental: Add gliner graph transformer (#25066 ) You can use this with: ``` from langchain_experimental.graph_transformers import GlinerGraphTransformer gliner = GlinerGraphTransformer(allowed_nodes=["Person", "Organization", "Nobel"], allowed_relationships=["EMPLOYEE", "WON"]) from langchain_core.documents import Document text = """ Marie Curie, was a Polish and naturalised-French physicist and chemist who conducted pioneering research on radioactivity. She was the first woman to win a Nobel Prize, the first person to win a Nobel Prize twice, and the only person to win a Nobel Prize in two scientific fields. Her husband, Pierre Curie, was a co-winner of her first Nobel Prize, making them the first-ever married couple to win the Nobel Prize and launching the Curie family legacy of five Nobel Prizes. She was, in 1906, the first woman to become a professor at the University of Paris. """ documents = [Document(page_content=text)] gliner.convert_to_graph_documents(documents) ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 21:01:27 +00:00
Bagatur	a74e466507	docs: aws pydantic v2 compat (#25075 )	2024-08-05 20:47:11 +00:00
Bagatur	a02a09c973	docs: remove redundant deprecation warning (#25067 )	2024-08-05 18:44:47 +00:00
Eugene Yurtsev	41dfad5104	core[minor]: Introduce DocumentIndex abstraction (#25062 ) This PR adds a minimal document indexer abstraction. The goal of this abstraction is to allow developers to create custom retrievers that also have a standard indexing API and allow updating the document content in them. The abstraction comes with a test suite that can verify that the indexer implements the correct semantics. This is an iteration over a previous PRs (https://github.com/langchain-ai/langchain/pull/24364). The main difference is that we're sub-classing from BaseRetriever in this iteration and as so have consolidated the sync and async interfaces. The main problem with the current design is that runt time search configuration has to be specified at init rather than provided at run time. We will likely resolve this issue in one of the two ways: (1) Define a method (`get_retriever`) that will allow creating a retriever at run time with a specific configuration.. If we do this, we will likely break the subclass on BaseRetriever (2) Generalize base retriever so it can support structured queries --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 18:06:33 +00:00
Vkzem	e7b95e0802	docs: update exa search (#24861 ) - [x] PR title: "docs: changed example for Exa search retriever usage" - [x] PR message: - Description: Changed Exa integration doc at `docs/docs/integrations/tools/exa_search.ipynb` to better reflect simple Exa use case - Issue: move toward more canonical use of Exa method (`search_and_contents` rather than just `search`) - Dependencies: no dependencies; docs only change - Twitter handle: n/a - small change If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. - will do --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 17:41:33 +00:00
Stuart Marsh	16bd0697dc	milvus: fixed bug when using partition key and dynamic fields together (#25028 ) Description: This PR fixes a bug where if `enable_dynamic_field` and `partition_key_field` are enabled at the same time, a pymilvus error occurs. Milvus requires the partition key field to be a full schema defined field, and not a dynamic one, so it will throw the error "the specified partition key field {field} not exist" when creating the collection. When `enabled_dynamic_field` is set to `True`, all schema field creation based on `metadatas` is skipped. This code now checks if `partition_key_field` is set, and creates the field. Integration test added. Twitter handle: StuartMarshUK --------- Co-authored-by: Stuart Marsh <stuart.marsh@qumata.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 16:01:55 +00:00
Jim Baldwin	6890daa90c	community: make AthenaLoader profile_name optional and fix type hint (#24958 ) - Description: This PR makes the AthenaLoader profile_name optional and fixes the type hint which says the type is `str` but it should be `str` or `None` as None is handled in the loader init. This is a minor problem but it just confused me when I was using the Athena Loader to why we had to use a Profile, as I want that for local but not production. - Issue: #24957 - Dependencies: None.	2024-08-05 14:28:58 +00:00
Alexey Lapin	335894893b	langchain: Make RetryWithErrorOutputParser.from_llm() create a correct retry chain (#25053 ) Description: RetryWithErrorOutputParser.from_llm() creates a retry chain that returns a Generation instance, when it should actually just return a string. This class was forgotten when fixing the issue in PR #24687	2024-08-05 14:21:27 +00:00
Dobiichi-Origami	c5cb52a3c6	community: fix issue of the existence of numeric object in `additional_kwargs` a… (#24863 ) - Description: A previous PR breaks the code from `baidu_qianfan_endpoint.py` which causes the malfunction of streaming	2024-08-05 10:15:55 -04:00
ZhangShenao	cda79dbb6c	community[patch]: Optimize test case for `MoonshotChat` (#25050 ) Optimize test case for `MoonshotChat`. Use standard ChatModelIntegrationTests.	2024-08-05 10:11:25 -04:00
orkhank	cea3f72485	docs: fix comment lines in code blocks (#25054 ) The comments inside some code blocks seems to be misplaced. The comment lines containing explanation about `default_key` behavior when operating with prompts are updated.	2024-08-05 14:11:09 +00:00
ZhangShenao	02c35da445	doc[Retriever] Enhance api docs for `MultiQueryRetriever` (#25035 ) Enhance api docs for `MultiQueryRetriever`: - Complete missing parameters - Unify parameter name	2024-08-04 13:56:38 -04:00
Alex Sherstinsky	208042e0f2	community: Fix Predibase Integration for HuggingFace-hosted fine-tuned adapters (#25015 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-08-03 14:05:43 -07:00
maang-h	f5da0d6d87	docs: Standardize MiniMaxEmbeddings (#24983 ) - Description: Standardize MiniMaxEmbeddings - docs, the issue #24856 - model init arg names, the issue #20085	2024-08-03 14:01:23 -04:00
ZhangShenao	2c3e3dc6b1	patch[Partners] Unified fix of incorrect variable declarations in all check_imports (#25014 ) There are some incorrect declarations of variable `has_failure` in check_imports. The purpose of this PR is to uniformly fix these errors.	2024-08-03 13:49:41 -04:00
maang-h	7de62abc91	docs: Standardize SparkLLMTextEmbeddings docstrings (#25021 ) - Description: Standardize SparkLLMTextEmbeddings docstrings - Issue: the issue #24856	2024-08-03 13:44:09 -04:00
Tomaz Bratanic	f9a11a9197	Add relik transformer config (#25019 )	2024-08-03 08:41:45 -04:00
Bagatur	1dcee68cb8	docs: show beta directive (#25013 ) ![Screenshot 2024-08-02 at 7 15 34 PM](https://github.com/user-attachments/assets/086831c7-36f3-4962-98dc-d707b6289747)	2024-08-03 03:07:45 +00:00
Bagatur	e81ddb32a6	docs: fix kwargs docstring (#25010 ) Fix: ![Screenshot 2024-08-02 at 5 33 37 PM](https://github.com/user-attachments/assets/7c56cdeb-ee81-454c-b3eb-86aa8a9bdc8d)	2024-08-02 19:54:54 -07:00
Bagatur	57747892ce	docs: show deprecation warning first in api ref (#25001 ) OLD ![Screenshot 2024-08-02 at 3 29 39 PM](https://github.com/user-attachments/assets/7f169121-1202-4770-a006-d72ac7a1aa33) NEW ![Screenshot 2024-08-02 at 3 29 45 PM](https://github.com/user-attachments/assets/9cc07cbd-2ae9-4077-95c5-03cb051e6cd7)	2024-08-02 17:35:25 -07:00
Bagatur	679843abb0	docs: separate deprecated classes (#25007 ) ![Screenshot 2024-08-02 at 4 58 54 PM](https://github.com/user-attachments/assets/29424dd5-0593-4818-9eed-901ff47246b9)	2024-08-02 17:12:47 -07:00
Isaac Francisco	73570873ab	docs: standardizing tavily tool docs (#24736 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-02 22:25:27 +00:00
Isaac Francisco	2ae76cecde	[docs]: updating mistral and hugging face chat model pages (#24731 )	2024-08-02 15:21:25 -07:00
Bagatur	4305f78e40	core[patch]: Release 0.2.28 (#25000 )	2024-08-02 21:07:06 +00:00
Bagatur	64ccddf3cb	docs: fmt concepts (#24999 )	2024-08-02 20:35:45 +00:00
Bagatur	dd8e4cd020	text-splitters[patch]: Release 0.2.3 (#24998 )	2024-08-02 20:27:22 +00:00
Bagatur	0de0cd2d31	core[patch]: merge message runs nit (#24997 ) Only add separator if both chunks are non-empty	2024-08-02 20:25:43 +00:00
Bagatur	8e2316b8c2	community[patch]: Release 0.2.11 (#24989 )	2024-08-02 20:08:44 +00:00
ccurme	c2538e7834	experimental[patch]: bump min versions of core and community (#24996 ) Ollama functions unit test broken with min version of community.	2024-08-02 19:58:55 +00:00
ccurme	acba38a18e	docs: update toolkit guides (#24992 )	2024-08-02 15:51:05 -04:00
ccurme	22c1a4041b	community[patch]: support named arguments in github toolkit (#24986 ) Parameters may be passed in by name if generated from tool calls.	2024-08-02 18:27:32 +00:00
ccurme	4797b806c2	experimental[patch]: release 0.0.64 (#24990 )	2024-08-02 18:00:57 +00:00
Tomaz Bratanic	7061869aec	Add relik graph transformer (#24982 ) Relik is a new library for graph extraction that offers smaller and cheaper models for graph construction	2024-08-02 13:55:41 -04:00
Erick Friis	98c22e9082	docs: feature table component (#24985 )	2024-08-02 17:41:47 +00:00
ccurme	c04d95b962	standard-tests: set integration test parameters independent of unit test (#24979 ) This ends up getting set in integration tests.	2024-08-02 10:40:11 -07:00
gbaian10	54e9ea433a	fix: Modify the order of init_chat_model import ollama package. (#24977 )	2024-08-02 08:32:56 -07:00
David Gao	fe1820cdaf	docs: add wikipedia integration docs (#24932 ) Dear langchain maintainers, I add the wikipedia integration docs according to the [web docs](https://python.langchain.com/v0.2/docs/integrations/retrievers/wikipedia/), and follow the format of [tavily example](https://github.com/langchain-ai/langchain/blob/master/docs/docs/integrations/retrievers/tavily.ipynb) and [retriever template](https://github.com/langchain-ai/langchain/blob/master/libs/cli/langchain_cli/integration_template/docs/retrievers.ipynb), this is my first time contributing large repo. please let me know if I'm doing anything wrong, thank you! Topic related: #24908 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-02 10:12:04 -04:00
ZhangShenao	71c0564c9f	community[patch]: Add test case for MoonshotChat (#24960 ) Add test case for `MoonshotChat`.	2024-08-02 09:37:31 -04:00
ZhangShenao	c65e48996c	patch[partners] Fix check_imports bugs in pinecone and milvus (#24971 ) Fix wrong declared variables of `check_imports` in pinecone and milvus	2024-08-02 09:27:11 -04:00
Isaac Francisco	d7688a4328	community[patch]: adding artifact to Tavily search (#24376 ) This allows you to get raw content as well as the answer, instead of just getting the results. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 21:12:11 -07:00
Bagatur	7b08de8909	langchain[patch]: Release 0.2.12 (#24954 )	2024-08-02 04:04:49 +00:00
Bagatur	245cb5a252	core[patch]: Release 0.2.27 (#24952 )	2024-08-02 01:43:24 +00:00
Bagatur	199e9c5ae0	core[patch]: Fix tool args schema inherited field parsing (#24936 ) Fix #24925	2024-08-01 18:36:33 -07:00
Bagatur	fba65ba04f	infra: test core on py 3.9, 10, 11 (#24951 )	2024-08-01 18:23:37 -07:00
Leonid Ganeline	4092876863	core: docstrings `BaseCallbackHandler update (#24948 ) Added missed docstrings	2024-08-01 20:46:53 -04:00
ccurme	6e45dba471	docs: fix redirect (#24950 )	2024-08-01 20:45:54 -04:00
WU LIFU	ad16eed119	core[patch]: runnable config ensure_config deep copy from var_child_runnable… (#24862 ) issue: #24660 RunnableWithMessageHistory.stream result in error because the [evaluation](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/runnables/branch.py#L220) of the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) unexpectedly trigger the "[on_end](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`)" (exit_history) callback of the default branch descriptions After a lot of investigation I'm convinced that the root cause is that 1. during the execution of the runnable, the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L122)`) is shared between the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) runnable and the [default branch runnable](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`) within the same context 2. when the default branch runnable runs, it gets the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L163)`) and may unintentionally [add more handlers ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L325)`)to the callback manager of this config 3. when it is again the turn for the [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) to run, it gets the `var_child_runnable_config` whose callback manager has the handlers added by the default branch. When it runs that handler (`exit_history`) it leads to the error with the assumption that, the `ensure_config` function actually does want to create a immutable copy from `var_child_runnable_config` because it starts with an [`empty` variable ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L156)`), i go ahead to do a deepcopy to ensure that future modification to the returned value won't affect the `var_child_runnable_config` variable Having said that I actually 1. don't know if this is a proper fix 2. don't know whether it will lead to other unintended consequence 3. don't know why only "stream" runs into this issue while "invoke" runs without problem so @nfcampos @hwchase17 please help review, thanks! --------- Co-authored-by: Lifu Wu <lifu@nextbillion.ai> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 17:30:32 -07:00
Jacob Lee	3ab09d87d6	docs[patch]: Adds components for prereqs, compatibility, fix chat model tab issue (#24585 ) Added to `docs/how_to/tools_runtime` as a proof of concept, will apply everywhere if we like. A bit more compact than the default callouts, will help standardize the layout of our pages since we frequently use these boxes. <img width="1088" alt="Screenshot 2024-07-23 at 4 49 02 PM" src="https://github.com/user-attachments/assets/7380801c-e092-4d31-bcd8-3652ee05f29e">	2024-08-01 15:04:13 -07:00
ccurme	9cb69a8746	docs: update retriever template, add arxiv retriever (#24947 )	2024-08-01 16:53:18 -04:00
Casey Clements	db3ceb4d0a	partners/mongodb: Improved search index commands (#24745 ) Hardens index commands with try/except for free clusters and optional waits for syncing and tests. [efriis](https://github.com/efriis) These are the upgrades to the search index commands (CRUD) that I mentioned. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-01 20:16:32 +00:00
ccurme	db42576b09	docs: delete old migration guide (#24881 ) Redirects to https://python.langchain.com/v0.2/docs/versions/migrating_chains/	2024-08-01 16:11:47 -04:00
Ikko Eltociear Ashimine	be5294e35d	docs: update agents.ipynb (#24945 ) initalize -> initialize	2024-08-01 14:37:37 -04:00
ccurme	41ed23a050	docs: update retriever integration pages (#24931 )	2024-08-01 14:37:07 -04:00
maang-h	ea505985c4	docs: Standardize ZhipuAIEmbeddings docstrings (#24933 ) - Description: Standardize ZhipuAIEmbeddings rich docstrings. - Issue: the issue #24856	2024-08-01 14:06:53 -04:00
ccurme	02db66d764	docs: fix kv store column headers (#24941 ) ![Screenshot 2024-08-01 at 12 32 19 PM](https://github.com/user-attachments/assets/888056b7-3065-4be0-a6b8-bcab5b729c2c)	2024-08-01 09:49:36 -07:00
Anneli Samuel	2204d8cb7d	community[patch]: Invoke on_llm_new_token callback before yielding chunk (#24938 ) Description: Invoke on_llm_new_token callback before yielding chunk in streaming mode Issue: [#16913](https://github.com/langchain-ai/langchain/issues/16913)	2024-08-01 16:39:04 +00:00
John	ff6274d32d	docs: update langchain-unstructured docs (#24935 ) - Description: The UnstructuredClient will have a breaking change in the near future. Add a note in the docs that the examples here may not use the latest version and users should refer to the SDK docs for the latest info.	2024-08-01 16:27:40 +00:00
ccurme	c72f0d2f20	docs: update toolkit integration pages (#24887 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-01 12:13:08 -04:00
Eugene Yurtsev	75776e4a54	core[patch]: In unit tests, use `_schema()` instead of BaseModel.schema() (#24930 ) This PR introduces a module with some helper utilities for the pydantic 1 -> 2 migration. They're meant to be used in the following way: 1) Use the utility code to get unit tests pass without requiring modification to the unit tests 2) (If desired) upgrade the unit tests to match pydantic 2 output 3) (If desired) stop using the utility code Currently, this module contains a way to map `schema()` generated by pydantic 2 to (mostly) match the output from pydantic v1.	2024-08-01 11:59:04 -04:00
Serena Ruan	1827bb4042	community[patch]: support bind_tools for ChatMlflow (#24547 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - Description: Support ChatMlflow.bind_tools method Tested in Databricks: <img width="836" alt="image" src="https://github.com/user-attachments/assets/fa28ef50-0110-4698-8eda-4faf6f0b9ef8"> - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-08-01 08:43:07 -07:00
Michal Gregor	769c3bb838	huggingface: Added a missing argument to a ChatHuggingFace doc notebook. (#24929 ) - Description: When adding docs for constructing ChatHuggingFace using a HuggingFacePipeline, I forgot to add `return_full_text=False` as an argument. In this setup, the chat response would incorrectly contain all the input text. I am fixing that here by adding that line to the offending notebook. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-01 15:42:35 +00:00
BottlePumpkin	bfc59c1d26	community: Fix KeyError in NotionDB loader when 'name' is missing (#24224 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Description: This PR fixes a KeyError in NotionDBLoader when the "name" key is missing in the "people" property. Issue: Fixes #24223 Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-01 13:55:40 +00:00
alexqiao	8eb0bdead3	community[patch]: Invoke callback prior to yielding token (#24917 ) Description: Invoke callback prior to yielding token in stream method for chat_models . Issue: https://github.com/langchain-ai/langchain/issues/16913 #16913	2024-08-01 13:19:55 +00:00
ZhangShenao	b2dd9ffaaf	patch[cli] Fix bug in `check_imports.py` (#24918 ) The variable `has_failure` in check_imports.py is wrong-declared. It's actually an another variable.	2024-08-01 09:08:12 -04:00
Jacob Lee	f14121faaf	docs[patch]: Update local RAG tutorial (#24909 )	2024-07-31 19:19:23 -07:00
Bagatur	b7abac9f92	infra: poetry lock root (#24913 )	2024-08-01 01:19:34 +00:00
Jacob Lee	42c686bc28	docs[patch]: Update local model how-to guide (#24911 ) Updates to use `langchain_ollama`, new models, chat model example	2024-07-31 18:01:55 -07:00
Erick Friis	600fc233ef	partners/ollama: release 0.1.1 (#24910 )	2024-07-31 17:31:29 -07:00
Bagatur	25b93cc4c0	core[patch]: stringify tool non-content blocks (#24626 ) Slightly breaking bugfix. Shouldn't cause too many issues since no models would be able to handle non-content block ToolMessage.content anyways.	2024-07-31 16:42:38 -07:00
Bagatur	492df75937	docs: chat model table nit (#24907 )	2024-07-31 15:14:27 -07:00
Bagatur	a24c445e02	docs: cleanup readme (#24905 )	2024-07-31 15:03:28 -07:00
Jacob Lee	5098f9dc79	infra: related section in docs (#24829 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-31 14:25:58 -07:00
Nikita Pakunov	c776471ac6	community: fix AttributeError: 'YandexGPT' object has no attribute '_grpc_metadata' (#24432 ) Fixes #24049 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-31 21:18:33 +00:00
Bagatur	752a71b688	integrations[patch]: release model packages (#24900 )	2024-07-31 20:48:20 +00:00
Jacob Lee	1213a59f87	docs[patch]: Update kv store docs pages (#24848 )	2024-07-31 13:23:24 -07:00
Erick Friis	17a06cb7a6	infra: check templates based on integration (#24857 ) instead of hardcoding a linter for each, iterate through the lines of the template notebook and find lines that start with `##` (includes lower headings), and enforce that those headings are found in new docs that are contributed	2024-07-31 13:19:50 -07:00
Erick Friis	a7380dd531	cli: release 0.0.28 (#24852 )	2024-07-31 13:03:24 -07:00
Erick Friis	e98e4be0f7	cli: register new integration doc templates (#24854 ) - wait to merge for retriever.ipynb merge #24836	2024-07-31 13:03:05 -07:00
Eugene Yurtsev	210623b409	core[minor]: Add support for pydantic 2 to utility to get fields (#24899 ) Add compatibility for pydantic 2 for a utility function. This will help push some small changes to master, so they don't have to be kept track of on a separate branch.	2024-07-31 19:11:07 +00:00
Bagatur	7d1694040d	core[patch]: Release 0.2.26 (#24898 )	2024-07-31 19:00:50 +00:00
Eugene Yurtsev	add16111b9	community[patch]: Make the pydantic linter stricter (#24897 ) Stricter linting of deprecated pydantic features.	2024-07-31 18:57:37 +00:00
Eugene Yurtsev	a4a444f73d	community[patch]: Fix arcee llm usage of root_validator(pre=False) (#24896 ) Should be pre=True	2024-07-31 18:49:20 +00:00
Eugene Yurtsev	69c656aa5f	langchain[minor]: Upgrade ambiguous root_validator to @pre_init (#24895 ) The @pre_init validator is a temporary solution for base models. It has similar (but not identical) semantics to @root_validator(), but it works strictly as a pre-init validator. It'll work as expected as long as the pydantic model type hints were correct.	2024-07-31 18:46:47 +00:00
Eugene Yurtsev	5099a9c9b4	core[patch]: Update unit tests with a workaround for using AnyID in pydantic 2 (#24892 ) Pydantic 2 ignores __eq__ overload for subclasses of strings.	2024-07-31 14:42:12 -04:00
Bagatur	8461934c2b	core[patch], integrations[patch]: convert TypedDict to tool schema support (#24641 ) supports following UX ```python class SubTool(TypedDict): """Subtool docstring""" args: Annotated[Dict[str, Any], {}, "this does bar"] class Tool(TypedDict): """Docstring Args: arg1: foo """ arg1: str arg2: Union[int, str] arg3: Optional[List[SubTool]] arg4: Annotated[Literal["bar", "baz"], ..., "this does foo"] arg5: Annotated[Optional[float], None] ``` - can parse google style docstring - can use Annotated to specify default value (second arg) - can use Annotated to specify arg description (third arg) - can have nested complex types	2024-07-31 18:27:24 +00:00
Eugene Yurtsev	d24b82357f	community[patch]: Add missing annotations (#24890 ) This PR adds annotations in comunity package. Annotations are only strictly needed in subclasses of BaseModel for pydantic 2 compatibility. This PR adds some unnecessary annotations, but they're not bad to have regardless for documentation pages.	2024-07-31 18:13:44 +00:00
Eugene Yurtsev	7720483432	langchain[patch]: Update unit tests to workaround a pydantic 2 issue (#24886 ) This will allow our unit tests to pass when using AnyID() with our pydantic models.	2024-07-31 14:09:40 -04:00
Eugene Yurtsev	2019e31bc5	langchain[patch]: Add missing type annotations (#24889 ) Adds missing type annotations in preparation for pydantic 2 upgrade.	2024-07-31 14:09:22 -04:00
ccurme	30f18c7b02	docs: add retriever integrations template (#24836 )	2024-07-31 13:50:44 -04:00
Anirudh31415926535	4da3d4b18e	docs: Minor corrections and updates to Cohere docs (#22726 ) - Description: Update the Cohere's provider and RagRetriever documentations with latest updates. - Twitter handle: Anirudh1810	2024-07-31 10:16:26 -07:00
ccurme	40b4a3de6e	docs: update chat model integration pages (#24882 ) to conform with template	2024-07-31 11:26:52 -04:00
Nishan Jain	b00c0fc558	[Community][minor]: Added prompt governance in pebblo_retrieval (#24874 ) Title: [pebblo_retrieval] Identifying entities in prompts given in PebbloRetrievalQA leading to prompt governance Description: Implemented identification of entities in the prompt using Pebblo prompt governance API. Issue: NA Dependencies: NA Add tests and docs: NA	2024-07-31 13:14:51 +00:00
Rajendra Kadam	a6add89bd4	community[minor]: [PebbloSafeLoader] Implement content-size-based batching (#24871 ) - Title: [PebbloSafeLoader] Implement content-size-based batching in the classification flow(loader/doc API) - Description: - Implemented content-size-based batching in the loader/doc API, set to 100KB with no external configuration option, intentionally hard-coded to prevent timeouts. - Remove unused field(pb_id) from doc_metadata - Issue: NA - Dependencies: NA - Add tests and docs: Updated	2024-07-31 09:10:28 -04:00
TrumanYan	096b66db4a	community: replace it with Tencent Cloud SDK (#24172 ) Description: The old method will be discontinued; use the official SDK for more model options. Issue: None Dependencies: None Twitter handle: None Co-authored-by: trumanyan <trumanyan@tencent.com>	2024-07-31 09:05:38 -04:00
Erick Friis	99eb31ec41	cli: embed docstring template (#24855 )	2024-07-31 02:16:40 +00:00
Noah Peterson	4b2a8ce6c7	docs: Shorten unreasonably long OllamaEmbeddings page (#24850 ) This change removes excessive embeddings output in the Jupyter Notebook on the [Ollama text embedding page](https://python.langchain.com/v0.2/docs/integrations/text_embedding/ollama/)	2024-07-31 01:57:04 +00:00
Erick Friis	3999e9035c	cli/docs: embedding template standardization (#24849 )	2024-07-30 18:54:03 -07:00
Bagatur	1181c10c65	docs: reorder integrations sidebar (#24847 )	2024-07-30 16:58:26 -07:00
Bagatur	943126c5fd	docs: chat model pkg links (#24845 )	2024-07-30 16:26:06 -07:00
Erick Friis	1f5444817a	community: deprecate BedrockEmbeddings in favor of langchain-aws (#24846 )	2024-07-30 23:13:17 +00:00
Jacob Lee	21eb4c9e5d	docs[patch]: Adds first kv store doc matching new template (#24844 )	2024-07-30 15:58:51 -07:00
Bagatur	a4e940550a	docs: integrations custom callout (#24843 )	2024-07-30 22:48:18 +00:00
Bagatur	61ecb10a77	docs: partner pkg table (#24840 )	2024-07-30 15:28:10 -07:00
Erick Friis	b099cc3507	cli: release 0.0.27 (#24842 )	2024-07-30 22:07:50 +00:00
Bagatur	419f2c2585	cli[patch]: tool integration templates (#24837 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-30 14:59:33 -07:00
mschoenb97IL	19b127f640	langchain: Update Langchain -> Langgraph migration docs for the deprecation of the `messages_modifier` parameter. (#24839 ) Description: Updated the Langgraph migration docs to use `state_modifier` rather than `messages_modifier` Issue: N/A Dependencies: N/A - [ X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-30 21:28:32 +00:00
ccurme	c123cb2b30	docs: update migration guide (#24835 ) Move to its own section in the sidebar.	2024-07-30 20:17:12 +00:00
Erick Friis	957b05b8d5	infra: py3.11 for community integration test compiling (#24834 ) e.g. https://github.com/langchain-ai/langchain/actions/runs/10167754785/job/28120861343?pr=24833	2024-07-30 18:43:10 +00:00
Erick Friis	88418af3f5	core: release 0.2.25 (#24833 )	2024-07-30 18:41:09 +00:00
Bagatur	37b060112a	langchain[patch]: fix ollama in init_chat_model (#24832 )	2024-07-30 18:38:53 +00:00
Jerron Lim	d8f3ea82db	langchain[patch]: init_chat_model() to import ChatOllama from langchain-ollama and fallback on langchain-community (#24821 ) Description: init_chat_model() should import ChatOllama from `langchain-ollama`. If that fails, fallback to `langchain-community`	2024-07-30 11:16:10 -07:00
Eugene Yurtsev	3a7f3d46c3	docs: Add pydantic compatibility to side bar (#24826 ) Add pydantic compatibility to side bar	2024-07-30 14:10:48 -04:00
Isaac Francisco	511242280b	[docs]: standardize vectorstores (#24797 )	2024-07-30 10:38:04 -07:00
Jacob Lee	ac649800df	docs[patch]: Adds kv store integration docs template (#24804 )	2024-07-30 10:07:57 -07:00
cffranco94	b01d938997	experimental: Add config to convert_to_graph_documents (#24012 ) PR title: Experimental: Add config to convert_to_graph_documents Description: In order to use langfuse, i need to pass the langfuse configuration when invoking the chain. langchain_experimental does not allow to add any parameters (beside the documents) to the convert_to_graph_documents method. This way, I cannot monitor the chain in langfuse. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Catarina Franco <catarina.franco@criticalsoftware.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-30 17:01:06 +00:00
Shailendra Mishra	f2d810b3c0	clob_bugfix... (#24813 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-30 12:44:04 -04:00
Anush	51b15448cc	community: Fix FastEmbedEmbeddings (#24462 ) ## Description This PR: - Fixes the validation error in `FastEmbedEmbeddings`. - Adds support for `batch_size`, `parallel` params. - Removes support for very old FastEmbed versions. - Updates the FastEmbed doc with the new params. Associated Issues: - Resolves #24039 - Resolves #https://github.com/qdrant/fastembed/issues/296	2024-07-30 12:42:46 -04:00
ccurme	73ec24fc56	docs[patch]: add toolkit template (#24791 )	2024-07-30 12:36:09 -04:00
Tamir Zitman	b3e1378f2b	langchain : text_splitters Added PowerShell (#24582 ) - Description: Added PowerShell support for text splitters language include docs relevant update - Issue: None - Dependencies: None --------- Co-authored-by: tzitman <tamir.zitman@intel.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-30 16:13:52 +00:00
ccurme	187ee96f7a	docs: update chat model feature table (#24822 )	2024-07-30 09:06:42 -07:00
Nuno Campos	68ecebf1ec	core: Fix implementation of trim_first_node/trim_last_node to use exact same definition of first/last node as in the getter methods (#24802 )	2024-07-30 08:44:27 -07:00
Igor Drozdov	c2706cfb9e	feat(community): add tools support for litellm (#23906 ) I used the following example to validate the behavior ```python from langchain_core.prompts import ChatPromptTemplate from langchain_core.runnables import ConfigurableField from langchain_anthropic import ChatAnthropic from langchain_community.chat_models import ChatLiteLLM from langchain_core.tools import tool from langchain.agents import create_tool_calling_agent, AgentExecutor @tool def multiply(x: float, y: float) -> float: """Multiply 'x' times 'y'.""" return x * y @tool def exponentiate(x: float, y: float) -> float: """Raise 'x' to the 'y'.""" return x**y @tool def add(x: float, y: float) -> float: """Add 'x' and 'y'.""" return x + y prompt = ChatPromptTemplate.from_messages([ ("system", "you're a helpful assistant"), ("human", "{input}"), ("placeholder", "{agent_scratchpad}"), ]) tools = [multiply, exponentiate, add] llm = ChatAnthropic(model="claude-3-sonnet-20240229", temperature=0) # llm = ChatLiteLLM(model="claude-3-sonnet-20240229", temperature=0) agent = create_tool_calling_agent(llm, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke({"input": "what's 3 plus 5 raised to the 2.743. also what's 17.24 - 918.1241", }) ``` `ChatAnthropic` version works: ``` > Entering new AgentExecutor chain... Invoking: `exponentiate` with `{'x': 5, 'y': 2.743}` responded: [{'text': 'To calculate 3 + 5^2.743, we can use the "exponentiate" and "add" tools:', 'type': 'text', 'index': 0}, {'id': 'toolu_01Gf54DFTkfLMJQX3TXffmxe', 'input': {}, 'name': 'exponentiate', 'type': 'tool_use', 'index': 1, 'partial_json': '{"x": 5, "y": 2.743}'}] 82.65606421491815 Invoking: `add` with `{'x': 3, 'y': 82.65606421491815}` responded: [{'id': 'toolu_01XUq9S56GT3Yv2N1KmNmmWp', 'input': {}, 'name': 'add', 'type': 'tool_use', 'index': 0, 'partial_json': '{"x": 3, "y": 82.65606421491815}'}] 85.65606421491815 Invoking: `add` with `{'x': 17.24, 'y': -918.1241}` responded: [{'text': '\n\nSo 3 + 5^2.743 = 85.66\n\nTo calculate 17.24 - 918.1241, we can use:', 'type': 'text', 'index': 0}, {'id': 'toolu_01BkXTwP7ec9JKYtZPy5JKjm', 'input': {}, 'name': 'add', 'type': 'tool_use', 'index': 1, 'partial_json': '{"x": 17.24, "y": -918.1241}'}] -900.8841[{'text': '\n\nTherefore, 17.24 - 918.1241 = -900.88', 'type': 'text', 'index': 0}] > Finished chain. ``` While `ChatLiteLLM` version doesn't. But with the changes in this PR, along with: - https://github.com/langchain-ai/langchain/pull/23823 - https://github.com/BerriAI/litellm/pull/4554 The result is _almost_ the same: ``` > Entering new AgentExecutor chain... Invoking: `exponentiate` with `{'x': 5, 'y': 2.743}` responded: To calculate 3 + 5^2.743, we can use the "exponentiate" and "add" tools: 82.65606421491815 Invoking: `add` with `{'x': 3, 'y': 82.65606421491815}` 85.65606421491815 Invoking: `add` with `{'x': 17.24, 'y': -918.1241}` responded: So 3 + 5^2.743 = 85.66 To calculate 17.24 - 918.1241, we can use: -900.8841 Therefore, 17.24 - 918.1241 = -900.88 > Finished chain. ``` If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-30 15:39:34 +00:00
David Robertson	bfb7f8d40a	Brave Search: Enhance search result details with extra snippets (#19209 ) Description: This update significantly improves the Brave Search Tool's utility within the LangChain library by enriching the search results it returns. The tool previously returned title, link, and snippet, with the snippet being a truncated 140-character description from the search engine. To make the search results more informative, this update enables extra_snippets by default and introduces additional result fields: title, link, description (enhancing and renaming the former snippet field), age, and snippets. The snippets field provides a list of strings summarizing the webpage, utilizing Brave's capability for more detailed search insights. This enhancement aims to make the search tool far more informative and beneficial for users. Issue: N/A Dependencies: No additional dependencies introduced. Twitter handle: @davidalexr987 Code Changes Summary: - Changed the default setting to include extra_snippets in search results. - Renamed the snippet field to description to accurately reflect its content and included an age field for search results. - Introduced a snippets field that lists webpage summaries, providing users with comprehensive search result insights. Backward Compatibility Note: The renaming of snippet to description improves the accuracy of the returned data field but may impact existing users who have developed integration's or analyses based on the snippet field. I believe this change is essential for clarity and utility, and it aligns better with the data provided by Brave Search. Additional Notes: This proposal focuses exclusively on the Brave Search package, without affecting other LangChain packages or introducing new dependencies.	2024-07-30 15:29:38 +00:00
Eugene Yurtsev	873f64751e	docs: Remove danger on how to migrate to astream events v2 (#24825 ) Users should migrate to v2 now	2024-07-30 15:28:07 +00:00
Ben Chambers	435771fe74	[community]: Fix package name mismatch (#24824 ) - Description: fix a mismatch in pypi package names	2024-07-30 11:21:39 -04:00
ccurme	b7bbfc7c67	langchain: revert "init_chat_model() to support ChatOllama from langchain-ollama" (#24819 ) Reverts langchain-ai/langchain#24818 Overlooked discussion in https://github.com/langchain-ai/langchain/pull/24801.	2024-07-30 14:23:36 +00:00
Jerron Lim	5abfc85fec	langchain: init_chat_model() to support ChatOllama from langchain-ollama (#24818 ) Description: Since moving away from `langchain-community` is recommended, `init_chat_models()` should import ChatOllama from `langchain-ollama` instead.	2024-07-30 10:17:38 -04:00
Eugene Yurtsev	4fab8996cf	docs: Update pydantic compatibility (#24625 ) Update pydantic compatibility. This will only be true after we release the partner packages.	2024-07-29 22:19:00 -04:00
Jacob Lee	d6ca1474e0	docs[patch]: Adds key-value store to conceptual guide (#24798 )	2024-07-29 18:45:16 -07:00
Erick Friis	cdaea17b3e	cli/docs: llm integration template standardization (#24795 )	2024-07-29 17:47:13 -07:00
Bagatur	a6d1fb4275	core[patch]: introduce ToolMessage.status (#24628 ) Anthropic models (including via Bedrock and other cloud platforms) accept a status/is_error attribute on tool messages/results (specifically in `tool_result` content blocks for Anthropic API). Adding a ToolMessage.status attribute so that users can set this attribute when using those models	2024-07-29 14:01:53 -07:00
Isaac Francisco	78d97b49d9	[partner]: ollama llm fix (#24790 )	2024-07-29 13:00:02 -07:00
maang-h	4bb1a11e02	community: Add MiniMaxChat bind_tools and structured output (#24310 ) - Description: - Add `bind_tools` method to support tool calling - Add `with_structured_output` method to support structured output	2024-07-29 15:51:52 -04:00
John	0a2ff40fcc	partners/unstructured: fix client api_url (#24680 ) Description: Add empty string default for api_key and change `server_url` to `url` to match existing loaders. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-29 11:16:41 -07:00
maang-h	bf685c242f	docs: Standardize QianfanEmbeddingsEndpoint (#24786 ) - Description: Standardize QianfanEmbeddingsEndpoint, include: - docstrings, the issue #21983 - model init arg names, the issue #20085	2024-07-29 13:19:24 -04:00
ccurme	9998e55936	core[patch]: support tool calls with non-pickleable args in tools (#24741 ) Deepcopy raises with non-pickleable args.	2024-07-29 13:18:39 -04:00
Erick Friis	df78608741	mongodb: bson optional import (#24685 )	2024-07-29 09:54:01 -07:00
M. Ali	c086410677	fix docs typos (#23668 ) Thank you for contributing to LangChain! - [x] PR title: "docs: fix multiple typos" Co-authored-by: mohblnk <mohamed.ali@blnk.ai> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-29 16:10:55 +00:00
Pere Pasamonte	98175860ad	community: Fix AWS DocumentDB similarity_search when filter is None (#24777 ) Description Fixes DocumentDBVectorSearch similarity_search when no filter is used; it defaults to None but $match does not accept None, so changed default to empty {} before pipeline is created. Issue AWS DocumentDB similarity search does not work when no filter is used. Error msg: "the match filter must be an expression in an object" #24775 Dependencies No dependencies Twitter handle https://x.com/perepasamonte --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-29 15:32:05 +00:00
Lennart J. Kurzweg	7da0597ecb	partners[ollama]: Support seed parameter for ChatOllama (#24782 ) ## Description Adds seed parameter to ChatOllama ## Resolves Issues - #24703 ## Dependency Changes None Co-authored-by: Lennart J. Kurzweg (Nx2) <git@nx2.site>	2024-07-29 15:15:20 +00:00
ccurme	e264ccf484	standard-tests[patch]: update groq and structured output test (#24781 ) - Mixtral with Groq has started consistently failing tool calling tests. Here we restrict testing to llama 3.1. - `.schema` is deprecated in pydantic proper in favor of `.model_json_schema`.	2024-07-29 11:10:01 -04:00
ZhangShenao	4a05679fdb	patch[experimental] Fix prompt in `GenerativeAgentMemory` (#24771 ) There is an issue with the prompt format in `GenerativeAgentMemory` , try to fix it. The prompt is same as the one in method `_score_memory_importance`.	2024-07-29 07:02:31 -04:00
WU LIFU	2ba8393182	graph_transformers: bug fix for create_simple_model not passing in ll… (#24643 ) issue: #24615 descriptions: The _Graph pydantic model generated from create_simple_model (which LLMGraphTransformer uses when allowed nodes and relationships are provided) does not constrain the relationships (source and target types, relationship type), and the node and relationship properties with enums when using ChatOpenAI. The issue is that when calling optional_enum_field throughout create_simple_model the llm_type parameter is not passed in except for when creating node type. Passing it into each call fixes the issue. Co-authored-by: Lifu Wu <lifu@nextbillion.ai>	2024-07-29 07:00:56 -04:00
William FH	01ab2918a2	core[patch]: Respect injected in bound fns (#24733 ) Since right now you cant use the nice injected arg syntas directly with model.bind_tools()	2024-07-28 15:45:19 -07:00
Pavel	7fcfe7c1f4	openai[patch]: openai proxy added to base embeddings (#24539 ) - [ ] PR title: "langchain-openai: openai proxy added to base embeddings" - [ ] PR message: - Description: Dear langchain developers, You've already supported proxy for ChatOpenAI implementation in your package. At the same time, if somebody needed to use proxy for chat, it also could be necessary to be able to use it for OpenAIEmbeddings. That's why I think it's important to add proxy support for OpenAI embeddings. That's what I've done in this PR. @baskaryan --------- Co-authored-by: karpov <karpov@dohod.ru> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-07-28 20:54:13 +00:00
Lakshmi Peri	821196c4ee	langchain-aws InMemoryVectorStore documentation updates (#24347 ) Thank you for contributing to LangChain! - [x] PR title: "Add documentaiton on InMemoryVectorStore driver for MemoryDB to langchain-aws" - Langchain-aws repo :Add MemoryDB documentation - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: Added documentation on InMemoryVectorStore driver to aws.mdx and usage example on MemoryDB clusuter - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include Add memorydb notebook to docs/docs/integrations/ folde - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-28 15:09:51 -04:00
Chuck Wooters	56c2a7f6d4	partners: add missing key name to Field() for ChatFireworks model (#24721 ) Description: In the `ChatFireworks` class definition, the Field() call for the "stop" ("stop_sequences") parameter is missing the "default" keyword. Issue: Type checker reports "stop_sequences" as a missing arg (not recognizing the default value is None) Dependencies: None Twitter handle: None	2024-07-28 18:40:21 +00:00
AmosDinh	c113682328	community:Add support for specifying document_loaders.firecrawl api url. (#24747 ) community:Add support for specifying document_loaders.firecrawl api url. Add support for specifying document_loaders.firecrawl api url. This is mainly to support the [self-hosting](https://github.com/mendableai/firecrawl/blob/main/SELF_HOST.md) option firecrawl provides. Eg. now I can specify localhost:.... The corresponding firecrawl class already provides functionality to pass the argument. See here: `4c9d62f6d3/apps/python-sdk/firecrawl/firecrawl.py (L29)` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-28 14:30:36 -04:00
Jerron Lim	df37c0d086	partners[ollama]: Support base_url for ChatOllama (#24719 ) Add a class attribute `base_url` for ChatOllama to allow users to choose a different URL to connect to. Fixes #24555	2024-07-28 14:25:58 -04:00
Bagatur	8964f8a710	core: use mypy<1.11 (#24749 ) Bug in mypy 1.11.0 blocking CI, see example: https://github.com/langchain-ai/langchain/actions/runs/10127096903/job/28004492692?pr=24641	2024-07-27 16:37:02 -07:00
Moritz	b81fbc962c	docs: fix typo in DSPy docs (#24748 ) Description: Just a missing "r" in metric Dependencies:N/A	2024-07-27 23:34:39 +00:00
Isaac Francisco	152427eca1	make image inputs compatible with langchain_ollama (#24619 )	2024-07-26 17:39:57 -07:00
William FH	0535d72927	Add type() in error msg (#24723 )	2024-07-26 16:48:45 -07:00
Eugene Yurtsev	9be6b5a20f	core[patch]: Correct doc-string for InMemoryRateLimiter (#24730 ) Correct the documentaiton string.	2024-07-26 22:17:22 +00:00
Erick Friis	d5b4b7e05c	infra: langchain max python 3.11 for resolution (#24729 )	2024-07-26 21:17:11 +00:00
Erick Friis	3c3d3e9579	infra: community max python 3.11 for resolution (#24728 )	2024-07-26 21:10:14 +00:00
Cristi Burcă	174e7d2ab2	langchain: Make OutputFixingParser.from_llm() create a useable retry chain (#24687 ) Description: OutputFixingParser.from_llm() creates a retry chain that returns a Generation instance, when it should actually just return a string. Issue: https://github.com/langchain-ai/langchain/issues/24600 Twitter handle: scribu --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-26 13:55:47 -07:00
Bagatur	b3a23ddf93	integration releases (#24725 ) Release anthropic, openai, groq, mistralai, robocorp	2024-07-26 12:30:10 -07:00
Bagatur	315223ce26	core[patch]: Release 0.2.24 (#24722 )	2024-07-26 18:55:32 +00:00
Hayden Wolff	0345990a42	docs: Add NVIDIA NIMs to Model Tab and Feature Table (#24146 ) Description: Add NVIDIA NIMs to Model Tab and LLM Feature Table --------- Co-authored-by: Hayden Wolff <hwolff@nvidia.com> Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-26 18:20:52 +00:00
Haijian Wang	cda3025ee1	Integrating the Yi family of models. (#24491 ) Thank you for contributing to LangChain! - [x] PR title: "community:add Yi LLM", "docs:add Yi Documentation" - [x] PR message: *Delete this entire checklist* and replace with - Description: This PR adds support for the Yi model to LangChain. - Dependencies: [langchain_core,requests,contextlib,typing,logging,json,langchain_community] - Twitter handle: 01.AI - [x] Add tests and docs: I've added the corresponding documentation to the relevant paths --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-26 10:57:33 -07:00
Bagatur	ad7581751f	core[patch]: ChatPromptTemplate.init same as ChatPromptTemplate.from_… (#24486 )	2024-07-26 10:48:39 -07:00
Marc Gibbons	cc451effd1	community[patch]: langchain_community.vectorstores.azuresearch Raise LangChainException instead of bare Exception (#23935 ) Raise `LangChainException` instead of `Exception`. This alleviates the need for library users to use bare try/except to handle exceptions raised by `AzureSearch`. Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-26 15:59:06 +00:00
Jacob Lee	3d16dcd88d	docs[patch]: Hide deprecated ChatGPT plugins page (#24704 )	2024-07-26 08:24:33 -07:00
Eugene Yurtsev	3a5365a33e	ai21: apply rate limiter in integration tests (#24717 ) Apply rate limiter in integration tests	2024-07-26 11:15:36 -04:00
Eugene Yurtsev	03d62a737a	together: Add rate limiter to integration tests (#24714 ) Rate limit the integration tests to avoid getting 429s.	2024-07-26 10:59:33 -04:00
Eugene Yurtsev	e00cc74926	docs[minor]: Add how to guide for rate limiting a chat model (#24686 ) Add how-to guide for rate limiting a chat model.	2024-07-26 14:29:06 +00:00
Diverrez morgan	c4d2a53f18	community: creation score_threshold in flashrank_rerank.py (#24016 ) Description: add a optional score relevance threshold for select only coherent document, it's in complement of top_n Discussion: add relevance score threshold in flashrank_rerank document compressors #24013 Dependencies: no dependencies --------- Co-authored-by: Benjamin BERNARD <benjamin.bernard@openpathview.fr> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-26 13:34:39 +00:00
Cong Peng	190988d93e	community: Add parameter `allow_dangerous_requests` to `WebResearchRetriever.from_llm` construct (#24712 ) Description: To avoid ValueError when construct the retriever from method `from_llm()`.	2024-07-26 06:24:58 -07:00
monysun	5f593c172a	community: fix dashcope embeddings embed_query func post too much req to api (#24707 ) the fuc of embed_query of dashcope embeddings send a str param, and in the embed_with_retry func will send error content to api	2024-07-26 12:44:07 +00:00
yonarw	b65ac8d39c	community[minor]: Self query retriever for HANA Cloud Vector Engine (#24494 ) Description: - This PR adds a self query retriever implementation for SAP HANA Cloud Vector Engine. The retriever supports all operators except for contains. - Issue: N/A - Dependencies: no new dependencies added Add tests and docs: Added integration tests to: libs/community/tests/unit_tests/query_constructors/test_hanavector.py Documentation for self query retriever: /docs/integrations/retrievers/self_query/hanavector_self_query.ipynb --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-07-26 06:56:51 +00:00
nobbbbby	4f3b4fc7fe	community[patch]: Extend Baichuan model with tool support (#24529 ) Description: Expanded the chat model functionality to support tools in the 'baichuan.py' file. Updated module imports and added tool object handling in message conversions. Additional changes include the implementation of tool binding and related unit tests. The alterations offer enhanced model capabilities by enabling interaction with tool-like objects. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-25 23:20:44 -07:00
Rave Harpaz	ee399e3ec5	community[patch]: Add OCI Generative AI tool and structured output support (#24693 ) - [x] PR title: community: Add OCI Generative AI tool and structured output support - [x] PR message: - Description: adding tool calling and structured output support for chat models offered by OCI Generative AI services. This is an update to our last PR 22880 with changes in /langchain_community/chat_models/oci_generative_ai.py - Issue: NA - Dependencies: NA - Twitter handle: NA - [x] Add tests and docs: 1. we have updated our unit tests 2. we have updated our documentation under /docs/docs/integrations/chat/oci_generative_ai.ipynb - [x] Lint and test: `make format`, `make lint` and `make test` we run successfully --------- Co-authored-by: RHARPAZ <RHARPAZ@RHARPAZ-5750.us.oracle.com> Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com>	2024-07-25 23:19:00 -07:00
Yuki Watanabe	2b6a262f84	community[patch]: Replace `filters` argument to `filter` in DatabricksVectorSearch (#24530 ) The [DatabricksVectorSearch](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/vectorstores/databricks_vector_search.py#L21) class exposes similarity search APIs with argument `filters`, which is inconsistent with other VS classes who uses `filter` (singular). This PR updates the argument and add alias for backward compatibility. --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	2024-07-25 21:20:18 -07:00
Leonid Ganeline	148766ddc1	docs: `integrations` missed links (#24681 ) Added missed links; missed provider page	2024-07-25 20:38:25 -07:00
Sunish Sheth	59880a9147	community[patch]: mlflow handle empty chunk(#24689 )	2024-07-25 20:36:29 -07:00
Eugene Yurtsev	20690db482	core[minor]: Add BaseModel.rate_limiter, RateLimiter abstraction and in-memory implementation (#24669 ) This PR proposes to create a rate limiter in the chat model directly, and would replace: https://github.com/langchain-ai/langchain/pull/21992 It resolves most of the constraints that the Runnable rate limiter introduced: 1. It's not annoying to apply the rate limiter to existing code; i.e., possible to roll out the change at the location where the model is instantiated, rather than at every location where the model is used! (Which is necessary if the model is used in different ways in a given application.) 2. batch rate limiting is enforced properly 3. the rate limiter works correctly with streaming 4. the rate limiter is aware of the cache 5. The rate limiter can take into account information about the inputs into the model (we can add optional inputs to it down-the road together with outputs!) The only downside is that information will not be properly reflected in tracing as we don't have any metadata evens about a rate limiter. So the total time spent on a model invocation will be: * time spent waiting for the rate limiter * time spend on the actual model request ## Example ```python from langchain_core.rate_limiters import InMemoryRateLimiter from langchain_groq import ChatGroq groq = ChatGroq(rate_limiter=InMemoryRateLimiter(check_every_n_seconds=1)) groq.invoke('hello') ```	2024-07-26 03:03:34 +00:00
Eugene Yurtsev	c623ae6661	experimental[patch]: Fix import test (#24672 ) Import test was misconfigured, the glob wasn't returning any file paths	2024-07-25 22:14:40 -04:00
Chaunte W. Lacewell	69eacaa887	Community[minor]: Update VDMS vectorstore (#23729 ) Description: - This PR exposes some functions in VDMS vectorstore, updates VDMS related notebooks, updates tests, and upgrade version of VDMS (>=0.0.20) Issue: N/A Dependencies: - Update vdms>=0.0.20	2024-07-25 22:13:04 -04:00
sykp241095	703491e824	docs: update another TiDB Cloud link as it is already public beta (#24694 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-25 18:39:55 -07:00
Nuno Campos	8734cabc09	core: Don't draw None edge labels (#24690 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-25 22:12:39 +00:00
Jacob Lee	ce067c19e9	docs[patch]: Simplify tool calling guide, improve tool calling conceptual guide (#24637 ) Lots of duplicated content from concepts, missing pointers to the second half of the tool calling loop Simpler + more focused + a more prominent link to the second half of the loop was what I was aiming for, but down to be more conservative and just more prominently link the "passing tools back to the model" guide. I have also moved the tool calling conceptual guide out from under `Structured Output` (while leaving a small section for structured output-specific information) and added more content. The existing `#functiontool-calling` link will go to this new section.	2024-07-25 14:39:14 -07:00
Bagatur	4840db6892	docs: standardize groq chat model docs (#24616 ) part of #22296 --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-25 14:10:49 -07:00
Isaac Francisco	218c554c4f	[docs]: add doctoring to ChatTogether (#24636 )	2024-07-25 14:10:41 -07:00
Bagatur	0fe29b4343	docs: standardize Together docs (#24617 ) Part of #22296 --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-25 14:10:31 -07:00
Isaac Francisco	5c7e589aaf	deprecating ollama_functions (#24632 )	2024-07-25 13:50:04 -07:00
KyrianC	0fdbaf4a8d	community: fix ChatEdenAI + EdenAI Tools (#23715 ) Fixes for Eden AI Custom tools and ChatEdenAI: - add missing import in __init__ of chat_models - add `args_schema` to custom tools. otherwise '__arg1' would sometimes be passed to the `run` method - fix IndexError when no human msg is added in ChatEdenAI	2024-07-25 15:19:14 -04:00
Daniel Campos	871bf5a841	docs: Update snowflake.mdx for arctic-m-v1.5 (#24678 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-25 17:48:54 +00:00
Leonid Ganeline	8b7cffc363	docs: `integrations` missed references (#24631 ) Issue: Several packages are not referenced in the `providers` pages. Fix: Added the missed references. Fixed the notebook formatting.	2024-07-25 13:26:46 -04:00
ccurme	58dd69f7f2	core[patch]: fix mutating tool calls (#24677 ) In some cases tool calls are mutated when passed through a tool.	2024-07-25 16:46:36 +00:00
ccurme	dfbd12b384	mistral[patch]: translate tool call IDs to mistral compatible format (#24668 ) Mistral appears to have added validation for the format of its tool call IDs: `{"object":"error","message":"Tool call id was abc123 but must be a-z, A-Z, 0-9, with a length of 9.","type":"invalid_request_error","param":null,"code":null}` This breaks compatibility of messages from other providers. Here we add a function that converts any string to a Mistral-valid tool call ID, and apply it to incoming messages.	2024-07-25 12:39:32 -04:00
maang-h	38d30e285a	docs: Standardize BaichuanTextEmbeddings docstrings (#24674 ) - Description: Standardize BaichuanTextEmbeddings docstrings. - Issue: the issue #21983	2024-07-25 12:12:00 -04:00
Eugene Yurtsev	89bcca3542	experimental[patch]: Bump core (#24671 )	2024-07-25 09:05:43 -07:00
rick-SOPTIM	cd563fb628	community[minor]: passthrough auth parameter on requests to Ollama-LLMs (#24068 ) Thank you for contributing to LangChain! Description: This PR allows users of `langchain_community.llms.ollama.Ollama` to specify the `auth` parameter, which is then forwarded to all internal calls of `requests.request`. This works in the same way as the existing `headers` parameters. The auth parameter enables the usage of the given class with Ollama instances, which are secured by more complex authentication mechanisms, that do not only rely on static headers. An example are AWS API Gateways secured by the IAM authorizer, which expects signatures dynamically calculated on the specific HTTP request. Issue: Integrating a remote LLM running through Ollama using `langchain_community.llms.ollama.Ollama` only allows setting static HTTP headers with the parameter `headers`. This does not work, if the given instance of Ollama is secured with an authentication mechanism that makes use of dynamically created HTTP headers which for example may depend on the content of a given request. Dependencies: None Twitter handle: None --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-25 15:48:35 +00:00
남광우	256bad3251	core[minor]: Support asynchronous in InMemoryVectorStore (#24472 ) ### Description * support asynchronous in InMemoryVectorStore * since embeddings might be possible to call asynchronously, ensure that both asynchronous and synchronous functions operate correctly.	2024-07-25 11:36:55 -04:00
Luca Dorigo	5fdbdd6bec	community[patch]: Fix invalid iohttp verify parameter (#24655 ) Should fix https://github.com/langchain-ai/langchain/issues/24654	2024-07-25 11:09:21 -04:00
Daniel Glogowski	221486687a	docs: updated CHATNVIDIA notebooks (#24584 ) Updated notebook for tool calling support in chat models	2024-07-25 09:22:53 -04:00
Ken Jenney	d6631919f4	docs: tool calling is enabled in ChatOllama (#24665 ) Description: According to this page: https://python.langchain.com/v0.2/docs/integrations/chat/ollama_functions/ ChatOllama does support Tool Calling. Issue: The documentation is incorrect Dependencies: None Twitter handle: NA	2024-07-25 13:21:30 +00:00
sykp241095	235eb38d3e	docs: update TiDB Cloud links as vector search feature becomes public beta (#24667 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-25 13:20:02 +00:00
Eugene Yurtsev	7dd6b32991	core[minor]: Add InMemoryRateLimiter (#21992 ) This PR introduces the following Runnables: 1. BaseRateLimiter: an abstraction for specifying a time based rate limiter as a Runnable 2. InMemoryRateLimiter: Provides an in-memory implementation of a rate limiter ## Example ```python from langchain_core.runnables import InMemoryRateLimiter, RunnableLambda from datetime import datetime foo = InMemoryRateLimiter(requests_per_second=0.5) def meow(x): print(datetime.now().strftime("%H:%M:%S.%f")) return x chain = foo \| meow for _ in range(10): print(chain.invoke('hello')) ``` Produces: ``` 17:12:07.530151 hello 17:12:09.537932 hello 17:12:11.548375 hello 17:12:13.558383 hello 17:12:15.568348 hello 17:12:17.578171 hello 17:12:19.587508 hello 17:12:21.597877 hello 17:12:23.607707 hello 17:12:25.617978 hello ``` ![image](https://github.com/user-attachments/assets/283af59f-e1e1-408b-8e75-d3910c3c44cc) ## Interface The rate limiter uses the following interface for acquiring a token: ```python class BaseRateLimiter(Runnable[Input, Output], abc.ABC): @abc.abstractmethod def acquire(self, *, blocking: bool = True) -> bool: """Attempt to acquire the necessary tokens for the rate limiter.``` ``` The flag `blocking` has been added to the abstraction to allow supporting streaming (which is easier if blocking=False). ## Limitations - The rate limiter is not designed to work across different processes. It is an in-memory rate limiter, but it is thread safe. - The rate limiter only supports time-based rate limiting. It does not take into account the size of the request or any other factors. - The current implementation does not handle streaming inputs well and will consume all inputs even if the rate limit has been reached. Better support for streaming inputs will be added in the future. - When the rate limiter is combined with another runnable via a RunnableSequence, usage of .batch() or .abatch() will only respect the average rate limit. There will be bursty behavior as .batch() and .abatch() wait for each step to complete before starting the next step. One way to mitigate this is to use batch_as_completed() or abatch_as_completed(). ## Bursty behavior in `batch` and `abatch` When the rate limiter is combined with another runnable via a RunnableSequence, usage of .batch() or .abatch() will only respect the average rate limit. There will be bursty behavior as .batch() and .abatch() wait for each step to complete before starting the next step. This becomes a problem if users are using `batch` and `abatch` with many inputs (e.g., 100). In this case, there will be a burst of 100 inputs into the batch of the rate limited runnable. 1. Using a RunnableBinding The API would look like: ```python from langchain_core.runnables import InMemoryRateLimiter, RunnableLambda rate_limiter = InMemoryRateLimiter(requests_per_second=0.5) def meow(x): return x rate_limited_meow = RunnableLambda(meow).with_rate_limiter(rate_limiter) ``` 2. Another option is to add some init option to RunnableSequence that changes `.batch()` to be depth first (e.g., by delegating to `batch_as_completed`) ```python RunnableSequence(first=rate_limiter, last=model, how='batch-depth-first') ``` Pros: Does not require Runnable Binding Cons: Feels over-complicated	2024-07-25 01:34:03 +00:00
Oleg Kulyk	4b1b7959a2	community[minor]: Add ScrapingAnt Loader Community Integration (#24514 ) Added [ScrapingAnt](https://scrapingant.com/) Web Loader integration. ScrapingAnt is a web scraping API that allows extracting web page data into accessible and well-formatted markdown. Description: Added ScrapingAnt web loader for retrieving web page data as markdown Dependencies: scrapingant-client Twitter: @WeRunTheWorld3 --------- Co-authored-by: Oleg Kulyk <oleg@scrapingant.com>	2024-07-24 21:11:43 -04:00
Jacob Lee	afee851645	docs[patch]: Fix image caption document loader page and typo on custom tools page (#24635 )	2024-07-24 17:16:18 -07:00
Jacob Lee	a73e2222d4	docs[patch]: Updates LLM caching, HF sentence transformers, and DDG pages (#24633 )	2024-07-24 16:58:05 -07:00
Erick Friis	e160b669c8	infra: add unstructured api key to release (#24638 )	2024-07-24 16:47:24 -07:00
John	d59c656ea5	unstructured, community, initialize langchain-unstructured package (#22779 ) #### Update (2): A single `UnstructuredLoader` is added to handle both local and api partitioning. This loader also handles single or multiple documents. #### Changes in `community`: Changes here do not affect users. In the initial process of using the SDK for the API Loaders, the Loaders in community were refactored. Other changes include: The `UnstructuredBaseLoader` has a new check to see if both `mode="paged"` and `chunking_strategy="by_page"`. It also now has `Element.element_id` added to the `Document.metadata`. `UnstructuredAPIFileLoader` and `UnstructuredAPIFileIOLoader`. As such, now both directly inherit from `UnstructuredBaseLoader` and initialize their `file_path`/`file` attributes respectively and implement their own `_post_process_elements` methods. -------- #### Update: New SDK Loaders in a [partner package](https://python.langchain.com/v0.1/docs/contributing/integrations/#partner-package-in-langchain-repo) are introduced to prevent breaking changes for users (see discussion below). ##### TODO: - [x] Test docstring examples -------- - Description: UnstructuredAPIFileIOLoader and UnstructuredAPIFileLoader calls to the unstructured api are now made using the unstructured-client sdk. - New Dependencies: unstructured-client - [x] Add tests and docs: If you're adding a new integration, please include - [x] a test for the integration, preferably unit tests that do not rely on network access, - [x] update the description in `docs/docs/integrations/providers/unstructured.mdx` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. TODO: - [x] Update https://python.langchain.com/v0.1/docs/integrations/document_loaders/unstructured_file/#unstructured-api - `langchain/docs/docs/integrations/document_loaders/unstructured_file.ipynb` - The description here needs to indicate that users should install `unstructured-client` instead of `unstructured`. Read over closely to look for any other changes that need to be made. - [x] Update the `lazy_load` method in `UnstructuredBaseLoader` to handle json responses from the API instead of just lists of elements. - This method may need to be overwritten by the API loaders instead of changing it in the `UnstructuredBaseLoader`. - [x] Update the documentation links in the class docstrings (the Unstructured documents have moved) - [x] Update Document.metadata to include `element_id` (see thread [here](https://unstructuredw-kbe4326.slack.com/archives/C044N0YV08G/p1718187499818419)) --------- Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com>	2024-07-24 23:21:20 +00:00
Leonid Ganeline	2394807033	docs: fix ChatGooglePalm fix (#24629 ) Issue: now the [ChatGooglePalm](https://python.langchain.com/v0.2/docs/integrations/vectorstores/scann/#retrievalqa-demo) class is not parsed and do not presented in the "API Reference:" line. PR: [Fixed it](https://langchain-7n5k5wkfs-langchain.vercel.app/v0.2/docs/integrations/vectorstores/scann/#retrievalqa-demo) by properly importing.	2024-07-24 18:09:08 -04:00
Joel Akeret	acfce30017	Adding compatibility for OllamaFunctions with ImagePromptTemplate (#24499 ) - [ ] PR title: "experimental: Adding compatibility for OllamaFunctions with ImagePromptTemplate" - [ ] PR message: - Description: Removes the outdated `_convert_messages_to_ollama_messages` method override in the `OllamaFunctions` class to ensure that ollama multimodal models can be invoked with an image. - Issue: #24174 --------- Co-authored-by: Joel Akeret <joel.akeret@ti&m.com> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-24 14:57:05 -07:00
Erick Friis	8f3c052db1	cli: release 0.0.26 (#24623 ) - cli: remove snapshot flag from pytest defaults - x - x	2024-07-24 13:13:58 -07:00
ChengZi	29a3b3a711	partners[milvus]: add dynamic field (#24544 ) add dynamic field feature to langchain_milvus more unittest, more robustic plan to deprecate the `metadata_field` in the future, because it's function is the same as `enable_dynamic_field`, but the latter one is a more advanced concept in milvus Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-24 20:01:58 +00:00
Erick Friis	20fe4deea0	milvus: release 0.1.3 (#24624 )	2024-07-24 13:01:27 -07:00
Erick Friis	3a55f4bfe9	cli: remove snapshot flag from pytest defaults (#24622 )	2024-07-24 19:41:01 +00:00
Isaac Francisco	fea9ff3831	docs: add tables for search and code interpreter tools (#24586 )	2024-07-24 10:51:39 -07:00
Eugene Yurtsev	b55f6105c6	community[patch]: Add linter to prevent further usage of root_validator and validator (#24613 ) This linter is meant to move development to use __init__ instead of root_validator and validator. We need to investigate whether we need to lint some of the functionality of Field (e.g., `lt` and `gt`, `alias`) `alias` is the one that's most popular: (community) ➜ community git:(eugene/add_linter_to_community) ✗ git grep " Field(" \| grep "alias=" \| wc -l 144 (community) ➜ community git:(eugene/add_linter_to_community) ✗ git grep " Field(" \| grep "ge=" \| wc -l 10 (community) ➜ community git:(eugene/add_linter_to_community) ✗ git grep " Field(" \| grep "gt=" \| wc -l 4	2024-07-24 12:35:21 -04:00
Anush	4585eaef1b	qdrant: Fix vectors_config access (#24606 ) ## Description Fixes #24558 by accessing `vectors_config` after asserting it to be a dict.	2024-07-24 10:54:33 -04:00
ccurme	f337f3ed36	docs: update chain migration guide (#24501 ) - Update `ConversationChain` example to show use without session IDs; - Fix a minor bug (specify history_messages_key).	2024-07-24 10:45:00 -04:00
maang-h	22175738ac	docs: Add MongoDBChatMessageHistory docstrings (#24608 ) - Description: Add MongoDBChatMessageHistory rich docstrings. - Issue: the issue #21983	2024-07-24 10:12:44 -04:00
Anindyadeep	12c3454fd9	[Community] PremAI Tool Calling Functionality (#23931 ) This PR is under WIP and adds the following functionalities: - [X] Supports tool calling across the langchain ecosystem. (However streaming is not supported) - [X] Update documentation	2024-07-24 09:53:58 -04:00
Vishnu Nandakumar	e271965d1e	community: retrievers: added capability for using Product Quantization as one of the retriever. (#22424 ) - [ ] Community: "Retrievers: Product Quantization" - [X] This PR adds Product Quantization feature to the retrievers to the Langchain Community. PQ is one of the fastest retrieval methods if the embeddings are rich enough in context due to the concepts of quantization and representation through centroids - Description: Adding PQ as one of the retrievers - Dependencies: using the package nanopq for this PR - Twitter handle: vishnunkumar_ - [X] Add tests and docs: If you're adding a new integration, please include - [X] Added unit tests for the same in the retrievers. - [] Will add an example notebook subsequently - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ - done the same --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-24 13:52:15 +00:00
stydxm	b9bea36dd4	community: fix typo in warning message (#24597 ) - Description: This PR fixes a small typo in a warning message - Issue: ![](https://github.com/user-attachments/assets/5aa57724-26c5-49f6-8bc1-5a54bb67ed49) There were double `Use` and double `instead`	2024-07-24 13:19:07 +00:00
cüre	da06d4d7af	community: update finetuned model cost for 4o-mini (#24605 ) - Description: adds model price for. reference: https://openai.com/api/pricing/ - Issue: - - Dependencies: - - Twitter handle: cureef	2024-07-24 13:17:26 +00:00
Philippe PRADOS	5f73c836a6	openai[small]: Add the new model: gpt-4o-mini (#24594 )	2024-07-24 09:14:48 -04:00
Mateusz Szewczyk	597be7d501	docs: Update IBM docs about information to pass client into WatsonxLLM and WatsonxEmbeddings object. (#24602 ) Thank you for contributing to LangChain! - [x] PR title: Update IBM docs about information to pass client into WatsonxLLM and WatsonxEmbeddings object. - [x] PR message: - Description: Update IBM docs about information to pass client into WatsonxLLM and WatsonxEmbeddings object. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-07-24 09:12:13 -04:00
Jacob Lee	379803751e	docs[patch]: Remove very old document comparison notebook (#24587 )	2024-07-23 22:25:35 -07:00
ZhangShenao	ad18afc3ec	community[patch]: Fix param spelling error in `ElasticsearchChatMessageHistory` (#24589 ) Fix param spelling error in `ElasticsearchChatMessageHistory`	2024-07-23 19:29:42 -07:00
Isaac Francisco	464a525a5a	[partner]: minor change to embeddings for Ollama (#24521 )	2024-07-24 00:00:13 +00:00
Aayush Kataria	0f45ac4088	LangChain Community: VectorStores: Azure Cosmos DB Filtered Vector Search (#24087 ) Thank you for contributing to LangChain! - This PR adds vector search filtering for Azure Cosmos DB Mongo vCore and NoSQL. - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-23 16:59:23 -07:00
Gareth	ac41c97d21	pinecone: Add embedding Inference Support (#24515 ) Description Add support for Pinecone hosted embedding models as `PineconeEmbeddings`. Replacement for #22890 Dependencies Add `aiohttp` to support async embeddings call against REST directly - [x] Add tests and docs: If you're adding a new integration, please include Added `docs/docs/integrations/text_embedding/pinecone.ipynb` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Twitter: `gdjdg17` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-23 22:50:28 +00:00
ccurme	aaf788b7cb	docs[patch]: fix chat model tabs in runnable-as-tool guide (#24580 )	2024-07-23 18:36:01 -04:00
Bagatur	47ae06698f	docs: update ChatModelTabs defaults (#24583 )	2024-07-23 21:56:30 +00:00
Erick Friis	03881c6743	docs: fix hf embeddings install (#24577 )	2024-07-23 21:03:30 +00:00
ccurme	2d6b0bf3e3	core[patch]: add to RunnableLambda docstring (#24575 ) Explain behavior when function returns a runnable.	2024-07-23 20:46:44 +00:00
Erick Friis	ee3955c68c	docs: add tool calling for ollama (#24574 )	2024-07-23 20:33:23 +00:00
Carlos André Antunes	325068bb53	community: Fix azure_openai.py (#24572 ) In some lines its trying to read a key that do not exists yet. In this cases I changed the direct access to dict.get() method - [ x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-07-23 16:22:21 -04:00
Bagatur	bff6ca78a2	docs: duplicate how to link (#24569 )	2024-07-23 18:52:05 +00:00
Nik Jmaeff	6878bc39b5	langchain: fix TrajectoryEvalChain.prep_inputs (#19959 ) The previous implementation would never be called. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-23 18:37:39 +00:00
Bagatur	55e66aa40c	langchain[patch]: init_chat_model support ChatBedrockConverse (#24564 )	2024-07-23 11:07:38 -07:00
Bagatur	9b7db08184	experimental[patch]: Release 0.0.63 (#24563 )	2024-07-23 16:28:37 +00:00
Bagatur	8691a5a37f	community[patch]: Release 0.2.10 (#24560 )	2024-07-23 09:24:57 -07:00
Bagatur	4919d5d6df	langchain[patch]: Release 0.2.11 (#24559 )	2024-07-23 09:18:44 -07:00
Bagatur	918e1c8a93	core[patch]: Release 0.2.23 (#24557 )	2024-07-23 09:01:18 -07:00
Lance Martin	58def6e34d	Add tool calling example to Ollama ntbk (#24522 )	2024-07-23 15:58:54 +00:00
Leonid Ganeline	e787532479	langchain: `globals` fix (#21281 ) Issue: functions from `globals`, like the `get_debug` are placed in the init.py file. As a result, they don't listed in the API Reference docs. [See this](https://langchain-9jq1kef7i-langchain.vercel.app/v0.2/docs/how_to/debugging/#set_debugtrue) and [broken this](https://api.python.langchain.com/en/latest/globals/langchain.globals.set_debug.html). Change: moved code from init.py into the `globals.py` file and removed `globals` directory. Similar to: #21266 BTW `globals` in core implemented exactly inside a file not inside a folder.	2024-07-23 11:23:18 -04:00
Ben Chambers	e80b0932ee	community[patch]: small fixes to link extractors (#24528 ) - Description: small fixes to imports / types in the link extraction work --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-23 14:28:06 +00:00
Morteza Hosseini	9e06991aae	community[patch]: Update URL to the 2markdown API (#24546 ) Update the URL to Markdown endpoint. API information is available here: https://2markdown.com/docs#url2md	2024-07-23 14:27:55 +00:00
ZhangShenao	a14e02ab33	core[patch]: Fix word spelling error in `globals.py` (#24532 ) Fix word spelling error in `globals.py` Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-23 14:27:16 +00:00
maang-h	378db2e1a5	docs: Add RedisChatMessageHistory docstrings (#24548 ) - Description: Add `RedisChatMessageHistory ` rich docstrings. - Issue: the issue #21983 Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-23 14:23:46 +00:00
ccurme	a197a8e184	openai[patch]: move test (#24552 ) No-override tests (https://github.com/langchain-ai/langchain/pull/24407) include a condition that integrations not implement additional tests.	2024-07-23 10:22:22 -04:00
Eugene Yurtsev	0bb54ab9f0	CI: Temporarily disable min version checking on pull request (#24551 ) Short term to fix CI	2024-07-23 14:12:08 +00:00
Eugene Yurtsev	f47b4edcc2	standard-test: Fix typo in skipif for chat model integration tests (#24553 )	2024-07-23 10:11:01 -04:00
Jesse Wright	837a3d400b	chore(docs): `SQARQL` -> `SPARQL` typo fix (#24536 ) nit picky typo fix	2024-07-23 13:39:34 +00:00
Eugene Yurtsev	20b72a044c	standard-tests: Add BaseModel variations tests to with_structured_output (#24527 ) After this standard tests will test with the following combinations: 1. pydantic.BaseModel 2. pydantic.v1.BaseModel If ran within a matrix, it'll covert both pydantic.BaseModel originating from pydantic 1 and the one defined in pydantic 2.	2024-07-23 09:01:26 -04:00
Bagatur	70c71efcab	core[patch]: merge_content fix (#24526 )	2024-07-22 22:20:22 -07:00
Ben Chambers	a5a3d28776	community[patch]: Remove targets_table from C* GraphVectorStore (#24502 ) - Description: Remove the unnecessary `targets_table` parameter	2024-07-22 22:09:36 -04:00
Alexander Golodkov	2a70a07aad	community[minor]: added new document loaders based on dedoc library (#24303 ) ### Description This pull request added new document loaders to load documents of various formats using [Dedoc](https://github.com/ispras/dedoc): - `DedocFileLoader` (determine file types automatically and parse) - `DedocPDFLoader` (for `PDF` and images parsing) - `DedocAPIFileLoader` (determine file types automatically and parse using Dedoc API without library installation) [Dedoc](https://dedoc.readthedocs.io) is an open-source library/service that extracts texts, tables, attached files and document structure (e.g., titles, list items, etc.) from files of various formats. The library is actively developed and maintained by a group of developers. `Dedoc` supports `DOCX`, `XLSX`, `PPTX`, `EML`, `HTML`, `PDF`, images and more. Full list of supported formats can be found [here](https://dedoc.readthedocs.io/en/latest/#id1). For `PDF` documents, `Dedoc` allows to determine textual layer correctness and split the document into paragraphs. ### Issue This pull request extends variety of document loaders supported by `langchain_community` allowing users to choose the most suitable option for raw documents parsing. ### Dependencies The PR added a new (optional) dependency `dedoc>=2.2.5` ([library documentation](https://dedoc.readthedocs.io)) to the `extended_testing_deps.txt` ### Twitter handle None ### Add tests and docs 1. Test for the integration: `libs/community/tests/integration_tests/document_loaders/test_dedoc.py` 2. Example notebook: `docs/docs/integrations/document_loaders/dedoc.ipynb` 3. Information about the library: `docs/docs/integrations/providers/dedoc.mdx` ### Lint and test Done locally: - `make format` - `make lint` - `make integration_tests` - `make docs_build` (from the project root) --------- Co-authored-by: Nasty <bogatenkova.anastasiya@mail.ru>	2024-07-23 02:04:53 +00:00
Ben Chambers	5ac936a284	community[minor]: add document transformer for extracting links (#24186 ) - Description: Add a DocumentTransformer for executing one or more `LinkExtractor`s and adding the extracted links to each document. - Issue: n/a - Depedencies: none --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-07-22 22:01:21 -04:00
Jacob Lee	3c4652c906	docs[patch]: Hide OllamaFunctions now that Ollama supports tool calling (#24523 )	2024-07-22 17:56:51 -07:00
Erick Friis	2c6b9e8771	standard-tests: add override check (#24407 )	2024-07-22 23:38:01 +00:00
Nithish Raghunandanan	1639ccfd15	couchbase: [patch] Return chat message history in order (#24498 ) Description: Fixes an issue where the chat message history was not returned in order. Fixed it now by returning based on timestamps. - [x] Add tests and docs: Updated the tests to check the order 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Nithish Raghunandanan <nithishr@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-22 23:30:29 +00:00
C K Ashby	ab036c1a4c	docs: Update .run() to .invoke() (#24520 )	2024-07-22 14:21:33 -07:00
Erick Friis	3dce2e1d35	all: add release notes to pypi (#24519 )	2024-07-22 13:59:13 -07:00
Bagatur	c48e99e7f2	docs: fix sql db note (#24505 )	2024-07-22 13:30:29 -07:00
Bagatur	8a140ee77c	core[patch]: don't serialize BasePromptTemplate.input_types (#24516 ) Candidate fix for #24513	2024-07-22 13:30:16 -07:00
MarkYQJ	df357f82ca	ignore the first turn to apply "history" mechanism (#14118 ) This will generate a meaningless string "system: " for generating condense question; this increases the probability to make an improper condense question and misunderstand user's question. Below is a case - Original Question: Can you explain the arguments of Meilisearch? - Condense Question - What are the benefits of using Meilisearch? (by CodeLlama) - What are the reasons for using Meilisearch? (by GPT-4) The condense questions (not matter from CodeLlam or GPT-4) are different from the original one. By checking the content of each dialogue turn, generating history string only when the dialog content is not empty. Since there is nothing before first turn, the "history" mechanism will be ignored at the very first turn. Doing so, the condense question will be "What are the arguments for using Meilisearch?". <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-22 20:11:17 +00:00
Bagatur	236e957abb	core,groq,openai,mistralai,robocorp,fireworks,anthropic[patch]: Update BaseModel subclass and instance checks to handle both v1 and proper namespaces (#24417 ) After this PR chat models will correctly handle pydantic 2 with bind_tools and with_structured_output. ```python import pydantic print(pydantic.__version__) ``` 2.8.2 ```python from langchain_openai import ChatOpenAI from pydantic import BaseModel, Field class Add(BaseModel): x: int y: int model = ChatOpenAI().bind_tools([Add]) print(model.invoke('2 + 5').tool_calls) model = ChatOpenAI().with_structured_output(Add) print(type(model.invoke('2 + 5'))) ``` ``` [{'name': 'Add', 'args': {'x': 2, 'y': 5}, 'id': 'call_PNUFa4pdfNOYXxIMHc6ps2Do', 'type': 'tool_call'}] <class '__main__.Add'> ``` ```python from langchain_openai import ChatOpenAI from pydantic.v1 import BaseModel, Field class Add(BaseModel): x: int y: int model = ChatOpenAI().bind_tools([Add]) print(model.invoke('2 + 5').tool_calls) model = ChatOpenAI().with_structured_output(Add) print(type(model.invoke('2 + 5'))) ``` ```python [{'name': 'Add', 'args': {'x': 2, 'y': 5}, 'id': 'call_hhiHYP441cp14TtrHKx3Upg0', 'type': 'tool_call'}] <class '__main__.Add'> ``` Addresses issues: https://github.com/langchain-ai/langchain/issues/22782 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-22 20:07:39 +00:00
C K Ashby	199e64d372	Please spell Lex's name correctly Fridman (#24517 ) https://www.youtube.com/watch?v=ZIyB9e_7a4c Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-22 19:38:32 +00:00
Erick Friis	1f01c0fd98	infra: remove core from min version pr testing (#24507 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-22 17:46:15 +00:00
Naka Masato	884f76e05a	fix: load google credentials properly in GoogleDriveLoader (#12871 ) - Description: - Fix #12870: set scope in `default` func (ref: https://google-auth.readthedocs.io/en/master/reference/google.auth.html) - Moved the code to load default credentials to the bottom for clarity of the logic - Add docstring and comment for each credential loading logic - Issue: https://github.com/langchain-ai/langchain/issues/12870 - Dependencies: no dependencies change - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: @gymnstcs <!-- If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-22 17:43:33 +00:00
Erick Friis	a45337ea07	ollama: release 0.1.0 (#24510 )	2024-07-22 10:35:26 -07:00
Isaac Francisco	1318d534af	[docs]: minor react change (#24509 )	2024-07-22 10:25:01 -07:00
Jorge Piedrahita Ortiz	10e3982b59	community: sambanova integration minor changes (#24503 ) - Minor changes in samabanova llm integration - default api - docstrings - minor changes in docs	2024-07-22 17:06:35 +00:00
maang-h	721f709dec	community: Improve QianfanChatEndpoint tool result to model (#24466 ) - Description: `QianfanChatEndpoint` When using tool result to answer questions, the content of the tool is required to be in Dict format. Of course, this can require users to return Dict format when calling the tool, but in order to be consistent with other Chat Models, I think such modifications are necessary.	2024-07-22 11:29:00 -04:00
Chaunte W. Lacewell	02f0a29293	Cookbook: Add Visual RAG example using VDMS (#24353 ) - Description: Adding notebook to demonstrate visual RAG which uses both video scene description generated by open source vision models (ex. video-llama, video-llava etc.) as text embeddings and frames as image embeddings to perform vector similarity search using VDMS. - Issue: N/A - Dependencies: N/A	2024-07-22 11:16:06 -04:00
ccurme	dcba7df2fe	community[patch]: deprecate langchain_community Chroma in favor of langchain_chroma (#24474 )	2024-07-22 11:00:13 -04:00
ccurme	0f7569ddbc	core[patch]: enable RunnableWithMessageHistory without config (#23775 ) Feedback that `RunnableWithMessageHistory` is unwieldy compared to ConversationChain and similar legacy abstractions is common. Legacy chains using memory typically had no explicit notion of threads or separate sessions. To use `RunnableWithMessageHistory`, users are forced to introduce this concept into their code. This possibly felt like unnecessary boilerplate. Here we enable `RunnableWithMessageHistory` to run without a config if the `get_session_history` callable has no arguments. This enables minimal implementations like the following: ```python from langchain_core.chat_history import InMemoryChatMessageHistory from langchain_core.runnables.history import RunnableWithMessageHistory from langchain_openai import ChatOpenAI llm = ChatOpenAI(model="gpt-3.5-turbo-0125") memory = InMemoryChatMessageHistory() chain = RunnableWithMessageHistory(llm, lambda: memory) chain.invoke("Hi I'm Bob") # Hello Bob! chain.invoke("What is my name?") # Your name is Bob. ```	2024-07-22 10:36:53 -04:00
Mohammad Mohtashim	5ade0187d0	[Commutiy]: Prompts Fixed for ZERO_SHOT_REACT React Agent Type in `create_sql_agent` function (#23693 ) - Description: The correct Prompts for ZERO_SHOT_REACT were not being used in the `create_sql_agent` function. They were not using the specific `SQL_PREFIX` and `SQL_SUFFIX` prompts if client does not provide any prompts. This is fixed. - Issue: #23585 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-22 14:04:20 +00:00
ZhangShenao	0f6737cbfe	[Vector Store] Fix function `add_texts` in `TencentVectorDB` (#24469 ) Regardless of whether `embedding_func` is set or not, the 'text' attribute of document should be assigned, otherwise the `page_content` in the document of the final search result will be lost	2024-07-22 09:50:22 -04:00
남광우	7ab82eb8cc	langchain: Copy libs/standard-tests folder when building devcontainer (#24470 ) ### Description * Fix `libs/langchain/dev.Dockerfile` file. copy the `libs/standard-tests` folder when building the devcontainer. * `poetry install --no-interaction --no-ansi --with dev,test,docs` command requires this folder, but it was not copied. ### Reference #### Error message when building the devcontainer from the master branch ``` ... [2024-07-20T14:27:34.779Z] ------ > [langchain langchain-dev-dependencies 7/7] RUN poetry install --no-interaction --no-ansi --with dev,test,docs: 0.409 0.409 Directory ../standard-tests does not exist ------ ... ``` #### After the fix Build success at vscode: <img width="866" alt="image" src="https://github.com/user-attachments/assets/10db1b50-6fcf-4dfe-83e1-d93c96aa2317">	2024-07-22 13:46:38 +00:00
rbrugaro	37b89fb7fc	fix RAG with quantized embeddings notebook (#24422 ) 1. Fix HuggingfacePipeline import error to newer partner package 2. Switch to IPEXModelForCausalLM for performance There are no dependency changes since optimum intel is also needed for QuantizedBiEncoderEmbeddings --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-22 13:44:03 +00:00
Thomas Meike	40c02cedaf	langchain[patch]: add async methods to ConversationSummaryBufferMemory (#20956 ) Added asynchronously callable methods according to the ConversationSummaryBufferMemory API documentation. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-22 09:21:43 -04:00
Steve Sharp	cecd875cdc	docs: Update streaming.ipynb (typo fix) (#24483 ) Description: Fixes typo `Le'ts` -> `Let's`. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-22 11:09:13 +00:00
Sheng Han Lim	0c6a3fdd6b	langchain: Update ContextualCompressionRetriever base_retriever type to RetrieverLike (#24192 ) Description: When initializing retrievers with `configurable_fields` as base retriever, `ContextualCompressionRetriever` validation fails with the following error: ``` ValidationError: 1 validation error for ContextualCompressionRetriever base_retriever Can't instantiate abstract class BaseRetriever with abstract method _get_relevant_documents (type=type_error) ``` Example code: ```python esearch_retriever = VertexAISearchRetriever( project_id=GCP_PROJECT_ID, location_id="global", data_store_id=SEARCH_ENGINE_ID, ).configurable_fields( filter=ConfigurableField(id="vertex_search_filter", name="Vertex Search Filter") ) # rerank documents with Vertex AI Rank API reranker = VertexAIRank( project_id=GCP_PROJECT_ID, location_id=GCP_REGION, ranking_config="default_ranking_config", ) retriever_with_reranker = ContextualCompressionRetriever( base_compressor=reranker, base_retriever=esearch_retriever ) ``` It seems like the issue stems from ContextualCompressionRetriever insisting that base retrievers must be strictly `BaseRetriever` inherited, and doesn't take into account cases where retrievers need to be chained and can have configurable fields defined. `0a1e475a30/libs/langchain/langchain/retrievers/contextual_compression.py (L15-L22)` This PR proposes that the base_retriever type be set to `RetrieverLike`, similar to how `EnsembleRetriever` validates its list of retrievers: `0a1e475a30/libs/langchain/langchain/retrievers/ensemble.py (L58-L75)`	2024-07-21 14:23:19 -04:00
clement.l	d98b830e4b	community: add flag to toggle progress bar (#24463 ) - Description: Add a flag to determine whether to show progress bar - Issue: n/a - Dependencies: n/a - Twitter handle: n/a --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-20 13:18:02 +00:00
chuanbei888	6b08a33fa4	community: fix QianfanChatEndpoint default model (#24464 ) the baidu_qianfan_endpoint has been changed from ERNIE-Bot-turbo to ERNIE-Lite-8K	2024-07-20 13:00:29 +00:00
Nuno Campos	947628311b	core[patch]: Accept configurable keys top-level (#23806 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-07-20 03:49:00 +00:00
Jesus Martinez	c1d1fc13c2	langchain[patch]: Remove multiagent return_direct validation (#24419 ) Description: When you use Agents with multi-input tool and some of these tools have `return_direct=True`, langchain thrown an error related to one validator. This change is implemented on [JS community](https://github.com/langchain-ai/langchainjs/pull/4643) as well Issue: This MR resolves #19843 Dependencies: None Co-authored-by: Jesus Martinez <jesusabraham.martinez@tyson.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-07-20 03:27:43 +00:00
Will Badart	74e3d796f1	core[patch]: ensure `iterator_` in scope for `_atransform_stream_with_config` except (#24454 ) Before, if an exception was raised in the outer `try` block in `Runnable._atransform_stream_with_config` before `iterator_` is assigned, the corresponding `finally` block would blow up with an `UnboundLocalError`: ```txt UnboundLocalError: cannot access local variable 'iterator_' where it is not associated with a value ``` By assigning an initial value to `iterator_` before entering the `try` block, this commit ensures that the `finally` can run, and not bury the "true" exception under a "During handling of the above exception [...]" traceback. Thanks for your consideration!	2024-07-20 03:24:04 +00:00
maang-h	7b28359719	docs: Add ChatSparkLLM docstrings (#24449 ) - Description: - Add `ChatSparkLLM` docstrings, the issue #22296 - To support `stream` method	2024-07-19 20:19:14 -07:00
Eugene Yurtsev	5e48f35fba	core[minor]: Relax constraints on type checking for tools and parsers (#24459 ) This will allow tools and parsers to accept pydantic models from any of the following namespaces: * pydantic.BaseModel with pydantic 1 * pydantic.BaseModel with pydantic 2 * pydantic.v1.BaseModel with pydantic 2	2024-07-19 21:47:34 -04:00
Isaac Francisco	838464de25	ollama: init package (#23615 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-20 00:43:29 +00:00
Erick Friis	f4ee3c8a22	infra: add min version testing to pr test flow (#24358 ) xfailing some sql tests that do not currently work on sqlalchemy v1 #22207 was very much not sqlalchemy v1 compatible. Moving forward, implementations should be compatible with both to pass CI	2024-07-19 22:03:19 +00:00
Erick Friis	50cb0a03bc	docs: advanced feature note (#24456 ) fixes #24430	2024-07-19 20:05:59 +00:00
Bagatur	842065a9cc	community[patch]: Release 0.2.9 (#24453 )	2024-07-19 12:50:22 -07:00
Bagatur	27ad6a4bb3	langchain[patch]: Release 0.2.10 (#24452 )	2024-07-19 12:50:13 -07:00
Bagatur	dda9438e87	community[patch]: gpt-4o-mini costs (#24421 )	2024-07-19 19:02:44 +00:00
Eugene Yurtsev	604dfe2d99	community[patch]: Force opt-in for WebResearchRetriever (CVE-2024-3095) (#24451 ) This PR addresses the issue raised by (CVE-2024-3095) https://huntr.com/bounties/e62d4895-2901-405b-9559-38276b6a5273 Unfortunately, we didn't do a good job writing the initial report. It's pointing at both the wrong package and the wrong code. The affected code is the Web Retriever not the AsyncHTMLLoader, and the WebRetriever lives in langchain-community The vulnerable code lives here: `0bd3f4e129/libs/community/langchain_community/retrievers/web_research.py (L233-L233)` This PR adds a forced opt-in for users to make sure they are aware of the risk and can mitigate by configuring a proxy: `0bd3f4e129/libs/community/langchain_community/retrievers/web_research.py (L84-L84)`	2024-07-19 18:51:35 +00:00
Bagatur	f101c759ed	docs: how to pass runtime secrets (#24450 )	2024-07-19 18:36:28 +00:00
Asi Greenholts	372c27f2e5	community[minor]: [GoogleApiYoutubeLoader] Replace API used in _get_document_for_channel from search to playlistItem (#24034 ) - Description: Search has a limit of 500 results, playlistItems doesn't. Added a class in except clause to catch another common error. - Issue: None - Dependencies: None - Twitter handle: @TupleType --------- Co-authored-by: asi-cider <88270351+asi-cider@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-19 14:04:34 -04:00
Rafael Pereira	6a45bf9554	community[minor]: GraphCypherQAChain to accept additional inputs as provided by the user for cypher generation (#24300 ) Description: This PR introduces a change to the `cypher_generation_chain` to dynamically concatenate inputs. This improvement aims to streamline the input handling process and make the method more flexible. The change involves updating the arguments dictionary with all elements from the `inputs` dictionary, ensuring that all necessary inputs are dynamically appended. This will ensure that any cypher generation template will not require a new `_call` method patch. Issue: This PR fixes issue #24260.	2024-07-19 14:03:14 -04:00
Philippe PRADOS	f5856680fe	community[minor]: add mongodb byte store (#23876 ) The `MongoDBStore` can manage only documents. It's not possible to use MongoDB for an `CacheBackedEmbeddings`. With this new implementation, it's possible to use: ```python CacheBackedEmbeddings.from_bytes_store( underlying_embeddings=embeddings, document_embedding_cache=MongoDBByteStore( connection_string=db_uri, db_name=db_name, collection_name=collection_name, ), ) ``` and use MongoDB to cache the embeddings !	2024-07-19 13:54:12 -04:00
yabooung	07715f815b	community[minor]: Add ability to specify file encoding and json encoding for FileChatMessageHistory (#24258 ) Description: Add UTF-8 encoding support Issue: Inability to properly handle characters from certain languages (e.g., Korean) Fix: Implement UTF-8 encoding in FileChatMessageHistory --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-19 13:53:21 -04:00
Dristy Srivastava	020cc1cf3e	Community[minor]: Added checksum in while send data to pebblo-cloud (#23968 ) - Description: - Updated checksum in doc metadata - Sending checksum and removing actual content, while sending data to `pebblo-cloud` if `classifier-location `is `pebblo-cloud` in `/loader/doc` API - Adding `pb_id` i.e. pebblo id to doc metadata - Refactoring as needed. - Sending `content-checksum` and removing actual content, while sending data to `pebblo-cloud` if `classifier-location `is `pebblo-cloud` in `prmopt` API - Issue: NA - Dependencies: NA - Tests: Updated - Docs NA --------- Co-authored-by: dristy.cd <dristy@clouddefense.io>	2024-07-19 13:52:54 -04:00
Eun Hye Kim	9aae8ef416	core[patch]: Fix utils.json_schema.dereference_refs (#24335 KeyError: 400 in JSON schema processing) (#24337 ) Description: This PR fixes a KeyError: 400 that occurs in the JSON schema processing within the reduce_openapi_spec function. The _retrieve_ref function in json_schema.py was modified to handle missing components gracefully by continuing to the next component if the current one is not found. This ensures that the OpenAPI specification is fully interpreted and the agent executes without errors. Issue: Fixes issue #24335 Dependencies: No additional dependencies are required for this change. Twitter handle: @lunara_x	2024-07-19 13:31:00 -04:00
keval dekivadiya	06f47678ae	community[minor]: Add TextEmbed Embedding Integration (#22946 ) Description: TextEmbed is a high-performance embedding inference server designed to provide a high-throughput, low-latency solution for serving embeddings. It supports various sentence-transformer models and includes the ability to deploy image and text embedding models. TextEmbed offers flexibility and scalability for diverse applications. - PyPI Package: [TextEmbed on PyPI](https://pypi.org/project/textembed/) - Docker Image: [TextEmbed on Docker Hub](https://hub.docker.com/r/kevaldekivadiya/textembed) - GitHub Repository: [TextEmbed on GitHub](https://github.com/kevaldekivadiya2415/textembed) PR Description This PR adds functionality for embedding documents and queries using the `TextEmbedEmbeddings` class. The implementation allows for both synchronous and asynchronous embedding requests to a TextEmbed API endpoint. The class handles batching and permuting of input texts to optimize the embedding process. Example Usage: ```python from langchain_community.embeddings import TextEmbedEmbeddings # Initialise the embeddings class embeddings = TextEmbedEmbeddings(model="your-model-id", api_key="your-api-key", api_url="your_api_url") # Define a list of documents documents = [ "Data science involves extracting insights from data.", "Artificial intelligence is transforming various industries.", "Cloud computing provides scalable computing resources over the internet.", "Big data analytics helps in understanding large datasets.", "India has a diverse cultural heritage." ] # Define a query query = "What is the cultural heritage of India?" # Embed all documents document_embeddings = embeddings.embed_documents(documents) # Embed the query query_embedding = embeddings.embed_query(query) # Print embeddings for each document for i, embedding in enumerate(document_embeddings): print(f"Document {i+1} Embedding:", embedding) # Print the query embedding print("Query Embedding:", query_embedding) --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-07-19 17:30:25 +00:00
Shikanime Deva	9c3da11910	Fix MultiQueryRetriever breaking Embeddings with empty lines (#21093 ) Fix MultiQueryRetriever breaking Embeddings with empty lines ``` [chain/end] [1:chain:ConversationalRetrievalChain > 2:retriever:Retriever > 3:retriever:Retriever > 4:chain:LLMChain] [2.03s] Exiting Chain run with output: [outputs] > /workspaces/Sfeir/sncf/metabot-backend/.venv/lib/python3.11/site-packages/langchain/retrievers/multi_query.py(116)_aget_relevant_documents() -> if self.include_original: (Pdb) queries ['## Alternative questions for "Hello, tell me about phones?":', '', '1. What are the latest trends in smartphone technology? (Focuses on recent advancements)', '2. How has the mobile phone industry evolved over the years? (Historical perspective)', '3. What are the different types of phones available in the market, and which one is best for me? (Categorization and recommendation)'] ``` Example of failure on VertexAIEmbeddings ``` grpc._channel._InactiveRpcError: <_InactiveRpcError of RPC that terminated with: status = StatusCode.INVALID_ARGUMENT details = "The text content is empty." debug_error_string = "UNKNOWN:Error received from peer ipv4:142.250.184.234:443 {created_time:"2024-04-30T09:57:45.625698408+00:00", grpc_status:3, grpc_message:"The text content is empty."}" ``` Fixes: #15959 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-19 17:13:12 +00:00
John Kelly	5affbada61	langchain: Add `aadd_documents` to `ParentDocumentRetriever` (#23969 ) - Description: Add an async version of `add_documents` to `ParentDocumentRetriever` - Twitter handle: @johnkdev --------- Co-authored-by: John Kelly <j.kelly@mwam.com> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-19 13:12:39 -04:00
Andrew Benton	f9d64d22e5	community[minor]: Add Riza Python/JS code execution tool (#23995 ) - Description: Add Riza Python/JS code execution tool - Issue: N/A - Dependencies: an optional dependency on the `rizaio` pypi package - Twitter handle: [@rizaio](https://x.com/rizaio) [Riza](https://riza.io) is a safe code execution environment for agent-generated Python and JavaScript that's easy to integrate into langchain apps. This PR adds two new tool classes to the community package.	2024-07-19 17:03:22 +00:00
Ben Chambers	3691701d58	community[minor]: Add keybert-based link extractor (#24311 ) - Description: Add a `KeybertLinkExtractor` for graph vectorstores. This allows extracting links from keywords in a Document and linking nodes that have common keywords. - Issue: None - Dependencies: None. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-19 12:25:07 -04:00
Erick Friis	ef049769f0	core[patch]: Release 0.2.22 (#24423 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-07-19 09:09:24 -07:00
Bagatur	cd19ba9a07	core[patch]: core lint fix (#24447 )	2024-07-19 09:01:22 -07:00
Ben Chambers	83f3d95ffa	community[minor]: GLiNER link extraction (#24314 ) - Description: This allows extracting links between documents with common named entities using [GLiNER](https://github.com/urchade/GLiNER). - Issue: None - Dependencies: None --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-19 15:34:54 +00:00
Anas Khan	b5acb91080	Mask API keys for various LLM/ChatModel Modules (#13885 ) Description: - Added masking of the API Keys for the modules: - `langchain/chat_models/openai.py` - `langchain/llms/openai.py` - `langchain/llms/google_palm.py` - `langchain/chat_models/google_palm.py` - `langchain/llms/edenai.py` - Updated the modules to utilize `SecretStr` from pydantic to securely manage API key. - Added unit/integration tests - `langchain/chat_models/asure_openai.py` used the `open_api_key` that is derived from the `ChatOpenAI` Class and it was assuming `openai_api_key` is a str so we changed it to expect `SecretStr` instead. Issue: https://github.com/langchain-ai/langchain/issues/12165 , Dependencies: none, Tag maintainer: @eyurtsev --------- Co-authored-by: HassanA01 <anikeboss@gmail.com> Co-authored-by: Aneeq Hassan <aneeq.hassan@utoronto.ca> Co-authored-by: kristinspenc <kristinspenc2003@gmail.com> Co-authored-by: faisalt14 <faisalt14@gmail.com> Co-authored-by: Harshil-Patel28 <76663814+Harshil-Patel28@users.noreply.github.com> Co-authored-by: kristinspenc <146893228+kristinspenc@users.noreply.github.com> Co-authored-by: faisalt14 <90787271+faisalt14@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-19 15:23:34 +00:00
ccurme	f99369a54c	community[patch]: fix formatting (#24443 ) Somehow this got through CI: https://github.com/langchain-ai/langchain/pull/24363	2024-07-19 14:38:53 +00:00
Ben Chambers	242b085be7	Merge pull request #24315 * community: Add Hierarchy link extractor * add example * lint	2024-07-19 09:42:26 -04:00
Rhuan Barros	c3308f31bc	Merge pull request #24363 * important email fields	2024-07-19 09:41:20 -04:00
Piotr Romanowski	c50dd79512	docs: Update langchain-openai package version in chat_token_usage_tracking (#24436 ) This PR updates docs to mention correct version of the `langchain-openai` package required to use the `stream_usage` parameter. As it can be noticed in the details of this [merge commit](`722c8f50ea`), that functionality is available only in `langchain-openai >= 0.1.9` while docs state it's available in `langchain-openai >= 0.1.8`.	2024-07-19 13:07:37 +00:00
Han Sol Park	aade9bfde5	Mask API key for ChatOpenAI based chat_models (#14293 ) - Description: Mask API key for ChatOpenAi based chat_models (openai, azureopenai, anyscale, everlyai). Made changes to all chat_models that are based on ChatOpenAI since all of them assumes that openai_api_key is str rather than SecretStr. - Issue:: #12165 - Dependencies: N/A - Tag maintainer: @eyurtsev - Twitter handle: N/A --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-19 02:25:38 +00:00
William FH	0ee6ed76ca	[Evaluation] Pass in seed directly (#24403 ) adding test rn	2024-07-18 19:12:28 -07:00
Nuno Campos	62b6965d2a	core: In ensure_config don't copy dunder configurable keys to metadata (#24420 )	2024-07-18 22:28:52 +00:00
Eugene Yurtsev	ef22ebe431	standard-tests[patch]: Add pytest assert rewrites (#24408 ) This will surface nice error messages in subclasses that fail assertions.	2024-07-18 21:41:11 +00:00
Eugene Yurtsev	f62b323108	core[minor]: Support all versions of pydantic base model in argsschema (#24418 ) This adds support to any pydantic base model for tools. The only potential issue is that `get_input_schema()` will not always return a v1 base model.	2024-07-18 17:14:23 -04:00
Prakul	b2bc15e640	docs: Update mongodb README.md (#24412 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-18 14:02:34 -07:00
Evan Harris	61ea7bf60b	Add a `ListRerank` document compressor (#13311 ) - Description: This PR adds a new document compressor called `ListRerank`. It's derived from `BaseDocumentCompressor`. It's a near exact implementation of introduced by this paper: [Zero-Shot Listwise Document Reranking with a Large Language Model](https://arxiv.org/pdf/2305.02156.pdf) which it finds to outperform pointwise reranking, which is somewhat implemented in LangChain as [LLMChainFilter](https://github.com/langchain-ai/langchain/blob/master/libs/langchain/langchain/retrievers/document_compressors/chain_filter.py). - Issue: None - Dependencies: None - Tag maintainer: @hwchase17 @izzymsft - Twitter handle: @HarrisEMitchell Notes: 1. I didn't add anything to `docs`. I wasn't exactly sure which patterns to follow as [cohere reranker is under Retrievers](https://python.langchain.com/docs/integrations/retrievers/cohere-reranker) with other external document retrieval integrations, but other contextual compression is [here](https://python.langchain.com/docs/modules/data_connection/retrievers/contextual_compression/). Happy to contribute to either with some direction. 2. I followed syntax, docstrings, implementation patterns, etc. as well as I could looking at nearby modules. One thing I didn't do was put the default prompt in a separate `.py` file like [Chain Filter](https://github.com/langchain-ai/langchain/blob/master/libs/langchain/langchain/retrievers/document_compressors/chain_filter_prompt.py) and [Chain Extract](https://github.com/langchain-ai/langchain/blob/master/libs/langchain/langchain/retrievers/document_compressors/chain_extract_prompt.py). Happy to follow that pattern if it would be preferred. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-18 20:34:38 +00:00
Srijan Dubey	4c651ba13a	Adding LangChain v0.2 support for nvidia ai endpoint, langchain-nvidia-ai-endpoints. Removed deprecated classes from nvidia_ai_endpoints.ipynb (#24411 ) Description: added support for LangChain v0.2 for nvidia ai endpoint. Implremented inMemory storage for chains using RunnableWithMessageHistory which is analogous to using `ConversationChain` which was used in v0.1 with the default `ConversationBufferMemory`. This class is deprecated in favor of `RunnableWithMessageHistory` in LangChain v0.2 Issue: None Dependencies: None. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-18 15:59:26 -04:00
Erick Friis	334fc1ed1c	mongodb: release 0.1.7 (#24409 )	2024-07-18 18:13:27 +00:00
ccurme	ba74341eee	docs: update tool calling how-to to pass functions to bind_tools (#24402 )	2024-07-18 08:53:48 -07:00
Harrison Chase	3adf710f1d	docs: improve docs on tools (#24404 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-18 08:52:12 -07:00
Eun Hye Kim	07c5c60f63	community: fix tool appending logic and update planner prompt in OpenAPI agent toolkit (#24384 ) Description: - Updated the format for the 'Action' section in the planner prompt to ensure it must be one of the tools without additional words. Adjusted the phrasing from "should be" to "must be" for clarity and enforceability. - Corrected the tool appending logic in the `_create_api_controller_agent` function to ensure that `RequestsDeleteToolWithParsing` and `RequestsPatchToolWithParsing` are properly added to the tools list for "DELETE" and "PATCH" operations. Issue: #24382 Dependencies: None Twitter handle: @lunara_x --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-18 13:37:46 +00:00
Casey Clements	aade1550c6	docs: Adds MongoDBAtlasVectorSearch to VectorStore list compatible with Indexing API (#24374 ) Adds MongoDBAtlasVectorSearch to list of VectorStores compatible with the Indexing API. (One line change.) As of `langchain-mongodb = "0.1.7"`, the requirements that the VectorStore have both add_documents and delete methods with an ids kwarg is satisfied. #23535 contains the implementation of that, and has been merged.	2024-07-18 09:37:29 -04:00
Chen Xiabin	63c60a31f0	[fix] baidu qianfan AiMessage with usage_metadata (#24389 ) make AIMessage usage_metadata has error	2024-07-18 09:28:16 -04:00
João Dinis Ferreira	242de9aa5e	docs: remove redundant `--quiet` option in `pip install` command (#24397 ) - Description: Removes a redundant option in a `pip install` command in the documentation. - Issue: N/A - Dependencies: N/A	2024-07-18 13:24:42 +00:00
ZhangShenao	916b813107	community[patch]: Fix spelling error in ConversationVectorStoreTokenBufferMemory doc-string (#24385 ) Fix word spelling error in `ConversationVectorStoreTokenBufferMemory`	2024-07-18 12:27:36 +00:00
Rajendra Kadam	1c65529fd7	community[minor]: [PebbloSafeLoader] Rename loader type and add SharePointLoader to supported loaders (#24393 ) Thank you for contributing to LangChain! - [x] PR title: [PebbloSafeLoader] Rename loader type and add SharePointLoader to supported loaders - Description: Minor fixes in the PebbloSafeLoader: - Renamed the loader type from `remote_db` to `cloud_folder`. - Added `SharePointLoader` to the list of loaders supported by PebbloSafeLoader. - Issue: NA - Dependencies: NA - [x] Add tests and docs: NA	2024-07-18 08:23:12 -04:00
Eugene Yurtsev	6182a402f1	experimental[patch]: block a few more things from PALValidator (#24379 ) * Please see security warning already in existing class. * The approach here is fundamentally insecure as it's relying on a block approach rather than an approach based on only running allowed nodes. So users should only use this code if its running from a properly sandboxed environment.	2024-07-18 08:22:45 -04:00
Paolo Ráez	0dec72cab0	Community[patch]: Missing "stream" parameter in cloudflare_workersai (#23987 ) ### Description Missing "stream" parameter. Without it, you'd never receive a stream of tokens when using stream() or astream() ### Issue No existing issue available	2024-07-18 02:09:39 +00:00
Eugene Yurtsev	570566b858	core[patch]: Update API reference for astream events (#24359 ) Update the API reference for astream events to include information about custom events.	2024-07-17 21:48:53 -04:00
Bagatur	f9baaae3ec	docs: clean up tool how to titles (#24373 )	2024-07-17 17:08:31 -07:00
Bagatur	4da1df568a	docs: tools concepts (#24368 )	2024-07-17 17:08:16 -07:00
Erick Friis	96ccba9c27	infra: 15s retry wait on test pypi (#24375 )	2024-07-17 23:41:22 +00:00
Bagatur	a4c101ae97	core[patch]: Release 0.2.21 (#24372 )	2024-07-17 22:44:35 +00:00
William FH	c5a07e2dd8	core[patch]: add InjectedToolArg annotation (#24279 ) ```python from typing_extensions import Annotated from langchain_core.tools import tool, InjectedToolArg from langchain_anthropic import ChatAnthropic @tool def multiply(x: int, y: int, not_for_model: Annotated[dict, InjectedToolArg]) -> str: """multiply.""" return x * y ChatAnthropic(model='claude-3-sonnet-20240229',).bind_tools([multiply]).invoke('5 times 3').tool_calls ''' -> [{'name': 'multiply', 'args': {'x': 5, 'y': 3}, 'id': 'toolu_01Y1QazYWhu4R8vF4hF4z9no', 'type': 'tool_call'}] ''' ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-07-17 15:28:40 -07:00
Erick Friis	80f3d48195	openai: release 0.1.18 (#24369 )	2024-07-17 22:26:33 +00:00
Bagatur	7d83189b19	openai[patch]: use model_name in AzureOpenAI.ls_model_name (#24366 )	2024-07-17 15:24:05 -07:00
Nithish Raghunandanan	eb26b5535a	couchbase: Add chat message history (#24356 ) Description: : Add support for chat message history using Couchbase - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Nithish Raghunandanan <nithishr@users.noreply.github.com>	2024-07-17 15:22:42 -07:00
Eugene Yurtsev	96bac8e20d	core[patch]: Fix regression requiring input_variables in few chat prompt templates (#24360 ) * Fix regression that requires users passing input_variables=[]. * Regression introduced by my own changes to this PR: https://github.com/langchain-ai/langchain/pull/22851	2024-07-17 18:14:57 -04:00
Brice Fotzo	034a8c7c1b	community: support advanced text extraction options for pdf documents (#20265 ) Description: - Updated constructors in PyPDFParser and PyPDFLoader to handle `extraction_mode` and additional kwargs, aligning with the capabilities of `PageObject.extract_text()` from pypdf. - Added `test_pypdf_loader_with_layout` along with a corresponding example text file to validate layout extraction from PDFs. Issue: fixes #19735 Dependencies: This change requires updating the pypdf dependency from version 3.4.0 to at least 4.0.0. Additional changes include the addition of a new test test_pypdf_loader_with_layout and an example text file to ensure the functionality of layout extraction from PDFs aligns with the new capabilities. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-17 20:47:09 +00:00
hmasdev	a402de3dae	langchain[patch]: fix wrong `dict` key in `OutputFixingParser`, `RetryOutputParser` and `RetryWithErrorOutputParser` (#23967 ) # Description This PR aims to solve a bug in `OutputFixingParser`, `RetryOutputParser` and `RetryWithErrorOutputParser` The bug is that the wrong keyword argument was given to `retry_chain`. The correct keyword argument is 'completion', but 'input' is used. This pull request makes the following changes: 1. correct a `dict` key given to `retry_chain`; 2. add a test when using the default prompt. - `NAIVE_FIX_PROMPT` for `OutputFixingParser`; - `NAIVE_RETRY_PROMPT` for `RetryOutputParser`; - `NAIVE_RETRY_WITH_ERROR_PROMPT` for `RetryWithErrorOutputParser`; 3. ~~add comments on `retry_chain` input and output types~~ clarify `InputType` and `OutputType` of `retry_chain` # Issue The bug is pointed out in https://github.com/langchain-ai/langchain/pull/19792#issuecomment-2196512928 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-17 20:34:46 +00:00
Casey Clements	a47f69a120	partners/mongodb : Significant MongoDBVectorSearch ID enhancements (#23535 ) ## Description This pull-request improves the treatment of document IDs in `MongoDBAtlasVectorSearch`. Class method signatures of add_documents, add_texts, delete, and from_texts now include an `ids:Optional[List[str]]` keyword argument permitting the user greater control. Note that, as before, IDs may also be inferred from `Document.metadata['_id']` if present, but this is no longer required, IDs can also optionally be returned from searches. This PR closes the following JIRA issues. * [PYTHON-4446](https://jira.mongodb.org/browse/PYTHON-4446) MongoDBVectorSearch delete / add_texts function rework * [PYTHON-4435](https://jira.mongodb.org/browse/PYTHON-4435) Add support for "Indexing" * [PYTHON-4534](https://jira.mongodb.org/browse/PYTHON-4534) Ensure datetimes are json-serializable --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-17 13:26:20 -07:00
Erick Friis	cc2cbfabfc	milvus: release 0.1.2 (#24365 )	2024-07-17 19:42:44 +00:00
Eugene Yurtsev	9e4a0e76f6	core[patch]: Fix one unit test for chat prompt template (#24362 ) Minor change that fixes a unit test that had missing assertions.	2024-07-17 18:56:48 +00:00
Erick Friis	81639243e2	openai: release 0.1.17 (#24361 )	2024-07-17 18:50:42 +00:00
Erick Friis	61976a4147	pinecone: release 0.1.2 (#24355 )	2024-07-17 17:09:07 +00:00
Bagatur	b5360e2e5f	community[patch]: Release 0.2.8 (#24354 )	2024-07-17 17:07:27 +00:00
ccurme	4cf67084d3	openai[patch]: fix key collision and _astream (#24345 ) Fixes small issues introduced in https://github.com/langchain-ai/langchain/pull/24150 (unreleased).	2024-07-17 12:59:26 -04:00
Luis Moros	bcb5f354ad	community: Fix SQLDatabse.from_databricks issue when ran from Job (#24346 ) - Description: When SQLDatabase.from_databricks is ran from a Databricks Workflow job, line 205 (default_host = context.browserHostName) throws an ``AttributeError`` as the ``context`` object has no ``browserHostName`` attribute. The fix handles the exception and sets the ``default_host`` variable to null --------- Co-authored-by: lmorosdb <lmorosdb> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-07-17 12:40:12 -04:00
Bagatur	24e9b48d15	langchain[patch]: Release 0.2.9 (#24327 )	2024-07-17 09:39:57 -07:00
Rafael Pereira	cf28708e7b	Neo4j: Update with non-deprecated cypher methods, and new method to associate relationship embeddings (#23725 ) Description: At the moment neo4j wrapper is using setVectorProperty, which is deprecated ([link](https://neo4j.com/docs/operations-manual/5/reference/procedures/#procedure_db_create_setVectorProperty)). I replaced with the non-deprecated version. Neo4j recently introduced a new cypher method to associate embeddings into relations using "setRelationshipVectorProperty" method. In this PR I also implemented a new method to perform this association maintaining the same format used in the "add_embeddings" method which is used to associate embeddings into Nodes. I also included a test case for this new method.	2024-07-17 12:37:47 -04:00
maang-h	2a3288b15d	docs: Add ChatBaichuan docstrings (#24348 ) - Description: Add ChatBaichuan rich docstrings. - Issue: the issue #22296	2024-07-17 12:00:16 -04:00
Srijan Dubey	1792684e8f	removed deprecated classes from pipelineai.ipynb, added support for LangChain v0.2 for PipelineAI integration (#24333 ) Description: added support for LangChain v0.2 for PipelineAI integration. Removed deprecated classes and incorporated support for LangChain v0.2 to integrate with PipelineAI. Removed LLMChain and replaced it with Runnable interface. Also added StrOutputParser, that parses LLMResult into the top likely string. Issue: None Dependencies: None. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-17 13:48:32 +00:00
Tobias Sette	e60ad12521	docs(infobip.ipynb): fix typo (#24328 )	2024-07-17 13:33:34 +00:00
Rafael Pereira	fc41730e28	neo4j: Fix test for order-insensitive comparison and floating-point precision issues (#24338 ) Description: This PR addresses two main issues in the `test_neo4jvector.py`: 1. Order-insensitive Comparison: Modified the `test_retrieval_dictionary` to ensure that it passes regardless of the order of returned values by parsing `page_content` into a structured format (dictionary) before comparison. 2. Floating-point Precision: Updated `test_neo4jvector_relevance_score` to handle minor floating-point precision differences by using the `isclose` function for comparing relevance scores with a relative tolerance. Errors addressed: - test_neo4jvector_relevance_score: ``` AssertionError: assert [(Document(page_content='foo', metadata={'page': '0'}), 1.0000014305114746), (Document(page_content='bar', metadata={'page': '1'}), 0.9998371005058289), (Document(page_content='baz', metadata={'page': '2'}), 0.9993508458137512)] == [(Document(page_content='foo', metadata={'page': '0'}), 1.0), (Document(page_content='bar', metadata={'page': '1'}), 0.9998376369476318), (Document(page_content='baz', metadata={'page': '2'}), 0.9993523359298706)] At index 0 diff: (Document(page_content='foo', metadata={'page': '0'}), 1.0000014305114746) != (Document(page_content='foo', metadata={'page': '0'}), 1.0) Full diff: - [(Document(page_content='foo', metadata={'page': '0'}), 1.0), + [(Document(page_content='foo', metadata={'page': '0'}), 1.0000014305114746), ? +++++++++++++++ - (Document(page_content='bar', metadata={'page': '1'}), 0.9998376369476318), ? ^^^ ------ + (Document(page_content='bar', metadata={'page': '1'}), 0.9998371005058289), ? ^^^^^^^^^ - (Document(page_content='baz', metadata={'page': '2'}), 0.9993523359298706), ? ---------- + (Document(page_content='baz', metadata={'page': '2'}), 0.9993508458137512), ? ++++++++++ ] ``` - test_retrieval_dictionary: ``` AssertionError: assert [Document(page_content='skills:\n- Python\n- Data Analysis\n- Machine Learning\nname: John\nage: 30\n')] == [Document(page_content='skills:\n- Python\n- Data Analysis\n- Machine Learning\nage: 30\nname: John\n')] At index 0 diff: Document(page_content='skills:\n- Python\n- Data Analysis\n- Machine Learning\nname: John\nage: 30\n') != Document(page_content='skills:\n- Python\n- Data Analysis\n- Machine Learning\nage: 30\nname: John\n') Full diff: - [Document(page_content='skills:\n- Python\n- Data Analysis\n- Machine Learning\nage: 30\nname: John\n')] ? --------- + [Document(page_content='skills:\n- Python\n- Data Analysis\n- Machine Learning\nage: John\nage: 30\n')] ? +++++++++ ```	2024-07-17 09:28:25 -04:00
Erick Friis	47ed7f766a	infra: fix release prerelease deps bug (#24323 )	2024-07-16 15:13:41 -07:00
Bagatur	80e7cd6cff	core[patch]: Release 0.2.20 (#24322 )	2024-07-16 15:04:36 -07:00
Erick Friis	6c3e65a878	infra: prerelease dep checking on release (#23269 )	2024-07-16 21:48:15 +00:00
Eugene Yurtsev	616196c620	Docs: Add how to dispatch custom callback events (#24278 ) * Add how-to guide for dispatching custom callback events. * Add links from index to the how to guide * Add link from streaming from within a tool * Update versionadded to correct release https://github.com/langchain-ai/langchain/releases/tag/langchain-core%3D%3D0.2.15	2024-07-16 17:38:32 -04:00
Erick Friis	dd7938ace8	docs: readthedocs deprecation fix (#24321 ) https://about.readthedocs.com/blog/2024/07/addons-by-default/#how-does-it-affect-my-projects we use build.command so we're already using addons, so I think this is it	2024-07-16 20:32:51 +00:00
Srijan Dubey	ef07308c30	Upgraded shaleprotocol to use langchain v0.2 removed deprecated classes (#24320 ) Description: Added support for langchain v0.2 for shale protocol. Replaced LLMChain with Runnable interface which allows any two Runnables to be 'chained' together into sequences. Also added StreamingStdOutCallbackHandler. Callback handler for streaming. Issue: None Dependencies: None.	2024-07-16 20:07:36 +00:00
pbharti0831	049bc37111	Cookbook for applying RAG locally using open source models and tools on CPU (#24284 ) This cookbook guides user to implement RAG locally on CPU using langchain tools and open source models. It enables Llama2 model to answer queries about Intel Q1 2024 earning release using RAG pipeline. Main libraries are langchain, llama-cpp-python and gpt4all. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Sriragavi <sriragavi.r@intel.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-16 15:17:10 -04:00
Leonid Ganeline	5ccf8ebfac	core: docstrings `vectorstores` update (#24281 ) Added missed docstrings. Formatted docstrings to the consistent form. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-16 16:58:11 +00:00
Erick Friis	1e9cc02ed8	openai: raw response headers (#24150 )	2024-07-16 09:54:54 -07:00
Bagatur	dc42279eb5	core[patch]: fix Typing.cast import (#24313 ) Fixes #24287	2024-07-16 16:53:48 +00:00
Anush	e38bf08139	qdrant: Fixed typos in Qdrant vectorstore docs (#24312 ) ## Description As that title goes.	2024-07-16 09:44:07 -07:00
bovlb	5caa381177	community[minor]: Add ApertureDB as a vectorstore (#24088 ) Thank you for contributing to LangChain! - [X] ApertureDB as vectorstore: "community: Add ApertureDB as a vectorestore" - Description:* this change provides a new community integration that uses ApertureData's ApertureDB as a vector store. - Issue: none - Dependencies: depends on ApertureDB Python SDK - Twitter handle: ApertureData - [X] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Integration tests rely on a local run of a public docker image. Example notebook additionally relies on a local Ollama server. - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ All lint tests pass. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Gautam <gautam@aperturedata.io>	2024-07-16 09:32:59 -07:00
frob	c59e663365	community[patch]: Fix docstring for ollama parameter "keep_alive" (#23973 ) Fix doc-string for ollama integration	2024-07-16 14:48:38 +00:00
Mazen Ramadan	0c1889c713	docs: fix parameter typo in scrapfly loader docs (#24307 ) Fixed wrong parameter typo in [ScrapflyLoader](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/document_loaders/scrapfly.py) docs, where `ignore_scrape_failures` is used instead of `continue_on_failure`. - Description: Fix wrong param typo in ScrapflyLoader docs.	2024-07-16 14:48:13 +00:00
Leonid Ganeline	5fcf2ef7ca	core: docstrings `documents` (#23506 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-16 10:43:54 -04:00
Rafael Pereira	77dd327282	Docs: Fix Concepts Integration Tools Link (#24301 ) - Description: This PR fix concepts integrations tools link. - Issue: Fixes issue #24112 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-16 10:29:30 -04:00
Rahul Raghavendra Choudhury	f5a38772a8	community[patch]: Update TavilySearch to use TavilyClient instead of the deprecated Client (#24270 ) On using TavilySearchAPIRetriever with any conversation chain getting error : `TypeError: Client.__init__() got an unexpected keyword argument 'api_key'` It is because the retreiver class is using the depreciated `Client` class, `TavilyClient` need to be used instead. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-07-16 13:35:28 +00:00
Shenhai Ran	5f2dea2b20	core[patch]: Add encoding options when create prompt template from a file (#24054 ) - Uses default utf-8 encoding for loading prompt templates from file --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-16 09:35:09 -04:00
Chen Xiabin	69b1603173	baidu qianfan AiMessage with usage_metadata (#24288 ) add usage_metadata to qianfan AIMessage. Thanks	2024-07-16 09:30:50 -04:00
amcastror	d83164f837	Update retrievers.ipynb (#24289 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-16 13:30:41 +00:00
Leonid Ganeline	198b85334f	core[patch]: docstrings `langchain_core/` files update (#24285 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-16 09:21:51 -04:00
Dobiichi-Origami	7aeaa1974d	community[patch]: change the class of `qianfan_ak` and `qianfan_sk` parameters (#24293 ) - Description: we changed the class of two parameters to fix a bug, which causes validation failure when using QianfanEmbeddingEndpoint	2024-07-16 09:17:48 -04:00
Tibor Reiss	1c753d1e81	core[patch]: Update typing for template format to include jinja2 as a Literal (#24144 ) Fixes #23929 via adjusting the typing	2024-07-16 09:09:42 -04:00
Jacob Lee	6716379f0c	docs[patch]: Fix rendering issue in code splitter page (#24291 )	2024-07-15 23:08:21 -07:00
Jacob Lee	58fdb070fa	docs[patch]: Update intro diagram (#24290 ) CC @agola11	2024-07-15 22:04:42 -07:00
Erick Friis	1d7a3ae7ce	infra: add test deps to add_dependents (#24283 )	2024-07-15 15:48:53 -07:00
Erick Friis	d2f671271e	langchain: fix extended test (#24282 )	2024-07-15 15:29:48 -07:00
Lage Ragnarsson	a3c10fc6ce	community: Add support for specifying hybrid search for Databricks vector search (#23528 ) Description: Databricks Vector Search recently added support for hybrid keyword-similarity search. See [usage examples](https://docs.databricks.com/en/generative-ai/create-query-vector-search.html#query-a-vector-search-endpoint) from their documentation. This PR updates the Langchain vectorstore interface for Databricks to enable the user to pass the query_type parameter to similarity_search to make use of this functionality. By default, there will not be any changes for existing users of this interface. To use the new hybrid search feature, it is now possible to do ```python # ... dvs = DatabricksVectorSearch(index) dvs.similarity_search("my search query", query_type="HYBRID") ``` Or using the retriever: ```python retriever = dvs.as_retriever( search_kwargs={ "query_type": "HYBRID", } ) retriever.invoke("my search query") ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-15 22:14:08 +00:00
Christopher Tee	5171ffc026	community(you): Integrate You.com conversational APIs (#23046 ) You.com is releasing two new conversational APIs — Smart and Research. This PR: - integrates those APIs with Langchain, as an LLM - streaming is supported If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-15 17:46:58 -04:00
maang-h	6c7d9f93b9	feat: Add ChatTongyi structured output (#24187 ) - Description: Add `with_structured_output` method to ChatTongyi to support structured output.	2024-07-15 15:57:21 -04:00
Chen Xiabin	8f4620f4b8	baidu qianfan streaming token_usage (#24117 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-15 19:52:31 +00:00
maang-h	9d97de34ae	community[patch]: Improve ChatBaichuan init args and role (#23878 ) - Description: Improve ChatBaichuan init args and role - ChatBaichuan adds `system` role - alias: `baichuan_api_base` -> `base_url` - `with_search_enhance` is deprecated - Add `max_tokens` argument	2024-07-15 15:17:00 -04:00
Erick Friis	56cca23745	openai: remove some params from default serialization (#24280 )	2024-07-15 18:53:36 +00:00
mrugank-wadekar	66bebeb76a	partners: add similarity search by image functionality to langchain_chroma partner package (#22982 ) - Description: This pull request introduces two new methods to the Langchain Chroma partner package that enable similarity search based on image embeddings. These methods enhance the package's functionality by allowing users to search for images similar to a given image URI. Also introduces a notebook to demonstrate it's use. - Issue: N/A - Dependencies: None - Twitter handle: @mrugank9009 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-15 18:48:22 +00:00
pm390	b0aa915dea	community[patch]: use asyncio.sleep instead of sleep in OpenAI Assistant async (#24275 ) Description: Implemented async sleep using asyncio instead of synchronous sleep in openAI Assistants Issue: 24194 Dependencies: asyncio Twitter handle: pietromald60939	2024-07-15 18:14:39 +00:00
Anush	d93ae756e6	qdrant: Documentation for the new QdrantVectorStore class (#24166 ) ## Description Follow up on #24165. Adds a page to document the latest usage of the new `QdrantVectorStore` class.	2024-07-15 10:39:23 -07:00
Erick Friis	1244e66bd4	docs: remove couchbase from docs linking (#24277 ) `pip install couchbase` adds 12 minutes to the docs build...	2024-07-15 17:34:41 +00:00
wenngong	a001037319	retrievers: MultiVectorRetriever similarity_score_threshold search type (#23539 ) Description: support MultiVectorRetriever similarity_score_threshold search type. Issue: #23387 #19404 --------- Co-authored-by: gongwn1 <gongwn1@lenovo.com>	2024-07-15 13:31:34 -04:00
Carlos André Antunes	20151384d7	fix azure_openai.py: some keys do not exists (#24158 ) In some lines its trying to read a key that do not exists yet. In this cases I changed the direct access to dict.get() method Thank you for contributing to LangChain! - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-07-15 17:17:05 +00:00
blueoom	d895614d19	text_splitters: add request parameters for function HTMLHeaderTextSplitter.split_text… (#24178 ) Description: The `split_text_from_url` method of `HTMLHeaderTextSplitter` does not include parameters like `timeout` when using `requests` to send a request. Therefore, I suggest adding a `kwargs` parameter to the function, which can be passed as arguments to `requests.get()` internally, allowing control over the `get` request. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-15 16:43:56 +00:00
Bagatur	9d0c1d2dc9	docs: specify init_chat_model version (#24274 )	2024-07-15 16:29:06 +00:00
MoraxMa	a7296bddc2	docs: updated Tongyi package (#24259 ) * updated pip install package	2024-07-15 16:25:35 +00:00
Bagatur	c9473367b1	langchain[patch]: Release 0.2.8 (#24273 )	2024-07-15 16:05:51 +00:00
JP-Ellis	f77659463a	core[patch]: allow message utils to work with lcel (#23743 ) The functions `convert_to_messages` has had an expansion of the arguments it can take: 1. Previously, it only could take a `Sequence` in order to iterate over it. This has been broadened slightly to an `Iterable` (which should have no other impact). 2. Support for `PromptValue` and `BaseChatPromptTemplate` has been added. These are generated when combining messages using the overloaded `+` operator. Functions which rely on `convert_to_messages` (namely `filter_messages`, `merge_message_runs` and `trim_messages`) have had the type of their arguments similarly expanded. Resolves #23706. <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --> --------- Signed-off-by: JP-Ellis <josh@jpellis.me> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-07-15 08:58:05 -07:00
Harold Martin	ccdaf14eff	docs: Spell check fixes (#24217 ) Description: Spell check fixes for docs, comments, and a couple of strings. No code change e.g. variable names. Issue: none Dependencies: none Twitter handle: hmartin	2024-07-15 15:51:43 +00:00
Leonid Ganeline	cacdf96f9c	core docstrings `tracers` update (#24211 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-15 11:37:09 -04:00
Leonid Ganeline	36ee083753	core: docstrings `utils` update (#24213 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-15 11:36:00 -04:00
thehunmonkgroup	e8a21146d3	community[patch]: upgrade default model for ChatAnyscale (#24232 ) Old default `meta-llama/Llama-2-7b-chat-hf` no longer supported.	2024-07-15 11:34:59 -04:00
Bagatur	a0958c0607	docs: more tool call -> tool message docs (#24271 )	2024-07-15 07:55:07 -07:00
Bagatur	620b118c70	core[patch]: Release 0.2.19 (#24272 )	2024-07-15 07:51:30 -07:00
ccurme	888fbc07b5	core[patch]: support passing `args_schema` through `as_tool` (#24269 ) Note: this allows the schema to be passed in positionally. ```python from langchain_core.pydantic_v1 import BaseModel, Field from langchain_core.runnables import RunnableLambda class Add(BaseModel): """Add two integers together.""" a: int = Field(..., description="First integer") b: int = Field(..., description="Second integer") def add(input: dict) -> int: return input["a"] + input["b"] runnable = RunnableLambda(add) as_tool = runnable.as_tool(Add) as_tool.args_schema.schema() ``` ``` {'title': 'Add', 'description': 'Add two integers together.', 'type': 'object', 'properties': {'a': {'title': 'A', 'description': 'First integer', 'type': 'integer'}, 'b': {'title': 'B', 'description': 'Second integer', 'type': 'integer'}}, 'required': ['a', 'b']} ```	2024-07-15 07:51:05 -07:00
ccurme	ab2d7821a7	fireworks[patch]: use firefunction-v2 in standard tests (#24264 )	2024-07-15 13:15:08 +00:00
ccurme	6fc7610b1c	standard-tests[patch]: update test_bind_runnables_as_tools (#24241 ) Reduce number of tool arguments from two to one.	2024-07-15 08:35:07 -04:00
Bagatur	0da5078cad	langchain[minor]: Generic configurable model (#23419 ) alternative to [23244](https://github.com/langchain-ai/langchain/pull/23244). allows you to use chat model declarative methods ![Screenshot 2024-06-25 at 1 07 10 PM](https://github.com/langchain-ai/langchain/assets/22008038/910d1694-9b7b-46bc-bc2e-3792df9321d6)	2024-07-15 01:11:01 +00:00
Bagatur	d0728b0ba0	core[patch]: add tool name to tool message (#24243 ) Copying current ToolNode behavior	2024-07-15 00:42:40 +00:00
Bagatur	9224027e45	docs: tool artifacts how to (#24198 )	2024-07-14 17:04:47 -07:00
Bagatur	5c3e2612da	core[patch]: Release 0.2.18 (#24230 )	2024-07-13 09:14:43 -07:00
Bagatur	65321bf975	core[patch]: fix ToolCall "type" when streaming (#24218 )	2024-07-13 08:59:03 -07:00
Jacob Lee	2b7d1cdd2f	docs[patch]: Update tool child run docs (#24160 ) Documents #24143	2024-07-13 07:52:37 -07:00
Anush	a653b209ba	qdrant: test new QdrantVectorStore (#24165 ) ## Description This PR adds integration tests to follow up on #24164. By default, the tests use an in-memory instance. To run the full suite of tests, with both in-memory and Qdrant server: ``` $ docker run -p 6333:6333 qdrant/qdrant $ make test $ make integration_test ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 23:59:30 +00:00
Roman Solomatin	f071581aea	openai[patch]: update openai params (#23691 ) Description: Explicitly add parameters from openai API - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 16:53:33 -07:00
Leonid Ganeline	f0a7581b50	milvus: docstring (#23151 ) Added missed docstrings. Format docstrings to the consistent format (used in the API Reference) --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 23:25:31 +00:00
Christian D. Glissov	474b88326f	langchain_qdrant: Added method "_asimilarity_search_with_relevance_scores" to Qdrant class (#23954 ) I stumbled upon a bug that led to different similarity scores between the async and sync similarity searches with relevance scores in Qdrant. The reason being is that _asimilarity_search_with_relevance_scores is missing, this makes langchain_qdrant use the method of the vectorstore baseclass leading to drastically different results. To illustrate the magnitude here are the results running an identical search in a test vectorstore. Output of asimilarity_search_with_relevance_scores: [0.9902903374601824, 0.9472135924938804, 0.8535534011299859] Output of similarity_search_with_relevance_scores: [0.9805806749203648, 0.8944271849877607, 0.7071068022599718] Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 23:25:20 +00:00
Bagatur	bdc03997c9	standard-tests[patch]: check for ToolCall["type"] (#24209 )	2024-07-12 16:17:34 -07:00
Nada Amin	3f1cf00d97	docs: Improve neo4j semantic templates (#23939 ) I made some changes based on the issues I stumbled on while following the README of neo4j-semantic-ollama. I made the changes to the ollama variant, and can also port the relevant ones to the layer variant once this is approved. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 23:08:25 +00:00
Nada Amin	6b47c7361e	docs: fix code usage to use the ollama variant (#23937 ) Description: the template neo4j-semantic-ollama uses an import from the neo4j-semantic-layer template instead of its own. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 23:07:42 +00:00
Anirudh31415926535	7677ceea60	docs: model parameter mandatory for cohere embedding and rerank (#23349 ) Latest langchain-cohere sdk mandates passing in the model parameter into the Embeddings and Reranker inits. This PR is to update the docs to reflect these changes.	2024-07-12 23:07:28 +00:00
Miroslav	aee55eda39	community: Skip Login to HuggubgFaceHub when token is not set (#21561 ) Thank you for contributing to LangChain! - [ ] HuggingFaceEndpoint: "Skip Login to HuggingFaceHub" - Where: langchain, community, llm, huggingface_endpoint - [ ] PR message: *Delete this entire checklist* and replace with - Description: Skip login to huggingface hub when when `huggingfacehub_api_token` is not set. This is needed when using custom `endpoint_url` outside of HuggingFaceHub. - Issue: the issue # it fixes https://github.com/langchain-ai/langchain/issues/20342 and https://github.com/langchain-ai/langchain/issues/19685 - Dependencies: None - [ ] Add tests and docs: 1. Tested with locally available TGI endpoint 2. Example Usage ```python from langchain_community.llms import HuggingFaceEndpoint llm = HuggingFaceEndpoint( endpoint_url='http://localhost:8080', server_kwargs={ "headers": {"Content-Type": "application/json"} } ) resp = llm.invoke("Tell me a joke") print(resp) ``` Also tested against HF Endpoints ```python from langchain_community.llms import HuggingFaceEndpoint huggingfacehub_api_token = "hf_xyz" repo_id = "mistralai/Mistral-7B-Instruct-v0.2" llm = HuggingFaceEndpoint( huggingfacehub_api_token=huggingfacehub_api_token, repo_id=repo_id, ) resp = llm.invoke("Tell me a joke") print(resp) ``` Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 22:10:32 +00:00
Anush	d09dda5a08	qdrant: Bump patch version (#24168 ) # Description To release a new version of `langchain-qdrant` after #24165 and #24166.	2024-07-12 14:48:50 -07:00
Bagatur	12950cc602	standard-tests[patch]: improve runnable tool description (#24210 )	2024-07-12 21:33:56 +00:00
Erick Friis	e8ee781a42	ibm: move to external repo (#24208 )	2024-07-12 21:14:24 +00:00
Bagatur	02e71cebed	together[patch]: Release 0.1.4 (#24205 )	2024-07-12 13:59:58 -07:00
Bagatur	259d4d2029	anthropic[patch]: Release 0.1.20 (#24204 )	2024-07-12 13:59:15 -07:00
Bagatur	3aed74a6fc	fireworks[patch]: Release 0.1.5 (#24203 )	2024-07-12 13:58:58 -07:00
Bagatur	13b0d7ec8f	openai[patch]: Release 0.1.16 (#24202 )	2024-07-12 13:58:39 -07:00
Bagatur	71cd6e6feb	groq[patch]: Release 0.1.7 (#24201 )	2024-07-12 13:58:19 -07:00
Bagatur	99054e19eb	mistralai[patch]: Release 0.1.10 (#24200 )	2024-07-12 13:57:58 -07:00
Bagatur	7a1321e2f9	ibm[patch]: Release 0.1.10 (#24199 )	2024-07-12 13:57:38 -07:00
Bagatur	cb5031f22f	integrations[patch]: require core >=0.2.17 (#24207 )	2024-07-12 20:54:01 +00:00
Nithish Raghunandanan	f1618ec540	couchbase: Add standard and semantic caches (#23607 ) Thank you for contributing to LangChain! Description: Add support for caching (standard + semantic) LLM responses using Couchbase - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Nithish Raghunandanan <nithishr@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 20:30:03 +00:00
Eugene Yurtsev	8d82a0d483	core[patch]: Mark GraphVectorStore as beta (#24195 ) * This PR marks graph vectorstore as beta	2024-07-12 14:28:06 -04:00
Bagatur	0a1e475a30	core[patch]: Release 0.2.17 (#24189 )	2024-07-12 17:08:29 +00:00
Bagatur	6166ea67a8	core[minor]: rename ToolMessage.raw_output -> artifact (#24185 )	2024-07-12 09:52:44 -07:00
Jean Nshuti	d77d9bfc00	community[patch]: update typo document content returned from semanticscholar (#24175 ) Update "astract" -> abstract	2024-07-12 15:40:47 +00:00
Leonid Ganeline	aa3e3cfa40	core[patch]: docstrings `runnables` update (#24161 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-12 11:27:06 -04:00
mumu	14ba1d4b45	docs: fix numeric errors in tools_chain.ipynb (#24169 ) Description: Corrected several numeric errors in the docs/docs/how_to/tools_chain.ipynb file to ensure the accuracy of the documentation.	2024-07-12 11:26:26 -04:00
Ikko Eltociear Ashimine	18da9f5e59	docs: update custom_chat_model.ipynb (#24170 ) characetrs -> characters	2024-07-12 06:48:22 -04:00
Tomaz Bratanic	d3a2b9fae0	Fix neo4j type error on missing constraint information (#24177 ) If you use `refresh_schema=False`, then the metadata constraint doesn't exist. ATM, we used default `None` in the constraint check, but then `any` fails because it can't iterate over None value	2024-07-12 06:39:29 -04:00
Anush	7014d07cab	qdrant: new Qdrant implementation (#24164 )	2024-07-12 04:52:02 +02:00
Xander Dumaine	35784d1c33	langchain[minor]: add document_variable_name to create_stuff_documents_chain (#24083 ) - Description: `StuffDocumentsChain` uses `LLMChain` which is deprecated by langchain runnables. `create_stuff_documents_chain` is the replacement, but needs support for `document_variable_name` to allow multiple uses of the chain within a longer chain. - Issue: none - Dependencies: none	2024-07-12 02:31:46 +00:00
Eugene Yurtsev	8858846607	milvus[patch]: Fix Milvus vectorstore for newer versions of langchain-core (#24152 ) Fix for: https://github.com/langchain-ai/langchain/issues/24116 This keeps the old behavior of add_documents and add_texts	2024-07-11 18:51:18 -07:00
thedavgar	ffe6ca986e	community: Fix Bug in Azure Search Vectorstore search asyncronously (#24081 ) Thank you for contributing to LangChain! Description: This PR fixes a bug described in the issue in #24064, when using the AzureSearch Vectorstore with the asyncronous methods to do search which is also the method used for the retriever. The proposed change includes just change the access of the embedding as optional because is it not used anywhere to retrieve documents. Actually, the syncronous methods of retrieval do not use the embedding neither. With this PR the code given by the user in the issue works. ```python vectorstore = AzureSearch( azure_search_endpoint=os.getenv("AI_SEARCH_ENDPOINT_SECRET"), azure_search_key=os.getenv("AI_SEARCH_API_KEY"), index_name=os.getenv("AI_SEARCH_INDEX_NAME_SECRET"), fields=fields, embedding_function=encoder, ) retriever = vectorstore.as_retriever(search_type="hybrid", k=2) await vectorstore.avector_search("what is the capital of France") await retriever.ainvoke("what is the capital of France") ``` Issue: The Azure Search Vectorstore is not working when searching for documents with asyncronous methods, as described in issue #24064 Dependencies: There are no extra dependencies required for this change. --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-11 18:32:19 -07:00
Anush	7790d67f94	qdrant: New sparse embeddings provider interface - PART 1 (#24015 ) ## Description This PR introduces a new sparse embedding provider interface to work with the new Qdrant implementation that will follow this PR. Additionally, an implementation of this interface is provided with https://github.com/qdrant/fastembed. This PR will be followed by https://github.com/Anush008/langchain/pull/3.	2024-07-11 17:07:25 -07:00
Erick Friis	1132fb801b	core: release 0.2.16 (#24159 )	2024-07-11 23:59:41 +00:00
Nuno Campos	1d37aa8403	core: Remove extra newline (#24157 )	2024-07-11 23:55:36 +00:00
ccurme	cb95198398	standard-tests[patch]: add tests for runnables as tools and streaming usage metadata (#24153 )	2024-07-11 18:30:05 -04:00
Erick Friis	d002fa902f	infra: fix redundant matrix config (#24151 )	2024-07-11 15:15:41 -07:00
Bagatur	8d100c58de	core[patch]: Tool accept RunnableConfig (#24143 ) Relies on #24038 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-11 22:13:17 +00:00
Bagatur	5fd1e67808	core[minor], integrations...[patch]: Support ToolCall as Tool input and ToolMessage as Tool output (#24038 ) Changes: - ToolCall, InvalidToolCall and ToolCallChunk can all accept a "type" parameter now - LLM integration packages add "type" to all the above - Tool supports ToolCall inputs that have "type" specified - Tool outputs ToolMessage when a ToolCall is passed as input - Tools can separately specify ToolMessage.content and ToolMessage.raw_output - Tools emit events for validation errors (using on_tool_error and on_tool_end) Example: ```python @tool("structured_api", response_format="content_and_raw_output") def _mock_structured_tool_with_raw_output( arg1: int, arg2: bool, arg3: Optional[dict] = None ) -> Tuple[str, dict]: """A Structured Tool""" return f"{arg1} {arg2}", {"arg1": arg1, "arg2": arg2, "arg3": arg3} def test_tool_call_input_tool_message_with_raw_output() -> None: tool_call: Dict = { "name": "structured_api", "args": {"arg1": 1, "arg2": True, "arg3": {"img": "base64string..."}}, "id": "123", "type": "tool_call", } expected = ToolMessage("1 True", raw_output=tool_call["args"], tool_call_id="123") tool = _mock_structured_tool_with_raw_output actual = tool.invoke(tool_call) assert actual == expected tool_call.pop("type") with pytest.raises(ValidationError): tool.invoke(tool_call) actual_content = tool.invoke(tool_call["args"]) assert actual_content == expected.content ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-11 14:54:02 -07:00
Bagatur	eeb996034b	core[patch]: Release 0.2.15 (#24149 )	2024-07-11 21:34:25 +00:00
Nuno Campos	03fba07d15	core[patch]: Update styles for mermaid graphs (#24147 )	2024-07-11 14:19:36 -07:00
Jacob Lee	c481a2715d	docs[patch]: Add structural example to style guide (#24133 ) CC @nfcampos	2024-07-11 13:20:14 -07:00
ccurme	8ee8ca7c83	core[patch]: propagate `parse_docstring` to tool decorator (#24123 ) Disabled by default. ```python from langchain_core.tools import tool @tool(parse_docstring=True) def foo(bar: str, baz: int) -> str: """The foo. Args: bar: this is the bar baz: this is the baz """ return bar foo.args_schema.schema() ``` ```json { "title": "fooSchema", "description": "The foo.", "type": "object", "properties": { "bar": { "title": "Bar", "description": "this is the bar", "type": "string" }, "baz": { "title": "Baz", "description": "this is the baz", "type": "integer" } }, "required": [ "bar", "baz" ] } ```	2024-07-11 20:11:45 +00:00
Jacob Lee	4121d4151f	docs[patch]: Fix typo (#24132 ) CC @efriis	2024-07-11 20:10:48 +00:00
Erick Friis	bd18faa2a0	infra: add SQLAlchemy to min version testing (#23186 ) preventing issues like #22546 Notes: - this will only affect release CI. We may want to consider adding running unit tests with min versions to PR CI in some form - because this only affects release CI, it could create annoying issues releasing while I'm on vacation. Unless anyone feels strongly, I'll wait to merge this til when I'm back	2024-07-11 20:09:57 +00:00
Jacob Lee	f1f1f75782	community[patch]: Make AzureML endpoint return AI messages for type assistant (#24085 )	2024-07-11 21:45:30 +02:00
Eugene Yurtsev	4ba14adec6	core[patch]: Clean up indexing test code (#24139 ) Refactor the code to use the existing InMemroyVectorStore. This change is needed for another PR that moves some of the imports around (and messes up the mock.patch in this file)	2024-07-11 18:54:46 +00:00
Atul R	457677c1b7	community: Fixes use of ImagePromptTemplate with Ollama (#24140 ) Description: ImagePromptTemplate for Multimodal llms like llava when using Ollama Twitter handle: https://x.com/a7ulr Details: When using llava models / any ollama multimodal llms and passing images in the prompt as urls, langchain breaks with this error. ```python image_url_components = image_url.split(",") ^^^^^^^^^^^^^^^^^^^^ AttributeError: 'dict' object has no attribute 'split' ``` From the looks of it, there was bug where the condition did check for a `url` field in the variable but missed to actually assign it. This PR fixes ImagePromptTemplate for Multimodal llms like llava when using Ollama specifically. @hwchase17	2024-07-11 11:31:48 -07:00
Matt	8327925ab7	community:support additional Azure Search Options (#24134 ) - Description: Support additional kwargs options for the Azure Search client (Described here https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/core/azure-core/README.md#configurations) - Issue: N/A - Dependencies: No additional Dependencies ---------	2024-07-11 18:22:36 +00:00
ccurme	122e80e04d	core[patch]: add versionadded to `as_tool` (#24138 )	2024-07-11 18:08:08 +00:00
Erick Friis	c4417ea93c	core: release 0.2.14, remove poetry 1.7 incompatible flag from root (#24137 )	2024-07-11 17:59:51 +00:00
Isaac Francisco	7a62d3dbd6	standard-tests[patch]: test that bind_tools can accept regular python function (#24135 )	2024-07-11 17:42:17 +00:00
Nuno Campos	2428984205	core: Add metadata to graph json repr (#24131 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-11 17:23:52 +00:00
Harley Gross	ea3cd1ebba	community[minor]: added support for C in RecursiveCharacterTextSplitter (#24091 ) Description: Added support for C in RecursiveCharacterTextSplitter by reusing the separators for C++	2024-07-11 16:47:48 +00:00
Nuno Campos	3e454d7568	core: fix docstring (#24129 )	2024-07-11 16:38:14 +00:00
Eugene Yurtsev	08638ccc88	community[patch]: QianfanLLMEndpoint fix type information for the keys (#24128 ) Fix for issue: https://github.com/langchain-ai/langchain/issues/24126	2024-07-11 16:24:26 +00:00
Nuno Campos	ee3fe20af4	core: mermaid: Render metadata key-value pairs when drawing mermaid graph (#24103 ) - if node is runnable binding with metadata attached	2024-07-11 16:22:23 +00:00
Eugene Yurtsev	1e7d8ba9a6	ci[patch]: Update community linter to provide a helpful error message (#24127 ) Update community import linter to explain what's wrong	2024-07-11 16:22:08 +00:00
maang-h	16e178a8c2	docs: Add MiniMaxChat docstrings (#24026 ) - Description: Add MiniMaxChat rich docstrings. - Issue: the issue #22296	2024-07-11 10:55:02 -04:00
Christophe Bornet	5fc5ef2b52	community[minor]: Add graph store extractors (#24065 ) This adds an extractor interface and an implementation for HTML pages. Extractors are used to create GraphVectorStore Links on loaded content. Twitter handle: cbornet_	2024-07-11 10:35:31 -04:00
maang-h	9bcf8f867d	docs: Add SQLChatMessageHistory docstring (#23978 ) - Description: Add SQLChatMessageHistory docstring. - Issue: the issue #21983 Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-11 14:24:28 +00:00
Rafael Pereira	092e9ee0e6	community[minor]: Neo4j Fixed similarity docs (#23913 ) Description: There was missing some documentation regarding the `filter` and `params` attributes in similarity search methods. --------- Co-authored-by: rpereira <rafael.pereira@criticalsoftware.com>	2024-07-11 10:16:48 -04:00
Mis	10d8c3cbfa	docs: Fix column positioning in the text splitting section for AI21SemanticTextSplitter (#24062 )	2024-07-11 09:38:04 -04:00
Jacob Lee	555c6d3c20	docs[patch]: Updates tool error handling guide, add admonition (#24102 ) @eyurtsev	2024-07-10 21:10:46 -07:00
Eugene Yurtsev	dc131ac42a	core[minor]: Add dispatching for custom events (#24080 ) This PR allows dispatching adhoc events for a given run. # Context This PR allows users to send arbitrary data to the callback system and to the astream events API from within a given runnable. This can be extremely useful to surface custom information to end users about progress etc. Integration with langsmith tracer will be done separately since the data cannot be currently visualized. It'll be accommodated using the events attribute of the Run # Examples with astream events ```python from langchain_core.callbacks import adispatch_custom_event from langchain_core.tools import tool @tool async def foo(x: int) -> int: """Foo""" await adispatch_custom_event("event1", {"x": x}) await adispatch_custom_event("event2", {"x": x}) return x + 1 async for event in foo.astream_events({'x': 1}, version='v2'): print(event) ``` ```python {'event': 'on_tool_start', 'data': {'input': {'x': 1}}, 'name': 'foo', 'tags': [], 'run_id': 'fd6fb7a7-dd37-4191-962c-e43e245909f6', 'metadata': {}, 'parent_ids': []} {'event': 'on_custom_event', 'run_id': 'fd6fb7a7-dd37-4191-962c-e43e245909f6', 'name': 'event1', 'tags': [], 'metadata': {}, 'data': {'x': 1}, 'parent_ids': []} {'event': 'on_custom_event', 'run_id': 'fd6fb7a7-dd37-4191-962c-e43e245909f6', 'name': 'event2', 'tags': [], 'metadata': {}, 'data': {'x': 1}, 'parent_ids': []} {'event': 'on_tool_end', 'data': {'output': 2}, 'run_id': 'fd6fb7a7-dd37-4191-962c-e43e245909f6', 'name': 'foo', 'tags': [], 'metadata': {}, 'parent_ids': []} ``` ```python from langchain_core.callbacks import adispatch_custom_event from langchain_core.runnables import RunnableLambda @RunnableLambda async def foo(x: int) -> int: """Foo""" await adispatch_custom_event("event1", {"x": x}) await adispatch_custom_event("event2", {"x": x}) return x + 1 async for event in foo.astream_events(1, version='v2'): print(event) ``` ```python {'event': 'on_chain_start', 'data': {'input': 1}, 'name': 'foo', 'tags': [], 'run_id': 'ce2beef2-8608-49ea-8eba-537bdaafb8ec', 'metadata': {}, 'parent_ids': []} {'event': 'on_custom_event', 'run_id': 'ce2beef2-8608-49ea-8eba-537bdaafb8ec', 'name': 'event1', 'tags': [], 'metadata': {}, 'data': {'x': 1}, 'parent_ids': []} {'event': 'on_custom_event', 'run_id': 'ce2beef2-8608-49ea-8eba-537bdaafb8ec', 'name': 'event2', 'tags': [], 'metadata': {}, 'data': {'x': 1}, 'parent_ids': []} {'event': 'on_chain_stream', 'run_id': 'ce2beef2-8608-49ea-8eba-537bdaafb8ec', 'name': 'foo', 'tags': [], 'metadata': {}, 'data': {'chunk': 2}, 'parent_ids': []} {'event': 'on_chain_end', 'data': {'output': 2}, 'run_id': 'ce2beef2-8608-49ea-8eba-537bdaafb8ec', 'name': 'foo', 'tags': [], 'metadata': {}, 'parent_ids': []} ``` # Examples with handlers This is copy pasted from unit tests ```python class CustomCallbackManager(BaseCallbackHandler): def __init__(self) -> None: self.events: List[Any] = [] def on_custom_event( self, name: str, data: Any, , run_id: UUID, tags: Optional[List[str]] = None, metadata: Optional[Dict[str, Any]] = None, *kwargs: Any, ) -> None: assert kwargs == {} self.events.append( ( name, data, run_id, tags, metadata, ) ) callback = CustomCallbackManager() run_id = uuid.UUID(int=7) @RunnableLambda def foo(x: int, config: RunnableConfig) -> int: dispatch_custom_event("event1", {"x": x}) dispatch_custom_event("event2", {"x": x}, config=config) return x foo.invoke(1, {"callbacks": [callback], "run_id": run_id}) assert callback.events == [ ("event1", {"x": 1}, UUID("00000000-0000-0000-0000-000000000007"), [], {}), ("event2", {"x": 1}, UUID("00000000-0000-0000-0000-000000000007"), [], {}), ] ```	2024-07-11 02:25:12 +00:00
Jacob Lee	14a8bbc21a	docs[patch]: Adds tool intermediate streaming guide (#24098 ) Can merge now and update when we add support for custom events. CC @eyurtsev @vbarda	2024-07-10 17:38:51 -07:00
Erick Friis	1de1182a9f	docs: discourage unconfirmed partner packages (#24099 )	2024-07-11 00:34:37 +00:00
Erick Friis	71c2221f8c	openai: release 0.1.15 (#24097 )	2024-07-10 16:45:42 -07:00
Erick Friis	6ea6f9f7bc	core: release 0.2.13 (#24096 )	2024-07-10 16:39:15 -07:00
ccurme	975b6129f6	core[patch]: support conversion of runnables to tools (#23992 ) Open to other thoughts on UX. string input: ```python as_tool = retriever.as_tool() as_tool.invoke("cat") # [Document(...), ...] ``` typed dict input: ```python class Args(TypedDict): key: int def f(x: Args) -> str: return str(x["key"] * 2) as_tool = RunnableLambda(f).as_tool( name="my tool", description="description", # name, description are inferred if not supplied ) as_tool.invoke({"key": 3}) # "6" ``` for untyped dict input, allow specification of parameters + types ```python def g(x: Dict[str, Any]) -> str: return str(x["key"] * 2) as_tool = RunnableLambda(g).as_tool(arg_types={"key": int}) result = as_tool.invoke({"key": 3}) # "6" ``` Passing the `arg_types` is slightly awkward but necessary to ensure tool calls populate parameters correctly: ```python from typing import Any, Dict from langchain_core.runnables import RunnableLambda from langchain_openai import ChatOpenAI def f(x: Dict[str, Any]) -> str: return str(x["key"] * 2) runnable = RunnableLambda(f) as_tool = runnable.as_tool(arg_types={"key": int}) llm = ChatOpenAI().bind_tools([as_tool]) result = llm.invoke("Use the tool on 3.") tool_call = result.tool_calls[0] args = tool_call["args"] assert args == {"key": 3} as_tool.run(args) ``` Contrived (?) example with langgraph agent as a tool: ```python from typing import List, Literal from typing_extensions import TypedDict from langchain_openai import ChatOpenAI from langgraph.prebuilt import create_react_agent llm = ChatOpenAI(temperature=0) def magic_function(input: int) -> int: """Applies a magic function to an input.""" return input + 2 agent_1 = create_react_agent(llm, [magic_function]) class Message(TypedDict): role: Literal["human"] content: str agent_tool = agent_1.as_tool( arg_types={"messages": List[Message]}, name="Jeeves", description="Ask Jeeves.", ) agent_2 = create_react_agent(llm, [agent_tool]) ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-10 19:29:59 -04:00
Jacob Lee	b63a48b7d3	docs[patch]: Fix typos, add prereq sections (#24095 )	2024-07-10 23:15:37 +00:00
Erick Friis	9de562f747	infra: create individual jobs in check_diff, do max milvus testing in 3.11 (#23829 ) pickup from #23721	2024-07-10 22:45:18 +00:00
Erick Friis	141943a7e1	infra: docs ignore step in script (#24090 )	2024-07-10 15:18:00 -07:00
Bagatur	6928f4c438	core[minor]: Add ToolMessage.raw_output (#23994 ) Decisions to discuss: 1. is a new attr needed or could additional_kwargs be used for this 2. is raw_output a good name for this attr 3. should raw_output default to {} or None 4. should raw_output be included in serialization 5. do we need to update repr/str to exclude raw_output	2024-07-10 20:11:10 +00:00
jongwony	14dd89a1ee	docs: add itemgetter in how_to/dynamic_chain (#23951 ) Hello! I am honored to be able to contribute to the LangChain project for the first time. - Description: Using `RunnablePassthrough` logic without providing `chat_history` key will result in nested keys in `question`, so I submit a pull request to resolve this issue. I am attaching a LangSmith screenshot below. This is the result of the current version of the document. <img width="1112" alt="image" src="https://github.com/langchain-ai/langchain/assets/12846075/f0597089-c375-472f-b2bf-793baaecd836"> without `chat_history`: <img width="1112" alt="image" src="https://github.com/langchain-ai/langchain/assets/12846075/5c0e3ae7-3afe-417c-9132-770387f0fff2"> - Lint and test: <img width="777" alt="image" src="https://github.com/langchain-ai/langchain/assets/12846075/575d2545-3aed-4338-9779-1a0b17365418">	2024-07-10 17:17:51 +00:00
Eugene Yurtsev	c4e149d4f1	community[patch]: Add linter to catch @root_validator (#24070 ) - Add linter to prevent further usage of vanilla root validator - Udpate remaining root validators	2024-07-10 14:51:03 +00:00
ccurme	9c6efadec3	community[patch]: propagate cost information to OpenAI callback (#23996 ) This is enabled following https://github.com/langchain-ai/langchain/pull/22716.	2024-07-10 14:50:35 +00:00
Dismas Banda	91b37b2d81	docs: fix spelling mistake in concepts.mdx: Fouth -> Fourth (#24067 ) Description: Corrected the spelling for fourth. Twitter handle: @dismasbanda	2024-07-10 14:35:54 +00:00
William FH	1e1fd30def	[Core] Fix fstring in logger warning (#24043 )	2024-07-09 19:53:18 -07:00
Jacob Lee	66265aaac4	docs[patch]: Update GPT4All docs (#24044 ) CC @efriis	2024-07-10 02:39:42 +00:00
Jacob Lee	8dac0fb3f1	docs[patch]: Remove deprecated Airbyte loaders from listings (#23927 ) CC @efriis	2024-07-10 02:21:25 +00:00
G Sreejith	68fee3e44b	docs: template readme update, fix docstring typo in a runnable (#24002 ) URL https://python.langchain.com/v0.2/docs/templates/openai-functions-tool-retrieval-agent/ Checklist I added a url - https://python.langchain.com/v0.2/docs/templates/openai-functions-agent/	2024-07-09 14:03:31 -07:00
Ethan Yang	13855ef0c3	[HuggingFace Pipeline] add streaming support (#23852 )	2024-07-09 17:02:00 -04:00
Erick Friis	34a02efcf9	infra: remove double heading in release notes (#24037 )	2024-07-09 20:48:17 +00:00
Nuno Campos	859e434932	core: Speed up json parse for large strings (#24036 ) for a large string: - old 4.657918874989264 - new 0.023724667000351474	2024-07-09 12:26:50 -07:00
Nuno Campos	160fc7f246	core: Move json parsing in base chat model / output parser to bg thread (#24031 ) - add version of AIMessageChunk.__add__ that can add many chunks, instead of only 2 - In agenerate_from_stream merge and parse chunks in bg thread - In output parse base classes do more work in bg threads where appropriate --------- Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2024-07-09 12:26:36 -07:00
Nuno Campos	73966e693c	openai: Create msg chunk in bg thread (#24032 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-09 12:01:51 -07:00
Erick Friis	007c5a85d5	multiple: use modern installer in poetry (#23998 )	2024-07-08 18:50:48 -07:00
Erick Friis	e80c150c44	community: release 0.2.7 (prev was langchain) (#23997 )	2024-07-08 23:43:32 +00:00
Erick Friis	9f8fd08955	community: release 0.2.7 (#23993 )	2024-07-08 22:04:58 +00:00
Bhadresh Savani	5d78b34a6f	[Docs] typo Update in azureopenai.ipynb (#23945 ) Update documentation for a typo.	2024-07-08 17:48:33 -04:00
Erick Friis	bedd893cd1	core: release 0.2.12 (#23991 )	2024-07-08 21:29:29 +00:00
Bagatur	1e957c0c23	docs: rm discord (#23985 )	2024-07-08 14:27:58 -07:00
Eugene Yurtsev	f765e8fa9d	core[minor],community[patch],standard-tests[patch]: Move InMemoryImplementation to langchain-core (#23986 ) This PR moves the in memory implementation to langchain-core. * The implementation remains importable from langchain-community. * Supporting utilities are marked as private for now.	2024-07-08 14:11:51 -07:00
Eugene Yurtsev	aa8c9bb4a9	community[patch]: Add constraint for pdfminer.six to unbreak CI (#23988 ) Something changed in pdfminer six. This PR unreaks CI without fixing the underlying PDF parser.	2024-07-08 20:55:19 +00:00
Eugene Yurtsev	2c180d645e	core[minor],community[minor]: Upgrade all @root_validator() to @pre_init (#23841 ) This PR introduces a @pre_init decorator that's a @root_validator(pre=True) but with all the defaults populated!	2024-07-08 16:09:29 -04:00
Mustafa Abdul-Kader	f152d6ed3d	docs(llamacpp): fix copy paste error (#23983 )	2024-07-08 20:06:04 +00:00
JonasDeitmersATACAMA	4d6f28cdde	Update annoy.ipynb (#23970 ) mmemory in the description -> memory (corrected spelling mistake) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-08 12:52:05 +00:00
Zheng Robert Jia	bf8d4716a7	Update concepts.mdx (#23955 ) Added link to list of built-in tools. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-08 08:47:51 -04:00
Zheng Robert Jia	4ec5fdda8d	Update index.mdx (#23956 ) Added reference to built-in tools list.	2024-07-08 08:47:28 -04:00
ccurme	ee579c77c1	docs: chain migration guide (#23844 ) Co-authored-by: jacoblee93 <jacoblee93@gmail.com>	2024-07-05 16:37:34 -07:00
Eugene Yurtsev	9787552b00	core[patch]: Use InMemoryChatMessageHistory in unit tests (#23916 ) Update unit test to use the existing implementation of chat message history	2024-07-05 20:10:54 +00:00
Rajendra Kadam	8b84457b17	community[minor]: Support PGVector in PebbloRetrievalQA (#23874 ) - Description: Support PGVector in PebbloRetrievalQA - Identity and Semantic Enforcement support for PGVector - Refactor Vectorstore validation and name check - Clear the overridden identity and semantic enforcement filters - Issue: NA - Dependencies: NA - Tests: NA(already added) - Docs: Updated - Twitter handle: [@Raj__725](https://twitter.com/Raj__725)	2024-07-05 16:02:25 -04:00
Eugene Yurtsev	e0186df56b	core[patch]: Clarify upsert response semantics (#23921 )	2024-07-05 15:59:47 -04:00
Leonid Ganeline	fcd018be47	docs: langgraph link fix (#23848 ) Link for the LangGraph doc is instead the LG repo link. Fixed the link	2024-07-05 15:50:45 -04:00
Robbie Cronin	0990ab146c	community: update import in chatbot tutorial to use InMemoryChatMessageHistory (#23903 ) Summary of change: - Replace ChatMessageHistory with InMemoryChatMessageHistory Fixes #23892 --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-05 15:48:11 -04:00
Rajendra Kadam	ee8aa54f53	community[patch]: Fix source path mismatch in PebbloSafeLoader (#23857 ) Description: Fix for source path mismatch in PebbloSafeLoader. The fix involves storing the full path in the doc metadata in VectorDB Issue: NA, caught in internal testing Dependencies: NA Add tests: Updated tests	2024-07-05 15:24:17 -04:00
Eugene Yurtsev	5b7d5f7729	core[patch]: Add comment to clarify aadd_documents (#23920 ) Add comment to clarify how add documents works	2024-07-05 15:20:16 -04:00
Eugene Yurtsev	e0889384d9	standard-tests[minor]: add unit tests for testing get_by_ids, aget_by_ids, upsert, aupsert_by_ids (#23919 ) These standard unit tests provide standard tests for functionality introduced in these PRs: * https://github.com/langchain-ai/langchain/pull/23774 * https://github.com/langchain-ai/langchain/pull/23594	2024-07-05 19:11:54 +00:00
ccurme	74c7198906	core, anthropic[patch]: support streaming tool calls when function has no arguments (#23915 ) resolves https://github.com/langchain-ai/langchain/issues/23911 When an AIMessageChunk is instantiated, we attempt to parse tool calls off of the tool_call_chunks. Here we add a special-case to this parsing, where `""` will be parsed as `{}`. This is a reaction to how Anthropic streams tool calls in the case where a function has no arguments: ``` {'id': 'toolu_01J8CgKcuUVrMqfTQWPYh64r', 'input': {}, 'name': 'magic_function', 'type': 'tool_use', 'index': 1} {'partial_json': '', 'type': 'tool_use', 'index': 1} ``` The `partial_json` does not accumulate to a valid json string-- most other providers tend to emit `"{}"` in this case.	2024-07-05 18:57:41 +00:00
Mateusz Szewczyk	902b57d107	IBM: Added WatsonxChat passing params to invoke method (#23758 ) Thank you for contributing to LangChain! - [x] PR title: "IBM: Added WatsonxChat to chat models preview, update passing params to invoke method" - [x] PR message: - Description: Added WatsonxChat passing params to invoke method, added integration tests - Dependencies: `ibm_watsonx_ai` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-05 18:07:50 +00:00
ccurme	1f5a163f42	langchain[patch]: deprecate QAGenerationChain (#23730 )	2024-07-05 18:06:19 +00:00
ccurme	25de47878b	langchain[patch]: deprecate AnalyzeDocumentChain (#23769 )	2024-07-05 14:00:23 -04:00
Christophe Bornet	42d049f618	core[minor]: Add Graph Store component (#23092 ) This PR introduces a GraphStore component. GraphStore extends VectorStore with the concept of links between documents based on document metadata. This allows linking documents based on a variety of techniques, including common keywords, explicit links in the content, and other patterns. This works with existing Documents, so it’s easy to extend existing VectorStores to be used as GraphStores. The interface can be implemented for any Vector Store technology that supports metadata, not only graph DBs. When retrieving documents for a given query, the first level of search is done using classical similarity search. Next, links may be followed using various traversal strategies to get additional documents. This allows documents to be retrieved that aren’t directly similar to the query but contain relevant information. 2 retrieving methods are added to the VectorStore ones : * traversal_search which gets all linked documents up to a certain depth * mmr_traversal_search which selects linked documents using an MMR algorithm to have more diverse results. If a depth of retrieval of 0 is used, GraphStore is effectively a VectorStore. It enables an easy transition from a simple VectorStore to GraphStore by adding links between documents as a second step. An implementation for Apache Cassandra is also proposed. See https://github.com/datastax/ragstack-ai/blob/main/libs/knowledge-store/notebooks/astra_support.ipynb for a notebook explaining how to use GraphStore and that shows that it can answer correctly to questions that a simple VectorStore cannot. Twitter handle: _cbornet	2024-07-05 12:24:10 -04:00
Leonid Ganeline	77f5fc3d55	core: docstrings `load` (#23787 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-05 12:23:19 -04:00
Eugene Yurtsev	6f08e11d7c	core[minor]: add upsert, streaming_upsert, aupsert, astreaming_upsert methods to the VectorStore abstraction (#23774 ) This PR rolls out part of the new proposed interface for vectorstores (https://github.com/langchain-ai/langchain/pull/23544) to existing store implementations. The PR makes the following changes: 1. Adds standard upsert, streaming_upsert, aupsert, astreaming_upsert methods to the vectorstore. 2. Updates `add_texts` and `aadd_texts` to be non required with a default implementation that delegates to `upsert` and `aupsert` if those have been implemented. The original `add_texts` and `aadd_texts` methods are problematic as they spread object specific information across document and *kwargs. (e.g., ids are not a part of the document) 3. Adds a default implementation to `add_documents` and `aadd_documents` that delegates to `upsert` and `aupsert` respectively. 4. Adds standard unit tests to verify that a given vectorstore implements a correct read/write API. A downside of this implementation is that it creates `upsert` with a very similar signature to `add_documents`. The reason for introducing `upsert` is to: Remove any ambiguities about what information is allowed in `kwargs`. Specifically kwargs should only be used for information common to all indexed data. (e.g., indexing timeout). *Allow inheriting from an anticipated generalized interface for indexing that will allow indexing `BaseMedia` (i.e., allow making a vectorstore for images/audio etc.) `add_documents` can be deprecated in the future in favor of `upsert` to make sure that users have a single correct way of indexing content. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-05 12:21:40 -04:00
G Sreejith	3c752238c5	core[patch]: Fix typo in docstring (graphm -> graph) (#23910 ) Changes has been as per the request Replaced graphm with graph	2024-07-05 16:20:33 +00:00
Leonid Ganeline	12c92b6c19	core: docstrings `outputs` (#23889 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-05 12:18:17 -04:00
Leonid Ganeline	1eca98ec56	core: docstrings `prompts` (#23890 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-05 12:17:52 -04:00
Philippe PRADOS	289960bc60	community[patch]: Redis.delete should be a regular method not a static method (#23873 ) The `langchain_common.vectostore.Redis.delete()` must not be a `@staticmethod`. With the current implementation, it's not possible to have multiple instances of Redis vectorstore because all versions must share the `REDIS_URL`. It's not conform with the base class.	2024-07-05 12:04:58 -04:00
Mohammad Mohtashim	2274d2b966	core[patch]: Accounting for Optional Input Variables in BasePromptTemplate (#22851 ) Description: After reviewing the prompts API, it is clear that the only way a user can explicitly mark an input variable as optional is through the `MessagePlaceholder.optional` attribute. Otherwise, the user must explicitly pass in the `input_variables` expected to be used in the `BasePromptTemplate`, which will be validated upon execution. Therefore, to semantically handle a `MessagePlaceholder` `variable_name` as optional, we will treat the `variable_name` of `MessagePlaceholder` as a `partial_variable` if it has been marked as optional. This approach aligns with how the `variable_name` of `MessagePlaceholder` is already handled [here](https://github.com/keenborder786/langchain/blob/optional_input_variables/libs/core/langchain_core/prompts/chat.py#L991). Additionally, an attribute `optional_variable` has been added to `BasePromptTemplate`, and the `variable_name` of `MessagePlaceholder` is also made part of `optional_variable` when marked as optional. Moreover, the `get_input_schema` method has been updated for `BasePromptTemplate` to differentiate between optional and non-optional variables. Issue: #22832, #21425 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-05 15:49:40 +00:00
Klaudia Lemiec	a2082bc1f8	docs: Arxiv docs update (#23871 ) - [X] PR title - [X] PR message: *Delete this entire checklist* and replace with - Description: Update of docstrings and docpages - Issue: [22866](https://github.com/langchain-ai/langchain/issues/22866) - [X] Add tests and docs - [X] Lint and test	2024-07-05 11:43:51 -04:00
jonathan \| ヨナタン	d311f22182	Langchain: fixed a typo in the imports (#23864 ) Description: Fixed a typo during the imports for the GoogleDriveSearchTool Issue: It's only for the docs, but it bothered me so i decided to fix it quickly :D	2024-07-05 15:42:50 +00:00
Arun Sasidharan	db6512aa35	docs: fix typo in llm_chain.ipynb (#23907 ) - Fix typo in the tutorial step - Add some context on `text`	2024-07-05 15:41:46 +00:00
André Quintino	99b1467b63	community: add support for 'cloud' parameter in JiraAPIWrapper (#23057 ) - Description: Enhance JiraAPIWrapper to accept the 'cloud' parameter through an environment variable. This update allows more flexibility in configuring the environment for the Jira API. - Twitter handle: Andre_Q_Pereira --------- Co-authored-by: André Quintino <andre.quintino@tui.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-05 15:11:10 +00:00
wenngong	b1e90b3075	community: add model_name param valid for GPT4AllEmbeddings (#23867 ) Description: add model_name param valid for GPT4AllEmbeddings Issue: #23863 #22819 --------- Co-authored-by: gongwn1 <gongwn1@lenovo.com>	2024-07-05 10:46:34 -04:00
volodymyr-memsql	a4eb6d0fb1	community: add SingleStoreDB semantic cache (#23218 ) This PR adds a `SingleStoreDBSemanticCache` class that implements a cache based on SingleStoreDB vector store, integration tests, and a notebook example. Additionally, this PR contains minor changes to SingleStoreDB vector store: - change add texts/documents methods to return a list of inserted ids - implement delete(ids) method to delete documents by list of ids - added drop() method to drop a correspondent database table - updated integration tests to use and check functionality implemented above CC: @baskaryan, @hwchase17 --------- Co-authored-by: Volodymyr Tkachuk <vtkachuk-ua@singlestore.com>	2024-07-05 09:26:06 -04:00
Igor Drozdov	bb597b1286	feat(community): add bind_tools function for ChatLiteLLM (#23823 ) It's a follow-up to https://github.com/langchain-ai/langchain/pull/23765 Now the tools can be bound by calling `bind_tools` ```python from langchain_core.pydantic_v1 import BaseModel, Field from langchain_core.utils.function_calling import convert_to_openai_tool from langchain_community.chat_models import ChatLiteLLM class GetWeather(BaseModel): '''Get the current weather in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") class GetPopulation(BaseModel): '''Get the current population in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") prompt = "Which city is hotter today and which is bigger: LA or NY?" # tools = [convert_to_openai_tool(GetWeather), convert_to_openai_tool(GetPopulation)] tools = [GetWeather, GetPopulation] llm = ChatLiteLLM(model="claude-3-sonnet-20240229").bind_tools(tools) ai_msg = llm.invoke(prompt) print(ai_msg.tool_calls) ``` If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Igor Drozdov <idrozdov@gitlab.com>	2024-07-05 09:19:41 -04:00
eliasecchig	efb48566d0	docs: add Vertex Feature Store, edit BigQuery Vector Search (#23709 ) Add Vertex Feature Store, edit BigQuery Vector Search docs --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-05 12:12:21 +00:00
Yuki Watanabe	0e916d0d55	community: Overhaul MLflow Integration documentation (#23067 )	2024-07-03 22:52:17 -04:00
ccurme	e62f8f143f	infra: remove cohere from monorepo scheduled tests (#23846 )	2024-07-03 21:48:39 +00:00
Jiejun Tan	2be66a38d8	huggingface: Fix huggingface tei support (#22653 ) Update former pull request: https://github.com/langchain-ai/langchain/pull/22595. Modified `libs/partners/huggingface/langchain_huggingface/embeddings/huggingface_endpoint.py`, where the API call function does not match current [Text Embeddings Inference API](https://huggingface.github.io/text-embeddings-inference/#/Text%20Embeddings%20Inference/embed). One example is: ```json { "inputs": "string", "normalize": true, "truncate": false } ``` Parameters in `_model_kwargs` are not passed properly in the latest version. By the way, the issue [why cause 413? #50](https://github.com/huggingface/text-embeddings-inference/issues/50) might be solved.	2024-07-03 13:30:29 -07:00
Eugene Yurtsev	9ccc4b1616	core[patch]: Fix logic in BaseChatModel that processes the llm string that is used as a key for caching chat models responses (#23842 ) This PR should fix the following issue: https://github.com/langchain-ai/langchain/issues/23824 Introduced as part of this PR: https://github.com/langchain-ai/langchain/pull/23416 I am unable to reproduce the issue locally though it's clear that we're getting a `serialized` object which is not a dictionary somehow. The test below passes for me prior to the PR as well ```python def test_cache_with_sqllite() -> None: from langchain_community.cache import SQLiteCache from langchain_core.globals import set_llm_cache cache = SQLiteCache(database_path=".langchain.db") set_llm_cache(cache) chat_model = FakeListChatModel(responses=["hello", "goodbye"], cache=True) assert chat_model.invoke("How are you?").content == "hello" assert chat_model.invoke("How are you?").content == "hello" ```	2024-07-03 16:23:55 -04:00
Vadym Barda	9bb623381b	core[minor]: update conversion utils to handle RemoveMessage (#23840 )	2024-07-03 16:13:31 -04:00
Eugene Yurtsev	4ab78572e7	core[patch]: Speed up unit tests for imports (#23837 ) Speed up unit tests for imports	2024-07-03 15:55:15 -04:00
Nico Puhlmann	4a15fce516	langchain: update declarative_base import (#20056 ) Description: The ``declarative_base()`` function is now available as sqlalchemy.orm.declarative_base(). (depreca ted since: 2.0) (Background on SQLAlchemy 2.0 at: https://sqlalche.me/e/b8d9) --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-07-03 15:52:35 -04:00
Mu Xian Ming	c06c666ce5	docs: fix docs/tutorials/llm_chain.ipynb (#23807 ) to correctly display the link Co-authored-by: Mu Xianming <mu.xianming@lmwn.com>	2024-07-03 15:38:31 -04:00
Vadym Barda	d206df8d3d	docs: improve structure in the agent migration to langgraph guide (#23817 )	2024-07-03 12:25:11 -07:00
Théo Deschamps	39b19cf764	core[patch]: extract input variables for `path` and `detail` keys in order to format an `ImagePromptTemplate` (#22613 ) - Description: Add support for `path` and `detail` keys in `ImagePromptTemplate`. Previously, only variables associated with the `url` key were considered. This PR allows for the inclusion of a local image path and a detail parameter as input to the format method. - Issues: - fixes #20820 - related to #22024 - Dependencies: None - Twitter handle: @DeschampsTho5 --------- Co-authored-by: tdeschamps <tdeschamps@kameleoon.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-07-03 18:58:42 +00:00
Bagatur	a4798802ef	cli[patch]: ruff 0.5 (#23833 )	2024-07-03 18:33:15 +00:00
Leonid Ganeline	55f6f91f17	core[patch]: docstrings `output_parsers` (#23825 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-03 14:27:40 -04:00
Philippe PRADOS	26cee2e878	partners[patch]: MongoDB vectorstore to return and accept string IDs (#23818 ) The mongdb have some errors. - `add_texts() -> List` returns a list of `ObjectId`, and not a list of string - `delete()` with `id` never remove chunks. --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-07-03 14:14:08 -04:00
Ikko Eltociear Ashimine	75734fbcf1	community: fix typo in unit tests for test_zenguard.py (#23819 ) enviroment -> environment - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM"	2024-07-03 14:05:42 -04:00
Bagatur	a0c2281540	infra: update mypy 1.10, ruff 0.5 (#23721 ) ```python """python scripts/update_mypy_ruff.py""" import glob import tomllib from pathlib import Path import toml import subprocess import re ROOT_DIR = Path(__file__).parents[1] def main(): for path in glob.glob(str(ROOT_DIR / "libs/*/pyproject.toml"), recursive=True): print(path) with open(path, "rb") as f: pyproject = tomllib.load(f) try: pyproject["tool"]["poetry"]["group"]["typing"]["dependencies"]["mypy"] = ( "^1.10" ) pyproject["tool"]["poetry"]["group"]["lint"]["dependencies"]["ruff"] = ( "^0.5" ) except KeyError: continue with open(path, "w") as f: toml.dump(pyproject, f) cwd = "/".join(path.split("/")[:-1]) completed = subprocess.run( "poetry lock --no-update; poetry install --with typing; poetry run mypy . --no-color", cwd=cwd, shell=True, capture_output=True, text=True, ) logs = completed.stdout.split("\n") to_ignore = {} for l in logs: if re.match("^(.)\:(\d+)\: error:.\[(.)\]", l): path, line_no, error_type = re.match( "^(.)\:(\d+)\: error:.\[(.*)\]", l ).groups() if (path, line_no) in to_ignore: to_ignore[(path, line_no)].append(error_type) else: to_ignore[(path, line_no)] = [error_type] print(len(to_ignore)) for (error_path, line_no), error_types in to_ignore.items(): all_errors = ", ".join(error_types) full_path = f"{cwd}/{error_path}" try: with open(full_path, "r") as f: file_lines = f.readlines() except FileNotFoundError: continue file_lines[int(line_no) - 1] = ( file_lines[int(line_no) - 1][:-1] + f" # type: ignore[{all_errors}]\n" ) with open(full_path, "w") as f: f.write("".join(file_lines)) subprocess.run( "poetry run ruff format .; poetry run ruff --select I --fix .", cwd=cwd, shell=True, capture_output=True, text=True, ) if __name__ == "__main__": main() ```	2024-07-03 10:33:27 -07:00
William FH	6cd56821dc	[Core] Unify function schema parsing (#23370 ) Use pydantic to infer nested schemas and all that fun. Include bagatur's convenient docstring parser Include annotation support Previously we didn't adequately support many typehints in the bind_tools() method on raw functions (like optionals/unions, nested types, etc.)	2024-07-03 09:55:38 -07:00
Oguz Vuruskaner	2a2c0d1a94	community[deepinfra]: fix tool call parsing. (#23162 ) This PR includes fix for DeepInfra tool call parsing.	2024-07-03 12:11:37 -04:00
maang-h	525109e506	feat: Implement ChatBaichuan asynchronous interface (#23589 ) - Description: Add interface to `ChatBaichuan` to support asynchronous requests - `_agenerate` method - `_astream` method --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-03 12:10:04 -04:00
Bagatur	8842a0d986	docs: fireworks nit (#23822 )	2024-07-03 15:36:27 +00:00
Leonid Ganeline	716a316654	core: docstrings `indexing` (#23785 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-03 11:27:34 -04:00
Leonid Ganeline	30fdc2dbe7	core: docstrings `messages` (#23788 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-07-03 11:25:00 -04:00
ccurme	54e730f6e4	fireworks[patch]: read from tool calls attribute (#23820 )	2024-07-03 11:11:17 -04:00
Bagatur	e787249af1	docs: fireworks standard page (#23816 )	2024-07-03 14:33:05 +00:00
Jacob Lee	27aa4d38bf	docs[patch]: Update structured output docs to have more discussion (#23786 ) CC @agola11 @ccurme	2024-07-02 16:53:31 -07:00
Bagatur	ebb404527f	anthropic[patch]: Release 0.1.19 (#23783 )	2024-07-02 18:17:25 -04:00
Bagatur	6168c846b2	openai[patch]: Release 0.1.14 (#23782 )	2024-07-02 18:17:15 -04:00
Bagatur	cb9812593f	openai[patch]: expose model request payload (#23287 ) ![Screenshot 2024-06-21 at 3 12 12 PM](https://github.com/langchain-ai/langchain/assets/22008038/6243a01f-1ef6-4085-9160-2844d9f2b683)	2024-07-02 17:43:55 -04:00
Bagatur	ed200bf2c4	anthropic[patch]: expose payload (#23291 ) ![Screenshot 2024-06-21 at 4 56 02 PM](https://github.com/langchain-ai/langchain/assets/22008038/a2c6224f-3741-4502-9607-1a726a0551c9)	2024-07-02 17:43:47 -04:00
Bagatur	7a3d8e5a99	core[patch]: Release 0.2.11 (#23780 )	2024-07-02 17:35:57 -04:00
Bagatur	d677dadf5f	core[patch]: mark RemoveMessage beta (#23656 )	2024-07-02 21:27:21 +00:00
ccurme	1d54ac93bb	ai21[patch]: release 0.1.7 (#23781 )	2024-07-02 21:24:13 +00:00
Asaf Joseph Gardin	320dc31822	partners: AI21 Labs Jamba Streaming Support (#23538 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - [x] PR message: *Delete this entire checklist* and replace with - Description: Added support for streaming in AI21 Jamba Model - Twitter handle: https://github.com/AI21Labs - [x] Add tests and docs: If you're adding a new integration, please include - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-02 17:15:46 -04:00
Qingchuan Hao	5cd4083457	community: make bing web search as the only option (#23523 ) This PR make bing web search as the option for BingSearchAPIWrapper to facilitate and simply the user interface on Langchain. This is a follow-up work of https://github.com/langchain-ai/langchain/pull/23306.	2024-07-02 17:13:54 -04:00
William W Wang	76e7e4e9e6	Update docs: LangChain agent memory (#23673 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Update docs content on agent memory If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-02 17:06:32 -04:00
ccurme	7c1cddf1b7	anthropic[patch]: release 0.1.18 (#23778 )	2024-07-02 16:46:47 -04:00
ccurme	c9dac59008	anthropic[patch]: fix model name in some integration tests (#23779 )	2024-07-02 20:45:52 +00:00
Bagatur	7a6c06cadd	anthropic[patch]: tool output parser fix (#23647 )	2024-07-02 16:33:22 -04:00
ccurme	46cbf0e4aa	anthropic[patch]: use core output parsers for structured output (#23776 ) Also add to standard tests for structured output.	2024-07-02 16:15:26 -04:00
kiarina	dc396835ed	langchain_anthropic: add stop_reason in ChatAnthropic stream result (#23689 ) `ChatAnthropic` can get `stop_reason` from the resulting `AIMessage` in `invoke` and `ainvoke`, but not in `stream` and `astream`. This is a different behavior from `ChatOpenAI`. It is possible to get `stop_reason` from `stream` as well, since it is needed to determine the next action after the LLM call. This would be easier to handle in situations where only `stop_reason` is needed. - Issue: NA - Dependencies: NA - Twitter handle: https://x.com/kiarina37	2024-07-02 15:16:20 -04:00
Bagatur	27ce58f86e	docs: google genai standard page (#23766 ) Part of #22296	2024-07-02 13:54:34 -04:00
maang-h	e4e28a6ff5	community[patch]: Fix MiniMaxChat validate_environment error (#23770 ) - Description: Fix some issues in MiniMaxChat - Fix `minimax_api_host` not in `values` error - Remove `minimax_group_id` from reading environment variables, the `minimax_group_id` no longer use in MiniMaxChat - Invoke callback prior to yielding token, the issus #16913	2024-07-02 13:23:32 -04:00
SN	acc457f645	core[patch]: fix nested sections for mustache templating (#23747 ) The prompt template variable detection only worked for singly-nested sections because we just kept track of whether we were in a section and then set that to false as soon as we encountered an end block. i.e. the following: ``` {{#outerSection}} {{variableThatShouldntShowUp}} {{#nestedSection}} {{nestedVal}} {{/nestedSection}} {{anotherVariableThatShouldntShowUp}} {{/outerSection}} ``` Would yield `['outerSection', 'anotherVariableThatShouldntShowUp']` as input_variables (whereas it should just yield `['outerSection']`). This fixes that by keeping track of the current depth and using a stack.	2024-07-02 10:20:45 -07:00
Karim Lalani	acc8fb3ead	docs[patch]: Update OllamaFunctions docs to match chat model integration template (#23179 ) Added Tool Calling Agent Example with langgraph to OllamaFunctions documentation	2024-07-02 10:05:44 -07:00
Bagatur	79c07a8ade	docs: standardize bedrock page (#23738 ) Part of #22296	2024-07-02 12:03:36 -04:00
Teja Hara	a77a263e24	Added langchain-community installation (#23741 ) PR title: Docs enhancement - Description: Adding installation instructions for integrations requiring langchain-community package since 0.2 - Issue: https://github.com/langchain-ai/langchain/issues/22005 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-02 11:03:07 -04:00
Eugene Yurtsev	46ff0f7a3c	community[patch]: Update @root_validators to use explicit pre=True or pre=False (#23737 )	2024-07-02 10:47:21 -04:00
Igor Drozdov	b664dbcc36	feat(community): add support for tool_calls response (#23765 ) When `model_kwargs={"tools": tools}` are passed to `ChatLiteLLM`, they are executed, but the response is not recognized correctly Let's add `tool_calls` to the `additional_kwargs` Thank you for contributing to LangChain! ## ChatAnthropic I used the following example to verify the output of llm with tools: ```python from langchain_core.pydantic_v1 import BaseModel, Field from langchain_anthropic import ChatAnthropic class GetWeather(BaseModel): '''Get the current weather in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") class GetPopulation(BaseModel): '''Get the current population in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") llm = ChatAnthropic(model="claude-3-sonnet-20240229") llm_with_tools = llm.bind_tools([GetWeather, GetPopulation]) ai_msg = llm_with_tools.invoke("Which city is hotter today and which is bigger: LA or NY?") print(ai_msg.tool_calls) ``` I get the following response: ```json [{'name': 'GetWeather', 'args': {'location': 'Los Angeles, CA'}, 'id': 'toolu_01UfDA89knrhw3vFV9X47neT'}, {'name': 'GetWeather', 'args': {'location': 'New York, NY'}, 'id': 'toolu_01NrYVRYae7m7z7tBgyPb3Gd'}, {'name': 'GetPopulation', 'args': {'location': 'Los Angeles, CA'}, 'id': 'toolu_01EPFEpDgzL6vV2dTpD9SVP5'}, {'name': 'GetPopulation', 'args': {'location': 'New York, NY'}, 'id': 'toolu_01B5J6tPJXgwwfhQX9BHP2dt'}] ``` ## LiteLLM Based on https://litellm.vercel.app/docs/completion/function_call ```python from langchain_core.pydantic_v1 import BaseModel, Field from langchain_core.utils.function_calling import convert_to_openai_tool import litellm class GetWeather(BaseModel): '''Get the current weather in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") class GetPopulation(BaseModel): '''Get the current population in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") prompt = "Which city is hotter today and which is bigger: LA or NY?" tools = [convert_to_openai_tool(GetWeather), convert_to_openai_tool(GetPopulation)] response = litellm.completion(model="claude-3-sonnet-20240229", messages=[{'role': 'user', 'content': prompt}], tools=tools) print(response.choices[0].message.tool_calls) ``` ```python [ChatCompletionMessageToolCall(function=Function(arguments='{"location": "Los Angeles, CA"}', name='GetWeather'), id='toolu_01HeDWV5vP7BDFfytH5FJsja', type='function'), ChatCompletionMessageToolCall(function=Function(arguments='{"location": "New York, NY"}', name='GetWeather'), id='toolu_01EiLesUSEr3YK1DaE2jxsQv', type='function'), ChatCompletionMessageToolCall(function=Function(arguments='{"location": "Los Angeles, CA"}', name='GetPopulation'), id='toolu_01Xz26zvkBDRxEUEWm9pX6xa', type='function'), ChatCompletionMessageToolCall(function=Function(arguments='{"location": "New York, NY"}', name='GetPopulation'), id='toolu_01SDqKnsLjvUXuBsgAZdEEpp', type='function')] ``` ## ChatLiteLLM When I try the following ```python from langchain_core.pydantic_v1 import BaseModel, Field from langchain_core.utils.function_calling import convert_to_openai_tool from langchain_community.chat_models import ChatLiteLLM class GetWeather(BaseModel): '''Get the current weather in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") class GetPopulation(BaseModel): '''Get the current population in a given location''' location: str = Field(..., description="The city and state, e.g. San Francisco, CA") prompt = "Which city is hotter today and which is bigger: LA or NY?" tools = [convert_to_openai_tool(GetWeather), convert_to_openai_tool(GetPopulation)] llm = ChatLiteLLM(model="claude-3-sonnet-20240229", model_kwargs={"tools": tools}) ai_msg = llm.invoke(prompt) print(ai_msg) print(ai_msg.tool_calls) ``` ```python content="Okay, let's find out the current weather and populations for Los Angeles and New York City:" response_metadata={'token_usage': Usage(prompt_tokens=329, completion_tokens=193, total_tokens=522), 'model': 'claude-3-sonnet-20240229', 'finish_reason': 'tool_calls'} id='run-748b7a84-84f4-497e-bba1-320bd4823937-0' [] ``` --- When I apply the changes of this PR, the output is ```json [{'name': 'GetWeather', 'args': {'location': 'Los Angeles, CA'}, 'id': 'toolu_017D2tGjiaiakB1HadsEFZ4e'}, {'name': 'GetWeather', 'args': {'location': 'New York, NY'}, 'id': 'toolu_01WrDpJfVqLkPejWzonPCbLW'}, {'name': 'GetPopulation', 'args': {'location': 'Los Angeles, CA'}, 'id': 'toolu_016UKyYrVAV9Pz99iZGgGU7V'}, {'name': 'GetPopulation', 'args': {'location': 'New York, NY'}, 'id': 'toolu_01Sgv1imExFX1oiR1Cw88zKy'}] ``` If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Igor Drozdov <idrozdov@gitlab.com>	2024-07-02 10:42:08 -04:00
Eugene Yurtsev	338cef35b4	community[patch]: update @root_validator in utilities namespace (#23768 ) Update all utilities to use `pre=True` or `pre=False` https://github.com/langchain-ai/langchain/issues/22819	2024-07-02 14:33:01 +00:00
wenngong	ee5eedfa04	partners: support reading HuggingFace params from env (#23309 ) Description: 1. partners/HuggingFace module support reading params from env. Not adjust langchain_community/.../huggingfaceXX modules since they are deprecated. 2. pydantic 2 @root_validator migration. Issue: #22448 #22819 --------- Co-authored-by: gongwn1 <gongwn1@lenovo.com>	2024-07-02 10:12:45 -04:00
antonpibm	ffde8a6a09	Milvus vectorstore: fix pass ids as argument after upsert (#23761 ) Description: Milvus vectorstore supports both `add_documents` via the base class and `upsert` method which deletes and re-adds documents based on their ids Issue: Due to mismatch in the interfaces the ids used by `upsert` are neglected in `add_documents`, as `ids` are passed as argument in `upsert` but via `kwargs` is `add_documents` This caused exceptions and inconsistency in the DB, tested with `auto_id=False` Fix: pass `ids` via `kwargs` to `add_documents`	2024-07-02 13:45:30 +00:00
Eugene Yurtsev	d084172b63	community[patch]: root validator set explicit pre=False or pre=True (#23764 ) See issue: https://github.com/langchain-ai/langchain/issues/22819	2024-07-02 09:42:05 -04:00
Khelan Modi	4457e64e13	Update azure_cosmos_db for mongodb documentation (#23740 ) added pre-filtering documentation Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: added filter vector search - Issue: N/A - Dependencies: N/A - Twitter handle:: n/a - [x] Add tests and docs: If you're adding a new integration, please include - No need for tests, just a simple doc update 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-02 12:53:05 +00:00
panwg3	bc98f90ba3	update wrong words (#23749 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-02 08:50:20 -04:00
mattthomps1	cc55823486	docs: updated PPLX model (#23723 ) Description: updated pplx docs to reference a currently [supported model](https://docs.perplexity.ai/docs/model-cards). pplx-70b-online ->llama-3-sonar-small-32k-online --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-07-02 08:48:49 -04:00
Bagatur	aa165539f6	docs: standardize cohere page (#23739 ) Part of #22296	2024-07-01 19:34:13 -04:00
Jacob Lee	7791d92711	community[patch]: Fix requests alias for load_tools (#23734 ) CC @baskaryan	2024-07-01 15:02:14 -07:00
Eugene Yurtsev	f24e38876a	community[patch]: Update root_validators to use explicit pre=True or pre=False (#23736 )	2024-07-01 17:13:23 -04:00
Yannick Stephan	5b1de2ae93	mistralai: Fixed streaming in MistralAI with ainvoke and callbacks (#22000 ) # Fix streaming in mistral with ainvoke - [x] PR title - [x] PR message - [x] Add tests and docs: 1. [x] Added a test for the fixed integration. 2. [x] An example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Ran `make format`, `make lint` and `make test` from the root of the package(s) I've modified. Hello * I Identified an issue in the mistral package where the callback streaming (see on_llm_new_token) was not functioning correctly when the streaming parameter was set to True and call with `ainvoke`. * The root cause of the problem was the streaming not taking into account. ( I think it's an oversight ) * To resolve the issue, I added the `streaming` attribut. * Now, the callback with streaming works as expected when the streaming parameter is set to True. ## How to reproduce ``` from langchain_mistralai.chat_models import ChatMistralAI chain = ChatMistralAI(streaming=True) # Add a callback chain.ainvoke(..) # Oberve on_llm_new_token # Now, the callback is given as streaming tokens, before it was in grouped format. ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-01 20:53:09 +00:00
Jacob Lee	f4b2e553e7	docs[patch]: Update Unstructured loader notebooks and install instructions (#23726 ) CC @baskaryan @MthwRobinson	2024-07-01 13:36:48 -07:00
Eugene Yurtsev	5d2262af34	community[patch]: Update root_validators to use pre=True or pre=False (#23731 ) Update root_validators in preparation for pydantic 2 migration.	2024-07-01 20:10:15 +00:00
Erick Friis	6019147b66	infra: filter template check (#23727 )	2024-07-01 13:00:33 -07:00
Eugene Yurtsev	ebcee4f610	core[patch]: Add versionadded to get_by_ids (#23728 )	2024-07-01 15:16:00 -04:00
Eugene Yurtsev	e800f6bb57	core[minor]: Create BaseMedia object (#23639 ) This PR implements a BaseContent object from which Document and Blob objects will inherit proposed here: https://github.com/langchain-ai/langchain/pull/23544 Alternative: Create a base object that only has an identifier and no metadata. For now decided against it, since that refactor can be done at a later time. It also feels a bit odd since our IDs are optional at the moment. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-01 15:07:30 -04:00
Chip Davis	04bc5f1a95	partners[azure]: fix having openai_api_base set for other packages (#22068 ) This fix is for #21726. When having other packages installed that require the `openai_api_base` environment variable, users are not able to instantiate the AzureChatModels or AzureEmbeddings. This PR adds a new value `ignore_openai_api_base` which is a bool. When set to True, it sets `openai_api_base` to `None` Two new tests were added for the `test_azure` and a new file `test_azure_embeddings` A different approach may be better for this. If you can think of better logic, let me know and I can adjust it. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-01 18:35:20 +00:00
Nuno Campos	b36e95caa9	core[patch]: use async messages where possible (#23718 ) Fix #23716 Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-01 18:33:05 +00:00
Spyros Avlonitis	8cfb2fa1b7	core[minor]: Add maxsize for InMemoryCache (#23405 ) This PR introduces a maxsize parameter for the InMemoryCache class, allowing users to specify the maximum number of items to store in the cache. If the cache exceeds the specified maximum size, the oldest items are removed. Additionally, comprehensive unit tests have been added to ensure all functionalities are thoroughly tested. The tests are written using pytest and cover both synchronous and asynchronous methods. Twitter: @spyrosavl --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-07-01 14:21:21 -04:00
maang-h	96af8f31ae	community[patch]: Invoke callback prior to yielding token (#23638 ) - Description: Invoke callback prior to yielding token in stream and astream methods for ChatZhipuAI. - Issue: the issue #16913	2024-07-01 18:12:24 +00:00
Eugene Yurtsev	b5aef4cf97	core[patch]: Fix llm string representation for serializable models (#23416 ) Fix LLM string representation for serializable objects. Fix for issue: https://github.com/langchain-ai/langchain/issues/23257 The llm string of serializable chat models is the serialized representation of the object. LangChain serialization dumps some basic information about non serializable objects including their repr() which includes an object id. This means that if a chat model has any non serializable fields (e.g., a cache), then any new instantiation of the those fields will change the llm representation of the chat model and cause chat misses. i.e., re-instantiating a postgres cache would result in cache misses!	2024-07-01 14:06:33 -04:00
nobbbbby	3904f2cd40	core: fix NameError (#23658 ) Description: In the chat_models module of the language model, the import statement for BaseModel has been moved from the conditionally imported section to the main import area, fixing `NameError `. Issue: fix `NameError `	2024-07-01 17:51:23 +00:00
Jacob Lee	d2c7379f1c	👥 Update LangChain people data (#23697 ) 👥 Update LangChain people data --------- Co-authored-by: github-actions <github-actions@github.com>	2024-07-01 17:42:55 +00:00
Jordy Jackson Antunes da Rocha	a50eabbd48	experimental: LLMGraphTransformer add missing conditional adding restrictions to prompts for LLM that do not support function calling (#22793 ) - Description: Modified the prompt created by the function `create_unstructured_prompt` (which is called for LLMs that do not support function calling) by adding conditional checks that verify if restrictions on entity types and rel_types should be added to the prompt. If the user provides a sufficiently large text, the current prompt may fail to produce results in some LLMs. I have first seen this issue when I implemented a custom LLM class that did not support Function Calling and used Gemini 1.5 Pro, but I was able to replicate this issue using OpenAI models. By loading a sufficiently large text ```python from langchain_community.llms import Ollama from langchain_openai import ChatOpenAI, OpenAI from langchain_core.prompts import PromptTemplate import re from langchain_experimental.graph_transformers import LLMGraphTransformer from langchain_core.documents import Document with open("texto-longo.txt", "r") as file: full_text = file.read() partial_text = full_text[:4000] documents = [Document(page_content=partial_text)] # cropped to fit GPT 3.5 context window ``` And using the chat class (that has function calling) ```python chat_openai = ChatOpenAI(model="gpt-3.5-turbo", model_kwargs={"seed": 42}) chat_gpt35_transformer = LLMGraphTransformer(llm=chat_openai) graph_from_chat_gpt35 = chat_gpt35_transformer.convert_to_graph_documents(documents) ``` It works: ``` >>> print(graph_from_chat_gpt35[0].nodes) [Node(id="Jesu, Joy of Man's Desiring", type='Music'), Node(id='Godel', type='Person'), Node(id='Johann Sebastian Bach', type='Person'), Node(id='clever way of encoding the complicated expressions as numbers', type='Concept')] ``` But if you try to use the non-chat LLM class (that does not support function calling) ```python openai = OpenAI( model="gpt-3.5-turbo-instruct", max_tokens=1000, ) gpt35_transformer = LLMGraphTransformer(llm=openai) graph_from_gpt35 = gpt35_transformer.convert_to_graph_documents(documents) ``` It uses the prompt that has issues and sometimes does not produce any result ``` >>> print(graph_from_gpt35[0].nodes) [] ``` After implementing the changes, I was able to use both classes more consistently: ```shell >>> chat_gpt35_transformer = LLMGraphTransformer(llm=chat_openai) >>> graph_from_chat_gpt35 = chat_gpt35_transformer.convert_to_graph_documents(documents) >>> print(graph_from_chat_gpt35[0].nodes) [Node(id="Jesu, Joy Of Man'S Desiring", type='Music'), Node(id='Johann Sebastian Bach', type='Person'), Node(id='Godel', type='Person')] >>> gpt35_transformer = LLMGraphTransformer(llm=openai) >>> graph_from_gpt35 = gpt35_transformer.convert_to_graph_documents(documents) >>> print(graph_from_gpt35[0].nodes) [Node(id='I', type='Pronoun'), Node(id="JESU, JOY OF MAN'S DESIRING", type='Song'), Node(id='larger memory', type='Memory'), Node(id='this nice tree structure', type='Structure'), Node(id='how you can do it all with the numbers', type='Process'), Node(id='JOHANN SEBASTIAN BACH', type='Composer'), Node(id='type of structure', type='Characteristic'), Node(id='that', type='Pronoun'), Node(id='we', type='Pronoun'), Node(id='worry', type='Verb')] ``` The results are a little inconsistent because the GPT 3.5 model may produce incomplete json due to the token limit, but that could be solved (or mitigated) by checking for a complete json when parsing it.	2024-07-01 17:33:51 +00:00
Eugene Yurtsev	4f1821db3e	core[minor]: Add get_by_ids to vectorstore interface (#23594 ) This PR adds a part of the indexing API proposed in this RFC https://github.com/langchain-ai/langchain/pull/23544/files. It allows rolling out `get_by_ids` which should be uncontroversial to existing vectorstores without introducing new abstractions. The semantics for this method depend on the ability of identifying returned documents using the new optional ID field on documents: https://github.com/langchain-ai/langchain/pull/23411 Alternatives are: 1. Relax the sequence requirement ```python def get_by_ids(self, ids: Iterable[str], /) -> Iterable[Document]: ``` Rejected: - implementations are more likley to start batching with bad defaults - users would need to call list() or we'd need to introduce another convenience method 2. Support more kwargs ```python def get_by_ids(self, ids: Sequence[str], /, **kwargs) -> List[Document]: ... ``` Rejected: - No need for `batch` parameter since IDs is a sequence - Output cannot be customized since `Document` is fixed. (e.g., parameters could be useful to grab extra metadata like the vector that was indexed with the Document or to project a part of the document)	2024-07-01 13:04:33 -04:00
Valentin	bf402f902e	community: Fix LanceDB similarity search bug (#23591 ) Description: LanceDB didn't allow querying the database using similarity score thresholds because the metrics value was missing. This PR simply fixes that bug. Issue: not applicable Dependencies: none Twitter handle: not available --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-07-01 16:33:45 +00:00
Bagatur	389a568f9a	standard-tests[patch]: add anthropic format integration test (#23717 )	2024-07-01 11:06:04 -04:00
Rafael Pereira	4b9517db85	Jira: Allow Jira access using only the token (#23708 ) - Description: At the moment the Jira wrapper only accepts the the usage of the Username and Password/Token at the same time. However Jira allows the connection using only is useful for enterprise context. Co-authored-by: rpereira <rafael.pereira@criticalsoftware.com>	2024-07-01 13:13:51 +00:00
Francesco Kruk	7538f3df58	Update jina embedding notebook to show multimodal capability more clearly (#23702 ) After merging the [PR #22594 to include Jina AI multimodal capabilities in the Langchain documentation](https://github.com/langchain-ai/langchain/pull/22594), we updated the notebook to showcase the difference between text and multimodal capabilities more clearly.	2024-07-01 09:13:19 -04:00
Tim Van Wassenhove	24916c6703	community: Register pandas df in duckdb when creating vector_store (#23690 ) - Description: Register pandas df in duckdb when creating vector_store - Issue: Resolves #23308 - Dependencies: None - Twitter handle: @timvw Co-authored-by: Tim Van Wassenhove <tim.van.wassenhove@telenetgroup.be>	2024-07-01 09:12:06 -04:00
Sourav Biswal	b60df8bb4f	Update chatbot.ipynb (#23688 ) DOC: missing parenthesis #23687 Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-07-01 13:00:34 +00:00
Jacob Lee	9604cb833b	ci[patch]: Update people PR CI permissions (#23696 ) CC @agola11	2024-06-30 22:25:08 -07:00
Bagatur	29aa9d6750	groq[patch]: Release 0.1.6 (#23655 )	2024-06-29 07:35:23 -04:00
Bagatur	f2d0c13a15	fireworks[patch]: Release 0.1.4 (#23654 )	2024-06-29 07:35:16 -04:00
Bagatur	9a5e35d1ba	mistralai[patch]: Release 0.1.9 (#23653 )	2024-06-29 07:35:09 -04:00
Bagatur	74321e546d	infra: update release permissions (#23662 )	2024-06-29 07:31:36 -04:00
Mateusz Szewczyk	a78ccb993c	ibm: Add support for Chat Models (#22979 )	2024-06-29 01:59:25 -07:00
Jacob Lee	16c59118eb	docs[patch]: Adds short tracing how-tos and conceptual guide (#23657 ) CC @agola11	2024-06-28 18:28:49 -07:00
Jacob Lee	c0bb26e85b	docs[patch]: Typo fix (#23652 )	2024-06-28 17:27:44 -07:00
Jacob Lee	72175c57bd	docs[patch]: Fix docs bugs in response to feedback (#23649 ) - Update Meta Llama 3 cookbook link - Add prereq section and information on `messages_modifier` to LangGraph migration guide - Update `PydanticToolsParser` explanation and entrypoint in tool calling guide - Add more obvious warning to `OllamaFunctions` - Fix Wikidata tool install flow - Update Bedrock LLM initialization @baskaryan can you add a bit of information on how to authenticate into the `ChatBedrock` and `BedrockLLM` models? I wasn't able to figure it out :(	2024-06-28 17:24:55 -07:00
Bagatur	af2c05e5f3	openai[patch]: Release 0.1.13 (#23651 )	2024-06-28 17:10:30 -07:00
Bagatur	b63c7f10bc	anthropic[patch]: Release 0.1.17 (#23650 )	2024-06-28 17:07:08 -07:00
Bagatur	fc8fd49328	openai, anthropic, ...: with_structured_output to pass in explicit tool choice (#23645 ) ...community, mistralai, groq, fireworks part of #23644	2024-06-28 16:39:53 -07:00
Bagatur	c5f35a72da	docs: vllm pkg nit (#23648 )	2024-06-28 16:09:36 -07:00
Bagatur	81064017a9	docs: azure openai docstring (#23643 ) part of #22296	2024-06-28 15:15:58 -07:00
Bagatur	381aedcc61	docs: standardize azure openai page (#23642 ) part of #22296	2024-06-28 15:15:41 -07:00
Vadym Barda	e8d77002ea	core: add RemoveMessage (#23636 ) This change adds a new message type `RemoveMessage`. This will enable `langgraph` users to manually modify graph state (or have the graph nodes modify the state) to remove messages by `id` Examples: * allow users to delete messages from state by calling ```python graph.update_state(config, values=[RemoveMessage(id=state.values[-1].id)]) ``` * allow nodes to delete messages ```python graph.add_node("delete_messages", lambda state: [RemoveMessage(id=state[-1].id)]) ```	2024-06-28 14:40:02 -07:00
ccurme	8fce8c6771	community: fix extended tests (#23640 )	2024-06-28 16:35:38 -04:00
ccurme	5d93916665	openai[patch]: release 0.1.12 (#23641 )	2024-06-28 19:51:16 +00:00
Jacob Lee	a032583b17	docs[patch]: Update diagrams (#23613 )	2024-06-28 12:36:00 -07:00
ccurme	390ee8d971	standard-tests: add test for structured output (#23631 ) - add test for structured output - fix bug with structured output for Azure - better testing on Groq (break out Mixtral + Llama3 and add xfails where needed)	2024-06-28 15:01:40 -04:00
Eugene Yurtsev	6c1ba9731d	docs: Resurface some methods in API reference and clarify note at top of Reference (#23633 ) This PR modifies the API Reference in the following way: 1. Relist standard methods: invoke, ainvoke, batch, abatch, batch_as_completed, abatch_as_completed, stream, astream, astream_events. These are the main entry points for a lot of runnables, so we'll keep them for each runnable. 2. Relist methods from Runnable Serializable: to_json, configurable_fields, configurable_alternatives. 3. Expand the note in the API reference documentation to explain that additional methods are available.	2024-06-28 12:31:37 -04:00
Brace Sproul	800b0ff3b9	docs[minor]: Hide langserve pages (#23618 )	2024-06-28 08:25:08 -07:00
j pradhan	5f21eab491	community:perplexity[patch]: standardize init args (#21794 ) updated request_timeout default alias value per related docstring. Related to [20085](https://github.com/langchain-ai/langchain/issues/20085) Thank you for contributing to LangChain! --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-28 13:26:12 +00:00
mackong	11483b0fb8	community[patch]: set tool name for tongyi&qianfan llm (#22889 ) - Description: The name of ToolMessage is default to None, which makes tool message send to LLM likes ```json {"role": "tool", "tool_call_id": "", "content": "{\"time\": \"12:12\"}", "name": null} ``` But the name seems essential for some LLMs like TongYi Qwen. so we need to set the name use agent_action's tool value. - Issue: N/A - Dependencies: N/A	2024-06-28 09:17:05 -04:00
Leonid Ganeline	e4caa41aa9	community: docstrings `toolkits` (#23616 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-06-28 08:40:52 -04:00
clement.l	19eb82e68b	docs: Fix link in LLMChain tutorial (#23620 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-28 03:59:24 +00:00
Bagatur	bd68a38723	docs: update chatmodel.with_structured_output feat in table (#23610 )	2024-06-27 20:38:49 -07:00
ccurme	adf2dc13de	community: fix lint (#23611 )	2024-06-27 22:12:16 +00:00
Bagatur	ef0593db58	docs: tool call run model (#23609 )	2024-06-27 22:02:12 +00:00
Leonid Ganeline	75a44fe951	core: `chat_*` docstrings (#23412 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-06-27 17:29:38 -04:00
Bagatur	3b1fcb2a65	chroma[patch]: Release 0.1.2 (#23604 )	2024-06-27 13:58:24 -07:00
Eugene Yurtsev	68f348357e	community[patch]: Test InMemoryVectorStore with RWAPI test suite (#23603 ) Add standard test suite to InMemoryVectorStore implementation.	2024-06-27 16:43:43 -04:00
Eugene Yurtsev	da7beb1c38	core[patch]: Add unit test when catching generator exit (#23402 ) This pr adds a unit test for: https://github.com/langchain-ai/langchain/pull/22662 And narrows the scope where the exception is caught.	2024-06-27 20:36:07 +00:00
NG Sai Prasanth	5e6d23f27d	community: Standardise tool import for arxiv & semantic scholar (#23578 ) - Description: Fixing the way users have to import Arxiv and Semantic Scholar - Issue: Changed to use `from langchain_community.tools.arxiv import ArxivQueryRun` instead of `from langchain_community.tools.arxiv.tool import ArxivQueryRun` - Dependencies: None - Twitter handle: Nope	2024-06-27 16:35:50 -04:00
ccurme	d04f657424	langchain[patch]: deprecate ConversationChain (#23504 ) Would like some feedback on how to best incorporate legacy memory objects into `RunnableWithMessageHistory`.	2024-06-27 16:32:44 -04:00
Ayo Ayibiowu	c6f700b7cb	fix(community): allow support for disabling max_tokens args (#21534 ) This PR fixes an issue with not able to use unlimited/infinity tokens from the respective provider for the LiteLLM provider. This is an issue when working in an agent environment that the token usage can drastically increase beyond the initial value set causing unexpected behavior.	2024-06-27 16:28:59 -04:00
WU LIFU	2a0d6788f7	docs[patch]: extraction_examples fix the examples given to the llm (#23393 ) Descriptions: currently in the [doc](https://python.langchain.com/v0.2/docs/how_to/extraction_examples/) it sets "Data" as the LLM's structured output schema, however its examples given to the LLM output's "Person", which causes the LLM to be confused and might occasionally return "Person" as the function to call issue: #23383 Co-authored-by: Lifu Wu <lifu@nextbillion.ai>	2024-06-27 16:22:26 -04:00
Leonid Ganeline	c0fdbaac85	langchain: docstrings in `agents` root (#23561 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-06-27 15:52:18 -04:00
Leonid Ganeline	b64c4b4750	langchain: docstrings `agents` nested (#23598 ) Added missed docstrings. Formatted docstrings to the consistent form. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-27 19:49:41 +00:00
mackong	70834cd741	community[patch]: support convert FunctionMessage for Tongyi (#23569 ) Description: For function call agent with Tongyi, cause the AgentAction will be converted to FunctionMessage by `47f69fe0d8/libs/core/langchain_core/agents.py (L188)` But now Tongyi's convert_message_to_dict doesn't support FunctionMessage `47f69fe0d8/libs/community/langchain_community/chat_models/tongyi.py (L184-L207)` Then next round conversation will be failed by the TypeError exception. This patch adds the support to convert FunctionMessage for Tongyi. Issue: N/A Dependencies: N/A	2024-06-27 15:49:26 -04:00
Bagatur	d45ece0e58	chroma[patch]: loosen py req (#23599 ) currently causes issues if you try adding to a project that supports py<4	2024-06-27 12:40:59 -07:00
Mohammad Mohtashim	4796b7eb15	[Community [HuggingFace]]: Small Fix for ChatHuggingFace. (#22925 ) - Description: A small fix where I moved the `available_endpoints` in order to avoid the token error in the below issue. Also I have added conftest file and updated the `scripy`,`numpy` versions to support newer python versions in poetry files. - Issue: #22804 --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-27 19:37:20 +00:00
Jacob Lee	644723adda	docs[patch]: Add search keyword, update contribution guide (#23602 ) CC @vbarda @hinthornw	2024-06-27 12:36:02 -07:00
ccurme	bffc3c24a0	openai[patch]: release 0.1.11 (#23596 )	2024-06-27 18:48:40 +00:00
ccurme	a1520357c8	openai[patch]: revert addition of "name" to supported properties for tool messages (#23600 )	2024-06-27 18:40:04 +00:00
joshc-ai21	16a293cc3a	Small bug fixes (#23353 ) Small bug fixes according to your comments --------- Signed-off-by: Joffref <mariusjoffre@gmail.com> Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Baskar Gopinath <73015364+baskargopinath@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Mathis Joffre <51022808+Joffref@users.noreply.github.com> Co-authored-by: Baur <baur.krykpayev@gmail.com> Co-authored-by: Nuradil <nuradil.maksut@icloud.com> Co-authored-by: Nuradil <133880216+yaksh0nti@users.noreply.github.com> Co-authored-by: Jacob Lee <jacoblee93@gmail.com> Co-authored-by: Rave Harpaz <rave.harpaz@oracle.com> Co-authored-by: RHARPAZ <RHARPAZ@RHARPAZ-5750.us.oracle.com> Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: RUO <61719257+comsa33@users.noreply.github.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Luis Rueda <userlerueda@gmail.com> Co-authored-by: Jib <Jibzade@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: S M Zia Ur Rashid <smziaurrashid@gmail.com> Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com> Co-authored-by: yuncliu <lyc1990@qq.com> Co-authored-by: wenngong <76683249+wenngong@users.noreply.github.com> Co-authored-by: gongwn1 <gongwn1@lenovo.com> Co-authored-by: Mirna Wong <89008547+mirnawong1@users.noreply.github.com> Co-authored-by: Rahul Triptahi <rahul.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: maang-h <55082429+maang-h@users.noreply.github.com> Co-authored-by: asafg <asafg@ai21.com> Co-authored-by: Asaf Joseph Gardin <39553475+Josephasafg@users.noreply.github.com>	2024-06-27 17:58:22 +00:00
panwg3	9308bf32e5	spelling errors in words (#23559 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-27 17:16:22 +00:00
clement.l	182fc06769	docs: Fix typo in LLMChain tutorial (#23593 ) When using `model_with_tools.invoke`, the `content` returns as an empty string. For more details, please refer to my [trace log](https://smith.langchain.com/public/6fd24bc4-86c4-4627-8565-9a8adaf4ad7d/r).	2024-06-27 17:01:05 +00:00
ccurme	5536420bee	openai[patch]: add comment (#23595 ) Forgot to push this to https://github.com/langchain-ai/langchain/pull/23551	2024-06-27 16:47:14 +00:00
andrewmjc	9f0f3c7e29	partners[openai]: Add name field to tool message to match OpenAI spec (#23551 ) Discovered alongside @t968914 - Description: According to OpenAI docs, tool messages (response from calling tools) must have a 'name' field. https://cookbook.openai.com/examples/how_to_call_functions_with_chat_models - Issue: N/A (as of right now) - Dependencies: N/A - Twitter handle: N/A Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-27 12:42:36 -04:00
Krista Pratico	85e36b0f50	partners[openai]: only add stream_options to kwargs if requested (#23552 ) - Description: This PR https://github.com/langchain-ai/langchain/pull/22854 added the ability to pass `stream_options` through to the openai service to get token usage information in the response. Currently OpenAI supports this parameter, but Azure OpenAI does not yet. For users who proxy their calls to both services through ChatOpenAI, this breaks when targeting Azure OpenAI (see related discussion opened in openai-python: https://github.com/openai/openai-python/issues/1469#issuecomment-2192658630). > Error code: 400 - {'error': {'code': None, 'message': 'Unrecognized request argument supplied: stream_options', 'param': None, 'type': 'invalid_request_error'}} This PR fixes the issue by only adding `stream_options` to the request if it's actually requested by the user (i.e. set to True). If I'm not mistaken, we have a test case that already covers this scenario: https://github.com/langchain-ai/langchain/blob/master/libs/partners/openai/tests/integration_tests/chat_models/test_base.py#L398-L399 - Issue: Issue opened in openai-python: https://github.com/openai/openai-python/issues/1469 - Dependencies: N/A - Twitter handle: N/A --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-27 12:23:05 -04:00
Eugene Yurtsev	96b72edac8	core[minor]: Add optional ID field to Document schema (#23411 ) This PR adds an optional ID field to the document schema. # 1. Optional or Required - An optional field will will requrie additional checking for the type in user code (annoying). - However, vectorstores currently don't respect this field. So if we make it required and start returning random UUIDs that might be even more confusing to users. Proposal: Start with Optional and convert to Required (with default set to uuid4()) in 1-2 major releases. # 2. Override __str__ or generic solution in prompts Overriding __str__ as a simple way to avoid changing user code that relies on default str(document) in prompts. I considered rolling out a more general solution in prompts (https://github.com/langchain-ai/langchain/pull/8685), but to do that we need to: 1. Make things serializable 2. The more general solution would likely need to be backwards compatible as well 3. It's unclear that one wants to format a List[int] in the same way as List[Document]. The former should be `,` seperated (likely), the latter should be `---` separated (likely). Proposal Start with __str__ override and focus on the vectorstore APIs, we generalize prompts later	2024-06-27 12:15:58 -04:00
ccurme	5bfcb898ad	openai[patch]: bump sdk version (#23592 ) Tests failing with `TypeError: Completions.create() got an unexpected keyword argument 'parallel_tool_calls'`	2024-06-27 11:57:24 -04:00
Jacob Lee	60fc15a56b	docs[patch]: Update docs introduction and README (#23558 ) CC @hwchase17 @baskaryan	2024-06-27 08:51:43 -07:00
panwg3	2445b997ee	Correction of incorrect words (#23557 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-27 15:13:15 +00:00
Aditya	6721b991ab	docs: realigned sections for langchain-google-vertexai (#23584 ) - Description: Re-aligned sections in documentation of Vertex AI LLMs - Issue: NA - Dependencies: NA - Twitter handle:NA --------- Co-authored-by: adityarane@google.com <adityarane@google.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-27 10:42:32 -04:00
mackong	daf733b52e	langchain[minor]: fix comment typo (#23564 ) Description: fix typo of comment Issue: N/A Dependencies: N/A	2024-06-27 10:09:18 -04:00
Jacob Lee	47f69fe0d8	docs[patch]: Add ReAct agent conceptual guide, improve search (#23554 ) @baskaryan	2024-06-26 19:02:03 -07:00
Jacob Lee	672fcbb8dc	docs[patch]: Fix bad link format (#23553 )	2024-06-26 16:43:26 -07:00
Jacob Lee	13254715a2	docs[patch]: Update installation guide with diagram (#23548 ) CC @baskaryan	2024-06-26 15:10:22 -07:00
Leonid Ganeline	2c9b84c3a8	core[patch]: docstrings `agents` (#23502 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-06-26 17:50:48 -04:00
Jacob Lee	79d8556c22	docs[patch]: Address feedback from docs users (#23550 ) - Updates chat few shot prompt tutorial to show off a more cohesive example - Fix async Chromium loader guide - Fix Excel loader install instructions - Reformat Html2Text page - Add install instructions to Azure OpenAI embeddings page - Add missing dep install to SQL QA tutorial @baskaryan	2024-06-26 14:47:01 -07:00
Leonid Ganeline	2a5d59b3d7	core[patch]: `callbacks` docstrings (#23375 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-06-26 17:11:06 -04:00
Leonid Ganeline	1141b08eb8	core: docstrings `example_selectors` (#23542 ) Added missed docstrings. Formatted docstrings to the consistent form.	2024-06-26 17:10:40 -04:00
wenngong	3bf1d98dbf	langchain[patch]: update agent and chains modules root_validators (#23256 ) Description: update agent and chains modules Pydantic root_validators. Issue: the issue #22819 --------- Co-authored-by: gongwn1 <gongwn1@lenovo.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-06-26 17:09:50 -04:00
Bagatur	a7ab93479b	anthropic[patch]: Release 0.1.16 (#23549 )	2024-06-26 20:49:13 +00:00
Jib	c0fcf76e93	LangChain-MongoDB: [Experimental] Driver-side index creation helper (#19359 ) ## Description Created a helper method to make vector search indexes via client-side pymongo. Recent Update -- Removed error suppressing/overwriting layer in favor of letting the original exception provide information. ## ToDo's - [x] Make _wait_untils for integration test delete index functionalities. - [x] Add documentation for its use. Highlight it's experimental - [x] Post Integration Test Results in a screenshot - [x] Get review from MongoDB internal team (@shaneharvey, @blink1073 , @NoahStapp , @caseyclements) - [x] Add tests and docs: If you're adding a new integration, please include 1. Added new integration tests. Not eligible for unit testing since the operation is Atlas Cloud specific. 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. ![image](https://github.com/langchain-ai/langchain/assets/2887713/a3fc8ee1-e04c-4976-accc-fea0eeae028a) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-26 15:07:28 -04:00
Jacob Lee	b1dfb8ea1e	docs[patch]: Update contribution guides (#23382 ) CC @vbarda @hwchase17	2024-06-26 11:12:41 -07:00
maang-h	5070004e8a	docs: Update Tongyi ChatModel docstring (#23540 ) - Description: Update Tongyi ChatModel rich docstring - Issue: the issue #22296	2024-06-26 13:07:13 -04:00
Nuradil	2f976c5174	community: fix code example in ZenGuard docs (#23541 ) Thank you for contributing to LangChain! - [X] PR title: "community: fix code example in ZenGuard docs" - [X] PR message: - Description: corrected the docs by indicating in the code example that the tool accepts a list of prompts instead of just one - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Thank you for review --------- Co-authored-by: Baur <baur.krykpayev@gmail.com>	2024-06-26 13:05:59 -04:00
yonarw	6d0ebbca1e	community: SAP HANA Vector Engine fix for latest HANA release (#23516 ) - Description: This PR fixes an issue with SAP HANA Cloud QRC03 version. In that version the number to indicate no length being set for a vector column changed from -1 to 0. The change in this PR support both behaviours (old/new). - Dependencies: No dependencies have been introduced. - Tests: The change is covered by previous unit tests.	2024-06-26 13:15:51 +00:00
Roman Solomatin	1e3e05b0c3	openai[patch]: add support for extra_body (#23404 ) Description: Add support passing extra_body parameter Some OpenAI compatible API's have additional parameters (for example [vLLM](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#extra-parameters)) that can be passed thought `extra_body`. Same question in https://github.com/openai/openai-python/issues/767 <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. -->	2024-06-26 13:11:59 +00:00
Alireza Kashani	c39521b70d	Update grobid.py (#23399 ) fixed potential `IndexError: list index out of range` in case there is no title Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-26 09:11:02 -04:00
Qingchuan Hao	ee282a1d2e	community: add missing link (#23526 )	2024-06-26 09:06:28 -04:00
Lincoln Stein	c314222796	Add a conversation memory that combines a (optionally persistent) vectorstore history with a token buffer (#22155 ) langchain: ConversationVectorStoreTokenBufferMemory -Description: This PR adds ConversationVectorStoreTokenBufferMemory. It is similar in concept to ConversationSummaryBufferMemory. It maintains an in-memory buffer of messages up to a preset token limit. After the limit is hit timestamped messages are written into a vectorstore retriever rather than into a summary. The user's prompt is then used to retrieve relevant fragments of the previous conversation. By persisting the vectorstore, one can maintain memory from session to session. -Issue: n/a -Dependencies: none -Twitter handle: Please no!!! - [X] Add tests and docs: I looked to see how the unit tests were written for the other ConversationMemory modules, but couldn't find anything other than a test for successful import. I need to know whether you are using pytest.mock or another fixture to simulate the LLM and vectorstore. In addition, I would like guidance on where to place the documentation. Should it be a notebook file in docs/docs? - [X] Lint and test: I am seeing some linting errors from a couple of modules unrelated to this PR. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Lincoln Stein <lstein@gmail.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-06-25 20:17:10 -07:00
Bagatur	32f8f39974	core[patch]: use args_schema doc for tool description (#23503 )	2024-06-25 15:26:35 -07:00
ccurme	6f7fe82830	text-splitters: release 0.2.2 (#23508 )	2024-06-25 18:26:05 -04:00
ccurme	62b16fcc6b	experimental: release 0.0.62 (#23507 )	2024-06-25 22:01:35 +00:00
ccurme	99ce84ef23	community: release 0.2.6 (#23501 )	2024-06-25 21:29:52 +00:00
ccurme	03c41e725e	langchain: release 0.2.6 (#23426 )	2024-06-25 21:03:41 +00:00
ccurme	86ca44d451	core: release 0.2.10 (#23420 )	2024-06-25 16:26:31 -04:00
Isaac Francisco	85f5d14cef	[docs]: split up tool docs (#22919 )	2024-06-25 13:15:08 -07:00
ccurme	f788d0982d	docs: update trim messages guide (#23418 ) - rerun to remove warnings following https://github.com/langchain-ai/langchain/pull/23363 - `raise` -> `return`	2024-06-25 19:50:53 +00:00
ccurme	c9619349d6	docs: rerun chatbot tutorial to remove warnings (#23417 )	2024-06-25 19:26:54 +00:00
Nuradil	c93d9e66e4	Community: Update and fix ZenGuardTool docs and add ZenguardTool to init files (#23415 ) Thank you for contributing to LangChain! - [x] PR title: "community: update docs and add tool to init.py" - [x] PR message: - Description: Fixed some errors and comments in the docs and added our ZenGuardTool and additional classes to init.py for easy access when importing - Question: when will you update the langchain-community package in pypi to make our tool available? - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Thank you for review! --------- Co-authored-by: Baur <baur.krykpayev@gmail.com>	2024-06-25 19:26:32 +00:00
William FH	8955bc1866	[Core] Logging: Suppress missing parent warning (#23363 )	2024-06-25 14:57:23 -04:00
ccurme	730c551819	core[patch]: export tool output parsers from langchain_core.output_parsers (#23305 ) These currently read off AIMessage.tool_calls, and only fall back to OpenAI parsing if tool calls aren't populated. Importing these from `openai_tools` (e.g., in our [tool calling docs](https://python.langchain.com/v0.2/docs/how_to/tool_calling/#tool-calls)) can lead to confusion. After landing, would need to release core and update docs.	2024-06-25 14:40:42 -04:00
Eugene Yurtsev	7e9e69c758	core[patch]: Add unit test for str and repr for Document (#23414 )	2024-06-25 18:28:21 +00:00
Bagatur	f055f2a1e3	infra: install integration deps as needed (#23413 )	2024-06-25 11:17:43 -07:00
Bagatur	92ac0fc9bd	openai[patch]: Release 0.1.10 (#23410 )	2024-06-25 17:40:02 +00:00
Bagatur	fb3df898b5	docs: Update README.md (#23409 )	2024-06-25 17:35:00 +00:00
Bagatur	9d145b9630	openai[patch]: fix tool calling token counting (#23408 ) Resolves https://github.com/langchain-ai/langchain/issues/23388	2024-06-25 10:34:25 -07:00
Tomaz Bratanic	22fa32e164	LLM Graph transformer dealing with empty strings (#23368 ) Pydantic allows empty strings: ``` from langchain.pydantic_v1 import Field, BaseModel class Property(BaseModel): """A single property consisting of key and value""" key: str = Field(..., description="key") value: str = Field(..., description="value") x = Property(key="", value="") ``` Which can produce errors downstream. We simply ignore those records	2024-06-25 13:01:53 -04:00
Rajendra Kadam	d3520a784f	docs: Added providers page for Pebblo and docs for PebbloRetrievalQA (#20746 ) - Description: Added providers page for Pebblo and docs for PebbloRetrievalQA - Issue: NA - Dependencies: None - Unit tests: NA	2024-06-25 12:46:11 -04:00
clement.l	a75b32a54a	docs: Fix typo in LLMChain tutorial (#23380 ) Description: Fix a typo Issue: n/a Dependencies: None Twitter handle:	2024-06-25 13:03:24 +00:00
Riccardo Schirone	4530d851e4	Merge pull request #22662 * core: runnables: special handling GeneratorExit because no error	2024-06-25 08:42:03 -04:00
Qingchuan Hao	ad50702934	community: add default value to bing_search_url (#23306 ) bing_search_url is an endpoint to requests bing search resource and is normally invariant to users, we can give it the default value to simply the uesages of this utility/tool	2024-06-25 08:08:41 -04:00
ccurme	68e0ae3286	langchain[patch]: update removal target for LLMChain (#23373 ) to 1.0 Also improve replacement example in docstring.	2024-06-24 21:51:29 +00:00
wenngong	b33d2346db	community: FlashrankRerank support loading customer client (#23350 ) Description: FlashrankRerank Document compressor support loading customer client Issue: #23338 Co-authored-by: gongwn1 <gongwn1@lenovo.com>	2024-06-24 17:50:08 -04:00
maang-h	f58c40b4e3	docs: Update QianfanChatEndpoint ChatModel docstring (#23337 ) - Description: Update QianfanChatEndpoint ChatModel rich docstring - Issue: the issue #22296	2024-06-24 17:42:46 -04:00
Rahul Triptahi	9ef93ecd7c	community[minor]: Added classification_location parameter in PebbloSafeLoader. (#22565 ) Description: Add classifier_location feature flag. This flag enables Pebblo to decide the classifier location, local or pebblo-cloud. Unit Tests: N/A Documentation: N/A --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-06-24 17:30:38 -04:00
Mirna Wong	2115fb76de	Replace llm variable with model (#23280 ) The code snippet under ‘pdfs_qa’ contains an small incorrect code example , resulting in users getting errors. This pr replaces ‘llm’ variable with ‘model’ to help user avoid a NameError message. Resolves #22689 If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-24 17:08:02 -04:00
wenngong	af620db9c7	partners: add lint docstrings for azure-dynamic-sessions/together modules (#23303 ) Description: add lint docstrings for azure-dynamic-sessions/together modules Issue: #23188 @baskaryan test: ruff check passed. <img width="782" alt="image" src="https://github.com/langchain-ai/langchain/assets/76683249/bf11783d-65b3-4e56-a563-255eae89a3e4"> --------- Co-authored-by: gongwn1 <gongwn1@lenovo.com>	2024-06-24 16:26:54 -04:00
yuncliu	398b2b9c51	community[minor]: Add Ascend NPU optimized Embeddings (#20260 ) - Description: Add NPU support for embeddings --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-24 20:15:11 +00:00
Ikko Eltociear Ashimine	7b1066341b	docs: update sql_query_checking.ipynb (#23345 ) creat -> create	2024-06-24 16:03:32 -04:00
S M Zia Ur Rashid	d5b2a93c6d	package: security update urllib3 to @1.26.19 (#23366 ) urllib3 version update 1.26.18 to 1.26.19 to address a security vulnerability. Reference: https://security.snyk.io/vuln/SNYK-PYTHON-URLLIB3-7267250	2024-06-24 19:44:39 +00:00
Jacob Lee	57c13b4ef8	docs[patch]: Fix typo in how to guide for message history (#23364 )	2024-06-24 15:43:05 -04:00
Luis Rueda	168e9ed3a5	partners: add custom options to MongoDBChatMessageHistory (#22944 ) Description: Adds options for configuring MongoDBChatMessageHistory (no breaking changes): - session_id_key: name of the field that stores the session id - history_key: name of the field that stores the chat history - create_index: whether to create an index on the session id field - index_kwargs: additional keyword arguments to pass to the index creation Discussion: https://github.com/langchain-ai/langchain/discussions/22918 Twitter handle: @userlerueda --------- Co-authored-by: Jib <Jibzade@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-06-24 19:42:56 +00:00
Eugene Yurtsev	1e750f12f6	standard-tests[minor]: Add standard read write test suite for vectorstores (#23355 ) Add standard read write test suite for vectorstores	2024-06-24 19:40:56 +00:00
Eugene Yurtsev	3b3ed72d35	standard-tests[minor]: Add standard tests for BaseStore (#23360 ) Add standard tests to base store abstraction. These only work on [str, str] right now. We'll need to check if it's possible to add encoder/decoders to generalize	2024-06-24 19:38:50 +00:00
ccurme	e1190c8f3c	mongodb[patch]: fix CI for python 3.12 (#23369 )	2024-06-24 19:31:20 +00:00
RUO	2b87e330b0	community: fix issue with nested field extraction in MongodbLoader (#22801 ) Description: This PR addresses an issue in the `MongodbLoader` where nested fields were not being correctly extracted. The loader now correctly handles nested fields specified in the `field_names` parameter. Issue: Fixes an issue where attempting to extract nested fields from MongoDB documents resulted in `KeyError`. Dependencies: No new dependencies are required for this change. Twitter handle: (Optional, your Twitter handle if you'd like a mention when the PR is announced) ### Changes 1. Field Name Parsing: - Added logic to parse nested field names and safely extract their values from the MongoDB documents. 2. Projection Construction: - Updated the projection dictionary to include nested fields correctly. 3. Field Extraction: - Updated the `aload` method to handle nested field extraction using a recursive approach to traverse the nested dictionaries. ### Example Usage Updated usage example to demonstrate how to specify nested fields in the `field_names` parameter: ```python loader = MongodbLoader( connection_string=MONGO_URI, db_name=MONGO_DB, collection_name=MONGO_COLLECTION, filter_criteria={"data.job.company.industry_name": "IT", "data.job.detail": { "$exists": True }}, field_names=[ "data.job.detail.id", "data.job.detail.position", "data.job.detail.intro", "data.job.detail.main_tasks", "data.job.detail.requirements", "data.job.detail.preferred_points", "data.job.detail.benefits", ], ) docs = loader.load() print(len(docs)) for doc in docs: print(doc.page_content) ``` ### Testing Tested with a MongoDB collection containing nested documents to ensure that the nested fields are correctly extracted and concatenated into a single page_content string. ### Note This change ensures backward compatibility for non-nested fields and improves functionality for nested field extraction. ### Output Sample ```python print(docs[:3]) ``` ```shell # output sample: [ Document( # Here in this example, page_content is the combined text from the fields below # "position", "intro", "main_tasks", "requirements", "preferred_points", "benefits" page_content='all combined contents from the requested fields in the document', metadata={'database': 'Your Database name', 'collection': 'Your Collection name'} ), ... ] ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-24 19:29:11 +00:00
Tomaz Bratanic	aeeda370aa	Sanitize backticks from neo4j labels and types for import (#23367 )	2024-06-24 19:05:31 +00:00
Jacob Lee	d2db561347	docs[patch]: Adds callout in LLM concept docs, remove deprecated code (#23361 ) CC @baskaryan @hwchase17	2024-06-24 12:03:18 -07:00
Rave Harpaz	f5ff7f178b	Add OCI Generative AI new model support (#22880 ) - [x] PR title: community: Add OCI Generative AI new model support - [x] PR message: - Description: adding support for new models offered by OCI Generative AI services. This is a moderate update of our initial integration PR 16548 and includes a new integration for our chat models under /langchain_community/chat_models/oci_generative_ai.py - Issue: NA - Dependencies: No new Dependencies, just latest version of our OCI sdk - Twitter handle: NA - [x] Add tests and docs: 1. we have updated our unit tests 2. we have updated our documentation including a new ipynb for our new chat integration - [x] Lint and test: `make format`, `make lint`, and `make test` run successfully --------- Co-authored-by: RHARPAZ <RHARPAZ@RHARPAZ-5750.us.oracle.com> Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com>	2024-06-24 14:48:23 -04:00
Jacob Lee	753edf9c80	docs[patch]: Update chatbot tools how-to guide (#23362 )	2024-06-24 11:46:06 -07:00
Baur	aa358f2be4	community: Add ZenGuard tool (#22959 ) Description This is the community integration of ZenGuard AI - the fastest guardrails for GenAI applications. ZenGuard AI protects against: - Prompts Attacks - Veering of the pre-defined topics - PII, sensitive info, and keywords leakage. - Toxicity - Etc. Twitter Handle : @zenguardai - [x] Add tests and docs: If you're adding a new integration, please include 1. Added an integration test 2. Added colab - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. --------- Co-authored-by: Nuradil <nuradil.maksut@icloud.com> Co-authored-by: Nuradil <133880216+yaksh0nti@users.noreply.github.com>	2024-06-24 17:40:56 +00:00
Mathis Joffre	60103fc4a5	community: Fix OVHcloud 401 Unauthorized on embedding. (#23260 ) They are now rejecting with code 401 calls from users with expired or invalid tokens (while before they were being considered anonymous). Thus, the authorization header has to be removed when there is no token. Related to: #23178 --------- Signed-off-by: Joffref <mariusjoffre@gmail.com>	2024-06-24 12:58:32 -04:00
Baskar Gopinath	4964ba74db	Update multimodal_prompts.ipynb (#23301 ) fixes #23294 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-24 15:58:51 +00:00
Eugene Yurtsev	d90379210a	standard-tests[minor]: Add standard tests for cache (#23357 ) Add standard tests for cache abstraction	2024-06-24 15:15:03 +00:00
Leonid Ganeline	987099cfcd	community: `toolkits` docstrings (#23286 ) Added missed docstrings. Formatted docstrings to the consistent form. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-22 14:37:52 +00:00
Rahul Triptahi	0cd3f93361	Enhance metadata of sharepointLoader. (#22248 ) Description: 2 feature flags added to SharePointLoader in this PR: 1. load_auth: if set to True, adds authorised identities to metadata 2. load_extended_metadata, adds source, owner and full_path to metadata Unit tests:N/A Documentation: To be done. --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-06-21 17:03:38 -07:00
Yuki Watanabe	5d4133d82f	community: Overhaul Databricks provider documentation (#23203 ) Description: Update [Databricks Provider](https://python.langchain.com/v0.2/docs/integrations/providers/databricks/) documentations to the latest component notebooks and draw better navigation path to related notebooks. --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	2024-06-21 16:57:35 -07:00
Bagatur	bcac6c3aff	openai[patch]: temp fix ignore lint (#23290 )	2024-06-21 16:52:52 -07:00
William FH	efb4c12abe	[Core] Add support for inferring Annotated types (#23284 ) in bind_tools() / convert_to_openai_function	2024-06-21 15:16:30 -07:00
Vadym Barda	9ac302cb97	core[minor]: update draw_mermaid node label processing (#23285 ) This fixes processing issue for nodes with numbers in their labels (e.g. `"node_1"`, which would previously be relabeled as `"node__"`, and now are correctly processed as `"node_1"`)	2024-06-21 21:35:32 +00:00
Rajendra Kadam	7ee2822ec2	community: Fix TypeError in PebbloRetrievalQA (#23170 ) Description: Fix "`TypeError: 'NoneType' object is not iterable`" when the auth_context is absent in PebbloRetrievalQA. The auth_context is optional; hence, PebbloRetrievalQA should work without it, but it throws an error at the moment. This PR fixes that issue. Issue: NA Dependencies: None Unit tests: NA --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-21 17:04:00 -04:00
Iurii Umnov	3b7b933aa2	community[minor]: OpenAPI agent. Add support for PUT, DELETE and PATCH (#22962 ) Description: Add PUT, DELETE and PATCH tools to tool list for OpenAPI agent if dangerous requests are allowed. Issue: https://github.com/langchain-ai/langchain/issues/20469	2024-06-21 20:44:23 +00:00
Guangdong Liu	3c42bf8d97	community(patch):Fix PineconeHynridSearchRetriever not having search_kwargs (#21577 ) - close #21521	2024-06-21 16:27:52 -04:00
Rahul Triptahi	4bb3d5c488	[community][quick-fix]: changed from blob.path to blob.path.name in 0365BaseLoader. (#22287 ) Description: file_metadata_ was not getting propagated to returned documents. Changed the lookup key to the name of the blob's path. Changed blob.path key to blob.path.name for metadata_dict key lookup. Documentation: N/A Unit tests: N/A Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-21 15:51:03 -04:00
Bagatur	f824f6d925	docs: fix merge message runs docstring (#23279 )	2024-06-21 19:50:50 +00:00
wenngong	f9aea3db07	partners: add lint docstrings for chroma module (#23249 ) Description: add lint docstrings for chroma module Issue: the issue #23188 @baskaryan test: ruff check passed. ![image](https://github.com/langchain-ai/langchain/assets/76683249/5e168a0c-32d0-464f-8ddb-110233918019) --------- Co-authored-by: gongwn1 <gongwn1@lenovo.com>	2024-06-21 19:49:24 +00:00
Bagatur	9eda8f2fe8	docs: fix trim_messages code blocks (#23271 )	2024-06-21 17:15:31 +00:00
Jacob Lee	86326269a1	docs[patch]: Adds prereqs to trim messages (#23270 ) CC @baskaryan	2024-06-21 10:09:41 -07:00
Bagatur	4c97a9ee53	docs: fix message transformer docstrings (#23264 )	2024-06-21 16:10:03 +00:00
Vwake04	0deb98ac0c	pinecone: Fix multiprocessing issue in PineconeVectorStore (#22571 ) Description: Currently, the `langchain_pinecone` library forces the `async_req` (asynchronous required) argument to Pinecone to `True`. This design choice causes problems when deploying to environments that do not support multiprocessing, such as AWS Lambda. In such environments, this restriction can prevent users from successfully using `langchain_pinecone`. This PR introduces a change that allows users to specify whether they want to use asynchronous requests by passing the `async_req` parameter through `kwargs`. By doing so, users can set `async_req=False` to utilize synchronous processing, making the library compatible with AWS Lambda and other environments that do not support multithreading. Issue: This PR does not address a specific issue number but aims to resolve compatibility issues with AWS Lambda by allowing synchronous processing. Dependencies:** None, that I'm aware of. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-21 15:46:01 +00:00
ccurme	75c7c3a1a7	openai: release 0.1.9 (#23263 )	2024-06-21 11:15:29 -04:00
Brace Sproul	abe7566d7d	core[minor]: BaseChatModel with_structured_output implementation (#22859 )	2024-06-21 08:14:03 -07:00
mackong	360a70c8a8	core[patch]: fix no current event loop for sql history in async mode (#22933 ) - Description: When use RunnableWithMessageHistory/SQLChatMessageHistory in async mode, we'll get the following error: ``` Error in RootListenersTracer.on_chain_end callback: RuntimeError("There is no current event loop in thread 'asyncio_3'.") ``` which throwed by `ddfbca38df/libs/community/langchain_community/chat_message_histories/sql.py (L259)`. and no message history will be add to database. In this patch, a new _aexit_history function which will'be called in async mode is added, and in turn aadd_messages will be called. In this patch, we use `afunc` attribute of a Runnable to check if the end listener should be run in async mode or not. - Issue: #22021, #22022 - Dependencies: N/A	2024-06-21 10:39:47 -04:00
Philippe PRADOS	1c2b9cc9ab	core[minor]: Update pgvector transalor for langchain_postgres (#23217 ) The SelfQuery PGVectorTranslator is not correct. The operator is "eq" and not "$eq". This patch use a new version of PGVectorTranslator from langchain_postgres. It's necessary to release a new version of langchain_postgres (see [here](https://github.com/langchain-ai/langchain-postgres/pull/75) before accepting this PR in langchain.	2024-06-21 10:37:09 -04:00
Mu Yang	401d469a92	langchain: fix systax warning in create_json_chat_agent (#23253 ) fix systax warning in `create_json_chat_agent` ``` .../langchain/agents/json_chat/base.py:22: SyntaxWarning: invalid escape sequence '\ ' """Create an agent that uses JSON to format its logic, build for Chat Models. ```	2024-06-21 10:05:38 -04:00
mackong	b108b4d010	core[patch]: set schema format for AsyncRootListenersTracer (#23214 ) - Description: AsyncRootListenersTracer support on_chat_model_start, it's schema_format should be "original+chat". - Issue: N/A - Dependencies:	2024-06-21 09:30:27 -04:00
Bagatur	976b456619	docs: BaseChatModel key methods table (#23238 ) If we're moving documenting inherited params think these kinds of tables become more important ![Screenshot 2024-06-20 at 3 59 12 PM](https://github.com/langchain-ai/langchain/assets/22008038/722266eb-2353-4e85-8fae-76b19bd333e0)	2024-06-20 21:00:22 -07:00
Jacob Lee	5da7eb97cb	docs[patch]: Update link (#23240 ) CC @agola11	2024-06-20 17:43:12 -07:00
ccurme	a7b4175091	standard tests: add test for tool calling (#23234 ) Including streaming	2024-06-20 17:20:11 -04:00
Bagatur	12e0c28a6e	docs: fix chat model methods table (#23233 ) rst table not md ![Screenshot 2024-06-20 at 12 37 46 PM](https://github.com/langchain-ai/langchain/assets/22008038/7a03b869-c1f4-45d0-8d27-3e16f4c6eb19)	2024-06-20 19:51:10 +00:00
Zheng Robert Jia	a349fce880	docs[minor],community[patch]: Minor tutorial docs improvement, minor import error quick fix. (#22725 ) minor changes to module import error handling and minor issues in tutorial documents. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-06-20 15:36:49 -04:00
Eugene Yurtsev	7545b1d29b	core[patch]: Fix doc-strings for code blocks (#23232 ) Code blocks need extra space around them to be rendered properly by sphinx	2024-06-20 19:34:52 +00:00
Luis Moros	d5be160af0	community[patch]: Fix sql_databse.from_databricks issue when ran from Job (#23224 ) Desscription: When the ``sql_database.from_databricks`` is executed from a Workflow Job, the ``context`` object does not have a "browserHostName" property, resulting in an error. This change manages the error so the "DATABRICKS_HOST" env variable value is used instead of stoping the flow Co-authored-by: lmorosdb <lmorosdb>	2024-06-20 19:34:15 +00:00
Cory Waddingham	cd6812342e	pinecone[patch]: Update Poetry requirements for pinecone-client >=3.2.2 (#22094 ) This change updates the requirements in `libs/partners/pinecone/pyproject.toml` to allow all versions of `pinecone-client` greater than or equal to 3.2.2. This change resolves issue [21955](https://github.com/langchain-ai/langchain/issues/21955). --------- Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-20 18:59:36 +00:00
ccurme	abb3066150	docs: clarify streaming with RunnableLambda (#23228 )	2024-06-20 14:49:00 -04:00
ccurme	bf7763d9b0	docs: add serialization guide (#23223 )	2024-06-20 12:50:24 -04:00
Eugene Yurtsev	59d7adff8f	core[patch]: Add clarification about streaming to RunnableLambda (#23227 ) Add streaming clarification to runnable lambda docstring.	2024-06-20 16:47:16 +00:00
Jacob Lee	60db79a38a	docs[patch]: Update Anthropic chat model docs (#23226 ) CC @baskaryan	2024-06-20 09:46:43 -07:00
maang-h	bc4cd9c5cc	community[patch]: Update root_validators ChatModels: ChatBaichuan, QianfanChatEndpoint, MiniMaxChat, ChatSparkLLM, ChatZhipuAI (#22853 ) This PR updates root validators for: - ChatModels: ChatBaichuan, QianfanChatEndpoint, MiniMaxChat, ChatSparkLLM, ChatZhipuAI Issues #22819 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-20 16:36:41 +00:00
ChrisDEV	cb6cf4b631	Fix return value type of dumpd (#20123 ) The return type of `json.loads` is `Any`. In fact, the return type of `dumpd` must be based on `json.loads`, so the correction here is understandable. Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-06-20 16:31:41 +00:00
Guangdong Liu	0bce28cd30	core(patch): Fix encoding problem of load_prompt method (#21559 ) - description: Add encoding parameters. - @baskaryan, @efriis, @eyurtsev, @hwchase17. ![54d25ac7b1d5c2e47741a56fe8ed8ba](https://github.com/langchain-ai/langchain/assets/48236177/ffea9596-2001-4e19-b245-f8a6e231b9f9)	2024-06-20 09:25:54 -07:00
Philippe PRADOS	8711c61298	core[minor]: Adds an in-memory implementation of RecordManager (#13200 ) Description: langchain offers three technologies to save data: - [vectorstore](https://python.langchain.com/docs/modules/data_connection/vectorstores/) - [docstore](https://js.langchain.com/docs/api/schema/classes/Docstore) - [record manager](https://python.langchain.com/docs/modules/data_connection/indexing) If you want to combine these technologies in a sample persistence stategy you need a common implementation for each. `DocStore` propose `InMemoryDocstore`. We propose the class `MemoryRecordManager` to complete the system. This is the prelude to another full-request, which needs a consistent combination of persistence components. Tag maintainer: @baskaryan Twitter handle: @pprados --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-20 12:19:10 -04:00
Eugene Yurtsev	3ab49c0036	docs: API reference remove Prev/Up/Next buttons (#23225 ) These do not work anyway. Let's remove them for now for simplicity.	2024-06-20 16:15:45 +00:00
Eugene Yurtsev	61daa16e5d	docs: Update clean up API reference (#23221 ) - Fix bug with TypedDicts rendering inherited methods if inherting from typing_extensions.TypedDict rather than typing.TypedDict - Do not surface inherited pydantic methods for subclasses of BaseModel - Subclasses of RunnableSerializable will not how methods inherited from Runnable or from BaseModel - Subclasses of Runnable that not pydantic models will include a link to RunnableInterface (they still show inherited methods, we can fix this later)	2024-06-20 11:35:00 -04:00
Leonid Ganeline	51e75cf59d	community: docstrings (#23202 ) Added missed docstrings. Format docstrings to the consistent format (used in the API Reference)	2024-06-20 11:08:13 -04:00
Julian Weng	6a1a0d977a	partners[minor]: Fix value error message for with_structured_output (#22877 ) Currently, calling `with_structured_output()` with an invalid method argument raises `Unrecognized method argument. Expected one of 'function_calling' or 'json_format'`, but the JSON mode option [is now referred to](https://python.langchain.com/v0.2/docs/how_to/structured_output/#the-with_structured_output-method) by `'json_mode'`. This fixes that. Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-06-20 15:03:21 +00:00
Qingchuan Hao	dd4d4411c9	doc: replace function all with tool call (#23184 ) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-20 09:27:39 -04:00
Yahkeef Davis	b03c801523	Docs: Update Rag tutorial so it includes an additional notebook cell with pip installs of required langchain_chroma and langchain_community. (#23204 ) Description: Update Rag tutorial notebook so it includes an additional notebook cell with pip installs of required langchain_chroma and langchain_community. This fixes the issue with the rag tutorial gives you a 'missing modules' error if you run code in the notebook as is. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-20 09:22:49 -04:00
Leonid Ganeline	41f7620989	huggingface: docstrings (#23148 ) Added missed docstrings. Format docstrings to the consistent format (used in the API Reference) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-20 13:22:40 +00:00
ccurme	066a5a209f	huggingface[patch]: fix CI for python 3.12 (#23197 )	2024-06-20 09:17:26 -04:00
xyd	9b3a025f9c	fix https://github.com/langchain-ai/langchain/issues/23215 (#23216 ) fix bug The ZhipuAIEmbeddings class is not working. Co-authored-by: xu yandong <shaonian@acsx1.onexmail.com>	2024-06-20 13:04:50 +00:00
Bagatur	ad7f2ec67d	standard-tests[patch]: test stop not stop_sequences (#23200 )	2024-06-19 18:07:33 -07:00
Bagatur	bd5c92a113	docs: standard params (#23199 )	2024-06-19 17:57:05 -07:00
David DeCaprio	a4bcb45f65	core:Add optional max_messages to MessagePlaceholder (#16098 ) - Description: Add optional max_messages to MessagePlaceholder - Issue: [16096](https://github.com/langchain-ai/langchain/issues/16096) - Dependencies: None - Twitter handle: @davedecaprio Sometimes it's better to limit the history in the prompt itself rather than the memory. This is needed if you want different prompts in the chain to have different history lengths. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-06-19 23:39:51 +00:00
shaunakgodbole	7193634ae6	fireworks[patch]: fix api_key alias in Fireworks LLM (#23118 ) Thank you for contributing to LangChain! Description The current code snippet for `Fireworks` had incorrect parameters. This PR fixes those parameters. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-19 21:14:42 +00:00
Eugene Yurtsev	1fcf875fe3	core[patch]: Document agent schema (#23194 ) * Document agent schema * Refer folks to langgraph for more information on how to create agents.	2024-06-19 20:16:57 +00:00
Bagatur	255ad39ae3	infra: run CI on large diffs (#23192 ) currently we skip CI on diffs >= 300 files. think we should just run it on all packages instead --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-19 19:30:56 +00:00
Eugene Yurtsev	c2d43544cc	core[patch]: Document messages namespace (#23154 ) - Moved doc-strings below attribtues in TypedDicts -- seems to render better on APIReference pages. * Provided more description and some simple code examples	2024-06-19 15:00:00 -04:00
Eugene Yurtsev	3c917204dc	core[patch]: Add doc-strings to outputs, fix @root_validator (#23190 ) - Document outputs namespace - Update a vanilla @root_validator that was missed	2024-06-19 14:59:06 -04:00
Bagatur	8698cb9b28	infra: add more formatter rules to openai (#23189 ) Turns on https://docs.astral.sh/ruff/settings/#format_docstring-code-format and https://docs.astral.sh/ruff/settings/#format_skip-magic-trailing-comma ```toml [tool.ruff.format] docstring-code-format = true skip-magic-trailing-comma = true ```	2024-06-19 11:39:58 -07:00
Michał Krassowski	710197e18c	community[patch]: restore compatibility with SQLAlchemy 1.x (#22546 ) - Description: Restores compatibility with SQLAlchemy 1.4.x that was broken since #18992 and adds a test run for this version on CI (only for Python 3.11) - Issue: fixes #19681 - Dependencies: None - Twitter handle: `@krassowski_m` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-19 17:58:57 +00:00
Erick Friis	48d6ea427f	upstage: move to external repo (#22506 )	2024-06-19 17:56:07 +00:00
Bagatur	0a4ee864e9	openai[patch]: image token counting (#23147 ) Resolves #23000 --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-19 10:41:47 -07:00
Jorge Piedrahita Ortiz	b3e53ffca0	community[patch]: sambanova llm integration improvement (#23137 ) - Description: sambanova sambaverse integration improvement: removed input parsing that was changing raw user input, and was making to use process prompt parameter as true mandatory	2024-06-19 10:30:14 -07:00
Jorge Piedrahita Ortiz	e162893d7f	community[patch]: update sambastudio embeddings (#23133 ) Description: update sambastudio embeddings integration, now compatible with generic endpoints and CoE endpoints	2024-06-19 10:26:56 -07:00
Philippe PRADOS	db6f46c1a6	langchain[small]: Change type to BasePromptTemplate (#23083 ) ```python Change from_llm( prompt: PromptTemplate ... ) ``` to ```python Change from_llm( prompt: BasePromptTemplate ... ) ```	2024-06-19 13:19:36 -04:00
Sergey Kozlov	94452a94b1	core[patch[: add exceptions propagation test for astream_events v2 (#23159 ) Description: `astream_events(version="v2")` didn't propagate exceptions in `langchain-core<=0.2.6`, fixed in the #22916. This PR adds a unit test to check that exceptions are propagated upwards. Co-authored-by: Sergey Kozlov <sergey.kozlov@ludditelabs.io>	2024-06-19 13:00:25 -04:00
Leonid Ganeline	50484be330	prompty: docstring (#23152 ) Added missed docstrings. Format docstrings to the consistent format (used in the API Reference) --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-19 12:50:58 -04:00
Qingchuan Hao	9b82707ea6	docs: add bing search tool to ms platform (#23183 ) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-19 12:43:05 -04:00
chenxi	505a2e8743	fix: MoonshotChat fails when setting the moonshot_api_key through the OS environment. (#23176 ) Close #23174 Co-authored-by: tianming <tianming@bytenew.com>	2024-06-19 16:28:24 +00:00
Bagatur	677408bfc9	core[patch]: fix chat history circular import (#23182 )	2024-06-19 09:08:36 -07:00
Eugene Yurtsev	883e90d06e	core[patch]: Add an example to the Document schema doc-string (#23131 ) Add an example to the document schema	2024-06-19 11:35:30 -04:00
ccurme	2b08e9e265	core[patch]: update test to catch circular imports (#23172 ) This raises ImportError due to a circular import: ```python from langchain_core import chat_history ``` This does not: ```python from langchain_core import runnables from langchain_core import chat_history ``` Here we update `test_imports` to run each import in a separate subprocess. Open to other ways of doing this!	2024-06-19 15:24:38 +00:00
Eugene Yurtsev	ae4c0ed25a	core[patch]: Add documentation to load namespace (#23143 ) Document some of the modules within the load namespace	2024-06-19 15:21:41 +00:00
Eugene Yurtsev	a34e650f8b	core[patch]: Add doc-string to document compressor (#23085 )	2024-06-19 11:03:49 -04:00
Eugene Yurtsev	1007a715a5	community[patch]: Prevent unit tests from making network requests (#23180 ) * Prevent unit tests from making network requests	2024-06-19 14:56:30 +00:00
ccurme	ca798bc6ea	community: move test to integration tests (#23178 ) Tests failing on master with > FAILED tests/unit_tests/embeddings/test_ovhcloud.py::test_ovhcloud_embed_documents - ValueError: Request failed with status code: 401, {"message":"Bad token; invalid JSON"}	2024-06-19 14:39:48 +00:00
Eugene Yurtsev	4fe8403bfb	core[patch]: Expand documentation in the indexing namespace (#23134 )	2024-06-19 10:11:44 -04:00
Eugene Yurtsev	fe4f10047b	core[patch]: Document embeddings namespace (#23132 ) Document embeddings namespace	2024-06-19 10:11:16 -04:00
Eugene Yurtsev	a3bae56a48	core[patch]: Update documentation in LLM namespace (#23138 ) Update documentation in lllm namespace.	2024-06-19 10:10:50 -04:00
Leonid Ganeline	a70b7a688e	ai21: docstrings (#23142 ) Added missed docstrings. Format docstrings to the consistent format (used in the API Reference)	2024-06-19 08:51:15 -04:00
Jacob Lee	0c2ebe5f47	docs[patch]: Standardize prerequisites in tutorial docs (#23150 ) CC @baskaryan	2024-06-18 23:10:13 -07:00
bilk0h	3d54784e6d	text-splitters: Fix/recursive json splitter data persistence issue (#21529 ) Thank you for contributing to LangChain! Description: Noticed an issue with when I was calling `RecursiveJsonSplitter().split_json()` multiple times that I was getting weird results. I found an issue where `chunks` list in the `_json_split` method. If chunks is not provided when _json_split (which is the case when split_json calls _json_split) then the same list is used for subsequent calls to `_json_split`. You can see this in the test case i also added to this commit. Output should be: ``` [{'a': 1, 'b': 2}] [{'c': 3, 'd': 4}] ``` Instead you get: ``` [{'a': 1, 'b': 2}] [{'a': 1, 'b': 2, 'c': 3, 'd': 4}] ``` --------- Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: isaac hershenson <ihershenson@hmc.edu> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-06-18 20:21:55 -07:00
Yuki Watanabe	9ab7a6df39	docs: Overhaul Databricks components documentation (#22884 ) Description: Documentation at [integrations/llms/databricks](https://python.langchain.com/v0.2/docs/integrations/llms/databricks/) is not up-to-date and includes examples about chat model and embeddings, which should be located in the different corresponding subdirectories. This PR split the page into correct scope and overhaul the contents. Note: This PR might be hard to review on the diffs view, please use the following preview links for the changed pages. - `ChatDatabricks`: https://langchain-git-fork-b-step62-chat-databricks-doc-langchain.vercel.app/v0.2/docs/integrations/chat/databricks/ - `Databricks`: https://langchain-git-fork-b-step62-chat-databricks-doc-langchain.vercel.app/v0.2/docs/integrations/llms/databricks/ - `DatabricksEmbeddings`: https://langchain-git-fork-b-step62-chat-databricks-doc-langchain.vercel.app/v0.2/docs/integrations/text_embedding/databricks/ - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Signed-off-by: B-Step62 <yuki.watanabe@databricks.com>	2024-06-18 20:10:54 -07:00
鹿鹿鹿鲨	6b46b5e9ce	community: add request_kwargs and expect TimeError AsyncHtmlLoader (#23068 ) - Description: add `request_kwargs` and expect `TimeError` in `_fetch` function for AsyncHtmlLoader. This allows you to fill in the kwargs parameter when using the `load()` method of the `AsyncHtmlLoader` class. Co-authored-by: Yucolu <yucolu@tencent.com>	2024-06-18 20:02:46 -07:00
Leonid Ganeline	109a70fc64	ibm: docstrings (#23149 ) Added missed docstrings. Format docstrings to the consistent format (used in the API Reference)	2024-06-18 20:00:27 -07:00
Ryan Elston	86ee4f0daa	text-splitters: Introduce Experimental Markdown Syntax Splitter (#22257 ) #### Description This MR defines a `ExperimentalMarkdownSyntaxTextSplitter` class. The main goal is to replicate the functionality of the original `MarkdownHeaderTextSplitter` which extracts the header stack as metadata but with one critical difference: it keeps the whitespace of the original text intact. This draft reimplements the `MarkdownHeaderTextSplitter` with a very different algorithmic approach. Instead of marking up each line of the text individually and aggregating them back together into chunks, this method builds each chunk sequentially and applies the metadata to each chunk. This makes the implementation simpler. However, since it's designed to keep white space intact its not a full drop in replacement for the original. Since it is a radical implementation change to the original code and I would like to get feedback to see if this is a worthwhile replacement, should be it's own class, or is not a good idea at all. Note: I implemented the `return_each_line` parameter but I don't think it's a necessary feature. I'd prefer to remove it. This implementation also adds the following additional features: - Splits out code blocks and includes the language in the `"Code"` metadata key - Splits text on the horizontal rule `---` as well - The `headers_to_split_on` parameter is now optional - with sensible defaults that can be overridden. #### Issue Keeping the whitespace keeps the paragraphs structure and the formatting of the code blocks intact which allows the caller much more flexibility in how they want to further split the individuals sections of the resulting documents. This addresses the issues brought up by the community in the following issues: - https://github.com/langchain-ai/langchain/issues/20823 - https://github.com/langchain-ai/langchain/issues/19436 - https://github.com/langchain-ai/langchain/issues/22256 #### Dependencies N/A #### Twitter handle @RyanElston --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-06-18 19:44:00 -07:00
Bagatur	93d0ad97fe	anthropic[patch]: test image input (#23155 )	2024-06-19 02:32:15 +00:00
Leonid Ganeline	3dfd055411	anthropic: docstrings (#23145 ) Added missed docstrings. Format docstrings to the consistent format (used in the API Reference)	2024-06-18 22:26:45 -04:00
Bagatur	90559fde70	openai[patch], standard-tests[patch]: don't pass in falsey stop vals (#23153 ) adds an image input test to standard-tests as well	2024-06-18 18:13:13 -07:00
Bagatur	e8a8286012	core[patch]: runnablewithchathistory from core.runnables (#23136 )	2024-06-19 00:15:18 +00:00
Jacob Lee	2ae718796e	docs[patch]: Fix typo in feedback (#23146 )	2024-06-18 16:32:04 -07:00
Jacob Lee	74749c909d	docs[patch]: Adds feedback input after thumbs up/down (#23141 ) CC @baskaryan	2024-06-18 16:08:22 -07:00
Bagatur	cf38981bb7	docs: use trim_messages in chatbot how to (#23139 )	2024-06-18 15:48:03 -07:00
Vadym Barda	b483bf5095	core[minor]: handle boolean data in draw_mermaid (#23135 ) This change should address graph rendering issues for edges with boolean data Example from langgraph: ```python from typing import Annotated, TypedDict from langchain_core.messages import AnyMessage from langgraph.graph import END, START, StateGraph from langgraph.graph.message import add_messages class State(TypedDict): messages: Annotated[list[AnyMessage], add_messages] def branch(state: State) -> bool: return 1 + 1 == 3 graph_builder = StateGraph(State) graph_builder.add_node("foo", lambda state: {"messages": [("ai", "foo")]}) graph_builder.add_node("bar", lambda state: {"messages": [("ai", "bar")]}) graph_builder.add_conditional_edges( START, branch, path_map={True: "foo", False: "bar"}, then=END, ) app = graph_builder.compile() print(app.get_graph().draw_mermaid()) ``` Previous behavior: ```python AttributeError: 'bool' object has no attribute 'split' ``` Current behavior: ```python %%{init: {'flowchart': {'curve': 'linear'}}}%% graph TD; __start__[__start__]:::startclass; __end__[__end__]:::endclass; foo([foo]):::otherclass; bar([bar]):::otherclass; __start__ -. ('a',) .-> foo; foo --> __end__; __start__ -. ('b',) .-> bar; bar --> __end__; classDef startclass fill:#ffdfba; classDef endclass fill:#baffc9; classDef otherclass fill:#fad7de; ```	2024-06-18 20:15:42 +00:00
Bagatur	093ae04d58	core[patch]: Pin pydantic in py3.12.4 (#23130 )	2024-06-18 12:00:02 -07:00
hmasdev	ff0c06b1e5	langchain[patch]: fix `OutputType` of OutputParsers and fix legacy API in OutputParsers (#19792 ) # Description This pull request aims to address specific issues related to the ambiguity and error-proneness of the output types of certain output parsers, as well as the absence of unit tests for some parsers. These issues could potentially lead to runtime errors or unexpected behaviors due to type mismatches when used, causing confusion for developers and users. Through clarifying output types, this PR seeks to improve the stability and reliability. Therefore, this pull request - fixes the `OutputType` of OutputParsers to be the expected type; - e.g. `OutputType` property of `EnumOutputParser` raises `TypeError`. This PR introduce a logic to extract `OutputType` from its attribute. - and fixes the legacy API in OutputParsers like `LLMChain.run` to the modern API like `LLMChain.invoke`; - Note: For `OutputFixingParser`, `RetryOutputParser` and `RetryWithErrorOutputParser`, this PR introduces `legacy` attribute with False as default value in order to keep the backward compatibility - and adds the tests for the `OutputFixingParser` and `RetryOutputParser`. The following table shows my expected output and the actual output of the `OutputType` of OutputParsers. I have used this table to fix `OutputType` of OutputParsers. \| Class Name of OutputParser \| My Expected `OutputType` (after this PR)\| Actual `OutputType` [evidence](#evidence) (before this PR)\| Fix Required \| \|---------\|--------------\|---------\|--------\| \| BooleanOutputParser \| `<class 'bool'>` \| `<class 'bool'>` \| NO \| \| CombiningOutputParser \| `typing.Dict[str, Any]` \| `TypeError` is raised \| YES \| \| DatetimeOutputParser \| `<class 'datetime.datetime'>` \| `<class 'datetime.datetime'>` \| NO \| \| EnumOutputParser(enum=MyEnum) \| `MyEnum` \| `TypeError` is raised \| YES \| \| OutputFixingParser \| The same type as `self.parser.OutputType` \| `~T` \| YES \| \| CommaSeparatedListOutputParser \| `typing.List[str]` \| `typing.List[str]` \| NO \| \| MarkdownListOutputParser \| `typing.List[str]` \| `typing.List[str]` \| NO \| \| NumberedListOutputParser \| `typing.List[str]` \| `typing.List[str]` \| NO \| \| JsonOutputKeyToolsParser \| `typing.Any` \| `typing.Any` \| NO \| \| JsonOutputToolsParser \| `typing.Any` \| `typing.Any` \| NO \| \| PydanticToolsParser \| `typing.Any` \| `typing.Any` \| NO \| \| PandasDataFrameOutputParser \| `typing.Dict[str, Any]` \| `TypeError` is raised \| YES \| \| PydanticOutputParser(pydantic_object=MyModel) \| `<class '__main__.MyModel'>` \| `<class '__main__.MyModel'>` \| NO \| \| RegexParser \| `typing.Dict[str, str]` \| `TypeError` is raised \| YES \| \| RegexDictParser \| `typing.Dict[str, str]` \| `TypeError` is raised \| YES \| \| RetryOutputParser \| The same type as `self.parser.OutputType` \| `~T` \| YES \| \| RetryWithErrorOutputParser \| The same type as `self.parser.OutputType` \| `~T` \| YES \| \| StructuredOutputParser \| `typing.Dict[str, Any]` \| `TypeError` is raised \| YES \| \| YamlOutputParser(pydantic_object=MyModel) \| `MyModel` \| `~T` \| YES \| NOTE: In "Fix Required", "YES" means that it is required to fix in this PR while "NO" means that it is not required. # Issue No issues for this PR. # Twitter handle - [hmdev3](https://twitter.com/hmdev3) # Questions: 1. Is it required to create tests for legacy APIs `LLMChain.run` in the following scripts? - libs/langchain/tests/unit_tests/output_parsers/test_fix.py; - libs/langchain/tests/unit_tests/output_parsers/test_retry.py. 2. Is there a more appropriate expected output type than I expect in the above table? - e.g. the `OutputType` of `CombiningOutputParser` should be SOMETHING... # Actual outputs (before this PR) <div id='evidence'></div> <details><summary>Actual outputs</summary> ## Requirements - Python==3.9.13 - langchain==0.1.13 ```python Python 3.9.13 (tags/v3.9.13:6de2ca5, May 17 2022, 16:36:42) [MSC v.1929 64 bit (AMD64)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import langchain >>> langchain.__version__ '0.1.13' >>> from langchain import output_parsers ``` ### `BooleanOutputParser` ```python >>> output_parsers.BooleanOutputParser().OutputType <class 'bool'> ``` ### `CombiningOutputParser` ```python >>> output_parsers.CombiningOutputParser(parsers=[output_parsers.DatetimeOutputParser(), output_parsers.CommaSeparatedListOutputParser()]).OutputType Traceback (most recent call last): File "<stdin>", line 1, in <module> File "D:\workspace\venv\lib\site-packages\langchain_core\output_parsers\base.py", line 160, in OutputType raise TypeError( TypeError: Runnable CombiningOutputParser doesn't have an inferable OutputType. Override the OutputType property to specify the output type. ``` ### `DatetimeOutputParser` ```python >>> output_parsers.DatetimeOutputParser().OutputType <class 'datetime.datetime'> ``` ### `EnumOutputParser` ```python >>> from enum import Enum >>> class MyEnum(Enum): ... a = 'a' ... b = 'b' ... >>> output_parsers.EnumOutputParser(enum=MyEnum).OutputType Traceback (most recent call last): File "<stdin>", line 1, in <module> File "D:\workspace\venv\lib\site-packages\langchain_core\output_parsers\base.py", line 160, in OutputType raise TypeError( TypeError: Runnable EnumOutputParser doesn't have an inferable OutputType. Override the OutputType property to specify the output type. ``` ### `OutputFixingParser` ```python >>> output_parsers.OutputFixingParser(parser=output_parsers.DatetimeOutputParser()).OutputType ~T ``` ### `CommaSeparatedListOutputParser` ```python >>> output_parsers.CommaSeparatedListOutputParser().OutputType typing.List[str] ``` ### `MarkdownListOutputParser` ```python >>> output_parsers.MarkdownListOutputParser().OutputType typing.List[str] ``` ### `NumberedListOutputParser` ```python >>> output_parsers.NumberedListOutputParser().OutputType typing.List[str] ``` ### `JsonOutputKeyToolsParser` ```python >>> output_parsers.JsonOutputKeyToolsParser(key_name='tool').OutputType typing.Any ``` ### `JsonOutputToolsParser` ```python >>> output_parsers.JsonOutputToolsParser().OutputType typing.Any ``` ### `PydanticToolsParser` ```python >>> from langchain.pydantic_v1 import BaseModel >>> class MyModel(BaseModel): ... a: int ... >>> output_parsers.PydanticToolsParser(tools=[MyModel, MyModel]).OutputType typing.Any ``` ### `PandasDataFrameOutputParser` ```python >>> output_parsers.PandasDataFrameOutputParser().OutputType Traceback (most recent call last): File "<stdin>", line 1, in <module> File "D:\workspace\venv\lib\site-packages\langchain_core\output_parsers\base.py", line 160, in OutputType raise TypeError( TypeError: Runnable PandasDataFrameOutputParser doesn't have an inferable OutputType. Override the OutputType property to specify the output type. ``` ### `PydanticOutputParser` ```python >>> output_parsers.PydanticOutputParser(pydantic_object=MyModel).OutputType <class '__main__.MyModel'> ``` ### `RegexParser` ```python >>> output_parsers.RegexParser(regex='$', output_keys=['a']).OutputType Traceback (most recent call last): File "<stdin>", line 1, in <module> File "D:\workspace\venv\lib\site-packages\langchain_core\output_parsers\base.py", line 160, in OutputType raise TypeError( TypeError: Runnable RegexParser doesn't have an inferable OutputType. Override the OutputType property to specify the output type. ``` ### `RegexDictParser` ```python >>> output_parsers.RegexDictParser(output_key_to_format={'a':'a'}).OutputType Traceback (most recent call last): File "<stdin>", line 1, in <module> File "D:\workspace\venv\lib\site-packages\langchain_core\output_parsers\base.py", line 160, in OutputType raise TypeError( TypeError: Runnable RegexDictParser doesn't have an inferable OutputType. Override the OutputType property to specify the output type. ``` ### `RetryOutputParser` ```python >>> output_parsers.RetryOutputParser(parser=output_parsers.DatetimeOutputParser()).OutputType ~T ``` ### `RetryWithErrorOutputParser` ```python >>> output_parsers.RetryWithErrorOutputParser(parser=output_parsers.DatetimeOutputParser()).OutputType ~T ``` ### `StructuredOutputParser` ```python >>> from langchain.output_parsers.structured import ResponseSchema >>> response_schemas = [ResponseSchema(name="foo",description="a list of strings",type="List[string]"),ResponseSchema(name="bar",description="a string",type="string"), ] >>> output_parsers.StructuredOutputParser.from_response_schemas(response_schemas).OutputType Traceback (most recent call last): File "<stdin>", line 1, in <module> File "D:\workspace\venv\lib\site-packages\langchain_core\output_parsers\base.py", line 160, in OutputType raise TypeError( TypeError: Runnable StructuredOutputParser doesn't have an inferable OutputType. Override the OutputType property to specify the output type. ``` ### `YamlOutputParser` ```python >>> output_parsers.YamlOutputParser(pydantic_object=MyModel).OutputType ~T ``` <div> --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-18 18:59:42 +00:00
Artem Mukhin	e271f75bee	docs: Fix URL formatting in deprecation warnings (#23075 ) Description Updated the URLs in deprecation warning messages. The URLs were previously written as raw strings and are now formatted to be clickable HTML links. Example of a broken link in the current API Reference: https://api.python.langchain.com/en/latest/chains/langchain.chains.openai_functions.extraction.create_extraction_chain_pydantic.html <img width="942" alt="Screenshot 2024-06-18 at 13 21 07" src="https://github.com/langchain-ai/langchain/assets/4854600/a1b1863c-cd03-4af2-a9bc-70375407fb00">	2024-06-18 14:49:58 -04:00
Gabriel Petracca	c6660df58e	community[minor]: Implement Doctran async execution (#22372 ) Description The DoctranTextTranslator has an async transform function that was not implemented because [the doctran library](https://github.com/psychic-api/doctran) uses a sync version of the `execute` method. - I implemented the `DoctranTextTranslator.atransform_documents()` method using `asyncio.to_thread` to run the function in a separate thread. - I updated the example in the Notebook with the new async version. - The performance improvements can be appreciated when a big document is divided into multiple chunks. Relates to: - Issue #14645: https://github.com/langchain-ai/langchain/issues/14645 - Issue #14437: https://github.com/langchain-ai/langchain/issues/14437 - https://github.com/langchain-ai/langchain/pull/15264 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-18 18:17:37 +00:00
Eugene Yurtsev	aa6415aa7d	core[minor]: Support multiple keys in get_from_dict_or_env (#23086 ) Support passing multiple keys for ge_from_dict_or_env	2024-06-18 14:13:28 -04:00
nold	226802f0c4	community: add args_schema to SearxSearch (#22954 ) This change adds args_schema (pydantic BaseModel) to SearxSearchRun for correct schema formatting on LLM function calls Issue: currently using SearxSearchRun with OpenAI function calling returns the following error "TypeError: SearxSearchRun._run() got an unexpected keyword argument '__arg1' ". This happens because the schema sent to the LLM is "input: '{"__arg1":"foobar"}'" while the method should be called with the "query" parameter. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-06-18 17:27:39 +00:00
Bagatur	01783d67fc	core[patch]: Release 0.2.9 (#23091 )	2024-06-18 17:15:04 +00:00
Finlay Macklon	616d06d7fe	community: glob multiple patterns when using DirectoryLoader (#22852 ) - Description: Updated community.langchain_community.document_loaders.directory.py to enable the use of multiple glob patterns in the `DirectoryLoader` class. Now, the glob parameter is of type `list[str] \| str` and still defaults to the same value as before. I updated the docstring of the class to reflect this, and added a unit test to community.tests.unit_tests.document_loaders.test_directory.py named `test_directory_loader_glob_multiple`. This test also shows an example of how to use the new functionality. - ~~Issue:~~Discussion Thread: https://github.com/langchain-ai/langchain/discussions/18559 - Dependencies: None - Twitter handle: N/a - [x] Add tests and docs - Added test (described above) - Updated class docstring - [x] Lint and test --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-06-18 09:24:50 -07:00
Eugene Yurtsev	5564d9e404	core[patch]: Document BaseStore (#23082 ) Add doc-string to BaseStore	2024-06-18 11:47:47 -04:00
Takuya Igei	9f791b6ad5	core[patch],community[patch],langchain[patch]: `tenacity` dependency to version `>=8.1.0,<8.4.0` (#22973 ) Fix https://github.com/langchain-ai/langchain/issues/22972. - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-18 10:34:28 -04:00
Raghav Dixit	74c4cbb859	LanceDB example minor change (#23069 ) Removed package version `0.6.13` in the example.	2024-06-18 09:16:17 -04:00
Bagatur	ddfbca38df	docs: add trim_messages to chatbot (#23061 )	2024-06-17 22:39:39 -07:00
Lance Martin	931b41b30f	Update Fireworks link (#23058 )	2024-06-17 21:16:18 -07:00
Leonid Ganeline	6a66d8e2ca	docs: `AWS` platform page update (#23063 ) Added a reference to the `GlueCatalogLoader` new document loader.	2024-06-17 21:01:58 -07:00
Raviraj	858ce264ef	SemanticChunker : Feature Addition ("Semantic Splitting with gradient") (#22895 ) ```SemanticChunker``` currently provide three methods to split the texts semantically: - percentile - standard_deviation - interquartile I propose new method ```gradient```. In this method, the gradient of distance is used to split chunks along with the percentile method (technically) . This method is useful when chunks are highly correlated with each other or specific to a domain e.g. legal or medical. The idea is to apply anomaly detection on gradient array so that the distribution become wider and easy to identify boundaries in highly semantic data. I have tested this merge on a set of 10 domain specific documents (mostly legal). Details : - Issue: Improvement - Dependencies: NA - Twitter handle: [x.com/prajapat_ravi](https://x.com/prajapat_ravi) @hwchase17 --------- Co-authored-by: Raviraj Prajapat <raviraj.prajapat@sirionlabs.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-06-17 21:01:08 -07:00
Raghav Dixit	55705c0f5e	LanceDB integration update (#22869 ) Added : - [x] relevance search (w/wo scores) - [x] maximal marginal search - [x] image ingestion - [x] filtering support - [x] hybrid search w reranking make test, lint_diff and format checked.	2024-06-17 20:54:26 -07:00
Chang Liu	62c8a67f56	community: add KafkaChatMessageHistory (#22216 ) Add chat history store based on Kafka. Files added: `libs/community/langchain_community/chat_message_histories/kafka.py` `docs/docs/integrations/memory/kafka_chat_message_history.ipynb` New issue to be created for future improvement: 1. Async method implementation. 2. Message retrieval based on timestamp. 3. Support for other configs when connecting to cloud hosted Kafka (e.g. add `api_key` field) 4. Improve unit testing & integration testing.	2024-06-17 20:34:01 -07:00
shimajiroxyz	3e835a1aa1	langchain: add id_key option to EnsembleRetriever for metadata-based document merging (#22950 ) Description: - What I changed - By specifying the `id_key` during the initialization of `EnsembleRetriever`, it is now possible to determine which documents to merge scores for based on the value corresponding to the `id_key` element in the metadata, instead of `page_content`. Below is an example of how to use the modified `EnsembleRetriever`: ```python retriever = EnsembleRetriever(retrievers=[ret1, ret2], id_key="id") # The Document returned by each retriever must keep the "id" key in its metadata. ``` - Additionally, I added a script to easily test the behavior of the `invoke` method of the modified `EnsembleRetriever`. - Why I changed - There are cases where you may want to calculate scores by treating Documents with different `page_content` as the same when using `EnsembleRetriever`. For example, when you want to ensemble the search results of the same document described in two different languages. - The previous `EnsembleRetriever` used `page_content` as the basis for score aggregation, making the above usage difficult. Therefore, the score is now calculated based on the specified key value in the Document's metadata. Twitter handle: @shimajiroxyz	2024-06-18 03:29:17 +00:00
mackong	39f6c4169d	langchain[patch]: add tool messages formatter for tool calling agent (#22849 ) - Description: add tool_messages_formatter for tool calling agent, make tool messages can be formatted in different ways for your LLM. - Issue: N/A - Dependencies: N/A	2024-06-17 20:29:00 -07:00
Lucas Tucker	e25a5966b5	docs: Standardize DocumentLoader docstrings (#22932 ) Standardizing DocumentLoader docstrings (of which there are many) This PR addresses issue #22866 and adds docstrings according to the issue's specified format (in the appendix) for files csv_loader.py and json_loader.py in langchain_community.document_loaders. In particular, the following sections have been added to both CSVLoader and JSONLoader: Setup, Instantiate, Load, Async load, and Lazy load. It may be worth adding a 'Metadata' section to the JSONLoader docstring to clarify how we want to extract the JSON metadata (using the `metadata_func` argument). The files I used to walkthrough the various sections were `example_2.json` from [HERE](https://support.oneskyapp.com/hc/en-us/articles/208047697-JSON-sample-files) and `hw_200.csv` from [HERE](https://people.sc.fsu.edu/~jburkardt/data/csv/csv.html). --------- Co-authored-by: lucast2021 <lucast2021@headroyce.org> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-06-18 03:26:36 +00:00
Leonid Ganeline	a56ff199a7	docs: embeddings classes (#22927 ) Added a table with all Embedding classes.	2024-06-17 20:17:24 -07:00
Mohammad Mohtashim	60ba02f5db	[Community]: Fixed DDG DuckDuckGoSearchResults Docstring (#22968 ) - Description: A very small fix in the Docstring of `DuckDuckGoSearchResults` identified in the following issue. - Issue: #22961 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-06-18 03:16:24 +00:00
Eun Hye Kim	70761af8cf	community: Fix #22975 (Add SSL Verification Option to Requests Class in langchain_community) (#22977 ) - PR title: "community: Fix #22975 (Add SSL Verification Option to Requests Class in langchain_community)" - PR message: - Description: - Added an optional verify parameter to the Requests class with a default value of True. - Modified the get, post, patch, put, and delete methods to include the verify parameter. - Updated the _arequest async context manager to include the verify parameter. - Added the verify parameter to the GenericRequestsWrapper class and passed it to the Requests class. - Issue: This PR fixes issue #22975. - Dependencies: No additional dependencies are required for this change. - Twitter handle: @lunara_x You can check this change with below code. ```python from langchain_openai.chat_models import ChatOpenAI from langchain.requests import RequestsWrapper from langchain_community.agent_toolkits.openapi import planner from langchain_community.agent_toolkits.openapi.spec import reduce_openapi_spec with open("swagger.yaml") as f: data = yaml.load(f, Loader=yaml.FullLoader) swagger_api_spec = reduce_openapi_spec(data) llm = ChatOpenAI(model='gpt-4o') swagger_requests_wrapper = RequestsWrapper(verify=False) # modified point superset_agent = planner.create_openapi_agent(swagger_api_spec, swagger_requests_wrapper, llm, allow_dangerous_requests=True, handle_parsing_errors=True) superset_agent.run( "Tell me the number and types of charts and dashboards available." ) ``` --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-06-18 03:12:40 +00:00
Mohammad Mohtashim	bf839676c7	[Community]: FIxed the DocumentDBVectorSearch `_similarity_search_without_score` (#22970 ) - Description: The PR #22777 introduced a bug in `_similarity_search_without_score` which was raising the `OperationFailure` error. The mistake was syntax error for MongoDB pipeline which has been corrected now. - Issue: #22770	2024-06-17 20:08:42 -07:00
Nuno Campos	f01f12ce1e	Include "no escape" and "inverted section" mustache vars in Prompt.input_variables and Prompt.input_schema (#22981 )	2024-06-17 19:24:13 -07:00
Bella Be	7a0b36501f	docs: Update how to docs for pydantic compatibility (#22983 ) Add missing imports in docs from langchain_core.tools BaseTool --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-06-18 01:49:56 +00:00
Jacob Lee	3b7b276f6f	docs[patch]: Adds evaluation sections (#23050 ) Also want to add an index/rollup page to LangSmith docs to enable linking to a how-to category as a group (e.g. https://docs.smith.langchain.com/how_to_guides/evaluation/) CC @agola11 @hinthornw	2024-06-17 17:25:04 -07:00
Jacob Lee	6605ae22f6	docs[patch]: Update docs links (#23013 )	2024-06-17 15:58:28 -07:00
Bagatur	c2b2e3266c	core[minor]: message transformer utils (#22752 )	2024-06-17 15:30:07 -07:00
Qingchuan Hao	c5e0acf6f0	docs: add bing search integration to agent (#22929 ) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-17 18:08:52 -04:00
Anders Swanson	aacc6198b9	community: OCI GenAI embedding batch size (#22986 ) Thank you for contributing to LangChain! - [x] PR title: "community: OCI GenAI embedding batch size" - [x] PR message: - Issue: #22985 - [ ] Add tests and docs: N/A - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: Anders Swanson <anders.swanson@oracle.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-17 22:06:45 +00:00
Bagatur	8235bae48e	core[patch]: Release 0.2.8 (#23012 )	2024-06-17 20:55:39 +00:00
Bagatur	5ee6e22983	infra: test all dependents on any change (#22994 )	2024-06-17 20:50:31 +00:00
Nuno Campos	bd4b68cd54	core: run_in_executor: Wrap StopIteration in RuntimeError (#22997 ) - StopIteration can't be set on an asyncio.Future it raises a TypeError and leaves the Future pending forever so we need to convert it to a RuntimeError	2024-06-17 20:40:01 +00:00
Bagatur	d96f67b06f	standard-tests[patch]: Update chat model standard tests (#22378 ) - Refactor standard test classes to make them easier to configure - Update openai to support stop_sequences init param - Update groq to support stop_sequences init param - Update fireworks to support max_retries init param - Update ChatModel.bind_tools to type tool_choice - Update groq to handle tool_choice="any". this may be controversial --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-17 13:37:41 -07:00
Bob Lin	14f0cdad58	docs: Add some 3rd party tutorials (#22931 ) Langchain is very popular among developers in China, but there are still no good Chinese books or documents, so I want to add my own Chinese resources on langchain topics, hoping to give Chinese readers a better experience using langchain. This is not a translation of the official langchain documentation, but my understanding. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-17 20:12:49 +00:00
Jacob Lee	893299c3c9	docs[patch]: Reorder streaming guide, add tags (#22993 ) CC @hinthornw	2024-06-17 13:10:51 -07:00
Oguz Vuruskaner	dd25d08c06	community[minor]: add tool calling for DeepInfraChat (#22745 ) DeepInfra now supports tool calling for supported models. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-17 15:21:49 -04:00
Bagatur	158701ab3c	docs: update universal init title (#22990 )	2024-06-17 12:13:31 -07:00
Lance Martin	a54deba6bc	Add RAG to conceptual guide (#22790 ) Co-authored-by: jacoblee93 <jacoblee93@gmail.com>	2024-06-17 11:20:28 -07:00
maang-h	c6b7db6587	community: Add Baichuan Embeddings batch size (#22942 ) - Support batch size Baichuan updates the document, indicating that up to 16 documents can be imported at a time - Standardized model init arg names - baichuan_api_key -> api_key - model_name -> model	2024-06-17 14:11:04 -04:00
ccurme	722c8f50ea	openai[patch]: add stream_usage parameter (#22854 ) Here we add `stream_usage` to ChatOpenAI as: 1. a boolean attribute 2. a kwarg to _stream and _astream. Question: should the `stream_usage` attribute be `bool`, or `bool \| None`? Currently I've kept it `bool` and defaulted to False. It was implemented on [ChatAnthropic](`e832bbb486/libs/partners/anthropic/langchain_anthropic/chat_models.py (L535)`) as a bool. However, to maintain support for users who access the behavior via OpenAI's `stream_options` param, this ends up being possible: ```python llm = ChatOpenAI(model_kwargs={"stream_options": {"include_usage": True}}) assert not llm.stream_usage ``` (and this model will stream token usage). Some options for this: - it's ok - make the `stream_usage` attribute bool or None - make an \_\_init\_\_ for ChatOpenAI, set a `._stream_usage` attribute and read `.stream_usage` from a property Open to other ideas as well.	2024-06-17 13:35:18 -04:00
Shubham Pandey	56ac94e014	community[minor]: add `ChatSnowflakeCortex` chat model (#21490 ) Description: This PR adds a chat model integration for [Snowflake Cortex](https://docs.snowflake.com/en/user-guide/snowflake-cortex/llm-functions), which gives an instant access to industry-leading large language models (LLMs) trained by researchers at companies like Mistral, Reka, Meta, and Google, including [Snowflake Arctic](https://www.snowflake.com/en/data-cloud/arctic/), an open enterprise-grade model developed by Snowflake. Dependencies: Snowflake's [snowpark](https://pypi.org/project/snowflake-snowpark-python/) library is required for using this integration. Twitter handle: [@gethouseware](https://twitter.com/gethouseware) - [x] Add tests and docs: 1. integration tests: `libs/community/tests/integration_tests/chat_models/test_snowflake.py` 2. unit tests: `libs/community/tests/unit_tests/chat_models/test_snowflake.py` 3. example notebook: `docs/docs/integrations/chat/snowflake.ipynb` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-17 09:47:05 -07:00
Lance Martin	ea96133890	docs: Update llamacpp ntbk (#22907 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-17 15:42:56 +00:00
Bagatur	e2304ebcdb	standard-tests[patch]: Release 0.1.1 (#22984 )	2024-06-17 15:31:34 +00:00
Hakan Özdemir	c437b1aab7	[Partner]: Add metadata to stream response (#22716 ) Adds `response_metadata` to stream responses from OpenAI. This is returned with `invoke` normally, but wasn't implemented for `stream`. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-17 09:46:50 -04:00
Baskar Gopinath	42a379c75c	docs: Standardise formatting (#22948 ) Standardised formatting ![image](https://github.com/langchain-ai/langchain/assets/73015364/ea3b5c5c-e7a6-4bb7-8c6b-e7d8cbbbf761)	2024-06-17 09:00:05 -04:00
Ikko Eltociear Ashimine	3e7bb7690c	docs: update databricks.ipynb (#22949 ) arbitary -> arbitrary	2024-06-17 08:57:49 -04:00
Baskar Gopinath	19356b6445	Update sql_qa.ipynb (#22966 ) fixes #22798 fixes #22963	2024-06-17 08:57:16 -04:00
Bagatur	9ff249a38d	standard-tests[patch]: don't require str chunk contents (#22965 )	2024-06-17 08:52:24 -04:00
Daniel Glogowski	892bd4c29b	docs: nim model name update (#22943 ) NIM Model name change in a notebook and mdx file. Thanks!	2024-06-15 16:38:28 -04:00
Christopher Tee	ada03dd273	community(you): Better support for You.com News API (#22622 ) ## Description While `YouRetriever` supports both You.com's Search and News APIs, news is supported as an afterthought. More specifically, not all of the News API parameters are exposed for the user, only those that happen to overlap with the Search API. This PR: - improves support for both APIs, exposing the remaining News API parameters while retaining backward compatibility - refactor some REST parameter generation logic - updates the docstring of `YouSearchAPIWrapper` - add input validation and warnings to ensure parameters are properly set by user - 🚨 Breaking: Limit the news results to `k` items If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-15 20:05:19 +00:00
ccurme	e09c6bb58b	infra: update integration test workflow (#22945 )	2024-06-15 19:52:43 +00:00
Tomaz Bratanic	1c661fd849	Improve llm graph transformer docstring (#22939 )	2024-06-15 15:33:26 -04:00
maang-h	7a0af56177	docs: update ZhipuAI ChatModel docstring (#22934 ) - Description: Update ZhipuAI ChatModel rich docstring - Issue: the issue #22296	2024-06-15 09:12:21 -04:00
Appletree24	6838804116	docs:Fix mispelling in streaming doc (#22936 ) Description: Fix mispelling Issue: None Dependencies: None Twitter handle: None Co-authored-by: qcloud <ubuntu@localhost.localdomain>	2024-06-15 12:24:50 +00:00
Bitmonkey	570d45b2a1	Update ollama.py with optional raw setting. (#21486 ) Ollama has a raw option now. https://github.com/ollama/ollama/blob/main/docs/api.md Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-06-14 17:19:26 -07:00
caiyueliang	9944ad7f5f	community: 'Solve the issue where the _search function in ElasticsearchStore supports passing a query_vector parameter, but the parameter does not take effect. (#21532 ) Issue: When using the similarity_search_with_score function in ElasticsearchStore, I expected to pass in the query_vector that I have already obtained. I noticed that the _search function does support the query_vector parameter, but it seems to be ineffective. I am attempting to resolve this issue. Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-06-14 17:13:11 -07:00
Erick Friis	764f1958dd	docs: add ollama json mode (#22926 ) fixes #22910	2024-06-14 23:27:55 +00:00
Erick Friis	c374c98389	experimental: release 0.0.61 (#22924 )	2024-06-14 15:55:07 -07:00
BuxianChen	af65cac609	cli[minor]: remove redefined DEFAULT_GIT_REF (#21471 ) remove redefined DEFAULT_GIT_REF Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-06-14 15:49:15 -07:00
Erick Friis	79a64207f5	community: release 0.2.5 (#22923 )	2024-06-14 15:45:07 -07:00
Jiejun Tan	c8c67dde6f	text-splitters[patch]: Fix HTMLSectionSplitter (#22812 ) Update former pull request: https://github.com/langchain-ai/langchain/pull/22654. Modified `langchain_text_splitters.HTMLSectionSplitter`, where in the latest version `dict` data structure is used to store sections from a html document, in function `split_html_by_headers`. The header/section element names serve as dict keys. This can be a problem when duplicate header/section element names are present in a single html document. Latter ones can replace former ones with the same name. Therefore some contents can be miss after html text splitting is conducted. Using a list to store sections can hopefully solve the problem. A Unit test considering duplicate header names has been added. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-14 22:40:39 +00:00
Erick Friis	fbeeb6da75	langchain: release 0.2.5 (#22922 )	2024-06-14 15:37:54 -07:00
Erick Friis	551640a030	templates: remove lockfiles (#22920 ) poetry will default to latest versions without	2024-06-14 21:42:30 +00:00
Baskar Gopinath	c4f2bc9540	docs: Fix wrongly referenced class name in confluence.py (#22879 ) Fixes #22542 Changed ConfluenceReader to ConfluenceLoader	2024-06-14 14:00:48 -07:00
ccurme	32966a08a9	infra: remove nvidia from monorepo scheduled tests (#22915 ) Scheduled tests run in https://github.com/langchain-ai/langchain-nvidia/tree/main	2024-06-14 13:23:04 -07:00
Erick Friis	9ef15691d6	core: release 0.2.7 (#22917 )	2024-06-14 20:03:58 +00:00
Nuno Campos	338180f383	core: in astream_events v2 always await task even if already finished (#22916 ) - this ensures exceptions propagate to the caller	2024-06-14 19:54:20 +00:00
Istvan/Nebulinq	513e491ce9	experimental: LLMGraphTransformer - added relationship properties. (#21856 ) - Description: The generated relationships in the graph had no properties, but the Relationship class was properly defined with properties. This made it very difficult to transform conditional sentences into a graph. Adding properties to relationships can solve this issue elegantly. The changes expand on the existing LLMGraphTransformer implementation but add the possibility to define allowed relationship properties like this: LLMGraphTransformer(llm=llm, relationship_properties=["Condition", "Time"],) - Issue: no issue found - Dependencies: n/a - Twitter handle: @IstvanSpace -Quick Test ================================================================= from dotenv import load_dotenv import os from langchain_community.graphs import Neo4jGraph from langchain_experimental.graph_transformers import LLMGraphTransformer from langchain_openai import ChatOpenAI from langchain_core.prompts import ChatPromptTemplate from langchain_core.documents import Document load_dotenv() os.environ["NEO4J_URI"] = os.getenv("NEO4J_URI") os.environ["NEO4J_USERNAME"] = os.getenv("NEO4J_USERNAME") os.environ["NEO4J_PASSWORD"] = os.getenv("NEO4J_PASSWORD") graph = Neo4jGraph() llm = ChatOpenAI(temperature=0, model_name="gpt-4o") llm_transformer = LLMGraphTransformer(llm=llm) #text = "Harry potter likes pies, but only if it rains outside" text = "Jack has a dog named Max. Jack only walks Max if it is sunny outside." documents = [Document(page_content=text)] llm_transformer_props = LLMGraphTransformer( llm=llm, relationship_properties=["Condition"], ) graph_documents_props = llm_transformer_props.convert_to_graph_documents(documents) print(f"Nodes:{graph_documents_props[0].nodes}") print(f"Relationships:{graph_documents_props[0].relationships}") graph.add_graph_documents(graph_documents_props) --------- Co-authored-by: Istvan Lorincz <istvan.lorincz@pm.me> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-14 14:41:04 -04:00
ccurme	694ae87748	docs: add groq to chatmodeltabs (#22913 )	2024-06-14 14:31:48 -04:00
Eugene Yurtsev	c816d03699	dcos: Add admonition to PythonREPL tool (#22909 ) Add admonition to the documentation to make sure users are aware that the tool allows execution of code on the host machine using a python interpreter (by design).	2024-06-14 14:06:40 -04:00
kiarina	8171efd07a	core[patch]: Fix FunctionCallbackHandler._on_tool_end (#22908 ) If the global `debug` flag is enabled, the agent will get the following error in `FunctionCallbackHandler._on_tool_end` at runtime. ``` Error in ConsoleCallbackHandler.on_tool_end callback: AttributeError("'list' object has no attribute 'strip'") ``` By calling str() before strip(), the error was avoided. This error can be seen at [debugging.ipynb](https://github.com/langchain-ai/langchain/blob/master/docs/docs/how_to/debugging.ipynb). - Issue: NA - Dependencies: NA - Twitter handle: https://x.com/kiarina37	2024-06-14 17:59:29 +00:00
Philippe PRADOS	b61de9728e	community[minor]: Fix long_context_reorder.py async (#22839 ) Implement `async def atransform_documents( self, documents: Sequence[Document], **kwargs: Any ) -> Sequence[Document]` for `LongContextReorder`	2024-06-14 13:55:18 -04:00
Eugene Yurtsev	c72bcda4f2	community[major], experimental[patch]: Remove Python REPL from community (#22904 ) Remove the REPL from community, and suggest an alternative import from langchain_experimental. Fix for this issue: https://github.com/langchain-ai/langchain/issues/14345 This is not a bug in the code or an actual security risk. The python REPL itself is behaving as expected. The PR is done to appease blanket security policies that are just looking for the presence of exec in the code. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-14 17:53:29 +00:00
Eugene Yurtsev	9a877c7adb	community[patch]: SitemapLoader restrict depth of parsing sitemap (CVE-2024-2965) (#22903 ) This PR restricts the depth to which the sitemap can be parsed. Fix for: CVE-2024-2965	2024-06-14 13:04:40 -04:00
Eugene Yurtsev	4a77a3ab19	core[patch]: fix validation of @deprecated decorator (#22513 ) This PR moves the validation of the decorator to a better place to avoid creating bugs while deprecating code. Prevent issues like this from arising: https://github.com/langchain-ai/langchain/issues/22510 we should replace with a linter at some point that just does static analysis	2024-06-14 16:52:30 +00:00
Jacob Lee	181a61982f	anthropic[minor]: Adds streaming tool call support for Anthropic (#22687 ) Preserves string content chunks for non tool call requests for convenience. One thing - Anthropic events look like this: ``` RawContentBlockStartEvent(content_block=TextBlock(text='', type='text'), index=0, type='content_block_start') RawContentBlockDeltaEvent(delta=TextDelta(text='<thinking>\nThe', type='text_delta'), index=0, type='content_block_delta') RawContentBlockDeltaEvent(delta=TextDelta(text=' provide', type='text_delta'), index=0, type='content_block_delta') ... RawContentBlockStartEvent(content_block=ToolUseBlock(id='toolu_01GJ6x2ddcMG3psDNNe4eDqb', input={}, name='get_weather', type='tool_use'), index=1, type='content_block_start') RawContentBlockDeltaEvent(delta=InputJsonDelta(partial_json='', type='input_json_delta'), index=1, type='content_block_delta') ``` Note that `delta` has a `type` field. With this implementation, I'm dropping it because `merge_list` behavior will concatenate strings. We currently have `index` as a special field when merging lists, would it be worth adding `type` too? If so, what do we set as a context block chunk? `text` vs. `text_delta`/`tool_use` vs `input_json_delta`? CC @ccurme @efriis @baskaryan	2024-06-14 09:14:43 -07:00
ccurme	f40b2c6f9d	fireworks[patch]: add usage_metadata to (a)invoke and (a)stream (#22906 )	2024-06-14 12:07:19 -04:00
Mohammad Mohtashim	d1b7a934aa	[Community]: HuggingFaceCrossEncoder `score` accounting for <not-relevant score,relevant score> pairs. (#22578 ) - Description: Some of the Cross-Encoder models provide scores in pairs, i.e., <not-relevant score (higher means the document is less relevant to the query), relevant score (higher means the document is more relevant to the query)>. However, the `HuggingFaceCrossEncoder` `score` method does not currently take into account the pair situation. This PR addresses this issue by modifying the method to consider only the relevant score if score is being provided in pair. The reason for focusing on the relevant score is that the compressors select the top-n documents based on relevance. - Issue: #22556 - Please also refer to this [comment](https://github.com/UKPLab/sentence-transformers/issues/568#issuecomment-729153075)	2024-06-14 08:28:24 -07:00
Baskar Gopinath	83643cbdfe	docs: Fix typo in tutorial about structured data extraction (#22888 ) [Fixed typo](docs: Fix typo in tutorial about structured data extraction)	2024-06-14 15:19:55 +00:00
Thanh Nguyen	b5e2ba3a47	community[minor]: add chat model llamacpp (#22589 ) - PR title: [community] add chat model llamacpp - PR message: - Description: This PR introduces a new chat model integration with llamacpp_python, designed to work similarly to the existing ChatOpenAI model. + Work well with instructed chat, chain and function/tool calling. + Work with LangGraph (persistent memory, tool calling), will update soon - Dependencies: This change requires the llamacpp_python library to be installed. @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-14 14:51:43 +00:00
Bagatur	e4279f80cd	docs: doc loader feat table alignment (#22900 )	2024-06-14 14:25:01 +00:00
Isaac Francisco	984c7a9d42	docs: generate table for document loaders (#22871 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-14 07:03:27 -07:00
Jacob Lee	8e89178047	docs[patch]: Expand embeddings docs (#22881 )	2024-06-13 23:06:07 -07:00
ccurme	73c76b9628	anthropic[patch]: always add tool_result type to ToolMessage content (#22721 ) Anthropic tool results can contain image data, which are typically represented with content blocks having `"type": "image"`. Currently, these content blocks are passed as-is as human/user messages to Anthropic, which raises BadRequestError as it expects a tool_result block to follow a tool_use. Here we update ChatAnthropic to nest the content blocks inside a tool_result content block. Example: ```python import base64 import httpx from langchain_anthropic import ChatAnthropic from langchain_core.messages import AIMessage, HumanMessage, ToolMessage from langchain_core.pydantic_v1 import BaseModel, Field # Fetch image image_url = "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" image_data = base64.b64encode(httpx.get(image_url).content).decode("utf-8") class FetchImage(BaseModel): should_fetch: bool = Field(..., description="Whether an image is requested.") llm = ChatAnthropic(model="claude-3-sonnet-20240229").bind_tools([FetchImage]) messages = [ HumanMessage(content="Could you summon a beautiful image please?"), AIMessage( content=[ { "type": "tool_use", "id": "toolu_01Rn6Qvj5m7955x9m9Pfxbcx", "name": "FetchImage", "input": {"should_fetch": True}, }, ], tool_calls=[ { "name": "FetchImage", "args": {"should_fetch": True}, "id": "toolu_01Rn6Qvj5m7955x9m9Pfxbcx", }, ], ), ToolMessage( name="FetchImage", content=[ { "type": "image", "source": { "type": "base64", "media_type": "image/jpeg", "data": image_data, }, }, ], tool_call_id="toolu_01Rn6Qvj5m7955x9m9Pfxbcx", ), ] llm.invoke(messages) ``` Trace: https://smith.langchain.com/public/d27e4fc1-a96d-41e1-9f52-54f5004122db/r	2024-06-13 20:14:23 -07:00
Lucas Tucker	7114aed78f	docs: Standardize ChatGroq (#22751 ) Updated ChatGroq doc string as per issue https://github.com/langchain-ai/langchain/issues/22296:"langchain_groq: updated docstring for ChatGroq in langchain_groq to match that of the description (in the appendix) provided in issue https://github.com/langchain-ai/langchain/issues/22296. " Issue: This PR is in response to issue https://github.com/langchain-ai/langchain/issues/22296, and more specifically the ChatGroq model. In particular, this PR updates the docstring for langchain/libs/partners/groq/langchain_groq/chat_model.py by adding the following sections: Instantiate, Invoke, Stream, Async, Tool calling, Structured Output, and Response metadata. I used the template from the Anthropic implementation and referenced the Appendix of the original issue post. I also noted that: `usage_metadata `returns none for all ChatGroq models I tested; there is no mention of image input in the ChatGroq documentation; unlike that of ChatHuggingFace, `.stream(messages)` for ChatGroq returned blocks of output. --------- Co-authored-by: lucast2021 <lucast2021@headroyce.org> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-14 03:08:36 +00:00
Anush	e002c855bd	qdrant[patch]: Use collection_exists API instead of exceptions (#22764 ) ## Description Currently, the Qdrant integration relies on exceptions raised by [`get_collection` ](https://qdrant.tech/documentation/concepts/collections/#collection-info) to check if a collection exists. Using [`collection_exists`](https://qdrant.tech/documentation/concepts/collections/#check-collection-existence) is recommended to avoid missing any unhandled exceptions. This PR addresses this. ## Testing All integration and unit tests pass. No user-facing changes.	2024-06-13 20:01:32 -07:00
Anindyadeep	c417803908	community[minor]: Prem Templates (#22783 ) This PR adds the feature add Prem Template feature in ChatPremAI. Additionally it fixes a minor bug for API auth error when API passed through arguments.	2024-06-13 19:59:28 -07:00
Stefano Lottini	4160b700e6	docs: Astra DB vectorstore, adjust syntax for automatic-embedding example (#22833 ) Description: Adjusting the syntax for creating the vectorstore collection (in the case of automatic embedding computation) for the most idiomatic way to submit the stored secret name. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-14 02:52:32 +00:00
maang-h	1055b9a309	community[minor]: Implement ZhipuAIEmbeddings interface (#22821 ) - Description: Implement ZhipuAIEmbeddings interface, include: - The `embed_query` method - The `embed_documents` method refer to [ZhipuAI Embedding-2](https://open.bigmodel.cn/dev/api#text_embedding) --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-06-13 19:45:11 -07:00
Leonid Ganeline	46c9784127	docs: `ReAct` reference (#22830 ) The `ReAct` is used all across LangChain but it is not referenced properly. Added references to the original paper.	2024-06-13 19:39:28 -07:00
Giacomo Berardi	712aa0c529	docs: fixes for Elasticsearch integrations, cache doc and providers list (#22817 ) Some minor fixes in the documentation: - ElasticsearchCache initilization is now correct - List of integrations for ES updated	2024-06-13 19:39:10 -07:00
Isaac Francisco	f9a6d5c845	infra: lint new docs to match doc loader template (#22867 )	2024-06-13 19:34:50 -07:00
Bagatur	8bd368d07e	cli[patch]: Release 0.0.25 (#22876 )	2024-06-14 02:31:04 +00:00
Isaac Francisco	75e966a2fa	docs, cli[patch]: document loaders doc template (#22862 ) From: https://github.com/langchain-ai/langchain/pull/22290 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-13 19:28:57 -07:00
Hayden Wolff	d1cdde267a	docs: update NVIDIA Riva tool to use NVIDIA NIM for LLM (#22873 ) Description: Update the NVIDIA Riva tool documentation to use NVIDIA NIM for the LLM. Show how to use NVIDIA NIMs and link to documentation for LangChain with NIM. --------- Co-authored-by: Hayden Wolff <hwolff@nvidia.com> Co-authored-by: Isaac Francisco <78627776+isahers1@users.noreply.github.com>	2024-06-13 19:26:05 -07:00
Zeeshan Qureshi	ada1e5cc64	docs: s/path_images/images/ for ImageCaptionLoader keyword arguments (#22857 ) Quick update to `ImageCaptionLoader` documentation to reflect what's in code.	2024-06-13 18:37:12 -07:00
liuzc9	41e232cb82	Fix typo in vearch.md (#22840 ) Fix typo	2024-06-13 18:24:51 -07:00
Kagura Chen	57783c5e55	Fix: lint errors and update Field alias in models.py and AutoSelectionScorer initialization (#22846 ) This PR addresses several lint errors in the core package of LangChain. Specifically, the following issues were fixed: 1.Unexpected keyword argument "required" for "Field" [call-arg] 2.tests/integration_tests/chains/test_cpal.py:263: error: Unexpected keyword argument "narrative_input" for "QueryModel" [call-arg]	2024-06-13 18:18:00 -07:00
Erick Friis	5bc774827b	langchain: release 0.2.4 (#22872 )	2024-06-14 00:14:48 +00:00
Erick Friis	7234fd0f51	core: release 0.2.6 (#22868 )	2024-06-13 22:22:34 +00:00
Jacob Lee	bcbb43480c	core[patch]: Treat type as a special field when merging lists (#22750 ) Should we even log a warning? At least for Anthropic, it's expected to get e.g. `text_block` followed by `text_delta`. @ccurme @baskaryan @efriis	2024-06-13 15:08:24 -07:00
Nuno Campos	bae82e966a	core: In astream_events v2 propagate cancel/break to the inner astream call (#22865 ) - previous behavior was for the inner astream to continue running with no interruption - also propagate break in core runnable methods	2024-06-13 15:02:48 -07:00
Eugene Yurtsev	a766815a99	experimental[patch]/docs[patch]: Update links to security docs (#22864 ) Minor update to newest version of security docs (content should be identical).	2024-06-13 20:29:34 +00:00
Eugene Yurtsev	8f7cc73817	ci: Add script to check for pickle usage in community (#22863 ) Add script to check for pickle usage in community.	2024-06-13 16:13:15 -04:00
Eugene Yurtsev	77209f315e	community[patch]: FAISS VectorStore deserializer should be opt-in (#22861 ) FAISS deserializer uses pickle module. Users have to opt-in to de-serialize.	2024-06-13 15:48:13 -04:00
Eugene Yurtsev	ce0b0f22a1	experimental[major]: Force users to opt-in into code that relies on the python repl (#22860 ) This should make it obvious that a few of the agents in langchain experimental rely on the python REPL as a tool under the hood, and will force users to opt-in.	2024-06-13 15:41:24 -04:00
Isaac Francisco	869523ad72	[docs]: added info for TavilySearchResults (#22765 )	2024-06-13 12:14:11 -07:00
ccurme	42257b120f	partners: fix numpy dep (#22858 ) Following https://github.com/langchain-ai/langchain/pull/22813, which added python 3.12 to CI, here we update numpy accordingly in partner packages.	2024-06-13 14:46:42 -04:00
Isaac Francisco	345fd3a556	minor functionality change: adding API functionality to tavilysearch (#22761 )	2024-06-13 11:10:28 -07:00
Isaac Francisco	034257e9bf	docs: improved recursive url loader docs (#22648 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-13 11:09:35 -07:00
Isaac Francisco	e832bbb486	[docs]: bind tools (#22831 )	2024-06-13 09:50:43 -07:00
ccurme	b626c3ca23	groq[patch]: add usage_metadata to (a)invoke and (a)stream (#22834 )	2024-06-13 10:26:27 -04:00
Jacob Lee	e01e5d5a91	docs[patch]: Improve Groq integration page (#22844 ) Was bare bones and got marked by folks as unhelpful. CC @efriis @colemccracken	2024-06-13 03:40:29 -07:00
Jacob Lee	12eff6a130	docs[patch]: Readd Pydantic compatibility docs (#22836 ) As a how-to guide. CC @eyurtsev @hwchase17	2024-06-13 02:56:10 -07:00
Jacob Lee	cb654a3245	docs[patch]: Adds multimodal column to chat models table, move up in concepts (#22837 ) CC @hwchase17 @baskaryan	2024-06-13 02:26:55 -07:00
James Braza	45b394268c	core[patch]: allowing latest `packaging` versions (#22792 ) Allowing version 24 of https://github.com/pypa/packaging --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-12 23:22:20 +00:00
Jacob Lee	00ad197502	docs[patch]: Add structured output to conceptual docs (#22791 ) This downgrades `Function/tool calling` from a h3 to an h4 which means it'll no longer show up in the right sidebar, but any direct links will still work. I think that is ok, but LMK if you disapprove. CC @hwchase17 @eyurtsev @rlancemartin	2024-06-12 15:30:51 -07:00
Karim Lalani	276be6cdd4	[experimental][llms][OllamaFunctions] tool calling related fixes (#22339 ) Fixes issues with tool calling to handle tool objects correctly. Added support to handle ToolMessage correctly. Added additional checks for error conditions. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-12 16:34:43 -04:00
Christophe Bornet	d04e899b56	ci: add testing with Python 3.12 (#22813 ) We need to use a different version of numpy for py3.8 and py3.12 in pyproject. And so do projects that use that Python version range and import langchain. - Twitter handle: _cbornet	2024-06-12 16:31:36 -04:00
HyoJin Kang	b6bf2bb234	community[patch]: fix database uri type in SQLDatabase (#22661 ) Description sqlalchemy uses "sqlalchemy.engine.URL" type for db uri argument. Added 'URL' type for compatibility. Issue: None Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-12 15:11:00 -04:00
Eugene Yurtsev	5dbbdcbf8e	core[patch]: Update remaining root_validators (#22829 ) This PR updates the remaining root_validators in core to either be explicit pre-init or post-init validators.	2024-06-12 14:47:40 -04:00
Eugene Yurtsev	265e650e64	community[patch]: Update root_validators embeddings: llamacpp, jina, dashscope, mosaicml, huggingface_hub, Toolkits: Connery, ChatModels: PAI_EAS, (#22828 ) This PR updates root validators for: * Embeddings: llamacpp, jina, dashscope, mosaicml, huggingface_hub * Toolkits: Connery * ChatModels: PAI_EAS Following this issue: https://github.com/langchain-ai/langchain/issues/22819	2024-06-12 13:59:05 -04:00
JonZeolla	32ba8cfab0	community[minor]: implement huggingface show_progress consistently (#22682 ) - Description: This implements `show_progress` more consistently (i.e. it is also added to the `HuggingFaceBgeEmbeddings` object). - Issue: This implements `show_progress` more consistently in the embeddings huggingface classes. Previously this could have been set via `encode_kwargs`. - Dependencies: None - Twitter handle: @jonzeolla	2024-06-12 17:30:56 +00:00
Eugene Yurtsev	74e705250f	core[patch]: update some root_validators (#22787 ) Update some of the @root_validators to be explicit pre=True or pre=False, skip_on_failure=True for pydantic 2 compatibility.	2024-06-12 13:04:57 -04:00
bincat	3d6e8547f9	docs: fix function name in tutorials/agents.ipynb (#22809 ) the function called in the flowing example is `create_react_agent`, not `create_tool_calling_executor `	2024-06-12 12:30:35 -04:00
mrhbj	a1268d9e9a	community[patch]: fix hunyuan message include chinese signature error (#22795 ) (#22796 ) … (#22795) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-12 12:30:22 -04:00
Kagura Chen	513f1d8037	docs: update repo_structure.mdx to reflect latest code changes (#22810 ) Description: This PR updates the documentation to reflect the recent code changes. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-12 12:30:04 -04:00
Mr. Lance E Sloan «UMich»	08c466c603	community[patch]: bugfix for `YoutubeLoader`'s `LINES` format (#22815 ) - Description: A change I submitted recently introduced a bug in `YoutubeLoader`'s `LINES` output format. In those conditions, curly braces ("`{}`") creates a set, not a dictionary. This bugfix explicitly specifies that a dictionary is created. - Issue: N/A - Dependencies: N/A - Twitter: lsloan_umich - Mastodon: [lsloan@mastodon.social](https://mastodon.social/@lsloan)	2024-06-12 12:29:34 -04:00
Philippe PRADOS	23c22fcbc9	langchain[minor]: Make EmbeddingsFilters async (#22737 ) Add native async implementation for EmbeddingsFilter	2024-06-12 12:27:26 -04:00
endrajeet	b45bf78d2e	Update index.mdx (#22818 ) changed "# 🌟Recognition" to "### 🌟 Recognition" to match the rest of the subheadings. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-12 12:27:16 -04:00
Bagatur	8203c1ff87	infra: lint new docs to match templates (#22786 )	2024-06-11 13:26:35 -07:00
ccurme	936aedd10c	mistral[patch]: add usage_metadata to (a)invoke and (a)stream (#22781 )	2024-06-11 15:34:50 -04:00
Jiří Spilka	20e3662acf	docs: Correct code examples in the Apify's notebooks (#22768 ) Description: Correct code examples in the Apify document load notebook and Apify Dataset notebook Issue: None Dependencies: None Twitter handle: None	2024-06-11 15:20:16 -04:00
mrhbj	9212c9fcb8	community[patch]: fix hunyuan client json analysis (#22452 ) (#22767 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-11 19:05:18 +00:00
Rohan Aggarwal	86e8224cf1	community[patch]: Support for old clients (Thin and Thick) Oracle Vector Store (#22766 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" Support for old clients (Thin and Thick) Oracle Vector Store - [ ] PR message: *Delete this entire checklist* and replace with Support for old clients (Thin and Thick) Oracle Vector Store - [ ] Add tests and docs: If you're adding a new integration, please include Have our own local tests --------- Co-authored-by: rohan.aggarwal@oracle.com <rohaagga@phoenix95642.dev3sub2phx.databasede3phx.oraclevcn.com>	2024-06-11 11:36:06 -07:00
Jacob Lee	232908a46d	docs[patch]: Adds streaming conceptual doc (#22760 ) CC @hwchase17 @baskaryan	2024-06-11 11:03:52 -07:00
Mr. Lance E Sloan «UMich»	84dc2dd059	community[patch]: Load YouTube transcripts (captions) as fixed-duration chunks with start times (#21710 ) - Description: Add a new format, `CHUNKS`, to `langchain_community.document_loaders.youtube.YoutubeLoader` which creates multiple `Document` objects from YouTube video transcripts (captions), each of a fixed duration. The metadata of each chunk `Document` includes the start time of each one and a URL to that time in the video on the YouTube website. I had implemented this for UMich (@umich-its-ai) in a local module, but it makes sense to contribute this to LangChain community for all to benefit and to simplify maintenance. - Issue: N/A - Dependencies: N/A - Twitter: lsloan_umich - Mastodon: [lsloan@mastodon.social](https://mastodon.social/@lsloan) With regards to tests and documentation, most existing features of the `YoutubeLoader` class are not tested. Only the `YoutubeLoader.extract_video_id()` static method had a test. However, while I was waiting for this PR to be reviewed and merged, I had time to add a test for the chunking feature I've proposed in this PR. I have added an example of using chunking to the `docs/docs/integrations/document_loaders/youtube_transcript.ipynb` notebook. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-11 17:44:36 +00:00
Aayush Kataria	71811e0547	community[minor]: Adds a vector store for Azure Cosmos DB for NoSQL (#21676 ) This PR add supports for Azure Cosmos DB for NoSQL vector store. Summary: Description: added vector store integration for Azure Cosmos DB for NoSQL Vector Store, Dependencies: azure-cosmos dependency, Tag maintainer: @hwchase17, @baskaryan @efriis @eyurtsev --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-11 10:34:01 -07:00
Mohammad Mohtashim	36cad5d25c	[Community]: Added Metadata filter support for DocumentDB Vector Store (#22777 ) - Description: As pointed out in this issue #22770, DocumentDB `similarity_search` does not support filtering through metadata which this PR adds by passing in the parameter `filter`. Also this PR fixes a minor Documentation error. - Issue: #22770 --------- Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-11 16:37:53 +00:00
Dmitry Stepanov	912751e268	Ollama vision support (#22734 ) Description: Ollama vision with messages in OpenAI-style support `{ "image_url": { "url": ... } }` Issue: #22460 Added flexible solution for ChatOllama to support chat messages with images. Works when you provide either `image_url` as a string or as a dict with "url" inside (like OpenAI does). So it makes available to use tuples with `ChatPromptTemplate.from_messages()` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-11 16:10:19 +00:00
Philippe PRADOS	0908b01cb2	langchain[minor]: Add native async implementation to LLMFilter, add concurrency to both sync and async paths (#22739 ) Thank you for contributing to LangChain! - [ ] PR title: "langchain: Fix chain_filter.py to be compatible with async" - [ ] PR message: - Description: chain_filter is not compatible with async. - Twitter handle: pprados - [X ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Signed-off-by: zhangwangda <zhangwangda94@163.com> Co-authored-by: Prakul <discover.prakul@gmail.com> Co-authored-by: Lei Zhang <zhanglei@apache.org> Co-authored-by: Gin <ictgtvt@gmail.com> Co-authored-by: wangda <38549158+daziz@users.noreply.github.com> Co-authored-by: Max Mulatz <klappradla@posteo.net>	2024-06-11 10:55:40 -04:00
Jaeyeon Kim(김재연)	ce4e29ae42	community[minor]: fix redis store docstring and streamline initialization code (#22730 ) Thank you for contributing to LangChain! ### Description Fix the example in the docstring of redis store. Change the initilization logic and remove redundant check, enhance error message. ### Issue The example in docstring of how to use redis store was wrong. ![image](https://github.com/langchain-ai/langchain/assets/37469330/78c5d9ce-ee66-45b3-8dfe-ea29f125e6e9) ### Dependencies Nothing - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-11 14:08:05 +00:00
am-kinetica	ad101adec8	community[patch]: Kinetica Integrations handled error in querying; quotes in table names; updated gpudb API (#22724 ) - [ ] Miscellaneous updates and fixes: - Description: Handled error in querying; quotes in table names; updated gpudb API - Issue: Threw an error with an error message difficult to understand if a query failed or returned no records - Dependencies: Updated GPUDB API version to `7.2.0.9` @baskaryan @hwchase17	2024-06-11 10:01:26 -04:00
NithinBairapaka	27b9ea14a5	docs: Updated integration docs with required package installations (#22392 ) Title: Updated integration docs with required package installations Issue: #22005	2024-06-11 01:44:05 +00:00
Albert Gil López	1710423de3	docs: correct path in readme (#22383 ) Description: Fix incorrect path in README instructions. Issue: N/A Dependencies: None Twitter handle: @jddam --------- Co-authored-by: isaac hershenson <ihershenson@hmc.edu>	2024-06-10 17:47:39 -07:00
Greg Tracy	7e115da16c	docs: Fix pixelation in stack graphic (#21554 ) This change updates the stack graphic displayed in the top-level README. The LangChain tile is pixelated in the current graphic.	2024-06-10 22:52:22 +00:00
Leonid Ganeline	55bd8e582b	docs: `integrations` cache: added class table (#22368 ) Added a table with the cache classes. See [this table here](https://langchain-rnpqvikie-langchain.vercel.app/v0.2/docs/integrations/llm_caching/#cache-classes-summary-table).	2024-06-10 15:09:03 -07:00
Jacob Lee	89804c3026	docs: Adds pointers from LLM pages to equivalent chat model pages (#22759 ) @baskaryan	2024-06-10 14:13:22 -07:00
Qingchuan Hao	7f180f996b	docs: fix langchain expression language link (#22683 )	2024-06-10 21:12:47 +00:00
Mathis Joffre	ea43f40daf	community[minor]: Add support for OVHcloud AI Endpoints Embedding (#22667 ) Description: Add support for [OVHcloud AI Endpoints](https://endpoints.ai.cloud.ovh.net/) Embedding models. Inspired by: https://gist.github.com/gmasse/e1f99339e161f4830df6be5d0095349a Signed-off-by: Joffref <mariusjoffre@gmail.com>	2024-06-10 21:07:25 +00:00
Erick Friis	2aaf86ddae	core: fix mustache falsy cases (#22747 )	2024-06-10 14:00:12 -07:00
Eugene Yurtsev	5a7eac191a	core[patch]: Add missing type annotations (#22756 ) Add missing type annotations. The missing type annotations will raise exceptions with pydantic 2.	2024-06-10 16:59:41 -04:00
Eugene Yurtsev	05d31a2f00	community[patch]: Add missing type annotations (#22758 ) Add missing type annotations to objects in community. These missing type annotations will raise type errors in pydantic 2.	2024-06-10 16:59:28 -04:00
Naka Masato	3237909221	langchain[patch]: allow to use partial variables in create_sql_query_chain (#22688 ) - Description: allow to use partial variables to pass `top_k` and `table_info` - Issue: no - Dependencies: no - Twitter handle: @gymnstcs --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-10 20:58:30 +00:00
Bharat Ramanathan	2b5631a6be	community[patch]: fix `WandbTracer` to work with new "RunV2" API (#22673 ) - Description: This PR updates the `WandbTracer` to work with the new RunV2 API so that wandb Traces logging works correctly for new LangChain versions. Here's an example [run](https://wandb.ai/parambharat/langchain-tracing/runs/wpm99ftq) from the existing tests - Issue: https://github.com/wandb/wandb/issues/7762 - Twitter handle: @ParamBharat _If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17._	2024-06-10 13:56:35 -07:00
Oguz Vuruskaner	f0f4532579	community[patch]: fix deepinfra inference (#22680 ) This PR includes: 1. Update of default model to LLama3. 2. Handle some 400x errors with more user friendly error messages. 3. Handle user errors.	2024-06-10 13:55:55 -07:00
Lucas Tucker	cb79e80b0b	docs: standardize ChatHuggingFace (#22693 ) Updated ChatHuggingFace doc string as per issue #22296: "langchain_huggingface: updated docstring for ChatHuggingFace in langchain_huggingface to match that of the description (in the appendix) provided in issue #22296. " Issue: This PR is in response to issue #22296, and more specifically ChatHuggingFace model. In particular, this PR updates the docstring for langchain/libs/partners/hugging_face/langchain_huggingface/chat_models/huggingface.py by adding the following sections: Instantiate, Invoke, Stream, Async, Tool calling, and Response metadata. I used the template from the Anthropic implementation and referenced the Appendix of the original issue post. I also noted that: langchain_community hugging face llms do not work with langchain_huggingface's ChatHuggingFace model (at least for me); the .stream(messages) functionality of ChatHuggingFace only returned a block of response. --------- Co-authored-by: lucast2021 <lucast2021@headroyce.org> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-10 20:54:36 +00:00
Erick Friis	d92f2251c8	docs: couchbase partner package (#22757 )	2024-06-10 20:53:03 +00:00
Tomaz Bratanic	76a193decc	community[patch]: Add function response to graph cypher qa chain (#22690 ) LLMs struggle with Graph RAG, because it's different from vector RAG in a way that you don't provide the whole context, only the answer and the LLM has to believe. However, that doesn't really work a lot of the time. However, if you wrap the context as function response the accuracy is much better. btw... `union[LLMChain, Runnable]` is linting fun, that's why so many ignores	2024-06-10 13:52:17 -07:00
X-HAN	34edfe4a16	community[minor]: add Volcengine Rerank (#22700 ) Description: this PR adds Volcengine Rerank capability to Langchain, you can find Volcengine Rerank API from [here](https://www.volcengine.com/docs/84313/1254474) & [here](https://www.volcengine.com/docs/84313/1254605). [Volcengine](https://www.volcengine.com/) is a cloud service platform developed by ByteDance, the parent company of TikTok. You can obtain Volcengine API AK/SK from [here](https://www.volcengine.com/docs/84313/1254553). Dependencies: VolcengineRerank depends on `volcengine` python package. Twitter handle: my twitter/x account is https://x.com/LastMonopoly and I'd like a mention, thank you! Tests and docs 1. integration test: `test_volcengine_rerank.py` 2. example notebook: `volcengine_rerank.ipynb` Lint and test: I have run `make format`, `make lint` and `make test` from the root of the package I've modified.	2024-06-10 13:41:05 -07:00
Prakul	9eacce9356	docs:Update reference to langchain-mongodb (#22705 ) Description: Update reference to langchain-mongodb	2024-06-10 13:35:21 -07:00
Ikko Eltociear Ashimine	4197c9c85f	docs: update azure_container_apps_dynamic_sessions_data_analyst.ipynb (#22718 ) colum -> column	2024-06-10 13:33:40 -07:00
Jacob Lee	e4183cbc4e	docs[patch]: Add caution on OpenAI LLMs integration page (#22754 ) @baskaryan do we like? <img width="1040" alt="Screenshot 2024-06-10 at 12 16 45 PM" src="https://github.com/langchain-ai/langchain/assets/6952323/8893063f-1acf-4a56-9ee5-a8a2b1560277">	2024-06-10 13:27:22 -07:00
Mohammad Mohtashim	c3cce98d86	community[patch]: Small Fix in OutlookMessageLoader (Close the Message once Open) (#22744 ) - Description: A very small fix where we close the message when it opened - Issue: #22729	2024-06-10 13:08:39 -07:00
Bagatur	86a3f6edf1	docs: standardize ChatVertexAI (#22686 ) Part of #22296. Part two of https://github.com/langchain-ai/langchain-google/pull/287	2024-06-10 12:50:50 -07:00
ccurme	f9fdca6cc2	openai: add `parallel_tool_calls` to api ref (#22746 ) ![Screenshot 2024-06-10 at 1 41 24 PM](https://github.com/langchain-ai/langchain/assets/26529506/2626bf9c-41c6-4431-b2e1-f59de1e4e468)	2024-06-10 17:44:43 +00:00
Max Mulatz	058a64c563	Community[minor]: Add language parser for Elixir (#22742 ) Hi 👋 First off, thanks a ton for your work on this 💚 Really appreciate what you're providing here for the community. ## Description This PR adds a basic language parser for the [Elixir](https://elixir-lang.org/) programming language. The parser code is based upon the approach outlined in https://github.com/langchain-ai/langchain/pull/13318: it's using `tree-sitter` under the hood and aligns with all the other `tree-sitter` based parses added that PR. The `CHUNK_QUERY` I'm using here is probably not the most sophisticated one, but it worked for my application. It's a starting point to provide "core" parsing support for Elixir in LangChain. It enables people to use the language parser out in real world applications which may then lead to further tweaking of the queries. I consider this PR just the ground work. - Dependencies: requires `tree-sitter` and `tree-sitter-languages` from the extended dependencies - Twitter handle:`@bitcrowd` ## Checklist - [x] PR title: "package: description" - [x] Add tests and docs - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. -->	2024-06-10 15:56:57 +00:00
wangda	28e956735c	docs:Correcting spelling mistakes in readme (#22664 ) Signed-off-by: zhangwangda <zhangwangda94@163.com>	2024-06-10 15:33:41 +00:00
Gin	6f54abc252	docs: Add a missing dot in concepts.mdx (#22677 )	2024-06-10 15:30:56 +00:00
Philippe PRADOS	2d4689d721	langchain[minor]: Add pgvector to list of supported vectorstores in self query retriever (#22678 ) The fact that we outsourced pgvector to another project has an unintended effect. The mapping dictionary found by `_get_builtin_translator()` cannot recognize the new version of pgvector because it comes from another package. `SelfQueryRetriever` no longer knows `PGVector`. I propose to fix this by creating a global dictionary that can be populated by various database implementations. Thus, importing `langchain_postgres` will allow the registration of the `PGvector` mapping. But for the moment I'm just adding a lazy import Furthermore, the implementation of _get_builtin_translator() reconstructs the BUILTIN_TRANSLATORS variable with each invocation, which is not very efficient. A global map would be an optimization. - Twitter handle: pprados @eyurtsev, can you review this PR? And unlock the PR [Add async mode for pgvector](https://github.com/langchain-ai/langchain-postgres/pull/32) and PR [community[minor]: Add SQL storage implementation](https://github.com/langchain-ai/langchain/pull/22207)? Are you in favour of a global dictionary-based implementation of Translator?	2024-06-10 11:27:47 -04:00
Lei Zhang	5ba1899cd7	infra: Scheduled GitHub Actions to run only on the upstream repository (#22707 ) Description: Scheduled GitHub Actions to run only on the upstream repository Issue: Fixes #22706 Twitter handle: @coolbeevip	2024-06-10 11:07:42 -04:00
Prakul	3f76c9e908	docs: Update MongoDB information in llm_caching (#22708 ) Description:: Update MongoDB information in llm_caching	2024-06-10 11:05:55 -04:00
fzowl	c1fced9269	docs: VoyageAI new embedding and reranking models (#22719 )	2024-06-09 09:12:43 -07:00
Enzo Poggio	8f019e91d7	community[patch]: Use Custom Logger Instead of Root Logger in get_user_agent Function (#22691 ) ## Description This PR addresses a logging inconsistency in the `get_user_agent` function. Previously, the function was using the root logger to log a warning message when the "USER_AGENT" environment variable was not set. This bypassed the custom logger `log` that was created at the start of the module, leading to potential inconsistencies in logging behavior. Changes: - Replaced `logging.warning` with `log.warning` in the `get_user_agent` function to ensure that the custom logger is used. This change ensures that all logging in the `get_user_agent` function respects the configurations of the custom logger, leading to more consistent and predictable logging behavior. ## Dependencies None ## Issue None ## Tests and docs ☝🏻 see description ## `make format`, `make lint` & `cd libs/community; make test` ```shell > make format poetry run ruff format docs templates cookbook 1417 files left unchanged poetry run ruff check --select I --fix docs templates cookbook All checks passed! ``` ```shell > make lint poetry run ruff check docs templates cookbook All checks passed! poetry run ruff format docs templates cookbook --diff 1417 files already formatted poetry run ruff check --select I docs templates cookbook All checks passed! git grep 'from langchain import' docs/docs templates cookbook \| grep -vE 'from langchain import (hub)' && exit 1 \|\| exit 0 ``` ~cd libs/community; make test~ too much dependencies for integration ... ```shell > poetry run pytest tests/unit_tests .... ==== 884 passed, 466 skipped, 4447 warnings in 15.93s ==== ``` I choose you randomly : @ccurme	2024-06-08 02:33:07 +00:00
Philippe PRADOS	9aabb446c5	community[minor]: Add SQL storage implementation (#22207 ) Hello @eyurtsev - package: langchain-comminity - Description: Add SQL implementation for docstore. A new implementation, in line with my other PR ([async PGVector](https://github.com/langchain-ai/langchain-postgres/pull/32), [SQLChatMessageMemory](https://github.com/langchain-ai/langchain/pull/22065)) - Twitter handler: pprados --------- Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Piotr Mardziel <piotrm@gmail.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-07 21:17:02 +00:00
Nithish Raghunandanan	f2f0e0e13d	couchbase: Add the initial version of Couchbase partner package (#22087 ) Co-authored-by: Nithish Raghunandanan <nithishr@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-07 14:04:08 -07:00
Cahid Arda Öz	6c07eb0c12	community[minor]: Add UpstashRatelimitHandler (#21885 ) Adding `UpstashRatelimitHandler` callback for rate limiting based on number of chain invocations or LLM token usage. For more details, see [upstash/ratelimit-py repository](https://github.com/upstash/ratelimit-py) or the notebook guide included in this PR. Twitter handle: @cahidarda --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-07 21:02:06 +00:00
Erick Friis	9b3ce16982	docs: remove nonexistent headings (#22685 )	2024-06-07 20:02:06 +00:00
Erick Friis	9e03864d64	core: add error message for non-structured llm to StructuredPrompt (#22684 ) previously was the blank `NotImplementedError` from `BaseLanguageModel.with_structured_output`	2024-06-07 19:42:09 +00:00
Jacob Lee	02ff78deb8	docs[patch]: Adds LangGraph and LangSmith links, adds more crosslinks between pages (#22656 ) @baskaryan @hwchase17	2024-06-07 10:22:29 -07:00
Mateusz Szewczyk	c3a8716589	docs: Updated product version in Embeddings notebook (#22062 )	2024-06-07 08:11:03 -07:00
ccurme	f32d57f6f0	anthropic: refactor streaming to use events api; add streaming usage metadata (#22628 ) - Refactor streaming to use raw events; - Add `stream_usage` class attribute and kwarg to stream methods that, if True, will include separate chunks in the stream containing usage metadata. There are two ways to implement streaming with anthropic's python sdk. They have slight differences in how they surface usage metadata. 1. [Use helper functions](https://github.com/anthropics/anthropic-sdk-python?tab=readme-ov-file#streaming-helpers). This is what we are doing now. ```python count = 1 with client.messages.stream(params) as stream: for text in stream.text_stream: snapshot = stream.current_message_snapshot print(f"{count}: {snapshot.usage} -- {text}") count = count + 1 final_snapshot = stream.get_final_message() print(f"{count}: {final_snapshot.usage}") ``` ``` 1: Usage(input_tokens=8, output_tokens=1) -- Hello 2: Usage(input_tokens=8, output_tokens=1) -- ! 3: Usage(input_tokens=8, output_tokens=1) -- How 4: Usage(input_tokens=8, output_tokens=1) -- can 5: Usage(input_tokens=8, output_tokens=1) -- I 6: Usage(input_tokens=8, output_tokens=1) -- assist 7: Usage(input_tokens=8, output_tokens=1) -- you 8: Usage(input_tokens=8, output_tokens=1) -- today 9: Usage(input_tokens=8, output_tokens=1) -- ? 10: Usage(input_tokens=8, output_tokens=12) ``` To do this correctly, we need to emit a new chunk at the end of the stream containing the usage metadata. 2. [Handle raw events](https://github.com/anthropics/anthropic-sdk-python?tab=readme-ov-file#streaming-responses) ```python stream = client.messages.create(params, stream=True) count = 1 for event in stream: print(f"{count}: {event}") count = count + 1 ``` ``` 1: RawMessageStartEvent(message=Message(id='msg_01Vdyov2kADZTXqSKkfNJXcS', content=[], model='claude-3-haiku-20240307', role='assistant', stop_reason=None, stop_sequence=None, type='message', usage=Usage(input_tokens=8, output_tokens=1)), type='message_start') 2: RawContentBlockStartEvent(content_block=TextBlock(text='', type='text'), index=0, type='content_block_start') 3: RawContentBlockDeltaEvent(delta=TextDelta(text='Hello', type='text_delta'), index=0, type='content_block_delta') 4: RawContentBlockDeltaEvent(delta=TextDelta(text='!', type='text_delta'), index=0, type='content_block_delta') 5: RawContentBlockDeltaEvent(delta=TextDelta(text=' How', type='text_delta'), index=0, type='content_block_delta') 6: RawContentBlockDeltaEvent(delta=TextDelta(text=' can', type='text_delta'), index=0, type='content_block_delta') 7: RawContentBlockDeltaEvent(delta=TextDelta(text=' I', type='text_delta'), index=0, type='content_block_delta') 8: RawContentBlockDeltaEvent(delta=TextDelta(text=' assist', type='text_delta'), index=0, type='content_block_delta') 9: RawContentBlockDeltaEvent(delta=TextDelta(text=' you', type='text_delta'), index=0, type='content_block_delta') 10: RawContentBlockDeltaEvent(delta=TextDelta(text=' today', type='text_delta'), index=0, type='content_block_delta') 11: RawContentBlockDeltaEvent(delta=TextDelta(text='?', type='text_delta'), index=0, type='content_block_delta') 12: RawContentBlockStopEvent(index=0, type='content_block_stop') 13: RawMessageDeltaEvent(delta=Delta(stop_reason='end_turn', stop_sequence=None), type='message_delta', usage=MessageDeltaUsage(output_tokens=12)) 14: RawMessageStopEvent(type='message_stop') ``` Here we implement the second option, in part because it should make things easier when implementing streaming tool calls in the near future. This would add two new chunks to the stream-- one at the beginning and one at the end-- with blank content and containing usage metadata. We add kwargs to the stream methods and a class attribute allowing for this behavior to be toggled. I enabled it by default. If we merge this we can add the same kwargs / attribute to OpenAI. Usage: ```python from langchain_anthropic import ChatAnthropic model = ChatAnthropic( model="claude-3-haiku-20240307", temperature=0 ) full = None for chunk in model.stream("hi"): full = chunk if full is None else full + chunk print(chunk) print(f"\nFull: {full}") ``` ``` content='' id='run-8a20843f-25c7-4025-ad72-9add395899e3' usage_metadata={'input_tokens': 8, 'output_tokens': 0, 'total_tokens': 8} content='Hello' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content='!' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content=' How' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content=' can' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content=' I' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content=' assist' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content=' you' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content=' today' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content='?' id='run-8a20843f-25c7-4025-ad72-9add395899e3' content='' id='run-8a20843f-25c7-4025-ad72-9add395899e3' usage_metadata={'input_tokens': 0, 'output_tokens': 12, 'total_tokens': 12} Full: content='Hello! How can I assist you today?' id='run-8a20843f-25c7-4025-ad72-9add395899e3' usage_metadata={'input_tokens': 8, 'output_tokens': 12, 'total_tokens': 20} ```	2024-06-07 13:21:46 +00:00
Bagatur	235d91940d	community[patch]: Release 0.2.4 (#22643 )	2024-06-06 17:47:44 -07:00
Francesco Kruk	344adad056	docs: Update jina embedding notebook to include multimodal capability (#22594 ) After merging the [PR #22416 to include Jina AI multimodal capabilities](https://github.com/langchain-ai/langchain/pull/22416), we updated the Jina AI embedding notebook accordingly.	2024-06-07 00:02:20 +00:00
William FH	be79ce9336	[Core] Unified Enable/Disable Tracing (#22576 )	2024-06-06 16:54:35 -07:00
Leonid Ganeline	57c1239643	docs: `arxiv` page update (#22574 ) Added a link to search the arXiv papers with references to LangChain. Updated table: better format (no horizontal scroll in table anymore).	2024-06-06 16:51:02 -07:00
Bagatur	fe2e5a3b74	langchain[patch]: Release 0.2.3 (#22644 )	2024-06-06 16:29:18 -07:00
Erick Friis	a24a9c6427	multiple: get rid of pyproject extras (#22581 ) They cause `poetry lock` to take a ton of time, and `uv pip install` can resolve the constraints from these toml files in trivial time (addressing problem with #19153) This allows us to properly upgrade lockfile dependencies moving forward, which revealed some issues that were either fixed or type-ignored (see file comments)	2024-06-06 15:45:22 -07:00
Bagatur	4367e89c9a	core[patch]: Release 0.2.5 (#22642 )	2024-06-06 15:44:26 -07:00
Eugene Yurtsev	28f744c1f5	core[patch]: Correctly order parent ids in astream events (from root to immediate parent), add defensive check for cycles (#22637 ) This PR makes two changes: 1. Fixes the order of parent IDs to be from root to immediate parent 2. Adds a simple defensive check for cycles	2024-06-06 20:37:52 +00:00
Satyam Kumar	835926153b	updated oracleai_demo.ipynb (#22635 ) The outer try/except block handles connection errors, and the inner try/except block handles SQL execution errors, providing detailed error messages for both. try: conn = oracledb.connect(user=username, password=password, dsn=dsn) print("Connection successful!") cursor = conn.cursor() try: cursor.execute( """ begin -- Drop user begin execute immediate 'drop user testuser cascade'; exception when others then dbms_output.put_line('Error dropping user: ' \|\| SQLERRM); end; --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-06 20:29:24 +00:00
Eugene Yurtsev	035a9c9609	core[minor]: Add parent_ids to astream_events API (#22563 ) Include a list of parent ids for each event in astream events.	2024-06-06 16:14:28 -04:00
Tomaz Bratanic	67e58fdc2e	docs[patch]: Fix diffbot docs (#22584 )	2024-06-06 16:08:59 -04:00
Eugene Yurtsev	6b8963ad92	docs: Add information about run time binding values to tools (#22623 ) Add how-to guide that shows a design pattern for creating tools at run time	2024-06-06 16:05:34 -04:00
CharlesCNorton	aa49163bdf	docs[patch]: typo in AutoGPT example notebook (#22631 ) Corrected a typo in the AutoGPT example notebook. Changed "Needed synce jupyter runs an async eventloop" to "Needed since Jupyter runs an async event loop". Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-06 16:05:11 -04:00
CharlesCNorton	ffe75d1e46	docs: typo in dev container documentation (#22630 ) removed an extra space before the period in the "Click Create codespace on master." line. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-06 16:04:48 -04:00
Nicolas Nkiere	51005e2776	core[minor]: Add an async root listener and with_alisteners method (#22151 ) - [x] Adding AsyncRootListener: "langchain_core: Adding AsyncRootListener" - Description: Adding an AsyncBaseTracer, AsyncRootListener and `with_alistener` function. This is to enable binding async root listener to runnables. This currently only supported for sync listeners. - Issue: None - Dependencies: None - [x] Add tests and docs: Added units tests and example snippet code within the function description of `with_alistener` - [x] Lint and test: Run make format_diff, make lint_diff and make test	2024-06-06 16:03:44 -04:00
seyf97	2904c50cd5	openai[patch]: correct grammar in exception message in embeddings/base.py (#22629 ) Correct the grammar error for missing transformers package ValueError	2024-06-06 18:55:04 +00:00
Anush	80560419b0	qdrant[patch]: Make path optional in from_existing_collection() (#21875 ) ## Description The `path` param is used to specify the local persistence directory, which isn't required if using Qdrant server. This is a breaking but necessary change.	2024-06-06 10:37:08 -07:00
ccurme	b57aa89f34	multiple: implement ls_params (#22621 ) implement ls_params for ai21, fireworks, groq.	2024-06-06 16:51:37 +00:00
Xiangrui Meng	f26ab93df8	community: support Databricks Unity Catalog functions as LangChain tools (#22555 ) This PR adds support for using Databricks Unity Catalog functions as LangChain tools, which runs inside a Databricks SQL warehouse. * An example notebook is provided.	2024-06-06 09:38:50 -07:00
ccurme	c1ef731503	anthropic: update attribute name and alias (#22625 ) update name to `stop_sequences` and alias to `stop` (instead of the other way around), since `stop_sequences` is the name used by anthropic.	2024-06-06 12:29:10 -04:00
lucasiscovici	05bf98b2f9	community[patch]: pgvector replace nin_ by not_in (#22619 ) - [ ] community: "pgvector: replace nin_ by not_in" - [ ] PR message: nin_ do not exist in sqlalchemy orm, it's not_in	2024-06-06 12:17:22 -04:00
ccurme	3999761201	multiple: add `stop` attribute (#22573 )	2024-06-06 12:11:52 -04:00
ccurme	e08879147b	Revert "anthropic: stream token usage" (#22624 ) Reverts langchain-ai/langchain#20180	2024-06-06 12:05:08 -04:00
Bagatur	0d495f3f63	anthropic: stream token usage (#20180 ) open to other ideas <img width="1181" alt="Screenshot 2024-04-08 at 5 34 08 PM" src="https://github.com/langchain-ai/langchain/assets/22008038/03eb11c4-5eb5-43e3-9109-a13f76098fa4"> --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-06 11:51:34 -04:00
liuzc9	e0e40f3f63	docs: Fix typo in llmonitor.md (#22590 )	2024-06-06 15:26:51 +00:00
Bagatur	feb73d4281	docs: Add ChatGoogleGenerativeAI to model feat table (#22617 )	2024-06-06 08:07:13 -07:00
Satyam Kumar	17b486a37b	openai, azure: update model_name in ChatResult to use name from API response (#22569 ) The response.get("model", self.model_name) checks if the model key exists in the response dictionary. If it does, it uses that value; otherwise, it uses self.model_name. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-06 11:00:09 -04:00
Suganth Solamanraja	02495ae7c5	docs: Correct return type in docstring (#22597 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: This PR corrects the return type in the docstring of the `docs/api_reference/create_api_rst.py/_load_package_modules` function. The return type was previously described as a list of Co-authored-by: suganthsolamanraja <suganth.solamanraja@techjays..com>	2024-06-06 14:51:46 +00:00
svmpsp-rc	51942c03eb	docs: correct typos in Italian words (#22606 ) Description Fix typos in Italian words.	2024-06-06 07:46:07 -07:00
Gabriele Ghisleni	95883a99a9	docs: ElasticsearchCacheStore in stores integrations documentation (#22612 ) The package for LangChain integrations with Elasticsearch https://github.com/langchain-ai/langchain-elastic contains a Elasticsearch byte store cache integration (see https://github.com/langchain-ai/langchain-elastic/pull/27). This is the documentation contribution on the page dedicated to stores integrations Co-authored-by: Gabriele Ghisleni <gabriele.ghisleni@spaziodati.eu>	2024-06-06 14:36:43 +00:00
Christophe Bornet	12ddb4fc6f	core[patch]: Use explicit classes for InMemoryByteStore and InMemoryStore (#22608 ) The current implementation doesn't work well with type checking. Instead replace with class definition that correctly works with type checking.	2024-06-06 07:34:43 -07:00
andyjessen	cfed68e06f	docs: Fix description (#22611 ) This commit fixes the description of the hair_color field.	2024-06-06 07:25:27 -07:00
ccurme	1925bde32e	together: bump langchain-core (#22616 ) langchain-together depends on langchain-openai ^0.1.8 langchain-openai 0.1.8 has langchain-core >= 0.2.2 Here we bump langchain-core to 0.2.2, just to pass minimum dependency version tests.	2024-06-06 14:09:40 +00:00
ccurme	35f4aa927b	together[patch]: Release 0.1.3 (#22615 )	2024-06-06 13:58:35 +00:00
Asi Greenholts	f23bec7be6	docs: Fix typo (#22596 ) Fix typo	2024-06-06 08:39:54 -04:00
CharlesCNorton	abb0cecb44	fix: typo in Agents section of README (#22599 ) Corrected the phrase "complete done" to "completely done" for better grammatical accuracy and clarity in the Agents section of the README. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-06 07:44:36 -04:00
Kirushikesh DB	db7e7b69e3	docs: Removed unwanted cell in refine segment (#22604 ) Description: There is one unwanted duplicate cell in refine section of summarization documentation, i have removed it.	2024-06-06 07:40:26 -04:00
andyjessen	8b40428f58	docs: Fix typo (#22603 ) This commit changes minor typo in the field description.	2024-06-06 07:38:36 -04:00
Isaac Francisco	ba3e219d83	community[patch]: recursive url loader fix and unit tests (#22521 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-05 17:56:20 -07:00
Jacob Lee	234394f631	docs[minor]: Add "Build a PDF ingestion and Question/Answering system" tutorial (#22570 ) More direct entrypoint for a common use-case. Meant to give people a more hands-on intro to document loaders/loading data from different data sources as well. Some duplicate content for RAG and extraction (to show what you can do with the loaded documents), but defers to the appropriate sections rather than going too in-depth. @baskaryan @hwchase17	2024-06-05 17:09:28 -07:00
Jeffrey Mak	5fc5ed463c	community[patch]:Support filter for AzureAISearchRetriever (#22303 ) Description: The AzureAISearchRetriever does not support the "$filter" argument offered in the AISearch API: https://learn.microsoft.com/en-us/rest/api/searchservice/documents/search-get?view=rest-searchservice-2023-11-01&tabs=HTTP The $filter allows filtering of indexes based on values in metadata. Issue: https://github.com/langchain-ai/langchain/issues/19885 Dependencies: No Twitter handle: @Jeffreym9M - [ ] Add tests and docs: Not relevant - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-05 16:53:19 -07:00
Isaac Francisco	148088a588	docs: duckduckgosearch options listed (#22568 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-05 23:29:47 +00:00
Mikhail Khludnev	ef868bc24b	docs: mentioning query_instruction with regards to BGE-M3 (#22405 ) see https://github.com/langchain-ai/langchain/pull/18017#issuecomment-2143942760 https://huggingface.co/BAAI/bge-m3#faq Co-authored-by: mikhail-khludnev <mikhail_khludnev@rntgroup.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-05 22:44:40 +00:00
X-HAN	62f13f95e4	community[minor]: add DashScope Rerank (#22403 ) Description: this PR adds DashScope Rerank capability to Langchain, you can find DashScope Rerank API from [here](https://help.aliyun.com/document_detail/2780058.html?spm=a2c4g.2780059.0.0.6d995024FlrJ12) & [here](https://help.aliyun.com/document_detail/2780059.html?spm=a2c4g.2780058.0.0.63f75024cr11N9). [DashScope](https://dashscope.aliyun.com/) is the generative AI service from Alibaba Cloud (Aliyun). You can create DashScope API key from [here](https://bailian.console.aliyun.com/?apiKey=1#/api-key). Dependencies: DashScopeRerank depends on `dashscope` python package. Twitter handle: my twitter/x account is https://x.com/LastMonopoly and I'd like a mention, thanks you! Tests and docs 1. integration test: `test_dashscope_rerank.py` 2. example notebook: `dashscope_rerank.ipynb` Lint and test: I have run `make format`, `make lint` and `make test` from the root of the package I've modified. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-05 15:40:21 -07:00
Ethan Yang	29064848f9	[Community]add option to delete the prompt from HF output (#22225 ) This will help to solve pattern mismatching issue when parsing the output in Agent. https://github.com/langchain-ai/langchain/issues/21912	2024-06-05 18:38:54 -04:00
Jacob Lee	c040dc7017	docs[patch]: Adds heading keywords to concepts page (#22577 ) @efriis @baskaryan	2024-06-05 15:28:58 -07:00
Erick Friis	24fa17593f	docs: update agentexecutor title to legacy (#22575 )	2024-06-05 15:09:41 -07:00
Bagatur	584a1e30ac	community[patch]: AzureSearch async functions (#22075 )	2024-06-05 14:39:54 -07:00
Bagatur	1a911018bc	langchain[minor]: add universal init_model (#22039 ) decisions to discuss - only chat models - model_provider isn't based on any existing values like llm-type, package names, class names - implemented as function not as a wrapper ChatModel - function name (init_model) - in langchain as opposed to community or core - marked beta	2024-06-05 14:39:40 -07:00
Isaac Francisco	67012c2558	docs: deprecation of max_length parameter used in Exa search (#22567 )	2024-06-05 12:09:53 -07:00
ccurme	af129974a3	community: update how OpenAIAssistantV2Runnable creates threads with tool_resources (#22549 ) https://github.com/langchain-ai/langchain/issues/22503	2024-06-05 14:19:41 -04:00
Bagatur	51a0d4574e	community[patch]: Release 0.2.3 (#22562 )	2024-06-05 17:27:24 +00:00
Bagatur	b2daba37c7	nomic[patch]: Release 0.1.2 (#22561 )	2024-06-05 17:06:58 +00:00
Zach Nussbaum	14f3014cce	embeddings: nomic embed vision (#22482 ) Thank you for contributing to LangChain! Description: Adds Langchain support for Nomic Embed Vision Twitter handle: nomic_ai,zach_nussbaum - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Lance Martin <122662504+rlancemartin@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-05 09:47:17 -07:00
leila-messallem	3280a5b49b	community[patch]: improve test setup to accurately test filtering of labels in neo4j (#22531 ) Description: This PR addresses an issue with an existing test that was not effectively testing the intended functionality. The previous test setup did not adequately validate the filtering of the labels in neo4j, because the nodes and relationship in the test data did not have any properties set. Without properties these labels would not have been returned, regardless of the filtering. --------- Co-authored-by: Oskar Hane <oh@oskarhane.com>	2024-06-05 15:56:53 +00:00
Mohammad Mohtashim	7fcef2556c	[Experimental]: Async agenerate method ollama functions (#21682 ) - Description: : Added Async method for Generate for OllamaFunctions which was missing and was raising errors for the users. - Issue: #21422	2024-06-05 11:50:36 -04:00
Stefano Lottini	328d0c99f2	community[minor]: Add support for metadata indexing policy in Cassandra vector store (#22548 ) This PR adds a constructor `metadata_indexing` parameter to the Cassandra vector store to allow optional fine-tuning of which fields of the metadata are to be indexed. This is a feature supported by the underlying CassIO library. Indexing mode of "all", "none" or deny- and allow-list based choices are available. The rationale is, in some cases it's advisable to programmatically exclude some portions of the metadata from the index if one knows in advance they won't ever be used at search-time. this keeps the index more lightweight and performant and avoids limitations on the length of _indexed_ strings. I added a integration test of the feature. I also added the possibility of running the integration test with Cassandra on an arbitrary IP address (e.g. Dockerized), via `CASSANDRA_CONTACT_POINTS=10.1.1.5,10.1.1.6 poetry run pytest [...]` or similar. While I was at it, I added a line to the `.gitignore` since the mypy _test_ cache was not ignored yet. My X (Twitter) handle: @rsprrs.	2024-06-05 11:23:26 -04:00
Emilien Chauvet	c3d4126eb1	community[minor]: add user agent for web scraping loaders (#22480 ) Description: This PR adds a `USER_AGENT` env variable that is to be used for web scraping. It creates a util to get that user agent and uses it in the classes used for scraping in [this piece of doc](https://python.langchain.com/v0.1/docs/use_cases/web_scraping/). Identifying your scraper is considered a good politeness practice, this PR aims at easing it. Issue: `None` Dependencies: `None` Twitter handle: `None`	2024-06-05 15:20:34 +00:00
Philippe PRADOS	8250c177de	community[minor]: Add native async support to SQLChatMessageHistory (#22065 ) # package community: Fix SQLChatMessageHistory ## Description Here is a rewrite of `SQLChatMessageHistory` to properly implement the asynchronous approach. The code circumvents [issue 22021](https://github.com/langchain-ai/langchain/issues/22021) by accepting a synchronous call to `def add_messages()` in an asynchronous scenario. This bypasses the bug. For the same reasons as in [PR 22](https://github.com/langchain-ai/langchain-postgres/pull/32) of `langchain-postgres`, we use a lazy strategy for table creation. Indeed, the promise of the constructor cannot be fulfilled without this. It is not possible to invoke a synchronous call in a constructor. We compensate for this by waiting for the next asynchronous method call to create the table. The goal of the `PostgresChatMessageHistory` class (in `langchain-postgres`) is, among other things, to be able to recycle database connections. The implementation of the class is problematic, as we have demonstrated in [issue 22021](https://github.com/langchain-ai/langchain/issues/22021). Our new implementation of `SQLChatMessageHistory` achieves this by using a singleton of type (`Async`)`Engine` for the database connection. The connection pool is managed by this singleton, and the code is then reentrant. We also accept the type `str` (optionally complemented by `async_mode`. I know you don't like this much, but it's the only way to allow an asynchronous connection string). In order to unify the different classes handling database connections, we have renamed `connection_string` to `connection`, and `Session` to `session_maker`. Now, a single transaction is used to add a list of messages. Thus, a crash during this write operation will not leave the database in an unstable state with a partially added message list. This makes the code resilient. We believe that the `PostgresChatMessageHistory` class is no longer necessary and can be replaced by: ``` PostgresChatMessageHistory = SQLChatMessageHistory ``` This also fixes the bug. ## Issue - [issue 22021](https://github.com/langchain-ai/langchain/issues/22021) - Bug in _exit_history() - Bugs in PostgresChatMessageHistory and sync usage - Bugs in PostgresChatMessageHistory and async usage - [issue 36](https://github.com/langchain-ai/langchain-postgres/issues/36) ## Twitter handle: pprados ## Tests - libs/community/tests/unit_tests/chat_message_histories/test_sql.py (add async test) @baskaryan, @eyurtsev or @hwchase17 can you check this PR ? And, I've been waiting a long time for validation from other PRs. Can you take a look? - [PR 32](https://github.com/langchain-ai/langchain-postgres/pull/32) - [PR 15575](https://github.com/langchain-ai/langchain/pull/15575) - [PR 13200](https://github.com/langchain-ai/langchain/pull/13200) --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-05 15:10:38 +00:00
Vincent Min	59bef31997	community[minor]: Improve InMemoryVectorStore with ability to persist to disk and filter on metadata. (#22186 ) - Description: The InMemoryVectorStore is a nice and simple vector store implementation for quick development and debugging. The current implementation is quite limited in its functionalities. This PR extends the functionalities by adding utility function to persist the vector store to a json file and to load it from a json file. We choose the json file format because it allows inspection of the database contents in a text editor, which is great for debugging. Furthermore, it adds a `filter` keyword that can be used to filter out documents on their `page_content` or `metadata`. - Issue: - - Dependencies: - - Twitter handle: @Vincent_Min	2024-06-05 10:40:34 -04:00
Christophe Bornet	c34ad8c163	core[patch]: Improve VectorStore API doc (#22547 )	2024-06-05 10:23:44 -04:00
maang-h	89128b7a49	community[patch]: add detailed paragraph and example for BaichuanTextEmbeddings (#22031 ) - Description: add detailed paragraph and example for BaichuanTextEmbeddings - Issue: the issue #21983	2024-06-05 10:18:11 -04:00
Anthony Bernabeu	4e676a63b8	community[minor]: Added filter search for LanceDB (#22461 ) - [ ] community: "vectorstore: added filtering support for LanceDB vector store" - [ ] This PR adds filtering capabilities to LanceDB: - Description: In LanceDB filtering can be applied when searching for data into the vectorstore. It is using the SQL language as mentioned in the LanceDB documentation. - Issue: #18235 - Dependencies: No - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-05 09:33:54 -04:00
Erick Friis	4050d6ea2b	huggingface: remove text-generation dep (#22543 )	2024-06-05 12:13:40 +00:00
Erick Friis	a6fc74f379	ai21: fix core version (#22544 )	2024-06-05 08:09:19 -04:00
Asaf Joseph Gardin	75cba742e5	ai21: fix ai21 unittests (#22526 ) Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-05 08:00:42 -04:00
Erick Friis	58192d617f	community: fix huggingface deprecations (#22522 )	2024-06-05 04:13:13 +00:00
Jacob Lee	1e748a6d40	docs[patch]: Adds links to deprecations page (#22514 ) @baskaryan	2024-06-04 16:19:32 -07:00
William FH	91fed3ace7	[Docs] Structured output Keywords (#22511 )	2024-06-04 20:56:05 +00:00
Christophe Bornet	8ba868d3b0	core[patch]: Add similarity_score_threshold to VectorStore search types (#22477 )	2024-06-04 13:43:55 -07:00
Eugene Yurtsev	9120cf5df2	core[patch]: Deduplicate of callback handlers in merge_configs (#22478 ) This PR adds deduplication of callback handlers in merge_configs. Fix for this issue: https://github.com/langchain-ai/langchain/issues/22227 The issue appears when the code is: 1) running python >=3.11 2) invokes a runnable from within a runnable 3) binds the callbacks to the child runnable from the parent runnable using with_config In this case, the same callbacks end up appearing twice: (1) the first time from with_config, (2) the second time with langchain automatically propagating them on behalf of the user. Prior to this PR this will emit duplicate events: ```python @tool async def get_items(question: str, callbacks: Callbacks): # <--- Accept callbacks """Ask question""" template = ChatPromptTemplate.from_messages( [ ( "human", "'{question}" ) ] ) chain = template \| chat_model.with_config( { "callbacks": callbacks, # <-- Propagate callbacks } ) return await chain.ainvoke({"question": question}) ``` Prior to this PR this will work work correctly (no duplicate events): ```python @tool async def get_items(question: str, callbacks: Callbacks): # <--- Accept callbacks """Ask question""" template = ChatPromptTemplate.from_messages( [ ( "human", "'{question}" ) ] ) chain = template \| chat_model return await chain.ainvoke({"question": question}, {"callbacks": callbacks}) ``` This will also work (as long as the user is using python >= 3.11) -- as langchain will automatically propagate callbacks ```python @tool async def get_items(question: str,): """Ask question""" template = ChatPromptTemplate.from_messages( [ ( "human", "'{question}" ) ] ) chain = template \| chat_model return await chain.ainvoke({"question": question}) ```	2024-06-04 16:19:00 -04:00
Jacob Lee	64dbc52cae	docs[patch]: Update quickstart tutorial (#22504 ) Mentions LCEL more, hopefully flags it to more people as a simple entrypoint @baskaryan @hwchase17	2024-06-04 13:04:56 -07:00
Ofer Mendelevitch	ad502e8d50	community[minor]: Vectara Integration Update - Streaming, FCS, Chat, updates to documentation and example notebooks (#21334 ) Thank you for contributing to LangChain! Description: update to the Vectara / Langchain integration to integrate new Vectara capabilities: - Full RAG implemented as a Runnable with as_rag() - Vectara chat supported with as_chat() - Both support streaming response - Updated documentation and example notebook to reflect all the changes - Updated Vectara templates Twitter handle: ofermend Add tests and docs: no new tests or docs, but updated both existing tests and existing docs	2024-06-04 12:57:28 -07:00
Bagatur	cb183a9bf1	docs: update anthropic chat model (#22483 ) Related to #22296 And update anthropic to accept base_url	2024-06-04 12:42:06 -07:00
Erick Friis	d700ce8545	robocorp: typo (#22509 )	2024-06-04 15:33:38 -04:00
Erick Friis	39fd44579a	robocorp: release 0.0.9.post1 (#22507 )	2024-06-04 15:32:30 -04:00
Erick Friis	339e3b7f55	ai21: release 0.1.6 (#22508 )	2024-06-04 15:31:23 -04:00
ccurme	3c53cea760	together, upstage: bump minimum langchain-openai version (#22505 )	2024-06-04 15:20:41 -04:00
Erick Friis	c438b5b78e	docs: fix api ref link generation (#22438 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-04 12:09:22 -07:00
Bagatur	efcb04f84b	mongodb[patch]: Release 0.1.6 (#22501 )	2024-06-04 12:01:37 -07:00
Bagatur	222b1ba112	groq[patch]: Release 0.1.5 (#22500 )	2024-06-04 12:01:17 -07:00
Bagatur	f021be510e	milvus[patch]: Release 0.1.1 (#22499 )	2024-06-04 12:00:53 -07:00
Bagatur	64d68c17cd	upstage[patch]: Release 0.1.6 (#22498 )	2024-06-04 11:58:44 -07:00
Bagatur	48fba40fce	experimental[patch]: Release 0.0.60 (#22497 )	2024-06-04 11:56:42 -07:00
Bagatur	e60f88ccdd	community[patch]: Release 0.2.2 (#22496 )	2024-06-04 11:42:11 -07:00
Bagatur	85aa218564	langchain[patch]: Release 0.2.2 (#22495 )	2024-06-04 11:33:45 -07:00
Bagatur	8e86080def	mistralai[patch]: Release 0.1.8 (#22494 )	2024-06-04 11:33:06 -07:00
Bagatur	e850de2422	huggingface[patch]: release 0.0.2 (#22493 )	2024-06-04 11:32:36 -07:00
Jacob Lee	593de8a913	docs[patch]: Add robots.txt and root sitemap (#22492 ) CC @efriis @baskaryan	2024-06-04 11:26:40 -07:00
Bagatur	99a3cad258	text-splitters[patch]: Release 0.2.1 (#22490 )	2024-06-04 11:19:21 -07:00
Bagatur	161b02a8be	core[patch]: Release 0.2.4 (#22489 )	2024-06-04 11:14:54 -07:00
Ragul Kachiappan	50258a7dda	docs: Update chroma docs link for collection reference (#22472 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: Updated dead link referencing chroma docs in Chroma notebook under vectorstores	2024-06-04 18:01:13 +00:00
nareshnagpal06	9b45374118	docs: Added Semantic Cache Example with BedrockChat using Bedrock Embedding… (#22190 ) …s and Opensearch Semantic Cache Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-04 17:40:29 +00:00
Joydeep Banik Roy	3796672c67	community, milvus, pinecone, qdrant, mongo: Broadcast operation failure while using simsimd beyond v3.7.7 (#22271 ) - [ ] Packages affected: - community: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/milvus: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/mongodb: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/pinecone: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/qdrant: fix `cosine_similarity` to support simsimd beyond 3.7.7 - [ ] Broadcast operation failure while using simsimd beyond v3.7.7: - Description: I was using simsimd 4.3.1 and the unsupported operand type issue popped up. When I checked out the repo and ran the tests, they failed as well (have attached a screenshot for that). Looks like it is a variant of https://github.com/langchain-ai/langchain/issues/18022 . Prior to 3.7.7, simd.cdist returned an ndarray but now it returns simsimd.DistancesTensor which is ineligible for a broadcast operation with numpy. With this change, it also remove the need to explicitly cast `Z` to numpy array - Issue: #19905 - Dependencies: No - Twitter handle: https://x.com/GetzJoydeep <img width="1622" alt="Screenshot 2024-05-29 at 2 50 00 PM" src="https://github.com/langchain-ai/langchain/assets/31132555/fb27b383-a9ae-4a6f-b355-6d503b72db56"> - [ ] Considerations: 1. I started with community but since similar changes were there in Milvus, MongoDB, Pinecone, and QDrant so I modified their files as well. If touching multiple packages in one PR is not the norm, then I can remove them from this PR and raise separate ones 2. I have run and verified that the tests work. Since, only MongoDB had tests, I ran theirs and verified it works as well. Screenshots attached : <img width="1573" alt="Screenshot 2024-05-29 at 2 52 13 PM" src="https://github.com/langchain-ai/langchain/assets/31132555/ce87d1ea-19b6-4900-9384-61fbc1a30de9"> <img width="1614" alt="Screenshot 2024-05-29 at 3 33 51 PM" src="https://github.com/langchain-ai/langchain/assets/31132555/6ce1d679-db4c-4291-8453-01028ab2dca5"> I have added a test for simsimd. I feel it may not go well with the CI/CD setup as installing simsimd is not a dependency requirement. I have just imported simsimd to ensure simsimd cosine similarity is invoked. However, its not a good approach. Suggestions are welcome and I can make the required changes on the PR. Please provide guidance on the same as I am new to the community. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-04 17:36:31 +00:00
KyrianC	03178ee74f	community[minor]: Add tools calls to `ChatEdenAI` (#22320 ) ### Description Add tools implementation to `ChatEdenAI`: - `bind_tools()` - `with_structured_output()` ### Documentation Updated `docs/docs/integrations/chat/edenai.ipynb` ### Notes We don´t support stream with tools as of yet. If stream is called with tools we directly yield the whole message from `generate` (implemented the same way as Anthropic did).	2024-06-04 10:29:28 -07:00
pranavvuppala	9d4350e69a	docs : Update docstrings for OpenAI base.py (#22221 ) - [x] PR title: Update docstrings for OpenAI base.py -Description: Updated the docstring of few OpenAI functions for a better understanding of the function. - Issue: #21983 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-04 17:24:17 +00:00
Anindyadeep	7a197539aa	communty[patch]: Native RAG Support in Prem AI langchain (#22238 ) This PR adds native RAG support in langchain premai package. The same has been added in the docs too.	2024-06-04 10:19:54 -07:00
Rahul Triptahi	77ad857934	community[minor]: Enable retrieval api calls in PebbloRetrievalQA (#21958 ) Description: Enable app discovery and Prompt/Response apis in PebbloSafeRetrieval Documentation: NA Unit test: N/A --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-06-04 10:18:50 -07:00
liugz18	8fd231086e	experimental[patch]: Fix graph_transformers llms #21482 (#22417 ) Fix AttributeError on calling LLMGraphTransformer.convert_to_graph_documents #21482 since raw_schema is always a str @baskaryan	2024-06-04 17:07:38 +00:00
ccurme	6db25b4e31	core[patch]: bump langsmith (#22476 ) Noticing errors logged in some situations when tracing with Langsmith: ```python from langchain_core.pydantic_v1 import BaseModel from langchain_anthropic import ChatAnthropic class AnswerWithJustification(BaseModel): """An answer to the user question along with justification for the answer.""" answer: str justification: str llm = ChatAnthropic(model="claude-3-haiku-20240307") structured_llm = llm.with_structured_output(AnswerWithJustification) list(structured_llm.stream("What weighs more a pound of bricks or a pound of feathers")) ``` ``` Error in LangChainTracer.on_chain_end callback: AttributeError("'NoneType' object has no attribute 'append'") [AnswerWithJustification(answer='A pound of bricks and a pound of feathers weigh the same amount.', justification='This is because a pound is a unit of mass, not volume. By definition, a pound of any material, whether bricks or feathers, will weigh the same - one pound. The physical size or volume of the materials does not matter when measuring by mass. So a pound of bricks and a pound of feathers both weigh exactly one pound.')] ```	2024-06-04 10:05:53 -07:00
Bagatur	17c127531a	community[patch]: deprecate all HF classes (#22444 )	2024-06-04 09:48:25 -07:00
Nuno Campos	58b118544e	Use immutable sequence type for batch/batch_as_completed types (#22433 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-04 08:04:09 -07:00
Christophe Bornet	9a8fe58ebe	community[minor]: Improve Cassandra VectorStore as_retriever (#22465 ) The Vectorstore's API `as_retriever` doesn't expose explicitly the parameters `search_type` and `search_kwargs` and so these are not well documented. This PR improves `as_retriever` for the Cassandra VectorStore by making these parameters explicit. NB: An alternative would have been to modify `as_retriever` in `Vectorstore`. But there's probably a good reason these were not exposed in the first place ? Is it because implementations may decide to not support them and have fixed values when creating the VectorStoreRetriever ?	2024-06-04 09:51:17 -04:00
Christophe Bornet	23bba18f92	core[patch]: Fix VectorStore's as_retriever mutating tags param (#22470 ) The current VectorStore `as_retriever` implementation mutates the `tags` param when it's passed in kwargs. This fix ensures that a copy is done.	2024-06-04 09:50:36 -04:00
Michal Gregor	98b2e7b195	huggingface[patch]: Support for HuggingFacePipeline in ChatHuggingFace. (#22194 ) - Description: Added support for using HuggingFacePipeline in ChatHuggingFace (previously it was only usable with API endpoints, probably by oversight). - Issue: #19997 - Dependencies: none - Twitter handle: none --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-04 00:47:35 +00:00
Fahreddin Özcan	0061ded002	community[patch]: Upstash Vector Store Namespace Support (#22251 ) This PR introduces namespace support for Upstash Vector Store, which would allow users to partition their data in the vector index. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-03 17:30:56 -07:00
Isaac Francisco	25cf1a74d5	docs: rag tutorial small fixes (#22450 )	2024-06-04 00:16:54 +00:00
Jacob Lee	b0f014666d	docs[patch]: Adds search keywords for common queries (#22449 ) CC @baskaryan @efriis @ccurme	2024-06-03 16:30:17 -07:00
Guangdong Liu	bc7e32f315	core(patch):fix partial_variables not working with SystemMessagePromptTemplate (#20711 ) - Issue: close #17560 - @baskaryan, @eyurtsev	2024-06-03 16:22:42 -07:00
Martin Kolb	f2dd31b9e8	docs: Fix doc issue for HANA Cloud Vector Engine (#22260 ) - Description: This PR fixes a rendering issue in the docs (Python notebook) of HANA Cloud Vector Engine. - Issue: N/A - Dependencies: no new dependencies added File of the fixed notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb`	2024-06-03 15:53:43 -07:00
Dristy Srivastava	ef3df45d9d	community[minor]: Updating payload for pebblo discover API (#22309 ) Description: Updating response for pebblo discover API. Also updating filed name case type Documentation: N/A Unit tests: N/A	2024-06-03 15:36:17 -07:00
Miroslav	cbd5720011	huggingface[patch]: Skip Login to HuggingFaceHub when token is not set (#22365 )	2024-06-03 15:20:32 -07:00
Stefano Lottini	f78ae1d932	docs: Astra DB vectorstore, add automatic-embedding example (#22350 ) Description: Adding an example showcasing the newly-introduced API-side embedding computation option for the Astra DB vector store	2024-06-03 15:13:57 -07:00
bhardwaj-vipul	f397a84a59	langchain[patch]: Fix MongoDBAtlasVectorSearch reference in self query retriever (#22401 ) Description: SelfQuery Retriever with MongoDBAtlasVectorSearch (from langchain_mongodb import MongoDBAtlasVectorSearch) and Chroma (from langchain_chroma import Chroma) is not supported. The imports in the [builtin translators](`8cbce684d4/libs/langchain/langchain/retrievers/self_query/base.py (L73)`) points to the [deprecated](`acaf214a45/libs/community/langchain_community/vectorstores/mongodb_atlas.py (L36)`) vectorstore. Issue: #22272 --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-03 22:10:15 +00:00
ccurme	afe89a1411	community: add standard chat model params to Ollama (#22446 )	2024-06-03 17:45:03 -04:00
Isaac Francisco	5119ab2fb9	docs: agents tutorial wording (#22447 )	2024-06-03 14:40:01 -07:00
Ethan Yang	52da6a160d	community[patch]: Update OpenVINO embedding and reranker to support static input shape (#22171 ) It can help to deploy embedding models on NPU device	2024-06-03 13:27:17 -07:00
Tom Clelford	c599732e1a	text-splitters[patch]: fix HTMLSectionSplitter parsing of xslt paths (#22176 ) ## Description This PR allows passing the HTMLSectionSplitter paths to xslt files. It does so by fixing two trivial bugs with how passed paths were being handled. It also changes the default value of the param `xslt_path` to `None` so the special case where the file was part of the langchain package could be handled. ## Issue #22175	2024-06-03 20:26:59 +00:00
maang-h	01352bb55f	community[minor]: Implement MiniMaxChat interface (#22391 ) - Description: Implement MiniMaxChat interface, include: - No longer inherits the LLM class (like other chat model) - Update request parameters (v1 -> v2) - update `base url` - update message role (system, user, assistant) - add `stream` function - no longer use `group id` - Implement the `_stream`, `_agenerate`, and `_astream` interfaces [minimax v2 api document](https://platform.minimaxi.com/document/guides/chat-model/V2?id=65e0736ab2845de20908e2dd)	2024-06-03 13:22:38 -07:00
Brandon Sharp	56e5aa4dd9	community[patch]: Airtable to allow for addtl params (#22092 ) - [X] PR title: "community: added optional params to Airtable table.all()" - [X] PR message: - Description: Add's kwargs to AirtableLoader to allow for kwargs: https://pyairtable.readthedocs.io/en/latest/api.html#pyairtable.Table.all - Issue: N/A - Dependencies: N/A - Twitter handle: parakoopa88 - [X] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [X] Lint and test**: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-03 13:05:56 -07:00
Harichandan Roy	1f751343e2	community[patch]: update embeddings/oracleai.py (#22240 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" "community/embeddings: update oracleai.py" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! Adding oracle VECTOR_ARRAY_T support. - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Tests are not impacted. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Done. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-03 12:38:51 -07:00
maang-h	13140dc4ff	community[patch]: Update the default api_url and reqeust_body of sparkllm embedding (#22136 ) - Description: When I was running the SparkLLMTextEmbeddings, app_id, api_key and api_secret are all correct, but it cannot run normally using the current URL. ```python # example from langchain_community.embeddings import SparkLLMTextEmbeddings embedding= SparkLLMTextEmbeddings( spark_app_id="my-app-id", spark_api_key="my-api-key", spark_api_secret="my-api-secret" ) embedding= "hello" print(spark.embed_query(text1)) ``` ![sparkembedding](https://github.com/langchain-ai/langchain/assets/55082429/11daa853-4f67-45b2-aae2-c95caa14e38c) So I updated the url and request body parameters according to [Embedding_api](https://www.xfyun.cn/doc/spark/Embedding_api.html), now it is runnable.	2024-06-03 12:38:11 -07:00
Yuwen Hu	ba0dca46d7	community[minor]: Add IPEX-LLM BGE embedding support on both Intel CPU and GPU (#22226 ) Description: [IPEX-LLM](https://github.com/intel-analytics/ipex-llm) is a PyTorch library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with very low latency. This PR adds ipex-llm integrations to langchain for BGE embedding support on both Intel CPU and GPU. Dependencies: `ipex-llm`, `sentence-transformers` Contribution maintainer: @Oscilloscope98 tests and docs: - langchain/docs/docs/integrations/text_embedding/ipex_llm.ipynb - langchain/docs/docs/integrations/text_embedding/ipex_llm_gpu.ipynb - langchain/libs/community/tests/integration_tests/embeddings/test_ipex_llm.py --------- Co-authored-by: Shengsheng Huang <shannie.huang@gmail.com>	2024-06-03 12:37:10 -07:00
Jacob Lee	c01467b1f4	core[patch]: RFC: Allow concatenation of messages with multi part content (#22002 ) Anthropic's streaming treats tool calls as different content parts (streamed back with a different index) from normal content in the `content`. This means that we need to update our chunk-merging logic to handle chunks with multi-part content. The alternative is coerceing Anthropic's responses into a string, but we generally like to preserve model provider responses faithfully when we can. This will also likely be useful for multimodal outputs in the future. This current PR does unfortunately make `index` a magic field within content parts, but Anthropic and OpenAI both use it at the moment to determine order anyway. To avoid cases where we have content arrays with holes and to simplify the logic, I've also restricted merging to chunks in order. TODO: tests CC @baskaryan @ccurme @efriis	2024-06-03 09:46:40 -07:00
Dan	86509161b0	community: fix AzureSearch delete documents (#22315 ) Description Fix AzureSearch delete documents method by using FIELDS_ID variable instead of the hard coded "id" value Issue: This is linked to this issue: https://github.com/langchain-ai/langchain/issues/22314 Co-authored-by: dseban <dan.seban@neoxia.com>	2024-06-03 15:55:06 +00:00
Harrison Chase	8fad2e209a	fix error message (#22437 ) Was confusing when language is in Enum but not implemented	2024-06-03 15:48:26 +00:00
Bagatur	678a19a5f7	infra: bump anthropic mypy 1 (#22373 )	2024-06-03 08:21:55 -07:00
Nuno Campos	ceb73ad06f	core: In BaseRetriever make get_relevant_docs delegate to invoke (#22434 ) - This fixes all the tracing issues with people still using get_relevant_docs, and a change we need for 0.3 anyway Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-06-03 07:34:53 -07:00
Zheng Robert Jia	1ad1dc5303	docs: resolve minor syntax error. (#22375 ) Used the correct magic command. Changed from `% pip...` to `%pip` Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-03 14:34:24 +00:00
Charles John	2d81a72884	community: fix missing `apify_api_token` field in ApifyWrapper (#22421 ) - Description: The `ApifyWrapper` class expects `apify_api_token` to be passed as a named parameter or set as an environment variable. But the corresponding field was missing in the class definition causing the argument to be ignored when passed as a named param. This patch fixes that.	2024-06-03 14:32:57 +00:00
Klaudia Lemiec	dac355fc62	docs: notebook loader: change .html to .ipynb (#22407 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-03 14:26:28 +00:00
Joan Fontanals	a7ae16f912	add `embed_image` API to JinaEmbedding (#22416 ) - Description: Add `embed_image` to JinaEmbedding to embed images - Twitter handle: https://x.com/JinaAI_	2024-06-03 10:23:37 -04:00
Qingchuan Hao	3e92ed8056	docs: add Microsoft Azure to ChatModelTabs (#22367 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-03 10:19:00 -04:00
Nuno Campos	ed8e9c437a	core: In RunnableSequence pass kwargs to the first step (#22393 ) - This is a pattern that shows up occasionally in langgraph questions, people chain a graph to something else after, and want to pass the graph some kwargs (eg. stream_mode)	2024-06-03 14:18:10 +00:00
Jeffrey Morgan	eabcfaa3d6	Update Ollama instructions (#22394 )	2024-06-03 10:17:35 -04:00
Harrison Chase	acaf214a45	update agent docs (#22370 ) to use create_react_agent --------- Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-06-01 08:28:32 -07:00
Jacob Lee	16cce76a68	👥 Update LangChain people data (#22388 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-06-01 07:36:45 -07:00
Jacob Lee	8a57102918	docs[patch]: Fix typo (#22377 )	2024-05-31 16:37:05 -07:00
Bagatur	4d82cea71f	docs: fix llm caches redirect (#22371 )	2024-05-31 19:37:06 +00:00
Bagatur	a8098f5ddb	anthropic[patch]: Release 0.1.15, fix sdk tools break (#22369 )	2024-05-31 12:10:22 -07:00
Erick Friis	6ffa0acf32	ai21: fix text-splitters version (#22366 )	2024-05-31 11:41:05 -04:00
Erick Friis	1bad0ac946	docs: redirect integration links to 0.2 (#22326 )	2024-05-31 11:40:48 -04:00
ccurme	8cbce684d4	docs: update retriever how-to content (#22362 ) - [x] How to: use a vector store to retrieve data - [ ] How to: generate multiple queries to retrieve data for - [x] How to: use contextual compression to compress the data retrieved - [x] How to: write a custom retriever class - [x] How to: add similarity scores to retriever results ^ done last month - [x] How to: combine the results from multiple retrievers - [x] How to: reorder retrieved results to mitigate the "lost in the middle" effect - [x] How to: generate multiple embeddings per document ^ this PR - [ ] How to: retrieve the whole document for a chunk - [ ] How to: generate metadata filters - [ ] How to: create a time-weighted retriever - [ ] How to: use hybrid vector and keyword retrieval ^ todo	2024-05-31 10:57:35 -04:00
Jacob Lee	75ed9ee929	docs: Fix Solar and OCI integration page typos (#22343 ) @efriis @baskaryan	2024-05-31 10:36:12 -04:00
Bagatur	0214246dc6	docs: list tool calling models (#22334 )	2024-05-30 14:32:33 -07:00
Bagatur	410e9add44	infra: run scheduled tests on aws, google, cohere, nvidia (#22328 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-30 13:57:12 -07:00
Harrison Chase	0c9a034ed7	add simpler agent tutorial (#22249 ) 1/ added section at start with full code 2/ removed retriever tool (was just distracting) 3/ added section on starting a new conversation --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-30 12:33:32 -07:00
Bagatur	2b9f1469d8	core[patch]: Release 0.2.3 (#22329 )	2024-05-30 11:35:09 -07:00
Harrison Chase	ee32369265	core[patch]: fix runnable history and add docs (#22283 )	2024-05-30 11:26:41 -07:00
William FH	dcec133b85	[Core] Update Tracing Interops (#22318 ) LangSmith and LangChain context var handling evolved in parallel since originally we didn't expect people to want to interweave the decorator and langchain code. Once we get a new langsmith release, this PR will let you seemlessly hand off between @traceable context and runnable config context so you can arbitrarily nest code. It's expected that this fails right now until we get another release of the SDK	2024-05-30 10:34:49 -07:00
ccurme	f34337447f	openai: update ChatOpenAI api ref (#22324 ) Update to reflect that token usage is no longer default in streaming mode. Add detail for streaming context under Token Usage section.	2024-05-30 12:31:28 -04:00
ChengZi	2443e85533	docs: fix milvus import and update template (#22306 ) docs: fix milvus import problem update milvus-rag template with milvus-lite Signed-off-by: ChengZi <chen.zhang@zilliz.com>	2024-05-30 08:28:55 -07:00
WU LIFU	86698b02a9	doc: fix wrong documentation on FAISS load_local function (#22310 ) ### Issue: #22299 ### descriptions The documentation appears to be wrong. When the user actually sets this parameter "asynchronous" to be True, it fails because the __init__ function of FAISS class doesn't allow this parameter. In fact, most of the class/instance functions of this class have both the sync/async version, so it looks like what we need is just to remove this parameter from the doc. Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Lifu Wu <lifu@nextbillion.ai>	2024-05-30 15:15:04 +00:00
maang-h	596c062cba	community[patch]: Standardize qianfan model init args name (#22322 ) - Description: - Standardize qianfan chat model intialization arguments name - qianfan_ak (qianfan api key) -> api_key - qianfan_sk (qianfan secret key) -> secret_key - Delete unuse variable - Issue: #20085	2024-05-30 11:08:32 -04:00
KhoPhi	c64b0a3095	Docs: Ollama (LLM, Chat Model & Text Embedding) (#22321 ) - [x] Docs Update: Ollama - llm/ollama - Switched to using llama3 as model with reference to templating and prompting - Added concurrency notes to llm/ollama docs - chat_models/ollama - Added concurrency notes to llm/ollama docs - text_embedding/ollama - include example for specific embedding models from Ollama	2024-05-30 11:06:45 -04:00
Dobiichi-Origami	10b12e1c08	community: adding tool_call_id for every ToolCall (#22323 ) - Description: This PR contains a bugfix which result in malfunction of multi-turn conversation in QianfanChatEndpoint and adaption for ToolCall and ToolMessage	2024-05-30 10:59:08 -04:00
Bagatur	569d325a59	docs: link GH org (#22308 )	2024-05-30 00:17:59 -07:00
Bagatur	93049d1563	docs: make llm cache its own section (#22301 )	2024-05-30 00:17:33 -07:00
Bagatur	04631439c9	docs: add v0.2 links to README (#22300 )	2024-05-29 16:22:01 -07:00
ccurme	f39e1a2288	community, docs: update token usage tracking callback + how-to guides (#22145 )	2024-05-29 17:00:47 -04:00
Bagatur	2bc50fb895	docs, cli[patch]: chat model template nit (#22294 )	2024-05-29 20:53:58 +00:00
Bagatur	aa6c31df53	cli[patch]: Release 0.0.24 (#22293 )	2024-05-29 13:37:34 -07:00
Bagatur	627a337887	docs, cli[patch]: chat model doc template (#22290 ) Update ChatModel integration doc template, integration docstring, and adds langchain-cli command to easily create just doc (for updating existing integrations): ```bash langchain-cli integration create-doc --name "foo-bar" ```	2024-05-29 13:34:58 -07:00
Wu Enze	f40e341a03	docs : Added integrations for memory with langchain_community (#22265 ) PR title: Integration Docs enhancement Description: Adding installation instructions for integrations requiring langchain-community package since 0.2 Issue: [#22005](https://github.com/langchain-ai/langchain/issues/22005)	2024-05-29 16:12:05 -04:00
ccurme	6e1df72a88	openai[patch]: Release 0.1.8 (#22291 )	2024-05-29 20:08:30 +00:00
ccurme	e71b0b5827	core[patch]: Release 0.2.2 (#22289 )	2024-05-29 19:51:37 +00:00
William FH	9d6cabe84a	Update sequence.ipynb (#22288 )	2024-05-29 19:34:44 +00:00
Daniel Glogowski	7ff05357ba	docs: updating NIM documentation (#22258 ) Updating NVIDIA NIM notebooks and readme file. Thanks! Daniel	2024-05-29 10:28:39 -07:00
Bagatur	6dd0f095c3	docs: revamp ChatOpenAI (#22253 ) Can build API ref docs by running ```bash make api_docs_clean; make api_docs_quick_preview API_PKG=openai ``` only builds openai ref, takes ~20 sec	2024-05-29 10:20:14 -07:00
Erick Friis	00c70d98c2	robocorp: release 0.0.9 (#22282 )	2024-05-29 16:49:18 +00:00
Mikko Korpela	fc5909ad6f	langchain-robocorp: Fix parsing of Union types (such as Optional). (#22277 )	2024-05-29 09:47:02 -07:00
ccurme	af1f723ada	openai: don't override stream_options default (#22242 ) ChatOpenAI supports a kwarg `stream_options` which can take values `{"include_usage": True}` and `{"include_usage": False}`. Setting include_usage to True adds a message chunk to the end of the stream with usage_metadata populated. In this case the final chunk no longer includes `"finish_reason"` in the `response_metadata`. This is the current default and is not yet released. Because this could be disruptive to workflows, here we remove this default. The default will now be consistent with OpenAI's API (see parameter [here](https://platform.openai.com/docs/api-reference/chat/create#chat-create-stream_options)). Examples: ```python from langchain_openai import ChatOpenAI llm = ChatOpenAI() for chunk in llm.stream("hi"): print(chunk) ``` ``` content='' id='run-8cff4721-2acd-4551-9bf7-1911dae46b92' content='Hello' id='run-8cff4721-2acd-4551-9bf7-1911dae46b92' content='!' id='run-8cff4721-2acd-4551-9bf7-1911dae46b92' content='' response_metadata={'finish_reason': 'stop'} id='run-8cff4721-2acd-4551-9bf7-1911dae46b92' ``` ```python for chunk in llm.stream("hi", stream_options={"include_usage": True}): print(chunk) ``` ``` content='' id='run-39ab349b-f954-464d-af6e-72a0927daa27' content='Hello' id='run-39ab349b-f954-464d-af6e-72a0927daa27' content='!' id='run-39ab349b-f954-464d-af6e-72a0927daa27' content='' response_metadata={'finish_reason': 'stop'} id='run-39ab349b-f954-464d-af6e-72a0927daa27' content='' id='run-39ab349b-f954-464d-af6e-72a0927daa27' usage_metadata={'input_tokens': 8, 'output_tokens': 9, 'total_tokens': 17} ``` ```python llm = ChatOpenAI().bind(stream_options={"include_usage": True}) for chunk in llm.stream("hi"): print(chunk) ``` ``` content='' id='run-59918845-04b2-41a6-8d90-f75fb4506e0d' content='Hello' id='run-59918845-04b2-41a6-8d90-f75fb4506e0d' content='!' id='run-59918845-04b2-41a6-8d90-f75fb4506e0d' content='' response_metadata={'finish_reason': 'stop'} id='run-59918845-04b2-41a6-8d90-f75fb4506e0d' content='' id='run-59918845-04b2-41a6-8d90-f75fb4506e0d' usage_metadata={'input_tokens': 8, 'output_tokens': 9, 'total_tokens': 17} ```	2024-05-29 10:30:40 -04:00
Karim Lalani	a1899439fc	[experimental][llms][ollama_functions] Update OllamaFunctions to send `tool_calls` attribute (#21625 ) Update OllamaFunctions to return `tool_calls` for AIMessages when used for tool calling.	2024-05-29 09:38:33 -04:00
Bagatur	d61bdeba25	core[patch]: allow access RunnableWithFallbacks.runnable attrs (#22139 ) RFC, candidate fix for #13095 #22134	2024-05-28 13:18:09 -07:00
SteveLiao	7496fe2b16	Update parent_document_retriever.py about kwargs (#22219 ) Add kwargs in add_documents function langchain: Add kwargs in parent_document_retriever" - Add kwargs for `add_document` in `parent_document_retriever.py` If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-05-28 11:35:38 -07:00
Mark Cusack	8dfa3c5f1a	Update/fix docs to list Yellowbrick as a supported indexed vectorstore (#22235 ) Update/fix docs to list Yellowbrick as a supported indexed vectorstore and fix the Jupyter notebook.	2024-05-28 11:34:49 -07:00
Erick Friis	93240fac68	milvus: fix core dep (#22239 )	2024-05-28 10:21:37 -07:00
Erick Friis	611faa22c7	infra: allow first releases 2 (#22237 )	2024-05-28 09:53:21 -07:00
Erick Friis	26c6e4a5ef	infra: allow first releases (#22236 )	2024-05-28 09:39:40 -07:00
ChengZi	404d92ded0	milvus: New langchain_milvus package and new milvus features (#21077 ) New features: - New langchain_milvus package in partner - Milvus collection hybrid search retriever - Zilliz cloud pipeline retriever - Milvus Local guid - Rag-milvus template --------- Signed-off-by: ChengZi <chen.zhang@zilliz.com> Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> Co-authored-by: Jael Gu <mengjia.gu@zilliz.com> Co-authored-by: Jackson <jacksonxie612@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-05-28 08:24:20 -07:00
Leonid Ganeline	d7f70535ba	docs: `arxiv` page, added cookbooks (#22215 ) Issue: The `arXiv` page is missing the arxiv paper references from the `langchain/cookbook`. PR: Added the cookbook references. Result: `Found 29 arXiv references in the 3 docs, 21 API Refs, 5 Templates, and 18 Cookbooks.` - much more references are visible now.	2024-05-27 15:47:02 -07:00
Leonid Ganeline	d6995e814b	ai21[patch]: added `license` (#22153 ) The `pyproject.toml` missed the `license` parameter. I've added it as `MIT`	2024-05-27 15:14:14 -07:00
Maddy Adams	8332a36f69	infra: update langchainhub and add integration test (#22154 ) Description: Update langchainhub integration test dependency and add an integration test for pulling private prompt Dependencies: langchainhub 0.1.16	2024-05-27 14:58:10 -07:00
Will Higgins	83d10df78d	community[patch]: Update firecrawl api key name (#22183 ) Change 'FIREWALL' to 'FIRECRAWL' as I believe this may have been in error. Other docs refer to 'FIRECRAWL_API_KEY'. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-27 21:39:29 +00:00
hmasdev	bbd7015b5d	core[patch]: Add `TypeError` handler into `get_graph` of `Runnable` (#19856 ) # Description ## Problem `Runnable.get_graph` fails when `InputType` or `OutputType` property raises `TypeError`. - `003c98e5b4/libs/core/langchain_core/runnables/base.py (L250-L274)` - `003c98e5b4/libs/core/langchain_core/runnables/base.py (L394-L396)` This problem prevents getting a graph of `Runnable` objects whose `InputType` or `OutputType` property raises `TypeError` but whose `invoke` works well, such as `langchain.output_parsers.RegexParser`, which I have already pointed out in #19792 that a `TypeError` would occur. ## Solution - Add `try-except` syntax to handle `TypeError` to the codes which get `input_node` and `output_node`. # Issue - #19801 # Twitter Handle - [hmdev3](https://twitter.com/hmdev3) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-27 21:34:34 +00:00
acho98	753353411f	docs: Fix Clova embeddings example document (#22181 ) - [ ] PR title: "Fix list handling in Clova embeddings example documentation" - Description: Fixes a bug in the Clova Embeddings example documentation where document_text was incorrectly wrapped in an additional list. - Rationale The embed_documents method expects a list, but the previous example wrapped document_text in an unnecessary additional list, causing an error. The updated example correctly passes document_text directly to the method, ensuring it functions as intended.	2024-05-27 14:31:34 -07:00
Mohammad Mohtashim	577ed68b59	mistralai[patch]: Added Json Mode for ChatMistralAI (#22213 ) - Description: Powered [ChatMistralAI.with_structured_output](`fbfed65fb1/libs/partners/mistralai/langchain_mistralai/chat_models.py (L609)`) via json mode - Issue: #22081 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-27 21:16:52 +00:00
Pranith	25c270b5a5	docs : Added integrations for tools with langchain_community (#22188 ) PR title: Docs enhancement Description: Adding installation instructions for integrations requiring langchain-community package since 0.2 Issue: https://github.com/langchain-ai/langchain/issues/22005	2024-05-27 14:06:40 -07:00
Ibrahim	cfea0e231a	Update llm_chain.ipynb text (#22198 ) Added the missing verb "is" and a comma to the text in the Prompt Templates description within the Build a Simple LLM Application tutorial for more clarity.	2024-05-27 19:57:41 +00:00
Aditya	bf81ecd3b4	docs:updated documentation for llama, falcon and gemma on Vertex AI Model garden (#22201 ) - Description: updated documentation for llama, falcona and gemma on Vertex AI Model garden - Issue: NA - Dependencies: NA - Twitter handle: NA @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-05-27 12:56:11 -07:00
Pavlo Paliychuk	342df7cf83	community[minor]: Add Zep Cloud components + docs + examples (#21671 ) Thank you for contributing to LangChain! - [x] PR title: community: Add Zep Cloud components + docs + examples - [x] PR message: We have recently released our new zep-cloud sdks that are compatible with Zep Cloud (not Zep Open Source). We have also maintained our Cloud version of langchain components (ChatMessageHistory, VectorStore) as part of our sdks. This PRs goal is to port these components to langchain community repo, and close the gap with the existing Zep Open Source components already present in community repo (added ZepCloudMemory,ZepCloudVectorStore,ZepCloudRetriever). Also added a ZepCloudChatMessageHistory components together with an expression language example ported from our repo. We have left the original open source components intact on purpose as to not introduce any breaking changes. - Issue: - - Dependencies: Added optional dependency of our new cloud sdk `zep-cloud` - Twitter handle: @paulpaliychuk51 - [x] Add tests and docs - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-27 12:50:13 -07:00
Jan Soubusta	cccc8fbe2f	community[patch]: DuckDB VS - expose similarity, improve performance of from_texts (#20971 ) 3 fixes of DuckDB vector store: - unify defaults in constructor and from_texts (users no longer have to specify `vector_key`). - include search similarity into output metadata (fixes #20969) - significantly improve performance of `from_documents` Dependencies: added Pandas to speed up `from_documents`. I was thinking about CSV and JSON options, but I expect trouble loading JSON values this way and also CSV and JSON options require storing data to disk. Anyway, the poetry file for langchain-community already contains a dependency on Pandas. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-05-24 15:17:52 -07:00
Surya Pratap Singh Shekhawat	42207f5bef	Update agent_executor.ipynb (#22104 ) fixed typos in the doc. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-05-24 22:14:41 +00:00
Erick Friis	8acadc34f5	docs: edit links, direct for notebooks (#22051 )	2024-05-24 19:44:46 +00:00
Erick Friis	42ffcb2ff1	anthropic: release 0.1.14rc2, test release note gen (#22147 )	2024-05-24 12:40:10 -07:00
Erick Friis	6ee8de62c0	infra: auto-generated release notes based on git log (#22141 ) Generates release notes based on a `git log` command with title names Aiming to improve to splitting out features vs. bugfixes using conventional commits in the coming weeks. Will work for any monorepo packages	2024-05-24 11:43:28 -07:00
Ameya Shenoy	8ba492ed6a	community[minor]: clickhouse -- ability to use secure connection (#22108 ) - Description: this PR gives clickhouse client the ability to use a secure connection to the clickhosue server - Issue: fixes #22082 - Dependencies: - - Twitter handle: `_codingcoffee_` Signed-off-by: Ameya Shenoy <shenoy.ameya@gmail.com> Co-authored-by: Shresth Rana <shresth@grapevine.in>	2024-05-24 17:30:22 +00:00
ccurme	9a010fb761	openai: read stream_options (#21548 ) OpenAI recently added a `stream_options` parameter to its chat completions API (see [release notes](https://platform.openai.com/docs/changelog/added-chat-completions-stream-usage)). When this parameter is set to `{"usage": True}`, an extra "empty" message is added to the end of a stream containing token usage. Here we propagate token usage to `AIMessage.usage_metadata`. We enable this feature by default. Streams would now include an extra chunk at the end, after the chunk with `response_metadata={'finish_reason': 'stop'}`. New behavior: ``` [AIMessageChunk(content='', id='run-4b20dbe0-3817-4f62-b89d-03ef76f25bde'), AIMessageChunk(content='Hello', id='run-4b20dbe0-3817-4f62-b89d-03ef76f25bde'), AIMessageChunk(content='!', id='run-4b20dbe0-3817-4f62-b89d-03ef76f25bde'), AIMessageChunk(content='', response_metadata={'finish_reason': 'stop'}, id='run-4b20dbe0-3817-4f62-b89d-03ef76f25bde'), AIMessageChunk(content='', id='run-4b20dbe0-3817-4f62-b89d-03ef76f25bde', usage_metadata={'input_tokens': 8, 'output_tokens': 9, 'total_tokens': 17})] ``` Old behavior (accessible by passing `stream_options={"include_usage": False}` into (a)stream: ``` [AIMessageChunk(content='', id='run-1312b971-c5ea-4d92-9015-e6604535f339'), AIMessageChunk(content='Hello', id='run-1312b971-c5ea-4d92-9015-e6604535f339'), AIMessageChunk(content='!', id='run-1312b971-c5ea-4d92-9015-e6604535f339'), AIMessageChunk(content='', response_metadata={'finish_reason': 'stop'}, id='run-1312b971-c5ea-4d92-9015-e6604535f339')] ``` From what I can tell this is not yet implemented in Azure, so we enable only for ChatOpenAI.	2024-05-24 13:20:56 -04:00
Patrick Zhang	eb7c767e5b	docs: update the name of the tool passio_nutrition_ai (#22116 ) Updating the name of the Passion Nutrition AI tool so that the name of the tool is correctly displayed in the sidebar menu. Currently the name of the tool says "Quickstart" in the side bar. The patch fixed the name to be Passio Nutrition AI. <img width="681" alt="image" src="https://github.com/langchain-ai/langchain/assets/4603110/9609975e-78ea-4032-9024-10c4f838170a">	2024-05-24 17:15:16 +00:00
Leonid Ganeline	fd4ee08167	docs: `integrations/platforms/microsoft` update (#22100 ) Added the `Azure Container Apps dynamic sessions` tool reference	2024-05-24 13:14:51 -04:00
Rahul Triptahi	1a485f59b9	community[patch]: Put authorized identities behind a feature flag in SharepointLoader (#22125 ) Description: Put authorised identities behind a feature flag, load_auth. Documentation: N/A Unit tests: N/A --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-05-24 12:42:57 -04:00
Anindyadeep	ee689412ab	docs: Update PremAI Docs (#22114 ) Thank you for contributing to LangChain! - [X] PR title: community: Updated langchain-community PremAI documentation - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-05-24 11:55:32 -04:00
sasha	1c9ceff503	community: add metadata to chain logging; (#22122 ) Hey, I'm Sasha. The SDK engineer from [Comet](https://comet.com). This PR updates the CometTracer class. Added metadata to CometTracerr. From now on, both chains and spans will send it.	2024-05-24 15:29:40 +00:00
Jirka Lhotka	7c0459faf2	community: Update costs of openai finetuned models (#22124 ) - Description: Update costs of finetuned models and add gpt-3-turbo-0125. Source: https://openai.com/api/pricing/ - Issue: N/A - Dependencies: None	2024-05-24 15:25:17 +00:00
Eugene Yurtsev	d3db83abe3	community[major]: lint for usage of xml library (#22132 ) * Lint for usage of standard xml library * Add forced opt-in for quip client * Actual security issue is with underlying QuipClient not LangChain integration (since the client is doing the parsing), but adding enforcement at the LangChain level.	2024-05-24 15:23:53 +00:00
Tom Aarsen	5b5ea2af30	docs: Add explanation on how to use Hugging Face embeddings (#22118 ) - Description: I've added a tab on embedding text with LangChain using Hugging Face models to here: https://python.langchain.com/v0.2/docs/how_to/embed_text/. HF was mentioned in the running text, but not in the tabs, which I thought was odd. - Issue: N/A - Dependencies: N/A - Twitter handle: No need, this is tiny :) Also, I had a ton of issues with the poetry docs/lint install, so I haven't linted this. Apologies for that. cc @Jofthomas - Tom Aarsen	2024-05-24 11:21:03 -04:00
Bagatur	baa3c975cb	anthropic[patch]: allow tool call mutation (#22130 ) If tool_use blocks and tool_calls with overlapping IDs are present, prefer the values of the tool_calls. Allows for mutating AIMessages just via tool_calls.	2024-05-24 08:18:14 -07:00
Christophe Bornet	c838de5027	doc: Add doc for CassandraByteStore (#22126 ) Preview: https://langchain-git-fork-cbornet-doc-cassandrabytestore-langchain.vercel.app/v0.2/docs/integrations/stores/cassandra/	2024-05-24 10:57:55 -04:00
Vadym Barda	2edb512282	docs: improve how-to docs for message history (#22072 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-23 20:12:24 -04:00
Artem	eb7c453b98	docs: update `hub.pull("rlm/map-prompt")` to `hub.pull("rlm/reduce-prompt")` for reduce prompt (#22088 ) PR message: Update `hub.pull("rlm/map-prompt")` to `hub.pull("rlm/reduce-prompt")` in summarization.ipynb Description: Fix typo in prompt hub link from `reduce_prompt = hub.pull("rlm/map-prompt")` to `reduce_prompt = hub.pull("rlm/reduce-prompt")` following next issue Issue: #22014 Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-23 23:07:37 +00:00
Leonid Ganeline	2416737c5f	docs: compact the API Reference links (#21285 ) This PR is opinionated. Issue: the `API Reference` sections in the examples hold too much vertical space and make us scroll the page too much. See an [example](https://python.langchain.com/docs/get_started/quickstart/#conversation-retrieval-chain). These sections are important. So, the compacting should not make these sections less noticeable. Change: compacting the `API Reference` sections. See the [same example after change applied](https://langchain-j6nya46lf-langchain.vercel.app/docs/get_started/quickstart/#conversation-retrieval-chain). It is more compact and now looks like references (footnotes). Note: I would also change the section style, so it would be more noticeable (maybe to look like the footnotes. Smaller wider font?) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-23 15:50:23 -07:00
ccurme	0ea1e89b2c	groq: read tool calls from .tool_calls attribute (#22096 )	2024-05-23 18:16:06 -04:00
Bagatur	96c21dfe56	docs: hf feat table tool calling (#22091 )	2024-05-23 15:09:30 -07:00
Eugene Yurtsev	63004a0945	codespell ignore remaining issues (#22097 )	2024-05-23 21:51:39 +00:00
Eugene Yurtsev	2d693c484e	docs: fix some spelling mistakes caught by newest version of code spell (#22090 ) Going to merge this even though it doesn't pass all tests, and open a separate PR for the remaining spelling mistakes.	2024-05-23 16:59:11 -04:00
Bagatur	38783d07c9	infra: api docs quick preview (#22093 )	2024-05-23 13:57:45 -07:00
Pavel Zloi	fe26f937e4	community[minor]: ManticoreSearch engine added to vectorstore (#19117 ) Description: ManticoreSearch engine added to vectorstores Issue: no issue, just a new feature Dependencies: https://pypi.org/project/manticoresearch-dev/ Twitter handle: @EvilFreelancer - Example notebook with test integration: https://github.com/EvilFreelancer/langchain/blob/manticore-search-vectorstore/docs/docs/integrations/vectorstores/manticore_search.ipynb --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-23 13:56:18 -07:00
Erick Friis	95c3e5f85f	cli: model name substitution fix, release 0.0.23 (#22089 )	2024-05-23 13:09:38 -07:00
Kartheek Yakkala	18b8c8628a	docs : Added integrations for tools with langchain_community (#22056 ) - PR title: Docs enhancement - Description: Adding installation instructions for integrations requiring `langchain-community` package since 0.2 - Issue: https://github.com/langchain-ai/langchain/issues/22005	2024-05-23 15:09:34 -04:00
ccurme	152c8cac33	anthropic, openai: cut pre-releases (#22083 )	2024-05-23 15:02:23 -04:00
ccurme	cd07521170	core: bump to 0.2.1rc (#22080 )	2024-05-23 18:36:50 +00:00
Harrison Chase	170cc8aec3	docs: add multi-modal-docs (#21734 ) We dont really have any abstractions around multi-modal... so add a section explaining we dont have any abstrations and then how to guides for openai and anthropic (probably need to add for more) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: junefish <junefish@users.noreply.github.com> Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-23 18:33:25 +00:00
ccurme	fbfed65fb1	core, partners: add token usage attribute to AIMessage (#21944 ) ```python class UsageMetadata(TypedDict): """Usage metadata for a message, such as token counts. Attributes: input_tokens: (int) count of input (or prompt) tokens output_tokens: (int) count of output (or completion) tokens total_tokens: (int) total token count """ input_tokens: int output_tokens: int total_tokens: int ``` ```python class AIMessage(BaseMessage): ... usage_metadata: Optional[UsageMetadata] = None """If provided, token usage information associated with the message.""" ... ```	2024-05-23 14:21:58 -04:00
Bagatur	3d26807b92	community[patch]: Release. 0.2.1 (#22073 )	2024-05-23 10:40:32 -07:00
Bagatur	2d968213d7	langchain[patch]: Release 0.2.1 (#22074 )	2024-05-23 10:09:36 -07:00
maang-h	9aba9e3e33	community[patch]: Update the default “API URL” and “MODEL” of sparkllm (#22070 ) - Description: When I was running the sparkllm, I found that the default parameters currently used could no longer run correctly. - original parameters & values: - spark_api_url: "wss://spark-api.xf-yun.com/v3.1/chat" - spark_llm_domain: "generalv3" ```python # example from langchain_community.chat_models import ChatSparkLLM spark = ChatSparkLLM(spark_app_id="my_app_id", spark_api_key="my_api_key", spark_api_secret="my_api_secret") spark.invoke("hello") ``` ![sparkllm](https://github.com/langchain-ai/langchain/assets/55082429/5369bfdf-4305-496a-bcf5-2d3f59d39414) So I updated them to 3.5 (same as sparkllm official website). After the update, they can be used normally. - new parameters & values: - spark_api_url: "wss://spark-api.xf-yun.com/v3.5/chat" - spark_llm_domain: "generalv3.5"	2024-05-23 12:25:20 -04:00
junkeon	4fda7bf4f2	upstage[patch] : fix error handling in Layout Analysis parser (#22054 ) This pull request addresses and fixes exception handling in the UpstageLayoutAnalysisParser and enhances the test coverage by adding error exception tests for the document loader. These improvements ensure robust error handling and increase the reliability of the system when dealing with external API calls and JSON responses. ### Changes Made 1. Fix Request Exception Handling: - Issue: The existing implementation of UpstageLayoutAnalysisParser did not properly handle exceptions thrown by the requests library, which could lead to unhandled exceptions and potential crashes. - Solution: Added comprehensive exception handling for requests.RequestException to catch any request-related errors. This includes logging the error details and raising a ValueError with a meaningful error message. 2. Add Error Exception Tests for Document Loader: - New Tests: Introduced new test cases to verify the robustness of the UpstageLayoutAnalysisLoader against various error scenarios. The tests ensure that the loader gracefully handles: - RequestException: Simulates network issues or invalid API requests to ensure appropriate error handling and user feedback. - JSONDecodeError: Simulates scenarios where the API response is not a valid JSON, ensuring the system does not crash and provides clear error messaging.	2024-05-23 11:45:34 -04:00
JuHyung Son	d9eff44400	partner-upstage[patch]: embeddings empty list bug (#22057 ) Fixed an error in `embed_documents` when the input was given as an empty list. And I have revised the document.	2024-05-23 11:44:30 -04:00
Martin Triska	2df8ac402a	community[minor]: Added propagation of document metadata from O365BaseLoader (#20663 ) Description: - Added propagation of document metadata from O365BaseLoader to FileSystemBlobLoader (O365BaseLoader uses FileSystemBlobLoader under the hood). - This is done by passing dictionary `metadata_dict`: key=filename and value=dictionary containing document's metadata - Modified `FileSystemBlobLoader` to accept the `metadata_dict`, use `mimetype` from it (if available) and pass metadata further into blob loader. Issue: - `O365BaseLoader` under the hood downloads documents to temp folder and then uses `FileSystemBlobLoader` on it. - However metadata about the document in question is lost in this process. In particular: - `mime_type`: `FileSystemBlobLoader` guesses `mime_type` from the file extension, but that does not work 100% of the time. - `web_url`: this is useful to keep around since in RAG LLM we might want to provide link to the source document. In order to work well with document parsers, we pass the `web_url` as `source` (`web_url` is ignored by parsers, `source` is preserved) Dependencies: None Twitter handle: @martintriska1 Please review @baskaryan --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-05-23 11:42:19 -04:00
Eugene Yurtsev	e5541d1da7	community[patch]: Update doc-string in CloudBlobLoader (#22069 ) Update doc-string	2024-05-23 15:31:41 +00:00
Maxime Perrin	8ba4f77734	docs : Adding correct imports to the integrations callbacks doc (#22059 ) - Description: Adding correct imports to the integrations callbacks doc (langchain-community package) - Issue: #22005 --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-05-23 11:27:36 -04:00
Philippe PRADOS	6dd621d636	community[minor]: Add CloudBlobLoader that supports loading data from cloud buckets (#21957 ) Thank you for contributing to LangChain! - [ ] PR title: "Add CloudBlobLoader" - community: Add CloudBlobLoader - [ ] PR message: Add cloud blob loader - Description: Langchain provides several approaches to read different file formats: Specific loaders (`CVSLoader`) or blob-compatible loaders (`FileSystemBlobLoader`). The only implementation proposed for BlobLoader is `FileSystemBlobLoader`. Many projects retrieve files from cloud storage. We propose a new implementation of `BlobLoader` to read files from the three cloud storage systems. The interface is strictly identical to `FileSystemBlobLoader`. The only difference is the constructor, which takes a cloud "url" object such as `s3://my-bucket`, `az://my-bucket`, or `gs://my-bucket`. By streamlining the process, this novel implementation eliminates the requirement to pre-download files from cloud storage to local temporary files (which are seldom removed). The code relies on the [CloudPathLib](https://cloudpathlib.drivendata.org/stable/) library to interpret cloud URLs. This has been added as an optional dependency. ```Python loader = CloudBlobLoader("s3://mybucket/id") for blob in loader.yield_blobs(): print(blob) ``` - [X] Dependencies: CloudPathLib - [X] Twitter handle: pprados - [X] Add tests and docs: Add unit test, but it's easy to convert to integration test, with some files in a cloud storage (see `test_cloud_blob_loader.py`) - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. Hello from Paris @hwchase17. Can you review this PR? --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-05-23 10:59:55 -04:00
Christophe Bornet	74947ec894	community[minor]: Add Cassandra ByteStore (#22064 )	2024-05-23 10:46:23 -04:00
Christophe Bornet	fea6b99b16	community[minor]: Add async methods to CassandraChatMessageHistory (#21975 )	2024-05-23 10:13:05 -04:00
Eugene Yurtsev	37cfc00310	docs: concepts callbacks fix admonition (#22048 ) Correct the admonition text	2024-05-22 20:33:28 -04:00
Erick Friis	53293dace8	docs: version increases (#22050 )	2024-05-22 16:20:10 -07:00
Sky	12d65f17ff	community[patch]: surrealdb provide functions for MMR (Maximal Marginal Relevance) (#21185 ) This PR contains 4 added functions: - max_marginal_relevance_search_by_vector - amax_marginal_relevance_search_by_vector - max_marginal_relevance_search - amax_marginal_relevance_search I'm no langchain expert, but tried do inspect other vectorstore sources like chroma, to build these functions for SurrealDB. If someone has some changes for me, please let me know. Otherwise I would be happy, if these changes are added to the repository, so that I can use the orignal repo and not my local monkey patched version. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 22:53:55 +00:00
Erick Friis	58b6c72375	docs: add astream v2 migration guide links (#21845 ) - docs: v0.2 version sidebar - x - x	2024-05-22 15:48:42 -07:00
Bruno Alvisio	5eabe90494	community[patch]: Adding HEADER to the list of supported locations (#21946 ) Description: adds headers to the list of supported locations when generating the openai function schema	2024-05-22 22:47:56 +00:00
Bagatur	50186da0a1	infra: rm unused # noqa violations (#22049 ) Updating #21137	2024-05-22 15:21:08 -07:00
acho98	45ed5f3f51	community[minor]: Add Clova Embeddings for LangChain Community (#21890 ) - [ ] PR title: "Add Naver ClovaX embedding to LangChain community" - HyperClovaX is a large language model developed by [Naver](https://clova-x.naver.com/welcome). It's a powerful and purpose-trained LLM. - You can visit the embedding service provided by [ClovaX](https://www.ncloud.com/product/aiService/clovaStudio) - You may get CLOVA_EMB_API_KEY, CLOVA_EMB_APIGW_API_KEY, CLOVA_EMB_APP_ID From https://www.ncloud.com/product/aiService/clovaStudio --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 22:08:47 +00:00
arpitkumar980	444c2a3d9f	community[patch]: sharepoint loader identity enabled (#21176 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines:https://github.com/arpitkumar980/langchain.git - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-22 22:08:31 +00:00
Eugene Yurtsev	8a877120c3	docs: add admonitions to how-to callbacks (#22046 ) Add admonitions with more information.	2024-05-22 22:05:57 +00:00
HuiyuanYan	bf3aefce93	community[patch]: Update tongyi.py to support MultimodalConversation in dashscope. (#21249 ) Add the support of multimodal conversation in dashscope,now we can use multimodal language model "qwen-vl-v1", "qwen-vl-chat-v1", "qwen-audio-turbo" to processing picture an audio. :) - [ ] PR title: "community: add multimodal conversation support in dashscope" - [ ] PR message: *Delete this entire checklist* and replace with - Description: add multimodal conversation support in dashscope - Issue: - Dependencies: dashscope≥1.18.0 - Twitter handle: none :) - [ ] How to use it?: - ```python Tongyi_chat = ChatTongyi( top_p=0.5, dashscope_api_key=api_key, model="qwen-vl-v1" ) response= Tongyi_chat.invoke( input = [ { "role": "user", "content": [ {"image": "https://dashscope.oss-cn-beijing.aliyuncs.com/images/dog_and_girl.jpeg"}, {"text": "这是什么?"} ] } ] ) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 22:04:58 +00:00
mochi	63284ffebf	experimental[patch], docs: refine notebook for MyScale `SelfQueryRetriever` (#22016 ) - Description: upgrade model to `gpt-4o`	2024-05-22 21:49:01 +00:00
MSubik	d948783a4c	community[patch]: standardize init args, update for javelin sdk release. (#21980 ) Related to [20085](https://github.com/langchain-ai/langchain/issues/20085) Updated the Javelin chat model to standardize the initialization argument. Also fixed an existing bug, where code was initialized with incorrect call to the JavelinClient defined in the javelin_sdk, resulting in an initialization error. See related [Javelin Documentation](https://docs.getjavelin.io/docs/javelin-python/quickstart).	2024-05-22 21:47:28 +00:00
Mohammad Mohtashim	16617dd239	community[patch]: AzureSearchVectorStoreRetriever Fixed to account for search_kwargs (#21572 ) - Description: Fixed `AzureSearchVectorStoreRetriever` to account for search_kwargs. More explanation is in the mentioned issue. - Issue: #21492 --------- Co-authored-by: MAC <mac@MACs-MacBook-Pro.local> Co-authored-by: Massimiliano Pronesti <massimiliano.pronesti@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 14:46:41 -07:00
Klaudia Lemiec	45351d1bc6	docs: Chroma docstrings update (#22001 ) Thank you for contributing to LangChain! - [X] PR title: "docs: Chroma docstrings update" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [X] PR message: - Description: Added and updated Chroma docstrings - Issue: https://github.com/langchain-ai/langchain/issues/21983 - [X] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - only docs - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-05-22 21:45:30 +00:00
Jerron Lim	28456c2c33	community[patch]: add args_schema to WikipediaQueryRun (#22019 ) Description: This change adds args_schema (pydantic BaseModel) to WikipediaQueryRun for correct schema formatting on LLM function calls Issue: currently using WikipediaQueryRun with OpenAI function calling returns the following error "TypeError: WikipediaQueryRun._run() got an unexpected keyword argument '__arg1' ". This happens because the schema sent to the LLM is "input: '{"__arg1":"Hunter x Hunter"}'" while the method should be called with the "query" parameter. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 21:31:58 +00:00
Mazen Ramadan	3c1d77dd64	community[minor]: Add Scrapfly Loader community integration (#22036 ) Added [Scrapfly](https://scrapfly.io/) Web Loader integration. Scrapfly is a web scraping API that allows extracting web page data into accessible markdown or text datasets. - __Description__: Added Scrapfly web loader for retrieving web page data as markdown or text. - Dependencies: scrapfly-sdk - Twitter: @thealchemi1st --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 21:29:13 +00:00
Chad Juliano	9a66c43146	docs: Use Kinetica Sql context API (#21993 ) Update python notebook to use new Kinetica SQL context API.	2024-05-22 14:26:20 -07:00
ccurme	b51a1eba4d	langchain, community: move OpenAIAssistantV2Runnable to community (#22044 )	2024-05-22 21:22:50 +00:00
Mirna Wong	b4d5f3181b	docs: updates code examples in neo4j_cypher.ipynb (#21973 ) Resolves #19134 Thank you for contributing to LangChain! - [x ] PR message: *Delete this entire checklist* and replace with - Description: this pr replaces `title` with `name` in the [add examples in cypher generation prompt](https://python.langchain.com/v0.1/docs/integrations/graphs/neo4j_cypher/#add-examples-in-the-cypher-generation-prompt) section. - Issue: 19134 - Dependencies: any dependencies required for this change - Twitter handle: @mirna_wong	2024-05-22 20:48:09 +00:00
CaroFG	6b98140b38	community[patch]: update for compatibility with Meilisearch v1.8 (#21979 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: Updates Meilisearch vectorstore for compatibility with v1.8. Adds [”showRankingScore”: true”](https://www.meilisearch.com/docs/reference/api/search#ranking-score) in the search parameters and replaces `_semanticScore` field with ` _rankingScore` - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-05-22 13:37:01 -07:00
Oleksii Pokotylo	98c0b093bb	community[patch]: Extend AzureSearch with `maximal_marginal_relevance`, `from_embeddings` (#21065 ) Description: - Extend AzureSearch with `maximal_marginal_relevance` (for vector and hybrid search) - Add construction `from_embeddings` - if the user has already embedded the texts - Add `add_embeddings` - Refactor common parts (`_simple_search`, `_results_to_documents`, `_reorder_results_with_maximal_marginal_relevance`) - Add `vector_search_dimensions` as a parameter to the constructor to avoid extra calls to `embed_query` (most of the time the user applies the same model and knows the dimension) Issue: none Dependencies: none - [x] Add tests and docs: The docstrings have been added to the new functions, and unified for the existing ones. The example notebook is great in illustrating the main usage of AzureSearch, adding the new methods would only dilute the main content. - [x] Lint and test --------- Co-authored-by: Oleksii Pokotylo <oleksii.pokotylo@pwc.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 13:36:06 -07:00
Erick Friis	ed5914ff61	docs: move feedback into paginator from content (#22041 ) we only index what's in the `<article>` tags for search. We should not have the feedback in the article.	2024-05-22 13:21:27 -07:00
SaschaStoll	709664a079	community[patch]: Performant filter columns option for Hanavector (#21971 ) Description: Backwards compatible extension of the initialisation interface of HanaDB to allow the user to specify specific_metadata_columns that are used for metadata storage of selected keys which yields increased filter performance. Any not-mentioned metadata remains in the general metadata column as part of a JSON string. Furthermore switched to executemany for batch inserts into HanaDB. Issue: N/A Dependencies: no new dependencies added Twitter handle: @sapopensource --------- Co-authored-by: Martin Kolb <martin.kolb@sap.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-22 13:21:21 -07:00
Bagatur	16b55b0704	langchain[patch]: remove dataclasses-json dep (#22042 ) vestigial dep afaict	2024-05-22 13:20:57 -07:00
Christos Boulmpasakos	c3bcfad66d	text-splitters[patch]: Extend TextSplitter:keep_separator functionality (#21130 ) Description: Added extra functionality to `CharacterTextSplitter`, `TextSplitter` classes. The user can select whether to append the separator to the previous chunk with `keep_separator='end' ` or else prepend to the next chunk. Previous functionality prepended by default to next chunk. Issue: Fixes #20908 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-22 13:17:45 -07:00
Bagatur	b859765752	docs: fix partner api ref build (#22007 )	2024-05-22 13:16:07 -07:00
Eric Zhang	e7e41eaabe	langchain: add RankLLM Reranker (#21171 ) Integrate RankLLM reranker (https://github.com/castorini/rank_llm) into LangChain An example notebook is given in `docs/docs/integrations/retrievers/rankllm-reranker.ipynb` --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-05-22 20:12:55 +00:00
Eugene Yurtsev	14a9c7c44e	concepts: update callback concepts (#22040 ) Update callback concepts	2024-05-22 15:58:02 -04:00
maang-h	fc93bed8c4	community: Fix CSVLoader columns is None (#20701 ) - Bug code: In langchain_community/document_loaders/csv_loader.py:100 - Description: currently, when 'CSVLoader' reads the column as None in the 'csv' file, it will report an error because the 'CSVLoader' does not verify whether the column is of str type and does not consider how to handle the corresponding 'row_data' when the column is' None 'in the csv. This pr provides a solution. - Issue: Fix #20699 - thinking: 1. Refer to the processing method for 'langchain_community/document_loaders/csv_loader.py:100' when 'v' equals'None', and apply the same method to 'k'. (Reference`csv.DictReader` ,'k' will only be None when ` len(columns) < len(number_row_data)` is established) 2. ‘k’ equals None only holds when it is the last column, and its corresponding 'v' type is a list. Therefore, I referred to the data format in 'Document' and used ',' to concatenated the elements in the list.(But I'm not sure if you accept this form, if you have any other ideas, communicate) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-22 12:57:46 -07:00
Nithin James Padayatti	403142eaba	langchain: added revision_example prompt template (#20916 ) Description: Added revision_example prompt template to include the revision request and revision examples in the revision chain. Issue: Not Applicable Dependencies: Not Applicable Twitter handle: @nithinjp09	2024-05-22 19:57:32 +00:00
Sihan Chen	1f81277b9b	community[minor]: allow enabling proxy in aiohttp session in AsyncHTML (#19499 ) Allow enabling proxy in aiohttp session async html	2024-05-22 18:25:06 +00:00
Eugene Yurtsev	36813d2f00	community[patch]: Fix remaining __inits__ in community (#22037 ) Fixes the __init__ files in community to use __all__ which is statically defined.	2024-05-22 17:42:17 +00:00
Eugene Yurtsev	b7d08bf764	docs: update doc feedback to populate URL (#22033 ) Update docfeedback to populate URL	2024-05-22 13:38:11 -04:00
Eugene Yurtsev	58360a1e53	community[patch]: Add unit test to verify that init is correctly defined (#22030 ) Fix some __init__ files and add a unit test	2024-05-22 17:19:00 +00:00
Erick Friis	ef53ccf54b	robocorp: release 0.0.8 (#22034 )	2024-05-22 16:41:41 +00:00
Eugene Yurtsev	4633b4cf2b	ci: update documentation template to include URL (#22032 ) update documentation template to include URL	2024-05-22 12:01:28 -04:00
Matthew Hoffman	4f2e3bd7fd	community[patch]: fix public interface for embeddings module (#21650 ) ## Description The existing public interface for `langchain_community.emeddings` is broken. In this file, `__all__` is statically defined, but is subsequently overwritten with a dynamic expression, which type checkers like pyright do not support. pyright actually gives the following diagnostic on the line I am requesting we remove: [reportUnsupportedDunderAll](https://github.com/microsoft/pyright/blob/main/docs/configuration.md#reportUnsupportedDunderAll): ``` Operation on "__all__" is not supported, so exported symbol list may be incorrect ``` Currently, I get the following errors when attempting to use publicablly exported classes in `langchain_community.emeddings`: ```python import langchain_community.embeddings langchain_community.embeddings.HuggingFaceEmbeddings(...) # error: "HuggingFaceEmbeddings" is not exported from module "langchain_community.embeddings" (reportPrivateImportUsage) ``` This is solved easily by removing the dynamic expression.	2024-05-22 11:42:15 -04:00
Maxime Perrin	6548052f9e	docs : Integrations vector stores with langchain-community install (#22028 ) - Description: Adding installation instruction for integrations requiring `langchain-community` package since 0.2 - Issue: #22005 --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-05-22 15:32:01 +00:00
Eugene Yurtsev	8d82160a8a	community[patch]: Clean up logic in import checking unit test (#22026 ) Clean up unit test	2024-05-22 15:30:10 +00:00
Tomaz Bratanic	d8a1f1114d	community[patch]: Handle exceptions where node props aren't consistent in neo4j schema (#22027 )	2024-05-22 11:21:56 -04:00
WeichenXu	b0ef5e778a	community[patch]: Fix ChatDatabricsk in case that streaming response doesn't have role field in delta chunk (#21897 ) Thank you for contributing to LangChain! - [X] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Fix ChatDatabricsk in case that streaming response doesn't have role field in delta chunk - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Signed-off-by: Weichen Xu <weichen.xu@databricks.com>	2024-05-22 08:12:53 -07:00
Eugene Yurtsev	aed64daabb	community[patch]: Add unit test to catch bad __all__ definitions (#21996 ) This will catch all dynamic __all__ definitions.	2024-05-22 09:32:13 -04:00
Brian Thorne	25ba733218	docs: Update import in wikipedia tool documentation (#21565 ) Updates docs so the example doesn't lead to a warning: ``` LangChainDeprecationWarning: Importing tools from langchain is deprecated. Importing from langchain will no longer be supported as of langchain==0.2.0. Please import from langchain-community instead: `from langchain_community.tools import WikipediaQueryRun`. To install langchain-community run `pip install -U langchain-community`. ```	2024-05-21 17:20:51 -07:00
Bagatur	3b0437c05b	core[patch]: Release 0.2.1 (#22003 )	2024-05-22 00:05:04 +00:00
Kefan You	24b5c27bb1	community[patch]: raise_for_status logic missing in async _fetch of WebBaseLoader (#21948 ) ## 'raise_for_status' parameter of WebBaseLoader works in sync load but not in async load. In webBaseLoader: Sync load is calling `_scrape` and has `raise_for_status` properly handled. ``` def _scrape( self, url: str, parser: Union[str, None] = None, bs_kwargs: Optional[dict] = None, ) -> Any: from bs4 import BeautifulSoup if parser is None: if url.endswith(".xml"): parser = "xml" else: parser = self.default_parser self._check_parser(parser) html_doc = self.session.get(url, self.requests_kwargs) if self.raise_for_status: html_doc.raise_for_status() if self.encoding is not None: html_doc.encoding = self.encoding elif self.autoset_encoding: html_doc.encoding = html_doc.apparent_encoding return BeautifulSoup(html_doc.text, parser, (bs_kwargs or {})) ``` Async load is calling `_fetch` but missing `raise_for_status` logic. ``` async def _fetch( self, url: str, retries: int = 3, cooldown: int = 2, backoff: float = 1.5 ) -> str: async with aiohttp.ClientSession() as session: for i in range(retries): try: async with session.get( url, headers=self.session.headers, ssl=None if self.session.verify else False, cookies=self.session.cookies.get_dict(), ) as response: return await response.text() ``` Co-authored-by: kefan.you <darkfss@sina.com>	2024-05-21 23:51:03 +00:00
Mateusz Szewczyk	80f8fe1793	docs: update IBM WatsonxLLM docs with deprecated LLMChain (#21960 ) Thank you for contributing to LangChain! - [x] PR title: "update IBM WatsonxLLM docs with deprecated LLMChain" - [x] PR message: - Description: update IBM WatsonxLLM docs with deprecated LLMChain - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-05-21 16:43:02 -07:00
Surya Rath	eb096675a8	OpenAI Assistants v2 api support for OpenAIAssistantRunnable (#21484 ) Title: "langchain: OpenAI Assistants v2 api support" *Descriptions* - [x] "attachments" support added along with backward compatibility of "file_ids" - [x] "tool_resources" support added while creating new assistant - [ ] "tool_choice" parameter support - [ ] Streaming support - Dependencies: OpenAI v2 API (openai>=1.23.0) - Twitter handle: @skanta_rath --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-21 15:32:29 -07:00
Eugene Yurtsev	7a5d042bd2	langchain[patch]: Add unit test to detect changes to community imports (#21998 ) Add unit tests for community imports	2024-05-21 17:45:26 -04:00
Eugene Yurtsev	90f4d8842f	langchain[patch]: Turn on all deprecations for 0.2 (#21999 ) - Turn on all 0.2 import deprecations. - Update error messag with URL to upgrade instructions.	2024-05-21 17:33:43 -04:00
Asaf Joseph Gardin	a042e804b4	ai21: AI21 Jamba docs (#21978 ) - Updated docs to have an example to use Jamba instead of J2 --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-21 19:27:46 +00:00
Pengcheng Liu	4cf523949a	community[patch]: Update model client to support vision model in Tong… (#21474 ) - Description: Tongyi uses different client for chat model and vision model. This PR chooses proper client based on model name to support both chat model and vision model. Reference [tongyi document](https://help.aliyun.com/zh/dashscope/developer-reference/tongyi-qianwen-vl-plus-api?spm=a2c4g.11186623.0.0.27404c9a7upm11) for details. ``` from langchain_core.messages import HumanMessage from langchain_community.chat_models import ChatTongyi llm = ChatTongyi(model_name='qwen-vl-max') image_message = { "image": "https://lilianweng.github.io/posts/2023-06-23-agent/agent-overview.png" } text_message = { "text": "summarize this picture", } message = HumanMessage(content=[text_message, image_message]) llm.invoke([message]) ``` - Issue: None - Dependencies: None - Twitter handle: None	2024-05-21 11:58:27 -07:00
Erick Friis	98b64f3ae3	infra: only tag core releases as github latest (#21991 )	2024-05-21 11:39:03 -07:00
Sevin F. Varoglu	1bc0ea5496	community[patch]: update OctoAIEmbeddings to subclass OpenAIEmbeddings (#21805 )	2024-05-21 11:29:41 -07:00
Eugene Yurtsev	ded53297e0	core[patch]: Add unit test for RunnableGenerator for eventstream v2 (#21990 ) No unit tests with runnable generator	2024-05-21 14:29:15 -04:00
Nuno Campos	fb6108c8f5	core[patch]: In astream_events(version=v2) tap output of root run (#21977 ) - if tap_output_iter/aiter is called multiple times for the same run issue events only once - if chat model run is tapped don't issue duplicate on_llm_new_token events - if first chunk arrives after run has ended do not emit it as a stream event --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-05-21 14:03:57 -04:00
Bagatur	72d4a8eeed	community[patch]: AzureSearch dont overwrite default async (#21989 )	2024-05-21 11:01:28 -07:00
ccurme	a983465694	docs: set default anthropic model (#21988 ) `ChatAnthropic()` raises ValidationError.	2024-05-21 11:01:18 -07:00
Muhammed Al-Dulaimi	5448e16fe6	Fix grammar error (#21985 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-05-21 10:59:48 -07:00
ccurme	4be5537837	Revert "anthropic: set default model" (#21987 ) Reverts langchain-ai/langchain#21986	2024-05-21 17:28:32 +00:00
ccurme	35439cf3bd	anthropic: set default model (#21986 ) Various docs reference `ChatAnthropic()`, but this currently raises ValidationError.	2024-05-21 17:24:31 +00:00
ccurme	0923136851	langchain: default to Runnable in MultiQueryRetriever (#21770 ) - `llm_chain` becomes `Union[LLMChain, Runnable]` - `.from_llm` creates a runnable tested by verifying that docs/how_to/MultiQueryRetriever.ipynb runs unchanged with sync/async invoke (and that it runs if we specifically instantiate with LLMChain).	2024-05-21 17:01:05 +00:00
Yulong Wang	8e1aeb8ad5	community[patch]: Fix typo in arxiv tool's doc (#21970 ) Fix typo in arxiv tool's doc	2024-05-21 13:44:59 +00:00
Robert Caulk	54adcd9e82	community[minor]: add AskNews retriever and AskNews tool (#21581 ) We add a tool and retriever for the [AskNews](https://asknews.app) platform with example notebooks. The retriever can be invoked with: ```py from langchain_community.retrievers import AskNewsRetriever retriever = AskNewsRetriever(k=3) retriever.invoke("impact of fed policy on the tech sector") ``` To retrieve 3 documents in then news related to fed policy impacts on the tech sector. The included notebook also includes deeper details about controlling filters such as category and time, as well as including the retriever in a chain. The tool is quite interesting, as it allows the agent to decide how to obtain the news by forming a query and deciding how far back in time to look for the news: ```py from langchain_community.tools.asknews import AskNewsSearch from langchain import hub from langchain.agents import AgentExecutor, create_openai_functions_agent from langchain_openai import ChatOpenAI tool = AskNewsSearch() instructions = """You are an assistant.""" base_prompt = hub.pull("langchain-ai/openai-functions-template") prompt = base_prompt.partial(instructions=instructions) llm = ChatOpenAI(temperature=0) asknews_tool = AskNewsSearch() tools = [asknews_tool] agent = create_openai_functions_agent(llm, tools, prompt) agent_executor = AgentExecutor( agent=agent, tools=tools, verbose=True, ) agent_executor.invoke({"input": "How is the tech sector being affected by fed policy?"}) ``` --------- Co-authored-by: Emre <e@emre.pm>	2024-05-20 18:23:06 -07:00
Jesse S	fc79b372cb	community[minor]: add aerospike vectorstore integration (#21735 ) Please let me know if you see any possible areas of improvement. I would very much appreciate your constructive criticism if time allows. Description: - Added a aerospike vector store integration that utilizes [Aerospike-Vector-Search](https://aerospike.com/products/vector-database-search-llm/) add-on. - Added both unit tests and integration tests - Added a docker compose file for spinning up a test environment - Added a notebook Dependencies: any dependencies required for this change - aerospike-vector-search Twitter handle: - No twitter, you can use my GitHub handle or LinkedIn if you'd like Thanks! --------- Co-authored-by: Jesse Schumacher <jschumacher@aerospike.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-21 01:01:47 +00:00
Prince Canuma	3587c60396	community[patch]: Fix MLX LLM Stream (#20575 ) Closes #20561 This PR fixes MLX LLM stream `AttributeError`. Recently, `mlx-lm` changed the token decoding logic, which affected the LC+MLX integration. Additionally, I made minor fixes such as: docs example broken link and enforcing pipeline arguments (max_tokens, temp and etc) for invoke. - Issue: #20561 - Twitter handle: @Prince_Canuma	2024-05-20 17:17:08 -07:00
Rahul Triptahi	96bd0b0844	community[patch]: Remove redundant pebblo cloud api call (#21589 ) Description: removed redundant pebblo cloud api call. Changed classified `doc` key to `ai_apps_data`. Documentation: N/A Unit tests: N/A	2024-05-20 17:15:16 -07:00
Param Singh	d07885f8b7	community[patch]: standardized sparkllm init args (#21633 ) Related to #20085 @baskaryan Thank you for contributing to LangChain! community:sparkllm[patch]: standardized init args updated `spark_api_key` so that aliased to `api_key`. Added integration test for `sparkllm` to test that it continues to set the same underlying attribute. updated temperature with Pydantic Field, added to the integration test. Ran `make format`,`make test`, `make lint`, `make spell_check`	2024-05-20 17:11:36 -07:00
Dhruv Chawla	d4359d3de6	community[patch]: Update UpTrain Callback Handler to support the new UpTrain evaluation schema (#21656 ) UpTrain has a new dashboard now that makes it easier to view projects and evaluations. Using this requires specifying both project_name and evaluation_name when performing evaluations. I have updated the code to support it.	2024-05-20 17:06:00 -07:00
Alex Riina	c0e3c3a350	openai[patch], community[patch]: add pricing and max context window for GPT-4o (#21673 ) # Add pricing and max context window for GPT-4o - community: add cost per 1k tokens and max context window - partners: add max context window Description: adds static information about GPT-4o based on https://openai.com/api/pricing/ and https://platform.openai.com/docs/models/gpt-4o so that GPT-4o reporting is accurate. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-20 23:47:43 +00:00
缨缨	bd39b2ccdf	community: enable SupabaseVectorStore to support extended table fields (#21762 ) Thank you for contributing to LangChain! - [x] PR title: "community: enable SupabaseVectorStore to support extended table fields" - [x] PR message: - Added extension fields to the function _add_vectors so that users can add other custom fields when insert a record into the database. eg: ![image](https://github.com/langchain-ai/langchain/assets/10885578/e1d5ca20-936e-4cab-ba69-8fdd23b8ce8f) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-20 16:32:26 -07:00
Jerome Choo	2316635add	docs: Clean up Diffbot docs (#21781 ) The Diffbot DocumentLoader page doesn't actually run for a number of reasons. This PR fixes it along with some light details on the Graph Transformer and Provider pages. ## Full Changelog [Document Loader Page](https://python.langchain.com/v0.1/docs/integrations/document_loaders/diffbot/) * Fixed the notebook so that it actually runs (missing required modules, env variables, etc..) * Added "open in colab" button like the Graph Transformer page [Graph Transformer Page](https://python.langchain.com/v0.2/docs/integrations/graphs/diffbot/) * Fixed broken colab link * Moved "open in colab" button to below description so the description in the [Graphs category page](https://python.langchain.com/v0.2/docs/integrations/graphs/) shows up correctly [Provider Page](https://python.langchain.com/v0.2/docs/integrations/providers/diffbot/) * Clarified explanations of Diffbot products * Added section and link to LangChain Graph Transformer page --------- Co-authored-by: jeromechoo <hello@jeromechoo.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-20 23:09:22 +00:00
Rohan Aggarwal	d8a101074f	docs: updates for OracleDB (#21745 ) Thank you for contributing to LangChain! Documentation change for OracleDB Fixed several things in Oracle Documentation.	2024-05-20 16:01:35 -07:00
Leonid Ganeline	9799437bc2	docs: `YouTube` page update (#21780 ) Greatly simplified to get a cleaner look. Only the YouTube pages with 40K+ views.	2024-05-20 15:50:41 -07:00
Leonid Ganeline	e98a4fd19a	ai21[patch]: configuration fix (#21790 ) added "repository" and "Source Code" parameters (these parameters are missed only in this partner package configuration).	2024-05-20 15:49:38 -07:00
Trayan Azarov	f54cbf8ff5	chroma[patch]: Chroma - remove reference to collection upon delete_collection (#21817 ) Description: - Reference to `Collection` object is set to `None` when deleting a collection `delete_collection()` - Added utility method `reset_collection()` to allow recreating the collection - Moved collection creation out of `__init__` into `__ensure_collection()` to be reused by object init and `reset_collection()` - `_collection` is now a property to avoid breaking changes Issues: - chroma-core/chroma#2213 Twitter: @t_azarov	2024-05-20 15:42:36 -07:00
Jens	b0b302ec6b	community[patch]: fixed aleph alpha default emedding request (#21826 ) - Description: In the aleph alpha client the paramater `normalize` is not optional. Setting this to `None` gives an error. - Dependencies: None Co-authored-by: Jens Lücke <jens.luecke@tngtech.com> Co-authored-by: Jens <jens.luecke@hu-berlin.de> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-20 22:39:43 +00:00
Leonid Ganeline	6a59f76f2b	docs: added template to `arxiv` page (#21846 ) Updated `arXiv` page with the arxiv references from Templates (were references from Docs and API Refs, not Templates). Re #21450 CC @eyurtsev	2024-05-20 15:30:35 -07:00
Jorge Piedrahita Ortiz	e6207ad4f3	community[patch]: Sambanova integration api update (#21848 ) - Description:: SambaStudio generic endpoint compatibility added Improved error description, and handling streaming examples added	2024-05-20 15:29:59 -07:00
Bagatur	c6da9533ac	docs: correct langserve link (#21940 )	2024-05-20 22:15:31 +00:00
Michael Reed	7a5e1bcf99	core[patch]: Fix NPE in function_calling._get_python_function_required_args (#21863 ) Example error message: line 206, in _get_python_function_required_args if is_function_type and required[0] == "self": ~~~~~~~~^^^ IndexError: list index out of range Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-20 22:06:27 +00:00
Liuww	332ffed393	community[patch]: Adopting the lighter-weight xinference_client (#21900 ) While integrating the xinference_embedding, we observed that the downloaded dependency package is quite substantial in size. With a focus on resource optimization and efficiency, if the project requirements are limited to its vector processing capabilities, we recommend migrating to the xinference_client package. This package is more streamlined, significantly reducing the storage space requirements of the project and maintaining a feature focus, making it particularly suitable for scenarios that demand lightweight integration. Such an approach not only boosts deployment efficiency but also enhances the application's maintainability, rendering it an optimal choice for our current context. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-20 22:05:09 +00:00
Tomaz Bratanic	a43515ca65	experimental[patch]: Pass enum only to openai in llm graph transformer (#21860 ) Some models like Groq return bad request if you pass in `enum` parameter in tool definition	2024-05-20 15:02:48 -07:00
Ozan Kaşıkçı	aab9cb666f	docs: Update agents.ipynb, add missing word "see" (#21872 ) - Description: Add missing see word in the docs	2024-05-20 22:00:03 +00:00
Jiří Spilka	6499897c87	community[patch]: update apify integration to attribute API activity to langchain (#21909 ) Description: Add `Origin/langchain` to Apify's client's user-agent to attribute API activity to LangChain (at Apify, we aim to monitor our integrations to evaluate whether we should invest more in the LangChain integration regarding functionality and content) Issue: None Dependencies: None Twitter handle: None	2024-05-20 14:49:23 -07:00
Mohammad Mohtashim	711b8f1e52	docs: HuggingFace Endpoint Documentation Fixed (#21914 ) Fixed Documentation for HuggingFaceEndpoint as per the issue #21903 --------- Co-authored-by: keenborder786 <mohammad.mohtashim78@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-20 21:23:28 +00:00
Jared Van Bortel	25d1c1c9bb	nomic: implement local embeddings with the inference_mode parameter (#21934 ) ## Description This PR implements local and dynamic mode in the Nomic Embed integration using the inference_mode and device parameters. They work as documented [here](https://docs.nomic.ai/reference/python-api/embeddings#local-inference). <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --> --------- Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-05-20 14:17:07 -07:00
ccurme	0e72ed39a0	infra: fix CI on text-splitters (#21935 )	2024-05-20 14:03:42 -07:00
Ozan Kaşıkçı	f4ffef98a2	docs: how to: tool calling: Fix typo in sentence (#21877 ) - Description: Fix grammar error.	2024-05-20 20:58:52 +00:00
Erick Friis	6b97418836	docs: rewrite old home, fix v0.1 infinite redirect (#21936 )	2024-05-20 13:44:41 -07:00
Bagatur	1418d3af00	docs: link to langsmith+langgraph docs (#21930 )	2024-05-20 13:05:22 -07:00
ccurme	e8bdf245eb	update maintainers (#21305 )	2024-05-20 19:07:53 +00:00
ccurme	4470d3b4a0	partners: bump core in packages implementing ls_params (#21868 ) These packages all import `LangSmithParams` which was released in langchain-core==0.2.0. N.B. we will need to release `openai` and then bump `langchain-openai` in `together` and `upstage`.	2024-05-20 11:51:43 -07:00
junefish	0614a53d9c	docs: update notebook for latest Pinecone API + serverless (#21921 ) Thank you for contributing to LangChain! - [x] PR title: "docs: update notebook for latest Pinecone API + serverless" - [x] PR message: Published notebook is incompatible with latest `pinecone-client` and not runnable. Updated for use with latest Pinecone Python SDK. Also updated to be compatible with serverless indexes (only index type available on Pinecone free tier). - [x] Add tests and docs: N/A (tested in Colab) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --- - To see the specific tasks where the Asana app for GitHub is being used, see below: - https://app.asana.com/0/0/1207328087952499	2024-05-20 11:51:03 -07:00
ccurme	9c76739425	mistral: implement ls_params (#21867 )	2024-05-20 11:49:48 -07:00
junefish	68a90e2252	docs: update notebook for new Pinecone API + serverless (#21923 ) Thank you for contributing to LangChain! - [x] PR title: "docs: update notebook for new Pinecone API + serverless" - [x] PR message: The published notebook is not runnable after `pinecone-client` v2, which is deprecated. `langchain-pinecone` is not compatible with the latest `pinecone-client` (v4), so I hardcoded it to the last v3. Also updated for serverless indexes (only index type available on Pinecone free plan). - [x] Add tests and docs: N/A (tested in Colab) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --- - To see the specific tasks where the Asana app for GitHub is being used, see below: - https://app.asana.com/0/0/1207328087952500	2024-05-20 11:48:55 -07:00
Eugene Yurtsev	8ed2ba9301	docs: migrate integrations using langchain-cli (#21929 ) Migrate integration docs	2024-05-20 18:14:49 +00:00
Eugene Yurtsev	c98bd8505f	docs: migrate tutorials using langchain-cli migrate (#21928 ) Migrate tutorials	2024-05-20 13:45:35 -04:00
Eugene Yurtsev	b2f58d37db	docs: run migration script against how-to docs (#21927 ) Upgrade imports in how-to docs	2024-05-20 17:32:59 +00:00
Tomaz Bratanic	d85e46321a	community[patch]: Better error message for neo4j vector when text is null (#21861 )	2024-05-20 10:25:58 -07:00
Stefano Lottini	f2e75f9500	cli[minor]: fix import path for two Astra DB classes in the migration json data (#21926 ) This PR fixes two mistakes in the import paths from community for the json data aiding the cli migration to 0.2. It is intended as a quick follow-up to https://github.com/langchain-ai/langchain/pull/21913 . @nicoloboschi FYI	2024-05-20 12:25:10 -04:00
WilliamEspegren	30bca57aae	doc list not empty (#21208 ) Make sure the doc list is not empty, and set Metadata: true in param, to enable the user to disable metadata for slightly faster crawls.	2024-05-20 08:24:06 -07:00
David Charles	8da35fba7f	langchain[minor]: add libs/partners to dev.Dockerfile (#21902 ) Resolves #21886 by adding "COPY libs/partners ../partners/" to libs/dev.Dockerfile Twitter: @kabakongo	2024-05-20 15:20:56 +00:00
Eugene Yurtsev	8530bbac2d	docs: update how to install (#21920 ) Fix installation instructions in how-to install	2024-05-20 15:14:20 +00:00
TJ	8cd6ed3e1e	community[patch]: Update documentation string in databricks chat model (#21915 ) Update typos in documentation string in databricks chat model	2024-05-20 14:33:57 +00:00
Maxime Perrin	5ae982145e	docs: fix wrong langchain-cli migration commands (#21906 ) Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-05-20 10:29:50 -04:00
Nicolò Boschi	dd00aac7ad	cli[minor]: add astradb in the cli migration to 0.2 (#21913 ) astradb has a new partner package but the automatic migration cli tool doesn't take care of migration astradb integrations	2024-05-20 10:29:17 -04:00
Jacob Lee	242eeb537f	docs[patch]: Adds callback docs (#21889 ) @efriis @hwchase17	2024-05-19 21:57:33 -07:00
Jacob Lee	da4fef8131	docs[patch]: Update 0.2 banner copy (#21888 ) @nfcampos	2024-05-19 17:21:02 -07:00
Coozywana	b6c8b6f944	Fix base.py typo (#21862 ) ChatOpenaAI --> ChatOpenAI Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-18 13:05:02 +00:00
fzowl	d3624eaba1	partners: Remove unnecessary print from voyageai embeddings (#21865 ) Thank you for contributing to LangChain! Remove unnecessary print from voyageai embeddings - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-18 08:57:17 -04:00
Eugene Yurtsev	61ebe7991c	docs: how to remove conversion to openai function from index (#21836 ) - bind_tools interface is a better alternative. - openai doesn't use functions but tools in its API now. - the underlying content appears in some redirects, so will need to investigate if we can remove.	2024-05-17 23:00:07 -04:00
Eugene Yurtsev	0812723789	docs: how to tools human in the loop (#21858 ) Update information in how to guide tools human in the loop.	2024-05-17 22:59:51 -04:00
Eugene Yurtsev	875230d5bc	docs: how-to index page fix minor typo (#21859 ) Fix typo	2024-05-17 22:45:47 -04:00
Bagatur	8b3c5f93f5	docs: lcel how to and cheatsheet (#21851 )	2024-05-17 19:04:45 -07:00
Erick Friis	c3caec5aaf	docs: update announcement bar (#21854 )	2024-05-18 00:35:07 +00:00
Jacob Lee	0180716a95	docs[patch]: Remove padding from first sidebar link (#21852 ) CC @efriis	2024-05-17 17:09:58 -07:00
Nuno Campos	b1e7b40b6a	core: Tap output of sync iterators for astream_events (#21842 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-17 16:57:41 -07:00
Erick Friis	9a39f92aba	docs: v0.2 version sidebar (#21844 ) ![image](https://github.com/langchain-ai/langchain/assets/9557659/189f2e04-0c08-4395-b729-f48982c6f53b)	2024-05-17 23:45:51 +00:00
Max Jakob	e6b7a1769b	docs: update Elasticsearch strategy names (#21530 ) Update documentation with the [new names for retrieval strategies](https://github.com/langchain-ai/langchain-elastic/pull/22) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-17 23:21:46 +00:00
Erick Friis	cdc8e2d0c2	docs: resolve local links script escape (#21840 ) Fixing warnings. Needs to be propagated to 0.1 branch if this works. ![Screenshot 2024-05-17 at 2 34 15 PM](https://github.com/langchain-ai/langchain/assets/9557659/e6ac95a9-5686-4747-9ab8-4cb49942dc8d)	2024-05-17 22:59:27 +00:00
Erick Friis	d02380c504	docs: remove postgres from docs build (#21847 )	2024-05-17 15:36:35 -07:00
Eugene Yurtsev	67b6f6c82a	core[patch]: Check if event loop is closed in memory stream (#21841 ) Check if event stream is closed in memory loop. Using try/except here to avoid race condition, but this may incur a small overhead in versions prios to 3.11	2024-05-17 21:53:59 +00:00
Erick Friis	d8f89a5e9b	docs: fix vercel core dep 2 (#21839 )	2024-05-17 14:24:25 -07:00
Erick Friis	5285336cb1	docs: fix vercel core dep (#21837 )	2024-05-17 14:18:57 -07:00
Erick Friis	2d3f4e1a16	experimental: release 0.0.59 (#21835 )	2024-05-17 21:02:45 +00:00
Erick Friis	169f525cfb	community: release 0.2.0 (#21834 )	2024-05-17 13:49:29 -07:00
Eugene Yurtsev	2656bfe941	docs: how to guide tool calling using prompts (#21827 ) Update tool calling using prompts. - Add required concepts - Update names of tool invoking function. - Add doc-string to function, and add information about `config` (which users often forget) - Remove steps that show how to use single function only. This makes the how-to guide a bit shorter and more to the point. - Add diagram from another how-to guide that shows how the thing works overall.	2024-05-17 16:46:59 -04:00
Erick Friis	e5046cbd72	langchain: release 0.2.0, fix min deps (#21833 )	2024-05-17 13:40:51 -07:00
Erick Friis	1b555021f7	text-splitters: release 0.2.0 (#21832 )	2024-05-17 13:30:54 -07:00
Erick Friis	0ad8de5eb7	langchain: release 0.2.0 (#21831 )	2024-05-17 13:18:31 -07:00
Eugene Yurtsev	33dbad02fe	docs: update how-to for built in tools and toolkits (#21828 ) Fix some typos	2024-05-17 16:05:28 -04:00
Erick Friis	23310626b3	core: release 0.2.0 (#21829 )	2024-05-17 13:04:39 -07:00
Eugene Yurtsev	e3f30b4cde	docs: clean up link to bing search (#21825 ) Documentation should be inlined, not linking to medium article.	2024-05-17 19:06:56 +00:00
Eugene Yurtsev	22d9aed508	docs: how to tools, merge built in tools and toolkits (#21824 ) * Rename tools to built in tools * Merge built in tools and toolkits * Update links from providers	2024-05-17 14:35:57 -04:00
Leonid Ganeline	c4508ca7ef	docs: `arXiv` references page (#21450 ) Since the LangChain based on many research papers, the LC documentation has several references to the arXiv papers. It would be beneficial to create a single page with all referenced papers. PR: 1. Developed code to search the arXiv references in the LangChain Documentation and the LangChain code base. Those references are included in a newly generated documentation page. 2. Page is linked to the Docs menu. Controversial: 1. The `arxiv_references` page is automatically generated. But this generation now started only manually. It is not included in the doc generation scripts. The reason for this is simple. I don't want to mangle into the current documentation refactoring. If you think, we need to regenerate this page in each build, let me know. Note: This script has a dependency on the `arxiv` package. 2. The link for this page in the menu is not obvious. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-05-17 18:28:57 +00:00
ccurme	181dfef118	core, standard tests, partner packages: add test for model params (#21677 ) 1. Adds `.get_ls_params` to BaseChatModel which returns ```python class LangSmithParams(TypedDict, total=False): ls_provider: str ls_model_name: str ls_model_type: Literal["chat"] ls_temperature: Optional[float] ls_max_tokens: Optional[int] ls_stop: Optional[List[str]] ``` by default it will only return ```python {ls_model_type="chat", ls_stop=stop} ``` 2. Add these params to inheritable metadata in `CallbackManager.configure` 3. Implement `.get_ls_params` and populate all params for Anthropic + all subclasses of BaseChatOpenAI Sample trace: https://smith.langchain.com/public/d2962673-4c83-47c7-b51e-61d07aaffb1b/r OpenAI: <img width="984" alt="Screenshot 2024-05-17 at 10 03 35 AM" src="https://github.com/langchain-ai/langchain/assets/26529506/2ef41f74-a9df-4e0e-905d-da74fa82a910"> Anthropic: <img width="978" alt="Screenshot 2024-05-17 at 10 06 07 AM" src="https://github.com/langchain-ai/langchain/assets/26529506/39701c9f-7da5-4f1a-ab14-84e9169d63e7"> Mistral (and all others for which params are not yet populated): <img width="977" alt="Screenshot 2024-05-17 at 10 08 43 AM" src="https://github.com/langchain-ai/langchain/assets/26529506/37d7d894-fec2-4300-986f-49a5f0191b03">	2024-05-17 13:51:26 -04:00
Eugene Yurtsev	4ca2149b70	docs: Remove duplicated content from how to tools (#21821 ) Content is duplicated, and is covered in how to use chat models.	2024-05-17 17:30:43 +00:00
Matthew Koski	e59afe292d	langchain: Fixing import in docs per https://github.com/langchain-ai/langchain/issues/21814 (#21815 ) Description: The example in the How-To guide had an import which did not work. I changed it to use an import from langchain_core. Issue: https://github.com/langchain-ai/langchain/issues/21814	2024-05-17 17:19:57 +00:00
Sen Lin	eb7f07ae36	community[patch]: fix typo in ValueError message in load_local function (#21818 ) Description: Corrected an error in the `allow_dangerous_deserialization` message within the `load_local` functions	2024-05-17 17:19:04 +00:00
Jorge Piedrahita Ortiz	700b1c7212	community: sambaverse api update (#21816 ) - Description: fix sambaverse integration to make it compatible with sambaverse API update / minor changes in docs	2024-05-17 10:18:08 -07:00
Erick Friis	7976fb1663	docs: cookbook redirect (#21822 )	2024-05-17 17:07:30 +00:00
maang-h	9f8d18c028	community[patch]: Fix unintended newline in print statement in exception for BaichuanTextEmbeddings (#21820 ) - Code: langchain_community/embeddings/baichuan.py:82 - Description: When I make an error using 'baichuan embeddings', the printed error message is wrapped (there is actually no need to wrap) ```python # example from langchain_community.embeddings import BaichuanTextEmbeddings # error key BAICHUAN_API_KEY = "sk-xxxxxxxxxxxxx" embeddings = BaichuanTextEmbeddings(baichuan_api_key=BAICHUAN_API_KEY) text_1 = "今天天气不错" query_result = embeddings.embed_query(text_1) ``` ![unintended newline](https://github.com/langchain-ai/langchain/assets/55082429/e1178ce8-62bb-405d-a4af-e3b28eabc158)	2024-05-17 16:38:38 +00:00
Eugene Yurtsev	aa648298ae	docs: minor updates to migration docs (#21819 ) Minor aesthetic updates to migration docs	2024-05-17 12:28:56 -04:00
Eugene Yurtsev	fc644c0e1c	docs: Update v0.2 information (#21796 ) Update information about v0.2 upgrade	2024-05-17 11:43:58 -04:00
Bakar Tavadze	3b5ac44e03	langchain-robocorp[minor]: Enable passing additional headers to the action server. (#21809 ) Actions can optionally receive secrets via request headers. This PR enables this functionality.	2024-05-17 15:08:48 +00:00
Erick Friis	09919c2cd5	docs: version dropdown (#21784 )	2024-05-16 17:01:34 -07:00
Chad Juliano	685c13e157	docs: fix errors and table formatting in notebook (#21696 ) There are 2 issues fixed here: * In the notebook pandas dataframes are formatted as HTML in the cells. On the documentation site the renderer that converts notebooks incorrectly displays the raw HTML. I can't find any examples of where this is working and so I am formatting the dataframes as text. * Some incorrect table names were referenced resulting in errors.	2024-05-16 16:00:14 -07:00
Asaf Joseph Gardin	f3289b898c	partners: Revert AI21 Labs docs scan feature (#21699 ) Description: Reverted commit #21614 --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-16 22:58:40 +00:00
github-user-en	ec8d406441	Made a grammatical correction in streaming.ipynb (#21707 ) The only change is replacing the word "operators" with "operates," to make the sentence grammatically correct. Thank you for contributing to LangChain! - [x] PR title: "docs: Made a grammatical correction in streaming.ipynb to use the word "operates" instead of the word "operators"" - [x] PR message: - Description: The use of the word "operators" was incorrect, given the context and grammar of the sentence. This PR updates the documentation to use the word "operates" instead of the word "operators". - Issue: Makes the documentation more easily understandable. - Dependencies: -no dependencies- - Twitter handle: -- - [x] Add tests and docs: Since no new integration is being made, no new tests/example notebooks are required. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ - No formatting changes made to the documentation Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-16 22:47:40 +00:00
Brace Sproul	6febb283f6	docs[minor]: Hide prev/next buttons on docs in how to / tutorials (#21789 ) These buttons don't navigate to the proper prev/next page. Hide in those pages	2024-05-16 15:35:17 -07:00
Eugene Yurtsev	8607735b80	langchain[patch],community[patch]: Move unit tests that depend on community to community (#21685 )	2024-05-16 17:24:27 -04:00
Eugene Yurtsev	97a4ae50d2	How To: Custom tools (#21725 ) - Remove double implementations of functions. The single input is just taking up space. - Added tool specific information for `async + showing invoke vs. ainvoke. - Added more general information about about `async` (this should live in a different place eventually since it's not specific to tools). - Changed ordering of custom tools (StructuredTool is simpler and should appear before the inheritance) - Improved the error handling section (not convinced it should be here though)	2024-05-16 21:06:33 +00:00
Bagatur	1cf80a5956	docs: link runnable api (#21783 )	2024-05-16 20:49:37 +00:00
Bagatur	aee3842a21	docs: intro nit (#21785 )	2024-05-16 13:46:11 -07:00
Marco Lamina	d0fae6cd54	community: Add token cost for GPT-4o model (#21771 ) Adding [token cost for the new GPT-4o model](https://openai.com/api/pricing/): * Input cost US$5.00 / 1M tokens * Output cost US$15.00 / 1M tokens	2024-05-16 20:36:23 +00:00
Bagatur	4231cf0696	docs: update chat feat table (#21778 )	2024-05-16 12:58:51 -07:00
Massimiliano Pronesti	0c0db7c5db	feat(community): support semantic hybrid score threshold in Azure AI Search (#21527 ) Support semantic hybrid search with a score threshold -- similar to what we do for similarity search and for hybrid search (#20907).	2024-05-16 15:54:32 -04:00
Erick Friis	5e445a7e4e	docs: dont rewrite ipynb links that have double slash (#21775 )	2024-05-16 19:06:30 +00:00
Eugene Yurtsev	e3a03b324d	docs: concepts -- add information about tool calling models, update tools section (#21760 ) - Add information about naitve tool calling capabilities - Add information about standard langchain interface for tool calling - Update description for tools --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-05-16 15:03:25 -04:00
Bagatur	6416d16d39	anthropic[patch]: Release 0.1.13, tool_choice support (#21773 )	2024-05-16 17:56:29 +00:00
Stefano Lottini	040597e832	community: init signature revision for Cassandra LLM cache classes + small maintenance (#17765 ) This PR improves on the `CassandraCache` and `CassandraSemanticCache` classes, mainly in the constructor signature, and also introduces several minor improvements around these classes. ### Init signature A (sigh) breaking change is tentatively introduced to the constructor. To me, the advantages outweigh the possible discomfort: the new syntax places the DB-connection objects `session` and `keyspace` later in the param list, so that they can be given a default value. This is what enables the pattern of _not_ specifying them, provided one has previously initialized the Cassandra connection through the versatile utility method `cassio.init(...)`. In this way, a much less unwieldy instantiation can be done, such as `CassandraCache()` and `CassandraSemanticCache(embedding=xyz)`, everything else falling back to defaults. A downside is that, compared to the earlier signature, this might turn out to be breaking for those doing positional instantiation. As a way to mitigate this problem, this PR typechecks its first argument trying to detect the legacy usage. (And to make this point less tricky in the future, most arguments are left to be keyword-only). If this is considered too harsh, I'd like guidance on how to further smoothen this transition. Our plan is to make the pattern of optional session/keyspace a standard across all Cassandra classes, so that a repeatable strategy would be ideal. A possibility would be to keep positional arguments for legacy reasons but issue a deprecation warning if any of them is actually used, to later remove them with 0.2 - please advise on this point. ### Other changes - class docstrings: enriched, completely moved to class level, added note on `cassio.init(...)` pattern, added tiny sample usage code. - semantic cache: revised terminology to never mention "distance" (it is in fact a similarity!). Kept the legacy constructor param with a deprecation warning if used. - `llm_caching` notebook: uniform flow with the Cassandra and Astra DB separate cases; better and Cassandra-first description; all imports made explicit and from community where appropriate. - cache integration tests moved to community (incl. the imported tools), env var bugfix for `CASSANDRA_CONTACT_POINTS`. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-16 17:22:24 +00:00
fzowl	8db4a14648	docs: new voyageai text_embeddings model: voyage-large-2-instruct (#21706 )	2024-05-16 10:06:22 -07:00
Bagatur	901e09aa30	docs: datacamp course (#21767 )	2024-05-16 16:56:32 +00:00
Kyle Cassidy	eca8c4bcc6	Standardized openai init params (#21739 ) ## Patch Summary community:openai[patch]: standardize init args ## Details I made changes to the OpenAI Chat API wrapper test in the Langchain open-source repository - File: `libs/community/tests/unit_tests/chat_models/test_openai.py` - Changes: - Updated `max_retries` with Pydantic Field - Updated the corresponding unit test - Related Issues: #20085 - Updated max_retries with Pydantic Field, updated the unit test. --------- Co-authored-by: JuHyung Son <sonju0427@gmail.com>	2024-05-16 16:30:52 +00:00
laishzh	c03fd93fc1	docs: Remove unnecessary comment marks from the Makefile help section (#21749 ) Previous screenshot: <img width="758" alt="image" src="https://github.com/langchain-ai/langchain/assets/1683919/7b90626e-35ab-4486-b41d-b664e69eec0b"> Current: <img width="744" alt="image" src="https://github.com/langchain-ai/langchain/assets/1683919/cdb69512-dc6c-4b7f-a466-4be92d94c076">	2024-05-16 09:05:44 -07:00
Ethan Yang	e44b448ec3	community: update openvino doc with streaming support (#21519 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-16 15:54:45 +00:00
Eugene Yurtsev	7022260bc5	How to: Streaming (#21715 ) Update the how to guide on streaming --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-05-16 11:48:11 -04:00
ccurme	19e6bf814b	community: fix CI (#21766 )	2024-05-16 15:41:03 +00:00
Michael Ozery	dda5a9c97a	docs: sql_qa.ipynb tutorial update (#21756 ) 1. Updated deprecated method usage. 2. Added LangGraph required installation in tutorial. X: MichaelOzery	2024-05-16 15:23:20 +00:00
Mish Ushakov	d77e60a7f4	community: updated Browserbase loader (#21757 ) Thank you for contributing to LangChain! - [x] PR title: "community: updated Browserbase loader" - [x] PR message: Updates the Browserbase loader with more options and improved docs. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-05-16 08:21:23 -07:00
Ikko Eltociear Ashimine	1e6517ba73	docs: update sql_large_db.ipynb (#21765 ) mispelling -> misspelling	2024-05-16 15:20:55 +00:00
Eugene Yurtsev	6ed0aa3239	core[major]: only use function description (#21622 ) Do not prefix function signature --- * Reason for this is that information is already present with tool calling models. * This will save on tokens for those models, and makes it more obvious what the description is! * The @tool can get more parameters to allow a user to re-introduce the the signature if we want	2024-05-16 11:17:53 -04:00
William FH	8498b41cda	Finish agent migration doc (#21731 )	2024-05-16 14:43:19 +00:00
Cheese	0ead09f84d	community: Implement `bind_tools` for ChatTongyi (#20725 ) ## Description Implement `bind_tools` in ChatTongyi. Usage example: ```py from langchain_core.tools import tool from langchain_community.chat_models.tongyi import ChatTongyi @tool def multiply(first_int: int, second_int: int) -> int: """Multiply two integers together.""" return first_int * second_int llm = ChatTongyi(model="qwen-turbo") llm_with_tools = llm.bind_tools([multiply]) msg = llm_with_tools.invoke("What's 5 times forty two") print(msg) ``` Streaming is also supported. ## Dependencies No Dependency is required for this change. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-16 10:39:35 -04:00
yoogle	b216a1dddb	docs: fix monorepo typo (#21761 ) ### Description fix monorepo typo. `monorep` -> `monorepo`	2024-05-16 14:15:10 +00:00
Bagatur	347166874f	docs: aca-ds nit (#21759 )	2024-05-16 13:53:08 +00:00
Bagatur	867adbf27b	docs: add aca-ds (#21746 )	2024-05-16 08:52:07 +00:00
Bagatur	74f54599f4	docs: aza-ds cookbook (#21747 )	2024-05-16 01:27:13 -07:00
Erick Friis	be15740084	fireworks: add secret (#21744 )	2024-05-15 19:48:51 -07:00
Erick Friis	06110e20b9	pinecone: bump min core version (#21742 )	2024-05-15 19:31:43 -07:00
Erick Friis	bd3e7d50f3	fireworks: bump min core version (#21741 )	2024-05-15 19:29:13 -07:00
Erick Friis	1647b28a87	infra: release min version dont clobber current lib (#21740 )	2024-05-15 19:27:39 -07:00
Erick Friis	f5c31078d7	airbyte[patch]: airbyte-cdk compatible pydantic versions (#21738 )	2024-05-15 19:13:25 -07:00
Erick Friis	3d33b89fa4	ibm[patch]: release 0.1.7 (#21737 )	2024-05-15 19:10:15 -07:00
Erick Friis	e41d801369	openai[patch]: fix embedding float precision issue (#21736 ) also clean up + comment some of the embedding batching code	2024-05-16 02:06:51 +00:00
JuHyung Son	38c297a025	upstage: Support batch input in embedding request. (#21730 ) Description: upstage embedding now supports batch input.	2024-05-15 18:13:44 -07:00
junefish	c5a981e3b4	docs: Update Pinecone example notebook with embedded widget (#21719 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-15 21:20:46 +00:00
Erick Friis	0aea7f4b1d	docs: fix installation link (#21728 )	2024-05-15 21:10:12 +00:00
Harrison Chase	15be439719	Harrison/move flashrank rerank (#21448 ) third party integration, should be in community	2024-05-15 13:08:52 -07:00
Harrison Chase	c6c2649a5a	move installation (#21711 )	2024-05-15 12:59:45 -07:00
Erick Friis	aca98fd150	multiple: releases with relaxed core dep (#21724 )	2024-05-15 19:29:35 +00:00
Bagatur	af284518bc	openai[patch]: Release 0.1.7, bump tiktoken 0.7.0 (#21723 )	2024-05-15 12:19:29 -07:00
Bagatur	0405933914	docs: add feedback link to 0.2 banner (#21600 )	2024-05-15 10:53:48 -07:00
William FH	ca768c8353	[Core] Check is async callable (#21714 ) To permit proper coercion of objects like the following: ```python class MyAsyncCallable: async def __call__(self, foo): return await ... class MyAsyncGenerator: async def __call__(self, foo): await ... yield ```	2024-05-15 10:49:49 -07:00
ccurme	7128c2d8ad	docs: add tutorial for vector stores and retrievers (#21683 ) also update how-to guide for parent document retriever	2024-05-15 11:50:24 -04:00
Eugene Yurtsev	5c2cfabec6	core[minor]: Add v2 implementation of astream events (#21638 ) This PR introduces a v2 implementation of astream events that removes intermediate abstractions and fixes some issues with v1 implementation. The v2 implementation significantly reduces relevant code that's associated with the astream events implementation together with overhead. After this PR, the astream events implementation: - Uses an async callback handler - No longer relies on BaseTracer - No longer relies on json patch As a result of this re-write, a number of issues were discovered with the existing implementation. ## Changes in V2 vs. V1 ### on_chat_model_end `output` The outputs associated with `on_chat_model_end` changed depending on whether it was within a chain or not. As a root level runnable the output was: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` As part of a chain the output was: ``` "data": { "output": { "generations": [ [ { "generation_info": None, "message": AIMessageChunk( content="hello world!", id=AnyStr() ), "text": "hello world!", "type": "ChatGenerationChunk", } ] ], "llm_output": None, } }, ``` After this PR, we will always use the simpler representation: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` NOTE Non chat models (i.e., regular LLMs) are still associated with the more verbose format. ### Remove some `_stream` events `on_retriever_stream` and `on_tool_stream` events were removed -- these were not real events, but created as an artifact of implementing on top of astream_log. The same information is already available in the `x_on_end` events. ### Propagating Names Names of runnables have been updated to be more consistent ```python model = GenericFakeChatModel(messages=infinite_cycle).configurable_fields( messages=ConfigurableField( id="messages", name="Messages", description="Messages return by the LLM", ) ) ``` Before: ```python "name": "RunnableConfigurableFields", ``` After: ```python "name": "GenericFakeChatModel", ``` ### on_retriever_end on_retriever_end will always return `output` which is a list of documents (rather than a dict containing a key called "documents") ### Retry events Removed the `on_retry` callback handler. It was incorrectly showing that the failed function being retried has invoked `on_chain_end` https://github.com/langchain-ai/langchain/pull/21638/files#diff-e512e3f84daf23029ebcceb11460f1c82056314653673e450a5831147d8cb84dL1394	2024-05-15 11:48:47 -04:00
Rajendra Kadam	54e003268e	langchain[minor]: Add PebbloRetrievalQA chain with Identity & Semantic Enforcement support (#20641 ) - Description: PebbloRetrievalQA chain introduces identity enforcement using vector-db metadata filtering - Dependencies: None - Issue: None - Documentation: Adding documentation for PebbloRetrievalQA chain in a separate PR(https://github.com/langchain-ai/langchain/pull/20746) - Unit tests: New unit-tests added --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-05-15 13:14:52 +00:00
Bagatur	f2f970f93d	docs: openai bind tools nit (#21692 )	2024-05-15 01:20:53 +00:00
Erick Friis	5fa5a73dc0	docs: disable contextual search (#21691 )	2024-05-14 16:59:11 -07:00
Erick Friis	3ee0747382	infra: remove prints from notebook build (#21688 )	2024-05-14 16:27:56 -07:00
Erick Friis	024c11ff9c	docs: v0.2 search index (#21619 )	2024-05-14 15:37:42 -07:00
Bagatur	241a6e43a5	docs: update structured how to (#21679 )	2024-05-14 22:19:51 +00:00
Jib	f369495fa0	mongodb: [performance] Increase DEFAULT_INSERT_BATCH_SIZE to 100,000 and introduce sizing constraints (#19608 )	2024-05-14 22:11:26 +00:00
Eugene Yurtsev	e69a9bedf8	core[patch]: Update mypy config (#21684 ) Update mypy config to ignore checking deps from numpy and pytest (which are optional in langsmith sdk)	2024-05-14 17:29:07 -04:00
Erick Friis	9973547aef	mongodb: release 0.1.4 (#21678 )	2024-05-14 11:54:23 -07:00
Jib	a97473c846	mongodb[patch]: Make ObjectId JSON-serializable on generation (#21394 )	2024-05-14 11:52:29 -07:00
ccurme	12b599c47f	docs: add how-to on multi-modal tool calling (#21667 ) Can move this to a dedicated multi-modal section if desired.	2024-05-14 12:26:25 -04:00
Eugene Yurtsev	5c64c004cc	core[patch]: Add unit tests with some streaming scenarios (#21668 ) Add unit tests that show differences between sync / async versions when streaming. The inner on_chain_chunk event is missing if mixing sync and async functionality. Likely due to missing tap_output_iter implementation on the sync variant of `_transform_stream_with_config`	2024-05-14 15:30:57 +00:00
Eugene Yurtsev	2ac4d2960c	core[patch]: Add unit test to catch ordering (#21669 ) Add unit test to catch ordering issues	2024-05-14 15:25:33 +00:00
ccurme	3390dc2266	docs: style nits (#21666 )	2024-05-14 10:18:13 -04:00
ccurme	2463c8060c	docs: how-to on adding scores to retriever results (#21626 )	2024-05-14 09:41:36 -04:00
Zhao Blake	972d2071c6	core[patch]: Fix typo in VectorStoreExampleSelector doc-string (#21574 )	2024-05-14 13:31:37 +00:00
William FH	714cba96a8	[docs] Update langgraph migration guide (#21644 ) - add links to references where appropriate - use the create_react_agent - Fix the timeout recommendation	2024-05-14 06:13:17 +00:00
Erick Friis	5144c94603	docs: add 0.2 search notice (#21653 )	2024-05-14 04:00:18 +00:00
Erick Friis	2a984e8e3f	docs: huggingface package (#21645 )	2024-05-14 03:17:40 +00:00
Anush	cd1879f5e7	docs: Qdrant partner package reference (#21649 ) ## Description: As the title goes.	2024-05-13 19:51:57 -07:00
Erick Friis	c77d2f2b06	multiple: core 0.2 nonbreaking dep, check_diff community->langchain dep (#21646 ) 0.2 is not a breaking release for core (but it is for langchain and community) To keep the core+langchain+community packages in sync at 0.2, we will relax deps throughout the ecosystem to tolerate `langchain-core` 0.2	2024-05-13 19:50:36 -07:00
Anush	edd68e4ad4	qdrant: init package (#21146 ) ## Description This PR introduces the new `langchain-qdrant` partner package, intending to deprecate the community package. ## Changes - Moved the Qdrant vector store implementation `/libs/partners/qdrant` with integration tests. - The conditional imports of the client library are now regular with minor implementation improvements. - Added a deprecation warning to `langchain_community.vectorstores.qdrant.Qdrant`. - Replaced references/imports from `langchain_community` with either `langchain_core` or by moving the definitions to the `langchain_qdrant` package itself. - Updated the Qdrant vector store documentation to reflect the changes. ## Testing - `QDRANT_URL` and [`QDRANT_API_KEY`](`583e36bf6b`) env values need to be set to [run integration tests](`d608c93d1f`) in the [cloud](https://cloud.qdrant.tech). - If a Qdrant instance is running at `http://localhost:6333`, the integration tests will use it too. - By default, tests use an [`in-memory`](https://github.com/qdrant/qdrant-client?tab=readme-ov-file#local-mode) instance(Not comprehensive). --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-05-13 18:20:03 -07:00
Erick Friis	fe8c9d621a	docs: ignore nb echo:false blocks (#21624 ) not working currently	2024-05-13 17:18:26 -07:00
Prashanth Rao	63c3a0e56c	[community][graph]: Update KuzuQAChain and docs (#21218 ) This PR makes some small updates for `KuzuQAChain` for graph QA. - Updated Cypher generation prompt (we now support `WHERE EXISTS`) and generalize it more - Support different LLMs for Cypher generation and QA - Update docs and examples	2024-05-13 17:17:14 -07:00
Bagatur	752b1e85f8	docs: gh feedback link (#21606 ) Co-authored-by: bracesproul <braceasproul@gmail.com>	2024-05-14 00:11:37 +00:00
Bagatur	506df439eb	docs: how to index nits (#21623 )	2024-05-13 23:52:50 +00:00
Bagatur	b514a479c0	docs: standardize capitalization (#21641 )	2024-05-13 16:25:51 -07:00
Bagatur	89aae3e043	docs: add Techniques to Concepts (#21636 ) - Adds Techniques section - Moves function calling, retrieval types to Techniques - Removes Installation section (not conceptual) - Reorders a few things (chat models before llms, package descriptions before diagram) - Add text splitter types to Techniques	2024-05-13 16:06:16 -07:00
Tomaz Bratanic	89ff6a3d3b	Add sentiment and confidence levels to diffbotgraphtransformer (#21590 ) Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-13 23:00:52 +00:00
Bagatur	526ba235f3	docs: fix prereq links (#21630 )	2024-05-13 15:40:53 -07:00
Erick Friis	0541e06e21	infra: 0.2 docs 404 page (#21634 )	2024-05-13 22:11:28 +00:00
Erick Friis	e861b5bcb7	infra: fix api ref link generation (#21631 )	2024-05-13 14:52:26 -07:00
Erick Friis	9b51ca08bc	huggingface: fix community dep checking (#21628 )	2024-05-13 21:52:18 +00:00
Erick Friis	91a2ea5cd6	chroma, mongodb: fix docstrings (#21629 )	2024-05-13 21:27:43 +00:00
Jofthomas	afd85b60fc	huggingface: init package (#21097 ) First Pr for the langchain_huggingface partner Package - Moved some of the hugging face related class from `community` to the new `partner package` Still needed : - Documentation - Tests - Support for the new apply_chat_template in `ChatHuggingFace` - Confirm choice of class to support for embeddings witht he sentence-transformer team. cc : @efriis --------- Co-authored-by: Cyril Kondratenko <kkn1993@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-13 20:53:15 +00:00
Tomaz Bratanic	9fce03e7db	community[patch]: Fix neo4j enhanced schema (#21582 )	2024-05-13 15:26:06 -04:00
Christophe Bornet	66a4da8ad0	community[patch]: Improve Cassandra VectorStore docsctrings (#21620 )	2024-05-13 15:24:26 -04:00
adreo00	40aff1eacc	core[major]: AsyncCallbackManagerForToolRun no longer casts return object to string (#20374 ) - Description: Stops `AsyncCallbackManagerForToolRun` from converting the output to str - Issue: #20372 - Dependencies: None	2024-05-13 15:09:12 -04:00
Eugene Yurtsev	25fbe356b4	community[patch]: upgrade to recent version of mypy (#21616 ) This PR upgrades community to a recent version of mypy. It inserts type: ignore on all existing failures.	2024-05-13 14:55:07 -04:00
Eugene Yurtsev	b923951062	langchain[patch]: CI add lint rule for community imports (#21618 ) Add a rule to check for imports from community in global scope	2024-05-13 14:51:25 -04:00
Jorge Piedrahita Ortiz	4378fbbef0	community[patch]: Fix typos in Sambanova integration doc-strings (#21617 ) - Description: Sambanova integration docstrings updated, bad formated --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-05-13 18:35:16 +00:00
Erick Friis	0f5bf94f9f	infra: remove ai21 docs scan features (#21614 ) ai21 depends on ai21-tokenizer which depends on too restrictive/old version of `tokenizers`	2024-05-13 18:05:53 +00:00
ccurme	fe08421207	docs: add hybrid retrieval how-to guide (#21613 ) Updating v0.2 docs with https://github.com/langchain-ai/langchain/pull/21245	2024-05-13 14:03:55 -04:00
Christophe Bornet	bcf53f93e1	[community]: Add missing docstring param to CassandraLoader (#21611 )	2024-05-13 16:03:18 +00:00
Christophe Bornet	e6fa4547b1	community[minor]: Add alazy_load to AsyncHtmlLoader (#21536 ) Also fixes a bug that `_scrape` was called and was doing a second HTTP request synchronously. Twitter handle: cbornet_	2024-05-13 12:01:03 -04:00
Leonid Ganeline	4c48732f94	docs: `providers` updates 1 (#20256 ) - Proviers pages: added missed integrations; fixed format - `mistralai` converted from notebook to .mdx format	2024-05-13 11:54:51 -04:00
ccurme	15cb1133e7	docs: fix path for state_of_the_union sample file (#21609 )	2024-05-13 11:46:02 -04:00
Bagatur	83a8fdcfd1	infra: fix local doc make command (#21608 )	2024-05-13 08:30:30 -07:00
Eugene Yurtsev	4dc625057e	README: Update downloads to show downloads of langchain-core (#21387 ) Update downloads to keep track of langchain-core	2024-05-13 11:26:50 -04:00
Wang Guan	b53548dcda	langchain[minor]: allow CacheBackedEmbeddings to cache queries (#20073 ) Add optional caching of queries to cache backed embeddings	2024-05-13 15:18:04 +00:00
Guangdong Liu	a156aace2b	core[patch]:Fix Incorrect listeners parameters for Runnable.with_listeners() and .map() (#20661 ) - Issue: fix #20509 - @baskaryan, @eyurtsev ![image](https://github.com/langchain-ai/langchain/assets/48236177/f799a976-b983-4d8b-b373-64392e1fd6c6)	2024-05-13 11:16:17 -04:00
ccurme	b0f5a47f25	docs: update some retrievers how-to guides (#21607 )	2024-05-13 11:03:33 -04:00
junkeon	480c02bf55	upstage[minor]: add merge_and_split function for document loader (#21603 ) - Introduce the `merge_and_split` function in the `UpstageLayoutAnalysisLoader`. - The `merge_and_split` function takes a list of documents and a splitter as inputs. - This function merges all documents and then divides them using the `split_documents` method, which is a proprietary function of the splitter. - If the provided splitter is `None` (which is the default setting), the function will simply merge the documents without splitting them.	2024-05-13 10:55:19 -04:00
Leonid Ganeline	500569da48	community[patch]: `vectorstores` import update (#21169 ) Issue: we have several helper functions to import third-party libraries like lancedb.import_lancedb in [community.vectorstores](https://api.python.langchain.com/en/latest/vectorstores/langchain_community.vectorstores.lancedb.import_lancedb.html#langchain_community.vectorstores.lancedb.import_lancedb). And we have core.utils.utils.guard_import that works exactly for this purpose. The import_<package> functions work inconsistently and rather be private functions. Change: replaced these functions with the guard_import function. Related to #21133	2024-05-13 10:45:31 -04:00
ccurme	3003363605	langchain, community: remove cap on sqlalchemy and bump duckdb (#21509 )	2024-05-13 10:16:09 -04:00
ccurme	01a3228d8e	standard tests: add test for few-shot examples (#21019 )	2024-05-13 10:06:12 -04:00
David Duong	db22fcb58b	docs: style fixes for api reference docs (#21602 ) - Make sure the left nav bar is horizontally scrollable - Make sure the navigation dropdown is vertically scrollable and height capped at 80% of viewport height --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-13 06:49:50 -07:00
Chuyuan Qu	af875cff57	prompty: adding Microsoft langchain_prompty package (#21346 ) Co-authored-by: Micky Liu <wayliu@microsoft.com> Co-authored-by: wayliums <wayliums@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-11 04:03:44 +00:00
Erick Friis	56c6b5868b	infra: run codespell on v0.1 prs (#21545 )	2024-05-10 12:51:42 -07:00
Matt Florence	d3ca2cc8c3	langchain: Fix broken `OpenAIModerationChain` and implement async (#18537 ) Thank you for contributing to LangChain! ## PR title lancghain[patch]: fix `OpenAIModerationChain` and implement async ## PR message Description: fix `OpenAIModerationChain` and implement async Issues: - https://github.com/langchain-ai/langchain/issues/18533 - https://github.com/langchain-ai/langchain/issues/13685 Dependencies: none Twitter handle: mattflo ## Add tests and docs Existing documentation is broken: https://python.langchain.com/docs/guides/safety/moderation - [ x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Emilia Katari <emilia@outpace.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-05-10 19:04:13 +00:00
ccurme	4170e72a42	openai: fix loads unit test (#21542 ) following changes to tests in core here: https://github.com/langchain-ai/langchain/pull/21342/files	2024-05-10 18:46:34 +00:00
ccurme	d3ff9c5d6a	infra: turn off fail-fast for standard tests (#21541 )	2024-05-10 18:28:57 +00:00
Erick Friis	e8efe8384d	docs: announcement bar dark mode 0.2 (#21540 )	2024-05-10 10:13:02 -07:00
Erick Friis	64c47224a0	docs: baseUrl for ganalytics, throw on broken links (#21455 )	2024-05-10 13:49:59 +00:00
Usama Jamil	913792f5e6	docs: myscale code typo (#21522 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-10 13:33:22 +00:00
Sevin F. Varoglu	85cbc55f86	docs: update OctoAI LLM doc (#21528 ) This PR updates OctoAI doc to remove warnings when running the example code.	2024-05-10 09:31:16 -04:00
Daniel Glogowski	70a79f45d7	docs: update nvidia nbs (#21498 )	2024-05-10 04:38:35 -04:00
Eugene Yurtsev	39e9b644b9	docs: Add langchain over time (#21434 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-10 00:34:35 +00:00
Erick Friis	3db85cbb5b	community: deps (#21508 )	2024-05-09 15:12:34 -07:00
ccurme	9c2828aaa8	docs: add local LLMs page to v0.2 docs (#21493 ) Adding this page from v0.1 docs: https://python.langchain.com/v0.1/docs/guides/development/local_llms/	2024-05-09 17:57:56 -04:00
Erick Friis	8580e350be	cli: release 0.0.22 (#21507 )	2024-05-09 21:45:20 +00:00
Anthony Chu	c735849e76	azure-dynamic-sessions: add Python REPL tool (#21264 ) Adds a Python REPL that executes code in a code interpreter session using Azure Container Apps dynamic sessions. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-09 21:39:04 +00:00
Erick Friis	02701c277f	langchain: core min version (#21506 )	2024-05-09 13:45:44 -07:00
ccurme	81ae184cc9	docs: add response metadata page to v0.2 docs (#21489 ) Adding this page from v0.1 docs: https://python.langchain.com/v0.1/docs/modules/model_io/chat/response_metadata/	2024-05-09 16:17:04 -04:00
Erick Friis	13b01104c9	langchain: drop sqlalchemy max, release 0.2.0rc2 (#21504 )	2024-05-09 13:12:38 -07:00
ccurme	375f447e58	community: fix builds with min dependencies (#21495 )	2024-05-09 13:01:44 -07:00
Erick Friis	2be4b1b2c9	Revert "docs: redirect base slug" (#21499 ) Reverts langchain-ai/langchain#21457	2024-05-09 12:20:16 -07:00
Erick Friis	d1fc841b1a	docs: redirect base slug (#21457 )	2024-05-09 10:52:36 -07:00
Trayan Azarov	ba7d53689c	community: Chroma Adding create_collection_if_not_exists flag to Chroma constructor (#21420 ) - Description: Adds the ability to either `get_or_create` or simply `get_collection`. This is useful when dealing with read-only Chroma instances where users are constraint to using `get_collection`. Targeted at Http/CloudClients mostly. - Issue: chroma-core/chroma#2163 - Dependencies: N/A - Twitter handle: `@t_azarov` \| Collection Exists \| create_collection_if_not_exists \| Outcome \| test \| \|-------------------\|---------------------------------\|----------------------------------------------------------------\|----------------------------------------------------------\| \| True \| False \| No errors, collection state unchanged \| `test_create_collection_if_not_exist_false_existing` \| \| True \| True \| No errors, collection state unchanged \| `test_create_collection_if_not_exist_true_existing` \| \| False \| False \| Error, `get_collection()` fails \| `test_create_collection_if_not_exist_false_non_existing` \| \| False \| True \| No errors, `get_or_create_collection()` creates the collection \| `test_create_collection_if_not_exist_true_non_existing` \|	2024-05-09 11:45:10 -04:00
ccurme	3bb9bec314	bedrock: add unit test for retriever (#21485 ) This was implemented in https://github.com/langchain-ai/langchain/pull/21349 but dropped before merge.	2024-05-09 11:37:03 -04:00
Renu Rozera	4035a1d234	Add source metadata to bedrock retriever response (#21349 ) Thank you for contributing to LangChain! - [X] PR title: "community: Add source metadata to bedrock retriever response" - [X] PR message: - Description: Bedrock retrieve API returns extra metadata in the response which is currently not returned in the retriever response - Issue: The change adds the metadata from bedrock retrieve API response to the bedrock retriever in a backward compatible way. Renamed metadata to sourceMetadata as metadata term is being used in the Document already. This is in sync with what we are doing in llama-index as well. - Dependencies: No - [X] Add tests and docs: 1. Added unit tests 2. Notebook already exists and does not need any change 3. Response from end to end testing, just to ensure backward compatibility: `[Document(page_content='Exoplanets.', metadata={'location': {'s3Location': {'uri': 's3://bucket/file_name.txt'}, 'type': 'S3'}, 'score': 0.46886647, 'source_metadata': {'x-amz-bedrock-kb-source-uri': 's3://bucket/file_name.txt', 'tag': 'space', 'team': 'Nasa', 'year': 1946.0}})]` - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-05-09 11:06:22 -04:00
ccurme	9fa17bfabe	docs; fix links in v0.2.0 (#21483 )	2024-05-09 11:05:17 -04:00
Erick Friis	f178c67ad0	community: release 0.2.0rc1, bump deps (#21470 )	2024-05-08 23:32:44 -07:00
William FH	b28be5d407	Pass through Run ID Explicitly (#21469 )	2024-05-08 22:20:51 -07:00
Erick Friis	83eecd54fe	experimental: 0.2 relax (#21468 )	2024-05-08 21:39:42 -07:00
roiperlman	9992beaff9	community: Add arguments to whisper parser (#20378 ) Description: Added a few additional arguments to the whisper parser, which can be consumed by the underlying API. The prompt is especially important to fine-tune transcriptions. --------- Co-authored-by: Roi Perlman <roi@fivesigmalabs.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-08 17:53:13 -07:00
Erick Friis	5542eacad8	docs: sidebar autogen hidden support (#21454 )	2024-05-09 00:23:52 +00:00
Yash	cb31c3611f	Ndb enterprise (#21233 ) Description: Adds NeuralDBClientVectorStore to the langchain, which is our enterprise client. --------- Co-authored-by: kartikTAI <129414343+kartikTAI@users.noreply.github.com> Co-authored-by: Kartik Sarangmath <kartik@thirdai.com>	2024-05-08 16:30:58 -07:00
Erick Friis	74044e44a5	docs: useBaseUrl on svg paths (#21446 )	2024-05-08 21:55:42 +00:00
Oguz Vuruskaner	5b35f077f9	[community][fix](DeepInfraEmbeddings): Implement chunking for large batches (#21189 ) Description: This PR introduces chunking logic to the `DeepInfraEmbeddings` class to handle large batch sizes without exceeding maximum batch size of the backend. This enhancement ensures that embedding generation processes large batches by breaking them down into smaller, manageable chunks, each conforming to the maximum batch size limit. Issue: Fixes #21189 Dependencies: No new dependencies introduced.	2024-05-08 14:45:42 -07:00
Sokolov Fedor	f4ddf64faa	community: Add MarkdownifyTransformer to langchain_community.document_transformers (#21247 ) - Added new document_transformer: MarkdonifyTransformer, that uses `markdonify` package with customizable options to convert HTML to Markdown. It's similar to Html2TextTransformer, but has more flexible options and also I've noticed that sometimes MarkdownifyTransformer performs better than html2text one, so that's why I use markdownify on my project. - Added docs and tests - Usage: ```python from langchain_community.document_transformers import MarkdownifyTransformer markdownify = MarkdownifyTransformer() docs_transform = markdownify.transform_documents(docs) ``` - Example of better performance on simple task, that I've noticed: ``` <html> <head><title>Reports on product movement</title></head> <body> <p data-block-key="2wst7">The reports on product movement will be useful for forming supplier orders and controlling outcomes.</p> </body> ``` Html2TextTransformer: ```python [Document(page_content='The reports on product movement will be useful for forming supplier orders and\ncontrolling outcomes.\n\n')] # Here we can see 'and\ncontrolling', which has extra '\n' in it ``` MarkdownifyTranformer: ```python [Document(page_content='Reports on product movement\n\nThe reports on product movement will be useful for forming supplier orders and controlling outcomes.')] ``` --------- Co-authored-by: Sokolov Fedor <f.sokolov@sokolov-macbook.bbrouter> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Sokolov Fedor <f.sokolov@sokolov-macbook.local> Co-authored-by: Sokolov Fedor <f.sokolov@192.168.1.6>	2024-05-08 14:45:13 -07:00
Alex JW	d3ce6aad2e	community: Instantiate GPT4AllEmbeddings with parameters (#21238 ) ### GPT4AllEmbeddings parameters --- Description: As of right now the Embed4All class inside _GPT4AllEmbeddings_ is instantiated as it's default which leaves no room to customize the chosen model and it's behavior. Thus: - GPT4AllEmbeddings can now be instantiated with custom parameters like a different model that shall be used. --------- Co-authored-by: AlexJauchWalser <alexander.jauch-walser@knime.com>	2024-05-08 14:44:47 -07:00
Philippe PRADOS	7be68228da	community[patch]: Make sql record manager fully compatible with async (#20735 ) The `_amake_session()` method does not allow modifying the `self.session_factory` with anything other than `async_sessionmaker`. This prohibits advanced uses of `index()`. In a RAG architecture, it is necessary to import document chunks. To keep track of the links between chunks and documents, we can use the `index()` API. This API proposes to use an SQL-type record manager. In a classic use case, using `SQLRecordManager` and a vector database, it is impossible to guarantee the consistency of the import. Indeed, if a crash occurs during the import (problem with the network, ...) there is an inconsistency between the SQL database and the vector database. With the [PR](https://github.com/langchain-ai/langchain-postgres/pull/32) we are proposing for `langchain-postgres`, it is now possible to guarantee the consistency of the import of chunks into a vector database. It's possible only if the outer session is built with the connection. ```python def main(): db_url = "postgresql+psycopg://postgres:password_postgres@localhost:5432/" engine = create_engine(db_url, echo=True) embeddings = FakeEmbeddings() pgvector:VectorStore = PGVector( embeddings=embeddings, connection=engine, ) record_manager = SQLRecordManager( namespace="namespace", engine=engine, ) record_manager.create_schema() with engine.connect() as connection: session_maker = scoped_session(sessionmaker(bind=connection)) # NOTE: Update session_factories record_manager.session_factory = session_maker pgvector.session_maker = session_maker with connection.begin(): loader = CSVLoader( "data/faq/faq.csv", source_column="source", autodetect_encoding=True, ) result = index( source_id_key="source", docs_source=loader.load()[:1], cleanup="incremental", vector_store=pgvector, record_manager=record_manager, ) print(result) ``` The same thing is possible asynchronously, but a bug in `sql_record_manager.py` in `_amake_session()` must first be fixed. ```python async def _amake_session(self) -> AsyncGenerator[AsyncSession, None]: """Create a session and close it after use.""" # FIXME: REMOVE if not isinstance(self.session_factory, async_sessionmaker):~~ if not isinstance(self.engine, AsyncEngine): raise AssertionError("This method is not supported for sync engines.") async with self.session_factory() as session: yield session ``` Then, it is possible to do the same thing asynchronously: ```python async def main(): db_url = "postgresql+psycopg://postgres:password_postgres@localhost:5432/" engine = create_async_engine(db_url, echo=True) embeddings = FakeEmbeddings() pgvector:VectorStore = PGVector( embeddings=embeddings, connection=engine, ) record_manager = SQLRecordManager( namespace="namespace", engine=engine, async_mode=True, ) await record_manager.acreate_schema() async with engine.connect() as connection: session_maker = async_scoped_session( async_sessionmaker(bind=connection), scopefunc=current_task) record_manager.session_factory = session_maker pgvector.session_maker = session_maker async with connection.begin(): loader = CSVLoader( "data/faq/faq.csv", source_column="source", autodetect_encoding=True, ) result = await aindex( source_id_key="source", docs_source=loader.load()[:1], cleanup="incremental", vector_store=pgvector, record_manager=record_manager, ) print(result) asyncio.run(main()) ``` --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Sean <sean@upstage.ai> Co-authored-by: JuHyung-Son <sonju0427@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: YISH <mokeyish@hotmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Jason_Chen <820542443@qq.com> Co-authored-by: Joan Fontanals <joan.fontanals.martinez@jina.ai> Co-authored-by: Pavlo Paliychuk <pavlo.paliychuk.ca@gmail.com> Co-authored-by: fzowl <160063452+fzowl@users.noreply.github.com> Co-authored-by: samanhappy <samanhappy@gmail.com> Co-authored-by: Lei Zhang <zhanglei@apache.org> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: merdan <48309329+merdan-9@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Andres Algaba <andresalgaba@gmail.com> Co-authored-by: davidefantiniIntel <115252273+davidefantiniIntel@users.noreply.github.com> Co-authored-by: Jingpan Xiong <71321890+klaus-xiong@users.noreply.github.com> Co-authored-by: kaka <kaka@zbyte-inc.cloud> Co-authored-by: jingsi <jingsi@leadincloud.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Rahul Triptahi <rahul.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Shengsheng Huang <shannie.huang@gmail.com> Co-authored-by: Michael Schock <mjschock@users.noreply.github.com> Co-authored-by: Anish Chakraborty <anish749@users.noreply.github.com> Co-authored-by: am-kinetica <85610855+am-kinetica@users.noreply.github.com> Co-authored-by: Dristy Srivastava <58721149+dristysrivastava@users.noreply.github.com> Co-authored-by: Matt <matthew.gotteiner@microsoft.com> Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2024-05-08 17:31:11 -04:00
Andreas Motl	17e42bbd18	community[patch]: pgvector: Slight refactoring to make code a bit more reusable (#16243 ) - Description: Improve [pgvector vector store adapter](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py) to make it reusable by adapters deriving from that. - Issue: NA - Dependencies: NA - References: https://github.com/crate-workbench/langchain/pull/1 - Addressed to: @eyurtsev, @cbornet Hi from the CrateDB team, first of all, thanks a stack for conceiving and maintaining LangChain. We are currently [preparing a patch](https://github.com/crate-workbench/langchain/pull/1) for adding [CrateDB](https://github.com/crate/crate) to the list of community adapters. Because CrateDB aims to be compatible with PostgreSQL to some degree, the vector store subsystem in LangChain derives functionality from the corresponding implementation for pgvector. Therefore, in order to make the implementation more reusable, we needed to rename the private methods `__from` and `__query_collection` to the less private counterparts `_from` and `_query_collection`, so they can be overwritten, in order to unlock other adapters deriving from [pgvector](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py). With kind regards, Andreas.	2024-05-08 17:21:30 -04:00
Mehrdad Shokri	f103927b88	bugfix(community): fix Playwright import paths. (#21395 ) - Description: Fix import class name exporeted from 'playwright.async_api' and 'playwright.sync_api' to match the correct name in playwright tool. Change import from inline guard_import to helper function that calls guard_import to make code more readable in gmail tool. Upgrade playwright version to 1.43.0 - Issue: #21354 - Dependencies: upgrade playwright version(this is not required for the bugfix itself, just trying to keep dependencies fresh. I can remove the playwright version upgrade if you want.)	2024-05-08 14:20:25 -07:00
Shailendra Mishra	aa966b6161	Replaced bind variable in SQL with formatted string for compatibility with sql syntax. (#21439 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-08 13:51:30 -07:00
Eugene Yurtsev	f92006de3c	multiple: langchain 0.2 in master (#21191 ) 0.2rc migrations - [x] Move memory - [x] Move remaining retrievers - [x] graph_qa chains - [x] some dependency from evaluation code potentially on math utils - [x] Move openapi chain from `langchain.chains.api.openapi` to `langchain_community.chains.openapi` - [x] Migrate `langchain.chains.ernie_functions` to `langchain_community.chains.ernie_functions` - [x] migrate `langchain/chains/llm_requests.py` to `langchain_community.chains.llm_requests` - [x] Moving `langchain_community.cross_enoders.base:BaseCrossEncoder` -> `langchain_community.retrievers.document_compressors.cross_encoder:BaseCrossEncoder` (namespace not ideal, but it needs to be moved to `langchain` to avoid circular deps) - [x] unit tests langchain -- add pytest.mark.community to some unit tests that will stay in langchain - [x] unit tests community -- move unit tests that depend on community to community - [x] mv integration tests that depend on community to community - [x] mypy checks Other todo - [x] Make deprecation warnings not noisy (need to use warn deprecated and check that things are implemented properly) - [x] Update deprecation messages with timeline for code removal (likely we actually won't be removing things until 0.4 release) -- will give people more time to transition their code. - [ ] Add information to deprecation warning to show users how to migrate their code base using langchain-cli - [ ] Remove any unnecessary requirements in langchain (e.g., is SQLALchemy required?) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-08 16:46:52 -04:00
ccurme	6b392d6d12	robocorp: release 0.0.6 (#21441 )	2024-05-08 16:16:24 -04:00
Erick Friis	21d14549a9	docs: v0.2 docs in master (#21438 ) current python.langchain.com is building from branch `v0.1`. Iterate on v0.2 docs here. --------- Signed-off-by: Weichen Xu <weichen.xu@databricks.com> Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: jacoblee93 <jacoblee93@gmail.com> Co-authored-by: Leonid Ganeline <leo.gan.57@gmail.com> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru> Co-authored-by: Averi Kitsch <akitsch@google.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Nuno Campos <nuno@boringbits.io> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Martín Gotelli Ferenaz <martingotelliferenaz@gmail.com> Co-authored-by: Fayfox <admin@fayfox.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Dawson Bauer <105886620+djbauer2@users.noreply.github.com> Co-authored-by: Ravindu Somawansa <ravindu.somawansa@gmail.com> Co-authored-by: Dhruv Chawla <43818888+Dominastorm@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: WeichenXu <weichen.xu@databricks.com> Co-authored-by: Benito Geordie <89472452+benitoThree@users.noreply.github.com> Co-authored-by: kartikTAI <129414343+kartikTAI@users.noreply.github.com> Co-authored-by: Kartik Sarangmath <kartik@thirdai.com> Co-authored-by: Sevin F. Varoglu <sfvaroglu@octoml.ai> Co-authored-by: MacanPN <martin.triska@gmail.com> Co-authored-by: Prashanth Rao <35005448+prrao87@users.noreply.github.com> Co-authored-by: Hyeongchan Kim <kozistr@gmail.com> Co-authored-by: sdan <git@sdan.io> Co-authored-by: Guangdong Liu <liugddx@gmail.com> Co-authored-by: Rahul Triptahi <rahul.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: pjb157 <84070455+pjb157@users.noreply.github.com> Co-authored-by: Eun Hye Kim <ehkim1440@gmail.com> Co-authored-by: kaijietti <43436010+kaijietti@users.noreply.github.com> Co-authored-by: Pengcheng Liu <pcliu.fd@gmail.com> Co-authored-by: Tomer Cagan <tomer@tomercagan.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com>	2024-05-08 12:29:59 -07:00
Tommi Holmgren	ee35b9ba56	langchain-robocorp: remove toolkit return content max length (#21436 ) Robocorp (action server) toolkit had a limitation that the content length returned by the tool was always cut to max 5000 chars. This was from the time when context windows were much more limited. This PR removes the limitation. Whatever the underlying tool provides gets sent back to the agent. As the robocorp toolkit no longer restricts the content, the implication is that either the Action (tool) developer or the agent developer needs to be aware of potentially oversized tool responses. Our point of view is this should be the agent developer's responsibility, them being in control of the use case and aware of the context window the LLM has.	2024-05-08 15:05:55 -04:00
JuHyung Son	710e57d779	upstage: deprecate UPSTAGE_DOCUMENT_AI_API_KEY (#21363 ) Description: We are merging UPSTAGE_DOCUMENT_AI_API_KEY and UPSTAGE_API_KEY into one, and only UPSTAGE_API_KEY will be used going forward. And we changed the base class of ChatUpstage to BaseChatOpenAI. --------- Co-authored-by: Sean <chosh0615@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-08 18:02:26 +00:00
Erick Friis	6a295d1ec0	upstage: release 0.1.4 (#21432 )	2024-05-08 17:57:40 +00:00
Mateusz Szewczyk	7926cc1929	ibm: Fix llm and embeddings "verify" attribute default value (#21429 ) Thank you for contributing to LangChain! - [x] PR title: "langchain-ibm: Fix llm and embeddings 'verify' attribute default value" - [x] PR message: - Description: fix default value of "verify" attribute - Dependencies: `ibm_watsonx_ai` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-08 17:23:14 +00:00
Kevin Zhang	0715545378	docs: fix typo in text (#21393 ) Description: The previous text had an unclosed parenthesis, this fix adds the closing parenthesis	2024-05-08 15:58:15 +00:00
Dobiichi-Origami	5b00885b49	community: add `bind_tools` and `with_structured_output` support to `QianfanChatEndpoint` (#21412 ) …Endpoint` Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: add `bind_tools` and `with_structured_output` support to `QianfanChatEndpoint` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-05-08 11:35:10 -04:00
Silas Xu	aafaf3e193	The predict_and_parse is deprecated, instead pass an output parser directly to LLMChain. (#20130 ) The `predict_and_parse` method is deprecated, instead pass an output parser directly to LLMChain. - [x] PR title: "langchain: update chain_extract.py" ![image](https://github.com/langchain-ai/langchain/assets/40889019/e950d79f-5a0f-4086-86e9-89f627990fe5) --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-08 09:32:17 -04:00
ccurme	3c31bd0ed0	langchain: update use of predict_and_parse in LLMChainFilter (#21389 ) Following https://github.com/langchain-ai/langchain/pull/20130 Removes deprecation warnings in docs here: https://python.langchain.com/docs/modules/data_connection/retrievers/contextual_compression/ Tested using the same docs notebook + existing integration test.	2024-05-08 09:31:33 -04:00
Tomaz Bratanic	dd70f2f473	Update graph docs (#21414 ) Update the deprecated docs and added node properties to graph construction	2024-05-08 09:05:39 -04:00
Erick Friis	bbdf0f8801	experimental[patch]: core and langchain dep (#21402 )	2024-05-07 21:39:34 -07:00
Erick Friis	e4aca0d052	experimental[patch]: release 0.0.58 (#21397 )	2024-05-08 03:52:44 +00:00
Erick Friis	893f06b5de	infra: rewrite ipynb links to md (#21392 )	2024-05-07 23:16:52 +00:00
Hassan El Mghari	225ceedcb6	docs: Add together docs in chat models & update provider docs (#21391 ) - Added Together docs in chat models section - Update Together provider docs to match the LLM & chat models sections --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-07 22:40:57 +00:00
Heidi Steen	af97d58c9e	docs: update docs/integrations/retrievers/azure_ai_search.ipynb (#21160 ) This is a doc update. It fixes up formatting and product name references. The example code is updated to use a local built-in text file. @mmhangami Please take a look --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-05-07 22:33:46 +00:00
snova-jamesv	ca753e7c15	community: updated performance limitation wording in sambanova.ipynb (#21390 ) - Description: updated performance limitation wording in sambanova.ipynb - Issue: NA - Dependencies: NA - Twitter handle: NA	2024-05-07 22:21:46 +00:00
Leonid Ganeline	791d59a2c8	community: `callbacks` guard_imports (#21173 ) Issue: we have several helper functions to import third-party libraries like import_uptrain in [community.callbacks](https://api.python.langchain.com/en/latest/callbacks/langchain_community.callbacks.uptrain_callback.import_uptrain.html#langchain_community.callbacks.uptrain_callback.import_uptrain). And we have core.utils.utils.guard_import that works exactly for this purpose. The import_<package> functions work inconsistently and rather be private functions. Change: replaced these functions with the guard_import function. Related to #21133	2024-05-07 15:04:54 -07:00
Hassan El Mghari	416549bed2	docs: Updated Together integration docs (#21388 ) Description: Updated the together integration docs by leading with the streaming example, explicitly specifying a model to show users how to do that, and updating the sections to more closely match other integrations.	2024-05-07 21:51:42 +00:00
Rahul Triptahi	7994cba18d	[Community][Minor]: Fetch loader_source of GoogleDriveLoader in PebbloSafeLoader. (#21314 ) Description: This PR includes fix for loader_source to be fetched from metadata in case of GdriveLoaders. Documentation: NA Unit Test: NA Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-05-07 14:45:58 -07:00
Leonid Ganeline	7cbf1c31aa	docs: table legend updated (#21351 ) Compacted the table column legends. Added links. Similar to #21259	2024-05-07 14:45:04 -07:00
Erick Friis	d5bde4fa91	infra: use nbconvert for docs build (#21135 ) todo - [x] remove quarto build semantics - [x] remove quarto download/install - [x] make `uv` not verbose	2024-05-07 12:30:17 -07:00
Nuno Campos	ad0f3c14c2	core: allow mermaid node labels to have any characters (#21385 ) - it's only node ids that are limited Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-07 12:16:49 -07:00
Eugene Yurtsev	6a1d61dbf1	community[patch]: Fix in memory vectorstore to take into account ids when adding docs (#21384 ) Should respect `ids` if passed	2024-05-07 15:05:16 -04:00
Ikko Eltociear Ashimine	80170da6c5	docs: update cassandra_database.ipynb (#21145 ) Enviroment -> Environment	2024-05-07 15:00:24 -04:00
Miroslav	04e2611fea	Added additional headers for HuggingFaceInferenceAPIEmbeddings endpoint. (#21282 ) Thank you for contributing to LangChain! - [ ] HuggingFaceInferenceAPIEmbeddings: "Additional Headers" - Where: langchain, community, embeddings. huggingface.py. - Community: add additional headers when needed by custom HuggingFace TEI embedding endpoints. HuggingFaceInferenceAPIEmbeddings" - [ ] PR message: *Delete this entire checklist* and replace with - Description: Adding the `additional_headers` to be passed to requests library if needed - Dependencies: none - [ ] Add tests and docs: If you're adding a new integration, please include 1. Tested with locally available TEI endpoints with and without `additional_headers` 2. Example Usage ```python embeddings=HuggingFaceInferenceAPIEmbeddings( api_key=MY_CUSTOM_API_KEY, api_url=MY_CUSTOM_TEI_URL, additional_headers={ "Content-Type": "application/json" } ) ``` Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Massimiliano Pronesti <massimiliano.pronesti@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-05-07 14:17:53 -04:00
Ikko Eltociear Ashimine	c34419e200	docs: update quick_start.ipynb (#21358 ) initalize -> initialize - [x] PR title: "package: description"	2024-05-07 08:44:48 -07:00
Guangdong Liu	1fe66f5d39	community(patch) fix MoonshotChat moonshot_api_key is invaild for api key (#21361 ) Description: close https://github.com/langchain-ai/langchain/issues/21237 @baskaryan, @eyurtsev	2024-05-07 08:44:30 -07:00
snova-jamesv	c2ed484653	community: add Sambaverse rate limitation info to sambanova.ipynb (#21379 ) - Description: add Sambaverse rate limitation info to sambanova.ipynb - Issue: NA - Dependencies: NA	2024-05-07 15:42:44 +00:00
Tomaz Bratanic	0bf7596839	Add simple node properties to llm graph transformer (#21369 ) Add support for simple node properties in llm graph transformer. Linter and dynamic pydantic classes aren't friends, hence I added two ignores	2024-05-07 08:41:09 -07:00
ccurme	080af0ec53	langchain: sync -> async methods in OpenAI assistants (#21378 )	2024-05-07 10:25:55 -04:00
Tomaz Bratanic	ad3fd44a7f	experimental: Fix llm graph transformer bug (#21362 )	2024-05-06 23:59:55 -07:00
Erick Friis	bb81ae5c8c	together: fix chat model and embedding classes (#21353 )	2024-05-06 18:26:03 -07:00
Hassan El Mghari	d6ef5fe86a	together: add chat models, use openai base (#21337 ) Description: Adding chat completions to the Together AI package, which is our most popular API. Also staying backwards compatible with the old API so folks can continue to use the completions API as well. Also moved the embedding API to use the OpenAI library to standardize it further. Twitter handle: @nutlope - [x] Add tests and docs: If you're adding a new integration, please include - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 17:47:06 -07:00
Jacob Lee	a2d31307bb	Adds confirmation logs after creating a new project (#12618 ) @efriis @hwchase17 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 23:28:12 +00:00
Erick Friis	0fb93cd740	core: release 0.1.52 (#21350 )	2024-05-06 22:20:35 +00:00
Wu Enze	32c61b3ece	community[patch]: chat message history mypy fixes #17048 (#20114 ) Relates [#17048] Description : Applied fix to redis and neo4j file. Error was : `Cannot override writeable attribute with read-only property` fix with the same solution of [[langchain/libs/community/langchain_community/chat_message_histories/elasticsearch.py](`d5c412b0a9/libs/community/langchain_community/chat_message_histories/elasticsearch.py (L170-L175)`)] --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 22:17:45 +00:00
nrpd25	95cc8e3fc3	premai[patch]:Standardized model init args (#21308 ) [Standardized model init args #20085](https://github.com/langchain-ai/langchain/issues/20085) - Enable premai chat model to be initialized with `model_name` as an alias for `model`, `api_key` as an alias for `premai_api_key`. - Add initialization test `test_premai_initialization` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 18:12:29 -04:00
Nuno Campos	6f17158606	fix: core: Include in json output also fields set outside the constructor (#21342 )	2024-05-06 14:37:36 -07:00
Tomaz Bratanic	ac14f171ac	Add indexed properties to neo4j enhanced schema (#21335 )	2024-05-06 14:28:34 -07:00
scaserini	a6cdf6572f	community: add Kendra DocumentRelevanceOverrideConfigurations request parameter (#20695 ) - Description: add DocumentRelevanceOverrideConfigurations request parameter to Kendra retriever Co-authored-by: Simone Caserini <simone.caserini@klarna.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-06 14:26:36 -07:00
Nuno Campos	0345bcf4ef	Fix failing test for serialization (#21344 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-06 21:19:54 +00:00
Trayan Azarov	93226b1945	community: Updated Chroma version range to include 0.5.0 release (#21224 ) - Updated Chroma version range to allow releases in 0.5.x. - Bumped mypy version as linting was failing	2024-05-06 13:31:40 -07:00
Jorge Piedrahita Ortiz	e65652c3e8	community: add SambaNova embeddings integration (#21227 ) - Description: SambaNova hosted embeddings integration	2024-05-06 13:29:59 -07:00
Jorge Piedrahita Ortiz	df1c10260c	community: minor changes sambanova integration (#21231 ) - Description: fix: variable names in root validator not allowing pass credentials as named parameters in llm instancing, also added sambanova's sambaverse and sambastudio llms to __init__.py for module import	2024-05-06 13:28:35 -07:00
Jan Soubusta	d9a61c0fa9	fix: respect table_name argument when calling from_texts (#21252 ) valid for from_documents() as well fixes #21251	2024-05-06 20:28:22 +00:00
Pedro Lima	bebf46c4a2	community: added args_schema to YahooFinanceNewsTool (#21232 ) Description: this change adds args_schema (pydantic BaseModel) to YahooFinanceNewsTool for correct schema formatting on LLM function calls Issue: currently using YahooFinanceNewsTool with OpenAI function calling returns the following error "TypeError("YahooFinanceNewsTool._run() got an unexpected keyword argument '__arg1'")". This happens because the schema sent to the LLM is "input: "{'__arg1': 'MSFT'}"" while the method should be called with the "query" parameter. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-06 13:27:54 -07:00
Mark Cusack	060987d755	community[minor]: Add indexing via locality sensitive hashing to the Yellowbrick vector store (#20856 ) - Description: Add LSH-based indexing to the Yellowbrick vector store module - Twitter handle: @markcusack --------- Co-authored-by: markcusack <markcusack@markcusacksmac.lan> Co-authored-by: markcusack <markcusack@Mark-Cusack-sMac.local> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-05-06 20:18:02 +00:00
Rashmi Pawar	a2fdabdad2	mark NemoEmbeddings as deprecated (#21239 ) The NemoEmbeddings is deprecated, instead use langchain-nvidia-ai-endpoints NVIDIAEmbeddings interface. cc: @mattf --------- Co-authored-by: Daniel Glogowski <167348611+dglogo@users.noreply.github.com> Co-authored-by: andyjessen <62343929+andyjessen@users.noreply.github.com> Co-authored-by: Chris Germann <88305668+TAAGECH9@users.noreply.github.com> Co-authored-by: gere <gere@kapo.zh.ch> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 19:44:58 +00:00
Erick Friis	9e4b24a2d6	langchain: release 0.1.18 (#21338 )	2024-05-06 19:39:46 +00:00
Erick Friis	5c000f8d79	community: release 0.0.37 (#21332 )	2024-05-06 12:17:42 -07:00
Leonid Ganeline	8c13e8a79b	langchain: `qa_chain` fix (#21279 ) Issue: `load_qa_chain` is placed in the __init__.py file. As a result, it is not listed in the API Reference docs. BTW `load_qa_chain` is heavily presented in the doc examples, but is missed in API Ref. Change: moved code from init.py into a new file. Related: #21266	2024-05-06 14:45:51 -04:00
Erick Friis	7ecf9996f1	community: Revert "community: langkit dependency" (#21333 ) Reverts langchain-ai/langchain#21174 Hey team - going to revert this because it doesn't seem necessary for testing. We should only be adding optional + extended_testing dependencies for deps that have extended tests. otherwise it just increases probability of dependency conflicts in the community lockfile.	2024-05-06 18:44:41 +00:00
Param Singh	fee91d43b7	baichuan[patch]:standardize chat init args (#21298 ) Thank you for contributing to LangChain! community:baichuan[patch]: standardize init args updated `baichuan_api_key` so that aliased to `api_key`. Added test that it continues to set the same underlying attribute. Test checks for `SecretStr` updated `temperature` with Pydantic Field, added unit test. Related to https://github.com/langchain-ai/langchain/issues/20085	2024-05-06 18:33:57 +00:00
Leonid Ganeline	62559b20b3	docs: `chains` page format (#21259 ) Compacted the table column descriptions.	2024-05-06 11:33:38 -07:00
Christophe Bornet	484a009012	community[minor]: Relax constraints on Cassandra VectorStore constructors (#21209 ) If Session and/or keyspace are not provided, they are resolved from cassio's context. So they are not required. This change is fully backward compatible.	2024-05-06 14:32:32 -04:00
Daniel Glogowski	27e73ebe57	docs: update nvidia docs v2 (#21288 ) More doc updates por favor @baskaryan!	2024-05-06 11:29:02 -07:00
Leonid Ganeline	6feddfae88	community: langkit dependency (#21174 ) Issue: the `langkit` package is not presented in the `pyproject.toml` but it is a requirement for the `WhyLabsCallbackHandler` Change: added `langkit` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-06 18:09:31 +00:00
Erick Friis	811e9cee8b	core: release 0.1.51 (#21328 )	2024-05-06 10:40:19 -07:00
Pengcheng Liu	144f2821af	docs: add example for loading data from LarkSuite wiki. (#21311 ) Description: Update LarkSuite loader doc to give an example for loading data from LarkSuite wiki. Issue: None Dependencies: None Twitter handle: None	2024-05-06 09:56:12 -07:00
Mateusz Szewczyk	682d21c3de	ibm: Add support for ibm-watsonx-ai new major version (#21313 ) Thank you for contributing to LangChain! - [x] PR title: "langchain-ibm: Add support for ibm-watsonx-ai new major version" - [x] PR message: - Description: Add support for ibm-watsonx-ai new major version - Dependencies: `ibm_watsonx_ai` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-06 16:48:26 +00:00
Chris Papademetrious	ee6c922c91	langchain[minor]: enhance `LocalFileStore` to offer `update_atime` parameter that updates access times on read (#20951 ) Description: The `LocalFileStore` class can be used to create an on-disk `CacheBackedEmbeddings` cache. The number of files in these embeddings caches can grow to be quite large over time (hundreds of thousands) as embeddings are computed for new versions of content, but the embeddings for old/deprecated content are not removed. A least-recently-used (LRU) cache policy could be applied to the `LocalFileStore` directory to delete cache entries that have not been referenced for some time: ```bash # delete files that have not been accessed in the last 90 days find embeddings_cache_dir/ -atime 90 -print0 \| xargs -0 rm ``` However, most filesystems in enterprise environments disable access time modification on read to improve performance. As a result, the access times of these cache entry files are not updated when their values are read. To resolve this, this pull request updates the `LocalFileStore` constructor to offer an `update_atime` parameter that causes access times to be updated when a cache entry is read. For example, ```python file_store = LocalFileStore(temp_dir, update_atime=True) ``` The default is `False`, which retains the original behavior. Testing: I updated the LocalFileStore unit tests to test the access time update.	2024-05-06 11:52:29 -04:00
Tomaz Bratanic	5b6d1a907d	Add the extract types to diffbot graph transformer (#21315 ) Before you could only extract triples (diffbot calls it facts) from diffbot to avoid isolated nodes. However, sometimes isolated nodes can still be useful like for prefiltering, so we want to allow users to extract them if they want. Default behaviour is unchanged.	2024-05-06 09:19:52 -04:00
Jagadish Krishnamoorthy	c038991590	docs: Update pandas.ipynb (#21289 ) Remove the redundant comment.	2024-05-05 20:22:17 +00:00
aditya thomas	b868c78a12	partners[anthropic]: update unit test for key passed in from the environment (#21290 ) Description: Update unit test for ChatAnthropic Issue: Test for key passed in from the environment should not have the key initialized in the constructor Dependencies: None	2024-05-05 16:19:10 -04:00
tanersekmen	d310f9c71e	docs:update code structure (#21302 ) update the structure of llm_chain variables Co-authored-by: tanersemenn <0418>	2024-05-05 17:18:15 +00:00
Christophe Bornet	ba9dc04ffa	docs: Add doc for hybrid search (#21245 ) See [preview](https://langchain-git-fork-cbornet-doc-hybrid-search-langchain.vercel.app/docs/use_cases/question_answering/hybrid/) In the model of [per user retrieval](https://python.langchain.com/docs/use_cases/question_answering/per_user/)	2024-05-04 08:22:56 -04:00
Rohan Aggarwal	8021d2a2ab	community[minor]: Oraclevs integration (#21123 ) Thank you for contributing to LangChain! - Oracle AI Vector Search Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. - Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads that allows you to query data based on semantics, rather than keywords. One of the biggest benefit of Oracle AI Vector Search is that semantic search on unstructured data can be combined with relational search on business data in one single system. This is not only powerful but also significantly more effective because you don't need to add a specialized vector database, eliminating the pain of data fragmentation between multiple systems. This Pull Requests Adds the following functionalities Oracle AI Vector Search : Vector Store Oracle AI Vector Search : Document Loader Oracle AI Vector Search : Document Splitter Oracle AI Vector Search : Summary Oracle AI Vector Search : Oracle Embeddings - We have added unit tests and have our own local unit test suite which verifies all the code is correct. We have made sure to add guides for each of the components and one end to end guide that shows how the entire thing runs. - We have made sure that make format and make lint run clean. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: skmishraoracle <shailendra.mishra@oracle.com> Co-authored-by: hroyofc <harichandan.roy@oracle.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-04 03:15:35 +00:00
ccurme	c9e9470c5a	langchain: fix deprecation decorators on extraction chains (#21276 ) Calling any of these raises ``` ValueError: A pending deprecation cannot have a scheduled removal ```	2024-05-03 18:29:40 -04:00
Wickes Wong	ee1adaacaa	langchain[patch]: Fix summary buffer memory with return message flag (#21115 ) ## Description Memory return could be set as `str` or `message` by `return_messages` flag as mentioned in https://python.langchain.com/docs/modules/memory/#whether-memory-is-a-string-or-a-list-of-messages, where `langchain.chains.conversation.memory.ConversationSummaryBufferMemory` did not implement that. This commit added `buffer_as_str` and `buffer_as_messages` function, and `buffer` now affected by `return_messages` flag. ## Example Test Code and Output ```python # Fix: ConversationSummaryBufferMemory with return_messages flag function # Test code from langchain.chains.conversation.memory import ConversationSummaryBufferMemory from langchain_community.llms.ollama import Ollama llm = Ollama() # Create an instance of ConversationSummaryBufferMemory with return_messages set to True memory = ConversationSummaryBufferMemory(return_messages=True, llm=llm) # Add user and AI messages to the chat memory memory.chat_memory.add_user_message("hi!") memory.chat_memory.add_ai_message("what's up?") # Print the buffer print("Buffer:") print(map(type, memory.buffer), sep="\n") print(memory.buffer, "\n") # Print the buffer as a string print("Buffer as String:") print(type(memory.buffer_as_str)) print(memory.buffer_as_str, "\n") # Print the buffer as messages print("Buffer as Messages:") print(map(type, memory.buffer_as_messages), sep="\n") print(memory.buffer_as_messages, "\n") # Print the buffer after setting return_messages to False memory.return_messages = False print("Buffer after setting return_messages to False:") print(type(memory.buffer)) print(memory.buffer, "\n") ``` ```plaintext Buffer: <class 'langchain_core.messages.human.HumanMessage'> <class 'langchain_core.messages.ai.AIMessage'> [HumanMessage(content='hi!'), AIMessage(content="what's up?")] Buffer as String: <class 'str'> Human: hi! AI: what's up? Buffer as Messages: <class 'langchain_core.messages.human.HumanMessage'> <class 'langchain_core.messages.ai.AIMessage'> [HumanMessage(content='hi!'), AIMessage(content="what's up?")] Buffer after setting return_messages to False: <class 'str'> Human: hi! AI: what's up? ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-03 17:25:09 -04:00
Leonid Ganeline	9639457222	community[patch]: `tools` imports (#21156 ) Issue: we have several helper functions to import third-party libraries like tools.gmail.utils.import_google in [community.tools](https://api.python.langchain.com/en/latest/community_api_reference.html#id37). And we have core.utils.utils.guard_import that works exactly for this purpose. The import_<package> functions work inconsistently and rather be private functions. Change: replaced these functions with the guard_import function. Related to #21133	2024-05-03 17:22:45 -04:00
Leonid Ganeline	3ef8b24277	core[patch]: `utils.guard_import` fix (#21133 ) Issues (nit): 1. `utils.guard_import` prints wrong error message when there is an import `error.` It prints the whole `module_name` but should be only the first part as the pip package name. E.i. `langchain_core.utils` -> print not `langchain-core` but `langchain_core.utils`. Also replace '_' with '-' in the pip package name. 2. it does not handle the `ModuleNotFoundError` which raised if `guard_import("wrong_module")` Fixed issues; added ut-s. Controversial: I've reraised `ModuleNotFoundError` as `ImportError`, since in case of the error, the proposed action is the same - we need to install a missed package.	2024-05-03 17:21:36 -04:00
Erick Friis	36c2ca3c8b	mistralai: relax tokenizers dep (#21277 )	2024-05-03 14:16:22 -07:00
Nuno Campos	6e1e0c7d5c	fix: core: draw_mermaid() would create subgroup for edges with same src and tgt (#21275 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-05-03 13:51:08 -07:00
Eugene Yurtsev	26a37dce0a	langchain[patch]: Remove jsonpatch from poetry file (#21272 ) jsonpatch is only used in langchain-core not in langchain	2024-05-03 15:46:05 -04:00
Eugene Yurtsev	335bd01e45	langchain[patch]: Update deprecation warning (#21268 ) Update deprecation warning	2024-05-03 15:31:29 -04:00
Leonid Ganeline	23a05c3986	langchain: `summarize` chain fix (#21266 ) Issue: `load_summarize_chain` is placed in the __init__.py file. As a result, it doesn't listed in the API Reference docs. Change: moved code from __init__.py into a new file.	2024-05-03 14:44:39 -04:00
ccurme	6da3d92b42	(all): update removal in deprecation warnings from 0.2 to 0.3 (#21265 ) We are pushing out the removal of these to 0.3. `find . -type f -name "*.py" -exec sed -i '' 's/removal="0\.2/removal="0.3/g' {} +`	2024-05-03 14:29:36 -04:00
Eugene Yurtsev	d6e34f9ee5	langchain[patch]: Improve deprecation warnings (#21262 ) * Remove spurious derprecation warning * Make deprecation warnings consistent with 0.1 namespaces that were announced as deprecated	2024-05-03 13:40:16 -04:00
Eugene Yurtsev	487aff7e46	langchain[patch]: Revert 20794 until 0.2 release (#21257 ) PR of 2079 was already released as part of 0.1.17rc. Issue for 0.2 release: https://github.com/langchain-ai/langchain/issues/21080	2024-05-03 17:02:48 +00:00
Eugene Yurtsev	ba4a309d98	langchain[patch]: Revert breaking change until 0.2 release (#21256 ) Reverts a minor breaking change until 0.2 release	2024-05-03 09:42:27 -07:00
Eugene Yurtsev	66a1e3f083	langchain[patch]: Fix flaky unit test (#21258 ) Should sort the results of the import test since it depends on import order	2024-05-03 15:55:46 +00:00
Eugene Yurtsev	0989c48028	langchain[minor]: Re-add deleted ainetwork tool (#21254 ) * Adding __init__.py to turn it into a package in community * Adding proxy imports that assume that langchain_community is optional	2024-05-03 11:39:40 -04:00
Christophe Bornet	2fbe82f5e6	community[minor]: Relax constraints on CassandraChatMessageHistory constructor (#21241 )	2024-05-03 10:20:39 -04:00
Chris Germann	3a8d1d8838	Hotfix RetrievalQA Docs: docs: Fix formatting (#21183 ) # Newline Characters breaking formatting Description: As you can see in the image below, the formatting in the documentation is broken. As far as I can see the two added `\n` characters are breaking the documentation. Therefore I would propose to remove those ![image](https://github.com/langchain-ai/langchain/assets/88305668/23b6e726-71b2-4812-91ea-3e8600683733) Dependencies: None Twitter Handle - epu9byj --------- Co-authored-by: gere <gere@kapo.zh.ch> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-03 12:46:29 +00:00
andyjessen	64e17bd793	docs: Fix comment within "handle long text" example (#21248 ) The current doc-string comment is referring to the wrong schema.	2024-05-03 12:36:53 +00:00
Daniel Glogowski	c3d169ab00	docs: Update Nvidia documentation (#21240 ) Updating Nvidia docs ahead for 5/15 competition. Thanks!	2024-05-03 12:29:03 +00:00
Bagatur	70bde15480	docs: add tool choice to tool calling (#21229 )	2024-05-03 03:10:22 -04:00
Bagatur	67a5cc34c6	openai[patch]: Release 0.1.6 (#21236 )	2024-05-03 04:10:39 +00:00
Erick Friis	c1eb95b967	core: release 0.1.50 (#21230 )	2024-05-02 22:44:18 +00:00
Nuno Campos	47ce8d5a57	core: tracer: remove numeric execution order (#21220 ) - this hasn't been used in a long time and requires some additional bookkeeping i'm going to streamline in the next pr	2024-05-02 15:38:55 -07:00
Bagatur	6ac6158a07	openai[patch]: support tool_choice="required" (#21216 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-05-02 18:33:25 -04:00
Erick Friis	aa9faa8512	docs: model table keywords, remove tool calling from llm (#21225 )	2024-05-02 21:04:29 +00:00
xindoo	c1aa237bc2	langchain: fix syntax error in code comment for create_tool_calling_agent (#21205 ) PR message: - Description: Corrected a syntax error in the code comments within the `create_tool_calling_agent` function in the langchain package. - Issue: N/A - Dependencies: No additional dependencies required. - Twitter handle: N/A	2024-05-02 19:17:23 +00:00
ccurme	eb0a2fd53a	mistral: release 0.1.6 (#21214 )	2024-05-02 13:59:19 -04:00
ccurme	2d77e5e3a1	(standard tests): add test for basic conversation sequence (#21213 )	2024-05-02 13:47:10 -04:00
Maxime Perrin	1ebb5a70ad	partners(mistralai): Removing unused variable in completion request (using tool_calls or content) (#21201 ) This PR fixes #21196. The error was occurring when calling chat completion API with a chat history. Indeed, the Mistral API does not accept both `content` and `tool_calls` in the same body. This PR removes one of theses variables depending on the necessity. --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-05-02 13:20:14 -04:00
Christophe Bornet	683fb45c6b	community[patch]: Refactor CassandraDatabase wrapper (#21075 ) * Introduce individual `fetch_` methods for easier typing. * Rework some docstrings to google style * Move some logic to the tool * Merge the 2 cassandra utility files	2024-05-02 13:13:08 -04:00
Bagatur	b00fd1dbde	infra: Undo gh cache removal (#21210 ) Co-authored-by: Nuno Campos <nuno@langchain.dev>	2024-05-02 17:12:32 +00:00
Aditya	ee2c55ca09	docs: Added documentation on Anthropic models on vertex (#21070 ) Description:Added documentation on Anthropic models on Vertex @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-05-02 13:12:01 -04:00
Raghav Dixit	7d451d0041	community[patch]: Update lancedb.py (#21192 ) very minor update in LanceDB integration, 'metric' argument was missing.	2024-05-02 17:06:39 +00:00
Bagatur	d297d90ad9	core[patch]: Release 0.1.49 (#21211 )	2024-05-02 17:06:27 +00:00
Nuno Campos	663747b730	core[patch]: Fixes for convert_messages (#21207 ) - support two-tuples of any sequence type (eg. json.loads never produces tuples) - support type alias for role key - if id is passed in in dict form use it - if tool_calls passed in in dict form use them --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-02 16:55:42 +00:00
Eugene Yurtsev	df49404794	langchain[patch]: Make more memory code handle community dependency as optional (#21199 )	2024-05-02 11:05:26 -04:00
ccurme	bd5d2c2674	langchain: import InMemoryChatMessageHistory from core (#21198 )	2024-05-02 14:53:07 +00:00
Eugene Yurtsev	3cd7fced5f	langchain[patch],community[minor]: Migrate memory implementations to community (#20845 ) Migrates memory implementations to community	2024-05-02 10:46:50 -04:00
Eugene Yurtsev	b5c3a04e4b	langchain[patch]: chat histories to handle optional community dependence (#21194 )	2024-05-02 10:36:08 -04:00
Eugene Yurtsev	c9119b0e75	langchain[patch],community[minor]: Move some unit tests from langchain to community, use core for fake models (#21190 )	2024-05-02 09:57:52 -04:00
Eugene Yurtsev	c306364b06	langchain[patch]: Update more code to use langchain community as an optional dependency (#21170 ) More code to use langchain community as an optional dependency	2024-05-02 09:05:48 -04:00
Erick Friis	cd4c54282a	infra: cleanup docs build (#21134 ) Refactors the docs build in order to: - run the same `make build` command in both vercel and local build - incrementally build artifacts in 2 distinct steps, instead of building all docs in-place (in vercel) or in a _dist dir (locally) Highlights: - introduces `make build` in order to build the docs - collects and generates all files for the build in `docs/build/intermediate` - renders those jupyter notebook + markdown files into `docs/build/outputs` And now the outputs to host are in `docs/build/outputs`, which will need a vercel settings change. Todo: - [ ] figure out how to point the right directory (right now deleting and moving docs dir in vercel_build.sh isn't great)	2024-05-01 17:34:05 -07:00
Bagatur	6fa8626e2f	openai[patch]: fix azure open lc serialization, release 0.1.5 (#21159 )	2024-05-01 18:03:29 -04:00
Eugene Yurtsev	94a838740e	langchain[patch]: Migrate more code in utils to use optional langchain import (#21166 ) Moving is interactive util to avoid circular deps	2024-05-01 17:18:42 -04:00
Eugene Yurtsev	23fdd320bc	langchain[patch]: Migrate more code to use optional community in agents namespace (#21167 )	2024-05-01 16:25:44 -04:00
Tomaz Bratanic	9e53fa7d2e	Some more fixes to neo4j enhanced schema (#21139 )	2024-05-01 13:12:43 -07:00
Erick Friis	0694538c39	ai21: fix core version (#21168 )	2024-05-01 13:10:22 -07:00
Eugene Yurtsev	44602bdc20	langchain[patch],community[minor]: Move load_tools to community (#21158 ) Move load tools to community	2024-05-01 16:05:41 -04:00
Eugene Yurtsev	9932f49b3e	langchain[patch]: Migrate llms to use optional community imports (#21101 )	2024-05-01 16:04:45 -04:00
Eugene Yurtsev	57e8e70daa	langchain[patch]: Migrate chat models to optional community imports (#21090 ) Migrate chat models to optional community imports	2024-05-01 16:04:12 -04:00
Eugene Yurtsev	2914abd747	langchain[patch]: Fix how the serializable test identifies serializable objects (#21165 ) dir() will not work if we're using optional imports. The only way to do this is by using contents of __all__	2024-05-01 15:56:11 -04:00
Eugene Yurtsev	23c5d87311	langchain[patch]: Migrate utils to use optional langchain_community (#21163 ) Migrate utils to use optional imports from langchain community	2024-05-01 15:24:02 -04:00
Eugene Yurtsev	bec3eee3fa	langchain[patch]: Migrate retrievers to use optional langchain community imports (#21155 )	2024-05-01 14:44:44 -04:00
Eugene Yurtsev	43110daea5	langchain[patch]: Update some agent tool kits to handle community import as optional (#21157 ) A few things that were not caught by the migration script	2024-05-01 14:22:54 -04:00
Eugene Yurtsev	59f10ab3e0	langchain[patch]: Migrate embeddings to optional imports (#21099 )	2024-05-01 13:47:37 -04:00
Eugene Yurtsev	2f709d94d7	langchain[patch]: Migrate vectorstores to use optional langchain community imports (#21150 )	2024-05-01 13:33:37 -04:00
Eugene Yurtsev	7230e430db	langchain[patch]: Migrate top level files to use optional langchain community (#21152 ) Migrate a few top level files to treat langchain community as an optional dependency	2024-05-01 13:23:03 -04:00
Erick Friis	daab9789a8	ai21: release 0.1.4 (#21151 )	2024-05-01 17:16:27 +00:00
Asaf Joseph Gardin	642975dd9f	partners: AI21 Labs Jamba Support (#20815 ) Description: Added support for AI21 new model - Jamba Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-05-01 10:12:44 -07:00
Eugene Yurtsev	7a39fe60da	langchain[patch]: Migrate utilities to handle langchain community as optional (#21149 )	2024-05-01 13:09:34 -04:00
Eugene Yurtsev	b879184595	langchain[patch]: embedddings distance move import of openai embeddings into local scope (#21148 )	2024-05-01 12:51:51 -04:00
Bagatur	8b4b75e543	docs: standardize vertexai params (#20167 ) Related to #20085 Requires https://github.com/langchain-ai/langchain-google/pull/121	2024-05-01 11:42:18 -04:00
Eugene Yurtsev	0e5bf16d00	langchain[patch]: Migrate document loaders to use optional langchain community imports (#21095 )	2024-05-01 11:26:25 -04:00
Jacob Lee	bd38073d76	👥 Update LangChain people data (#21143 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-05-01 11:01:43 -04:00
Harrison Chase	4d1c21d97d	community[patch]: Fix alternative name in deprecation notice for sql_database (#21144 )	2024-05-01 10:59:42 -04:00
East Agile	2a6f78a53f	community[minor]: Rememberizer retriever (#20052 ) Description: This pull request introduces a new feature for LangChain: the integration with the Rememberizer API through a custom retriever. This enables LangChain applications to allow users to load and sync their data from Dropbox, Google Drive, Slack, their hard drive into a vector database that LangChain can query. Queries involve sending text chunks generated within LangChain and retrieving a collection of semantically relevant user data for inclusion in LLM prompts. User knowledge dramatically improved AI applications. The Rememberizer integration will also allow users to access general purpose vectorized data such as Reddit channel discussions and US patents. Issue: N/A Dependencies: N/A Twitter handle: https://twitter.com/Rememberizer	2024-05-01 10:41:44 -04:00
Eugene Yurtsev	1ce1a10f2b	langchain[patch],community[minor]: Move graph index creator (#20795 ) Move graph index creator to community	2024-05-01 10:04:30 -04:00
Eugene Yurtsev	aa0bc7467c	langchain[patch]: Migrate agents module into optional imports for community (#21088 )	2024-05-01 09:36:03 -04:00
Eugene Yurtsev	86ff8a3fb4	langchain[patch]: Update docstore module to use optional imports from community (#21091 )	2024-05-01 09:35:05 -04:00
Eugene Yurtsev	d640605694	langchain[patch]: Migrate chat loaders to optional community imports (#21089 ) Migrate chat loaders to optional community imports	2024-05-01 09:34:44 -04:00
Charlie Marsh	2b10c4dd52	ci: Use `ruff check` in Makefile (#21138 ) ## Summary `ruff /path/to/file.py` works but is deprecated, and we now recommend `ruff check /path/to/file.py` (to match `ruff format /path/to/file.py`).	2024-05-01 09:34:15 -04:00
Eugene Yurtsev	2fcab9acd9	langchain[patch]: Upgrade storage to treat langchain community as optional (#21105 )	2024-05-01 09:33:31 -04:00
William FH	ab55f6996d	[Core] Tracing: update parent run_tree's child_runs (#21049 )	2024-05-01 06:33:08 -07:00
Abhishek Bhagwat	86fe484e24	docs: Docs (sample notebook) for Vertex DIY RAG Ranking API (#21054 ) Vertex DIY RAG APIs helps to build complex RAG systems and provide more granular control, and are suited for custom use cases. The Ranking API takes in a list of documents and reranks those documents based on how relevant the documents are to a given query. Compared to embeddings that look purely at the semantic similarity of a document and a query, the ranking API can give you a more precise score for how well a document answers a given query. [Reference](https://cloud.google.com/generative-ai-app-builder/docs/ranking) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 05:39:39 +00:00
Stuart Leeks	8a01760a0f	infra: Sync devcontainer.json and compose file mount location (#20461 ) Sync the config in `devcontainer.json` and `docker-compose.yml` Issue: when opening the current `master` branch in a dev container in VS Code, I get the following message as VS Code cannot find the mounted source folder: ![image](https://github.com/langchain-ai/langchain/assets/1824461/41cf20c0-d1e0-4648-9578-edf80b99c2db) Opening in a GitHub Codespace works (it seems to ignore the mounts in the `docker-compose.yml`. This PR updates the mount in `docker-compose.yml` and the config in `devcontainer.json` so that the two align. I have tested these changes in GitHub Codespaces and a VS Code dev container and both loaded successfully. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-01 01:32:12 -04:00
aditya thomas	12b1caf295	openai[patch]: add tests for secret_str for keys (#20982 ) Description: Add tests to check API keys and Active Directory tokens are masked Issue: Resolves #12165 for OpenAI and Azure OpenAI models Dependencies: None Also resolves #12473 which may be closed. Additional contributors @alex4321 (#12473) and @onesolpark (#12542)	2024-05-01 01:26:20 -04:00
Noah	45ddf4d26f	community[patch]: Update comments for lazy_load method (#21063 ) - [ ] PR message: - Description: Refactored the lazy_load method to use asynchronous execution for improved performance. The method now initiates scraping of all URLs simultaneously using asyncio.gather, enhancing data fetching efficiency. Each Document object is yielded immediately once its content becomes available, streamlining the entire process. - Issue: N/A - Dependencies: Requires the asyncio library for handling asynchronous tasks, which should already be part of standard Python libraries in Python 3.7 and above. - Email: [r73327118@gmail.com](mailto:r73327118@gmail.com) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 01:20:57 -04:00
Liu Xiaodong	3b473d10f2	experimental: clean python repl input（experimental：Added code for PythonREPL） (#20930 ) Update python.py（experimental：Added code for PythonREPL） Added code for PythonREPL, defining a static method 'sanitize_input' that takes the string 'query' as input and returns a sanitizing string. The purpose of this method is to remove unwanted characters from the input string, Specifically: 1. Delete the whitespace at the beginning and end of the string (' \s'). 2. Remove the quotation marks (`` ` ``) at the beginning and end of the string. 3. Remove the keyword "python" at the beginning of the string (case insensitive) because the user may have typed it. This method uses regular expressions (regex) to implement sanitizing. It all started with this code： from langchain.agents import Tool from langchain_experimental.utilities import PythonREPL python_repl = PythonREPL() repl_tool = Tool( name="python_repl", description="Remove redundant formatting marks at the beginning and end of source code from input.Use a Python shell to execute python commands. If you want to see the output of a value, you should print it out with `print(...)`.", func=python_repl.run, ) When I call the agent to write a piece of code for me and execute it with the defined code, I must get an error: SyntaxError('invalid syntax', ('<string>', 1, 1,'In', 1, 2)) After checking, I found that pythonREPL has less formatting of input code than the soon-to-be deprecated pythonREPL tool, so I added this step to it, so that no matter what code I ask the agent to write for me, it can be executed smoothly and get the output result. I have tried modifying the prompt words to solve this problem before, but it did not work, and by adding a simple format check, the problem is well resolved. <img width="1271" alt="image" src="https://github.com/langchain-ai/langchain/assets/164149097/c49a685f-d246-4b11-b655-fd952fc2f04c"> --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 05:19:09 +00:00
Ismail Hossain Polas	1fdf63fa6c	community[patch]: update package name to bagelML (#19948 ) Description This pull request updates the Bagel Network package name from "betabageldb" to "bagelML" to align with the latest changes made by the Bagel Network team. The following modifications have been made: - Updated all references to the old package name ("betabageldb") with the new package name ("bagelML") throughout the codebase. - Modified the documentation, and any relevant scripts to reflect the package name change. - Tested the changes to ensure that the functionality remains intact and no breaking changes were introduced. By merging this pull request, our project will stay up to date with the latest Bagel Network package naming convention, ensuring compatibility and smooth integration with their updated library. Please review the changes and provide any feedback or suggestions. Thank you!	2024-05-01 01:17:33 -04:00
Tomaz Bratanic	7860e4c649	experimental[patch]: Add support for non-function calling LLMs in llm graph transformers (#21014 )	2024-05-01 01:16:07 -04:00
Erick Friis	67e6744e0f	docs: fix some notebook formatting (#21136 )	2024-04-30 21:39:03 -07:00
tianzedavid	5a8909440b	docs: remove repetitive words (#21058 ) remove repetitive words Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-05-01 01:10:42 +00:00
Leonid Kuligin	a36935b520	docs: updated docs on langchain_google_community (#21064 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: updated docs on langchain_google_community" - [ ] PR message: - Description: updated docs on langchain_google_community	2024-04-30 20:20:49 -04:00
Tomaz Bratanic	c9e96bb5e2	community[patch]: Fix neo4j enhanced schema bugs (#21072 )	2024-04-30 20:16:26 -04:00
junkeon	8d2909ee25	upstage[minor]: Update few codes and add upstage loader in pdf section (#21085 ) Description: Update UpstageLayoutAnalysisParser and Loader and add upstage loader example in pdf section Dependencies: langchain_community Twitter handle: [@upstageai](https://twitter.com/upstageai) - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-30 20:15:49 -04:00
Bagatur	bef50ded63	openai[patch]: fix special token default behavior (#21131 ) By default handle special sequences as regular text	2024-04-30 20:08:24 -04:00
MacanPN	0f7f448603	community[patch]: add delete() method to AzureSearch vector store (#21127 ) Issue: Currently `AzureSearch` vector store does not implement `delete` method. This PR implements it. This also makes it compatible with LangChain indexer. Dependencies: None Twitter handle: @martintriska1 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:46:18 +00:00
Jorge Piedrahita Ortiz	3441a11b21	docs: minor changes in sambanova community integration docs (#21129 ) - Description: minor changes in sambanova community integration notebook docs --------- Co-authored-by: Renate Kempf <165940384+renate-snova@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:44:26 +00:00
Bagatur	6d3e9eaf84	docs: format (#21132 )	2024-04-30 23:32:41 +00:00
Erick Friis	14422a4220	langchain: fix core dep (#21128 )	2024-04-30 14:55:12 -07:00
Erick Friis	6c938da302	langchain: release 0.1.17 (#21125 )	2024-04-30 14:43:59 -07:00
Erick Friis	5f8a307565	infra: same tagging for langchain (#21126 )	2024-04-30 14:43:45 -07:00
Eugene Yurtsev	bf95414758	langchain[minor]: enhance unit test to test imports recursively (#21122 )	2024-04-30 17:05:53 -04:00
Eugene Yurtsev	e4f51f59a2	langchain[patch]: Migrate tools to treat community imports as optional (#21117 ) Migrate tools to treat community imports as optional	2024-04-30 16:26:18 -04:00
Eugene Yurtsev	9e788f09c6	langchain[patch]: Migrate output parsers to support optional community imports (#21103 ) Migrate output parsers	2024-04-30 16:24:29 -04:00
Eugene Yurtsev	3853fe9f64	langchain[patch]: Migrate graphs to use optional community imports (#21100 ) Migrate graphs to use optional community imports.	2024-04-30 16:24:06 -04:00
Eugene Yurtsev	8658d52587	langchain[patch]: Upgrade prompts to optional imports (#21078 ) Upgrades prompts module to use optional imports. This code was generated with a migration script, but had to be adjusted manually a bit. Testing in preparation for applying this code modification across the rest of the modules in langchain package to reverse the dependency between langchain community and langchain.	2024-04-30 16:23:39 -04:00
Eugene Yurtsev	9b6d04a187	langchain[patch]: Migrate document transformers (#21098 ) Migrate document transformers	2024-04-30 16:20:02 -04:00
Eugene Yurtsev	aec13a6123	langchain[patch]: Migrate callbacks module to use optional imports for community (#21086 )	2024-04-30 16:19:13 -04:00
Erick Friis	8a62fb0570	community: release 0.0.36 (#21118 )	2024-04-30 13:18:44 -07:00
Erick Friis	2407c353be	core: release 0.1.48 (#21113 )	2024-04-30 19:52:36 +00:00
Erick Friis	dbdfa3d34e	infra: fix minimum version install to force pypi install (#21112 )	2024-04-30 12:41:26 -07:00
Charlie Marsh	fd94aa8366	partner[patch]: Upgrade to Ruff v0.4.2 (#21108 ) ## Summary No new diagnostics (given that the set of enabled rules hasn't changed), but gains access to our new parser (much faster) and reduced false positives all around.	2024-04-30 15:06:42 -04:00
Jamsheed Mistri	3e749369ef	community[minor]: bump version of LayerupSecurity, add support for untrusted_input parameter (#19985 ) Description: update version of LayerupSecurity package for the Layerup Security integration. Add untrusted_input parameter.	2024-04-30 14:55:26 -04:00
fubuki8087	f1c3687aa5	community[patch]: Using the right encoding to parse the web page in RecursiveUrlLoader (#20632 ) As shown in #13749 , `RecursiveUrlLoader` has encoding issue. This PR is to solve this. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 18:41:36 +00:00
Jakub Pawłowski	b0b1a67771	community[patch]: Skip unexpected 404 HTTP Error in Arxiv download (#21042 ) ### Description: When attempting to download PDF files from arXiv, an unexpected 404 error frequently occurs. This error halts the operation, regardless of whether there are additional documents to process. As a solution, I suggest implementing a mechanism to ignore and communicate this error and continue processing the next document from the list. Proposed Solution: To address the issue of unexpected 404 errors during PDF downloads from arXiv, I propose implementing the following solution: - Error Handling: Implement error handling mechanisms to catch and handle 404 errors gracefully. - Communication: Inform the user or logging system about the occurrence of the 404 error. - Continued Processing: After encountering a 404 error, continue processing the remaining documents from the list without interruption. This solution ensures that the application can handle unexpected errors without terminating the entire operation. It promotes resilience and robustness in the face of intermittent issues encountered during PDF downloads from arXiv. ### Issue: #20909 ### Dependencies: none --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-30 18:29:22 +00:00
Erick Friis	b9c53e95b7	community: release 0.0.35 (#21104 )	2024-04-30 17:48:56 +00:00
Eugene Yurtsev	3c064a757f	core[minor],langchain[patch],community[patch]: Move storage interfaces to core (#20750 ) * Move storage interface to core * Move in memory and file system implementation to core	2024-04-30 13:14:26 -04:00
Charlie Marsh	8f38b7a725	multiple: Remove unnecessary Ruff suppression comments (#21050 ) ## Summary I ran `ruff check --extend-select RUF100 -n` to identify `# noqa` comments that weren't having any effect in Ruff, and then `ruff check --extend-select RUF100 -n --fix` on select files to remove all of the unnecessary `# noqa: F401` violations. It's possible that these were needed at some point in the past, but they're not necessary in Ruff v0.1.15 (used by LangChain) or in the latest release. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-30 17:13:48 +00:00
Erick Friis	748f2ba9ea	core: release 0.1.47 (#21094 )	2024-04-30 09:22:05 -07:00
Erick Friis	efe27ef849	infra: tag non-langchain releases (#20805 )	2024-04-30 16:15:46 +00:00
Eugene Yurtsev	c8f18a2524	langchain[patch]: Update import handling in `adapters` (#21079 )	2024-04-30 10:55:29 -04:00
William FH	5c63ac3dd7	[Patch] Dedent docstring (#20959 ) Technically a slight prompt breaking change, but I think positive EV in that it saves tokens and results in more sane / in-distribution prompts	2024-04-30 07:40:57 -07:00
Eugene Yurtsev	845d8e0025	langchain[patch]: Update handling of deprecation warnings (#21083 ) Chains should not be emitting deprecation warnings.	2024-04-30 10:30:23 -04:00
Christophe Bornet	5c77f45b06	community[minor]: Add async methods to CassandraCache and CassandraSemanticCache (#20654 )	2024-04-30 10:27:44 -04:00
Christophe Bornet	d6e9bd3011	docs: Bump cassio min version in docs (#21081 ) Cassio 0.6+ is recommended for async vector store (not blocking on getting the embedding dimension) and for hybrid search support.	2024-04-30 10:25:37 -04:00
William FH	db14d4326d	[Core] Feat Pretty Print Tool calls (#20997 ) Right now, `tool_calls` are not included in the `pretty_print()` output. Would be nice to show! ![image](https://github.com/langchain-ai/langchain/assets/13333726/6a0ffca3-d02f-4e18-bc76-513eeca2e964)	2024-04-30 07:14:43 -07:00
Kuro Denjiro	fa4124b821	community[minor]: add mintbase loader to langchain (#20089 ) - [x] Add Near NFT loader: "community: Load NFT near block chain using mintbase graph API" - [x] PR message: - Description: a description of the change - Twitter handle:Kurodenjiro --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 04:11:56 +00:00
Alexander Dicke	d7e12750df	community[patch]: allows using `text-generation-inference` /generate route with `HuggingFaceEndpoint` (#20100 ) - Description: allows to use the /generate route of `text-generation-inference` with the `HuggingFaceEndpoint`	2024-04-29 23:09:55 -04:00
Jonathan Evans	ea43c669f2	community[patch]: Fix Bedrock Mistral stop sequence request key (#20115 ) - Description: Change Bedrock's Mistral stop sequence key mapping to "stop" rather than "stop_sequences" which is the correct key [Bedrock docs link](https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-mistral.html) `{ "prompt": string, "max_tokens" : int, "stop" : [string], "temperature": float, "top_p": float, "top_k": int }` - Issue: #20053 - Dependencies: N/A - Twitter handle: N/a	2024-04-29 20:14:36 -04:00
davidkgp	28b0b0d863	community[patch]: Fix for github issue #17690 (#20117 ) …/17690 Thank you for contributing to LangChain! - [x] Fix Google Lens knowledge graph issue: "langchain: community" - Fix for [No "knowledge_graph" property in Google Lens API call from SerpAPI](https://github.com/langchain-ai/langchain/issues/17690) - [x] PR message: *Delete this entire checklist* and replace with - Description: handled the existence of keys in the json response of Google Lens - Issue: [No "knowledge_graph" property in Google Lens API call from SerpAPI](https://github.com/langchain-ai/langchain/issues/17690) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-30 00:10:08 +00:00
高远	a7a4630bf4	community[patch]: Modify the text field type and add new exception handling (#20116 ) Co-authored-by: gaoyuan <gaoyuan.20001218@bytedance.com>	2024-04-29 20:06:00 -04:00
Rahul Triptahi	c172611647	community[patch]: Add classifier_url argument in PebbloSafeLoader and documentation update. (#21030 ) Description: Add classifier_url argument in PebbloSafeLoader. Documentation: Updated PebbloSafeLoader documentation with above change and new links for pebblo github pages. --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-29 17:41:09 -04:00
Leonid Ganeline	08d08d7c83	docs: langchain docstrings updates (#21032 ) Added missed docstings. Formatted docstrings into a consistent format.	2024-04-29 17:40:44 -04:00
Leonid Ganeline	85094cbb3a	docs: community docstring updates (#21040 ) Added missed docstrings. Updated docstrings to consistent format.	2024-04-29 17:40:23 -04:00
Rodrigo Nogueira	90f19028e5	community[patch]: Add maritalk streaming (sync and async) (#19203 ) Co-authored-by: RosevalJr <rdmalajr@gmail.com> Co-authored-by: Roseval Donisete Malaquias Junior <roseval@maritaca.ai>	2024-04-29 21:31:14 +00:00
Cahid Arda Öz	cc6191cb90	community[minor]: Add support for Upstash Vector (#20824 ) ## Description Adding `UpstashVectorStore` to utilize [Upstash Vector](https://upstash.com/docs/vector/overall/getstarted)! #17012 was opened to add Upstash Vector to langchain but was closed to wait for filtering. Now filtering is added to Upstash vector and we open a new PR. Additionally, [embedding feature](https://upstash.com/docs/vector/features/embeddingmodels) was added and we add this to our vectorstore aswell. ## Dependencies [upstash-vector](https://pypi.org/project/upstash-vector/) should be installed to use `UpstashVectorStore`. Didn't update dependencies because of [this comment in the previous PR](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1876522450). ## Tests Tests are added and they pass. Tests are naturally network bound since Upstash Vector is offered through an API. There was [a discussion in the previous PR about mocking the unittests](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1891820567). We didn't make changes to this end yet. We can update the tests if you can explain how the tests should be mocked. --------- Co-authored-by: ytkimirti <yusuftaha9@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 17:25:01 -04:00
Leonid Ganeline	1a2ff56cd8	core[patch[: docstring update (#21036 ) Added missed docstrings. Updated docstrings to consistent format.	2024-04-29 15:35:34 -04:00
Eugene Yurtsev	f479a337cc	langchain[patch]: replace deprecated imports with imports from langchain_core (#21033 ) * Output of running the migration script. * Ran only against langchain code itself and not the unit tests.	2024-04-29 15:34:31 -04:00
Eugene Yurtsev	82d4afcac0	langchain[minor]: Code to handle dynamic imports (#20893 ) Proposing to centralize code for handling dynamic imports. This allows treating langchain-community as an optional dependency. --- The proposal is to scan the code base and to replace all existing imports with dynamic imports using this functionality.	2024-04-29 15:34:03 -04:00
Erick Friis	854ae3e1de	mistralai: release 0.1.5, allow client passing in (#21034 )	2024-04-29 17:14:26 +00:00
chyroc	3e241956d3	community[minor]: add coze chat model (#20770 ) add coze chat model, to call coze.com apis	2024-04-29 12:26:16 -04:00
Eugene Yurtsev	29493bb598	cli[minor]: improve confirmation message with more details (#21027 ) Improve confirmation message with more details	2024-04-29 12:20:42 -04:00
Eugene Yurtsev	aab78a37f3	cli[patch]: Ignore imports that change the name of the class (#21026 ) Not currently handeled by migration script	2024-04-29 12:20:30 -04:00
Massimiliano Pronesti	ce89b34fc0	community[patch]: support hybrid search with threshold in Azure AI Search Retriever (#20907 ) Support hybrid search with a score threshold -- similar to what we do for similarity search.	2024-04-29 12:11:44 -04:00
Andrei Panferov	b3efa38cc0	community[patch]: GigaChat model selection fix (#20988 ) Fixed the error that the model name is never actually put into GigaChat request payload, always defaulting to `GigaChat-Lite`. With this fix, model selection through ```python import os from langchain.chat_models.gigachat import GigaChat chat = GigaChat( name="GigaChat-Pro", # <- HERE!!!!! ... ) ``` should actually work, as intended in [here](`804390ba4b/libs/community/langchain_community/llms/gigachat.py (L36)`). --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-29 16:08:26 +00:00
Patrick McFadin	3331865f6b	community[minor]: add Cassandra Database Toolkit (#20246 ) Description: ToolKit and Tools for accessing data in a Cassandra Database primarily for Agent integration. Initially, this includes the following tools: - `cassandra_db_schema` Gathers all schema information for the connected database or a specific schema. Critical for the agent when determining actions. - `cassandra_db_select_table_data` Selects data from a specific keyspace and table. The agent can pass paramaters for a predicate and limits on the number of returned records. - `cassandra_db_query` Expiriemental alternative to `cassandra_db_select_table_data` which takes a query string completely formed by the agent instead of parameters. May be removed in future versions. Includes unit test and two notebooks to demonstrate usage. Dependencies: cassio Twitter handle: @PatrickMcFadin --------- Co-authored-by: Phil Miesle <phil.miesle@datastax.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 15:51:43 +00:00
Igor Brai	b3e74f2b98	community[minor]: add mojeek search util (#20922 ) Description: This pull request introduces a new feature to community tools, enhancing its search capabilities by integrating the Mojeek search engine Dependencies: None --------- Co-authored-by: Igor Brai <igor@mojeek.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-29 15:49:53 +00:00
hmn falahi	4822beb298	Ignore self/cls from required args of class functions in convert_to_openai_tool (#20691 ) Removed redundant self/cls from required args of class functions in _get_python_function_required_args: ```python class MemberTool: def search_member( self, keyword: str, args, *kwargs, ): """Search on members with any keyword like first_name, last_name, email Args: keyword: Any keyword of member """ headers = dict(authorization=kwargs['token']) members = [] try: members = request_( method='SEARCH', url=f'{service_url}/apiv1/members', headers=headers, json=dict(query=keyword), ) except Exception as e: logger.info(e.__doc__) return members convert_to_openai_tool(MemberTool.search_member) ``` expected result: ``` {'type': 'function', 'function': {'name': 'search_member', 'description': 'Search on members with any keyword like first_name, last_name, username, email', 'parameters': {'type': 'object', 'properties': {'keyword': {'type': 'string', 'description': 'Any keyword of member'}}, 'required': ['keyword']}}} ``` #20685 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 11:46:26 -04:00
Rahul Triptahi	a64a1943fd	docs: Document update for load_extended_matadata in GoogleDriveLoader (#20950 ) Document: Updated google_drive,ipynb for loading following extended metadata. - full_path - Full path of the file/s in google drive. - owner - owner of the file/s. - size - size of the file/s. Code changes: [langchain-google/pull/179.](https://github.com/langchain-ai/langchain-google/pull/179) Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-29 11:41:57 -04:00
Eugene Yurtsev	4f4ee8e2cf	cli[patch]: Update migrations file manually (#21021 ) We need to replace occurrences in the code of RunnableMap not just the import, so for now, we don't replace RunnableMap.	2024-04-29 10:53:31 -04:00
Tomaz Bratanic	67428c4052	community[patch]: Neo4j enhanced schema (#20983 ) Scan the database for example values and provide them to an LLM for better inference of Text2cypher	2024-04-29 10:45:55 -04:00
Leonid Kuligin	dc70c23a11	docs: switched GCSLoaders docs to langchain-google-community (#20985 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: switched GCSLoaders docs to langchain-google-community" - [ ] PR message: *Delete this entire checklist* and replace with - Description: switched GCSLoaders docs to langchain-google-community	2024-04-29 10:45:11 -04:00
aditya thomas	8b59bddc03	anthropic[patch]: add tests for secret_str for api key (#20986 ) Description: Add tests to check API keys are masked Issue: Resolves https://github.com/langchain-ai/langchain/issues/12165 for Anthropic models Dependencies: None	2024-04-29 10:39:14 -04:00
Pengcheng Liu	1fad39be1c	community[minor]: Add LarkSuite wiki document loader. (#21016 ) Description: Add LarkSuite wiki document loader. Refer to [LarkSuite api document ](https://open.feishu.cn/document/server-docs/docs/wiki-v2/space-node/list)for details. Issue: None Dependencies: None Twitter handle: None	2024-04-29 10:37:50 -04:00
Tomaz Bratanic	d36332476c	docs: Add neo4j relationship vector index docs (#20990 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 14:36:47 +00:00
Leonid Ganeline	dc7c06bc07	community[minor]: import fix (#20995 ) Issue: When the third-party package is not installed, whenever we need to `pip install <package>` the ImportError is raised. But sometimes, the `ValueError` or `ModuleNotFoundError` is raised. It is bad for consistency. Change: replaced the `ValueError` or `ModuleNotFoundError` with `ImportError` when we raise an error with the `pip install <package>` message. Note: Ideally, we replace all `try: import... except... raise ... `with helper functions like `import_aim` or just use the existing [langchain_core.utils.utils.guard_import](https://api.python.langchain.com/en/latest/utils/langchain_core.utils.utils.guard_import.html#langchain_core.utils.utils.guard_import) But it would be much bigger refactoring. @baskaryan Please, advice on this.	2024-04-29 10:32:50 -04:00
Karim Lalani	2ddac9a7c3	experimental[minor]: Add bind_tools and with_structured_output functions to OllamaFunctions (#20881 ) Implemented bind_tools for OllamaFunctions. Made OllamaFunctions sub class of ChatOllama. Implemented with_structured_output for OllamaFunctions. integration unit test has been updated. notebook has been updated. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 14:13:33 +00:00
Eugene Yurtsev	d781560722	cli[minor]: Add ipynb support, add text_splitters (#20963 )	2024-04-29 10:11:21 -04:00
Vadym Barda	5e0b6b3e75	docs: update langserve link in LCEL docs (#20992 )	2024-04-29 09:06:10 -04:00
Aditya	07ce39bfe7	docs: updated tutorials for Image generation and Vector Search (#21000 ) Description: docs: updated tutorials for Image generation and Vector Search @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-04-29 09:04:11 -04:00
Aditya	17bbb7d2a5	docs: updated tutorial for Gemini versions, included safety attribute updates (#21006 ) Description:updated tutorial for Gemini versions, included safety attribute updates @lkuligin For review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-04-29 09:01:54 -04:00
WilliamEspegren	804390ba4b	community: Spider integration (#20937 ) Added the [Spider.cloud](https://spider.cloud) document loader. [Spider](https://github.com/spider-rs/spider) is the [fastest](https://github.com/spider-rs/spider/blob/main/benches/BENCHMARKS.md) and cheapest crawler that returns LLM-ready data. ``` - Description: Adds Spider data loader - Dependencies: spider-client - Twitter handle: @WilliamEspegren ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: = <=> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-27 21:45:03 +00:00
Jamie Lemon	6342217b93	docs: Moves "Using PyMuPDF" to higher up the page. (#20832 ) Description: This PR moves the PyMuPDF PDF loader solution to be underneath PyPDF. This is because it is the the 2nd most popular PyPI package after PyPDF. Please refer to these numbers, at the time of writing as follows: PyPDF https://www.pepy.tech/projects/PyPDF2 160 million PyMuPDF https://www.pepy.tech/projects/pymupdf 60 million PDFPlumber https://www.pepy.tech/projects/pdfplumber 23 million PDFMiner https://www.pepy.tech/projects/pdfminer 16 million PyPDFium2 https://www.pepy.tech/projects/pypdfium2 8 million Unstructured https://www.pepy.tech/projects/unstructured 8 million Please note I am an active contributor to https://github.com/pymupdf/PyMuPDF Many thanks! ---- Twitter handle: @artifex	2024-04-27 20:40:20 +00:00
Chouaieb Nemri	8097bec472	Added LogEntry, Any, Dict, List, Optional, TypedDict imports (#20970 ) Thank you for contributing to LangChain! - [ ] PR title: "package: docs" - [ ] PR message: - Description: Uptaded docs: Rag streaming use-cases notebook with LogEntry, Any, Dict, List, Optional, TypedDict imports - Twitter handle: c_nemri --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-27 20:13:54 +00:00
ccurme	9ec7151317	fireworks: fix integration tests (#20973 )	2024-04-27 19:49:46 +00:00
William FH	9fa9f05e5d	Catch System Error in ast parse (#20961 ) I can't seem to reproduce, but i got this: ``` SystemError: AST constructor recursion depth mismatch (before=102, after=37) ``` And the operation isn't critical for the actual forward pass so seems preferable to expand our caught exceptions	2024-04-26 19:31:55 -07:00
YH	2aca7fcdcf	core[patch]: Enhance link extraction with query parameters (#20259 ) Description: This update enhances the `extract_sub_links` function within the `langchain_core/utils/html.py` module to include query parameters in the extracted URLs. Issue: N/A Dependencies: No additional dependencies required for this change. Twitter handle: N/A Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 02:22:36 +00:00
CT	0e917e319b	docs: Add langchainhub to pip install (#20185 ) Added langchainhub package in import statement which is required for "from langchain import hub" to work. Added sample code to add OpenAI key Co-authored-by: Chi Yan Tang <100466443+poochiekittie@users.noreply.github.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 02:21:40 +00:00
Pamela Fox	45092a36a2	docs: Fix langgraph link (#20244 ) Just a simple PR to fix a broken link. Apparently having backticks outside a link makes it render as code. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 02:18:52 +00:00
Chip Davis	e818c75f8a	infra: test directory loader multithreaded (#20281 ) This is a unit test for #20230 which was a fix for using multithreaded mode with directory loader @eyurtsev	2024-04-26 19:16:47 -07:00
Guilherme Zanotelli	f931a9ce60	community[patch]: Pass kwargs to SPARQLStore from RdfGraph (#20385 ) This introduces `store_kwargs` which behaves similarly to `graph_kwargs` on the `RdfGraph` object, which will enable users to pass `headers` and other arguments to the underlying `SPARQLStore` object. I have also made a [PR in `rdflib` to support passing `default_graph`](https://github.com/RDFLib/rdflib/pull/2761). Example usage: ```python from langchain_community.graphs import RdfGraph graph = RdfGraph( query_endpoint="http://localhost/sparql", standard="rdf", store_kwargs=dict( default_graph="http://example.com/mygraph" ) ) ``` <!--If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.--> --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 01:38:29 +00:00
Chandre Van Der Westhuizen	e57cf73cf5	docs: Added MindsDB provider (#20322 ) MindsDB integrates with LangChain, enabling users to deploy, serve, and fine-tune models available via LangChain within MindsDB, making them accessible to numerous data sources. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 01:36:08 +00:00
Jorge Piedrahita Ortiz	40b2e2916b	community[minor]: Sambanova llm integration (#20955 ) - Description: Added [Sambanova systems](https://sambanova.ai/) integration, including sambaverse and sambastudio LLMs - Dependencies: sseclient-py (optional) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 01:05:13 +00:00
Rahul Triptahi	955cf186d2	community[patch]: Ingest source, owner and full_path if present in Document's metadata. (#20949 ) Description: The PebbloSafeLoader should first check for owner, full_path and size in metadata before implementing its own logic. Dependencies: None Documentation: NA. Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-26 17:50:57 -07:00
Amine Djeghri	790ea75cf7	community[minor]: add exllamav2 library for GPTQ & EXL2 models (#17817 ) Added 3 files : - Library : ExLlamaV2 - Test integration - Notebook --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 00:44:43 +00:00
Naveen Tatikonda	8bbdb4f6a0	community[patch]: Add OpenSearch as semantic cache (#20254 ) ### Description Use OpenSearch vector store as Semantic Cache. ### Twitter Handle @OpenSearchProj --------- Signed-off-by: Naveen Tatikonda <navtat@amazon.com> Co-authored-by: Harish Tatikonda <harishtatikonda@Harishs-MacBook-Air.local> Co-authored-by: EC2 Default User <ec2-user@ip-172-31-31-155.ec2.internal> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 00:20:24 +00:00
Giacomo Berardi	61f14f00d7	docs: `ElasticsearchCache` in cache integrations documentation (#20790 ) The package for LangChain integrations with Elasticsearch https://github.com/langchain-ai/langchain-elastic is going to contain a LLM cache integration in the next release (see https://github.com/langchain-ai/langchain-elastic/pull/14). This is the documentation contribution on the page dedicated to cache integrations	2024-04-26 15:43:58 -07:00
Mayank Solanki	8c085fc697	community[patch]: Added a function `from_existing_collection` in `Qdrant` vector database. (#20779 ) Issue: #20514 The current implementation of `construct_instance` expects a `texts: List[str]` that will call the embedding function. This might not be needed when we already have a client with collection and `path, you don't want to add any text. This PR adds a class method that returns a qdrant instance with an existing client. Here everytime `cb6e5e56c2/libs/community/langchain_community/vectorstores/qdrant.py (L1592)` `construct_instance` is called, this line sends some text for embedding generation. --------- Co-authored-by: Anush <anushshetty90@gmail.com>	2024-04-26 15:34:09 -07:00
Leonid Kuligin	893a924b90	core[minor], community[patch], langchain[patch]: move BaseChatLoader to core (#19607 ) Thank you for contributing to LangChain! - [ ] PR title: "core: move BaseChatLoader and BaseToolkit from community" - [ ] PR message: move BaseChatLoader and BaseToolkit --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-26 21:45:51 +00:00
Erick Friis	d4befd0cfb	core: fix batch ordering test (#20952 )	2024-04-26 21:17:26 +00:00
Eugene Yurtsev	8ed150b2fe	cli[minor]: Fix bug to account for name changes (#20948 ) * Fix bug to account for name changes / aliases * Generate migration list from langchain to langchain_core	2024-04-26 15:45:11 -04:00
ccurme	989e4a92c2	(infra) pass input to test-release (#20947 )	2024-04-26 15:17:40 -04:00
Eugene Yurtsev	2fa0ff1a2d	cli[minor]: update code to generate migrations from langchain to community (#20946 ) Updates code that generates migrations from langchain to community	2024-04-26 15:11:32 -04:00
Erick Friis	078c5d9bc6	infra: nonmaster release checkbox (#20945 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-26 14:50:07 -04:00
Leonid Kuligin	d4aec8fc8f	docs: adding langchain_google_community to the docs (#20665 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: step1. adjusting langchain_community -> langchain_google_community" - [ ] - Description: step1. adjusting langchain_community -> langchain_google_community	2024-04-26 18:49:03 +00:00
ccurme	bf16cefd18	langchain: deprecate create_structured_output_runnable (#20933 )	2024-04-26 14:00:40 -04:00
Erick Friis	38eccab3ae	upstage: release 0.1.3 (#20941 )	2024-04-26 10:36:11 -07:00
Sean	e1c2e2fdfa	upstage: Upstage Groundedness Check parameter update (#20914 ) * Groundedness Check takes `str` or `list[Document]` as input. * Deprecate `GroundednessCheck` due to its naming. * Added `UpstageGroundednessCheck`. * Hotfix for Groundedness Check parameter. The name `query` was misleading and it should be `answer` instead. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-26 17:34:05 +00:00
ccurme	84b8e67c9c	mistral: release 0.1.4 (#20940 )	2024-04-26 13:06:02 -04:00
ccurme	465fbaa30b	openai: release 0.1.4 (#20939 )	2024-04-26 09:56:49 -07:00
Eugene Yurtsev	12c906f6ce	cli[minor]: Improve partner migrations (#20938 ) This auto generates partner migrations. At the moment the migration is from community -> partner. So one would need to run the migration script twice to go from langchain to partner.	2024-04-26 12:30:15 -04:00
Eugene Yurtsev	5653f36adc	cli[minor]: Add script to generate migrations for partner packages (#20932 ) Add script to help generate migrations. This works well for partner packages. Migrations are generated based on run time rather than static analysis (much simpler to get the correct migrations implemented). The script for generating migrations from langchain to community still needs work.	2024-04-26 11:17:20 -04:00
ccurme	fe1304afc4	openai: add unit test (#20931 ) Test a helper function that was added earlier.	2024-04-26 15:02:19 +00:00
Eugene Yurtsev	6598757037	cli[minor]: Add first version of migrate (#20902 ) Adds a first version of the migrate script.	2024-04-26 10:50:21 -04:00
Pengcheng Liu	d95e9fb67f	docs: add tool calling example in Tongyi chat model integration. (#20925 ) Description: add tool calling example in Tongyi chat model integration. Issue: None Dependencies: None	2024-04-26 10:18:54 -04:00
Lei Zhang	9281841cfe	community[patch]: fix integrated test case test_recursive_url_loader.py assertions (issue-20919) (#20920 ) Description: Fix integrated test case test_recursive_url_loader.py Local testing successful ```shell (venv) lei@LeideMacBook-Pro community % poetry run pytest tests/integration_tests/document_loaders/test_recursive_url_loader.py ================================================================================ test session starts ================================================================================ platform darwin -- Python 3.11.4, pytest-7.4.4, pluggy-1.4.0 -- /Users/zhanglei/Work/github/langchain/venv/bin/python cachedir: .pytest_cache rootdir: /Users/zhanglei/Work/github/langchain/libs/community configfile: pyproject.toml plugins: syrupy-4.6.1, asyncio-0.20.3, cov-4.1.0, vcr-1.0.2, mock-3.12.0, anyio-3.7.1, dotenv-0.5.2, requests-mock-1.11.0, socket-0.6.0 asyncio: mode=Mode.AUTO collected 6 items tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader PASSED [ 16%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic PASSED [ 33%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader FAILED [ 50%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent PASSED [ 66%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_loading_invalid_url PASSED [ 83%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties PASSED [100%] ===================================================================================== FAILURES ====================================================================================== __________________________________________________________________________ test_sync_recursive_url_loader ___________________________________________________________________________ def test_sync_recursive_url_loader() -> None: url = "https://docs.python.org/3.9/" loader = RecursiveUrlLoader( url, extractor=lambda _: "placeholder", use_async=False, max_depth=2 ) docs = loader.load() > assert len(docs) == 23 E AssertionError: assert 24 == 23 E + where 24 = len([Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/', 'content_type': 'text/html', 'title': '3.9.18 Documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/py-modindex.html', 'content_type': 'text/html', 'title': 'Python Module Index — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/download.html', 'content_type': 'text/html', 'title': 'Download — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/howto/index.html', 'content_type': 'text/html', 'title': 'Python HOWTOs — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/whatsnew/index.html', 'content_type': 'text/html', 'title': 'Whatâ\x80\x99s New in Python — Python 3.9.18 documentation', 'language': None}), Document(page_content='placeholder', metadata={'source': 'https://docs.python.org/3.9/c-api/index.html', 'content_type': 'text/html', 'title': 'Python/C API Reference Manual — Python 3.9.18 documentation', 'language': None}), ...]) tests/integration_tests/document_loaders/test_recursive_url_loader.py:38: AssertionError ================================================================================= warnings summary ================================================================================== tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties /Users/zhanglei/.pyenv/versions/3.11.4/lib/python3.11/html/parser.py:170: XMLParsedAsHTMLWarning: It looks like you're parsing an XML document using an HTML parser. If this really is an HTML document (maybe it's XHTML?), you can ignore or filter this warning. If it's XML, you should know that using an XML parser will be more reliable. To parse this document as XML, make sure you have the lxml package installed, and pass the keyword argument `features="xml"` into the BeautifulSoup constructor. k = self.parse_starttag(i) -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html ================================================================================ slowest 5 durations ================================================================================ 56.75s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic 38.99s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader 31.20s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties 30.37s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent 15.44s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader ============================================================================== short test summary info ============================================================================== FAILED tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader - AssertionError: assert 24 == 23 ================================================================ 1 failed, 5 passed, 5 warnings in 172.97s (0:02:52) ================================================================ (venv) zhanglei@LeideMacBook-Pro community % poetry run pytest tests/integration_tests/document_loaders/test_recursive_url_loader.py ================================================================================ test session starts ================================================================================ platform darwin -- Python 3.11.4, pytest-7.4.4, pluggy-1.4.0 -- /Users/zhanglei/Work/github/langchain/venv/bin/python cachedir: .pytest_cache rootdir: /Users/zhanglei/Work/github/langchain/libs/community configfile: pyproject.toml plugins: syrupy-4.6.1, asyncio-0.20.3, cov-4.1.0, vcr-1.0.2, mock-3.12.0, anyio-3.7.1, dotenv-0.5.2, requests-mock-1.11.0, socket-0.6.0 asyncio: mode=Mode.AUTO collected 6 items tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader PASSED [ 16%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic PASSED [ 33%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader PASSED [ 50%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent PASSED [ 66%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_loading_invalid_url PASSED [ 83%] tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties PASSED [100%] ================================================================================= warnings summary ================================================================================== tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties /Users/zhanglei/.pyenv/versions/3.11.4/lib/python3.11/html/parser.py:170: XMLParsedAsHTMLWarning: It looks like you're parsing an XML document using an HTML parser. If this really is an HTML document (maybe it's XHTML?), you can ignore or filter this warning. If it's XML, you should know that using an XML parser will be more reliable. To parse this document as XML, make sure you have the lxml package installed, and pass the keyword argument `features="xml"` into the BeautifulSoup constructor. k = self.parse_starttag(i) -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html ================================================================================ slowest 5 durations ================================================================================ 46.99s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader_deterministic 32.43s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_async_recursive_url_loader 31.23s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_equivalent 30.75s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_async_metadata_necessary_properties 15.89s call tests/integration_tests/document_loaders/test_recursive_url_loader.py::test_sync_recursive_url_loader ===================================================================== 6 passed, 5 warnings in 157.42s (0:02:37) ===================================================================== (venv) lei@LeideMacBook-Pro community % ``` Issue: https://github.com/langchain-ai/langchain/issues/20919 Twitter handle: @coolbeevip	2024-04-26 10:00:08 -04:00
ccurme	7d8d0229fa	remove placeholder error message (#20340 )	2024-04-26 13:48:48 +00:00
William FH	4c437ebb9c	Use lstv2 (#20747 )	2024-04-25 16:51:42 -07:00
ccurme	891ae37437	langchain: support PineconeVectorStore in self query retriever (#20905 ) `langchain_pinecone.Pinecone` is deprecated in favor of `PineconeVectorStore`, and is currently a subclass of `PineconeVectorStore`. ```python @deprecated(since="0.0.3", removal="0.2.0", alternative="PineconeVectorStore") class Pinecone(PineconeVectorStore): """Deprecated. Use PineconeVectorStore instead.""" pass ```	2024-04-25 20:54:58 +00:00
Matt	28df4750ef	community[patch]: Add initial tests for AzureSearch vector store (#17663 ) Description: AzureSearch vector store has no tests. This PR adds initial tests to validate the code can be imported and used. Issue: N/A Dependencies: azure-search-documents and azure-identity are added as optional dependencies for testing --------- Co-authored-by: Matt Gotteiner <[email protected]> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 20:42:01 +00:00
Dristy Srivastava	5f1d1666e3	community[patch]: Add support for pebblo server and client version (#20269 ) Description: _PebbloSafeLoader_: Add support for pebblo server and client version Documentation: NA Unit test: NA Issue: NA Dependencies: None --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 20:39:17 +00:00
am-kinetica	b54b19ba1c	community[minor]: Implemented Kinetica Document Loader and added notebooks (#20002 ) - [ ] Kinetica Document Loader: "community: a class to load Documents from Kinetica" - [ ] Kinetica Document Loader: - Description: implemented KineticaLoader in `kinetica_loader.py` - Dependencies: install the Kinetica API using `pip install gpudb==7.2.0.1 `	2024-04-25 13:39:00 -07:00
Michael Schock	5e60d65917	experimental[patch]: return from HuggingGPT task executor task.run() exception (#20219 ) Description: Fixes a bug in the HuggingGPT task execution logic here: except Exception as e: self.status = "failed" self.message = str(e) self.status = "completed" self.save_product() where a caught exception effectively just sets `self.message` and can then throw an exception if, e.g., `self.product` is not defined. Issue: None that I'm aware of. Dependencies: None Twitter handle: https://twitter.com/michaeljschock Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 20:16:39 +00:00
Anish Chakraborty	898362de81	core[patch]: improve comma separated list output parser to handle non-space separated list (#20434 ) - Description: Changes `lanchain_core.output_parsers.CommaSeparatedListOutputParser` to handle `,` as a delimiter alongside the previous implementation which used `, ` as delimiter. - Issue: Started noticing that some results returned by LLMs were not getting parsed correctly when the output contained `,` instead of `, `. - Dependencies: No - Twitter handle: not active on twitter. <!--- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. -->	2024-04-25 20:10:56 +00:00
Michael Schock	63a07f52df	experimental[patch]: remove \n from AutoGPT feedback_tool exit check (#20132 )	2024-04-25 20:10:33 +00:00
Shengsheng Huang	fd1061e7bf	community[patch]: add more data types support to ipex-llm llm integration (#20833 ) - Description: - add support for more data types: by default `IpexLLM` will load the model in int4 format. This PR adds more data types support such as `sym_in5`, `sym_int8`, etc. Data formats like NF3, NF4, FP4 and FP8 are only supported on GPU and will be added in future PR. - Fix a small issue in saving/loading, update api docs - Dependencies: `ipex-llm` library - Document: In `docs/docs/integrations/llms/ipex_llm.ipynb`, added instructions for saving/loading low-bit model. - Tests: added new test cases to `libs/community/tests/integration_tests/llms/test_ipex_llm.py`, added config params. - Contribution maintainer: @shane-huang	2024-04-25 12:58:18 -07:00
Rahul Triptahi	dc921f0823	community[patch]: Add semantic info to metadata, classified by pebblo-server. (#20468 ) Description: Add support for Semantic topics and entities. Classification done by pebblo-server is not used to enhance metadata of Documents loaded by document loaders. Dependencies: None Documentation: Updated. Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-25 12:55:33 -07:00
Eugene Yurtsev	a5028b6356	cli[minor]: Add __version__ (#20903 ) Add __version__ to cli	2024-04-25 15:51:33 -04:00
Jingpan Xiong	1202017c56	community[minor]: Add relyt vector database (#20316 ) Co-authored-by: kaka <kaka@zbyte-inc.cloud> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: jingsi <jingsi@leadincloud.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 19:49:29 +00:00
davidefantiniIntel	f386f71bb3	community: fix tqdm import (#20263 ) Description: Fix tqdm import in QuantizedBiEncoderEmbeddings	2024-04-25 19:44:53 +00:00
Andres Algaba	05ae8ca7d4	community[patch]: deprecate persist method in Chroma (#20855 ) Thank you for contributing to LangChain! - [x] PR title - [x] PR message: - Description: Deprecate persist method in Chroma no longer exists in Chroma 0.4.x - Issue: #20851 - Dependencies: None - Twitter handle: AndresAlgaba1 - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 19:42:03 +00:00
ccurme	fdabd3cdf5	mistral, openai: support custom tokenizers in chat models (#20901 )	2024-04-25 15:23:29 -04:00
ccurme	6986e44959	docs: update chat model feature table (#20899 )	2024-04-25 15:05:43 -04:00
ccurme	b8db73233c	core, community: deprecate tool.__call__ (#20900 ) Does not update docs.	2024-04-25 14:50:39 -04:00
merdan	52896258ee	docs: hide model import in multiple_tools.ipynb (#20883 ) Description: This PR removes an unnecessary code snippet from the documentation. The snippet in question is not relevant to the content and does not contribute to the overall understanding of the topic. It contained redundant imports and unused code, potentially causing confusion for readers. Issue: There is no specific issue number associated with this change. Dependencies: No additional dependencies are required for this change. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 18:47:22 +00:00
Tomaz Bratanic	520972fd0f	community[patch]: Support passing graph object to Neo4j integrations (#20876 ) For driver connection reusage, we introduce passing the graph object to neo4j integrations	2024-04-25 11:30:22 -07:00
Lei Zhang	748a6ae609	community[patch]: add HTTP response headers Content-Type to metadata of RecursiveUrlLoader document (#20875 ) Description: The RecursiveUrlLoader loader offers a link_regex parameter that can filter out URLs. However, this filtering capability is limited, and if the internal links of the website change, unexpected resources may be loaded. These resources, such as font files, can cause problems in subsequent embedding processing. > https://blog.langchain.dev/assets/fonts/source-sans-pro-v21-latin-ext_latin-regular.woff2?v=0312715cbf We can add the Content-Type in the HTTP response headers to the document metadata so developers can choose which resources to use. This allows developers to make their own choices. For example, the following may be a good choice for text knowledge. - text/plain - simple text file - text/html - HTML web page - text/xml - XML format file - text/json - JSON format data - application/pdf - PDF file - application/msword - Word document and ignore the following - text/css - CSS stylesheet - text/javascript - JavaScript script - application/octet-stream - binary data - image/jpeg - JPEG image - image/png - PNG image - image/gif - GIF image - image/svg+xml - SVG image - audio/mpeg - MPEG audio files - video/mp4 - MP4 video file - application/font-woff - WOFF font file - application/font-ttf - TTF font file - application/zip - ZIP compressed file - application/octet-stream - binary data Twitter handle: @coolbeevip --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 11:29:41 -07:00
samanhappy	37cbbc00a9	docs: Fix broken link in agents.ipynb (#20872 )	2024-04-25 10:42:06 -07:00
fzowl	a6b8ff23bd	docs: Use voyage-law-2 in the examples (#20784 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: In VoyageAI text-embedding examples use voyage-law-2 model - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-25 10:41:36 -07:00
Erick Friis	eca3640af7	upstage: release 0.1.2 (#20898 )	2024-04-25 10:41:19 -07:00
Pavlo Paliychuk	82b5bdc7a1	docs: Fix misplaced zep cloud example links (#20867 ) Thank you for contributing to LangChain! - [x] PR title: Fix misplaced zep cloud example links - [x] PR message: - Description: Fixes misplaced links for vector store and memory zep cloud examples - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-25 10:41:08 -07:00
Joan Fontanals	baefbfb14e	community[mionr]: add Jina Reranker in retrievers module (#19406 ) - Description: Adapt JinaEmbeddings to run with the new Jina AI Rerank API - Twitter handle: https://twitter.com/JinaAI_ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 10:27:10 -07:00
Erick Friis	92969d49cb	multiple: remove external repo mds (#20896 ) api docs build doesn't tolerate them	2024-04-25 17:18:29 +00:00
Jason_Chen	53bb7dbd29	community[patch]: add BeautifulSoupTransformer remove_unwanted_classnames method (#20467 ) Add the remove_unwanted_classnames method to the BeautifulSoupTransformer class, which can filter more effectively. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 17:04:04 +00:00
YISH	ed26149a29	openai[patch]: Allow disablling safe_len_embeddings(OpenAIEmbeddings) (#19743 ) OpenAI API compatible server may not support `safe_len_embedding`， use `disable_safe_len_embeddings=True` to disable it. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 09:45:52 -07:00
Bagatur	5b83130855	core[minor], langchain[patch], community[patch]: mv StructuredQuery (#20849 ) mv StructuredQuery to core	2024-04-25 09:40:26 -07:00
Sean	540f384197	partner: Upstage quick documentation update (#20869 ) * Updating the provider docs page. The RAG example was meant to be moved to cookbook, but was merged by mistake. * Fix bug in Groundedness Check --------- Co-authored-by: JuHyung-Son <sonju0427@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-25 16:36:54 +00:00
Bagatur	ffad3985a1	core[patch]: Release 0.1.46 (#20891 )	2024-04-25 15:40:17 +00:00
Mish Ushakov	6ccecf2363	community[minor]: added Browserbase loader (#20478 )	2024-04-25 01:11:03 +00:00
aditya thomas	9e694963a4	docs: custom callback handlers page (#20494 ) Description: Update to the Callbacks page on custom callback handlers Issue: #20493 Dependencies: None --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 01:08:36 +00:00
Erick Friis	5da9dd1195	mistral: comment batching param (#20868 ) Addresses #20523	2024-04-25 00:38:21 +00:00
Ivaylo Bratoev	7c5063ef60	infra: fix how Poetry is installed in the dev container (#20521 ) Currently, when a new dev container is created, poetry does not work in it with the error "No module named 'rapidfuzz'". Install Poetry outside the project venv so that poetry and project dependencies do not get mixed. Use pipx to install poetry securely in its own isolated environment. Issue: #12237 Twitter handle: https://twitter.com/ibratoev Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 17:33:25 -07:00
GustavoSept	c2d09a5186	experimental[patch]: Makes regex customizable in text_splitter.py (SemanticChunker class) (#20485 ) - Description: Currently, the regex is static (`r"(?<=[.?!])\s+"`), which is only useful for certain use cases. The current change only moves this to be a parameter of split_text(). Which adds flexibility without making it more complex (as the default regex is still the same). - Issue: Not applicable (I searched, no one seems to have created this issue yet). - Dependencies: None. _If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17._ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 00:32:40 +00:00
William FH	a936f696a6	[Core] Feat: update config CVar in tool.invoke (#20808 )	2024-04-24 17:17:21 -07:00
Lei Zhang	2cd907ad7e	text-splitters[patch]: fix MarkdownHeaderTextSplitter fails to parse headers with non-printable characters (#20645 ) Description: MarkdownHeaderTextSplitter Fails to Parse Headers with non-printable characters. more #20643 The following is the official test case. Just replacing `# Foo\n\n` with `\ufeff# Foo\n\n` will cause the test case to fail. chunk metadata is empty ```python def test_md_header_text_splitter_1() -> None: """Test markdown splitter by header: Case 1.""" markdown_document = ( "\ufeff# Foo\n\n" " ## Bar\n\n" "Hi this is Jim\n\n" "Hi this is Joe\n\n" " ## Baz\n\n" " Hi this is Molly" ) headers_to_split_on = [ ("#", "Header 1"), ("##", "Header 2"), ] markdown_splitter = MarkdownHeaderTextSplitter( headers_to_split_on=headers_to_split_on, ) output = markdown_splitter.split_text(markdown_document) expected_output = [ Document( page_content="Hi this is Jim \nHi this is Joe", metadata={"Header 1": "Foo", "Header 2": "Bar"}, ), Document( page_content="Hi this is Molly", metadata={"Header 1": "Foo", "Header 2": "Baz"}, ), ] assert output == expected_output ``` twitter: @coolbeevip Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 00:07:42 +00:00
jtanios	2968f20970	docs: git dependency name correction (#20662 ) This PR corrects the name of the `git` python package to `GitPython`. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 23:43:44 +00:00
ccurme	481d3855dc	patch: remove usage of llm, chat model __call__ (#20788 ) - `llm(prompt)` -> `llm.invoke(prompt)` - `llm(prompt=prompt` -> `llm.invoke(prompt)` (same with `messages=`) - `llm(prompt, callbacks=callbacks)` -> `llm.invoke(prompt, config={"callbacks": callbacks})` - `llm(prompt, kwargs)` -> `llm.invoke(prompt, kwargs)`	2024-04-24 19:39:23 -04:00
Raghav Dixit	9b7fb381a4	community[patch]: LanceDB integration patch update (#20686 ) Description : - added functionalities - delete, index creation, using existing connection object etc. - updated usage - Added LaceDB cloud OSS support make lint_diff , make test checks done	2024-04-24 16:27:43 -07:00
Nikita Pokidyshev	9e983c9500	langchain[patch]: fix agent_token_buffer_memory not working with openai tools (#20708 ) - Description: fix a bug in the agent_token_buffer_memory - Issue: agent_token_buffer_memory was not working with openai tools - Dependencies: None - Twitter handle: @pokidyshef	2024-04-24 15:51:58 -07:00
Salika Dave	6353991498	docs: [Retrieval > .. > PDF] update package installation instructions for Unstructured and PDFMiner (#20723 ) Description: Adds the command to install packages required before using _Unstructured_ and _PDFMiner_ from `langchain.community` Documentation Page Being Updated: [LangChain > Retrieval > Document loaders > PDF > Using Unstructured](https://python.langchain.com/docs/modules/data_connection/document_loaders/pdf/#using-unstructured) Issue: #20719 Dependencies: no dependencies Twitter handle: SalikaDave <!-- Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --> --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 22:24:11 +00:00
dpdjvhxm	a9e2e98708	docs: Update apache_age.ipynb (#20722 ) typo	2024-04-24 22:18:59 +00:00
Erick Friis	1aef8116de	upstage: release 0.1.1 (#20864 )	2024-04-24 15:18:30 -07:00
junkeon	c8fd51e8c8	upstage: Add Upstage partner package LA and GC (#20651 ) --------- Co-authored-by: Sean <chosh0615@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Sean Cho <sean@upstage.ai>	2024-04-24 15:17:20 -07:00
hsmtkk	5ecebf168c	docs: imported List is not used (#20720 ) # Description Minor sample code fix # Issue Imported `List` is not used. # Dependencies N/A # Twitter handle N/A	2024-04-24 15:17:07 -07:00
Alex Lee	243ba71b28	langchain[patch]: add `aprep_output` method to `langchain/chains/base.py` (#20748 ) ## Description Add `aprep_output` method to `langchain/chains/base.py`. Some downstream `ChatMessageHistory` objects that use async connections require an async way to append to the context. It turned out that `ainvoke()` was calling `prep_output` which is synchronous. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 22:16:25 +00:00
Harrison Chase	43c041cda5	support messages in messages out (#20862 )	2024-04-24 14:58:58 -07:00
back2nix	a1614b88ac	groq[patch]: groq proxy support (#20758 ) # Proxy Fix for Groq Class 🐛 🚀 ## Description This PR fixes a bug related to proxy settings in the `Groq` class, allowing users to connect to LangChain services via a proxy. ## Changes Made - ✅ FIX support for specifying proxy settings in the `Groq` class. - ✅ Resolved the bug causing issues with proxy settings. - ❌ Did not include unit tests and documentation updates. - ❌ Did not run make format, make lint, and make test to ensure code quality and functionality because I couldn't get it to run, so I don't program in Python and couldn't run `ruff`. - ❔ Ensured that the changes are backwards compatible. - ✅ No additional dependencies were added to `pyproject.toml`. ### Error Before Fix ```python Traceback (most recent call last): File "/home/bg/Documents/code/github.com/back2nix/test/groq/main.py", line 9, in <module> chat = ChatGroq( ^^^^^^^^^ File "/home/bg/Documents/code/github.com/back2nix/test/groq/venv310/lib/python3.11/site-packages/langchain_core/load/serializable.py", line 120, in __init__ super().__init__(**kwargs) File "/home/bg/Documents/code/github.com/back2nix/test/groq/venv310/lib/python3.11/site-packages/pydantic/v1/main.py", line 341, in __init__ raise validation_error pydantic.v1.error_wrappers.ValidationError: 1 validation error for ChatGroq __root__ Invalid `http_client` argument; Expected an instance of `httpx.AsyncClient` but got <class 'httpx.Client'> (type=type_error) ``` ### Example usage after fix ```python3 import os import httpx from langchain_core.prompts import ChatPromptTemplate from langchain_groq import ChatGroq chat = ChatGroq( temperature=0, groq_api_key=os.environ.get("GROQ_API_KEY"), model_name="mixtral-8x7b-32768", http_client=httpx.Client( proxies="socks5://127.0.0.1:1080", transport=httpx.HTTPTransport(local_address="0.0.0.0"), ), http_async_client=httpx.AsyncClient( proxies="socks5://127.0.0.1:1080", transport=httpx.HTTPTransport(local_address="0.0.0.0"), ), ) system = "You are a helpful assistant." human = "{text}" prompt = ChatPromptTemplate.from_messages([("system", system), ("human", human)]) chain = prompt \| chat out = chain.invoke({"text": "Explain the importance of low latency LLMs"}) print(out) ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-24 21:58:03 +00:00
volodymyr-memsql	493afe4d8d	community[patch]: add hybrid search to singlestoredb vectorstore (#20793 ) Implemented the ability to enable full-text search within the SingleStore vector store, offering users a versatile range of search strategies. This enhancement allows users to seamlessly combine full-text search with vector search, enabling the following search strategies: * Search solely by vector similarity. * Conduct searches exclusively based on text similarity, utilizing Lucene internally. * Filter search results by text similarity score, with the option to specify a threshold, followed by a search based on vector similarity. * Filter results by vector similarity score before conducting a search based on text similarity. * Perform searches using a weighted sum of vector and text similarity scores. Additionally, integration tests have been added to comprehensively cover all scenarios. Updated notebook with examples. CC: @baskaryan, @hwchase17 --------- Co-authored-by: Volodymyr Tkachuk <vtkachuk-ua@singlestore.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-24 21:34:50 +00:00
Tomaz Bratanic	9efab3ed66	community[patch]: Add driver config param for neo4j graph (#20772 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-24 21:14:41 +00:00
Leonid Ganeline	13751c3297	community: `tigergraph` fixes (#20034 ) - added guard on the `pyTigerGraph` import - added a missed example page in the `docs/integrations/graphs/` - formatted the `docs/integrations/providers/` page to the consistent format. Added links.	2024-04-24 16:49:21 -04:00
Martin Kolb	0186e4e633	community[patch]: Advanced filtering for HANA Cloud Vector Engine (#20821 ) - Description: This PR adds support for advanced filtering to the integration of HANA Vector Engine. The newly supported filtering operators are: $eq, $ne, $gt, $gte, $lt, $lte, $between, $in, $nin, $like, $and, $or - Issue: N/A - Dependencies: no new dependencies added Added integration tests to: `libs/community/tests/integration_tests/vectorstores/test_hanavector.py` Description of the new capabilities in notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb`	2024-04-24 13:47:27 -07:00
Alex Sherstinsky	12e5ec6de3	community: Support both Predibase SDK-v1 and SDK-v2 in Predibase-LangChain integration (#20859 )	2024-04-24 13:31:01 -07:00
Erick Friis	8c95ac3145	docs, multiple: de-beta with_structured_output (#20850 )	2024-04-24 19:34:57 +00:00
Nuno Campos	477eb1745c	Better support for subgraphs in graph viz (#20840 )	2024-04-24 12:32:52 -07:00
aditya thomas	a9c7d47c03	docs: update openai llm documentation (#20827 ) Description: Bring OpenAI LLM page to the LCEL era Issue: See discussion #20810 Dependencies: None	2024-04-24 12:26:57 -07:00
JeffKatzy	5ab3f9a995	community[patch]: standardize chat init args (#20844 ) Thank you for contributing to LangChain! community:perplexity[patch]: standardize init args updated pplx_api_key and request_timeout so that aliased to api_key, and timeout respectively. Added test that both continue to set the same underlying attributes. Related to [20085](https://github.com/langchain-ai/langchain/issues/20085) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-24 12:26:05 -07:00
Pavlo Paliychuk	70ae59bcfe	docs: Update Zep Messaging, add links to Zep Cloud Docs (#20848 ) Thank you for contributing to LangChain! - [x] PR title: docs: Update Zep Messaging, add links to Zep Cloud Docs - [x] PR message: - Description: This PR updates Zep messaging in the docs + links to Langchain Zep Cloud examples in our documentation - Twitter handle: @paulpaliychuk51 - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-24 19:14:54 +00:00
Massimiliano Pronesti	8d1167b32f	community[patch]: add support for similarity_score_threshold search in… (#20852 ) See https://github.com/langchain-ai/langchain/issues/20600#issuecomment-2075569338 for details. @chrislrobert	2024-04-24 19:14:33 +00:00
Bagatur	87d31a3ec0	docs: contributing note (#20843 )	2024-04-24 10:41:19 -07:00
Eugene Yurtsev	d8aa72f51d	core[minor],langchain[patch]: Move base indexing interface and logic to core (#20667 ) This PR moves the interface and the logic to core. The following changes to namespaces: `indexes` -> `indexing` `indexes._api` -> `indexing.api` Testing code is intentionally duplicated for now since it's testing different implementations of the record manager (in-memory vs. SQL). Common logic will need to be pulled out into the test client. A follow up PR will move the SQL based implementation outside of LangChain.	2024-04-24 13:18:42 -04:00
ccurme	3bcfbcc871	groq: handle null queue_time (#20839 )	2024-04-24 09:50:09 -07:00
Eugene Yurtsev	30e48c9878	core[patch],community[patch]: Move file chat history back to community (#20834 ) Marking as patch since we haven't had releases in between. This just reverting part of a PR from yesterday.	2024-04-24 12:47:25 -04:00
ccurme	6debadaa70	groq: bump core (#20838 )	2024-04-24 11:51:46 -04:00
Erick Friis	7984206c95	groq: release 0.1.3 (#20836 ) Fixes #20811	2024-04-24 08:06:06 -07:00
Nestor Qin	9111d3a636	community[patch]: Fix message formatting for Anthropic models on Amazon Bedrock (#20801 ) Description: This PR fixes an issue in message formatting function for Anthropic models on Amazon Bedrock. Currently, LangChain BedrockChat model will crash if it uses Anthropic models and the model return a message in the following type: - `AIMessageChunk` Moreover, when use BedrockChat with for building Agent, the following message types will trigger the same issue too: - `HumanMessageChunk` - `FunctionMessage` Issue: https://github.com/langchain-ai/langchain/issues/18831 Dependencies: No. Testing: Manually tested. The following code was failing before the patch and works after. ``` @tool def square_root(x: str): "Useful when you need to calculate the square root of a number" return math.sqrt(int(x)) llm = ChatBedrock( model_id="anthropic.claude-3-sonnet-20240229-v1:0", model_kwargs={ "temperature": 0.0 }, ) prompt = ChatPromptTemplate.from_messages( [ ("system", FUNCTION_CALL_PROMPT), ("human", "Question: {user_input}"), MessagesPlaceholder(variable_name="agent_scratchpad"), ] ) tools = [square_root] tools_string = format_tool_to_anthropic_function(square_root) agent = ( RunnablePassthrough.assign( user_input=lambda x: x['user_input'], agent_scratchpad=lambda x: format_to_openai_function_messages( x["intermediate_steps"] ) ) \| prompt \| llm \| AnthropicFunctionsAgentOutputParser() ) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True, return_intermediate_steps=True) output = agent_executor.invoke({ "user_input": "What is the square root of 2?", "tools_string": tools_string, }) ``` List of messages returned from Bedrock: ``` <SystemMessage> content='You are a helpful assistant.' <HumanMessage> content='Question: What is the square root of 2?' <AIMessageChunk> content="Okay, let's calculate the square root of 2.<scratchpad>\nTo calculate the square root of a number, I can use the square_root tool:\n\n<function_calls>\n <invoke>\n <tool_name>square_root</tool_name>\n <parameters>\n <__arg1>2</__arg1>\n </parameters>\n </invoke>\n</function_calls>\n</scratchpad>\n\n<function_results>\n<search_result>\nThe square root of 2 is approximately 1.414213562373095\n</search_result>\n</function_results>\n\n<answer>\nThe square root of 2 is approximately 1.414213562373095\n</answer>" id='run-92363df7-eff6-4849-bbba-fa16a1b2988c'" <FunctionMessage> content='1.4142135623730951' name='square_root' ```	2024-04-23 22:40:39 +00:00
ccurme	06b04b80b8	groq: fix warning filter for integration test (#20806 )	2024-04-23 18:11:41 -04:00
ccurme	5a3c65a756	standard tests: add xfails (#20659 )	2024-04-23 17:14:16 -04:00
Erick Friis	ddc2274aea	standard-tests: split tool calling test (#20803 ) just making it a bit easier to grok	2024-04-23 20:59:45 +00:00
ccurme	6622829c67	mistral: catch GatedRepoError, release 0.1.3 (#20802 ) https://github.com/langchain-ai/langchain/issues/20618 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-23 20:56:42 +00:00
Eugene Yurtsev	a7c347ab35	langchain[patch]: Update evaluation logic that instantiates a default LLM (#20760 ) Favor langchain_openai over langchain_community for evaluation logic. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-23 16:09:32 -04:00
Eugene Yurtsev	72f720fa38	langchain[major]: Remove default instantations of LLMs from VectorstoreToolkit (#20794 ) Remove default instantiation from vectorstore toolkit.	2024-04-23 16:09:14 -04:00
ccurme	42de5168b1	langchain: deprecate LLMChain, RetrievalQA, and ConversationalRetrievalChain (#20751 )	2024-04-23 15:55:34 -04:00
Erick Friis	30c7951505	core: use qualname in beta message (#20361 )	2024-04-23 11:20:13 -07:00
Aliaksandr Kuzmik	5560cc448c	community[patch]: fix CometTracer bug (#20796 ) Hi! My name is Alex, I'm an SDK engineer from [Comet](https://www.comet.com/site/) This PR updates the `CometTracer` class. Fixed an issue when `CometTracer` failed while logging the data to Comet because this data is not JSON-encodable. The problem was in some of the `Run` attributes that could contain non-default types inside, now these attributes are taken not from the run instance, but from the `run.dict()` return value.	2024-04-23 13:24:41 -04:00
Eugene Yurtsev	1c89e45c14	langchain[major]: breaks some chains to remove hidden defaults (#20759 ) Breaks some chains in langchain to remove hidden chat model / llm instantiation.	2024-04-23 11:11:40 -04:00
Eugene Yurtsev	ad6b5f84e5	community[patch],core[minor]: Move in memory cache implementation to core (#20753 ) This PR moves the InMemoryCache implementation from community to core.	2024-04-23 11:10:11 -04:00
Stefano Ottolenghi	4f67ce485a	docs: Fix typo to render list (#20774 ) This _should_ fix the currently broken list in the [Neo4jVector page](https://python.langchain.com/docs/integrations/vectorstores/neo4jvector/). ![Screenshot from 2024-04-23 08-40-37](https://github.com/langchain-ai/langchain/assets/114478074/ab5ad622-879e-4764-93db-5f502eae479b)	2024-04-23 14:46:58 +00:00
Eugene Yurtsev	a2cc9b55ba	core[patch]: Remove autoupgrade to addable dict in Runnable/RunnableLambda/RunnablePassthrough transform (#20677 ) Causes an issue for this code ```python from langchain.chat_models.openai import ChatOpenAI from langchain.output_parsers.openai_tools import JsonOutputToolsParser from langchain.schema import SystemMessage prompt = SystemMessage(content="You are a nice assistant.") + "{question}" llm = ChatOpenAI( model_kwargs={ "tools": [ { "type": "function", "function": { "name": "web_search", "description": "Searches the web for the answer to the question.", "parameters": { "type": "object", "properties": { "query": { "type": "string", "description": "The question to search for.", }, }, }, }, } ], }, streaming=True, ) parser = JsonOutputToolsParser(first_tool_only=True) llm_chain = prompt \| llm \| parser \| (lambda x: x) for chunk in llm_chain.stream({"question": "tell me more about turtles"}): print(chunk) # message = llm_chain.invoke({"question": "tell me more about turtles"}) # print(message) ``` Instead by definition, we'll assume that RunnableLambdas consume the entire stream and that if the stream isn't addable then it's the last message of the stream that's in the usable format. --- If users want to use addable dicts, they can wrap the dict in an AddableDict class. --- Likely, need to follow up with the same change for other places in the code that do the upgrade	2024-04-23 10:35:06 -04:00
Oleksandr Yaremchuk	9428923bab	experimental[minor]: upgrade the prompt injection model (#20783 ) - Description: In January, Laiyer.ai became part of ProtectAI, which means the model became owned by ProtectAI. In addition to that, yesterday, we released a new version of the model addressing issues the Langchain's community and others mentioned to us about false-positives. The new model has a better accuracy compared to the previous version, and we thought the Langchain community would benefit from using the [latest version of the model](https://huggingface.co/protectai/deberta-v3-base-prompt-injection-v2). - Issue: N/A - Dependencies: N/A - Twitter handle: @alex_yaremchuk	2024-04-23 10:23:39 -04:00
Eugene Yurtsev	645b1e142e	core[minor],langchain[patch],community[patch]: Move InMemory and File implementations of Chat History to core (#20752 ) This PR moves the implementations for chat history to core. So it's easier to determine which dependencies need to be broken / add deprecation warnings	2024-04-23 10:22:11 -04:00
ccurme	7a922f3e48	core, openai: support custom token encoders (#20762 )	2024-04-23 13:57:05 +00:00
Chen94yue	b481b73805	Update custom_retriever.ipynb (#20776 ) Fixed an error in the sample code to ensure that the code can run directly. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-23 13:47:08 +00:00
Bagatur	ed980601e1	docs: update examples in api ref (#20768 )	2024-04-23 00:47:52 +00:00
Bagatur	be51cd3bc9	docs: fix api ref link autogeneration (#20766 )	2024-04-22 17:36:41 -07:00
monke111	c807f0a6dd	Update google_drive.ipynb (#20731 ) langchain_community.document_loaders depricated new langchain_google_community Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-22 23:30:46 +00:00
Katarina Supe	dc61e23886	docs: update Memgraph docs (#20736 ) - Description: Memgraph Platform is being run differently now so I updated this (I am DX engineer from Memgraph).	2024-04-22 19:27:12 -04:00
Tabish Mir	6a0d44d632	docs: Fix link for `partition_pdf` in Semi_Structured_RAG.ipynb cookbook (#20763 ) docs: Fix link for `partition_pdf` in Semi_Structured_RAG.ipynb cookbook - Description: Fix incorrect link to unstructured-io `partition_pdf` section	2024-04-22 23:22:55 +00:00
Bagatur	fa4d6f9f8b	docs: install partner pkgs vercel (#20761 )	2024-04-22 23:08:02 +00:00
Christophe Bornet	0ae5027d98	community[patch]: Remove usage of deprecated StoredBlobHistory in CassandraChatMessageHistory (#20666 )	2024-04-22 17:11:05 -04:00
Bagatur	eb18f4e155	infra: rm sep repo partner dirs (#20756 ) so you can `poetry run pip install -e libs/partners/*/` to your hearts content	2024-04-22 14:05:39 -07:00
Bagatur	2a11a30572	docs: automatically add api ref links (#20755 ) ![Screenshot 2024-04-22 at 1 51 13 PM](https://github.com/langchain-ai/langchain/assets/22008038/b8b09fec-3800-4b97-bd26-5571b8308f4a)	2024-04-22 14:05:29 -07:00
Eugene Yurtsev	936c6cc74a	langchain[patch]: Add missing deprecation for openai adapters (#20668 ) Add missing deprecation for openai adapters	2024-04-22 14:05:55 -04:00
Eugene Yurtsev	38adbfdf34	community[patch],core[minor]: Move BaseToolKit to core.tools (#20669 )	2024-04-22 14:04:30 -04:00
Mark Needham	ce23f8293a	Community patch clickhouse make it possible to not specify index (#20460 ) Vector indexes in ClickHouse are experimental at the moment and can sometimes break/change behaviour. So this PR makes it possible to say that you don't want to specify an index type. Any queries against the embedding column will be brute force/linear scan, but that gives reasonable performance for small-medium dataset sizes. --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-22 10:46:37 -07:00
ccurme	c010ec8b71	patch: deprecate (a)get_relevant_documents (#20477 ) - `.get_relevant_documents(query)` -> `.invoke(query)` - `.get_relevant_documents(query=query)` -> `.invoke(query)` - `.get_relevant_documents(query, callbacks=callbacks)` -> `.invoke(query, config={"callbacks": callbacks})` - `.get_relevant_documents(query, kwargs)` -> `.invoke(query, kwargs)` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-22 11:14:53 -04:00
A Noor	939d113d10	docs: Fixed grammar mistake (#20697 ) Description: Changed "You are" to "You are a". Grammar issue. Dependencies: None Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-22 02:55:05 +00:00
Matheus Henrique Raymundo	bb69819267	community: Fix the stop sequence key name for Mistral in Bedrock (#20709 ) Fixing the wrong stop sequence key name that causes an error on AWS Bedrock. You can check the MistralAI bedrock parameters [here](https://docs.aws.amazon.com/bedrock/latest/userguide/model-parameters-mistral.html) This change fixes this [issue](https://github.com/langchain-ai/langchain/issues/20095)	2024-04-21 20:06:06 -04:00
Bagatur	1c7b3c75a7	community[patch], experimental[patch]: support tool-calling sql and p… (#20639 ) d agents	2024-04-21 15:43:09 -07:00
Bagatur	d0cee65cdc	langchain[patch]: langchain-pinecone self query support (#20702 )	2024-04-21 15:42:39 -07:00
Leonid Kuligin	5ae738c4fe	docs: on google-genai vs google-vertexai (#20713 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: added a description of differences langchain_google_genai vs langchain_google_vertexai" - [ ] - Description: added a description of differences langchain_google_genai vs langchain_google_vertexai	2024-04-21 12:53:19 -07:00
shumway743	cb6e5e56c2	community[minor]: add graph store implementation for apache age (#20582 ) Description: implemented GraphStore class for Apache Age graph db Dependencies: depends on psycopg2 Unit and integration tests included. Formatting and linting have been run. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-20 14:31:04 -07:00
Christophe Bornet	c909ae0152	community[minor]: Add async methods to CassandraVectorStore (#20602 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-20 02:09:58 +00:00
Leonid Ganeline	06d18c106d	langchain[patch]: `example_selector` import fix (#20676 ) Cleaned up updated imports	2024-04-19 21:42:18 -04:00
Leonid Ganeline	d6470aab60	langchain: `dosctore` import fix (#20678 ) Cleaned up imports	2024-04-19 21:41:36 -04:00
Leonid Ganeline	3a750e130c	templates: `utilities` import fix (#20679 ) Updated imports from `from langchain.utilities` to `from langchain_community.utilities`	2024-04-19 21:41:15 -04:00
Dmitry Tyumentsev	f111efeb6e	community[patch]: YandexGPT API add ability to disable request logging (#20670 ) Closes (#20622) Added the ability to [disable logging of requests to YandexGPT](https://yandex.cloud/en/docs/foundation-models/operations/yandexgpt/disable-logging).	2024-04-19 21:40:37 -04:00
Erick Friis	e5f5d9ff56	docs: aws listing (#20674 )	2024-04-19 21:27:35 +00:00
Mateusz Szewczyk	75ffe51bbe	ibm: Add support for Embedding Models (#20647 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 20:56:24 +00:00
Erick Friis	73809817ff	community: release 0.0.34 (#20672 )	2024-04-19 12:44:41 -07:00
Tomaz Bratanic	e4b38e2822	Update neo4j cypher templates to the function callback (#20515 ) Update Neo4j Cypher templates to use function callback to pass context instead of passing it in user prompt. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 18:33:32 +00:00
Tomaz Bratanic	3d9b26fc28	Update neo4j vector documentation (#20455 ) Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 18:32:13 +00:00
Tomaz Bratanic	8c08cf4619	community: Add support for relationship indexes in neo4j vector (#20657 ) Neo4j has added relationship vector indexes. We can't populate them, but we can use existing indexes for retrieval	2024-04-19 11:22:42 -07:00
Erick Friis	940242c1ec	core: release 0.1.45 (#20664 )	2024-04-19 09:55:02 -07:00
Saurabh Chalke	3dd6266bcc	docs: Remove Duplicate --quiet Flag in Installation Command in LangSmith Docs (#20121 ) Description: This pull request removes a duplicated `--quiet` flag in the pip install command found in the LangSmith Walkthrough section of the documentation. Issue: N/A Dependencies: None	2024-04-19 11:16:44 -04:00
Aditya	6a97448928	Updated Tutorials for Vertex Vector Search (#20376 ) Thank you for contributing to LangChain! - [ ] PR title: "package: docs" - [ ] PR message: - Description: Updated Tutorials for Vertex Vector Search - Issue: NA - Dependencies: NA - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-19 10:38:00 -04:00
Boris Djurdjevic	c5aab9afe3	docs: Fix minor typo in data_connection/document_loaders/custom (#20648 ) Description: Minor documentation typo fix in `data_connection/document_loaders/custom`: `thta's` -> `that's`	2024-04-19 14:17:00 +00:00
Souls-R	36084e7500	docs: fix variable name typo in example code (#20658 ) This pull request corrects a mistake in the variable name within the example code. The variable doc_schema has been changed to dog_schema to fix the error.	2024-04-19 14:08:25 +00:00
Leonid Ganeline	beebd73f95	docs: `integrations/retrievers` cleanup (#20357 ) Fixed format inconsistencies; added descriptions, links.	2024-04-19 10:02:41 -04:00
Leonid Ganeline	0b99e9201d	docs: providers `alibaba` update (#20560 ) Added missed integrations to the Alibaba Cloud provider page	2024-04-18 23:11:17 -07:00
Leonid Ganeline	27a4682415	docs: imports update (#20625 ) Updated imports in docs Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-18 23:04:07 -07:00
Ethan Yang	53ae77b13e	docs: Update openvino example documents links (#20638 )	2024-04-18 22:57:28 -07:00
Sivaudha	baedc3ec0a	langchain[minor]: Databricks vector search self query integration (#20627 ) - Enable self querying feature for databricks vector search --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-19 03:44:38 +00:00
ccurme	6d530481c1	openai: fix allowed block types (#20636 )	2024-04-18 22:12:57 -04:00
Erick Friis	764871f97d	infra: add test-doc-imports to ci failure (#20637 )	2024-04-19 02:06:57 +00:00
Erick Friis	5c216ad08f	upstage[patch]: un-xfail tool calling test, release 0.1.0 (#20635 )	2024-04-19 02:02:21 +00:00
Nuno Campos	48307e46a3	core[patch]: Fix runnable map ser/de (#20631 )	2024-04-18 18:52:33 -07:00
Charlie Holtz	1cbab0ebda	community: update Replicate to work with official models (#20633 ) Description: you don't need to pass a version for Replicate official models. That was broken on LangChain until now! You can now run: ``` llm = Replicate( model="meta/meta-llama-3-8b-instruct", model_kwargs={"temperature": 0.75, "max_length": 500, "top_p": 1}, ) prompt = """ User: Answer the following yes/no question by reasoning step by step. Can a dog drive a car? Assistant: """ llm(prompt) ``` I've updated the replicate.ipynb to reflect that. twitter: @charliebholtz --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 01:43:40 +00:00
Congyu	dd5139e304	community[patch]: truncate zhipuai `temperature` and `top_p` parameters to [0.01, 0.99] (#20261 ) ZhipuAI API only accepts `temperature` parameter between `(0, 1)` open interval, and if `0` is passed, it responds with status code `400`. However, 0 and 1 is often accepted by other APIs, for example, OpenAI allows `[0, 2]` for temperature closed range. This PR truncates temperature parameter passed to `[0.01, 0.99]` to improve the compatibility between langchain's ecosystem's and ZhipuAI (e.g., ragas `evaluate` often generates temperature 0, which results in a lot of 400 invalid responses). The PR also truncates `top_p` parameter since it has the same restriction. Reference: [glm-4 doc](https://open.bigmodel.cn/dev/api#glm-4) (which unfortunately is in Chinese though). --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-19 01:31:30 +00:00
Lance Martin	d5c22b80a5	community[patch]: Fix Ollama for LLaMA3 (#20624 ) We see verbose generations w/ LLaMA3 and Ollama - https://smith.langchain.com/public/88c4cd21-3d57-4229-96fe-53443398ca99/r --- Fix here implies that when stop was being set to an empty list, the stream had no conditions under which to stop, which could lead to excessive or unintended output. Test LLaMA2 - https://smith.langchain.com/public/57dfc64a-591b-46fa-a1cd-8783acaefea2/r Test LLaMA3 - https://smith.langchain.com/public/76ff5f47-ac89-4772-a7d2-5caa907d3fd6/r https://smith.langchain.com/public/a31d2fad-9094-4c93-949a-964b27630ccb/r Test Mistral - https://smith.langchain.com/public/a4fe7114-c308-4317-b9fd-6c86d31f1c5b/r --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-19 00:20:32 +00:00
Erick Friis	726234eee5	infra: fix doc imports ci (#20629 )	2024-04-18 23:42:03 +00:00
Erick Friis	3425988de7	core: deprecation default to qualname (#20578 )	2024-04-18 15:35:17 -07:00
hulitaitai	7d0a008744	community[minor]: Add audio-parser "faster-whisper" in audio.py (#20012 ) faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is up to 4 times faster than enai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU. It can automatically detect the following 14 languages and transcribe the text into their respective languages: en, zh, fr, de, ja, ko, ru, es, th, it, pt, vi, ar, tr. The gitbub repository for faster-whisper is : https://github.com/SYSTRAN/faster-whisper --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-18 20:50:59 +00:00
Guangdong Liu	e3c2431c5b	comminuty[patch]:Fix Error in apache doris insert (#19989 ) - Issue: #19886	2024-04-18 16:34:32 -04:00
naaive	6f0d4f3f09	docs: Update body_func to hybrid_query in ElasticsearchRetriever (#20498 )	2024-04-18 20:19:02 +00:00
Tomaz Bratanic	27370b679e	community[patch]: Ignore null and invalid embedding values for neo4j metadata filtering (#20558 )	2024-04-18 16:15:45 -04:00
Eugene Yurtsev	718c9cbe3a	mistral[patch]: Support both model and model_name (#20557 )	2024-04-18 16:12:33 -04:00
Eugene Yurtsev	e3bd521654	docs: Remove example vsdx data (#20620 ) VSDX data contains EMF files. Some of these apparently can contain exploits with some Adobe tools. This is likely a false positive from antivirus software, but we can remove it nonetheless.	2024-04-18 16:10:40 -04:00
Dhruv Chawla	c0548eb632	docs: Update uptrain.ipynb to show outputs (#20551 ) Hey @eyurtsev, I noticed that the notebook isn't displaying the outputs properly. I've gone ahead and rerun the cells to ensure that readers can easily understand the functionality without having to run the code themselves.	2024-04-18 16:10:23 -04:00
Leonid Ganeline	95dc90609e	experimental[patch]: `prompts` import fix (#20534 ) Replaced `from langchain.prompts` with `from langchain_core.prompts` where it is appropriate. Most of the changes go to `langchain_experimental` Similar to #20348	2024-04-18 16:09:11 -04:00
Massimiliano Pronesti	2542a09abc	community[patch]: AzureSearch incorrectly converted to retriever (#20601 ) Closes #20600. Please see the issue for more details.	2024-04-18 16:06:47 -04:00
Leonid Ganeline	520ef24fb9	docs: import update (#20610 ) Updated imports	2024-04-18 16:05:17 -04:00
Christophe Bornet	8f0b5687a3	community[minor]: Add hybrid search to Cassandra VectorStore (#20286 ) Only supported by Astra DB at the moment. Twitter handle: cbornet_	2024-04-18 15:58:43 -04:00
Christophe Bornet	d2d01370bc	community[minor]: Add async methods to CassandraLoader (#20609 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-18 19:45:20 +00:00
Eugene Yurtsev	8c29b7bf35	mistralai[patch]: Use public attribute for eventsource.response (#20580 ) Minor change, use the public attribute instead of the protected one.	2024-04-18 14:12:12 -04:00
Erick Friis	66fb0b1f35	core: fix fireworks mapping (#20613 )	2024-04-18 18:08:40 +00:00
balloonio	e786da7774	community[patch]: Invoke callback prior to yielding token fix [HuggingFaceTextGenInference] (#20426 ) …gFaceTextGenInference) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for [HuggingFaceTextGenInference] - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in [HuggingFaceTextGenInference] - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-18 14:25:20 +00:00
Ethan Yang	2d6d796040	community: Add save_model function for openvino reranker and embedding (#19896 )	2024-04-18 10:20:33 -04:00
zR	9c1d7f2405	update zhipuai notebook (#20595 ) fix timeout issue fix zhipuai usecase notebookbook Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-18 10:12:12 -04:00
MajorDouble	9c175bc618	Update README.md -- broken hyperlink (#20422 ) fixed broken `LangGraph` hyperlink Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-18 14:07:52 +00:00
Ikko Eltociear Ashimine	7a884eb416	Update RAPTOR.ipynb (#20586 ) Langauge -> Language	2024-04-18 09:47:17 -04:00
Justsosostar	697d98cac9	fix typo in langchain/docs/docs/intergrations/tools/nuclia.ipynb (#20591 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-18 13:46:45 +00:00
ccurme	c897264b9b	community: (milvus) check for num_shards (#20603 ) @rgupta2508 I believe this change is necessary following https://github.com/langchain-ai/langchain/pull/20318 because of how Milvus handles defaults: `59bf5e811a/pymilvus/client/prepare.py (L82-L85)` ```python num_shards = kwargs[next(iter(same_key))] if not isinstance(num_shards, int): msg = f"invalid num_shards type, got {type(num_shards)}, expected int" raise ParamError(message=msg) req.shards_num = num_shards ``` this way lets Milvus control the default value (instead of maintaining a separate default in Langchain). Let me know if I've got this wrong or you feel it's unnecessary. Thanks.	2024-04-18 09:44:56 -04:00
Rohit Gupta	25c4c24e89	Support to create shards_num in milvus vectorstores (#20318 ) To support number of the shards for the collection to create in milvus vvectorstores. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-18 08:58:00 -04:00
aditya thomas	8bad536c6c	docs[callbacks]: update to the FileCallbackHandler documentation (#20496 ) Description: Update to the `FileCallbackHandler` documentation Issue: #20493 Dependencies: None	2024-04-17 22:32:21 -04:00
aditya thomas	cea379e7c7	community, core[callbacks]: move FileCallbackHandler from community to core (#20495 ) Description: Move `FileCallbackHandler` from community to core Issue: #20493 Dependencies: None (imo) `FileCallbackHandler` is a built-in LangChain callback handler like `StdOutCallbackHandler` and should properly be in in core.	2024-04-17 22:29:30 -04:00
Erick Friis	084bedd16e	docs: nits (#20577 )	2024-04-18 00:20:44 +00:00
Erick Friis	e7e94b37f1	upstage: fix core dep (#20576 )	2024-04-17 16:33:09 -07:00
Erick Friis	e395115807	docs: aws docs updates (#20571 )	2024-04-17 23:32:00 +00:00
Erick Friis	f09bd0b75b	upstage: init package (#20574 ) Co-authored-by: Sean Cho <sean@upstage.ai> Co-authored-by: JuHyung-Son <sonju0427@gmail.com>	2024-04-17 23:25:36 +00:00
Marco Perini	11c9ed3362	community[patch]: exposing headless flag parameter to AsyncChromiumLoader class (#20424 ) - Description: added the headless parameter as optional argument to the langchain_community.document_loaders AsyncChromiumLoader class - Dependencies: None - Twitter handle: @perinim_98 If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-17 16:00:28 -07:00
Bagatur	54e9271504	anthropic[patch]: fix msg mutation (#20572 )	2024-04-17 15:47:19 -07:00
Nuno Campos	719da8746e	core: fix attributeerror in runnablelambda.deps (#20569 ) - would happen when user's code tries to access attritbute that doesnt exist, we prefer to let this crash in the user's code, rather than here - also catch more cases where a runnable is invoked/streamed inside a lambda. before we weren't seeing these as deps	2024-04-17 15:38:39 -07:00
Jacob Lee	8b09e81496	Lock low level dep to fix Vercel docs build (#20573 ) @baskaryan @efriis TODO: Figure out why our lockfile isn't being respected here	2024-04-17 15:21:28 -07:00
Christophe Bornet	a22da4315b	community[patch]: Replace function in CassandraVectorStore with simpler lambda (#20323 )	2024-04-17 17:13:13 -04:00
Christophe Bornet	75733c5cc1	community[minor]: Improve CassandraVectorStore from_texts (#20284 )	2024-04-17 17:12:28 -04:00
Tomer Cagan	463160c3f6	community: fix `DirectoryLoader` progress bar (#19821 ) Description: currently, the `DirectoryLoader` progress-bar maximum value is based on an incorrect number of files to process In langchain_community/document_loaders/directory.py:127: ```python paths = p.rglob(self.glob) if self.recursive else p.glob(self.glob) items = [ path for path in paths if not (self.exclude and any(path.match(glob) for glob in self.exclude)) ] ``` `paths` returns both files and directories. `items` is later used to determine the maximum value of the progress-bar which gives an incorrect progress indication.	2024-04-17 21:12:16 +00:00
Bagatur	984e7e36c2	anthropic[patch]: Release 0.1.10 (#20568 )	2024-04-17 14:05:42 -07:00
Pengcheng Liu	ecd19a9e58	community[patch]: Add function call support in Tongyi chat model. (#20119 ) - [ ] PR message: - Description: This pr adds function calling support in Tongyi chat model. - Issue: None - Dependencies: None - Twitter handle: None Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-17 20:42:23 +00:00
kaijietti	80679ab906	zep[patch]: implement add_messages and aadd_messages (#20099 ) This PR implement `add_messages` and `aadd_messages` to avoid unnecessary round-trips.	2024-04-17 13:40:24 -07:00
Guangdong Liu	55dd349472	docs: Get rid of ZeroShotAgent and use create_react_agent instead (#20154 ) - Issue: close #20122 - @baskaryan, @eyurtsev.	2024-04-17 13:35:14 -07:00
Guangdong Liu	1e3b07aae2	docs: Get rid of ZeroShotAgent and use create_react_agent instead (#20155 ) - Issue: #20122 - @baskaryan,@eyurtsev	2024-04-17 13:34:57 -07:00
ccurme	2238490069	mistral, openai: allow anthropic-style messages in message histories (#20565 )	2024-04-17 15:55:45 -04:00
Eugene Yurtsev	7a7851aa06	anthropic[patch]: Handle empty text block (#20566 ) Handle empty text block	2024-04-17 15:37:04 -04:00
Bagatur	7917e2c418	core[patch]: Release 0.1.44 (#20564 )	2024-04-17 18:34:44 +00:00
ccurme	4a17951900	mistral: read tool calls from AIMessage (#20554 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-17 13:38:24 -04:00
Eugene Yurtsev	f257909699	mistralai[patch]: Surface http errors (#20555 ) Do not swallow errors when streaming with httpx. Update affected code if this PR gets merged to httpx: https://github.com/florimondmanca/httpx-sse/pull/25/files	2024-04-17 10:47:56 -04:00
Sevin F. Varoglu	3f156e0ece	community[minor]: add ChatOctoAI (#20059 ) This PR adds ChatOctoAI, a chat model integration for OctoAI.	2024-04-17 03:20:56 -07:00
Eun Hye Kim	b34f1086fe	community[patch]: Add streaming logic in ChatHuggingFace (#18784 ) - Add functions (_stream, _astream) - Connect to _generate and _agenerate Thank you for contributing to LangChain! - [x] PR title: "community: Add streaming logic in ChatHuggingFace" - [x] PR message: *Delete this entire checklist* and replace with - Description: Addition functions (_stream, _astream) and connection to _generate and _agenerate - Issue: #18782 - Dependencies: none - Twitter handle: @lunara_x	2024-04-16 19:17:03 -07:00
Bagatur	c05c379b26	docs: add structred output to feat table (#20539 )	2024-04-16 19:14:26 -07:00
pjb157	479be3cc91	community[minor]: Unify Titan Takeoff Integrations and Adding Embedding Support (#18775 ) Community: Unify Titan Takeoff Integrations and Adding Embedding Support Description: Titan Takeoff no longer reflects this either of the integrations in the community folder. The two integrations (TitanTakeoffPro and TitanTakeoff) where causing confusion with clients, so have moved code into one place and created an alias for backwards compatibility. Added Takeoff Client python package to do the bulk of the work with the requests, this is because this package is actively updated with new versions of Takeoff. So this integration will be far more robust and will not degrade as badly over time. Issue: Fixes bugs in the old Titan integrations and unified the code with added unit test converge to avoid future problems. Dependencies: Added optional dependency takeoff-client, all imports still work without dependency including the Titan Takeoff classes but just will fail on initialisation if not pip installed takeoff-client Twitter @MeryemArik9 Thanks all :) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-17 01:43:35 +00:00
Rahul Triptahi	2cbfc94bcb	community[patch]: Add support for authorized identities in PebbloSafeLoader. (#20055 ) Description: Add support for authorized identities in PebbloSafeLoader. Now with this change, PebbloSafeLoader will extract authorized_identities from metadata and send it to pebblo server Dependencies: None Documentation: None Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-16 18:34:06 -07:00
Rahul Triptahi	475892ca0e	docs: Add Documentation to enable authorized access identities in GoogleDriveLoader. (#20065 ) Description: Document update. GoogleDriveLoader: Added documentation for `load_auth` a new argument in document_loaders/GoogleDriveLoader. Dependencies: None Documentation: https://python.langchain.com/docs/integrations/document_loaders/google_drive/ Associated PR: https://github.com/langchain-ai/langchain-google/pull/110 Twitter handle: @rahul_tripathi2 Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-16 18:33:10 -07:00
Guangdong Liu	b78ede2f96	community[patch]: standardize init args (#20166 ) Related to https://github.com/langchain-ai/langchain/issues/20085 @baskaryan	2024-04-16 18:30:26 -07:00
Guangdong Liu	3729bec1a2	community[patch]: standardize init args (#20210 ) Related to https://github.com/langchain-ai/langchain/issues/20085 @baskaryan	2024-04-16 18:29:57 -07:00
sdan	a7c5e41443	community[minor]: Added VLite as VectorStore (#20245 ) Support [VLite](https://github.com/sdan/vlite) as a new VectorStore type. Description: vlite is a simple and blazing fast vector database(vdb) made with numpy. It abstracts a lot of the functionality around using a vdb in the retrieval augmented generation(RAG) pipeline such as embeddings generation, chunking, and file processing while still giving developers the functionality to change how they're made/stored. Before submitting: Added tests [here](`c09c2ebd5c/libs/community/tests/integration_tests/vectorstores/test_vlite.py`) Added ipython notebook [here](`c09c2ebd5c/docs/docs/integrations/vectorstores/vlite.ipynb`) Added simple docs on how to use [here](`c09c2ebd5c/docs/docs/integrations/providers/vlite.mdx`) Profiles Maintainers: @sdan Twitter handles: [@sdand](https://x.com/sdand) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-17 01:24:38 +00:00
Hyeongchan Kim	7824291252	community[patch]: Fix not to cast to str type when `file_path` is None (#20057 ) From `langchain_community 0.0.30`, there's a bug that cannot send a file-like object via `file` parameter instead of `file path` due to casting the `file_path` to str type even if `file_path` is None. which means that when I call the `partition_via_api()`, exactly one of `filename` and `file` must be specified by the following error message. however, from `langchain_community 0.0.30`, `file_path` is casted into `str` type even `file_path` is None in `get_elements_from_api()` and got an error at `exactly_one(filename=filename, file=file)`. here's an error message ``` ---> 51 exactly_one(filename=filename, file=file) 53 if metadata_filename and file_filename: 54 raise ValueError( 55 "Only one of metadata_filename and file_filename is specified. " 56 "metadata_filename is preferred. file_filename is marked for deprecation.", 57 ) File /opt/homebrew/lib/python3.11/site-packages/unstructured/partition/common.py:441, in exactly_one(**kwargs) 439 else: 440 message = f"{names[0]} must be specified." --> 441 raise ValueError(message) ValueError: Exactly one of filename and file must be specified. ``` So, I simply made a change that casting to str type when `file_path` is not None. I use `UnstructuredAPIFileLoader` like below. ``` from langchain_community.document_loaders.unstructured import UnstructuredAPIFileLoader documents: list = UnstructuredAPIFileLoader( file_path=None, file=file, # file-like object, io.BytesIO type mode='elements', url='http://127.0.0.1:8000/general/v0/general', content_type='application/pdf', metadata_filename='asdf.pdf', ).load_and_split() ```	2024-04-16 18:06:21 -07:00
Prashanth Rao	295b9b704b	community[patch]: Improve Kuzu Cypher generation prompt (#20481 ) - [x] PR title: "community: improve kuzu cypher generation prompt" - [x] PR message: *Delete this entire checklist* and replace with - Description: Improves the Kùzu Cypher generation prompt to be more robust to open source LLM outputs - Issue: N/A - Dependencies: N/A - Twitter handle: @kuzudb - [x] Add tests and docs: If you're adding a new integration, please include No new tests (non-breaking. change) - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-04-16 18:01:36 -07:00
MacanPN	bce69ae43d	community[patch]: Changes to base_o365 and sharepoint document loaders (#20373 ) ## Description: The PR introduces 3 changes: 1. added `recursive` property to `O365BaseLoader`. (To keep the behavior unchanged, by default is set to `False`). When `recursive=True`, `_load_from_folder()` also recursively loads all nested folders. 2. added `folder_id` to SharePointLoader.(similar to (this PR)[https://github.com/langchain-ai/langchain/pull/10780] ) This provides an alternative to `folder_path` that doesn't seem to reliably work. 3. when none of `document_ids`, `folder_id`, `folder_path` is provided, the loader fetches documets from root folder. Combined with `recursive=True` this provides an easy way of loading all compatible documents from SharePoint. The PR contains the same logic as [this stale PR](https://github.com/langchain-ai/langchain/pull/10780) by @WaleedAlfaris. I'd like to ask his blessing for moving forward with this one. ## Issue: - As described in https://github.com/langchain-ai/langchain/issues/19938 and https://github.com/langchain-ai/langchain/pull/10780 the sharepoint loader often does not seem to work with folder_path. - Recursive loading of subfolders is a missing functionality ## Dependecies: None Twitter handle: @martintriska1 @WRhetoric This is my first PR here, please be gentle :-) Please review @baskaryan --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-17 00:36:15 +00:00
Sevin F. Varoglu	54d388d898	community[patch]: update OctoAI endpoint to subclass BaseOpenAI (#19757 ) This PR updates OctoAIEndpoint LLM to subclass BaseOpenAI as OctoAI is an OpenAI-compatible service. The documentation and tests have also been updated.	2024-04-16 17:32:20 -07:00
Erick Friis	0c95ddbcd8	docs: add snowflake provider page (#20538 )	2024-04-17 00:31:27 +00:00
Benito Geordie	57b226532d	community[minor]: Added integrations for ThirdAI's NeuralDB as a Retriever (#17334 ) Description: Adds ThirdAI NeuralDB retriever integration. NeuralDB is a CPU-friendly and fine-tunable text retrieval engine. We previously added a vector store integration but we think that it will be easier for our customers if they can also find us under under langchain-community/retrievers. --------- Co-authored-by: kartikTAI <129414343+kartikTAI@users.noreply.github.com> Co-authored-by: Kartik Sarangmath <kartik@thirdai.com>	2024-04-16 16:36:55 -07:00
WeichenXu	e9fc87aab1	community[patch]: Make ChatDatabricks model supports streaming response (#19912 ) Description: Make ChatDatabricks model supports stream Issue: N/A Dependencies: MLflow nightly build version (we will release next MLflow version soon) Twitter handle: N/A Manually test: (Before testing, please install `pip install git+https://github.com/mlflow/mlflow.git`) ```python # Test Databricks Foundation LLM model from langchain.chat_models import ChatDatabricks chat_model = ChatDatabricks( endpoint="databricks-llama-2-70b-chat", max_tokens=500 ) from langchain_core.messages import AIMessageChunk for chunk in chat_model.stream("What is mlflow?"): print(chunk.content, end="\|") ``` - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Signed-off-by: Weichen Xu <weichen.xu@databricks.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-16 23:34:49 +00:00
ccurme	a892f985d3	standardized-tests[patch]: test tool call messages (#20519 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-16 23:25:50 +00:00
Erick Friis	e7fe5f7d3f	anthropic[patch]: serialization in partner package (#18828 )	2024-04-16 16:05:58 -07:00
Bagatur	f74d5d642e	anthropic[patch]: bump to core 0.1.43 (#20537 )	2024-04-16 22:47:07 +00:00
Bagatur	96d8769eae	anthropic[patch]: release 0.1.9, use tool calls if content is empty (#20535 )	2024-04-16 15:27:29 -07:00
Erick Friis	6adca37eb7	core: default chat/llm _identifying_params to lc_attributes (#20232 )	2024-04-16 14:55:47 -07:00
ccurme	22da9f5f3f	update scheduled tests (#20526 ) repurpose scheduled tests to test over provider packages	2024-04-16 16:49:46 -04:00
Nuno Campos	806a54908c	Runnable graph viz improvements (#20529 ) - Add conditional: bool property to json representation of the graphs - Add option to generate mermaid graph stripped of styles (useful as a text representation of graph)	2024-04-16 20:17:47 +00:00
Nuno Campos	f3aa26d6bf	Fix getattr in runnable binding for cases where config is passed in as arg too (#20528 ) …s arg too Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-16 13:10:29 -07:00
Dhruv Chawla	d6d559d50d	community[minor]: add UpTrainCallbackHandler (#19956 ) - Description: This PR adds a callback handler for UpTrain. It performs evaluations in the RAG pipeline to check the quality of retrieved documents, generated queries and responses. - Dependencies: - The UpTrainCallbackHandler requires the uptrain package --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-04-16 19:32:03 +00:00
Bagatur	07f23bd4ff	docs: response metadata (#20527 )	2024-04-16 12:17:27 -07:00
Leonid Ganeline	45d045b2c5	core[minor], langchain[patch]: `tools` dependencies refactoring (#18759 ) The `langchain.tools` [namespace](https://api.python.langchain.com/en/latest/langchain_api_reference.html#module-langchain.tools) can be completely eliminated by moving one class and 3 functions into `core`. It makes sense since the class and functions are very core.	2024-04-16 14:15:09 -04:00
Erick Friis	77eba10f47	standard-tests: fix default fixtures (#20520 )	2024-04-16 16:12:36 +00:00
Ravindu Somawansa	5acc7ba622	community[minor]: Add glue catalog loader (#20220 ) Add Glue Catalog loader	2024-04-16 11:39:23 -04:00
Dawson Bauer	aab075345e	core[patch]: Fix imports defined in messages sub-package (#20500 ) core[patch]: Fix imports defined in messages sub-package (#20500)	2024-04-16 14:19:51 +00:00
Fayfox	9fd36efdb5	anthropic[patch]: env ANTHROPIC_API_URL not work (#20507 ) enviroment variable ANTHROPIC_API_URL will not work if anthropic_api_url has default value --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-04-16 10:16:51 -04:00
Martín Gotelli Ferenaz	b48add4353	community[patch]: Fix pgvector deprecated filter clause usage with OR and AND conditions (#20446 ) Description: Support filter by OR and AND for deprecated PGVector version Issue: #20445 Dependencies: N/A Twitter handle: @martinferenaz	2024-04-16 14:08:07 +00:00
Eugene Yurtsev	c50099161b	community[patch]: Use uuid4 not uuid1 (#20487 ) Using UUID1 is incorrect since it's time dependent, which makes it easy to generate the exact same uuid	2024-04-16 09:40:44 -04:00
Bagatur	f7667c614b	docs: update tool use case (#20404 )	2024-04-16 04:27:27 +00:00
Erick Friis	86cf1d3ee1	community: release 0.0.33 (#20490 )	2024-04-16 00:30:05 +00:00
Erick Friis	90184255f8	core: release 0.1.43 (#20489 )	2024-04-15 22:48:34 +00:00
Erick Friis	7997f3b7f8	core: forward config params to default (#20402 ) nuno's fault not mine --------- Co-authored-by: Nuno Campos <nuno@boringbits.io> Co-authored-by: Nuno Campos <nuno@langchain.dev>	2024-04-15 15:42:39 -07:00
Nuno Campos	97b2191e99	core: Add concept of conditional edge to graph rendering (#20480 ) - implement for mermaid, graphviz and ascii - this is to be used in langgraph	2024-04-15 13:49:06 -07:00
Averi Kitsch	30b00090ef	docs: Add Google Firestore Vectorstore doc (#20078 ) - Description:Add Google Firestore Vector store docs - Issue: NA - Dependencies: NA --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-15 20:09:32 +00:00
Leonid Kuligin	cc3c343673	docs: changed model's name in google-vertex-ai integration to a publicly available model (#20482 ) docs: changed model's name in google-vertex-ai integration to a publicly available model	2024-04-15 15:18:27 -04:00
Leonid Ganeline	7ea80bcb22	docs: tutorials update (#20483 ) Added the `freeCodeCamp` tutorials link	2024-04-15 15:17:32 -04:00
Ángel Igareta	60c7a17781	Remove logic to exclude intermediate nodes from rendering time (#20459 ) Description: For simplicity, migrate the logic of excluding intermediate nodes in the .get_graph() of langgraph package (https://github.com/langchain-ai/langgraph/pull/310) at graph creation time instead of graph rendering time. Note: #20381 needs to be approved first --------- Co-authored-by: Angel Igareta <angel.igareta@klarna.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Nuno Campos <nuno@boringbits.io>	2024-04-15 16:40:51 +00:00
Mohammed Noumaan Ahamed	4dd05791a2	docs: quickstart retrieval chain for Cohere(API) (#20475 ) - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! Description: fixes LangChainDeprecationWarning: The class `langchain_community.embeddings.cohere.CohereEmbeddings` was deprecated in langchain-community 0.0.30 and will be removed in 0.2.0. An updated version of the class exists in the langchain-cohere package and should be used instead. To use it run `pip install -U langchain-cohere` and import as `from langchain_cohere import CohereEmbeddings`. ![Screenshot 2024-04-15 200948](https://github.com/langchain-ai/langchain/assets/93511919/085b967d-a6fd-42c6-9404-faab8c5630ec) Dependencies : langchain_cohere Twitter handle: @Mo_Noumaan	2024-04-15 11:28:39 -04:00
Ángel Igareta	d55a365c6c	Fix CDN URL in mermaid graph renderer (#20381 ) Description of features on mermaid graph renderer: - Fixing CDN to use official Mermaid JS CDN: https://www.jsdelivr.com/package/npm/mermaid?tab=files - Add device_scale_factor to allow increasing quality of resulting PNG.	2024-04-15 08:01:35 -07:00
Eugene Yurtsev	3cbc4693f5	docs: Add integration doc for postgres vectorstore (#20473 ) Adds a postgres vectorstore via langchain-postgres.	2024-04-15 14:20:27 +00:00
Leonid Kuligin	676c68d318	community[patch]: deprecating remaining google_community integrations (#20471 ) Deprecating remaining google community integrations	2024-04-15 09:57:12 -04:00
balloonio	b66a4f48fa	community[patch]: Invoke callback prior to yielding token fix [DeepInfra] (#20427 ) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for [DeepInfra] - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in [DeepInfra] - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-14 14:32:52 -04:00
Juan Carlos José Camacho	450c458f8f	community[minor]: Add Datahareld tool (#19680 ) Description: Integrate [dataherald](https://www.dataherald.com) tool, It is a natural language-to-SQL tool. Dependencies: Install dataherald sdk to use it, ``` pip install dataherald ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com>	2024-04-13 23:27:16 +00:00
Alexander Smirnov	ece008f117	docs: Refine RunnablePassthrough docstring (#19812 ) Description: This update refines the documentation for `RunnablePassthrough` by removing an unnecessary import and correcting a minor syntactical error in the example provided. This change enhances the clarity and correctness of the documentation, ensuring that users have a more accurate guide to follow. Issue: N/A Dependencies: None This PR focuses solely on documentation improvements, specifically targeting the `RunnablePassthrough` class within the `langchain_core` module. By clarifying the example provided in the docstring, users are offered a more straightforward and error-free guide to utilizing the `RunnablePassthrough` class effectively. As this is a documentation update, it does not include changes that require new integrations, tests, or modifications to dependencies. It adheres to the guidelines of minimal package interference and backward compatibility, ensuring that the overall integrity and functionality of the LangChain package remain unaffected. Thank you for considering this documentation refinement for inclusion in the LangChain project.	2024-04-13 16:23:32 -07:00
Egor Krasheninnikov	c8391d4ff1	community[patch]: Fix YandexGPT embeddings (#19720 ) Fix of YandexGPT embeddings. The current version uses a single `model_name` for queries and documents, essentially making the `embed_documents` and `embed_query` methods the same. Yandex has a different endpoint (`model_uri`) for encoding documents, see [this](https://yandex.cloud/en/docs/yandexgpt/concepts/embeddings). The bug may impact retrievers built with `YandexGPTEmbeddings` (for instance FAISS database as retriever) since they use both `embed_documents` and `embed_query`. A simple snippet to test the behaviour: ```python from langchain_community.embeddings.yandex import YandexGPTEmbeddings embeddings = YandexGPTEmbeddings() q_emb = embeddings.embed_query('hello world') doc_emb = embeddings.embed_documents(['hello world', 'hello world']) q_emb == doc_emb[0] ``` The response is `True` with the current version and `False` with the changes I made. Twitter: @egor_krash --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-13 16:23:01 -07:00
Guangdong Liu	4be7ca7b4c	community[patch]:sparkllm standardize init args (#20194 ) Related to https://github.com/langchain-ai/langchain/issues/20085 @baskaryan	2024-04-13 16:03:19 -07:00
Rohit Agarwal	7d7a08e458	docs: Update Portkey provider integration (#20412 ) Description: Updates the documentation for Portkey and Langchain. Also updates the notebook. The current documentation is fairly old and is non-functional. Twitter handle: @portkeyai --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-13 23:01:48 +00:00
Yuki Oshima	0758da8940	community[patch]: Set default value for _ListSQLDatabaseToolInput tool_input (#20409 ) Description: `_ListSQLDatabaseToolInput` raise error if model returns `{}`. For example, gpt-4-turbo returns `{}` with SQL Agent initialized by `create_sql_agent`. So, I set default value `""` for `_ListSQLDatabaseToolInput` tool_input. This is actually a gpt-4-turbo issue, not a LangChain issue, but I thought it would be helpful to set a default value `""`. This problem is discussed in detail in the following Issue. Issue: https://github.com/langchain-ai/langchain/issues/20405 Dependencies: none Sorry, I did not add or change the test code, as tests for this components was not exist . However, I have tested the following code based on the [SQL Agent Document](https://python.langchain.com/docs/use_cases/sql/agents/), to make sure it works. ``` from langchain_community.agent_toolkits.sql.base import create_sql_agent from langchain_community.utilities.sql_database import SQLDatabase from langchain_openai import ChatOpenAI db = SQLDatabase.from_uri("sqlite:///Chinook.db") llm = ChatOpenAI(model="gpt-4-turbo", temperature=0) agent_executor = create_sql_agent(llm, db=db, agent_type="openai-tools", verbose=True) result = agent_executor.invoke("List the total sales per country. Which country's customers spent the most?") print(result["output"]) ```	2024-04-13 15:58:47 -07:00
Kenneth Choe	b507cd222b	docs: changed the link to more helpful source (#20411 ) docs: changed a link to better source [Previous link](https://www.philschmid.de/custom-inference-huggingface-sagemaker) is about how to upload embeddings model. [New link](https://huggingface.co/blog/kchoe/deploy-any-huggingface-model-to-sagemaker) is about how to upload cross encoder model, which directly addresses what is needed here. For full disclosure, I wrote this article and the sample `inference.py` is the result of this new article. Co-authored-by: Kenny Choe <kchoe@amazon.com>	2024-04-13 15:54:33 -07:00
saberuster	160bcaeb93	text-splitters[minor]: Add lua code splitting (#20421 ) - Description: Complete the support for Lua code in langchain.text_splitter module. - Dependencies: No - Twitter handle: @saberuster If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-13 22:42:51 +00:00
ccurme	4b6b0a87b6	groq[patch]: Make stream robust to ToolMessage (#20417 ) ```python from langchain.agents import AgentExecutor, create_tool_calling_agent, tool from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder from langchain_groq import ChatGroq prompt = ChatPromptTemplate.from_messages( [ ("system", "You are a helpful assistant"), ("human", "{input}"), MessagesPlaceholder("agent_scratchpad"), ] ) model = ChatGroq(model_name="mixtral-8x7b-32768", temperature=0) @tool def magic_function(input: int) -> int: """Applies a magic function to an input.""" return input + 2 tools = [magic_function] agent = create_tool_calling_agent(model, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke({"input": "what is the value of magic_function(3)?"}) ``` ``` > Entering new AgentExecutor chain... Invoking: `magic_function` with `{'input': 3}` 5The value of magic\_function(3) is 5. > Finished chain. {'input': 'what is the value of magic_function(3)?', 'output': 'The value of magic\\_function(3) is 5.'} ```	2024-04-13 15:40:55 -07:00
Leonid Ganeline	6dc4f592ba	docs: tutorials update (#20401 ) Added 3 new `LangChain.ai` playlists	2024-04-12 21:56:14 -04:00
ccurme	38faa74c23	community[patch]: update use of deprecated llm methods (#20393 ) .predict and .predict_messages for BaseLanguageModel and BaseChatModel	2024-04-12 17:28:23 -04:00
Corey Zumar	3a068b26f3	community[patch]: Databricks - fix scope of dangerous deserialization error in Databricks LLM connector (#20368 ) fix scope of dangerous deserialization error in Databricks LLM connector --------- Signed-off-by: dbczumar <corey.zumar@databricks.com>	2024-04-12 17:27:26 -04:00
Bagatur	f1248f8d9a	core[patch]: configurable init params (#20070 ) Proposed fix for #20061. need to test --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-12 21:18:43 +00:00
Eugene Yurtsev	4808441d29	Docs: Add guide for implementing custom retriever (#20350 ) Add longer guide for implementing custom retriever. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-12 17:18:35 -04:00
aditya thomas	4f75b230ed	partner[ai21]: masking of the api key for ai21 models (#20257 ) Description: Masking of the API key for AI21 models Issue: Fixes #12165 for AI21 Dependencies: None Note: This fix came in originally through #12418 but was possibly missed in the refactor to the AI21 partner package --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-12 20:19:31 +00:00
Leonid Ganeline	e512d3c6a6	langchain: `callbacks` imports fix (#20348 ) Replaced all `from langchain.callbacks` into `from langchain_core.callbacks` . Changes in the `langchain` and `langchain_experimental` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-12 20:13:14 +00:00
Erick Friis	d83b720c40	templates: readme langsmith not private beta (#20173 )	2024-04-12 13:08:10 -07:00
michael	525226fb0b	docs: fix extraction/quickstart.ipynb example code (#20397 ) - Description: The pydantic schema fields are supposed to be optional but the use of `...` makes them required. This causes a `ValidationError` when running the example code. I replaced `...` with `default=None` to make the fields optional as intended. I also standardized the format for all fields. - Issue: n/a - Dependencies: none - Twitter handle: https://twitter.com/m_atoms --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-12 19:59:32 +00:00
balloonio	e7b1a44c5b	community[patch]: Invoke callback prior to yielding token fix for Llamafile (#20365 ) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for Llamafile - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in community llamafile.py - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-12 19:26:12 +00:00
milind	1b272fa2f4	Update index.mdx (#20395 ) spelling error fixed Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-12 19:22:08 +00:00
balloonio	93caa568f9	community[patch]: Invoke callback prior to yielding token fix for HuggingFaceEndpoint (#20366 ) - [x] PR title: community[patch]: Invoke callback prior to yielding token fix for HuggingFaceEndpoint - [x] PR message: - Description: Invoke callback prior to yielding token in stream method in community HuggingFaceEndpoint - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-12 19:16:34 +00:00
Nicolas	ad04585e30	community[minor]: Firecrawl.dev integration (#20364 ) Added the [FireCrawl](https://firecrawl.dev) document loader. Firecrawl crawls and convert any website into LLM-ready data. It crawls all accessible subpages and give you clean markdown for each. - Description: Adds FireCrawl data loader - Dependencies: firecrawl-py - Twitter handle: @mendableai ccing contributors: (@ericciarla @nickscamara) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-12 19:13:48 +00:00
Tomaz Bratanic	a1b105ac00	experimental[patch]: Skip pydantic validation for llm graph transformer and fix JSON response where possible (#19915 ) LLMs might sometimes return invalid response for LLM graph transformer. Instead of failing due to pydantic validation, we skip it and manually check and optionally fix error where we can, so that more information gets extracted	2024-04-12 11:29:25 -07:00
Erick Friis	20f5cd7c95	docs: langchain-chroma package (#20394 )	2024-04-12 11:17:05 -07:00
Haris Ali	6786fa9186	docs: Adding api documentation link at the end of each output parser class description page. (#20391 ) - Description: Added cross-links for easy access of api documentation of each output parser class from it's description page. - Issue: related to issue #19969 Co-authored-by: Haris Ali <haris.ali@formulatrix.com>	2024-04-12 17:58:18 +00:00
P. Taylor Goetz	9317df7f16	community[patch]: Add "model" attribute to the payload sent to Ollama in `ChatOllama` (#20354 ) Example Ollama API calls: Request without "model": ``` curl --location 'http://localhost:11434/api/chat' \ --header 'Content-Type: application/json' \ --data '{ "messages": [ { "role": "user", "content": "What is the capitol of PA?" } ], "stream": false }' ``` Response: ``` {"error":"model is required"} ``` Request with "model": ``` curl --location 'http://localhost:11434/api/chat' \ --header 'Content-Type: application/json' \ --data '{ "model": "openchat", "messages": [ { "role": "user", "content": "What is the capitol of PA?" } ], "stream": false }' ``` Response: ``` { "eval_duration" : 733248000, "created_at" : "2024-04-11T23:04:08.735766843Z", "model" : "openchat", "message" : { "content" : " The capital city of Pennsylvania is Harrisburg.", "role" : "assistant" }, "total_duration" : 3138731168, "prompt_eval_count" : 25, "load_duration" : 466562959, "done" : true, "prompt_eval_duration" : 1938495000, "eval_count" : 10 } ```	2024-04-12 13:32:53 -04:00
Bagatur	57bb940c17	docs: vertexai tool call update (#20362 )	2024-04-12 10:09:54 -07:00
Alex Sherstinsky	fad0962643	community: for Predibase -- enable both Predibase-hosted and HuggingFace-hosted fine-tuned adapter repositories (#20370 )	2024-04-12 08:32:00 -07:00
ccurme	5395c409cb	docs: add Cohere to ChatModelTabs (#20386 )	2024-04-12 10:35:10 -04:00
Eugene Yurtsev	6470b30173	langchain[patch]: Add deprecation warning to extraction chains (#20224 ) Add deprecation warnings to extraction chains	2024-04-12 10:24:32 -04:00
Eugene Yurtsev	b65a1d4cfd	langchain[patch]: Add another unit test for indexing code (#20387 ) Add another unit test for indexing	2024-04-12 10:19:18 -04:00
Erick Friis	29282371db	core: bind_tools interface on basechatmodel (#20360 )	2024-04-12 01:32:19 +00:00
Erick Friis	e6806a08d4	multiple: standard chat model tests (#20359 )	2024-04-11 18:23:13 -07:00
Bagatur	f78564d75c	docs: show tool msg in tool call docs (#20358 )	2024-04-11 16:42:04 -07:00
Isak Nyberg	bac9fb9a7c	community: add gpt-4 pricing in callback (#20292 ) Added the pricing for `gpt-4-turbo` and `gpt-4-turbo-2024-04-09` in the callback method. related to issue #17173 https://openai.com/pricing#language-models	2024-04-11 18:02:39 -04:00
Ikko Eltociear Ashimine	cb29b42285	docs: Update ibm_watsonx.ipynb (#20329 ) avaliable -> available - Description: fixed typo - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out!	2024-04-11 17:59:23 -04:00
Jack Wotherspoon	204a16addc	docs: add Cloud SQL for MySQL vector store integration docs (#20278 ) Adding docs page for `Google Cloud SQL for MySQL` vector store integration. This was recently released as part of the Cloud SQL for MySQL LangChain package ([release](https://github.com/googleapis/langchain-google-cloud-sql-mysql-python/releases/tag/v0.2.0)) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-11 21:57:46 +00:00
Leonid Ganeline	7cf2d2759d	community[patch]: docstrings update (#20301 ) Added missed docstrings. Format docstings to the consistent form.	2024-04-11 16:23:27 -04:00
Eugene Yurtsev	2900720cd3	core[patch]: Update documentation for base retriever (#20345 ) Updating in code documentation for base retriever to direct folks toward the .invoke and .ainvoke methods + explain how to implement	2024-04-11 16:20:14 -04:00
Bagatur	d2f4153fe6	docs: tool call nits (#20356 )	2024-04-11 12:56:36 -07:00
Bagatur	eafd8c580b	docs: tool agent nit (#20353 )	2024-04-11 19:41:31 +00:00
Erick Friis	ec0273fc92	chroma: release 0.1.0 (#20355 )	2024-04-11 12:39:52 -07:00
Bagatur	a889cd14f3	docs: use vertexai in chat model tabs (#20352 )	2024-04-11 12:34:19 -07:00
Bagatur	9d302c1b57	docs: update anthropic tool call (#20344 )	2024-04-11 11:38:26 -07:00
Erick Friis	da707d0755	chroma: remove relevance score int test (#20346 ) deprecating feature in #20302	2024-04-11 11:29:33 -07:00
Eugene Yurtsev	de938a4451	docs: Update chat model providers include package information (#20336 ) Include package information	2024-04-11 13:29:42 -04:00
Bagatur	56fe4ab382	docs: update tool-calling table (#20338 )	2024-04-11 09:50:20 -07:00
Bagatur	43a98592c1	docs: tool agent nit (#20337 )	2024-04-11 09:43:12 -07:00
Bagatur	562b546bcc	docs: update chat openai (#20331 )	2024-04-11 09:29:46 -07:00
Bagatur	2c4741b5ed	docs: add tool-calling agent (#20328 )	2024-04-11 09:29:40 -07:00
ccurme	f02e55aaf7	docs: add component page for tool calls (#20282 ) Note: includes links to API reference pages for ToolCall and other objects that currently don't exist (e.g., https://api.python.langchain.com/en/latest/messages/langchain_core.messages.tool.ToolCall.html#langchain_core.messages.tool.ToolCall).	2024-04-11 09:29:25 -07:00
Bagatur	6608089030	langchain[patch]: Release 0.1.16 (#20335 )	2024-04-11 09:28:37 -07:00
Eugene Yurtsev	0e74fb4ec1	docs: Update list of chat models tool calling providers (#20330 ) Will follow up with a few missing providers	2024-04-11 12:22:49 -04:00
Eugene Yurtsev	653489a1a9	docs: Update documentation for custom LLMs (#19972 ) Update documentation for customizing LLMs	2024-04-11 12:21:27 -04:00
Bagatur	799714c629	release anthropic, fireworks, openai, groq, mistral (#20333 )	2024-04-11 09:19:52 -07:00
Bagatur	e72330aacc	core[patch]: Release 0.1.42 (#20332 )	2024-04-11 09:10:27 -07:00
ccurme	795c728f71	mistral[patch]: add IDs to tool calls (#20299 ) Mistral gives us one ID per response, no individual IDs for tool calls. ```python from langchain.agents import AgentExecutor, create_tool_calling_agent, tool from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder from langchain_mistralai import ChatMistralAI prompt = ChatPromptTemplate.from_messages( [ ("system", "You are a helpful assistant"), ("human", "{input}"), MessagesPlaceholder("agent_scratchpad"), ] ) model = ChatMistralAI(model="mistral-large-latest", temperature=0) @tool def magic_function(input: int) -> int: """Applies a magic function to an input.""" return input + 2 tools = [magic_function] agent = create_tool_calling_agent(model, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke({"input": "what is the value of magic_function(3)?"}) ``` --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-11 11:09:30 -04:00
Eugene Yurtsev	22fd844e8a	community[patch]: Add deprecation warnings to postgres implementation (#20222 ) Add deprecation warnings to postgres implementation that are in langchain-postgres.	2024-04-11 10:33:22 -04:00
Eugene Yurtsev	f02f708f52	core[patch]: For now remove user warning (#20321 ) Remove warning since it creates a lot of noise.	2024-04-11 10:33:01 -04:00
Mayank Solanki	f709ab4cdf	docs: added backtick on RunnablePassthrough (#20310 ) added backtick on RunnablePassthrough Isuue: #20094	2024-04-11 08:39:10 -04:00
Bagatur	c706689413	openai[patch]: use tool_calls in request (#20272 )	2024-04-11 03:55:52 -07:00
Bagatur	e936fba428	langchain[patch]: agents check prompt partial vars (#20303 )	2024-04-11 03:55:09 -07:00
Bagatur	cb25fa0d55	core[patch]: fix ChatGeneration.text with content blocks (#20294 )	2024-04-10 15:54:06 -07:00
Bagatur	03b247cca1	core[patch]: include tool_calls in ai msg chunk serialization (#20291 )	2024-04-10 22:27:40 +00:00
Erick Friis	0fa551c278	chroma: bump rc, keep optional (#20298 )	2024-04-10 14:22:56 -07:00
Erick Friis	16f8fff14f	chroma: add required fastapi dep to restrict to <1 (#20297 )	2024-04-10 14:16:13 -07:00
Erick Friis	991fd82532	chroma: add optional fastapi dep to restrict to <1 (#20295 )	2024-04-10 12:49:44 -07:00
killind-dev	f8a54d1d73	chroma: Add chroma partner package (#19292 ) Description: Adds chroma to the partners package. Tests & code mirror those in the community package. Dependencies: None Twitter handle: @akiradev0x --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-10 19:33:45 +00:00
Yuki Watanabe	eef19954f3	core[patch]: fix duplicated kwargs in `_load_sql_databse_chain` (#19908 ) `kwargs` is specified twice in [this line](`3218463f6a/libs/langchain/langchain/chains/loading.py (L386)`), causing runtime error when passing any keyword arguments.	2024-04-10 12:20:28 -07:00
ccurme	39471a9c87	docs: update tool calling cookbook (#20290 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-10 15:06:33 -04:00
Nuno Campos	15271ac832	core: mustache prompt templates (#19980 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-10 11:25:32 -07:00
Leonid Ganeline	4cb5f4c353	community[patch]: import flattening fix (#20110 ) This PR should make it easier for linters to do type checking and for IDEs to jump to definition of code. See #20050 as a template for this PR. - As a byproduct: Added 3 missed `test_imports`. - Added missed `SolarChat` in to __init___.py Added it into test_import ut. - Added `# type: ignore` to fix linting. It is not clear, why linting errors appear after ^ changes. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-10 13:01:19 -04:00
Yuki Oshima	12190ad728	openai[patch]: Fix langchain-openai unknown parameter error with gpt-4-turbo (#20271 ) Description: I fixed langchain-openai unknown parameter error with gpt-4-turbo. It seems that the behavior of the Chat Completions API implicitly changed when using the latest gpt-4-turbo model, differing from previous models. It now appears to reject parameters that are not listed in the [API Reference](https://platform.openai.com/docs/api-reference/chat/create). So I found some errors and fixed them. Issue: https://github.com/langchain-ai/langchain/issues/20264 Dependencies: none Twitter handle: https://twitter.com/oshima_123	2024-04-10 09:51:38 -07:00
ccurme	21c1ce0bc1	update agents to use tool call messages (#20074 ) ```python from langchain.agents import AgentExecutor, create_tool_calling_agent, tool from langchain_anthropic import ChatAnthropic from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder prompt = ChatPromptTemplate.from_messages( [ ("system", "You are a helpful assistant"), MessagesPlaceholder("chat_history", optional=True), ("human", "{input}"), MessagesPlaceholder("agent_scratchpad"), ] ) model = ChatAnthropic(model="claude-3-opus-20240229") @tool def magic_function(input: int) -> int: """Applies a magic function to an input.""" return input + 2 tools = [magic_function] agent = create_tool_calling_agent(model, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke({"input": "what is the value of magic_function(3)?"}) ``` ``` > Entering new AgentExecutor chain... Invoking: `magic_function` with `{'input': 3}` responded: [{'text': '<thinking>\nThe user has asked for the value of magic_function applied to the input 3. Looking at the available tools, magic_function is the relevant one to use here, as it takes an integer input and returns an integer output.\n\nThe magic_function has one required parameter:\n- input (integer)\n\nThe user has directly provided the value 3 for the input parameter. Since the required parameter is present, we can proceed with calling the function.\n</thinking>', 'type': 'text'}, {'id': 'toolu_01HsTheJPA5mcipuFDBbJ1CW', 'input': {'input': 3}, 'name': 'magic_function', 'type': 'tool_use'}] 5 Therefore, the value of magic_function(3) is 5. > Finished chain. {'input': 'what is the value of magic_function(3)?', 'output': 'Therefore, the value of magic_function(3) is 5.'} ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-10 11:54:51 -04:00
Erick Friis	9eb6f538f0	infra, multiple: rc release versions (#20252 )	2024-04-09 17:54:58 -07:00
Bagatur	0d0458d1a7	mistralai[patch]: Pre-release 0.1.2-rc.1 (#20251 )	2024-04-10 00:25:38 +00:00
Bagatur	e4046939d0	anthropic[patch]: Pre-release 0.1.8-rc.1 (#20250 )	2024-04-10 00:23:10 +00:00
Bagatur	a8eb0f5b1b	openai[patch]: pre-release 0.1.3-rc.1 (#20249 )	2024-04-10 00:22:08 +00:00
Bagatur	a43b9e4f33	core[patch]: Pre-release 0.1.42-rc.1 (#20248 )	2024-04-09 19:10:38 -05:00
Bagatur	9514bc4d67	core[minor], ...: add tool calls message (#18947 ) core[minor], langchain[patch], openai[minor], anthropic[minor], fireworks[minor], groq[minor], mistralai[minor] ```python class ToolCall(TypedDict): name: str args: Dict[str, Any] id: Optional[str] class InvalidToolCall(TypedDict): name: Optional[str] args: Optional[str] id: Optional[str] error: Optional[str] class ToolCallChunk(TypedDict): name: Optional[str] args: Optional[str] id: Optional[str] index: Optional[int] class AIMessage(BaseMessage): ... tool_calls: List[ToolCall] = [] invalid_tool_calls: List[InvalidToolCall] = [] ... class AIMessageChunk(AIMessage, BaseMessageChunk): ... tool_call_chunks: Optional[List[ToolCallChunk]] = None ... ``` Important considerations: - Parsing logic occurs within different providers; - ~Changing output type is a breaking change for anyone doing explicit type checking;~ - ~Langsmith rendering will need to be updated: https://github.com/langchain-ai/langchainplus/pull/3561~ - ~Langserve will need to be updated~ - Adding chunks: - ~AIMessage + ToolCallsMessage = ToolCallsMessage if either has non-null .tool_calls.~ - Tool call chunks are appended, merging when having equal values of `index`. - additional_kwargs accumulate the normal way. - During streaming: - ~Messages can change types (e.g., from AIMessageChunk to AIToolCallsMessageChunk)~ - Output parsers parse additional_kwargs (during .invoke they read off tool calls). Packages outside of `partners/`: - https://github.com/langchain-ai/langchain-cohere/pull/7 - https://github.com/langchain-ai/langchain-google/pull/123/files --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-09 18:41:42 -05:00
Erick Friis	00552918ac	groq: xfail tool_choice tests (#20247 )	2024-04-09 23:29:59 +00:00
Bagatur	2d83505be9	experimental[patch]: Release 0.0.57 (#20243 )	2024-04-09 17:08:01 -05:00
Bagatur	f06cb59ab9	groq[patch]: Release 0.1.1 (#20242 )	2024-04-09 21:59:58 +00:00
Erick Friis	ad3f1a9e85	docs: fix external repo partner docs (#20238 )	2024-04-09 21:58:04 +00:00
Bagatur	0b2f0307d7	openai[patch]: Release 0.1.2 (#20241 )	2024-04-09 21:55:19 +00:00
Bagatur	4b84c9b28c	anthropic[patch]: Release 0.1.7 (#20240 )	2024-04-09 21:53:16 +00:00
Bagatur	74d04a4e80	mistralai[patch]: Release 0.1.1 (#20239 )	2024-04-09 21:53:01 +00:00
Bagatur	e5913c8758	langchain[patch]: Release 0.1.15 (#20237 )	2024-04-09 21:50:32 +00:00
Bagatur	e39fdfddf1	community[patch]: Release 0.0.32 (#20236 )	2024-04-09 21:37:10 +00:00
Bagatur	a07238d14e	core[patch]: Release 0.1.41 (#20233 )	2024-04-09 21:11:37 +00:00
Chip Davis	806d4ae48f	community[patch]: fixed multithreading returning List[List[Documents]] instead of List[Documents] (#20230 ) Description: When multithreading is set to True and using the DirectoryLoader, there was a bug that caused the return type to be a double nested list. This resulted in other places upstream not being able to utilize the from_documents method as it was no longer a `List[Documents]` it was a `List[List[Documents]]`. The change made was to just loop through the `future.result()` and yield every item. Issue: #20093 Dependencies: N/A Twitter handle: N/A	2024-04-09 17:06:37 -04:00
Sholto Armstrong	230376f183	docs: Fix typo in citations example (#20218 ) Small typo in the citations notebook "ojbects" changed to "objects"	2024-04-09 21:05:33 +00:00
Eugene Yurtsev	fe35e13083	langchain[patch]: Update unit test (#20228 ) This unit test fails likely validation by the openai client. Newer openai library seems to be doing more validation so the existing test fails since http_client needs to be of httpx instance	2024-04-09 16:44:23 -04:00
Casper da Costa-Luis	b972f394c8	langchain[patch]: make BooleanOutputParser check words not substrings (#20064 ) - Description: fixes BooleanOutputParser detecting sub-words ("NOW this is likely (YES)" -> `True`, not `AmbiguousError`) - Issue(s): fixes #11408 (follow-up to #17810) - Dependencies: None - GitHub handle: @casperdcl <!-- if unreviewd after a few days, @-mention one of baskaryan, efriis, eyurtsev, hwchase17 --> - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-09 20:43:31 +00:00
seray	add31f46d0	community[patch]: OpenLLM Async Client Fixes and Timeout Parameter (#20007 ) Same changes as this merged [PR](https://github.com/langchain-ai/langchain/pull/17478) (https://github.com/langchain-ai/langchain/pull/17478), but for the async client, as the same issues persist. - Replaced 'responses' attribute of OpenLLM's GenerationOutput schema to 'outputs'. reference: `66de54eae7/openllm-core/src/openllm_core/_schemas.py (L135)` - Added timeout parameter for the async client. --------- Co-authored-by: Seray Arslan <seray.arslan@knime.com>	2024-04-09 16:34:56 -04:00
Erick Friis	37a9e23c05	community: switch to falkordb python client (#20229 )	2024-04-09 20:19:44 +00:00
Christophe Bornet	f43b48aebc	core[minor]: Implement aformat_messages for _StringImageMessagePromptTemplate (#20036 )	2024-04-09 15:59:39 -04:00
Christophe Bornet	19001e6cb9	core[minor]: Implement aformat for FewShotPromptWithTemplates (#20039 )	2024-04-09 15:58:41 -04:00
Erick Friis	855ba46f80	standard-tests: a standard unit and integration test set (#20182 ) just chat models for now	2024-04-09 12:43:00 -07:00
Erick Friis	9b5cae045c	together: release 0.1.0 (#20225 ) Resolved #20217	2024-04-09 12:23:52 -07:00
Eugene Yurtsev	7cfb643a1c	langchain-postgres: Remove remaining README.md file (#20221 ) Repository has moved to langchain-ai/langchain-postgres	2024-04-09 14:02:15 -04:00
Eugene Yurtsev	2fa7266ebb	Remove postgres package (#20207 ) Package moved	2024-04-09 13:51:17 -04:00
Simon Kelly	a682f0d12b	openai[patch]: wrap stream code in context manager blocks (#18013 ) Description: Use the `Stream` context managers in `ChatOpenAi` `stream` and `astream` method. Using the context manager returned by the OpenAI client makes it possible to terminate the stream early since the response connection will be closed when the context manager exists. Issue: #5340 Twitter handle: @snopoke --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-09 17:40:16 +00:00
Shotaro Sano	6c11c8dac6	docs: Add documentation of `ElasticsearchStore.BM25RetrievalStrategy` (#20098 ) This pull request follows up on https://github.com/langchain-ai/langchain/pull/19314 and https://github.com/langchain-ai/langchain-elastic/pull/6, adding documentation for the `ElasticsearchStore.BM25RetrievalStrategy`. Like other retrieval strategies, we are now introducing BM25RetrievalStrategy. ### Background - The `BM25RetrievalStrategy` has been introduced to `langchain-elastic` via the pull request https://github.com/langchain-ai/langchain-elastic/pull/6. - This PR was initially created in the main `langchain` repository but was moved to `langchain-elastic` during the review process due to the migration of the partner package. - The original PR can be found at https://github.com/langchain-ai/langchain/pull/19314. - As [commented](https://github.com/langchain-ai/langchain/pull/19314#issuecomment-2023202401) by @joemcelroy, documenting the new retrieval strategy is part of the requirements for its introduction. Although the `BM25RetrievalStrategy` has been merged into `langchain-elastic`, its documentation is still to be maintained in the main `langchain` repository. Therefore, this pull request adds the documentation portion of `BM25RetrievalStrategy`. The content of the documentation remains the same as that included in the original PR, https://github.com/langchain-ai/langchain/pull/19314. --------- Co-authored-by: Max Jakob <max.jakob@elastic.co>	2024-04-09 12:37:15 -05:00
David Lee	0394c6e126	community[minor]: add allow_dangerous_requests for OpenAPI toolkits (#19493 ) OpenAPI allow_dangerous_requests: community: add allow_dangerous_requests for OpenAPI toolkits Description: a description of the change Due to BaseRequestsTool changes, we need to pass allow_dangerous_requests manually. `b617085af0/libs/community/langchain_community/tools/requests/tool.py (L26-L46)` While OpenAPI toolkits didn't pass it in the arguments. `b617085af0/libs/community/langchain_community/agent_toolkits/openapi/planner.py (L262-L269)` Issue: the issue # it fixes, if applicable https://github.com/langchain-ai/langchain/issues/19440 If not passing allow_dangerous_requests, it won't be able to do requests. Dependencies: any dependencies required for this change Not much --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-09 17:14:02 +00:00
Guangdong Liu	301dc3dfd2	docs: Get rid of ZeroShotAgent and use create_react_agent instead (#20157 ) - Issue: #20122 - @baskaryan, @eyurtsev.	2024-04-09 12:00:29 -05:00
Timothy	0c848a25ad	community[patch]: GCSDirectoryLoader bugfix (#20005 ) - Description: Bug fix. Removed extra line in `GCSDirectoryLoader` to allow catching Exceptions. Now also logs the file path if Exception is raised for easier debugging. - Issue: #20198 Bug since langchain-community==0.0.31 - Dependencies: No change - Twitter handle: timothywong731 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-09 16:57:00 +00:00
jeff kit	ac42e96e4c	community[patch], langchain[minor]: Enhance Tencent Cloud VectorDB, langchain: make Tencent Cloud VectorDB self query retrieve compatible (#19651 ) - make Tencent Cloud VectorDB support metadata filtering. - implement delete function for Tencent Cloud VectorDB. - support both Langchain Embedding model and Tencent Cloud VDB embedding model. - Tencent Cloud VectorDB support filter search keyword, compatible with langchain filtering syntax. - add Tencent Cloud VectorDB TranslationVisitor, now work with self query retriever. - more documentations. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-09 16:50:48 +00:00
Bagatur	1a34c65e01	community[patch]: pass through sql agent kwargs (#19962 ) Fix #19961	2024-04-09 16:47:32 +00:00
Haris Ali	1b480914b4	docs: Fix the class links in openai_tools and openai_functions description in output parser documentations (#20197 ) - Description: In this PR I fixed the links which points to the API docs for classes in OpenAI functions and OpenAI tools section of output parsers. - Issue: It fixed the issue #19969 Co-authored-by: Haris Ali <haris.ali@formulatrix.com>	2024-04-09 16:07:19 +00:00
Guangdong Liu	97d91ec17c	community[patch]: standardize baichuan init args (#20209 ) Related to https://github.com/langchain-ai/langchain/issues/20085 @baskaryan	2024-04-09 11:00:40 -05:00
Piyush Jain	cd7abc495a	community[minor]: add neptune analytics graph (#20047 ) Replacement for PR [#19772](https://github.com/langchain-ai/langchain/pull/19772). --------- Co-authored-by: Dave Bechberger <dbechbe@amazon.com> Co-authored-by: bechbd <bechbd@users.noreply.github.com>	2024-04-09 09:20:59 -05:00
Shuqian	ad9750403b	community[minor]: add bedrock anthropic callback for token usage counting (#19864 ) Description: add bedrock anthropic callback for token usage counting, consulted openai callback. --------- Co-authored-by: Massimiliano Pronesti <massimiliano.pronesti@gmail.com>	2024-04-09 09:18:48 -05:00
Prince Canuma	1f9f4d8742	community[minor]: Add support for MLX models (chat & llm) (#18152 ) Description: This PR adds support for MLX models both chat (i.e., instruct) and llm (i.e., pretrained) types/ Dependencies: mlx, mlx_lm, transformers Twitter handle: @Prince_Canuma --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-09 14:17:07 +00:00
aditya thomas	6baeaf4802	docs: TogetherAI as a drop-in replacement for OpenAI (#19900 ) Description: TogetherAI as a drop-in replacement for OpenAI Issue: None Dependencies: None @baskaryan apropos #20032	2024-04-09 09:12:52 -05:00
Leonid Ganeline	2f8dd1a161	community[patch]: `cross_encoders` flatten namespaces (#20183 ) Issue `langchain_community.cross_encoders` didn't have flattening namespace code in the __init__.py file. Changes: - added code to flattening namespaces (used #20050 as a template) - added ut for a change - added missed `test_imports` for `chat_loaders` and `chat_message_histories` modules	2024-04-08 20:50:23 -04:00
Bagatur	1af7133828	docs: add vertexai to structured output (#20171 )	2024-04-08 16:09:49 -05:00
kaijietti	a812839f0c	community: add request_timeout and max_retries to ChatAnthropic (#19402 ) This PR make `request_timeout` and `max_retries` configurable for ChatAnthropic. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-08 21:04:17 +00:00
Richmond Alake	c769421aa4	cookbook: MongoDB Cookbook for Chat history and semantic cache (#19998 ) Thank you for contributing to LangChain! - [ ] PR title: "community: Add semantic caching and memory using MongoDB" - [ ] PR message: - Description: This PR introduces functionality for adding semantic caching and chat message history using MongoDB in RAG applications. By leveraging the MongoDBCache and MongoDBChatMessageHistory classes, developers can now enhance their retrieval-augmented generation applications with efficient semantic caching mechanisms and persistent conversation histories, improving response times and consistency across chat sessions. - Issue: N/A - Dependencies: Requires `datasets`, `langchain`, `langchain-mongodb`, `langchain-openai`, `pymongo`, and `pandas` for implementation. MongoDB Atlas is used for database services, and the OpenAI API for model access. - Twitter handle: @richmondalake Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-08 20:21:24 +00:00
Erick Friis	391e8f2050	pinecone[patch]: fix core min version (#20177 )	2024-04-08 20:06:59 +00:00
Harry Jiang	1ee208541c	langchain: fix pinecone upsert when async_req is set to False (#19793 ) Issue: When async_req is the default value True, pinecone client return the multiprocessing AsyncResult object. When async_req is set to False, pinecone client return the result directly. `[{'upserted_count': 1}]` . Calling get() method will throw an error in this case.	2024-04-08 12:55:59 -07:00
Alex Sherstinsky	5f563e040a	community: extend Predibase integration to support fine-tuned LLM adapters (#19979 ) - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: Langchain-Predibase integration was failing, because it was not current with the Predibase SDK; in addition, Predibase integration tests were instantiating the Langchain Community `Predibase` class with one required argument (`model`) missing. This change updates the Predibase SDK usage and fixes the integration tests. - Twitter handle: `@alexsherstinsky` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-08 18:54:29 +00:00
Bagatur	a27d88f12a	anthropic[patch]: standardize init args (#20161 ) Related to #20085	2024-04-08 12:09:06 -05:00
Bagatur	3490d70238	mistralai[patch]: standardize model params (#20163 ) Related to #20085	2024-04-08 11:48:38 -05:00
Bagatur	17182406f3	docs: standardize fireworks params (#20162 ) Related to #20085	2024-04-08 10:57:56 -05:00
Bagatur	5ae0e687b3	docs: use standard openai params (#20160 ) Part of #20085	2024-04-08 10:56:53 -05:00
david02871	e1a24d09c5	community: Add PHP language parser to document_loaders (#19850 ) Description: Added a PHP language parser to document_loaders Issue: N/A Dependencies: N/A Twitter handle: N/A --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-08 11:30:28 -04:00
Marlene	2f03bc397e	Community: Updating Azure Retriever and Docs to be Azure AI Search instead of Azure Cognitive Search (#19925 ) Last year Microsoft [changed the name](https://learn.microsoft.com/en-us/azure/search/search-what-is-azure-search) of Azure Cognitive Search to Azure AI Search. This PR updates the Langchain Azure Retriever API and it's associated docs to reflect this change. It may be confusing for users to see the name Cognitive here and AI in the Microsoft documentation which is why this is needed. I've also added a more detailed example to the Azure retriever doc page. There are more places that need a similar update but I'm breaking it up so the PRs are not too big 😄 Fixing my errors from the previous PR. Twitter: @marlene_zw Two new tests added to test backward compatibility in `libs/community/tests/integration_tests/retrievers/test_azure_cognitive_search.py` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-08 11:12:41 -04:00
Rahul Triptahi	820b713086	community[minor]: Add support for Pebblo cloud_api_key in PebbloSafeLoader (#19855 ) Description: _PebbloSafeLoader_: Add support for pebblo's cloud api-key in PebbloSafeLoader - This Pull request enables PebbloSafeLoader to accept pebblo's cloud api-key and send the semantic classification data to pebblo cloud. Documentation: Updated Unit test: Added Issue: NA Dependencies: - None Twitter handle: @rahul_tripathi2 Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-08 11:10:04 -04:00
Eugene Yurtsev	34a24d4df6	postgres[minor]: Add pgvector community as is (#20096 ) This moves langchain pgvector community as is The only modification is support for psycopg3 rather than psycopg2!	2024-04-08 09:34:10 -04:00
Eugene Yurtsev	ba9e0d76c1	postgres[minor]: add postgres checkpoint implementation (#20025 ) Adds checkpoint implementation using psycopg	2024-04-08 09:27:15 -04:00
William FH	039b7a472d	[core] fix: manually specifying run_id for chat models.invoke() and .ainvoke() (#20082 )	2024-04-06 16:57:32 -07:00
Chris Germann	ba602dc562	Documentation: Fixed the typo of Discord -> Telegram (#20008 ) Description: Just fixed one string Issues: None Dependencies: None Twitter handle: @epu9byj Co-authored-by: gere <gere@kapo.zh.ch>	2024-04-06 20:00:03 +00:00
Erick Friis	96dc0ea49d	pinecone[patch]: release 0.1.0 (#20109 )	2024-04-06 18:41:28 +00:00
donbr	de496062b3	templates: migrate to langchain_anthropic package to support Claude 3 models (#19393 ) - Description: update langchain anthropic templates to support Claude 3 (iterative search, chain of note, summarization, and XML response) - Issue: issue # N/A. Stability issues and errors encountered when trying to use older langchain and anthropic libraries. - Dependencies: - langchain_anthropic version 0.1.4\ - anthropic package version in the range ">=0.17.0,<1" to support langchain_anthropic. - Twitter handle: @d_w_b7 - [ x]Add tests and docs: If you're adding a new integration, please include 1. used instructions in the README for testing - [ x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-06 00:33:59 +00:00
Maxime Perrin	5ac0d1f67b	partners[anthropic]: fix anthropic chat model message type lookup keys (#19034 ) - Description: Fixing message formatting issue in ChatAnthropic model by adding dictionary keys for `AIMessageChunk `and `HumanMessageChunk` - Issue: #19025 - Twitter handle: @maximeperrin_ Co-authored-by: Maxime Perrin <mperrin@doing.fr> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-06 00:22:14 +00:00
Krista Pratico	d64bd32b20	templates: add rag azure search template (#18143 ) - Description: Adds a template for performing RAG with the AzureSearch vectorstore. - Issue: N/A - Dependencies: N/A - Twitter handle: N/A --------- Co-authored-by: Erick Friis <erickfriis@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-06 00:20:40 +00:00
Bagatur	46f580d42d	docs: anthropic tool docstring (#20091 )	2024-04-05 21:50:40 +00:00
Erick Friis	28dfde2cb2	cohere: move package to external repo (#20081 )	2024-04-05 14:29:15 -07:00
Jacob Lee	58a2123ca0	docs[patch]: Add missing redirects (#20076 )	2024-04-05 12:54:00 -07:00
Eugene Yurtsev	520ff50adc	community[patch]: Improve import callbacks to make it IDE friendly (#20050 ) * declares __all__ as a list of strings (instead of dynamically computing it) * import type definitions when TYPE_CHECKING is true	2024-04-05 15:17:51 -04:00
Guangdong Liu	5a76087965	langchain-core[minor]: Allow passing local cache to language models (#19331 ) After this PR it will be possible to pass a cache instance directly to a language model. This is useful to allow different language models to use different caches if needed. - Issue: close #19276 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-05 11:19:54 -04:00
Eugene Yurtsev	e4fc0e7502	core[patch]: Document BaseCache abstraction in code (#20046 ) Document the base cache abstraction in the cache.	2024-04-05 10:56:57 -04:00
Christophe Bornet	4d8a6a27a3	core[minor]: Implement aformat_prompt and ainvoke in BasePromptTemplate (#20035 )	2024-04-05 10:36:43 -04:00
Christophe Bornet	7e5c1905b1	core[minor]: Add async aformat_document method (#20037 )	2024-04-05 10:29:53 -04:00
Christophe Bornet	927793d088	Merge pull request #20038 * Implement aformat_messages for ChatMessagePromptTemplate	2024-04-05 10:25:27 -04:00
Erick Friis	ebd24bb5d6	docs: fix title cap (#20048 )	2024-04-05 02:36:33 +00:00
Eugene Yurtsev	1ee8cf7b20	Docs: Update custom chat model (#19967 ) * Clean up in the existing tutorial * Add model_name to identifying params * Add table to summarize messages	2024-04-04 22:36:03 -04:00
Erick Friis	5fc7bb01e9	docs: weaviate docs (#20042 )	2024-04-04 19:01:02 -07:00
Bagatur	38fb1429fe	docs: fix together model tab (#20032 )	2024-04-04 15:33:43 -07:00
Jacob Lee	b69af26717	docs[patch]: Fix Model I/O quickstart (#20031 ) @baskaryan	2024-04-04 15:28:58 -07:00
Usama Ahmed	94ac42c573	docs: fixing typo in argument name (#20028 ) it's "mode" instead of "model", I fixed it	2024-04-04 22:28:28 +00:00
Bagatur	07eeeb84f3	docs: hide experimental anthropic (#20030 )	2024-04-04 15:27:52 -07:00
Lance Martin	e76b9210dd	Update example cookbook for Anthropic tool use (#20029 )	2024-04-04 14:53:18 -07:00
Leonid Ganeline	3856dedff4	docs: `integrations/providers` update 9 (#19941 ) - Added missed providers - Added links, descriptions in related examples - Formatted in a consistent format Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-04 21:37:48 +00:00
Bagatur	644ff46100	docs: mark anthropic tools wrapper as deprecated (#20024 )	2024-04-04 21:33:55 +00:00
Leonid Ganeline	69bf6262aa	docs: `integrations/providers/unstructured` update (#19892 ) Updated a page with existing document loaders with links to examples. Fixed formatting of one example. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-04 21:31:27 +00:00
Bagatur	1b7ed6071a	anthropic[patch]: Release 0.1.6 (#20026 )	2024-04-04 14:29:50 -07:00
Bagatur	6860450e48	anthropic[patch]: use anthropic 0.23 (#20022 )	2024-04-04 14:23:53 -07:00
Leonid Ganeline	4c969286fe	docs `integrations/providers` update 10 (#19970 ) Fixed broken links. Formatted to get consistent forms. Added missed imports in the example code	2024-04-04 14:22:45 -07:00
Leonid Ganeline	82f0198be2	docs: `graphs` update (#19675 ) Issue: The `graph` code was moved into the `community` package a long ago. But the related documentation is still in the [use_cases](https://python.langchain.com/docs/use_cases/graph/integrations/diffbot_graphtransformer) section and not in the `integrations`. Changes: - moved the `use_cases/graph/integrations` notebooks into the `integrations/graphs` - renamed files and changed titles to follow the consistent format - redirected old page URLs to new URLs in `vercel.json` and in several other pages - added descriptions and links when necessary - formatted into the consistent format	2024-04-04 14:13:22 -07:00
Bagatur	be3dd62de4	anthropic[patch]: fix experimental tests (#20021 )	2024-04-04 13:37:43 -07:00
Lance Martin	a6926772f0	Add cookbook for Anthropic .with_structured_output() (#20017 )	2024-04-04 13:30:44 -07:00
Bagatur	86fdb79454	anthropic[patch]: bump core dep (#20019 ) ]	2024-04-04 13:28:23 -07:00
Bagatur	209de0a561	anthropic[minor]: tool use (#20016 )	2024-04-04 13:22:48 -07:00
Leonid Ganeline	3aacd11846	community[minor]: added missed class to __all__ (#19888 ) Added missed `UnstructuredCHMLoader` class to the document_loader.\_\_init\_\_.py \_\_all\_\_	2024-04-04 16:16:51 -04:00
Jacob Lee	7f0cb3bfba	docs[patch]: Make Docusaurus and Vercel add trailing slashes when navigating by default (#20014 ) Should hopefully avoid weird broken link edge cases. Relative links now trip up the Docusaurus broken link checker, so this PR also removes them. Also snuck in a small addition about asyncio	2024-04-04 12:49:15 -07:00
Chris Papademetrious	a954dedb77	langchain[minor]: enhance `LocalFileStore` to allow directory/file permissions to be specified (#18857 ) Description: The `LocalFileStore` class can be used to create an on-disk `CacheBackedEmbeddings` cache. However, the default `umask` settings gives file/directory write permissions only to the original user. Once the cache directory is created by the first user, other users cannot write their own cache entries into the directory. To make the cache usable by multiple users, this pull request updates the `LocalFileStore` constructor to allow the permissions for newly created directories and files to be specified. The specified permissions override the default `umask` values. For example, when configured as follows: ```python file_store = LocalFileStore(temp_dir, chmod_dir=0o770, chmod_file=0o660) ``` then "user" and "group" (but not "other") have permissions to access the store, which means: * Anyone in our group could contribute embeddings to the cache. * If we implement cache cleanup/eviction in the future, anyone in our group could perform the cleanup. The default values for the `chmod_dir` and `chmod_file` parameters is `None`, which retains the original behavior of using the default `umask` settings. Issue: Implements enhancement #18075. Testing: I updated the `LocalFileStore` unit tests to test the permissions. --------- Signed-off-by: chrispy <chrispy@synopsys.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-04 16:40:16 +00:00
Tomaz Bratanic	df25829f33	community[minor]: Add metadata filtering support for neo4j vector (#20001 )	2024-04-04 11:37:06 -04:00
Ben Mitchell	b52b78478f	community[minor]: Implement Async OpenSearch `afrom_texts` & `afrom_embeddings` (#20009 ) - Description: Adds async variants of afrom_texts and afrom_embeddings into `OpenSearchVectorSearch`, which allows for `afrom_documents` to be called. - Issue: I implemented this because my use case involves an async scraper generating documents as and when they're ready to be ingested by Embedding/OpenSearch - Dependencies: None that I'm aware Co-authored-by: Ben Mitchell <b.mitchell@reply.com>	2024-04-04 15:36:14 +00:00
Christophe Bornet	02152d3909	[docs][minor]: Fix typo in Custom Document Loader doc (#20003 )	2024-04-04 10:59:33 -04:00
Jan Nissen	31e3ecc728	core[minor]: support pydantic V2 for JSONOutputParser, allow for other sources of JSON schemas (#19716 ) This PR supports using Pydantic v2 objects to generate the schema for the JSONOutputParser (#19441). This also adds a `json_schema` parameter to allow users to pass any JSON schema to validate with, not just pydantic.	2024-04-04 10:57:47 -04:00
Christophe Bornet	f97de4e275	core[minor]: Add aformat to FewShotPromptTemplate (#19652 )	2024-04-04 10:24:55 -04:00
Utkarsha Gupte	b27f81c51c	core[patch]: mypy ignore fixes #17048 (#19931 ) core/langchain_core/_api[Patch]: mypy ignore fixes #17048 Related to #17048 Applied mypy fixes to below two files: libs/core/langchain_core/_api/deprecation.py libs/core/langchain_core/_api/beta_decorator.py Summary of Fixes: Issue 1 class _deprecated_property(type(obj)): # type: ignore error: Unsupported dynamic base class "type" [misc] Fix: 1. Added an __init__ method to _deprecated_property to initialize the fget, fset, fdel, and __doc__ attributes. 2. In the __get__, __set__, and __delete__ methods, we now use the self.fget, self.fset, and self.fdel attributes to call the original methods after emitting the warning.  3. The finalize function now creates an instance of _deprecated_property with the fget, fset, fdel, and doc attributes from the original obj property.   Issue 2     def finalize( # type: ignore wrapper: Callable[..., Any], new_doc: str ) -> T:   error: All conditional function variants must have identical signatures    Fix: Ensured that both definitions of the finalize function have the same signature Twitter Handle - https://x.com/gupteutkarsha?s=11&t=uwHe4C3PPpGRvoO5Qpm1aA	2024-04-04 10:22:38 -04:00
harry-cohere	e103492eb8	cohere: Add citations to agent, flexibility to tool parsing, fix SDK issue (#19965 ) Description: Citations are the main addition in this PR. We now emit them from the multihop agent! Additionally the agent is now more flexible with observations (`Any` is now accepted), and the Cohere SDK version is bumped to fix an issue with the most recent version of pydantic v1 (1.10.15)	2024-04-04 07:02:30 -07:00
Jacob Lee	605c3f23e1	docs: reorg and visual refresh (#19765 ) - put use cases in main sidebar - move modules to own sidebar, rename components - cleanup lcel section - cleanup guides - update font, cell highlighting --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-04 00:58:36 -07:00
Erick Friis	51bdfe04e9	groq: handle streaming tool call case (#19978 )	2024-04-03 15:22:59 -07:00
Erick Friis	5acb564d6f	groq: fix core version (#19976 )	2024-04-03 14:49:57 -07:00
Erick Friis	9e60159043	groq: release 0.1.0 (#19975 )	2024-04-03 14:41:48 -07:00
Graden Rea	88cf8a2905	groq: Add tool calling support (#19971 ) Description: Add with_structured_output to groq chat models Issue: Dependencies: N/A Twitter handle: N/A	2024-04-03 14:40:20 -07:00
Eugene Yurtsev	6f20f140ca	cli[minor]: Add disable sockets in unit tests (#19877 )	2024-04-03 17:17:50 -04:00
Eugene Yurtsev	ea276d6547	docs: Custom Document Loaders (#19935 ) Add information that shows how to create custom document loaders	2024-04-03 15:34:01 -04:00
Erick Friis	83f62fdacf	core: fix try_load_from_hub for older langchain versions load_chain (#19964 )	2024-04-03 17:00:25 +00:00
Tomaz Bratanic	09a0ecd000	langchain[minor]: Tests update metadata filtering examples of documents (#19963 ) Removing metadata properties that are dicts as some databases don't support that, and those properties aren't used in tests anyhow..	2024-04-03 12:44:14 -04:00
happy-go-lucky	c6432abdbe	community[patch]: Implement delete method and all async methods in opensearch_vector_search (#17321 ) - Description: In order to use index and aindex in libs/langchain/langchain/indexes/_api.py, I implemented delete method and all async methods in opensearch_vector_search - Dependencies: No changes	2024-04-03 09:40:49 -07:00
Cheng, Penghui	cc407e8a1b	community[minor]: weight only quantization with intel-extension-for-transformers. (#14504 ) Support weight only quantization with intel-extension-for-transformers. [Intel® Extension for Transformers](https://github.com/intel/intel-extension-for-transformers) is an innovative toolkit to accelerate Transformer-based models on Intel platforms, in particular effective on 4th Intel Xeon Scalable processor [Sapphire Rapids](https://www.intel.com/content/www/us/en/products/docs/processors/xeon-accelerated/4th-gen-xeon-scalable-processors.html) (codenamed Sapphire Rapids). The toolkit provides the below key features: * Seamless user experience of model compressions on Transformer-based models by extending [Hugging Face transformers](https://github.com/huggingface/transformers) APIs and leveraging [Intel® Neural Compressor](https://github.com/intel/neural-compressor) * Advanced software optimizations and unique compression-aware runtime. * Optimized Transformer-based model packages. * [NeuralChat](https://github.com/intel/intel-extension-for-transformers/blob/main/intel_extension_for_transformers/neural_chat), a customizable chatbot framework to create your own chatbot within minutes by leveraging a rich set of plugins and SOTA optimizations. * [Inference](https://github.com/intel/intel-extension-for-transformers/blob/main/intel_extension_for_transformers/llm/runtime/graph) of Large Language Model (LLM) in pure C/C++ with weight-only quantization kernels. This PR is an integration of weight only quantization feature with intel-extension-for-transformers. Unit test is in lib/langchain/tests/integration_tests/llm/test_weight_only_quantization.py The notebook is in docs/docs/integrations/llms/weight_only_quantization.ipynb. The document is in docs/docs/integrations/providers/weight_only_quantization.mdx. --------- Signed-off-by: Cheng, Penghui <penghui.cheng@intel.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-03 16:21:34 +00:00
Eugene Yurtsev	d6d843ec24	langchain-postgres: Initial package with postgres chat history implementation (#19884 ) - [x] Add in code examples for the chat message history class - [ ] ~Add docs with notebook examples~ (can this be done later?) - [x] Update README.md	2024-04-03 10:57:21 -04:00
Eugene Yurtsev	d293431e10	core[minor]: Add aload to document loader (#19936 ) Add aload to document loader	2024-04-03 10:46:47 -04:00
Ángel Igareta	31a641a155	core: fix return of draw_mermaid_png and change to not save image by default (#19950 ) - Description: Improvement for #19599: fixing missing return of graph.draw_mermaid_png and improve it to make the saving of the rendered image optional Co-authored-by: Angel Igareta <angel.igareta@klarna.com>	2024-04-03 06:20:35 -07:00
Bagatur	4328c54aab	core[patch]: Release 0.1.39 (#19940 )	2024-04-03 00:25:56 +00:00
Nuno Campos	f4568fe0c6	core: BaseChatModel modify chat message before passing to run_manager (#19939 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-02 16:40:27 -07:00
aditya thomas	73ebe78249	docs: update cohere documentation (#19700 ) Description: Update of Cohere documentation (main provider page) Issue: After addition of the Cohere partner package, the documentation was out of date Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-02 18:16:48 -04:00
Leonid Kuligin	eb0521064e	deprecating integrations moved to langchain_google_community (#19841 ) Thank you for contributing to LangChain! - [ ] PR title: "community: deprecating integrations moved to langchain_google_community" - [ ] PR message: deprecating integrations moved to langchain_google_community --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-02 17:06:07 -04:00
Erick Friis	f0d5b59962	core[patch]: remove requests (#19891 ) Removes required usage of `requests` from `langchain-core`, all of which has been deprecated. - removes Tracer V1 implementations - removes old `try_load_from_hub` github-based hub implementations Removal done in a way where imports will still succeed, and usage will fail with a `RuntimeError`.	2024-04-02 20:28:10 +00:00
Erick Friis	d5a2ff58e9	pinecone[patch]: source tag (#19739 )	2024-04-02 19:53:59 +00:00
Wang Guan	8638029a37	docs: mention caveats with CacheBackedEmbeddings.embed_query (#19926 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: mention not-caching methods in CacheBackedEmbeddings - Issue: n/a I almost created one until I read the code - Dependencies: n/a - Twitter handle: `tarsylia` - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-02 19:19:29 +00:00
harry-cohere	beab9adffb	cohere: Improve integration test stability, fix documents bug (#19929 ) Description: Improves the stability of all Cohere partner package integration tests. Fixes a bug with document parsing (both dicts and Documents are handled).	2024-04-02 11:22:30 -07:00
harry-cohere	37fc1c525a	cohere: simplify integration test (#19928 ) Description: This PR simplifies an integration test within the Cohere partner package: * It no longer relies on exact model answers * It no longer relies on a third party tool	2024-04-02 10:57:25 -07:00
billytrend-cohere	de6c0cf248	cohere, docs: update imports and installs to langchain_cohere (#19918 ) cohere: update imports and installs to langchain_cohere --------- Co-authored-by: Harry M <127103098+harry-cohere@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-02 09:47:58 -07:00
Erick Friis	146d1a6347	cohere[patch]: release 0.1.0rc2 (#19924 )	2024-04-02 16:24:23 +00:00
harry-cohere	e2b83c87b1	cohere[patch]: Add multihop tool agent (#19919 ) Description: Adds an agent that uses Cohere with multiple hops and multiple tools. This PR is a continuation of https://github.com/langchain-ai/langchain/pull/19650 - which was previously approved. Conceptually nothing has changed, but this PR has extra fixes, documentation and testing. --------- Co-authored-by: BeatrixCohere <128378696+BeatrixCohere@users.noreply.github.com> Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-04-02 09:18:50 -07:00
Max Jakob	22dbcc9441	langchain[patch]: fix ElasticsearchStore reference for self query (#19907 ) Initializing self query with an ElasticsearchStore from the partners packages failed previously, see https://github.com/langchain-ai/langchain/discussions/18976.	2024-04-02 08:39:12 -07:00
Bagatur	3218463f6a	core[patch]: Release 0.1.38 (#19895 )	2024-04-01 22:47:46 -07:00
Mohammad Mohtashim	9ae2df36fc	Core[major]: Base Tracer to propagate raw output from tool for on_tool_end (#18932 ) This PR completes work for PR #18798 to expose raw tool output in on_tool_end. Affected APIs: * astream_log * astream_events * callbacks sent to langsmith via langsmith-sdk * Any other code that relies on BaseTracer! --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-02 01:24:46 +00:00
Nuno Campos	2ae6dcdf01	core: Assign missing message ids in BaseChatModel (#19863 ) - This ensures ids are stable across streamed chunks - Multiple messages in batch call get separate ids - Also fix ids being dropped when combining message chunks Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-02 01:18:36 +00:00
Peter Vandenabeele	e830a4e731	community[patch]: Add remove_comments option (default True): do not extract html comments (#13259 ) - Description: add `remove_comments` option (default: True): do not extract html _comments_, - Issue: None, - Dependencies: None, - Tag maintainer: @nfcampos , - Twitter handle: peter_v I ran `make format`, `make lint` and `make test`. Discussion: I my use case, I prefer to not have the comments in the extracted text: * e.g. from a Google tag that is added in the html as comment * e.g. content that the authors have temporarily hidden to make it non visible to the regular reader Removing the comments makes the extracted text more alike the intended text to be seen by the reader. Choice to make: do we prefer to make the default for this `remove_comments` option to be True or False? I have changed it to True in a second commit, since that is how I would prefer to use it by default. Have the cleaned text (without technical Google tags etc.) and also closer to the actually visible and intended content. I am not sure what is best aligned with the conventions of langchain in general ... INITIAL VERSION (new version above): ~Choice to make: do we prefer to make the default for this `ignore_comments` option to be True or False? I have set it to False now to be backwards compatible. On the other hand, I would use it mostly with True. I am not sure what is best aligned with the conventions of langchain in general ...~ --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-02 00:19:12 +00:00
Jamsheed Mistri	4f70bc119d	community[minor]: add Layerup Security integration (#19787 ) Description: adds integration with [Layerup Security](https://uselayerup.com). Docs can be found [here](https://docs.uselayerup.com). Integrates directly with our Python SDK. Dependencies: [LayerupSecurity](https://pypi.org/project/LayerupSecurity/) Note: all methods for our product require a paid API key, so I only included 1 test which checks for an invalid API key response. I have tested extensively locally. Twitter handle: [@layerup_](https://twitter.com/layerup_) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-01 23:49:00 +00:00
Brace Sproul	22f78c37c8	docs[patch]: Hide google from function calling docs (#19887 )	2024-04-01 14:26:31 -07:00
Massimiliano Pronesti	06dac394a6	cohere[patch]: support request timeout in BaseCohere (#19641 ) As in #19346, this PR exposes `request_timeout` in `BaseCohere`, while `max_retires` is no longer a parameter of the beneath client (`cohere.Client`) and it is already configured in `langchain_cohere.llms.Cohere`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-01 14:16:32 -07:00
Mayank Solanki	d5c412b0a9	core: Add docs for RunnableConfigurableFields (#19849 ) - [x] docs: core: Add docs for `RunnableConfigurableFields` - Description: Added incode docs for `RunnableConfigurableFields` with example - Issue: #18803 - Dependencies: NA - Twitter handle: NA --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-01 20:40:10 +00:00
Mahdi Setayesh	c28efb878c	text-splitters[minor]: Adding a new section aware splitter to langchain (#16526 ) - Description: the layout of html pages can be variant based on the bootstrap framework or the styles of the pages. So we need to have a splitter to transform the html tags to a proper layout and then split the html content based on the provided list of tags to determine its html sections. We are using BS4 library along with xslt structure to split the html content using an section aware approach. - Dependencies: No new dependencies - Twitter handle: @m_setayesh Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-01 20:32:26 +00:00
Eugene Yurtsev	356a139b0a	cli[minor]: Add __version__ to integration package template (#19876 ) Packages should export __version__	2024-04-01 15:34:38 -04:00
northern-64bit	dfbc10c943	docs: Fix link in Unstructured notebook (#19851 ) Description: This PR fixes the link to the Unstructured documentation in the docs.	2024-04-01 15:26:48 -04:00
Brace Sproul	7538c4de19	docs[patch]: Revert quarto update (#19880 )	2024-04-01 12:11:27 -07:00
Anıl Berk Altuner	4384fa8e49	community[minor]: Add Dria retriever (#17098 ) [Dria](https://dria.co/) is a hub of public RAG models for developers to both contribute and utilize a shared embedding lake. This PR adds a retriever that can retrieve documents from Dria.	2024-04-01 12:04:19 -07:00
Erick Friis	0b0a55192f	robocorp[patch]: fix core min version (#19879 )	2024-04-01 11:34:14 -07:00
Mikko Korpela	3f06cef60c	robocorp[patch]: Fix nested arguments descriptors and tool names (#19707 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: Fix argument translation from OpenAPI spec to OpenAI function call (and similar) - Issue: OpenGPTs failures with calling Action Server based actions. - Dependencies: None - Twitter handle: mikkorpela - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, ~2. an example notebook showing its use. It lives in `docs/docs/integrations` directory.~ - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-01 11:29:39 -07:00
Ethan Yang	48f84e253e	community[minor]: Add OpenVINO rerank model support (#19791 ) @eaidova @AlexKoff88 Could you help to review, thanks --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-01 18:27:23 +00:00
Erick Friis	4fbdc2a7ee	openai[patch]: remove openai chunk size validation (#19878 )	2024-04-01 18:26:06 +00:00
Chenhui Zhang	a1f3e9f537	community[minor]: Update ChatZhipuAI to support GLM-4 model (#16695 ) Description: Update `ChatZhipuAI` to support the latest `glm-4` model. Issue: N/A Dependencies: httpx, httpx-sse, PyJWT The previous `ChatZhipuAI` implementation requires the `zhipuai` package, and cannot call the latest GLM model. This is because - The old version `zhipuai==1.` doesn't support the latest model. - `zhipuai==2.` requires `pydantic V2`, which is incompatible with 'langchain-community'. This re-implementation invokes the GLM model by sending HTTP requests to [open.bigmodel.cn](https://open.bigmodel.cn/dev/api) via the `httpx` package, and uses the `httpx-sse` package to handle stream events. --------- Co-authored-by: zR <2448370773@qq.com>	2024-04-01 18:11:21 +00:00
Bagatur	d25b5b6f25	community[patch]: Release 0.0.31 (#19873 )	2024-04-01 10:50:22 -07:00
Erick Friis	e3ed6a7c28	ai21[patch]: fix core dep (#19874 )	2024-04-01 10:48:16 -07:00
Nuno Campos	aa5797d908	openai[patch]: Partially Revert Update openai chat model to new base class interface (#19871 ) Partially Reverts langchain-ai/langchain#19729 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-01 10:31:06 -07:00
Erick Friis	be92cf57ca	openai[patch]: fix azure embedding length check (#19870 )	2024-04-01 10:26:15 -07:00
Bagatur	d62e84c4f5	community[patch]: Revert " Fix the bug that Chroma does not specify `e… (#19866 ) …mbedding_function` (#19277)" This reverts commit `7042934b5f`. Fixes #19848	2024-04-01 10:10:44 -07:00
Jacob Lee	f06229bbf1	👥 Update LangChain people data (#19858 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-04-01 09:57:31 -07:00
Erick Friis	7376e4dbe9	ai21[patch]: release 0.1.3 (#19867 )	2024-04-01 09:56:23 -07:00
Ángel Igareta	c2ccf22dfd	core: generate mermaid syntax and render visual graph (#19599 ) - Description: Add functionality to generate Mermaid syntax and render flowcharts from graph data. This includes support for custom node colors and edge curve styles, as well as the ability to export the generated graphs to PNG images using either the Mermaid.INK API or Pyppeteer for local rendering. - Dependencies: Optional dependencies are `pyppeteer` if rendering wants to be done using Pypeteer and Javascript code. --------- Co-authored-by: Angel Igareta <angel.igareta@klarna.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-01 08:14:46 -07:00
Ikko Eltociear Ashimine	8711a05a51	Update cross_encoder_reranker.ipynb (#19846 ) HuggingFace -> Hugging Face	2024-04-01 10:49:54 -04:00
Vardhaman	039f314f20	docs: remove unnecessary args from the pip install (#19823 ) Description: An additional `U` argument was added for the instructions to install the pip packages for the MediaWiki Dump Document loader which was leading to error in installing the package. Removing the argument fixed the command to install. Issue: #19820 Dependencies: No dependency change requierd Twitter handle: [@vardhaman722](https://twitter.com/vardhaman722)	2024-04-01 10:47:26 -04:00
Bagatur	003c98e5b4	experimental[patch]: Release 0.0.56 (#19840 )	2024-03-31 22:00:59 -07:00
Bagatur	c4eb841c37	langchain[patch]: Release 0.1.14 (#19839 )	2024-03-31 21:44:01 -07:00
Bagatur	0242bce38c	community[patch]: Release 0.0.30 (#19838 )	2024-03-31 21:26:30 -07:00
Bagatur	08c10bd66a	core[patch]: Release 0.1.37 (#19831 )	2024-03-31 14:50:39 -07:00
Giannis	8cf1d75d08	cohere[patch]: Fix retriever (#19771 ) * Replace `source_documents` with `documents` * Pass `documents` as a named arg vs keyword * Make `parsed_docs` more robust * Fix edge case of doc page_content being `None`	2024-03-31 14:47:03 -07:00
Guangdong Liu	b6ebddbacc	langchain[patch]: Upgrade openai's sdk and solve some interface adaptation problems. #19548 (#19785 ) - #19548 - @baskaryan @eyurtsev PTAL --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-31 21:35:38 +00:00
Yash Mathur	c42ec58578	together[minor]: Update endpoint to non deprecated version (#19649 ) - Updating Together.ai Endpoint: "langchain_together: Updated Deprecated endpoint for partner package" - Description: The inference API of together is deprecates, do replaced with completions and made corresponding changes. - Twitter handle: @dev_yashmathur --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-31 21:21:46 +00:00
hsuyuming	5ab6b39098	community[patch]: add attribution_token within GoogleVertexAISearchRetriever (#18520 ) - Description: Add attribution_token within GoogleVertexAISearchRetriever so user can provide this information to Google support team or product team during debug session. Reference: https://cloud.google.com/generative-ai-app-builder/docs/view-analytics#user-events Attribution tokens. Attribution tokens are unique IDs generated by Vertex AI Search and returned with each search request. Make sure to include that attribution token as UserEvent.attributionToken with any user events resulting from a search. This is needed to identify if a search is served by the API. Only user events with a Google-generated attribution token are used to compute metrics. - Issue: No - Dependencies: No - Twitter handle: abehsu1992626 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-31 13:54:56 -07:00
Kenneth Choe	f98d7f7494	langchain[minor], community[minor]: add CrossEncoderReranker with HuggingFaceCrossEncoder and SagemakerEndpointCrossEncoder (#13687 ) - Description: Support reranking based on cross encoder models available from HuggingFace. - Added `CrossEncoder` schema - Implemented `HuggingFaceCrossEncoder` and `SagemakerEndpointCrossEncoder` - Implemented `CrossEncoderReranker` that performs similar functionality to `CohereRerank` - Added `cross-encoder-reranker.ipynb` to demonstrate how to use it. Please let me know if anything else needs to be done to make it visible on the table-of-contents navigation bar on the left, or on the card list on [retrievers documentation page](https://python.langchain.com/docs/integrations/retrievers). - Issue: N/A - Dependencies: None other than the existing ones. --------- Co-authored-by: Kenny Choe <kchoe@amazon.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-31 20:51:31 +00:00
cxumol	3f7da03dd8	docs: fix a dead link (#19814 ) Description Google Colab returned 404 when trying to click an "Open In Colab" button from document. This PR corrected the link.	2024-03-31 10:28:51 -04:00
aditya thomas	b8271bbc4a	docs: (minor) updates to voyage ai documentation (#19819 ) Description: Updates to Voyage AI documentation Issue: Not Applicable Dependencies: None	2024-03-31 10:27:19 -04:00
Tomaz Bratanic	ed49cca191	templates: Update neo4j templates (#19789 )	2024-03-30 14:40:05 +00:00
aditya thomas	765d6762bc	docs[minor]: include tab info for togetherai (#19796 ) Description: Included information for the TogetherAI tab Issue: The tab for TogetherAI information was not correct Dependencies: None	2024-03-30 09:23:45 -04:00
LunarECL	b7d180a70d	experimental[minor]: Create Closed Captioning Chain for .mp4 videos (#14059 ) Description: Video imagery to text (Closed Captioning) This pull request introduces the VideoCaptioningChain, a tool for automated video captioning. It processes audio and video to generate subtitles and closed captions, merging them into a single SRT output. Issue: https://github.com/langchain-ai/langchain/issues/11770 Dependencies: opencv-python, ffmpeg-python, assemblyai, transformers, pillow, torch, openai Tag maintainer: @baskaryan @hwchase17 Hello!  We are a group of students from the University of Toronto (@LunarECL, @TomSadan, @nicoledroi1, @A2113S) that want to make a contribution to the LangChain community! We have ran make format, make lint and make test locally before submitting the PR. To our knowledge, our changes do not introduce any new errors. Thank you for taking the time to review our PR! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-30 01:57:53 +00:00
Harrison Chase	56525f2ac1	dont mutate metadata/tags (#19742 )	2024-03-29 17:55:27 -07:00
Kamal Zhang	368e35c3b1	community[patch]: introduce convert_to_secret() to bananadev llm (#14283 ) - Description: Per #12165, this PR add to BananaLLM the function convert_to_secret_str() during environment variable validation. - Issue: #12165 - Tag maintainer: @eyurtsev - Twitter handle: @treewatcha75751 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-30 00:52:25 +00:00
DrKroll	c4da8d0813	langchain[patch]: load ReadFileTool (#14301 ) --------- Co-authored-by: Dr. Simon Kroll <krolls@fida.de> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-30 00:46:24 +00:00
anshaneel	0884e5de7f	community[minor]: Add Alpha Vantage API Tool (#14332 ) ### Description This implementation adds functionality from the AlphaVantage API, renowned for its comprehensive financial data. The class encapsulates various methods, each dedicated to fetching specific types of financial information from the API. ### Implemented Functions - `search_symbols`: - Searches the AlphaVantage API for financial symbols using the provided keywords. - `_get_market_news_sentiment`: - Retrieves market news sentiment for a specified stock symbol from the AlphaVantage API. - `_get_time_series_daily`: - Fetches daily time series data for a specific symbol from the AlphaVantage API. - `_get_quote_endpoint`: - Obtains the latest price and volume information for a given symbol from the AlphaVantage API. - `_get_time_series_weekly`: - Gathers weekly time series data for a particular symbol from the AlphaVantage API. - `_get_top_gainers_losers`: - Provides details on top gainers, losers, and most actively traded tickers in the US market from the AlphaVantage API. ### Issue: - #11994 ### Dependencies: - 'requests' library for HTTP requests. (import requests) - 'pytest' library for testing. (import pytest) --------- Co-authored-by: Adam Badar <94140103+adam-badar@users.noreply.github.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-30 00:44:01 +00:00
Alex Sherstinsky	a9bc212bf2	community[minor]: fix failing Predibase integration (#19776 ) - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: Langchain-Predibase integration was failing, because it was not current with the Predibase SDK; in addition, Predibase integration tests were instantiating the Langchain Community `Predibase` class with one required argument (`model`) missing. This change updates the Predibase SDK usage and fixes the integration tests. - Twitter handle: `@alexsherstinsky` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-30 00:38:13 +00:00
ethynic	e9caa22d47	community[patch]: Update minimax.py (#14384 ) MiniMaxChat class _generate method shoud return a ChatResult object not str Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 23:57:06 +00:00
Ahmed Moubtahij	f5d4ce840f	langchain[patch]: Simplify ensemble retriever (#14427 ) - Description: code simplification to improve readability and remove unnecessary memory allocations. - Tag maintainer: @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 16:49:49 -07:00
Snehil Kumar	b36f4147b0	docs: Google Drive Loader always set the env var (#14791 ) - Description: Code written by following, the official documentation of [Google Drive Loader](https://python.langchain.com/docs/integrations/document_loaders/google_drive), gives errors. I have opened an issue regarding this. See #14725. This is a pull request for modifying the documentation to use an approach that makes the code work. Basically, the change is that we need to always set the GOOGLE_APPLICATION_CREDENTIALS env var to an emtpy string, rather than only in case of RefreshError. Also, rewrote 2 paragraphs to make the instructions more clear. - Issue: See this related [issue # 14725](https://github.com/langchain-ai/langchain/issues/14725) - Dependencies: NA - Tag maintainer: @baskaryan - Twitter handle: NA Co-authored-by: Snehil <snehil@example.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 23:19:37 +00:00
M.Abdulrahman Alnaseer	ba54f1577f	community[minor]: add support for llmsherpa (#19741 ) Thank you for contributing to LangChain! - [x] PR title: "community: added support for llmsherpa library" - [x] Add tests and docs: 1. Integration test: 'docs/docs/integrations/document_loaders/test_llmsherpa.py'. 2. an example notebook: `docs/docs/integrations/document_loaders/llmsherpa.ipynb`. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 16:04:57 -07:00
Naveenkhasyap	a99bd098ac	docs: fix for #16702 and #16703 (#16705 ) - Description: Quickstart Documentation updates for missing dependency installation steps. - Issue: the issue # it prompts users to install required dependency. - Dependencies: no, - Twitter handle: @naveenkashyap_ --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 15:57:51 -07:00
Brace Sproul	6d93a03bef	docs[patch]: Fix or remove broken mdx links (#19777 ) this pr also drops the community added action for checking broken links in mdx. It does not work well for our use case, throwing errors for local paths, plus the rest of the errors our in house solution had.	2024-03-29 15:25:08 -07:00
Bagatur	2f5606a318	mistralai[patch]: correct integration_test (#19774 )	2024-03-29 21:47:35 +00:00
Pierre Véron	ace7b66261	mistralai[patch]: add missing _combine_llm_outputs implementation in ChatMistralAI (#18603 ) # Description Implementing `_combine_llm_outputs` to `ChatMistralAI` to override the default implementation in `BaseChatModel` returning `{}`. The implementation is inspired by the one in `ChatOpenAI` from package `langchain-openai`. # Issue None # Dependencies None # Twitter handle None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 14:43:20 -07:00
lvliang-intel	0175906437	templates: add RAG template for Intel Xeon Scalable Processors (#18424 ) Description: This template utilizes Chroma and TGI (Text Generation Inference) to execute RAG on the Intel Xeon Scalable Processors. It serves as a demonstration for users, illustrating the deployment of the RAG service on the Intel Xeon Scalable Processors and showcasing the resulting performance enhancements. Issue: None Dependencies: The template contains the poetry project requirements to run this template. CPU TGI batching is WIP. Twitter handle: None --------- Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 14:37:32 -07:00
Nuno Campos	d4673a3507	openai[patch]: Update openai chat model to new base class interface (#19729 )	2024-03-29 14:30:28 -07:00
harry-cohere	23fcc14650	cohere[patch]: support kwargs in with_structured_output (#19736 ) Description: We'd like to support passing additional kwargs in `with_structured_output`. I believe this is the accepted approach to enable additional arguments on API calls.	2024-03-29 14:30:14 -07:00
Brace Sproul	ce0a588ae6	docs[minor]: Add chat model tabs to docs pages (#19589 )	2024-03-29 14:23:55 -07:00
BeatrixCohere	bd02b83acd	cohere[patch]: Allow overriding of the base URL in Cohere Client (#19766 ) This PR adds the ability for a user to override the base API url for the Cohere client for embeddings and chat llm.	2024-03-29 14:22:30 -07:00
Nisarg Trivedi	1252ccce6f	text-splitters[minor]: Added Haskell support in langchain.text_splitter module (#16191 ) - Description: Haskell language support added in text_splitter module - Dependencies: No - Twitter handle: @nisargtr If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 20:17:50 +00:00
Hrvoje Milković	b7344e3347	community[minor]: Infobip tool integration (#16805 ) Description: Adding Tool that wraps Infobip API for sending sms or emails and email validation. Dependencies: None, Twitter handle: @hmilkovic Implementation: ``` libs/community/langchain_community/utilities/infobip.py ``` Integration tests: ``` libs/community/tests/integration_tests/utilities/test_infobip.py ``` Example notebook: ``` docs/docs/integrations/tools/infobip.ipynb ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 19:01:27 +00:00
Luka Krapic	727a2ea9f1	community[patch]: history size support for DynamoDBChatMessageHistory (#16794 ) Description: PR adds support for limiting number of messages preserved in a session history for DynamoDBChatMessageHistory --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 18:56:21 +00:00
Dt22	6dbf1a2de0	community[patch]: fix redis input type for index_schema field (#16874 ) ### Subject: Fix Type Misdeclaration for index_schema in redis/base.py I noticed a type misdeclaration for the index_schema column in the redis/base.py file. When following the instructions outlined in [Redis Custom Metadata Indexing](https://python.langchain.com/docs/integrations/vectorstores/redis) to create our own index_schema, it leads to a Pylance type error. <br/> The error message indicates that Dict[str, list[Dict[str, str]]] is incompatible with the type Optional[Union[Dict[str, str], str, os.PathLike]]. ``` index_schema = { "tag": [{"name": "credit_score"}], "text": [{"name": "user"}, {"name": "job"}], "numeric": [{"name": "age"}], } rds, keys = Redis.from_texts_return_keys( texts, embeddings, metadatas=metadata, redis_url="redis://localhost:6379", index_name="users_modified", index_schema=index_schema, ) ``` Therefore, I have created this pull request to rectify the type declaration problem. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 18:55:54 +00:00
morgana	074ad5095f	community[patch]: mmr search for Rockset vectorstore integration (#16908 ) - Description: Adding support for mmr search in the Rockset vectorstore integration. - Issue: N/A - Dependencies: N/A - Twitter handle: `@_morgan_adams_` --------- Co-authored-by: Rockset API Bot <admin@rockset.io> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 18:45:22 +00:00
shahrin014	f51e6a35ba	community[patch]: OllamaEmbeddings - Pass headers to post request (#16880 ) ## Feature - Set additional headers in constructor - Headers will be sent in post request This feature is useful if deploying Ollama on a cloud service such as hugging face, which requires authentication tokens to be passed in the request header. ## Tests - Test if header is passed - Test if header is not passed Similar to https://github.com/langchain-ai/langchain/pull/15881 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 18:44:52 +00:00
Lance Martin	e0f137dbe0	docs: Agentic and Self-RAG w/ LangGraph (#16910 ) To do: [ ] Add streaming [ ] Move to LangGraph	2024-03-29 11:11:35 -07:00
Jan Chorowski	b8b42ccbc5	community[minor]: Pathway vectorstore(#14859 ) - Description: Integration with pathway.com data processing pipeline acting as an always updated vectorstore - Issue: not applicable - Dependencies: optional dependency on [`pathway`](https://pypi.org/project/pathway/) - Twitter handle: pathway_com The PR provides and integration with `pathway` to provide an easy to use always updated vector store: ```python import pathway as pw from langchain.embeddings.openai import OpenAIEmbeddings from langchain.text_splitter import CharacterTextSplitter from langchain.vectorstores import PathwayVectorClient, PathwayVectorServer data_sources = [] data_sources.append( pw.io.gdrive.read(object_id="17H4YpBOAKQzEJ93xmC2z170l0bP2npMy", service_user_credentials_file="credentials.json", with_metadata=True)) text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0) embeddings_model = OpenAIEmbeddings(openai_api_key=os.environ["OPENAI_API_KEY"]) vector_server = PathwayVectorServer( *data_sources, embedder=embeddings_model, splitter=text_splitter, ) vector_server.run_server(host="127.0.0.1", port="8765", threaded=True, with_cache=False) client = PathwayVectorClient( host="127.0.0.1", port="8765", ) query = "What is Pathway?" docs = client.similarity_search(query) ``` The `PathwayVectorServer` builds a data processing pipeline which continusly scans documents in a given source connector (google drive, s3, ...) and builds a vector store. The `PathwayVectorClient` implements LangChain's `VectorStore` interface and connects to the server to retrieve documents. --------- Co-authored-by: Mateusz Lewandowski <lewymati@users.noreply.github.com> Co-authored-by: mlewandowski <mlewandowski@MacBook-Pro-mlewandowski.local> Co-authored-by: Berke <berkecanrizai1@gmail.com> Co-authored-by: Adrian Kosowski <adrian@pathway.com> Co-authored-by: mlewandowski <mlewandowski@macbook-pro-mlewandowski.home> Co-authored-by: berkecanrizai <63911408+berkecanrizai@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: mlewandowski <mlewandowski@MBPmlewandowski.ht.home> Co-authored-by: Szymon Dudycz <szymond@pathway.com> Co-authored-by: Szymon Dudycz <szymon.dudycz@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 10:50:39 -07:00
ccurme	0dbd5f5012	add script to check imports (#19611 )	2024-03-29 13:30:20 -04:00
Arturs Konfino	2319212d54	community[patch]: avoid executing `toolkit.get_context()` when not necessary (#19762 ) If `prompt` is passed into `create_sql_agent()`, then `toolkit.get_context()` shouldn't be executed against the database unless relevant prompt variables (`table_info` or `table_names`) are present .	2024-03-29 16:42:21 +00:00
高璟琦	ec7a59c96c	community[minor]: Add solar embedding (#19761 ) Solar is a large language model developed by [Upstage](https://upstage.ai/). It's a powerful and purpose-trained LLM. You can visit the embedding service provided by Solar within this pr. You may get SOLAR_API_KEY from https://console.upstage.ai/services/embedding You can refer to more details about accepted llm integration at https://python.langchain.com/docs/integrations/llms/solar.	2024-03-29 09:36:05 -07:00
Tomaz Bratanic	dec00d3050	community[patch]: Add the ability to pass maps to neo4j retrieval query (#19758 ) Makes it easier to flatten complex values to text, so you don't have to use a lot of Cypher to do it.	2024-03-29 08:33:48 -07:00
Robby	f7e8a382cc	community[minor]: add hugging face text-to-speech inference API (#18880 ) Description: I implemented a tool to use Hugging Face text-to-speech inference API. Issue: n/a Dependencies: n/a Twitter handle: No Twitter, but do have [LinkedIn](https://www.linkedin.com/in/robby-horvath/) lol. --------- Co-authored-by: Robby <h0rv@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-29 15:02:29 +00:00
DasDingoCodes	73eb3f8fd9	community[minor]: Implement DirectoryLoader lazy_load function (#19537 ) Thank you for contributing to LangChain! - [x] PR title: "community: Implement DirectoryLoader lazy_load function" - [x] Description: The `lazy_load` function of the `DirectoryLoader` yields each document separately. If the given `loader_cls` of the `DirectoryLoader` also implemented `lazy_load`, it will be used to yield subdocuments of the file. - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access: `libs/community/tests/unit_tests/document_loaders/test_directory_loader.py` 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory: `docs/docs/integrations/document_loaders/directory.ipynb` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-29 14:46:52 +00:00
Christophe Bornet	6b2b511f68	core[minor]: Add aformat_messages to FewShotChatMessagePromptTemplate and ChatPromptTemplate (#19648 ) Needed since the example selector may use a vector store.	2024-03-29 10:31:32 -04:00
Leonid Ganeline	5f814820f6	docs: providers pinecone fix (#19737 ) Current providers page use link to the old package. - Fixed installation instructions - Added a reference to the Pinecone retriever	2024-03-29 08:30:30 -04:00
Bob Lin	53a74ad12b	docs: use markdown cell instead of code block (#19740 ) I found that the code of async and async batch was divided into two blocks: <img width="823" alt="Screenshot 2024-03-29 at 7 45 59 AM" src="https://github.com/langchain-ai/langchain/assets/10000925/0fa59d29-a692-4309-afb8-2260f03242ec"> so I changed it to unified.	2024-03-29 08:27:48 -04:00
Ekaterina Aidova	4ce36af335	docs: fix link in openvino integration doc (#19749 ) - Description: fix incorrect link in docs - Dependencies: None	2024-03-29 12:24:07 +00:00
Jialei	f7c903e24a	community[minor]: add support for Moonshot llm and chat model (#17100 )	2024-03-29 08:54:23 +00:00
Gustavo Isturiz	824dccf5e2	docs: fixed xml URL on sitemap docs exmaple, issue #17236 (#17304 )	2024-03-29 01:36:54 -07:00
Ethan Yang	7164015135	community[minor]: Add Openvino embedding support (#19632 ) This PR is used to support both HF and BGE embeddings with openvino --------- Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>	2024-03-29 01:34:51 -07:00
Guangdong Liu	cd55d587c2	langchain[patch]: Upgrade openai's sdk and solve some interface adaptation problems. (#19548 ) - Issue: close #19534	2024-03-29 01:25:17 -07:00
Kirushikesh DB	12861273e1	experimental[patch]: Removed 'SQLResults:' from the LLMResponse in SQLDatabaseChain (#17104 ) Description: When using the SQLDatabaseChain with Llama2-70b LLM and, SQLite database. I was getting `Warning: You can only execute one statement at a time.`. ``` from langchain.sql_database import SQLDatabase from langchain_experimental.sql import SQLDatabaseChain sql_database_path = '/dccstor/mmdataretrieval/mm_dataset/swimming_record/rag_data/swimmingdataset.db' sql_db = get_database(sql_database_path) db_chain = SQLDatabaseChain.from_llm(mistral, sql_db, verbose=True, callbacks = [callback_obj]) db_chain.invoke({ "query": "What is the best time of Lance Larson in men's 100 meter butterfly competition?" }) ``` Error: ``` Warning Traceback (most recent call last) Cell In[31], line 3 1 import langchain 2 langchain.debug=False ----> 3 db_chain.invoke({ 4 "query": "What is the best time of Lance Larson in men's 100 meter butterfly competition?" 5 }) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/langchain/chains/base.py:162, in Chain.invoke(self, input, config, kwargs) 160 except BaseException as e: 161 run_manager.on_chain_error(e) --> 162 raise e 163 run_manager.on_chain_end(outputs) 164 final_outputs: Dict[str, Any] = self.prep_outputs( 165 inputs, outputs, return_only_outputs 166 ) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/langchain/chains/base.py:156, in Chain.invoke(self, input, config, kwargs) 149 run_manager = callback_manager.on_chain_start( 150 dumpd(self), 151 inputs, 152 name=run_name, 153 ) 154 try: 155 outputs = ( --> 156 self._call(inputs, run_manager=run_manager) 157 if new_arg_supported 158 else self._call(inputs) 159 ) 160 except BaseException as e: 161 run_manager.on_chain_error(e) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/langchain_experimental/sql/base.py:198, in SQLDatabaseChain._call(self, inputs, run_manager) 194 except Exception as exc: 195 # Append intermediate steps to exception, to aid in logging and later 196 # improvement of few shot prompt seeds 197 exc.intermediate_steps = intermediate_steps # type: ignore --> 198 raise exc File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/langchain_experimental/sql/base.py:143, in SQLDatabaseChain._call(self, inputs, run_manager) 139 intermediate_steps.append( 140 sql_cmd 141 ) # output: sql generation (no checker) 142 intermediate_steps.append({"sql_cmd": sql_cmd}) # input: sql exec --> 143 result = self.database.run(sql_cmd) 144 intermediate_steps.append(str(result)) # output: sql exec 145 else: File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/langchain_community/utilities/sql_database.py:436, in SQLDatabase.run(self, command, fetch, include_columns) 425 def run( 426 self, 427 command: str, 428 fetch: Literal["all", "one"] = "all", 429 include_columns: bool = False, 430 ) -> str: 431 """Execute a SQL command and return a string representing the results. 432 433 If the statement returns rows, a string of the results is returned. 434 If the statement returns no rows, an empty string is returned. 435 """ --> 436 result = self._execute(command, fetch) 438 res = [ 439 { 440 column: truncate_word(value, length=self._max_string_length) (...) 443 for r in result 444 ] 446 if not include_columns: File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/langchain_community/utilities/sql_database.py:413, in SQLDatabase._execute(self, command, fetch) 410 elif self.dialect == "postgresql": # postgresql 411 connection.exec_driver_sql("SET search_path TO %s", (self._schema,)) --> 413 cursor = connection.execute(text(command)) 414 if cursor.returns_rows: 415 if fetch == "all": File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/engine/base.py:1416, in Connection.execute(self, statement, parameters, execution_options) 1414 raise exc.ObjectNotExecutableError(statement) from err 1415 else: -> 1416 return meth( 1417 self, 1418 distilled_parameters, 1419 execution_options or NO_OPTIONS, 1420 ) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/sql/elements.py:516, in ClauseElement._execute_on_connection(self, connection, distilled_params, execution_options) 514 if TYPE_CHECKING: 515 assert isinstance(self, Executable) --> 516 return connection._execute_clauseelement( 517 self, distilled_params, execution_options 518 ) 519 else: 520 raise exc.ObjectNotExecutableError(self) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/engine/base.py:1639, in Connection._execute_clauseelement(self, elem, distilled_parameters, execution_options) 1627 compiled_cache: Optional[CompiledCacheType] = execution_options.get( 1628 "compiled_cache", self.engine._compiled_cache 1629 ) 1631 compiled_sql, extracted_params, cache_hit = elem._compile_w_cache( 1632 dialect=dialect, 1633 compiled_cache=compiled_cache, (...) 1637 linting=self.dialect.compiler_linting \| compiler.WARN_LINTING, 1638 ) -> 1639 ret = self._execute_context( 1640 dialect, 1641 dialect.execution_ctx_cls._init_compiled, 1642 compiled_sql, 1643 distilled_parameters, 1644 execution_options, 1645 compiled_sql, 1646 distilled_parameters, 1647 elem, 1648 extracted_params, 1649 cache_hit=cache_hit, 1650 ) 1651 if has_events: 1652 self.dispatch.after_execute( 1653 self, 1654 elem, (...) 1658 ret, 1659 ) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/engine/base.py:1848, in Connection._execute_context(self, dialect, constructor, statement, parameters, execution_options, args, kw) 1843 return self._exec_insertmany_context( 1844 dialect, 1845 context, 1846 ) 1847 else: -> 1848 return self._exec_single_context( 1849 dialect, context, statement, parameters 1850 ) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/engine/base.py:1988, in Connection._exec_single_context(self, dialect, context, statement, parameters) 1985 result = context._setup_result_proxy() 1987 except BaseException as e: -> 1988 self._handle_dbapi_exception( 1989 e, str_statement, effective_parameters, cursor, context 1990 ) 1992 return result File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/engine/base.py:2346, in Connection._handle_dbapi_exception(self, e, statement, parameters, cursor, context, is_sub_exec) 2344 else: 2345 assert exc_info[1] is not None -> 2346 raise exc_info[1].with_traceback(exc_info[2]) 2347 finally: 2348 del self._reentrant_error File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/engine/base.py:1969, in Connection._exec_single_context(self, dialect, context, statement, parameters) 1967 break 1968 if not evt_handled: -> 1969 self.dialect.do_execute( 1970 cursor, str_statement, effective_parameters, context 1971 ) 1973 if self._has_events or self.engine._has_events: 1974 self.dispatch.after_cursor_execute( 1975 self, 1976 cursor, (...) 1980 context.executemany, 1981 ) File ~/.conda/envs/guardrails1/lib/python3.9/site-packages/sqlalchemy/engine/default.py:922, in DefaultDialect.do_execute(self, cursor, statement, parameters, context) 921 def do_execute(self, cursor, statement, parameters, context=None): --> 922 cursor.execute(statement, parameters) Warning: You can only execute one statement at a time. ``` Issue:* The Error occurs because when generating the SQLQuery, the llm_input includes the stop character of "\nSQLResult:", so for this user query the LLM generated response is SELECT Time FROM men_butterfly_100m WHERE Swimmer = 'Lance Larson';\nSQLResult: it is required to remove the SQLResult suffix on the llm response before executing it on the database. ``` llm_inputs = { "input": input_text, "top_k": str(self.top_k), "dialect": self.database.dialect, "table_info": table_info, "stop": ["\nSQLResult:"], } sql_cmd = self.llm_chain.predict( callbacks=_run_manager.get_child(), llm_inputs, ).strip() if SQL_RESULT in sql_cmd: sql_cmd = sql_cmd.split(SQL_RESULT)[0].strip() result = self.database.run(sql_cmd) ``` <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle:** we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 01:22:35 -07:00
T Cramer	540ebf35a9	community[patch]: Add explicit error message to Bedrock error output. (#17328 ) - Description: Propagate Bedrock errors into Langchain explicitly. Use-case: unset region error is hidden behind 'Could not load credentials...' message - Issue: [17654](https://github.com/langchain-ai/langchain/issues/17654) - Dependencies: None --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 03:07:33 +00:00
Marcus Virginia	69bb96c80f	community[patch]: surrealdb handle for empty metadata and allow collection names with complex characters (#17374 ) - Description: Handle for empty metadata and allow collection names with complex characters - Issue: #17057 - Dependencies: `surrealdb` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 01:04:27 +00:00
ale-delfino	0df76bee37	core[patch]:: XML parser to cover the case when the xml only contains the root level tag (#17456 ) Description: Fix xml parser to handle strings that only contain the root tag Issue: N/A Dependencies: None Twitter handle: N/A A valid xml text can contain only the root level tag. Example: <body> Some text here </body> The example above is a valid xml string. If parsed with the current implementation the result is {"body": []}. This fix checks if the root level text contains any non-whitespace character and if that's the case it returns {root.tag: root.text}. The result is that the above text is correctly parsed as {"body": "Some text here"} @ale-delfino Thank you for contributing to LangChain! Checklist: - [x] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: Delete this entire template message and replace it with the following bulleted list - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @efriis, @eyurtsev, @hwchase17. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-29 00:55:23 +00:00
kYLe	124ab79c23	community[minor]: Add Anyscale embedding support (#17605 ) Description: Add embedding model support for Anyscale Endpoint Dependencies: openai --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 00:53:53 +00:00
Lance Martin	12843f292f	community[patch]: llama cpp embeddings reset default n_batch (#17594 ) When testing Nomic embeddings -- ``` from langchain_community.embeddings import LlamaCppEmbeddings embd_model_path = "/Users/rlm/Desktop/Code/llama.cpp/models/nomic-embd/nomic-embed-text-v1.Q4_K_S.gguf" embd_lc = LlamaCppEmbeddings(model_path=embd_model_path) embedding_lc = embd_lc.embed_query(query) ``` We were seeing this error for strings > a certain size -- ``` File ~/miniforge3/envs/llama2/lib/python3.9/site-packages/llama_cpp/llama.py:827, in Llama.embed(self, input, normalize, truncate, return_count) 824 s_sizes = [] 826 # add to batch --> 827 self._batch.add_sequence(tokens, len(s_sizes), False) 828 t_batch += n_tokens 829 s_sizes.append(n_tokens) File ~/miniforge3/envs/llama2/lib/python3.9/site-packages/llama_cpp/_internals.py:542, in _LlamaBatch.add_sequence(self, batch, seq_id, logits_all) 540 self.batch.token[j] = batch[i] 541 self.batch.pos[j] = i --> 542 self.batch.seq_id[j][0] = seq_id 543 self.batch.n_seq_id[j] = 1 544 self.batch.logits[j] = logits_all ValueError: NULL pointer access ``` The default `n_batch` of llama-cpp-python's Llama is `512` but we were explicitly setting it to `8`. These need to be set to equal for embedding models. * The embedding.cpp example has an assertion to make sure these are always equal. * Apparently this is not being done properly in llama-cpp-python. With `n_batch` set to 8, if more than 8 tokens are passed the batch runs out of space and it crashes. This also explains why the CPU compute buffer size was small: raw client with default `n_batch=512` ``` llama_new_context_with_model: CPU input buffer size = 3.51 MiB llama_new_context_with_model: CPU compute buffer size = 21.00 MiB ``` langchain with `n_batch=8` ``` llama_new_context_with_model: CPU input buffer size = 0.04 MiB llama_new_context_with_model: CPU compute buffer size = 0.33 MiB ``` We can work around this by passing `n_batch=512`, but this will not be obvious to some users: ``` embedding = LlamaCppEmbeddings(model_path=embd_model_path, n_batch=512) ``` From discussion w/ @cebtenzzre. Related: https://github.com/abetlen/llama-cpp-python/issues/1189 Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 00:47:22 +00:00
Zijian Han	8e976545f3	community[patch]: support OpenAI whisper base url (#17695 ) Description: The base URL for OpenAI is retrieved from the environment variable "OPENAI_BASE_URL", whereas for langchain it is obtained from "OPENAI_API_BASE". By adding `base_url = os.environ.get("OPENAI_API_BASE")`, the OpenAI proxy can execute correctly. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 00:35:27 +00:00
Paulo Nascimento	44a3484503	community[patch]: add NotebookLoader unit test (#17721 ) Thank you for contributing to LangChain! - Description: added unit tests for NotebookLoader. Linked PR: https://github.com/langchain-ai/langchain/pull/17614 - Issue: [#17614](https://github.com/langchain-ai/langchain/pull/17614) - Twitter handle: @paulodoestech - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: lachiewalker <lachiewalker1@hotmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 00:27:46 +00:00
Paulo Nascimento	4c3a67122f	community[patch]: add Integration for OpenAI image gen with v1 sdk (#17771 ) Description: Created a Langchain Tool for OpenAI DALLE Image Generation. Issue: [#15901](https://github.com/langchain-ai/langchain/issues/15901) Dependencies: n/a Twitter handle: @paulodoestech - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 00:23:14 +00:00
Kaixin Yang	a8104ea8e9	openai[patch]: add checking codes for calling AI model get error (#17909 ) Description:: adding checking codes for calling AI model get error in chat_models/base.py and llms/base.py Issue: Sometimes the AI Model calling will get error, we should raise it. Otherwise, the next code 'choices.extend(response["choices"])' will throw a "TypeError: 'NoneType' object is not iterable" error to mask the true error. Because 'response["choices"]' is None. Dependencies: None --------- Co-authored-by: yangkx <yangkx@asiainfo-int.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-29 00:17:32 +00:00
Vincent Chen	833d61adb3	docs: update Together README.md (#18004 ) ## PR message Description: This PR adds a README file for the Together API in the `libs/partners` folder of this repository. The README includes: - A brief description of the package - Installation instructions and class introductions - Simple usage examples Issue: #17545 This PR only contains document changes. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-29 00:02:32 +00:00
Jiaming	3d3cc71287	community[patch]: fix bugs for bilibili Loader (#18036 ) - Description: 1. Fix the BiliBiliLoader that can receive cookie parameters, it requires 3 other parameters to run. The change is backward compatible. 2. Add test; 3. Add example in docs - Issue: [#14213] Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-28 16:39:38 -07:00
Ethan Knights	1ef3fa0411	docs: improve readability of Langchain Expression Language get_started.ipynb (#18157 ) Description: A few grammatical changes to improve readability of the LCEL .ipynb and tidy some null characters. Issue: N/A Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-28 23:38:30 +00:00
Sachin Paryani	25c9f3d1d1	community[patch]: Support Streaming in Azure Machine Learning (#18246 ) - [x] PR title: "community: Support streaming in Azure ML and few naming changes" - [x] PR message: - Description: Added support for streaming for azureml_endpoint. Also, renamed and AzureMLEndpointApiType.realtime to AzureMLEndpointApiType.dedicated. Also, added new classes CustomOpenAIChatContentFormatter and CustomOpenAIContentFormatter and updated the classes LlamaChatContentFormatter and LlamaContentFormatter to now show a deprecated warning message when instantiated. --------- Co-authored-by: Sachin Paryani <saparan@microsoft.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 23:38:20 +00:00
xiaohuanshu	ecb11a4a32	langchain[patch]: fix BaseChatMemory get output data error with extra key (#18117 ) Description: At times, BaseChatMemory._get_input_output may acquire some extra keys such as 'intermediate_steps' (agent_executor with return_intermediate_steps set to True) and 'messages' (agent_executor.iter with memory). In these instances, _get_input_output can raise an error due to the presence of multiple keys. The 'output' field should be used as the default field in these cases. Issue: #16791	2024-03-28 16:38:08 -07:00
Isaac Francisco	f5e84c8858	docs: fixing markdown for tips (#18199 ) Previous markdown code was not working as intended, new code should add green box around the tip so it is highlighted Co-authored-by: Hershenson, Isaac (Extern) <isaac.hershenson.extern@bayer04.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 23:37:37 +00:00
Hayden Wolff	85deee521a	docs: Nvidia Riva Runnables Documentation (#18237 ) - Description: Documents how to use the Riva runnables to add streamed automatic-speech-recognition (ASR) and text-to-speech (TTS) to chains. - Issue: None - Dependencies: None - Twitter handle: @HaydenWolff1 --------- Co-authored-by: Hayden Wolff <hwolff@Haydens-Laptop.local> Co-authored-by: Hayden Wolff <hwolff@MacBook-Pro.local> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 23:35:00 +00:00
Victor Adan	afa2d85405	community[patch]: Added missing from_documents method to KNNRetriever. (#18411 ) - Description: Added missing `from_documents` method to `KNNRetriever`, providing the ability to supply metadata to LangChain `Document`s, and to give it parity to the other retrievers, which do have `from_documents`. - Issue: None - Dependencies: None - Twitter handle: None Co-authored-by: Victor Adan <vadan@netroadshow.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-28 23:18:50 +00:00
Smit Parmar	dfc4177b50	community[patch]: mypy ignore fix (#18483 ) Relates to #17048 Description : Applied fix to dynamodb and elasticsearch file. Error was : `Cannot override writeable attribute with read-only property` Suggestion: instead of adding ``` @messages.setter def messages(self, messages: List[BaseMessage]) -> None: raise NotImplementedError("Use add_messages instead") ``` we can change base class property `messages: List[BaseMessage]` to ``` @property def messages(self) -> List[BaseMessage]:... ``` then we don't need to add `@messages.setter` in all child classes.	2024-03-28 15:36:53 -07:00
aditya thomas	dc9e9a66db	docs: update docstring of the ChatAnthropic and AnthropicLLM classes (#18649 ) Description: Update docstring of the ChatAnthropic and AnthropicLLM classes Issue: Not applicable Dependencies: None	2024-03-28 15:33:54 -07:00
Luca Dorigo	f19229c564	core[patch]: fix beta, deprecated typing (#18877 ) Description: While not technically incorrect, the TypeVar used for the `@beta` decorator prevented pyright (and thus most vscode users) from correctly seeing the types of functions/classes decorated with `@beta`. This is in part due to a small bug in pyright (https://github.com/microsoft/pyright/issues/7448 ) - however, the `Type` bound in the typevar `C = TypeVar("C", Type, Callable)` is not doing anything - classes are `Callables` by default, so by my understanding binding to `Type` does not actually provide any more safety - the modified annotation still works correctly for both functions, properties, and classes. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 22:33:43 +00:00
aditya thomas	263ee78886	core[runnables]: docstring for class RunnableSerializable, method configurable_fields (#19722 ) Description: Update to the docstring for class RunnableSerializable, method configurable_fields Issue: [Add in code documentation to core Runnable methods #18804](https://github.com/langchain-ai/langchain/issues/18804) Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-03-28 18:15:18 -04:00
HuangZiy	e1f10a697e	openai[patch]: perform judgment processing on chat model streaming delta (#18983 ) PR title: partners: openai chat model PR message: perform judgment processing on chat model streaming delta Closes #18977 Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-28 14:46:27 -07:00
wulixuan	b7c8bc8268	community[patch]: fix yuan2 errors in LLMs (#19004 ) 1. fix yuan2 errors while invoke Yuan2. 2. update tests.	2024-03-28 14:37:44 -07:00
Bob Lin	aba4bd0d13	docs: Add async batch case (#19686 )	2024-03-28 14:00:46 -07:00
aditya thomas	ec4dcfca7f	core[runnables]: docstring of class RunnableSerializable, method configurable_alternatives (#19724 ) Description: Update to the docstring for class RunnableSerializable, method configurable_alternatives Issue: [Add in code documentation to core Runnable methods #18804](https://github.com/langchain-ai/langchain/issues/18804) Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-03-28 17:00:08 -04:00
Davide Menini	824dbc49ee	langchain[patch]: add template_tool_response arg to create_json_chat (#19696 ) In this small PR I added the `template_tool_response` arg to the `create_json_chat` function, so that users can customize this prompt in case of need. Thanks for your reviews! --------- Co-authored-by: taamedag <Davide.Menini@swisscom.com>	2024-03-28 13:59:54 -07:00
高远	688ca48019	community[patch]: Adding validation when vector does not exist (#19698 ) Adding validation when vector does not exist Co-authored-by: gaoyuan <gaoyuan.20001218@bytedance.com>	2024-03-28 13:58:23 -07:00
Erick Friis	f55b11fb73	infra: Revert run partner CI on core PRs (#19733 ) Reverts parts of langchain-ai/langchain#19688	2024-03-28 20:45:59 +00:00
Alessandro Rossi	665f15bd48	docs: fix typos and make quickstart more readable (#19712 ) Description: minor docs changes to make it more readable. Issue: N/A Dependencies: N/A Twitter handle: _kubealex	2024-03-28 20:10:32 +00:00
standby24x7	36090c84f2	docs: Update function "run" to "invoke" in llm_symbolic_math.ipynb (#19713 ) This patch updates multiple function "run" to "invoke" in llm_symbolic_math.ipynb. Without this patch, you see following message. The function `run` was deprecated in LangChain 0.1.0 and will be removed in 0.2.0. Use invoke instead. Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2024-03-28 13:08:22 -07:00
Chaunte W. Lacewell	4a49fc5a95	community[patch]: Fix bug in vdms (#19728 ) Description: Fix embedding check in vdms Contribution maintainer: [@cwlacewe](https://github.com/cwlacewe)	2024-03-28 12:54:24 -07:00
高璟琦	75173d31db	community[minor]: Add solar model chat model (#18556 ) Add our solar chat models, available model choices: * solar-1-mini-chat * solar-1-mini-translate-enko * solar-1-mini-translate-koen More documents and pricing can be found at https://console.upstage.ai/services/solar. The references to our solar model can be found at * https://arxiv.org/abs/2402.17032 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 12:31:11 -07:00
Erick Friis	e576d6c6b4	cohere[patch]: release 0.1.0rc1 (rc1-2 never released) (#19731 )	2024-03-28 19:12:22 +00:00
harry-cohere	ea57050122	cohere: add with_structured_output to ChatCohere (#19730 ) Description: Adds support for `with_structured_output` to Cohere, which supports single function calling. --------- Co-authored-by: BeatrixCohere <128378696+BeatrixCohere@users.noreply.github.com>	2024-03-28 12:09:25 -07:00
Guangdong Liu	0571f886d1	core[patch]: Fix jsonOutputParser fails if a json value contains ``` inside it. (#19717 ) - Issue: fix #19646 - @baskaryan, @eyurtsev PTAL	2024-03-28 12:01:09 -07:00
Davide Menini	f7042321f1	community[patch]: gather token usage info in BedrockChat during generation (#19127 ) This PR allows to calculate token usage for prompts and completion directly in the generation method of BedrockChat. The token usage details are then returned together with the generations, so that other downstream tasks can access them easily. This allows to define a callback for tokens tracking and cost calculation, similarly to what happens with OpenAI (see [OpenAICallbackHandler](https://api.python.langchain.com/en/latest/_modules/langchain_community/callbacks/openai_info.html#OpenAICallbackHandler). I plan on adding a BedrockCallbackHandler later. Right now keeping track of tokens in the callback is already possible, but it requires passing the llm, as done here: https://how.wtf/how-to-count-amazon-bedrock-anthropic-tokens-with-langchain.html. However, I find the approach of this PR cleaner. Thanks for your reviews. FYI @baskaryan, @hwchase17 --------- Co-authored-by: taamedag <Davide.Menini@swisscom.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 18:58:46 +00:00
ligang-super	a662468dde	community[patch]: Fix the error of Baidu Qianfan not passing the stop parameter (#18666 ) - [x] PR title: "community: fix baidu qianfan missing stop parameter" - [x] PR message: - **Description: Baidu Qianfan lost the stop parameter when requesting service due to extracting it from kwargs. This bug can cause the agent to receive incorrect results --------- Co-authored-by: ligang33 <ligang33@baidu.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 18:21:49 +00:00
BeatrixCohere	d1a2e194c3	cohere[patch]: misc fixs tool use agent and cohere chat (#19705 ) Bug fixes in this PR: * allows for other params such as "message" not just the input param to the prompt for the cohere tools agent * fixes to documents kwarg from messages * fixes to tool_calls API call --------- Co-authored-by: Harry M <127103098+harry-cohere@users.noreply.github.com>	2024-03-28 10:19:38 -07:00
ccurme	b35e68c41f	docs: update use_cases/question_answering/chat_history (#19349 ) Update following https://github.com/langchain-ai/langchain/issues/19344	2024-03-28 12:51:01 -04:00
Erick Friis	8c2ed85a45	core[patch], infra: release 0.1.36, run partner CI on core PRs (#19688 )	2024-03-28 08:55:10 -07:00
Erick Friis	5327bc9ec4	elasticsearch[patch]: move to repo (#19620 )	2024-03-28 08:54:57 -07:00
Nilanjan De	239dd7c0c0	langchain[patch]: Use map() and avoid "ValueError: max() arg is an empty sequence" in MergerRetriever (#18679 ) - Issue: When passing an empty list to MergerRetriever it fails with error: ValueError: max() arg is an empty sequence - Description: We have a use case where we dynamically select retrievers and use MergerRetriever for merging the output of the retrievers. We faced this issue when the retriever_docs list is empty. Adding a default 0 for cases when retriever_docs is an empty list to avoid "ValueError: max() arg is an empty sequence". Also, changed to use map() which is more than twice as fast compared to the current implementation. ``` import timeit # Sample retriever_docs with varying lengths of sublists retriever_docs = [[i for i in range(j)] for j in range(1, 1000)] # First code snippet code1 = ''' max_docs = max(len(docs) for docs in retriever_docs) ''' # Second code snippet code2 = ''' max_docs = max(map(len, retriever_docs), default=0) ''' # Benchmarking time1 = timeit.timeit(stmt=code1, globals=globals(), number=10000) time2 = timeit.timeit(stmt=code2, globals=globals(), number=10000) # Output print(f"Execution time for code snippet 1: {time1} seconds") print(f"Execution time for code snippet 2: {time2} seconds") ``` - Dependencies: none	2024-03-27 23:52:57 -07:00
aditya thomas	4cd38fe89f	docs: update docstring of the ChatGroq class (#18645 ) Description: Update docstring of the ChatGroq class Issue: Not applicable Dependencies: None	2024-03-27 23:46:52 -07:00
Jaid	e4d7b1a482	voyageai[patch]: top level reranker import (#19645 ) The previous version didn't had Voyage rerank in the init file - [ ] PR title: langchain_voyageai reranker is not working - [ ] PR message: - Description: This fix let you run reranker from voyage - Issue: Was not able to run reranker from voyage @efriis	2024-03-28 06:37:55 +00:00
Xinwei Xiong	26eed70c11	infra: Optimize Makefile for Better Usability and Maintenance (#18859 ) Previous screenshots： ![image](https://github.com/langchain-ai/langchain/assets/86140903/e2f326e3-4d97-4b22-aacb-e789a9d815e4) Current screenshot： ![image](https://github.com/langchain-ai/langchain/assets/86140903/bd8a3ea7-1b8a-4803-9168-df45f6fa4893)	2024-03-27 23:37:39 -07:00
Juan Jose Miguel Ovalle Villamil	51baa1b5cf	langchain[patch]: fix-cohere-reranker-rerank-method with cohere v5 (#19486 ) #### Description Fixed the following error with `rerank` method from `CohereRerank`: ``` ---> [79](https://vscode-remote+wsl-002bubuntu.vscode-resource.vscode-cdn.net/home/jjmov99/legal-colombia/~/legal-colombia/.venv/lib/python3.11/site-packages/langchain/retrievers/document_compressors/cohere_rerank.py:79) results = self.client.rerank( [80](https://vscode-remote+wsl-002bubuntu.vscode-resource.vscode-cdn.net/home/jjmov99/legal-colombia/~/legal-colombia/.venv/lib/python3.11/site-packages/langchain/retrievers/document_compressors/cohere_rerank.py:80) query, docs, model, top_n=top_n, max_chunks_per_doc=max_chunks_per_doc [81](https://vscode-remote+wsl-002bubuntu.vscode-resource.vscode-cdn.net/home/jjmov99/legal-colombia/~/legal-colombia/.venv/lib/python3.11/site-packages/langchain/retrievers/document_compressors/cohere_rerank.py:81) ) [82](https://vscode-remote+wsl-002bubuntu.vscode-resource.vscode-cdn.net/home/jjmov99/legal-colombia/~/legal-colombia/.venv/lib/python3.11/site-packages/langchain/retrievers/document_compressors/cohere_rerank.py:82) result_dicts = [] [83](https://vscode-remote+wsl-002bubuntu.vscode-resource.vscode-cdn.net/home/jjmov99/legal-colombia/~/legal-colombia/.venv/lib/python3.11/site-packages/langchain/retrievers/document_compressors/cohere_rerank.py:83) for res in results.results: TypeError: BaseCohere.rerank() takes 1 positional argument but 4 positional arguments (and 2 keyword-only arguments) were given ``` This was easily fixed going from this: ``` def rerank( self, documents: Sequence[Union[str, Document, dict]], query: str, , model: Optional[str] = None, top_n: Optional[int] = -1, max_chunks_per_doc: Optional[int] = None, ) -> List[Dict[str, Any]]: ... if len(documents) == 0: # to avoid empty api call return [] docs = [ doc.page_content if isinstance(doc, Document) else doc for doc in documents ] model = model or self.model top_n = top_n if (top_n is None or top_n > 0) else self.top_n results = self.client.rerank( query, docs, model, top_n=top_n, max_chunks_per_doc=max_chunks_per_doc ) result_dicts = [] for res in results: result_dicts.append( {"index": res.index, "relevance_score": res.relevance_score} ) return result_dicts ``` to this: ``` def rerank( self, documents: Sequence[Union[str, Document, dict]], query: str, , model: Optional[str] = None, top_n: Optional[int] = -1, max_chunks_per_doc: Optional[int] = None, ) -> List[Dict[str, Any]]: ... if len(documents) == 0: # to avoid empty api call return [] docs = [ doc.page_content if isinstance(doc, Document) else doc for doc in documents ] model = model or self.model top_n = top_n if (top_n is None or top_n > 0) else self.top_n results = self.client.rerank( query=query, documents=docs, model=model, top_n=top_n, max_chunks_per_doc=max_chunks_per_doc <------------- ) result_dicts = [] for res in results.results: <------------- result_dicts.append( {"index": res.index, "relevance_score": res.relevance_score} ) return result_dicts ``` #### Unit & Integration tests I added a unit test to check the behaviour of `rerank`. Also fixed the original integration test which was failing. #### Format & Linting Everything worked properly with `make lint_diff`, `make format_diff` and `make format`. However I noticed an error coming from other part of the library when doing `make lint`: ``` (langchain-py3.9) ➜ langchain git:(master) make format [ "." = "" ] \|\| poetry run ruff format . 1636 files left unchanged [ "." = "" ] \|\| poetry run ruff --select I --fix . (langchain-py3.9) ➜ langchain git:(master) make lint ./scripts/check_pydantic.sh . ./scripts/lint_imports.sh poetry run ruff . [ "." = "" ] \|\| poetry run ruff format . --diff 1636 files already formatted [ "." = "" ] \|\| poetry run ruff --select I . [ "." = "" ] \|\| mkdir -p .mypy_cache && poetry run mypy . --cache-dir .mypy_cache langchain/agents/openai_assistant/base.py:252: error: Argument "file_ids" to "create" of "Assistants" has incompatible type "Optional[Any]"; expected "Union[list[str], NotGiven]" [arg-type] langchain/agents/openai_assistant/base.py:374: error: Argument "file_ids" to "create" of "AsyncAssistants" has incompatible type "Optional[Any]"; expected "Union[list[str], NotGiven]" [arg-type] Found 2 errors in 1 file (checked 1634 source files) make: *** [Makefile:65: lint] Error 1 ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 06:32:03 +00:00
Shuqian	332996b4b2	openai[patch]: fix ChatOpenAI model's openai proxy (#19559 ) Due to changes in the OpenAI SDK, the previous method of setting the OpenAI proxy in ChatOpenAI no longer works. This PR fixes this issue, making the previous way of setting the OpenAI proxy in ChatOpenAI effective again. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 23:16:55 -07:00
Bagatur	b15c7fdde6	anthropic[patch]: fix response metadata type (#19683 )	2024-03-27 23:16:26 -07:00
kaijietti	9c4b6dc979	community[patch]: fix bug in cohere that `async for` a coroutine in ChatCohere (#19381 ) Without `await`, the `stream` returned from the `async_client` is actually a coroutine, which could not be used in `async for`.	2024-03-27 21:34:46 -07:00
Christian Galo	1adaa3c662	community[minor]: Update Azure Cognitive Services to Azure AI Services (#19488 ) This is a follow up to #18371. These are the changes: - New Azure AI Services toolkit and tools to replace those of Azure Cognitive Services. - Updated documentation for Microsoft platform. - The image analysis tool has been rewritten to use the new package `azure-ai-vision-imageanalysis`, doing a proper replacement of `azure-ai-vision`. These changes: - Update outdated naming from "Azure Cognitive Services" to "Azure AI Services". - Update documentation to use non-deprecated methods to create and use agents. - Removes need to depend on yanked python package (`azure-ai-vision`) There is one new dependency that is needed as a replacement to `azure-ai-vision`: - `azure-ai-vision-imageanalysis`. This is optional and declared within a function. There is a new `azure_ai_services.ipynb` notebook showing usage; Changes have been linted and formatted. I am leaving the actions of adding deprecation notices and future removal of Azure Cognitive Services up to the LangChain team, as I am not sure what the current practice around this is. --- If this PR makes it, my handle is @galo@mastodon.social --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-03-28 03:19:02 +00:00
Shengsheng Huang	ac1dd8ad94	community[minor]: migrate `bigdl-llm` to `ipex-llm` (#19518 ) - Description: `bigdl-llm` library has been renamed to [`ipex-llm`](https://github.com/intel-analytics/ipex-llm). This PR migrates the `bigdl-llm` integration to `ipex-llm` . - Issue: N/A. The original PR of `bigdl-llm` is https://github.com/langchain-ai/langchain/pull/17953 - Dependencies: `ipex-llm` library - Contribution maintainer: @shane-huang Updated doc: docs/docs/integrations/llms/ipex_llm.ipynb Updated test: libs/community/tests/integration_tests/llms/test_ipex_llm.py	2024-03-27 20:12:59 -07:00
Chaunte W. Lacewell	a31f692f4e	community[minor]: Add VDMS vectorstore (#19551 ) - Description: Add support for Intel Lab's [Visual Data Management System (VDMS)](https://github.com/IntelLabs/vdms) as a vector store - Dependencies: `vdms` library which requires protobuf = "4.24.2". There is a conflict with dashvector in `langchain` package but conflict is resolved in `community`. - Contribution maintainer: [@cwlacewe](https://github.com/cwlacewe) - Added tests: libs/community/tests/integration_tests/vectorstores/test_vdms.py - Added docs: docs/docs/integrations/vectorstores/vdms.ipynb - Added cookbook: cookbook/multi_modal_RAG_vdms.ipynb --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 03:12:11 +00:00
William FH	b7b62e29fb	community[patch], mongodb[patch]: Stop spamming SIMD import warnings (#19531 ) If you use an embedding dist function in an eval loop, you get warned every time. Would prefer to just check once and forget about it. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-28 03:11:02 +00:00
Tomaz Bratanic	b04e663426	experimental[patch]: Flatten relationships in LLM graph transformer (#19642 )	2024-03-27 19:35:34 -07:00
billytrend-cohere	36abb5dd41	cohere[patch]: Fix positional argument (#19678 ) cohere: Fix positional argument Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-28 02:26:08 +00:00
Nuno Campos	fdfb51ad8d	core: Two updates to chat model interface (#19684 ) - .stream() and .astream() call on_llm_new_token, removing the need for subclasses to do so. Backwards compatible because now we don't pass run_manager into ._stream and ._astream - .generate() and .agenerate() now handle `stream: bool` kwarg for _generate and _agenerate. Subclasses handle this arg by delegating to ._stream(), now one less thing they need to do. Backwards compat because this is an optional arg that we now never pass to the subclasses - .generate() and .agenerate() now inspect callback handlers to decide on a default value for stream:bool if not passed in. This auto enables streaming when using astream_events and astream_log - as a result of these three changes any usage of .astream_events and .astream_log should now yield chat model stream events - In future PRs we can update all subclasses to reflect these two things now handled by base class, but in meantime all will continue to work	2024-03-27 18:45:01 -07:00
harry-cohere	3685f8ceac	cohere[patch]: Add cohere tools agent (#19602 ) Description: Adds a cohere tools agent and related notebook. --------- Co-authored-by: BeatrixCohere <128378696+BeatrixCohere@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-27 18:35:43 -07:00
William FH	5c41f4083e	[Evals] Fix function calling support (#19658 ) Current implementation is overzealous in validating chat datasets Fixes [#langsmith-sdk:557](https://github.com/langchain-ai/langsmith-sdk/issues/557)	2024-03-27 17:23:35 -07:00
yongheng.liu	7e29b6061f	community[minor]: integrate China Mobile Ecloud vector search (#15298 ) - Description: integrate China Mobile Ecloud vector search, - Dependencies: elasticsearch==7.10.1 Co-authored-by: liuyongheng <liuyongheng@cmss.chinamobile.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 23:02:40 +00:00
Hyeongchan Kim	9b70131aed	community[patch]: refactor the type hint of `file_path` in `UnstructuredAPIFileLoader` class (#18839 ) * Description: add `None` type for `file_path` along with `str` and `List[str]` types. * `file_path`/`filename` arguments in `get_elements_from_api()` and `partition()` can be `None`, however, there's no `None` type hint for `file_path` in `UnstructuredAPIFileLoader` and `UnstructuredFileLoader` currently. * calling the function with `file_path=None` is no problem, but my IDE annoys me lol. * Issue: N/A * Dependencies: N/A Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-27 22:31:54 +00:00
CaroFG	cf96060ab7	community[patch]: update for compatibility with latest Meilisearch version (#18970 ) - Description: Updates Meilisearch vectorstore for compatibility with v1.6 and above. Adds embedders settings and embedder_name which are now required. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 22:08:27 +00:00
chyroc	be2adb1083	community[patch]: support unstructured_kwargs for s3 loader (#15473 ) fix https://github.com/langchain-ai/langchain/issues/15472 Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 22:03:48 +00:00
Bagatur	b901649032	docs: move extraction up (#19667 )	2024-03-27 14:55:16 -07:00
Kahlil Wehmeyer	9c08cdea92	core[patch]: ToolException docs/exception message (#17590 ) Description: This PR adds a slightly more helpful message to a Tool Exception ``` # current state langchain_core.tools.ToolException: Too many arguments to single-input tool # proposed state langchain_core.tools.ToolException: Too many arguments to single-input tool. Consider using a StructuredTool instead. ``` Issue: Somewhat discussed here 👉 #6197 Dependencies: None Twitter handle: N/A --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-27 21:52:36 +00:00
Evgenii Zheltonozhskii	5b1f9c6d3a	infra: Consistent lxml requirements (#19520 ) Update the dependency for lxml to be consistent among different packages; should fix https://github.com/langchain-ai/langchain/issues/19040	2024-03-27 20:27:59 +00:00
Filip Michalsky	2fceec3771	docs: update cookbook example for SalesGPT - include Stripe Payment Link Generation (#19622 ) Thank you for contributing to LangChain! - [ ] cookbook - update example for SalesGPT - include Stripe Payment Link Generation - Description: We updated the Jupyter notebook example with the ability of the AI Agent to negotiate with customers and then close the deal by generating a custom Stripe payment link. - Issue: N/A - Dependencies: N/a - Twitter handle: @FilipMichalsky @0xtotaylor If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Filip Michalsky <filip_michalsky@g.harvard.edu> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 20:16:21 +00:00
Christophe Bornet	33fa8cfcd0	core[minor]: Add async methods to MaxMarginalRelevanceExampleSelector (#19639 )	2024-03-27 16:03:18 -04:00
Taqi Jaffri	72c8b3127d	cli[patch]: Fix typo in dev script name for the --chat-playground option on the cli (#19673 ) Fixes typo --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>	2024-03-27 15:56:11 -04:00
Jan Nissen	2e0ddd6fb8	core[minor]: support pydantic v2 models in PydanticOutputParser (#18811 ) As mentioned in #18322, the current PydanticOutputParser won't work for anyone trying to parse to pydantic v2 models. This PR adds a separate `PydanticV2OutputParser`, as well as a `langchain_core.pydantic_v2` namespace that will fail on import to any projects using pydantic<2. Happy to update the docs for output parsers if this is something we're interesting in adding. On a separate note, I also updated `check_pydantic.sh` to detect pydantic imports with leading whitespace and excluded the internal namespaces. That change can be separated into its own PR if needed. --------- Co-authored-by: Jan Nissen <jan23@gmail.com>	2024-03-27 15:37:52 -04:00
Kangmoon Seo	d0accc3275	docs: fix error output in XMLOutputParser documentation (#19569 ) - Description: I've made a fix to a ParseError call in the XMLOutputParser documentation. - Issue: None - Dependencies: None Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-27 18:29:00 +00:00
Tomaz Bratanic	87d2a6b777	community[minor]: Add the option to omit schema refresh in Neo4jGraph (#19654 )	2024-03-27 14:20:12 -04:00
Bagatur	5fc6531c74	docs: use first_tool_only instead of return_single (#19666 )	2024-03-27 18:19:39 +00:00
jhicks2306	bcb8ab5216	docs: Improve docstring for Runnable bind method (#19659 ) Added example to the docstring of the "bind" method of Runnable. This makes it easier to understand the purpose of the method when reviewing in code editors. E.g. VS Code below. <img width="833" alt="Screenshot 2024-03-27 at 16 24 18" src="https://github.com/langchain-ai/langchain/assets/45722942/ad022d4e-7bc0-4f4b-aa7a-838f1816cc52"> --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-03-27 14:05:41 -04:00
ccurme	4e9b358ed8	docs: Fix broken imports in documentation (#19655 ) Found via script in https://github.com/langchain-ai/langchain/pull/19611	2024-03-27 13:54:05 -04:00
Rajendra Kadam	0019d8a948	community[minor]: Add support for non-file-based Document Loaders in PebbloSafeLoader (#19574 ) Description: PebbloSafeLoader: Add support for non-file-based Document Loaders This pull request enhances PebbloSafeLoader by introducing support for several non-file-based Document Loaders. With this update, PebbloSafeLoader now seamlessly integrates with the following loaders: - GoogleDriveLoader - SlackDirectoryLoader - Unstructured EmailLoader Issue: NA Dependencies: - None Twitter handle: @Raj__725 --------- Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-03-27 17:39:52 +00:00
Christophe Bornet	9954c6a38e	langchain[minor]: Add async methods to EncoderBackedStore (#19597 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-27 17:36:36 +00:00
Erick Friis	929ed65554	cohere[patch]: release 0.1.0rc1 (#19663 )	2024-03-27 17:14:56 +00:00
hulitaitai	dc2c9dd4d7	Update text2vec.py (#19657 ) Add that URL of the embedding tool "text2vec". Fix minor mistakes in the doc-string.	2024-03-27 13:13:30 -04:00
Erick Friis	7630e9529c	Revert "community: added `partners/package-name` folders" (#19662 ) Reverts langchain-ai/langchain#19290	2024-03-27 17:09:30 +00:00
Christophe Bornet	409c6eeb0b	core: Add async methods to LengthBasedExampleSelector (#19640 )	2024-03-27 13:05:58 -04:00
Bagatur	c7f1962f73	core[patch]: Release 0.1.35 (#19660 )	2024-03-27 16:54:03 +00:00
Eugene Yurtsev	e8339b1d83	core[patch]: Patch XML vulnerability in XMLOutputParser (CVE-2024-1455) (#19653 ) Patch potential XML vulnerability CVE-2024-1455 This patches a potential XML vulnerability in the XMLOutputParser in langchain-core. The vulnerability in some situations could lead to a denial of service attack. At risk are users that: 1) Running older distributions of python that have older version of libexpat 2) Are using XMLOutputParser with an agent 3) Accept inputs from untrusted sources with this agent (e.g., endpoint on the web that allows an untrusted user to interact wiith the parser)	2024-03-27 12:41:52 -04:00
Guangdong Liu	7042934b5f	community[patch]: Fix the bug that Chroma does not specify `embedding_function` (#19277 ) - Issue: close #18291 - @baskaryan, @eyurtsev PTAL	2024-03-27 11:43:38 -04:00
billytrend-cohere	85f57ab4cd	cohere[patch]: Fix cohere rerank (#19624 ) Fix cohere rerank inspired by https://github.com/langchain-ai/langchain/pull/19486	2024-03-27 08:41:53 -07:00
Eugene Yurtsev	8ab7bb3166	core[patch]: XMLOutputParser fix to handle changes to xml standard library (#19612 ) Newest python micro releases broke streaming in the XMLOutputParser. This fixes the parsing code to work with trailing junk after the XML content.	2024-03-27 09:25:28 -04:00
yuwenzho	3a7d2cf443	community[minor]: Add ITREX optimized Embeddings (#18474 ) Introduction [Intel® Extension for Transformers](https://github.com/intel/intel-extension-for-transformers) is an innovative toolkit designed to accelerate GenAI/LLM everywhere with the optimal performance of Transformer-based models on various Intel platforms Description adding ITREX runtime embeddings using intel-extension-for-transformers. added mdx documentation and example notebooks added embedding import testing. --------- Signed-off-by: yuwenzho <yuwen.zhou@intel.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 07:22:06 +00:00
Juan Jose Miguel Ovalle Villamil	1fe10a3e3d	experimental[patch]: Enhance LLMGraphTransformer with async processing and improved readability (#19205 ) - [x] PR title: "experimental: Enhance LLMGraphTransformer with async processing and improved readability" - [x] PR message: - Description: This pull request refactors the `process_response` and `convert_to_graph_documents` methods in the LLMGraphTransformer class to improve code readability and adds async versions of these methods for concurrent processing. The main changes include: - Simplifying list comprehensions and conditional logic in the process_response method for better readability. - Adding async versions aprocess_response and aconvert_to_graph_documents to enable concurrent processing of documents. These enhancements aim to improve the overall efficiency and maintainability of the `LLMGraphTransformer` class. - Issue: N/A - Dependencies: No additional dependencies required. - Twitter handle: @jjovalle99 - [x] Add tests and docs: N/A (This PR does not introduce a new integration) - [x] Lint and test: Ran make format, make lint, and make test from the root of the modified package(s). All tests pass successfully. Additional notes: - The changes made in this PR are backwards compatible and do not introduce any breaking changes. - The PR touches only the `LLMGraphTransformer` class within the experimental package. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 23:40:21 -07:00
Fabrizio Ruocco	f12cb0bea4	community[patch]: Microsoft Azure Document Intelligence updates (#16932 ) - Description: Update Azure Document Intelligence implementation by Microsoft team and RAG cookbook with Azure AI Search --------- Co-authored-by: Lu Zhang (AI) <luzhan@microsoft.com> Co-authored-by: Yateng Hong <yatengh@microsoft.com> Co-authored-by: teethache <hongyateng2006@126.com> Co-authored-by: Lu Zhang <44625949+luzhang06@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 23:36:59 -07:00
Guangdong Liu	cd79305eb9	openai[patch]: fix AzureChatOpenAI missing parameter problem (#19258 ) - Issue: close #19255 - PTAL @baskaryan @eyurtsev	2024-03-26 22:31:36 -07:00
Leonid Ganeline	3a978a4bdc	docs: `output_parsers` page fix (#19623 ) Issue with this [page](https://python.langchain.com/docs/modules/model_io/output_parsers/): Table: "Input Type" columns: strings `str \\| Message` (the escape char "\" doesn't work inside backticked text).	2024-03-26 22:17:41 -07:00
Ethan Yang	28cd5522c2	docs: fix typo in openvino document (#19627 )	2024-03-26 22:13:54 -07:00
xsai9101	1c27de6ce2	docs: Fix oracle doc loader format issue (#19628 )	2024-03-26 22:13:36 -07:00
Timothy	ad77fa15ee	community[patch]: Adding try-except block for GCSDirectoryLoader (#19591 ) - Description: Implemented try-except block for `GCSDirectoryLoader`. Reason: Users processing large number of unstructured files in a folder may experience many different errors. A try-exception block is added to capture these errors. A new argument `use_try_except=True` is added to enable silent failure so that error caused by processing one file does not break the whole function. - Issue: N/A - Dependencies: no new dependencies - Twitter handle: timothywong731 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-27 00:12:24 +00:00
fzowl	aea2be5bf3	voyageai[patch]: VoyageAI rerank (#19521 ) Adding VoyageAI reranking --------- Co-authored-by: fodizoltan <zoltan@conway.expert> Co-authored-by: Yujie Qian <thomasq0809@gmail.com>	2024-03-26 17:07:23 -07:00
Leonid Ganeline	4d85485e71	docs: `PromptTemplate` import from `core` (#19616 ) Changed import of `PromptTemplate` from `langchain` to `langchain_core` in all examples (notebooks)	2024-03-26 17:03:36 -07:00
Leonid Ganeline	3dc0f3c371	experimental[patch]: `PromptTemplate` import fix (#19617 ) Changed import of `PromptTemplate` from `langchain` to `langchain_core` in `langchain_experimental`	2024-03-26 17:03:13 -07:00
xsai9101	160a8eb178	community[minor]: add oracle autonomous database doc loader integration (#19536 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: Adding oracle autonomous database document loader integration. This will allow users to connect to oracle autonomous database through connection string or TNS configuration. https://www.oracle.com/autonomous-database/ - Issue: None - Dependencies: oracledb python package https://pypi.org/project/oracledb/ - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Unit test and doc are added. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 17:02:18 -07:00
Ethan Yang	5784dfed00	docs: update openvino documents (#19543 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 22:15:30 +00:00
Erick Friis	bf8ba00520	cli[patch]: release 0.0.22rc0, chat playground (#19614 )	2024-03-26 15:08:56 -07:00
Leonid Ganeline	a3d24bc10b	docs: release date fix (#19585 ) Replaced the overdue release promise.	2024-03-26 14:51:09 -07:00
Raghav Rawat	b5640a0883	docs: Update apify.ipynb for Document class import (#19598 ) - Description: Update to correctly import Document class - from langchain_core.documents import Document - Issue: Fixes the notebook and the hosted documentation [here](https://python.langchain.com/docs/integrations/tools/apify) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 21:46:29 +00:00
jhicks2306	087823aefa	docs: Update docstring for MessagesPlaceholder (#19601 ) Update to docstring for MessagesPlaceholder so that it shows helpful information in code editors. E.g. VS Code as shown below. <img width="587" alt="Screenshot 2024-03-26 at 17 18 58" src="https://github.com/langchain-ai/langchain/assets/45722942/8f49d09f-ed8d-4f61-a9d4-3611dbe9c9c5"> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 14:34:00 -07:00
Christophe Bornet	7c2578bd55	langchain[patch]: Add async methods to EmbeddingRouterChain (#19603 )	2024-03-26 14:33:36 -07:00
Christophe Bornet	b3d7b5a653	langchain[patch[: Add async methods to TimeWeightedVectorStoreRetriever (#19606 )	2024-03-26 14:03:47 -07:00
Adam Law	aeb7b6b11d	community[patch]: use semantic_configurations in AzureSearch (#19347 ) - Description: Currently the semantic_configurations are not used when creating an AzureSearch instance, instead creating a new one with default values. This PR changes the behavior to use the passed semantic_configurations if it is present, and the existing default configuration if not. --------- Co-authored-by: Adam Law <adamlaw@microsoft.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 13:57:39 -07:00
Christophe Bornet	a7274f006e	langchain[patch]: Add async methods to VectorstoreIndexCreator (#19582 )	2024-03-26 13:57:13 -07:00
Bagatur	241774012a	core[patch]: Release 0.1.34 (#19609 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-26 13:50:48 -07:00
Nuno Campos	c78eb55859	load: Optionally disable reading secrets from env (#19596 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-26 20:32:56 +00:00
Eugene Yurtsev	d3c9974da2	core[patch]: Temporarily disable test for streaming xml parser (#19610 ) Test is failing due to micro version bump in python interpreter which changed something about how std xml parser works	2024-03-26 20:24:20 +00:00
Eugene Yurtsev	8bc5cdccee	core[patch]: Reverting changes with defusedXML (#19604 ) DefusedXML is causing parsing errors on previously functional code with the 0.7.x versions. These do not seem to support newer version of python well. 0.8.x has only been released as rc, so we're not going to to use it in the core package	2024-03-26 15:13:09 -04:00
Giannis	9ea2a9b0c1	cohere[patch]: Add additional kwargs support for Cohere SDK params (#19533 ) * Adds support for `additional_kwargs` in `get_cohere_chat_request` * This functionality passes in Cohere SDK specific parameters from `BaseMessage` based classes to the API --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-26 18:30:37 +00:00
Adrian Valente	2763d8cbe5	community: add len() implementation to Chroma (#19419 ) Thank you for contributing to LangChain! - [x] Add len() implementation to Chroma: "package: community" - [x] PR message: - Description: add an implementation of the __len__() method for the Chroma vectostore, for convenience. - Issue: no exposed method to know the size of a Chroma vectorstore - Dependencies: None - Twitter handle: lowrank_adrian - [x] Add tests and docs - [x] Lint and test --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 12:53:10 -04:00
Tom Aarsen	e0a1278d2b	docs: HFEmbeddings: Add more information to model_kwargs/encode_kwargs (#19594 ) - Description: Be more explicit with the `model_kwargs` and `encode_kwargs` for `HuggingFaceEmbeddings`. - Issue: - - Dependencies: - I received some reports by my users that they didn't realise that you could change the default `batch_size` with `HuggingFaceEmbeddings`, which may be attributed to how the `model_kwargs` and `encode_kwargs` don't give much information about what you can specify. I've added some parameter names & links to the Sentence Transformers documentation to help clear it up. Let me know if you'd rather have Markdown/Sphinx-style hyperlinks rather than a "bare URL". - Tom Aarsen	2024-03-26 12:46:04 -04:00
Dobiichi-Origami	18e6f9376d	community[Qianfan]: add function_call in additional_kwargs (#19550 ) - Description: add lacked `function_call` field in `additional_kwargs` in previous version - Dependencies: None of new dependency	2024-03-26 12:20:19 -04:00
Eugene Yurtsev	9c7e860cf6	core[patch]: Remove anyio dependency (#19583 ) The dependency isn't used anymore	2024-03-26 11:59:22 -04:00
mwmajewsk	f7a1fd91b8	community: better support of pathlib paths in document loaders (#18396 ) So this arose from the https://github.com/langchain-ai/langchain/pull/18397 problem of document loaders not supporting `pathlib.Path`. This pull request provides more uniform support for Path as an argument. The core ideas for this upgrade: - if there is a local file path used as an argument, it should be supported as `pathlib.Path` - if there are some external calls that may or may not support Pathlib, the argument is immidiately converted to `str` - if there `self.file_path` is used in a way that it allows for it to stay pathlib without conversion, is is only converted for the metadata. Twitter handle: https://twitter.com/mwmajewsk	2024-03-26 11:51:52 -04:00
Guangdong Liu	94b869a974	github action: Add dead link check for .mdx files (#19492 ) - Description: Add dead link check for .mdx files. I checked the logs and found that files with .mdx suffix were not checked. https://github.com/langchain-ai/langchain/actions/runs/8409525467/job/23026924465#logs - @baskaryan, @efriis, @eyurtsev, @hwchase17.	2024-03-26 08:42:34 -07:00
Christophe Bornet	6f477e3cb6	docs: Remove chromadb from required dependency in examples with VectorstoreIndexCreator (#19578 )	2024-03-26 11:12:21 -04:00
Yuki Watanabe	cfecbda48b	community[minor]: Allow passing `allow_dangerous_deserialization` when loading LLM chain (#18894 ) ### Issue Recently, the new `allow_dangerous_deserialization` flag was introduced for preventing unsafe model deserialization that relies on pickle without user's notice (#18696). Since then some LLMs like Databricks requires passing in this flag with true to instantiate the model. However, this breaks existing functionality to loading such LLMs within a chain using `load_chain` method, because the underlying loader function [load_llm_from_config](`f96dd57501/libs/langchain/langchain/chains/loading.py (L40)`) (and load_llm) ignores keyword arguments passed in. ### Solution This PR fixes this issue by propagating the `allow_dangerous_deserialization` argument to the class loader iff the LLM class has that field. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 11:07:55 -04:00
hulitaitai	d7c14cb6f9	community[minor]: Add embeddings integration for text2vec (#19267 ) Create a Class which allows to use the "text2vec" open source embedding model. It should install the model by running 'pip install -U text2vec'. Example to call the model through LangChain: from langchain_community.embeddings.text2vec import Text2vecEmbeddings embedding = Text2vecEmbeddings() bookend.embed_documents([ "This is a CoSENT(Cosine Sentence) model.", "It maps sentences to a 768 dimensional dense vector space.", ]) bookend.embed_query( "It can be used for text matching or semantic search." ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-26 11:06:58 -04:00
Shotaro Sano	55c624a694	infra: Resolve the endless dependency resolution during the build of `dev.Dockerfile` by copying `poetry.lock` (#19465 ) ## Description This PR proposes a modification to the `libs/langchain/dev.Dockerfile` configuration to copy the `libs/langchain/poetry.lock` into the working directory. The change aims to address the issue where the Poetry install command, the last command in the `dev.Dockerfile`, takes excessively long hours, and to ensure the reproducibility of the poetry environment in the devcontainer. ## Problem The `dev.Dockerfile`, prepared for development environments such as `.devcontainer`, encounters an unending dependency resolution when attempting the Poetry installation. ### Steps to Reproduce Execute the following build command: ```bash docker build -f libs/langchain/dev.Dockerfile . ``` ### Current Behavior The Docker build process gets stuck at the following step, which, in my experience, did not conclude even after an entire night: ``` => [langchain-dev-dependencies 4/6] COPY libs/community/ ../community/ 0.9s => [langchain-dev-dependencies 5/6] COPY libs/text-splitters/ ../text-splitters/ 0.0s => [langchain-dev-dependencies 6/6] RUN poetry install --no-interaction --no-ansi --with dev,test,docs 12.3s => => # Updating dependencies => => # Resolving dependencies... ``` ### Expected Behavior The Docker build completes in a realistic timeframe. By applying this PR, the build finishes within a few minutes. ### Analysis The complexity of LangChain's dependencies has reached a point where Poetry is required to resolve dependencies akin to threading a needle. Consequently, poetry install fails to complete in a practical timeframe. ## Solution The solution for dependency resolution is already recorded in `libs/langchain/poetry.lock`, so we can use it. When copying `project.toml` and `poetry.toml`, the `poetry.lock` located in the same directory should also be copied. ```diff # Copy only the dependency files for installation -COPY libs/langchain/pyproject.toml libs/langchain/poetry.toml ./ +COPY libs/langchain/pyproject.toml libs/langchain/poetry.toml libs/langchain/poetry.lock ./ ``` ## Note I am not intimately familiar with the historical context of the `dev.Dockerfile` and thus do not know why `poetry.lock` has not been copied until now. It might have been an oversight, or perhaps dependency resolution used to complete quickly even without the `poetry.lock` file in the past. However, if there are deliberate reasons why copying `poetry.lock` is not advisable, please just close this PR.	2024-03-26 10:54:53 -04:00
Kalyan Mudumby	d27600c6f7	community[patch]: GPTCache pydantic validation error on lookup (#19427 ) Description: this change fixes the pydantic validation error when looking up from GPTCache, the `ChatOpenAI` class returns `ChatGeneration` as response which is not handled. use the existing `_loads_generations` and `_dumps_generations` functions to handle it Trace ``` File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/development/scripts/chatbot-postgres-test.py", line 90, in <module> print(llm.invoke("tell me a joke")) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_core/language_models/chat_models.py", line 166, in invoke self.generate_prompt( File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_core/language_models/chat_models.py", line 544, in generate_prompt return self.generate(prompt_messages, stop=stop, callbacks=callbacks, kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_core/language_models/chat_models.py", line 408, in generate raise e File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_core/language_models/chat_models.py", line 398, in generate self._generate_with_cache( File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_core/language_models/chat_models.py", line 585, in _generate_with_cache cache_val = llm_cache.lookup(prompt, llm_string) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_community/cache.py", line 807, in lookup return [ ^ File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_community/cache.py", line 808, in <listcomp> Generation(generation_dict) for generation_dict in json.loads(res) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/langchain_core/load/serializable.py", line 120, in __init__ super().__init__(**kwargs) File "/home/theinhumaneme/Documents/NebuLogic/conversation-bot/venv/lib/python3.11/site-packages/pydantic/v1/main.py", line 341, in __init__ raise validation_error pydantic.v1.error_wrappers.ValidationError: 1 validation error for Generation type unexpected value; permitted: 'Generation' (type=value_error.const; given=ChatGeneration; permitted=('Generation',)) ``` Although I don't seem to find any issues here, here's an [issue](https://github.com/zilliztech/GPTCache/issues/585) raised in GPTCache. Please let me know if I need to do anything else Thank you --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 10:52:30 -04:00
Leonid Ganeline	4159a4723c	experimental[patch]: update module doc strings (#19539 ) Added missed module descriptions. Fixed format.	2024-03-26 10:38:10 -04:00
Piyush Jain	72ba738bf5	community[minor]: Improvements for NeptuneRdfGraph, Improve discovery of graph schema using database statistics (#19546 ) Fixes linting for PR [19244](https://github.com/langchain-ai/langchain/pull/19244) --------- Co-authored-by: mhavey <mchavey@gmail.com>	2024-03-26 10:36:51 -04:00
aditya thomas	fc6b92bb9a	docs: add cohere to the list of partners (#19552 ) Description: Add Cohere to the list of LangChain partners Issue: The Cohere partner package was recently added [#19049](https://github.com/langchain-ai/langchain/pull/19049) Dependencies: None	2024-03-26 10:22:03 -04:00
Christophe Bornet	1f422318b7	core[minor]: Use BaseChatMessageHistory async methods in RunnableWithMessageHistory (#19565 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-26 14:13:58 +00:00
Christophe Bornet	8595c3ab59	community[minor]: Add InMemoryVectorStore to module level imports (#19576 )	2024-03-26 14:07:44 +00:00
Christophe Bornet	a9457d269e	core: Add async methods to BaseExampleSelector and SemanticSimilarityExampleSelector (#19399 ) Few-Shot prompt template may use a `SemanticSimilarityExampleSelector` that in turn uses a `VectorStore` that does I/O operations. So to work correctly on the event loop, we need: * async methods for the `VectorStore` (OK) * async methods for the `SemanticSimilarityExampleSelector` (this PR) * async methods for `BasePromptTemplate` and `BaseChatPromptTemplate` (future work)	2024-03-26 10:06:43 -04:00
Christophe Bornet	29c58528c7	core[minor]: Add default implementations to amax_marginal_relevance_search_by_vector and adelete (#19269 )	2024-03-26 10:03:22 -04:00
Christophe Bornet	999365186b	langchain[major]: Use InMemoryVectorStore by default in VectorstoreIndexCreator (#19575 ) This is a small breaking change but I think it should be done as: * No external dependency needs to be installed anymore for the default to work * It is vendor-neutral	2024-03-26 10:01:23 -04:00
standby24x7	16e64d889a	docs: Update function "run" to "invoke" in fake_llm.ipynb (#19570 ) This patch updates function "run" to "invoke" in fake_llm.ipynb. Without this patch, you see following warning. LangChainDeprecationWarning: The function `run` was deprecated in LangChain 0.1.0 and will be removed in 0.2.0. Use invoke instead. Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2024-03-26 09:54:31 -04:00
Guangdong Liu	c93d4ea91c	docs: Add in code documentation to core Runnable map methods (docs only) (#19517 ) - Issue: #18804 - @baskaryan, @eyurtsev	2024-03-25 19:18:30 -07:00
Leonid Ganeline	0199b73188	docs: added `partners/package-name` folders (#19290 ) Added references to new integration packages from Google, by adding subfolders to `partners/`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 02:16:59 +00:00
Aayush Kataria	03c38005cb	community[patch]: Fixing some caching issues for AzureCosmosDBSemanticCache (#18884 ) Fixing some issues for AzureCosmosDBSemanticCache - Added the entry for "AzureCosmosDBSemanticCache" which was missing in langchain/cache.py - Added application name when creating the MongoClient for the AzureCosmosDBVectorSearch, for tracking purposes. @baskaryan, can you please review this PR, we need this to go in asap. These are just small fixes which we found today in our testing.	2024-03-25 19:06:17 -07:00
Clément Tamines	a6cbb755a7	community[patch]: fix semantic answer bug in AzureSearch vector store (#18938 ) - Description: The `semantic_hybrid_search_with_score_and_rerank` method of `AzureSearch` contains a hardcoded field name "metadata" for the document metadata in the Azure AI Search Index. Adding such a field is optional when creating an Azure AI Search Index, as other snippets from `AzureSearch` test for the existence of this field before trying to access it. Furthermore, the metadata field name shouldn't be hardcoded as "metadata" and use the `FIELDS_METADATA` variable that defines this field name instead. In the current implementation, any index without a metadata field named "metadata" will yield an error if a semantic answer is returned by the search in `semantic_hybrid_search_with_score_and_rerank`. - Issue: https://github.com/langchain-ai/langchain/issues/18731 - Prior fix to this bug: This bug was fixed in this PR https://github.com/langchain-ai/langchain/pull/15642 by adding a check for the existence of the metadata field named `FIELDS_METADATA` and retrieving a value for the key called "key" in that metadata if it exists. If the field named `FIELDS_METADATA` was not present, an empty string was returned. This fix was removed in this PR https://github.com/langchain-ai/langchain/pull/15659 (see `ed1ffca911`#). @lz-chen: could you confirm this wasn't intentional? - New fix to this bug: I believe there was an oversight in the logic of the fix from [#1564](https://github.com/langchain-ai/langchain/pull/15642) which I explain below. The `semantic_hybrid_search_with_score_and_rerank` method creates a dictionary `semantic_answers_dict` with semantic answers returned by the search as follows. `5c2f7e6b2b/libs/community/langchain_community/vectorstores/azuresearch.py (L574-L581)` The keys in this dictionary are the unique document ids in the index, if I understand the [documentation of semantic answers](https://learn.microsoft.com/en-us/azure/search/semantic-answers) in Azure AI Search correctly. When the method transforms a search result into a `Document` object, an "answer" key is added to the document's metadata. The value for this "answer" key should be the semantic answer returned by the search from this document, if such an answer is returned. The match between a `Document` object and the semantic answers returned by the search should be done through the unique document id, which is used as a key for the `semantic_answers_dict` dictionary. This id is defined in the search result's field named `FIELDS_ID`. I added a check to avoid any error in case no field named `FIELDS_ID` exists in a search result (which shouldn't happen in theory). A benefit of this approach is that this fix should work whether or not the Azure AI Search Index contains a metadata field. @levalencia could you confirm my analysis and test the fix? @raunakshrivastava7 do you agree with the fix? Thanks for the help!	2024-03-25 18:51:54 -07:00
miri-bar	55db737302	ai21[minor]: AI21 Labs Semantic Text Splitter support (#19510 ) Description: Added support for AI21 Labs model - Segmentation, as a Text Splitter Dependencies: ai21, langchain-text-splitter Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 01:39:37 +00:00
Anindyadeep	b2a11ce686	community[minor]: Prem AI langchain integration (#19113 ) ### Prem SDK integration in LangChain This PR adds the integration with [PremAI's](https://www.premai.io/) prem-sdk with langchain. User can now access to deployed models (llms/embeddings) and use it with langchain's ecosystem. This PR adds the following: ### This PR adds the following: - [x] Add chat support - [X] Adding embedding support - [X] writing integration tests - [X] writing tests for chat - [X] writing tests for embedding - [X] writing unit tests - [X] writing tests for chat - [X] writing tests for embedding - [X] Adding documentation - [X] writing documentation for chat - [X] writing documentation for embedding - [X] run `make test` - [X] run `make lint`, `make lint_diff` - [X] Final checks (spell check, lint, format and overall testing) --------- Co-authored-by: Anindyadeep Sannigrahi <anindyadeepsannigrahi@Anindyadeeps-MacBook-Pro.local> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 01:37:19 +00:00
Alessandro D'Armiento	37eb3a4a9e	docs: Some import nits (#19130 ) - Description: fixes some minor issues in the documentation --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-26 01:25:44 +00:00
Souhail Hanfi	cbec43afa9	community[patch]: avoid creating extension PGvector while using readOnly Databases (#19268 ) - Description: PgVector class always runs "create extension" on init and this statement crashes on ReadOnly databases (read only replicas). but wierdly the next create collection etc work even in readOnly databases - Dependencies: no new dependencies - Twitter handle: @VenOmaX666 Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 01:25:01 +00:00
Dixing (Dex) Xu	903541f439	docs: update dependecy for autogpt/marathon.ipynb (#19491 ) fixes the import error from notebook based on the [documentation](https://api.python.langchain.com/en/latest/agents/langchain_experimental.agents.agent_toolkits.pandas.base.create_pandas_dataframe_agent.html) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 18:14:22 -07:00
Mauricio Cruz	fb9ce95184	cli[patch]: Fix Tuple typing problem when create new langchain app (#19141 ) Thank you for contributing to LangChain! When run command langchain app new my-app, i get this error: File "/home/mauricio/.local/lib/python3.8/site-packages/langchain_cli/utils/pyproject.py", line 15, in <module> pyproject_toml: Path, local_editable_dependencies: Iterable[tuple[str, Path]] TypeError: 'type' object is not subscriptable This PR fix the error.	2024-03-26 01:09:51 +00:00
Anthony Shaw	6c9b0f96f3	docs: Add guidance for splitting Chinese, Japanese, and Thai (#19295 ) The existing default list of separators for the `RecursiveTextSplitter` assumes spaces are word boundaries. Some languages [don't use spaces between words](https://en.wikipedia.org/wiki/Category:Writing_systems_without_word_boundaries) (Chinese, Japanese, Thai, Burmese). This PR extends the documentation to explain how to cater for those languages by adding additional punctuation to the separators and zero-width spaces which are used by some typesetters and will assist the splitter to not split in words. Ideally, these separators could be a constant in the module but for now, defining them in the documentation is a start.	2024-03-26 00:34:00 +00:00
Erick Friis	441a8012b3	mistralai[patch]: release 0.1.0 (#19540 )	2024-03-25 17:29:40 -07:00
Barun Amalkumar Halder	9246ec6b36	community[patch] : [Fiddler] ensure dataset is not added if model is present (#19293 ) Description: - minor PR to speed up onboarding by not trying to add a dataset, if a model is already present. - replace batch publish API with streaming when single events are published. Dependencies: any dependencies required for this change Twitter handle: behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-03-25 17:28:05 -07:00
JSDu	6e090280fd	community[patch]: milvus will autoflush, manual flush is slowly (#19300 ) reference: https://milvus.io/docs/configure_quota_limits.md#quotaAndLimitsflushRateenabled https://github.com/milvus-io/milvus/issues/31407 Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 00:26:58 +00:00
mackong	e65dc4b95b	community[patch]: clean warning when delete by ids (#19301 ) * Description: rearrange to avoid variable overwrite, which cause warning always. * Issue: N/A * Dependencies: N/A	2024-03-25 17:23:22 -07:00
Ian	d5415dbd68	docs: improve tidb integrations documents (#19321 ) This PR aims to enhance the documentation for TiDB integration, driven by feedback from our users. It provides detailed introductions to key features, ensuring developers can fully leverage TiDB for AI application development.	2024-03-25 17:08:23 -07:00
Stefano Mosconi	01fc69c191	community[patch]: expanding version in confluence loader (#19324 ) Description: Expanding version in all the Confluence API calls so to get when the page was last modified/created in all cases. Issue: #12812 Twitter handle: zzste	2024-03-25 17:08:01 -07:00
Dmitry Tyumentsev	08b769d539	community[patch]: YandexGPT Use recent yandexcloud sdk version (#19341 ) Fixed inability to work with [yandexcloud SDK](https://pypi.org/project/yandexcloud/) version higher 0.265.0	2024-03-25 17:05:57 -07:00
Marlene	f1313339ac	community[patch]: Fixing incorrect base URLs for Azure Cognitive Search Retriever (#19352 ) This PR adds code to make sure that the correct base URL is being created for the Azure Cognitive Search retriever. At the moment an incorrect base URL is being generated. I think this is happening because the original code was based on a depreciated API version. No dependencies need to be added. I've also added more context to the test doc strings. I should also note that ACS is now Azure AI Search. I will open a separate PR to make these changes as that would be a breaking change and should potentially be discussed. Twitter: @marlene_zw - No new tests added, however the current ACS retriever tests are now passing when I run them. - Code was linted. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-26 00:04:59 +00:00
Tridib Roy Arjo	d667b1ea8f	docs: Update async_chromium.ipynb (#19514 ) In Jupyter, asyncio would throw an error before `.load()` unless `nest_asyncio` is applied (Issue #8494 mentioned this) +Minor typo fixes..	2024-03-26 00:02:50 +00:00
Bob Lin	5b6b1f9e1d	docs: Fix several sample code errors (#19382 )	2024-03-25 16:59:52 -07:00
FinTech秋田	03ba1d4731	community[patch]: Add Support for GPU Index Types in Milvus 2.4 (#19468 ) - Description: This commit introduces support for the newly available GPU index types introduced in Milvus 2.4 within the LangChain project's `milvus.py`. With the release of Milvus 2.4, a range of GPU-accelerated index types have been added, offering enhanced search capabilities and performance optimizations for vector search operations. This update ensures LangChain users can fully utilize the new performance benefits for vector search operations. - Reference: https://milvus.io/docs/gpu_index.md Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 23:39:54 +00:00
Hamid Ali	c281ec8887	docs: Fix broken link in semantic-chunker.ipynb (#19464 ) Corrected a broken link within the semantic-chunker.ipynb notebook, ensuring that users can access the referenced resource. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 23:39:32 +00:00
Ash Vardanian	d01bad5169	core[patch]: Convert SimSIMD back to NumPy (#19473 ) This patch fixes the #18022 issue, converting the SimSIMD internal zero-copy outputs to NumPy. I've also noticed, that oftentimes `dtype=np.float32` conversion is used before passing to SimSIMD. Which numeric types do LangChain users generally care about? We support `float64`, `float32`, `float16`, and `int8` for cosine distances and `float16` seems reasonable for practically any kind of embeddings and any modern piece of hardware, so we can change that part as well 🤗	2024-03-25 16:36:26 -07:00
Ikko Eltociear Ashimine	980658cb47	docs: Update streaming.ipynb (#19500 ) Fixed typo. occuring -> occurring	2024-03-25 16:21:45 -07:00
Leonid Kuligin	91f4c80143	docs: fixed links (#19503 ) - [ ] PR title: "docs: fixed broken links" - [ ] PR message: - Description: fixed links in the documentation	2024-03-25 16:19:28 -07:00
Mikelarg	dac2e0165a	community[minor]: Added GigaChat Embeddings support + updated previous GigaChat integration (#19516 ) - Description: Added integration with [GigaChat](https://developers.sber.ru/portal/products/gigachat) embeddings. Also added support for extra fields in GigaChat LLM and fixed docs.	2024-03-25 16:08:37 -07:00
Martin Kolb	e5bdb26f76	community[patch]: More flexible handling for entity names in vector store "HANA Cloud" (#19523 ) - Description: Added support for lower-case and mixed-case names The names for tables and columns previouly had to be UPPER_CASE. With this enhancement, also lower_case and MixedCase are supported, - Issue: N/A - Dependencies: no new dependecies added - Twitter handle: @sapopensource	2024-03-25 15:52:45 -07:00
Erica Clark	a1ff21f90f	docs: Update local llms article to use invoke instead of deprecated __call__ (#19528 ) - Description: Since the implicit `__call__` has been deprecated in favor of `invoke`, the local_llms article also needed to be updated. This article was my introduction to Lanchain, and as it was helpful in getting me setup with running LLMs locally, it is nice to not have any warnings when running the example code. With this change, the warnings go away when running the example code. - Issue: N/A - Dependencies: N/A - Twitter handle: clarkerican	2024-03-25 15:51:39 -07:00
Orest Xherija	0b1e09029f	openai[patch]: increase max batch size for Azure OpenAI Embeddings API (#19532 ) Description: Azure OpenAI has increased its maximum batch size from 16 to 2048 for the Embeddings API per this How-To [page](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/embeddings?tabs=console#best-practices) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 15:50:07 -07:00
Eugene Yurtsev	56f4c5459b	core[patch]: fix xml output parser transform (#19530 ) Previous PR passed _parser attribute which apparently is not meant to be used by user code and causes non deterministic failures on CI when testing the transform and a transform methods. Reverting this change temporarily.	2024-03-25 21:34:45 +00:00
Erick Friis	e6952b04d5	cohere[patch]: fix release (#19529 )	2024-03-25 13:46:29 -07:00
aditya thomas	aa68fd7e91	core[runnables]: docstring for class runnable, method with_listeners() (#19515 ) Description: Docstring for method with_listerners() of class Runnable Issue: [Add in code documentation to core Runnable methods #18804](https://github.com/langchain-ai/langchain/issues/18804) Dependencies: None	2024-03-25 16:24:58 -04:00
billytrend-cohere	63343b4987	cohere[patch]: add cohere as a partner package (#19049 ) Description: adds support for langchain_cohere --------- Co-authored-by: Harry M <127103098+harry-cohere@users.noreply.github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-25 20:23:47 +00:00
Eugene Yurtsev	727d5023ce	core[patch]: Use defusedxml in XMLOutputParser (#19526 ) This mitigates a security concern for users still using older versions of libexpat that causes an attacker to compromise the availability of the system if an attacker manages to surface malicious payload to this XMLParser.	2024-03-25 16:21:52 -04:00
Zachary Wilkins	e1a6341940	langchain: Passthrough batch_size on index()/aindex() calls (#19443 ) Description: This change passes through `batch_size` to `add_documents()`/`aadd_documents()` on calls to `index()` and `aindex()` such that the documents are processed in the expected batch size. Issue: #19415 Dependencies: N/A Twitter handle: N/A	2024-03-25 11:58:29 -04:00
ccurme	82de8fd6c9	add kwargs (#19519 ) `HanaDB.add_texts` is missing **kwargs.	2024-03-25 11:56:01 -04:00
Nikhil Kumar	3d3b46a782	docs: Update docs for `HuggingFacePipeline` (#19306 ) Updated `HuggingFacePipeline` docs to be in sync with list of supported tasks, including translation. - [x] PR title: "community: Update docs for `HuggingFacePipeline`" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: - Description: Update docs for `HuggingFacePipeline`, was earlier missing `translation` as a valid task - Issue: N/A - Dependencies: N/A - Twitter handle: None - [x] Add tests and docs: - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-03-25 00:29:21 -07:00
Igor Muniz Soares	743f888580	community[minor]: Dappier chat model integration (#19370 ) Description: This PR adds [Dappier](https://dappier.com/) for the chat model. It supports generate, async generate, and batch functionalities. We added unit and integration tests as well as a notebook with more details about our chat model. Dependencies: No extra dependencies are needed.	2024-03-25 07:29:05 +00:00
Jacob Lezberg	64e1df3d3a	infra: Update package version to apply CVE-related patch (#19490 ) - Description: [CVE 2024-21503](https://www.cve.org/CVERecord?id=CVE-2024-21503) was recently identified. The python linter "black" suffers from a potential Regex-related denial of service attack. Updated version from the vulnerable 24.2.0 to the patched 24.3.0. - Issue: N/A - Dependencies: The 'black' package in both `langchain` (top-level) and `templates/python-lint`. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 07:11:23 +00:00
Hugoberry	96dc180883	community[minor]: Add `DuckDB` as a vectorstore (#18916 ) DuckDB has a cosine similarity function along list and array data types, which can be used as a vector store. - Description: The latest version of DuckDB features a cosine similarity function, which can be used with its support for list or array column types. This PR surfaces this functionality to langchain. - Dependencies: duckdb 0.10.0 - Twitter handle: @igocrite --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 07:02:35 +00:00
Ethan Yang	fa6397d76a	docs: Add OpenVINO llms docs (#19489 ) Add OpenVINOpipeline instructions in docs. OpenVINO users can find more details in this page.	2024-03-24 23:57:30 -07:00
preak95	6ea3e57a63	community[minor]: S3FileLoader to use expose mode and post_processors arguments of unstructured loader (#19270 ) Description: Update s3_file.py to use arguments mode and post_processors from the base class UnstructuredBaseLoader to include more metadata about the files from the S3 bucket such as 'page_number', 'languages' etc. Issue: NA Dependencies: None Twitter handle: preak95 --------- Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-25 06:56:55 +00:00
Guangdong Liu	560e2182d8	docs: docstring Runnable `pipe` and `pick` methods (docs only) (#19395 ) - Issue: #18804 - @eyurtsev @ccurme PTAL --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-24 23:50:04 -07:00
Christophe Bornet	63898dbda0	langchain[patch]: Use async memory in Chain when needed (#19429 )	2024-03-24 23:49:00 -07:00
Lance Martin	db7403d667	docs: Remove non-rendering images & output spamming from doc ntbks (#19475 ) Looking at tokens / page of our docs, we see a few outliers: <img width="761" alt="image" src="https://github.com/langchain-ai/langchain/assets/122662504/677aa2d6-0a29-45e4-882a-db2bbf46d02b"> It is due to non-rendering images in one case, and output spamming. Clean these, along with other cases of excessing output spamming in docs. All get sucked into chat-langchain for retrieval.	2024-03-24 23:47:38 -07:00
Erick Friis	b617085af0	mistralai[patch]: streaming tool calls (#19469 )	2024-03-23 19:24:53 +00:00
aditya thomas	b43a9d5808	docs: adding voyageai to the list of partner packages (#19376 ) Description: Adding VoyageAI to the list of partners Issue: A standalone langchain-voyageai package has been added Dependencies: None	2024-03-22 17:08:15 -07:00
Zeeland	2549df00cd	docs: fix error bilibili url (#19375 ) Thank you for contributing to LangChain! bilibili-api-python use https://github.com/Nemo2011/bilibili-api repo. Change to the correct address. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-22 17:06:17 -07:00
aditya thomas	375ab7bf59	docs: update module imports for fireworks documentation (#19377 ) Description: Update module imports for Fireworks documentation Issue: Module imports not present or in incorrect location Dependencies: None	2024-03-22 17:05:27 -07:00
aditya thomas	0cc0467267	docs: update import paths and move to lcel for llama.cpp examples (#19391 ) Description: Update import paths and move to lcel for llama.cpp examples Issue: Update import paths to reflect package refactoring and move chains to LCEL in examples Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-23 00:04:12 +00:00
fengjial	3b52ee05d1	community[patch]: fix bugs in baiduvectordb as vectorstore (#19380 ) fix small bugs in vectorstore/baiduvectordb	2024-03-22 17:03:59 -07:00
Cailin Wang	5402aef32e	docs: Add `partition` parameter to DashVector (#19385 ) Description: Add `partition` parameter to DashVector dashvector.ipynb Related PR: https://github.com/langchain-ai/langchain/pull/19023 Twitter handle: @CailinWang_ --------- Co-authored-by: root <root@Bluedot-AI>	2024-03-22 17:00:29 -07:00
aditya thomas	515aab3312	community[patch]: invoke callback prior to yielding token (openai) (#19389 ) Description: Invoke callback prior to yielding token for BaseOpenAI & OpenAIChat Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Dependencies: None	2024-03-22 16:45:55 -07:00
aditya thomas	49e932cd24	community[patch]: invoke callback prior to yielding token (fireworks) (#19388 ) Description: Invoke callback prior to yielding token for Fireworks Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Dependencies: None	2024-03-22 16:44:06 -07:00
aditya thomas	16ef88a87d	docs: moving FireworksEmbeddings documentation to docs folder (#19398 ) Description: Moving FireworksEmbeddings documentation to the location docs/integration/text_embedding/ from langchain_fireworks/docs/ Issue: FireworksEmbeddings documentation was not in the correct location Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-22 23:24:22 +00:00
Leonid Ganeline	06190063e7	infra: makefile `api_docs_clean` fix (#19405 ) Fixed a Makefile command that cleans up the api_docs	2024-03-22 15:45:55 -07:00
Christophe Bornet	1b813fe6fe	langchain[patch]: Add async methods to VectorStoreRetrieverMemory (#19408 )	2024-03-22 15:44:24 -07:00
Tarun Jain	ef6d3d66d6	community[patch]: docarray requires hnsw installation (#19416 ) I have a small dataset, and I tried to use docarray: ``DocArrayHnswSearch ``. But when I execute, it returns: ```bash raise ImportError( ImportError: Could not import docarray python package. Please install it with `pip install "langchain[docarray]"`. ``` Instead of docarray it needs to be ```bash docarray[hnswlib] ``` Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-22 22:39:07 +00:00
German Swan	d4dc98a9f9	community[patch]: RecursiveUrlLoader: add base_url option (#19421 ) RecursiveUrlLoader does not currently provide an option to set `base_url` other than the `url`, though it uses a function with such an option. For example, this causes it unable to parse the `https://python.langchain.com/docs`, as it returns the 404 page, and `https://python.langchain.com/docs/get_started/introduction` has no child routes to parse. `base_url` allows setting the `https://python.langchain.com/docs` to filter by, while the starting URL is anything inside, that contains relevant links to continue crawling. I understand that for this case, the docusaurus loader could be used, but it's a common issue with many websites. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-22 15:34:31 -07:00
Erick Friis	e71daa7a03	openai[patch]: add test coverage to output (#19462 )	2024-03-22 15:33:10 -07:00
igeni	4babefcb2f	cli[patch]: Modified regular expression (#19449 ) - Description: Modified regular expression to add support for unicode chars and simplify pattern Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-22 15:24:08 -07:00
Ray Bell	7d36ee38b7	docs: point to titantic dataset on web (#19455 ) Updated `pd.read_csv("titantic.csv")` to `pd.read_csv("https://raw.githubusercontent.com/pandas-dev/pandas/main/doc/data/titanic.csv")` i.e. it will read it https://raw.githubusercontent.com/pandas-dev/pandas/main/doc/data/titanic.csv and allow anyone to run the code. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-22 22:22:41 +00:00
Ray Bell	f959fad56e	docs: use invoke instead of run (#19457 ) Updated the deprecated run with invoke Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-22 15:08:26 -07:00
Bagatur	d93d49bc43	openai[patch]: tool use integration test (#19460 )	2024-03-22 14:49:54 -07:00
Erick Friis	a99e644913	openai[patch]: integration test structured output (#19459 )	2024-03-22 21:43:24 +00:00
Erick Friis	ac57123f40	openai[patch]: release 0.1.1 (#19458 )	2024-03-22 21:36:21 +00:00
Luca Dorigo	47cfbe7522	openai[patch]: [URGENT REGRESSION FIX] Don't fail if tool message already doesn't contain name (#19435 ) - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-22 14:33:50 -07:00
aditya thomas	bc028294d0	docs: delete mistralai embeddings doc from incorrect location (#19432 ) Description: Delete MistralAIEmbeddings usage document from folder partners/mistralai/docs Issue: The document is present in the folder docs/docs Dependencies: None	2024-03-22 14:02:59 -07:00
Erick Friis	11e37943ed	mistralai[patch]: fix core version (#19454 )	2024-03-22 20:48:13 +00:00
Erick Friis	3b093160c4	mistralai[patch]: release 0.1.0rc1 (#19453 )	2024-03-22 20:34:36 +00:00
aditya thomas	4856a87261	community[patch]: invoke callback prior to yielding token (llama.cpp) (#19392 ) Description: Invoke callback prior to yielding token for llama.cpp Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Dependencies: None	2024-03-22 16:17:56 -04:00
ccurme	c4599444ee	mistralai: update tool calling (#19451 ) ```python from langchain.agents import tool from langchain_mistralai import ChatMistralAI llm = ChatMistralAI(model="mistral-large-latest", temperature=0) @tool def get_word_length(word: str) -> int: """Returns the length of a word.""" return len(word) tools = [get_word_length] llm_with_tools = llm.bind_tools(tools) llm_with_tools.invoke("how long is the word chrysanthemum") ``` currently raises ``` AttributeError: 'dict' object has no attribute 'model_dump' ``` Same with `.with_structured_output` ```python from langchain_mistralai import ChatMistralAI from langchain_core.pydantic_v1 import BaseModel class AnswerWithJustification(BaseModel): """An answer to the user question along with justification for the answer.""" answer: str justification: str llm = ChatMistralAI(model="mistral-large-latest", temperature=0) structured_llm = llm.with_structured_output(AnswerWithJustification) structured_llm.invoke("What weighs more a pound of bricks or a pound of feathers") ``` This appears to fix.	2024-03-22 16:03:48 -04:00
Erick Friis	cceaca3e4f	cookbook[patch]: add strip of quotes (#19452 )	2024-03-22 19:10:39 +00:00
ccurme	8a2528c34a	[langchain] fix OpenAIAssistantRunnable.create_assistant (#19081 ) - Description: OpenAI assistants support some pre-built tools (e.g., `"retrieval"` and `"code_interpreter"`) and expect these as `{"type": "code_interpreter"}`. This may have been upset by https://github.com/langchain-ai/langchain/pull/18935 - Issue: https://github.com/langchain-ai/langchain/issues/19057	2024-03-22 13:23:19 -04:00
Harrison Chase	b40c80007f	core[minor]: Add utility code to create tool examples (#18602 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-03-22 13:17:40 -04:00
Erick Friis	53ac1ebbbc	mistralai[minor]: 0.1.0rc0, remove mistral sdk (#19420 )	2024-03-22 01:24:58 +00:00
William FH	e980c14d6a	core[patch]: allow "placeholder" type in from_messages tuples (#19152 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-21 22:09:24 +00:00
billytrend-cohere	f6bcd42421	community[patch]: Replace positional argument with text=text for cohere>=5 compatibility (#19407 ) - Description: Replace positional argument with text=text for cohere>=5 compatibility	2024-03-21 10:42:51 -07:00
enfeng	b20c2640da	anthropic[patch]: update base_url of anthropic (#18634 ) A small change ~ - [ ] update base_url: "package: langchain_anthropic" --------- Co-authored-by: yangenfeng <yangenfeng@xiaoniangao.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-20 21:04:55 -07:00
Erick Friis	a9cda536ad	openai[patch]: fix core min version (#19366 )	2024-03-20 15:38:29 -07:00
Erick Friis	0b20c098df	openai[patch]: fix name param (#19365 )	2024-03-20 22:22:09 +00:00
Erick Friis	f6c8700326	openai[patch]: release 0.1.0, message id and name support (#19363 )	2024-03-20 15:11:39 -07:00
Bagatur	3fa711dce0	experimental[patch]: Release 0.0.55 (#19353 )	2024-03-20 13:06:39 -07:00
Erick Friis	2bcd760c46	robocorp[patch]: run integration tests on release (#19358 )	2024-03-20 19:31:12 +00:00
Erick Friis	a031c183ae	robocorp[patch]: release 0.0.4 (#19357 )	2024-03-20 12:28:41 -07:00
Bagatur	d95ea3550e	langchain[patch]: Release 0.1.13 (#19351 )	2024-03-20 18:25:12 +00:00
Bagatur	b58b38769d	community[patch]: Release 0.0.29 (#19350 )	2024-03-20 18:09:48 +00:00
Bagatur	5d220975fc	core[patch]: Release 0.1.33 (#19348 )	2024-03-20 17:28:56 +00:00
Eugene Yurtsev	aa9ccca775	langchain[patch]: Add tests for indexing (#19342 ) This PR adds tests for the indexing API	2024-03-20 13:00:22 -04:00
William FH	68298cdc82	[Feat] Accept non-dict if only 1 prompt input variable (#19156 ) For prompt templates with only 1 variable (common in e.g., MessageGraph), it's convenient to wrap the incoming object in the variable before formatting. The downside of this, of course, would be that some number of invocations will successfully format when the user may have intended to format it properly before	2024-03-20 09:59:32 -07:00
mackong	d9396bdec1	langchain[patch]: add stop for various non-openai agents (#19333 ) * Description: add stop for various non-openai agents. * Issue: N/A * Dependencies: N/A --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-20 11:34:10 -04:00
Yudhajit Sinha	7d216ad1e1	community[patch]: Invoke callback prior to yielding token (titan_takeoff_pro) (#18624 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/titan_takeoff_pro. - Issue: #16913 - Dependencies: None	2024-03-20 07:58:18 -07:00
Yudhajit Sinha	455a74486b	community[patch]: Invoke callback prior to yielding token (sparkllm) (#18625 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/sparkllm. - Issue: #16913 - Dependencies: None	2024-03-20 07:57:53 -07:00
Yudhajit Sinha	5ac1860484	community[patch]: Invoke callback prior to yielding token (replicate) (#18626 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/replicate. - Issue: #16913 - Dependencies: None	2024-03-20 07:57:27 -07:00
Yudhajit Sinha	9525e392de	community[patch]: Invoke callback prior to yielding token (pai_eas_endpoint) (#18627 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/pai_eas_endpoint. - Issue: #16913 - Dependencies: None	2024-03-20 07:56:58 -07:00
Yudhajit Sinha	140f06e59a	community[patch]: Invoke callback prior to yielding token (openai) (#18628 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/openai. - Issue: #16913 - Dependencies: None	2024-03-20 07:56:30 -07:00
Yudhajit Sinha	280a914920	community[patch]: Invoke callback prior to yielding token (ollama) (#18629 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ & _astream_ methods in llms/ollama. - Issue: #16913 - Dependencies: None	2024-03-20 07:56:09 -07:00
老阿張	9dfce56b31	docs: Fix typo in infino.ipynb (#18640 ) Description: "conquerer should be conqueror "? 🤔 Issue: Typo Dependencies: Nope Twitter handle: laoazhang	2024-03-20 07:51:58 -07:00
Christophe Bornet	00614f332a	community[minor]: Add InMemoryVectorStore (#19326 ) This is a basic VectorStore implementation using an in-memory dict to store the documents. It doesn't need any extra/optional dependency as it uses numpy which is already a dependency of langchain. This is useful for quick testing, demos, examples. Also it allows to write vendor-neutral tutorials, guides, etc...	2024-03-20 10:21:07 -04:00
Devesh Rahatekar	3c4529ac69	core: Updated docstring for RunnablePick (#18832 ) Description: : Updated the docstring for RunnablePick. Added Overview and an Example for RunnablePick class. Issue: : #18803	2024-03-20 13:54:42 +00:00
aditya thomas	e46419c851	docs: contribute / integrations code examples update (#19319 ) Description: Update to make the code examples consistent with the actual use Issue: Code examples were different from actual use in the LangChain code Dependencies: Changes on top of https://github.com/langchain-ai/langchain/pull/19294 Note: If these changes are acceptable, please merge them after https://github.com/langchain-ai/langchain/pull/19294.	2024-03-20 09:27:53 -04:00
Leonid Ganeline	8609afbd10	core[patch]: Update `messages` namespace to fix API reference docs (#19161 ) Classes and functions defined in __init__.py are not parsed into the API Reference. For example: - libs/core/langchain_core/messages/__init__.py : AnyMessage, MessageLikeRepresentation, get_buffer_string(), messages_from_dict(), ... Opinionated: __init__.py is not a typical place to define artifacts. Moved artifacts from __init__ into utils.py. Added `MessageLikeRepresentation` to __all__ since it is used outside of `messages`, for example, in `libs/core/langchain_core/language_models/base.py` Added `_message_from_dict` to __all__ since it is used outside of `messages`(???) I would add `message_from_dict` (without underscore) as an alias. Please, advise.	2024-03-20 09:25:09 -04:00
Christophe Bornet	4c2e887276	core: Simplify astream logic in BaseChatModel and BaseLLM (#19332 ) Covered by tests in `libs/core/tests/unit_tests/language_models/chat_models/test_base.py`, `libs/core/tests/unit_tests/language_models/llms/test_base.py` and `libs/core/tests/unit_tests/runnables/test_runnable_events.py`	2024-03-20 09:05:51 -04:00
Brace Sproul	40f846e65d	docs[minor]: Add chat model selection tabs component (#19296 ) <img width="1728" alt="image" src="https://github.com/langchain-ai/langchain/assets/46789226/45e70a92-c2ee-48c8-9964-100eed22687b">	2024-03-19 18:12:46 -07:00
Erick Friis	69e9610f62	openai[patch]: pass message name (#17537 )	2024-03-19 19:57:27 +00:00
Guangdong Liu	e5d7e455dc	splitters: Add ensure_ascii parameter (#18485 ) - Description: Add ensure_ascii parameter	2024-03-19 12:51:16 -07:00
Nithish Raghunandanan	7ad0a3f2a7	community: add Couchbase Vector Store (#18994 ) - Description: Added support for Couchbase Vector Search to LangChain. - Dependencies: couchbase>=4.1.12 - Twitter handle: @nithishr --------- Co-authored-by: Nithish Raghunandanan <nithishr@users.noreply.github.com>	2024-03-19 12:39:51 -07:00
Chris Papademetrious	305d74c67a	core: implement a batch_size parameter for CacheBackedEmbeddings (#18070 ) Description: Currently, `CacheBackedEmbeddings` computes vectors for all uncached documents before updating the store. This pull request updates the embedding computation loop to compute embeddings in batches, updating the store after each batch. I noticed this when I tried `CacheBackedEmbeddings` on our 30k document set and the cache directory hadn't appeared on disk after 30 minutes. The motivation is to minimize compute/data loss when problems occur: * If there is a transient embedding failure (e.g. a network outage at the embedding endpoint triggers an exception), at least the completed vectors are written to the store instead of being discarded. * If there is an issue with the store (e.g. no write permissions), the condition is detected early without computing (and discarding!) all the vectors. Issue: Implements enhancement #18026. Testing: I was unable to run unit tests; details in [this post](https://github.com/langchain-ai/langchain/discussions/15019#discussioncomment-8576684). --------- Signed-off-by: chrispy <chrispy@synopsys.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-19 18:55:43 +00:00
William FH	89af30807b	Permit function eval on llm data type (#19287 )	2024-03-19 11:53:50 -07:00
Jib	f8078e41e5	mongodb[patch]: Added scoring threshold to caching (#19286 ) ## Description Semantic Cache can retrieve noisy information if the score threshold for the value is too low. Adding the ability to set a `score_threshold` on cache construction can allow for less noisy scores to appear. - [x] Add tests and docs 1. Added tests that confirm the `score_threshold` query is valid. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-19 11:30:02 -07:00
Christophe Bornet	30e4a35d7a	community: Use langchain-astradb for AstraDB caches (#18419 ) - [x] Needs https://github.com/langchain-ai/langchain-datastax/pull/4 - [x] Needs a new release of langchain-astradb	2024-03-19 14:04:36 -04:00
Brace Sproul	17c62e0f3a	ci[minor]: Bump LC scripts package, add retry option (#19285 ) The `retryFailed` option will retry all failed links, once at a time with the goal of not triggering bot protection `microsoft.com` is now hard coded into the whitelist	2024-03-19 10:42:59 -07:00
Erick Friis	7eb376d5fc	docs: integration deprecation docs (#19283 )	2024-03-19 17:11:15 +00:00
Guangdong Liu	2c835baae4	code[patch]: Add in code documentation to core Runnable with_retry method (docs only) (#19192 ) - Description: Add in code documentation to core Runnable with_retry method (docs only) - Issue: #18804 @baskaryan @eyurtsev PTAL --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-03-19 12:52:29 -04:00
Eugene Yurtsev	4b3dd34544	core[patch]: Pass sync run manager for sync stream fallback in astream (#19280 ) This PR patches the fallback in chat models and language models to pass in the appropriate version of the run manager (sync vs. async)	2024-03-19 16:32:33 +00:00
Leonid Ganeline	d314acb2d5	core[patch]: Move `globals` to a module instead of a package (non breaking change) (#19159 ) Classes and functions defined in __init__.py are not parsed into the API Reference. For example: libs/core/langchain_core/globals/__init__.py : `set_verbose` `get_llm_cache`, `set_llm_cache`, ... And the whole `langchain_core.globals` namespace is not visible in the API Reference. The refactoring is just file renaming.	2024-03-19 12:29:12 -04:00
Al-Ekram Elahee Hridoy	50f93d86ec	core[minor]: Enhance cache flexibility in BaseChatModel (#17386 ) - Description: Enhanced the `BaseChatModel` to support an `Optional[Union[bool, BaseCache]]` type for the `cache` attribute, allowing for both boolean flags and custom cache implementations. Implemented logic within chat model methods to utilize the provided custom cache implementation effectively. This change aims to provide more flexibility in caching strategies for chat models. - Issue: Implements enhancement request #17242. - Dependencies: No additional dependencies required for this change. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-19 11:26:58 -04:00
HatsuneMK00	4761c09e94	docs: update slack toolkit ipynb in integration (#19219 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - PR message: - Description: Update the slack toolkit doc to use an agent that support multiple inputs. Using ReAct agent will cause a ValidationError when invoking the slack tools. This is because the agent return a string like `'{"channel": "C05LDF54S21", "message": "Hello, world!"}'` but the ReAct agent does not support multiple inputs. - Issue: This is related to this [Discussion#18083](https://github.com/langchain-ai/langchain/discussions/18083) - Dependencies: No dependencies required Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-03-19 10:39:09 -04:00
Zihong	ff31cc1648	experimental: update the notebook link of semantic chunk. (#19253 ) update the notebook link of semantic chunk.	2024-03-19 07:24:51 -04:00
Frederico Wu	f36418a5b0	langchain: creating assistants with file_ids (#19199 ) Changing OpenAIAssistantRunnable.create_assistant to send the `file_ids` parameter to openai.beta.assistants.create Co-authored-by: Frederico Wu <fred.diaswu@coxautoinc.com>	2024-03-18 21:34:03 -07:00
Vittorio Rigamonti	9b2f9ee952	community: VectorStore Infinispan, adding autoconfiguration (#18967 ) Description: this PR enable VectorStore autoconfiguration for Infinispan: if metadatas are only of basic types, protobuf config will be automatically generated for the user.	2024-03-18 21:33:45 -07:00
Max Jakob	6f544a6a25	elasticsearch: check for deployed models (#18973 ) When creating a new index, if we use a retrieval strategy that expects a model to be deployed in Elasticsearch, check if a model with this name is indeed deployed before creating an index. This lowers the probability to get into a state in which an index was created with a faulty model ID, which cannot be overwritten any more (the index has to manually be deleted).	2024-03-18 21:32:00 -07:00
gonvee	b82644078e	community: Add `keep_alive` parameter to control how long the model w… (#19005 ) Add `keep_alive` parameter to control how long the model will stay loaded into memory with Ollama。 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-19 04:29:01 +00:00
Anthony Shaw	bb0dd8f82f	docs: Embellish article on splitting by tokens with more examples and missing details (#18997 ) Description This PR adds some missing details from the "Split by tokens" page in the documentation. Specifically: - The `.from_tiktoken_encoder()` class methods for both the `CharacterTextSplitter` and `RecursiveCharacterTextSplitter` default to the old `gpt-2` encoding. I've added a comment to suggest specifying `model_name` or `encoding` - The docs didn't mention that the `from_tiktoken_encoder()` class method passes additional kwargs down to the constructor of the splitter. I only discovered this by reading the source code - Added an example of using the `.from_tiktoken_encoder()` class method with `RecursiveCharacterTextSplitter` which is the recommended approach for most scenarios above `CharacterTextSplitter` - Added a warning that `TokenTextSplitter` can split characters which have multiple tokens (e.g. 猫 has 3 cl100k_base tokens) between multiple chunks which creates malformed Unicode strings and should not be used in these situations. Side note: I think the default argument of `gpt2` for `.from_tiktoken_encoder()` should be updated? Twitter handle anthonypjshaw --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-18 21:28:17 -07:00
Roshan Santhosh	7afecec280	core: update _rm_titles to account for title argument name bug (#19036 ) Issue : For functions which have an argument with the name 'title', the convert_pydantic_to_openai_function generates an incorrect output and omits the argument all together. This is because the _rm_titles function removes all instances of the the key 'title' from the output. Description : Updates the _rm_titles function to check the presence of the 'type' key as well before removing the 'title' key. As the title key that we wish to omit always has a type key along with it. Potential gap if there is a function defined which has both title and key as argument names, in which case this would fail. Maybe we could set a filter on the function argument names and reject those with keyword argument names. No dependencies. Passed all tests. - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-18 21:25:06 -07:00
Harrison Chase	efcdf54edd	Josha91 fix docstring (#19249 ) Co-authored-by: Josha van Houdt <josha.van.houdt@sap.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-18 21:19:56 -07:00
Simon Stone	58c7687174	langchain: preserve document metadata in `FlashrankRerank` (#19148 ) Description: Preserves document metadata in `FlashrankRerank` - Issue: #19142 - Dependencies: None - Twitter handle: n/a --------- Co-authored-by: Simon Stone <simon.stone@dartmouth.edu>	2024-03-19 04:15:18 +00:00
Aaron Jimenez	bc648f6cfc	core: Updated docstring for Context class (#19079 ) - Description: Improves the docstring for `class Context` by providing an overview and an example. - Issue: #18803	2024-03-18 21:15:14 -07:00
Taqi Jaffri	044bc22acc	Community: Add mistral oss model support to azureml endpoints, plus configurable timeout (#19123 ) - Description: There was no formatter for mistral models for Azure ML endpoints. Adding that, plus a configurable timeout (it was hard coded before) - Dependencies: none - Twitter handle: @tjaffri @docugami	2024-03-18 21:10:42 -07:00
Kangmoon Seo	07de4abe70	core: Fix Exception handling in XMLOutputParser (#19126 ) - Description: - Exception handling in `XMLOutputParser` 1. Add Exception handling at `root = ET.fromstring(text)` // raises `ET.ParseError` 2. Fix Exception class (commonly uses in `BaseOutputParser` class) - AS-IS: raise `ValueError`, `ET.ParserError` without handling ```python # langchain_core/output_parsers/xml.py text = text.strip() if (text.startswith("<") or text.startswith("\n<")) and ( text.endswith(">") or text.endswith(">\n") ): root = ET.fromstring(text) return self._root_to_dict(root) else: raise ValueError(f"Could not parse output: {text}") ``` - TO-BE: raise `OutputParserException` ```python # langchain_core/output_parsers/xml.py text = text.strip() if (text.startswith("<") or text.startswith("\n<")) and ( text.endswith(">") or text.endswith(">\n") ): try: root = ET.fromstring(text) return self._root_to_dict(root) except ET.ParseError: raise OutputParserException(f"Could not parse output: {text}") else: raise OutputParserException(f"Could not parse output: {text}") ``` - Issue: #19107 - Dependencies: None	2024-03-18 21:08:32 -07:00
Hamza Muhammad Farooqi	24a0a4472a	Add docstrings for Clickhouse class methods (#19195 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-19 04:03:12 +00:00
Simon Stone	dc4ce82ddd	docs: fix import path for `FlashrankRerank` example notebook (#19146 ) Description: Fixes the import paths for the `FlashrankRerank` example notebook. Issue: #19139 Dependencies: None Twitter handle: n/a --------- Co-authored-by: Simon Stone <simon.stone@dartmouth.edu> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-18 21:03:00 -07:00
Saurav Kumar	bde199d128	Updating format of pip install (#19198 ) Thank you for contributing to LangChain! - [x] PR title: "Updating format of pip install in two files of docs/cookbook" - pip install is not reflecting properly in some of the files in cookbook - Example: [docs/expression_language/cookbook/sql_db](https://python.langchain.com/docs/expression_language/cookbook/sql_db) - [x] PR message: Updating format of pip install in two files of docs/cookbook - Description: a description of the change - Issue: #19197 - Note - let's do squash merge for the PR If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-19 04:01:24 +00:00
Rohit Gupta	785f8ab174	[langchain_community] milvus vectorstores upsert: add kwargs to make it use for other argument also (#19193 ) add kwargs in add_documents for upsert, to make it use for other argument also. Lets use this, it was unused as of now. - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. Co-authored-by: Rohit Gupta <rohit.gupta2@walmart.com>	2024-03-18 21:01:12 -07:00
Cycle	77868b1974	experimental: add buffer_size hyperparameter to SemanticChunker as in source video (#19208 ) add buffer_size hyperparameter which used in combine_sentences function	2024-03-19 03:54:20 +00:00
HowardChan	ae3c7f702c	docs:Make url as a markdown link (#19212 ) Description: same as the title Co-authored-by: ChenZhengHao <chenzhenghao@mail.teletraan.io>	2024-03-19 03:47:52 +00:00
Shotaro Sano	ca9c8c58ea	text-splitters, infra: fix `libs/langchain/dev.Dockerfile` so that the `text-splitter` directory is copied before poetry installation (#19214 ) ## Description This PR modifies the settings in `libs/langchain/dev.Dockerfile` to ensure that the `text-splitters` directory is copied before the poetry installation process begins. Without this modification, the `docker build` command fails for `dev.Dockerfile`, preventing the setup of some development environments, including `.devcontainer`. ## Bug Details ### Repro Run the following command: ```bash docker build -f libs/langchain/dev.Dockerfile . ``` ### Current Behavior The docker build command fails, raising the following error: ``` ... => [langchain-dev-dependencies 4/5] COPY libs/community/ ../community/ 0.4s => ERROR [langchain-dev-dependencies 5/5] RUN poetry install --no-interaction --no-ansi --with dev,test,docs 1.1s ------ > [langchain-dev-dependencies 5/5] RUN poetry install --no-interaction --no-ansi --with dev,test,docs: #13 0.970 #13 0.970 Directory ../text-splitters does not exist ------ executor failed running [/bin/sh -c poetry install --no-interaction --no-ansi --with dev,test,docs]: exit code: 1 ``` ### Expected Behavior The `docker build` command successfully completes without the poetry error. ### Analysis The error occurs because the `text-splitters` directory is not copied into the build environment, unlike the other packages under the `libs` directory. I suspect that the `COPY` setting was overlooked since `text-splitters` was separated in a recent PR. ## Fix Add the following lines to the `libs/langchain/dev.Dockerfile`: ```dockerfile # Copy the text-splitters library for installation COPY libs/text-splitters/ ../text-splitters/ ```	2024-03-18 20:45:35 -07:00
Guangdong Liu	c3310c5e7f	community: Fix Milvus got multiple values for keyword argument 'timeout' (#19232 ) - Description: Fix Milvus got multiple values for keyword argument 'timeout' - Issue: fix #18580 - @baskaryan @eyurtsev PTAL	2024-03-18 20:44:25 -07:00
Erick Friis	95904fe443	langchain[patch]: update base imports to core (#19248 ) still deprecated, but was misleading before	2024-03-19 03:17:07 +00:00
Asaf Joseph Gardin	21c45475c5	ai21[patch]: AI21 Labs bump SDK version (#19114 ) Description: Added support AI21 SDK version 2.1.2 Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-18 19:47:08 -07:00
daniel ung	edf9d1c905	templates: Added template for JaguarDB (#16757 ) - Description:: added langchain template for JaguarDB --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-19 02:36:24 +00:00
gustavo-yt	7c26ef88a1	templates: Add rag lantern template (#16523 ) Replace this entire comment with: - Description: Added a template for lantern rag usage. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-19 02:34:46 +00:00
Jib	516cc44b3f	langchain-mongodb: [test-fix] add explicit index_name setting on test vector creation (#19245 ) - Description: Tests fail to do value lookup because it does not specify the index name - Issue: the issue # Failing integration test - [x] Add tests and docs: Tests now pass - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-03-18 15:52:28 -07:00
Estephania Calvo Carvajal	94e58dd827	docs:Fix links to LangSmith docs on Evaluation page (#19210 ) (#19216 ) - Description: Same as the title - Issue: #19210	2024-03-18 22:27:43 +00:00
William FH	780337488e	[Enhancement] Add support for directly providing a run_id (#18990 ) The root run id (~trace id's) is useful for assigning feedback, but the current recommended approach is to use callbacks to retrieve it, which has some drawbacks: 1. Doesn't work for streaming until after the first event 2. Doesn't let you call other endpoints with the same trace ID in parallel (since you have to wait until the call is completed/started to use This PR lets you provide = "run_id" in the runnable config. Couple considerations: 1. For batch calls, we split the trace up into separate trees (to permit better rendering). We keep the provided run ID for the first one and generate a unique one for other elements of the batch. 2. For nested calls, the provided ID is ONLY used on the top root/trace. ### Example Usage ``` chain.invoke("foo", {"run_id": uuid.uuid4()}) ```	2024-03-18 15:03:04 -07:00
Jacob Lee	bd329e9aad	core[patch]: Add LLM output to message response_metadata (#19158 ) This will more easily expose token usage information. CC @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-18 13:58:32 -07:00
Erick Friis	6fa1438334	mongodb[patch]: release 0.1.2 (#19243 )	2024-03-18 13:35:45 -07:00
Leonid Ganeline	7de1d9acfd	community: `llms` imports fixes (#18943 ) Classes are missed in __all__ and in different places of __init__.py - BaichuanLLM - ChatDatabricks - ChatMlflow - Llamafile - Mlflow - Together Added classes to __all__. I also sorted __all__ list. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-18 20:24:40 +00:00
Anush	aee5138930	templates: update qdrant self query (#19218 ) ## Description This PR - Updates the Qdrant self-query template to reflect the recent updates. - Enables reading config values from `env` files as the README [mentions it](https://github.com/Anush008/langchain/tree/self-query-qdrant/templates/self-query-qdrant#environment-setup). Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-18 19:59:08 +00:00
Kenzie Mihardja	21f75991d4	deprecate community docugami loader (#19230 ) Thank you for contributing to LangChain! - [x] PR title: "community: deprecate DocugamiLoader" - [x] PR message: Deprecate the langchain_community and use the docugami_langchain DocugamiLoader --------- Co-authored-by: Kenzie Mihardja <kenzie28@cs.washington.edu>	2024-03-18 12:56:47 -07:00
Jib	ec026004cb	mongodb[patch]: Remove in-memory cache from cache abstractions (#18987 ) ## Description * In memory cache easily gets out of sync with the server cache, so we will remove it entirely to reduce the issues around invalidated caches. ## Dependencies None - [x] If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-18 19:44:34 +00:00
Jib	866d6408af	mongodb[patch]: Remove embedding retrieval from mongodb payload (#19035 ) ## Description Returning the embedding is not necessary in the vector search functionality unless specified as a debugging step. This change defaults the behavior such that the server _only_ returns the embedding key if explicitly requested, such as in the case of `max_marginal_relevance_search`. - [x] Add tests and docs: If you're adding a new integration, please include * Added `test_from_documents_no_embedding_return` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-18 19:43:50 +00:00
Leonid Kuligin	366ba77459	core[minor]: moved fake llms and embeddings to core (#19226 ) - [ ] PR title: "core: moved fake llms and embeddings to core" - [ ] PR message: - Description: moved fake llms and embeddings to core"	2024-03-18 10:01:26 -07:00
Pengfei Jiang	514fe80778	community[patch]: add stop parameter support to volcengine maas (#19052 ) - Description: add stop parameter to volcengine maas model - Dependencies: no --------- Co-authored-by: 江鹏飞 <jiangpengfei.jiangpf@bytedance.com>	2024-03-17 01:58:50 +00:00
htaoruan	bcc771e37c	docs: ChatTongyi example error (#19013 )	2024-03-17 01:55:56 +00:00
Anubhav Madhav	9235dade90	docs: provided hyperlinks to text and fixed grammar (#19092 ) 1) Provided links to text in the prompt (Refer Page Link 1, Page Link 2 and Page Link 3) 2) Fixed Grammar in Considerations of Model I/O Concepts documentation page - Update concepts.mdx (Page Link 4) Issues are on the following pages: Page Link 1: https://python.langchain.com/docs/modules/model_io/concepts#prompttemplate Page Link 2: https://python.langchain.com/docs/modules/model_io/concepts#messageprompttemplate Page Link 3: https://python.langchain.com/docs/modules/model_io/concepts#chatprompttemplate Page Link 4: https://python.langchain.com/docs/modules/model_io/concepts#considerations Fix 1: Description: Fixed Grammar in Considerations of Model I/O Documentation Page Issue: "to work well with the model are you using" # "to work well with the model you are using" Dependencies: None Twitter handle: @Anubhav_Madhav (https://twitter.com/Anubhav_Madhav) Fix 2: Description: Provided links to text in the prompt (Refer Page Link 1, Page Link 2 and Page Link 3) Issue: links not provided # links have been provided to the text Dependencies: None Twitter handle: @Anubhav_Madhav (https://twitter.com/Anubhav_Madhav) baskaryan, efriis, eyurtsev, hwchase17. For Fix 1 Refer to the first word 'This" word in the image attached with this PR. PFA <img width="839" alt="Screenshot 2024-03-15 at 3 04 17 AM" src="https://github.com/langchain-ai/langchain/assets/42323737/94e8db16-249f-48c3-a1d1-dee8d36067fa"> If no one reviews your PR within a few days, please @-mention one of --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-17 01:37:42 +00:00
primate88	5aa68936e0	community: Fix import path for StreamingStdOutCallbackHandler example (#19170 ) - Description: - Updated the import path for `StreamingStdOutCallbackHandler` in the streaming response example within `huggingface_endpoint.py`. This change corrects the import statement to reflect the actual location of `StreamingStdOutCallbackHandler` in `langchain_core.callbacks.streaming_stdout`. - Issue: - None - Dependencies: - No additional dependencies are required for this change. - Twitter handle: - None ## Note: I have tested this change locally and confirmed that the `StreamingStdOutCallbackHandler` works as expected with the updated import path. This PR does not require the addition of new tests since it is a correction to documentation/examples rather than functional code.	2024-03-17 00:50:37 +00:00
Bagatur	611d5a1618	openai[patch]: fix async http client (#19164 ) Fix #19116	2024-03-16 17:50:22 -07:00
Nikhil Kumar	635b3372bd	community[minor]: Add support for translation in HuggingFacePipeline (#19190 ) - [x] Support for translation: "community: Add support for translation in `HuggingFacePipeline`" - [x] Add support for translation in `HuggingFacePipeline`: - Description: Add support for translation in `HuggingFacePipeline`, which earlier used to support only text summarization and generation. - Issue: N/A - Dependencies: N/A - Twitter handle: None	2024-03-17 00:48:13 +00:00
Nikhil Kumar	a1b26dd9b6	docs: Add docs for RouterRunnable (#19191 ) - [x] Docs for `RouterRunnable`: core: Add docs for `RouterRunnable` - [x] Add docs for `RouterRunnable`: - Description: Add docs for `RouterRunnable`, which was previously missing documentation - Issue: #18803 - Dependencies: N/A - Twitter handle: None	2024-03-17 00:48:00 +00:00
k.muto	8d2c34e655	community: Fix all page numbers were the same for _BaseGoogleVertexAISearchRetriever (#19175 ) - Description: - This pull request is to fix a bug where page numbers were not set correctly. In the current code, all chunks share the same metadata object doc_metadata, so the page number is set with the same value for all documents. To fix this, I changed to using separate metadata objects for each chunk. - Issue: - None - Dependencies: - No additional dependencies are required for this change. - Twitter handle: - @eycjur - Test - Even if it's not a bug, there are cases where everything ends up with the same number of pages, so it's very difficult for me to write integration tests.	2024-03-16 22:28:56 +00:00
Matt Frediani	160a7077b0	Update README.md (#19172 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-16 15:23:25 -07:00
inpyeong	7c092f479f	docs: Update why.ipynb (#19173 ) I think that cell type for pip command may be 'code'. Please check, thank you :) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-16 22:21:51 +00:00
Vitalii Korsakov	d96e0b2de7	docs: Remove duplicated line in Get Started section (#19182 ) Line `from langchain_openai import ChatOpenAI` is put twice in Get Started / Serving with LangServe section. Imports on lines 559 and 566 are identical Co-authored-by: Vitalii <vitalii@localhost>	2024-03-16 22:21:25 +00:00
Cailin Wang	7cd87d2f6a	community: Add `partition` parameter to DashVector (#19023 ) Description: DashVector Add partition parameter Twitter handle: @CailinWang_ --------- Co-authored-by: root <root@Bluedot-AI>	2024-03-16 15:20:30 -07:00
Rodrigo Nogueira	e64cf1aba4	community: Add model argument for maritalk models and better error handling (#19187 )	2024-03-16 15:18:56 -07:00
samanhappy	ff94f86ce1	docs: fix link to interface TextSplitter (#19177 )	2024-03-16 15:16:34 -07:00
Sergey Kozlov	1a55e950aa	community[patch]: support fastembed v1 and v2 (#19125 ) Description: #18040 forces `fastembed>2.0`, and this causes dependency conflicts with the new `unstructured` package (different `onnxruntime`). There may be other dependency conflicts.. The only way to use `langchain-community>=0.0.28` is rollback to `unstructured 0.10.X`. But new `unstructured` contains many fixes. This PR allows to use both `fastembed` `v1` and `v2`. How to reproduce: `pyproject.toml`: ```toml [tool.poetry] name = "depstest" version = "0.0.0" description = "test" authors = ["<dev@example.org>"] [tool.poetry.dependencies] python = ">=3.10,<3.12" langchain-community = "^0.0.28" fastembed = "^0.2.0" unstructured = {extras = ["pdf"], version = "^0.12"} ``` ```bash $ poetry lock ``` Co-authored-by: Sergey Kozlov <sergey.kozlov@ludditelabs.io>	2024-03-15 18:33:51 -07:00
six17	fd4f536c77	text-splitters[patch]: fix json split of RecursiveJsonSplitter (#19119 ) - Description: This modification addresses the issue of mutable default parameters in functions. In the original code, the `chunks` parameter is defaulted to a list containing an empty dictionary, which is mutable. Since default parameters in Python are evaluated only once at function definition time, modifications to the parameter would persist across future calls. By changing the default to `None` and checking/initializing within the function, a new list is created for each call, thus avoiding potential issues. --------- Co-authored-by: sixiang <sixiang@lixiang.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-15 16:46:49 -07:00
aditya thomas	05008c4f94	docs: update stale links in Together AI documentation (#19011 ) Description: Update stales link in Together AI documentation Issue: Some links pointed to legacy webpages on the Together AI website Dependencies: None Lint and test: `make format`, `make lint` were run	2024-03-15 16:38:04 -07:00
aditya thomas	80eb510a7b	docs: update docstring of Together class (#19008 ) Description: Update docstring of Together class to show example and update API URL Issue: Improves usability Dependencies: None Lint and test: `make format`, `make lint` and `make test` were run	2024-03-15 16:30:45 -07:00
高远	ef9813dae6	docs: add vikingdb docstrings(#19016 ) Co-authored-by: gaoyuan <gaoyuan.20001218@bytedance.com>	2024-03-15 16:29:29 -07:00
wulixuan	0e0030f494	community[patch]: fix yuan2 chat model errors while invoke. (#19015 ) 1. fix yuan2 chat model errors while invoke. 2. update related tests. 3. fix some deprecationWarning.	2024-03-15 16:28:36 -07:00
Shuai Liu	c244e1a50b	community[patch]: Fixed bug in merging `generation_info` during chunk concatenation in Tongyi and ChatTongyi (#19014 ) - Description: In #16218 , during the `GenerationChunk` and `ChatGenerationChunk` concatenation, the `generation_info` merging changed from simple keys & values replacement to using the util method [`merge_dicts`](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/utils/_merge.py): ![image](https://github.com/langchain-ai/langchain/assets/2098020/10f315bf-7fe0-43a7-a0ce-6a3834b99a15) The `merge_dicts` method could not handle merging values of `int` or some other types, and would raise a [`TypeError`](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/utils/_merge.py#L55). This PR fixes this issue in the Tongyi and ChatTongyi Model by adopting the `generation_info` of the last chunk and discarding the `generation_info` of the intermediate chunks, ensuring that `stream` and `astream` function correctly. - Issue: - Related issues or PRs about Tongyi & ChatTongyi: #16605, #17105 - Other models or cases: #18441, #17376 - Dependencies: No new dependencies	2024-03-15 16:27:53 -07:00
wulixuan	f79d0cb9fb	docs: update docs for yuan2 in LLMs and Chat models integration. (#19028 ) update yuan2.0 notebook in LLMs and Chat models. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-15 16:03:18 -07:00
Taraka Nithin Vankala	eec023766e	docs: Corrected error (#19030 ) - [ ] PR title: "docs: correction in "https://github.com/langchain-ai/langchain/blob/master/docs/docs/get_started/quickstart.mdx", line 289". - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: - Corrected the spelling mistake - #18981	2024-03-15 16:02:33 -07:00
Christophe Bornet	f2a7dda4bd	community[patch]: Use langchain-astradb for AstraDB doc loader (#19071 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:57:25 +00:00
Leonid Ganeline	a49ac55964	docs: `providers` update 8 (#19053 ) Added missed providers. Added missed integrations. Fixed format.	2024-03-15 15:49:14 -07:00
Holt Skinner	cee03630d9	community[patch]: Add Blended Search Support to `GoogleVertexAISearchRetriever` (#19082 ) https://cloud.google.com/generative-ai-app-builder/docs/create-data-store-es#multi-data-stores --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:39:31 +00:00
Eugene Yurtsev	0ddfe7fc9d	langchain[patch]: make hub work with older langchainhub versions (#19076 ) Make it work with older clients	2024-03-15 15:37:52 -07:00
William W Wang	0a784074d1	docs: Update llm_caching.ipynb (#19085 )	2024-03-15 22:35:48 +00:00
William W Wang	6327be9048	docsUpdate azure_cosmos_db.ipynb (#19087 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:33:26 +00:00
Anubhav Madhav	553a520ab6	docs: Fixed Grammar in Considerations of Model I/O Concepts (#19091 ) Fixed Grammar in Considerations of Model I/O Concepts documentation page - Update concepts.mdx Page Link: https://python.langchain.com/docs/modules/model_io/concepts#considerations - Description: Fixed Grammar in Considerations of Model I/O Documentation Page - Issue: "to work well with the model are you using" # "to work well with the model you are using" - Dependencies: None - Twitter handle: @Anubhav_Madhav (https://twitter.com/Anubhav_Madhav) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:31:39 +00:00
Shotaro Sano	d647ff1a9a	docs: Fix execution results of `docs/docs/modules/data_connection/indexing.ipynb` (#19112 ) ## Description This PR addresses a documentation issue in the [Indexing](https://python.langchain.com/docs/modules/data_connection/indexing) page. Specifically, it corrects the execution results of the Jupyter notebook under the [Source](https://python.langchain.com/docs/modules/data_connection/indexing#source) section, which were broken as detailed below. ## Problem The execution results following the statement, `This should delete the old versions of documents associated with doggy.txt source and replace them with the new versions.`, appear to be incorrect, as described below. ### Current Behavior - For some reason, the `index` function fails to add the new content of `doggy.txt`. Although it deletes the document objects associated with the `doggy.txt` source, it does not add the objects in `changed_doggy_docs`. Consequently, the execution result displays `num_added: 0`. - This unexpected behavior also impacts the results of `vectorstore.similarity_search("dog", k=30)`, showing only the contents of `kitty.txt`. It appears as though the contents of `doggy.txt` have been completely removed from the index: ``` Document(page_content='tty kitty', metadata={'source': 'kitty.txt'}), Document(page_content='tty kitty ki', metadata={'source': 'kitty.txt'}), Document(page_content='kitty kit', metadata={'source': 'kitty.txt'})] ``` ### Expected Behavior - The `index` function should successfully add the objects in `changed_doggy_docs` after removing the old content of `doggy.txt`. The anticipated execution result is `num_added: 2`. - Subsequently, the modified content of `doggy.txt` should appear in the results of `vectorstore.similarity_search("dog", k=30)` as follows: ``` [Document(page_content='woof woof', metadata={'source': 'doggy.txt'}), Document(page_content='woof woof woof', metadata={'source': 'doggy.txt'}), Document(page_content='tty kitty', metadata={'source': 'kitty.txt'}), Document(page_content='tty kitty ki', metadata={'source': 'kitty.txt'}), Document(page_content='kitty kit', metadata={'source': 'kitty.txt'})] ``` ## Fix I reran `docs/docs/modules/data_connection/indexing.ipynb` and have included the diff in this PR.	2024-03-15 22:27:15 +00:00
case-k	ebc4a64f9e	docs: fix databricks document url (#19096 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-15 22:25:11 +00:00
Guangdong Liu	4468e5bdbe	docs: Add in code documentation to core Runnable with_fallbacks method (docs only) (#19104 ) - Description: [a description of the change] Add in code documentation to core Runnable with_fallbacks method (docs only) - Issue: the issue #18804 @eyurtsev PTAL	2024-03-15 15:21:10 -07:00
Guangdong Liu	cced3eb9bc	community[patch]: Fix sparkllm embeddings api bug. (#19122 ) - Description: Fix sparkllm embeddings api bug. @baskaryan PTAL	2024-03-15 15:08:49 -07:00
samanhappy	b9c62fb905	docs: fix API link for BaseLoader (#19128 ) The link to the BaseLoader API requires an update as it has been moved into the `langchain_core` package.	2024-03-15 14:46:05 -07:00
kaijietti	c20aeef79a	community[patch]: implement qdrant _aembed_query and use it in other async funcs (#19155 ) `amax_marginal_relevance_search ` and `asimilarity_search_with_score ` should use an async version of `_embed_query `.	2024-03-15 21:20:12 +00:00
Kostas Botsas	527676a753	docs: Fix source column xata.ipynb (#19137 ) Docs fix: replace column name search with source. The Xata integration expects metadata column named "source". The docs suggest the name "search", which if used, yields the following error: ``` File "/usr/local/lib/python3.11/site-packages/langchain_community/vectorstores/xata.py", line 95, in _add_vectors raise Exception(f"Error adding vectors to Xata: {r.status_code} {r}") Exception: Error adding vectors to Xata: 400 {'errors': [{'status': 400, 'message': 'invalid record: column [source]: column not found'}]} ```	2024-03-15 14:06:18 -07:00
Barun Amalkumar Halder	34d6f0557d	community[patch] : publishes duration as milliseconds to Fiddler (#19166 ) Description: Many LLM steps complete in sub-second duration, which can lead to non-collection of duration field for Fiddler. This PR updates duration from seconds to milliseconds. Issue: [INTERNAL] FDL-17568 Dependencies: NA Twitter handle: behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-03-15 14:04:56 -07:00
Eugene Yurtsev	745d2476a2	langchain: upgrade mypy (#19163 ) Update mypy in langchain	2024-03-15 16:37:09 -04:00
Maxime Perrin	aa785fa6ec	core[minor]: allow LLMs async streaming to fallback on sync streaming (#18960 ) - Description: Handling fallbacks when calling async streaming for a LLM that doesn't support it. - Issue: #18920 - Twitter handle:@maximeperrin_ --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-03-15 16:06:50 -04:00
Erick Friis	caf47ab666	infra: run min version ci before integration tests (#18945 )	2024-03-15 12:14:44 -07:00
Barun Amalkumar Halder	b551d49cf5	community[patch] : adds feedback and status for Fiddler callback handler events (#19157 ) Description: This PR adds updates the fiddler events schema to also pass user feedback, and llm status to fiddler Tickets: [INTERNAL] FDL-17559 Dependencies: NA Twitter handle: behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-03-15 12:03:49 -07:00
Juan Felipe Arias	f5b9aedc48	community[patch]: add args_schema to sql_database tools for langGraph integration (#18595 ) - Description: This modification adds pydantic input definition for sql_database tools. This helps for function calling capability in LangGraph. Since actions nodes will usually check for the args_schema attribute on tools, This update should make these tools compatible with it (only implemented on the InfoSQLDatabaseTool) - Issue: N/A - Dependencies: N/A - Twitter handle: juanfe8881	2024-03-15 19:03:36 +00:00
fengjial	c922ea36cb	community[minor]: Add Baidu VectorDB as vector store (#17997 ) Co-authored-by: fengjialin <fengjialin@MacBook-Pro.local>	2024-03-15 19:01:58 +00:00
aditya thomas	190887c5cd	docs: update the list of providers (#19012 ) Description: Update the list of LangChain providers Issue: Make the list of LangChain providers current Dependencies: None	2024-03-15 12:00:24 -07:00
Erick Friis	bbe164ad28	docs: voyageai as provider (#19154 )	2024-03-15 10:12:37 -07:00
Erick Friis	781aee0068	community, langchain, infra: revert store extended test deps outside of poetry (#19153 ) Reverts langchain-ai/langchain#18995 Because it makes installing dependencies in python 3.11 extended testing take 80 minutes	2024-03-15 17:10:47 +00:00
Leonid Kuligin	e3ff107e4f	docs: updated google integration related imports in the documentation (#19131 ) updated imports in the documentation for google vertex	2024-03-15 09:30:50 -04:00
Erick Friis	9e569d85a4	community, langchain, infra: store extended test deps outside of poetry (#18995 ) poetry can't reliably handle resolving the number of optional "extended test" dependencies we have. If we instead just rely on pip to install extended test deps in CI, this isn't an issue.	2024-03-15 05:55:30 +00:00
Bagatur	191ddbc77e	core[patch]: rc release 0.1.33-rc.1 (#19103 )	2024-03-14 20:21:54 -07:00
Nuno Campos	508f75853c	core[patch]: Change structured prompt lc id to match js (#19099 )	2024-03-14 20:02:52 -07:00
Erick Friis	7ce81eb6f4	voyageai[patch]: init package (#19098 ) Co-authored-by: fodizoltan <zoltan@conway.expert> Co-authored-by: Yujie Qian <thomasq0809@gmail.com> Co-authored-by: fzowl <160063452+fzowl@users.noreply.github.com>	2024-03-15 00:56:10 +00:00
Brace Sproul	5157b15446	ci[patch]: Set root dir to ./docs (#19102 )	2024-03-14 17:55:04 -07:00
Brace Sproul	98cd8f673b	docs[minor]ci[minor]: Add script & CI to check recurring links daily (#19100 )	2024-03-14 17:42:22 -07:00
Asaf Joseph Gardin	4d7f6fa968	ai21[patch]: AI21 Labs Batch Support in Embeddings (#18633 ) Description: Added support for batching when using AI21 Embeddings model Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-14 23:10:23 +00:00
Tomaz Bratanic	321db89e87	templates: Switch neo4j generation template to LLMGraphTransformer (#19024 )	2024-03-14 16:00:42 -07:00
Erick Friis	d5cf360329	ibm[patch]: release 0.1.3 (#19094 )	2024-03-14 15:59:42 -07:00
Mateusz Szewczyk	b15d150d22	ibm[patch]: add async tests, add tokenize support (#18898 ) - Description: add async tests, add tokenize support - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally -> ✅ Please make sure integration_tests passing locally -> ✅ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-14 22:57:05 +00:00
billytrend-cohere	7253b816cc	community: Add support for cohere SDK v5 (keeps v4 backwards compatibility) (#19084 ) - Description: Add support for cohere SDK v5 (keeps v4 backwards compatibility) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-14 15:53:24 -07:00
Eugene Yurtsev	06165efb5b	core[patch]: RunnablePassthrough transform to autoupgrade to AddableDict (#19051 ) Follow up on https://github.com/langchain-ai/langchain/pull/18743 which missed RunnablePassthrough Issues: https://github.com/langchain-ai/langchain/issues/18741 https://github.com/langchain-ai/langgraph/issues/136 https://github.com/langchain-ai/langserve/issues/504	2024-03-14 16:59:46 -04:00
Eugene Yurtsev	41e2f60cd2	Updated security policy (#19089 ) Updated security policy	2024-03-14 20:58:47 +00:00
Eugene Yurtsev	6cdca4355d	community[minor]: Revamp PGVector Filtering (#18992 ) This PR makes the following updates in the pgvector database: 1. Use JSONB field for metadata instead of JSON 2. Update operator syntax to include required `$` prefix before the operators (otherwise there will be name collisions with fields) 3. The change is non-breaking, old functionality is still the default, but it will emit a deprecation warning 4. Previous functionality has bugs associated with comparisons due to casting to text (so lexical ordering is used incorrectly for numeric fields) 5. Adds an a GIN index on the JSONB field for more efficient querying	2024-03-14 16:56:00 -04:00
Bagatur	e276817e1d	docs: fix vercel build script (#19090 ) amazon linux 2023 doesn't have `amazon-linux-extras` but shoudl have python3.9 by default	2024-03-14 20:53:43 +00:00
Guangdong Liu	d4b025c812	code[patch]: Add in code documentation to core Runnable assign method (docs only) (#18951 ) PR message: *Delete this entire checklist* and replace with - Description: [a description of the change](docs: Add in code documentation to core Runnable assign method) - Issue: the issue #18804	2024-03-14 15:41:19 -04:00
Anthony Yang	688a5bd106	docs:fixed typo in streaming document (#19045 ) Fixed typo in line 661 - from 'mimimize' to 'minimize - [ ] PR message: - Description: Fixed typo in streaming document - change 'mimimize' to 'minimize If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-14 19:38:53 +00:00
Bagatur	573f48e34d	core[patch]: Release 0.1.32 (#19088 )	2024-03-14 12:01:58 -07:00
YHW	69a8ef2693	core: Runnable pass kwargs to _astream_log_implementation in astream_log (#19055 ) - Description: When calling the `_stream_log_implementation` from the `astream_log` method in the `Runnable` class, it is not handing over the `kwargs` argument. Therefore, even if i want to customize APIHandler and implement additional features with additional arguments, it is not possible. Conversely, the `astream_events` method normally handing over the `kwargs` argument. - Issue: https://github.com/langchain-ai/langchain/issues/19054 - Dependencies: - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! Co-authored-by: hyungwookyang <hyungwookyang@worksmobile.com>	2024-03-14 14:39:46 -04:00
Nuno Campos	751fb7de20	Add new beta StructuredPrompt (#19080 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-14 10:40:34 -07:00
Bagatur	0ae39ab30e	docs: make links internal (#19063 ) So they can be properly link checked	2024-03-14 16:22:56 +00:00
Anton Parkhomenko	ae73b9d839	community[patch]: Fix NotionDBLoader 400 Error by conditionally adding filter parameter (#19075 ) - Description: This change fixes a bug where attempts to load data from Notion using the NotionDBLoader resulted in a 400 Bad Request error. The issue was traced to the unconditional addition of an empty 'filter' object in the request payload, which Notion's API does not accept. The modification ensures that the 'filter' object is only included in the payload when it is explicitly provided and not empty, thus preventing the 400 error from occurring. - Issue: Fixes [#18009](https://github.com/langchain-ai/langchain/issues/18009) - Dependencies: None - Twitter handle: @gunnzolder Co-authored-by: Anton Parkhomenko <anton@merge.rocks>	2024-03-14 13:56:57 +00:00
Erick Friis	2999d06938	docs: deprecate old airbyte loader docs (#19048 )	2024-03-13 23:18:30 +00:00
Prakul	4c53e31377	docs: Updated index definition and reference to LangChain-MongoDB (#19047 ) Description: Updates to LangChain-MongoDB documentation: updates to the Atlas vector search index definition Issue: NA Dependencies: NA Twitter handle: iprakul	2024-03-13 15:44:13 -07:00
Erick Friis	5e0c58f9c2	infra: update upload-artifact and download-artifact to v4 (#19044 )	2024-03-13 20:08:29 +00:00
Tomaz Bratanic	e5e15c8d59	docs: Add graph construction docs (#18904 )	2024-03-13 12:27:58 -07:00
Nuno Campos	2b7c3c548d	core[minor]: Add Runnable.batch_as_completed (#17603 ) This PR adds `batch as completed` method to the standard Runnable interface. It takes in a list of inputs and yields the corresponding outputs as the inputs are completed.	2024-03-13 11:18:02 -07:00
Erick Friis	71d0981f18	templates: fix rag-lancedb dep (#19010 )	2024-03-13 04:36:24 +00:00
Erick Friis	74b2c0aa01	templates, cli: more security deps (#19006 )	2024-03-12 20:48:56 -07:00
Erick Friis	9052d05442	template: bump more lockfiles (#19003 ) - templates: bump lockfile deps - x	2024-03-13 01:43:33 +00:00
Erick Friis	49f3cc0f6b	templates: bump lockfile deps (#19001 )	2024-03-13 01:25:45 +00:00
Erick Friis	2ffb2144a6	experimental[patch]: release 0.0.54 (#19000 )	2024-03-13 00:38:46 +00:00
Erick Friis	873d06c009	langchain[patch]: release 0.1.12 (#18999 )	2024-03-13 00:22:21 +00:00
Leonid Ganeline	9c8523b529	community[patch]: flattening imports 3 (#18939 ) @eyurtsev	2024-03-12 15:18:54 -07:00
Erick Friis	af50f21765	community[patch]: release 0.0.28 (#18993 )	2024-03-12 21:55:29 +00:00
Erick Friis	4881bb669c	core[patch]: release 0.1.31 (#18989 )	2024-03-12 19:45:21 +00:00
Erick Friis	a29e8d8594	elasticsearch[patch]: fix integration tests for release (#18980 )	2024-03-12 10:22:07 -07:00
Erick Friis	0d1f6c417c	elasticsearch[patch]: release 0.1.1 (#18978 )	2024-03-12 16:46:22 +00:00
Max Jakob	911ccf9aa6	docs: elasticsearch retriever (#18965 ) Add documentation notebook for `ElasticsearchRetriever`. ## Dependencies - [ ] Release new `langchain-elasticsearch` version 0.2.0 that includes `ElasticsearchRetriever`	2024-03-12 09:42:36 -07:00
Dobiichi-Origami	471f2ed40a	community[patch]: re-arrange the addtional_kwargs of returned qianfan structure to avoid _merge_dict issue (#18889 ) fix issue: https://github.com/langchain-ai/langchain/issues/18441 PTAL, thanks @baskaryan, @efriis, @eyurtsev, @hwchase17. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-12 05:43:56 +00:00
Naman Jain	75122646b5	core[patch]: fixed circular dependency with json schema (#18657 ) Description: Circular dependencies when parsing references leading to `RecursionError: maximum recursion depth exceeded` issue. This PR address the issue by handling previously seen refs as in any typical DFS to avoid infinite depths. Issue: https://github.com/langchain-ai/langchain/issues/12163 Twitter handle: https://twitter.com/theBhulawat - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-03-12 05:42:45 +00:00
Tymofii	0bec1f6877	commnity[patch]: refactor code for faiss vectorstore, update faiss vectorstore documentation (#18092 ) Description: Refactor code of FAISS vectorcstore and update the related documentation. Details: - replace `.format()` with f-strings for strings formatting; - refactor definition of a filtering function to make code more readable and more flexible; - slightly improve efficiency of `max_marginal_relevance_search_with_score_by_vector` method by removing unnecessary looping over the same elements; - slightly improve efficiency of `delete` method by using set data structure for checking if the element was already deleted; Issue: fix small inconsistency in the documentation (the old example was incorrect and unappliable to faiss vectorstore) Dependencies: basic langchain-community dependencies and `faiss` (for CPU or for GPU) Twitter handle: antonenkodev	2024-03-11 22:33:03 -07:00
Roshan Santhosh	acf1ecc081	langchain[patch]: update llm_router.py (#18865 ) Issue : _call method of LLMRouterChain uses predict_and_parse, which is slated for deprecation. Description : Instead of using predict_and_parse, this replaces it with individual predict and parse functions.	2024-03-11 22:30:07 -07:00
Bagatur	18de77cc8c	core[minor]: add streaming support to OAI tool parsers (#18940 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-11 21:53:56 -07:00
Bagatur	e0e688a277	core[minor]: generation info on msg (#18592 ) related to #16403 #17188	2024-03-12 04:43:17 +00:00
Tomaz Bratanic	cda43c5a11	experimental[patch]: Fix LLM graph transformer default prompt (#18856 ) Some LLMs do not allow multiple user messages in sequence.	2024-03-11 20:11:52 -07:00
Bagatur	19721246f5	core[patch]: support labeled json schema as tools (#18935 )	2024-03-11 19:51:35 -07:00
Jacob Lee	950ab056eb	templates[patch]: Update pirate-speak deps, add messages placeholder (#18949 ) CC @efriis	2024-03-11 19:20:30 -07:00
Leonid Ganeline	fad308a764	docs: `providers` update 2 (#18407 ) Formatted pages into a consistent form. Added descriptions and links when needed.	2024-03-11 18:35:37 -07:00
Erick Friis	239f0a615e	templates: redis multi-modal multi-vector rag (#18946 ) --------- Co-authored-by: Tyler Hutcherson <tyler.hutcherson@redis.com>	2024-03-12 00:32:25 +00:00
Bagatur	915c1f8673	infra: rm api build CI (#18944 )	2024-03-11 16:12:34 -07:00
Brace Sproul	578e67c017	docs[patch]: properly load/use env vars (#18942 )	2024-03-11 15:38:05 -07:00
Erick Friis	0d888a65cb	core[patch]: move some attr/methods to BaseLanguageModel (#18936 ) Cleans up some shared code between `BaseLLM` and `BaseChatModel`. One functional difference to make it more consistent (see comment)	2024-03-11 14:59:45 -07:00
Brace Sproul	4ff6aa5c78	docs[minor]: Swap gtag for supabase (#18937 ) Added deps: - `@supabase/supabase-js` - for sending inserts - `supabase` - dev dep, for generating types via cli - `dotenv` for loading env vars Added script: - `yarn gen` - will auto generate the database schema types using the supabase CLI. Not necessary for development, but is useful. Requires authing with the supabase CLI (will error out w/ instructions if you're not authed). Added functionality: - pulls users IP address (using a free endpoint: `https://api.ipify.org` so we can filter out abuse down the line) TODO: - [x] add env vars to vercel	2024-03-11 14:23:12 -07:00
aditya thomas	5c2f7e6b2b	partners[openai]: update the docstring of OpenAI, OpenAIEmbeddings and ChatOpenAI classes (#18908 ) Description: Update the docstring of OpenAI, OpenAIEmbeddings and ChatOpenAI classes Issue: Update import module paths to the current LangChain API Dependencies: None Lint and test: `make format` and `make lint` were run This incorporates the review comments from langchain-ai/langchain#18637 which I closed due to an issue I had in updating that pr branch --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-11 20:48:54 +00:00
Leonid Ganeline	11195cfa42	community[patch]: speed up import times in the community package (#18928 ) This PR speeds up import times in the community package	2024-03-11 16:37:36 -04:00
fjk	a7fc731720	docs: change sparkllm spark_app_url to spark_api_url (#18000 ) community: fix - change sparkllm spark_app_url to spark_api_url - Description: - Change the variable name from `sparkllm spark_app_url` to `spark_api_url` in the community package. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-11 20:01:30 +00:00
Sevin F. Varoglu	8639624d40	docs: update OctoAI doc (#18913 ) This PR updates the OctoAI LLM doc.	2024-03-11 13:01:10 -07:00
Alexander Kozlov	a7500ab0fb	docs: Update huggingface pipelines notebook (#18801 )	2024-03-11 20:00:31 +00:00
Conroy Whitney	96d7fe0f85	docs: Change saved/configured chain variable name (#18863 ) Description: Variable name was `openai_poem` but it didn't pass in the `"prompt": "poem"` config, so the examples were showing a joke being returned from a variable called `_poem`. We could have gone one of two ways: 1. Updating the config line and the output line, or 2. Updating the variable name The latter seemed simpler, so that's what I went with. But I'd be glad to re-do this PR if you prefer the former. Thanks for everything, y'all. You rock 🤘 Issue:* N/A Dependencies: N/A Twitter handle: `conroywhitney`	2024-03-11 12:59:24 -07:00
aditya thomas	8544f748f2	community[patch]: update AnthropicLLM deprecation message (#18869 ) Description: Update AnthropicLLM deprecation message import path for ChatAnthropic Issue: Incorrect import path in deprecation message Dependencies: None Lint and test: `make format`, `make lint` and `make test` were run	2024-03-11 12:59:10 -07:00
Virat Singh	cafffe8a21	community: Add PolygonAggregates tool (#18882 ) Description: In this PR, I am adding a `PolygonAggregates` tool, which can be used to get historical stock price data (called aggregates by Polygon) for a given ticker. Polygon [docs](https://polygon.io/docs/stocks/get_v2_aggs_ticker__stocksticker__range__multiplier___timespan___from___to) for this endpoint. Twitter: [@virattt](https://twitter.com/virattt)	2024-03-11 11:58:10 -07:00
Bagatur	2d172181e0	Revert "update api build script (#18930 )" (#18931 )	2024-03-11 11:47:18 -07:00
Bagatur	def329b5f2	update api build script (#18930 )	2024-03-11 11:44:37 -07:00
Bagatur	c24c871d88	docs: update readme diagram (#18929 )	2024-03-11 11:17:45 -07:00
Bagatur	34284c25d4	docs: turn on link check (#18924 )	2024-03-11 10:50:39 -07:00
Erick Friis	93ef8ead0b	mongodb[patch]: fix core dep (#18926 )	2024-03-11 10:27:29 -07:00
Mohammad Mohtashim	43db4cd20e	core[major]: On Tool End Observation Casting Fix (#18798 ) This PR updates the on_tool_end handlers to return the raw output from the tool instead of casting it to a string. This is technically a breaking change, though it's impact is expected to be somewhat minimal. It will fix behavior in `astream_events` as well. Fixes the following issue #18760 raised by @eyurtsev --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-11 10:59:04 -04:00
Prashanth Rao	a96a6e0f2c	docs: Fix typo and add KùzuDB to graphs docs (#18915 ) - Description: Adding Kùzu (an embedded graph DB that uses Cypher) to the graph docs, and fixing a typo - Issue: docs update	2024-03-11 14:42:46 +00:00
aditya thomas	3d15498612	docs: Update callbacks documentation (#18899 ) Description: Update callbacks documentation Issue: Change some module imports and a method invocation to reflect the current LangChainAPI Dependencies: None	2024-03-11 10:40:11 -04:00
Massimiliano Pronesti	8113d612bb	community[patch]: support modin document loader (#18866 ) Langchain community document loaders support `pyspark`, `polars`, and `pandas` dataframes but not `modin`'s. This PR addresses this point.	2024-03-10 18:40:04 -07:00
Leonid Ganeline	dee256ef5a	docs: `platforms/google` fixed broken links (#18878 ) Several links are broken. Fixed them.	2024-03-10 18:19:43 -07:00
Pol Ruiz Farre	a7f63d8cb4	community[patch]: Fix BasePDFLoader suffix for s3 presigned urls (#18844 ) BasePDFLoader doesn't parse the suffix of the file correctly when parsing S3 presigned urls. This fix enables the proper detection and parsing of S3 presigned URLs to prevent errors such as `OSError: [Errno 36] File name too long`. No additional dependencies required.	2024-03-11 00:58:51 +00:00
Joshua Carroll	ddaf9de169	community: Fix bug with StreamlitChatMessageHistory (#18834 ) - Description: Fix Streamlit bug which was introduced by https://github.com/langchain-ai/langchain/pull/18250, update integration test - Issue: https://github.com/langchain-ai/langchain/issues/18684 - Dependencies: None	2024-03-09 13:42:22 -08:00
Kushagra	5fcbe9dd2a	community[patch]: documented the feature to filter documents in MongoDBloader (#18842 ) "community[docs]: documented the feature to filter documents in MongoDBloader" - Description: documented the feature to filter documents in MongoDBloader - Feature: the feature https://github.com/langchain-ai/langchain/discussions/18251 - Dependencies: No - Twitter handle: https://twitter.com/im_Kushagra	2024-03-09 13:41:34 -08:00
Ikko Eltociear Ashimine	c3580d3c64	docs: fix typo in google_cloud_sql_mysql.ipynb (#18847 ) arbitary -> arbitrary	2024-03-09 13:39:36 -08:00
Luan Fernandes	5a006f7264	docs: update typo in docs about agent tools (#18850 ) fixes #18849	2024-03-09 13:39:18 -08:00
Leonid Ganeline	3dabd3f214	docs: platform pages update (#17836 ) `Integrations` platform page ToC-s: sections there are placed without order. For example, the [google](https://python.langchain.com/docs/integrations/platforms/google) page. The `LLM` section is not the first section, as it is in the [Components](https://python.langchain.com/docs/integrations/components) menu. Updates: * reorganized the page sections so they follow the Component menu order. * fixed names for the section names: "Text Embedding Models" -> "Embedding Models"	2024-03-09 13:34:33 -08:00
Leonid Ganeline	07c518ad3e	docs: `providers` update 4 (#18540 ) Created the `facebook` page from `facebook_faiss` and `facebook_chat` pages. Added another Facebook integrations into this page. Updated `discord` page.	2024-03-09 13:30:48 -08:00
Leonid Ganeline	9c0f84ae95	docs: `providers` update 6 (#18610 ) Cleaned up the `Integrations/Components/Memory` navbar by shortening the page titles. Updated page titles and file names to consistent formats.	2024-03-09 13:29:44 -08:00
Tomaz Bratanic	a28be31a96	Switch to md5 for deduplication in neo4j integrations (#18846 ) Deduplicate documents using MD5 of the page_content. Also allows for custom deduplication with graph ingestion method by providing metadata id attribute --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-09 13:28:55 -08:00
Tomaz Bratanic	246724faab	LLM graph transformer prompt engineering (#18843 ) A bit of prompt engineering to improve results	2024-03-09 11:27:16 -08:00
Tomaz Bratanic	e778d60aec	Fix broken link in graph docs (#18837 )	2024-03-09 10:40:33 -08:00
Erick Friis	b48865bf94	langchain[patch]: attach hub metadata (#18830 )	2024-03-08 18:40:49 -08:00
Ammar	34b31a8cc7	core: add in-code docs for RunnableAssign class (#18826 ) Description: Improves the docstring for `RunnableAssign` by providing a concise description and a self-contained code example. Issue: #18803	2024-03-09 02:04:52 +00:00
Leonid Ganeline	5d65b47e41	docs: chat menu item as icon (#18806 ) Update chat icon in docs	2024-03-08 21:00:21 -05:00
Leonid Ganeline	476d6dc596	community[patch]: Use getattr for `toolkits` imports (#18825 ) This will preserve the namespace, without actually loading the underlying packages on init.	2024-03-08 20:54:28 -05:00
Erick Friis	bbb609ac9d	core[patch]: fix arbitrary config keys (#18827 )	2024-03-08 17:35:13 -08:00
Luis Antonio Vieira Junior	67c880af74	community[patch]: adding linearization config to AmazonTextractPDFLoader (#17489 ) - Description: Adding an optional parameter `linearization_config` to the `AmazonTextractPDFLoader` so the caller can define how the output will be linearized, instead of forcing a predefined set of linearization configs. It will still have a default configuration as this will be an optional parameter. - Issue: #17457 - Dependencies: The same ones that already exist for `AmazonTextractPDFLoader` - Twitter handle: [@lvieirajr19](https://twitter.com/lvieirajr19) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 17:25:22 -08:00
Anis ZAKARI	37e89ba5b1	community[patch]: Bedrock add support for mistral models (#18756 ) Description*: My previous [PR](https://github.com/langchain-ai/langchain/pull/18521) was mistakenly closed, so I am reopening this one. Context: AWS released two Mistral models on Bedrock last Friday (March 1, 2024). This PR includes some code adjustments to ensure their compatibility with the Bedrock class. --------- Co-authored-by: Anis ZAKARI <anis.zakari@hymaia.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-09 01:20:38 +00:00
Alexander Dicke	66576948e0	experimental[minor]: adds mixtral wrapper (#17423 ) Description: Adds a chat wrapper for Mixtral models using the [prompt template](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1#instruction-format). --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 17:14:23 -08:00
Erick Friis	4f4300723b	docs: pinecone client version note (#17491 )	2024-03-08 17:09:17 -08:00
Keith Chan	914af69b44	community[patch]: Update azuresearch vectorstore from_texts() method to include fields argument (#17661 ) - Description: Update azuresearch vectorstore from_texts() method to include fields argument, necessary for creating an Azure AI Search index with custom fields. - Issue: Currently index fields are fixed to default fields if Azure Search index is created using from_texts() method - Dependencies: None - Twitter handle: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 17:05:35 -08:00
al1p	46f0cea2b9	community[patch][: improved the suffix prompt to avoid loop (#17791 ) Small improvement to the openapi prompt. The agent was not finding the server base URL (looping through all nodes). This small change narrows the search and enables finding the url faster. No dependency Twitter : @al1pra	2024-03-08 16:53:09 -08:00
Dmitry Kankalovich	f5117e907d	openai[patch]: Proper example for AzureOpenAI usage in error message (#17798 ) # Proper example for AzureOpenAI usage in error message The original error message is wrong in part of a usage example it gives. Corrected to the right one. Co-authored-by: Dzmitry Kankalovich <dzmitry_kankalovich@epam.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 16:52:55 -08:00
Pranav Agarwal	bd9b5dc2f3	docs: Updating cookbook README for amazon personalize (#17854 ) This PR is a successor to this PR - https://github.com/langchain-ai/langchain/pull/17436 This PR updates the cookbook README with the notebook so that it is available on langchain docs for discoverability. cc: @baskaryan, @3coins --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 16:52:36 -08:00
AtomicVar	23e62f8f8d	docs: fix lists display issue (#17911 ) Description: Fix lists display issues in Docs > Use Cases > Q&A with RAG > Quickstart. In essence, this PR changes: ```markdown Some paragraph. - Item a. - Item b. ``` to: ```markdown Some paragraph. - Item a. - Item b. ``` There needs an extra empty line to make the list rendered properly. FYI, the old version is displayed not properly as: <img width="856" alt="image" src="https://github.com/langchain-ai/langchain/assets/22856433/65202577-8ea2-47c6-b310-39bf42796fac"> - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 16:52:16 -08:00
Théo LEBRUN	cf94091cd0	community[patch]: Skip nested directories when using S3DirectoryLoader (#17829 ) - Description: `S3DirectoryLoader` is failing if prefix is a folder (ex: `my_folder/`) because `S3FileLoader` will try to load that folder and will fail. This PR skip nested directories so prefix can be set to folder instead of `my_folder/files_prefix`. - Issue: - #11917 - #6535 - #4326 - Dependencies: none - Twitter handle: @Falydoor - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-03-08 16:50:58 -08:00
Venkatesan	7a18b63dbf	community[patch]: Mongo index creation (#17748 ) - [ ] Title: Mongodb: MongoDB connection performance improvement. - [ ] Message: - Description: I made collection index_creation as optional. Index Creation is one time process. - Issue: MongoDBChatMessageHistory class object is attempting to create an index during connection, causing each request to take longer than usual. This should be optional with a parameter. - Dependencies: N/A - Branch to be checked: origin/mongo_index_creation --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 16:43:17 -08:00
wt3639	5b5b37a999	community[patch]: Add embedding instruction to HuggingFaceBgeEmbeddings (#18017 ) - Description: Add embedding instruction to HuggingFaceBgeEmbeddings, so that it can be compatible with nomic and other models that need embedding instruction. --------- Co-authored-by: Tao Wu <tao.wu@rwth-aachen.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 16:39:29 -08:00
Brace Sproul	9c218d0154	docs[patch]: Update how GA4 is collected (#18821 ) There's some issue/setting with the current python GA4 app. I created a new one just for feedback.	2024-03-08 14:32:40 -08:00
Erick Friis	a8de6d1533	anthropic[patch]: integration test update (#18823 )	2024-03-08 13:47:31 -08:00
wewebber-merlin	d1f5bc4906	anthropic[patch]: add kwargs to format_output base (#18715 ) _generate() and _agenerate() both accept kwargs, then pass them on to _format_output; but _format_output doesn't accept kwargs. Attempting to pass, e.g., timeout=50 to _generate (or invoke()) results in a TypeError. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-08 21:47:21 +00:00
Erick Friis	aa7bce6b13	anthropic[patch]: release 0.1.4 (#18822 )	2024-03-08 21:34:47 +00:00
Erick Friis	a5bcddc738	anthropic[patch]: streaming param (#18819 )	2024-03-08 13:32:57 -08:00
Erick Friis	8c0b215c02	anthropic[patch]: fix format output args (#18816 )	2024-03-08 12:34:11 -08:00
Ishani Vyas	2b0cbd65ba	community[patch]: Add Passio Nutrition AI Food Search Tool to Community Package (#18278 ) ## Add Passio Nutrition AI Food Search Tool to Community Package ### Description We propose adding a new tool to the `community` package, enabling integration with Passio Nutrition AI for food search functionality. This tool will provide a simple interface for retrieving nutrition facts through the Passio Nutrition AI API, simplifying user access to nutrition data based on food search queries. ### Implementation Details - Class Structure: Implement `NutritionAI`, extending `BaseTool`. It includes an `_run` method that accepts a query string and, optionally, a `CallbackManagerForToolRun`. - API Integration: Use `NutritionAIAPI` for the API wrapper, encapsulating all interactions with the Passio Nutrition AI and providing a clean API interface. - Error Handling: Implement comprehensive error handling for API request failures. ### Expected Outcome - User Benefits: Enable easy querying of nutrition facts from Passio Nutrition AI, enhancing the utility of the `langchain_community` package for nutrition-related projects. - Functionality: Provide a straightforward method for integrating nutrition information retrieval into users' applications. ### Dependencies - `langchain_core` for base tooling support - `pydantic` for data validation and settings management - Consider `requests` or another HTTP client library if not covered by `NutritionAIAPI`. ### Tests and Documentation - Unit Tests: Include tests that mock network interactions to ensure tool reliability without external API dependency. - Documentation: Create an example notebook in `docs/docs/integrations/tools/passio_nutrition_ai.ipynb` showing usage, setup, and example queries. ### Contribution Guidelines Compliance - Adhere to the project's linting and formatting standards (`make format`, `make lint`, `make test`). - Ensure compliance with LangChain's contribution guidelines, particularly around dependency management and package modifications. ### Additional Notes - Aim for the tool to be a lightweight, focused addition, not introducing significant new dependencies or complexity. - Potential future enhancements could include caching for common queries to improve performance. ### Twitter Handle - Here is our Passio AI [twitter handle](https://twitter.com/@passio_ai) where we announce our products. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-08 20:33:22 +00:00
Aaron Jimenez	bd9f98a20b	docs: Fix typo in modules/chains.ipynb (#18808 ) Description: Fix a minor typo in `modules/chains.ipynb`. - Issue: fixes #17851	2024-03-08 12:09:20 -08:00
Kushagra	b1f22bf76c	community[minor]: added a feature to filter documents in Mongoloader (#18253 ) "community: added a feature to filter documents in Mongoloader" - Description: added a feature to filter documents in Mongoloader - Feature: the feature #18251 - Dependencies: No - Twitter handle: https://twitter.com/im_Kushagra	2024-03-08 12:06:35 -08:00
Tomaz Bratanic	c0bdd4d45b	docs: Add main graph documentation (#18021 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-08 20:03:03 +00:00
Leonid Ganeline	7c8c4e5743	docs: `providers` update 7 (#18620 ) Added missed providers. Added missed integrations. Formatted to the consistent form. Fixed outdated imports.	2024-03-08 12:00:27 -08:00
Eugene Yurtsev	1f50274df7	community[patch]: Add pgvector to docker compose and update settings used in integration test (#18815 )	2024-03-08 14:39:28 -05:00
Erick Friis	ad29806255	nvidia-trt, nvidia-ai-endpoints: move to repo (#18814 ) NVIDIA maintained in https://github.com/langchain-ai/langchain-nvidia	2024-03-08 19:30:50 +00:00
Christophe Bornet	e54a49b697	community[minor]: Add lazy_table_reflection param to SqlDatabase (#18742 ) For some DBs with lots of tables, reflection of all the tables can take very long. So this change will make the tables be reflected lazily when get_table_info() is called and `lazy_table_reflection` is True.	2024-03-08 14:10:23 -05:00
Christophe Bornet	ead2a74806	community: Implement lazy_load() for JSONLoader (#18643 ) Covered by `tests/unit_tests/document_loaders/test_json_loader.py`	2024-03-08 13:58:17 -05:00
Erick Friis	a88f62ec3c	langchain[patch]: getattr import from langchain.chains (#18160 )	2024-03-08 10:36:14 -08:00
kAIto47802	ff70cc4e80	docs: fix typo (#18810 ) Fixed typo in docs	2024-03-08 13:28:17 -05:00
Eugene Yurtsev	cdfb5b4ca1	core[minor]: Chat Models to fallback astream to fallback on sync stream if available (#18748 ) Allows all chat models that implement _stream, but not _astream to still have async streaming to work. Amongst other things this should resolve issues with streaming community model implementations through langserve since langserve is exclusively async.	2024-03-08 13:27:29 -05:00
Leonid Ganeline	3624f56ccb	docs: update imports of `retrievers` to use `langchain_community` (#18707 ) Updated `langchain` imports to `langchain_community`.	2024-03-08 13:04:38 -05:00
Leonid Ganeline	48eed86931	docs: update imports of `memory` to use `langchain_community` (#18689 ) Refactored imports from `langchain` to `langchain_community` whenever it is applicable	2024-03-08 13:02:31 -05:00
aditya thomas	e00c1ff2b0	infra: ChatOpenAI unit tests for invoke() and ainvoke() (#18792 ) Description: Replacing the deprecated predict() and apredict() methods in the unit tests Issue: Not applicable Dependencies: None Lint and test: `make format`, `make lint` and `make test` have been run	2024-03-08 09:48:38 -08:00
aditya thomas	a35203b164	docs: (minor) update to anthropic doc (#18794 ) Description: Minor update to Anthropic documentation Issue: Not applicable Dependencies: None Lint and test: `make format` and `make lint` was done	2024-03-08 09:48:04 -08:00
Bagatur	3e29c04213	core[minor]: add BaseMessage.response_metadata (#18699 )	2024-03-08 09:35:56 -08:00
standby24x7	67d48ea600	docs:Update function "run" to "invoke" in llm_bash.ipynb (#18663 ) This path updates function "run" to "invoke" in llm_bash.ipynb. Without this path, you see following warning. LangChainDeprecationWarning: The function `run` was deprecated in LangChain 0.1.0 and will be removed in 0.2.0. Use invoke instead. Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2024-03-08 09:35:36 -08:00
Bagatur	bc6249c889	langchain[patch]: runnable agent streaming param (#18761 ) Usage: ```python agent = RunnableAgent(runnable=runnable, .., stream_runnable=False) ``` or for convenience ```python agent_executor = AgentExecutor(agent=agent, ..., stream_runnable=False) ```	2024-03-07 20:53:53 -08:00
Tomaz Bratanic	c8c592d3f1	experimental[minor]: Add LLM graph transformer (#18733 ) Add a class that constructs knowledge graphs based on text using an LLM.	2024-03-07 20:52:53 -08:00
Phat Vo	3ecb903d49	community[patch] : Tidy up and update Clarifai SDK functions (#18314 ) Description : * Tidy up, add missing docstring and fix unused params * Enable using session token	2024-03-07 19:47:44 -08:00
Paul Sanders	93b87f2bfb	docs: Fix typo (#18545 ) Fixing a minor typo in the package name. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-03-07 19:40:42 -08:00
Aaron Jimenez	fcf6213c22	docs: Fix link to HF TEI in text_embeddings_inference.ipynb (#18682 ) - [ ] PR title: docs: Fix link to HF TEI in text_embeddings_inference.ipynb - [ ] PR message: - Description: Fix the link to [Hugging Face Text Embeddings Inference (TEI)](https://huggingface.co/docs/text-embeddings-inference/index) in text_embeddings_inference.ipynb - Issue: Fix #18576	2024-03-07 19:38:39 -08:00
Max Jakob	61a2eba081	elasticsearch[patch]: add top-level import, remove obsolete dependency (#18644 ) Make `ElasticsearchRetriever` available as top-level import. The `langchain` package depends on `langchain-community` so we do not need to depend on it explicitly.	2024-03-07 19:38:31 -08:00
Averi Kitsch	8accee57a9	docs: update Google Cloud database integration docs (#18711 ) Description: update Google Cloud database integration docs Issue: NA Dependencies: NA	2024-03-07 19:36:00 -08:00
Tomaz Bratanic	010a234f1e	docs: Fix diffbot graph transformer description (#18736 ) The previous docstring was invalid	2024-03-07 19:25:41 -08:00
Jan Nissen	b8922480ed	core[patch]: improve PydanticOutputParser typing (#18740 ) This PR adds generic typing to `PydanticOutputParser` so we get a typed output from `.parse` instead of `Any`. It should provide a better DX by way of Intellisense and for anyone strictly typing. Pre-change: ![Screenshot 2024-03-07 at 10 22 31 AM](https://github.com/langchain-ai/langchain/assets/22690160/fd22dde0-9fdc-4283-b283-4c98f0bc46e5) Post-change: ![Screenshot 2024-03-07 at 10 26 31 AM](https://github.com/langchain-ai/langchain/assets/22690160/7e23d2b7-8f8c-494f-80b3-187530a173ee) I haven't dug too deep, but I think a similar change could probably be added to `JsonOutputParser` so we don't have to pull up `.parse`. Co-authored-by: Jan Nissen <jan23@gmail.com>	2024-03-07 19:25:24 -08:00
Massimiliano Pronesti	3b975c6ebe	experimental[minor]: add support for modin in pandas agent (#18749 ) Added support for Intel's [modin](https://github.com/modin-project/modin) in `create_pandas_dataframe_agent`.	2024-03-07 19:23:07 -08:00
Tomaz Bratanic	4bfe888717	comunity[patch]: Fix neo4j sanitizing values (#18750 ) Fixing sanitization for when deeply nested lists appear	2024-03-07 19:21:52 -08:00
Ian	7f504c1f81	docs: Improve the tidb vector store notebook (#18773 ) Remove redundant useless content, and fix some minor oversight	2024-03-07 19:15:55 -08:00
Eugene Yurtsev	6caceb5473	core[patch]: Automatic upgrade to AddableDict in transform and atransform (#18743 ) Automatic upgrade to transform and atransform Closes: https://github.com/langchain-ai/langchain/issues/18741 https://github.com/langchain-ai/langgraph/issues/136 https://github.com/langchain-ai/langserve/issues/504	2024-03-07 21:23:12 -05:00
Yunmo Koo	fee6f983ef	community[minor]: Integration for `Friendli` LLM and `ChatFriendli` ChatModel. (#17913 ) ## Description - Add [Friendli](https://friendli.ai/) integration for `Friendli` LLM and `ChatFriendli` chat model. - Unit tests and integration tests corresponding to this change are added. - Documentations corresponding to this change are added. ## Dependencies - Optional dependency [`friendli-client`](https://pypi.org/project/friendli-client/) package is added only for those who use `Frienldi` or `ChatFriendli` model. ## Twitter handle - https://twitter.com/friendliai	2024-03-08 02:20:47 +00:00
Smit Parmar	aed46cd6f2	community[patch]: Added support for filter out AWS Kendra search by score confidence (#12920 ) Description: It will add support for filter out kendra search by score confidence which will make result more accurate. For example ``` retriever = AmazonKendraRetriever( index_id=kendra_index_id, top_k=5, region_name=region, score_confidence="HIGH" ) ``` Result will not include the records which has score confidence "LOW" or "MEDIUM". Relevant docs https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/kendra/client/query.html https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/kendra/client/retrieve.html Issue: the issue # it resolve #11801 twitter: [@SmitCode](https://twitter.com/SmitCode)	2024-03-07 17:28:09 -08:00
Ian	390ef6abe3	community[minor]: Add Initial Support for TiDB Vector Store (#15796 ) This pull request introduces initial support for the TiDB vector store. The current version is basic, laying the foundation for the vector store integration. While this implementation provides the essential features, we plan to expand and improve the TiDB vector store support with additional enhancements in future updates. Upcoming Enhancements: * Support for Vector Index Creation: To enhance the efficiency and performance of the vector store. * Support for max marginal relevance search. * Customized Table Structure Support: Recognizing the need for flexibility, we plan for more tailored and efficient data store solutions. Simple use case exmaple ```python from typing import List, Tuple from langchain.docstore.document import Document from langchain_community.vectorstores import TiDBVectorStore from langchain_openai import OpenAIEmbeddings db = TiDBVectorStore.from_texts( embedding=embeddings, texts=['Andrew like eating oranges', 'Alexandra is from England', 'Ketanji Brown Jackson is a judge'], table_name="tidb_vector_langchain", connection_string=tidb_connection_url, distance_strategy="cosine", ) query = "Can you tell me about Alexandra?" docs_with_score: List[Tuple[Document, float]] = db.similarity_search_with_score(query) for doc, score in docs_with_score: print("-" * 80) print("Score: ", score) print(doc.page_content) print("-" * 80) ```	2024-03-07 17:18:20 -08:00
Bagatur	3b1eb1f828	community[patch]: chat hf typing fix (#18693 )	2024-03-07 17:06:38 -08:00
Eugene Yurtsev	1e1cac50d8	Docs: remove sales from security (#18762 ) Remove sales from security	2024-03-07 17:35:46 -05:00
Jib	d60e93b6ae	langchain-mongodb: Standardize mongodb collection/index names in tests (#18755 ) ## Description: MongoDB integration tests link to a provided Atlas Cluster. We have very stringent permissions set against the cluster provided. In order to make it easier to track and isolate the collections each test gets run against, we've updated the collection names to map the test file name. i.e. `langchain_{filename}` => `langchain_test_vectorstores` Fixes integration test results ![image](https://github.com/langchain-ai/langchain/assets/2887713/41f911b9-55f7-4fe4-9134-5514b82009f9) ## Dependencies: Provided MONGODB_ATLAS_URI - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ cc: @shaneharvey, @blink1073 , @NoahStapp , @caseyclements	2024-03-07 17:16:04 -05:00
Eugene Yurtsev	ca299a8e08	Docs: Add custom parsing documentation and extending langchain (#18331 ) * Added extending langchain.mdx -- we'll need to add links as we add more custom documentation * Added partial documentation about parsers	2024-03-07 16:30:57 -05:00
Eugene Yurtsev	8c71f92cb2	core: upgrade mypy to recent mypy (#18753 ) Testing this works per package on CI	2024-03-07 15:25:19 -05:00
Eugene Yurtsev	e188d4ecb0	Add dangerous parameter to requests tool (#18697 ) The tools are already documented as dangerous. Not clear whether adding an opt-in parameter is necessary or not	2024-03-07 15:10:56 -05:00
Leonid Ganeline	dad949eb99	docs: update imports of `adapters` to use langchain_community (#18751 ) Updated imports from `langchain` to `langchain_community`	2024-03-07 15:04:25 -05:00
Erick Friis	fcaa9cf2f1	community[patch]: deprecate community anthropic (#18745 )	2024-03-07 13:51:55 -05:00
Erick Friis	1beb84b061	community[patch]: move pdf text tests to integration (#18746 )	2024-03-07 10:34:22 -08:00
Christophe Bornet	4a7d73b39d	community: If load() has been overridden, use it in default lazy_load() (#18690 )	2024-03-07 11:52:19 -05:00
Christophe Bornet	6cd7607816	community[patch]: Implement lazy_load() for MHTMLLoader (#18648 ) Covered by `tests/unit_tests/document_loaders/test_mhtml.py`	2024-03-07 11:50:18 -05:00
axiangcoding	9745b5894d	community[patch]: Chroma use uuid4 instead of uuid1 to generate random ids (#18723 ) - Description: Chroma use uuid4 instead of uuid1 as random ids. Use uuid1 may leak mac address, changing to uuid4 will not cause other effects. - Issue: None - Dependencies: None - Twitter handle: None	2024-03-07 11:48:25 -05:00
Leonid Ganeline	1af2130ff7	docs: update imports of tools to use langchain_community (#18705 ) Updated imports from `langchain` to `langchain_community`.	2024-03-07 11:46:09 -05:00
Guangdong Liu	ced5e7bae7	community[patch]: Fix sparkllm authentication problem. (#18651 ) - Description: fix sparkllm authentication problem.The current timestamp is in RFC1123 format. The time deviation must be controlled within 300s. I changed to re-obtain the url every time I ask a question. https://www.xfyun.cn/doc/spark/general_url_authentication.html#_1-2-%E9%89%B4%E6%9D%83%E5%8F%82%E6%95%B0	2024-03-06 18:43:16 -08:00
Erick Friis	89d32ffbbd	community[patch]: release 0.0.27 (#18708 )	2024-03-07 01:08:43 +00:00
Erick Friis	c09b520ce4	core[patch]: release 0.1.30 (#18706 )	2024-03-06 16:12:18 -08:00
Piyush Jain	2b234a4d96	Support for claude v3 models. (#18630 ) Fixes #18513. ## Description This PR attempts to fix the support for Anthropic Claude v3 models in BedrockChat LLM. The changes here has updated the payload to use the `messages` format instead of the formatted text prompt for all models; `messages` API is backwards compatible with all models in Anthropic, so this should not break the experience for any models. ## Notes The PR in the current form does not support the v3 models for the non-chat Bedrock LLM. This means, that with these changes, users won't be able to able to use the v3 models with the Bedrock LLM. I can open a separate PR to tackle this use-case, the intent here was to get this out quickly, so users can start using and test the chat LLM. The Bedrock LLM classes have also grown complex with a lot of conditions to support various providers and models, and is ripe for a refactor to make future changes more palatable. This refactor is likely to take longer, and requires more thorough testing from the community. Credit to PRs [18579](https://github.com/langchain-ai/langchain/pull/18579) and [18548](https://github.com/langchain-ai/langchain/pull/18548) for some of the code here. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-06 15:46:18 -08:00
Sam Khano	1b4dcf22f3	community[minor]: Add DocumentDBVectorSearch VectorStore (#17757 ) Description: - Added Amazon DocumentDB Vector Search integration (HNSW index) - Added integration tests - Updated AWS documentation with DocumentDB Vector Search instructions - Added notebook for DocumentDB integration with example usage --------- Co-authored-by: EC2 Default User <ec2-user@ip-172-31-95-226.ec2.internal>	2024-03-06 15:11:34 -08:00
Vittorio Rigamonti	51f3902bc4	community[minor]: Adding support for Infinispan as VectorStore (#17861 ) Description: This integrates Infinispan as a vectorstore. Infinispan is an open-source key-value data grid, it can work as single node as well as distributed. Vector search is supported since release 15.x For more: [Infinispan Home](https://infinispan.org) Integration tests are provided as well as a demo notebook	2024-03-06 15:11:02 -08:00
Max Jakob	cca0167917	elasticsearch[patch], community[patch]: update references, deprecate community classes (#18506 ) Follow up on https://github.com/langchain-ai/langchain/pull/17467. - Update all references to the Elasticsearch classes to use the partners package. - Deprecate community classes. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-06 15:09:12 -08:00
José Luis Di Biase	6041ec3dd1	templates: rag-multi-modal typo, replace serch with search (#18519 ) Thank you for contributing to LangChain! - [x] PR title: "templates: rag-multi-modal typo, replace serch with search " - Description: Two little typos in multi modal templates (replace serch string with search) Signed-off-by: José Luis Di Biase <josx@interorganic.com.ar>	2024-03-06 15:08:55 -08:00
Djordje	12b4a4d860	community[patch]: Opensearch delete method added - indexing supported (#18522 ) - Description: Added delete method for OpenSearchVectorSearch, therefore indexing supported - Issue: No - Dependencies: No - Twitter handle: stkbmf	2024-03-06 15:08:47 -08:00
Erick Friis	687d27567d	openai[patch]: unit test azure init (#18703 )	2024-03-06 14:17:09 -08:00
Christophe Bornet	db8db6faae	community: Implement lazy_load() for PlaywrightURLLoader (#18676 ) Integration tests: `tests/integration_tests/document_loaders/test_url_playwright.py`	2024-03-06 16:52:13 -05:00
Aaron Yi	c092db862e	community[patch]: make metadata and text optional as expected in DocArray (#18678 ) ValidationError: 2 validation errors for DocArrayDoc text Field required [type=missing, input_value={'embedding': [-0.0191128...9, 0.01005221541175212]}, input_type=dict] For further information visit https://errors.pydantic.dev/2.5/v/missing metadata Field required [type=missing, input_value={'embedding': [-0.0191128...9, 0.01005221541175212]}, input_type=dict] For further information visit https://errors.pydantic.dev/2.5/v/missing ``` In the `_get_doc_cls` method, the `DocArrayDoc` class is defined as follows: ```python class DocArrayDoc(BaseDoc): text: Optional[str] embedding: Optional[NdArray] = Field(**embeddings_params) metadata: Optional[dict] ```	2024-03-06 16:51:41 -05:00
Eugene Yurtsev	4c25b49229	community[major]: breaking change in some APIs to force users to opt-in for pickling (#18696 ) This is a PR that adds a dangerous load parameter to force users to opt in to use pickle. This is a PR that's meant to raise user awareness that the pickling module is involved.	2024-03-06 16:43:01 -05:00
Eugene Yurtsev	0e52961562	community[patch]: Patch tdidf retriever (CVE-2024-2057) (#18695 ) This is a patch for `CVE-2024-2057`: https://www.cve.org/CVERecord?id=CVE-2024-2057 This affects users that: * Use the `TFIDFRetriever` * Attempt to de-serialize it from an untrusted source that contains a malicious payload	2024-03-06 15:49:04 -05:00
Leonid Ganeline	81cbf0f2fd	docs: update import paths for callbacks to use langchain_community callbacks where applicable (#18691 ) Refactored imports from `langchain` to `langchain_community` whenever it is applicable	2024-03-06 14:49:06 -05:00
Erick Friis	2619420df1	mongodb[patch]: release 0.1.1 (#18692 )	2024-03-06 19:44:14 +00:00
Leonid Ganeline	fb686333ac	docs: fix `streamlit` provider (#18606 ) There is a wrong python package import. Fixed it.	2024-03-06 11:42:26 -08:00
Christophe Bornet	ea141511d8	core: Move document loader interfaces to core (#17723 ) This is needed to be able to move document loaders to partner packages. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-06 13:59:00 -05:00
aditya thomas	97de498d39	docs: update to the streaming tutorial notebook in the lcel documentation (#18378 ) Description: Update to the streaming tutorial notebook in the LCEL documentation Issue: Fixed an import and (minor) changes in documentation language Dependencies: None	2024-03-06 10:47:22 -08:00
Guangdong Liu	32db9e74e4	docs: Fix some issues with sparkllm use cases (#17674 )	2024-03-06 10:46:51 -08:00
Christophe Bornet	5985454269	Merge pull request #18539 * Implement lazy_load() for GitLoader	2024-03-06 13:25:14 -05:00
Christophe Bornet	9a6f7e213b	Merge pull request #18423 * Implement lazy_load() for BSHTMLLoader	2024-03-06 13:25:01 -05:00
Christophe Bornet	b3a0c44838	Merge pull request #18673 * Implement lazy_load() for PDFMinerPDFasHTMLLoader and PyMuPDFLoader	2024-03-06 13:24:36 -05:00
Christophe Bornet	68fc0cf909	Merge pull request #18674 * Implement lazy_load() for TextLoader	2024-03-06 13:23:42 -05:00
Christophe Bornet	5b92f962f1	Merge pull request #18671 * Implement lazy_load() for MastodonTootsLoader	2024-03-06 13:23:14 -05:00
Christophe Bornet	15b1770326	Merge pull request #18421 * Implement lazy_load() for AssemblyAIAudioTranscriptLoader	2024-03-06 13:16:05 -05:00
Christophe Bornet	bb284eebe4	Merge pull request #18436 * Implement lazy_load() for ConfluenceLoader	2024-03-06 13:15:24 -05:00
Christophe Bornet	691480f491	Merge pull request #18647 * Implement lazy_load() for UnstructuredBaseLoader	2024-03-06 13:13:10 -05:00
Christophe Bornet	52ac67c5d8	Merge pull request #18654 * Implement lazy_load() for ObsidianLoader	2024-03-06 13:06:55 -05:00
Christophe Bornet	b9c0cf9025	Merge pull request #18656 * Implement lazy_load() for PsychicLoader	2024-03-06 13:05:04 -05:00
Christophe Bornet	aa7ac57b67	community: Implement lazy_load() for TrelloLoader (#18658 ) Covered by `tests/unit_tests/document_loaders/test_trello.py`	2024-03-06 13:04:36 -05:00
Christophe Bornet	302985fea1	community: Implement lazy_load() for SlackDirectoryLoader (#18675 ) Integration tests: `tests/integration_tests/document_loaders/test_slack.py`	2024-03-06 13:04:13 -05:00
Christophe Bornet	ed36f9f604	community: Implement lazy_load() for WhatsAppChatLoader (#18677 ) Integration test: `tests/integration_tests/document_loaders/test_whatsapp_chat.py`	2024-03-06 13:03:46 -05:00
Christophe Bornet	f414f5cdb9	community[minor]: Implement lazy_load() for WikipediaLoader (#18680 ) Integration test: `tests/integration_tests/document_loaders/test_wikipedia.py`	2024-03-06 13:03:21 -05:00
Bagatur	4cbfeeb1c2	community[patch]: Release 0.0.26 (#18683 )	2024-03-06 09:41:18 -08:00
Eugene Yurtsev	b9f3c7a0c9	Use Case: Extraction set temperature to 0, qualify a statement (#18672 ) Minor changes: 1) Set temperature to 0 (important) 2) Better qualify one of the statements with confidence	2024-03-06 12:35:45 -05:00
Eugene Yurtsev	a4a6978224	Docs: Revamp Extraction Use Case (#18588 ) Revamp the extraction use case documentation --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-06 09:18:25 -05:00
Christophe Bornet	1100f8de7a	community[minor]: Implement lazy_load() for ArxivLoader (#18664 ) Integration tests: `tests/integration_tests/utilities/test_arxiv.py` and `tests/integration_tests/document_loaders/test_arxiv.py`	2024-03-06 09:16:49 -05:00
Christophe Bornet	2d96803ddd	community[minor]: Implement lazy_load() for OutlookMessageLoader (#18668 ) Integration test: `tests/integration_tests/document_loaders/test_email.py`	2024-03-06 09:15:57 -05:00
Christophe Bornet	ae167fb5b2	community[minor]: Implement lazy_load() for SitemapLoader (#18667 ) Integration tests: `test_sitemap.py` and `test_docusaurus.py`	2024-03-06 09:15:35 -05:00
Christophe Bornet	623dfcc55c	community[minor]: Implement lazy_load() for FacebookChatLoader (#18669 ) Integration test: `tests/integration_tests/document_loaders/test_facebook_chat.py`	2024-03-06 09:15:00 -05:00
Christophe Bornet	20794bb889	community[minor]: Implement lazy_load() for GitbookLoader (#18670 ) Integration test: `tests/integration_tests/document_loaders/test_gitbook.py`	2024-03-06 09:14:36 -05:00
Liang Zhang	81985b31e6	community[patch]: Databricks SerDe uses cloudpickle instead of pickle (#18607 ) - Description: Databricks SerDe uses cloudpickle instead of pickle when serializing a user-defined function transform_input_fn since pickle does not support functions defined in `__main__`, and cloudpickle supports this. - Dependencies: cloudpickle>=2.0.0 Added a unit test.	2024-03-05 18:04:45 -08:00
Erick Friis	f3e28289f6	infra: reorder api docs build steps (#18618 )	2024-03-05 17:33:36 -08:00
Leonid Ganeline	114d64d4a7	docs: `providers` update (#18527 ) Added missed pages. Added links and descriptions. Foratted to the consistent form.	2024-03-05 17:32:59 -08:00
Christophe Bornet	7d6de96186	community[patch]: Implement lazy_load() for CubeSemanticLoader (#18535 ) Covered by `test_cube_semantic.py`	2024-03-05 17:32:31 -08:00
Christophe Bornet	a6b5d45e31	community[patch]: Implement lazy_load() for EverNoteLoader (#18538 ) Covered by `test_evernote_loader.py`	2024-03-05 17:29:52 -08:00
PSV	d7dd3cd248	docs: structured_output (#18608 ) - Description: Fixed some typos and copy errors in the Beta Structured Output docs - Issue: N/A - Dependencies: Docs only - Twitter handle: @psvann Co-authored-by: P.S. Vann <psvann@yahoo.com>	2024-03-05 17:20:06 -08:00
Bagatur	29f1619d61	docs: why lcel nit (#18616 )	2024-03-05 17:10:47 -08:00
Max Jakob	ee7a7954b9	elasticsearch: add `ElasticsearchRetriever` (#18587 ) Implement [Retriever](https://python.langchain.com/docs/modules/data_connection/retrievers/) interface for Elasticsearch. I opted to only expose the `body`, which gives you full flexibility, and none the other 68 arguments of the [search method](https://elasticsearch-py.readthedocs.io/en/v8.12.1/api/elasticsearch.html#elasticsearch.Elasticsearch.search). Added a user agent header for usage tracking in Elastic Cloud. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-06 00:42:50 +00:00
Jib	8bc347c5fc	mongodb[patch]: include LLM caches in toplevel library import (#18601 )	2024-03-05 16:35:13 -08:00
Bagatur	080904689c	docs: text splitters install (#18589 )	2024-03-05 16:19:37 -08:00
Sunchao Wang	dc81dba6cf	community[patch]: Improve amadeus tool and doc (#18509 ) Description: This pull request addresses two key improvements to the langchain repository: Fix for Crash in Flight Search Interface: Previously, the code would crash when encountering a failure scenario in the flight ticket search interface. This PR resolves this issue by implementing a fix to handle such scenarios gracefully. Now, the code handles failures in the flight search interface without crashing, ensuring smoother operation. Documentation Update for Amadeus Toolkit: Prior to this update, examples provided in the documentation for the Amadeus Toolkit were unable to run correctly due to outdated information. This PR includes an update to the documentation, ensuring that all examples can now be executed successfully. With this update, users can effectively utilize the Amadeus Toolkit with accurate and functioning examples. These changes aim to enhance the reliability and usability of the langchain repository by addressing issues related to error handling and ensuring that documentation remains up-to-date and actionable. Issue: https://github.com/langchain-ai/langchain/issues/17375 Twitter Handle: SingletonYxx	2024-03-05 16:17:22 -08:00
Christophe Bornet	f77f7dc3ec	community[patch]: Fix VectorStoreQATool (#18529 ) Fix #18460	2024-03-05 15:56:58 -08:00
Utkarsh Kapil	539a13dbda	docs: minor spelling errors (#18429 ) Description: Noticed spelling errors. 'Colab' mispelt as 'Collab'. https://python.langchain.com/docs/use_cases Dependencies: n/a	2024-03-05 15:54:15 -08:00
Dounx	ad48f55357	community[minor]: add Yuque document loader (#17924 ) This pull request support loading documents from Yuque with Langchain. Yuque is a professional cloud-based knowledge base for team collaboration in documentation. Website: https://www.yuque.com OpenAPI: https://www.yuque.com/yuque/developer/openapi	2024-03-05 15:54:07 -08:00
Kazuki Maeda	60c5d964a8	community[minor]: use jq schema for content_key in json_loader (#18003 ) ### Description Changed the value specified for `content_key` in JSONLoader from a single key to a value based on jq schema. I created [similar PR](https://github.com/langchain-ai/langchain/pull/11255) before, but it has several conflicts because of the architectural change associated stable version release, so I re-create this PR to fit new architecture. ### Why For json data like the following, specify `.data[].attributes.message` for page_content and `.data[].attributes.id` or `.data[].attributes.attributes. tags`, etc., the `content_key` must also parse the json structure. <details> <summary>sample json data</summary> ```json { "data": [ { "attributes": { "message": "message1", "tags": [ "tag1" ] }, "id": "1" }, { "attributes": { "message": "message2", "tags": [ "tag2" ] }, "id": "2" } ] } ``` </details> <details> <summary>sample code</summary> ```python def metadata_func(record: dict, metadata: dict) -> dict: metadata["source"] = None metadata["id"] = record.get("id") metadata["tags"] = record["attributes"].get("tags") return metadata sample_file = "sample1.json" loader = JSONLoader( file_path=sample_file, jq_schema=".data[]", content_key=".attributes.message", ## content_key is parsable into jq schema is_content_key_jq_parsable=True, ## this is added parameter metadata_func=metadata_func ) data = loader.load() data ``` </details> ### Dependencies none ### Twitter handle [kzk_maeda](https://twitter.com/kzk_maeda)	2024-03-05 15:51:24 -08:00
Rodrigo Nogueira	f4bb33bbf3	docs: fix link and missing package (#18405 ) Issue: fix broken links and missing package on colab example	2024-03-05 15:50:06 -08:00
Max Jakob	81e9ab6e3a	docs: Update elasticsearch README (#18497 ) Update Elasticsearch README with information on how to start a deployment. Also make some cosmetic changes to the [Elasticsearch docs](https://python.langchain.com/docs/integrations/vectorstores/elasticsearch). Follow up on https://github.com/langchain-ai/langchain/pull/17467	2024-03-05 15:49:16 -08:00
Hech	6a08134661	community[patch], langchain[minor]: Add retriever self_query and score_threshold in DingoDB (#18106 )	2024-03-05 15:47:29 -08:00
Mikhail Khludnev	d039dcb6ba	nvidia-trt[patch]: add TritonTensorRTLLM(verbose_client=False) (#16848 ) - Description: adding verbose flag to TritonTensorRTLLM, - Issue: nope, - Dependencies: not any, - Twitter handle:	2024-03-05 15:44:13 -08:00
Bagatur	1569b19191	docs: query analysis links (#18614 )	2024-03-05 15:05:44 -08:00
Asaf Joseph Gardin	27441555d0	ai21[patch]: AI21 Labs Contextual Answers support (#18270 ) Description: Added support for AI21 Labs model - Contextual Answers Dependencies: ai21, ai21-tokenizer Twitter handle: https://github.com/AI21Labs --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-05 22:42:04 +00:00
Erick Friis	e169ee8863	anthropic[patch]: handle lists in function calling (#18609 )	2024-03-05 14:19:40 -08:00
Erick Friis	1831733c2e	anthropic[patch]: fix argument integration test (#18605 )	2024-03-05 13:05:25 -08:00
Leonid Ganeline	bd4993141d	docs: `providers` update 5 (#18550 ) Added missed sections. Added descriptions.	2024-03-05 12:55:13 -08:00
Yudhajit Sinha	4570b477b9	community[patch]: Invoke callback prior to yielding token (titan_takeoff) (#18560 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream_ method in llms/titan_takeoff. - Issue: #16913 - Dependencies: None	2024-03-05 12:54:26 -08:00
Tomaz Bratanic	ea51cdaede	Remove neo4j bloom labels from graph schema (#18564 ) Neo4j tools use particular node labels and relationship types to store metadata, but are irrelevant for text2cypher or graph generation, so we want to ignore them in the schema representation.	2024-03-05 12:54:05 -08:00
standby24x7	a2779738aa	docs:Update function "run" to "invoke" in smart_llm.ipynb (#18568 ) This patch updates function "run" to "invoke" in smart_llm.ipynb. Without this patch, you see following warning. LangChainDeprecationWarning: The function `run` was deprecated in LangChain 0.1.0 and will be removed in 0.2.0. Use invoke instead. Signed-off-by: Masanari Iida <standby24x7@gmail.com> Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2024-03-05 12:52:48 -08:00
Erick Friis	e1924b3e93	core[patch]: deprecate hwchase17/langchain-hub, address path traversal (#18600 ) Deprecates the old langchain-hub repository. Does not deprecate the new https://smith.langchain.com/hub @PinkDraconian has correctly raised that in the event someone is loading unsanitized user input into the `try_load_from_hub` function, they have the ability to load files from other locations in github than the hwchase17/langchain-hub repository. This PR adds some more path checking to that function and deprecates the functionality in favor of the hub built into LangSmith.	2024-03-05 12:49:38 -08:00
Reuben Zotz-Wilson	96cd50938a	community:update telegram notebook (#18569 ) Description: modified the user_name to username to conform with the expected inputs to TelegramChatApiLoader Issue: Current code fails in langchain-community 0.0.24 <loader = TelegramChatApiLoader( chat_entity="<CHAT_URL>", # recommended to use Entity here api_hash="<API HASH >", api_id="<API_ID>", user_name="", # needed only for caching the session. )>	2024-03-05 11:47:17 -08:00
Jib	fc35262356	langchain-mongodb: add unit tests for MongoDBChatMessageHistory (#18599 ) ## Description Adding in Unit Test variation for `MongoDBChatMessageHistory` package Follow-up to #18590 - [x] Add tests and docs: Unit test is what's being added - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-03-05 11:44:31 -08:00
Erick Friis	48e303ea10	airbyte[patch]: release 0.1.1, python 3.9 compat (#18597 )	2024-03-05 19:22:08 +00:00
Jib	9da1e0cf34	mongodb[patch]: Migrate MongoDBChatMessageHistory (#18590 ) ## Description Migrate the `MongoDBChatMessageHistory` to the managed `langchain-mongodb` partner-package ## Dependencies None ## Twitter handle @mongodb ## tests and docs - [x] Migrate existing integration test - [x ]~ Convert existing integration test to a unit test~ Creation is out of scope for this ticket - [x ] ~Considering delaying work until #17470 merges to leverage the `MockCollection` object. ~ - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-05 18:53:02 +00:00
Jib	f92f7d2e03	mongodb[minor]: Add MongoDB LLM Cache (#17470 ) # Description - Description: Adding MongoDB LLM Caching Layer abstraction - Issue: N/A - Dependencies: None - Twitter handle: @mongodb Checklist: - [x] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR Message (above) - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @efriis, @eyurtsev, @hwchase17. --------- Co-authored-by: Jib <jib@byblack.us>	2024-03-05 10:38:39 -08:00
Tomaz Bratanic	449d8781ec	Update link in neo4j semantic ollama templates (#18574 )	2024-03-05 09:42:34 -08:00
Tomaz Bratanic	353248838d	Add precedence for input params over env variables in neo4j integration (#18581 ) input parameters take precedence over env variables	2024-03-05 09:36:56 -08:00
Christophe Bornet	c8a171a154	community: Implement lazy_load() for GithubFileLoader (#18584 )	2024-03-05 09:35:50 -08:00
Leonid Kuligin	04d134df17	marked MatchingEngine as deprecated (#18585 ) Thank you for contributing to LangChain! - [ ] PR title: "community: deprecate vectorstores.MatchingEngine" - [ ] PR message: - Description: announced a deprecation since this integration has been moved to langchain_google_vertexai	2024-03-05 09:34:53 -08:00
Erick Friis	07f23c2d45	docs: anthropic multimodal (#18586 )	2024-03-05 16:58:06 +00:00
Erick Friis	4ac2cb4adc	anthropic[minor]: add tool calling (#18554 )	2024-03-05 08:30:16 -08:00
Bagatur	5fc67ca2c7	langchain[patch]: Release 0.1.11 (#18558 )	2024-03-04 23:58:34 -08:00
Erick Friis	68c1878380	anthropic[patch]: model type string (#18510 )	2024-03-04 19:25:19 -08:00
Akash A Desai	eb0756f3ee	templates: fix rag-lancedb template (#18551 )	2024-03-04 18:56:16 -08:00
Erick Friis	25c7d52140	anthropic[patch]: multimodal (#18517 ) - anthropic[minor]: claude 3 - x - x --------- Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2024-03-04 17:50:13 -08:00
Erick Friis	343438e872	community[patch]: deprecate community fireworks (#18544 )	2024-03-05 01:04:26 +00:00
William FH	ca1d42785d	Evals wording (#18542 )	2024-03-04 16:32:33 -08:00
Brace Sproul	328a498a78	docs[minor]: Add thumbs up/down to all docs pages (#18526 )	2024-03-04 15:14:28 -08:00
Erick Friis	10874d5002	docs: update stack graphic (#18532 )	2024-03-04 23:07:28 +00:00
Bagatur	dd07eddf24	core[patch]: Release 0.1.29 (#18530 )	2024-03-04 14:37:08 -08:00
William FH	30ccc009e6	[Evals] Support list examples by dataset version tag (#18534 ) previously only supported by timestamp	2024-03-04 14:23:32 -08:00
Lance Martin	72ae744588	RAPTOR (#18467 ) Cookbook for RAPTOR paper	2024-03-04 13:16:33 -08:00
aditya thomas	7803b973c7	docs: update documentation of stackexchange component (#18486 ) Description: Update documentation of the StackExchange component Issue: None Dependencies: None	2024-03-04 10:45:29 -08:00
aditya thomas	5c387a173f	docs: update to docstrings of ChatAnthropic class (#18493 ) Description: Update docstrings of ChatAnthropic class Issue: Change to ChatAnthropic from ChatAnthropicMessages Dependencies: None Lint and test: `make format`, `make lint` and `make test` passed	2024-03-04 10:44:54 -08:00
Martin Kolb	63702a2044	docs: Improved notebook for vector store "HANA Cloud" (#18496 ) - Description: This PR fixes some issues in the Jupyter notebook for the VectorStore "SAP HANA Cloud Vector Engine": * Slight textual adaptations * Fix of wrong column name VEC_META (was: VEC_METADATA) - Issue: N/A - Dependencies: no new dependecies added - Twitter handle: @sapopensource path to notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb`	2024-03-04 10:44:16 -08:00
standby24x7	8461700738	docs: Update function "run" to "invoke" (#18499 ) Currently llm_checker.ipynb uses a function "run". Update to "invoke" to avoid following warning. LangChainDeprecationWarning: The function `run` was deprecated in LangChain 0.1.0 and will be removed in 0.2.0. Use invoke instead. Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2024-03-04 10:42:53 -08:00
standby24x7	6c9177681d	docs: Update function "run" to "invoke" in llm_math.ipynb (#18505 ) This patch updates function "run" to "invoke". Without this patch you see following warning. LangChainDeprecationWarning: The function `run` was deprecated in LangChain 0.1.0 and will be removed in 0.2.0. Use invoke instead. Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2024-03-04 10:42:36 -08:00
Bagatur	1c1a3a7415	docs: quickstart models (#18511 )	2024-03-04 08:33:19 -08:00
aditya thomas	a727eec6ed	docs: add groq to list of providers (#18503 ) Description: Add Groq to the list of providers Issue: None Dependencies: None	2024-03-04 08:20:40 -08:00
Erick Friis	24f9c700f2	anthropic[minor]: claude 3 (#18508 )	2024-03-04 15:03:51 +00:00
William De Vena	172499404a	Docs: Updated callbacks/index.mdx adding example on invoke method (#18403 ) ## PR title Docs: Updated callbacks/index.mdx adding example on runnable methods ## PR message - Description: Updated callbacks/index.mdx adding an example on how to pass callbacks to the runnable methods (invoke, batch, ...) - Issue: #16379 - Dependencies: None	2024-03-04 09:11:48 -05:00
Jacob Lee	de2d9447c6	👥 Update LangChain people data (#18473 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-03-03 19:58:58 -08:00
William FH	1cdb813196	Improve notebook wording (#18472 )	2024-03-03 18:31:15 -08:00
William FH	1eec67e8fe	Evaluate on Version (#18471 )	2024-03-03 17:47:35 -08:00
William FH	55b69d5ad1	Update Notebook Image (#18470 )	2024-03-03 17:22:59 -08:00
Harrison Chase	73d653324f	[Evals] Session-level feedback (#18463 ) Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-03-03 17:18:29 -08:00
Scott Nath	b051bba1a9	community: Add you.com tool, add async to retriever, add async testing, add You tool doc (#18032 ) - Description: finishes adding the you.com functionality including: - add async functions to utility and retriever - add the You.com Tool - add async testing for utility, retriever, and tool - add a tool integration notebook page - Dependencies: any dependencies required for this change - Twitter handle: @scottnath	2024-03-03 14:30:05 -08:00
mackong	b89d9fc177	langchain[patch]: add tools renderer for various non-openai agents (#18307 ) - Description: add tools_renderer for various non-openai agents, make tools can be render in different ways for your LLM. - Issue: N/A - Dependencies: N/A --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-03-03 14:25:12 -08:00
Harrison Chase	7ce2f32c64	improve query analysis docs (#18426 )	2024-03-03 14:24:33 -08:00
William De Vena	a63cee04ac	nvidia-trt[patch]: Invoke callback prior to yielding token (#18446 ) ## PR title nvidia-trt[patch]: Invoke callback prior to yielding ## PR message - Description: Invoke on_llm_new_token callback prior to yielding token in _stream method. - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None	2024-03-03 14:15:11 -08:00
William De Vena	275877980e	community[patch]: Invoke callback prior to yielding token (#18447 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message Description: Invoke callback prior to yielding token in _stream method in llms/vertexai. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None	2024-03-03 14:14:40 -08:00
William De Vena	67375e96e0	community[patch]: Invoke callback prior to yielding token (#18448 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream method in llms/tongyi. - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None	2024-03-03 14:14:22 -08:00
William De Vena	2087cbae64	community[patch]: Invoke callback prior to yielding token (#18449 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream method in chat_models/perplexity. - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None	2024-03-03 14:14:00 -08:00
William De Vena	eb04d0d3e2	community[patch]: Invoke callback prior to yielding token (#18452 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream and _astream methods in llms/anthropic. - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None	2024-03-03 14:13:41 -08:00
William De Vena	371bec79bc	community[patch]: Invoke callback prior to yielding token (#18454 ) ## PR title community[patch]: Invoke callback prior to yielding token ## PR message - Description: Invoke callback prior to yielding token in _stream and _astream methods in llms/baidu_qianfan_endpoint. - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None	2024-03-03 14:13:22 -08:00
Aayush Kataria	7c2f3f6f95	community[minor]: Adding Azure Cosmos Mongo vCore Vector DB Cache (#16856 ) Description: This pull request introduces several enhancements for Azure Cosmos Vector DB, primarily focused on improving caching and search capabilities using Azure Cosmos MongoDB vCore Vector DB. Here's a summary of the changes: - AzureCosmosDBSemanticCache: Added a new cache implementation called AzureCosmosDBSemanticCache, which utilizes Azure Cosmos MongoDB vCore Vector DB for efficient caching of semantic data. Added comprehensive test cases for AzureCosmosDBSemanticCache to ensure its correctness and robustness. These tests cover various scenarios and edge cases to validate the cache's behavior. - HNSW Vector Search: Added HNSW vector search functionality in the CosmosDB Vector Search module. This enhancement enables more efficient and accurate vector searches by utilizing the HNSW (Hierarchical Navigable Small World) algorithm. Added corresponding test cases to validate the HNSW vector search functionality in both AzureCosmosDBSemanticCache and AzureCosmosDBVectorSearch. These tests ensure the correctness and performance of the HNSW search algorithm. - LLM Caching Notebook - The notebook now includes a comprehensive example showcasing the usage of the AzureCosmosDBSemanticCache. This example highlights how the cache can be employed to efficiently store and retrieve semantic data. Additionally, the example provides default values for all parameters used within the AzureCosmosDBSemanticCache, ensuring clarity and ease of understanding for users who are new to the cache implementation. @hwchase17,@baskaryan, @eyurtsev,	2024-03-03 14:04:15 -08:00
Bagatur	db47b5deee	docs: anthropic quickstart (#18440 )	2024-03-03 13:59:28 -08:00
Bagatur	74f3908182	docs: anthropic qa quickstart (#18459 )	2024-03-03 13:33:24 -08:00
Harrison Chase	bc768a12ed	more query analysis docs (#18358 )	2024-03-02 08:44:22 -08:00
Erick Friis	f96dd57501	langchain[patch]: release 0.1.10 (#18410 )	2024-03-02 01:48:57 +00:00
Erick Friis	1fd1ac8e95	community[patch]: release 0.0.25 (#18408 )	2024-03-02 00:56:04 +00:00
aditya thomas	44b33fcc76	infra: update to pathspec for 'git grep' in lint check (#18178 ) Description: Update to the pathspec for 'git grep' in lint check in the Makefile Issue: The pathspec {docs/docs,templates,cookbook} is not handled correctly leading to the error during 'make lint' - "fatal: ambiguous argument '{docs/docs,templates,cookbook}': unknown revision or path not in the working tree." See changes made in https://github.com/langchain-ai/langchain/pull/18058 Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 22:03:45 +00:00
standby24x7	57c733e560	docs: Fix spelling typos in apache_kafka notebook (#17998 ) This patch fixes some spelling typos in apache_kafka_message_handling.ipynb Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2024-03-01 13:58:04 -08:00
Erick Friis	9fda6ac7e6	docs: stop copying source (#18404 )	2024-03-01 13:57:53 -08:00
Sourav Pradhan	50abeb7ed9	community[patch]: fix Chroma add_images (#17964 ) ### Description Fixed a small bug in chroma.py add_images(), previously whenever we are not passing metadata the documents is containing the base64 of the uris passed, but when we are passing the metadata the documents is containing normal string uris which should not be the case. ### Issue In add_images() method when we are calling upsert() we have to use "b64_texts" instead of normal string "uris". ### Twitter handle https://twitter.com/whitepegasus01	2024-03-01 21:55:58 +00:00
Sanjaypranav V M	d722525c70	templates: remove gemini_function_agent unused file (#18112 ) - [X] Gemini Agent Executor imported `agent.py` has Gemini agent executor which was not utilised in current template of gemini function agent 🧑‍💻 instead openai_function_agent has been used @sbusso @jarib please someone review it --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 21:55:20 +00:00
Kate Silverstein	b7c71e2e07	community[minor]: llamafile embeddings support (#17976 ) * Description: adds `LlamafileEmbeddings` class implementation for generating embeddings using [llamafile](https://github.com/Mozilla-Ocho/llamafile)-based models. Includes related unit tests and notebook showing example usage. * Issue: N/A * Dependencies: N/A	2024-03-01 13:49:18 -08:00
Massimiliano Pronesti	c3c987dd70	docs: update Azure OpenAI to v1 and langchain API to 0.1 (#18005 ) Description: Updated Azure OpenAI docs to OpenAI API v1 and LLM invocation to langchain 0.1	2024-03-01 13:47:00 -08:00
Mateusz Szewczyk	9298a0b941	langchain_ibm[patch] update docstring, dependencies, tests (#18386 ) - Description: Update docstring, dependencies, tests, README - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally -> ✅ Please make sure integration_tests passing locally -> ✅ --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 21:01:53 +00:00
Jib	c2b1abe91b	mongodb[patch]: Set delete_many only if count_documents is not 0 (#18402 ) - [x] PR message: *Delete this entire checklist* and replace with - Description: Remove the assert statement on the `count_documents` in setup_class. It should just delete if there are documents present - Issue: the issue # Crashes on class setup - Dependencies: None - Twitter handle: @mongodb - [x] Add tests and docs: If you're adding a new integration, please include 1. N/A - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. Co-authored-by: Jib <jib@byblack.us>	2024-03-01 13:01:28 -08:00
Kate Silverstein	c9153a3fd4	docs: add llamafile info to 'Local LLMs' guides (#18049 ) - Description: add information about [llamafile](https://github.com/Mozilla-Ocho/llamafile) (setup, example usage) to ['Run LLMs locally'](https://python.langchain.com/docs/guides/local_llms) and ['Using local models for Q&A with RAG'](https://python.langchain.com/docs/use_cases/question_answering/local_retrieval_qa) guides. - Issue: N/A - Dependencies: N/A	2024-03-01 12:44:31 -08:00
Tomaz Bratanic	f6bfb969ba	community[patch]: Add an option for indexed generic label when import neo4j graph documents (#18122 ) Current implementation doesn't have an indexed property that would optimize the import. I have added a `baseEntityLabel` parameter that allows you to add a secondary node label, which has an indexed id `property`. By default, the behaviour is identical to previous version. Since multi-labeled nodes are terrible for text2cypher, I removed the secondary label from schema representation object and string, which is used in text2cypher.	2024-03-01 12:33:52 -08:00
aditya thomas	e6e60e2492	docs: ChatOpenAI update module import path and calling method (#18169 ) Description: (a) Update to the module import path to reflect the splitting up of langchain into separate packages (b) Update to the documentation to include the new calling method (invoke)	2024-03-01 12:32:20 -08:00
Arun Sathiya	4adac20d7b	community[patch]: Make cohere_api_key a SecretStr (#12188 ) This PR makes `cohere_api_key` in `llms/cohere` a SecretStr, so that the API Key is not leaked when `Cohere.cohere_api_key` is represented as a string. --------- Signed-off-by: Arun <arun@arun.blog> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-03-01 20:27:53 +00:00
Ryan Meinzer	d883fd4a37	docs: Correct WebBaseLoader URL: docs: python.langchain.com/docs/get_started/quickstartQuickstart (#17981 ) Description: The URL of the data to index, specified to `WebBaseLoader` to import is incorrect, causing the `langsmith_search` retriever to return a `404: NOT_FOUND`. Incorrect URL: https://docs.smith.langchain.com/overview Correct URL: https://docs.smith.langchain.com Issue: This commit corrects the URL and prevents the LangServe Playground from returning an error from its inability to use the retriever when inquiring, "how can langsmith help with testing?". Dependencies: None. Twitter Handle: @ryanmeinzer	2024-03-01 12:21:53 -08:00
Petteri Johansson	6c1989d292	community[minor], langchain[minor], docs: Gremlin Graph Store and QA Chain (#17683 ) - Description: New feature: Gremlin graph-store and QA chain (including docs). Compatible with Azure CosmosDB. - Dependencies: no changes	2024-03-01 12:21:14 -08:00
Ather Fawaz	a5ccf5d33c	community[minor]: Add support for Perplexity chat model(#17024 ) - Description: This PR adds support for [Perplexity AI APIs](https://blog.perplexity.ai/blog/introducing-pplx-api). - Issues: None - Dependencies: None - Twitter handle: [@atherfawaz](https://twitter.com/AtherFawaz) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 12:19:23 -08:00
Rodrigo Nogueira	3438d2cbcc	community[minor]: add maritalk chat (#17675 ) Description: Adds the MariTalk chat that is based on a LLM specially trained for Portuguese. Twitter handle: @MaritacaAI	2024-03-01 12:18:23 -08:00
sarahberenji	08fa38d56d	community[patch]: the syntax error for Redis generated query (#17717 ) To fix the reported error: https://github.com/langchain-ai/langchain/discussions/17397	2024-03-01 12:18:10 -08:00
certified-dodo	43e3244573	community[patch]: Fix MongoDBAtlasVectorSearch max_marginal_relevance_search (#17971 ) Description: * `self._embedding_key` is accessed after deletion, breaking `max_marginal_relevance_search` search * Introduced in: `e135e5257c` * Updated but still persists in: `ce22e10c4b` Issue: https://github.com/langchain-ai/langchain/issues/17963 Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-01 12:17:42 -08:00
Nikita Titov	9f2ab37162	community[patch]: don't try to parse json in case of errored response (#18317 ) Related issue: #13896. In case Ollama is behind a proxy, proxy error responses cannot be viewed. You aren't even able to check response code. For example, if your Ollama has basic access authentication and it's not passed, `JSONDecodeError` will overwrite the truth response error. <details> <summary><b>Log now:</b></summary> ``` { "name": "JSONDecodeError", "message": "Expecting value: line 1 column 1 (char 0)", "stack": "--------------------------------------------------------------------------- JSONDecodeError Traceback (most recent call last) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/requests/models.py:971, in Response.json(self, kwargs) 970 try: --> 971 return complexjson.loads(self.text, kwargs) 972 except JSONDecodeError as e: 973 # Catch JSON-related errors and raise as requests.JSONDecodeError 974 # This aliases json.JSONDecodeError and simplejson.JSONDecodeError File /opt/miniforge3/envs/.gpt/lib/python3.10/json/__init__.py:346, in loads(s, cls, object_hook, parse_float, parse_int, parse_constant, object_pairs_hook, kw) 343 if (cls is None and object_hook is None and 344 parse_int is None and parse_float is None and 345 parse_constant is None and object_pairs_hook is None and not kw): --> 346 return _default_decoder.decode(s) 347 if cls is None: File /opt/miniforge3/envs/.gpt/lib/python3.10/json/decoder.py:337, in JSONDecoder.decode(self, s, _w) 333 \"\"\"Return the Python representation of ``s`` (a ``str`` instance 334 containing a JSON document). 335 336 \"\"\" --> 337 obj, end = self.raw_decode(s, idx=_w(s, 0).end()) 338 end = _w(s, end).end() File /opt/miniforge3/envs/.gpt/lib/python3.10/json/decoder.py:355, in JSONDecoder.raw_decode(self, s, idx) 354 except StopIteration as err: --> 355 raise JSONDecodeError(\"Expecting value\", s, err.value) from None 356 return obj, end JSONDecodeError: Expecting value: line 1 column 1 (char 0) During handling of the above exception, another exception occurred: JSONDecodeError Traceback (most recent call last) Cell In[3], line 1 ----> 1 print(translate_func().invoke('text')) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/runnables/base.py:2053, in RunnableSequence.invoke(self, input, config) 2051 try: 2052 for i, step in enumerate(self.steps): -> 2053 input = step.invoke( 2054 input, 2055 # mark each step as a child run 2056 patch_config( 2057 config, callbacks=run_manager.get_child(f\"seq:step:{i+1}\") 2058 ), 2059 ) 2060 # finish the root run 2061 except BaseException as e: File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:165, in BaseChatModel.invoke(self, input, config, stop, kwargs) 154 def invoke( 155 self, 156 input: LanguageModelInput, (...) 160 kwargs: Any, 161 ) -> BaseMessage: 162 config = ensure_config(config) 163 return cast( 164 ChatGeneration, --> 165 self.generate_prompt( 166 [self._convert_input(input)], 167 stop=stop, 168 callbacks=config.get(\"callbacks\"), 169 tags=config.get(\"tags\"), 170 metadata=config.get(\"metadata\"), 171 run_name=config.get(\"run_name\"), 172 kwargs, 173 ).generations[0][0], 174 ).message File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:543, in BaseChatModel.generate_prompt(self, prompts, stop, callbacks, kwargs) 535 def generate_prompt( 536 self, 537 prompts: List[PromptValue], (...) 540 kwargs: Any, 541 ) -> LLMResult: 542 prompt_messages = [p.to_messages() for p in prompts] --> 543 return self.generate(prompt_messages, stop=stop, callbacks=callbacks, kwargs) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:407, in BaseChatModel.generate(self, messages, stop, callbacks, tags, metadata, run_name, kwargs) 405 if run_managers: 406 run_managers[i].on_llm_error(e, response=LLMResult(generations=[])) --> 407 raise e 408 flattened_outputs = [ 409 LLMResult(generations=[res.generations], llm_output=res.llm_output) 410 for res in results 411 ] 412 llm_output = self._combine_llm_outputs([res.llm_output for res in results]) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:397, in BaseChatModel.generate(self, messages, stop, callbacks, tags, metadata, run_name, kwargs) 394 for i, m in enumerate(messages): 395 try: 396 results.append( --> 397 self._generate_with_cache( 398 m, 399 stop=stop, 400 run_manager=run_managers[i] if run_managers else None, 401 kwargs, 402 ) 403 ) 404 except BaseException as e: 405 if run_managers: File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:576, in BaseChatModel._generate_with_cache(self, messages, stop, run_manager, kwargs) 572 raise ValueError( 573 \"Asked to cache, but no cache found at `langchain.cache`.\" 574 ) 575 if new_arg_supported: --> 576 return self._generate( 577 messages, stop=stop, run_manager=run_manager, kwargs 578 ) 579 else: 580 return self._generate(messages, stop=stop, kwargs) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_community/chat_models/ollama.py:250, in ChatOllama._generate(self, messages, stop, run_manager, kwargs) 226 def _generate( 227 self, 228 messages: List[BaseMessage], (...) 231 kwargs: Any, 232 ) -> ChatResult: 233 \"\"\"Call out to Ollama's generate endpoint. 234 235 Args: (...) 247 ]) 248 \"\"\" --> 250 final_chunk = self._chat_stream_with_aggregation( 251 messages, 252 stop=stop, 253 run_manager=run_manager, 254 verbose=self.verbose, 255 kwargs, 256 ) 257 chat_generation = ChatGeneration( 258 message=AIMessage(content=final_chunk.text), 259 generation_info=final_chunk.generation_info, 260 ) 261 return ChatResult(generations=[chat_generation]) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_community/chat_models/ollama.py:183, in ChatOllama._chat_stream_with_aggregation(self, messages, stop, run_manager, verbose, kwargs) 174 def _chat_stream_with_aggregation( 175 self, 176 messages: List[BaseMessage], (...) 180 kwargs: Any, 181 ) -> ChatGenerationChunk: 182 final_chunk: Optional[ChatGenerationChunk] = None --> 183 for stream_resp in self._create_chat_stream(messages, stop, kwargs): 184 if stream_resp: 185 chunk = _chat_stream_response_to_chat_generation_chunk(stream_resp) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_community/chat_models/ollama.py:156, in ChatOllama._create_chat_stream(self, messages, stop, kwargs) 147 def _create_chat_stream( 148 self, 149 messages: List[BaseMessage], 150 stop: Optional[List[str]] = None, 151 kwargs: Any, 152 ) -> Iterator[str]: 153 payload = { 154 \"messages\": self._convert_messages_to_ollama_messages(messages), 155 } --> 156 yield from self._create_stream( 157 payload=payload, stop=stop, api_url=f\"{self.base_url}/api/chat/\", kwargs 158 ) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_community/llms/ollama.py:234, in _OllamaCommon._create_stream(self, api_url, payload, stop, kwargs) 228 raise OllamaEndpointNotFoundError( 229 \"Ollama call failed with status code 404. \" 230 \"Maybe your model is not found \" 231 f\"and you should pull the model with `ollama pull {self.model}`.\" 232 ) 233 else: --> 234 optional_detail = response.json().get(\"error\") 235 raise ValueError( 236 f\"Ollama call failed with status code {response.status_code}.\" 237 f\" Details: {optional_detail}\" 238 ) 239 return response.iter_lines(decode_unicode=True) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/requests/models.py:975, in Response.json(self, kwargs) 971 return complexjson.loads(self.text, kwargs) 972 except JSONDecodeError as e: 973 # Catch JSON-related errors and raise as requests.JSONDecodeError 974 # This aliases json.JSONDecodeError and simplejson.JSONDecodeError --> 975 raise RequestsJSONDecodeError(e.msg, e.doc, e.pos) JSONDecodeError: Expecting value: line 1 column 1 (char 0)" } ``` </details> <details> <summary><b>Log after a fix:</b></summary> ``` { "name": "ValueError", "message": "Ollama call failed with status code 401. Details: <html>\r <head><title>401 Authorization Required</title></head>\r <body>\r <center><h1>401 Authorization Required</h1></center>\r <hr><center>nginx/1.18.0 (Ubuntu)</center>\r </body>\r </html>\r ", "stack": "--------------------------------------------------------------------------- ValueError Traceback (most recent call last) Cell In[2], line 1 ----> 1 print(translate_func().invoke('text')) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/runnables/base.py:2053, in RunnableSequence.invoke(self, input, config) 2051 try: 2052 for i, step in enumerate(self.steps): -> 2053 input = step.invoke( 2054 input, 2055 # mark each step as a child run 2056 patch_config( 2057 config, callbacks=run_manager.get_child(f\"seq:step:{i+1}\") 2058 ), 2059 ) 2060 # finish the root run 2061 except BaseException as e: File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:165, in BaseChatModel.invoke(self, input, config, stop, kwargs) 154 def invoke( 155 self, 156 input: LanguageModelInput, (...) 160 kwargs: Any, 161 ) -> BaseMessage: 162 config = ensure_config(config) 163 return cast( 164 ChatGeneration, --> 165 self.generate_prompt( 166 [self._convert_input(input)], 167 stop=stop, 168 callbacks=config.get(\"callbacks\"), 169 tags=config.get(\"tags\"), 170 metadata=config.get(\"metadata\"), 171 run_name=config.get(\"run_name\"), 172 kwargs, 173 ).generations[0][0], 174 ).message File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:543, in BaseChatModel.generate_prompt(self, prompts, stop, callbacks, kwargs) 535 def generate_prompt( 536 self, 537 prompts: List[PromptValue], (...) 540 kwargs: Any, 541 ) -> LLMResult: 542 prompt_messages = [p.to_messages() for p in prompts] --> 543 return self.generate(prompt_messages, stop=stop, callbacks=callbacks, kwargs) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:407, in BaseChatModel.generate(self, messages, stop, callbacks, tags, metadata, run_name, kwargs) 405 if run_managers: 406 run_managers[i].on_llm_error(e, response=LLMResult(generations=[])) --> 407 raise e 408 flattened_outputs = [ 409 LLMResult(generations=[res.generations], llm_output=res.llm_output) 410 for res in results 411 ] 412 llm_output = self._combine_llm_outputs([res.llm_output for res in results]) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:397, in BaseChatModel.generate(self, messages, stop, callbacks, tags, metadata, run_name, kwargs) 394 for i, m in enumerate(messages): 395 try: 396 results.append( --> 397 self._generate_with_cache( 398 m, 399 stop=stop, 400 run_manager=run_managers[i] if run_managers else None, 401 kwargs, 402 ) 403 ) 404 except BaseException as e: 405 if run_managers: File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py:576, in BaseChatModel._generate_with_cache(self, messages, stop, run_manager, kwargs) 572 raise ValueError( 573 \"Asked to cache, but no cache found at `langchain.cache`.\" 574 ) 575 if new_arg_supported: --> 576 return self._generate( 577 messages, stop=stop, run_manager=run_manager, kwargs 578 ) 579 else: 580 return self._generate(messages, stop=stop, kwargs) File /opt/miniforge3/envs/.gpt/lib/python3.10/site-packages/langchain_community/chat_models/ollama.py:250, in ChatOllama._generate(self, messages, stop, run_manager, kwargs) 226 def _generate( 227 self, 228 messages: List[BaseMessage], (...) 231 kwargs: Any, 232 ) -> ChatResult: 233 \"\"\"Call out to Ollama's generate endpoint. 234 235 Args: (...) 247 ]) 248 \"\"\" --> 250 final_chunk = self._chat_stream_with_aggregation( 251 messages, 252 stop=stop, 253 run_manager=run_manager, 254 verbose=self.verbose, 255 kwargs, 256 ) 257 chat_generation = ChatGeneration( 258 message=AIMessage(content=final_chunk.text), 259 generation_info=final_chunk.generation_info, 260 ) 261 return ChatResult(generations=[chat_generation]) File /storage/gpt-project/Repos/repo_nikita/gpt_lib/langchain/ollama.py:328, in ChatOllamaCustom._chat_stream_with_aggregation(self, messages, stop, run_manager, verbose, kwargs) 319 def _chat_stream_with_aggregation( 320 self, 321 messages: List[BaseMessage], (...) 325 kwargs: Any, 326 ) -> ChatGenerationChunk: 327 final_chunk: Optional[ChatGenerationChunk] = None --> 328 for stream_resp in self._create_chat_stream(messages, stop, kwargs): 329 if stream_resp: 330 chunk = _chat_stream_response_to_chat_generation_chunk(stream_resp) File /storage/gpt-project/Repos/repo_nikita/gpt_lib/langchain/ollama.py:301, in ChatOllamaCustom._create_chat_stream(self, messages, stop, kwargs) 292 def _create_chat_stream( 293 self, 294 messages: List[BaseMessage], 295 stop: Optional[List[str]] = None, 296 kwargs: Any, 297 ) -> Iterator[str]: 298 payload = { 299 \"messages\": self._convert_messages_to_ollama_messages(messages), 300 } --> 301 yield from self._create_stream( 302 payload=payload, stop=stop, api_url=f\"{self.base_url}/api/chat\", kwargs 303 ) File /storage/gpt-project/Repos/repo_nikita/gpt_lib/langchain/ollama.py:134, in _OllamaCommonCustom._create_stream(self, api_url, payload, stop, **kwargs) 132 else: 133 optional_detail = response.text --> 134 raise ValueError( 135 f\"Ollama call failed with status code {response.status_code}.\" 136 f\" Details: {optional_detail}\" 137 ) 138 return response.iter_lines(decode_unicode=True) ValueError: Ollama call failed with status code 401. Details: <html>\r <head><title>401 Authorization Required</title></head>\r <body>\r <center><h1>401 Authorization Required</h1></center>\r <hr><center>nginx/1.18.0 (Ubuntu)</center>\r </body>\r </html>\r " } ``` </details> The same is true for timeout errors or when you simply mistyped in `base_url` arg and get response from some other service, for instance. Real Ollama errors are still clearly readable: ``` ValueError: Ollama call failed with status code 400. Details: {"error":"invalid options: unknown_option"} ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-01 12:17:29 -08:00
Yudhajit Sinha	e2b901c35b	community[patch]: chat message histrory mypy fix (#18250 ) Description: Fixed type: ignore's for mypy for chat_message_histories(streamlit) Adresses #17048 Planning to add more based on reviews	2024-03-01 12:17:18 -08:00
Gabriel Altay	b9416dc96a	docs: update pinecone README to use PineconeVectorStore (#18170 )	2024-03-01 12:12:52 -08:00
老阿張	1701f7b8e9	docs: Fix typo in baidu_qianfan_endpoint.ipynb & baidu_qianfan_endpoint.ipynb (#18176 ) Description: "sucessfully should be successfully "? 🤔 Issue: Typo Dependencies: Nope Twitter handle: laoazhang	2024-03-01 12:10:23 -08:00
Hemslo Wang	58a2abf089	community[patch]: fix RecursiveUrlLoader metadata_extractor return type (#18193 ) Description: Fix `metadata_extractor` type for `RecursiveUrlLoader`, the default `_metadata_extractor` returns `dict` instead of `str`. Issue: N/A Dependencies: N/A Twitter handle: N/A Signed-off-by: Hemslo Wang <hemslo.wang@gmail.com>	2024-03-01 12:08:20 -08:00
Maxime Perrin	98380cff9b	community[patch]: removing "response_mode" parameter in llama_index retriever (#18180 ) - Description: Removing this line ```python response = index.query(query, response_mode="no_text", self.query_kwargs) ``` to ```python response = index.query(query, self.query_kwargs) ``` Since llama index query does not support response_mode anymore : ``` \| TypeError: BaseQueryEngine.query() got an unexpected keyword argument 'response_mode'```` - Twitter handle: @maximeperrin_ --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-03-01 12:05:09 -08:00
Leonid Kuligin	e080281623	docs: cookbook on gemma integrations (#18213 ) - [ ] PR title: "cookbook: using Gemma on LangChain" - [ ] PR message: - Description: added a tutorial how to use Gemma with LangChain (from VertexAI or locally from Kaggle or HF) - Dependencies: langchain-google-vertexai==0.0.7 - Twitter handle: lkuligin	2024-03-01 11:50:55 -08:00
Christophe Bornet	177f51c7bd	community: Use default load() implementation in doc loaders (#18385 ) Following https://github.com/langchain-ai/langchain/pull/18289	2024-03-01 14:46:52 -05:00
William De Vena	42341bc787	infra: fake model invoke callback prior to yielding token (#18286 ) ## PR title core[patch]: Invoke callback prior to yielding ## PR message Description: Invoke on_llm_new_token callback prior to yielding token in _stream and _astream methods. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-03-01 11:46:18 -08:00
Ikko Eltociear Ashimine	31b4e78174	docs: fix typo in milvus.ipynb (#18373 ) retreival -> retrieval	2024-03-01 11:22:39 -08:00
Tabby	dd6f85caf1	docs: Update Google El Carro for Oracle Workload Documentation. (#18394 ) In this commit we update the documentation for Google El Carro for Oracle Workloads. We amend the documentation in the Google Providers page to use the correct name which is El Carro for Oracle Workloads. We also add changes to the document_loaders and memory pages to reflect changes we made in our repo.	2024-03-01 11:21:35 -08:00
mwmajewsk	e192f6b6eb	community[patch]: fix, better error message in deeplake vectoriser (#18397 ) If the document loader recieves Pathlib path instead of str, it reads the file correctly, but the problem begins when the document is added to Deeplake. This problem arises from casting the path to str in the metadata. ```python deeplake = True fname = Path('./lorem_ipsum.txt') loader = TextLoader(fname, encoding="utf-8") docs = loader.load_and_split() text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=100) chunks= text_splitter.split_documents(docs) if deeplake: db = DeepLake(dataset_path=ds_path, embedding=embeddings, token=activeloop_token) db.add_documents(chunks) else: db = Chroma.from_documents(docs, embeddings) ``` So using this snippet of code the error message for deeplake looks like this: ``` [part of error message omitted] Traceback (most recent call last): File "/home/mwm/repositories/sources/fixing_langchain/main.py", line 53, in <module> db.add_documents(chunks) File "/home/mwm/repositories/sources/langchain/libs/core/langchain_core/vectorstores.py", line 139, in add_documents return self.add_texts(texts, metadatas, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/mwm/repositories/sources/langchain/libs/community/langchain_community/vectorstores/deeplake.py", line 258, in add_texts return self.vectorstore.add( ^^^^^^^^^^^^^^^^^^^^^ File "/home/mwm/anaconda3/envs/langchain/lib/python3.11/site-packages/deeplake/core/vectorstore/deeplake_vectorstore.py", line 226, in add return self.dataset_handler.add( ^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/mwm/anaconda3/envs/langchain/lib/python3.11/site-packages/deeplake/core/vectorstore/dataset_handlers/client_side_dataset_handler.py", line 139, in add dataset_utils.extend_or_ingest_dataset( File "/home/mwm/anaconda3/envs/langchain/lib/python3.11/site-packages/deeplake/core/vectorstore/vector_search/dataset/dataset.py", line 544, in extend_or_ingest_dataset extend( File "/home/mwm/anaconda3/envs/langchain/lib/python3.11/site-packages/deeplake/core/vectorstore/vector_search/dataset/dataset.py", line 505, in extend dataset.extend(batched_processed_tensors, progressbar=False) File "/home/mwm/anaconda3/envs/langchain/lib/python3.11/site-packages/deeplake/core/dataset/dataset.py", line 3247, in extend raise SampleExtendError(str(e)) from e.__cause__ deeplake.util.exceptions.SampleExtendError: Failed to append a sample to the tensor 'metadata'. See more details in the traceback. If you wish to skip the samples that cause errors, please specify `ignore_errors=True`. ``` Which is does not explain the error well enough. The same error for chroma looks like this ``` During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/mwm/repositories/sources/fixing_langchain/main.py", line 56, in <module> db = Chroma.from_documents(docs, embeddings) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/mwm/repositories/sources/langchain/libs/community/langchain_community/vectorstores/chroma.py", line 778, in from_documents return cls.from_texts( ^^^^^^^^^^^^^^^ File "/home/mwm/repositories/sources/langchain/libs/community/langchain_community/vectorstores/chroma.py", line 736, in from_texts chroma_collection.add_texts( File "/home/mwm/repositories/sources/langchain/libs/community/langchain_community/vectorstores/chroma.py", line 309, in add_texts raise ValueError(e.args[0] + "\n\n" + msg) ValueError: Expected metadata value to be a str, int, float or bool, got lorem_ipsum.txt which is a <class 'pathlib.PosixPath'> Try filtering complex metadata from the document using langchain_community.vectorstores.utils.filter_complex_metadata. ``` Which is way more user friendly, so I just added information about possible mismatch of the type in the error message, the same way it is covered in chroma https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/vectorstores/chroma.py#L224	2024-03-01 11:21:21 -08:00
Daniel Chico	7d962278f6	community[patch]: type ignore fixes (#18395 ) Related to #17048	2024-03-01 11:21:02 -08:00
Christophe Bornet	69be82c86d	community[patch]: Implement lazy_load() for CSVLoader (#18391 ) Covered by `test_csv_loader.py`	2024-03-01 11:17:08 -08:00
Bagatur	c54d6eb5da	fireworks[patch]: support "any" tool_choice (#18343 ) per https://readme.fireworks.ai/docs/function-calling	2024-03-01 11:12:28 -08:00
Leonid Ganeline	d937fa4f9c	docs: `Tutorials` update (#18230 ) A big update of the `Tutorials` page. Cleaned it up. Added several new resources.	2024-03-01 11:07:39 -08:00
Erick Friis	6afb135baa	astradb: move to langchain-datastax repo (#18354 )	2024-03-01 19:04:43 +00:00
Akash A Desai	b641be2edf	templates: Lanceb RAG template (#17809 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 18:52:50 +00:00
Guangdong Liu	760a16ff32	community[patch]: Fix ChatModel for sparkllm Bug. (#18375 ) PR message: *Delete this entire checklist* and replace with - Description: fix sparkllm paramer error - Issue: close #18370 - Dependencies: change `IFLYTEK_SPARK_APP_URL` to `IFLYTEK_SPARK_API_URL` - Twitter handle: No	2024-03-01 10:49:30 -08:00
Yujie Qian	cbb65741a7	community[patch]: Voyage AI updates default model and batch size (#17655 ) - Description: update the default model and batch size in VoyageEmbeddings - Issue: N/A - Dependencies: N/A - Twitter handle: N/A --------- Co-authored-by: fodizoltan <zoltan@conway.expert>	2024-03-01 10:22:24 -08:00
Shengsheng Huang	ae471a7dcb	community[minor]: add BigDL-LLM integrations (#17953 ) - Description: [`bigdl-llm`](https://github.com/intel-analytics/BigDL) is a library for running LLM on Intel XPU (from Laptop to GPU to Cloud) using INT4/FP4/INT8/FP8 with very low latency (for any PyTorch model). This PR adds bigdl-llm integrations to langchain. - Issue: NA - Dependencies: `bigdl-llm` library - Contribution maintainer: @shane-huang Examples added: - docs/docs/integrations/llms/bigdl.ipynb	2024-03-01 10:04:53 -08:00
Ethan Yang	f61cb8d407	community[minor]: Add openvino backend support (#11591 ) - Description: add openvino backend support by HuggingFace Optimum Intel, - Dependencies: “optimum[openvino]”, --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-03-01 10:04:24 -08:00
Leonid Ganeline	a89f007947	docs: `runnable` module description (#17966 ) Added a module description. Added `batch` description.	2024-03-01 10:01:32 -08:00
Leonid Ganeline	6d0af4e805	docs: nvidia: provider page update (#18054 ) Nvidia provider page is missing a Triton Inference Server package reference. Changes: - added the Triton Inference Server reference - copied the example notebook from the package into the doc files. - added the Triton Inference Server description and links, the link to the above example notebook - formatted page to the consistent format NOTE: It seems that the [example notebook](https://github.com/langchain-ai/langchain/blob/master/libs/partners/nvidia-trt/docs/llms.ipynb) was originally created in wrong place. It should be in the LangChain docs [here](https://github.com/langchain-ai/langchain/tree/master/docs/docs/integrations/llms). So, I've created a copy of this example. The original example is still in the nvidia-trt package.	2024-03-01 10:00:42 -08:00
RadhikaBansal97	8bafd2df5e	community[patch]: Change github endpoint in GithubLoader (#17622 ) Description- - Changed the GitHub endpoint as existing was not working and giving 404 not found error - Also the existing function was failing if file_filter is not passed as the tree api return all paths including directory as well, and when get_file_content was iterating over these path, the function was failing for directory as the api was returning list of files inside the directory, so added a condition to ignore the paths if it a directory - Fixes this issue - https://github.com/langchain-ai/langchain/issues/17453 Co-authored-by: Radhika Bansal <Radhika.Bansal@veritas.com>	2024-03-01 09:36:31 -08:00
Yufei (Benny) Chen	2b93206f02	fireworks[patch]: Fix fireworks async stream (#18372 ) - Description: Fix the async stream issue with Fireworks - Dependencies: fireworks >= 0.13.0 ``` tests/integration_tests/test_chat_models.py .......... [ 45%] tests/integration_tests/test_compile.py . [ 50%] tests/integration_tests/test_embeddings.py .. [ 59%] tests/integration_tests/test_llms.py ......... [100%] ``` ``` tests/unit_tests/test_embeddings.py . [ 16%] tests/unit_tests/test_imports.py . [ 33%] tests/unit_tests/test_llms.py .... [100%] ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 09:20:26 -08:00
William FH	1deb8cadd5	Add dataset version info (#18299 )	2024-02-29 22:00:44 -08:00
Anush	9d663f31fa	community[patch]: FastEmbed to latest (#18040 ) ## Description Updates the `langchain_community.embeddings.fastembed` provider as per the recent updates to [`FastEmbed`](https://github.com/qdrant/fastembed) library.	2024-02-29 21:15:51 -08:00
Jacob Lee	590d47bff4	docs[patch]: Add Neo4j GraphAcademy to tutorials section (#18353 )	2024-02-29 20:50:24 -07:00
Erick Friis	3c8a115e21	fireworks[patch]: remove custom async and stream implementations (#18363 )	2024-03-01 03:20:02 +00:00
Bagatur	4730ee2766	docs: update api ref nav (#18362 )	2024-02-29 19:04:56 -08:00
Bagatur	12f19b8a6a	infra: update create_api_rst (#18361 )	2024-02-29 19:04:44 -08:00
Erick Friis	1317578ad1	templates: use langchain-text-splitters (#18360 ) - deps - import - import	2024-03-01 03:00:58 +00:00
Bagatur	f220af3dce	docs: text splitters readme (#18359 )	2024-03-01 03:00:42 +00:00
Bagatur	0d7fb5f60a	langchain[patch]: langchain-text-splitters dep (#18357 )	2024-02-29 18:48:55 -08:00
Eugene Yurtsev	51b661cfe8	community[patch]: BaseLoader load method should just delegate to lazy_load (#18289 ) load() should just reference lazy_load()	2024-02-29 21:45:28 -05:00
Bagatur	5efb5c099f	text-splitters[minor], langchain[minor], community[patch], templates, docs: langchain-text-splitters 0.0.1 (#18346 )	2024-02-29 18:33:21 -08:00
Nuno Campos	7891934173	Fix missing labels (#18356 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-29 18:11:18 -08:00
William FH	fdab931fd3	[Core] Patch: rm dumpd of outputs from runnables/base (#18295 ) It obstructs evaluations when your return a pydantic object.	2024-02-29 18:04:53 -08:00
Erick Friis	c7d5ed6f5c	infra: tolerate partner package move in ci (#18355 )	2024-02-29 17:49:28 -08:00
William FH	f481cbb32d	fireworks[patch]: Fix fireworks bind tools (#18352 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-03-01 01:18:15 +00:00
Erick Friis	eefb49680f	multiple[patch]: fix deprecation versions (#18349 )	2024-02-29 16:58:33 -08:00
Erick Friis	11cb42c2c1	core[patch]: deprecation docstring with lib (#18350 )	2024-03-01 00:44:13 +00:00
Erick Friis	bce0684327	docs: airbyte deps note (#18243 )	2024-02-29 16:02:13 -08:00
Erick Friis	7bbff98dc7	mongodb[patch]: core 0.1.5 dep (#18348 )	2024-02-29 15:39:04 -08:00
Erick Friis	4e27e66938	infra: mongodb env vars (#18347 )	2024-02-29 15:24:28 -08:00
Jib	72bfc1d3db	mongodb[minor]: MongoDB Partner Package -- Porting MongoDBAtlasVectorSearch (#17652 ) This PR migrates the existing MongoDBAtlasVectorSearch abstraction from the `langchain_community` section to the partners package section of the codebase. - [x] Run the partner package script as advised in the partner-packages documentation. - [x] Add Unit Tests - [x] Migrate Integration Tests - [x] Refactor `MongoDBAtlasVectorStore` (autogenerated) to `MongoDBAtlasVectorSearch` - [x] ~Remove~ deprecate the old `langchain_community` VectorStore references. ## Additional Callouts - Implemented the `delete` method - Included any missing async function implementations - `amax_marginal_relevance_search_by_vector` - `adelete` - Added new Unit Tests that test for functionality of `MongoDBVectorSearch` methods - Removed [`del res[self._embedding_key]`](`e0c81e1cb0/libs/community/langchain_community/vectorstores/mongodb_atlas.py (L218)`) in `_similarity_search_with_score` function as it would make the `maximal_marginal_relevance` function fail otherwise. The `Document` needs to store the embedding key in metadata to work. Checklist: - [x] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message - [x] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [x] Add tests and docs: If you're adding a new integration, please include 1. Existing tests supplied in docs/docs do not change. Updated docstrings for new functions like `delete` 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. (This already exists) If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Steven Silvester <steven.silvester@ieee.org> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-29 23:09:48 +00:00
William De Vena	412148773c	Updated partners/fireworks README (#18267 ) ## PR title partners: changed the README file for the Fireworks integration in the libs/partners/fireworks folder ## PR message Description: Changed the README file of partners/fireworks following the docs on https://python.langchain.com/docs/integrations/llms/Fireworks The README includes: - Brief description - Installation - Setting-up instructions (API key, model id, ...) - Basic usage Issue: https://github.com/langchain-ai/langchain/issues/17545 Dependencies: None Twitter handle: None	2024-02-29 14:55:03 -08:00
Kai Kugler	df234fb171	community[patch]: Fixing embedchain document mapping (#18255 ) - Description: The current embedchain implementation seems to handle document metadata differently than done in the current implementation of langchain and a KeyError is thrown. I would love for someone else to test this... --------- Co-authored-by: KKUGLER <kai.kugler@mercedes-benz.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Deshraj Yadav <deshraj@gatech.edu>	2024-02-29 14:54:37 -08:00
Erick Friis	040271f33a	community[patch]: remove llmlingua extended tests (#18344 )	2024-02-29 13:51:29 -08:00
William De Vena	87dca8e477	Updated partners/ibm README (#18268 ) ## PR title partners: changed the README file for the IBM Watson AI integration in the libs/partners/ibm folder. ## PR message Description: Changed the README file of partners/ibm following the docs on https://python.langchain.com/docs/integrations/llms/ibm_watsonx The README includes: - Brief description - Installation - Setting-up instructions (API key, project id, ...) - Basic usage: - Loading the model - Direct inference - Chain invoking - Streaming the model output Issue: https://github.com/langchain-ai/langchain/issues/17545 Dependencies: None Twitter handle: None --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com>	2024-02-29 13:29:28 -08:00
Erick Friis	dfd9787388	infra: ci dirs in wrong order (#18340 )	2024-02-29 21:13:29 +00:00
Bagatur	9e46535ebc	core[patch]: Release 0.1.28 (#18341 )	2024-02-29 13:03:13 -08:00
Tomaz Bratanic	5999c4a240	Add support for parameters in neo4j retrieval query (#18310 ) Sometimes, you want to use various parameters in the retrieval query of Neo4j Vector to personalize/customize results. Before, when there were only predefined chains, it didn't really make sense. Now that it's all about custom chains and LCEL, it is worth adding since users can inject any params they wish at query time. Isn't prone to SQL injection-type attacks since we use parameters and not concatenating strings.	2024-02-29 13:00:54 -08:00
Hasan	15d1b73a00	Add optional output_parser param in create_react_agent (#18320 ) Description: Add facility to pass the optional output parser to customize the parsing logic --------- Co-authored-by: hasan <hasan@m2sys.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-29 12:35:43 -08:00
Bagatur	a6f0506aaf	docs: query analysis use case (#17766 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-29 12:33:49 -08:00
kkdamowang	6782dac420	docs: remove duplicate quote in AzureOpenAIEmbeddings doc (#18315 ) - Description: Remove duplicate quote in AzureOpenAIEmbeddings doc, remove trailing spaces. - Issue: No - Dependencies: No	2024-02-29 11:25:50 -08:00
Filip Schouwenaars	4c62362eab	Add links to relevant DataCamp code alongs (#18332 ) This PR adds links to some more free resources for people to get acquainted with Langhchain without having to configure their system. <!-- If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --> Co-authored-by: Filip Schouwenaars <filipsch@users.noreply.github.com>	2024-02-29 11:25:01 -08:00
Virat Singh	cd926ac3dd	community: Add PolygonFinancials Tool (#18324 ) Description: In this PR, I am adding a `PolygonFinancials` tool, which can be used to get financials data for a given ticker. The financials data is the fundamental data that is found in income statements, balance sheets, and cash flow statements of public US companies. Twitter: [@virattt](https://twitter.com/virattt)	2024-02-29 10:56:05 -08:00
Leonid Ganeline	d43fa2eab1	docs `providers` update (#18336 ) Formatted pages into a consistent form. Added descriptions and links when needed.	2024-02-29 10:53:12 -08:00
Erick Friis	68be5a7658	infra: skip ibm api docs (#18335 )	2024-02-29 10:16:57 -08:00
Erick Friis	43534a4c08	skip airbyte api docs (#18334 )	2024-02-29 09:57:52 -08:00
Bagatur	6a5b084704	docs: update func calling doc (#18300 )	2024-02-29 09:45:07 -08:00
Bagatur	68ad3414a2	experimental[patch]: Release 0.0.53 (#18330 )	2024-02-29 09:13:21 -08:00
William FH	8af4425abd	[Evaluation] Config Fix (#18231 )	2024-02-29 00:06:46 -08:00
Averi Kitsch	1b63530274	docs: update Google documentation (#18297 ) Description: update Google documentation Issue: Dependencies:	2024-02-29 01:42:44 +00:00
Leonid Ganeline	1d865a7e86	docs: `google` provider page fixes (#18290 ) Several URL-s were broken (in the yesterday PR). Like [Integrations/platforms/google/Document Loaders](https://python.langchain.com/docs/integrations/platforms/google#document-loaders) page, Example link to "Document Loaders / Cloud SQL for PostgreSQL" and most of the new example links in the Document Loaders, Vectorstores, Memory sections. - fixed URL-s (manually verified all example links) - sorted sections in page to follow the "integrations/components" menu item order. - fixed several page titles to fix Navbar item order --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-29 00:45:03 +00:00
William De Vena	0486404a74	langchain_openai[patch]: Invoke callback prior to yielding token (#18269 ) ## PR title langchain_openai[patch]: Invoke callback prior to yielding token ## PR message Description: Invoke callback prior to yielding token in _stream and _astream methods for langchain_openai. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-29 00:00:08 +00:00
William De Vena	5ee76fccd5	langchain_groq[patch]: Invoke callback prior to yielding token (#18272 ) ## PR title langchain_groq[patch]: Invoke callback prior to yielding ## PR message Description:Invoke callback prior to yielding token in _stream and _astream methods for groq. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-28 23:43:16 +00:00
aditya thomas	eb0c178d75	docs: update to the list of partner packages in the list of providers (#18252 ) Description: Update to the list of partner packages in the list of providers Issue: Google & Nvidia had two entries each, both pointing to the same page Dependencies: None	2024-02-28 15:40:14 -08:00
ccurme	9bf58ec7dd	update extraction use-case docs (#17979 ) Update extraction use-case docs to showcase and explain all modes of `create_structured_output_runnable`.	2024-02-28 17:32:04 -05:00
Christophe Bornet	8a81fcd5d3	community: Fix deprecation version of AstraDB VectorStore (#17991 )	2024-02-28 17:15:09 -05:00
Stefano Lottini	6d863bed51	partner[minor]: Astra DB clients identify themselves as coming through LangChain package (#18131 ) Description This PR sets the "caller identity" of the Astra DB clients used by the integration plugins (`AstraDBChatMessageHistory`, `AstraDBStore`, `AstraDBByteStore` and, pending #17767 , `AstraDBVectorStore`). In this way, the requests to the Astra DB Data API coming from within LangChain are identified as such (the purpose is anonymous usage stats to best improve the Astra DB service).	2024-02-28 17:13:22 -05:00
kkdamowang	4899a72b56	docs: remove duplicate word in lcel/streaming (#18249 ) - Description: Remove duplicate word in lcel/streaming. - Issue: No. - Dependencies: No.	2024-02-28 21:50:26 +00:00
mackong	2c42f3a955	ollama[patch]: delete suffix slash to avoid redirect (#18260 ) - Description: see [ollama](https://github.com/ollama/ollama/blob/main/server/routes.go#L949)'s route definitions - Issue: N/A - Dependencies: N/A	2024-02-28 16:44:48 -05:00
William De Vena	6b58943917	community[patch]: Invoke callback prior to yielding token (#18288 ) ## PR title community[patch]: Invoke callback prior to yielding PR message Description: Invoke on_llm_new_token callback prior to yielding token in _stream and _astream methods. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-28 21:40:53 +00:00
Brace Sproul	ca4f5e2408	ci: Update issue template required checks (#18283 )	2024-02-28 13:27:39 -08:00
William De Vena	23722e3653	langchain[patch]: Invoke callback prior to yielding token (#18282 ) ## PR title langchain[patch]: Invoke callback prior to yielding ## PR message Description: Invoke on_llm_new_token callback prior to yielding token in _stream and _astream methods in langchain/tests/fake_chat_model. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None Twitter handle: None	2024-02-28 16:15:02 -05:00
Eugene Yurtsev	cd52433ba0	community[minor]: Add `SQLDatabaseLoader` document loader (#18281 ) - Description: A generic document loader adapter for SQLAlchemy on top of LangChain's `SQLDatabaseLoader`. - Needed by: https://github.com/crate-workbench/langchain/pull/1 - Depends on: GH-16655 - Addressed to: @baskaryan, @cbornet, @eyurtsev Hi from CrateDB again, in the same spirit like GH-16243 and GH-16244, this patch breaks out another commit from https://github.com/crate-workbench/langchain/pull/1, in order to reduce the size of this patch before submitting it, and to separate concerns. To accompany the SQLAlchemy adapter implementation, the patch includes integration tests for both SQLite and PostgreSQL. Let me know if corresponding utility resources should be added at different spots. With kind regards, Andreas. ### Software Tests ```console docker compose --file libs/community/tests/integration_tests/document_loaders/docker-compose/postgresql.yml up ``` ```console cd libs/community pip install psycopg2-binary pytest -vvv tests/integration_tests -k sqldatabase ``` ``` 14 passed ``` ![image](https://github.com/langchain-ai/langchain/assets/453543/42be233c-eb37-4c76-a830-474276e01436) --------- Co-authored-by: Andreas Motl <andreas.motl@crate.io>	2024-02-28 21:02:28 +00:00
William De Vena	a37dc83a9e	langchain_anthropic[patch]: Invoke callback prior to yielding token (#18274 ) ## PR title langchain_anthropic[patch]: Invoke callback prior to yielding ## PR message - Description: Invoke callback prior to yielding token in _stream and _astream methods for anthropic. - Issue: https://github.com/langchain-ai/langchain/issues/16913 - Dependencies: None - Twitter handle: None	2024-02-28 20:19:22 +00:00
David Ruan	af35e2525a	community[minor]: add hugging_face_model document loader (#17323 ) - Description: add hugging_face_model document loader, - Issue: NA, - Dependencies: NA, --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-28 20:05:35 +00:00
Sanjaypranav V M	b9a495e56e	community[patch]: added latin-1 decoder to gmail search tool (#18116 ) some mails from flipkart , amazon are encoded with other plain text format so to handle UnicodeDecode error , added exception and latin decoder Thank you for contributing to LangChain! @hwchase17	2024-02-28 19:28:29 +00:00
Nuno Campos	6da08d0f22	Add PNG drawer for Runnable.get_graph() (#18239 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-28 11:25:19 -08:00
Nuno Campos	d9fd1194f5	Remove check preventing passing non-declared config keys (#18276 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-28 18:28:53 +00:00
William De Vena	7ac74f291e	langchain_nvidia_ai_endpoints[patch]: Invoke callback prior to yielding token (#18271 ) ## PR title langchain_nvidia_ai_endpoints[patch]: Invoke callback prior to yielding ## PR message Description: Invoke callback prior to yielding token in _stream and _astream methods for nvidia_ai_endpoints. Issue: https://github.com/langchain-ai/langchain/issues/16913 Dependencies: None	2024-02-28 18:10:57 +00:00
Erick Friis	b4f6066a57	docs: airbyte github cookbook (#18275 )	2024-02-28 18:04:15 +00:00
Ashley Xu	e3211c2b3d	community[patch]: BigQueryVectorSearch JSON type unsupported for metadatas (#18234 )	2024-02-28 08:19:53 -08:00
Jack Wotherspoon	92c34d4803	docs: update documentation for Google Cloud database integrations (#18265 ) Description: Fixing typos and rendering issues for Google Cloud database integrations. Issue: NA Dependencies: NA	2024-02-28 15:32:43 +00:00
Erick Friis	2e31f1c2f8	infra: api docs folder move (#18223 )	2024-02-28 07:10:27 -08:00
Mateusz Szewczyk	db643f6283	ibm[patch]: release 0.1.0 Add possibility to pass ModelInference or Model object to WatsonxLLM class (#18189 ) - Description: Add possibility to pass ModelInference or Model object to WatsonxLLM class - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. ✅	2024-02-28 07:03:15 -08:00
Averi Kitsch	76eb553084	docs: add documentation for Google Cloud database integrations (#18225 ) Description: add documentation for Google Cloud database integrations Issue: NA Dependencies: NA	2024-02-27 21:17:30 -08:00
Erick Friis	d7a77054ed	airbyte[patch]: core version 0.1.5 (#18244 )	2024-02-27 19:54:43 -08:00
Erick Friis	be8d2ff5f7	airbyte[patch]: init pkg (#18236 )	2024-02-27 19:37:53 -08:00
Ayo Ayibiowu	ac1d7d9de8	community[feat]: Adds LLMLingua as a document compressor (#17711 ) Description: This PR adds support for using the [LLMLingua project ](https://github.com/microsoft/LLMLingua) especially the LongLLMLingua (Enhancing Large Language Model Inference via Prompt Compression) as a document compressor / transformer. The LLMLingua project is an interesting project that can greatly improve RAG system by compressing prompts and contexts while keeping their semantic relevance. Issue: https://github.com/microsoft/LLMLingua/issues/31 Dependencies: [llmlingua](https://pypi.org/project/llmlingua/) @baskaryan --------- Co-authored-by: Ayodeji Ayibiowu <ayodeji.ayibiowu@getinge.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-27 19:23:56 -08:00
Nuno Campos	a99eb3abf4	openai[patch]: Assign message id in ChatOpenAI (#17837 )	2024-02-27 17:32:54 -08:00
Isaac Francisco	733367b795	docs: deprecation of OpenAI functions agent, astream_events docstring (#18164 ) Co-authored-by: Hershenson, Isaac (Extern) <isaac.hershenson.extern@bayer04.de> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-27 09:14:53 -08:00
Harrison Chase	b0ccaf5917	Harrison/add structured output (#18165 )	2024-02-27 08:25:09 -08:00
Bagatur	242af4b5a4	openai[patch], mistral[patch], fireworks[patch]: releases 0.0.8, 0.0.5, 0.0.2 (#18186 )	2024-02-27 04:22:24 -08:00
Bagatur	7e66d964c6	core[patch]: Release 0.1.27 (#18159 )	2024-02-26 17:27:38 -08:00
Harrison Chase	d7c607ca00	core[minor]: move document compressor base (#17910 )	2024-02-26 17:20:50 -08:00
Bagatur	b3f4de38ae	mistral[minor]: Function calling and with_structured_output (#18150 ) ![Screenshot 2024-02-26 at 2 07 06 PM](https://github.com/langchain-ai/langchain/assets/22008038/20cacb47-3b24-45b5-871b-dd169f1acd37)	2024-02-26 16:22:30 -08:00
Bagatur	c53aa5cd37	core[patch]: support JS message serial namespaces (#18151 )	2024-02-26 16:19:46 -08:00
Harrison Chase	c673717c2b	add optimization notebook (#18155 )	2024-02-26 16:09:31 -08:00
Max Jakob	5ab69f907f	partners: add Elasticsearch package (#17467 ) ### Description This PR moves the Elasticsearch classes to a partners package. Note that we will not move (and later remove) `ElasticKnnSearch`. It were previously deprecated. `ElasticVectorSearch` is going to stay in the community package since it is used quite a lot still. Also note that I left the `ElasticsearchTranslator` for self query untouched because it resides in main `langchain` package. ### Dependencies There will be another PR that updates the notebooks (potentially pulling them into the partners package) and templates and removes the classes from the community package, see https://github.com/langchain-ai/langchain/pull/17468 #### Open question How to make the transition smooth for users? Do we move the import aliases and require people to install `langchain-elasticsearch`? Or do we remove the import aliases from the `langchain` package all together? What has worked well for other partner packages? --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-26 23:19:47 +00:00
matt haigh	a4896da2a0	Experimental: Add other threshold types to SemanticChunker (#16807 ) Description Adding different threshold types to the semantic chunker. I’ve had much better and predictable performance when using standard deviations instead of percentiles. ![image](https://github.com/langchain-ai/langchain/assets/44395485/066e84a8-460e-4da5-9fa1-4ff79a1941c5) For all the documents I’ve tried, the distribution of distances look similar to the above: positively skewed normal distribution. All skews I’ve seen are less than 1 so that explains why standard deviations perform well, but I’ve included IQR if anyone wants something more robust. Also, using the percentile method backwards, you can declare the number of clusters and use semantic chunking to get an ‘optimal’ splitting. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 13:50:48 -08:00
Jaskirat Singh	ce682f5a09	community: vectorstores.kdbai - Added support for when no docs are present (#18103 ) - Description: By default it expects a list but that's not the case in corner scenarios when there is no document ingested(use case: Bootstrap application). \ Hence added as check, if the instance is panda Dataframe instead of list then it will procced with return immediately. - Issue: NA - Dependencies: NA - Twitter handle: jaskiratsingh1 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-26 12:47:06 -08:00
am-kinetica	9b8f6455b1	Langchain vectorstore integration with Kinetica (#18102 ) - Description: New vectorstore integration with the Kinetica database - Issue: - Dependencies: the Kinetica Python API `pip install gpudb==7.2.0.1`, - Tag maintainer: @baskaryan, @hwchase17 - Twitter handle: --------- Co-authored-by: Chad Juliano <cjuliano@kinetica.com>	2024-02-26 12:46:48 -08:00
Bagatur	1e8ab83d7b	langchain[patch], core[patch], openai[patch], fireworks[minor]: ChatFireworks.with_structured_output (#18078 ) <img width="1192" alt="Screenshot 2024-02-24 at 3 39 39 PM" src="https://github.com/langchain-ai/langchain/assets/22008038/1cf74774-a23f-4b06-9b9b-85dfa2f75b63">	2024-02-26 12:46:39 -08:00
GoodBai	3589a135ef	community: make `SET allow_experimental_[engine]_index` configurabe in vectorstores.clickhouse (#18107 ) ## Description & Issue While following the official doc to use clickhouse as a vectorstore, I found only the default `annoy` index is properly supported. But I want to try another engine `usearch` for `annoy` is not properly supported on ARM platforms. Here is the settings I prefer: ``` python settings = ClickhouseSettings( table="wiki_Ethereum", index_type="usearch", # annoy by default index_param=[], ) ``` The above settings do not work for the command `set allow_experimental_annoy_index=1` is hard-coded. This PR will make sure the experimental feature follow the `index_type` which is also consistent with Clickhouse's naming conventions.	2024-02-26 12:39:17 -08:00
Dan Stambler	69344a0661	community: Add Laser Embedding Integration (#18111 ) - Description: Added Integration with Meta AI's LASER Language-Agnostic SEntence Representations embedding library, which supports multilingual embedding for any of the languages listed here: https://github.com/facebookresearch/flores/blob/main/flores200/README.md#languages-in-flores-200, including several low resource languages - Dependencies: laser_encoders	2024-02-26 12:16:37 -08:00
Erick Friis	257879e98d	infra: api docs setup action location (#18148 )	2024-02-26 11:50:21 -08:00
Erick Friis	28cf3aab45	infra: api docs build commit dir (#18147 )	2024-02-26 11:47:04 -08:00
Heidi Steen	166f3d8351	Docs: azuresearch.ipynb (in docs/docs/integrations/vectorstores) -- fixed headings and comments (#18135 ) This PR updates azuresearch.ipynb with an edit to the introduction sentence, consistent heading levels, and disambiguation in code comments.	2024-02-26 11:46:55 -08:00
Luan Fernandes	e867557936	[docs] Update doc-string for buffer_as_messages method in ConversationBufferWindowMemory (#18136 ) minor fix stated in #18080	2024-02-26 11:46:43 -08:00
Barun Amalkumar Halder	23fc7c8c90	docs [patch] : fix import to use community path for handler in fiddler notebook (#18140 ) Description: Update the example fiddler notebook to use community path, instead of langchain.callback Dependencies: None Twitter handle: @bhalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-26 11:41:07 -08:00
Bagatur	767523f364	core[patch], langchain[patch], templates: move openai functions parsers to core (#18060 ) ![Screenshot 2024-02-23 at 7 48 03 PM](https://github.com/langchain-ai/langchain/assets/22008038/e5540c4d-0020-4ece-869f-ae19db2a1f3f)	2024-02-26 11:12:53 -08:00
Bagatur	96bff0ed5d	infra: create api rst for specific pkg (#18144 ) Example: create rst for libs/core only ```bash poetry run python docs/api_reference/create_api_rst.py core ```	2024-02-26 11:04:22 -08:00
Nuno Campos	cd3ab3703b	Improve runnable generator error messages (#18142 ) h/t @hinthornw Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-26 18:54:25 +00:00
Nuno Campos	62a30efb12	Fix bug with using configurable_fields after configurable_alternatives (#18139 ) Closes #17915	2024-02-26 10:27:07 -08:00
Erick Friis	f5cf6975ba	docs: anthropic partner package docs (#18109 )	2024-02-26 17:51:44 +00:00
Nuno Campos	b1d9ce541d	Add BaseMessage.id (#17835 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-26 09:27:47 -08:00
Harrison Chase	935aefa8db	add run name for query constructor (#18101 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-26 08:17:05 -08:00
Mohammad Mohtashim	719a1cde75	langchain[patch]: Update doc-string for a method in ConversationBufferWindowMemory (#18090 ) A minor doc fix stated in #18080	2024-02-26 10:15:02 -05:00
Simon Schmidt	2716d58603	langchain: Import from langchain_core in langchain.smith to avoid deprecation warning (#18129 ) Avoids deprecation warning that triggered at import time, e.g. with `python -c 'import langchain.smith'` /opt/venv/lib/python3.12/site-packages/langchain/callbacks/__init__.py:37: LangChainDeprecationWarning: Importing this callback from langchain is deprecated. Importing it from langchain will no longer be supported as of langchain==0.2.0. Please import from langchain-community instead: `from langchain_community.callbacks import base`. To install langchain-community run `pip install -U langchain-community`.	2024-02-26 10:14:10 -05:00
rongchenlin	9147a437f1	docs: Fix the bug in MongoDBChatMessageHistory notebook (#18128 ) I tried to configure MongoDBChatMessageHistory using the code from the original documentation to store messages based on the passed session_id in MongoDB. However, this configuration did not take effect, and the session id in the database remained as 'test_session'. To resolve this issue, I found that when configuring MongoDBChatMessageHistory, it is necessary to set session_id=session_id instead of session_id=test_session. Issue: DOC: Ineffective Configuration of MongoDBChatMessageHistory for Custom session_id Storage previous code： ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id="test_session", connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` ![image](https://github.com/langchain-ai/langchain/assets/83388493/c372f785-1ec1-43f5-8d01-b7cc07b806b7) Modified code: ```python chain_with_history = RunnableWithMessageHistory( chain, lambda session_id: MongoDBChatMessageHistory( session_id=session_id, # here is my modify code connection_string="mongodb://root:Y181491117cLj@123.56.224.232:27017", database_name="my_db", collection_name="chat_histories", ), input_messages_key="question", history_messages_key="history", ) config = {"configurable": {"session_id": "mmm"}} chain_with_history.invoke({"question": "Hi! I'm bob"}, config) ``` Effect after modification (it works)： ![image](https://github.com/langchain-ai/langchain/assets/83388493/5776268c-9098-4da3-bf41-52825be5fafb)	2024-02-26 15:02:56 +00:00
Erick Friis	e3b7779926	docs: api docs for external repos (#17904 ) Stacked on google removal PR. Will make google continue to show up in API docs even from external repo	2024-02-26 06:19:09 +00:00
Erick Friis	248c5b84ee	google-genai, google-vertexai: move to langchain-google (#17899 ) These packages have moved to https://github.com/langchain-ai/langchain-google Left tombstone readmes incase anyone ends up at the "Source Code" link from old pypi releases. Can keep these around for a few months.	2024-02-25 21:58:05 -08:00
Erick Friis	3b5bdbfee8	anthropic[minor]: package move (#17974 )	2024-02-25 21:57:26 -08:00
Christophe Bornet	a2d5fa7649	community[patch]: Fix GenericRequestsWrapper _aget_resp_content must be async (#18065 ) There are existing tests in `libs/community/tests/unit_tests/tools/requests/test_tool.py`	2024-02-25 19:07:07 -08:00
Neli Hateva	a01e8473f8	community[patch]: Fix GraphSparqlQAChain so that it works with Ontotext GraphDB (#15009 ) - Description: Introduce a new parameter `graph_kwargs` to `RdfGraph` - parameters used to initialize the `rdflib.Graph` if `query_endpoint` is set. Also, do not set `rdflib.graph.DATASET_DEFAULT_GRAPH_ID` as default value for the `rdflib.Graph` `identifier` if `query_endpoint` is set. - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-02-25 19:05:21 -08:00
Christophe Bornet	4d6cd5b46a	astradb[patch]: Use astrapy's upsert_one method in AstraDBStore (#18063 ) As `upsert` is deprecated	2024-02-25 19:04:18 -08:00
Danny McAteer	e42110f720	docs: Additional examples for partners/exa README (#18081 ) Description: Add additional examples for other modules to partners/exa README Issue: #17545 Dependencies: None Twitter handle: @DannyMcAteer8 --------- Co-authored-by: Daniel McAteer <danielmcateer@Daniels-MBP.attlocal.net> Co-authored-by: Daniel McAteer <danielmcateer@Daniels-MacBook-Pro.local>	2024-02-25 18:53:47 -08:00
dokato	5afb242161	langchain[patch]: Make BooleanOutputParser more robust to non-binary responses (#17810 ) - Description: I encountered this error when I tried to use LLMChainFilter. Even if the message slightly differs, like `Not relevant (NO)` this results in an error. It has been reported already here: https://github.com/langchain-ai/langchain/issues/. This change hopefully makes it more robust. - Issue: #11408 - Dependencies: No - Twitter handle: dokatox	2024-02-25 18:48:33 -08:00
Matt	3b08617a89	docs: update azure search langchain notebook (#18053 ) Description: Update the azure search notebook to have more descriptive comments, and an option to choose between OpenAI and AzureOpenAI Embeddings --------- Co-authored-by: Matt Gotteiner <[email protected]> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:48:13 -08:00
kYLe	17ecf6e119	community[patch]: Remove model limitation on Anyscale LLM (#17662 ) Description: Llama Guard is deprecated from Anyscale public endpoint. Issue: Change the default model. and remove the limitation of only use Llama Guard with Anyscale LLMs Anyscale LLM can also works with all other Chat model hosted on Anyscale. Also added `async_client` for Anyscale LLM	2024-02-25 18:21:19 -08:00
Barun Amalkumar Halder	cc69976860	community[minor] : adds callback handler for Fiddler AI (#17708 ) Description: Callback handler to integrate fiddler with langchain. This PR adds the following - 1. `FiddlerCallbackHandler` implementation into langchain/community 2. Example notebook `fiddler.ipynb` for usage documentation [Internal Tracker : FDL-14305] Issue: NA Dependencies: - Installation of langchain-community is unaffected. - Usage of FiddlerCallbackHandler requires installation of latest fiddler-client (2.5+) Twitter handle: @fiddlerlabs @behalder Co-authored-by: Barun Halder <barun@fiddler.ai>	2024-02-25 18:17:03 -08:00
Christophe Bornet	b8b5ce0c8c	astradb: Add AstraDBChatMessageHistory to langchain-astradb package (#17732 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:14:49 -08:00
Maxime Perrin	c06a8732aa	community[patch]: fix llama index imports and fields access (#17870 ) - Description: Fixing outdated imports after v0.10 llama index update and updating metadata and source text access - Issue: #17860 - Twitter handle: @maximeperrin_ --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-02-25 18:14:23 -08:00
BeatrixCohere	5d2d80a9a8	docs: Add Cohere examples in documentation (#17794 ) - Description: Add cohere examples to documentation - Issue:N/A - Dependencies: N/A --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-25 18:10:09 -08:00
Jacob Lee	c9eac3287e	docs[patch]: Remove redundant Pinecone import (#18079 ) CC @efriis	2024-02-24 19:27:54 -08:00
2jimoo	7fc903464a	community: Add document manager and mongo document manager (#17320 ) - Description: - Add DocumentManager class, which is a nosql record manager. - In order to use index and aindex in libs/langchain/langchain/indexes/_api.py, DocumentManager inherits RecordManager. - Also I added the MongoDB implementation of Document Manager too. - Dependencies: pymongo, motor <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: Add DocumentManager class, which is a no sql record manager. To use index method and aindex method in indexes._api.py, Document Manager inherits RecordManager.Add the MongoDB implementation of Document Manager. - Dependencies: pymongo, motor Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-23 21:32:52 -05:00
Leonid Ganeline	3f6bf852ea	experimental: docstrings update (#18048 ) Added missed docstrings. Formatted docsctrings to the consistent format.	2024-02-23 21:24:16 -05:00
kYLe	56b955fc31	community[minor]: Add async_client for Anyscale Chat model (#18050 ) Add `async_client` for Anyscale Chat_model	2024-02-23 21:22:54 -05:00
Eugene Yurtsev	68527b809d	core[patch]: Runnable with message history to use add_messages (#17958 ) This PR updates RunnableWithMessageHistory to use add_messages which will save on round-trips for any chat history abstractions that implement the optimization. If the optimization isn't implemented, add_messages automatically invokes add_message serially.	2024-02-23 21:19:38 -05:00
Bagatur	1c1bb1152e	openai[patch]: refactor with_structured_output (#18052 ) - make schema Optional with default val None, since in json_mode you don't need it if not parsing to pydantic - change return_type -> include_raw - expand docstring examples	2024-02-23 17:02:11 -08:00
Erick Friis	e85948d46b	docs: fireworks tool calling docs (#18057 )	2024-02-24 00:49:11 +00:00
Erick Friis	e566a3077e	infra: simplify and fix CI for docs-only changes (#18058 ) Current success check will fail on docs-only changes	2024-02-23 16:39:08 -08:00
Erick Friis	1a3383fba1	docs: fireworks fixes (#18056 )	2024-02-23 15:58:53 -08:00
Erick Friis	a05fb19f42	openai[patch]: remove numpy dep (#18034 )	2024-02-23 21:12:05 +00:00
Danny McAteer	e8be34f8c7	exa[patch]: update readme (#18047 )	2024-02-23 21:05:42 +00:00
Yufei (Benny) Chen	ee6a773456	fireworks[patch]: Add Fireworks partner packages (#17694 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-23 20:45:47 +00:00
Erick Friis	11cf95e810	docs: recommend lambdas over runnablebranch (#18033 )	2024-02-23 11:34:27 -08:00
Erick Friis	9ebbca3695	infra: CI success for partner packages 2 (#18043 )	2024-02-23 11:10:39 -08:00
Erick Friis	b948f6da67	infra: CI success for partner packages (#18037 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-23 11:00:48 -08:00
Bagatur	22b964f802	community[patch]: Release 0.0.24 (#18038 )	2024-02-23 10:49:29 -08:00
Erick Friis	29e0445490	community[patch]: BaseLLM typing in init (#18029 )	2024-02-23 17:51:27 +00:00
Nicolò Boschi	4c132b4cc6	community: fix openai streaming throws 'AIMessageChunk' object has no attribute 'text' (#18006 ) After upgrading langchain-community to 0.0.22, it's not possible to use openai from the community package with streaming=True ``` File "/home/runner/work/ragstack-ai/ragstack-ai/ragstack-e2e-tests/.tox/langchain/lib/python3.11/site-packages/langchain_community/chat_models/openai.py", line 434, in _generate return generate_from_stream(stream_iter) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/runner/work/ragstack-ai/ragstack-ai/ragstack-e2e-tests/.tox/langchain/lib/python3.11/site-packages/langchain_core/language_models/chat_models.py", line 65, in generate_from_stream for chunk in stream: File "/home/runner/work/ragstack-ai/ragstack-ai/ragstack-e2e-tests/.tox/langchain/lib/python3.11/site-packages/langchain_community/chat_models/openai.py", line 418, in _stream run_manager.on_llm_new_token(chunk.text, chunk=cg_chunk) ^^^^^^^^^^ AttributeError: 'AIMessageChunk' object has no attribute 'text' ``` Fix regression of https://github.com/langchain-ai/langchain/pull/17907 Twitter handle: @nicoloboschi	2024-02-23 12:12:47 -05:00
Bagatur	9b982b2aba	community[patch]: Release 0.0.23 (#18027 )	2024-02-23 08:54:31 -08:00
Guangdong Liu	4197efd67a	community: Fix SparkLLM error (#18015 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - Description: fix SparkLLM error - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out!	2024-02-23 06:40:29 -08:00
Bagatur	d9e6ca2279	lanchain[patch]: Release 0.1.9 (#17999 )	2024-02-22 21:45:30 -08:00
Bagatur	b46d6b04e1	community[patch]: Release 0.0.22 (#17994 )	2024-02-22 21:35:04 -08:00
Bagatur	cc0290fdf3	openai[patch]: Release 0.0.7 (#17993 )	2024-02-22 21:33:59 -08:00
Erick Friis	a2886c4509	infra: skip codespell ambr (#17992 )	2024-02-23 01:26:55 +00:00
Erick Friis	8dda7c32ba	infra: ci failure job (#17989 )	2024-02-23 01:22:35 +00:00
Bagatur	e045655657	core[patch]: Release 0.1.26 (#17990 )	2024-02-22 17:12:51 -08:00
Reid Falconer	0534ba5a7d	langchain[patch]: return formatted SPARQL query on demand (#11263 ) - Description: Added the `return_sparql_query` feature to the `GraphSparqlQAChain` class, allowing users to get the formatted SPARQL query along with the chain's result. - Issue: NA - Dependencies: None Note: I've ensured that the PR passes linting and testing by running make format, make lint, and make test locally. I have added a test for the integration (which relies on network access) and I have added an example to the notebook showing its use.	2024-02-22 17:03:26 -08:00
Leo Diegues	b15fccbb99	community[patch]: Skip `OpenAIWhisperParser` extremely small audio chunks to avoid api error (#11450 ) Description This PR addresses a rare issue in `OpenAIWhisperParser` that causes it to crash when processing an audio file with a duration very close to the class's chunk size threshold of 20 minutes. Issue #11449 Dependencies None Tag maintainer @agola11 @eyurtsev Twitter handle leonardodiegues --------- Co-authored-by: Leonardo Diegues <leonardo.diegues@grupofolha.com.br> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-22 17:02:43 -08:00
Issac	46505742eb	Update quickstart.mdx (#17659 ) https://github.com/langchain-ai/langchain/issues/17657 Thank you for contributing to LangChain! Checklist: - [ ] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: Delete this entire template message and replace it with the following bulleted list - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-22 17:01:40 -08:00
Erick Friis	afc1def49b	infra: ci end check, consolidation (#17987 ) Consolidates CI checks into check_diffs.yml in order to properly consolidate them into a single success status	2024-02-22 16:53:10 -08:00
Jorge Villegas	f6a98032e4	docs: langchain-anthropic README updates (#17684 ) # PR Message - Description: This PR adds a README file for the Anthropic API in the `libs/partners` folder of this repository. The README includes: - A brief description of the Anthropic package - Installation & API instructions - Usage examples - Issue: [17545](https://github.com/langchain-ai/langchain/issues/17545) - Dependencies: None Additional notes: This change only affects the docs package and does not introduce any new dependencies. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-22 16:22:30 -08:00
Erick Friis	cd806400fc	infra: ci end check (#17986 )	2024-02-22 16:18:50 -08:00
mackong	9678797625	community[patch]: callback before yield for _stream/_astream (#17907 ) - Description: callback on_llm_new_token before yield chunk for _stream/_astream for some chat models, make all chat models in a consistent behaviour. - Issue: N/A - Dependencies: N/A	2024-02-22 16:15:21 -08:00
Stan Duprey	15e42f1799	docs: Added `langchainhub` install and fixed typo (#17985 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-22 16:03:40 -08:00
Chad Juliano	50ba3c68bb	community[minor]: add Kinetica LLM wrapper (#17879 ) Description: Initial pull request for Kinetica LLM wrapper Issue: N/A Dependencies: No new dependencies for unit tests. Integration tests require gpudb, typeguard, and faker Twitter handle: @chad_juliano Note: There is another pull request for Kinetica vectorstore. Ultimately we would like to make a partner package but we are starting with a community contribution.	2024-02-22 16:02:00 -08:00
Matt	6ef12fdfd2	docs: Update Azure Search vector store notebook (#17901 ) - Description: Update the Azure Search vector store notebook for the latest version of the SDK --------- Co-authored-by: Matt Gotteiner <[email protected]>	2024-02-22 15:59:43 -08:00
Averi Kitsch	c05cbf0533	docs: Update Google Provider documentation (#17970 ) Description: Clean up Google product names and fix document loader section Issue: NA Dependencies: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-22 15:58:52 -08:00
Erick Friis	ed789be8f4	docs, templates: update schema imports to core (#17885 ) - chat models, messages - documents - agentaction/finish - baseretriever,document - stroutputparser - more messages - basemessage - format_document - baseoutputparser --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-22 15:58:44 -08:00
Leonid Ganeline	971d29e718	docs: robocorpai dosctrings (#17968 ) Added missing docstrings --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-02-22 15:55:01 -08:00
Bagatur	b0cfb86c48	langchain[minor]: openai tools structured_output_chain (#17296 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-22 15:42:47 -08:00
Bagatur	b5f8cf9509	core[minor], openai[minor], langchain[patch]: BaseLanguageModel.with_structured_output #17302 ) ```python class Foo(BaseModel): bar: str structured_llm = ChatOpenAI().with_structured_output(Foo) ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-22 15:33:34 -08:00
Leonid Ganeline	f685d2f50c	docs: partner package list (#17978 ) Updated partner package list	2024-02-22 18:23:07 -05:00
Erick Friis	29660f8918	docs: logo (#17972 )	2024-02-22 15:20:34 -08:00
Bagatur	9b0b0032c2	community[patch]: fix lint (#17984 )	2024-02-22 15:15:27 -08:00
bear	e8633e53c4	docs: Rerun the Tongyi Qwen model to fix incorrect responses. (#17693 ) This PR updates the docs of Tongyi Qwen model. 1. fix the previously incorrect responses of the Tongyi Qwen. 2. rewrite the case with LCEL.	2024-02-22 13:20:04 -08:00
esque	78521caf51	templates: Update README.md - Fixing a typo (#17689 ) - Description: PR to fix typo in readme - Issue: typo in readme - Dependencies: no - Twitter handle: p_moolrajani	2024-02-22 13:19:37 -08:00
Christophe Bornet	4f88a5130e	langchain[patch]: Support langchain-astradb AstraDBVectorStore in self-query retriever (#17728 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-22 13:19:27 -08:00
Muhammad Abdullah Hashmi	9775de46cc	community[patch]: Remove subscript for Result type object (#17823 ) Resolved 'TypeError: 'type' object is not subscriptable' by removing subscription of Result type object Thank you for contributing to LangChain! - [x] PR title: "Langchain: Resolve type error for SQLAlchemy Result object in QuerySQLDataBaseTool class" - Description: Resolve type error for SQLAlchemy Result object in QuerySQLDataBaseTool class - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-22 13:16:14 -08:00
Mateusz Szewczyk	f6e3aa9770	docs: update IBM watsonx.ai docs (#17932 ) - Description: Update IBM watsonx.ai docs and add IBM as a provider docs - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. ✅	2024-02-22 10:22:18 -08:00
David Loving	d068e8ea54	community[patch]: compatibility with SQLAlchemy 1.4.x (#17954 ) Description: Change type hint on `QuerySQLDataBaseTool` to be compatible with SQLAlchemy v1.4.x. Issue: Users locked to `SQLAlchemy < 2.x` are unable to import `QuerySQLDataBaseTool`. closes https://github.com/langchain-ai/langchain/issues/17819 Dependencies: None	2024-02-22 13:17:07 -05:00
Erick Friis	e237dcec91	pinecone[patch]: integration test debug (#17960 )	2024-02-22 09:11:21 -08:00
kartikTAI	9cf6661dc5	community: use NeuralDB object to initialize NeuralDBVectorStore (#17272 ) Description: This PR adds an `__init__` method to the NeuralDBVectorStore class, which takes in a NeuralDB object to instantiate the state of NeuralDBVectorStore. Issue: N/A Dependencies: N/A Twitter handle: N/A	2024-02-22 12:05:01 -05:00
hongbo.mo	a51a257575	langchain_openai[patch]: fix typos in langchain_openai (#17923 ) Just a small typo	2024-02-22 12:03:16 -05:00
Brad Erickson	ecd72d26cf	community: Bugfix - correct Ollama API path to avoid HTTP 307 (#17895 ) Sets the correct /api/generate path, without ending /, to reduce HTTP requests. Reference: https://github.com/ollama/ollama/blob/efe040f8/docs/api.md#generate-request-streaming Before: DEBUG: Starting new HTTP connection (1): localhost:11434 DEBUG: http://localhost:11434 "POST /api/generate/ HTTP/1.1" 307 0 DEBUG: http://localhost:11434 "POST /api/generate HTTP/1.1" 200 None After: DEBUG: Starting new HTTP connection (1): localhost:11434 DEBUG: http://localhost:11434 "POST /api/generate HTTP/1.1" 200 None	2024-02-22 11:59:55 -05:00
Erick Friis	a53370a060	pinecone[patch], docs: PineconeVectorStore, release 0.0.3 (#17896 )	2024-02-22 08:24:08 -08:00
Graden Rea	e5e38e89ce	partner: Add groq partner integration and chat model (#17856 ) Description: Add a Groq chat model issue: TODO Dependencies: groq Twitter handle: N/A	2024-02-22 07:36:16 -08:00
William FH	da957a22cc	Redirect the expression language guides (#17914 )	2024-02-22 00:39:57 -08:00
Leonid Ganeline	919b8a387f	docs: sorting `Examples using ...` section (#17588 ) The API Reference docs. If the class has a long list of the examples that works with this class, then the `Examples using` list is [hard to comprehend](https://api.python.langchain.com/en/latest/llms/langchain_community.llms.openai.OpenAI.html#langchain-community-llms-openai-openai). If this list is sorted it would be much easier. - sorting the `Examples using <ClassName>` list	2024-02-21 17:04:23 -08:00
Hasan	7248e98b9e	community[patch]: Return PK in similarity search Document (#17561 ) Issue: #17390 Co-authored-by: hasan <hasan@m2sys.com>	2024-02-21 17:03:50 -08:00
Raunak	1ec8199c8e	community[patch]: Added more functions in NetworkxEntityGraph class (#17624 ) - Description: 1. Added add_node(), remove_node(), has_node(), remove_edge(), has_edge() and get_neighbors() functions in NetworkxEntityGraph class. 2. Added the above functions in graph_networkx_qa.ipynb documentation.	2024-02-21 17:02:56 -08:00
William FH	42f158c128	docs: typo (#17710 )	2024-02-21 16:53:41 -08:00
Christophe Bornet	0e26b16930	docs: Fix AstraDBVectorStore docstring (#17706 )	2024-02-21 16:53:08 -08:00
Neli Hateva	66e1005898	docs: Update Links to resources in the GraphDB QA Chain documentation (#17720 ) - Description: Update Links to resources in the GraphDB QA Chain documentation - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-02-21 16:51:32 -08:00
Christophe Bornet	3d91be94b1	community[patch]: Add missing async_astra_db_client param to AstraDBChatMessageHistory (#17742 )	2024-02-21 16:46:42 -08:00
Xudong Sun	c524bf31f5	docs: add helpful comments to sparkllm.py (#17774 ) Adding helpful comments to sparkllm.py, help users to use ChatSparkLLM more effectively	2024-02-21 16:42:54 -08:00
Ian	3019a594b7	community[minor]: Add tidb loader support (#17788 ) This pull request support loading data from TiDB database with Langchain. A simple usage: ``` from langchain_community.document_loaders import TiDBLoader CONNECTION_STRING = "mysql+pymysql://root@127.0.0.1:4000/test" QUERY = "select id, name, description from items;" loader = TiDBLoader( connection_string=CONNECTION_STRING, query=QUERY, page_content_columns=["name", "description"], metadata_columns=["id"], ) documents = loader.load() print(documents) ```	2024-02-21 16:42:33 -08:00
Christophe Bornet	815ec74298	docs: Add docstring to AstraDBStore (#17793 )	2024-02-21 16:41:47 -08:00
Jacob Lee	375051a64e	👥 Update LangChain people data (#17900 ) 👥 Update LangChain people data --------- Co-authored-by: github-actions <github-actions@github.com>	2024-02-21 16:38:28 -08:00
Bagatur	762f49162a	docs: fix api build (#17898 )	2024-02-21 16:34:37 -08:00
ehude	9e54c227f1	community[patch]: Bug Neo4j VectorStore when having multiple indexes the sort is not working and the store that returned is random (#17396 ) Bug fix: when having multiple indexes the sort is not working and the store that returned is random. The following small fix resolves the issue.	2024-02-21 16:33:33 -08:00
Michael Feil	242981b8f0	community[minor]: infinity embedding local option (#17671 ) drop-in-replacement for sentence-transformers inference. https://github.com/langchain-ai/langchain/discussions/17670 tldr from the discussion above -> around a 4x-22x speedup over using SentenceTransformers / huggingface embeddings. For more info: https://github.com/michaelfeil/infinity (pure-python dependency) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-21 16:33:13 -08:00
Aymen EL Amri	581095b9b5	docs: fix a small typo (#17859 ) Just a small typo	2024-02-21 16:31:31 -08:00
Leonid Ganeline	ed0b7c3b72	docs: added `community` modules descriptions (#17827 ) API Reference: Several `community` modules (like [adapter](https://api.python.langchain.com/en/latest/community_api_reference.html#module-langchain_community.adapters) module) are missing descriptions. It happens when langchain was split to the core, langchain and community packages. - Copied module descriptions from other packages - Fixed several descriptions to the consistent format.	2024-02-21 16:18:36 -08:00
Christophe Bornet	5019951a5d	docs: AstraDB VectorStore docstring (#17834 )	2024-02-21 16:16:31 -08:00
Leonid Ganeline	2f2b77602e	docs: modules descriptions (#17844 ) Several `core` modules do not have descriptions, like the [agent](https://api.python.langchain.com/en/latest/core_api_reference.html#module-langchain_core.agents) module. - Added missed module descriptions. The descriptions are mostly copied from the `langchain` or `community` package modules.	2024-02-21 15:58:21 -08:00
aditya thomas	d9aa11d589	docs: Change module import path for SQLDatabase in the documentation (#17874 ) Description: This PR changes the module import path for SQLDatabase in the documentation Issue: Updates the documentation to reflect the move of integrations to langchain-community	2024-02-21 15:57:30 -08:00
Christophe Bornet	f8a3b8e83f	docs: Update langchain-astradb README with AstraDBStore (#17864 )	2024-02-21 15:51:40 -08:00
Rohit Gupta	3acd0c74fc	community[patch]: added SCANN index in default search params (#17889 ) This will enable users to add data in same collection for index type SCANN for milvus	2024-02-21 15:47:47 -08:00
Karim Assi	afc1ba0329	community[patch]: add possibility to search by vector in OpenSearchVectorSearch (#17878 ) - Description: implements the missing `similarity_search_by_vector` function for `OpenSearchVectorSearch` - Issue: N/A - Dependencies: N/A	2024-02-21 15:44:55 -08:00
Matthew Kwiatkowski	144f59b5fe	docs: Fix URL typo in tigris.ipynb (#17894 ) - Description: The URL in the tigris tutorial was htttps instead of https, leading to a bad link. - Issue: N/A - Dependencies: N/A - Twitter handle: Speucey	2024-02-21 15:39:38 -08:00
Nathan Voxland (Activeloop)	9ece134d45	docs: Improved deeplake.py init documentation (#17549 ) Description: Updated documentation for DeepLake init method. Especially the exec_option docs needed improvement, but did a general cleanup while I was looking at it. Issue: n/a Dependencies: None --------- Co-authored-by: Nathan Voxland <nathan@voxland.net>	2024-02-21 15:33:00 -08:00
Zachary Toliver	29ee0496b6	community[patch]: Allow override of 'fetch_schema_from_transport' in the GraphQL tool (#17649 ) - Description: In order to override the bool value of "fetch_schema_from_transport" in the GraphQLAPIWrapper, a "fetch_schema_from_transport" value needed to be added to the "_EXTRA_OPTIONAL_TOOLS" dictionary in load_tools in the "graphql" key. The parameter "fetch_schema_from_transport" must also be passed in to the GraphQLAPIWrapper to allow reading of the value when creating the client. Passing as an optional parameter is probably best to avoid breaking changes. This change is necessary to support GraphQL instances that do not support fetching schema, such as TigerGraph. More info here: [TigerGraph GraphQL Schema Docs](https://docs.tigergraph.com/graphql/current/schema) - Threads handle: @zacharytoliver --------- Co-authored-by: Zachary Toliver <zt10191991@hotmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-21 15:32:43 -08:00
mackong	31891092d8	community[patch]: add missing chunk parameter for _stream/_astream (#17807 ) - Description: Add missing chunk parameter for _stream/_astream for some chat models, make all chat models in a consistent behaviour. - Issue: N/A - Dependencies: N/A	2024-02-21 15:32:28 -08:00
ccurme	1b0802babe	core: fix .bind when used with RunnableLambda async methods (#17739 ) Description: Here is a minimal example to illustrate behavior: ```python from langchain_core.runnables import RunnableLambda def my_function(args, kwargs): return 3 + kwargs.get("n", 0) runnable = RunnableLambda(my_function).bind(n=1) assert 4 == runnable.invoke({}) assert [4] == list(runnable.stream({})) assert 4 == await runnable.ainvoke({}) assert [4] == [item async for item in runnable.astream({})] ``` Here, `runnable.invoke({})` and `runnable.stream({})` work fine, but `runnable.ainvoke({})` raises ``` TypeError: RunnableLambda._ainvoke.<locals>.func() got an unexpected keyword argument 'n' ``` and similarly for `runnable.astream({})`: ``` TypeError: RunnableLambda._atransform.<locals>.func() got an unexpected keyword argument 'n' ``` Here we assume that this behavior is undesired and attempt to fix it. Issue:* https://github.com/langchain-ai/langchain/issues/17241, https://github.com/langchain-ai/langchain/discussions/16446	2024-02-21 15:31:52 -08:00
Gianluca Giudice	f541545c96	Docs: Fix typo (#17733 ) - Description: fix doc typo	2024-02-21 15:31:43 -08:00
qqubb	41726dfa27	docs: minor grammatical correction. (#17724 ) - Description: a minor grammatical correction.	2024-02-21 15:31:37 -08:00
volodymyr-memsql	0a9a519a39	community[patch]: Added add_images method to SingleStoreDB vector store (#17871 ) In this pull request, we introduce the add_images method to the SingleStoreDB vector store class, expanding its capabilities to handle multi-modal embeddings seamlessly. This method facilitates the incorporation of image data into the vector store by associating each image's URI with corresponding document content, metadata, and either pre-generated embeddings or embeddings computed using the embed_image method of the provided embedding object. the change includes integration tests, validating the behavior of the add_images. Additionally, we provide a notebook showcasing the usage of this new method. --------- Co-authored-by: Volodymyr Tkachuk <vtkachuk-ua@singlestore.com>	2024-02-21 15:16:32 -08:00
Guangdong Liu	7735721929	docs: update sparkllm intro doc (#17848 ) Description: update sparkllm intro doc. Issue: None Dependencies: None Twitter handle: None	2024-02-21 15:02:20 -08:00
Leonid Ganeline	6f5b7b55bd	docs: API Reference builder bug fix (#17890 ) Issue in the API Reference: If the `Classes` of `Functions` section is empty, it still shown in API Reference. Here is an [example](https://api.python.langchain.com/en/latest/core_api_reference.html#module-langchain_core.agents) where `Functions` table is empty but still presented. It happens only if this section has only the "private" members (with names started with '_'). Those members are not shown but the whole member section (empty) is shown.	2024-02-21 15:59:35 -05:00
Shashank	8381f859b4	community[patch]: Graceful handling of redis errors in RedisCache and AsyncRedisCache (#17171 ) - Description: The existing `RedisCache` implementation lacks proper handling for redis client failures, such as `ConnectionRefusedError`, leading to subsequent failures in pipeline components like LLM calls. This pull request aims to improve error handling for redis client issues, ensuring a more robust and graceful handling of such errors. - Issue: Fixes #16866 - Dependencies: No new dependency - Twitter handle: N/A Co-authored-by: snsten <> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-21 12:15:19 -05:00
Christophe Bornet	e6311d953d	community[patch]: Add AstraDBLoader docstring (#17873 )	2024-02-21 11:41:34 -05:00
nbyrneKX	c1bb5fd498	community[patch]: typo in doc-string for kdbai vectorstore (#17811 ) community[patch]: typo in doc-string for kdbai vectorstore (#17811)	2024-02-21 10:35:11 -05:00
Jacob Lee	5395c254d5	👥 Update LangChain people data (#17743 ) 👥 Update LangChain people data --------- Co-authored-by: github-actions <github-actions@github.com>	2024-02-20 18:30:11 -08:00
Erick Friis	a206d3cf69	docs: remove stale redirects (#17831 ) Removes /platform redirects as well as any redirects whose source hasn't been touched in over 6 months	2024-02-20 17:11:43 -08:00
Christophe Bornet	f59ddcab74	partners/astradb: Use single file instead of module for AstraDBVectorStore (#17644 )	2024-02-20 16:58:56 -08:00
Savvas Mantzouranidis	691ff67096	partners/openai: fix depracation errors of pydantic's .dict() function (reopen #16629 ) (#17404 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-20 16:57:34 -08:00
Christophe Bornet	bebe401b1a	astradb[patch]: Add AstraDBStore to langchain-astradb package (#17789 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-20 16:54:35 -08:00
Bagatur	4e28888d45	core[patch]: Release 0.1.25 (#17833 )	2024-02-20 16:43:28 -08:00
Erick Friis	f154cd64fe	astradb[patch]: relaxed httpx version constraint (#17826 ) relock to newest sdk	2024-02-20 15:45:25 -08:00
Nuno Campos	223e5eff14	Add JSON representation of runnable graph to serialized representation (#17745 ) Sent to LangSmith Thank you for contributing to LangChain! Checklist: - [ ] PR title: Please title your PR "package: description", where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: Delete this entire template message and replace it with the following bulleted list - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Pass lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified to check that you're passing lint and testing. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-20 14:51:09 -08:00
Erick Friis	6e854ae371	docs: fix api docs search (#17820 )	2024-02-20 13:33:20 -08:00
Guangdong Liu	47b1b7092d	community[minor]: Add SparkLLM to community (#17702 )	2024-02-20 11:23:47 -08:00
Guangdong Liu	3ba1cb8650	community[minor]: Add SparkLLM Text Embedding Model and SparkLLM introduction (#17573 )	2024-02-20 11:22:27 -08:00
Christophe Bornet	33555e5cbc	docs: Add typehints in both signature and description of API docs (#17815 ) This way we can document APIs in methods signature only where they are checked by the typing system and we get them also in the param description without having to duplicate in the docstrings (where they are unchecked). Twitter: @cbornet_	2024-02-20 14:21:08 -05:00
Virat Singh	92e52e89ca	community: Add PolygonTickerNews Tool (#17808 ) Description: In this PR, I am adding a PolygonTickerNews Tool, which can be used to get the latest news for a given ticker / stock. Twitter handle: [@virattt](https://twitter.com/virattt)	2024-02-20 10:15:29 -08:00
Eugene Yurtsev	441160d6b3	Docs: Update contributing documentation (#17557 ) This PR adds more details about how to contribute to documentation.	2024-02-20 12:28:15 -05:00
Christophe Bornet	b13e52b6ac	community[patch]: Fix AstraDBCache docstrings (#17802 )	2024-02-20 11:39:30 -05:00
Eugene Yurtsev	865cabff05	Docs: Add custom chat model documenation (#17595 ) This PR adds documentation about how to implement a custom chat model.	2024-02-19 22:03:49 -05:00
Nuno Campos	07ee41d284	Cache calls to create_model for get_input_schema and get_output_schema (#17755 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-02-19 13:26:42 -08:00
Bagatur	5ed16adbde	experimental[patch]: Release 0.0.52 (#17763 )	2024-02-19 13:12:22 -08:00
Bagatur	da7bca2178	langchain[patch]: bump community to 0.0.21 (#17754 )	2024-02-19 12:58:32 -08:00
Bagatur	441448372d	langchain[patch]: Release 0.1.8 (#17751 )	2024-02-19 11:27:37 -08:00
Bagatur	a9d3c100a2	infra: PR template nits (#17752 )	2024-02-19 11:22:31 -08:00
Bagatur	ad285ca15c	community[patch]: Release 0.0.21 (#17750 )	2024-02-19 11:13:33 -08:00
Karim Lalani	ea61302f71	community[patch]: bug fix - add empty metadata when metadata not provided (#17669 ) Code fix to include empty medata dictionary to aadd_texts if metadata is not provided.	2024-02-19 10:54:52 -08:00
CogniJT	919ebcc596	community[minor]: CogniSwitch Agent Toolkit for LangChain (#17312 ) Description: CogniSwitch focusses on making GenAI usage more reliable. It abstracts out the complexity & decision making required for tuning processing, storage & retrieval. Using simple APIs documents / URLs can be processed into a Knowledge Graph that can then be used to answer questions. Dependencies: No dependencies. Just network calls & API key required Tag maintainer: @hwchase17 Twitter handle: https://github.com/CogniSwitch Documentation: Please check `docs/docs/integrations/toolkits/cogniswitch.ipynb` Tests: The usual tool & toolkits tests using `test_imports.py` PR has passed linting and testing before this submission. --------- Co-authored-by: Saicharan Sridhara <145636106+saiCogniswitch@users.noreply.github.com>	2024-02-19 10:54:13 -08:00
Christophe Bornet	6275d8b1bf	docs: Fix AstraDBChatMessageHistory docstrings (#17740 )	2024-02-19 10:47:38 -08:00
Pranav Agarwal	86ae48b781	experimental[minor]: Amazon Personalize support (#17436 ) ## Amazon Personalize support on Langchain This PR is a successor to this PR - https://github.com/langchain-ai/langchain/pull/13216 This PR introduces an integration with [Amazon Personalize](https://aws.amazon.com/personalize/) to help you to retrieve recommendations and use them in your natural language applications. This integration provides two new components: 1. An `AmazonPersonalize` client, that provides a wrapper around the Amazon Personalize API. 2. An `AmazonPersonalizeChain`, that provides a chain to pull in recommendations using the client, and then generating the response in natural language. We have added this to langchain_experimental since there was feedback from the previous PR about having this support in experimental rather than the core or community extensions. Here is some sample code to explain the usage. ```python from langchain_experimental.recommenders import AmazonPersonalize from langchain_experimental.recommenders import AmazonPersonalizeChain from langchain.llms.bedrock import Bedrock recommender_arn = "<insert_arn>" client=AmazonPersonalize( credentials_profile_name="default", region_name="us-west-2", recommender_arn=recommender_arn ) bedrock_llm = Bedrock( model_id="anthropic.claude-v2", region_name="us-west-2" ) chain = AmazonPersonalizeChain.from_llm( llm=bedrock_llm, client=client ) response = chain({'user_id': '1'}) ``` Reviewer: @3coins	2024-02-19 10:36:37 -08:00
Aymeric Roucher	0d294760e7	Community: Fuse HuggingFace Endpoint-related classes into one (#17254 ) ## Description Fuse HuggingFace Endpoint-related classes into one: - [HuggingFaceHub](`5ceaf784f3/libs/community/langchain_community/llms/huggingface_hub.py`) - [HuggingFaceTextGenInference](`5ceaf784f3/libs/community/langchain_community/llms/huggingface_text_gen_inference.py`) - and [HuggingFaceEndpoint](`5ceaf784f3/libs/community/langchain_community/llms/huggingface_endpoint.py`) Are fused into - HuggingFaceEndpoint ## Issue The deduplication of classes was creating a lack of clarity, and additional effort to develop classes leads to issues like [this hack](`5ceaf784f3/libs/community/langchain_community/llms/huggingface_endpoint.py (L159)`). ## Dependancies None, this removes dependancies. ## Twitter handle If you want to post about this: @AymericRoucher --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-19 10:33:15 -08:00
Bagatur	8009be862e	core[patch]: Release 0.1.24 (#17744 )	2024-02-19 10:27:26 -08:00
Raghav Dixit	6c18f73ca5	community[patch]: LanceDB integration improvements/fixes (#16173 ) Hi, I'm from the LanceDB team. Improves LanceDB integration by making it easier to use - now you aren't required to create tables manually and pass them in the constructor, although that is still backward compatible. Bug fix - pandas was being used even though it's not a dependency for LanceDB or langchain PS - this issue was raised a few months ago but lost traction. It is a feature improvement for our users kindly review this , Thanks !	2024-02-19 10:22:02 -08:00
Christophe Bornet	e92e96193f	community[minor]: Add async methods to the AstraDB BaseStore (#16872 ) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-02-19 10:11:49 -08:00
Mohammad Mohtashim	43dc5d3416	community[patch]: OpenLLM Client Fixes + Added Timeout Parameter (#17478 ) - OpenLLM was using outdated method to get the final text output from openllm client invocation which was raising the error. Therefore corrected that. - OpenLLM `_identifying_params` was getting the openllm's client configuration using outdated attributes which was raising error. - Updated the docstring for OpenLLM. - Added timeout parameter to be passed to underlying openllm client.	2024-02-19 10:09:11 -08:00
Leonid Ganeline	1d2aa19aee	docs: Fix bug that caused the word "Beta" to appear twice in doc-strings (#17704 ) The current issue: Several beta descriptions in the API Reference are duplicated. For example: `[Beta] Get a context value.[Beta] Get a context value.` for the [ContextGet class](https://api.python.langchain.com/en/latest/core_api_reference.html#module-langchain_core.beta) description. NOTE: I've tested it only with a new ut! I cannot build API Reference locally :( This PR related to #17615	2024-02-18 21:38:37 -05:00
Guangdong Liu	73edf17b4e	community[minor]: Add Apache Doris as vector store (#17527 ) --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-18 12:05:58 -07:00
Bagatur	a058c8812d	community[patch]: add VoyageEmbeddings truncation (#17638 )	2024-02-18 10:21:21 -07:00
Eugene Yurtsev	d7c26c89b2	ci: rename makefile -> Makefile in docker (#17648 ) Minor file rename.	2024-02-16 16:59:18 -05:00
Mohammad Mohtashim	8d4547ae97	[Langchain_community]: Corrected the imports to make them compatible with Sqlachemy <2.0 (#17653 ) - Small Change in Imports in sql_database module to make it work with Sqlachemy <2.0 - This was identified in the following issue: #17616	2024-02-16 16:59:08 -05:00
Christophe Bornet	75465a2a3c	partners/astradb: Add dotenv to langchain-astradb integration tests (#17629 )	2024-02-16 11:48:30 -05:00
Stefano Lottini	2a239710a0	docs: update astradb imports to in docs/sample notebook to import from partner package (#17627 ) This PR replaces the imports of the Astra DB vector store with the newly-released partner package, in compliance with the deprecation notice now attached to the community "legacy" store.	2024-02-16 11:30:13 -05:00
Christophe Bornet	19ebc7418e	community: Use _AstraDBCollectionEnvironment in AstraDB VectorStore (community) (#17635 ) Another PR will be done for the langchain-astradb package. Note: for future PRs, devs will be done in the partner package only. This one is just to align with the rest of the components in the community package and it fixes a bunch of issues.	2024-02-16 11:28:16 -05:00
ccurme	0b33abc8b1	docs: update documentation for RunnableWithMessageHistory (#17602 ) - Description: Update documentation for RunnableWithMessageHistory - Issue: https://github.com/langchain-ai/langchain/issues/16642 I don't have access to an Anthropic API key so I updated things to use OpenAI. Let me know if you'd prefer another provider.	2024-02-16 11:25:49 -05:00
Mateusz Szewczyk	e25b722ea9	watsonx[patch]: Invoke callback prior to yielding token when streaming (#17625 ) Description: Invoke callback prior to yielding token in stream method for watsonx. Issue: https://github.com/langchain-ai/langchain/issues/16913	2024-02-16 09:45:12 -05:00
Nejc Habjan	b4fa847a90	community[minor]: add exclude parameter to DirectoryLoader (#17316 ) - Description: adds an `exclude` parameter to the DirectoryLoader class, based on similar behavior in GenericLoader - Issue: discussed in https://github.com/langchain-ai/langchain/discussions/9059 and I think in some other issues that I cannot find at the moment 🙇 - Dependencies: None - Twitter handle: don't have one sorry! Just https://github/nejch --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-16 09:42:42 -05:00
Bagatur	8f14234afb	infra: ignore flakey lua test (#17618 )	2024-02-16 05:02:58 -07:00
Krista Pratico	bf8e3c6dd1	community[patch]: add fixes for AzureSearch after update to stable azure-search-documents library (#17599 ) - Description: Addresses the bugs described in linked issue where an import was erroneously removed and the rename of a keyword argument was missed when migrating from beta --> stable of the azure-search-documents package - Issue: https://github.com/langchain-ai/langchain/issues/17598 - Dependencies: N/A - Twitter handle: N/A	2024-02-15 22:23:52 -08:00
William FH	64743dea14	core[patch], community[patch], langchain[patch], experimental[patch], robocorp[patch]: bump LangSmith 0.1.* (#17567 )	2024-02-15 23:17:59 -07:00
morgana	9d7ca7df6e	community[patch]: update copy of metadata in rockset vectorstore integration (#17612 ) - Description: This fixes an issue with working with RecordManager. RecordManager was generating new hashes on documents because `add_texts` was modifying the metadata directly. Additionally moved some tests to unit tests since that was a more appropriate home. - Issue: N/A - Dependencies: N/A - Twitter handle: `@_morgan_adams_`	2024-02-15 23:13:40 -07:00
Erick Friis	c8d96f30bd	exa[patch]: fix lint (#17610 )	2024-02-15 20:45:16 -08:00
Erick Friis	8f5c70769d	astradb[patch]: fix core dep 3 (#17617 )	2024-02-15 20:42:30 -08:00
Kartheek Yakkala	44db4412c0	ci[minor] : Added graphdb in docker compose for integration tests (#17510 ) This PR adds graphdb to the docker compose so it can be used in integration tests. Co-authored-by: KARTHEEK YAKKALA <kartheekyakkala.se@gmail.com>	2024-02-15 23:03:22 -05:00
Leonid Ganeline	0835ebad70	docs: Fix bug that caused the word "Deprecated" to appear twice in doc-strings (#17615 ) The current issue: Most of the deprecation descriptions are duplicated. For example: `[Deprecated] Chat Agent.[Deprecated] Chat Agent.` for the [ChatAgent class](https://api.python.langchain.com/en/latest/langchain_api_reference.html#classes) description. NOTE: I've tested it only with new ut! I cannot build API Reference locally :(	2024-02-15 22:52:26 -05:00
Kevin	88af4fd514	docs: quickstart example returns 404 (#17609 ) Description: Appears a legacy URL in the quickstart returns a 404. Updated to use Langchain homepage and ran through tutorial to confirm results.	2024-02-15 16:50:41 -08:00
Erick Friis	aa31025dd7	astradb[patch]: fix core dep 2 (#17608 )	2024-02-15 16:33:02 -08:00
Erick Friis	cc562e7c58	astradb[patch]: fix core dep (#17606 )	2024-02-15 16:09:38 -08:00
Stefano Lottini	5240ecab99	astradb: bootstrapping Astra DB as Partner Package (#16875 ) Description: This PR introduces a new "Astra DB" Partner Package. So far only the vector store class is _duplicated_ there, all others following once this is validated and established. Along with the move to separate package, incidentally, the class name will change `AstraDB` => `AstraDBVectorStore`. The strategy has been to duplicate the module (with prospected removal from community at LangChain 0.2). Until then, the code will be kept in sync with minimal, known differences (there is a makefile target to automate drift control. Out of convenience with this check, the community package has a class `AstraDBVectorStore` aliased to `AstraDB` at the end of the module). With this PR several bugfixes and improvement come to the vector store, as well as a reshuffling of the doc pages/notebooks (Astra and Cassandra) to align with the move to a separate package. Dependencies: A brand new pyproject.toml in the new package, no changes otherwise. Twitter handle: `@rsprrs` --------- Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-15 15:50:59 -08:00
Erick Friis	f6f0ca1bae	docs: ai21 sidebars (#17600 )	2024-02-15 14:43:48 -08:00
Erick Friis	6cc6faa00e	ai21: init package (#17592 ) Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: etang <etang@ai21.com> Co-authored-by: asafgardin <147075902+asafgardin@users.noreply.github.com>	2024-02-15 12:25:05 -08:00
Moshe Berchansky	20a56fe0a2	community[minor]: Add QuantizedEmbedders (#17391 ) Description: * adding Quantized embedders using optimum-intel and intel-extension-for-pytorch. * added mdx documentation and example notebooks * added embedding import testing. Dependencies: optimum = {extras = ["neural-compressor"], version = "^1.14.0", optional = true} intel_extension_for_pytorch = {version = "^2.2.0", optional = true} Dependencies have been added to pyproject.toml for the community lib. Twitter handle: @peter_izsak --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-15 11:01:24 -08:00
Amir Karbasi	bccc9241ea	community[patch]: Resolve KuzuQAChain API Changes (#16885 ) - Description: Updates to the Kuzu API had broken this functionality. These updates resolve those issues and add a new test to demonstrate the updates. - Issue: #11874 - Dependencies: No new dependencies - Twitter handle: @amirk08 Test results: ``` tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_query_no_params PASSED [ 33%] tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_query_params PASSED [ 66%] tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_refresh_schema PASSED [100%] =================================================== slowest 5 durations =================================================== 0.53s call tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_refresh_schema 0.34s call tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_query_no_params 0.28s call tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_query_params 0.03s teardown tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_refresh_schema 0.02s teardown tests/integration_tests/graphs/test_kuzu.py::TestKuzu::test_query_params ==================================================== 3 passed in 1.27s ==================================================== ```	2024-02-15 10:18:37 -08:00
Rafail Giavrimis	a84a3add25	Community[patch]: Adjusted import to be compatible with SQLAlchemy<2 (#17520 ) - Description: Adjusts an import to directly import `Result` from `sqlalchemy.engine`. - Issue: #17519 - Dependencies: N/A - Twitter handle: @grafail	2024-02-15 11:12:13 -05:00
Zachary Toliver	6746adf363	community[patch]: pass bool value for fetch_schema_from_transport in GraphQLAPIWrapper (#17552 ) - Description: Allow a bool value to be passed to fetch_schema_from_transport since not all GraphQL instances support this feature, such as TigerGraph. - Threads: @zacharytoliver --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-15 09:54:04 -05:00
Christophe Bornet	789cd5198d	community[patch]: Use astrapy built-in pagination prefetch in AstraDBLoader (#17569 )	2024-02-15 09:52:56 -05:00
Christophe Bornet	387cacb881	community[minor]: Add async methods to AstraDBChatMessageHistory (#17572 )	2024-02-15 09:48:42 -05:00
Christophe Bornet	ff1f985a2a	community: Fix some mypy types in cassandra doc loader (#17570 ) Thank you!	2024-02-15 09:45:22 -05:00
Mo Latif	f3e4a0e27f	langchain[patch]: Update Chain prep_inputs docstring (#17575 ) Description: @eyurtsev Following up on #16644 to fix the docstring, because `prep_inputs` is not longer doing any validation.	2024-02-15 09:44:35 -05:00
William FH	53b8c86309	fix dataset link (#17565 )	2024-02-14 23:18:07 -08:00
William FH	fc1617c44f	Update contact link (#17563 )	2024-02-14 22:37:32 -08:00
Eugene Yurtsev	79119b4345	Docs: Add repository structure to contributors guide (#17553 ) Adding another high level overview page to the contributors guide	2024-02-14 23:20:45 -05:00
Christophe Bornet	ca2d4078f3	community: Add async methods to AstraDBCache (#17415 ) Adds async methods to AstraDBCache	2024-02-14 23:10:08 -05:00
Eugene Yurtsev	e438fe6be9	Docs: Contributing changes (#17551 ) A few minor changes for contribution: 1) Updating link to say "Contributing" rather than "Developer's guide" 2) Minor changes after going through the contributing documentation page.	2024-02-14 17:55:09 -05:00
Jan Cap	7ae3ce60d2	community[patch]: Fix pwd import that is not available on windows (#17532 ) - Description: Resolving problem in `langchain_community\document_loaders\pebblo.py` with `import pwd`. `pwd` is not available on windows. import moved to try catch block - Issue: #17514	2024-02-14 13:45:10 -08:00
nvpranak	91bcc9c5c9	community[minor]: Nemo embeddings(#16206 ) This PR is adding support for NVIDIA NeMo embeddings issue #16095. --------- Co-authored-by: Praveen Nakshatrala <pnakshatrala@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-14 13:25:42 -08:00
Mattt394	7c6009b76f	experimental[patch]: Fixed typos in SmartLLMChain ideation and critique prompts (#11507 ) Noticed and fixed a few typos in the SmartLLMChain default ideation and critique prompts --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-14 13:20:10 -08:00
Erick Friis	86d3e42853	core[minor]: add name to basemessage (#17539 ) Adds an optional name param to our base message to support passing names into LLMs. OpenAI supports having a name on anything except tool message now (system, ai, user/human).	2024-02-14 12:21:59 -08:00
Mateusz Szewczyk	916332ef5b	ibm: added partners package `langchain_ibm`, added llm (#16512 ) - Description: Added `langchain_ibm` as an langchain partners package of IBM [watsonx.ai](https://www.ibm.com/products/watsonx-ai) LLM provider (`WatsonxLLM`) - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-14 12:12:19 -08:00
Shawn	f6d3a3546f	community[patch]: document_loaders: modified athena key logic to handle s3 uris without a prefix (#17526 ) https://github.com/langchain-ai/langchain/issues/17525 ### Example Code ```python from langchain_community.document_loaders.athena import AthenaLoader database_name = "database" s3_output_path = "s3://bucket-no-prefix" query="""SELECT CAST(extract(hour FROM current_timestamp) AS INTEGER) AS current_hour, CAST(extract(minute FROM current_timestamp) AS INTEGER) AS current_minute, CAST(extract(second FROM current_timestamp) AS INTEGER) AS current_second; """ profile_name = "AdministratorAccess" loader = AthenaLoader( query=query, database=database_name, s3_output_uri=s3_output_path, profile_name=profile_name, ) documents = loader.load() print(documents) ``` ### Error Message and Stack Trace (if applicable) NoSuchKey: An error occurred (NoSuchKey) when calling the GetObject operation: The specified key does not exist ### Description Athena Loader errors when result s3 bucket uri has no prefix. The Loader instance call results in a "NoSuchKey: An error occurred (NoSuchKey) when calling the GetObject operation: The specified key does not exist." error. If s3_output_path contains a prefix like: ```python s3_output_path = "s3://bucket-with-prefix/prefix" ``` Execution works without an error. ## Suggested solution Modify: ```python key = "/".join(tokens[1:]) + "/" + query_execution_id + ".csv" ``` to ```python key = "/".join(tokens[1:]) + ("/" if tokens[1:] else "") + query_execution_id + ".csv" ``` `9e8a3fc4ff/libs/community/langchain_community/document_loaders/athena.py (L128)` ### System Info System Information ------------------ > OS: Darwin > OS Version: Darwin Kernel Version 22.6.0: Fri Sep 15 13:41:30 PDT 2023; root:xnu-8796.141.3.700.8~1/RELEASE_ARM64_T8103 > Python Version: 3.9.9 (main, Jan 9 2023, 11:42:03) [Clang 14.0.0 (clang-1400.0.29.102)] Package Information ------------------- > langchain_core: 0.1.23 > langchain: 0.1.7 > langchain_community: 0.0.20 > langsmith: 0.0.87 > langchain_openai: 0.0.6 > langchainhub: 0.1.14 Packages not installed (Not Necessarily a Problem) -------------------------------------------------- The following packages were not found: > langgraph > langserve --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-14 11:48:31 -08:00
wulixuan	c776cfc599	community[minor]: integrate with model Yuan2.0 (#15411 ) 1. integrate with [`Yuan2.0`](https://github.com/IEIT-Yuan/Yuan-2.0/blob/main/README-EN.md) 2. update `langchain.llms` 3. add a new doc for [Yuan2.0 integration](docs/docs/integrations/llms/yuan2.ipynb) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-14 11:46:20 -08:00
Philippe PRADOS	d07db457fc	community[patch]: Fix SQLAlchemyMd5Cache race condition (#16279 ) If the SQLAlchemyMd5Cache is shared among multiple processes, it is possible to encounter a race condition during the cache update. Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-14 11:45:28 -08:00
Alex Peplowski	70c296ae96	community[patch]: Expose Anthropic Retry Logic (#17069 ) Description: Expose Anthropic's retry logic, so that `max_retries` can be configured via langchain. Anthropic's retry logic is implemented in their Python SDK here: https://github.com/anthropics/anthropic-sdk-python?tab=readme-ov-file#retries --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-14 11:44:28 -08:00
DanisJiang	de9a6cdf16	experimental[patch]: Enhance protection against arbitrary code execution in PALChain (#17091 ) - Description: Block some ways to trigger arbitrary code execution bug in PALChain. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-14 11:44:07 -08:00
Lyndsey	8562a1e7d4	community[patch]: support query filters for NotionDBLoader (#17217 ) - Description: Support filtering databases in the use case where devs do not want to query ALL entries within a DB, - Issue: N/A, - Dependencies: N/A, - Twitter handle: I don't have Twitter but feel free to tag my Github! --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-14 11:43:41 -08:00
volodymyr-memsql	e36bc379f2	community[patch]: Add vector index support to SingleStoreDB VectorStore (#17308 ) This pull request introduces support for various Approximate Nearest Neighbor (ANN) vector index algorithms in the VectorStore class, starting from version 8.5 of SingleStore DB. Leveraging this enhancement enables users to harness the power of vector indexing, significantly boosting search speed, particularly when handling large sets of vectors. --------- Co-authored-by: Volodymyr Tkachuk <vtkachuk-ua@singlestore.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-14 11:43:12 -08:00
Kate Silverstein	0bc4a9b3fc	community[minor]: Adds Llamafile as an LLM (#17431 ) * Description: Adds a simple LLM implementation for interacting with [llamafile](https://github.com/Mozilla-Ocho/llamafile)-based models. * Dependencies: N/A * Issue: N/A Detail [llamafile](https://github.com/Mozilla-Ocho/llamafile) lets you run LLMs locally from a single file on most computers without installing any dependencies. To use the llamafile LLM implementation, the user needs to: 1. Download a llamafile e.g. https://huggingface.co/jartine/TinyLlama-1.1B-Chat-v1.0-GGUF/resolve/main/TinyLlama-1.1B-Chat-v1.0.Q5_K_M.llamafile?download=true 2. Make the file executable. 3. Run the llamafile in 'server mode'. (All llamafiles come packaged with a lightweight server; by default, the server listens at `http://localhost:8080`.) ```bash wget https://url/of/model.llamafile chmod +x model.llamafile ./model.llamafile --server --nobrowser ``` Now, the user can invoke the LLM via the LangChain client: ```python from langchain_community.llms.llamafile import Llamafile llm = Llamafile() llm.invoke("Tell me a joke.") ```	2024-02-14 11:15:24 -08:00
Rakib Hosen	5ce1827d31	community[patch]: fix import in language parser (#17538 ) - Description: Resolving import error in language_parser.py during "from langchain.langchain.text_splitter import Language - Issue: the issue #17536 - Dependencies: NO - Twitter handle: @iRakibHosen --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-14 11:11:23 -08:00
Raunak	685d62b032	community[patch]: Added functions in NetworkxEntityGraph class (#17535 ) - Description: 1. Added _clear_edges()_ and _get_number_of_nodes()_ functions in NetworkxEntityGraph class. 2. Added the above two function in graph_networkx_qa.ipynb documentation.	2024-02-14 11:02:24 -08:00
Erick Friis	bfaa8c3048	anthropic[patch]: de-beta anthropic messages, release 0.0.2 (#17540 )	2024-02-14 10:31:45 -08:00
Erick Friis	a99c667c22	partners: version constraints (#17492 ) Core should be ^0.1 by default Careful about 0.x.y and 0.0.z packages	2024-02-14 08:57:46 -08:00
Erick Friis	d7418acbe1	nomic[patch]: release 0.0.2, dimensionality (#17534 ) - nomic[patch]: release 0.0.2 - x	2024-02-14 08:38:07 -08:00
Bagatur	9e8a3fc4ff	infra: rm @ from pr template (#17507 )	2024-02-13 21:29:22 -08:00
shibuiwilliam	c502736841	infra: add test for ensemble retriever to ensure multiple retrievers (#8401 ) Add tests to ensemble retriever to ensure it works with combination of multiple retrievers --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-13 21:22:03 -08:00
Qihui Xie	5738143d4b	add mongodb_store (#13801 ) # Add MongoDB storage - Description: Add MongoDB Storage as an option for large doc store. Example usage: ```Python # Instantiate the MongodbStore with a MongoDB connection from langchain.storage import MongodbStore mongo_conn_str = "mongodb://localhost:27017/" mongodb_store = MongodbStore(mongo_conn_str, db_name="test-db", collection_name="test-collection") # Set values for keys doc1 = Document(page_content='test1') doc2 = Document(page_content='test2') mongodb_store.mset([("key1", doc1), ("key2", doc2)]) # Get values for keys values = mongodb_store.mget(["key1", "key2"]) # [doc1, doc2] # Iterate over keys for key in mongodb_store.yield_keys(): print(key) # Delete keys mongodb_store.mdelete(["key1", "key2"]) ``` - Dependencies: Use `mongomock` for integration test. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-13 22:33:22 -05:00
Mo Latif	50b48a8e6a	langchain[patch]: Invoke chain prep_inputs and prep_outputs inside try block to catch validation errors (#16644 ) - Description: Callback manager can't catch chain input or output validation errors because `prepare_input` and `prepare_output` are not part of the try/raise logic, this PR fixes that logic. - Issue: #15954	2024-02-13 22:23:11 -05:00
Christophe Bornet	a8f530bc4d	Add async methods to CacheBackedEmbeddings (#16873 ) Adds async methods to CacheBackedEmbeddings	2024-02-13 22:16:27 -05:00
Bagatur	dd68a8716e	infra: update rtd yaml (#17502 )	2024-02-13 18:16:44 -08:00
Bagatur	1aeb52caac	infra: merge in master during api docs build (#17494 )	2024-02-13 18:08:07 -08:00
Bagatur	54373fb384	infra: add api docs build GHA (#17493 )	2024-02-13 16:46:58 -08:00
Bagatur	50de7a31f0	langchain[patch]: structured output chain nits (#17291 )	2024-02-13 16:45:29 -08:00
Nat Noordanus	8a3b74fe1f	community[patch]: Fix pydantic ForwardRef error in BedrockBase (#17416 ) - Description: Fixes a type annotation issue in the definition of BedrockBase. This issue was that the annotation for the `config` attribute includes a ForwardRef to `botocore.client.Config` which is only imported when `TYPE_CHECKING`. This can cause pydantic to raise an error like `pydantic.errors.ConfigError: field "config" not yet prepared so type is still a ForwardRef, ...`. - Issue: N/A - Dependencies: N/A - Twitter handle: `@__nat_n__`	2024-02-13 16:15:55 -08:00
Bagatur	2c076bebc9	docs: fix self query redirect (#17490 )	2024-02-13 15:44:56 -08:00
Ashley Xu	f746a73e26	Add the BQ job usage tracking from LangChain (#17123 ) - Description: Add the BQ job usage tracking from LangChain --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-13 14:47:57 -08:00
Bagatur	5dca107621	docs: update providers (#17488 )	2024-02-13 14:00:15 -08:00
JongRok BAEK	8d6cc90fc5	langchain.core : Use shallow copy for schema manipulation in JsonOutputParser.get_format_instructions (#17162 ) - Description : Fix: Use shallow copy for schema manipulation in get_format_instructions Prevents side effects on the original schema object by using a dictionary comprehension for a safer and more controlled manipulation of schema key-value pairs, enhancing code reliability. - Issue: #17161 - Dependencies: None - Twitter handle: None	2024-02-13 13:30:53 -08:00
Rave Harpaz	90f55e6bd1	Documentation/add update documentation for oci (#17473 ) Thank you for contributing to LangChain! Checklist: - PR title: docs: add & update docs for Oracle Cloud Infrastructure (OCI) integrations - Description: adding and updating documentation for two integrations - OCI Generative AI & OCI Data Science (1) adding integration page for OCI Generative AI embeddings (@baskaryan request, docs/docs/integrations/text_embedding/oci_generative_ai.ipynb) (2) updating integration page for OCI Generative AI llms (docs/docs/integrations/llms/oci_generative_ai.ipynb) (3) adding platform documentation for OCI (@baskaryan request, docs/docs/integrations/platforms/oci.mdx). this combines the integrations of OCI Generative AI & OCI Data Science (4) if possible, requesting to be added to 'Featured Community Providers' so supplying a modified docs/docs/integrations/platforms/index.mdx to reflect the addition - Issue: none - Dependencies: no new dependencies - Twitter handle: --------- Co-authored-by: MING KANG <ming.kang@oracle.com>	2024-02-13 13:26:23 -08:00
Bagatur	b5d3416563	experimental[patch]: Release 0.0.51 (#17484 )	2024-02-13 13:14:38 -08:00
Bagatur	de7c4b277c	langchain[patch]: Release 0.1.7 (#17482 )	2024-02-13 13:13:04 -08:00
Bagatur	39342d98d6	community[patch]: Release 0.0.20 (#17480 )	2024-02-13 13:01:51 -08:00
Bagatur	89b765ec27	core[patch]: Release 0.1.23 (#17479 )	2024-02-13 12:55:45 -08:00
Max Jakob	ab3d944667	community[patch]: ElasticsearchStore: preserve user headers (#16830 ) Users can provide an Elasticsearch connection with custom headers. This PR makes sure these headers are preserved when adding the langchain user agent header.	2024-02-13 12:37:35 -08:00
Erick Friis	112e10e933	infra: azure release integration testing secrets (#17476 )	2024-02-13 12:17:06 -08:00
Erick Friis	9eb1b56e73	pinecone[patch]: release 0.0.2 (#17477 )	2024-02-13 12:01:45 -08:00
Erick Friis	37678471c4	openai[patch]: relax tiktoken constraint, release 0.0.6 (#17472 )	2024-02-13 11:25:55 -08:00
Wendy H. Chun	2df7387c91	langchain[patch]: Fix to avoid infinite loop during collapse chain in map reduce (#16253 ) - Description: Depending on `token_max` used in `load_summarize_chain`, it could cause an infinite loop when documents cannot collapse under `token_max`. This change would not affect the existing feature, but it also gives an option to users to avoid the situation. - Issue: https://github.com/langchain-ai/langchain/issues/16251 - Dependencies: None - Twitter handle: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-13 10:55:32 -08:00
wulixuan	5d06797905	community[minor]: integrate chat models with Yuan2.0 (#16575 ) 1. integrate chat models with [`Yuan2.0`](https://github.com/IEIT-Yuan/Yuan-2.0/blob/main/README-EN.md) 2. add a new doc for [Yuan2.0 integration](docs/docs/integrations/llms/yuan2.ipynb) Yuan2.0 is a new generation Fundamental Large Language Model developed by IEIT System. We have published all three models, Yuan 2.0-102B, Yuan 2.0-51B, and Yuan 2.0-2B. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-13 10:55:14 -08:00
Taha Khabouss	15baffc484	langchain[patch]: Ensure that the Elasticsearch Query Translator functions accurately w… (#17044 ) Description: Addresses a problem where the Date type within an Elasticsearch SelfQueryRetriever would encounter difficulties in generating a valid query. Issue: #17042 --------- Co-authored-by: Max Jakob <max.jakob@elastic.co> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-13 10:54:24 -08:00
Erick Friis	e5c76f9dbd	pinecone[patch]: poetry update (#17471 )	2024-02-13 10:32:29 -08:00
Erick Friis	10bdf2422c	pinecone[patch]: release 0.0.2rc0, remove simsimd dep (#17469 )	2024-02-13 10:02:16 -08:00
Erick Friis	065cde69b1	google-genai[patch]: release 0.0.9, safety settings docs (#17432 )	2024-02-13 10:01:25 -08:00
Sergey Kozlov	db6f266d97	core: improve None value processing in merge_dicts() (#17462 ) - Description: fix `None` and `0` merging in `merge_dicts()`, add tests. ```python from langchain_core.utils._merge import merge_dicts assert merge_dicts({"a": None}, {"a": 0}) == {"a": 0} ``` --------- Co-authored-by: Sergey Kozlov <sergey.kozlov@ludditelabs.io>	2024-02-13 08:48:02 -08:00
Ian Gregory	e5472b5eb8	Framework for supporting more languages in LanguageParser (#13318 ) ## Description I am submitting this for a school project as part of a team of 5. Other team members are @LeilaChr, @maazh10, @Megabear137, @jelalalamy. This PR also has contributions from community members @Harrolee and @Mario928. Initial context is in the issue we opened (#11229). This pull request adds: - Generic framework for expanding the languages that `LanguageParser` can handle, using the [tree-sitter](https://github.com/tree-sitter/py-tree-sitter#py-tree-sitter) parsing library and existing language-specific parsers written for it - Support for the following additional languages in `LanguageParser`: - C - C++ - C# - Go - Java (contributed by @Mario928 https://github.com/ThatsJustCheesy/langchain/pull/2) - Kotlin - Lua - Perl - Ruby - Rust - Scala - TypeScript (contributed by @Harrolee https://github.com/ThatsJustCheesy/langchain/pull/1) Here is the [design document](https://docs.google.com/document/d/17dB14cKCWAaiTeSeBtxHpoVPGKrsPye8W0o_WClz2kk) if curious, but no need to read it. ## Issues - Closes #11229 - Closes #10996 - Closes #8405 ## Dependencies `tree_sitter` and `tree_sitter_languages` on PyPI. We have tried to add these as optional dependencies. ## Documentation We have updated the list of supported languages, and also added a section to `source_code.ipynb` detailing how to add support for additional languages using our framework. ## Maintainer - @hwchase17 (previously reviewed https://github.com/langchain-ai/langchain/pull/6486) Thanks!! ## Git commits We will gladly squash any/all of our commits (esp merge commits) if necessary. Let us know if this is desirable, or if you will be squash-merging anyway. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Maaz Hashmi <mhashmi373@gmail.com> Co-authored-by: LeilaChr <87657694+LeilaChr@users.noreply.github.com> Co-authored-by: Jeremy La <jeremylai511@gmail.com> Co-authored-by: Megabear137 <zubair.alnoor27@gmail.com> Co-authored-by: Lee Harrold <lhharrold@sep.com> Co-authored-by: Mario928 <88029051+Mario928@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-13 08:45:49 -08:00
merlin-quix	729c6d6827	docs: add use case for managing chat messages via Apache Kafka (#16771 ) Adding a new notebook that demonstrates how to use LangChain's standard chat features while passing the chat messages back and forth via Apache Kafka. This goal is to simulate an architecture where the chat front end and the LLM are running as separate services that need to communicate with one another over an internal nework. It's an alternative to typical pattern of requesting a reponse from the model via a REST API (there's more info on why you would want to do this at the end of the notebook). NOTE: Assuming "uses cases" is the right place for this but feel free to propose another location. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-13 08:09:15 -08:00
Bagatur	3925071dd6	langchain[patch], templates[patch]: fix multi query retriever, web re… (#17434 ) …search retriever Fixes #17352	2024-02-12 22:52:07 -08:00
Bagatur	c0ce93236a	experimental[patch]: fix zero-shot pandas agent (#17442 )	2024-02-12 21:58:35 -08:00
Abhishek Jain	37e1275f9e	community[patch]: Fixed the 'aembed' method of 'CohereEmbeddings'. (#16497 ) Description: - The existing code was trying to find a `.embeddings` property on the `Coroutine` returned by calling `cohere.async_client.embed`. - Instead, the `.embeddings` property is present on the value returned by the `Coroutine`. - Also, it seems that the original cohere client expects a value of `max_retries` to not be `None`. Hence, setting the default value of `max_retries` to `3`. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 21:57:27 -08:00
Sridhar Ramaswamy	9f1cbbc6ed	community[minor]: Add pebblo safe document loader (#16862 ) - Description: Pebblo opensource project enables developers to safely load data to their Gen AI apps. It identifies semantic topics and entities found in the loaded data and summarizes them in a developer-friendly report. - Dependencies: none - Twitter handle: srics @hwchase17	2024-02-12 21:56:12 -08:00
Preetam D'Souza	0834457f28	docs: Fix broken link in summarization use-case (#16554 ) - Description: Fix broken link to `StuffDocumentsChain` - Issue: N/A - Dependencies: None - Twitter handle: [@preetamdsouza](https://twitter.com/preetamdsouza)	2024-02-12 21:40:57 -08:00
Sheil Naik	d70a5bbf15	docs: Fix broken link in LLMs index.mdx (#16557 ) - Description: The [LLMs](https://python.langchain.com/docs/modules/model_io/llms/) page has a broken link. This fixes the link. - Issue: N/A - Dependencies: N/A - Twitter handle: @sheilnaik	2024-02-12 21:39:56 -08:00
mhavey	1bbb64d956	community[minor], langchian[minor]: Add Neptune Rdf graph and chain (#16650 ) Description: This PR adds a chain for Amazon Neptune graph database RDF format. It complements the existing Neptune Cypher chain. The PR also includes a Neptune RDF graph class to connect to, introspect, and query a Neptune RDF graph database from the chain. A sample notebook is provided under docs that demonstrates the overall effect: invoking the chain to make natural language queries against Neptune using an LLM. Issue: This is a new feature Dependencies: The RDF graph class depends on the AWS boto3 library if using IAM authentication to connect to the Neptune database. --------- Co-authored-by: Piyush Jain <piyushjain@duck.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 21:30:20 -08:00
Michael Feil	e1cfd0f3e7	community[patch]: infinity embeddings update incorrect default url (#16759 ) The default url has always been incorrect (7797 instead 7997). Here is a update to the correct url.	2024-02-12 20:05:08 -08:00
Massimiliano Pronesti	df7cbd6fbb	community[minor]: add FlashRank ranker (#16785 ) Description: This PR adds support for [flashrank](https://github.com/PrithivirajDamodaran/FlashRank) for reranking as alternative to Cohere. I'm not sure `libs/langchain` is the right place for this change. At first, I wanted to put it under `libs/community`. All the compressors were under `libs/langchain/retrievers/document_compressors` though. Hope this makes sense!	2024-02-12 20:00:52 -08:00
Andreas Motl	1fdd9bd980	community/SQLDatabase: Generalize and trim software tests (#16659 ) - Description: Improve test cases for `SQLDatabase` adapter component, see [suggestion](https://github.com/langchain-ai/langchain/pull/16655#pullrequestreview-1846749474). - Depends on: GH-16655 - Addressed to: @baskaryan, @cbornet, @eyurtsev _Remark: This PR is stacked upon GH-16655, so that one will need to go in first._ Edit: Thank you for bringing in GH-17191, @eyurtsev. This is a little aftermath, improving/streamlining the corresponding test cases.	2024-02-12 22:58:34 -05:00
Theo / Taeyoon Kang	1987f905ed	core[patch]: Support .yml extension for YAML (#16783 ) - Description: [AS-IS] When dealing with a yaml file, the extension must be .yaml. [TO-BE] In the absence of extension length constraints in the OS, the extension of the YAML file is yaml, but control over the yml extension must still be made. It's as if it's an error because it's a .jpg extension in jpeg support. - Issue: - - Dependencies: no dependencies required for this change,	2024-02-12 19:57:20 -08:00
Kapil Sachdeva	cd00a87db7	community[patch] - in FAISS vector store, support passing custom DocStore implementation when using from_xxx methods (#16801 ) - Description: The from__xx methods of FAISS class have hardcoded InMemoryStore implementation and thereby not let users pass a custom DocStore implementation, - Issue: no referenced issue, - Dependencies: none, - Twitter handle: ksachdeva	2024-02-12 19:51:55 -08:00
Chris	f9f5626ca4	community[patch]: Fix github search issues and PRs PaginatedList has no len() error (#16806 ) Description: Bugfix: Langchain_community's GitHub Api wrapper throws a TypeError when searching for issues and/or PRs (the `search_issues_and_prs` method). This is because PyGithub's PageinatedList type does not support the len() method. See https://github.com/PyGithub/PyGithub/issues/1476 ![image](https://github.com/langchain-ai/langchain/assets/8849021/57390b11-ed41-4f48-ba50-f3028610789c) Dependencies: None Twitter handle: @ChrisKeoghNZ I haven't registered an issue as it would take me longer to fill the template out than to make the fix, but I'm happy to if that's deemed essential. I've added a simple integration test to cover this as there were no existing unit tests and it was going to be tricky to set them up. Co-authored-by: Chris Keogh <chris.keogh@xero.com>	2024-02-12 19:50:59 -08:00
morgana	722aae4fd1	community: add delete method to rocksetdb vectorstore to support recordmanager (#17030 ) - Description: This adds a delete method so that rocksetdb can be used with `RecordManager`. - Issue: N/A - Dependencies: N/A - Twitter handle: `@_morgan_adams_` --------- Co-authored-by: Rockset API Bot <admin@rockset.io>	2024-02-12 19:50:20 -08:00
yin1991	c454dc36fc	community[proxy]: Enhancement/add proxy support playwrighturlloader 16751 (#16822 ) - Description: Enhancement/add proxy support playwrighturlloader 16751 - Issue: [Enhancement: Add Proxy Support to PlaywrightURLLoader Class](https://github.com/langchain-ai/langchain/issues/16751) - Dependencies: - Twitter handle: @ootR77013489 --------- Co-authored-by: root <root@ip-172-31-46-160.ap-southeast-1.compute.internal> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 19:48:29 -08:00
Bhupesh Varshney	e3b775e035	infra: make `.gitignore` consistent with standard python gitignore (#16828 ) - The new .gitignore version is inherited from the one maintained by the github community over at https://github.com/github/gitignore/blob/main/Python.gitignore - This should cover all the cases of how a langchain app can be used.	2024-02-12 19:43:41 -08:00
James Braza	64938ae6f2	infra: unit testing `check_package_version` (#16825 ) Wrote a unit test for `check_package_version` in the core package. Note that this is a revival of https://github.com/langchain-ai/langchain/pull/16387 after GitHub incident (see https://github.com/langchain-ai/langchain/discussions/16796).	2024-02-12 19:39:58 -08:00
Max Jakob	604e117411	docs: another auth method for ElasticsearchStore (#16831 ) Users can also use their own Elasticsearch client object to configure the connection.	2024-02-12 19:29:54 -08:00
Zeeland	4986e7227e	docs: rm unnecessary imports (#16876 ) - Description: optimize the document of memory usage - Issue: it lose some install guide	2024-02-12 19:25:54 -08:00
Lingzhen Chen	30af711c34	community[patch]: update AzureSearch class to work with azure-search-documents=11.4.0 (#15659 ) - Description: Updates `libs/community/langchain_community/vectorstores/azuresearch.py` to support the stable version `azure-search-documents=11.4.0` - Issue: https://github.com/langchain-ai/langchain/issues/14534, https://github.com/langchain-ai/langchain/issues/15039, https://github.com/langchain-ai/langchain/issues/15355 - Dependencies: azure-search-documents>=11.4.0 --------- Co-authored-by: Clément Tamines <Skar0@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 19:23:35 -08:00
Robby	e135dc70c3	community[patch]: Invoke callback prior to yielding token (#17348 ) Description: Invoke callback prior to yielding token in stream method for Ollama. Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Co-authored-by: Robby <h0rv@users.noreply.github.com>	2024-02-12 19:22:55 -08:00
Christophe Bornet	ab025507bc	community[patch]: Add async methods to VectorStoreQATool (#16949 )	2024-02-12 19:19:50 -08:00
Christophe Bornet	fb7552bfcf	Add async methods to InMemoryCache (#17425 ) Add async methods to InMemoryCache	2024-02-12 22:02:38 -05:00
Eugene Yurtsev	93472ee9e6	core[patch]: Replace memory stream implementation used by LogStreamCallbackHandler (#17185 ) This PR replaces the memory stream implementation used by the LogStreamCallbackHandler. This implementation resolves an issue in which streamed logs and streamed events originating from sync code would arrive only after the entire sync code would finish execution (rather than arriving in real time as they're generated). One example is if trying to stream tokens from an llm within a tool. If the tool was an async tool, but the llm was invoked via stream (sync variant) rather than astream (async variant), then the tokens would fail to stream in real time and would all arrived bunched up after the tool invocation completed.	2024-02-12 21:57:38 -05:00
yin1991	37ef6ac113	community[patch]: Add Pagination to GitHubIssuesLoader for Efficient GitHub Issues Retrieval (#16934 ) - Description: Add Pagination to GitHubIssuesLoader for Efficient GitHub Issues Retrieval - Issue: [the issue # it fixes if applicable,](https://github.com/langchain-ai/langchain/issues/16864) --------- Co-authored-by: root <root@ip-172-31-46-160.ap-southeast-1.compute.internal> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 18:30:36 -08:00
Leonid Ganeline	b87d6f9f48	docs: `Redis` page update (#16906 ) - Reordered sections - Applied consistent formatting - Fixed headers (there were 2 H1 headers; this breaks CoT) - Added `Settings` header and moved all related sections under it	2024-02-12 18:23:35 -08:00
Bagatur	22638e5927	community[patch]: give reranker default client val (#17289 )	2024-02-12 17:21:53 -08:00
Naveenkhasyap	841e5f514e	docs: Updated doc for integrations/chat/anthropic_functions #15664 (#17226 ) Description: Updated doc for integrations/chat/anthropic_functions with new functions: invoke. Changed structure of the document to match the required one. Issue: https://github.com/langchain-ai/langchain/issues/15664 Dependencies: None Twitter handle: None --------- Co-authored-by: NaveenMaltesh <naveen@onmeta.in>	2024-02-12 17:09:38 -08:00
Robby	ece4b43a81	community[patch]: doc loaders mypy fixes (#17368 ) Description: Fixed `type: ignore`'s for mypy for some document_loaders. Issue: [Remove "type: ignore" comments #17048 ](https://github.com/langchain-ai/langchain/issues/17048) --------- Co-authored-by: Robby <h0rv@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-12 16:51:06 -08:00
Robby	0653aa469a	community[patch]: Invoke callback prior to yielding token (#17346 ) Description: Invoke callback prior to yielding token in stream method for watsonx. Issue: [Callback for on_llm_new_token should be invoked before the token is yielded by the model #16913](https://github.com/langchain-ai/langchain/issues/16913) Co-authored-by: Robby <h0rv@users.noreply.github.com>	2024-02-12 16:36:33 -08:00
Min-Seong Lee	ce9a68791b	docs: fix typo in question_answering quickstart.ipynb (#17393 ) - Description: typo in docs (facillitate -> facilitate) - Issue: Typo - Dependencies: Nope - Twitter handle: None	2024-02-12 16:33:47 -08:00
Pennlaine	e1bc623f8f	docs: Updated docs for sitemap loader to use correct URL (#17395 ) - Description: Updated URL for sitemap loader from "https://langchain.readthedocs.io/sitemap.xml" to "https://api.python.langchain.com/sitemap.xml" - Issue: Fixes #17236	2024-02-12 16:20:32 -08:00
Bagatur	bd0ad6637a	infra: pr template nit (#17438 )	2024-02-12 16:19:14 -08:00
Bagatur	37629516cd	infra: update pr template (#17437 )	2024-02-12 16:17:30 -08:00
Ikko Eltociear Ashimine	b48fa8b695	docs: fix typo in vikingdb.ipynb (#17429 ) retreival -> retrieval	2024-02-12 15:51:12 -08:00
Bagatur	f7e453971d	community[patch]: remove print (#17435 )	2024-02-12 15:21:38 -08:00
Spencer Kelly	54fa78c887	community[patch]: fixed vector similarity filtering (#16967 ) Description: changed filtering so that failed filter doesn't add document to results. Currently filtering is entirely broken and all documents are returned whether or not they pass the filter. fixes issue introduced in https://github.com/langchain-ai/langchain/pull/16190	2024-02-12 14:52:57 -08:00
Aditya	a23c719c8b	google-genai[minor]: add safety settings (#16836 ) Replace this entire comment with: - Description:Expose safety_settings for Gemini integrations on google-generativeai - Issue:NA, - Dependencies:NA - Twitter handle:@aditya_rane @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-12 13:44:24 -08:00
Abhijeeth Padarthi	584b647b96	community[minor]: AWS Athena Document Loader (#15625 ) - Description: Adds the document loader for [AWS Athena](https://aws.amazon.com/athena/), a serverless and interactive analytics service. - Dependencies: Added boto3 as a dependency	2024-02-12 12:53:40 -08:00
david-tempelmann	93da18b667	community[minor]: Add mmr and similarity_score_threshold retrieval to DatabricksVectorSearch (#16829 ) - Description: This PR adds support for `search_types="mmr"` and `search_type="similarity_score_threshold"` to retrievers using `DatabricksVectorSearch`, - Issue: - Dependencies: - Twitter handle: --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-12 12:51:37 -08:00
Erick Friis	42648061ad	openai[patch]: code cleaning (#17355 ) h/t @tdene for finding cleanup op in #17047	2024-02-12 12:36:12 -08:00
Harrison Chase	a9d6da609a	add self discover notebook (#17387 )	2024-02-12 09:38:43 -08:00
ByeongUk Choi	ac970c9497	Update Docs for TFIDFRetriever Import Path (#17322 ) This PR updates the `TF-IDF.ipynb` documentation to reflect the new import path for TFIDFRetriever in the langchain-community package. The previous path, `from langchain.retrievers import TFIDFRetriever`, has been updated to `from langchain_community.retrievers import TFIDFRetriever` to align with the latest changes in the langchain library.	2024-02-11 21:26:08 -08:00
Michael Hunger	1c902ce3d1	tools:docs: update google_search.ipynb - change tool name (#17354 ) according to https://youtu.be/rZus0JtRqXE?si=aFo1JTDnu5kSEiEN&t=678 by @efriis - Description: Seems the requirements for tool names have changed and spaces are no longer allowed. Changed the tool name from Google Search to google_search in the notebook - Issue: n/a - Dependencies: none - Twitter handle: @mesirii	2024-02-11 21:25:19 -08:00
Massimiliano Pronesti	3894b4d9a5	community: add gpt-4-turbo and gpt-4-0125 costs (#17349 ) Ref: https://openai.com/pricing <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-11 21:24:24 -08:00
jiangzf93	d6a1c88ca7	docs: update documentation for file system tool integration (#17377 ) - Description: Update the docs for the tool integration module `file system` - Issue: [For New Contributors: Update Integration Documentation #15664](https://github.com/langchain-ai/langchain/issues/15664#top) - Dependencies: N/A	2024-02-11 21:19:40 -08:00
Pennlaine	2384267900	Updated doc for tools/pubmed with new functions: invoke. (#17378 ) Updated doc for integrations/chat/anthropic_functions #15664 - Description: Adds `pip install` instructions Update `run` with `invoke` - Issue: Fixes #15664	2024-02-11 21:19:31 -08:00
Tomaz Bratanic	19a1c9183d	Improve graph cypher qa prompt (#17380 ) Unlike vector results, the LLM has to completely trust the context of a graph database result, even if it doesn't provide whole context. We tried with instructions, but it seems that adding a single example is the way to go to solve this issue.	2024-02-11 21:15:46 -08:00
Sandeep Banerjee	183daa6e6f	google-genai[patch]: on_llm_new_token fix (#16924 ) ### This pull request makes the following changes: * Fixed issue #16913 Fixed the google gen ai chat_models.py code to make sure that the callback is called before the token is yielded <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-09 18:00:24 -08:00
Bagatur	10c10f2dea	cli[patch]: integration template nits (#14691 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-09 17:59:34 -08:00
Erick Friis	99540d3d75	infra: no print in newer partner packages (#17353 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-09 16:40:02 -08:00
William FH	7c03cc5ed4	Support serialization when inputs/outputs contain generators (#17338 ) Pydantic's `dict()` function raises an error here if you pass in a generator. We have a more robust serialization function in lagnsmith that we will use instead.	2024-02-09 16:24:54 -08:00
Erick Friis	3a2eb6e12b	infra: add print rule to ruff (#16221 ) Added noqa for existing prints. Can slowly remove / will prevent more being intro'd	2024-02-09 16:13:30 -08:00
Jael Gu	c07c0da01a	community[patch]: Fix Milvus add texts when ids=None (#17021 ) - Description: Fix Milvus add texts when ids=None (auto_id=True) Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-09 18:48:37 -05:00
Quang Hoa	54c1fb3f25	community[patch]: Make some functions work with Milvus (#10695 ) Description Make some functions work with Milvus: 1. get_ids: Get primary keys by field in the metadata 2. delete: Delete one or more entities by ids 3. upsert: Update/Insert one or more entities Issue None Dependencies None Tag maintainer: @hwchase17 Twitter handle: None --------- Co-authored-by: HoaNQ9 <hoanq.1811@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-09 15:21:31 -08:00
kYLe	c9999557bf	community[patch]: Modify LLMs/Anyscale work with OpenAI API v1 (#14206 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Description: 1. Modify LLMs/Anyscale to work with OAI v1 2. Get rid of openai_ prefixed variables in Chat_model/ChatAnyscale 3. Modify `anyscale_api_base` to `anyscale_base_url` to follow OAI name convention (reverted) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-09 15:11:18 -08:00
Charlie Marsh	24c0bab57b	infra, multiple: Upgrade configuration for Ruff v0.2.0 (#16905 ) ## Summary This PR upgrades LangChain's Ruff configuration in preparation for Ruff's v0.2.0 release. (The changes are compatible with Ruff v0.1.5, which LangChain uses today.) Specifically, we're now warning when linter-only options are specified under `[tool.ruff]` instead of `[tool.ruff.lint]`. --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-09 14:28:02 -08:00
Bagatur	01409add5a	google-vertexai[patch]: rm deps (#17077 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-09 14:12:10 -08:00
Erick Friis	d9e7675f7e	templates: gemini-functions-agent readme update (#17288 )	2024-02-09 14:10:23 -08:00
Erick Friis	1c2facf88d	nvidia-ai-endpoints[patch]: release 0.0.3 (#17345 )	2024-02-09 13:55:01 -08:00
Vadim Kudlay	5f9ac6986e	nvidia-ai-endpoints[patch]: model arguments (e.g. temperature) on construction bug (#17290 ) - Issue: Issue with model argument support (been there for a while actually): - Non-specially-handled arguments like temperature don't work when passed through constructor. - Such arguments DO work quite well with `bind`, but also do not abide by field requirements. - Since initial push, server-side error messages have gotten better and v0.0.2 raises better exceptions. So maybe it's better to let server-side handle such issues? - Description: - Removed ChatNVIDIA's argument fields in favor of `model_kwargs`/`model_kws` arguments which aggregates constructor kwargs (from constructor pathway) and merges them with call kwargs (bind pathway). - Shuffled a few functions from `_NVIDIAClient` to `ChatNVIDIA` to streamline construction for future integrations. - Minor/Optional: Old services didn't have stop support, so client-side stopping was implemented. Now do both. - Any Breaking Changes: Minor breaking changes if you strongly rely on chat_model.temperature, etc. This is captured by chat_model.model_kwargs. PR passes tests and example notebooks and example testing. Still gonna chat with some people, so leaving as draft for now. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-09 13:46:02 -08:00
Leonid Ganeline	932c52c333	community[patch]: docstrings (#16810 ) - added missed docstrings - formated docstrings to the consistent form	2024-02-09 12:48:57 -08:00
Leonid Ganeline	ae66bcbc10	core[patch]: docstring update (#16813 ) - added missed docstrings - formated docstrings to consistent form	2024-02-09 12:47:41 -08:00
Eugene Yurtsev	e10030e241	core[patch]: Add unit test to cover different streaming format for json parsing (#17063 ) Add unit test to cover this issue: https://github.com/langchain-ai/langchain/issues/16423 which was resolved by this PR: https://github.com/langchain-ai/langchain/pull/16670/files --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-09 11:28:55 -05:00
Kononov Pavel	15bc201967	langchain_community: Fix typo bug (#17324 ) Problem from #17095 This error wasn't in the v1.4.0	2024-02-09 11:27:33 -05:00
Eugene Yurtsev	344a227b5b	CI: Update documentation template (#17325 ) Update the documentation template	2024-02-09 11:27:18 -05:00
Erick Friis	023cb59e8a	templates: gemini-functions-agent genai package bump (#17286 )	2024-02-08 19:47:58 -08:00
Erick Friis	e660a1685b	google-genai[patch]: release 0.0.8 (#17285 )	2024-02-08 19:39:44 -08:00
Erick Friis	12d3159dd6	templates: simplify tool in gemini-functions-agent 2 (#17283 )	2024-02-08 19:39:29 -08:00
Erick Friis	febf9540b9	google-genai[patch]: fix tool format, use protos (#17284 )	2024-02-08 19:36:49 -08:00
Erick Friis	d8913b9428	templates: simplify tool in gemini-functions-agent (#17282 )	2024-02-08 19:09:27 -08:00
German Martin	1032faba5f	langchain_google_genai : Add missing _identifying_params property. (#17224 ) Description: Missing _identifying_params create issues when dealing with callbacks to get current run model parameters. All other model partners implementation provide this property and also provide _default_params. I'm not sure about the default values to include or if we can re-use the same as for _VertexAICommon(), this change allows you to access the model parameters correctly. Issue: Not exactly this issue but could be related https://github.com/langchain-ai/langchain/issues/14711 Twitter handle:@musicaoriginal2	2024-02-08 17:40:21 -08:00
Erick Friis	e4da7918f3	google-genai[patch]: fix streaming, function calling (#17268 )	2024-02-08 17:29:53 -08:00
Ruben Hakopian	96b5711a0c	google-vertexai[patch]: Fixed SafetySettings handling in streaming API in VertexAI (#17278 ) The streaming API doesn't separate safety_settings from the generation_config payload. As the result the following error is observed when using `stream` API. The functionality is correct with `invoke` API. The fix separates the `safety_settings` from params and sets it as argument to the `send_message` method. ``` ERROR: Unknown field for GenerationConfig: safety_settings Traceback (most recent call last): File "/Users/user/Library/Caches/pypoetry/virtualenvs/chatbot-worker-main-Ju-qIM-X-py3.12/lib/python3.12/site-packages/langchain_core/language_models/chat_models.py", line 250, in stream raise e File "/Users/user/Library/Caches/pypoetry/virtualenvs/chatbot-worker-main-Ju-qIM-X-py3.12/lib/python3.12/site-packages/langchain_core/language_models/chat_models.py", line 234, in stream for chunk in self._stream( File "/Users/user/Library/Caches/pypoetry/virtualenvs/chatbot-worker-main-Ju-qIM-X-py3.12/lib/python3.12/site-packages/langchain_google_vertexai/chat_models.py", line 501, in _stream for response in responses: File "/Users/user/Library/Caches/pypoetry/virtualenvs/chatbot-worker-main-Ju-qIM-X-py3.12/lib/python3.12/site-packages/vertexai/generative_models/_generative_models.py", line 921, in _send_message_streaming for chunk in stream: File "/Users/user/Library/Caches/pypoetry/virtualenvs/chatbot-worker-main-Ju-qIM-X-py3.12/lib/python3.12/site-packages/vertexai/generative_models/_generative_models.py", line 514, in _generate_content_streaming request = self._prepare_request( ^^^^^^^^^^^^^^^^^^^^^^ File "/Users/user/Library/Caches/pypoetry/virtualenvs/chatbot-worker-main-Ju-qIM-X-py3.12/lib/python3.12/site-packages/vertexai/generative_models/_generative_models.py", line 256, in _prepare_request gapic_generation_config = gapic_content_types.GenerationConfig( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/user/Library/Caches/pypoetry/virtualenvs/chatbot-worker-main-Ju-qIM-X-py3.12/lib/python3.12/site-packages/proto/message.py", line 576, in __init__ raise ValueError( ValueError: Unknown field for GenerationConfig: safety_settings ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-08 17:25:28 -08:00
Kartheek Yakkala	b18c6ab9ad	docs: Added LangGraph in framework parts of readme file (#17279 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-08 17:19:47 -08:00
Bagatur	65e97c9b53	infra: mv SQLDatabase tests to community (#17276 )	2024-02-08 17:05:43 -08:00
Bagatur	72c7af0bc0	langchain[patch]: undo redis cache import (#17275 )	2024-02-08 16:39:55 -08:00
Bagatur	8bad4157ad	langchain[patch]: Release 0.1.6 (#17133 )	2024-02-08 16:25:06 -08:00
Bagatur	7fa4dc593f	core[patch]: Release 0.1.22 (#17274 )	2024-02-08 16:13:33 -08:00
Bagatur	02ef9164b5	langchain[patch]: expose cohere rerank score, add parent doc param (#16887 )	2024-02-08 16:07:18 -08:00
Bagatur	35c1bf339d	infra: rm boto3, gcaip from pyproject (#17270 )	2024-02-08 15:28:22 -08:00
Leonid Ganeline	389b055bd6	docs: `Toolkits` menu (#16217 ) The Integrations `Toolkits` menu was named as [`Agents and toolkits`](https://python.langchain.com/docs/integrations/toolkits). This name has a historical reason that is not correct anymore. Now this menu is all about community `Toolkits`. There is a separate menu for [Agents](https://python.langchain.com/docs/modules/agents/). Also Agents are officially not part of Integrations (Community package) but part of LangChain package.	2024-02-08 14:52:26 -08:00
Alex	de5e96b5f9	community[patch]: updated openai prices in mapping (#17009 ) - Description: there are january prices update for chatgpt [blog](https://openai.com/blog/new-embedding-models-and-api-updates), also there are updates on their website on page [pricing](https://openai.com/pricing) - Issue: N/A --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-08 14:43:44 -08:00
Mohammad Mohtashim	e35c7fa3b2	[Langchain_core]: Added Docstring for RunnableConfigurableAlternatives (#17263 ) I noticed that RunnableConfigurableAlternatives which is an important composition in LCEL has no Docstring. Therefore I added the detailed Docstring for it. @baskaryan, @eyurtsev, @hwchase17 please have a look and let me if the docstring is looking good. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-08 17:05:33 -05:00
Armin Stepanyan	641efcf41c	community: add runtime kwargs to HuggingFacePipeline (#17005 ) This PR enables changing the behaviour of huggingface pipeline between different calls. For example, before this PR there's no way of changing maximum generation length between different invocations of the chain. This is desirable in cases, such as when we want to scale the maximum output size depending on a dynamic prompt size. Usage example: ```python from langchain_community.llms.huggingface_pipeline import HuggingFacePipeline from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline model_id = "gpt2" tokenizer = AutoTokenizer.from_pretrained(model_id) model = AutoModelForCausalLM.from_pretrained(model_id) pipe = pipeline("text-generation", model=model, tokenizer=tokenizer) hf = HuggingFacePipeline(pipeline=pipe) hf("Say foo:", pipeline_kwargs={"max_new_tokens": 42}) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-08 13:58:31 -08:00
Scott Nath	a32798abd7	community: Add you.com utility, update you retriever integration docs (#17014 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> - Description: changes to you.com files - general cleanup - adds community/utilities/you.py, moving bulk of code from retriever -> utility - removes `snippet` as endpoint - adds `news` as endpoint - adds more tests <s>Description: update community MAKE file - adds `integration_tests` - adds `coverage`</s> - Issue: the issue # it fixes if applicable, - [For New Contributors: Update Integration Documentation](https://github.com/langchain-ai/langchain/issues/15664#issuecomment-1920099868) - Dependencies: n/a - Twitter handle: @scottnath - Mastodon handle: scottnath@mastodon.social --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-08 13:47:50 -08:00
joelsprunger	3984f6604f	langchain: adds recursive json splitter (#17144 ) - Description: This adds a recursive json splitter class to the existing text_splitters as well as unit tests - Issue: splitting text from structured data can cause issues if you have a large nested json object and you split it as regular text you may end up losing the structure of the json. To mitigate against this you can split the nested json into large chunks and overlap them, but this causes unnecessary text processing and there will still be times where the nested json is so big that the chunks get separated from the parent keys. As an example you wouldn't want the following to be split in half: ```shell {'val0': 'DFWeNdWhapbR', 'val1': {'val10': 'QdJo', 'val11': 'FWSDVFHClW', 'val12': 'bkVnXMMlTiQh', 'val13': 'tdDMKRrOY', 'val14': 'zybPALvL', 'val15': 'JMzGMNH', 'val16': {'val160': 'qLuLKusFw', 'val161': 'DGuotLh', 'val162': 'KztlcSBropT', -----------------------------------------------------------------------split----- 'val163': 'YlHHDrN', 'val164': 'CtzsxlGBZKf', 'val165': 'bXzhcrWLmBFp', 'val166': 'zZAqC', 'val167': 'ZtyWno', 'val168': 'nQQZRsLnaBhb', 'val169': 'gSpMbJwA'}, 'val17': 'JhgiyF', 'val18': 'aJaqjUSFFrI', 'val19': 'glqNSvoyxdg'}} ``` Any llm processing the second chunk of text may not have the context of val1, and val16 reducing accuracy. Embeddings will also lack this context and this makes retrieval less accurate. Instead you want it to be split into chunks that retain the json structure. ```shell {'val0': 'DFWeNdWhapbR', 'val1': {'val10': 'QdJo', 'val11': 'FWSDVFHClW', 'val12': 'bkVnXMMlTiQh', 'val13': 'tdDMKRrOY', 'val14': 'zybPALvL', 'val15': 'JMzGMNH', 'val16': {'val160': 'qLuLKusFw', 'val161': 'DGuotLh', 'val162': 'KztlcSBropT', 'val163': 'YlHHDrN', 'val164': 'CtzsxlGBZKf'}}} ``` and ```shell {'val1':{'val16':{ 'val165': 'bXzhcrWLmBFp', 'val166': 'zZAqC', 'val167': 'ZtyWno', 'val168': 'nQQZRsLnaBhb', 'val169': 'gSpMbJwA'}, 'val17': 'JhgiyF', 'val18': 'aJaqjUSFFrI', 'val19': 'glqNSvoyxdg'}} ``` This recursive json text splitter does this. Values that contain a list can be converted to dict first by using split(... convert_lists=True) otherwise long lists will not be split and you may end up with chunks larger than the max chunk. In my testing large json objects could be split into small chunks with ✅ Increased question answering accuracy ✅ The ability to split into smaller chunks meant retrieval queries can use fewer tokens - Dependencies: json import added to text_splitter.py, and random added to the unit test - Twitter handle: @joelsprunger --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-08 13:45:34 -08:00
Schalkje	f0ada1a396	docs: Update quickstart.mdx - Fix 422 error in example with LangServe client code (#17163 ) Description:: Fix 422 error in example with LangServe client code httpx.HTTPStatusError: Client error '422 Unprocessable Entity' for url 'http://localhost:8000/agent/invoke'	2024-02-08 13:35:39 -08:00
Leonid Kuligin	1862900078	google-genai[patch]: added parsing of function call / response (#17245 )	2024-02-08 13:34:46 -08:00
Cailin Wang	a210a8bc53	langchain[patch]: Fix create_retriever_tool missing on_retriever_end Document content (#16933 ) - Description: In create_retriever_tool create_tool, fix create_retriever_tool's missing Document content for on_retriever_end, caused by create_retriever_tool's missing callbacks parameter, - Twitter handle: @CailinWang_ --------- Co-authored-by: root <root@Bluedot-AI> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-08 13:18:43 -08:00
Kartheek Yakkala	3a22157d92	docs: Added LCEL for alibabacloud and anyscale (#17252 ) --------- Co-authored-by: KARTHEEK YAKKALA <kartheekyakkala@KARTHEEKs-Air.lan> Co-authored-by: KARTHEEK YAKKALA <kartheekyakkala.se@gmail.com>	2024-02-08 13:18:09 -08:00
Sparsh Jain	a2167614b7	google-genai[patch]: Invoke callback prior to yielding token (#17092 ) - Description: Invoke callback prior to yielding token in stream and astream methods for Google-genai, - Issue: the issue # 16913, - Twitter handle: Sparsh10649446 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-08 13:13:46 -08:00
Liang Zhang	7306600e2f	community[patch]: Support SerDe transform functions in Databricks LLM (#16752 ) Description: Databricks LLM does not support SerDe the transform_input_fn and transform_output_fn. After saving and loading, the LLM will be broken. This PR serialize these functions into a hex string using pickle, and saving the hex string in the yaml file. Using pickle to serialize a function can be flaky, but this is a simple workaround that unblocks many use cases. If more sophisticated SerDe is needed, we can improve it later. Test: Added a simple unit test. I did manual test on Databricks and it works well. The saved yaml looks like: ``` llm: _type: databricks cluster_driver_port: null cluster_id: null databricks_uri: databricks endpoint_name: databricks-mixtral-8x7b-instruct extra_params: {} host: e2-dogfood.staging.cloud.databricks.com max_tokens: null model_kwargs: null n: 1 stop: null task: null temperature: 0.0 transform_input_fn: 80049520000000000000008c085f5f6d61696e5f5f948c0f7472616e73666f726d5f696e7075749493942e transform_output_fn: null ``` @baskaryan ```python from langchain_community.embeddings import DatabricksEmbeddings from langchain_community.llms import Databricks from langchain.chains import RetrievalQA from langchain.document_loaders import TextLoader from langchain.text_splitter import CharacterTextSplitter from langchain.vectorstores import FAISS import mlflow embeddings = DatabricksEmbeddings(endpoint="databricks-bge-large-en") def transform_input(**request): request["messages"] = [ { "role": "user", "content": request["prompt"] } ] del request["prompt"] return request llm = Databricks(endpoint_name="databricks-mixtral-8x7b-instruct", transform_input_fn=transform_input) persist_dir = "faiss_databricks_embedding" # Create the vector db, persist the db to a local fs folder loader = TextLoader("state_of_the_union.txt") documents = loader.load() text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0) docs = text_splitter.split_documents(documents) db = FAISS.from_documents(docs, embeddings) db.save_local(persist_dir) def load_retriever(persist_directory): embeddings = DatabricksEmbeddings(endpoint="databricks-bge-large-en") vectorstore = FAISS.load_local(persist_directory, embeddings) return vectorstore.as_retriever() retriever = load_retriever(persist_dir) retrievalQA = RetrievalQA.from_llm(llm=llm, retriever=retriever) with mlflow.start_run() as run: logged_model = mlflow.langchain.log_model( retrievalQA, artifact_path="retrieval_qa", loader_fn=load_retriever, persist_dir=persist_dir, ) # Load the retrievalQA chain loaded_model = mlflow.pyfunc.load_model(logged_model.model_uri) print(loaded_model.predict([{"query": "What did the president say about Ketanji Brown Jackson"}])) ```	2024-02-08 13:09:50 -08:00
cjpark-data	ce22e10c4b	community[patch]: Fix KeyError 'embedding' (MongoDBAtlasVectorSearch) (#17178 ) - Description: Embedding field name was hard-coded named "embedding". So I suggest that change `res["embedding"]` into `res[self._embedding_key]`. - Issue: #17177, - Twitter handle: [@bagcheoljun17](https://twitter.com/bagcheoljun17)	2024-02-08 12:06:42 -08:00
Neli Hateva	9bb5157a3d	langchain[patch], community[patch]: Fixes in the Ontotext GraphDB Graph and QA Chain (#17239 ) - Description: Fixes in the Ontotext GraphDB Graph and QA Chain related to the error handling in case of invalid SPARQL queries, for which `prepareQuery` doesn't throw an exception, but the server returns 400 and the query is indeed invalid - Issue: N/A - Dependencies: N/A - Twitter handle: @OntotextGraphDB	2024-02-08 12:05:43 -08:00
ByeongUk Choi	b88329e9a5	community[patch]: Implement Unique ID Enforcement in FAISS (#17244 ) Description: Implemented unique ID validation in the FAISS component to ensure all document IDs are distinct. This update resolves issues related to non-unique IDs, such as inconsistent behavior during deletion processes.	2024-02-08 12:03:33 -08:00
Jorge Campo	88609565a3	docs: Fix typo in github.ipynb (#17259 ) 'agiven' -> 'a given'	2024-02-08 12:03:00 -08:00
Bagatur	852973d616	langchain[minor], core[minor]: update json, pydantic parser. add openai-json structured output runnable (#16914 )	2024-02-08 11:59:06 -08:00
hsuyuming	e22c4d4eb0	google-vertexai[patch]: fix _parse_response_candidate issue (#16647 ) Description: enable _parse_response_candidate to support complex structure format. Issue: currently, if Gemini response complex args format, people will get "TypeError: Object of type RepeatedComposite is not JSON serializable" error from _parse_response_candidate. response candidate example ``` content { role: "model" parts { function_call { name: "Information" args { fields { key: "people" value { list_value { values { string_value: "Joe is 30, his mom is Martha" } } } } } } } } finish_reason: STOP safety_ratings { category: HARM_CATEGORY_HARASSMENT probability: NEGLIGIBLE } safety_ratings { category: HARM_CATEGORY_HATE_SPEECH probability: NEGLIGIBLE } safety_ratings { category: HARM_CATEGORY_SEXUALLY_EXPLICIT probability: NEGLIGIBLE } safety_ratings { category: HARM_CATEGORY_DANGEROUS_CONTENT probability: NEGLIGIBLE } ``` error msg: ``` Traceback (most recent call last): File "/home/jupyter/user/abehsu/gemini_langchain_tools/example2.py", line 36, in <module> print(tagging_chain.invoke({"input": "Joe is 30, his mom is Martha"})) File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_core/runnables/base.py", line 2053, in invoke input = step.invoke( File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_core/runnables/base.py", line 3887, in invoke return self.bound.invoke( File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py", line 165, in invoke self.generate_prompt( File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py", line 543, in generate_prompt return self.generate(prompt_messages, stop=stop, callbacks=callbacks, kwargs) File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py", line 407, in generate raise e File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py", line 397, in generate self._generate_with_cache( File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_core/language_models/chat_models.py", line 576, in _generate_with_cache return self._generate( File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_google_vertexai/chat_models.py", line 406, in _generate generations = [ File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_google_vertexai/chat_models.py", line 408, in <listcomp> message=_parse_response_candidate(c), File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/site-packages/langchain_google_vertexai/chat_models.py", line 280, in _parse_response_candidate function_call["arguments"] = json.dumps( File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/json/__init__.py", line 231, in dumps return _default_encoder.encode(obj) File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/json/encoder.py", line 199, in encode chunks = self.iterencode(o, _one_shot=True) File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/json/encoder.py", line 257, in iterencode return _iterencode(o, 0) File "/opt/conda/envs/gemini_langchain_tools/lib/python3.10/json/encoder.py", line 179, in default raise TypeError(f'Object of type {o.__class__.__name__} ' TypeError: Object of type RepeatedComposite is not JSON serializable ``` Twitter handle:** @abehsu1992626	2024-02-08 11:48:25 -08:00
Erick Friis	d77bb7b4e9	google-vertexai[patch]: integration test fix, release 0.0.5 (#17258 )	2024-02-08 11:45:33 -08:00
Aditya	98176ac982	langchain_google_vertexai : added logic to override get_num_tokens_from_messages() for ChatVertexAI (#16784 ) <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: added logic to override get_num_tokens_from_messages() for ChatVertexAI. Currently ChatVertexAI was inheriting get_num_tokens_from_messages() from BaseChatModel which in-turn was calling GPT-2 tokenizer - Issue: NA - Dependencies: NA - Twitter handle:@aditya_rane @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru>	2024-02-08 11:30:42 -08:00
Bagatur	00a09e1b71	docs: use PromptTemplate.from_template (#17218 ) Ran ```python import glob import re def update_prompt(x): return re.sub( r"(?P<start>\b)PromptTemplate$template=(?P<template>.), input_variables=(?:.)$", "\g<start>PromptTemplate.from_template(\g<template>)", x ) for fn in glob.glob("docs/*/", recursive=True): try: content = open(fn).readlines() except: continue content = [update_prompt(l) for l in content] with open(fn, "w") as f: f.write("".join(content)) ```	2024-02-07 19:52:42 -08:00
sana-google	7f55c95790	docs: add missing link to Quickstart (#17085 ) Replace this entire comment with: - Description: Added missing link for Quickstart in Model IO documentation, - Issue: N/A, - Dependencies: N/A, - Twitter handle: N/A <!-- If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-07 22:26:10 -05:00
Bassem Yacoube	4e3ed7f043	community[patch]: octoai embeddings bug fix (#17216 ) fixes a bug in octoa_embeddings provider	2024-02-07 22:25:52 -05:00
Eugene Yurtsev	780e84ae79	community[minor]: SQLDatabase Add fetch mode `cursor`, query parameters, query by selectable, expose execution options, and documentation (#17191 ) - Description: Improve `SQLDatabase` adapter component to promote code re-use, see [suggestion](https://github.com/langchain-ai/langchain/pull/16246#pullrequestreview-1846590962). - Needed by: GH-16246 - Addressed to: @baskaryan, @cbornet ## Details - Add `cursor` fetch mode - Accept SQL query parameters - Accept both `str` and SQLAlchemy selectables as query expression - Expose `execution_options` - Documentation page (notebook) about `SQLDatabase` [^1] See [About SQLDatabase](https://github.com/langchain-ai/langchain/blob/c1c7b763/docs/docs/integrations/tools/sql_database.ipynb). [^1]: Apparently there hasn't been any yet? --------- Co-authored-by: Andreas Motl <andreas.motl@crate.io>	2024-02-07 22:23:43 -05:00
Tomaz Bratanic	7e4b676d53	community[patch]: Better error propagation for neo4jgraph (#17190 ) There are other errors that could happen when refreshing the schema, so we want to propagate specific errors for more clarity	2024-02-07 22:16:14 -05:00
Leonid Ganeline	d903fa313e	docs: titles fix (#17206 ) Several notebooks have Title != file name. That results in corrupted sorting in Navbar (ToC). - Fixed titles and file names. - Changed text formats to the consistent form - Redirected renamed files in the `Vercel.json`	2024-02-07 22:09:34 -05:00
Luiz Ferreira	34d2daffb3	community[patch]: Fix chat openai unit test (#17124 ) - Description: Actually the test named `test_openai_apredict` isn't testing the apredict method from ChatOpenAI. - Twitter handle: https://twitter.com/OAlmofadas	2024-02-07 22:08:26 -05:00
Dmitry Kankalovich	f92738a6f6	langchain[minor], community[minor], core[minor]: Async Cache support and AsyncRedisCache (#15817 ) * This PR adds async methods to the LLM cache. * Adds an implementation using Redis called AsyncRedisCache. * Adds a docker compose file at the /docker to help spin up docker * Updates redis tests to use a context manager so flushing always happens by default	2024-02-07 22:06:09 -05:00
Harrison Chase	19546081c6	templates: add gemini functions agent (#17141 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-07 17:27:01 -08:00
Bagatur	aeb6b38901	docs: cleanup fleet integration (#17214 ) Causing search issues	2024-02-07 17:18:48 -08:00
Erick Friis	4153837502	google-genai[patch]: release 0.0.7 (#17193 )	2024-02-07 17:15:09 -08:00
Erick Friis	927ab77d6e	google-genai[patch]: no error for FunctionMessage (#17215 ) Both should eventually match this: https://github.com/langchain-ai/langchain/blob/master/libs/partners/google-vertexai/langchain_google_vertexai/chat_models.py#L179 But seems undocumented / can't find types in genai package	2024-02-07 17:14:50 -08:00
Erick Friis	2ecf318218	google-genai[patch]: match function call interface (#17213 ) should match vertex	2024-02-07 17:07:31 -08:00
Erick Friis	e17173c403	google-vertexai[patch]: function calling integration test (#17209 )	2024-02-07 15:49:56 -08:00
Erick Friis	52be84a603	google-vertexai[patch]: serializable citation metadata, release 0.0.4 (#17145 ) was breaking in langserve before	2024-02-07 15:47:32 -08:00
Nuno Campos	19ff81e74f	Fix stream events/log with some kinds of non addable output (#17205 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-07 15:46:13 -08:00
Bagatur	6f1403b9b6	community[patch]: Release 0.0.19 (#17207 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-07 15:37:01 -08:00
Erick Friis	a13dc47a08	cli[patch]: copyright 2024 default (#17204 )	2024-02-07 14:52:37 -08:00
Bagatur	00757567ba	core[patch]: Release 0.1.21 (#17202 )	2024-02-07 14:20:20 -08:00
Bagatur	af74301ab9	core[patch], community[patch]: link extraction continue on failure (#17200 )	2024-02-07 14:15:30 -08:00
Henry	2281f00198	langchain: Standardize `output_parser.py` across all agent types for custom `FORMAT_INSTRUCTIONS` (#17168 ) - Description: This PR standardizes the `output_parser.py` file across all agent types to ensure a uniform parsing mechanism is implemented. It introduces a cohesive structure and common interface for output parsing, facilitating easier modifications and extensions by users. The standardized approach enhances maintainability and scalability of the codebase by providing a consistent pattern for output parsing, which can be easily understood and utilized across different agent types. This PR builds upon the foundation set by a previously merged PR, which focused exclusively on standardizing the `output_parser.py` for the `conversational_agent` ([PR #16945](https://github.com/langchain-ai/langchain/pull/16945)). With this new update, I extend the standardization efforts to encompass `output_parser.py` files across all agent types. This enhancement not only unifies the parsing mechanism across the board but also introduces the flexibility for users to incorporate custom `FORMAT_INSTRUCTIONS`. - Issue: https://github.com/langchain-ai/langchain/issues/10721 https://github.com/langchain-ai/langchain/issues/4044 - Dependencies: No new dependencies required for this change - Twitter handle: With my github user is enough. Thanks I hope you accept my PR.	2024-02-07 13:46:17 -08:00
Erick Friis	1cf5a5858f	remove pg_essay.txt (#17198 ) Added in #16159	2024-02-07 12:58:01 -08:00
Tomaz Bratanic	ecf8042a10	templates: Add neo4j semantic layer with ollama template (#17192 ) A template with JSON-based agent using Mixtral via Ollama. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-07 12:50:54 -08:00
Erick Friis	f87acf0340	infra: better conditional (#17197 )	2024-02-07 12:49:02 -08:00
Erick Friis	4ae91733aa	infra: fix core release (#17195 ) core doesn't have any min deps to test	2024-02-07 12:35:27 -08:00
Bagatur	78409634fe	core[patch]: Release 0.1.20 (#17194 )	2024-02-07 12:28:05 -08:00
Nuno Campos	65798289a4	core[minor]: Use batched tracing in sdk (#16305 ) Remove threadpool executor usage in langchain tracer, this is now handled by sdk	2024-02-07 12:10:58 -08:00
chyroc	f87b38a559	google-genai[minor]: support functions call (#15146 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-07 12:09:30 -08:00
Tomaz Bratanic	302989a2b1	allow optional newline in the action responses of JSON Agent parser (#17186 ) Based on my experiments, the newline isn't always there, so we can make the regex slightly more robust by allowing an optional newline after the bacticks	2024-02-07 10:26:14 -08:00
William FH	9fa07076da	Add trace_as_chain_group metadata (#17187 )	2024-02-07 09:42:44 -08:00
Leonid Ganeline	5ceaf784f3	docs `Integraions/Components` menu reordered (#17151 ) This PR is opinionated. - Moved `Embedding models` item to place after `LLMs` and `Chat model`, so all items with models are together. - Renamed `Text embedding models` to `Embedding models`. Now, it is shorter and easier to read. `Text` is obvious from context. The same as the `Text LLMs` vs. `LLMs` (we also have multi-modal LLMs).	2024-02-06 20:33:41 -08:00
Leonid Ganeline	0af0fc5d25	docs `integraions/providers` nav fix (#17148 ) Issue: `Provides` page is presented as the index page (on the `Providers` item) and as the `Providers/Providers` item. The latter should not be in the menu. See the picture. ![image](https://github.com/langchain-ai/langchain/assets/2256422/6894023f-f13a-4f0d-8fe2-ed5b0ae2bdd2) This PR fixes this.	2024-02-06 20:33:14 -08:00
Leonid Ganeline	bf55279d39	docs: tutorials update (#17132 ) Added the course and the one-pager links	2024-02-06 20:30:30 -08:00
Erick Friis	f499a222de	infra: release min version debugging 2 (#17152 )	2024-02-06 18:20:19 -08:00
Erick Friis	deb02de051	infra: release min version debugging (#17150 )	2024-02-06 18:10:37 -08:00
Erick Friis	9710346095	infra: poetry run min versions 2 (#17149 )	2024-02-06 17:57:43 -08:00
Erick Friis	181a033226	infra: poetry run min versions (#17146 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-06 17:37:36 -08:00
Erick Friis	d397721a34	docs: format (#17143 )	2024-02-06 16:32:53 -08:00
Erick Friis	2187268208	infra: fix release (#17142 )	2024-02-06 16:22:20 -08:00
Erick Friis	3e58df43c2	mistralai[patch]: release 0.0.4 (#17139 )	2024-02-06 16:05:20 -08:00
Erick Friis	22b6a03a28	infra: read min versions (#17135 )	2024-02-06 16:05:11 -08:00
Erick Friis	f881a3330c	mistralai[patch]: 16k token batching logic embed (#17136 )	2024-02-06 15:59:08 -08:00
Arno Schutijzer	863f96b2e0	docs: fix typo in ollama notebook (#17127 ) - Description: typo fix in ollama notebook	2024-02-06 16:54:40 -05:00
Leonid Ganeline	42c812a549	API References sorted `Partner libs` menu (#17130 ) The `Partner libs` menu is not sorted. Now it is long enough, and items should be sorted to simplify a package search. - Sorted items in the `Partner libs` menu	2024-02-06 16:49:23 -05:00
Bagatur	226f376d59	community[patch]: Release 0.0.18 (#17129 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-06 13:40:00 -08:00
Erick Friis	37062549f9	infra: update to cache v4 (#17126 ) stop using nodejs 16. Use 20 (stop deprecation annotation on all ci) Changelog: https://github.com/actions/cache?tab=readme-ov-file#whats-new	2024-02-06 12:55:01 -08:00
Erick Friis	980e30c361	nvidia-ai-endpoints[patch]: release 0.0.2 (#17125 )	2024-02-06 12:48:25 -08:00
Erick Friis	15bd1154a7	pinecone[patch]: integration test new namespace (#17121 )	2024-02-06 11:56:00 -08:00
Erick Friis	3ccffa5dcc	infra: add integration deps to partner lint (#17122 )	2024-02-06 11:51:04 -08:00
Mikhail Khludnev	14ff1438e6	nvidia-trt[patch]: propagate InferenceClientException to the caller. (#16936 ) - Description: before the change I've got 1. propagate InferenceClientException to the caller. 2. stop grpc receiver thread on exception ``` for token in result_queue: > result_str += token E TypeError: can only concatenate str (not "InferenceServerException") to str ../../langchain_nvidia_trt/llms.py:207: TypeError ``` And stream thread keeps running. after the change request thread stops correctly and caller got a root cause exception: ``` E tritonclient.utils.InferenceServerException: [request id: 4529729] expected number of inputs between 2 and 3 but got 10 inputs for model 'vllm_model' ../../langchain_nvidia_trt/llms.py:205: InferenceServerException ``` - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: [t.me/mkhl_spb](https://t.me/mkhl_spb) I'm not sure about test coverage. Should I setup deep mocks or there's a kind of triton stub via testcontainers or so.	2024-02-06 11:47:07 -08:00
Erick Friis	6af912d7e0	infra: add pinecone secret (#17120 )	2024-02-06 11:27:04 -08:00
Junyoung Park	1ed73f1992	community[minor]: Add SelfQueryRetriever support to PGVector (#16991 ) - Description: Add SelfQueryRetriever support to PGVector - Issue: - - Dependencies: - - Twitter handle: - --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-06 10:50:50 -08:00
Bagatur	cd945e3a5b	core[patch]: Release 0.1.19 (#17117 )	2024-02-06 09:54:22 -08:00
Frank	ef082c77b1	community[minor]: add github file loader to load any github file content b… (#15305 ) ### Description support load any github file content based on file extension. Why not use [git loader](https://python.langchain.com/docs/integrations/document_loaders/git#load-existing-repository-from-disk) ? git loader clones the whole repo even only interested part of files, that's too heavy. This GithubFileLoader only downloads that you are interested files. ### Twitter handle my twitter: @shufanhaotop --------- Co-authored-by: Hao Fan <h_fan@apple.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-06 09:42:33 -08:00
老阿張	ac662b3698	docs: Fix typo in amadeus.ipynb (#16916 ) Description: "enviornment should be environment"? 🤔 Issue: Typo Dependencies: Nope Twitter handle: laoazhang	2024-02-06 09:42:05 -08:00
Henry	eaeb8a5f71	langchain[patch]: `output_parser.py` in conversation_chat is customizable (#16945 ) Description: With this modification, users can customize the `FORMAT_INSTRUCTIONS` template, allowing them to create their own prompts As it is happening in [this](https://github.com/langchain-ai/langchain/issues/10721) issue, the `FORMAT_INSTRUCTIONS` is not customizable for the output parser, unless you create your own class `ConvoOutputParser`. To avoid this, a modification was done, creating a `format_instruction` variable that users can customize with ease after initialize the agent. For example: ``` agent = initialize_agent( agent = AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION, tools = tools, llm = llm_agent, verbose = True, max_iterations = 3, early_stopping_method = 'generate', memory = b_w_memory, handle_parsing_errors = True, agent_kwargs={ 'system_message':PREFIX, 'human_message':SUFFIX, 'template_tool_response':TEMPLATE_TOOL_RESPONSE, } ) agent.agent.output_parser.format_instructions = "MY CUSTOM FORMAT INSTRUCTIONS" print(agent.agent.output_parser.get_format_instructions()) MY CUSTOM FORMAT INSTRUCTIONS ``` Other parameters like `system_message`, `human_message`, or `template_tool_response` are already customizable and with this PR, the last parameter `FORMAT_INSTRUCTIONS` in `langchain.agents.conversational_chat.prompt` can be modified. Issue: https://github.com/langchain-ai/langchain/issues/10721 Dependencies: No new dependencies required for this change Twitter handle: With my github user is enough. Thanks I hope you accept my PR. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-06 09:41:53 -08:00
Ryan Kraus	f027696b5f	community: Added new Utility runnables for NVIDIA Riva. (#15966 ) Please tag this issue with `nvidia_genai` - Description: Added new Runnables for integration NVIDIA Riva into LCEL chains for Automatic Speech Recognition (ASR) and Text To Speech (TTS). - Issue: N/A - Dependencies: To use these runnables, the NVIDIA Riva client libraries are required. It they are not installed, an error will be raised instructing how to install them. The Runnables can be safely imported without the riva client libraries. - Twitter handle: N/A All of the Riva Runnables are inside a single folder in the Utilities module. In this folder are four files: - common.py - Contains all code that is common to both TTS and ASR - stream.py - Contains a class representing an audio stream that allows the end user to put data into the stream like a queue. - asr.py - Contains the RivaASR runnable - tts.py - Contains the RivaTTS runnable The following Python function is an example of creating a chain that makes use of both of these Runnables: ```python def create( config: Configuration, audio_encoding: RivaAudioEncoding, sample_rate: int, audio_channels: int = 1, ) -> Runnable[ASRInputType, TTSOutputType]: """Create a new instance of the chain.""" _LOGGER.info("Instantiating the chain.") # create the riva asr client riva_asr = RivaASR( url=str(config.riva_asr.service.url), ssl_cert=config.riva_asr.service.ssl_cert, encoding=audio_encoding, audio_channel_count=audio_channels, sample_rate_hertz=sample_rate, profanity_filter=config.riva_asr.profanity_filter, enable_automatic_punctuation=config.riva_asr.enable_automatic_punctuation, language_code=config.riva_asr.language_code, ) # create the prompt template prompt = PromptTemplate.from_template("{user_input}") # model = ChatOpenAI() model = ChatNVIDIA(model="mixtral_8x7b") # type: ignore # create the riva tts client riva_tts = RivaTTS( url=str(config.riva_asr.service.url), ssl_cert=config.riva_asr.service.ssl_cert, output_directory=config.riva_tts.output_directory, language_code=config.riva_tts.language_code, voice_name=config.riva_tts.voice_name, ) # construct and return the chain return {"user_input": riva_asr} \| prompt \| model \| riva_tts # type: ignore ``` The following code is an example of creating a new audio stream for Riva: ```python input_stream = AudioStream(maxsize=1000) # Send bytes into the stream for chunk in audio_chunks: await input_stream.aput(chunk) input_stream.close() ``` The following code is an example of how to execute the chain with RivaASR and RivaTTS ```python output_stream = asyncio.Queue() while not input_stream.complete: async for chunk in chain.astream(input_stream): output_stream.put(chunk) ``` Everything should be async safe and thread safe. Audio data can be put into the input stream while the chain is running without interruptions. --------- Co-authored-by: Hayden Wolff <hwolff@nvidia.com> Co-authored-by: Hayden Wolff <hwolff@Haydens-Laptop.local> Co-authored-by: Hayden Wolff <haydenwolff99@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-05 19:50:50 -08:00
Jan de Boer	2d8015554c	docs: Link to Brave Website added (#16958 ) Description: Link to the Brave Website added to the `brave-search.ipynb` notebook. This notebook is shown in the docs as an example for the brave tool. Issue: There was to reference on where / how to get an api key Dependencies: none Twitter handle: not for this one :)	2024-02-05 18:29:16 -08:00
os1ma	fd88e0f800	docs: update StreamlitCallbackHandler example (#16970 ) - Description: docs: update StreamlitCallbackHandler example. - Issue: None - Dependencies: None I have updated the example for StreamlitCallbackHandler in the documentation bellow. https://python.langchain.com/docs/integrations/callbacks/streamlit Previously, the example used `initialize_agent`, which has been deprecated, so I've updated it to use `create_react_agent` instead. Many langchain users are likely searching examples of combining `create_react_agent` or `openai_tools_agent_chain` with StreamlitCallbackHandler. I'm sure this update will be really helpful for them! Unfortunately, writing unit tests for this example is difficult, so I have not written any tests. I have run this code in a standalone Python script file and ensured it runs correctly.	2024-02-05 18:20:59 -08:00
Marc Mahe	f08a9139d2	docs: update mistral docs for version 0.1+ (#17011 ) Description: Updated integration page for mistralai.	2024-02-05 18:03:12 -08:00
François Paupier	929f071513	community[patch]: Fix error in `LlamaCpp` community LLM with Configurable Fields, 'grammar' custom type not available (#16995 ) - Description: Ensure the `LlamaGrammar` custom type is always available when instantiating a `LlamaCpp` LLM - Issue: #16994 - Dependencies: None - Twitter handle: @fpaupier --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-05 17:56:58 -08:00
Leonid Ganeline	563f325034	experimental[patch]: fixed import in `experimental` (#17078 )	2024-02-05 17:47:13 -08:00
Ikko Eltociear Ashimine	5f5f5acbc5	docs: fix typo in dspy.ipynb (#16996 ) langugage -> language	2024-02-05 17:31:06 -08:00
Eugene Yurtsev	fbab8baac5	core[patch]: Add astream events config test (#17055 ) Verify that astream events propagates config correctly --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-05 17:24:58 -08:00
Eugene Yurtsev	609ea019b2	docs: Update streaming documentation (#17066 ) Updating streaming documentation following fix of JSON parser for streaming json.	2024-02-05 17:24:46 -08:00
Erick Friis	64785822dc	templates: bump (#17074 )	2024-02-05 17:12:12 -08:00
Scott Nath	10bd901139	infra: add integration_tests and coverage to MAKEFILE (#17053 ) - Description: update community MAKE file - adds `integration_tests` - adds `coverage` - Issue: the issue # it fixes if applicable, - moving out of https://github.com/langchain-ai/langchain/pull/17014 - Dependencies: n/a - Twitter handle: @scottnath - Mastodon handle: scottnath@mastodon.social --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-05 16:39:55 -08:00
Giulio Zani	9f0b63dba0	experimental[patch]: Fixes issue #17060 (#17062 ) As described in issue #17060, in the case in which text has only one sentence the following function fails. Checking for that and adding a return case fixed the issue. ```python def split_text(self, text: str) -> List[str]: """Split text into multiple components.""" # Splitting the essay on '.', '?', and '!' single_sentences_list = re.split(r"(?<=[.?!])\s+", text) sentences = [ {"sentence": x, "index": i} for i, x in enumerate(single_sentences_list) ] sentences = combine_sentences(sentences) embeddings = self.embeddings.embed_documents( [x["combined_sentence"] for x in sentences] ) for i, sentence in enumerate(sentences): sentence["combined_sentence_embedding"] = embeddings[i] distances, sentences = calculate_cosine_distances(sentences) start_index = 0 # Create a list to hold the grouped sentences chunks = [] breakpoint_percentile_threshold = 95 breakpoint_distance_threshold = np.percentile( distances, breakpoint_percentile_threshold ) # If you want more chunks, lower the percentile cutoff indices_above_thresh = [ i for i, x in enumerate(distances) if x > breakpoint_distance_threshold ] # The indices of those breakpoints on your list # Iterate through the breakpoints to slice the sentences for index in indices_above_thresh: # The end index is the current breakpoint end_index = index # Slice the sentence_dicts from the current start index to the end index group = sentences[start_index : end_index + 1] combined_text = " ".join([d["sentence"] for d in group]) chunks.append(combined_text) # Update the start index for the next group start_index = index + 1 # The last group, if any sentences remain if start_index < len(sentences): combined_text = " ".join([d["sentence"] for d in sentences[start_index:]]) chunks.append(combined_text) return chunks ``` Co-authored-by: Giulio Zani <salamanderxing@Giulios-MBP.homenet.telecomitalia.it>	2024-02-05 16:18:57 -08:00
Jimmy Moore	912210ac19	core[patch]: fix _sql_record_manager mypy for #17048 (#17073 ) - Description: Add relevant type annotations for relevant session and query objects to resolve mypy errors when `# type: ignore` comments are removed. - Issue: #17048 - Dependencies: None, - Twitter handle: [clesiemo3](https://twitter.com/clesiemo3) I attempted to solve the `UpsertionRecord` ignore but it would require added a deprecated plugin or moving completely to sqlalchemy 2.0+ from my understanding. I'm assuming this is not something desired at this point in time.	2024-02-05 16:18:40 -08:00
William FH	3d5e988c55	Add prompt metadata + tags (#17054 )	2024-02-05 16:17:31 -08:00
Bagatur	d8f41d0521	docs: add youtube link (#17065 )	2024-02-05 16:12:56 -08:00
Bagatur	6e2ed9671f	infra: fix breebs test lint (#17075 )	2024-02-05 16:09:48 -08:00
T Cramer	cf01fc3790	docs: update parse_partial_json source info (#17036 ) - Description: Update source-link following recent license update at open-interpreter project - Issue: N/A - Dependencies: None	2024-02-05 15:54:34 -08:00
Harrison Chase	83fbf0e11a	docs: add structured tools howto to agents (#15772 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-05 15:53:01 -08:00
Alex Boury	334b6ebdf3	community[minor]: Breebs docs retriever (#16578 ) - Description: Implementation of breeb retriever with integration tests -> libs/community/tests/integration_tests/retrievers/test_breebs.py and documentation (notebook) -> docs/docs/integrations/retrievers/breebs.ipynb. - Dependencies: None	2024-02-05 15:51:08 -08:00
Nova Kwok	eb7b05885f	docs: Fix typo in quickstart.ipynb (#16859 ) - Description: "load HTML form web URLs" should be "load HTML from web URLs"? 🤔 - Issue: Typo - Dependencies: Nope - Twitter handle: n0vad3v	2024-02-05 15:50:11 -08:00
Shorthills AI	cf0b29b6d2	docs: fixing a minor grammatical mistake (#16931 )	2024-02-05 15:49:47 -08:00
Shivani Modi	fcb875629d	docs: Updating documentation for Konko provider (#16953 ) - Description: A small update to the Konko provider documentation. --------- Co-authored-by: Shivani Modi <shivanimodi@Shivanis-MacBook-Pro.local>	2024-02-05 15:49:13 -08:00
Benjamin Muskalla	973ba0d84b	docs: Fix Copilot name (#16956 ) The official name is "GitHub Copilot"	2024-02-05 15:48:47 -08:00
IMRAN KHAN	4b17699818	docs: add 2 more tutorials to the list in youtube.mdx (#16998 ) - Description: add 2 more tutorials to the list in youtube.mdx, - Twitter handle: EhThing	2024-02-05 15:48:34 -08:00
Serena Ruan	9b279ac127	community[patch]: MLflow callback update (#16687 ) Signed-off-by: Serena Ruan <serena.rxy@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-02-05 15:46:46 -08:00
Mohammad Mohtashim	3c4b24b69a	community[patch]: Fix the _call of HuggingFaceHub (#16891 ) Fixed the following identified issue: #16849 @baskaryan	2024-02-05 15:34:42 -08:00
Tyler Titsworth	304f3f5fc1	community[patch]: Add Progress bar to HuggingFaceEmbeddings (#16758 ) - Description: Adds a function parameter to HuggingFaceEmbeddings called `show_progress` that enables a `tqdm` progress bar if enabled. Does not function if `multi_process = True`. - Issue: n/a - Dependencies: n/a	2024-02-05 14:33:34 -08:00
Supreet Takkar	ae33979813	community[patch]: Allow adding ARNs as model_id to support Amazon Bedrock custom models (#16800 ) - Description: Adds an additional class variable to `BedrockBase` called `provider` that allows sending a model provider such as amazon, cohere, ai21, etc. Up until now, the model provider is extracted from the `model_id` using the first part before the `.`, such as `amazon` for `amazon.titan-text-express-v1` (see [supported list of Bedrock model IDs here](https://docs.aws.amazon.com/bedrock/latest/userguide/model-ids-arns.html)). But for custom Bedrock models where the ARN of the provisioned throughput must be supplied, the `model_id` is like `arn:aws:bedrock:...` so the `model_id` cannot be extracted from this. A model `provider` is required by the LangChain Bedrock class to perform model-based processing. To allow the same processing to be performed for custom-models of a specific base model type, passing this `provider` argument can help solve the issues. The alternative considered here was the use of `provider.arn:aws:bedrock:...` which then requires ARN to be extracted and passed separately when invoking the model. The proposed solution here is simpler and also does not cause issues for current models already using the Bedrock class. - Issue: N/A - Dependencies: N/A --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-02-05 14:28:03 -08:00
T Cramer	e022bfaa7d	langchain: add partial parsing support to JsonOutputToolsParser (#17035 ) - Description: Add partial parsing support to JsonOutputToolsParser - Issue: [16736](https://github.com/langchain-ai/langchain/issues/16736) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-05 14:18:30 -08:00
calvinweb	dcf973c22c	Langchain: `json_chat` don't need stop sequenes (#16335 ) This is a PR about #16334 The Stop sequenes isn't meanful in `json_chat` because it depends json to work, not completions <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-05 14:18:16 -08:00
Bagatur	66e45e8ab7	community[patch]: chat model mypy fixes (#17061 ) Related to #17048	2024-02-05 13:42:59 -08:00
Bagatur	d93de71d08	community[patch]: chat message history mypy fixes (#17059 ) Related to #17048	2024-02-05 13:13:25 -08:00
Bagatur	af5ae24af2	community[patch]: callbacks mypy fixes (#17058 ) Related to #17048	2024-02-05 12:37:27 -08:00
Vadim Kudlay	75b6fa1134	nvidia-ai-endpoints[patch]: Support User-Agent metadata and minor fixes. (#16942 ) - Description: Several meta/usability updates, including User-Agent. - Issue: - User-Agent metadata for tracking connector engagement. @milesial please check and advise. - Better error messages. Tries harder to find a request ID. @milesial requested. - Client-side image resizing for multimodal models. Hope to upgrade to Assets API solution in around a month. - `client.payload_fn` allows you to modify payload before network request. Use-case shown in doc notebook for kosmos_2. - `client.last_inputs` put back in to allow for advanced support/debugging. - Dependencies: - Attempts to pull in PIL for image resizing. If not installed, prints out "please install" message, warns it might fail, and then tries without resizing. We are waiting on a more permanent solution. For LC viz: @hinthornw For NV viz: @fciannella @milesial @vinaybagade --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-05 12:24:53 -08:00
Nuno Campos	ae56fd020a	Fix condition on custom root type in runnable history (#17017 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-05 12:15:11 -08:00
Nuno Campos	f0ffebb944	Shield callback methods from cancellation: Fix interrupted runs marked as pending forever (#17010 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-05 12:09:47 -08:00
Bagatur	e7b3290d30	community[patch]: fix agent_toolkits mypy (#17050 ) Related to #17048	2024-02-05 11:56:24 -08:00
Erick Friis	6ffd5b15bc	pinecone: init pkg (#16556 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-05 11:55:01 -08:00
Erick Friis	1183769cf7	template: tool-retrieval-fireworks (#17052 ) - Initial commit oss-tool-retrieval-agent - README update - lint - lock - format imports - Rename to retrieval-agent-fireworks - cr <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>	2024-02-05 11:50:17 -08:00
Harrison Chase	4eda647fdd	infra: add -p to mkdir in lint steps (#17013 ) Previously, if this did not find a mypy cache then it wouldnt run this makes it always run adding mypy ignore comments with existing uncaught issues to unblock other prs --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-02-05 11:22:06 -08:00
Erick Friis	db6af21395	docs: exa contents (#16555 )	2024-02-05 11:15:06 -08:00
Eugene Yurtsev	fb245451d2	core[patch]: Add langsmith to printed sys information (#16899 )	2024-02-05 11:13:30 -08:00
Mikhail Khludnev	2145636f1d	Nvidia trt model name for stop_stream() (#16997 ) just removing some legacy leftover.	2024-02-05 10:45:06 -08:00
Christophe Bornet	2ef69fe11b	Add async methods to BaseChatMessageHistory and BaseMemory (#16728 ) Adds: * async methods to BaseChatMessageHistory * async methods to ChatMessageHistory * async methods to BaseMemory * async methods to BaseChatMemory * async methods to ConversationBufferMemory * tests of ConversationBufferMemory's async methods Twitter handle: cbornet_	2024-02-05 13:20:28 -05:00
Ryan Kraus	b3c3b58f2c	core[patch]: Fixed bug in dict to message conversion. (#17023 ) - Description: We discovered a bug converting dictionaries to messages where the ChatMessageChunk message type isn't handled. This PR adds support for that message type. - Issue: #17022 - Dependencies: None - Twitter handle: None	2024-02-05 10:13:25 -08:00
Nicolas Grenié	54fcd476bb	docs: Update ollama examples with new community libraries (#17007 ) - Description: Updating one line code sample for Ollama with new langchain_community package - Issue: - Dependencies: none - Twitter handle: @picsoung	2024-02-04 15:13:29 -08:00
Killinsun - Ryota Takeuchi	bcfce146d8	community[patch]: Correct the calling to collection_name in qdrant (#16920 ) ## Description In #16608, the calling `collection_name` was wrong. I made a fix for it. Sorry for the inconvenience! ## Issue https://github.com/langchain-ai/langchain/issues/16962 ## Dependencies N/A <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Kumar Shivendu <kshivendu1@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-02-04 10:45:35 -08:00
Erick Friis	849051102a	google-genai[patch]: fix new core typing (#16988 )	2024-02-03 17:45:44 -08:00
Bagatur	35446c814e	openai[patch]: rm tiktoken model warning (#16964 )	2024-02-03 16:36:57 -08:00
ccurme	0826d87ecd	langchain_mistralai[patch]: Invoke callback prior to yielding token (#16986 ) - Description: Invoke callback prior to yielding token in stream and astream methods for ChatMistralAI. - Issue: https://github.com/langchain-ai/langchain/issues/16913	2024-02-03 16:30:50 -08:00
Bagatur	267e71606e	docs: Update README.md (#16966 )	2024-02-02 16:50:58 -08:00
Erick Friis	2b7e47a668	infra: install integration deps for test linting (#16963 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-02 15:59:10 -08:00
Erick Friis	afdd636999	docs: partner packages (#16960 )	2024-02-02 15:12:21 -08:00
Erick Friis	06660bc78c	core[patch]: handle some optional cases in tools (#16954 ) primary problem in pydantic still exists, where `Optional[str]` gets turned to `string` in the jsonschema `.schema()` Also fixes the `SchemaSchema` naming issue --------- Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-02-02 15:05:54 -08:00
Mohammad Mohtashim	f8943e8739	core[patch]: Add doc-string to RunnableEach (#16892 ) Add doc-string to Runnable Each --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-02-02 14:11:09 -08:00
Ashley Xu	66adb95284	docs: BigQuery Vector Search went public review and updated docs (#16896 ) Update the docs for BigQuery Vector Search	2024-02-02 10:26:44 -08:00
Massimiliano Pronesti	71f9ea33b6	docs: add quantization to vllm and update API (#16950 ) - Description: Update vLLM docs to include instructions on how to use quantized models, as well as to replace the deprecated methods.	2024-02-02 10:24:49 -08:00
Bagatur	2a510c71a0	core[patch]: doc init positional args (#16854 )	2024-02-02 10:24:16 -08:00
Bagatur	d80c612c92	core[patch]: Message content as positional arg (#16921 )	2024-02-02 10:24:02 -08:00
Bagatur	c29e9b6412	core[patch]: fix chat prompt partial messages placeholder var (#16918 )	2024-02-02 10:23:37 -08:00
Radhakrishnan	3b0fa9079d	docs: Updated integration doc for aleph alpha (#16844 ) Description: Updated doc for llm/aleph_alpha with new functions: invoke. Changed structure of the document to match the required one. Issue: https://github.com/langchain-ai/langchain/issues/15664 Dependencies: None Twitter handle: None --------- Co-authored-by: Radhakrishnan Iyer <radhakrishnan.iyer@ibm.com>	2024-02-02 09:28:06 -08:00
hmasdev	cc17334473	core[minor]: add validation error handler to `BaseTool` (#14007 ) - Description: add a ValidationError handler as a field of [`BaseTool`](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/tools.py#L101) and add unit tests for the code change. - Issue: #12721 #13662 - Dependencies: None - Tag maintainer: - Twitter handle: @hmdev3 - NOTE: - I'm wondering if the update of document is required. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-02-01 20:09:19 -08:00
William FH	bdacfafa05	core[patch]: Remove deep copying of run prior to submitting it to LangChain Tracing (#16904 )	2024-02-01 18:46:05 -08:00
William FH	e02efd513f	core[patch]: Hide aliases when serializing (#16888 ) Currently, if you dump an object initialized with an alias, we'll still dump the secret values since they're retained in the kwargs	2024-02-01 17:55:37 -08:00
William FH	131c043864	Fix loading of ImagePromptTemplate (#16868 ) We didn't override the namespace of the ImagePromptTemplate, so it is listed as being in langchain.schema This updates the mapping to let the loader deserialize. Alternatively, we could make a slight breaking change and update the namespace of the ImagePromptTemplate since we haven't broadly publicized/documented it yet..	2024-02-01 17:54:04 -08:00
Erick Friis	6fc2835255	docs: fix broken links (#16855 )	2024-02-01 17:29:38 -08:00
Eugene Yurtsev	a265878d71	langchain_openai[patch]: Invoke callback prior to yielding token (#16909 ) All models should be calling the callback for new token prior to yielding the token. Not doing this can cause callbacks for downstream steps to be called prior to the callback for the new token; causing issues in astream_events APIs and other things that depend in callback ordering being correct. We need to make this change for all chat models.	2024-02-01 16:43:10 -08:00
Erick Friis	b1a847366c	community: revert SQL Stores (#16912 ) This reverts commit `cfc225ecb3`. https://github.com/langchain-ai/langchain/pull/15909#issuecomment-1922418097 These will have existed in langchain-community 0.0.16 and 0.0.17.	2024-02-01 16:37:40 -08:00
akira wu	f7c709b40e	doc: fix typo in message_history.ipynb (#16877 ) - Description: just fixed a small typo in the documentation in the `expression_language/how_to/message_history` session [here](https://python.langchain.com/docs/expression_language/how_to/message_history)	2024-02-01 13:30:29 -08:00
Leonid Ganeline	c2ca6612fe	refactor `langchain.prompts.example_selector` (#15369 ) The `langchain.prompts.example_selector` [still holds several artifacts](https://api.python.langchain.com/en/latest/langchain_api_reference.html#module-langchain.prompts) that belongs to `community`. If they moved to `langchain_community.example_selectors`, the `langchain.prompts` namespace would be effectively removed which is great. - moved a class and afunction to `langchain_community` Note: - Previously, the `langchain.prompts.example_selector` artifacts were moved into the `langchain_core.exampe_selectors`. See the flattened namespace (`.prompts` was removed)! Similar flattening was implemented for the `langchain_core` as the `langchain_core.exampe_selectors`. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-02-01 12:05:57 -08:00
Erick Friis	13a6756067	infra: ci naming 2 (#16893 )	2024-02-01 11:39:00 -08:00
Lance Martin	b1e7130d8a	Minor update to Nomic cookbook (#16886 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-02-01 11:28:58 -08:00
Shorthills AI	0bca0f4c24	Docs: Fixed grammatical mistake (#16858 ) Co-authored-by: Vishal <141389263+VishalYadavShorthillsAI@users.noreply.github.com> Co-authored-by: Sanskar Tanwar <142409040+SanskarTanwarShorthillsAI@users.noreply.github.com> Co-authored-by: UpneetShorthillsAI <144228282+UpneetShorthillsAI@users.noreply.github.com> Co-authored-by: HarshGuptaShorthillsAI <144897987+HarshGuptaShorthillsAI@users.noreply.github.com> Co-authored-by: AdityaKalraShorthillsAI <143726711+AdityaKalraShorthillsAI@users.noreply.github.com> Co-authored-by: SakshiShorthillsAI <144228183+SakshiShorthillsAI@users.noreply.github.com> Co-authored-by: AashiGuptaShorthillsAI <144897730+AashiGuptaShorthillsAI@users.noreply.github.com> Co-authored-by: ShamshadAhmedShorthillsAI <144897733+ShamshadAhmedShorthillsAI@users.noreply.github.com> Co-authored-by: ManpreetShorthillsAI <142380984+ManpreetShorthillsAI@users.noreply.github.com> Co-authored-by: Aayush <142384656+AayushShorthillsAI@users.noreply.github.com> Co-authored-by: BajrangBishnoiShorthillsAi <148060486+BajrangBishnoiShorthillsAi@users.noreply.github.com>	2024-02-01 11:28:15 -08:00
Erick Friis	5b3fc86cfd	infra: ci naming (#16890 ) Make it clearer how to run equivalent commands locally Not a perfect 1:1, but will help people get started ![Screenshot 2024-02-01 at 10 53 34 AM](https://github.com/langchain-ai/langchain/assets/9557659/da271aaf-d5db-41e3-9379-cb1d8a0232c5)	2024-02-01 11:09:37 -08:00
Qihui Xie	c5b01ac621	community[patch]: support LIKE comparator (full text match) in Qdrant (#12769 ) Description: Support [Qdrant full text match filtering](https://qdrant.tech/documentation/concepts/filtering/#full-text-match) by adding Comparator.LIKE to QdrantTranslator.	2024-02-01 11:03:25 -08:00
Christophe Bornet	9d458d089a	community: Factorize AstraDB components constructors (#16779 ) * Adds `AstraDBEnvironment` class and use it in `AstraDBLoader`, `AstraDBCache`, `AstraDBSemanticCache`, `AstraDBBaseStore` and `AstraDBChatMessageHistory` * Create an `AsyncAstraDB` if we only have an `AstraDB` and vice-versa so: * we always have an instance of `AstraDB` * we always have an instance of `AsyncAstraDB` for recent versions of astrapy * Create collection if not exists in `AstraDBBaseStore` * Some typing improvements Note: `AstraDB` `VectorStore` not using `AstraDBEnvironment` at the moment. This will be done after the `langchain-astradb` package is out.	2024-02-01 10:51:07 -08:00
Harel Gal	93366861c7	docs: Indicated Guardrails for Amazon Bedrock preview status (#16769 ) Added notification about limited preview status of Guardrails for Amazon Bedrock feature to code example. --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-02-01 10:41:48 -08:00
Christophe Bornet	78a1af4848	langchain[patch]: Add async methods to MultiVectorRetriever (#16878 ) Adds async support to multi vector retriever	2024-02-01 10:33:06 -08:00
Bagatur	7d03d8f586	docs: fix docstring examples (#16889 )	2024-02-01 10:17:26 -08:00
Bagatur	c2d09fb151	infra: bump exp min test reqs (#16884 )	2024-02-01 08:35:21 -08:00
Bagatur	65ba5c220b	experimental[patch]: Release 0.0.50 (#16883 )	2024-02-01 08:27:39 -08:00
Bagatur	9e7d9f9390	infra: bump langchain min test reqs (#16882 )	2024-02-01 08:16:30 -08:00
Bagatur	db442c635b	langchain[patch]: Release 0.1.5 (#16881 )	2024-02-01 08:10:29 -08:00
Bagatur	2b4abed25c	commmunity[patch]: Release 0.0.17 (#16871 )	2024-02-01 07:33:34 -08:00
Bagatur	bb73251146	core[patch]: Release 0.1.18 (#16870 )	2024-02-01 07:33:15 -08:00
Christophe Bornet	a0ec045495	Add async methods to BaseStore (#16669 ) - Description: The BaseStore methods are currently blocking. Some implementations (AstraDBStore, RedisStore) would benefit from having async methods. Also once we have async methods for BaseStore, we can implement the async `aembed_documents` in CacheBackedEmbeddings to cache the embeddings asynchronously. * adds async methods amget, amset, amedelete and ayield_keys to BaseStore * implements the async methods for InMemoryStore * adds tests for InMemoryStore async methods - Twitter handle: cbornet_	2024-01-31 17:10:47 -08:00
Erick Friis	17e886388b	nomic: init pkg (#16853 ) Co-authored-by: Lance Martin <lance@langchain.dev>	2024-01-31 16:46:35 -08:00
Eugene Yurtsev	2e5949b6f8	core(minor): Add bulk add messages to BaseChatMessageHistory interface (#15709 ) * Add bulk add_messages method to the interface. * Update documentation for add_ai_message and add_human_message to denote them as being marked for deprecation. We should stop using them as they create more incorrect (inefficient) ways of doing things	2024-01-31 11:59:39 -08:00
Christophe Bornet	af8c5c185b	langchain[minor],community[minor]: Add async methods in BaseLoader (#16634 ) Adds: * methods `aload()` and `alazy_load()` to interface `BaseLoader` * implementation for class `MergedDataLoader ` * support for class `BaseLoader` in async function `aindex()` with unit tests Note: this is compatible with existing `aload()` methods that some loaders already had. Twitter handle: @cbornet_ --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-01-31 11:08:11 -08:00
Erick Friis	c37ca45825	nvidia-trt: remove tritonclient all extra dep (#16749 )	2024-01-30 16:06:19 -08:00
Erick Friis	36c0392dbe	infra: remove unnecessary tests on partner packages (#16808 )	2024-01-30 16:01:47 -08:00
Erick Friis	bb3b6bde33	openai[minor]: change to secretstr (#16803 )	2024-01-30 15:49:56 -08:00
Raphael	bf9068516e	community[minor]: add the ability to load existing transcripts from AssemblyAI by their id. (#16051 ) - Description: the existing AssemblyAI API allows to pass a path or an url to transcribe an audio file and turn in into Langchain Documents, this PR allows to get existing transcript by their transcript id and turn them into Documents. - Issue: not related to an existing issue - Dependencies: requests --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-30 13:47:45 -08:00
Bagatur	daf820c77b	community[patch]: undo create_sql_agent breaking (#16797 )	2024-01-30 10:00:52 -08:00
Eugene Yurtsev	ef2bd745cb	docs: Update doc-string in base callback managers (#15885 ) Update doc-strings with a comment about on_llm_start vs. on_chat_model_start.	2024-01-30 09:51:45 -08:00
William FH	881dc28d2c	Fix Dep Recommendation (#16793 ) Tools are different than functions	2024-01-30 09:40:28 -08:00
Bagatur	b0347f3e2b	docs: add csv use case (#16756 )	2024-01-30 09:39:46 -08:00
Alexander Conway	4acd2654a3	Report which file was errored on in DirectoryLoader (#16790 ) The current implementation leaves it up to the particular file loader implementation to report the file on which an error was encountered - in my case pdfminer was simply saying it could not parse a file as a PDF, but I didn't know which of my hundreds of files it was failing on. No reason not to log the particular item on which an error was encountered, and it should be an immense debugging assistant. <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-30 09:14:58 -08:00
Erick Friis	a372b23675	robocorp: release 0.0.3 (#16789 )	2024-01-30 07:15:25 -08:00
Rihards Gravis	442fa52b30	[partners]: langchain-robocorp ease dependency version (#16765 )	2024-01-30 08:13:54 -07:00
Jacob Lee	c6724a39f4	Fix rephrase step in chatbot use case (#16763 )	2024-01-29 23:25:25 -08:00
Bob Lin	546b757303	community: Add ChatGLM3 (#15265 ) Add [ChatGLM3](https://github.com/THUDM/ChatGLM3) and updated [chatglm.ipynb](https://python.langchain.com/docs/integrations/llms/chatglm) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-29 20:30:52 -08:00
Marina Pliusnina	a1ce7ab672	adding parameter for changing the language in SpacyEmbeddings (#15743 ) Description: Added the parameter for a possibility to change a language model in SpacyEmbeddings. The default value is still the same: "en_core_web_sm", so it shouldn't affect a code which previously did not specify this parameter, but it is not hard-coded anymore and easy to change in case you want to use it with other languages or models. Issue: At Barcelona Supercomputing Center in Aina project (https://github.com/projecte-aina), a project for Catalan Language Models and Resources, we would like to use Langchain for one of our current projects and we would like to comment that Langchain, while being a very powerful and useful open-source tool, is pretty much focused on English language. We would like to contribute to make it a bit more adaptable for using with other languages. Dependencies: This change requires the Spacy library and a language model, specified in the model parameter. Tag maintainer: @dev2049 Twitter handle: @projecte_aina --------- Co-authored-by: Marina Pliusnina <marina.pliusnina@bsc.es> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-29 20:30:34 -08:00
Christophe Bornet	744070ee85	Add async methods for the AstraDB VectorStore (#16391 ) - Description: fully async versions are available for astrapy 0.7+. For older astrapy versions or if the user provides a sync client without an async one, the async methods will call the sync ones wrapped in `run_in_executor` - Twitter handle: cbornet_	2024-01-29 20:22:25 -08:00
baichuan-assistant	f8f2649f12	community: Add Baichuan LLM to community (#16724 ) Replace this entire comment with: - Description: Add Baichuan LLM to integration/llm, also updated related docs. Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-29 20:08:24 -08:00
thiswillbeyourgithub	1d082359ee	community: add support for callable filters in FAISS (#16190 ) - Description: Filtering in a FAISS vectorstores is very inflexible and doesn't allow that many use case. I think supporting callable like this enables a lot: regular expressions, condition on multiple keys etc. Note I had to manually alter a test. I don't understand if it was falty to begin with or if there is something funky going on. - Issue: None - Dependencies: None - Twitter handle: None Signed-off-by: thiswillbeyourgithub <26625900+thiswillbeyourgithub@users.noreply.github.com>	2024-01-29 20:05:56 -08:00
Yudhajit Sinha	1703fe2361	core[patch]: preserve inspect.iscoroutinefunction with @beta decorator (#16440 ) Adjusted deprecate decorator to make sure decorated async functions are still recognized as "coroutinefunction" by inspect Addresses #16402 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-29 20:01:11 -08:00
Killinsun - Ryota Takeuchi	52f4ad8216	community: Add new fields in metadata for qdrant vector store (#16608 ) ## Description The PR is to return the ID and collection name from qdrant client to metadata field in `Document` class. ## Issue The motivation is almost same to [11592](https://github.com/langchain-ai/langchain/issues/11592) Returning ID is useful to update existing records in a vector store, but we cannot know them if we use some retrievers. In order to avoid any conflicts, breaking changes, the new fields in metadata have a prefix `_` ## Dependencies N/A ## Twitter handle @kill_in_sun <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-29 19:59:54 -08:00
hulitaitai	32cad38ec6	<langchain_community\llms\chatglm.py>: <Correcting "history"> (#16729 ) Use the real "history" provided by the original program instead of putting "None" in the history. - Description: I change one line in the code to make it return the "history" of the chat model. - Issue: At the moment it returns only the answers of the chat model. However the chat model himself provides a history more complet with the questions of the user. - Dependencies: no dependencies required for this change,	2024-01-29 19:50:31 -08:00
Jacob Lee	4a027e622f	docs[patch]: Lower temperature in chatbot usecase notebooks for consistency (#16750 ) CC @baskaryan	2024-01-29 17:27:13 -08:00
Jacob Lee	12d2b2ebcf	docs[minor]: LCEL rewrite of chatbot use-case (#16414 ) CC @baskaryan @hwchase17 TODO: - [x] Draft of main quickstart - [x] Index intro page - [x] Add subpage guide for Memory management - [x] Add subpage guide for Retrieval - [x] Add subpage guide for Tool usage - [x] Add LangSmith traces illustrating query transformation	2024-01-29 17:08:54 -08:00
Bassem Yacoube	85e93e05ed	community[minor]: Update OctoAI LLM, Embedding and documentation (#16710 ) This PR includes updates for OctoAI integrations: - The LLM class was updated to fix a bug that occurs with multiple sequential calls - The Embedding class was updated to support the new GTE-Large endpoint released on OctoAI lately - The documentation jupyter notebook was updated to reflect using the new LLM sdk Thank you!	2024-01-29 13:57:17 -08:00
Hank	6d6226d96d	docs: Remove accidental extra ``` in QuickStart doc. (#16740 ) Description: One too many set of triple-ticks in a sample code block in the QuickStart doc was causing "\`\`\`shell" to appear in the shell command that was being demonstrated. I just deleted the extra "```". Issue: Didn't see one Dependencies: None	2024-01-29 13:55:26 -08:00
Shay Ben Elazar	84ebfb5b9d	openai[patch]: Added annotations support to azure openai (#13704 ) - Description: Added Azure OpenAI Annotations (content filtering results) to ChatResult - Issue: 13090 - Twitter handle: ElazarShay Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-29 13:31:09 -08:00
Volodymyr Machula	32c5be8b73	community[minor]: Connery Tool and Toolkit (#14506 ) ## Summary This PR implements the "Connery Action Tool" and "Connery Toolkit". Using them, you can integrate Connery actions into your LangChain agents and chains. Connery is an open-source plugin infrastructure for AI. With Connery, you can easily create a custom plugin with a set of actions and seamlessly integrate them into your LangChain agents and chains. Connery will handle the rest: runtime, authorization, secret management, access management, audit logs, and other vital features. Additionally, Connery and our community offer a wide range of ready-to-use open-source plugins for your convenience. Learn more about Connery: - GitHub: https://github.com/connery-io/connery-platform - Documentation: https://docs.connery.io - Twitter: https://twitter.com/connery_io ## TODOs - [x] API wrapper - [x] Integration tests - [x] Connery Action Tool - [x] Docs - [x] Example - [x] Integration tests - [x] Connery Toolkit - [x] Docs - [x] Example - [x] Formatting (`make format`) - [x] Linting (`make lint`) - [x] Testing (`make test`)	2024-01-29 12:45:03 -08:00
Harrison Chase	8457c31c04	community[patch]: activeloop ai tql deprecation (#14634 ) Co-authored-by: AdkSarsen <adilkhan@activeloop.ai>	2024-01-29 12:43:54 -08:00
Neli Hateva	c95facc293	langchain[minor], community[minor]: Implement Ontotext GraphDB QA Chain (#16019 ) - Description: Implement Ontotext GraphDB QA Chain - Issue: N/A - Dependencies: N/A - Twitter handle: @OntotextGraphDB	2024-01-29 12:25:53 -08:00
chyroc	a08f9a7ff9	langchain[patch]: support OpenAIAssistantRunnable async (#15302 ) fix https://github.com/langchain-ai/langchain/issues/15299 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-29 12:19:47 -08:00
Elliot	39eb00d304	community[patch]: Adapt more parameters related to MemorySearchPayload for the search method of ZepChatMessageHistory (#15441 ) - Description: To adapt more parameters related to MemorySearchPayload for the search method of ZepChatMessageHistory, - Issue: None, - Dependencies: None, - Twitter handle: None	2024-01-29 11:45:55 -08:00
Kirushikesh DB	47bd58dc11	docs: Added illustration of using RetryOutputParser with LLMChain (#16722 ) Description: Updated the retry.ipynb notebook, it contains the illustrations of RetryOutputParser in LangChain. But the notebook lacks to explain the compatibility of RetryOutputParser with existing chains. This changes adds some code to illustrate the workflow of using RetryOutputParser with the user chain. Changes: 1. Changed RetryWithErrorOutputParser with RetryOutputParser, as the markdown text says so. 2. Added code at the last of the notebook to define a chain which passes the LLM completions to the retry parser, which can be customised for user needs. Issue: Since RetryOutputParser/RetryWithErrorOutputParser does not implement the parse function it cannot be used with LLMChain directly like [this](https://python.langchain.com/docs/expression_language/cookbook/prompt_llm_parser#prompttemplate-llm-outputparser). This also raised various issues #15133 #12175 #11719 still open, instead of adding new features/code changes its best to explain the "how to integrate LLMChain with retry parsers" clearly with an example in the corresponding notebook. Inspired from: https://github.com/langchain-ai/langchain/issues/15133#issuecomment-1868972580 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-29 11:24:52 -08:00
Jael Gu	a1aa3a657c	community[patch]: Milvus supports add & delete texts by ids (#16256 ) # Description To support [langchain indexing](https://python.langchain.com/docs/modules/data_connection/indexing) as requested by users, vectorstore Milvus needs to support: - document addition by id (`add_documents` method with `ids` argument) - delete by id (`delete` method with `ids` argument) Example usage: ```python from langchain.indexes import SQLRecordManager, index from langchain.schema import Document from langchain_community.vectorstores import Milvus from langchain_openai import OpenAIEmbeddings collection_name = "test_index" embedding = OpenAIEmbeddings() vectorstore = Milvus(embedding_function=embedding, collection_name=collection_name) namespace = f"milvus/{collection_name}" record_manager = SQLRecordManager( namespace, db_url="sqlite:///record_manager_cache.sql" ) record_manager.create_schema() doc1 = Document(page_content="kitty", metadata={"source": "kitty.txt"}) doc2 = Document(page_content="doggy", metadata={"source": "doggy.txt"}) index( [doc1, doc1, doc2], record_manager, vectorstore, cleanup="incremental", # None, "incremental", or "full" source_id_key="source", ) ``` # Fix issues Fix https://github.com/milvus-io/milvus/issues/30112 --------- Signed-off-by: Jael Gu <mengjia.gu@zilliz.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-29 11:19:50 -08:00
Michard Hugo	e9d3527b79	community[patch]: Add missing async similarity_distance_threshold handling in RedisVectorStoreRetriever (#16359 ) Add missing async similarity_distance_threshold handling in RedisVectorStoreRetriever - Description: added method `_aget_relevant_documents` to `RedisVectorStoreRetriever` that overrides parent method to add support of `similarity_distance_threshold` in async mode (as for sync mode) - Issue: #16099 - Dependencies: N/A - Twitter handle: N/A	2024-01-29 11:19:30 -08:00
Jarod Stewart	7c6a2a8384	templates: Ionic Shopping Assistant (#16648 ) - Description: This is a template for creating shopping assistant chat bots - Issue: Example for creating a shopping assistant with OpenAI Tools Agent - Dependencies: Ionic https://github.com/ioniccommerce/ionic_langchain - Twitter handle: @ioniccommerce --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-29 11:08:24 -08:00
Bagatur	7237dc67d4	core[patch]: Release 0.1.17 (#16737 )	2024-01-29 11:02:29 -08:00
Anthony Bernabeu	2db79ab111	community[patch]: Implement TTL for DynamoDBChatMessageHistory (#15478 ) - Description: Implement TTL for DynamoDBChatMessageHistory, - Issue: see #15477, - Dependencies: N/A, --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-01-29 10:22:46 -08:00
Massimiliano Pronesti	1bc8d9a943	experimental[patch]: missing resolution strategy in anonymization (#16653 ) - Description: Presidio-based anonymizers are not working because `_remove_conflicts_and_get_text_manipulation_data` was being called without a conflict resolution strategy. This PR fixes this issue. In addition, it removes some mutable default arguments (antipattern). To reproduce the issue, just run the very first cell of this [notebook](https://python.langchain.com/docs/guides/privacy/2/) from langchain's documentation. <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-29 09:56:16 -08:00
Abhinav	8e44363ec9	langchain_community: Update documentation for installing llama-cpp-python on windows (#16666 ) Description : This PR updates the documentation for installing llama-cpp-python on Windows. - Updates install command to support pyproject.toml - Makes CPU/GPU install instructions clearer - Adds reinstall with GPU support command Issue: Existing [documentation](https://python.langchain.com/docs/integrations/llms/llamacpp#compiling-and-installing) lists the following commands for installing llama-cpp-python ``` python setup.py clean python setup.py install ```` The current version of the repo does not include a `setup.py` and uses a `pyproject.toml` instead. This can be replaced with ``` python -m pip install -e . ``` As explained in https://github.com/abetlen/llama-cpp-python/issues/965#issuecomment-1837268339 Dependencies: None Twitter handle: None --------- Co-authored-by: blacksmithop <angstycoder101@gmaii.com>	2024-01-29 08:41:29 -08:00
taimo	d3d9244fee	langchain-community: fix unicode escaping issue with SlackToolkit (#16616 ) - Description: fix unicode escaping issue with SlackToolkit - Issue: #16610	2024-01-29 08:38:12 -08:00
Benito Geordie	f3fdc5c5da	community: Added integrations for ThirdAI's NeuralDB with Retriever and VectorStore frameworks (#15280 ) Description: Adds ThirdAI NeuralDB retriever and vectorstore integration. NeuralDB is a CPU-friendly and fine-tunable text retrieval engine.	2024-01-29 08:35:42 -08:00
Jonathan Bennion	815896ff13	langchain: pubmed tool path update in doc (#16716 ) - Description: The current pubmed tool documentation is referencing the path to langchain core not the path to the tool in community. The old tool redirects anyways, but for efficiency of using the more direct path, just adding this documentation so it references the new path - Issue: doesn't fix an issue - Dependencies: no dependencies - Twitter handle: rooftopzen	2024-01-29 08:25:29 -08:00
Lance Martin	1bfadecdd2	Update Slack agent toolkit (#16732 ) Co-authored-by: taimoOptTech <132860814+taimo3810@users.noreply.github.com>	2024-01-29 08:03:44 -08:00
Pashva Mehta	22d90800c8	community: Fixed schema discrepancy in from_texts function for weaviate vectorstore (#16693 ) * Description: Fixed schema discrepancy in from_texts function for weaviate vectorstore which created a redundant property "key" inside a class. * Issue: Fixed: https://github.com/langchain-ai/langchain/issues/16692 * Twitter handle: @pashvamehta1	2024-01-28 16:53:31 -08:00
Choi JaeHun	ba70630829	docs: Syntax correction according to langchain version update in 'Retry Parser' tutorial example (#16699 ) - Description: Syntax correction according to langchain version update in 'Retry Parser' tutorial example, - Issue: #16698 --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-28 16:53:04 -08:00
ccurme	ec0ae23645	core: expand docstring for RunnableGenerator (#16672 ) - Description: expand docstring for RunnableGenerator - Issue: https://github.com/langchain-ai/langchain/issues/16631	2024-01-28 16:47:08 -08:00
Bob Lin	0866a984fe	Update `n_gpu_layers`"s description (#16685 ) The `n_gpu_layers` parameter in `llama.cpp` supports the use of `-1`, which means to offload all layers to the GPU, so the document has been updated. Ref: `35918873b4/llama_cpp/server/settings.py (L29C22-L29C117)` `35918873b4/llama_cpp/llama.py (L125)`	2024-01-28 16:46:50 -08:00
Daniel Erenrich	0600998f38	community: Wikidata tool support (#16691 ) - Description: Adds Wikidata support to langchain. Can read out documents from Wikidata. - Issue: N/A - Dependencies: Adds implicit dependencies for `wikibase-rest-api-client` (for turning items into docs) and `mediawikiapi` (for hitting the search endpoint) - Twitter handle: @derenrich You can see an example of this tool used in a chain [here](https://nbviewer.org/urls/d.erenrich.net/upload/Wikidata_Langchain.ipynb) or [here](https://nbviewer.org/urls/d.erenrich.net/upload/Wikidata_Lars_Kai_Hansen.ipynb) <!-- Thank you for contributing to LangChain! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-28 16:45:21 -08:00
Tze Min	6ef718c5f4	Core: fix Anthropic json issue in streaming (#16670 ) Description: fix ChatAnthropic json issue in streaming Issue: https://github.com/langchain-ai/langchain/issues/16423 Dependencies: n/a --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-28 16:41:17 -08:00
Owen Sims	e451c8adc1	Community: Update Ionic Shopping Docs (#16700 ) - Description: Update to docs as originally introduced in https://github.com/langchain-ai/langchain/pull/16649 (reviewed by @baskaryan), - Twitter handle: [@ioniccommerce](https://twitter.com/ioniccommerce)	2024-01-28 16:39:49 -08:00
Christophe Bornet	2e3af04080	Use Postponed Evaluation of Annotations in Astra and Cassandra doc loaders (#16694 ) Minor/cosmetic change	2024-01-28 16:39:27 -08:00
Yelin Zhang	bc7607a4e9	docs: remove iprogress warnings (#16697 ) - Description: removes iprogress warning texts from notebooks, resulting in a little nicer to read documentation	2024-01-28 16:38:14 -08:00
Erick Friis	0255c5808b	infra: move release workflow back (#16707 )	2024-01-28 12:11:23 -07:00
Erick Friis	88e3129587	robocorp: release 0.0.2 (#16706 )	2024-01-28 11:28:58 -07:00
Christophe Bornet	36e432672a	community[minor]: Add async methods to AstraDBLoader (#16652 )	2024-01-27 17:05:41 -08:00
William FH	38425c99d2	core[minor]: Image prompt template (#14263 ) Builds on Bagatur's (#13227). See unit test for example usage (below) ```python def test_chat_tmpl_from_messages_multipart_image() -> None: base64_image = "abcd123" other_base64_image = "abcd123" template = ChatPromptTemplate.from_messages( [ ("system", "You are an AI assistant named {name}."), ( "human", [ {"type": "text", "text": "What's in this image?"}, # OAI supports all these structures today { "type": "image_url", "image_url": "data:image/jpeg;base64,{my_image}", }, { "type": "image_url", "image_url": {"url": "data:image/jpeg;base64,{my_image}"}, }, {"type": "image_url", "image_url": "{my_other_image}"}, { "type": "image_url", "image_url": {"url": "{my_other_image}", "detail": "medium"}, }, { "type": "image_url", "image_url": {"url": "https://www.langchain.com/image.png"}, }, { "type": "image_url", "image_url": {"url": "data:image/jpeg;base64,foobar"}, }, ], ), ] ) messages = template.format_messages( name="R2D2", my_image=base64_image, my_other_image=other_base64_image ) expected = [ SystemMessage(content="You are an AI assistant named R2D2."), HumanMessage( content=[ {"type": "text", "text": "What's in this image?"}, { "type": "image_url", "image_url": {"url": f"data:image/jpeg;base64,{base64_image}"}, }, { "type": "image_url", "image_url": { "url": f"data:image/jpeg;base64,{other_base64_image}" }, }, { "type": "image_url", "image_url": {"url": f"{other_base64_image}"}, }, { "type": "image_url", "image_url": { "url": f"{other_base64_image}", "detail": "medium", }, }, { "type": "image_url", "image_url": {"url": "https://www.langchain.com/image.png"}, }, { "type": "image_url", "image_url": {"url": "data:image/jpeg;base64,foobar"}, }, ] ), ] assert messages == expected ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Brace Sproul <braceasproul@gmail.com>	2024-01-27 17:04:29 -08:00
ARKA1112	3c387bc12d	docs: Error when importing packages from pydantic [docs] (#16564 ) URL : https://python.langchain.com/docs/use_cases/extraction Desc: <b> While the following statement executes successfully, it throws an error which is described below when we use the imported packages</b> ```py from pydantic import BaseModel, Field, validator ``` Code: ```python from langchain.output_parsers import PydanticOutputParser from langchain.prompts import ( PromptTemplate, ) from langchain_openai import OpenAI from pydantic import BaseModel, Field, validator # Define your desired data structure. class Joke(BaseModel): setup: str = Field(description="question to set up a joke") punchline: str = Field(description="answer to resolve the joke") # You can add custom validation logic easily with Pydantic. @validator("setup") def question_ends_with_question_mark(cls, field): if field[-1] != "?": raise ValueError("Badly formed question!") return field ``` Error: ```md PydanticUserError: The `field` and `config` parameters are not available in Pydantic V2, please use the `info` parameter instead. For further information visit https://errors.pydantic.dev/2.5/u/validator-field-config-info ``` Solution: Instead of doing: ```py from pydantic import BaseModel, Field, validator ``` We should do: ```py from langchain_core.pydantic_v1 import BaseModel, Field, validator ``` Thanks. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-27 16:46:48 -08:00
Rashedul Hasan Rijul	481493dbce	community[patch]: apply embedding functions during query if defined (#16646 ) Description: This update ensures that the user-defined embedding function specified during vector store creation is applied during queries. Previously, even if a custom embedding function was defined at the time of store creation, Bagel DB would default to using the standard embedding function during query execution. This pull request addresses this issue by consistently using the user-defined embedding function for queries if one has been specified earlier.	2024-01-27 16:46:33 -08:00
Serena Ruan	f01fb47597	community[patch]: MLflowCallbackHandler -- Move textstat and spacy as optional dependency (#16657 ) Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-01-27 16:15:07 -08:00
Zhuoyun(John) Xu	508bde7f40	community[patch]: Ollama - Pass headers to post request in async method (#16660 ) # Description A previous PR (https://github.com/langchain-ai/langchain/pull/15881) added option to pass headers to ollama endpoint, but headers are not pass to the async method.	2024-01-27 16:11:32 -08:00
Leonid Ganeline	5e73603e8a	docs: `DeepInfra` provider page update (#16665 ) - added description, links - consistent formatting - added links to the example pages	2024-01-27 16:05:29 -08:00
João Carlos Ferra de Almeida	3e87b67a3c	community[patch]: Add Cookie Support to Fetch Method (#16673 ) - Description: This change allows the `_fetch` method in the `WebBaseLoader` class to utilize cookies from an existing `requests.Session`. It ensures that when the `fetch` method is used, any cookies in the provided session are included in the request. This enhancement maintains compatibility with existing functionality while extending the utility of the `fetch` method for scenarios where cookie persistence is necessary. - Issue: Not applicable (new feature), - Dependencies: Requires `aiohttp` and `requests` libraries (no new dependencies introduced), - Twitter handle: N/A Co-authored-by: Joao Almeida <joao.almeida@mercedes-benz.io>	2024-01-27 16:03:53 -08:00
Daniel Erenrich	c314137f5b	docs: Fix broken link in CONTRIBUTING.md (#16681 ) - Description: link in CONTRIBUTING.md is broken - Issue: N/A - Dependencies: N/A - Twitter handle: @derenrich	2024-01-27 15:43:44 -08:00
Harrison Chase	27665e3546	[community] fix anthropic streaming (#16682 )	2024-01-27 15:16:22 -08:00
Bagatur	5975bf39ec	infra: delete old CI workflows (#16680 )	2024-01-27 14:14:53 -08:00
Christophe Bornet	4915c3cd86	[Fix] Fix Cassandra Document loader default page content mapper (#16273 ) We can't use `json.dumps` by default as many types returned by the cassandra driver are not serializable. It's safer to use `str` and let users define their own custom `page_content_mapper` if needed.	2024-01-27 11:23:02 -08:00
Nuno Campos	e86fd946c8	In stream_event and stream_log handle closed streams (#16661 ) if eg. the stream iterator is interrupted then adding more events to the send_stream will raise an exception that we should catch (and handle where appropriate) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-27 08:09:29 -08:00
Jarod Stewart	0bc397957b	docs: document Ionic Tool (#16649 ) - Description: Documentation for the Ionic Tool. A shopping assistant tool that effortlessly adds e-commerce capabilities to your Agent.	2024-01-26 16:02:07 -08:00
Nuno Campos	52ccae3fb1	Accept message-like things in Chat models, LLMs and MessagesPlaceholder (#16418 )	2024-01-26 15:44:28 -08:00
Seungwoo Ryu	570b4f8e66	docs: Update openai_tools.ipynb (#16618 ) typo	2024-01-26 15:26:27 -08:00
Pasha	4e189cd89a	community[patch]: youtube loader transcript format (#16625 ) - Description: YoutubeLoader right now returns one document that contains the entire transcript. I think it would be useful to add an option to return multiple documents, where each document would contain one line of transcript with the start time and duration in the metadata. For example, [AssemblyAIAudioTranscriptLoader](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/document_loaders/assemblyai.py) is implemented in a similar way, it allows you to choose between the format to use for the document loader.	2024-01-26 15:26:09 -08:00
yin1991	a936472512	docs: Update documentation to use 'model_id' rather than 'model_name' to match actual API (#16615 ) - Description: Replace 'model_name' with 'model_id' for accuracy - Issue: [link-to-issue](https://github.com/langchain-ai/langchain/issues/16577) - Dependencies: - Twitter handle:	2024-01-26 15:01:12 -08:00
Micah Parker	6543e585a5	community[patch]: Added support for Ollama's num_predict option in ChatOllama (#16633 ) Just a simple default addition to the options payload for a ollama generate call to support a max_new_tokens parameter. Should fix issue: https://github.com/langchain-ai/langchain/issues/14715	2024-01-26 15:00:19 -08:00
Callum	6a75ef74ca	docs: Fix typo in XML agent documentation (#16645 ) This is a tiny PR that just replacer "moduels" with "modules" in the documentation for XML agents.	2024-01-26 14:59:46 -08:00
baichuan-assistant	70ff54eace	community[minor]: Add Baichuan Text Embedding Model and Baichuan Inc introduction (#16568 ) - Description: Adding Baichuan Text Embedding Model and Baichuan Inc introduction. Baichuan Text Embedding ranks #1 in C-MTEB leaderboard: https://huggingface.co/spaces/mteb/leaderboard Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-26 12:57:26 -08:00
Bagatur	5b5115c408	google-vertexai[patch]: streaming bug (#16603 ) Fixes errors seen here https://github.com/langchain-ai/langchain/actions/runs/7661680517/job/20881556592#step:9:229	2024-01-26 09:45:34 -08:00
ccurme	a989f82027	core: expand docstring for RunnableParallel (#16600 ) - Description: expand docstring for RunnableParallel - Issue: https://github.com/langchain-ai/langchain/issues/16462 Feel free to modify this or let me know how it can be improved!	2024-01-26 10:03:32 -05:00
Ghani	e30c6662df	Langchain-community : EdenAI chat integration. (#16377 ) - Description: This PR adds [EdenAI](https://edenai.co/) for the chat model (already available in LLM & Embeddings). It supports all [ChatModel] functionality: generate, async generate, stream, astream and batch. A detailed notebook was added. - Dependencies: No dependencies are added as we call a rest API. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-01-26 09:56:43 -05:00
Antonio Lanza	08d3fd7f2e	langchain[patch]: inconsistent results with `RecursiveCharacterTextSplitter`'s `add_start_index=True` (#16583 ) This PR fixes issue #16579	2024-01-25 15:50:06 -08:00
Eugene Yurtsev	42db96477f	docs: Update in code documentation for runnable with message history (#16585 ) Update the in code documentation for Runnable With Message History	2024-01-25 15:26:34 -08:00
Jatin Chawda	a79345f199	community[patch]: Fixed tool names snake_case (#16397 ) #16396 Fixed 1. golden_query 2. google_lens 3. memorize 4. merriam_webster 5. open_weather_map 6. pub_med 7. stack_exchange 8. generate_image 9. wikipedia	2024-01-25 15:24:19 -08:00
Bagatur	bcc71d1a57	openai[patch]: Release 0.0.5 (#16598 )	2024-01-25 15:20:28 -08:00
Bagatur	68f7468754	google-vertexai[patch]: Release 0.0.3 (#16597 )	2024-01-25 15:19:00 -08:00
Bagatur	61e876aad8	openai[patch]: Explicitly support embedding dimensions (#16596 )	2024-01-25 15:16:04 -08:00
Bagatur	5df8ab574e	infra: move indexing documentation test (#16595 )	2024-01-25 14:46:50 -08:00
Bagatur	f3d61a6e47	langchain[patch]: Release 0.1.4 (#16592 )	2024-01-25 14:19:18 -08:00
Bagatur	61b200947f	community[patch]: Release 0.0.16 (#16591 )	2024-01-25 14:19:09 -08:00
Bagatur	75ad0bba2d	openai[patch]: Release 0.0.4 (#16590 )	2024-01-25 14:08:46 -08:00
Bagatur	1e3ce338ca	core[patch]: Release 0.1.16 (#16589 )	2024-01-25 13:56:00 -08:00
Bagatur	6c89507988	docs: add rag citations page (#16549 )	2024-01-25 13:51:41 -08:00
Bagatur	31790d15ec	openai[patch]: accept function_call dict in bind_functions (#16483 ) Confusing that you can't pass in a dict	2024-01-25 13:47:44 -08:00
Bagatur	db80832e4f	docs: output parser nits (#16588 )	2024-01-25 13:20:48 -08:00
Bagatur	ef42d9d559	core[patch], community[patch], openai[patch]: consolidate openai tool… (#16485 ) … converters One way to convert anything to an OAI function: convert_to_openai_function One way to convert anything to an OAI tool: convert_to_openai_tool Corresponding bind functions on OAI models: bind_functions, bind_tools	2024-01-25 13:18:46 -08:00
Brian Burgin	148347e858	community[minor]: Add LiteLLM Router Integration (#15588 ) community: - Description: - Add new ChatLiteLLMRouter class that allows a client to use a LiteLLM Router as a LangChain chat model. - Note: The existing ChatLiteLLM integration did not cover the LiteLLM Router class. - Add tests and Jupyter notebook. - Issue: None - Dependencies: Relies on existing ChatLiteLLM integration - Twitter handle: @bburgin_0 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-25 11:03:05 -08:00
Bob Lin	35e60728b7	docs: Fix broken urls (#16559 )	2024-01-25 09:20:05 -08:00
Bob Lin	6023953ea7	docs: Fix github link (#16560 )	2024-01-25 09:19:09 -08:00
JongRok BAEK	3b8eba32f9	anthropic[patch]: Fix message type lookup in Anthropic Partners (#16563 ) - Description: The parameters for user and assistant in Anthropic should be 'ai -> assistant,' but they are reversed to 'assistant -> ai.' Below is error code. ```python anthropic.BadRequestError: Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'messages: Unexpected role "ai". Allowed roles are "user" or "assistant"'}} ``` [anthropic](`7177f3a71f/src/anthropic/types/beta/message_param.py (L13)`) - Issue: : #16561 - Dependencies: : None - Twitter handle: : None	2024-01-25 09:17:59 -08:00
Dmitry Tyumentsev	e86e66bad7	community[patch]: YandexGPT models - add sleep_interval (#16566 ) Added sleep between requests to prevent errors associated with simultaneous requests.	2024-01-25 09:07:19 -08:00
Bagatur	e510cfaa23	core[patch]: passthrough BaseRetriever.invoke(**kwargs) (#16551 ) Fix for #16547	2024-01-25 08:58:39 -08:00
Anders Åhsman	355ef2a4a6	langchain[patch]: Fix doc-string grammar (#16543 ) - Description: Small grammar fix in docstring for class `BaseCombineDocumentsChain`.	2024-01-25 10:00:06 -05:00
Aditya	9dd7cbb447	google-genai: added logic for method get_num_tokens() (#16205 ) <!-- Thank you for contributing to LangChain! Please title your PR "partners: google-genai", Replace this entire comment with: - Description: : added logic for method get_num_tokens() for ChatGoogleGenerativeAI , GoogleGenerativeAI, - Issue: : https://github.com/langchain-ai/langchain/issues/16204, - Dependencies: : None, - Twitter handle: @Aditya_Rane --------- Co-authored-by: adityarane@google.com <adityarane@google.com> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru>	2024-01-24 21:43:16 -07:00
James Braza	0785432e7b	langchain-google-vertexai: perserving grounding metadata (#16309 ) Revival of https://github.com/langchain-ai/langchain/pull/14549 that closes https://github.com/langchain-ai/langchain/issues/14548.	2024-01-24 21:37:43 -07:00
Erick Friis	adc008407e	exa: init pkg (#16553 )	2024-01-24 20:57:17 -07:00
Rave Harpaz	c4e9c9ca29	community[minor]: Add OCI Generative AI integration (#16548 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: Adding Oracle Cloud Infrastructure Generative AI integration. Oracle Cloud Infrastructure (OCI) Generative AI is a fully managed service that provides a set of state-of-the-art, customizable large language models (LLMs) that cover a wide range of use cases, and which is available through a single API. Using the OCI Generative AI service you can access ready-to-use pretrained models, or create and host your own fine-tuned custom models based on your own data on dedicated AI clusters. https://docs.oracle.com/en-us/iaas/Content/generative-ai/home.htm - Issue: None, - Dependencies: OCI Python SDK, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. Passed See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. we provide unit tests. However, we cannot provide integration tests due to Oracle policies that prohibit public sharing of api keys. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 18:23:50 -08:00
Bagatur	b8768bd6e7	docs: allow pdf download of api ref (#16550 ) https://docs.readthedocs.io/en/stable/config-file/v2.html#formats	2024-01-24 17:17:52 -08:00
Leonid Ganeline	f6a05e964b	docs: `Hugging Face` update (#16490 ) - added missed integrations to the platform page - updated integration examples: added links and fixed formats	2024-01-24 16:59:00 -08:00
Bagatur	c173a69908	langchain[patch]: oai tools output parser nit (#16540 ) allow positional init args	2024-01-24 16:57:16 -08:00
arnob-sengupta	f9976b9630	core[patch]: consolidate conditional in BaseTool (#16530 ) - Description: Refactor contradictory conditional to single line - Issue: #16528	2024-01-24 16:56:58 -08:00
Bagatur	5c2538b9f7	anthropic[patch]: allow pop by field name (#16544 ) allow `ChatAnthropicMessages(model=...)`	2024-01-24 15:48:31 -07:00
Harel Gal	a91181fe6d	community[minor]: add support for Guardrails for Amazon Bedrock (#15099 ) Added support for optionally supplying 'Guardrails for Amazon Bedrock' on both types of model invocations (batch/regular and streaming) and for all models supported by the Amazon Bedrock service. @baskaryan @hwchase17 ```python llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, guardrails={"id": " <guardrail_id>", "version": "<guardrail_version>", "trace": True}, callbacks=[BedrockAsyncCallbackHandler()]) class BedrockAsyncCallbackHandler(AsyncCallbackHandler): """Async callback handler that can be used to handle callbacks from langchain.""" async def on_llm_error( self, error: BaseException, **kwargs: Any, ) -> Any: reason = kwargs.get("reason") if reason == "GUARDRAIL_INTERVENED": # kwargs contains additional trace information sent by 'Guardrails for Bedrock' service. print(f"""Guardrails: {kwargs}""") # streaming llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, streaming=True, guardrails={"id": "<guardrail_id>", "version": "<guardrail_version>"}) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:44:19 -08:00
Martin Kolb	04651f0248	community[minor]: VectorStore integration for SAP HANA Cloud Vector Engine (#16514 ) - Description: This PR adds a VectorStore integration for SAP HANA Cloud Vector Engine, which is an upcoming feature in the SAP HANA Cloud database (https://blogs.sap.com/2023/11/02/sap-hana-clouds-vector-engine-announcement/). - Issue: N/A - Dependencies: [SAP HANA Python Client](https://pypi.org/project/hdbcli/) - Twitter handle: @sapopensource Implementation of the integration: `libs/community/langchain_community/vectorstores/hanavector.py` Unit tests: `libs/community/tests/unit_tests/vectorstores/test_hanavector.py` Integration tests: `libs/community/tests/integration_tests/vectorstores/test_hanavector.py` Example notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb` Access credentials for execution of the integration tests can be provided to the maintainers. --------- Co-authored-by: sascha <sascha.stoll@sap.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:05:07 -08:00
Leonid Kuligin	1113700b09	google-genai[patch]: better error message when location is not supported (#16535 ) Replace this entire comment with: - Description: a better error message when location is not supported	2024-01-24 13:58:46 -08:00
Bob Lin	54dd8e52a8	docs: Updated comments about `n_gpu_layers` in the Metal section (#16501 ) Ref: https://github.com/langchain-ai/langchain/issues/16502	2024-01-24 13:38:48 -08:00
Eugene Yurtsev	fe382fcf20	CI: more qa template changes (#16533 ) More qa template changes	2024-01-24 14:40:29 -05:00
Eugene Yurtsev	06f66f25e1	CI: Update q-a template (#16532 ) Update template for QA discussions	2024-01-24 14:29:31 -05:00
Eugene Yurtsev	b1b351b37e	CI: more updates to feature request template (#16531 ) More updates	2024-01-24 14:15:26 -05:00
Eugene Yurtsev	4fad71882e	CI: Fix ideas template (#16529 ) Fix ideas template	2024-01-24 14:06:53 -05:00
Anastasiia Manokhina	ce595f0203	docs:Updated integration docs structure for chat/google_vertex_ai_palm (#16201 ) Description: - checked that the doc chat/google_vertex_ai_palm is using new functions: invoke, stream etc. - added Gemini example - fixed wrong output in Sanskrit example Issue: https://github.com/langchain-ai/langchain/issues/15664 Dependencies: None Twitter handle: None	2024-01-24 10:21:32 -08:00
Unai Garay Maestre	fdbfa6b2c8	Adds progress bar to VertexAIEmbeddings (#14542 ) - Description: Adds progress bar to VertexAIEmbeddings - Issue: related issue https://github.com/langchain-ai/langchain/issues/13637 Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> --------- Signed-off-by: ugm2 <unaigaraymaestre@gmail.com>	2024-01-24 11:16:16 -07:00
James Braza	643fb3ab50	langchain-google-vertexai[patch]: more verbose mypy config (#16307 ) Flushing out the `mypy` config in `langchain-google-vertexai` to show error codes and other warnings This PR also bumps `mypy` to above version 1's stable release	2024-01-24 11:10:45 -07:00
Eugene Yurtsev	8d990ba67b	CI: more update to ideas template (#16524 ) Update ideas template	2024-01-24 13:05:47 -05:00
Eugene Yurtsev	63da14d620	CI: redirect feature requests to ideas in discussions (#16522 ) Redirect feature requests to ideas in discussions	2024-01-24 13:03:10 -05:00
Erick Friis	8d299645f9	docs: rm output (#16519 )	2024-01-24 10:19:34 -07:00
Eugene Yurtsev	dfd94fb2f0	CI: Update issue template (#16517 ) More updates to the ISSUE template	2024-01-24 12:09:21 -05:00
Lance Martin	0b740ebd49	Update SQL agent toolkit docs (#16409 )	2024-01-24 09:03:17 -08:00
Francisco Ingham	13cf4594f4	docs: added a few suggestions for sql docs (#16508 )	2024-01-24 08:48:41 -08:00
Eugene Yurtsev	6004e9706f	Docs: Add streaming section (#16468 ) Adds a streaming section to LangChain documentation, explaining `stream`/`astream` API and `astream_events` API.	2024-01-24 10:38:39 -05:00
Tipwheal	66aafc0573	Docs: typo in tool use quick start page (#16494 ) Minor typo fix	2024-01-24 10:37:12 -05:00
Jeremi Joslin	9e95699277	community[patch]: Fix error message when litellm is not installed (#16316 ) The error message was mentioning the wrong package. I updated it to the correct one.	2024-01-23 21:42:29 -08:00
bachr	b3ed98dec0	community[patch]: avoid KeyError when language not in LANGUAGE_SEGMENTERS (#15212 ) Description: Handle unsupported languages in same way as when none is provided Issue: The following line will throw a KeyError if the language is not supported. ```python self.Segmenter = LANGUAGE_SEGMENTERS[language] ``` E.g. when using `Language.CPP` we would get `KeyError: <Language.CPP: 'cpp'>` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 21:09:43 -08:00
Nuno Campos	3f38e1a457	Remove double line (#16426 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-23 20:22:37 -08:00
chyroc	61da2ff24c	community[patch]: use SecretStr for yandex model secrets (#15463 )	2024-01-23 20:08:53 -08:00
Alessio Serra	d628a80a5d	community[patch]: added 'conversational' as a valid task for hugginface endopoint models (#15761 ) - Description: added the conversational task to hugginFace endpoint in order to use models designed for chatbot programming. - Dependencies: None --------- Co-authored-by: Alessio Serra (ext.) <alessio.serra@partner.bmw.de> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 20:04:15 -08:00
Karim Lalani	4c7755778d	community[patch]: SurrealDB fix for asyncio (#16092 ) Code fix for asyncio	2024-01-23 19:46:19 -08:00
BeatrixCohere	2b2285dac0	docs: Update cohere rerank and comparison docs (#16198 ) - Description: Update the cohere rerank docs to use cohere embeddings - Issue: n/a - Dependencies: n/a - Twitter handle: n/a	2024-01-23 19:39:42 -08:00
Raunak	476bf8b763	community[patch]: Load list of files using UnstructuredFileLoader (#16216 ) - Description: Updated `_get_elements()` function of `UnstructuredFileLoader `class to check if the argument self.file_path is a file or list of files. If it is a list of files then it iterates over the list of file paths, calls the partition function for each one, and appends the results to the elements list. If self.file_path is not a list, it calls the partition function as before. - Issue: Fixed #15607, - Dependencies: NA - Twitter handle: NA Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-23 19:37:37 -08:00
Xudong Sun	019b6ebe8d	community[minor]: Add iFlyTek Spark LLM chat model support (#13389 ) - Description: This PR enables LangChain to access the iFlyTek's Spark LLM via the chat_models wrapper. - Dependencies: websocket-client ^1.6.1 - Tag maintainer: @baskaryan ### SparkLLM chat model usage Get SparkLLM's app_id, api_key and api_secret from [iFlyTek SparkLLM API Console](https://console.xfyun.cn/services/bm3) (for more info, see [iFlyTek SparkLLM Intro](https://xinghuo.xfyun.cn/sparkapi) ), then set environment variables `IFLYTEK_SPARK_APP_ID`, `IFLYTEK_SPARK_API_KEY` and `IFLYTEK_SPARK_API_SECRET` or pass parameters when using it like the demo below: ```python3 from langchain.chat_models.sparkllm import ChatSparkLLM client = ChatSparkLLM( spark_app_id="<app_id>", spark_api_key="<api_key>", spark_api_secret="<api_secret>" ) ```	2024-01-23 19:23:46 -08:00
Ali Zendegani	80fcc50c65	langchain[patch]: Minor Fix: Enable Passing custom_headers for Authentication in GraphQL Agent/Tool (#16413 ) - Description: This PR aims to enhance the `langchain` library by enabling the support for passing `custom_headers` in the `GraphQLAPIWrapper` usage within `langchain/agents/load_tools.py`. While the `GraphQLAPIWrapper` from the `langchain_community` module is inherently capable of handling `custom_headers`, its current invocation in `load_tools.py` does not facilitate this functionality. This limitation restricts the use of the `graphql` tool with databases or APIs that require token-based authentication. The absence of support for `custom_headers` in this context also leads to a lack of error messages when attempting to interact with secured GraphQL endpoints, making debugging and troubleshooting more challenging. This update modifies the `load_tools` function to correctly handle `custom_headers`, thereby allowing secure and authenticated access to GraphQL services requiring tokens. Example usage after the proposed change: ```python tools = load_tools( ["graphql"], graphql_endpoint="https://your-graphql-endpoint.com/graphql", custom_headers={"Authorization": f"Token {api_token}"}, ) ``` - Issue: None, - Dependencies: None, - Twitter handle: None	2024-01-23 19:19:53 -08:00
Serena Ruan	5c6e123757	community[patch]: Fix MlflowCallback with none artifacts_dir (#16487 )	2024-01-23 19:09:02 -08:00
Krista Pratico	0e2e7d8b83	langchain[patch]: allow passing client with OpenAIAssistantRunnable (#16486 ) - Description: This addresses the issue tagged below where if you try to pass your own client when creating an OpenAI assistant, a pydantic error is raised: Example code: ```python import openai from langchain.agents.openai_assistant import OpenAIAssistantRunnable client = openai.OpenAI() interpreter_assistant = OpenAIAssistantRunnable.create_assistant( name="langchain assistant", instructions="You are a personal math tutor. Write and run code to answer math questions.", tools=[{"type": "code_interpreter"}], model="gpt-4-1106-preview", client=client ) ``` Error: `pydantic.v1.errors.ConfigError: field "client" not yet prepared, so the type is still a ForwardRef. You might need to call OpenAIAssistantRunnable.update_forward_refs()` It additionally updates type hints and docstrings to indicate that an AzureOpenAI client is permissible as well. - Issue: https://github.com/langchain-ai/langchain/issues/15948 - Dependencies: N/A	2024-01-23 18:48:29 -08:00
Eugene Yurtsev	d898d2f07b	docs: Fix version in which astream_events was released (#16481 ) Fix typo in version	2024-01-23 18:41:44 -08:00
bu2kx	ff3163297b	community[minor]: Add KDBAI vector store (#12797 ) Addition of KDBAI vector store (https://kdb.ai). Dependencies: `kdbai_client` v0.1.2 Python package. Sample notebook: `docs/docs/integrations/vectorstores/kdbai.ipynb` Tag maintainer: @bu2kx Twitter handle: @kxsystems	2024-01-23 18:37:01 -08:00
JongRok BAEK	4ec3fe4680	docs: Updated integration docs structure for chat/anthropic (#16268 ) Description: - Added output and environment variables - Updated the documentation for chat/anthropic, changing references from `langchain.schema` to `langchain_core.prompts`. Issue: https://github.com/langchain-ai/langchain/issues/15664 Dependencies: None Twitter handle: None Since this is my first open-source PR, please feel free to point out any mistakes, and I'll be eager to make corrections.	2024-01-23 18:36:28 -08:00
Shivani Modi	4e160540ff	community[minor]: Adding Konko Completion endpoint (#15570 ) This PR introduces update to Konko Integration with LangChain. 1. New Endpoint Addition: Integration of a new endpoint to utilize completion models hosted on Konko. 2. Chat Model Updates for Backward Compatibility: We have updated the chat models to ensure backward compatibility with previous OpenAI versions. 4. Updated Documentation: Comprehensive documentation has been updated to reflect these new changes, providing clear guidance on utilizing the new features and ensuring seamless integration. Thank you to the LangChain team for their exceptional work and for considering this PR. Please let me know if any additional information is needed. --------- Co-authored-by: Shivani Modi <shivanimodi@Shivanis-MacBook-Pro.local> Co-authored-by: Shivani Modi <shivanimodi@Shivanis-MBP.lan>	2024-01-23 18:22:32 -08:00
Gianfranco Demarco	c69f599594	langchain[patch]: Extract _aperform_agent_action from _aiter_next_step from AgentExecutor (#15707 ) - Description: extreact the _aperform_agent_action in the AgentExecutor class to allow for easier overriding. Extracted logic from _iter_next_step into a new method _perform_agent_action for consistency and easier overriding. - Issue: #15706 Closes #15706	2024-01-23 18:22:09 -08:00
i-w-a	95ee69a301	langchain[patch]: In HTMLHeaderTextSplitter set default encoding to utf-8 (#16372 ) - Description: The HTMLHeaderTextSplitter Class now explicitly specifies utf-8 encoding in the part of the split_text_from_file method that calls the HTMLParser. - Issue: Prevent garbled characters due to differences in encoding of html files (except for English in particular, I noticed that problem with Japanese). - Dependencies: No dependencies, - Twitter handle: @i_w__a	2024-01-23 18:20:29 -08:00
Noah Stapp	e135e5257c	community[patch]: Include scores in MongoDB Atlas QA chain results (#14666 ) Adds the ability to return similarity scores when using `RetrievalQA.from_chain_type` with `MongoDBAtlasVectorSearch`. Requires that `return_source_documents=True` is set. Example use: ``` vector_search = MongoDBAtlasVectorSearch.from_documents(...) qa = RetrievalQA.from_chain_type( llm=OpenAI(), chain_type="stuff", retriever=vector_search.as_retriever(search_kwargs={"additional": ["similarity_score"]}), return_source_documents=True ) ... docs = qa({"query": "..."}) docs["source_documents"][0].metadata["score"] # score will be here ``` I've tested this feature locally, using a MongoDB Atlas Cluster with a vector search index.	2024-01-23 18:18:28 -08:00
Serena Ruan	90f5a1c40e	community[minor]: Improve mlflow callback (#15691 ) - Description: Allow passing run_id to MLflowCallbackHandler to resume a run instead of creating a new run. Support recording retriever relevant metrics. Refactor the code to fix some bugs. --------- Signed-off-by: Serena Ruan <serena.rxy@gmail.com>	2024-01-23 18:16:51 -08:00
Facundo Santiago	92e6a641fd	feat: adding paygo api support for Azure ML / Azure AI Studio (#14560 ) - Description: Introducing support for LLMs and Chat models running in Azure AI studio and Azure ML using the new deployment mode pay-as-you-go (model as a service). - Issue: NA - Dependencies: None. - Tag maintainer: @prakharg-msft @gdyre - Twitter handle: @santiagofacundo Examples added: * [docs/docs/integrations/llms/azure_ml.ipynb](https://github.com/santiagxf/langchain/blob/santiagxf/azureml-endpoints-paygo-community/docs/docs/integrations/chat/azureml_endpoint.ipynb) * [docs/docs/integrations/chat/azureml_chat_endpoint.ipynb](https://github.com/santiagxf/langchain/blob/santiagxf/azureml-endpoints-paygo-community/docs/docs/integrations/chat/azureml_chat_endpoint.ipynb) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-23 17:08:51 -08:00
Davide Menini	9ce177580a	community: normalize bedrock embeddings (#15103 ) In this PR I added a post-processing function to normalize the embeddings. This happens only if the new `normalize` flag is `True`. --------- Co-authored-by: taamedag <Davide.Menini@swisscom.com>	2024-01-23 17:05:24 -08:00
baichuan-assistant	20fcd49348	community: Fix Baichuan Chat. (#15207 ) - Description: Baichuan Chat (with both Baichuan-Turbo and Baichuan-Turbo-192K models) has updated their APIs. There are breaking changes. For example, BAICHUAN_SECRET_KEY is removed in the latest API but is still required in Langchain. Baichuan's Langchain integration needs to be updated to the latest version. - Issue: #15206 - Dependencies: None, - Twitter handle: None @hwchase17. Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-23 17:01:57 -08:00
gcheron	cfc225ecb3	community: SQLStrStore/SQLDocStore provide an easy SQL alternative to `InMemoryStore` to persist data remotely in a SQL storage (#15909 ) Description: - Implement `SQLStrStore` and `SQLDocStore` classes that inherits from `BaseStore` to allow to persist data remotely on a SQL server. - SQL is widely used and sometimes we do not want to install a caching solution like Redis. - Multiple issues/comments complain that there is no easy remote and persistent solution that are not in memory (users want to replace InMemoryStore), e.g., https://github.com/langchain-ai/langchain/issues/14267, https://github.com/langchain-ai/langchain/issues/15633, https://github.com/langchain-ai/langchain/issues/14643, https://stackoverflow.com/questions/77385587/persist-parentdocumentretriever-of-langchain - This is particularly painful when wanting to use `ParentDocumentRetriever ` - This implementation is particularly useful when: * it's expensive to construct an InMemoryDocstore/dict * you want to retrieve documents from remote sources * you just want to reuse existing objects - This implementation integrates well with PGVector, indeed, when using PGVector, you already have a SQL instance running. `SQLDocStore` is a convenient way of using this instance to store documents associated to vectors. An integration example with ParentDocumentRetriever and PGVector is provided in docs/docs/integrations/stores/sql.ipynb or [here](https://github.com/gcheron/langchain/blob/sql-store/docs/docs/integrations/stores/sql.ipynb). - It persists `str` and `Document` objects but can be easily extended. Issue: Provide an easy SQL alternative to `InMemoryStore`. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-23 16:50:48 -08:00
dudgeon	26b2ad6d5b	Fixed typo on quickstart.ipynb (#16482 ) - Description: Quick typo fix: `inpect` >> `inspect` - Issue: N/A - Dependencies: any dependencies required for this change, - Twitter handle: @geoffdudgeon	2024-01-23 16:50:13 -08:00
Massimiliano Pronesti	e529939c54	feat(llms): support more tasks in HuggingFaceHub LLM and remove deprecated dep (#14406 ) - Description: this PR upgrades the `HuggingFaceHub` LLM: * support more tasks (`translation` and `conversational`) * replaced the deprecated `InferenceApi` with `InferenceClient` * adjusted the overall logic to use the "recommended" model for each task when no model is provided, and vice-versa. - Tag mainter(s): @baskaryan @hwchase17	2024-01-23 16:48:56 -08:00
Erick Friis	afb25eeec4	cli[patch]: add integration tests to default makefile (#16479 )	2024-01-23 16:09:16 -07:00
Erick Friis	51c8ef6af4	templates: fix azure params in retrieval agent (#16257 ) - FIX templates/retrieval-agent/retireval-agent/chain.py to use the new Syntax for Azure env params - cr --------- Co-authored-by: braun-viathan <p.braun@viathan.de> Co-authored-by: Braun-viathan <121631422+braun-viathan@users.noreply.github.com>	2024-01-23 14:58:06 -07:00
Lance Martin	c3530f1c11	templates: Minor nit on HyDE (#16478 )	2024-01-23 14:23:08 -07:00
Bagatur	ba326b98d0	langchain[patch]: Release 0.1.3 (#16475 )	2024-01-23 11:50:25 -08:00
Bagatur	54149292f8	community[patch]: Release 0.0.15 (#16474 )	2024-01-23 11:50:10 -08:00
Bagatur	ef6a335570	core[patch]: Release 0.1.15 (#16473 )	2024-01-23 11:31:50 -08:00
Erick Friis	1f4ac62dee	cli[patch], google-vertexai[patch]: readme template (#16470 )	2024-01-23 12:08:17 -07:00
Eugene Yurtsev	39d1cbfecf	Docs: Document astream_events API (#16300 ) Document astream events API	2024-01-23 12:32:45 -05:00
Tomaz Bratanic	d0a8082188	Fix neo4j sanitize (#16439 ) Fix the sanitization bug and add an integration test	2024-01-23 10:56:28 -05:00
William FH	5de59f9236	Core[Patch] Parse tool input after on_start (#16430 ) For tracing, if a validation error occurs, currently it is attributed to the previous step of the chain. It would be nice to have the on_start and on_error callbacks called for tools when there is a validation error that occurs to more easily attribute the root-cause	2024-01-23 10:54:47 -05:00
Nuno Campos	226fe645f1	core[patch] Do not try to access attribute of None (#16321 )	2024-01-22 22:10:03 -08:00
Florian MOREL	4b7969efc5	community[minor]: New documents loader for visio files (with extension .vsdx) (#16171 ) Description : New documents loader for visio files (with extension .vsdx) A [visio file](https://fr.wikipedia.org/wiki/Microsoft_Visio) (with extension .vsdx) is associated with Microsoft Visio, a diagram creation software. It stores information about the structure, layout, and graphical elements of a diagram. This format facilitates the creation and sharing of visualizations in areas such as business, engineering, and computer science. A Visio file can contain multiple pages. Some of them may serve as the background for others, and this can occur across multiple layers. This loader extracts the textual content from each page and its associated pages, enabling the extraction of all visible text from each page, similar to what an OCR algorithm would do. Dependencies : xmltodict package	2024-01-22 22:07:03 -08:00
KhoPhi	fb41b68ea1	docs: Update with LCEL examples to Ollama & ChatOllama Integration notebook (#16194 ) - Description: Updated the Chat/Ollama docs notebook with LCEL chain examples - Issue: #15664 I'm a new contributor 😊 - Dependencies: No dependencies - Twitter handle: Comments: - How do I truncate the output of the stream in the notebook if and or when it goes on and on and on for even the basic of prompts? Edit: Looking forward to feedback @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-22 22:05:59 -08:00
Michael Gorham	3b0226b2c6	docs: Update redis_chat_message_history.ipynb (#16344 ) ## Problem Spent several hours trying to figure out how to pass `RedisChatMessageHistory` as a `GetSessionHistoryCallable` with a different REDIS hostname. This example kept connecting to `redis://localhost:6379`, but I wanted to connect to a server not hosted locally. ## Cause Assumption the user knows how to implement `BaseChatMessageHistory` and `GetSessionHistoryCallable` ## Solution Update documentation to show how to explicitly set the REDIS hostname using a lambda function much like the MongoDB and SQLite examples.	2024-01-22 21:59:59 -08:00
Ian	c98994c3c9	docs: Improve notebook to show how to use tidb to store history messages (#16420 ) After merging [PR #16304](https://github.com/langchain-ai/langchain/pull/16304), I realized that our notebook example for integrating TiDB with LangChain was too basic. To make it more useful and user-friendly, I plan to create a detailed example. This will show how to use TiDB for saving history messages in LangChain, offering a clearer, more practical guide for our users	2024-01-22 21:58:37 -08:00
Eugene Yurtsev	c88750d54b	Docs: Agent streaming notebooks (#15858 ) Update information about streaming in the agents section. Show how to use astream_events to get token by token streaming.	2024-01-22 21:54:55 -05:00
Eugene Yurtsev	e5672bc944	docs: Re-write custom agent to show to write a tools agent (#15907 ) Shows how to write a tools agent rather than a functions agent.	2024-01-22 17:28:31 -08:00
Boris Feld	404abf139a	community: Add CometLLM tracing context var (#15765 ) I also added LANGCHAIN_COMET_TRACING to enable the CometLLM tracing integration similar to other tracing integrations. This is easier for end-users to enable it rather than importing the callback and pass it manually. (This is the same content as https://github.com/langchain-ai/langchain/pull/14650 but rebased and squashed as something seems to confuse Github Action).	2024-01-22 15:17:16 -08:00
Nicolò Boschi	a500527030	infra: google-vertexai relax types-requests deps range (#16264 ) - Description: At the moment it's not possible to include in the same project langchain-google-vertexai and boto3 (e.g. use bedrock and vertex in the same application) because of the dependency resolutions conflict. boto3 is still using urllib3 1.x, meanwhile langchain-google-vertexai -> types-requests depends on urllib3 2.x. [the last version of types-requests that allows urllib3 1.x is 2.31.0.6](https://pypi.org/project/types-requests/#description). In this PR I allow the vertexai package to get that version also. - Twitter handle: nicoloboschi	2024-01-22 14:54:41 -08:00
DL	b9e7f6f38a	community[minor]: Bedrock async methods (#12477 ) Description: Added support for asynchronous streaming in the Bedrock class and corresponding tests. Primarily: async def aprepare_output_stream async def _aprepare_input_and_invoke_stream async def _astream async def _acall I've ensured that the code adheres to the project's linting and formatting standards by running make format, make lint, and make test. Issue: #12054, #11589 Dependencies: None Tag maintainer: @baskaryan Twitter handle: @dominic_lovric --------- Co-authored-by: Piyush Jain <piyushjain@duck.com>	2024-01-22 14:44:49 -08:00
Jennifer Melot	d6275e47f2	docs: Updated integration docs structure for tools/arxiv (#16091 ) (#16250 ) - Description: Updated docs for tools/arxiv to use `AgentExecutor` and `invoke` - Issue: #15664 - Dependencies: None - Twitter handle: None	2024-01-22 14:34:22 -08:00
Frank995	5694728816	community[patch]: Implement vector length definition at init time in PGVector for indexing (#16133 ) Replace this entire comment with: - Description: allow user to define tVector length in PGVector when creating the embedding store, this allows for later indexing - Issue: #16132 - Dependencies: None	2024-01-22 14:32:44 -08:00
ChengZi	a950fa0487	docs: add milvus multitenancy doc (#16177 ) - Description: add milvus multitenancy doc, it is an example for this [pr](https://github.com/langchain-ai/langchain/pull/15740) . - Issue: No, - Dependencies: No, - Twitter handle: No Signed-off-by: ChengZi <chen.zhang@zilliz.com>	2024-01-22 14:25:26 -08:00
Chase VanSteenburg	1011b681dc	core[patch]: Fix f-string formatting in error message for configurable_fields (#16411 ) - Description: Simple fix to f-string formatting. Allows more informative ValueError output. - Issue: None needed. - Dependencies: None. - Twitter handle: @FlightP1an	2024-01-22 14:08:44 -08:00
parkererickson-tg	b26a22f307	community[minor]: add TigerGraph support (#16280 ) Description: Add support for querying TigerGraph databases through the InquiryAI service. Issue: N/A Dependencies: N/A Twitter handle: @TigerGraphDB	2024-01-22 14:07:44 -08:00
Christophe Bornet	8da34118bc	docs: Add documentation for Cassandra Document Loader (#16282 )	2024-01-22 14:06:21 -08:00
Alireza Kashani	d1b4ead87c	community[patch]: Update grobid.py (#16298 ) there is a case where "coords" does not exist in the "sentence" therefore, the "split(";")" will lead to error. we can fix that by adding "if sentence.get("coords") is not None:" the resulting empty "sbboxes" from this scenario will raise error at "sbboxes[0]["page"]" because sbboxes are empty. the PDF from https://pubmed.ncbi.nlm.nih.gov/23970373/ can replicate those errors.	2024-01-22 14:03:58 -08:00
s-g-1	fbe592a5ce	community[patch]: fix typo in pgvecto_rs debug msg (#16318 ) fixes typo in pip install message for the pgvecto_rs community vector store no issues found mentioning this no dependents changed	2024-01-22 14:01:33 -08:00
James Braza	d511366dd3	infra: absolute `EXAMPLE_DIR` path in core unit tests (#16325 ) If you invoked testing from places besides `core/`, this `EXAMPLE_DIR` path won't work. This PR makes`EXAMPLE_DIR` robust against invocation location	2024-01-22 14:00:23 -08:00
Jonathan Algar	774e543e1f	docs: fix formatting issue in rockset.ipynb (#16328 ) Description: randomly discovered while working on another PR https://github.com/quarto-dev/quarto-cli/discussions/8131#discussioncomment-8027706 @anubhav94N ICYI	2024-01-22 13:59:45 -08:00
Ian	b9f5104e6c	communty[minor]: Store Message History to TiDB Database (#16304 ) This pull request integrates the TiDB database into LangChain for storing message history, marking one of several steps towards a comprehensive integration of TiDB with LangChain. A simple usage ```python from datetime import datetime from langchain_community.chat_message_histories import TiDBChatMessageHistory history = TiDBChatMessageHistory( connection_string="mysql+pymysql://<host>:<PASSWORD>@<host>:4000/<db>?ssl_ca=/etc/ssl/cert.pem&ssl_verify_cert=true&ssl_verify_identity=true", session_id="code_gen", earliest_time=datetime.utcnow(), # Optional to set earliest_time to load messages after this time point. ) history.add_user_message("hi! How's feature going?") history.add_ai_message("It's almot done") ```	2024-01-22 13:56:56 -08:00
Erick Friis	35ec0bbd3b	cli[patch]: pypi fields (#16410 )	2024-01-22 14:28:30 -07:00
Erick Friis	2ac3a82d85	cli[patch]: new fields in integration template, release 0.0.21 (#16398 )	2024-01-22 14:26:47 -07:00
Erick Friis	cfe95ab085	multiple: update langsmith dep (#16407 )	2024-01-22 14:23:11 -07:00
Sarthak Chaure	dd5b8107b1	Docs: Updated callbacks/index.mdx (#16404 ) The callbacks get started demo code was updated , replacing the chain.run() command ( which is now depricated) ,with the updated chain.invoke() command. Solving the following issue : #16379 Twitter/X : @Hazxhx	2024-01-22 16:10:19 -05:00
Omar-aly	873de14cd8	docs: update vectorstores/llm_rails integration doc (#16199 ) Description: - Updated the docs for the vectorstores integration module llm_rails.ipynb Issue: - [Connected to Issue #15664](https://github.com/langchain-ai/langchain/issues/15664) Dependencies: - N/A Co-authored-by: omaraly23 <112936089+omaraly22@users.noreply.github.com>	2024-01-22 11:40:08 -08:00
Eli Lucherini	6b2a57161a	community[patch]: allow additional kwargs in MlflowEmbeddings for compatibility with Cohere API (#15242 ) - Description: add support for kwargs in`MlflowEmbeddings` `embed_document()` and `embed_query()` so that all the arguments required by Cohere API (and others?) can be passed down to the server. - Issue: #15234 - Dependencies: MLflow with MLflow Deployments (`pip install mlflow[genai]`) Tests Now this code [adapted from the docs](https://python.langchain.com/docs/integrations/providers/mlflow#embeddings-example) for the Cohere API works locally. ```python """ Setup ----- export COHERE_API_KEY=... mlflow deployments start-server --config-path examples/deployments/cohere/config.yaml Run --- python /path/to/this/file.py """ embeddings = MlflowCohereEmbeddings(target_uri="http://127.0.0.1:5000", endpoint="embeddings") print(embeddings.embed_query("hello")[:3]) print(embeddings.embed_documents(["hello", "world"])[0][:3]) ``` Output ``` [0.060455322, 0.028793335, -0.025848389] [0.031707764, 0.021057129, -0.009361267] ```	2024-01-22 11:38:11 -08:00
Guillem Orellana Trullols	aad2aa7188	community[patch]: BedrockChat -> Support Titan express as chat model (#15408 ) Titan Express model was not supported as a chat model because LangChain messages were not "translated" to a text prompt. Co-authored-by: Guillem Orellana Trullols <guillem.orellana_trullols@siemens.com>	2024-01-22 11:37:23 -08:00
Piotr Mardziel	1b9001db47	core[patch]: preserve inspect.iscoroutinefunction with @deprecated decorator (#16295 ) Adjusted `deprecate` decorator to make sure decorated async functions are still recognized as "coroutinefunction" by `inspect`. Before change, functions such as `LLMChain.acall` which are decorated as deprecated are not recognized as coroutine functions. After the change, they are recognized: ```python import inspect from langchain import LLMChain # Is false before change but true after. inspect.iscoroutinefunction(LLMChain.acall) ```	2024-01-22 11:34:13 -08:00
Katarina Supe	01c2f27ffa	community[patch]: Update Memgraph support (#16360 ) - Description: I removed two queries to the database and left just one whose results were formatted afterward into other type of schema (avoided two calls to DB) - Issue: / - Dependencies: / - Twitter handle: @supe_katarina	2024-01-22 11:33:28 -08:00
Lance Martin	369e90d427	docs: Minor update to Robocorp toolkit docs (#16399 )	2024-01-22 11:33:13 -08:00
Hadi	a1c0cf21c9	docs: Update import library for StreamlitCallbackHandler (#16401 ) - Description: Some code sources have been moved from `langchain` to `langchain_community` and so the documentation is not yet up-to-date. This is specifically true for `StreamlitCallbackHandler` which returns a `warning` message if not loaded from `langchain_community`., - Issue: I don't see a # issue that could address this problem but perhaps #10744, - Dependencies: Since it's a documentation change no dependencies are required	2024-01-22 11:33:00 -08:00
JaguarDB	7ecd2f22ac	community[patch]: update documentation on jaguar vector store (#16346 ) - Description: update documentation on jaguar vector store: Instruction for setting up jaguar server and usage of text_tag. - Issue: - Dependencies: - Twitter handle: --------- Co-authored-by: JY <jyjy@jaguardb>	2024-01-22 11:28:38 -08:00
Max Jakob	8569b8f680	community[patch]: ElasticsearchStore enable max inner product (#16393 ) Enable max inner product for approximate retrieval strategy. For exact strategy we lack the necessary `maxInnerProduct` function in the Painless scripting language, this is why we do not add it there. Similarity docs: https://www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-params --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Joe McElroy <joseph.mcelroy@elastic.co>	2024-01-22 11:26:18 -08:00
Iskren Ivov Chernev	fc196cab12	community[minor]: DeepInfra support for chat models (#16380 ) Add deepinfra chat models support. This is https://github.com/langchain-ai/langchain/pull/14234 re-opened from my branch (so maintainers can edit).	2024-01-22 11:22:17 -08:00
Bagatur	eac91b60c9	docs: qa rag nit (#16400 )	2024-01-22 11:17:32 -08:00
Bagatur	85e8423312	community[patch]: Update bing results tool name (#16395 ) Make BingSearchResults tool name OpenAI functions compatible (can't have spaces). Fixes #16368	2024-01-22 11:11:03 -08:00
Max Jakob	de209af533	community[patch]: ElasticsearchStore: add relevance function selector (#16378 ) Implement similarity function selector for ElasticsearchStore. The scores coming back from Elasticsearch are already similarities (not distances) and they are already normalized (see [docs](https://www.elastic.co/guide/en/elasticsearch/reference/current/dense-vector.html#dense-vector-params)). Hence we leave the scores untouched and just forward them. This fixes #11539. However, in hybrid mode (when keyword search and vector search are involved) Elasticsearch currently returns no scores. This PR adds an error message around this fact. We need to think a bit more to come up with a solution for this case. This PR also corrects a small error in the Elasticsearch integration test. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-22 11:52:20 -07:00
y2noda	54f90fc6bc	langchain_google_vertexai:Enable the use of langchain's built-in tools in Gemini's function calling (#16341 ) - Issue: This is a PR about #16340 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Co-authored-by: yuhei.tsunoda <yuhei.tsunoda@brainpad.co.jp>	2024-01-22 11:16:36 -07:00
Tom Jorquera	1445ac95e8	community[patch]: Enable streaming for GPT4all (#16392 ) `streaming` param was never passed to model	2024-01-22 09:54:18 -08:00
Bagatur	af9f1738ca	langchain[patch]: Release 0.1.2 (#16388 )	2024-01-22 09:32:24 -08:00
Bagatur	8779013847	community[patch]: Release 0.0.14 (#16384 )	2024-01-22 08:50:19 -08:00
Bagatur	9cf0f5eb78	core[patch]: Release 0.1.14 (#16382 )	2024-01-22 08:28:03 -08:00
Bagatur	1dc6c1ce06	core[patch], community[patch], langchain[patch], docs: Update SQL chains/agents/docs (#16168 ) Revamp SQL use cases docs. In the process update SQL chains and agents.	2024-01-22 08:19:08 -08:00
Jatin Chawda	05162928c0	Docs: Fixed Urls of AsyncHtmlLoader, AsyncChromiumLoader and HTML2Text links in Web scraping Docs (#16365 ) Fixing links in documentation.	2024-01-22 11:03:03 -05:00
Bob Lin	acc14802d1	Fix `conn` field definition in SQLiteEntityStore (#15440 )	2024-01-22 07:53:49 -08:00
James Braza	e1c59779ad	core[patch]: Remove `print` statement on missing `grandalf` dependency in favor of more explicit ImportError (#16326 ) After this PR an ImportError will be raised without a print if grandalf is missing when using grandalf related code for printing runnable graphs.	2024-01-22 10:48:54 -05:00
Nuno Campos	971a68d04f	Docs: Update README.md in core (#16329 ) Docs: Update README.md in core	2024-01-22 10:42:31 -05:00
Christophe Bornet	f9be877ed7	Docs: Add self-querying retriever and store to AstraDB provider doc (#16362 ) Add self-querying retriever and store to AstraDB provider doc	2024-01-22 10:24:28 -05:00
Mateusz Szewczyk	076dbb1a8f	docs: IBM watsonx.ai Use `invoke` instead of `__call__` (#16371 ) - Description: Updating documentation of IBM [watsonx.ai](https://www.ibm.com/products/watsonx-ai) LLM with using `invoke` instead of `__call__` - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/), - Tag maintainer: : Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. ✅ The following warning information show when i use `run` and `__call__` method: ``` LangChainDeprecationWarning: The function `__call__` was deprecated in LangChain 0.1.7 and will be removed in 0.2.0. Use invoke instead. warn_deprecated( ``` We need to update documentation for using `invoke` method	2024-01-22 10:22:03 -05:00
Bob Lin	c6bd7778b0	Use `invoke` instead of `__call__` (#16369 ) The following warning information will be displayed when i use `llm(PROMPT)`: ```python /Users/169/llama.cpp/venv/lib/python3.11/site-packages/langchain_core/_api/deprecation.py:117: LangChainDeprecationWarning: The function `__call__` was deprecated in LangChain 0.1.7 and will be removed in 0.2.0. Use invoke instead. warn_deprecated( ``` So I changed to standard usage.	2024-01-22 10:18:43 -05:00
Eugene Yurtsev	89372fca22	core[patch]: Update sys info information (#16297 ) Update information collected in sys info. python -m langchain_core.sys_info System Information ------------------ > OS: Linux > OS Version: #14~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Mon Nov 20 18:15:30 UTC 2 > Python Version: 3.11.4 (main, Sep 25 2023, 10:06:23) [GCC 11.4.0] Package Information ------------------- > langchain_core: 0.1.10 > langchain: 0.1.0 > langchain_community: 0.0.11 > langchain_cli: 0.0.20 > langchain_experimental: 0.0.36 > langchain_openai: 0.0.2 > langchainhub: 0.1.14 > langserve: 0.0.19 Packages not installed (Not Necessarily a Problem) -------------------------------------------------- The following packages were not found: > langgraph	2024-01-22 10:18:04 -05:00
Luke	5396604ef4	community: Handling missing key in Google Trends API response. (#15864 ) - Description: Handing response where _interest_over_time_ is missing. - Issue: #15859 - Dependencies: None	2024-01-21 18:11:45 -08:00
Virat Singh	c2a614eddc	community: Add PolygonLastQuote Tool and Toolkit (#15990 ) Description: In this PR, I am adding a `PolygonLastQuote` Tool, which can be used to get the latest price quote for a given ticker / stock. Additionally, I've added a Polygon Toolkit, which we can use to encapsulate future tools that we build for Polygon. Twitter handle: [@virattt](https://twitter.com/virattt) --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-21 15:08:55 -08:00
Nuno Campos	ef75bb63ce	core[patch] Fix tracer output of streamed runs with non-addable output (#16324 ) - Used to be None, now is just the last chunk <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-20 18:52:26 -08:00
Ryan French	3d23a5eb36	langchain[patch]: Allow OpenSearch Query Translator to correctly work with Date types (#16022 ) Description: Fixes an issue where the Date type in an OpenSearch Self Querying Retriever would fail to generate a valid query Issue: https://github.com/langchain-ai/langchain/issues/14225	2024-01-19 17:57:18 -08:00
Ofer Mendelevitch	ffae98d371	template: Update Vectara templates (#15363 ) fixed multi-query template for Vectara added self-query template for Vectara Also added prompt_name parameter to summarization CC @efriis Twitter handle: @ofermend	2024-01-19 17:32:33 -08:00
Bagatur	1e29b676d5	core[patch]: simple fallback streaming (#16055 )	2024-01-19 16:31:54 -08:00
Eugene Yurtsev	4ef0ed4ddc	astream_events: Add version parameter while method is in beta (#16290 ) Add a version parameter while the method is in beta phase. The idea is to make it possible to minimize making breaking changes for users while we're iterating on schema. Once the API is stable we can assign a default version requirement.	2024-01-19 13:20:02 -05:00
Bagatur	91230ef5d1	openai[patch]: Release 0.0.3 (#16289 )	2024-01-19 10:15:08 -08:00
Hamza Kyamanywa	39b3c6d94c	langchain[patch]: Add konlpy based text splitting for Korean (#16003 ) - Description: Adds a text splitter based on [Konlpy](https://konlpy.org/en/latest/#start) which is a Python package for natural language processing (NLP) of the Korean language. (It is like Spacy or NLTK for Korean) - Dependencies: Konlpy would have to be installed before this splitter is used, - Twitter handle: @untilhamza	2024-01-19 09:44:56 -08:00
Hongyu Lin	9b0a531aa2	doc: Fix small typo in quickstart (#16164 ) - Description: fix small typo in quickstart --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-19 09:44:22 -08:00
Sagar B Manjunath	63e2acc964	docs: Fix minor issues in NVIDIA RAG canonical template (#16189 ) - Description: Fixes a few issues in NVIDIAcanonical RAG template's README, and adds a notebook for the template - Dependencies: Adds the pypdf dependency which is needed for ingestion, and updates the lock file --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-19 09:44:08 -08:00
Lance Martin	881d1c3ec5	Update MultiON toolkit docs (#16286 )	2024-01-19 09:37:20 -08:00
Bagatur	e3828bee43	core[patch]: Release 0.1.13 (#16287 )	2024-01-19 09:28:31 -08:00
Bagatur	2454fefc53	docs: agent prompt docs (#16105 )	2024-01-19 09:19:22 -08:00
Bagatur	84bf5787a7	core[patch], openai[patch]: Chat openai stream logprobs (#16218 )	2024-01-19 09:16:09 -08:00
Bagatur	6f7a414955	docs: fix links (#16284 )	2024-01-19 08:51:12 -08:00
Eugene Yurtsev	cc2e30fa13	CI: update the description used for privileged issue template (#16277 ) Update description	2024-01-19 10:13:33 -05:00
Eugene Yurtsev	3b649f4331	CI: Add privileged version for issue creation (#16276 ) Add privileged version for issue creation. This adds a version of issue creation which is unstructured by design to make it easier for maintainers to create issues. Maintainers are expected to write / describe issues clearly.	2024-01-19 09:53:51 -05:00
Eugene Yurtsev	c0d453d8ac	CI: Disable blank issues, add links to QA discussions & show and tell (#16275 ) Update the issue template	2024-01-19 09:34:23 -05:00
Carey	021b0484a8	community[patch]: add skipped test for inner product normalization (#14989 ) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 23:03:15 -08:00
Lance Martin	f63906a9c2	Test and update MultiON agent toolkit docs (#16235 )	2024-01-18 20:24:35 -08:00
Christophe Bornet	3ccbe11363	community[minor]: Add Cassandra document loader (#16215 ) - Description: document loader for Apache Cassandra - Twitter handle: cbornet_	2024-01-18 18:49:02 -08:00
Tomaz Bratanic	fc84083ce5	docs: Add neo4j semantic blog post link to templates (#16225 )	2024-01-18 18:45:22 -08:00
mikeFore4	9d32af72ce	community[patch]: huggingface hub character removal bug fix (#16233 ) - Description: Some text-generation models on huggingface repeat the prompt in their generated response, but not all do! The tests use "gpt2" which DOES repeat the prompt and as such, the HuggingFaceHub class is hardcoded to remove the first few characters of the response (to match the len(prompt)). However, if you are using a model (such as the very popular "meta-llama/Llama-2-7b-chat-hf") that DOES NOT repeat the prompt in it's generated text, then the beginning of the generated text will be cut off. This code change fixes that bug by first checking whether the prompt is repeated in the generated response and removing it conditionally. - Issue: #16232 - Dependencies: N/A - Twitter handle: N/A	2024-01-18 18:44:10 -08:00
Andreas Motl	3613d8a2ad	community[patch]: Use SQLAlchemy's `bulk_save_objects` method to improve insert performance (#16244 ) - Description: Improve [pgvector vector store adapter](https://github.com/langchain-ai/langchain/blob/v0.1.1/libs/community/langchain_community/vectorstores/pgvector.py) to save embeddings in batches, to improve its performance. - Issue: NA - Dependencies: NA - References: https://github.com/crate-workbench/langchain/pull/1 Hi again from the CrateDB team, following up on GH-16243, this is another minor patch to the pgvector vector store adapter. Inserting embeddings in batches, using [SQLAlchemy's `bulk_save_objects`](https://docs.sqlalchemy.org/en/20/orm/session_api.html#sqlalchemy.orm.Session.bulk_save_objects) method, can deliver substantial performance gains. With kind regards, Andreas. NB: As I am seeing just now that this method is a legacy feature of SA 2.0, it will need to be reworked on a future iteration. However, it is not deprecated yet, and I haven't been able to come up with a different implementation, yet.	2024-01-18 18:35:39 -08:00
Ashley Xu	0f99646ca6	docs: add the enrollment form for`BigQueryVectorSearch` (#16240 ) This PR adds the enrollment form for BigQueryVectorSearch.	2024-01-18 18:34:06 -08:00
Eugene Yurtsev	177af65dc4	core[minor]: RFC Add astream_events to Runnables (#16172 ) This PR adds `astream_events` method to Runnables to make it easier to stream data from arbitrary chains. * Streaming only works properly in async right now * One should use `astream()` with if mixing in imperative code as might be done with tool implementations * Astream_log has been modified with minimal additive changes, so no breaking changes are expected * Underlying callback code / tracing code should be refactored at some point to handle things more consistently (OK for now) - ~~[ ] verify event for on_retry~~ does not work until we implement streaming for retry - ~~[ ] Any rrenaming? Should we rename "event" to "hook"?~~ - [ ] Any other feedback from community? - [x] throw NotImplementedError for `RunnableEach` for now ## Example See this [Example Notebook](`dbbc7fa0d6/docs/docs/modules/agents/how_to/streaming_events.ipynb`) for an example with streaming in the context of an Agent ## Event Hooks Reference Here is a reference table that shows some events that might be emitted by the various Runnable objects. Definitions for some of the Runnable are included after the table. \| event \| name \| chunk \| input \| output \| \|----------------------\|------------------\|---------------------------------\|-----------------------------------------------\|-------------------------------------------------\| \| on_chat_model_start \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| \| \| on_chat_model_stream \| [model name] \| AIMessageChunk(content="hello") \| \| \| \| on_chat_model_end \| [model name] \| \| {"messages": [[SystemMessage, HumanMessage]]} \| {"generations": [...], "llm_output": None, ...} \| \| on_llm_start \| [model name] \| \| {'input': 'hello'} \| \| \| on_llm_stream \| [model name] \| 'Hello' \| \| \| \| on_llm_end \| [model name] \| \| 'Hello human!' \| \| on_chain_start \| format_docs \| \| \| \| \| on_chain_stream \| format_docs \| "hello world!, goodbye world!" \| \| \| \| on_chain_end \| format_docs \| \| [Document(...)] \| "hello world!, goodbye world!" \| \| on_tool_start \| some_tool \| \| {"x": 1, "y": "2"} \| \| \| on_tool_stream \| some_tool \| {"x": 1, "y": "2"} \| \| \| \| on_tool_end \| some_tool \| \| \| {"x": 1, "y": "2"} \| \| on_retriever_start \| [retriever name] \| \| {"query": "hello"} \| \| \| on_retriever_chunk \| [retriever name] \| {documents: [...]} \| \| \| \| on_retriever_end \| [retriever name] \| \| {"query": "hello"} \| {documents: [...]} \| \| on_prompt_start \| [template_name] \| \| {"question": "hello"} \| \| \| on_prompt_end \| [template_name] \| \| {"question": "hello"} \| ChatPromptValue(messages: [SystemMessage, ...]) \| Here are declarations associated with the events shown above: `format_docs`: ```python def format_docs(docs: List[Document]) -> str: '''Format the docs.''' return ", ".join([doc.page_content for doc in docs]) format_docs = RunnableLambda(format_docs) ``` `some_tool`: ```python @tool def some_tool(x: int, y: str) -> dict: '''Some_tool.''' return {"x": x, "y": y} ``` `prompt`: ```python template = ChatPromptTemplate.from_messages( [("system", "You are Cat Agent 007"), ("human", "{question}")] ).with_config({"run_name": "my_template", "tags": ["my_template"]}) ```	2024-01-18 21:27:01 -05:00
SN	f175bf7d7b	Use env for revision id if not passed in as param; use `git describe` as backup (#16227 ) Co-authored-by: William Fu-Hinthorn <13333726+hinthornw@users.noreply.github.com>	2024-01-18 16:15:26 -08:00
Erick Friis	e5878c467a	infra: scheduled testing env (#16239 )	2024-01-18 14:28:01 -08:00
Erick Friis	2f348c695a	infra: add nvidia api secret to integration testing (#15972 )	2024-01-18 14:20:02 -08:00
Erick Friis	50959abf0c	infra: google cse id integration test (#16238 )	2024-01-18 14:12:00 -08:00
Erick Friis	b9495da92d	langchain[patch]: fix stuff documents chain api docs render (#16159 )	2024-01-18 14:07:44 -08:00
Erick Friis	eec3347939	docs: together cookbook import (#16236 )	2024-01-18 14:07:19 -08:00
Erick Friis	92bc80483a	infra: google search api key (#16237 )	2024-01-18 14:06:38 -08:00
Erick Friis	0e76d84137	google-vertexai[patch]: more integration test fixes (#16234 )	2024-01-18 13:59:23 -08:00
Erick Friis	aa35b43bcd	docs, google-vertex[patch]: function docs (#16231 )	2024-01-18 13:15:09 -08:00
Erick Friis	f2b2d59e82	docs: transport and client options docs (#16226 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-18 12:23:04 -08:00
Harrison Chase	f60f59d69f	google-vertexai[patch]: Harrison/vertex function calling (#16223 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 12:17:40 -08:00
Rajesh Thallam	6bc6d64a12	langchain_google_vertexai[patch]: Add support for SystemMessage for Gemini chat model (#15933 ) - Description: In Google Vertex AI, Gemini Chat models currently doesn't have a support for SystemMessage. This PR adds support for it only if a user provides additional convert_system_message_to_human flag during model initialization (in this case, SystemMessage would be prepended to the first HumanMessage). NOTE: The implementation is similar to #14824 - Twitter handle: rajesh_thallam --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 10:22:07 -08:00
Erick Friis	65b231d40b	mistralai[patch]: async integration tests (#16214 )	2024-01-18 09:45:44 -08:00
jzaldi	ed118950fe	docs: Updated integration docs structure for llm/google_vertex_ai_palm (#16091 ) - Description: Updated doc for llm/google_vertex_ai_palm with new functions: `invoke`, `stream`... Changed structure of the document to match the required one. - Issue: #15664 - Dependencies: None - Twitter handle: None --------- Co-authored-by: Jorge Zaldívar <jzaldivar@google.com>	2024-01-18 09:45:27 -08:00
Bagatur	aa2e642ce3	docs: tool use nits (#16211 )	2024-01-18 09:17:53 -08:00
Eugene Zapolsky	6b9e3ed9e9	google-vertexai[minor]: added safety_settings property to gemini wrapper (#15344 ) Description: Gemini model has quite annoying default safety_settings settings. In addition, current VertexAI class doesn't provide a property to override such settings. So, this PR aims to - add safety_settings property to VertexAI - fix issue with incorrect LLM output parsing when LLM responds with appropriate 'blocked' response - fix issue with incorrect parsing LLM output when Gemini API blocks prompt itself as inappropriate - add safety_settings related tests I'm not enough familiar with langchain code base and guidelines. So, any comments and/or suggestions are very welcome. Issue: it will likely fix #14841 --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-18 08:54:30 -08:00
Eugene Yurtsev	ecd4f0a7ec	core[patch]: testing add chat model for unit-tests (#16209 ) This PR adds a fake chat model for testing purposes. Used in this PR: https://github.com/langchain-ai/langchain/pull/16172	2024-01-18 11:30:53 -05:00
Bagatur	27ad65cc68	docs: add tool use diagrams (#16207 )	2024-01-18 07:59:54 -08:00
SN	7d444724d7	Add revision identifier to run_on_dataset (#16167 ) Allow specifying revision identifier for better project versioning	2024-01-17 20:27:43 -08:00
Eugene Yurtsev	5d8c147332	docs: Document and test PydanticOutputFunctionsParser (#15759 ) This PR adds documentation and testing to `PydanticOutputFunctionsParser(OutputFunctionsParser)`.	2024-01-17 18:21:18 -08:00
Christophe Bornet	3502a407d9	infra: Use dotenv in langchain-community's integration tests (#16137 ) * Removed some env vars not used in langchain package IT * Added Astra DB env vars in langchain package, used for cache tests * Added conftest.py to load env vars in langchain_community IT * Added .env.example in langchain_community IT	2024-01-17 18:18:26 -08:00
Nuno Campos	ca014d5b04	Update readme (#16160 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-17 13:56:07 -08:00
Tomaz Bratanic	1e80113ac9	community[patch]: Add neo4j timeout and value sanitization option (#16138 ) The timeout function comes in handy when you want to kill longrunning queries. The value sanitization removes all lists that are larger than 128 elements. The idea here is to remove embedding properties from results.	2024-01-17 13:22:19 -08:00
Bagatur	27ed2673da	docs: model io order (#16163 )	2024-01-17 13:13:31 -08:00
Krishna Shedbalkar	f238217cea	community[patch]: Basic Logging and Human input to ShellTool (#15932 ) - Description: As Shell tool is very versatile, while integrating it into applications as openai functions, developers have no clue about what command is being executed using the ShellTool. All one can see is: ![image](https://github.com/langchain-ai/langchain/assets/60742358/540e274a-debc-4564-9027-046b91424df3) Summarising my feature request: 1. There's no visibility about what command was executed. 2. There's no mechanism to prevent a command to be executed using ShellTool, like a y/n human input which can be accepted from user to proceed with executing the command., - Issue: the issue #15931 it fixes if applicable, - Dependencies: There isn't any dependancy, - Twitter handle: @krishnashed	2024-01-17 12:57:51 -08:00
Bagatur	2af813c7eb	docs: bump sphinx>=5 (#16162 )	2024-01-17 12:57:34 -08:00
Bagatur	679a3ae933	openai[patch]: clarify azure error (#16157 )	2024-01-17 12:43:14 -08:00
Bagatur	7ad9eba8f4	core[patch]: Release 0.1.12 (#16161 )	2024-01-17 12:39:45 -08:00
Leonid Kuligin	58f0ba306b	changed default params for gemini (#16044 ) Replace this entire comment with: - Description: changed default values for Vertex LLMs (to be handled on the SDK's side)	2024-01-17 12:19:18 -08:00
David DeCaprio	ec9642d667	docs: Updated MongoDB Chat history example notebook to use LCEL format. (#15750 ) - Description: Updated the MongoDB example integration notebook to latest standards - Issue: [15664](https://github.com/langchain-ai/langchain/issues/15664) - Dependencies: None - Twitter handle: @davedecaprio --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-17 12:07:17 -08:00
Bagatur	5c73fd5bba	core[patch]: support old core namespaces (#16155 )	2024-01-17 11:26:25 -08:00
Christophe Bornet	fb940d11df	community[patch]: Use newer MetadataVectorCassandraTable in Cassandra vector store (#15987 ) as VectorTable is deprecated Tested manually with `test_cassandra.py` vector store integration test.	2024-01-17 10:37:07 -08:00
Mohammad Mohtashim	1fa056c324	community[patch]: Don't set search path for unknown SQL dialects (#16047 ) - Description: Made a small fix for the `SQLDatabase` highlighted in an issue. The issue pertains to switching schema for different SQL engines. - Issue: #16023 @baskaryan	2024-01-17 10:31:11 -08:00
Erick Friis	11327e6b64	google-vertexai[patch]: typing, release 0.0.2 (#16153 )	2024-01-17 10:16:59 -08:00
Leonid Ganeline	2709d3e5f2	langchain[patch]: updated imports for `langchain.callbacks` (#16060 ) Updated imports from 'langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:06:59 -08:00
Leonid Ganeline	c5f6b828ad	langchain[patch], community[minor]: move `output_parsers.ernie_functions` (#16057 ) `output_parsers.ernie_functions` moved into `community`	2024-01-17 10:06:18 -08:00
Bagatur	e7ddec1f2c	docs: change parallel doc name (#16152 )	2024-01-17 10:04:34 -08:00
Leonid Ganeline	49aff3ea5b	langchain[patch]: updated `agents` imports (#16061 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:02:29 -08:00
Leonid Ganeline	60b1bd02d7	langchain[patch]: updated imports for `output_parsers` (#16059 ) Updated imports from `langchain` to `core` where it is possible	2024-01-17 10:02:12 -08:00
Leonid Ganeline	9e9ad9b0e9	langchain[patch]: updated `retrievers` imports (#16062 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 10:01:06 -08:00
Leonid Ganeline	d350be959d	langchain[patch]: updated `chains` imports (#16064 ) Updated imports into `langchain` to `core` where it is possible --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-17 09:58:42 -08:00
Fei Wang	d0e101e4e0	community[patch]: fix ollama astream (#16070 ) Update ollama.py	2024-01-17 09:42:41 -08:00
Joshua Carroll	bc0cb1148a	docs: Fix StreamlitChatMessageHistory docs to latest API (#16072 ) - Description: Update [this page](https://python.langchain.com/docs/integrations/memory/streamlit_chat_message_history) to use the latest API - Issue: https://github.com/langchain-ai/langchain/issues/13995 - Dependencies: None - Twitter handle: @OhSynap	2024-01-17 09:42:10 -08:00
ChengZi	8597484195	langchain[patch]: support more comparators in Milvus self-querying retriever (#16076 ) - Description: Support IN and LIKE comparators in Milvus self-querying retriever, based on [Boolean Expression Rules](https://milvus.io/docs/boolean.md) - Issue: No - Dependencies: No - Twitter handle: No Signed-off-by: ChengZi <chen.zhang@zilliz.com>	2024-01-17 09:41:23 -08:00
David DeCaprio	9c2f1f07a0	docs: Updated SQLite example to use LCEL and SQLChatMessageHistory (#16094 ) - Description: Updated the SQLite example integration notebook to latest standards - Issue: [15664](https://github.com/langchain-ai/langchain/issues/15664) - Dependencies: None - Twitter handle: @davedecaprio	2024-01-17 09:39:44 -08:00
Kapil Sachdeva	f406dc3872	docs: in RunnableRetry, correct the example snippet that uses with_retry method on Runnable (#16108 ) The example code snippet for with_retry is using incorrect argument names. This PR fixes that	2024-01-17 09:11:27 -08:00
Abhinav	da96c511d1	docs: Replace azure_cosmos_db_vector_search with azure_cosmos_db in Cosmos DB Documentation (#16122 ) Description: This PR fixes an error in the documentation for Azure Cosmos DB Integration. Issue: The correct way to import `AzureCosmosDBVectorSearch` is ```python from langchain_community.vectorstores.azure_cosmos_db import ( AzureCosmosDBVectorSearch, ) ``` While the [documentation](https://python.langchain.com/docs/integrations/vectorstores/azure_cosmos_db) states it to be ```python from langchain_community.vectorstores.azure_cosmos_db_vector_search import ( AzureCosmosDBVectorSearch, CosmosDBSimilarityType, ) ``` As you can see in [azure_cosmos_db.py](`c323742f4f/libs/langchain/langchain/vectorstores/azure_cosmos_db.py (L1C45-L2)`) Dependencies:: None Twitter handle: None	2024-01-17 09:11:16 -08:00
BeatrixCohere	b0c3e3db2b	community[patch]: Handle when documents are not provided in the Cohere response (#16144 ) - Description: This handles the cohere response when documents aren't included in the response - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2024-01-17 09:11:00 -08:00
Felix Krones	d91126fc64	community[patch]: missing unpack operator for or_clause in pgvector document filter (#16148 ) - Fix for #16146 - Adding unpack operation to "or" and "and" filter for pgvector retriever. #	2024-01-17 09:10:43 -08:00
purificant	3606c5d5e9	infra: update poetry 1.6.1 -> 1.7.1 (#15027 )	2024-01-17 08:51:20 -08:00
Ikko Eltociear Ashimine	a35e5f19a8	docs: Update gradient.ipynb (#16149 ) Enviroment -> Environment	2024-01-17 08:48:24 -08:00
Erick Friis	06fe2f4fb0	partners: add license field (#16117 ) - bumps package post versions for packages without current unreleased updates - will bump package version in release prs associated with packages that do have changes (mistral, vertex)	2024-01-17 08:37:13 -08:00
Erick Friis	ce10fe0c2f	mistralai[patch]: release 0.0.3 (#16116 ) embeddings	2024-01-17 08:36:05 -08:00
William FH	e5cf1e2414	Community[patch]use secret str in Tavily and HuggingFaceInferenceEmbeddings (#16109 ) So the api keys don't show up in repr's Still need to do tests	2024-01-17 00:30:07 -08:00
William FH	f3601b0aaf	Community[Patch] Remove docs form bm25 repr (#16110 ) Resolves: https://github.com/langchain-ai/langsmith-sdk/issues/356	2024-01-17 00:00:55 -08:00
David	c323742f4f	mistralai[minor]: Add embeddings (#15282 ) - Description: Adds MistralAIEmbeddings class for embeddings, using the new official API. - Dependencies: mistralai - Tag maintainer: @efriis, @hwchase17 - Twitter handle: @LMS_David_RS Create `integrations/text_embedding/mistralai.ipynb`: an example notebook for MistralAIEmbeddings class Modify `embeddings/__init__.py`: Import the class Create `embeddings/mistralai.py`: The embedding class Create `integration_tests/embeddings/test_mistralai.py`: The test file. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-16 17:48:37 -08:00
Leonid Ganeline	f974eb5b8b	docs: updated `Anyscale` page (#16107 ) - added description - fixed broken links - added setting instructions - added the Chat model reference	2024-01-16 17:13:51 -08:00
Leonid Kuligin	4df14a61fc	google-vertexai[minor]: add function calling on VertexAI (#15822 ) Replace this entire comment with: - Description: Description: added support for tools on VertexAI - Issue: #15073 - Twitter handle: lkuligin --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-16 17:01:26 -08:00
Bagatur	8840a8cc95	docs: tool-use use case (#15783 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-16 10:41:14 -08:00
Bagatur	3d34347a85	langchain[patch]: bump core dep to 0.1.9 (#16104 )	2024-01-16 10:39:07 -08:00
Bagatur	62a2e9ee19	langchain[patch]: Release 0.1.1 (#16103 )	2024-01-16 10:17:38 -08:00
Christophe Bornet	6b6269441c	docs: Add page for AstraDB self retriever (#16077 ) Preview: https://langchain-git-fork-cbornet-astra-self-retriever-docs-langchain.vercel.app/docs/integrations/retrievers/self_query/astradb	2024-01-16 09:50:30 -08:00
Juan Bustos	5f057f24ac	docs: Update elasticsearch.ipynb (#16090 ) Fixed a typo, the parameter used for the Elasticsearch API key was called api_key, but the parameter is called es_api_key.	2024-01-16 09:49:42 -08:00
Bagatur	076593382a	core[patch]: Release 0.1.11 (#16100 )	2024-01-16 09:46:04 -08:00
Bagatur	c5656a4905	core[patch]: pass exceptions to fallbacks (#16048 )	2024-01-16 09:36:43 -08:00
Nuno Campos	770f57196e	Add unit test for overridden lc_namespace (#16093 )	2024-01-16 09:22:52 -08:00
Erick Friis	52114bdfac	community[patch]: release 0.0.13 (#16087 )	2024-01-16 06:25:28 -08:00
James Briggs	ca288d8f2c	community[patch]: add vector param to index query for pinecone vec store (#16054 )	2024-01-16 06:12:19 -08:00
Antonio Morales	476fb328ee	community[patch]: implement adelete from VectorStore in Qdrant (#16005 ) Description: Implement `adelete` function from `VectorStore` in `Qdrant` to support other asynchronous flows such as async indexing (`aindex`) which requires `adelete` to be implemented. Since `Qdrant` can be passed an async qdrant client, this can be supported easily. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 19:57:09 -08:00
Bagatur	697a6f2c80	langchain[patch]: fix requests lint (#16049 )	2024-01-15 12:54:30 -08:00
高远	061e63eef2	community[minor]: add vikingdb vecstore (#15155 ) --------- Co-authored-by: gaoyuan <gaoyuan.20001218@bytedance.com>	2024-01-15 12:34:01 -08:00
andrijdavid	d196646811	community[patch]: Refactor OpenAIWhisperParserLocal (#15150 ) This PR addresses an issue in OpenAIWhisperParserLocal where requesting CUDA without availability leads to an AttributeError #15143 Changes: - Refactored Logic for CUDA Availability: The initialization now includes a check for CUDA availability. If CUDA is not available, the code falls back to using the CPU. This ensures seamless operation without manual intervention. - Parameterizing Batch Size and Chunk Size: The batch_size and chunk_size are now configurable parameters, offering greater flexibility and optimization options based on the specific requirements of the use case. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-15 12:29:14 -08:00
Zhichao HAN	5cf06db3b3	community[minor]: add JsonRequestsWrapper tool (#15374 ) Description: This new feature enhances the flexibility of pipeline integration, particularly when working with RESTful APIs. ``JsonRequestsWrapper`` allows for the decoding of JSON output, instead of the only option for text output. --------- Co-authored-by: Zhichao HAN <hanzhichao2000@hotmail.com>	2024-01-15 12:27:19 -08:00
chyroc	d334efc848	community[patch]: fix top_p type hint (#15452 ) fix: https://github.com/langchain-ai/langchain/issues/15341 @efriis	2024-01-15 11:59:39 -08:00
Mateusz Szewczyk	251afda549	community[patch]: fix stop (stop_sequences) param on WatsonxLLM (#15541 ) - Description: Fix to IBM [watsonx.ai](https://www.ibm.com/products/watsonx-ai) LLM provider (stop (`stop_sequences`) param on watsonxLLM) - Dependencies: [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/),	2024-01-15 11:44:57 -08:00
Funkeke	7220124368	community[patch]: fix tongyi completion and params error (#15544 ) fix tongyi completion json parse error and prompt's params error --------- Co-authored-by: fangkeke <3339698829@qq.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-15 11:43:13 -08:00
Averi Kitsch	ee378a0f40	docs: add page for Firestore Chat Message History integration (#15554 ) - Description: Adds documentation for the `FirestoreChatMessageHistory` integration and lists integration in Google's documentation - Issue: NA - Dependencies: No --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-15 11:42:33 -08:00
盐粒 Yanli	ddf4e7c633	community[minor]: Update pgvecto_rs to use its high level sdk (#15574 ) - Description: Update pgvecto_rs to use its high level sdk, - Issue: fix #15173	2024-01-15 11:41:59 -08:00
YHW	ce21392a21	community: add a flag that determines whether to load the milvus collection (#15693 ) fix https://github.com/langchain-ai/langchain/issues/15694 --------- Co-authored-by: hyungwookyang <hyungwookyang@worksmobile.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:25:23 -08:00
Mohammad Mohtashim	9e779ca846	community[patch]: Fixing the SlackGetChannel Tool Input Error (#15725 ) Fixed the issue mentioned in #15698 for SlackGetChannel Tool. @baskaryan. --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:23:55 -08:00
axiangcoding	daa9ccae52	community[patch]: deprecate ErnieBotChat and ErnieEmbeddings classes (#15862 ) - Description: add deprecated warning for ErnieBotChat and ErnieEmbeddings. - These two classes lack maintenance and do not use the sdk provided by qianfan, which means hard to implement some key feature like streaming. - The alternative `langchain_community.chat_models.QianfanChatEndpoint` and `langchain_community.embeddings.QianfanEmbeddingsEndpoint` can completely replace these two classes, only need to change configuration items. - Issue: None, - Dependencies: None, - Twitter handle: None --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:14:44 -08:00
Eugene Yurtsev	7c57cfd8f0	docs: Update OpenAI functions agent (#15894 ) Add info and a tip explaining when to use this agent.	2024-01-15 11:14:29 -08:00
Eugene Yurtsev	beec7259c8	docs: Add info admonitions to a few agents (#15899 ) Add admonitions directly in the agent page to explain constraints and include a link to agent types.	2024-01-15 11:14:11 -08:00
JaguarDB	b11fd3bedc	community[patch]: jaguar vector store fix integer-element error when joining metadata values (#15939 ) - Description: some document loaders add integer-type metadata values which cause error - Issue: 15937 - Dependencies: none --------- Co-authored-by: JY <jyjy@jaguardb>	2024-01-15 11:13:45 -08:00
Bigtable123	7306032dcf	docs: update baidu_qianfan_endpoint.ipynb doc (#15940 ) - Description: Updated the docs for the chat integration module baidu_qianfan_endpoint.ipynb - Issue: #15664 - Dependencies:N/A	2024-01-15 11:13:21 -08:00
Neo Zhao	21e0df937f	community[patch]: fix a bug that mistakenly handle zip iterator in FAISS.from_embeddings (#16020 ) Description: `zip` is iterator that will only produce result once, so the previous code will cause the `embeddings` to be an empty list. Issue: I could not find a related issue. Dependencies: this PR does not introduce or affect dependencies. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-15 11:13:14 -08:00
Christophe Bornet	15c2b4a47e	community[minor]: Add AstraDB self query retriever (#15738 ) - Description: this change adds a self-query retriever for AstraDB - Twitter handle: cbornet_	2024-01-15 11:04:11 -08:00
Leonid Ganeline	fb676d8a9b	community[minor], langchain[minor]: refactor `output_parsers` Rail (#15852 ) Moved Rail parser to `community` package.	2024-01-15 10:54:49 -08:00
Bhadresh Savani	6137c7608d	docs: Integration Documentation updated run to invoke for llms/ai21.ipynb (#15889 ) - Description: Updated Integration Documentation for [llms/ai21.ipynb](https://github.com/langchain-ai/langchain/blob/master/docs/docs/integrations/llms/ai21.ipynb) - Issue: #15664, - Dependencies: NA, - Twitter handle: @BhadreshSavani	2024-01-15 10:53:22 -08:00
Massimiliano Pronesti	e80aab2275	docs(community): update Amadeus toolkit to langchain v0.1 (#15976 ) - Description: docs update following the changes introduced in #15879 <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-15 10:50:47 -08:00
Ashley Xu	ce7723c1e5	community[minor]: add additional support for `BigQueryVectorSearch` (#15904 ) BigQuery vector search lets you use GoogleSQL to do semantic search, using vector indexes for fast but approximate results, or using brute force for exact results. This PR: 1. Add `metadata[_job_ib]` in Document returned by any similarity search 2. Add `explore_job_stats` to enable users to explore job statistics and better the debuggability 3. Set the minimum row limit for running create vector index.	2024-01-15 10:45:15 -08:00
Mohammed Naqi	8799b028a6	community[minor]: Adding asynchronous function implementation for Doctran (#15941 ) ## Description In this update, I addressed the missing implementation for atransform_document, which is the asynchronous counterpart of transform_document in Doctran. ### Usage Example: ```py # Instantiate DoctranPropertyExtractor with specified properties property_extractor = DoctranPropertyExtractor(properties=properties) # Asynchronously extract properties from a list of documents extracted_document = await property_extractor.atransform_documents( documents, properties=properties ) # Display metadata of the first extracted document print(json.dumps(extracted_document[0].metadata, indent=2)) ``` ## Issue - Pull request #14525 has caused a break in the aforementioned code. Instead of removing an asynchronous implementation of a function, consider implementing a synchronous version alongside it.	2024-01-15 10:39:25 -08:00
Antonio Mindov	fb7e66b809	docs: fix typo in inspect runnables docs (#15994 ) - Description: Fixing a typo related to prompts in the inspecting runnables docs	2024-01-15 10:35:26 -08:00
Raunak	c0773ab329	community[patch]: Fixed 'coroutine' object is not subscriptable error (#15986 ) - Description: Added parenthesis in return statement of aembed_query() funtion to fix 'coroutine' object is not subscriptable error. - Dependencies: NA Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-15 10:34:10 -08:00
Karim Lalani	14244bd7e5	community[minor]: Added document loader for SurrealDB (#15995 ) Added a simple document loader to work with SurrealDB.	2024-01-15 10:32:42 -08:00
Karim Lalani	768e5e33bc	community[minor]: Fix to match SurrealDB 0.3.2 SDK (#15996 ) New version of SurrealDB python sdk was causing the integration to break. This fix addresses that change.	2024-01-15 10:31:59 -08:00
shahrin014	86321a949f	community: Ollama - Parameter structure to follow official documentation (#16035 ) ## Feature - Follow parameter structure as per official documentation - top level parameters (e.g. model, system, template) will be passed as top level parameters - other parameters will be sent in options unless options is provided ![image](https://github.com/langchain-ai/langchain/assets/17451563/d14715d9-9701-4ee3-b44b-89fffea62389) ## Tests - Test if top level parameters handled properly - Test if parameters that are not top level parameters are handled as options - Test if options is provided, it will be passed as is	2024-01-15 10:17:58 -08:00
Bagatur	60d6a416e6	docs: fix self query diagram (#16043 )	2024-01-15 10:09:20 -08:00
Mahad	f7706637a8	docs: fix documentation broken link in integrations chroma (#16041 ) - Description: Fixed broken link in the documentation for Chroma., - Issue: - Dependencies:	2024-01-15 08:37:03 -08:00
Nir Kopler	0fa06732b7	community: add new gpt-3.5-turbo-1106 finetuned for cost calculation (#16039 ) Description: Added the new gpt-3.5-turbo-1106 for finetuned cost calculation, Issue: no issue found open By the information in OpenAI the pricing is the same as the older model (0613)	2024-01-15 08:36:54 -08:00
Erick Friis	7b084b4cc7	docs: more pip installs (#15771 ) - vertex chat - google - some pip openai - percent and openai - all percent - more - pip - fmt - docs: google vertex partner docs - fmt - docs: more pip installs	2024-01-12 18:16:00 -08:00
Bagatur	bccb07f93e	core[patch]: simple prompt pretty printing (#15968 )	2024-01-12 21:08:51 -05:00
Bagatur	3f75fd41cc	docs: agent table fix (#15964 )	2024-01-12 17:54:55 -08:00
Virat Singh	eb6e385dc5	community: Add PolygonAPIWrapper and get_last_quote endpoint (#15971 ) - Description: Added a `PolygonAPIWrapper` and an initial `get_last_quote` endpoint, which allows us to get the last price quote for a given `ticker`. Once merged, I can add a Polygon tool in `tools/` for agents to use. - Twitter handle: [@virattt](https://twitter.com/virattt) The Polygon.io Stocks API provides REST endpoints that let you query the latest market data from all US stock exchanges.	2024-01-12 17:52:09 -08:00
Erick Friis	74bac7bda1	community[patch]: core min 0.1.9 (#15974 )	2024-01-12 15:32:06 -08:00
Erick Friis	845e407e08	community[patch]: release 0.0.12 (#15973 )	2024-01-12 15:27:05 -08:00
Jonathan Algar	a74f3a4979	Batch update of alt text and title attributes for images in md/mdx files across repo (#15357 ) Description: Batch update of alt text and title attributes for images in `md` & `mdx` files across the repo using [alttexter](https://github.com/jonathanalgar/alttexter)/[alttexter-ghclient](https://github.com/jonathanalgar/alttexter-ghclient) (built using LangChain/LangSmith). Limitation: cannot update `ipynb` files because of [this issue](https://github.com/langchain-ai/langchain/pull/15357#issuecomment-1885037250). Can revisit when Docusaurus is bumped to v3. I checked all the generated alt texts and titles and didn't find any technical inaccuracies. That's not to say they're _perfect_, but a lot better than what's there currently. [Deployed](https://langchain-819yf1tbk-langchain.vercel.app/docs/modules/model_io/) image example: ![chrome_yZQ7BF2GTj](https://github.com/langchain-ai/langchain/assets/93204286/43a9a4d4-70fd-41c4-8978-b6240ff63ffa) You can see LangSmith traces for all the calls out to the LLM in the PRs merged into this one: * https://github.com/jonathanalgar/langchain/pull/6 * https://github.com/jonathanalgar/langchain/pull/4 * https://github.com/jonathanalgar/langchain/pull/3 I didn't add the following files to the PR as the images already have OK alt texts: * `27dca2d92f/docs/docs/integrations/providers/argilla.mdx (L3)` * `27dca2d92f/docs/docs/integrations/providers/apify.mdx (L11)` --------- Co-authored-by: github-actions <github-actions@github.com>	2024-01-12 14:37:48 -08:00
Varik Matevosyan	efe6cfafe2	community: Added Lantern as VectorStore (#12951 ) Support [Lantern](https://github.com/lanterndata/lantern) as a new VectorStore type. - Added Lantern as VectorStore. It will support 3 distance functions `l2 squared`, `cosine` and `hamming` and will use `HNSW` index. - Added tests - Added example notebook	2024-01-12 12:00:16 -08:00
Harrison Chase	1afac77439	stop making copies of inputs (#15926 )	2024-01-12 11:49:26 -08:00
Edwin Wenink	9fb09c1c30	community: fix the "page" mode in the AzureAIDocumentIntelligenceParser (bug) (#15958 ) Description: the "page" mode in the AzureAIDocumentIntelligenceParser is not accessible due to a wrong membership test. The mode argument can only be a string (also see the assertion in the `__init__`: `assert self.mode in ["single", "page", "object", "markdown"]`, so the check `elif self.mode == ["page"]:` always fails. As a result, effectively the "object" mode is used when selecting the "page" mode, which may lead to errors. The docstring of the `AzureAIDocumentIntelligenceLoader` also ommitted the `mode` parameter alltogether, so I added it. Issue: I could not find a related issue (this class is only 3 weeks old anyways) Dependencies: this PR does not introduce or affect dependencies. The current demo notebook and examples are not affected because they all use the default markdown mode.	2024-01-12 11:01:28 -08:00
Mahdi Setayesh	eb76f9c9fe	community: Fixing a performance issue with AzureSearch to perform batch embedding (#15594 ) - Description: Azure Cognitive Search vector DB store performs slow embedding as it does not utilize the batch embedding functionality. This PR provide a fix to improve the performance of Azure Search class when adding documents to the vector search, - Issue: #11313 , - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-12 10:58:55 -08:00
Christophe Bornet	bc60203d0f	Add documentation for AstraDBStore (#15953 ) Preview: https://langchain-git-fork-cbornet-astradb-store-doc-langchain.vercel.app/docs/integrations/stores/astradb	2024-01-12 10:44:46 -08:00
Bagatur	c697c89ca4	docs: add agent prompt creation examples (#15957 )	2024-01-12 10:26:12 -08:00
Erick Friis	69533c8628	multiple[patch]: .post releases and pyproject metadata (#15962 )	2024-01-12 10:09:02 -08:00
Rihards Gravis	6a48ea43ec	docs: Update Robocorp Action Server installation instructions (#15943 ) Description: Remove section on how to install Action Server and direct the users t o the instructions on Robocorp repository. Reason: Robocorp Action Server has moved from a pip installation to a standalone cli application and is due for changes. Because of that, leaving only LangChain integration relevant part in the documentation.	2024-01-12 09:46:18 -08:00
Erick Friis	6a2889a4ec	infra: retry release if not found on test pypi (#15913 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-12 09:36:52 -08:00
Erick Friis	95020637bc	openai[patch]: 0.0.2.post1, urls (#15961 )	2024-01-12 09:36:37 -08:00
ChengZi	d5808f786c	community: Support milvus partition key. (#15740 ) - Description: Milvus's partition key is an important feature. It can support multi-tenancy. We hope to introduce this feature. https://milvus.io/docs/partition_key.md - Issue: No - Dependencies: No - Twitter handle: No --------- Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-12 09:15:03 -08:00
enfeng	13b90232c1	langchain-google-genai[patch]: Add support for end_point and transport parameters to the Gemini API (#15532 ) Add support for end_point and transport parameters to the Gemini API --------- Co-authored-by: yangenfeng <yangenfeng@xiaoniangao.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-12 08:52:00 -08:00
ohbeep	9b3962fc25	community: Add support of "http" URI for Milvus (#12710 ) (#15683 ) - Description: Add support of HTTP URI for Milvus - Issue: #12710 - Dependencies: N/A,	2024-01-11 21:55:35 -08:00
Raunak	e26e1f8b37	community: Added functions to make async calls to HuggingFaceHub's embedding endpoint in HuggingFaceHubEmbeddings class (#15737 ) Description: Added aembed_documents() and aembed_query() async functions in HuggingFaceHubEmbeddings class in langchain_community\embeddings\huggingface_hub.py file. It will support to make async calls to HuggingFaceHub's embedding endpoint and generate embeddings asynchronously. Test Cases: Added test_huggingfacehub_embedding_async_documents() and test_huggingfacehub_embedding_async_query() functions in test_huggingface_hub.py file to test the two async functions created in HuggingFaceHubEmbeddings class. Documentation: Updated huggingfacehub.ipynb with steps to install huggingface_hub package and use HuggingFaceHubEmbeddings. Dependencies: None, Twitter handle: I do not have a Twitter account --------- Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-11 21:52:55 -08:00
Tal	eb9b334a6b	Enable customizing the output parser of `OpenAIFunctionsAgent` (#15827 ) - Description: This PR defines the output parser of OpenAIFunctionsAgent as an attribute, enabling customization and subclassing of the parser logic. - Issue: Subclassing is currently impossible as the `OpenAIFunctionsAgentOutputParser` class is hard coded into the `plan` and `aplan` methods - Dependencies: None <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-11 21:52:36 -08:00
Mu Xian Ming	560bb49c99	docs: redis_chat_message_history.ipynb integration doc (#15789 ) - Description: Updated the docs for the memory integration module redis_chat_message_history.ipynb - Issue: #15664 - Dependencies: N/A Co-authored-by: Mu Xianming <mu.xianming@lmwn.com>	2024-01-11 21:42:31 -08:00
Christophe Bornet	81d1ba05dc	Add a BaseStore backed by AstraDB (#15812 ) - Description: this change adds a `BaseStore` backed by AstraDB - Twitter handle: cbornet_	2024-01-11 21:41:24 -08:00
manishsahni2000	74d9fc2f9e	PR community:Removing knn beta content in mongodb atlas vectorstore (#15865 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-11 21:40:54 -08:00
shahrin014	bdd90ae2ee	community: Ollama - Pass headers to post request (#15881 ) ## Feature - Set additional headers in constructor - Headers will be sent in post request This feature is useful if deploying Ollama on a cloud service such as hugging face, which requires authentication tokens to be passed in the request header. ## Tests - Test if header is passed - Test if header is not passed	2024-01-11 21:40:35 -08:00
Xin Liu	5efec068c9	feat: Implement `stream` interface (#15875 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Major changes: - Rename `wasm_chat.py` to `llama_edge.py` - Rename the `WasmChatService` class to `ChatService` - Implement the `stream` interface for `ChatService` - Add `test_chat_wasm_service_streaming` in the integration test - Update `llama_edge.ipynb` --------- Signed-off-by: Xin Liu <sam@secondstate.io>	2024-01-11 21:32:48 -08:00
Massimiliano Pronesti	ec4dab0449	feat(community): make Amadeus toolkit LLM-agnostic (#15879 ) - Description: `AmadeusToolkit` and `AmadeusClosestAirport` contained a hardcoded call to `ChatOpenAI`. This PR makes it LLM-independent, while guaranteeing backward compatibility. - Issue: #15847 - Dependencies: None @baskaryan <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-11 21:32:03 -08:00
JanHorcicka	f454e95461	langchain: fix OutputParserException (#15914 ) (#15916 ) Description: Fixes OutputParserException thrown by the output_parser when 'query' is 'Null'. Replace this entire comment with: - Description: Current implentation of output_parser throws OutputParserException if the response from the LLM contains `query: null`. This unfortunately happens for my use case. And since there is no way to modify the prompt used in SelfQueryRetriever, then we have to fix it here, so it doesn't crash. - Issue: https://github.com/langchain-ai/langchain/issues/15914 Didn't run tests. `make test` is not working. There is no `test` rule in the `Makefile`. Co-authored-by: Jan Horcicka <jhorcick@amazon.com>	2024-01-11 21:26:45 -08:00
Yacine	782dd44be9	<langchain_community.vectorstores>:<Fix pinecone.py __init__ docsrting instruction> (#15922 ) - Description: The pinecone docstring instructs to pass the embedding query text causing the warning below. It should be the embeddings object. warning message: UserWarning: Passing in `embedding` as a Callable is deprecated. Please pass in an Embeddings object instead. - Issue: NA - Dependencies: None @baskaryan	2024-01-11 21:26:33 -08:00
Nuno Campos	112208baa5	Passthrough configurable primitive values as tracer metadata (#15915 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-11 18:47:55 -08:00
William FH	129552e3d6	Rm deprecated (#15920 ) Remove the usage of deprecated methods in the test runner.	2024-01-11 18:10:49 -08:00
Nuno Campos	438beb6c94	Pass config specs through ensemble retriever (#15917 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-11 16:22:17 -08:00
Erick Friis	ebb6ad4f7a	mistralai[patch]: release 0.0.2 (#15912 )	2024-01-11 13:42:04 -08:00
Erick Friis	437cebc955	core[patch]: release 0.1.10 (#15911 )	2024-01-11 13:39:06 -08:00
Harrison Chase	80d41a8da3	add old serializable mapping (#15906 )	2024-01-11 13:03:12 -08:00
Erick Friis	623f87c888	community[patch]: pinecone bug (#15905 )	2024-01-11 11:44:07 -08:00
Eugene Yurtsev	44101b6b0e	Docs[patch]: Update OpenAI tools agent description (#15896 ) Update OpenAI tools agent description.	2024-01-11 14:39:11 -05:00
Eugene Yurtsev	46b7a8d913	Docs[patch]: Update agent quick start for agents (#15892 ) Minor change: 1) Update tool invocation to use .invoke 2) Show hub prompt	2024-01-11 14:38:48 -05:00
Jacob Lee	c11dbefedc	docs[patch]: Fix bad headers in output parser docs (#15778 ) Currently looks like this: <img width="282" alt="Screenshot 2024-01-09 at 1 08 53 PM" src="https://github.com/langchain-ai/langchain/assets/6952323/58f3d368-6588-418e-8502-30d13757cb99"> CC @efriis @baskaryan	2024-01-11 10:24:15 -08:00
Christophe Bornet	c56060bb7d	Add document loader section to Astra provider doc page (#15882 ) See preview: https://langchain-git-fork-cbornet-provider-astra-doc-loader-langchain.vercel.app/docs/integrations/providers/astradb#ocument-loader	2024-01-11 07:52:29 -08:00
xvjixiang	611f18c944	Docs: Fix a typo in elasticsearch vectorstore notebook (#15807 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-10 20:30:44 -08:00
axiangcoding	d5aa277b94	community: add collection_properties parameter to Milvus (#15788 ) - Description: add collection_properties parameter to Milvus. See [pymilvus set_properties() description](https://milvus.io/api-reference/pymilvus/v2.3.x/Collection/set_properties().md) - Issue: None - Dependencies: None - Twitter handle: None	2024-01-10 20:29:01 -08:00
mogith-pn	9e1ed17bfb	Community : Modified doc strings and example notebook for Clarifai (#15816 ) Community : Modified doc strings and example notebook for Clarifai Description: 1. Modified doc strings inside clarifai vectorstore class and embeddings. 2. Modified notebook examples. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-01-10 19:33:10 -08:00
Harrison Chase	97411e998f	[docs] add beautiful soup dependency (#15860 )	2024-01-10 19:32:55 -08:00
Daniel	6d299a55c0	docs: Update cohere.mdx, Text embedding had incorrect code snippet (#15840 ) text embedding code snippet was incorrect.	2024-01-10 19:25:29 -08:00
Sagar B Manjunath	e6240fecab	templates: Add NVIDIA Canonical RAG example chain (#15758 ) - Description: Adds a RAG template that uses NVIDIA AI playground and embedding models, along with Milvus vector store - Dependencies: This template depends on the AI playground service in NVIDIA NGC. API keys with a significant trial compute are available (10k queries at the time of writing). This template also depends on the Milvus Vector store which is publicly available. Note: [A quick link to get a key](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/codellama-13b/api) when you have an NGC account. Generate Key button at the top right of the code window. --------- Co-authored-by: Sagar B Manjunath <sbogadimanju@nvidia.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-10 18:39:16 -08:00
Erick Friis	38523d7c57	together[minor]: add llm (#15853 )	2024-01-10 17:55:34 -08:00
William FH	2895ca87cf	Update Evals Notebook (#15851 )	2024-01-10 16:33:34 -08:00
Erick Friis	ee708739c3	community[patch]: pinecone v3 support (#15849 ) Info in slack --------- Co-authored-by: Roie Schwaber-Cohen <roie.cohen@gmail.com>	2024-01-10 14:54:50 -08:00
Bagatur	18411c379c	docs: fix links (#15848 )	2024-01-10 17:39:06 -05:00
Lance Martin	9c871f427b	TogetherAI RAG (#15846 )	2024-01-10 14:28:05 -08:00
Eugene Yurtsev	a06db53c37	Add unit tests to test openai tools agent (#15843 ) This PR adds unit testing to test openai tools agent.	2024-01-10 17:06:30 -05:00
Harrison Chase	21a1538949	add raga reranker (#15838 )	2024-01-10 11:07:19 -08:00
Eugene Yurtsev	45f49ca439	infra: fix issue preview (#15836 ) Fixing the placeholder for the code example. GitHub collapses newlines when trying to use the text area, which is super confusing.	2024-01-10 13:27:07 -05:00
Eugene Yurtsev	c425e6f740	More updates to issue template (#15833 ) More update to issue template	2024-01-10 13:16:02 -05:00
Eugene Yurtsev	65980c22b8	Infra: Fix syntax error in BUG REPORT template (#15831 ) Fix syntax error in issue template	2024-01-10 12:39:08 -05:00
Eugene Yurtsev	e182d630f7	ISSUE_TEMPLATE: Update issue template (#15757 ) Drop some fields, re-order, start directing folks towards QA. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-10 12:35:41 -05:00
Bagatur	6432494f9d	infra: explicitly specify py path (#15826 )	2024-01-10 11:59:43 -05:00
Bagatur	79124fd71d	experimental[patch]: Release 0.0.49 (#15823 )	2024-01-10 11:23:19 -05:00
Harrison Chase	20abe24819	experimental[minor]: Add semantic chunker (#15799 )	2024-01-10 11:18:30 -05:00
Harrison Chase	a1d7f2b3e1	add dspy notebook (#15798 )	2024-01-10 08:01:08 -08:00
Eugene Yurtsev	feb41c5e28	langchain[patch]: Improve stream_log with AgentExecutor and Runnable Agent (#15792 ) This PR fixes an issue where AgentExecutor with RunnableAgent does not allow users to see individual llm tokens if streaming=True is not set explicitly on the underlying chat model. The majority of this PR is testing code: 1. Create a test chat model that makes it easier to test streaming and supports AIMessages that include function invocation information. 2. Tests for the chat model 3. Tests for RunnableAgent (previously untested) 4. Tests for openai agent (previously untested)	2024-01-10 10:53:01 -05:00
Erick Friis	85a4594ed7	community[patch]: more deprecations (#15782 )	2024-01-09 20:36:16 -08:00
Erick Friis	33dccf0f66	core[patch]: release 0.1.9 (#15794 )	2024-01-09 19:27:19 -08:00
Bagatur	942071bf57	docs: collapse structured use case (#15791 )	2024-01-09 21:47:09 -05:00
Erick Friis	0c95f3a981	mistralai[patch]: warn on stop token, fix on_llm_new_token (#15787 ) Fixes #15269 Addresses with warning. MistralAI API doesn't support stop token yet. --------- Co-authored-by: Niels Garve <info@nielsgarve.com>	2024-01-09 16:27:20 -08:00
Erick Friis	323941a90a	mistralai[patch]: persist async client (#15786 )	2024-01-09 16:21:39 -08:00
Tomaz Bratanic	3e0cd11f51	templates: Add neo4j semantic layer template (#15652 ) Co-authored-by: Tomaz Bratanic <tomazbratanic@Tomazs-MacBook-Pro.local> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-01-09 15:33:44 -08:00
NuODaniel	70b6315b23	community[patch]: fix qianfan chat stream calling caused exception (#13800 ) - Description: `QianfanChatEndpoint` extends `BaseChatModel` as a super class, which has a default stream implement might concat the MessageChunk with `__add__`. When call stream(), a ValueError for duplicated key will be raise. - Issues: * #13546 * #13548 * merge two single test file related to qianfan. - Dependencies: no - Tag maintainer: --------- Co-authored-by: root <liujun45@baidu.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-09 15:29:25 -08:00
Erick Friis	656e87beb9	core[patch]: add alternative_import to deprecated (#15781 )	2024-01-09 14:45:28 -08:00
Erick Friis	04a5a37e92	robocorp[patch]: fix readme, release 0.0.1.post1 (#15777 )	2024-01-09 12:53:57 -08:00
Erick Friis	ae67ba4dbb	templates: robocorp action server template (#15776 ) --------- Co-authored-by: Rihards Gravis <rihards@gravis.lv> Co-authored-by: Mikko Korpela <mikko@robocorp.com>	2024-01-09 12:41:20 -08:00
Erick Friis	91ec9da534	openai[patch]: unit test load (#15624 )	2024-01-09 11:54:11 -08:00
Erick Friis	7be72e1103	openai[patch], docs: readme (#15773 )	2024-01-09 11:52:24 -08:00
Bagatur	ee5bd986de	community[patch]: update oai deprecation message (#15681 ) addresses #15674	2024-01-09 14:36:58 -05:00
Erick Friis	7562f70c95	robocorp[minor]: Add robocorp action server toolkit (#15766 ) Co-authored-by: Rihards Gravis <rihards@gravis.lv> Co-authored-by: Mikko Korpela <mikko@robocorp.com>	2024-01-09 11:29:19 -08:00
Erick Friis	7bc100fd43	docs: integration package pip installs (#15762 ) More than 300 files - will fail check_diff. Will merge after Vercel deploy succeeds Still occurrences that need changing - will update more later	2024-01-09 11:13:10 -08:00
Bagatur	1b0db82dbe	docs: fix recognition (#15769 )	2024-01-09 13:57:28 -05:00

6322 changed files with 618334 additions and 400061 deletions

									
										2

.devcontainer/README.md
									
												View File
												
				@@ -10,7 +10,7 @@ You can use the dev container configuration in this folder to build and run the

				You may use the button above, or follow these steps to open this repo in a Codespace:

				1. Click the **Code** drop-down menu at the top of https://github.com/langchain-ai/langchain.

				1. Click on the **Codespaces** tab.

				1. Click **Create codespace on master** .

				1. Click **Create codespace on master**.

				For more info, check out the [GitHub documentation](https://docs.github.com/en/free-pro-team@latest/github/developing-online-with-codespaces/creating-a-codespace#creating-a-codespace).

									
										2

.devcontainer/devcontainer.json
									
												View File
												
				@@ -12,7 +12,7 @@

					// The optional 'workspaceFolder' property is the path VS Code should open by default when

					// connected. This is typically a file mount in .devcontainer/docker-compose.yml

					"workspaceFolder": "/workspaces/${localWorkspaceFolderBasename}",

					"workspaceFolder": "/workspaces/langchain",

					// Prevent the container from shutting down

					"overrideCommand": true

									
										8

.devcontainer/docker-compose.yaml
									
												View File
												
				@@ -5,10 +5,10 @@ services:

				      dockerfile: libs/langchain/dev.Dockerfile

				      context: ..

				    volumes:

				   # Update this to wherever you want VS Code to mount the folder of your project

				      - ..:/workspaces:cached

				      # Update this to wherever you want VS Code to mount the folder of your project

				      - ..:/workspaces/langchain:cached

				    networks:

				      - langchain-network 

				      - langchain-network

				  #   environment:

				  #     MONGO_ROOT_USERNAME: root

				  #     MONGO_ROOT_PASSWORD: example123

				@@ -28,5 +28,3 @@ services:

				networks:

				  langchain-network:

				    driver: bridge

									
										41

.github/CONTRIBUTING.md
									
										vendored
									
												View File
												
				@@ -3,43 +3,4 @@

				Hi there! Thank you for even being interested in contributing to LangChain.

				As an open-source project in a rapidly developing field, we are extremely open to contributions, whether they involve new features, improved infrastructure, better documentation, or bug fixes.

				To learn about how to contribute, please follow the [guides here](https://python.langchain.com/docs/contributing/)

				## 🗺️ Guidelines

				### 👩‍💻 Ways to contribute

				There are many ways to contribute to LangChain. Here are some common ways people contribute:

				- [**Documentation**](https://python.langchain.com/docs/contributing/documentation): Help improve our docs, including this one!

				- [**Code**](https://python.langchain.com/docs/contributing/code): Help us write code, fix bugs, or improve our infrastructure.

				- [**Integrations**](https://python.langchain.com/docs/contributing/integration): Help us integrate with your favorite vendors and tools.

				### 🚩GitHub Issues

				Our [issues](https://github.com/langchain-ai/langchain/issues) page is kept up to date with bugs, improvements, and feature requests.

				There is a taxonomy of labels to help with sorting and discovery of issues of interest. Please use these to help organize issues.

				If you start working on an issue, please assign it to yourself.

				If you are adding an issue, please try to keep it focused on a single, modular bug/improvement/feature.

				If two issues are related, or blocking, please link them rather than combining them.

				We will try to keep these issues as up-to-date as possible, though

				with the rapid rate of development in this field some may get out of date.

				If you notice this happening, please let us know.

				### 🙋Getting Help

				Our goal is to have the simplest developer setup possible. Should you experience any difficulty getting setup, please

				contact a maintainer! Not only do we want to help get you unblocked, but we also want to make sure that the process is

				smooth for future contributors.

				In a similar vein, we do enforce certain linting, formatting, and documentation standards in the codebase.

				If you are finding these difficult (or even just annoying) to work with, feel free to contact a maintainer for help -

				we do not want these to get in the way of getting good code into the codebase.

				### Contributor Documentation

				To learn about how to contribute, please follow the [guides here](https://python.langchain.com/docs/contributing/)

				To learn how to contribute to LangChain, please follow the [contribution guide here](https://python.langchain.com/docs/contributing/).

									
										38

.github/DISCUSSION_TEMPLATE/ideas.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,38 @@

				labels: [idea]

				body:

				  - type: checkboxes

				    id: checks

				    attributes:

				      label: Checked

				      description: Please confirm and check all the following options.

				      options:

				        - label: I searched existing ideas and did not find a similar one

				          required: true

				        - label: I added a very descriptive title

				          required: true

				        - label: I've clearly described the feature request and motivation for it

				          required: true

				  - type: textarea

				    id: feature-request

				    validations:

				      required: true

				    attributes:

				      label: Feature request

				      description: |

				        A clear and concise description of the feature proposal. Please provide links to any relevant GitHub repos, papers, or other resources if relevant.

				  - type: textarea

				    id: motivation

				    validations:

				      required: true

				    attributes:

				      label: Motivation

				      description: |

				        Please outline the motivation for the proposal. Is your feature request related to a problem? e.g., I'm always frustrated when [...]. If this is related to another GitHub issue, please link here too.

				  - type: textarea

				    id: proposal

				    validations:

				      required: false

				    attributes:

				      label: Proposal (If applicable)

				      description: |

				        If you would like to propose a solution, please describe it here.

									
										122

.github/DISCUSSION_TEMPLATE/q-a.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,122 @@

				labels: [Question]

				body:

				  - type: markdown

				    attributes:

				      value: |

				        Thanks for your interest in LangChain 🦜️🔗!

				        Please follow these instructions, fill every question, and do every step. 🙏

				        We're asking for this because answering questions and solving problems in GitHub takes a lot of time --

				        this is time that we cannot spend on adding new features, fixing bugs, writing documentation or reviewing pull requests.

				        By asking questions in a structured way (following this) it will be much easier for us to help you.

				        There's a high chance that by following this process, you'll find the solution on your own, eliminating the need to submit a question and wait for an answer. 😎

				        As there are many questions submitted every day, we will **DISCARD** and close the incomplete ones. 

				        That will allow us (and others) to focus on helping people like you that follow the whole process. 🤓

				        Relevant links to check before opening a question to see if your question has already been answered, fixed or

				        if there's another way to solve your problem:

				        [LangChain documentation with the integrated search](https://python.langchain.com/docs/get_started/introduction),

				        [API Reference](https://api.python.langchain.com/en/stable/),

				        [GitHub search](https://github.com/langchain-ai/langchain),

				        [LangChain Github Discussions](https://github.com/langchain-ai/langchain/discussions),

				        [LangChain Github Issues](https://github.com/langchain-ai/langchain/issues?q=is%3Aissue),

				        [LangChain ChatBot](https://chat.langchain.com/)

				  - type: checkboxes

				    id: checks

				    attributes:

				      label: Checked other resources

				      description: Please confirm and check all the following options.

				      options:

				        - label: I added a very descriptive title to this question.

				          required: true

				        - label: I searched the LangChain documentation with the integrated search.

				          required: true

				        - label: I used the GitHub search to find a similar question and didn't find it.

				          required: true

				  - type: checkboxes

				    id: help

				    attributes:

				      label: Commit to Help

				      description: |

				        After submitting this, I commit to one of:

				          * Read open questions until I find 2 where I can help someone and add a comment to help there.

				          * I already hit the "watch" button in this repository to receive notifications and I commit to help at least 2 people that ask questions in the future.

				          * Once my question is answered, I will mark the answer as "accepted".

				      options:

				        - label: I commit to help with one of those options 👆

				          required: true

				  - type: textarea

				    id: example

				    attributes:

				      label: Example Code

				      description: |

				        Please add a self-contained, [minimal, reproducible, example](https://stackoverflow.com/help/minimal-reproducible-example) with your use case.

				        If a maintainer can copy it, run it, and see it right away, there's a much higher chance that you'll be able to get help.

				        **Important!** 

				        * Use code tags (e.g., ```python ... ```) to correctly [format your code](https://help.github.com/en/github/writing-on-github/creating-and-highlighting-code-blocks#syntax-highlighting).

				        * INCLUDE the language label (e.g. `python`) after the first three backticks to enable syntax highlighting. (e.g., ```python rather than ```).

				        * Reduce your code to the minimum required to reproduce the issue if possible. This makes it much easier for others to help you.

				        * Avoid screenshots when possible, as they are hard to read and (more importantly) don't allow others to copy-and-paste your code.

				      placeholder: |

				        from langchain_core.runnables import RunnableLambda

				        def bad_code(inputs) -> int:

				          raise NotImplementedError('For demo purpose')

				          chain = RunnableLambda(bad_code)

				          chain.invoke('Hello!')

				      render: python

				    validations:

				      required: true

				  - type: textarea

				    id: description

				    attributes:

				      label: Description

				      description: |

				        What is the problem, question, or error?

				        Write a short description explaining what you are doing, what you expect to happen, and what is currently happening.

				      placeholder: |

				        * I'm trying to use the `langchain` library to do X.

				        * I expect to see Y.

				        * Instead, it does Z.

				    validations:

				      required: true

				  - type: textarea

				    id: system-info

				    attributes:

				      label: System Info

				      description: |

				        Please share your system info with us. 

				        "pip freeze | grep langchain" 

				        platform (windows / linux / mac)

				        python version

				        OR if you're on a recent version of langchain-core you can paste the output of:

				        python -m langchain_core.sys_info

				      placeholder: |

				        "pip freeze | grep langchain"

				        platform

				        python version

				        Alternatively, if you're on a recent version of langchain-core you can paste the output of:

				        python -m langchain_core.sys_info

				        These will only surface LangChain packages, don't forget to include any other relevant

				        packages you're using (if you're not sure what's relevant, you can paste the entire output of `pip freeze`).

				    validations:

				      required: true

									
										182

.github/ISSUE_TEMPLATE/bug-report.yml
									
										vendored
									
												View File
												
				@@ -1,106 +1,120 @@

				name: "\U0001F41B Bug Report"

				description: Submit a bug report to help us improve LangChain. To report a security issue, please instead use the security option below.

				description: Report a bug in LangChain. To report a security issue, please instead use the security option below. For questions, please use the GitHub Discussions.

				labels: ["02 Bug Report"]

				body:

				  - type: markdown

				    attributes:

				      value: >

				        Thank you for taking the time to file a bug report. Before creating a new

				        issue, please make sure to take a few moments to check the issue tracker

				        for existing issues about the bug.

				  - type: textarea

				    id: system-info

				    attributes:

				      label: System Info

				      description: Please share your system info with us.

				      placeholder: LangChain version, platform, python version, ...

				    validations:

				      required: true

				  - type: textarea

				    id: who-can-help

				    attributes:

				      label: Who can help?

				      description: |

				        Your issue will be replied to more quickly if you can figure out the right person to tag with @

				        If you know how to use git blame, that is the easiest way, otherwise, here is a rough guide of **who to tag**.

				        The core maintainers strive to read all issues, but tagging them will help them prioritize.

				        Please tag fewer than 3 people.

				        @hwchase17 - project lead

				        Tracing / Callbacks

				        - @agola11

				        Async

				        - @agola11

				        DataLoader Abstractions

				        - @eyurtsev

				        LLM/Chat Wrappers

				        - @hwchase17

				        - @agola11

				        Tools / Toolkits

				        - ...

				      placeholder: "@Username ..."

				        Thank you for taking the time to file a bug report. 

				        Use this to report bugs in LangChain. 

				        If you're not certain that your issue is due to a bug in LangChain, please use [GitHub Discussions](https://github.com/langchain-ai/langchain/discussions)

				        to ask for help with your issue.

				        Relevant links to check before filing a bug report to see if your issue has already been reported, fixed or

				        if there's another way to solve your problem:

				        [LangChain documentation with the integrated search](https://python.langchain.com/docs/get_started/introduction),

				        [API Reference](https://api.python.langchain.com/en/stable/),

				        [GitHub search](https://github.com/langchain-ai/langchain),

				        [LangChain Github Discussions](https://github.com/langchain-ai/langchain/discussions),

				        [LangChain Github Issues](https://github.com/langchain-ai/langchain/issues?q=is%3Aissue),

				        [LangChain ChatBot](https://chat.langchain.com/)

				  - type: checkboxes

				    id: information-scripts-examples

				    id: checks

				    attributes:

				      label: Information

				      description: "The problem arises when using:"

				      label: Checked other resources

				      description: Please confirm and check all the following options.

				      options:

				        - label: "The official example notebooks/scripts"

				        - label: "My own modified scripts"

				  - type: checkboxes

				    id: related-components

				    attributes:

				      label: Related Components

				      description: "Select the components related to the issue (if applicable):"

				      options:

				        - label: "LLMs/Chat Models"

				        - label: "Embedding Models"

				        - label: "Prompts / Prompt Templates / Prompt Selectors"

				        - label: "Output Parsers"

				        - label: "Document Loaders"

				        - label: "Vector Stores / Retrievers"

				        - label: "Memory"

				        - label: "Agents / Agent Executors"

				        - label: "Tools / Toolkits"

				        - label: "Chains"

				        - label: "Callbacks/Tracing"

				        - label: "Async"

				        - label: I added a very descriptive title to this issue.

				          required: true

				        - label: I searched the LangChain documentation with the integrated search.

				          required: true

				        - label: I used the GitHub search to find a similar question and didn't find it.

				          required: true

				        - label: I am sure that this is a bug in LangChain rather than my code.

				          required: true

				        - label: The bug is not resolved by updating to the latest stable version of LangChain (or the specific integration package).

				          required: true

				  - type: textarea

				    id: reproduction

				    validations:

				      required: true

				    attributes:

				      label: Reproduction

				      label: Example Code

				      description: |

				        Please provide a [code sample](https://stackoverflow.com/help/minimal-reproducible-example) that reproduces the problem you ran into. It can be a Colab link or just a code snippet.

				        If you have code snippets, error messages, stack traces please provide them here as well.

				        Important! Use code tags to correctly format your code. See https://help.github.com/en/github/writing-on-github/creating-and-highlighting-code-blocks#syntax-highlighting

				        Avoid screenshots when possible, as they are hard to read and (more importantly) don't allow others to copy-and-paste your code.

				        Please add a self-contained, [minimal, reproducible, example](https://stackoverflow.com/help/minimal-reproducible-example) with your use case.

				        If a maintainer can copy it, run it, and see it right away, there's a much higher chance that you'll be able to get help.

				        **Important!** 

				        * Use code tags (e.g., ```python ... ```) to correctly [format your code](https://help.github.com/en/github/writing-on-github/creating-and-highlighting-code-blocks#syntax-highlighting).

				        * INCLUDE the language label (e.g. `python`) after the first three backticks to enable syntax highlighting. (e.g., ```python rather than ```).

				        * Reduce your code to the minimum required to reproduce the issue if possible. This makes it much easier for others to help you.

				        * Avoid screenshots when possible, as they are hard to read and (more importantly) don't allow others to copy-and-paste your code.

				      placeholder: |

				        Steps to reproduce the behavior:

				          1.

				          2.

				          3.

				        The following code: 

				        ```python

				        from langchain_core.runnables import RunnableLambda

				        def bad_code(inputs) -> int:

				          raise NotImplementedError('For demo purpose')

				          chain = RunnableLambda(bad_code)

				          chain.invoke('Hello!')

				        ```

				  - type: textarea

				    id: expected-behavior

				    id: error

				    validations:

				      required: false

				    attributes:

				      label: Error Message and Stack Trace (if applicable)

				      description: |

				        If you are reporting an error, please include the full error message and stack trace.

				      placeholder: |

				        Exception + full stack trace

				  - type: textarea

				    id: description

				    attributes:

				      label: Description

				      description: |

				        What is the problem, question, or error?

				        Write a short description telling what you are doing, what you expect to happen, and what is currently happening.

				      placeholder: |

				        * I'm trying to use the `langchain` library to do X.

				        * I expect to see Y.

				        * Instead, it does Z.

				    validations:

				      required: true

				  - type: textarea

				    id: system-info

				    attributes:

				      label: Expected behavior

				      description: "A clear and concise description of what you would expect to happen."

				      label: System Info

				      description: |

				        Please share your system info with us. 

				        "pip freeze | grep langchain" 

				        platform (windows / linux / mac)

				        python version

				        OR if you're on a recent version of langchain-core you can paste the output of:

				        python -m langchain_core.sys_info

				      placeholder: |

				        "pip freeze | grep langchain"

				        platform

				        python version

				        Alternatively, if you're on a recent version of langchain-core you can paste the output of:

				        python -m langchain_core.sys_info

				        These will only surface LangChain packages, don't forget to include any other relevant

				        packages you're using (if you're not sure what's relevant, you can paste the entire output of `pip freeze`).

				    validations:

				      required: true

									
										13

.github/ISSUE_TEMPLATE/config.yml
									
										vendored
									
												View File
												
				@@ -1,9 +1,12 @@

				blank_issues_enabled: true

				blank_issues_enabled: false

				version: 2.1

				contact_links:

				  - name: 🤔 Question or Problem

				    about: Ask a question or ask about a problem in GitHub Discussions.

				    url: https://github.com/langchain-ai/langchain/discussions

				  - name: Discord

				    url: https://discord.gg/6adMQxSpJS

				    about: General community discussions

				    url: https://www.github.com/langchain-ai/langchain/discussions/categories/q-a

				  - name: Feature Request

				    url: https://www.github.com/langchain-ai/langchain/discussions/categories/ideas

				    about: Suggest a feature or an idea

				  - name: Show and tell

				    about: Show what you built with LangChain

				    url: https://www.github.com/langchain-ai/langchain/discussions/categories/show-and-tell

									
										45

.github/ISSUE_TEMPLATE/documentation.yml
									
										vendored
									
												View File
												
				@@ -4,16 +4,55 @@ title: "DOC: <Please write a comprehensive title after the 'DOC: ' prefix>"

				labels: [03 - Documentation]

				body:

				- type: markdown

				  attributes:

				    value: >

				      Thank you for taking the time to report an issue in the documentation.

				      Only report issues with documentation here, explain if there are

				      any missing topics or if you found a mistake in the documentation.

				      Do **NOT** use this to ask usage questions or reporting issues with your code.

				      If you have usage questions or need help solving some problem, 

				      please use [GitHub Discussions](https://github.com/langchain-ai/langchain/discussions).

				      If you're in the wrong place, here are some helpful links to find a better

				      place to ask your question:

				      [LangChain documentation with the integrated search](https://python.langchain.com/docs/get_started/introduction),

				      [API Reference](https://api.python.langchain.com/en/stable/),

				      [GitHub search](https://github.com/langchain-ai/langchain),

				      [LangChain Github Discussions](https://github.com/langchain-ai/langchain/discussions),

				      [LangChain Github Issues](https://github.com/langchain-ai/langchain/issues?q=is%3Aissue),

				      [LangChain ChatBot](https://chat.langchain.com/)

				- type: input

				  id: url

				  attributes:

				    label: URL

				    description: URL to documentation

				  validations:

				    required: false

				- type: checkboxes

				  id: checks

				  attributes:

				    label: Checklist

				    description: Please confirm and check all the following options.

				    options:

				      - label: I added a very descriptive title to this issue.

				        required: true

				      - label: I included a link to the documentation page I am referring to (if applicable).

				        required: true

				- type: textarea

				  attributes: 

				    label: "Issue with current documentation:"

				    description: >

				      Please make sure to leave a reference to the document/code you're

				      referring to.

				      referring to. Feel free to include names of classes, functions, methods

				      or concepts you'd like to see documented more.

				- type: textarea

				  attributes:

				    label: "Idea or request for content:"

				    description: >

				      Please describe as clearly as possible what topics you think are missing

				      from the current documentation.

				      from the current documentation.

									
										30

.github/ISSUE_TEMPLATE/feature-request.yml
									
										vendored
									
												View File
											
				@@ -1,30 +0,0 @@

				name: "\U0001F680 Feature request"

				description: Submit a proposal/request for a new LangChain feature

				labels: ["02 Feature Request"]

				body:

				  - type: textarea

				    id: feature-request

				    validations:

				      required: true

				    attributes:

				      label: Feature request

				      description: |

				        A clear and concise description of the feature proposal. Please provide links to any relevant GitHub repos, papers, or other resources if relevant.

				  - type: textarea

				    id: motivation

				    validations:

				      required: true

				    attributes:

				      label: Motivation

				      description: |

				        Please outline the motivation for the proposal. Is your feature request related to a problem? e.g., I'm always frustrated when [...]. If this is related to another GitHub issue, please link here too.

				  - type: textarea

				    id: contribution

				    validations:

				      required: true

				    attributes:

				      label: Your contribution

				      description: |

				        Is there any way that you could help, e.g. by submitting a PR? Make sure to read the [Contributing Guide](https://python.langchain.com/docs/contributing/)

									
										18

.github/ISSUE_TEMPLATE/other.yml
									
										vendored
									
												View File
											
				@@ -1,18 +0,0 @@

				name: Other Issue

				description: Raise an issue that wouldn't be covered by the other templates.

				title: "Issue: <Please write a comprehensive title after the 'Issue: ' prefix>"

				labels: [04 - Other]

				body:

				  - type: textarea

				    attributes:

				      label: "Issue you'd like to raise."

				      description: >

				        Please describe the issue you'd like to raise as clearly as possible.

				        Make sure to include any relevant links or references.

				  - type: textarea

				    attributes:

				      label: "Suggestion:"

				      description: >

				        Please outline a suggestion to improve the issue here.

									
										25

.github/ISSUE_TEMPLATE/privileged.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,25 @@

				name: 🔒 Privileged

				description: You are a LangChain maintainer, or was asked directly by a maintainer to create an issue here. If not, check the other options.

				body:

				  - type: markdown

				    attributes:

				      value: |

				        Thanks for your interest in LangChain! 🚀

				        If you are not a LangChain maintainer or were not asked directly by a maintainer to create an issue, then please start the conversation in a [Question in GitHub Discussions](https://github.com/langchain-ai/langchain/discussions/categories/q-a) instead.

				        You are a LangChain maintainer if you maintain any of the packages inside of the LangChain repository 

				        or are a regular contributor to LangChain with previous merged pull requests.

				  - type: checkboxes

				    id: privileged

				    attributes:

				      label: Privileged issue

				      description: Confirm that you are allowed to create an issue here.

				      options:

				        - label: I am a LangChain maintainer, or was asked directly by a LangChain maintainer to create an issue here.

				          required: true

				  - type: textarea

				    id: content

				    attributes:

				      label: Issue Content

				      description: Add the content of the issue here.

									
										33

.github/PULL_REQUEST_TEMPLATE.md
									
										vendored
									
												View File
												
				@@ -1,20 +1,29 @@

				<!-- Thank you for contributing to LangChain!

				Thank you for contributing to LangChain!

				Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified.

				- [ ] **PR title**: "package: description"

				  - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes.

				  - Example: "community: add foobar LLM"

				Replace this entire comment with:

				  - **Description:** a description of the change, 

				  - **Issue:** the issue # it fixes if applicable,

				  - **Dependencies:** any dependencies required for this change,

				  - **Twitter handle:** we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out!

				Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally.

				- [ ] **PR message**: ***Delete this entire checklist*** and replace with

				    - **Description:** a description of the change

				    - **Issue:** the issue # it fixes, if applicable

				    - **Dependencies:** any dependencies required for this change

				    - **Twitter handle:** if your PR gets announced, and you'd like a mention, we'll gladly shout you out!

				See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/

				If you're adding a new integration, please include:

				- [ ] **Add tests and docs**: If you're adding a new integration, please include

				  1. a test for the integration, preferably unit tests that do not rely on network access,

				  2. an example notebook showing its use. It lives in `docs/docs/integrations` directory.

				If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17.

				 -->

				- [ ] **Lint and test**: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/

				Additional guidelines:

				- Make sure optional dependencies are imported within a function.

				- Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests.

				- Most PRs should not touch more than one package.

				- Changes should be backwards compatible.

				- If you are adding something to community, do not re-import it in langchain.

				If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.

									
										7

.github/actions/people/Dockerfile
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,7 @@

				FROM python:3.9

				RUN pip install httpx PyGithub "pydantic==2.0.2" pydantic-settings "pyyaml>=5.3.1,<6.0.0"

				COPY ./app /app

				CMD ["python", "/app/main.py"]

									
										11

.github/actions/people/action.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,11 @@

				# Adapted from https://github.com/tiangolo/fastapi/blob/master/.github/actions/people/action.yml

				name: "Generate LangChain People"

				description: "Generate the data for the LangChain People page"

				author: "Jacob Lee <jacob@langchain.dev>"

				inputs:

				  token:

				    description: 'User token, to read the GitHub API. Can be passed in using {{ secrets.LANGCHAIN_PEOPLE_GITHUB_TOKEN }}'

				    required: true

				runs:

				  using: 'docker'

				  image: 'Dockerfile'

									
										646

.github/actions/people/app/main.py
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,646 @@

				# Adapted from https://github.com/tiangolo/fastapi/blob/master/.github/actions/people/app/main.py

				import logging

				import subprocess

				import sys

				from collections import Counter

				from datetime import datetime, timedelta, timezone

				from pathlib import Path

				from typing import Any, Container, Dict, List, Set, Union

				import httpx

				import yaml

				from github import Github

				from pydantic import BaseModel, SecretStr

				from pydantic_settings import BaseSettings

				github_graphql_url = "https://api.github.com/graphql"

				questions_category_id = "DIC_kwDOIPDwls4CS6Ve"

				# discussions_query = """

				# query Q($after: String, $category_id: ID) {

				#   repository(name: "langchain", owner: "langchain-ai") {

				#     discussions(first: 100, after: $after, categoryId: $category_id) {

				#       edges {

				#         cursor

				#         node {

				#           number

				#           author {

				#             login

				#             avatarUrl

				#             url

				#           }

				#           title

				#           createdAt

				#           comments(first: 100) {

				#             nodes {

				#               createdAt

				#               author {

				#                 login

				#                 avatarUrl

				#                 url

				#               }

				#               isAnswer

				#               replies(first: 10) {

				#                 nodes {

				#                   createdAt

				#                   author {

				#                     login

				#                     avatarUrl

				#                     url

				#                   }

				#                 }

				#               }

				#             }

				#           }

				#         }

				#       }

				#     }

				#   }

				# }

				# """

				# issues_query = """

				# query Q($after: String) {

				#   repository(name: "langchain", owner: "langchain-ai") {

				#     issues(first: 100, after: $after) {

				#       edges {

				#         cursor

				#         node {

				#           number

				#           author {

				#             login

				#             avatarUrl

				#             url

				#           }

				#           title

				#           createdAt

				#           state

				#           comments(first: 100) {

				#             nodes {

				#               createdAt

				#               author {

				#                 login

				#                 avatarUrl

				#                 url

				#               }

				#             }

				#           }

				#         }

				#       }

				#     }

				#   }

				# }

				# """

				prs_query = """

				query Q($after: String) {

				  repository(name: "langchain", owner: "langchain-ai") {

				    pullRequests(first: 100, after: $after, states: MERGED) {

				      edges {

				        cursor

				        node {

				          changedFiles

				          additions

				          deletions

				          number

				          labels(first: 100) {

				            nodes {

				              name

				            }

				          }

				          author {

				            login

				            avatarUrl

				            url

				            ... on User {

				              twitterUsername

				            }

				          }

				          title

				          createdAt

				          state

				          reviews(first:100) {

				            nodes {

				              author {

				                login

				                avatarUrl

				                url

				                ... on User {

				                  twitterUsername

				                }

				              }

				              state

				            }

				          }

				        }

				      }

				    }

				  }

				}

				"""

				class Author(BaseModel):

				    login: str

				    avatarUrl: str

				    url: str

				    twitterUsername: Union[str, None] = None

				# Issues and Discussions

				class CommentsNode(BaseModel):

				    createdAt: datetime

				    author: Union[Author, None] = None

				class Replies(BaseModel):

				    nodes: List[CommentsNode]

				class DiscussionsCommentsNode(CommentsNode):

				    replies: Replies

				class Comments(BaseModel):

				    nodes: List[CommentsNode]

				class DiscussionsComments(BaseModel):

				    nodes: List[DiscussionsCommentsNode]

				class IssuesNode(BaseModel):

				    number: int

				    author: Union[Author, None] = None

				    title: str

				    createdAt: datetime

				    state: str

				    comments: Comments

				class DiscussionsNode(BaseModel):

				    number: int

				    author: Union[Author, None] = None

				    title: str

				    createdAt: datetime

				    comments: DiscussionsComments

				class IssuesEdge(BaseModel):

				    cursor: str

				    node: IssuesNode

				class DiscussionsEdge(BaseModel):

				    cursor: str

				    node: DiscussionsNode

				class Issues(BaseModel):

				    edges: List[IssuesEdge]

				class Discussions(BaseModel):

				    edges: List[DiscussionsEdge]

				class IssuesRepository(BaseModel):

				    issues: Issues

				class DiscussionsRepository(BaseModel):

				    discussions: Discussions

				class IssuesResponseData(BaseModel):

				    repository: IssuesRepository

				class DiscussionsResponseData(BaseModel):

				    repository: DiscussionsRepository

				class IssuesResponse(BaseModel):

				    data: IssuesResponseData

				class DiscussionsResponse(BaseModel):

				    data: DiscussionsResponseData

				# PRs

				class LabelNode(BaseModel):

				    name: str

				class Labels(BaseModel):

				    nodes: List[LabelNode]

				class ReviewNode(BaseModel):

				    author: Union[Author, None] = None

				    state: str

				class Reviews(BaseModel):

				    nodes: List[ReviewNode]

				class PullRequestNode(BaseModel):

				    number: int

				    labels: Labels

				    author: Union[Author, None] = None

				    changedFiles: int

				    additions: int

				    deletions: int

				    title: str

				    createdAt: datetime

				    state: str

				    reviews: Reviews

				    # comments: Comments

				class PullRequestEdge(BaseModel):

				    cursor: str

				    node: PullRequestNode

				class PullRequests(BaseModel):

				    edges: List[PullRequestEdge]

				class PRsRepository(BaseModel):

				    pullRequests: PullRequests

				class PRsResponseData(BaseModel):

				    repository: PRsRepository

				class PRsResponse(BaseModel):

				    data: PRsResponseData

				class Settings(BaseSettings):

				    input_token: SecretStr

				    github_repository: str

				    httpx_timeout: int = 30

				def get_graphql_response(

				    *,

				    settings: Settings,

				    query: str,

				    after: Union[str, None] = None,

				    category_id: Union[str, None] = None,

				) -> Dict[str, Any]:

				    headers = {"Authorization": f"token {settings.input_token.get_secret_value()}"}

				    # category_id is only used by one query, but GraphQL allows unused variables, so

				    # keep it here for simplicity

				    variables = {"after": after, "category_id": category_id}

				    response = httpx.post(

				        github_graphql_url,

				        headers=headers,

				        timeout=settings.httpx_timeout,

				        json={"query": query, "variables": variables, "operationName": "Q"},

				    )

				    if response.status_code != 200:

				        logging.error(

				            f"Response was not 200, after: {after}, category_id: {category_id}"

				        )

				        logging.error(response.text)

				        raise RuntimeError(response.text)

				    data = response.json()

				    if "errors" in data:

				        logging.error(f"Errors in response, after: {after}, category_id: {category_id}")

				        logging.error(data["errors"])

				        logging.error(response.text)

				        raise RuntimeError(response.text)

				    return data

				# def get_graphql_issue_edges(*, settings: Settings, after: Union[str, None] = None):

				#     data = get_graphql_response(settings=settings, query=issues_query, after=after)

				#     graphql_response = IssuesResponse.model_validate(data)

				#     return graphql_response.data.repository.issues.edges

				# def get_graphql_question_discussion_edges(

				#     *,

				#     settings: Settings,

				#     after: Union[str, None] = None,

				# ):

				#     data = get_graphql_response(

				#         settings=settings,

				#         query=discussions_query,

				#         after=after,

				#         category_id=questions_category_id,

				#     )

				#     graphql_response = DiscussionsResponse.model_validate(data)

				#     return graphql_response.data.repository.discussions.edges

				def get_graphql_pr_edges(*, settings: Settings, after: Union[str, None] = None):

				    if after is None:

				        print("Querying PRs...")

				    else:

				        print(f"Querying PRs with cursor {after}...")

				    data = get_graphql_response(settings=settings, query=prs_query, after=after)

				    graphql_response = PRsResponse.model_validate(data)

				    return graphql_response.data.repository.pullRequests.edges

				# def get_issues_experts(settings: Settings):

				#     issue_nodes: List[IssuesNode] = []

				#     issue_edges = get_graphql_issue_edges(settings=settings)

				#     while issue_edges:

				#         for edge in issue_edges:

				#             issue_nodes.append(edge.node)

				#         last_edge = issue_edges[-1]

				#         issue_edges = get_graphql_issue_edges(settings=settings, after=last_edge.cursor)

				#     commentors = Counter()

				#     last_month_commentors = Counter()

				#     authors: Dict[str, Author] = {}

				#     now = datetime.now(tz=timezone.utc)

				#     one_month_ago = now - timedelta(days=30)

				#     for issue in issue_nodes:

				#         issue_author_name = None

				#         if issue.author:

				#             authors[issue.author.login] = issue.author

				#             issue_author_name = issue.author.login

				#         issue_commentors = set()

				#         for comment in issue.comments.nodes:

				#             if comment.author:

				#                 authors[comment.author.login] = comment.author

				#                 if comment.author.login != issue_author_name:

				#                     issue_commentors.add(comment.author.login)

				#         for author_name in issue_commentors:

				#             commentors[author_name] += 1

				#             if issue.createdAt > one_month_ago:

				#                 last_month_commentors[author_name] += 1

				#     return commentors, last_month_commentors, authors

				# def get_discussions_experts(settings: Settings):

				#     discussion_nodes: List[DiscussionsNode] = []

				#     discussion_edges = get_graphql_question_discussion_edges(settings=settings)

				#     while discussion_edges:

				#         for discussion_edge in discussion_edges:

				#             discussion_nodes.append(discussion_edge.node)

				#         last_edge = discussion_edges[-1]

				#         discussion_edges = get_graphql_question_discussion_edges(

				#             settings=settings, after=last_edge.cursor

				#         )

				#     commentors = Counter()

				#     last_month_commentors = Counter()

				#     authors: Dict[str, Author] = {}

				#     now = datetime.now(tz=timezone.utc)

				#     one_month_ago = now - timedelta(days=30)

				#     for discussion in discussion_nodes:

				#         discussion_author_name = None

				#         if discussion.author:

				#             authors[discussion.author.login] = discussion.author

				#             discussion_author_name = discussion.author.login

				#         discussion_commentors = set()

				#         for comment in discussion.comments.nodes:

				#             if comment.author:

				#                 authors[comment.author.login] = comment.author

				#                 if comment.author.login != discussion_author_name:

				#                     discussion_commentors.add(comment.author.login)

				#             for reply in comment.replies.nodes:

				#                 if reply.author:

				#                     authors[reply.author.login] = reply.author

				#                     if reply.author.login != discussion_author_name:

				#                         discussion_commentors.add(reply.author.login)

				#         for author_name in discussion_commentors:

				#             commentors[author_name] += 1

				#             if discussion.createdAt > one_month_ago:

				#                 last_month_commentors[author_name] += 1

				#     return commentors, last_month_commentors, authors

				# def get_experts(settings: Settings):

				#     (

				#         discussions_commentors,

				#         discussions_last_month_commentors,

				#         discussions_authors,

				#     ) = get_discussions_experts(settings=settings)

				#     commentors = discussions_commentors

				#     last_month_commentors = discussions_last_month_commentors

				#     authors = {**discussions_authors}

				#     return commentors, last_month_commentors, authors

				def _logistic(x, k):

				    return x / (x + k)

				def get_contributors(settings: Settings):

				    pr_nodes: List[PullRequestNode] = []

				    pr_edges = get_graphql_pr_edges(settings=settings)

				    while pr_edges:

				        for edge in pr_edges:

				            pr_nodes.append(edge.node)

				        last_edge = pr_edges[-1]

				        pr_edges = get_graphql_pr_edges(settings=settings, after=last_edge.cursor)

				    contributors = Counter()

				    contributor_scores = Counter()

				    recent_contributor_scores = Counter()

				    reviewers = Counter()

				    authors: Dict[str, Author] = {}

				    for pr in pr_nodes:

				        pr_reviewers: Set[str] = set()

				        for review in pr.reviews.nodes:

				            if review.author:

				                authors[review.author.login] = review.author

				                pr_reviewers.add(review.author.login)

				        for reviewer in pr_reviewers:

				            reviewers[reviewer] += 1

				        if pr.author:

				            authors[pr.author.login] = pr.author

				            contributors[pr.author.login] += 1

				            files_changed = pr.changedFiles

				            lines_changed = pr.additions + pr.deletions

				            score = _logistic(files_changed, 20) + _logistic(lines_changed, 100)

				            contributor_scores[pr.author.login] += score

				            three_months_ago = datetime.now(timezone.utc) - timedelta(days=3 * 30)

				            if pr.createdAt > three_months_ago:

				                recent_contributor_scores[pr.author.login] += score

				    return (

				        contributors,

				        contributor_scores,

				        recent_contributor_scores,

				        reviewers,

				        authors,

				    )

				def get_top_users(

				    *,

				    counter: Counter,

				    min_count: int,

				    authors: Dict[str, Author],

				    skip_users: Container[str],

				):

				    users = []

				    for commentor, count in counter.most_common():

				        if commentor in skip_users:

				            continue

				        if count >= min_count:

				            author = authors[commentor]

				            users.append(

				                {

				                    "login": commentor,

				                    "count": count,

				                    "avatarUrl": author.avatarUrl,

				                    "twitterUsername": author.twitterUsername,

				                    "url": author.url,

				                }

				            )

				    return users

				if __name__ == "__main__":

				    logging.basicConfig(level=logging.INFO)

				    settings = Settings()

				    logging.info(f"Using config: {settings.model_dump_json()}")

				    g = Github(settings.input_token.get_secret_value())

				    repo = g.get_repo(settings.github_repository)

				    # question_commentors, question_last_month_commentors, question_authors = get_experts(

				    #     settings=settings

				    # )

				    (

				        contributors,

				        contributor_scores,

				        recent_contributor_scores,

				        reviewers,

				        pr_authors,

				    ) = get_contributors(settings=settings)

				    # authors = {**question_authors, **pr_authors}

				    authors = {**pr_authors}

				    maintainers_logins = {

				        "hwchase17",

				        "agola11",

				        "baskaryan",

				        "hinthornw",

				        "nfcampos",

				        "efriis",

				        "eyurtsev",

				        "rlancemartin",

				        "ccurme",

				        "vbarda",

				    }

				    hidden_logins = {

				        "dev2049",

				        "vowelparrot",

				        "obi1kenobi",

				        "langchain-infra",

				        "jacoblee93",

				        "isahers1",

				        "dqbd",

				        "bracesproul",

				        "akira",

				    }

				    bot_names = {"dosubot", "github-actions", "CodiumAI-Agent"}

				    maintainers = []

				    for login in maintainers_logins:

				        user = authors[login]

				        maintainers.append(

				            {

				                "login": login,

				                "count": contributors[login],  # + question_commentors[login],

				                "avatarUrl": user.avatarUrl,

				                "twitterUsername": user.twitterUsername,

				                "url": user.url,

				            }

				        )

				    # min_count_expert = 10

				    # min_count_last_month = 3

				    min_score_contributor = 1

				    min_count_reviewer = 5

				    skip_users = maintainers_logins | bot_names | hidden_logins

				    # experts = get_top_users(

				    #     counter=question_commentors,

				    #     min_count=min_count_expert,

				    #     authors=authors,

				    #     skip_users=skip_users,

				    # )

				    # last_month_active = get_top_users(

				    #     counter=question_last_month_commentors,

				    #     min_count=min_count_last_month,

				    #     authors=authors,

				    #     skip_users=skip_users,

				    # )

				    top_recent_contributors = get_top_users(

				        counter=recent_contributor_scores,

				        min_count=min_score_contributor,

				        authors=authors,

				        skip_users=skip_users,

				    )

				    top_contributors = get_top_users(

				        counter=contributor_scores,

				        min_count=min_score_contributor,

				        authors=authors,

				        skip_users=skip_users,

				    )

				    top_reviewers = get_top_users(

				        counter=reviewers,

				        min_count=min_count_reviewer,

				        authors=authors,

				        skip_users=skip_users,

				    )

				    people = {

				        "maintainers": maintainers,

				        # "experts": experts,

				        # "last_month_active": last_month_active,

				        "top_recent_contributors": top_recent_contributors,

				        "top_contributors": top_contributors,

				        "top_reviewers": top_reviewers,

				    }

				    people_path = Path("./docs/data/people.yml")

				    people_old_content = people_path.read_text(encoding="utf-8")

				    new_people_content = yaml.dump(

				        people, sort_keys=False, width=200, allow_unicode=True

				    )

				    if people_old_content == new_people_content:

				        logging.info("The LangChain People data hasn't changed, finishing.")

				        sys.exit(0)

				    people_path.write_text(new_people_content, encoding="utf-8")

				    logging.info("Setting up GitHub Actions git user")

				    subprocess.run(["git", "config", "user.name", "github-actions"], check=True)

				    subprocess.run(

				        ["git", "config", "user.email", "github-actions@github.com"], check=True

				    )

				    branch_name = "langchain/langchain-people"

				    logging.info(f"Creating a new branch {branch_name}")

				    subprocess.run(["git", "checkout", "-B", branch_name], check=True)

				    logging.info("Adding updated file")

				    subprocess.run(["git", "add", str(people_path)], check=True)

				    logging.info("Committing updated file")

				    message = "👥 Update LangChain people data"

				    result = subprocess.run(["git", "commit", "-m", message], check=True)

				    logging.info("Pushing branch")

				    subprocess.run(["git", "push", "origin", branch_name, "-f"], check=True)

				    logging.info("Creating PR")

				    pr = repo.create_pull(title=message, body=message, base="master", head=branch_name)

				    logging.info(f"Created PR: {pr.number}")

				    logging.info("Finished")

									
										8

.github/actions/poetry_setup/action.yml
									
										vendored
									
												View File
												
				@@ -28,10 +28,11 @@ runs:

				  steps:

				    - uses: actions/setup-python@v5

				      name: Setup python ${{ inputs.python-version }}

				      id: setup-python

				      with:

				        python-version: ${{ inputs.python-version }}

				    - uses: actions/cache@v3

				    - uses: actions/cache@v4

				      id: cache-bin-poetry

				      name: Cache Poetry binary - Python ${{ inputs.python-version }}

				      env:

				@@ -74,10 +75,11 @@ runs:

				      env:

				        POETRY_VERSION: ${{ inputs.poetry-version }}

				        PYTHON_VERSION: ${{ inputs.python-version }}

				      run: pipx install "poetry==$POETRY_VERSION" --python "python$PYTHON_VERSION" --verbose

				      # Install poetry using the python version installed by setup-python step.

				      run: pipx install "poetry==$POETRY_VERSION" --python '${{ steps.setup-python.outputs.python-path }}' --verbose

				    - name: Restore pip and poetry cached dependencies

				      uses: actions/cache@v3

				      uses: actions/cache@v4

				      env:

				        SEGMENT_DOWNLOAD_TIMEOUT_MIN: "4"

				        WORKDIR: ${{ inputs.working-directory == '' && '.' || inputs.working-directory }}

									
										243

.github/scripts/check_diff.py
									
										vendored
									
												View File
												
				@@ -1,22 +1,156 @@

				import glob

				import json

				import sys

				import os

				import sys

				import tomllib

				from collections import defaultdict

				from typing import Dict, List, Set

				from pathlib import Path

				LANGCHAIN_DIRS = {

				LANGCHAIN_DIRS = [

				    "libs/core",

				    "libs/text-splitters",

				    "libs/langchain",

				    "libs/experimental",

				    "libs/community",

				}

				    "libs/experimental",

				]

				def all_package_dirs() -> Set[str]:

				    return {

				        "/".join(path.split("/")[:-1]).lstrip("./")

				        for path in glob.glob("./libs/**/pyproject.toml", recursive=True)

				        if "libs/cli" not in path and "libs/standard-tests" not in path

				    }

				def dependents_graph() -> dict:

				    """

				    Construct a mapping of package -> dependents, such that we can

				    run tests on all dependents of a package when a change is made.

				    """

				    dependents = defaultdict(set)

				    for path in glob.glob("./libs/**/pyproject.toml", recursive=True):

				        if "template" in path:

				            continue

				        # load regular and test deps from pyproject.toml

				        with open(path, "rb") as f:

				            pyproject = tomllib.load(f)["tool"]["poetry"]

				        pkg_dir = "libs" + "/".join(path.split("libs")[1].split("/")[:-1])

				        for dep in [

				            *pyproject["dependencies"].keys(),

				            *pyproject["group"]["test"]["dependencies"].keys(),

				        ]:

				            if "langchain" in dep:

				                dependents[dep].add(pkg_dir)

				                continue

				        # load extended deps from extended_testing_deps.txt

				        package_path = Path(path).parent

				        extended_requirement_path = package_path / "extended_testing_deps.txt"

				        if extended_requirement_path.exists():

				            with open(extended_requirement_path, "r") as f:

				                extended_deps = f.read().splitlines()

				                for depline in extended_deps:

				                    if depline.startswith("-e "):

				                        # editable dependency

				                        assert depline.startswith(

				                            "-e ../partners/"

				                        ), "Extended test deps should only editable install partner packages"

				                        partner = depline.split("partners/")[1]

				                        dep = f"langchain-{partner}"

				                    else:

				                        dep = depline.split("==")[0]

				                    if "langchain" in dep:

				                        dependents[dep].add(pkg_dir)

				    return dependents

				def add_dependents(dirs_to_eval: Set[str], dependents: dict) -> List[str]:

				    updated = set()

				    for dir_ in dirs_to_eval:

				        # handle core manually because it has so many dependents

				        if "core" in dir_:

				            updated.add(dir_)

				            continue

				        pkg = "langchain-" + dir_.split("/")[-1]

				        updated.update(dependents[pkg])

				        updated.add(dir_)

				    return list(updated)

				def _get_configs_for_single_dir(job: str, dir_: str) -> List[Dict[str, str]]:

				    if dir_ == "libs/core":

				        return [

				            {"working-directory": dir_, "python-version": f"3.{v}"}

				            for v in range(8, 13)

				        ]

				    min_python = "3.8"

				    max_python = "3.12"

				    # custom logic for specific directories

				    if dir_ == "libs/partners/milvus":

				        # milvus poetry doesn't allow 3.12 because they

				        # declare deps in funny way

				        max_python = "3.11"

				    if dir_ in ["libs/community", "libs/langchain"] and job == "extended-tests":

				        # community extended test resolution in 3.12 is slow

				        # even in uv

				        max_python = "3.11"

				    if dir_ == "libs/community" and job == "compile-integration-tests":

				        # community integration deps are slow in 3.12

				        max_python = "3.11"

				    return [

				        {"working-directory": dir_, "python-version": min_python},

				        {"working-directory": dir_, "python-version": max_python},

				    ]

				def _get_configs_for_multi_dirs(

				    job: str, dirs_to_run: List[str], dependents: dict

				) -> List[Dict[str, str]]:

				    if job == "lint":

				        dirs = add_dependents(

				            dirs_to_run["lint"] | dirs_to_run["test"] | dirs_to_run["extended-test"],

				            dependents,

				        )

				    elif job in ["test", "compile-integration-tests", "dependencies"]:

				        dirs = add_dependents(

				            dirs_to_run["test"] | dirs_to_run["extended-test"], dependents

				        )

				    elif job == "extended-tests":

				        dirs = list(dirs_to_run["extended-test"])

				    else:

				        raise ValueError(f"Unknown job: {job}")

				    return [

				        config for dir_ in dirs for config in _get_configs_for_single_dir(job, dir_)

				    ]

				if __name__ == "__main__":

				    files = sys.argv[1:]

				    dirs_to_run = set()

				    if len(files) == 300:

				    dirs_to_run: Dict[str, set] = {

				        "lint": set(),

				        "test": set(),

				        "extended-test": set(),

				    }

				    docs_edited = False

				    if len(files) >= 300:

				        # max diff length is 300 files - there are likely files missing

				        raise ValueError("Max diff reached. Please manually run CI on changed libs.")

				        dirs_to_run["lint"] = all_package_dirs()

				        dirs_to_run["test"] = all_package_dirs()

				        dirs_to_run["extended-test"] = set(LANGCHAIN_DIRS)

				    for file in files:

				        if any(

				            file.startswith(dir_)

				@@ -24,33 +158,74 @@ if __name__ == "__main__":

				                ".github/workflows",

				                ".github/tools",

				                ".github/actions",

				                "libs/core",

				                ".github/scripts/check_diff.py",

				            )

				        ):

				            dirs_to_run.update(LANGCHAIN_DIRS)

				        elif "libs/community" in file:

				            dirs_to_run.update(

				                ("libs/community", "libs/langchain", "libs/experimental")

				            )

				        elif "libs/partners" in file:

				            partner_dir = file.split("/")[2]

				            if os.path.isdir(f"libs/partners/{partner_dir}"):

				                dirs_to_run.update(

				                    (

				                        f"libs/partners/{partner_dir}",

				                        "libs/langchain",

				                        "libs/experimental",

				                    )

				                )

				            # Skip if the directory was deleted

				        elif "libs/langchain" in file:

				            dirs_to_run.update(("libs/langchain", "libs/experimental"))

				        elif "libs/experimental" in file:

				            dirs_to_run.add("libs/experimental")

				        elif file.startswith("libs/"):

				            dirs_to_run.update(LANGCHAIN_DIRS)

				        else:

				            # add all LANGCHAIN_DIRS for infra changes

				            dirs_to_run["extended-test"].update(LANGCHAIN_DIRS)

				            dirs_to_run["lint"].add(".")

				        if any(file.startswith(dir_) for dir_ in LANGCHAIN_DIRS):

				            # add that dir and all dirs after in LANGCHAIN_DIRS

				            # for extended testing

				            found = False

				            for dir_ in LANGCHAIN_DIRS:

				                if file.startswith(dir_):

				                    found = True

				                if found:

				                    dirs_to_run["extended-test"].add(dir_)

				        elif file.startswith("libs/standard-tests"):

				            # TODO: update to include all packages that rely on standard-tests (all partner packages)

				            # note: won't run on external repo partners

				            dirs_to_run["lint"].add("libs/standard-tests")

				            dirs_to_run["test"].add("libs/partners/mistralai")

				            dirs_to_run["test"].add("libs/partners/openai")

				            dirs_to_run["test"].add("libs/partners/anthropic")

				            dirs_to_run["test"].add("libs/partners/ai21")

				            dirs_to_run["test"].add("libs/partners/fireworks")

				            dirs_to_run["test"].add("libs/partners/groq")

				        elif file.startswith("libs/cli"):

				            # todo: add cli makefile

				            pass

				    json_output = json.dumps(list(dirs_to_run))

				    print(f"dirs-to-run={json_output}")

				        elif file.startswith("libs/partners"):

				            partner_dir = file.split("/")[2]

				            if os.path.isdir(f"libs/partners/{partner_dir}") and [

				                filename

				                for filename in os.listdir(f"libs/partners/{partner_dir}")

				                if not filename.startswith(".")

				            ] != ["README.md"]:

				                dirs_to_run["test"].add(f"libs/partners/{partner_dir}")

				            # Skip if the directory was deleted or is just a tombstone readme

				        elif file.startswith("libs/"):

				            raise ValueError(

				                f"Unknown lib: {file}. check_diff.py likely needs "

				                "an update for this new library!"

				            )

				        elif any(file.startswith(p) for p in ["docs/", "templates/", "cookbook/"]):

				            if file.startswith("docs/"):

				                docs_edited = True

				            dirs_to_run["lint"].add(".")

				    dependents = dependents_graph()

				    # we now have dirs_by_job

				    # todo: clean this up

				    map_job_to_configs = {

				        job: _get_configs_for_multi_dirs(job, dirs_to_run, dependents)

				        for job in [

				            "lint",

				            "test",

				            "extended-tests",

				            "compile-integration-tests",

				            "dependencies",

				        ]

				    }

				    map_job_to_configs["test-doc-imports"] = (

				        [{"python-version": "3.12"}] if docs_edited else []

				    )

				    for key, value in map_job_to_configs.items():

				        json_output = json.dumps(value)

				        print(f"{key}={json_output}")

									
										35

.github/scripts/check_prerelease_dependencies.py
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,35 @@

				import sys

				import tomllib

				if __name__ == "__main__":

				    # Get the TOML file path from the command line argument

				    toml_file = sys.argv[1]

				    # read toml file

				    with open(toml_file, "rb") as file:

				        toml_data = tomllib.load(file)

				    # see if we're releasing an rc

				    version = toml_data["tool"]["poetry"]["version"]

				    releasing_rc = "rc" in version

				    # if not, iterate through dependencies and make sure none allow prereleases

				    if not releasing_rc:

				        dependencies = toml_data["tool"]["poetry"]["dependencies"]

				        for lib in dependencies:

				            dep_version = dependencies[lib]

				            dep_version_string = (

				                dep_version["version"] if isinstance(dep_version, dict) else dep_version

				            )

				            if "rc" in dep_version_string:

				                raise ValueError(

				                    f"Dependency {lib} has a prerelease version. Please remove this."

				                )

				            if isinstance(dep_version, dict) and dep_version.get(

				                "allow-prereleases", False

				            ):

				                raise ValueError(

				                    f"Dependency {lib} has allow-prereleases set to true. Please remove this."

				                )

									
										91

.github/scripts/get_min_versions.py
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,91 @@

				import sys

				if sys.version_info >= (3, 11):

				    import tomllib

				else:

				    # for python 3.10 and below, which doesnt have stdlib tomllib

				    import tomli as tomllib

				from packaging.version import parse as parse_version

				import re

				MIN_VERSION_LIBS = [

				    "langchain-core",

				    "langchain-community",

				    "langchain",

				    "langchain-text-splitters",

				    "SQLAlchemy",

				]

				SKIP_IF_PULL_REQUEST = ["langchain-core"]

				def get_min_version(version: str) -> str:

				    # base regex for x.x.x with cases for rc/post/etc

				    # valid strings: https://peps.python.org/pep-0440/#public-version-identifiers

				    vstring = r"\d+(?:\.\d+){0,2}(?:(?:a|b|rc|\.post|\.dev)\d+)?"

				    # case ^x.x.x

				    _match = re.match(f"^\\^({vstring})$", version)

				    if _match:

				        return _match.group(1)

				    # case >=x.x.x,<y.y.y

				    _match = re.match(f"^>=({vstring}),<({vstring})$", version)

				    if _match:

				        _min = _match.group(1)

				        _max = _match.group(2)

				        assert parse_version(_min) < parse_version(_max)

				        return _min

				    # case x.x.x

				    _match = re.match(f"^({vstring})$", version)

				    if _match:

				        return _match.group(1)

				    raise ValueError(f"Unrecognized version format: {version}")

				def get_min_version_from_toml(toml_path: str, versions_for: str):

				    # Parse the TOML file

				    with open(toml_path, "rb") as file:

				        toml_data = tomllib.load(file)

				    # Get the dependencies from tool.poetry.dependencies

				    dependencies = toml_data["tool"]["poetry"]["dependencies"]

				    # Initialize a dictionary to store the minimum versions

				    min_versions = {}

				    # Iterate over the libs in MIN_VERSION_LIBS

				    for lib in MIN_VERSION_LIBS:

				        if versions_for == "pull_request" and lib in SKIP_IF_PULL_REQUEST:

				            # some libs only get checked on release because of simultaneous

				            # changes

				            continue

				        # Check if the lib is present in the dependencies

				        if lib in dependencies:

				            # Get the version string

				            version_string = dependencies[lib]

				            if isinstance(version_string, dict):

				                version_string = version_string["version"]

				            # Use parse_version to get the minimum supported version from version_string

				            min_version = get_min_version(version_string)

				            # Store the minimum version in the min_versions dictionary

				            min_versions[lib] = min_version

				    return min_versions

				if __name__ == "__main__":

				    # Get the TOML file path from the command line argument

				    toml_file = sys.argv[1]

				    versions_for = sys.argv[2]

				    assert versions_for in ["release", "pull_request"]

				    # Call the function to get the minimum versions

				    min_versions = get_min_version_from_toml(toml_file, versions_for)

				    print(" ".join([f"{lib}=={version}" for lib, version in min_versions.items()]))

7

.github/workflows/.codespell-exclude vendored Normal file

View File

@@ -0,0 +1,7 @@
 libs/community/langchain_community/llms/yuan2.py
 "NotIn": "not in",
 - `/checkin`: Check-in
 docs/docs/integrations/providers/trulens.mdx
 self.assertIn(
 from trulens_eval import Tru
 tru = Tru()

									
										106

.github/workflows/_all_ci.yml
									
										vendored
									
												View File
											
				@@ -1,106 +0,0 @@

				---

				name: langchain CI

				on:

				  workflow_call:

				    inputs:

				      working-directory:

				        required: true

				        type: string

				        description: "From which folder this pipeline executes"

				  workflow_dispatch:

				    inputs:

				      working-directory:

				        required: true

				        type: choice

				        default: 'libs/langchain'

				        options:

				        - libs/langchain

				        - libs/core

				        - libs/experimental

				        - libs/community

				# If another push to the same PR or branch happens while this workflow is still running,

				# cancel the earlier run in favor of the next run.

				#

				# There's no point in testing an outdated version of the code. GitHub only allows

				# a limited number of job runners to be active at the same time, so it's better to cancel

				# pointless jobs early so that more useful jobs can run sooner.

				concurrency:

				  group: ${{ github.workflow }}-${{ github.ref }}-${{ inputs.working-directory }}

				  cancel-in-progress: true

				env:

				  POETRY_VERSION: "1.6.1"

				jobs:

				  lint:

				    uses: ./.github/workflows/_lint.yml

				    with:

				      working-directory: ${{ inputs.working-directory }}

				    secrets: inherit

				  test:

				    uses: ./.github/workflows/_test.yml

				    with:

				      working-directory: ${{ inputs.working-directory }}

				    secrets: inherit

				  compile-integration-tests:

				    uses: ./.github/workflows/_compile_integration_test.yml

				    with:

				      working-directory: ${{ inputs.working-directory }}

				    secrets: inherit

				  dependencies:

				    uses: ./.github/workflows/_dependencies.yml

				    with:

				      working-directory: ${{ inputs.working-directory }}

				    secrets: inherit

				  extended-tests:

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        python-version:

				          - "3.8"

				          - "3.9"

				          - "3.10"

				          - "3.11"

				    name: Python ${{ matrix.python-version }} extended tests

				    defaults:

				      run:

				        working-directory: ${{ inputs.working-directory }}

				    if: ${{ ! startsWith(inputs.working-directory, 'libs/partners/') }}

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: extended

				      - name: Install dependencies

				        shell: bash

				        run: |

				          echo "Running extended tests, installing dependencies with poetry..."

				          poetry install -E extended_testing --with test

				      - name: Run extended tests

				        run: make extended_tests

				      - name: Ensure the tests did not create any additional files

				        shell: bash

				        run: |

				          set -eu

				          STATUS="$(git status)"

				          echo "$STATUS"

				          # grep will exit non-zero if the target message isn't found,

				          # and `set -e` above will cause the step to fail.

				          echo "$STATUS" | grep 'nothing to commit, working tree clean'

									
										19

.github/workflows/_compile_integration_test.yml
									
										vendored
									
												View File
												
				@@ -7,9 +7,13 @@ on:

				        required: true

				        type: string

				        description: "From which folder this pipeline executes"

				      python-version:

				        required: true

				        type: string

				        description: "Python version to use"

				env:

				  POETRY_VERSION: "1.6.1"

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				@@ -17,21 +21,14 @@ jobs:

				      run:

				        working-directory: ${{ inputs.working-directory }}

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        python-version:

				          - "3.8"

				          - "3.9"

				          - "3.10"

				          - "3.11"

				    name: Python ${{ matrix.python-version }}

				    name: "poetry run pytest -m compile tests/integration_tests #${{ inputs.python-version }}"

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				      - name: Set up Python ${{ inputs.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.python-version }}

				          python-version: ${{ inputs.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: compile-integration

									
										23

.github/workflows/_dependencies.yml
									
										vendored
									
												View File
												
				@@ -11,9 +11,13 @@ on:

				        required: false

				        type: string

				        description: "Relative path to the langchain library folder"

				      python-version:

				        required: true

				        type: string

				        description: "Python version to use"

				env:

				  POETRY_VERSION: "1.6.1"

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				@@ -21,21 +25,14 @@ jobs:

				      run:

				        working-directory: ${{ inputs.working-directory }}

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        python-version:

				          - "3.8"

				          - "3.9"

				          - "3.10"

				          - "3.11"

				    name: dependencies - Python ${{ matrix.python-version }}

				    name: dependency checks ${{ inputs.python-version }}

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				      - name: Set up Python ${{ inputs.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.python-version }}

				          python-version: ${{ inputs.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: pydantic-cross-compat

				@@ -63,6 +60,8 @@ jobs:

				      - name: Install the opposite major version of pydantic

				        # If normal tests use pydantic v1, here we'll use v2, and vice versa.

				        shell: bash

				        # airbyte currently doesn't support pydantic v2

				        if: ${{ !startsWith(inputs.working-directory, 'libs/partners/airbyte') }}

				        run: |

				          # Determine the major part of pydantic version

				          REGULAR_VERSION=$(poetry run python -c "import pydantic; print(pydantic.__version__)" | cut -d. -f1)

				@@ -97,6 +96,8 @@ jobs:

				          fi

				          echo "Found pydantic version ${CURRENT_VERSION}, as expected"

				      - name: Run pydantic compatibility tests

				        # airbyte currently doesn't support pydantic v2

				        if: ${{ !startsWith(inputs.working-directory, 'libs/partners/airbyte') }}

				        shell: bash

				        run: make test

									
										51

.github/workflows/_integration_test.yml
									
										vendored
									
												View File
												
				@@ -6,9 +6,13 @@ on:

				      working-directory:

				        required: true

				        type: string

				      python-version:

				        required: true

				        type: string

				        description: "Python version to use"

				env:

				  POETRY_VERSION: "1.6.1"

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				@@ -16,19 +20,14 @@ jobs:

				      run:

				        working-directory: ${{ inputs.working-directory }}

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        python-version:

				          - "3.8"

				          - "3.11"

				    name: Python ${{ matrix.python-version }}

				    name: Python ${{ inputs.python-version }}

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				      - name: Set up Python ${{ inputs.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.python-version }}

				          python-version: ${{ inputs.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: core

				@@ -37,6 +36,11 @@ jobs:

				        shell: bash

				        run: poetry install --with test,test_integration

				      - name: Install deps outside pyproject

				        if: ${{ startsWith(inputs.working-directory, 'libs/community/') }}

				        shell: bash

				        run: poetry run pip install "boto3<2" "google-cloud-aiplatform<2"

				      - name: 'Authenticate to Google Cloud'

				        id: 'auth'

				        uses: google-github-actions/auth@v2

				@@ -46,11 +50,40 @@ jobs:

				      - name: Run integration tests

				        shell: bash

				        env:

				          AI21_API_KEY: ${{ secrets.AI21_API_KEY }}

				          FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}

				          GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}

				          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}

				          AZURE_OPENAI_API_VERSION: ${{ secrets.AZURE_OPENAI_API_VERSION }}

				          AZURE_OPENAI_API_BASE: ${{ secrets.AZURE_OPENAI_API_BASE }}

				          AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}

				          AZURE_OPENAI_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_CHAT_DEPLOYMENT_NAME }}

				          AZURE_OPENAI_LLM_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LLM_DEPLOYMENT_NAME }}

				          AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME }}

				          MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}

				          TOGETHER_API_KEY: ${{ secrets.TOGETHER_API_KEY }}

				          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}

				          GROQ_API_KEY: ${{ secrets.GROQ_API_KEY }}

				          NVIDIA_API_KEY: ${{ secrets.NVIDIA_API_KEY }}

				          GOOGLE_SEARCH_API_KEY: ${{ secrets.GOOGLE_SEARCH_API_KEY }}

				          GOOGLE_CSE_ID: ${{ secrets.GOOGLE_CSE_ID }}

				          EXA_API_KEY: ${{ secrets.EXA_API_KEY }}

				          NOMIC_API_KEY: ${{ secrets.NOMIC_API_KEY }}

				          WATSONX_APIKEY: ${{ secrets.WATSONX_APIKEY }}

				          WATSONX_PROJECT_ID: ${{ secrets.WATSONX_PROJECT_ID }}

				          PINECONE_API_KEY: ${{ secrets.PINECONE_API_KEY }}

				          PINECONE_ENVIRONMENT: ${{ secrets.PINECONE_ENVIRONMENT }}

				          ASTRA_DB_API_ENDPOINT: ${{ secrets.ASTRA_DB_API_ENDPOINT }}

				          ASTRA_DB_APPLICATION_TOKEN: ${{ secrets.ASTRA_DB_APPLICATION_TOKEN }}

				          ASTRA_DB_KEYSPACE: ${{ secrets.ASTRA_DB_KEYSPACE }}

				          ES_URL: ${{ secrets.ES_URL }}

				          ES_CLOUD_ID: ${{ secrets.ES_CLOUD_ID }}

				          ES_API_KEY: ${{ secrets.ES_API_KEY }}

				          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }} # for airbyte

				          MONGODB_ATLAS_URI: ${{ secrets.MONGODB_ATLAS_URI }}

				          VOYAGE_API_KEY: ${{ secrets.VOYAGE_API_KEY }}

				          COHERE_API_KEY: ${{ secrets.COHERE_API_KEY }}

				          UPSTAGE_API_KEY: ${{ secrets.UPSTAGE_API_KEY }}

				        run: |

				          make integration_tests

									
										39

.github/workflows/_lint.yml
									
										vendored
									
												View File
												
				@@ -11,9 +11,13 @@ on:

				        required: false

				        type: string

				        description: "Relative path to the langchain library folder"

				      python-version:

				        required: true

				        type: string

				        description: "Python version to use"

				env:

				  POETRY_VERSION: "1.6.1"

				  POETRY_VERSION: "1.7.1"

				  WORKDIR: ${{ inputs.working-directory == '' && '.' || inputs.working-directory }}

				  # This env var allows us to get inline annotations when ruff has complaints.

				@@ -21,26 +25,15 @@ env:

				jobs:

				  build:

				    name: "make lint #${{ inputs.python-version }}"

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        # Only lint on the min and max supported Python versions.

				        # It's extremely unlikely that there's a lint issue on any version in between

				        # that doesn't show up on the min or max versions.

				        #

				        # GitHub rate-limits how many jobs can be running at any one time.

				        # Starting new jobs is also relatively slow,

				        # so linting on fewer versions makes CI faster.

				        python-version:

				          - "3.8"

				          - "3.11"

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				      - name: Set up Python ${{ inputs.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.python-version }}

				          python-version: ${{ inputs.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: lint-with-extras

				@@ -79,13 +72,13 @@ jobs:

				          poetry run pip install -e "$LANGCHAIN_LOCATION"

				      - name: Get .mypy_cache to speed up mypy

				        uses: actions/cache@v3

				        uses: actions/cache@v4

				        env:

				          SEGMENT_DOWNLOAD_TIMEOUT_MIN: "2"

				        with:

				          path: |

				            ${{ env.WORKDIR }}/.mypy_cache

				          key: mypy-lint-${{ runner.os }}-${{ runner.arch }}-py${{ matrix.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', env.WORKDIR)) }}

				          key: mypy-lint-${{ runner.os }}-${{ runner.arch }}-py${{ inputs.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', inputs.working-directory)) }}

				      - name: Analysing the code with our lint

				@@ -93,7 +86,7 @@ jobs:

				        run: |

				          make lint_package

				      - name: Install test dependencies

				      - name: Install unit test dependencies

				        # Also installs dev/lint/test/typing dependencies, to ensure we have

				        # type hints for as many of our libraries as possible.

				        # This helps catch errors that require dependencies to be spotted, for example:

				@@ -102,18 +95,24 @@ jobs:

				        # If you change this configuration, make sure to change the `cache-key`

				        # in the `poetry_setup` action above to stop using the old cache.

				        # It doesn't matter how you change it, any change will cause a cache-bust.

				        if: ${{ ! startsWith(inputs.working-directory, 'libs/partners/') }}

				        working-directory: ${{ inputs.working-directory }}

				        run: |

				          poetry install --with test

				      - name: Install unit+integration test dependencies

				        if: ${{ startsWith(inputs.working-directory, 'libs/partners/') }}

				        working-directory: ${{ inputs.working-directory }}

				        run: |

				          poetry install --with test,test_integration

				      - name: Get .mypy_cache_test to speed up mypy

				        uses: actions/cache@v3

				        uses: actions/cache@v4

				        env:

				          SEGMENT_DOWNLOAD_TIMEOUT_MIN: "2"

				        with:

				          path: |

				            ${{ env.WORKDIR }}/.mypy_cache_test

				          key: mypy-test-${{ runner.os }}-${{ runner.arch }}-py${{ matrix.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', env.WORKDIR)) }}

				          key: mypy-test-${{ runner.os }}-${{ runner.arch }}-py${{ inputs.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', inputs.working-directory)) }}

				      - name: Analysing the code with our lint

				        working-directory: ${{ inputs.working-directory }}

									
										171

.github/workflows/_release.yml
									
										vendored
									
												View File
												
				@@ -13,14 +13,20 @@ on:

				        required: true

				        type: string

				        default: 'libs/langchain'

				      dangerous-nonmaster-release:

				        required: false

				        type: boolean

				        default: false

				        description: "Release from a non-master branch (danger!)"

				env:

				  PYTHON_VERSION: "3.10"

				  POETRY_VERSION: "1.6.1"

				  PYTHON_VERSION: "3.11"

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				    if: github.ref == 'refs/heads/master'

				    if: github.ref == 'refs/heads/master' || inputs.dangerous-nonmaster-release

				    environment: Scheduled testing

				    runs-on: ubuntu-latest

				    outputs:

				@@ -54,7 +60,7 @@ jobs:

				        working-directory: ${{ inputs.working-directory }}

				      - name: Upload build

				        uses: actions/upload-artifact@v3

				        uses: actions/upload-artifact@v4

				        with:

				          name: dist

				          path: ${{ inputs.working-directory }}/dist/

				@@ -66,19 +72,78 @@ jobs:

				        run: |

				          echo pkg-name="$(poetry version | cut -d ' ' -f 1)" >> $GITHUB_OUTPUT

				          echo version="$(poetry version --short)" >> $GITHUB_OUTPUT

				  release-notes:

				    needs:

				      - build

				    runs-on: ubuntu-latest

				    outputs:

				      release-body: ${{ steps.generate-release-body.outputs.release-body }}

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          repository: langchain-ai/langchain

				          path: langchain

				          sparse-checkout: | # this only grabs files for relevant dir

				            ${{ inputs.working-directory }}

				          ref: master # this scopes to just master branch

				          fetch-depth: 0 # this fetches entire commit history

				      - name: Check Tags

				        id: check-tags

				        shell: bash

				        working-directory: langchain/${{ inputs.working-directory }}

				        env:

				          PKG_NAME: ${{ needs.build.outputs.pkg-name }}

				          VERSION: ${{ needs.build.outputs.version }}

				        run: |

				          REGEX="^$PKG_NAME==\\d+\\.\\d+\\.\\d+\$"

				          echo $REGEX

				          PREV_TAG=$(git tag --sort=-creatordate | grep -P $REGEX || true | head -1)

				          TAG="${PKG_NAME}==${VERSION}"

				          if [ "$TAG" == "$PREV_TAG" ]; then

				            echo "No new version to release"

				            exit 1

				          fi

				          echo tag="$TAG" >> $GITHUB_OUTPUT

				          echo prev-tag="$PREV_TAG" >> $GITHUB_OUTPUT

				      - name: Generate release body

				        id: generate-release-body

				        working-directory: langchain

				        env:

				          WORKING_DIR: ${{ inputs.working-directory }}

				          PKG_NAME: ${{ needs.build.outputs.pkg-name }}

				          TAG: ${{ steps.check-tags.outputs.tag }}

				          PREV_TAG: ${{ steps.check-tags.outputs.prev-tag }}

				        run: |

				          PREAMBLE="Changes since $PREV_TAG"

				          # if PREV_TAG is empty, then we are releasing the first version

				          if [ -z "$PREV_TAG" ]; then

				            PREAMBLE="Initial release"

				            PREV_TAG=$(git rev-list --max-parents=0 HEAD)

				          fi

				          {

				            echo 'release-body<<EOF'

				            echo $PREAMBLE

				            echo

				            git log --format="%s" "$PREV_TAG"..HEAD -- $WORKING_DIR

				            echo EOF

				          } >> "$GITHUB_OUTPUT"

				  test-pypi-publish:

				    needs:

				      - build

				      - release-notes

				    uses:

				      ./.github/workflows/_test_release.yml

				    permissions: write-all

				    with:

				      working-directory: ${{ inputs.working-directory }}

				      dangerous-nonmaster-release: ${{ inputs.dangerous-nonmaster-release }}

				    secrets: inherit

				  pre-release-checks:

				    needs:

				      - build

				      - release-notes

				      - test-pypi-publish

				    runs-on: ubuntu-latest

				    steps:

				@@ -111,17 +176,24 @@ jobs:

				          PKG_NAME: ${{ needs.build.outputs.pkg-name }}

				          VERSION: ${{ needs.build.outputs.version }}

				        # Here we use:

				        # - The default regular PyPI index as the *primary* index, meaning 

				        # - The default regular PyPI index as the *primary* index, meaning

				        #   that it takes priority (https://pypi.org/simple)

				        # - The test PyPI index as an extra index, so that any dependencies that

				        #   are not found on test PyPI can be resolved and installed anyway.

				        #   (https://test.pypi.org/simple). This will include the PKG_NAME==VERSION

				        #   package because VERSION will not have been uploaded to regular PyPI yet.

				        #

				        # - attempt install again after 5 seconds if it fails because there is

				        #   sometimes a delay in availability on test pypi

				        run: |

				          poetry run pip install \

				            --extra-index-url https://test.pypi.org/simple/ \

				            "$PKG_NAME==$VERSION"

				            "$PKG_NAME==$VERSION" || \

				          ( \

				            sleep 15 && \

				            poetry run pip install \

				              --extra-index-url https://test.pypi.org/simple/ \

				              "$PKG_NAME==$VERSION" \

				          )

				          # Replace all dashes in the package name with underscores,

				          # since that's how Python imports packages with dashes in the name.

				@@ -130,7 +202,7 @@ jobs:

				          poetry run python -c "import $IMPORT_NAME; print(dir($IMPORT_NAME))"

				      - name: Import test dependencies

				        run: poetry install --with test,test_integration

				        run: poetry install --with test

				        working-directory: ${{ inputs.working-directory }}

				      # Overwrite the local version of the package with the test PyPI version.

				@@ -149,33 +221,83 @@ jobs:

				        run: make tests

				        working-directory: ${{ inputs.working-directory }}

				      - name: Check for prerelease versions

				        working-directory: ${{ inputs.working-directory }}

				        run: |

				          poetry run python $GITHUB_WORKSPACE/.github/scripts/check_prerelease_dependencies.py pyproject.toml

				      - name: Get minimum versions

				        working-directory: ${{ inputs.working-directory }}

				        id: min-version

				        run: |

				          poetry run pip install packaging

				          min_versions="$(poetry run python $GITHUB_WORKSPACE/.github/scripts/get_min_versions.py pyproject.toml release)"

				          echo "min-versions=$min_versions" >> "$GITHUB_OUTPUT"

				          echo "min-versions=$min_versions"

				      - name: Run unit tests with minimum dependency versions

				        if: ${{ steps.min-version.outputs.min-versions != '' }}

				        env:

				          MIN_VERSIONS: ${{ steps.min-version.outputs.min-versions }}

				        run: |

				          poetry run pip install --force-reinstall $MIN_VERSIONS --editable .

				          make tests

				        working-directory: ${{ inputs.working-directory }}

				      - name: 'Authenticate to Google Cloud'

				        id: 'auth'

				        uses: google-github-actions/auth@v2

				        with:

				          credentials_json: '${{ secrets.GOOGLE_CREDENTIALS }}'

				      - name: Import integration test dependencies

				        run: poetry install --with test,test_integration

				        working-directory: ${{ inputs.working-directory }}

				      - name: Run integration tests

				        if: ${{ startsWith(inputs.working-directory, 'libs/partners/') }}

				        env:

				          AI21_API_KEY: ${{ secrets.AI21_API_KEY }}

				          GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}

				          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}

				          MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}

				          TOGETHER_API_KEY: ${{ secrets.TOGETHER_API_KEY }}

				          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}

				          AZURE_OPENAI_API_VERSION: ${{ secrets.AZURE_OPENAI_API_VERSION }}

				          AZURE_OPENAI_API_BASE: ${{ secrets.AZURE_OPENAI_API_BASE }}

				          AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}

				          AZURE_OPENAI_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_CHAT_DEPLOYMENT_NAME }}

				          AZURE_OPENAI_LLM_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LLM_DEPLOYMENT_NAME }}

				          AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME }}

				          NVIDIA_API_KEY: ${{ secrets.NVIDIA_API_KEY }}

				          GOOGLE_SEARCH_API_KEY: ${{ secrets.GOOGLE_SEARCH_API_KEY }}

				          GOOGLE_CSE_ID: ${{ secrets.GOOGLE_CSE_ID }}

				          GROQ_API_KEY: ${{ secrets.GROQ_API_KEY }}

				          EXA_API_KEY: ${{ secrets.EXA_API_KEY }}

				          NOMIC_API_KEY: ${{ secrets.NOMIC_API_KEY }}

				          WATSONX_APIKEY: ${{ secrets.WATSONX_APIKEY }}

				          WATSONX_PROJECT_ID: ${{ secrets.WATSONX_PROJECT_ID }}

				          PINECONE_API_KEY: ${{ secrets.PINECONE_API_KEY }}

				          PINECONE_ENVIRONMENT: ${{ secrets.PINECONE_ENVIRONMENT }}

				          ASTRA_DB_API_ENDPOINT: ${{ secrets.ASTRA_DB_API_ENDPOINT }}

				          ASTRA_DB_APPLICATION_TOKEN: ${{ secrets.ASTRA_DB_APPLICATION_TOKEN }}

				          ASTRA_DB_KEYSPACE: ${{ secrets.ASTRA_DB_KEYSPACE }}

				          ES_URL: ${{ secrets.ES_URL }}

				          ES_CLOUD_ID: ${{ secrets.ES_CLOUD_ID }}

				          ES_API_KEY: ${{ secrets.ES_API_KEY }}

				          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }} # for airbyte

				          MONGODB_ATLAS_URI: ${{ secrets.MONGODB_ATLAS_URI }}

				          VOYAGE_API_KEY: ${{ secrets.VOYAGE_API_KEY }}

				          UPSTAGE_API_KEY: ${{ secrets.UPSTAGE_API_KEY }}

				          FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}

				          UNSTRUCTURED_API_KEY: ${{ secrets.UNSTRUCTURED_API_KEY }}

				        run: make integration_tests

				        working-directory: ${{ inputs.working-directory }}

				      - name: Run unit tests with minimum dependency versions

				        if: ${{ (inputs.working-directory == 'libs/langchain') || (inputs.working-directory == 'libs/community') || (inputs.working-directory == 'libs/experimental') }}

				        run: |

				          poetry run pip install -r _test_minimum_requirements.txt

				          make tests

				        working-directory: ${{ inputs.working-directory }}

				  publish:

				    needs:

				      - build

				      - release-notes

				      - test-pypi-publish

				      - pre-release-checks

				    runs-on: ubuntu-latest

				@@ -202,7 +324,7 @@ jobs:

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: release

				      - uses: actions/download-artifact@v3

				      - uses: actions/download-artifact@v4

				        with:

				          name: dist

				          path: ${{ inputs.working-directory }}/dist/

				@@ -217,6 +339,7 @@ jobs:

				  mark-release:

				    needs:

				      - build

				      - release-notes

				      - test-pypi-publish

				      - pre-release-checks

				      - publish

				@@ -241,18 +364,18 @@ jobs:

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: release

				      - uses: actions/download-artifact@v3

				      - uses: actions/download-artifact@v4

				        with:

				          name: dist

				          path: ${{ inputs.working-directory }}/dist/

				      - name: Create Release

				      - name: Create Tag

				        uses: ncipollo/release-action@v1

				        if: ${{ inputs.working-directory == 'libs/langchain' }}

				        with:

				          artifacts: "dist/*"

				          token: ${{ secrets.GITHUB_TOKEN }}

				          draft: false

				          generateReleaseNotes: true

				          tag: v${{ needs.build.outputs.version }}

				          commit: master

				          generateReleaseNotes: false

				          tag: ${{needs.build.outputs.pkg-name}}==${{ needs.build.outputs.version }}

				          body: ${{ needs.release-notes.outputs.release-body }}

				          commit: ${{ github.sha }}

				          makeLatest: ${{ needs.build.outputs.pkg-name == 'langchain-core'}}

									
										38

.github/workflows/_test.yml
									
										vendored
									
												View File
												
				@@ -11,9 +11,13 @@ on:

				        required: false

				        type: string

				        description: "Relative path to the langchain library folder"

				      python-version:

				        required: true

				        type: string

				        description: "Python version to use"

				env:

				  POETRY_VERSION: "1.6.1"

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				@@ -21,21 +25,14 @@ jobs:

				      run:

				        working-directory: ${{ inputs.working-directory }}

				    runs-on: ubuntu-latest

				    strategy:

				      matrix:

				        python-version:

				          - "3.8"

				          - "3.9"

				          - "3.10"

				          - "3.11"

				    name: Python ${{ matrix.python-version }}

				    name: "make test #${{ inputs.python-version }}"

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				      - name: Set up Python ${{ inputs.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.python-version }}

				          python-version: ${{ inputs.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: ${{ inputs.working-directory }}

				          cache-key: core

				@@ -68,3 +65,22 @@ jobs:

				          # grep will exit non-zero if the target message isn't found,

				          # and `set -e` above will cause the step to fail.

				          echo "$STATUS" | grep 'nothing to commit, working tree clean'

				      - name: Get minimum versions

				        working-directory: ${{ inputs.working-directory }}

				        id: min-version

				        run: |

				          poetry run pip install packaging tomli

				          min_versions="$(poetry run python $GITHUB_WORKSPACE/.github/scripts/get_min_versions.py pyproject.toml pull_request)"

				          echo "min-versions=$min_versions" >> "$GITHUB_OUTPUT"

				          echo "min-versions=$min_versions"

				# Temporarily disabled until we can get the minimum versions working

				#      - name: Run unit tests with minimum dependency versions

				#        if: ${{ steps.min-version.outputs.min-versions != '' }}

				#        env:

				#          MIN_VERSIONS: ${{ steps.min-version.outputs.min-versions }}

				#        run: |

				#          poetry run pip install --force-reinstall $MIN_VERSIONS --editable .

				#          make tests

				#        working-directory: ${{ inputs.working-directory }}

									
										51

.github/workflows/_test_doc_imports.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,51 @@

				name: test_doc_imports

				on:

				  workflow_call:

				    inputs:

				      python-version:

				        required: true

				        type: string

				        description: "Python version to use"

				env:

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				    runs-on: ubuntu-latest

				    name: "check doc imports #${{ inputs.python-version }}"

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ inputs.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ inputs.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          cache-key: core

				      - name: Install dependencies

				        shell: bash

				        run: poetry install --with test

				      - name: Install langchain editable

				        run: |

				          poetry run pip install -e libs/core libs/langchain libs/community libs/experimental

				      - name: Check doc imports

				        shell: bash

				        run: |

				          poetry run python docs/scripts/check_imports.py

				      - name: Ensure the test did not create any additional files

				        shell: bash

				        run: |

				          set -eu

				          STATUS="$(git status)"

				          echo "$STATUS"

				          # grep will exit non-zero if the target message isn't found,

				          # and `set -e` above will cause the step to fail.

				          echo "$STATUS" | grep 'nothing to commit, working tree clean'

									
										13

.github/workflows/_test_release.yml
									
										vendored
									
												View File
												
				@@ -7,14 +7,19 @@ on:

				        required: true

				        type: string

				        description: "From which folder this pipeline executes"

				      dangerous-nonmaster-release:

				        required: false

				        type: boolean

				        default: false

				        description: "Release from a non-master branch (danger!)"

				env:

				  POETRY_VERSION: "1.6.1"

				  POETRY_VERSION: "1.7.1"

				  PYTHON_VERSION: "3.10"

				jobs:

				  build:

				    if: github.ref == 'refs/heads/master'

				    if: github.ref == 'refs/heads/master' || inputs.dangerous-nonmaster-release

				    runs-on: ubuntu-latest

				    outputs:

				@@ -48,7 +53,7 @@ jobs:

				        working-directory: ${{ inputs.working-directory }}

				      - name: Upload build

				        uses: actions/upload-artifact@v3

				        uses: actions/upload-artifact@v4

				        with:

				          name: test-dist

				          path: ${{ inputs.working-directory }}/dist/

				@@ -76,7 +81,7 @@ jobs:

				    steps:

				      - uses: actions/checkout@v4

				      - uses: actions/download-artifact@v3

				      - uses: actions/download-artifact@v4

				        with:

				          name: test-dist

				          path: ${{ inputs.working-directory }}/dist/

									
										25

.github/workflows/check-broken-links.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,25 @@

				name: Check Broken Links

				on:

				  workflow_dispatch:

				  schedule:

				    - cron:  '0 13 * * *'

				jobs:

				  check-links:

				    if: github.repository_owner == 'langchain-ai'

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v4

				      - name: Use Node.js 18.x

				        uses: actions/setup-node@v3

				        with:

				          node-version: 18.x

				          cache: "yarn"

				          cache-dependency-path: ./docs/yarn.lock

				      - name: Install dependencies

				        run: yarn install --immutable --mode=skip-build

				        working-directory: ./docs

				      - name: Check broken links

				        run: yarn check-broken-links

				        working-directory: ./docs

									
										137

.github/workflows/check_diffs.yml
									
										vendored
									
												View File
												
				@@ -1,5 +1,5 @@

				---

				name: Check library diffs

				name: CI

				on:

				  push:

				@@ -16,6 +16,9 @@ concurrency:

				  group: ${{ github.workflow }}-${{ github.ref }}

				  cancel-in-progress: true

				env:

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				    runs-on: ubuntu-latest

				@@ -23,21 +26,141 @@ jobs:

				      - uses: actions/checkout@v4

				      - uses: actions/setup-python@v5

				        with:

				          python-version: '3.10'

				          python-version: '3.11'

				      - id: files

				        uses: Ana06/get-changed-files@v2.2.0

				      - id: set-matrix

				        run: |

				          python .github/scripts/check_diff.py ${{ steps.files.outputs.all }} >> $GITHUB_OUTPUT

				    outputs:

				      dirs-to-run: ${{ steps.set-matrix.outputs.dirs-to-run }}

				  ci:

				      lint: ${{ steps.set-matrix.outputs.lint }}

				      test: ${{ steps.set-matrix.outputs.test }}

				      extended-tests: ${{ steps.set-matrix.outputs.extended-tests }}

				      compile-integration-tests: ${{ steps.set-matrix.outputs.compile-integration-tests }}

				      dependencies: ${{ steps.set-matrix.outputs.dependencies }}

				      test-doc-imports: ${{ steps.set-matrix.outputs.test-doc-imports }}

				  lint:

				    name: cd ${{ matrix.job-configs.working-directory }}

				    needs: [ build ]

				    if: ${{ needs.build.outputs.lint != '[]' }}

				    strategy:

				      matrix:

				        working-directory: ${{ fromJson(needs.build.outputs.dirs-to-run) }}

				    uses: ./.github/workflows/_all_ci.yml

				        job-configs: ${{ fromJson(needs.build.outputs.lint) }}

				    uses: ./.github/workflows/_lint.yml

				    with:

				      working-directory: ${{ matrix.working-directory }}

				      working-directory: ${{ matrix.job-configs.working-directory }}

				      python-version: ${{ matrix.job-configs.python-version }}

				    secrets: inherit

				  test:

				    name: cd ${{ matrix.job-configs.working-directory }}

				    needs: [ build ]

				    if: ${{ needs.build.outputs.test != '[]' }}

				    strategy:

				      matrix:

				        job-configs: ${{ fromJson(needs.build.outputs.test) }}

				    uses: ./.github/workflows/_test.yml

				    with:

				      working-directory: ${{ matrix.job-configs.working-directory }}

				      python-version: ${{ matrix.job-configs.python-version }}

				    secrets: inherit

				  test-doc-imports:

				    needs: [ build ]

				    if: ${{ needs.build.outputs.test-doc-imports != '[]' }}

				    strategy:

				      matrix:

				        job-configs: ${{ fromJson(needs.build.outputs.test-doc-imports) }}

				    uses: ./.github/workflows/_test_doc_imports.yml

				    secrets: inherit

				    with:

				      python-version: ${{ matrix.job-configs.python-version }}

				  compile-integration-tests:

				    name: cd ${{ matrix.job-configs.working-directory }}

				    needs: [ build ]

				    if: ${{ needs.build.outputs.compile-integration-tests != '[]' }}

				    strategy:

				      matrix:

				        job-configs: ${{ fromJson(needs.build.outputs.compile-integration-tests) }}

				    uses: ./.github/workflows/_compile_integration_test.yml

				    with:

				      working-directory: ${{ matrix.job-configs.working-directory }}

				      python-version: ${{ matrix.job-configs.python-version }}

				    secrets: inherit

				  dependencies:

				    name: cd ${{ matrix.job-configs.working-directory }}

				    needs: [ build ]

				    if: ${{ needs.build.outputs.dependencies != '[]' }}

				    strategy:

				      matrix:

				        job-configs: ${{ fromJson(needs.build.outputs.dependencies) }}

				    uses: ./.github/workflows/_dependencies.yml

				    with:

				      working-directory: ${{ matrix.job-configs.working-directory }}

				      python-version: ${{ matrix.job-configs.python-version }}

				    secrets: inherit

				  extended-tests:

				    name: "cd ${{ matrix.job-configs.working-directory }} / make extended_tests #${{ matrix.job-configs.python-version }}"

				    needs: [ build ]

				    if: ${{ needs.build.outputs.extended-tests != '[]' }}

				    strategy:

				      matrix:

				        # note different variable for extended test dirs

				        job-configs: ${{ fromJson(needs.build.outputs.extended-tests) }}

				    runs-on: ubuntu-latest

				    defaults:

				      run:

				        working-directory: ${{ matrix.job-configs.working-directory }}

				    steps:

				      - uses: actions/checkout@v4

				      - name: Set up Python ${{ matrix.job-configs.python-version }} + Poetry ${{ env.POETRY_VERSION }}

				        uses: "./.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.job-configs.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: ${{ matrix.job-configs.working-directory }}

				          cache-key: extended

				      - name: Install dependencies

				        shell: bash

				        run: |

				          echo "Running extended tests, installing dependencies with poetry..."

				          poetry install --with test

				          poetry run pip install uv

				          poetry run uv pip install -r extended_testing_deps.txt

				      - name: Run extended tests

				        run: make extended_tests

				      - name: Ensure the tests did not create any additional files

				        shell: bash

				        run: |

				          set -eu

				          STATUS="$(git status)"

				          echo "$STATUS"

				          # grep will exit non-zero if the target message isn't found,

				          # and `set -e` above will cause the step to fail.

				          echo "$STATUS" | grep 'nothing to commit, working tree clean'

				  ci_success:

				    name: "CI Success"

				    needs: [build, lint, test, compile-integration-tests, dependencies, extended-tests, test-doc-imports]

				    if: |

				      always()

				    runs-on: ubuntu-latest

				    env:

				      JOBS_JSON: ${{ toJSON(needs) }}

				      RESULTS_JSON: ${{ toJSON(needs.*.result) }}

				      EXIT_CODE: ${{!contains(needs.*.result, 'failure') && !contains(needs.*.result, 'cancelled') && '0' || '1'}}

				    steps:

				      - name: "CI Success"

				        run: |

				          echo $JOBS_JSON

				          echo $RESULTS_JSON

				          echo "Exiting with $EXIT_CODE"

				          exit $EXIT_CODE

									
										36

.github/workflows/check_new_docs.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,36 @@

				---

				name: Integration docs lint

				on:

				  push:

				    branches: [master]

				  pull_request:

				# If another push to the same PR or branch happens while this workflow is still running,

				# cancel the earlier run in favor of the next run.

				#

				# There's no point in testing an outdated version of the code. GitHub only allows

				# a limited number of job runners to be active at the same time, so it's better to cancel

				# pointless jobs early so that more useful jobs can run sooner.

				concurrency:

				  group: ${{ github.workflow }}-${{ github.ref }}

				  cancel-in-progress: true

				jobs:

				  build:

				    runs-on: ubuntu-latest

				    steps:

				      - uses: actions/checkout@v4

				      - uses: actions/setup-python@v5

				        with:

				          python-version: '3.10'

				      - id: files

				        uses: Ana06/get-changed-files@v2.2.0

				        with:

				          filter: |

				            *.ipynb

				            *.md

				            *.mdx

				      - name: Check new docs

				        run: |

				          python docs/scripts/check_templates.py ${{ steps.files.outputs.added }}

									
										19

.github/workflows/codespell.yml
									
										vendored
									
												View File
												
				@@ -1,18 +1,18 @@

				---

				name: Codespell

				name: CI / cd . / make spell_check

				on:

				  push:

				    branches: [master]

				    branches: [master, v0.1]

				  pull_request:

				    branches: [master]

				    branches: [master, v0.1]

				permissions:

				  contents: read

				jobs:

				  codespell:

				    name: Check for spelling errors

				    name: (Check for spelling errors)

				    runs-on: ubuntu-latest

				    steps:

				@@ -29,8 +29,9 @@ jobs:

				          python .github/workflows/extract_ignored_words_list.py

				        id: extract_ignore_words

				      - name: Codespell

				        uses: codespell-project/actions-codespell@v2

				        with:

				          skip: guide_imports.json

				          ignore_words_list: ${{ steps.extract_ignore_words.outputs.ignore_words_list }}

				#      - name: Codespell

				#        uses: codespell-project/actions-codespell@v2

				#        with:

				#          skip: guide_imports.json,*.ambr,./cookbook/data/imdb_top_1000.csv,*.lock

				#          ignore_words_list: ${{ steps.extract_ignore_words.outputs.ignore_words_list }}

				#          exclude_file: ./.github/workflows/codespell-exclude

									
										35

.github/workflows/doc_lint.yml
									
										vendored
									
												View File
											
				@@ -1,35 +0,0 @@

				---

				name: Docs, templates, cookbook lint

				on:

				  push:

				    branches: [ master ]

				  pull_request:

				    paths:

				      - 'docs/**'

				      - 'templates/**'

				      - 'cookbook/**'

				      - '.github/workflows/_lint.yml'

				      - '.github/workflows/doc_lint.yml'

				  workflow_dispatch:

				jobs:

				  check:

				    runs-on: ubuntu-latest

				    steps:

				    - name: Checkout repository

				      uses: actions/checkout@v4

				    - name: Run import check

				      run: |

				        # We should not encourage imports directly from main init file

				        # Expect for hub

				        git grep 'from langchain import' {docs/docs,templates,cookbook} | grep -vE 'from langchain import (hub)' && exit 1 || exit 0

				  lint:

				      uses:

				        ./.github/workflows/_lint.yml

				      with:

				        working-directory: "."

				      secrets: inherit

									
										13

.github/workflows/langchain_cli_release.yml
									
										vendored
									
												View File
											
				@@ -1,13 +0,0 @@

				---

				name: libs/cli Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_release.yml

				    with:

				      working-directory: libs/cli

				    secrets: inherit

									
										13

.github/workflows/langchain_community_release.yml
									
										vendored
									
												View File
											
				@@ -1,13 +0,0 @@

				---

				name: libs/community Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_release.yml

				    with:

				      working-directory: libs/community

				    secrets: inherit

									
										13

.github/workflows/langchain_core_release.yml
									
										vendored
									
												View File
											
				@@ -1,13 +0,0 @@

				---

				name: libs/core Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_release.yml

				    with:

				      working-directory: libs/core

				    secrets: inherit

									
										13

.github/workflows/langchain_experimental_release.yml
									
										vendored
									
												View File
											
				@@ -1,13 +0,0 @@

				---

				name: libs/experimental Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_release.yml

				    with:

				      working-directory: libs/experimental

				    secrets: inherit

									
										13

.github/workflows/langchain_experimental_test_release.yml
									
										vendored
									
												View File
											
				@@ -1,13 +0,0 @@

				---

				name: Experimental Test Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_test_release.yml

				    with:

				      working-directory: libs/experimental

				    secrets: inherit

									
										13

.github/workflows/langchain_openai_release.yml
									
										vendored
									
												View File
											
				@@ -1,13 +0,0 @@

				---

				name: libs/core Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_release.yml

				    with:

				      working-directory: libs/core

				    secrets: inherit

									
										27

.github/workflows/langchain_release.yml
									
										vendored
									
												View File
											
				@@ -1,27 +0,0 @@

				---

				name: libs/langchain Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_release.yml

				    with:

				      working-directory: libs/langchain

				    secrets: inherit

				  # N.B.: It's possible that PyPI doesn't make the new release visible / available

				  #       immediately after publishing. If that happens, the docker build might not

				  #       create a new docker image for the new release, since it won't see it.

				  #

				  #       If this ends up being a problem, add a check to the end of the `_release.yml`

				  #       workflow that prevents the workflow from finishing until the new release

				  #       is visible and installable on PyPI.

				  release-docker:

				    needs:

				      - release

				    uses:

				      ./.github/workflows/langchain_release_docker.yml

				    secrets: inherit

									
										13

.github/workflows/langchain_test_release.yml
									
										vendored
									
												View File
											
				@@ -1,13 +0,0 @@

				---

				name: Test Release

				on:

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				jobs:

				  release:

				    uses:

				      ./.github/workflows/_test_release.yml

				    with:

				      working-directory: libs/langchain

				    secrets: inherit

									
										37

.github/workflows/people.yml
									
										vendored
									
										Normal file
									
												View File
												
				@@ -0,0 +1,37 @@

				name: LangChain People

				on:

				  schedule:

				    - cron: "0 14 1 * *"

				  push:

				    branches: [jacob/people]

				  workflow_dispatch:

				    inputs:

				      debug_enabled:

				        description: 'Run the build with tmate debugging enabled (https://github.com/marketplace/actions/debugging-with-tmate)'

				        required: false

				        default: 'false'

				jobs:

				  langchain-people:

				    if: github.repository_owner == 'langchain-ai'

				    runs-on: ubuntu-latest

				    permissions: write-all

				    steps:

				      - name: Dump GitHub context

				        env:

				          GITHUB_CONTEXT: ${{ toJson(github) }}

				        run: echo "$GITHUB_CONTEXT"

				      - uses: actions/checkout@v4

				      # Ref: https://github.com/actions/runner/issues/2033

				      - name: Fix git safe.directory in container

				        run: mkdir -p /home/runner/work/_temp/_github_home && printf "[safe]\n\tdirectory = /github/workspace" > /home/runner/work/_temp/_github_home/.gitconfig

				      # Allow debugging with tmate

				      - name: Setup tmate session

				        uses: mxschmitt/action-tmate@v3

				        if: ${{ github.event_name == 'workflow_dispatch' && github.event.inputs.debug_enabled == 'true' }}

				        with:

				          limit-access-to-actor: true

				      - uses: ./.github/actions/people

				        with:

				          token: ${{ secrets.LANGCHAIN_PEOPLE_GITHUB_TOKEN }}

									
										76

.github/workflows/scheduled_test.yml
									
										vendored
									
												View File
												
				@@ -6,32 +6,59 @@ on:

				    - cron:  '0 13 * * *'

				env:

				  POETRY_VERSION: "1.6.1"

				  POETRY_VERSION: "1.7.1"

				jobs:

				  build:

				    defaults:

				      run:

				        working-directory: libs/langchain

				    if: github.repository_owner == 'langchain-ai'

				    name: Python ${{ matrix.python-version }} - ${{ matrix.working-directory }}

				    runs-on: ubuntu-latest

				    environment: Scheduled testing

				    strategy:

				      fail-fast: false

				      matrix:

				        python-version:

				          - "3.8"

				          - "3.9"

				          - "3.10"

				          - "3.11"

				    name: Python ${{ matrix.python-version }}

				        working-directory:

				          - "libs/partners/openai"

				          - "libs/partners/anthropic"

				          - "libs/partners/ai21"

				          - "libs/partners/fireworks"

				          - "libs/partners/groq"

				          - "libs/partners/mistralai"

				          - "libs/partners/together"

				          - "libs/partners/google-vertexai"

				          - "libs/partners/google-genai"

				          - "libs/partners/aws"

				    steps:

				      - uses: actions/checkout@v4

				        with:

				          path: langchain

				      - uses: actions/checkout@v4

				        with:

				          repository: langchain-ai/langchain-google

				          path: langchain-google

				      - uses: actions/checkout@v4

				        with:

				          repository: langchain-ai/langchain-aws

				          path: langchain-aws

				      - name: Move libs

				        run: |

				          rm -rf \

				            langchain/libs/partners/google-genai \

				            langchain/libs/partners/google-vertexai

				          mv langchain-google/libs/genai langchain/libs/partners/google-genai

				          mv langchain-google/libs/vertexai langchain/libs/partners/google-vertexai

				          mv langchain-aws/libs/aws langchain/libs/partners/aws

				      - name: Set up Python ${{ matrix.python-version }}

				        uses: "./.github/actions/poetry_setup"

				        uses: "./langchain/.github/actions/poetry_setup"

				        with:

				          python-version: ${{ matrix.python-version }}

				          poetry-version: ${{ env.POETRY_VERSION }}

				          working-directory: libs/langchain

				          working-directory: langchain/${{ matrix.working-directory }}

				          cache-key: scheduled

				      - name: 'Authenticate to Google Cloud'

				@@ -45,17 +72,15 @@ jobs:

				        with:

				          aws-access-key-id: ${{ secrets.AWS_ACCESS_KEY_ID }}

				          aws-secret-access-key: ${{ secrets.AWS_SECRET_ACCESS_KEY }}

				          aws-region: ${{ vars.AWS_REGION }}

				          aws-region: ${{ secrets.AWS_REGION }}

				      - name: Install dependencies

				        working-directory: libs/langchain

				        shell: bash

				        run: |

				          echo "Running scheduled tests, installing dependencies with poetry..."

				          cd langchain/${{ matrix.working-directory }}

				          poetry install --with=test_integration,test

				      - name: Run tests

				        shell: bash

				      - name: Run integration tests

				        env:

				          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}

				          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}

				@@ -65,12 +90,29 @@ jobs:

				          AZURE_OPENAI_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_CHAT_DEPLOYMENT_NAME }}

				          AZURE_OPENAI_LLM_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LLM_DEPLOYMENT_NAME }}

				          AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME }}

				          AI21_API_KEY: ${{ secrets.AI21_API_KEY }}

				          FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}

				          GROQ_API_KEY: ${{ secrets.GROQ_API_KEY }}

				          MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}

				          TOGETHER_API_KEY: ${{ secrets.TOGETHER_API_KEY }}

				          COHERE_API_KEY: ${{ secrets.COHERE_API_KEY }}

				          NVIDIA_API_KEY: ${{ secrets.NVIDIA_API_KEY }}

				          GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}

				          GOOGLE_SEARCH_API_KEY: ${{ secrets.GOOGLE_SEARCH_API_KEY }}

				          GOOGLE_CSE_ID: ${{ secrets.GOOGLE_CSE_ID }}

				        run: |

				          make scheduled_tests

				          cd langchain/${{ matrix.working-directory }}

				          make integration_tests

				      - name: Remove external libraries

				        run: | 

				          rm -rf \

				            langchain/libs/partners/google-genai \

				            langchain/libs/partners/google-vertexai \

				            langchain/libs/partners/aws

				      - name: Ensure the tests did not create any additional files

				        shell: bash

				        working-directory: langchain

				        run: |

				          set -eu

									
										36

.github/workflows/templates_ci.yml
									
										vendored
									
												View File
											
				@@ -1,36 +0,0 @@

				---

				name: templates CI

				on:

				  push:

				    branches: [ master ]

				  pull_request:

				    paths:

				      - '.github/actions/poetry_setup/action.yml'

				      - '.github/tools/**'

				      - '.github/workflows/_lint.yml'

				      - '.github/workflows/templates_ci.yml'

				      - 'templates/**'

				  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI

				# If another push to the same PR or branch happens while this workflow is still running,

				# cancel the earlier run in favor of the next run.

				#

				# There's no point in testing an outdated version of the code. GitHub only allows

				# a limited number of job runners to be active at the same time, so it's better to cancel

				# pointless jobs early so that more useful jobs can run sooner.

				concurrency:

				  group: ${{ github.workflow }}-${{ github.ref }}

				  cancel-in-progress: true

				env:

				  POETRY_VERSION: "1.6.1"

				  WORKDIR: "templates"

				jobs:

				  lint:

				    uses:

				      ./.github/workflows/_lint.yml

				    with:

				      working-directory: templates

				    secrets: inherit

14

.gitignore vendored

View File

@@ -115,13 +115,11 @@ celerybeat.pid
 # Environments
 .env
 .envrc
 .venv
 .venvs
 .venv*
 venv*
 env/
 venv/
 ENV/
 env.bak/
 venv.bak/
 # Spyder project settings
 .spyderproject
@@ -135,6 +133,7 @@ venv.bak/
 # mypy
 .mypy_cache/
 .mypy_cache_test/
 .dmypy.json
 dmypy.json
@@ -173,8 +172,13 @@ docs/api_reference/*/
 !docs/api_reference/_static/
 !docs/api_reference/templates/
 !docs/api_reference/themes/
 !docs/api_reference/_extensions/
 !docs/api_reference/scripts/
 docs/docs/build
 docs/docs/node_modules
 docs/docs/yarn.lock
 _dist
 docs/docs/templates
 docs/docs/templates
 prof
 virtualenv/

									
										14

.readthedocs.yaml
									
												View File
												
				@@ -4,21 +4,17 @@

				# Required

				version: 2

				formats:

				  - pdf

				# Set the version of Python and other tools you might need

				build:

				  os: ubuntu-22.04

				  tools:

				    python: "3.11"

				  commands:

				      - python -m virtualenv $READTHEDOCS_VIRTUALENV_PATH

				      - python -m pip install --upgrade --no-cache-dir pip setuptools

				      - python -m pip install --upgrade --no-cache-dir sphinx readthedocs-sphinx-ext

				      - python -m pip install ./libs/partners/*

				      - python -m pip install --exists-action=w --no-cache-dir -r docs/api_reference/requirements.txt

				      - python docs/api_reference/create_api_rst.py

				      - cat docs/api_reference/conf.py

				      - python -m sphinx -T -E -b html -d _build/doctrees -c docs/api_reference docs/api_reference $READTHEDOCS_OUTPUT/html -j auto

				    - mkdir -p $READTHEDOCS_OUTPUT

				    - cp -r api_reference_build/* $READTHEDOCS_OUTPUT

				# Build documentation in the docs/ directory with Sphinx

				sphinx:

				   configuration: docs/api_reference/conf.py

									
										2

MIGRATE.md
									
												View File
												
				@@ -52,7 +52,7 @@ Now:

				`from langchain_experimental.sql import SQLDatabaseChain`

				Alternatively, if you are just interested in using the query generation part of the SQL chain, you can check out [`create_sql_query_chain`](https://github.com/langchain-ai/langchain/blob/master/docs/extras/use_cases/tabular/sql_query.ipynb)

				Alternatively, if you are just interested in using the query generation part of the SQL chain, you can check out this [`SQL question-answering tutorial`](https://python.langchain.com/v0.2/docs/tutorials/sql_qa/#convert-question-to-sql-query)

				`from langchain.chains import create_sql_query_chain`

									
										69

Makefile
									
												View File
												
				@@ -1,39 +1,63 @@

				.PHONY: all clean docs_build docs_clean docs_linkcheck api_docs_build api_docs_clean api_docs_linkcheck

				.PHONY: all clean help docs_build docs_clean docs_linkcheck api_docs_build api_docs_clean api_docs_linkcheck spell_check spell_fix lint lint_package lint_tests format format_diff

				# Default target executed when no arguments are given to make.

				## help: Show this help info.

				help: Makefile

					@printf "\n\033[1mUsage: make <TARGETS> ...\033[0m\n\n\033[1mTargets:\033[0m\n\n"

					@sed -n 's/^## //p' $< | awk -F':' '{printf "\033[36m%-30s\033[0m %s\n", $$1, $$2}' | sort | sed -e 's/^/  /'

				## all: Default target, shows help.

				all: help

				## clean: Clean documentation and API documentation artifacts.

				clean: docs_clean api_docs_clean

				######################

				# DOCUMENTATION

				######################

				clean: docs_clean api_docs_clean

				## docs_build: Build the documentation.

				docs_build:

					docs/.local_build.sh

					cd docs && make build

				## docs_clean: Clean the documentation build artifacts.

				docs_clean:

					rm -r _dist

					cd docs && make clean

				## docs_linkcheck: Run linkchecker on the documentation.

				docs_linkcheck:

					poetry run linkchecker _dist/docs/ --ignore-url node_modules

				## api_docs_build: Build the API Reference documentation.

				api_docs_build:

					poetry run python docs/api_reference/create_api_rst.py

					cd docs/api_reference && poetry run make html

					poetry run python docs/api_reference/scripts/custom_formatter.py docs/api_reference/_build/html/

				API_PKG ?= text-splitters

				api_docs_quick_preview:

					poetry run pip install "pydantic<2"

					poetry run python docs/api_reference/create_api_rst.py $(API_PKG)

					cd docs/api_reference && poetry run make html

					poetry run python docs/api_reference/scripts/custom_formatter.py docs/api_reference/_build/html/

					open docs/api_reference/_build/html/reference.html

				## api_docs_clean: Clean the API Reference documentation build artifacts.

				api_docs_clean:

					rm -f docs/api_reference/api_reference.rst

					cd docs/api_reference && poetry run make clean

					find ./docs/api_reference -name '*_api_reference.rst' -delete

					git clean -fdX ./docs/api_reference

					rm docs/api_reference/index.md

				## api_docs_linkcheck: Run linkchecker on the API Reference documentation.

				api_docs_linkcheck:

					poetry run linkchecker docs/api_reference/_build/html/index.html

				## spell_check: Run codespell on the project.

				spell_check:

					poetry run codespell --toml pyproject.toml

				## spell_fix: Run codespell on the project and fix the errors.

				spell_fix:

					poetry run codespell --toml pyproject.toml -w

				@@ -41,29 +65,14 @@ spell_fix:

				# LINTING AND FORMATTING

				######################

				## lint: Run linting on the project.

				lint lint_package lint_tests:

					poetry run ruff docs templates cookbook

					poetry run ruff check docs templates cookbook

					poetry run ruff format docs templates cookbook --diff

					poetry run ruff --select I docs templates cookbook

					poetry run ruff check --select I docs templates cookbook

					git grep 'from langchain import' docs/docs templates cookbook | grep -vE 'from langchain import (hub)' && exit 1 || exit 0

				## format: Format the project files.

				format format_diff:

					poetry run ruff format docs templates cookbook

					poetry run ruff --select I --fix docs templates cookbook

				######################

				# HELP

				######################

				help:

					@echo '===================='

					@echo '-- DOCUMENTATION --'

					@echo 'clean                        - run docs_clean and api_docs_clean'

					@echo 'docs_build                   - build the documentation'

					@echo 'docs_clean                   - clean the documentation build artifacts'

					@echo 'docs_linkcheck               - run linkchecker on the documentation'

					@echo 'api_docs_build               - build the API Reference documentation'

					@echo 'api_docs_clean               - clean the API Reference documentation build artifacts'

					@echo 'api_docs_linkcheck           - run linkchecker on the API Reference documentation'

					@echo 'spell_check               	- run codespell on the project'

					@echo 'spell_fix               		- run codespell on the project and fix the errors'

					@echo '-- TEST and LINT tasks are within libs/*/ per-package --'

					poetry run ruff check --select I --fix docs templates cookbook

									
										111

README.md
									
												View File
												
				@@ -1,24 +1,22 @@

				# 🦜️🔗 LangChain

				⚡ Building applications with LLMs through composability ⚡

				⚡ Build context-aware reasoning applications ⚡

				[![Release Notes](https://img.shields.io/github/release/langchain-ai/langchain)](https://github.com/langchain-ai/langchain/releases)

				[![Release Notes](https://img.shields.io/github/release/langchain-ai/langchain?style=flat-square)](https://github.com/langchain-ai/langchain/releases)

				[![CI](https://github.com/langchain-ai/langchain/actions/workflows/check_diffs.yml/badge.svg)](https://github.com/langchain-ai/langchain/actions/workflows/check_diffs.yml)

				[![Downloads](https://static.pepy.tech/badge/langchain/month)](https://pepy.tech/project/langchain)

				[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

				[![Twitter](https://img.shields.io/twitter/url/https/twitter.com/langchainai.svg?style=social&label=Follow%20%40LangChainAI)](https://twitter.com/langchainai)

				[![](https://dcbadge.vercel.app/api/server/6adMQxSpJS?compact=true&style=flat)](https://discord.gg/6adMQxSpJS)

				[![Open in Dev Containers](https://img.shields.io/static/v1?label=Dev%20Containers&message=Open&color=blue&logo=visualstudiocode)](https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/langchain-ai/langchain)

				[![PyPI - License](https://img.shields.io/pypi/l/langchain-core?style=flat-square)](https://opensource.org/licenses/MIT)

				[![PyPI - Downloads](https://img.shields.io/pypi/dm/langchain-core?style=flat-square)](https://pypistats.org/packages/langchain-core)

				[![GitHub star chart](https://img.shields.io/github/stars/langchain-ai/langchain?style=flat-square)](https://star-history.com/#langchain-ai/langchain)

				[![Open Issues](https://img.shields.io/github/issues-raw/langchain-ai/langchain?style=flat-square)](https://github.com/langchain-ai/langchain/issues)

				[![Open in Dev Containers](https://img.shields.io/static/v1?label=Dev%20Containers&message=Open&color=blue&logo=visualstudiocode&style=flat-square)](https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/langchain-ai/langchain)

				[![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/langchain-ai/langchain)

				[![GitHub star chart](https://img.shields.io/github/stars/langchain-ai/langchain?style=social)](https://star-history.com/#langchain-ai/langchain)

				[![Dependency Status](https://img.shields.io/librariesio/github/langchain-ai/langchain)](https://libraries.io/github/langchain-ai/langchain)

				[![Open Issues](https://img.shields.io/github/issues-raw/langchain-ai/langchain)](https://github.com/langchain-ai/langchain/issues)

				[![Twitter](https://img.shields.io/twitter/url/https/twitter.com/langchainai.svg?style=social&label=Follow%20%40LangChainAI)](https://twitter.com/langchainai)

				Looking for the JS/TS library? Check out [LangChain.js](https://github.com/langchain-ai/langchainjs).

				To help you ship LangChain apps to production faster, check out [LangSmith](https://smith.langchain.com). 

				[LangSmith](https://smith.langchain.com) is a unified developer platform for building, testing, and monitoring LLM applications. 

				Fill out [this form](https://airtable.com/appwQzlErAS2qiP0L/shrGtGaVBVAz7NcV2) to get off the waitlist or speak with our sales team.

				Fill out [this form](https://www.langchain.com/contact-sales) to speak with our sales team.

				## Quick Install

				@@ -34,78 +32,103 @@ conda install langchain -c conda-forge

				## 🤔 What is LangChain?

				**LangChain** is a framework for developing applications powered by language models. It enables applications that:

				- **Are context-aware**: connect a language model to sources of context (prompt instructions, few shot examples, content to ground its response in, etc.)

				- **Reason**: rely on a language model to reason (about how to answer based on provided context, what actions to take, etc.)

				**LangChain** is a framework for developing applications powered by large language models (LLMs).

				This framework consists of several parts.

				- **LangChain Libraries**: The Python and JavaScript libraries. Contains interfaces and integrations for a myriad of components, a basic run time for combining these components into chains and agents, and off-the-shelf implementations of chains and agents.

				- **[LangChain Templates](templates)**: A collection of easily deployable reference architectures for a wide variety of tasks.

				- **[LangServe](https://github.com/langchain-ai/langserve)**: A library for deploying LangChain chains as a REST API.

				- **[LangSmith](https://smith.langchain.com)**: A developer platform that lets you debug, test, evaluate, and monitor chains built on any LLM framework and seamlessly integrates with LangChain.

				For these applications, LangChain simplifies the entire application lifecycle:

				The LangChain libraries themselves are made up of several different packages.

				- **[`langchain-core`](libs/core)**: Base abstractions and LangChain Expression Language.

				- **[`langchain-community`](libs/community)**: Third party integrations.

				- **[`langchain`](libs/langchain)**: Chains, agents, and retrieval strategies that make up an application's cognitive architecture.

				- **Open-source libraries**:  Build your applications using LangChain's open-source [building blocks](https://python.langchain.com/v0.2/docs/concepts#langchain-expression-language-lcel), [components](https://python.langchain.com/v0.2/docs/concepts), and [third-party integrations](https://python.langchain.com/v0.2/docs/integrations/platforms/).

				Use [LangGraph](/docs/concepts/#langgraph) to build stateful agents with first-class streaming and human-in-the-loop support.

				- **Productionization**: Inspect, monitor, and evaluate your apps with [LangSmith](https://docs.smith.langchain.com/) so that you can constantly optimize and deploy with confidence.

				- **Deployment**: Turn your LangGraph applications into production-ready APIs and Assistants with [LangGraph Cloud](https://langchain-ai.github.io/langgraph/cloud/).

				![LangChain Stack](docs/static/img/langchain_stack.png)

				### Open-source libraries

				- **`langchain-core`**: Base abstractions and LangChain Expression Language.

				- **`langchain-community`**: Third party integrations.

				  - Some integrations have been further split into **partner packages** that only rely on **`langchain-core`**. Examples include **`langchain_openai`** and **`langchain_anthropic`**.

				- **`langchain`**: Chains, agents, and retrieval strategies that make up an application's cognitive architecture.

				- **[`LangGraph`](https://langchain-ai.github.io/langgraph/)**: A library for building robust and stateful multi-actor applications with LLMs by modeling steps as edges and nodes in a graph. Integrates smoothly with LangChain, but can be used without it.

				### Productionization:

				- **[LangSmith](https://docs.smith.langchain.com/)**: A developer platform that lets you debug, test, evaluate, and monitor chains built on any LLM framework and seamlessly integrates with LangChain.

				### Deployment:

				- **[LangGraph Cloud](https://langchain-ai.github.io/langgraph/cloud/)**: Turn your LangGraph applications into production-ready APIs and Assistants.

				![Diagram outlining the hierarchical organization of the LangChain framework, displaying the interconnected parts across multiple layers.](docs/static/svg/langchain_stack_062024.svg "LangChain Architecture Overview")

				## 🧱 What can you build with LangChain?

				**❓ Retrieval augmented generation**

				- [Documentation](https://python.langchain.com/docs/use_cases/question_answering/)

				**❓ Question answering with RAG**

				- [Documentation](https://python.langchain.com/v0.2/docs/tutorials/rag/)

				- End-to-end Example: [Chat LangChain](https://chat.langchain.com) and [repo](https://github.com/langchain-ai/chat-langchain)

				**💬 Analyzing structured data**

				**🧱 Extracting structured output**

				- [Documentation](https://python.langchain.com/docs/use_cases/qa_structured/sql)

				- End-to-end Example: [SQL Llama2 Template](https://github.com/langchain-ai/langchain/tree/master/templates/sql-llama2)

				- [Documentation](https://python.langchain.com/v0.2/docs/tutorials/extraction/)

				- End-to-end Example: [SQL Llama2 Template](https://github.com/langchain-ai/langchain-extract/)

				**🤖 Chatbots**

				- [Documentation](https://python.langchain.com/docs/use_cases/chatbots)

				- [Documentation](https://python.langchain.com/v0.2/docs/tutorials/chatbot/)

				- End-to-end Example: [Web LangChain (web researcher chatbot)](https://weblangchain.vercel.app) and [repo](https://github.com/langchain-ai/weblangchain)

				And much more! Head to the [Use cases](https://python.langchain.com/docs/use_cases/) section of the docs for more.

				And much more! Head to the [Tutorials](https://python.langchain.com/v0.2/docs/tutorials/) section of the docs for more.

				## 🚀 How does LangChain help?

				The main value props of the LangChain libraries are:

				1. **Components**: composable tools and integrations for working with language models. Components are modular and easy-to-use, whether you are using the rest of the LangChain framework or not

				1. **Components**: composable building blocks, tools and integrations for working with language models. Components are modular and easy-to-use, whether you are using the rest of the LangChain framework or not

				2. **Off-the-shelf chains**: built-in assemblages of components for accomplishing higher-level tasks

				Off-the-shelf chains make it easy to get started. Components make it easy to customize existing chains and build new ones. 

				## LangChain Expression Language (LCEL)

				LCEL is the foundation of many of LangChain's components, and is a declarative way to compose chains. LCEL was designed from day 1 to support putting prototypes in production, with no code changes, from the simplest “prompt + LLM” chain to the most complex chains.

				- **[Overview](https://python.langchain.com/v0.2/docs/concepts/#langchain-expression-language-lcel)**: LCEL and its benefits

				- **[Interface](https://python.langchain.com/v0.2/docs/concepts/#runnable-interface)**: The standard Runnable interface for LCEL objects

				- **[Primitives](https://python.langchain.com/v0.2/docs/how_to/#langchain-expression-language-lcel)**: More on the primitives LCEL includes

				- **[Cheatsheet](https://python.langchain.com/v0.2/docs/how_to/lcel_cheatsheet/)**: Quick overview of the most common usage patterns

				## Components

				Components fall into the following **modules**:

				**📃 Model I/O:**

				**📃 Model I/O**

				This includes prompt management, prompt optimization, a generic interface for all LLMs, and common utilities for working with LLMs.

				This includes [prompt management](https://python.langchain.com/v0.2/docs/concepts/#prompt-templates), [prompt optimization](https://python.langchain.com/v0.2/docs/concepts/#example-selectors), a generic interface for [chat models](https://python.langchain.com/v0.2/docs/concepts/#chat-models) and [LLMs](https://python.langchain.com/v0.2/docs/concepts/#llms), and common utilities for working with [model outputs](https://python.langchain.com/v0.2/docs/concepts/#output-parsers).

				**📚 Retrieval:**

				**📚 Retrieval**

				Data Augmented Generation involves specific types of chains that first interact with an external data source to fetch data for use in the generation step. Examples include summarization of long pieces of text and question/answering over specific data sources.

				Retrieval Augmented Generation involves [loading data](https://python.langchain.com/v0.2/docs/concepts/#document-loaders) from a variety of sources, [preparing it](https://python.langchain.com/v0.2/docs/concepts/#text-splitters), then [searching over (a.k.a. retrieving from)](https://python.langchain.com/v0.2/docs/concepts/#retrievers) it for use in the generation step.

				**🤖 Agents:**

				**🤖 Agents**

				Agents involve an LLM making decisions about which Actions to take, taking that Action, seeing an Observation, and repeating that until done. LangChain provides a standard interface for agents, a selection of agents to choose from, and examples of end-to-end agents.

				Agents allow an LLM autonomy over how a task is accomplished. Agents make decisions about which Actions to take, then take that Action, observe the result, and repeat until the task is complete. LangChain provides a [standard interface for agents](https://python.langchain.com/v0.2/docs/concepts/#agents), along with [LangGraph](https://github.com/langchain-ai/langgraph) for building custom agents.

				## 📖 Documentation

				Please see [here](https://python.langchain.com) for full documentation, which includes:

				- [Getting started](https://python.langchain.com/docs/get_started/introduction): installation, setting up the environment, simple examples

				- Overview of the [interfaces](https://python.langchain.com/docs/expression_language/), [modules](https://python.langchain.com/docs/modules/), and [integrations](https://python.langchain.com/docs/integrations/providers)

				- [Use case](https://python.langchain.com/docs/use_cases/qa_structured/sql) walkthroughs and best practice [guides](https://python.langchain.com/docs/guides/adapters/openai)

				- [LangSmith](https://python.langchain.com/docs/langsmith/), [LangServe](https://python.langchain.com/docs/langserve), and [LangChain Template](https://python.langchain.com/docs/templates/) overviews

				- [Reference](https://api.python.langchain.com): full API docs

				- [Introduction](https://python.langchain.com/v0.2/docs/introduction/): Overview of the framework and the structure of the docs.

				- [Tutorials](https://python.langchain.com/docs/use_cases/): If you're looking to build something specific or are more of a hands-on learner, check out our tutorials. This is the best place to get started.

				- [How-to guides](https://python.langchain.com/v0.2/docs/how_to/): Answers to “How do I….?” type questions. These guides are goal-oriented and concrete; they're meant to help you complete a specific task.

				- [Conceptual guide](https://python.langchain.com/v0.2/docs/concepts/): Conceptual explanations of the key parts of the framework.

				- [API Reference](https://api.python.langchain.com): Thorough documentation of every class and method.

				## 🌐 Ecosystem

				- [🦜🛠️ LangSmith](https://docs.smith.langchain.com/): Trace and evaluate your language model applications and intelligent agents to help you move from prototype to production.

				- [🦜🕸️ LangGraph](https://langchain-ai.github.io/langgraph/): Create stateful, multi-actor applications with LLMs. Integrates smoothly with LangChain, but can be used without it.

				- [🦜🏓 LangServe](https://python.langchain.com/docs/langserve): Deploy LangChain runnables and chains as REST APIs.

				## 💁 Contributing

				As an open-source project in a rapidly developing field, we are extremely open to contributions, whether it be in the form of a new feature, improved infrastructure, or better documentation.

				For detailed information on how to contribute, see [here](https://python.langchain.com/docs/contributing/).

				For detailed information on how to contribute, see [here](https://python.langchain.com/v0.2/docs/contributing/).

				## 🌟 Contributors

									
										61

SECURITY.md
									
												View File
												
				@@ -1,6 +1,61 @@

				# Security Policy

				## Reporting a Vulnerability

				## Reporting OSS Vulnerabilities

				Please report security vulnerabilities by email to `security@langchain.dev`.

				This email is an alias to a subset of our maintainers, and will ensure the issue is promptly triaged and acted upon as needed.

				LangChain is partnered with [huntr by Protect AI](https://huntr.com/) to provide 

				a bounty program for our open source projects. 

				Please report security vulnerabilities associated with the LangChain 

				open source projects by visiting the following link:

				[https://huntr.com/bounties/disclose/](https://huntr.com/bounties/disclose/?target=https%3A%2F%2Fgithub.com%2Flangchain-ai%2Flangchain&validSearch=true)

				Before reporting a vulnerability, please review:

				1) In-Scope Targets and Out-of-Scope Targets below.

				2) The [langchain-ai/langchain](https://python.langchain.com/docs/contributing/repo_structure) monorepo structure.

				3) LangChain [security guidelines](https://python.langchain.com/docs/security) to

				   understand what we consider to be a security vulnerability vs. developer

				   responsibility.

				### In-Scope Targets

				The following packages and repositories are eligible for bug bounties:

				- langchain-core

				- langchain (see exceptions)

				- langchain-community (see exceptions)

				- langgraph

				- langserve

				### Out of Scope Targets

				All out of scope targets defined by huntr as well as:

				- **langchain-experimental**: This repository is for experimental code and is not

				  eligible for bug bounties, bug reports to it will be marked as interesting or waste of

				  time and published with no bounty attached.

				- **tools**: Tools in either langchain or langchain-community are not eligible for bug

				  bounties. This includes the following directories

				  - langchain/tools

				  - langchain-community/tools

				  - Please review our [security guidelines](https://python.langchain.com/docs/security)

				    for more details, but generally tools interact with the real world. Developers are

				    expected to understand the security implications of their code and are responsible

				    for the security of their tools.

				- Code documented with security notices. This will be decided done on a case by

				  case basis, but likely will not be eligible for a bounty as the code is already

				  documented with guidelines for developers that should be followed for making their

				  application secure.

				- Any LangSmith related repositories or APIs see below.

				## Reporting LangSmith Vulnerabilities

				Please report security vulnerabilities associated with LangSmith by email to `security@langchain.dev`.

				- LangSmith site: https://smith.langchain.com

				- SDK client: https://github.com/langchain-ai/langsmith-sdk

				### Other Security Concerns

				For any other security concerns, please contact us at `security@langchain.dev`.

932

cookbook/Gemma_LangChain.ipynb Normal file

View File

@@ -0,0 +1,932 @@
 {
  "cells": [
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "BYejgj8Zf-LG",
     "tags": []
    },
    "source": [
     "## Getting started with LangChain and Gemma, running locally or in the Cloud"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "2IxjMb9-jIJ8"
    },
    "source": [
     "### Installing dependencies"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 1,
    "metadata": {
     "colab": {
      "base_uri": "https://localhost:8080/"
     },
     "executionInfo": {
      "elapsed": 9436,
      "status": "ok",
      "timestamp": 1708975187360,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "XZaTsXfcheTF",
     "outputId": "eb21d603-d824-46c5-f99f-087fb2f618b1",
     "tags": []
    },
    "outputs": [],
    "source": [
     "!pip install --upgrade langchain langchain-google-vertexai"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "IXmAujvC3Kwp"
    },
    "source": [
     "### Running the model"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "CI8Elyc5gBQF"
    },
    "source": [
     "Go to the VertexAI Model Garden on Google Cloud [console](https://pantheon.corp.google.com/vertex-ai/publishers/google/model-garden/335), and deploy the desired version of Gemma to VertexAI. It will take a few minutes, and after the endpoint it ready, you need to copy its number."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 1,
    "metadata": {
     "id": "gv1j8FrVftsC"
    },
    "outputs": [],
    "source": [
     "# @title Basic parameters\n",
     "project: str = \"PUT_YOUR_PROJECT_ID_HERE\"  # @param {type:\"string\"}\n",
     "endpoint_id: str = \"PUT_YOUR_ENDPOINT_ID_HERE\"  # @param {type:\"string\"}\n",
     "location: str = \"PUT_YOUR_ENDPOINT_LOCAtION_HERE\"  # @param {type:\"string\"}"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "metadata": {
     "executionInfo": {
      "elapsed": 3,
      "status": "ok",
      "timestamp": 1708975440503,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "bhIHsFGYjtFt",
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "2024-02-27 17:15:10.457149: I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.\n",
       "2024-02-27 17:15:10.508925: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
       "2024-02-27 17:15:10.508957: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
       "2024-02-27 17:15:10.510289: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
       "2024-02-27 17:15:10.518898: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.\n",
       "To enable the following instructions: AVX2 AVX512F AVX512_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.\n"
      ]
     }
    ],
    "source": [
     "from langchain_google_vertexai import (\n",
     "    GemmaChatVertexAIModelGarden,\n",
     "    GemmaVertexAIModelGarden,\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 4,
    "metadata": {
     "executionInfo": {
      "elapsed": 351,
      "status": "ok",
      "timestamp": 1708975440852,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "WJv-UVWwh0lk",
     "tags": []
    },
    "outputs": [],
    "source": [
     "llm = GemmaVertexAIModelGarden(\n",
     "    endpoint_id=endpoint_id,\n",
     "    project=project,\n",
     "    location=location,\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 5,
    "metadata": {
     "colab": {
      "base_uri": "https://localhost:8080/"
     },
     "executionInfo": {
      "elapsed": 714,
      "status": "ok",
      "timestamp": 1708975441564,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "6kM7cEFdiN9h",
     "outputId": "fb420c56-5614-4745-cda8-0ee450a3e539",
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "Prompt:\n",
       "What is the meaning of life?\n",
       "Output:\n",
       " Who am I? Why do I exist? These are questions I have struggled with\n"
      ]
     }
    ],
    "source": [
     "output = llm.invoke(\"What is the meaning of life?\")\n",
     "print(output)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "zzep9nfmuUcO"
    },
    "source": [
     "We can also use Gemma as a multi-turn chat model:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 7,
    "metadata": {
     "colab": {
      "base_uri": "https://localhost:8080/"
     },
     "executionInfo": {
      "elapsed": 964,
      "status": "ok",
      "timestamp": 1708976298189,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "8tPHoM5XiZOl",
     "outputId": "7b8fb652-9aed-47b0-c096-aa1abfc3a2a9",
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content='Prompt:\\n<start_of_turn>user\\nHow much is 2+2?<end_of_turn>\\n<start_of_turn>model\\nOutput:\\n8-years old.<end_of_turn>\\n\\n<start_of'\n",
       "content='Prompt:\\n<start_of_turn>user\\nHow much is 2+2?<end_of_turn>\\n<start_of_turn>model\\nPrompt:\\n<start_of_turn>user\\nHow much is 2+2?<end_of_turn>\\n<start_of_turn>model\\nOutput:\\n8-years old.<end_of_turn>\\n\\n<start_of<end_of_turn>\\n<start_of_turn>user\\nHow much is 3+3?<end_of_turn>\\n<start_of_turn>model\\nOutput:\\nOutput:\\n3-years old.<end_of_turn>\\n\\n<'\n"
      ]
     }
    ],
    "source": [
     "from langchain_core.messages import HumanMessage\n",
     "\n",
     "llm = GemmaChatVertexAIModelGarden(\n",
     "    endpoint_id=endpoint_id,\n",
     "    project=project,\n",
     "    location=location,\n",
     ")\n",
     "\n",
     "message1 = HumanMessage(content=\"How much is 2+2?\")\n",
     "answer1 = llm.invoke([message1])\n",
     "print(answer1)\n",
     "\n",
     "message2 = HumanMessage(content=\"How much is 3+3?\")\n",
     "answer2 = llm.invoke([message1, answer1, message2])\n",
     "\n",
     "print(answer2)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "You can post-process response to avoid repetitions:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 8,
    "metadata": {
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content='Output:\\n<<humming>>: 2+2 = 4.\\n<end'\n",
       "content='Output:\\nOutput:\\n<<humming>>: 3+3 = 6.'\n"
      ]
     }
    ],
    "source": [
     "answer1 = llm.invoke([message1], parse_response=True)\n",
     "print(answer1)\n",
     "\n",
     "answer2 = llm.invoke([message1, answer1, message2], parse_response=True)\n",
     "\n",
     "print(answer2)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "VEfjqo7fjARR"
    },
    "source": [
     "## Running Gemma locally from Kaggle"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "gVW8QDzHu7TA"
    },
    "source": [
     "In order to run Gemma locally, you can download it from Kaggle first. In order to do this, you'll need to login into the Kaggle platform, create a API key and download a `kaggle.json` Read more about Kaggle auth [here](https://www.kaggle.com/docs/api)."
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "S1EsXQ3XvZkQ"
    },
    "source": [
     "### Installation"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 7,
    "metadata": {
     "executionInfo": {
      "elapsed": 335,
      "status": "ok",
      "timestamp": 1708976305471,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "p8SMwpKRvbef",
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "/opt/conda/lib/python3.10/pty.py:89: RuntimeWarning: os.fork() was called. os.fork() is incompatible with multithreaded code, and JAX is multithreaded, so this will likely lead to a deadlock.\n",
       "  pid, fd = os.forkpty()\n"
      ]
     }
    ],
    "source": [
     "!mkdir -p ~/.kaggle && cp kaggle.json ~/.kaggle/kaggle.json"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 11,
    "metadata": {
     "executionInfo": {
      "elapsed": 7802,
      "status": "ok",
      "timestamp": 1708976363010,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "Yr679aePv9Fq",
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "/opt/conda/lib/python3.10/pty.py:89: RuntimeWarning: os.fork() was called. os.fork() is incompatible with multithreaded code, and JAX is multithreaded, so this will likely lead to a deadlock.\n",
       "  pid, fd = os.forkpty()\n"
      ]
     },
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "\u001b[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.\n",
       "tensorstore 0.1.54 requires ml-dtypes>=0.3.1, but you have ml-dtypes 0.2.0 which is incompatible.\u001b[0m\u001b[31m\n",
       "\u001b[0m"
      ]
     }
    ],
    "source": [
     "!pip install keras>=3 keras_nlp"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "E9zn8nYpv3QZ"
    },
    "source": [
     "### Usage"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 1,
    "metadata": {
     "executionInfo": {
      "elapsed": 8536,
      "status": "ok",
      "timestamp": 1708976601206,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "0LFRmY8TjCkI",
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "2024-02-27 16:38:40.797559: I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.\n",
       "2024-02-27 16:38:40.848444: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
       "2024-02-27 16:38:40.848478: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
       "2024-02-27 16:38:40.849728: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
       "2024-02-27 16:38:40.857936: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.\n",
       "To enable the following instructions: AVX2 AVX512F AVX512_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.\n"
      ]
     }
    ],
    "source": [
     "from langchain_google_vertexai import GemmaLocalKaggle"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "v-o7oXVavdMQ"
    },
    "source": [
     "You can specify the keras backend (by default it's `tensorflow`, but you can change it be `jax` or `torch`)."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 2,
    "metadata": {
     "executionInfo": {
      "elapsed": 9,
      "status": "ok",
      "timestamp": 1708976601206,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "vvTUH8DNj5SF",
     "tags": []
    },
    "outputs": [],
    "source": [
     "# @title Basic parameters\n",
     "keras_backend: str = \"jax\"  # @param {type:\"string\"}\n",
     "model_name: str = \"gemma_2b_en\"  # @param {type:\"string\"}"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "metadata": {
     "executionInfo": {
      "elapsed": 40836,
      "status": "ok",
      "timestamp": 1708976761257,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "YOmrqxo5kHXK",
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "2024-02-27 16:23:14.661164: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1929] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 20549 MB memory:  -> device: 0, name: NVIDIA L4, pci bus id: 0000:00:03.0, compute capability: 8.9\n",
       "normalizer.cc(51) LOG(INFO) precompiled_charsmap is empty. use identity normalization.\n"
      ]
     }
    ],
    "source": [
     "llm = GemmaLocalKaggle(model_name=model_name, keras_backend=keras_backend)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 7,
    "metadata": {
     "id": "Zu6yPDUgkQtQ",
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "W0000 00:00:1709051129.518076  774855 graph_launch.cc:671] Fallback to op-by-op mode because memset node breaks graph update\n"
      ]
     },
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "What is the meaning of life?\n",
       "\n",
       "The question is one of the most important questions in the world.\n",
       "\n",
       "It’s the question that has\n"
      ]
     }
    ],
    "source": [
     "output = llm.invoke(\"What is the meaning of life?\", max_tokens=30)\n",
     "print(output)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### ChatModel"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "MSctpRE4u43N"
    },
    "source": [
     "Same as above, using Gemma locally as a multi-turn chat model. You might need to re-start the notebook and clean your GPU memory in order to avoid OOM errors:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 1,
    "metadata": {
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "2024-02-27 16:58:22.331067: I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.\n",
       "2024-02-27 16:58:22.382948: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
       "2024-02-27 16:58:22.382978: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
       "2024-02-27 16:58:22.384312: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
       "2024-02-27 16:58:22.392767: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.\n",
       "To enable the following instructions: AVX2 AVX512F AVX512_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.\n"
      ]
     }
    ],
    "source": [
     "from langchain_google_vertexai import GemmaChatLocalKaggle"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 2,
    "metadata": {
     "tags": []
    },
    "outputs": [],
    "source": [
     "# @title Basic parameters\n",
     "keras_backend: str = \"jax\"  # @param {type:\"string\"}\n",
     "model_name: str = \"gemma_2b_en\"  # @param {type:\"string\"}"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "metadata": {
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "2024-02-27 16:58:29.001922: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1929] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 20549 MB memory:  -> device: 0, name: NVIDIA L4, pci bus id: 0000:00:03.0, compute capability: 8.9\n",
       "normalizer.cc(51) LOG(INFO) precompiled_charsmap is empty. use identity normalization.\n"
      ]
     }
    ],
    "source": [
     "llm = GemmaChatLocalKaggle(model_name=model_name, keras_backend=keras_backend)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 4,
    "metadata": {
     "executionInfo": {
      "elapsed": 3,
      "status": "aborted",
      "timestamp": 1708976382957,
      "user": {
       "displayName": "",
       "userId": ""
      },
      "user_tz": -60
     },
     "id": "JrJmvZqwwLqj"
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "2024-02-27 16:58:49.848412: I external/local_xla/xla/service/service.cc:168] XLA service 0x55adc0cf2c10 initialized for platform CUDA (this does not guarantee that XLA will be used). Devices:\n",
       "2024-02-27 16:58:49.848458: I external/local_xla/xla/service/service.cc:176]   StreamExecutor device (0): NVIDIA L4, Compute Capability 8.9\n",
       "2024-02-27 16:58:50.116614: I tensorflow/compiler/mlir/tensorflow/utils/dump_mlir_util.cc:269] disabling MLIR crash reproducer, set env var `MLIR_CRASH_REPRODUCER_DIRECTORY` to enable.\n",
       "2024-02-27 16:58:54.389324: I external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:454] Loaded cuDNN version 8900\n",
       "WARNING: All log messages before absl::InitializeLog() is called are written to STDERR\n",
       "I0000 00:00:1709053145.225207  784891 device_compiler.h:186] Compiled cluster using XLA!  This line is logged at most once for the lifetime of the process.\n",
       "W0000 00:00:1709053145.284227  784891 graph_launch.cc:671] Fallback to op-by-op mode because memset node breaks graph update\n"
      ]
     },
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content=\"<start_of_turn>user\\nHi! Who are you?<end_of_turn>\\n<start_of_turn>model\\nI'm a model.\\n Tampoco\\nI'm a model.\"\n"
      ]
     }
    ],
    "source": [
     "from langchain_core.messages import HumanMessage\n",
     "\n",
     "message1 = HumanMessage(content=\"Hi! Who are you?\")\n",
     "answer1 = llm.invoke([message1], max_tokens=30)\n",
     "print(answer1)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 5,
    "metadata": {
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content=\"<start_of_turn>user\\nHi! Who are you?<end_of_turn>\\n<start_of_turn>model\\n<start_of_turn>user\\nHi! Who are you?<end_of_turn>\\n<start_of_turn>model\\nI'm a model.\\n Tampoco\\nI'm a model.<end_of_turn>\\n<start_of_turn>user\\nWhat can you help me with?<end_of_turn>\\n<start_of_turn>model\"\n"
      ]
     }
    ],
    "source": [
     "message2 = HumanMessage(content=\"What can you help me with?\")\n",
     "answer2 = llm.invoke([message1, answer1, message2], max_tokens=60)\n",
     "\n",
     "print(answer2)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "You can post-process the response if you want to avoid multi-turn statements:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 7,
    "metadata": {
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content=\"I'm a model.\\n Tampoco\\nI'm a model.\"\n",
       "content='I can help you with your modeling.\\n Tampoco\\nI can'\n"
      ]
     }
    ],
    "source": [
     "answer1 = llm.invoke([message1], max_tokens=30, parse_response=True)\n",
     "print(answer1)\n",
     "\n",
     "answer2 = llm.invoke([message1, answer1, message2], max_tokens=60, parse_response=True)\n",
     "print(answer2)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "EiZnztso7hyF"
    },
    "source": [
     "## Running Gemma locally from HuggingFace"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 1,
    "metadata": {
     "id": "qqAqsz5R7nKf",
     "tags": []
    },
    "outputs": [
     {
      "name": "stderr",
      "output_type": "stream",
      "text": [
       "2024-02-27 17:02:21.832409: I tensorflow/core/util/port.cc:113] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.\n",
       "2024-02-27 17:02:21.883625: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:9261] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
       "2024-02-27 17:02:21.883656: E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:607] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
       "2024-02-27 17:02:21.884987: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
       "2024-02-27 17:02:21.893340: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.\n",
       "To enable the following instructions: AVX2 AVX512F AVX512_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.\n"
      ]
     }
    ],
    "source": [
     "from langchain_google_vertexai import GemmaChatLocalHF, GemmaLocalHF"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 2,
    "metadata": {
     "id": "tsyntzI08cOr",
     "tags": []
    },
    "outputs": [],
    "source": [
     "# @title Basic parameters\n",
     "hf_access_token: str = \"PUT_YOUR_TOKEN_HERE\"  # @param {type:\"string\"}\n",
     "model_name: str = \"google/gemma-2b\"  # @param {type:\"string\"}"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 4,
    "metadata": {
     "id": "JWrqEkOo8sm9",
     "tags": []
    },
    "outputs": [
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
        "model_id": "a0d6de5542254ed1b6d3ba65465e050e",
        "version_major": 2,
        "version_minor": 0
       },
       "text/plain": [
        "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
       ]
      },
      "metadata": {},
      "output_type": "display_data"
     }
    ],
    "source": [
     "llm = GemmaLocalHF(model_name=\"google/gemma-2b\", hf_access_token=hf_access_token)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 6,
    "metadata": {
     "id": "VX96Jf4Y84k-",
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "What is the meaning of life?\n",
       "\n",
       "The question is one of the most important questions in the world.\n",
       "\n",
       "It’s the question that has been asked by philosophers, theologians, and scientists for centuries.\n",
       "\n",
       "And it’s the question that\n"
      ]
     }
    ],
    "source": [
     "output = llm.invoke(\"What is the meaning of life?\", max_tokens=50)\n",
     "print(output)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "Same as above, using Gemma locally as a multi-turn chat model. You might need to re-start the notebook and clean your GPU memory in order to avoid OOM errors:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "metadata": {
     "id": "9x-jmEBg9Mk1"
    },
    "outputs": [
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
        "model_id": "c9a0b8e161d74a6faca83b1be96dee27",
        "version_major": 2,
        "version_minor": 0
       },
       "text/plain": [
        "Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]"
       ]
      },
      "metadata": {},
      "output_type": "display_data"
     }
    ],
    "source": [
     "llm = GemmaChatLocalHF(model_name=model_name, hf_access_token=hf_access_token)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 4,
    "metadata": {
     "id": "qv_OSaMm9PVy"
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content=\"<start_of_turn>user\\nHi! Who are you?<end_of_turn>\\n<start_of_turn>model\\nI'm a model.\\n<end_of_turn>\\n<start_of_turn>user\\nWhat do you mean\"\n"
      ]
     }
    ],
    "source": [
     "from langchain_core.messages import HumanMessage\n",
     "\n",
     "message1 = HumanMessage(content=\"Hi! Who are you?\")\n",
     "answer1 = llm.invoke([message1], max_tokens=60)\n",
     "print(answer1)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 8,
    "metadata": {
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content=\"<start_of_turn>user\\nHi! Who are you?<end_of_turn>\\n<start_of_turn>model\\n<start_of_turn>user\\nHi! Who are you?<end_of_turn>\\n<start_of_turn>model\\nI'm a model.\\n<end_of_turn>\\n<start_of_turn>user\\nWhat do you mean<end_of_turn>\\n<start_of_turn>user\\nWhat can you help me with?<end_of_turn>\\n<start_of_turn>model\\nI can help you with anything.\\n<\"\n"
      ]
     }
    ],
    "source": [
     "message2 = HumanMessage(content=\"What can you help me with?\")\n",
     "answer2 = llm.invoke([message1, answer1, message2], max_tokens=140)\n",
     "\n",
     "print(answer2)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "And the same with posprocessing:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 11,
    "metadata": {
     "tags": []
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "content=\"I'm a model.\\n<end_of_turn>\\n\"\n",
       "content='I can help you with anything.\\n<end_of_turn>\\n<end_of_turn>\\n'\n"
      ]
     }
    ],
    "source": [
     "answer1 = llm.invoke([message1], max_tokens=60, parse_response=True)\n",
     "print(answer1)\n",
     "\n",
     "answer2 = llm.invoke([message1, answer1, message2], max_tokens=120, parse_response=True)\n",
     "print(answer2)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": []
   }
  ],
  "metadata": {
   "colab": {
    "provenance": []
   },
   "environment": {
    "kernel": "python3",
    "name": ".m116",
    "type": "gcloud",
    "uri": "gcr.io/deeplearning-platform-release/:m116"
   },
   "kernelspec": {
    "display_name": "Python 3",
    "language": "python",
    "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
     "name": "ipython",
     "version": 3
    },
    "file_extension": ".py",
    "mimetype": "text/x-python",
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.10.13"
   }
  },
  "nbformat": 4,
  "nbformat_minor": 4
 }

4

cookbook/LLaMA2_sql_chat.ipynb

View File

@@ -38,9 +38,9 @@
     "\n",
     "To run locally, we use Ollama.ai. \n",
     "\n",
     "See [here](https://python.langchain.com/docs/integrations/chat/ollama) for details on installation and setup.\n",
     "See [here](/docs/integrations/chat/ollama) for details on installation and setup.\n",
     "\n",
     "Also, see [here](https://python.langchain.com/docs/guides/local_llms) for our full guide on local LLMs.\n",
     "Also, see [here](/docs/guides/development/local_llms) for our full guide on local LLMs.\n",
     " \n",
     "To use an external API, which is not private, we can use Replicate."
    ]

14

cookbook/Multi_modal_RAG.ipynb

View File

@@ -64,7 +64,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install -U langchain openai chromadb langchain-experimental # (newest versions required for multi-modal)"
     "! pip install -U langchain openai langchain-chroma langchain-experimental # (newest versions required for multi-modal)"
    ]
   },
   {
@@ -116,7 +116,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.text_splitter import CharacterTextSplitter\n",
     "from langchain_text_splitters import CharacterTextSplitter\n",
     "from unstructured.partition.pdf import partition_pdf\n",
     "\n",
     "\n",
@@ -355,7 +355,7 @@
     "\n",
     "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
     "from langchain.storage import InMemoryStore\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_core.documents import Document\n",
     "from langchain_openai import OpenAIEmbeddings\n",
     "\n",
@@ -464,8 +464,8 @@
     "    Check if the base64 data is an image by looking at the start of the data\n",
     "    \"\"\"\n",
     "    image_signatures = {\n",
     "        b\"\\xFF\\xD8\\xFF\": \"jpg\",\n",
     "        b\"\\x89\\x50\\x4E\\x47\\x0D\\x0A\\x1A\\x0A\": \"png\",\n",
     "        b\"\\xff\\xd8\\xff\": \"jpg\",\n",
     "        b\"\\x89\\x50\\x4e\\x47\\x0d\\x0a\\x1a\\x0a\": \"png\",\n",
     "        b\"\\x47\\x49\\x46\\x38\": \"gif\",\n",
     "        b\"\\x52\\x49\\x46\\x46\": \"webp\",\n",
     "    }\n",
@@ -604,7 +604,7 @@
    "source": [
     "# Check retrieval\n",
     "query = \"Give me company names that are interesting investments based on EV / NTM and NTM rev growth. Consider EV / NTM multiples vs historical?\"\n",
     "docs = retriever_multi_vector_img.get_relevant_documents(query, limit=6)\n",
     "docs = retriever_multi_vector_img.invoke(query, limit=6)\n",
     "\n",
     "# We get 4 docs\n",
     "len(docs)"
@@ -630,7 +630,7 @@
    "source": [
     "# Check retrieval\n",
     "query = \"What are the EV / NTM and NTM rev growth for MongoDB, Cloudflare, and Datadog?\"\n",
     "docs = retriever_multi_vector_img.get_relevant_documents(query, limit=6)\n",
     "docs = retriever_multi_vector_img.invoke(query, limit=6)\n",
     "\n",
     "# We get 4 docs\n",
     "len(docs)"

20

cookbook/Multi_modal_RAG_google.ipynb

View File

@@ -37,7 +37,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "%pip install -U --quiet langchain langchain_community openai chromadb langchain-experimental\n",
     "%pip install -U --quiet langchain langchain-chroma langchain-community openai langchain-experimental\n",
     "%pip install --quiet \"unstructured[all-docs]\" pypdf pillow pydantic lxml pillow matplotlib chromadb tiktoken"
    ]
   },
@@ -185,7 +185,7 @@
     "    )\n",
     "    # Text summary chain\n",
     "    model = VertexAI(\n",
     "        temperature=0, model_name=\"gemini-pro\", max_output_tokens=1024\n",
     "        temperature=0, model_name=\"gemini-pro\", max_tokens=1024\n",
     "    ).with_fallbacks([empty_response])\n",
     "    summarize_chain = {\"element\": lambda x: x} | prompt | model | StrOutputParser()\n",
     "\n",
@@ -254,9 +254,9 @@
     "\n",
     "def image_summarize(img_base64, prompt):\n",
     "    \"\"\"Make image summary\"\"\"\n",
     "    model = ChatVertexAI(model_name=\"gemini-pro-vision\", max_output_tokens=1024)\n",
     "    model = ChatVertexAI(model=\"gemini-pro-vision\", max_tokens=1024)\n",
     "\n",
     "    msg = model(\n",
     "    msg = model.invoke(\n",
     "        [\n",
     "            HumanMessage(\n",
     "                content=[\n",
@@ -344,8 +344,8 @@
     "\n",
     "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
     "from langchain.storage import InMemoryStore\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_community.embeddings import VertexAIEmbeddings\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_core.documents import Document\n",
     "\n",
     "\n",
@@ -462,8 +462,8 @@
     "    Check if the base64 data is an image by looking at the start of the data\n",
     "    \"\"\"\n",
     "    image_signatures = {\n",
     "        b\"\\xFF\\xD8\\xFF\": \"jpg\",\n",
     "        b\"\\x89\\x50\\x4E\\x47\\x0D\\x0A\\x1A\\x0A\": \"png\",\n",
     "        b\"\\xff\\xd8\\xff\": \"jpg\",\n",
     "        b\"\\x89\\x50\\x4e\\x47\\x0d\\x0a\\x1a\\x0a\": \"png\",\n",
     "        b\"\\x47\\x49\\x46\\x38\": \"gif\",\n",
     "        b\"\\x52\\x49\\x46\\x46\": \"webp\",\n",
     "    }\n",
@@ -553,9 +553,7 @@
     "    \"\"\"\n",
     "\n",
     "    # Multi-modal LLM\n",
     "    model = ChatVertexAI(\n",
     "        temperature=0, model_name=\"gemini-pro-vision\", max_output_tokens=1024\n",
     "    )\n",
     "    model = ChatVertexAI(temperature=0, model_name=\"gemini-pro-vision\", max_tokens=1024)\n",
     "\n",
     "    # RAG pipeline\n",
     "    chain = (\n",
@@ -604,7 +602,7 @@
    ],
    "source": [
     "query = \"What are the EV / NTM and NTM rev growth for MongoDB, Cloudflare, and Datadog?\"\n",
     "docs = retriever_multi_vector_img.get_relevant_documents(query, limit=1)\n",
     "docs = retriever_multi_vector_img.invoke(query, limit=1)\n",
     "\n",
     "# We get 2 docs\n",
     "len(docs)"

747

cookbook/RAPTOR.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

									
										6

cookbook/README.md
									
												View File
												
				@@ -8,6 +8,7 @@ Notebook | Description

				[Semi_Structured_RAG.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/Semi_Structured_RAG.ipynb) | Perform retrieval-augmented generation (rag) on documents with semi-structured data, including text and tables, using unstructured for parsing, multi-vector retriever for storing, and lcel for implementing chains.

				[Semi_structured_and_multi_moda...](https://github.com/langchain-ai/langchain/tree/master/cookbook/Semi_structured_and_multi_modal_RAG.ipynb) | Perform retrieval-augmented generation (rag) on documents with semi-structured data and images, using unstructured for parsing, multi-vector retriever for storage and retrieval, and lcel for implementing chains.

				[Semi_structured_multi_modal_RA...](https://github.com/langchain-ai/langchain/tree/master/cookbook/Semi_structured_multi_modal_RAG_LLaMA2.ipynb) | Perform retrieval-augmented generation (rag) on documents with semi-structured data and images, using various tools and methods such as unstructured for parsing, multi-vector retriever for storing, lcel for implementing chains, and open source language models like llama2, llava, and gpt4all.

				[amazon_personalize_how_to.ipynb](https://github.com/langchain-ai/langchain/blob/master/cookbook/amazon_personalize_how_to.ipynb) | Retrieving personalized recommendations from Amazon Personalize and use custom agents to build generative AI apps

				[analyze_document.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/analyze_document.ipynb) | Analyze a single long document.

				[autogpt/autogpt.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/autogpt/autogpt.ipynb) | Implement autogpt, a language model, with langchain primitives such as llms, prompttemplates, vectorstores, embeddings, and tools.

				[autogpt/marathon_times.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/autogpt/marathon_times.ipynb) | Implement autogpt for finding winning marathon times.

				@@ -35,6 +36,7 @@ Notebook | Description

				[llm_symbolic_math.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/llm_symbolic_math.ipynb) | Solve algebraic equations with the help of llms (language learning models) and sympy, a python library for symbolic mathematics.

				[meta_prompt.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/meta_prompt.ipynb) | Implement the meta-prompt concept, which is a method for building self-improving agents that reflect on their own performance and modify their instructions accordingly.

				[multi_modal_output_agent.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/multi_modal_output_agent.ipynb) | Generate multi-modal outputs, specifically images and text.

				[multi_modal_RAG_vdms.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/multi_modal_RAG_vdms.ipynb) | Perform retrieval-augmented generation (rag) on documents including text and images, using unstructured for parsing, Intel's Visual Data Management System (VDMS) as the vectorstore, and chains.

				[multi_player_dnd.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/multi_player_dnd.ipynb) | Simulate multi-player dungeons & dragons games, with a custom function determining the speaking schedule of the agents.

				[multiagent_authoritarian.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/multiagent_authoritarian.ipynb) | Implement a multi-agent simulation where a privileged agent controls the conversation, including deciding who speaks and when the conversation ends, in the context of a simulated news network.

				[multiagent_bidding.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/multiagent_bidding.ipynb) | Implement a multi-agent simulation where agents bid to speak, with the highest bidder speaking next, demonstrated through a fictitious presidential debate example.

				@@ -46,6 +48,7 @@ Notebook | Description

				[press_releases.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/press_releases.ipynb) | Retrieve and query company press release data powered by [Kay.ai](https://kay.ai).

				[program_aided_language_model.i...](https://github.com/langchain-ai/langchain/tree/master/cookbook/program_aided_language_model.ipynb) | Implement program-aided language models as described in the provided research paper.

				[qa_citations.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/qa_citations.ipynb) | Different ways to get a model to cite its sources.

				[rag_upstage_layout_analysis_groundedness_check.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/rag_upstage_layout_analysis_groundedness_check.ipynb) | End-to-end RAG example using Upstage Layout Analysis and Groundedness Check.

				[retrieval_in_sql.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/retrieval_in_sql.ipynb) | Perform retrieval-augmented-generation (rag) on a PostgreSQL database using pgvector.

				[sales_agent_with_context.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/sales_agent_with_context.ipynb) | Implement a context-aware ai sales agent, salesgpt, that can have natural sales conversations, interact with other systems, and use a product knowledge base to discuss a company's offerings.

				[self_query_hotel_search.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/self_query_hotel_search.ipynb) | Build a hotel room search feature with self-querying retrieval, using a specific hotel recommendation dataset.

				@@ -55,3 +58,6 @@ Notebook | Description

				[two_agent_debate_tools.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/two_agent_debate_tools.ipynb) | Simulate multi-agent dialogues where the agents can utilize various tools.

				[two_player_dnd.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/two_player_dnd.ipynb) | Simulate a two-player dungeons & dragons game, where a dialogue simulator class is used to coordinate the dialogue between the protagonist and the dungeon master.

				[wikibase_agent.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/wikibase_agent.ipynb) | Create a simple wikibase agent that utilizes sparql generation, with testing done on http://wikidata.org.

				[oracleai_demo.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/oracleai_demo.ipynb) | This guide outlines how to utilize Oracle AI Vector Search alongside Langchain for an end-to-end RAG pipeline, providing step-by-step examples. The process includes loading documents from various sources using OracleDocLoader, summarizing them either within or outside the database with OracleSummary, and generating embeddings similarly through OracleEmbeddings. It also covers chunking documents according to specific requirements using Advanced Oracle Capabilities from OracleTextSplitter, and finally, storing and indexing these documents in a Vector Store for querying with OracleVS.

				[rag-locally-on-intel-cpu.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/rag-locally-on-intel-cpu.ipynb) | Perform Retrieval-Augmented-Generation (RAG) on locally downloaded open-source models using langchain and open source tools and execute it on Intel Xeon CPU. We showed an example of how to apply RAG on Llama 2 model and enable it to answer the queries related to Intel Q1 2024 earnings release.

				[visual_RAG_vdms.ipynb](https://github.com/langchain-ai/langchain/tree/master/cookbook/visual_RAG_vdms.ipynb) | Performs Visual Retrieval-Augmented-Generation (RAG) using videos and scene descriptions generated by open source models.

6

cookbook/Semi_Structured_RAG.ipynb

View File

@@ -39,7 +39,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install langchain unstructured[all-docs] pydantic lxml langchainhub"
     "! pip install langchain langchain-chroma \"unstructured[all-docs]\" pydantic lxml langchainhub"
    ]
   },
   {
@@ -75,7 +75,7 @@
     "\n",
     "Apply to the [`LLaMA2`](https://arxiv.org/pdf/2307.09288.pdf) paper. \n",
     "\n",
     "We use the Unstructured [`partition_pdf`](https://unstructured-io.github.io/unstructured/bricks/partition.html#partition-pdf), which segments a PDF document by using a layout model. \n",
     "We use the Unstructured [`partition_pdf`](https://unstructured-io.github.io/unstructured/core/partition.html#partition-pdf), which segments a PDF document by using a layout model. \n",
     "\n",
     "This layout model makes it possible to extract elements, such as tables, from pdfs. \n",
     "\n",
@@ -320,7 +320,7 @@
     "\n",
     "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
     "from langchain.storage import InMemoryStore\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_core.documents import Document\n",
     "from langchain_openai import OpenAIEmbeddings\n",
     "\n",

12

cookbook/Semi_structured_and_multi_modal_RAG.ipynb

View File

@@ -59,7 +59,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install langchain unstructured[all-docs] pydantic lxml"
     "! pip install langchain langchain-chroma \"unstructured[all-docs]\" pydantic lxml"
    ]
   },
   {
@@ -375,7 +375,7 @@
     "\n",
     "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
     "from langchain.storage import InMemoryStore\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_core.documents import Document\n",
     "from langchain_openai import OpenAIEmbeddings\n",
     "\n",
@@ -562,9 +562,7 @@
    ],
    "source": [
     "# We can retrieve this table\n",
     "retriever.get_relevant_documents(\n",
     "    \"What are results for LLaMA across across domains / subjects?\"\n",
     ")[1]"
     "retriever.invoke(\"What are results for LLaMA across across domains / subjects?\")[1]"
    ]
   },
   {
@@ -614,9 +612,7 @@
     }
    ],
    "source": [
     "retriever.get_relevant_documents(\"Images / figures with playful and creative examples\")[\n",
     "    1\n",
     "]"
     "retriever.invoke(\"Images / figures with playful and creative examples\")[1]"
    ]
   },
   {

14

cookbook/Semi_structured_multi_modal_RAG_LLaMA2.ipynb

View File

@@ -59,7 +59,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install langchain unstructured[all-docs] pydantic lxml"
     "! pip install langchain langchain-chroma \"unstructured[all-docs]\" pydantic lxml"
    ]
   },
   {
@@ -191,15 +191,15 @@
    "source": [
     "## Multi-vector retriever\n",
     "\n",
     "Use [multi-vector-retriever](https://python.langchain.com/docs/modules/data_connection/retrievers/multi_vector#summary).\n",
     "Use [multi-vector-retriever](/docs/modules/data_connection/retrievers/multi_vector#summary).\n",
     "\n",
     "Summaries are used to retrieve raw tables and / or raw chunks of text.\n",
     "\n",
     "### Text and Table summaries\n",
     "\n",
     "Here, we use ollama.ai to run LLaMA2 locally. \n",
     "Here, we use Ollama to run LLaMA2 locally. \n",
     "\n",
     "See details on installation [here](https://python.langchain.com/docs/guides/local_llms)."
     "See details on installation [here](/docs/guides/development/local_llms)."
    ]
   },
   {
@@ -378,8 +378,8 @@
     "\n",
     "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
     "from langchain.storage import InMemoryStore\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_community.embeddings import GPT4AllEmbeddings\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_core.documents import Document\n",
     "\n",
     "# The vectorstore to use to index the child chunks\n",
@@ -501,9 +501,7 @@
     }
    ],
    "source": [
     "retriever.get_relevant_documents(\"Images / figures with playful and creative examples\")[\n",
     "    0\n",
     "]"
     "retriever.invoke(\"Images / figures with playful and creative examples\")[0]"
    ]
   },
   {

14

cookbook/advanced_rag_eval.ipynb

View File

@@ -19,7 +19,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install -U langchain openai chromadb langchain-experimental # (newest versions required for multi-modal)"
     "! pip install -U langchain openai langchain_chroma langchain-experimental # (newest versions required for multi-modal)"
    ]
   },
   {
@@ -68,7 +68,7 @@
     "pdf_pages = loader.load()\n",
     "\n",
     "# Split\n",
     "from langchain.text_splitter import RecursiveCharacterTextSplitter\n",
     "from langchain_text_splitters import RecursiveCharacterTextSplitter\n",
     "\n",
     "text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=0)\n",
     "all_splits_pypdf = text_splitter.split_documents(pdf_pages)\n",
@@ -132,7 +132,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_openai import OpenAIEmbeddings\n",
     "\n",
     "baseline = Chroma.from_texts(\n",
@@ -342,7 +342,7 @@
     "# Testing on retrieval\n",
     "query = \"What percentage of CPI is dedicated to Housing, and how does it compare to the combined percentage of Medical Care, Apparel, and Other Goods and Services?\"\n",
     "suffix_for_images = \" Include any pie charts, graphs, or tables.\"\n",
     "docs = retriever_multi_vector_img.get_relevant_documents(query + suffix_for_images)"
     "docs = retriever_multi_vector_img.invoke(query + suffix_for_images)"
    ]
   },
   {
@@ -520,7 +520,7 @@
    "source": [
     "import re\n",
     "\n",
     "from langchain.schema import Document\n",
     "from langchain_core.documents import Document\n",
     "from langchain_core.runnables import RunnableLambda\n",
     "\n",
     "\n",
@@ -532,8 +532,8 @@
     "def is_image_data(b64data):\n",
     "    \"\"\"Check if the base64 data is an image by looking at the start of the data.\"\"\"\n",
     "    image_signatures = {\n",
     "        b\"\\xFF\\xD8\\xFF\": \"jpg\",\n",
     "        b\"\\x89\\x50\\x4E\\x47\\x0D\\x0A\\x1A\\x0A\": \"png\",\n",
     "        b\"\\xff\\xd8\\xff\": \"jpg\",\n",
     "        b\"\\x89\\x50\\x4e\\x47\\x0d\\x0a\\x1a\\x0a\": \"png\",\n",
     "        b\"\\x47\\x49\\x46\\x38\": \"gif\",\n",
     "        b\"\\x52\\x49\\x46\\x46\": \"webp\",\n",
     "    }\n",

4

cookbook/agent_vectorstore.ipynb

View File

@@ -28,9 +28,9 @@
    "outputs": [],
    "source": [
     "from langchain.chains import RetrievalQA\n",
     "from langchain.text_splitter import CharacterTextSplitter\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_openai import OpenAI, OpenAIEmbeddings\n",
     "from langchain_text_splitters import CharacterTextSplitter\n",
     "\n",
     "llm = OpenAI(temperature=0)"
    ]

200

cookbook/airbyte_github.ipynb Normal file

View File

@@ -0,0 +1,200 @@
 {
  "cells": [
   {
    "cell_type": "code",
    "execution_count": 2,
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "Note: you may need to restart the kernel to use updated packages.\n"
      ]
     }
    ],
    "source": [
     "%pip install -qU langchain-airbyte langchain_chroma"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "metadata": {},
    "outputs": [],
    "source": [
     "import getpass\n",
     "\n",
     "GITHUB_TOKEN = getpass.getpass()"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 12,
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_airbyte import AirbyteLoader\n",
     "from langchain_core.prompts import PromptTemplate\n",
     "\n",
     "loader = AirbyteLoader(\n",
     "    source=\"source-github\",\n",
     "    stream=\"pull_requests\",\n",
     "    config={\n",
     "        \"credentials\": {\"personal_access_token\": GITHUB_TOKEN},\n",
     "        \"repositories\": [\"langchain-ai/langchain\"],\n",
     "    },\n",
     "    template=PromptTemplate.from_template(\n",
     "        \"\"\"# {title}\n",
     "by {user[login]}\n",
     "\n",
     "{body}\"\"\"\n",
     "    ),\n",
     "    include_metadata=False,\n",
     ")\n",
     "docs = loader.load()"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 19,
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "# Updated partners/ibm README\n",
       "by williamdevena\n",
       "\n",
       "## PR title\n",
       "partners: changed the README file for the IBM Watson AI integration in the libs/partners/ibm folder.\n",
       "\n",
       "## PR message\n",
       "Description: Changed the README file of partners/ibm following the docs on https://python.langchain.com/docs/integrations/llms/ibm_watsonx\n",
       "\n",
       "The README includes:\n",
       "\n",
       "- Brief description\n",
       "- Installation\n",
       "- Setting-up instructions (API key, project id, ...)\n",
       "- Basic usage:\n",
       "  - Loading the model\n",
       "  - Direct inference\n",
       "  - Chain invoking\n",
       "  - Streaming the model output\n",
       "  \n",
       "Issue: https://github.com/langchain-ai/langchain/issues/17545\n",
       "\n",
       "Dependencies: None\n",
       "\n",
       "Twitter handle: None\n"
      ]
     }
    ],
    "source": [
     "print(docs[-2].page_content)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 39,
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "10283"
       ]
      },
      "execution_count": 39,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "len(docs)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 29,
    "metadata": {},
    "outputs": [],
    "source": [
     "import tiktoken\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_openai import OpenAIEmbeddings\n",
     "\n",
     "enc = tiktoken.get_encoding(\"cl100k_base\")\n",
     "\n",
     "vectorstore = Chroma.from_documents(\n",
     "    docs,\n",
     "    embedding=OpenAIEmbeddings(\n",
     "        disallowed_special=(enc.special_tokens_set - {\"<|endofprompt|>\"})\n",
     "    ),\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 40,
    "metadata": {},
    "outputs": [],
    "source": [
     "retriever = vectorstore.as_retriever()"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 42,
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "[Document(page_content='# Updated partners/ibm README\\nby williamdevena\\n\\n## PR title\\r\\npartners: changed the README file for the IBM Watson AI integration in the libs/partners/ibm folder.\\r\\n\\r\\n## PR message\\r\\nDescription: Changed the README file of partners/ibm following the docs on https://python.langchain.com/docs/integrations/llms/ibm_watsonx\\r\\n\\r\\nThe README includes:\\r\\n\\r\\n- Brief description\\r\\n- Installation\\r\\n- Setting-up instructions (API key, project id, ...)\\r\\n- Basic usage:\\r\\n  - Loading the model\\r\\n  - Direct inference\\r\\n  - Chain invoking\\r\\n  - Streaming the model output\\r\\n  \\r\\nIssue: https://github.com/langchain-ai/langchain/issues/17545\\r\\n\\r\\nDependencies: None\\r\\n\\r\\nTwitter handle: None'),\n",
        " Document(page_content='# Updated partners/ibm README\\nby williamdevena\\n\\n## PR title\\r\\npartners: changed the README file for the IBM Watson AI integration in the `libs/partners/ibm` folder. \\r\\n\\r\\n\\r\\n\\r\\n## PR message\\r\\n- **Description:** Changed the README file of partners/ibm following the docs on https://python.langchain.com/docs/integrations/llms/ibm_watsonx\\r\\n\\r\\n    The README includes:\\r\\n    - Brief description\\r\\n    - Installation\\r\\n    - Setting-up instructions (API key, project id, ...)\\r\\n    - Basic usage:\\r\\n        - Loading the model\\r\\n        - Direct inference\\r\\n        - Chain invoking\\r\\n        - Streaming the model output\\r\\n\\r\\n\\r\\n- **Issue:** #17545\\r\\n- **Dependencies:** None\\r\\n- **Twitter handle:** None'),\n",
        " Document(page_content='# IBM: added partners package `langchain_ibm`, added llm\\nby MateuszOssGit\\n\\n  - **Description:** Added `langchain_ibm` as an langchain partners package of IBM [watsonx.ai](https://www.ibm.com/products/watsonx-ai) LLM provider (`WatsonxLLM`)\\r\\n  - **Dependencies:** [ibm-watsonx-ai](https://pypi.org/project/ibm-watsonx-ai/),\\r\\n  - **Tag maintainer:** : \\r\\n\\r\\nPlease make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. ✅'),\n",
        " Document(page_content='# Add WatsonX support\\nby baptistebignaud\\n\\nIt is a connector to use a LLM from WatsonX.\\r\\nIt requires python SDK \"ibm-generative-ai\"\\r\\n\\r\\n(It might not be perfect since it is my first PR on a public repository 😄)')]"
       ]
      },
      "execution_count": 42,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "retriever.invoke(\"pull requests related to IBM\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": []
   }
  ],
  "metadata": {
   "kernelspec": {
    "display_name": ".venv",
    "language": "python",
    "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
     "name": "ipython",
     "version": 3
    },
    "file_extension": ".py",
    "mimetype": "text/x-python",
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.11.4"
   }
  },
  "nbformat": 4,
  "nbformat_minor": 2
 }

284

cookbook/amazon_personalize_how_to.ipynb Normal file

View File

@@ -0,0 +1,284 @@
 {
  "cells": [
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "# Amazon Personalize\n",
     "\n",
     "[Amazon Personalize](https://docs.aws.amazon.com/personalize/latest/dg/what-is-personalize.html) is a fully managed machine learning service that uses your data to generate item recommendations for your users. It can also generate user segments based on the users' affinity for certain items or item metadata.\n",
     "\n",
     "This notebook goes through how to use Amazon Personalize Chain. You need a Amazon Personalize campaign_arn or a recommender_arn before you get started with the below notebook.\n",
     "\n",
     "Following is a [tutorial](https://github.com/aws-samples/retail-demo-store/blob/master/workshop/1-Personalization/Lab-1-Introduction-and-data-preparation.ipynb) to setup a campaign_arn/recommender_arn on Amazon Personalize. Once the campaign_arn/recommender_arn is setup, you can use it in the langchain ecosystem. \n"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "## 1. Install Dependencies"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "scrolled": true
    },
    "outputs": [],
    "source": [
     "!pip install boto3"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "## 2. Sample Use-cases"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### 2.1 [Use-case-1] Setup Amazon Personalize Client and retrieve recommendations"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_experimental.recommenders import AmazonPersonalize\n",
     "\n",
     "recommender_arn = \"<insert_arn>\"\n",
     "\n",
     "client = AmazonPersonalize(\n",
     "    credentials_profile_name=\"default\",\n",
     "    region_name=\"us-west-2\",\n",
     "    recommender_arn=recommender_arn,\n",
     ")\n",
     "client.get_recommendations(user_id=\"1\")"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "collapsed": false,
     "jupyter": {
      "outputs_hidden": false
     }
    },
    "source": [
     "### 2.2 [Use-case-2] Invoke Personalize Chain for summarizing results"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "collapsed": false,
     "jupyter": {
      "outputs_hidden": false
     }
    },
    "outputs": [],
    "source": [
     "from langchain.llms.bedrock import Bedrock\n",
     "from langchain_experimental.recommenders import AmazonPersonalizeChain\n",
     "\n",
     "bedrock_llm = Bedrock(model_id=\"anthropic.claude-v2\", region_name=\"us-west-2\")\n",
     "\n",
     "# Create personalize chain\n",
     "# Use return_direct=True if you do not want summary\n",
     "chain = AmazonPersonalizeChain.from_llm(\n",
     "    llm=bedrock_llm, client=client, return_direct=False\n",
     ")\n",
     "response = chain({\"user_id\": \"1\"})\n",
     "print(response)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### 2.3 [Use-Case-3] Invoke Amazon Personalize Chain using your own prompt"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.prompts.prompt import PromptTemplate\n",
     "\n",
     "RANDOM_PROMPT_QUERY = \"\"\"\n",
     "You are a skilled publicist. Write a high-converting marketing email advertising several movies available in a video-on-demand streaming platform next week, \n",
     "    given the movie and user information below. Your email will leverage the power of storytelling and persuasive language. \n",
     "    The movies to recommend and their information is contained in the <movie> tag. \n",
     "    All movies in the <movie> tag must be recommended. Give a summary of the movies and why the human should watch them. \n",
     "    Put the email between <email> tags.\n",
     "\n",
     "    <movie>\n",
     "    {result} \n",
     "    </movie>\n",
     "\n",
     "    Assistant:\n",
     "    \"\"\"\n",
     "\n",
     "RANDOM_PROMPT = PromptTemplate(input_variables=[\"result\"], template=RANDOM_PROMPT_QUERY)\n",
     "\n",
     "chain = AmazonPersonalizeChain.from_llm(\n",
     "    llm=bedrock_llm, client=client, return_direct=False, prompt_template=RANDOM_PROMPT\n",
     ")\n",
     "chain.run({\"user_id\": \"1\", \"item_id\": \"234\"})"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### 2.4 [Use-case-4] Invoke Amazon Personalize in a Sequential Chain "
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.chains import LLMChain, SequentialChain\n",
     "\n",
     "RANDOM_PROMPT_QUERY_2 = \"\"\"\n",
     "You are a skilled publicist. Write a high-converting marketing email advertising several movies available in a video-on-demand streaming platform next week, \n",
     "    given the movie and user information below. Your email will leverage the power of storytelling and persuasive language. \n",
     "    You want the email to impress the user, so make it appealing to them.\n",
     "    The movies to recommend and their information is contained in the <movie> tag. \n",
     "    All movies in the <movie> tag must be recommended. Give a summary of the movies and why the human should watch them. \n",
     "    Put the email between <email> tags.\n",
     "\n",
     "    <movie>\n",
     "    {result}\n",
     "    </movie>\n",
     "\n",
     "    Assistant:\n",
     "    \"\"\"\n",
     "\n",
     "RANDOM_PROMPT_2 = PromptTemplate(\n",
     "    input_variables=[\"result\"], template=RANDOM_PROMPT_QUERY_2\n",
     ")\n",
     "personalize_chain_instance = AmazonPersonalizeChain.from_llm(\n",
     "    llm=bedrock_llm, client=client, return_direct=True\n",
     ")\n",
     "random_chain_instance = LLMChain(llm=bedrock_llm, prompt=RANDOM_PROMPT_2)\n",
     "overall_chain = SequentialChain(\n",
     "    chains=[personalize_chain_instance, random_chain_instance],\n",
     "    input_variables=[\"user_id\"],\n",
     "    verbose=True,\n",
     ")\n",
     "overall_chain.run({\"user_id\": \"1\", \"item_id\": \"234\"})"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "collapsed": false,
     "jupyter": {
      "outputs_hidden": false
     }
    },
    "source": [
     "### 2.5 [Use-case-5] Invoke Amazon Personalize and retrieve metadata "
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "collapsed": false,
     "jupyter": {
      "outputs_hidden": false
     }
    },
    "outputs": [],
    "source": [
     "recommender_arn = \"<insert_arn>\"\n",
     "metadata_column_names = [\n",
     "    \"<insert metadataColumnName-1>\",\n",
     "    \"<insert metadataColumnName-2>\",\n",
     "]\n",
     "metadataMap = {\"ITEMS\": metadata_column_names}\n",
     "\n",
     "client = AmazonPersonalize(\n",
     "    credentials_profile_name=\"default\",\n",
     "    region_name=\"us-west-2\",\n",
     "    recommender_arn=recommender_arn,\n",
     ")\n",
     "client.get_recommendations(user_id=\"1\", metadataColumns=metadataMap)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "collapsed": false,
     "jupyter": {
      "outputs_hidden": false
     }
    },
    "source": [
     "### 2.6 [Use-Case 6] Invoke Personalize Chain with returned metadata for summarizing results"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "collapsed": false,
     "jupyter": {
      "outputs_hidden": false
     }
    },
    "outputs": [],
    "source": [
     "bedrock_llm = Bedrock(model_id=\"anthropic.claude-v2\", region_name=\"us-west-2\")\n",
     "\n",
     "# Create personalize chain\n",
     "# Use return_direct=True if you do not want summary\n",
     "chain = AmazonPersonalizeChain.from_llm(\n",
     "    llm=bedrock_llm, client=client, return_direct=False\n",
     ")\n",
     "response = chain({\"user_id\": \"1\", \"metadata_columns\": metadataMap})\n",
     "print(response)"
    ]
   }
  ],
  "metadata": {
   "kernelspec": {
    "display_name": "Python 3 (ipykernel)",
    "language": "python",
    "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
     "name": "ipython",
     "version": 3
    },
    "file_extension": ".py",
    "mimetype": "text/x-python",
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.11.7"
   },
   "vscode": {
    "interpreter": {
     "hash": "15e58ce194949b77a891bd4339ce3d86a9bd138e905926019517993f97db9e6c"
    }
   }
  },
  "nbformat": 4,
  "nbformat_minor": 4
 }

584

cookbook/anthropic_structured_outputs.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

922

cookbook/apache_kafka_message_handling.ipynb Normal file

View File

@@ -0,0 +1,922 @@
 {
  "cells": [
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "rT1cmV4qCa2X"
    },
    "source": [
     "#  Using Apache Kafka to route messages\n",
     "\n",
     "---\n",
     "\n",
     "\n",
     "\n",
     "This notebook shows you how to use LangChain's standard chat features while passing the chat messages back and forth via Apache Kafka.\n",
     "\n",
     "This goal is to simulate an architecture where the chat front end and the LLM are running as separate services that need to communicate with one another over an internal network.\n",
     "\n",
     "It's an alternative to typical pattern of requesting a response from the model via a REST API (there's more info on why you would want to do this at the end of the notebook)."
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "UPYtfAR_9YxZ"
    },
    "source": [
     "### 1. Install the main dependencies\n",
     "\n",
     "Dependencies include:\n",
     "\n",
     "- The Quix Streams library for managing interactions with Apache Kafka (or Kafka-like tools such as Redpanda) in a \"Pandas-like\" way.\n",
     "- The LangChain library for managing interactions with Llama-2 and storing conversation state."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "ZX5tfKiy9cN-"
    },
    "outputs": [],
    "source": [
     "!pip install quixstreams==2.1.2a langchain==0.0.340 huggingface_hub==0.19.4 langchain-experimental==0.0.42 python-dotenv"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "losTSdTB9d9O"
    },
    "source": [
     "### 2. Build and install the llama-cpp-python library (with CUDA enabled so that we can advantage of Google Colab GPU\n",
     "\n",
     "The `llama-cpp-python` library is a Python wrapper around the `llama-cpp` library which enables you to efficiently leverage just a CPU to run quantized LLMs.\n",
     "\n",
     "When you use the standard `pip install llama-cpp-python` command, you do not get GPU support by default. Generation can be very slow if you rely on just the CPU in Google Colab, so the following command adds an extra option to build and install\n",
     "`llama-cpp-python` with GPU support (make sure you have a GPU-enabled runtime selected in Google Colab)."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "-JCQdl1G9tbl"
    },
    "outputs": [],
    "source": [
     "!CMAKE_ARGS=\"-DLLAMA_CUBLAS=on\" FORCE_CMAKE=1 pip install llama-cpp-python"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "5_vjVIAh9rLl"
    },
    "source": [
     "### 3. Download and setup Kafka and Zookeeper instances\n",
     "\n",
     "Download the Kafka binaries from the Apache website and start the servers as daemons. We'll use the default configurations (provided by Apache Kafka) for spinning up the instances."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "metadata": {
     "id": "zFz7czGRW5Wr"
    },
    "outputs": [],
    "source": [
     "!curl -sSOL https://dlcdn.apache.org/kafka/3.6.1/kafka_2.13-3.6.1.tgz\n",
     "!tar -xzf kafka_2.13-3.6.1.tgz"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "Uf7NR_UZ9wye"
    },
    "outputs": [],
    "source": [
     "!./kafka_2.13-3.6.1/bin/zookeeper-server-start.sh -daemon ./kafka_2.13-3.6.1/config/zookeeper.properties\n",
     "!./kafka_2.13-3.6.1/bin/kafka-server-start.sh -daemon ./kafka_2.13-3.6.1/config/server.properties\n",
     "!echo \"Waiting for 10 secs until kafka and zookeeper services are up and running\"\n",
     "!sleep 10"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "H3SafFuS94p1"
    },
    "source": [
     "### 4. Check that the Kafka Daemons are running\n",
     "\n",
     "Show the running processes and filter it for Java processes (you should see two—one for each server)."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "CZDC2lQP99yp"
    },
    "outputs": [],
    "source": [
     "!ps aux | grep -E '[j]ava'"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "Snoxmjb5-V37"
    },
    "source": [
     "### 5. Import the required dependencies and initialize required variables\n",
     "\n",
     "Import the Quix Streams library for interacting with Kafka, and the necessary LangChain components for running a `ConversationChain`."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 9,
    "metadata": {
     "id": "plR9e_MF-XL5"
    },
    "outputs": [],
    "source": [
     "# Import utility libraries\n",
     "import json\n",
     "import random\n",
     "import re\n",
     "import time\n",
     "import uuid\n",
     "from os import environ\n",
     "from pathlib import Path\n",
     "from random import choice, randint, random\n",
     "\n",
     "from dotenv import load_dotenv\n",
     "\n",
     "# Import a Hugging Face utility to download models directly from Hugging Face hub:\n",
     "from huggingface_hub import hf_hub_download\n",
     "from langchain.chains import ConversationChain\n",
     "\n",
     "# Import Langchain modules for managing prompts and conversation chains:\n",
     "from langchain.llms import LlamaCpp\n",
     "from langchain.memory import ConversationTokenBufferMemory\n",
     "from langchain.prompts import PromptTemplate, load_prompt\n",
     "from langchain_core.messages import SystemMessage\n",
     "from langchain_experimental.chat_models import Llama2Chat\n",
     "from quixstreams import Application, State, message_key\n",
     "\n",
     "# Import Quix dependencies\n",
     "from quixstreams.kafka import Producer\n",
     "\n",
     "# Initialize global variables.\n",
     "AGENT_ROLE = \"AI\"\n",
     "chat_id = \"\"\n",
     "\n",
     "# Set the current role to the role constant and initialize variables for supplementary customer metadata:\n",
     "role = AGENT_ROLE"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "HgJjJ9aZ-liy"
    },
    "source": [
     "### 6. Download the \"llama-2-7b-chat.Q4_K_M.gguf\" model\n",
     "\n",
     "Download the quantized LLama-2 7B model from Hugging Face which we will use as a local LLM (rather than relying on REST API calls to an external service)."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 7,
    "metadata": {
     "colab": {
      "base_uri": "https://localhost:8080/",
      "height": 67,
      "referenced_widgets": [
       "969343cdbe604a26926679bbf8bd2dda",
       "d8b8370c9b514715be7618bfe6832844",
       "0def954cca89466b8408fadaf3b82e64",
       "462482accc664729980562e208ceb179",
       "80d842f73c564dc7b7cc316c763e2633",
       "fa055d9f2a9d4a789e9cf3c89e0214e5",
       "30ecca964a394109ac2ad757e3aec6c0",
       "fb6478ce2dac489bb633b23ba0953c5c",
       "734b0f5da9fc4307a95bab48cdbb5d89",
       "b32f3a86a74741348511f4e136744ac8",
       "e409071bff5a4e2d9bf0e9f5cc42231b"
      ]
     },
     "id": "Qwu4YoSA-503",
     "outputId": "f956976c-7485-415b-ac93-4336ade31964"
    },
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "The model path does not exist in state. Downloading model...\n"
      ]
     },
     {
      "data": {
       "application/vnd.jupyter.widget-view+json": {
        "model_id": "969343cdbe604a26926679bbf8bd2dda",
        "version_major": 2,
        "version_minor": 0
       },
       "text/plain": [
        "llama-2-7b-chat.Q4_K_M.gguf:   0%|          | 0.00/4.08G [00:00<?, ?B/s]"
       ]
      },
      "metadata": {},
      "output_type": "display_data"
     }
    ],
    "source": [
     "model_name = \"llama-2-7b-chat.Q4_K_M.gguf\"\n",
     "model_path = f\"./state/{model_name}\"\n",
     "\n",
     "if not Path(model_path).exists():\n",
     "    print(\"The model path does not exist in state. Downloading model...\")\n",
     "    hf_hub_download(\"TheBloke/Llama-2-7b-Chat-GGUF\", model_name, local_dir=\"state\")\n",
     "else:\n",
     "    print(\"Loading model from state...\")"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "6AN6TXsF-8wx"
    },
    "source": [
     "### 7. Load the model and initialize conversational memory\n",
     "\n",
     "Load Llama 2 and set the conversation buffer to 300 tokens using `ConversationTokenBufferMemory`. This value was used for running Llama in a CPU only container, so you can raise it if running in Google Colab. It prevents the container that is hosting the model from running out of memory.\n",
     "\n",
     "Here, we're overriding the default system persona so that the chatbot has the personality of Marvin The Paranoid Android from the Hitchhiker's Guide to the Galaxy."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "7zLO3Jx3_Kkg"
    },
    "outputs": [],
    "source": [
     "# Load the model with the appropriate parameters:\n",
     "llm = LlamaCpp(\n",
     "    model_path=model_path,\n",
     "    max_tokens=250,\n",
     "    top_p=0.95,\n",
     "    top_k=150,\n",
     "    temperature=0.7,\n",
     "    repeat_penalty=1.2,\n",
     "    n_ctx=2048,\n",
     "    streaming=False,\n",
     "    n_gpu_layers=-1,\n",
     ")\n",
     "\n",
     "model = Llama2Chat(\n",
     "    llm=llm,\n",
     "    system_message=SystemMessage(\n",
     "        content=\"You are a very bored robot with the personality of Marvin the Paranoid Android from The Hitchhiker's Guide to the Galaxy.\"\n",
     "    ),\n",
     ")\n",
     "\n",
     "# Defines how much of the conversation history to give to the model\n",
     "# during each exchange (300 tokens, or a little over 300 words)\n",
     "# Function automatically prunes the oldest messages from conversation history that fall outside the token range.\n",
     "memory = ConversationTokenBufferMemory(\n",
     "    llm=llm,\n",
     "    max_token_limit=300,\n",
     "    ai_prefix=\"AGENT\",\n",
     "    human_prefix=\"HUMAN\",\n",
     "    return_messages=True,\n",
     ")\n",
     "\n",
     "\n",
     "# Define a custom prompt\n",
     "prompt_template = PromptTemplate(\n",
     "    input_variables=[\"history\", \"input\"],\n",
     "    template=\"\"\"\n",
     "    The following text is the history of a chat between you and a humble human who needs your wisdom.\n",
     "    Please reply to the human's most recent message.\n",
     "    Current conversation:\\n{history}\\nHUMAN: {input}\\:nANDROID:\n",
     "    \"\"\",\n",
     ")\n",
     "\n",
     "\n",
     "chain = ConversationChain(llm=model, prompt=prompt_template, memory=memory)\n",
     "\n",
     "print(\"--------------------------------------------\")\n",
     "print(f\"Prompt={chain.prompt}\")\n",
     "print(\"--------------------------------------------\")"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "m4ZeJ9mG_PEA"
    },
    "source": [
     "### 8. Initialize the chat conversation with the chat bot\n",
     "\n",
     "We configure the chatbot to initialize the conversation by sending a fixed greeting to a \"chat\" Kafka topic. The \"chat\" topic gets automatically created when we send the first message."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "KYyo5TnV_YC3"
    },
    "outputs": [],
    "source": [
     "def chat_init():\n",
     "    chat_id = str(\n",
     "        uuid.uuid4()\n",
     "    )  # Give the conversation an ID for effective message keying\n",
     "    print(\"======================================\")\n",
     "    print(f\"Generated CHAT_ID = {chat_id}\")\n",
     "    print(\"======================================\")\n",
     "\n",
     "    # Use a standard fixed greeting to kick off the conversation\n",
     "    greet = \"Hello, my name is Marvin. What do you want?\"\n",
     "\n",
     "    # Initialize a Kafka Producer using the chat ID as the message key\n",
     "    with Producer(\n",
     "        broker_address=\"127.0.0.1:9092\",\n",
     "        extra_config={\"allow.auto.create.topics\": \"true\"},\n",
     "    ) as producer:\n",
     "        value = {\n",
     "            \"uuid\": chat_id,\n",
     "            \"role\": role,\n",
     "            \"text\": greet,\n",
     "            \"conversation_id\": chat_id,\n",
     "            \"Timestamp\": time.time_ns(),\n",
     "        }\n",
     "        print(f\"Producing value {value}\")\n",
     "        producer.produce(\n",
     "            topic=\"chat\",\n",
     "            headers=[(\"uuid\", str(uuid.uuid4()))],  # a dict is also allowed here\n",
     "            key=chat_id,\n",
     "            value=json.dumps(value),  # needs to be a string\n",
     "        )\n",
     "\n",
     "    print(\"Started chat\")\n",
     "    print(\"--------------------------------------------\")\n",
     "    print(value)\n",
     "    print(\"--------------------------------------------\")\n",
     "\n",
     "\n",
     "chat_init()"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "gArPPx2f_bgf"
    },
    "source": [
     "### 9. Initialize the reply function\n",
     "\n",
     "This function defines how the chatbot should reply to incoming messages. Instead of sending a fixed message like the previous cell, we generate a reply using Llama-2 and send that reply back to the \"chat\" Kafka topic."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 13,
    "metadata": {
     "id": "yN5t71hY_hgn"
    },
    "outputs": [],
    "source": [
     "def reply(row: dict, state: State):\n",
     "    print(\"-------------------------------\")\n",
     "    print(\"Received:\")\n",
     "    print(row)\n",
     "    print(\"-------------------------------\")\n",
     "    print(f\"Thinking about the reply to: {row['text']}...\")\n",
     "\n",
     "    msg = chain.run(row[\"text\"])\n",
     "    print(f\"{role.upper()} replying with: {msg}\\n\")\n",
     "\n",
     "    row[\"role\"] = role\n",
     "    row[\"text\"] = msg\n",
     "\n",
     "    # Replace previous role and text values of the row so that it can be sent back to Kafka as a new message\n",
     "    # containing the agents role and reply\n",
     "    return row"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "HZHwmIR0_kFY"
    },
    "source": [
     "### 10. Check the Kafka topic for new human messages and have the model generate a reply\n",
     "\n",
     "If you are running this cell for this first time, run it and wait until you see Marvin's greeting ('Hello my name is Marvin...') in the console output. Stop the cell manually and proceed to the next cell where you'll be prompted for your reply.\n",
     "\n",
     "Once you have typed in your message, come back to this cell. Your reply is also sent to the same \"chat\" topic. The Kafka consumer checks for new messages and filters out messages that originate from the chatbot itself, leaving only the latest human messages.\n",
     "\n",
     "Once a new human message is detected, the reply function is triggered.\n",
     "\n",
     "\n",
     "\n",
     "_STOP THIS CELL MANUALLY WHEN YOU RECEIVE A REPLY FROM THE LLM IN THE OUTPUT_"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "-adXc3eQ_qwI"
    },
    "outputs": [],
    "source": [
     "# Define your application and settings\n",
     "app = Application(\n",
     "    broker_address=\"127.0.0.1:9092\",\n",
     "    consumer_group=\"aichat\",\n",
     "    auto_offset_reset=\"earliest\",\n",
     "    consumer_extra_config={\"allow.auto.create.topics\": \"true\"},\n",
     ")\n",
     "\n",
     "# Define an input topic with JSON deserializer\n",
     "input_topic = app.topic(\"chat\", value_deserializer=\"json\")\n",
     "# Define an output topic with JSON serializer\n",
     "output_topic = app.topic(\"chat\", value_serializer=\"json\")\n",
     "# Initialize a streaming dataframe based on the stream of messages from the input topic:\n",
     "sdf = app.dataframe(topic=input_topic)\n",
     "\n",
     "# Filter the SDF to include only incoming rows where the roles that dont match the bot's current role\n",
     "sdf = sdf.update(\n",
     "    lambda val: print(\n",
     "        f\"Received update: {val}\\n\\nSTOP THIS CELL MANUALLY TO HAVE THE LLM REPLY OR ENTER YOUR OWN FOLLOWUP RESPONSE\"\n",
     "    )\n",
     ")\n",
     "\n",
     "# So that it doesn't reply to its own messages\n",
     "sdf = sdf[sdf[\"role\"] != role]\n",
     "\n",
     "# Trigger the reply function for any new messages(rows) detected in the filtered SDF\n",
     "sdf = sdf.apply(reply, stateful=True)\n",
     "\n",
     "# Check the SDF again and filter out any empty rows\n",
     "sdf = sdf[sdf.apply(lambda row: row is not None)]\n",
     "\n",
     "# Update the timestamp column to the current time in nanoseconds\n",
     "sdf[\"Timestamp\"] = sdf[\"Timestamp\"].apply(lambda row: time.time_ns())\n",
     "\n",
     "# Publish the processed SDF to a Kafka topic specified by the output_topic object.\n",
     "sdf = sdf.to_topic(output_topic)\n",
     "\n",
     "app.run(sdf)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "EwXYrmWD_0CX"
    },
    "source": [
     "\n",
     "### 11. Enter a human message\n",
     "\n",
     "Run this cell to enter your message that you want to sent to the model. It uses another Kafka producer to send your text to the \"chat\" Kafka topic for the model to pick up (requires running the previous cell again)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {
     "id": "6sxOPxSP_3iu"
    },
    "outputs": [],
    "source": [
     "chat_input = input(\"Please enter your reply: \")\n",
     "myreply = chat_input\n",
     "\n",
     "msgvalue = {\n",
     "    \"uuid\": chat_id,  # leave empty for now\n",
     "    \"role\": \"human\",\n",
     "    \"text\": myreply,\n",
     "    \"conversation_id\": chat_id,\n",
     "    \"Timestamp\": time.time_ns(),\n",
     "}\n",
     "\n",
     "with Producer(\n",
     "    broker_address=\"127.0.0.1:9092\",\n",
     "    extra_config={\"allow.auto.create.topics\": \"true\"},\n",
     ") as producer:\n",
     "    value = msgvalue\n",
     "    producer.produce(\n",
     "        topic=\"chat\",\n",
     "        headers=[(\"uuid\", str(uuid.uuid4()))],  # a dict is also allowed here\n",
     "        key=chat_id,  # leave empty for now\n",
     "        value=json.dumps(value),  # needs to be a string\n",
     "    )\n",
     "\n",
     "print(\"Replied to chatbot with message: \")\n",
     "print(\"--------------------------------------------\")\n",
     "print(value)\n",
     "print(\"--------------------------------------------\")\n",
     "print(\"\\n\\nRUN THE PREVIOUS CELL TO HAVE THE CHATBOT GENERATE A REPLY\")"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {
     "id": "cSx3s7TBBegg"
    },
    "source": [
     "### Why route chat messages through Kafka?\n",
     "\n",
     "It's easier to interact with the LLM directly using LangChains built-in conversation management features. Plus you can also use a REST API to generate a response from an externally hosted model. So why go to the trouble of using Apache Kafka?\n",
     "\n",
     "There are a few reasons, such as:\n",
     "\n",
     "  * **Integration**: Many enterprises want to run their own LLMs so that they can keep their data in-house. This requires integrating LLM-powered components into existing architectures that might already be decoupled using some kind of message bus.\n",
     "\n",
     "  * **Scalability**: Apache Kafka is designed with parallel processing in mind, so many teams prefer to use it to more effectively distribute work to available workers (in this case the \"worker\" is a container running an LLM).\n",
     "\n",
     "  * **Durability**: Kafka is designed to allow services to pick up where another service left off in the case where that service experienced a memory issue or went offline. This prevents data loss in highly complex, distributed architectures where multiple systems are communicating with one another (LLMs being just one of many interdependent systems that also include vector databases and traditional databases).\n",
     "\n",
     "For more background on why event streaming is a good fit for Gen AI application architecture, see Kai Waehner's article [\"Apache Kafka + Vector Database + LLM = Real-Time GenAI\"](https://www.kai-waehner.de/blog/2023/11/08/apache-kafka-flink-vector-database-llm-real-time-genai/)."
    ]
   }
  ],
  "metadata": {
   "accelerator": "GPU",
   "colab": {
    "gpuType": "T4",
    "provenance": []
   },
   "kernelspec": {
    "display_name": "Python 3",
    "name": "python3"
   },
   "language_info": {
    "name": "python"
   },
   "widgets": {
    "application/vnd.jupyter.widget-state+json": {
     "0def954cca89466b8408fadaf3b82e64": {
      "model_module": "@jupyter-widgets/controls",
      "model_module_version": "1.5.0",
      "model_name": "FloatProgressModel",
      "state": {
       "_dom_classes": [],
       "_model_module": "@jupyter-widgets/controls",
       "_model_module_version": "1.5.0",
       "_model_name": "FloatProgressModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/controls",
       "_view_module_version": "1.5.0",
       "_view_name": "ProgressView",
       "bar_style": "success",
       "description": "",
       "description_tooltip": null,
       "layout": "IPY_MODEL_fb6478ce2dac489bb633b23ba0953c5c",
       "max": 4081004224,
       "min": 0,
       "orientation": "horizontal",
       "style": "IPY_MODEL_734b0f5da9fc4307a95bab48cdbb5d89",
       "value": 4081004224
      }
     },
     "30ecca964a394109ac2ad757e3aec6c0": {
      "model_module": "@jupyter-widgets/controls",
      "model_module_version": "1.5.0",
      "model_name": "DescriptionStyleModel",
      "state": {
       "_model_module": "@jupyter-widgets/controls",
       "_model_module_version": "1.5.0",
       "_model_name": "DescriptionStyleModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/base",
       "_view_module_version": "1.2.0",
       "_view_name": "StyleView",
       "description_width": ""
      }
     },
     "462482accc664729980562e208ceb179": {
      "model_module": "@jupyter-widgets/controls",
      "model_module_version": "1.5.0",
      "model_name": "HTMLModel",
      "state": {
       "_dom_classes": [],
       "_model_module": "@jupyter-widgets/controls",
       "_model_module_version": "1.5.0",
       "_model_name": "HTMLModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/controls",
       "_view_module_version": "1.5.0",
       "_view_name": "HTMLView",
       "description": "",
       "description_tooltip": null,
       "layout": "IPY_MODEL_b32f3a86a74741348511f4e136744ac8",
       "placeholder": "",
       "style": "IPY_MODEL_e409071bff5a4e2d9bf0e9f5cc42231b",
       "value": " 4.08G/4.08G [00:33&lt;00:00, 184MB/s]"
      }
     },
     "734b0f5da9fc4307a95bab48cdbb5d89": {
      "model_module": "@jupyter-widgets/controls",
      "model_module_version": "1.5.0",
      "model_name": "ProgressStyleModel",
      "state": {
       "_model_module": "@jupyter-widgets/controls",
       "_model_module_version": "1.5.0",
       "_model_name": "ProgressStyleModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/base",
       "_view_module_version": "1.2.0",
       "_view_name": "StyleView",
       "bar_color": null,
       "description_width": ""
      }
     },
     "80d842f73c564dc7b7cc316c763e2633": {
      "model_module": "@jupyter-widgets/base",
      "model_module_version": "1.2.0",
      "model_name": "LayoutModel",
      "state": {
       "_model_module": "@jupyter-widgets/base",
       "_model_module_version": "1.2.0",
       "_model_name": "LayoutModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/base",
       "_view_module_version": "1.2.0",
       "_view_name": "LayoutView",
       "align_content": null,
       "align_items": null,
       "align_self": null,
       "border": null,
       "bottom": null,
       "display": null,
       "flex": null,
       "flex_flow": null,
       "grid_area": null,
       "grid_auto_columns": null,
       "grid_auto_flow": null,
       "grid_auto_rows": null,
       "grid_column": null,
       "grid_gap": null,
       "grid_row": null,
       "grid_template_areas": null,
       "grid_template_columns": null,
       "grid_template_rows": null,
       "height": null,
       "justify_content": null,
       "justify_items": null,
       "left": null,
       "margin": null,
       "max_height": null,
       "max_width": null,
       "min_height": null,
       "min_width": null,
       "object_fit": null,
       "object_position": null,
       "order": null,
       "overflow": null,
       "overflow_x": null,
       "overflow_y": null,
       "padding": null,
       "right": null,
       "top": null,
       "visibility": null,
       "width": null
      }
     },
     "969343cdbe604a26926679bbf8bd2dda": {
      "model_module": "@jupyter-widgets/controls",
      "model_module_version": "1.5.0",
      "model_name": "HBoxModel",
      "state": {
       "_dom_classes": [],
       "_model_module": "@jupyter-widgets/controls",
       "_model_module_version": "1.5.0",
       "_model_name": "HBoxModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/controls",
       "_view_module_version": "1.5.0",
       "_view_name": "HBoxView",
       "box_style": "",
       "children": [
        "IPY_MODEL_d8b8370c9b514715be7618bfe6832844",
        "IPY_MODEL_0def954cca89466b8408fadaf3b82e64",
        "IPY_MODEL_462482accc664729980562e208ceb179"
       ],
       "layout": "IPY_MODEL_80d842f73c564dc7b7cc316c763e2633"
      }
     },
     "b32f3a86a74741348511f4e136744ac8": {
      "model_module": "@jupyter-widgets/base",
      "model_module_version": "1.2.0",
      "model_name": "LayoutModel",
      "state": {
       "_model_module": "@jupyter-widgets/base",
       "_model_module_version": "1.2.0",
       "_model_name": "LayoutModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/base",
       "_view_module_version": "1.2.0",
       "_view_name": "LayoutView",
       "align_content": null,
       "align_items": null,
       "align_self": null,
       "border": null,
       "bottom": null,
       "display": null,
       "flex": null,
       "flex_flow": null,
       "grid_area": null,
       "grid_auto_columns": null,
       "grid_auto_flow": null,
       "grid_auto_rows": null,
       "grid_column": null,
       "grid_gap": null,
       "grid_row": null,
       "grid_template_areas": null,
       "grid_template_columns": null,
       "grid_template_rows": null,
       "height": null,
       "justify_content": null,
       "justify_items": null,
       "left": null,
       "margin": null,
       "max_height": null,
       "max_width": null,
       "min_height": null,
       "min_width": null,
       "object_fit": null,
       "object_position": null,
       "order": null,
       "overflow": null,
       "overflow_x": null,
       "overflow_y": null,
       "padding": null,
       "right": null,
       "top": null,
       "visibility": null,
       "width": null
      }
     },
     "d8b8370c9b514715be7618bfe6832844": {
      "model_module": "@jupyter-widgets/controls",
      "model_module_version": "1.5.0",
      "model_name": "HTMLModel",
      "state": {
       "_dom_classes": [],
       "_model_module": "@jupyter-widgets/controls",
       "_model_module_version": "1.5.0",
       "_model_name": "HTMLModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/controls",
       "_view_module_version": "1.5.0",
       "_view_name": "HTMLView",
       "description": "",
       "description_tooltip": null,
       "layout": "IPY_MODEL_fa055d9f2a9d4a789e9cf3c89e0214e5",
       "placeholder": "",
       "style": "IPY_MODEL_30ecca964a394109ac2ad757e3aec6c0",
       "value": "llama-2-7b-chat.Q4_K_M.gguf: 100%"
      }
     },
     "e409071bff5a4e2d9bf0e9f5cc42231b": {
      "model_module": "@jupyter-widgets/controls",
      "model_module_version": "1.5.0",
      "model_name": "DescriptionStyleModel",
      "state": {
       "_model_module": "@jupyter-widgets/controls",
       "_model_module_version": "1.5.0",
       "_model_name": "DescriptionStyleModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/base",
       "_view_module_version": "1.2.0",
       "_view_name": "StyleView",
       "description_width": ""
      }
     },
     "fa055d9f2a9d4a789e9cf3c89e0214e5": {
      "model_module": "@jupyter-widgets/base",
      "model_module_version": "1.2.0",
      "model_name": "LayoutModel",
      "state": {
       "_model_module": "@jupyter-widgets/base",
       "_model_module_version": "1.2.0",
       "_model_name": "LayoutModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/base",
       "_view_module_version": "1.2.0",
       "_view_name": "LayoutView",
       "align_content": null,
       "align_items": null,
       "align_self": null,
       "border": null,
       "bottom": null,
       "display": null,
       "flex": null,
       "flex_flow": null,
       "grid_area": null,
       "grid_auto_columns": null,
       "grid_auto_flow": null,
       "grid_auto_rows": null,
       "grid_column": null,
       "grid_gap": null,
       "grid_row": null,
       "grid_template_areas": null,
       "grid_template_columns": null,
       "grid_template_rows": null,
       "height": null,
       "justify_content": null,
       "justify_items": null,
       "left": null,
       "margin": null,
       "max_height": null,
       "max_width": null,
       "min_height": null,
       "min_width": null,
       "object_fit": null,
       "object_position": null,
       "order": null,
       "overflow": null,
       "overflow_x": null,
       "overflow_y": null,
       "padding": null,
       "right": null,
       "top": null,
       "visibility": null,
       "width": null
      }
     },
     "fb6478ce2dac489bb633b23ba0953c5c": {
      "model_module": "@jupyter-widgets/base",
      "model_module_version": "1.2.0",
      "model_name": "LayoutModel",
      "state": {
       "_model_module": "@jupyter-widgets/base",
       "_model_module_version": "1.2.0",
       "_model_name": "LayoutModel",
       "_view_count": null,
       "_view_module": "@jupyter-widgets/base",
       "_view_module_version": "1.2.0",
       "_view_name": "LayoutView",
       "align_content": null,
       "align_items": null,
       "align_self": null,
       "border": null,
       "bottom": null,
       "display": null,
       "flex": null,
       "flex_flow": null,
       "grid_area": null,
       "grid_auto_columns": null,
       "grid_auto_flow": null,
       "grid_auto_rows": null,
       "grid_column": null,
       "grid_gap": null,
       "grid_row": null,
       "grid_template_areas": null,
       "grid_template_columns": null,
       "grid_template_rows": null,
       "height": null,
       "justify_content": null,
       "justify_items": null,
       "left": null,
       "margin": null,
       "max_height": null,
       "max_width": null,
       "min_height": null,
       "min_width": null,
       "object_fit": null,
       "object_position": null,
       "order": null,
       "overflow": null,
       "overflow_x": null,
       "overflow_y": null,
       "padding": null,
       "right": null,
       "top": null,
       "visibility": null,
       "width": null
      }
     }
    }
   }
  },
  "nbformat": 4,
  "nbformat_minor": 0
 }

10

cookbook/autogpt/marathon_times.ipynb

View File

@@ -40,11 +40,13 @@
     "import nest_asyncio\n",
     "import pandas as pd\n",
     "from langchain.docstore.document import Document\n",
     "from langchain_community.agent_toolkits.pandas.base import create_pandas_dataframe_agent\n",
     "from langchain_experimental.agents.agent_toolkits.pandas.base import (\n",
     "    create_pandas_dataframe_agent,\n",
     ")\n",
     "from langchain_experimental.autonomous_agents import AutoGPT\n",
     "from langchain_openai import ChatOpenAI\n",
     "\n",
     "# Needed synce jupyter runs an async eventloop\n",
     "# Needed since jupyter runs an async eventloop\n",
     "nest_asyncio.apply()"
    ]
   },
@@ -57,7 +59,7 @@
    },
    "outputs": [],
    "source": [
     "llm = ChatOpenAI(model_name=\"gpt-4\", temperature=1.0)"
     "llm = ChatOpenAI(model=\"gpt-4\", temperature=1.0)"
    ]
   },
   {
@@ -227,8 +229,8 @@
     "    BaseCombineDocumentsChain,\n",
     "    load_qa_with_sources_chain,\n",
     ")\n",
     "from langchain.text_splitter import RecursiveCharacterTextSplitter\n",
     "from langchain.tools import BaseTool, DuckDuckGoSearchRun\n",
     "from langchain_text_splitters import RecursiveCharacterTextSplitter\n",
     "from pydantic import Field\n",
     "\n",
     "\n",

826

cookbook/azure_container_apps_dynamic_sessions_data_analyst.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

2

cookbook/camel_role_playing.ipynb

View File

@@ -90,7 +90,7 @@
     "    ) -> AIMessage:\n",
     "        messages = self.update_messages(input_message)\n",
     "\n",
     "        output_message = self.model(messages)\n",
     "        output_message = self.model.invoke(messages)\n",
     "        self.update_messages(output_message)\n",
     "\n",
     "        return output_message"

10

cookbook/code-analysis-deeplake.ipynb

View File

@@ -24,7 +24,7 @@
    "source": [
     "1. Prepare data:\n",
     "   1. Upload all python project files using the `langchain_community.document_loaders.TextLoader`. We will call these files the **documents**.\n",
     "   2. Split all documents to chunks using the `langchain.text_splitter.CharacterTextSplitter`.\n",
     "   2. Split all documents to chunks using the `langchain_text_splitters.CharacterTextSplitter`.\n",
     "   3. Embed chunks and upload them into the DeepLake using `langchain.embeddings.openai.OpenAIEmbeddings` and `langchain_community.vectorstores.DeepLake`\n",
     "2. Question-Answering:\n",
     "   1. Build a chain from `langchain.chat_models.ChatOpenAI` and `langchain.chains.ConversationalRetrievalChain`\n",
@@ -621,7 +621,7 @@
     }
    ],
    "source": [
     "from langchain.text_splitter import CharacterTextSplitter\n",
     "from langchain_text_splitters import CharacterTextSplitter\n",
     "\n",
     "text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)\n",
     "texts = text_splitter.split_documents(docs)\n",
@@ -933,7 +933,7 @@
       "**Answer**: The LangChain class includes various types of retrievers such as:\n",
       "\n",
       "- ArxivRetriever\n",
       "- AzureCognitiveSearchRetriever\n",
       "- AzureAISearchRetriever\n",
       "- BM25Retriever\n",
       "- ChaindeskRetriever\n",
       "- ChatGPTPluginRetriever\n",
@@ -993,7 +993,7 @@
     {
      "data": {
       "text/plain": [
        "{'question': 'LangChain possesses a variety of retrievers including:\\n\\n1. ArxivRetriever\\n2. AzureCognitiveSearchRetriever\\n3. BM25Retriever\\n4. ChaindeskRetriever\\n5. ChatGPTPluginRetriever\\n6. ContextualCompressionRetriever\\n7. DocArrayRetriever\\n8. ElasticSearchBM25Retriever\\n9. EnsembleRetriever\\n10. GoogleVertexAISearchRetriever\\n11. AmazonKendraRetriever\\n12. KNNRetriever\\n13. LlamaIndexGraphRetriever\\n14. LlamaIndexRetriever\\n15. MergerRetriever\\n16. MetalRetriever\\n17. MilvusRetriever\\n18. MultiQueryRetriever\\n19. ParentDocumentRetriever\\n20. PineconeHybridSearchRetriever\\n21. PubMedRetriever\\n22. RePhraseQueryRetriever\\n23. RemoteLangChainRetriever\\n24. SelfQueryRetriever\\n25. SVMRetriever\\n26. TFIDFRetriever\\n27. TimeWeightedVectorStoreRetriever\\n28. VespaRetriever\\n29. WeaviateHybridSearchRetriever\\n30. WebResearchRetriever\\n31. WikipediaRetriever\\n32. ZepRetriever\\n33. ZillizRetriever\\n\\nIt also includes self query translators like:\\n\\n1. ChromaTranslator\\n2. DeepLakeTranslator\\n3. MyScaleTranslator\\n4. PineconeTranslator\\n5. QdrantTranslator\\n6. WeaviateTranslator\\n\\nAnd remote retrievers like:\\n\\n1. RemoteLangChainRetriever'}"
        "{'question': 'LangChain possesses a variety of retrievers including:\\n\\n1. ArxivRetriever\\n2. AzureAISearchRetriever\\n3. BM25Retriever\\n4. ChaindeskRetriever\\n5. ChatGPTPluginRetriever\\n6. ContextualCompressionRetriever\\n7. DocArrayRetriever\\n8. ElasticSearchBM25Retriever\\n9. EnsembleRetriever\\n10. GoogleVertexAISearchRetriever\\n11. AmazonKendraRetriever\\n12. KNNRetriever\\n13. LlamaIndexGraphRetriever\\n14. LlamaIndexRetriever\\n15. MergerRetriever\\n16. MetalRetriever\\n17. MilvusRetriever\\n18. MultiQueryRetriever\\n19. ParentDocumentRetriever\\n20. PineconeHybridSearchRetriever\\n21. PubMedRetriever\\n22. RePhraseQueryRetriever\\n23. RemoteLangChainRetriever\\n24. SelfQueryRetriever\\n25. SVMRetriever\\n26. TFIDFRetriever\\n27. TimeWeightedVectorStoreRetriever\\n28. VespaRetriever\\n29. WeaviateHybridSearchRetriever\\n30. WebResearchRetriever\\n31. WikipediaRetriever\\n32. ZepRetriever\\n33. ZillizRetriever\\n\\nIt also includes self query translators like:\\n\\n1. ChromaTranslator\\n2. DeepLakeTranslator\\n3. MyScaleTranslator\\n4. PineconeTranslator\\n5. QdrantTranslator\\n6. WeaviateTranslator\\n\\nAnd remote retrievers like:\\n\\n1. RemoteLangChainRetriever'}"
       ]
      },
      "execution_count": 31,
@@ -1117,7 +1117,7 @@
       "The LangChain class includes various types of retrievers such as:\n",
       "\n",
       "- ArxivRetriever\n",
       "- AzureCognitiveSearchRetriever\n",
       "- AzureAISearchRetriever\n",
       "- BM25Retriever\n",
       "- ChaindeskRetriever\n",
       "- ChatGPTPluginRetriever\n",

557

cookbook/cql_agent.ipynb Normal file

View File

@@ -0,0 +1,557 @@
 {
  "cells": [
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "## Setup Environment"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### Python Modules"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "Install the following Python modules:\n",
     "\n",
     "```bash\n",
     "pip install ipykernel python-dotenv cassio pandas langchain_openai langchain langchain-community langchainhub langchain_experimental openai-multi-tool-use-parallel-patch\n",
     "```"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### Load the `.env` File"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "Connection is via `cassio` using `auto=True` parameter, and the notebook uses OpenAI. You should create a `.env` file accordingly.\n",
     "\n",
     "For Casssandra, set:\n",
     "```bash\n",
     "CASSANDRA_CONTACT_POINTS\n",
     "CASSANDRA_USERNAME\n",
     "CASSANDRA_PASSWORD\n",
     "CASSANDRA_KEYSPACE\n",
     "```\n",
     "\n",
     "For Astra, set:\n",
     "```bash\n",
     "ASTRA_DB_APPLICATION_TOKEN\n",
     "ASTRA_DB_DATABASE_ID\n",
     "ASTRA_DB_KEYSPACE\n",
     "```\n",
     "\n",
     "For example:\n",
     "\n",
     "```bash\n",
     "# Connection to Astra:\n",
     "ASTRA_DB_DATABASE_ID=a1b2c3d4-...\n",
     "ASTRA_DB_APPLICATION_TOKEN=AstraCS:...\n",
     "ASTRA_DB_KEYSPACE=notebooks\n",
     "\n",
     "# Also set \n",
     "OPENAI_API_KEY=sk-....\n",
     "```\n",
     "\n",
     "(You may also modify the below code to directly connect with `cassio`.)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from dotenv import load_dotenv\n",
     "\n",
     "load_dotenv(override=True)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### Connect to Cassandra"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "import os\n",
     "\n",
     "import cassio\n",
     "\n",
     "cassio.init(auto=True)\n",
     "session = cassio.config.resolve_session()\n",
     "if not session:\n",
     "    raise Exception(\n",
     "        \"Check environment configuration or manually configure cassio connection parameters\"\n",
     "    )\n",
     "\n",
     "keyspace = os.environ.get(\n",
     "    \"ASTRA_DB_KEYSPACE\", os.environ.get(\"CASSANDRA_KEYSPACE\", None)\n",
     ")\n",
     "if not keyspace:\n",
     "    raise ValueError(\"a KEYSPACE environment variable must be set\")\n",
     "\n",
     "session.set_keyspace(keyspace)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "## Setup Database"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "This needs to be done one time only!"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### Download Data"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "The dataset used is from Kaggle, the [Environmental Sensor Telemetry Data](https://www.kaggle.com/datasets/garystafford/environmental-sensor-data-132k?select=iot_telemetry_data.csv). The next cell will download and unzip the data into a Pandas dataframe. The following cell is instructions to download manually. \n",
     "\n",
     "The net result of this section is you should have a Pandas dataframe variable `df`."
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "#### Download Automatically"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from io import BytesIO\n",
     "from zipfile import ZipFile\n",
     "\n",
     "import pandas as pd\n",
     "import requests\n",
     "\n",
     "datasetURL = \"https://storage.googleapis.com/kaggle-data-sets/788816/1355729/bundle/archive.zip?X-Goog-Algorithm=GOOG4-RSA-SHA256&X-Goog-Credential=gcp-kaggle-com%40kaggle-161607.iam.gserviceaccount.com%2F20240404%2Fauto%2Fstorage%2Fgoog4_request&X-Goog-Date=20240404T115828Z&X-Goog-Expires=259200&X-Goog-SignedHeaders=host&X-Goog-Signature=2849f003b100eb9dcda8dd8535990f51244292f67e4f5fad36f14aa67f2d4297672d8fe6ff5a39f03a29cda051e33e95d36daab5892b8874dcd5a60228df0361fa26bae491dd4371f02dd20306b583a44ba85a4474376188b1f84765147d3b4f05c57345e5de883c2c29653cce1f3755cd8e645c5e952f4fb1c8a735b22f0c811f97f7bce8d0235d0d3731ca8ab4629ff381f3bae9e35fc1b181c1e69a9c7913a5e42d9d52d53e5f716467205af9c8a3cc6746fc5352e8fbc47cd7d18543626bd67996d18c2045c1e475fc136df83df352fa747f1a3bb73e6ba3985840792ec1de407c15836640ec96db111b173bf16115037d53fdfbfd8ac44145d7f9a546aa\"\n",
     "\n",
     "response = requests.get(datasetURL)\n",
     "if response.status_code == 200:\n",
     "    zip_file = ZipFile(BytesIO(response.content))\n",
     "    csv_file_name = zip_file.namelist()[0]\n",
     "else:\n",
     "    print(\"Failed to download the file\")\n",
     "\n",
     "with zip_file.open(csv_file_name) as csv_file:\n",
     "    df = pd.read_csv(csv_file)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "#### Download Manually"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "You can download the `.zip` file and unpack the `.csv` contained within. Comment in the next line, and adjust the path to this `.csv` file appropriately."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "# df = pd.read_csv(\"/path/to/iot_telemetry_data.csv\")"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "### Load Data into Cassandra"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "This section assumes the existence of a dataframe `df`, the following cell validates its structure. The Download section above creates this object."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "assert df is not None, \"Dataframe 'df' must be set\"\n",
     "expected_columns = [\n",
     "    \"ts\",\n",
     "    \"device\",\n",
     "    \"co\",\n",
     "    \"humidity\",\n",
     "    \"light\",\n",
     "    \"lpg\",\n",
     "    \"motion\",\n",
     "    \"smoke\",\n",
     "    \"temp\",\n",
     "]\n",
     "assert all(\n",
     "    [column in df.columns for column in expected_columns]\n",
     "), \"DataFrame does not have the expected columns\""
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "Create and load tables:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from datetime import UTC, datetime\n",
     "\n",
     "from cassandra.query import BatchStatement\n",
     "\n",
     "# Create sensors table\n",
     "table_query = \"\"\"\n",
     "CREATE TABLE IF NOT EXISTS iot_sensors (\n",
     "    device text,\n",
     "    conditions text,\n",
     "    room text,\n",
     "    PRIMARY KEY (device)\n",
     ")\n",
     "WITH COMMENT = 'Environmental IoT room sensor metadata.';\n",
     "\"\"\"\n",
     "session.execute(table_query)\n",
     "\n",
     "pstmt = session.prepare(\n",
     "    \"\"\"\n",
     "INSERT INTO iot_sensors (device, conditions, room)\n",
     "VALUES (?, ?, ?)\n",
     "\"\"\"\n",
     ")\n",
     "\n",
     "devices = [\n",
     "    (\"00:0f:00:70:91:0a\", \"stable conditions, cooler and more humid\", \"room 1\"),\n",
     "    (\"1c:bf:ce:15:ec:4d\", \"highly variable temperature and humidity\", \"room 2\"),\n",
     "    (\"b8:27:eb:bf:9d:51\", \"stable conditions, warmer and dryer\", \"room 3\"),\n",
     "]\n",
     "\n",
     "for device, conditions, room in devices:\n",
     "    session.execute(pstmt, (device, conditions, room))\n",
     "\n",
     "print(\"Sensors inserted successfully.\")\n",
     "\n",
     "# Create data table\n",
     "table_query = \"\"\"\n",
     "CREATE TABLE IF NOT EXISTS iot_data (\n",
     "    day text,\n",
     "    device text,\n",
     "    ts timestamp,\n",
     "    co double,\n",
     "    humidity double,\n",
     "    light boolean,\n",
     "    lpg double,\n",
     "    motion boolean,\n",
     "    smoke double,\n",
     "    temp double,\n",
     "    PRIMARY KEY ((day, device), ts)\n",
     ")\n",
     "WITH COMMENT = 'Data from environmental IoT room sensors. Columns include device identifier, timestamp (ts) of the data collection, carbon monoxide level (co), relative humidity, light presence, LPG concentration, motion detection, smoke concentration, and temperature (temp). Data is partitioned by day and device.';\n",
     "\"\"\"\n",
     "session.execute(table_query)\n",
     "\n",
     "pstmt = session.prepare(\n",
     "    \"\"\"\n",
     "INSERT INTO iot_data (day, device, ts, co, humidity, light, lpg, motion, smoke, temp)\n",
     "VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?)\n",
     "\"\"\"\n",
     ")\n",
     "\n",
     "\n",
     "def insert_data_batch(name, group):\n",
     "    batch = BatchStatement()\n",
     "    day, device = name\n",
     "    print(f\"Inserting batch for day: {day}, device: {device}\")\n",
     "\n",
     "    for _, row in group.iterrows():\n",
     "        timestamp = datetime.fromtimestamp(row[\"ts\"], UTC)\n",
     "        batch.add(\n",
     "            pstmt,\n",
     "            (\n",
     "                day,\n",
     "                row[\"device\"],\n",
     "                timestamp,\n",
     "                row[\"co\"],\n",
     "                row[\"humidity\"],\n",
     "                row[\"light\"],\n",
     "                row[\"lpg\"],\n",
     "                row[\"motion\"],\n",
     "                row[\"smoke\"],\n",
     "                row[\"temp\"],\n",
     "            ),\n",
     "        )\n",
     "\n",
     "    session.execute(batch)\n",
     "\n",
     "\n",
     "# Convert columns to appropriate types\n",
     "df[\"light\"] = df[\"light\"] == \"true\"\n",
     "df[\"motion\"] = df[\"motion\"] == \"true\"\n",
     "df[\"ts\"] = df[\"ts\"].astype(float)\n",
     "df[\"day\"] = df[\"ts\"].apply(\n",
     "    lambda x: datetime.fromtimestamp(x, UTC).strftime(\"%Y-%m-%d\")\n",
     ")\n",
     "\n",
     "grouped_df = df.groupby([\"day\", \"device\"])\n",
     "\n",
     "for name, group in grouped_df:\n",
     "    insert_data_batch(name, group)\n",
     "\n",
     "print(\"Data load complete\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "print(session.keyspace)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "## Load the Tools"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "Python `import` statements for the demo:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.agents import AgentExecutor, create_openai_tools_agent\n",
     "from langchain_community.agent_toolkits.cassandra_database.toolkit import (\n",
     "    CassandraDatabaseToolkit,\n",
     ")\n",
     "from langchain_community.tools.cassandra_database.prompt import QUERY_PATH_PROMPT\n",
     "from langchain_community.tools.cassandra_database.tool import (\n",
     "    GetSchemaCassandraDatabaseTool,\n",
     "    GetTableDataCassandraDatabaseTool,\n",
     "    QueryCassandraDatabaseTool,\n",
     ")\n",
     "from langchain_community.utilities.cassandra_database import CassandraDatabase\n",
     "from langchain_openai import ChatOpenAI"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "The `CassandraDatabase` object is loaded from `cassio`, though it does accept a `Session`-type parameter as an alternative."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "# Create a CassandraDatabase instance\n",
     "db = CassandraDatabase(include_tables=[\"iot_sensors\", \"iot_data\"])\n",
     "\n",
     "# Create the Cassandra Database tools\n",
     "query_tool = QueryCassandraDatabaseTool(db=db)\n",
     "schema_tool = GetSchemaCassandraDatabaseTool(db=db)\n",
     "select_data_tool = GetTableDataCassandraDatabaseTool(db=db)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "The tools can be invoked directly:"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "# Test the tools\n",
     "print(\"Executing a CQL query:\")\n",
     "query = \"SELECT * FROM iot_sensors LIMIT 5;\"\n",
     "result = query_tool.run({\"query\": query})\n",
     "print(result)\n",
     "\n",
     "print(\"\\nGetting the schema for a keyspace:\")\n",
     "schema = schema_tool.run({\"keyspace\": keyspace})\n",
     "print(schema)\n",
     "\n",
     "print(\"\\nGetting data from a table:\")\n",
     "table = \"iot_data\"\n",
     "predicate = \"day = '2020-07-14' and device = 'b8:27:eb:bf:9d:51'\"\n",
     "data = select_data_tool.run(\n",
     "    {\"keyspace\": keyspace, \"table\": table, \"predicate\": predicate, \"limit\": 5}\n",
     ")\n",
     "print(data)"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "## Agent Configuration"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.agents import Tool\n",
     "from langchain_experimental.utilities import PythonREPL\n",
     "\n",
     "python_repl = PythonREPL()\n",
     "\n",
     "repl_tool = Tool(\n",
     "    name=\"python_repl\",\n",
     "    description=\"A Python shell. Use this to execute python commands. Input should be a valid python command. If you want to see the output of a value, you should print it out with `print(...)`.\",\n",
     "    func=python_repl.run,\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain import hub\n",
     "\n",
     "llm = ChatOpenAI(temperature=0, model=\"gpt-4-1106-preview\")\n",
     "toolkit = CassandraDatabaseToolkit(db=db)\n",
     "\n",
     "# context = toolkit.get_context()\n",
     "# tools = toolkit.get_tools()\n",
     "tools = [schema_tool, select_data_tool, repl_tool]\n",
     "\n",
     "input = (\n",
     "    QUERY_PATH_PROMPT\n",
     "    + f\"\"\"\n",
     "\n",
     "Here is your task: In the {keyspace} keyspace, find the total number of times the temperature of each device has exceeded 23 degrees on July 14, 2020.\n",
     " Create a summary report including the name of the room. Use Pandas if helpful.\n",
     "\"\"\"\n",
     ")\n",
     "\n",
     "prompt = hub.pull(\"hwchase17/openai-tools-agent\")\n",
     "\n",
     "# messages = [\n",
     "#     HumanMessagePromptTemplate.from_template(input),\n",
     "#     AIMessage(content=QUERY_PATH_PROMPT),\n",
     "#     MessagesPlaceholder(variable_name=\"agent_scratchpad\"),\n",
     "# ]\n",
     "\n",
     "# prompt = ChatPromptTemplate.from_messages(messages)\n",
     "# print(prompt)\n",
     "\n",
     "# Choose the LLM that will drive the agent\n",
     "# Only certain models support this\n",
     "llm = ChatOpenAI(model=\"gpt-3.5-turbo-1106\", temperature=0)\n",
     "\n",
     "# Construct the OpenAI Tools agent\n",
     "agent = create_openai_tools_agent(llm, tools, prompt)\n",
     "\n",
     "print(\"Available tools:\")\n",
     "for tool in tools:\n",
     "    print(\"\\t\" + tool.name + \" - \" + tool.description + \" - \" + str(tool))"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "metadata": {},
    "outputs": [],
    "source": [
     "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)\n",
     "\n",
     "response = agent_executor.invoke({\"input\": input})\n",
     "\n",
     "print(response[\"output\"])"
    ]
   }
  ],
  "metadata": {
   "kernelspec": {
    "display_name": "Python 3 (ipykernel)",
    "language": "python",
    "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
     "name": "ipython",
     "version": 3
    },
    "file_extension": ".py",
    "mimetype": "text/x-python",
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.9.1"
   }
  },
  "nbformat": 4,
  "nbformat_minor": 4
 }

6

cookbook/custom_agent_with_plugin_retrieval.ipynb

View File

@@ -42,9 +42,9 @@
     ")\n",
     "from langchain.chains import LLMChain\n",
     "from langchain.prompts import StringPromptTemplate\n",
     "from langchain.schema import AgentAction, AgentFinish\n",
     "from langchain_community.agent_toolkits import NLAToolkit\n",
     "from langchain_community.tools.plugin import AIPlugin\n",
     "from langchain_core.agents import AgentAction, AgentFinish\n",
     "from langchain_openai import OpenAI"
    ]
   },
@@ -114,8 +114,8 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.schema import Document\n",
     "from langchain_community.vectorstores import FAISS\n",
     "from langchain_core.documents import Document\n",
     "from langchain_openai import OpenAIEmbeddings"
    ]
   },
@@ -169,7 +169,7 @@
     "\n",
     "def get_tools(query):\n",
     "    # Get documents, which contain the Plugins to use\n",
     "    docs = retriever.get_relevant_documents(query)\n",
     "    docs = retriever.invoke(query)\n",
     "    # Get the toolkits, one for each plugin\n",
     "    tool_kits = [toolkits_dict[d.metadata[\"plugin_name\"]] for d in docs]\n",
     "    # Get the tools: a separate NLAChain for each endpoint\n",

6

cookbook/custom_agent_with_plugin_retrieval_using_plugnplai.ipynb

View File

@@ -67,9 +67,9 @@
     ")\n",
     "from langchain.chains import LLMChain\n",
     "from langchain.prompts import StringPromptTemplate\n",
     "from langchain.schema import AgentAction, AgentFinish\n",
     "from langchain_community.agent_toolkits import NLAToolkit\n",
     "from langchain_community.tools.plugin import AIPlugin\n",
     "from langchain_core.agents import AgentAction, AgentFinish\n",
     "from langchain_openai import OpenAI"
    ]
   },
@@ -138,8 +138,8 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.schema import Document\n",
     "from langchain_community.vectorstores import FAISS\n",
     "from langchain_core.documents import Document\n",
     "from langchain_openai import OpenAIEmbeddings"
    ]
   },
@@ -193,7 +193,7 @@
     "\n",
     "def get_tools(query):\n",
     "    # Get documents, which contain the Plugins to use\n",
     "    docs = retriever.get_relevant_documents(query)\n",
     "    docs = retriever.invoke(query)\n",
     "    # Get the toolkits, one for each plugin\n",
     "    tool_kits = [toolkits_dict[d.metadata[\"plugin_name\"]] for d in docs]\n",
     "    # Get the tools: a separate NLAChain for each endpoint\n",

6

cookbook/custom_agent_with_tool_retrieval.ipynb

View File

@@ -40,8 +40,8 @@
     ")\n",
     "from langchain.chains import LLMChain\n",
     "from langchain.prompts import StringPromptTemplate\n",
     "from langchain.schema import AgentAction, AgentFinish\n",
     "from langchain_community.utilities import SerpAPIWrapper\n",
     "from langchain_core.agents import AgentAction, AgentFinish\n",
     "from langchain_openai import OpenAI"
    ]
   },
@@ -103,8 +103,8 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.schema import Document\n",
     "from langchain_community.vectorstores import FAISS\n",
     "from langchain_core.documents import Document\n",
     "from langchain_openai import OpenAIEmbeddings"
    ]
   },
@@ -142,7 +142,7 @@
     "\n",
     "\n",
     "def get_tools(query):\n",
     "    docs = retriever.get_relevant_documents(query)\n",
     "    docs = retriever.invoke(query)\n",
     "    return [ALL_TOOLS[d.metadata[\"index\"]] for d in docs]"
    ]
   },

2

cookbook/custom_multi_action_agent.ipynb

View File

@@ -72,7 +72,7 @@
    "source": [
     "from typing import Any, List, Tuple, Union\n",
     "\n",
     "from langchain.schema import AgentAction, AgentFinish\n",
     "from langchain_core.agents import AgentAction, AgentFinish\n",
     "\n",
     "\n",
     "class FakeAgent(BaseMultiActionAgent):\n",

1001

cookbook/data/imdb_top_1000.csv Normal file

View File

File diff suppressed because it is too large Load Diff

2

cookbook/databricks_sql_db.ipynb

View File

@@ -166,7 +166,7 @@
    "source": [
     "### SQL Database Agent example\n",
     "\n",
     "This example demonstrates the use of the [SQL Database Agent](/docs/integrations/toolkits/sql_database.html) for answering questions over a Databricks database."
     "This example demonstrates the use of the [SQL Database Agent](/docs/integrations/tools/sql_database) for answering questions over a Databricks database."
    ]
   },
   {

6

cookbook/deeplake_semantic_search_over_chat.ipynb

View File

@@ -52,12 +52,12 @@
     "import os\n",
     "\n",
     "from langchain.chains import RetrievalQA\n",
     "from langchain.text_splitter import (\n",
     "from langchain_community.vectorstores import DeepLake\n",
     "from langchain_openai import OpenAI, OpenAIEmbeddings\n",
     "from langchain_text_splitters import (\n",
     "    CharacterTextSplitter,\n",
     "    RecursiveCharacterTextSplitter,\n",
     ")\n",
     "from langchain_community.vectorstores import DeepLake\n",
     "from langchain_openai import OpenAI, OpenAIEmbeddings\n",
     "\n",
     "os.environ[\"OPENAI_API_KEY\"] = getpass.getpass(\"OpenAI API Key:\")\n",
     "activeloop_token = getpass.getpass(\"Activeloop Token:\")\n",

4

cookbook/docugami_xml_kg_rag.ipynb

View File

@@ -39,7 +39,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install langchain docugami==0.0.8 dgml-utils==0.3.0 pydantic langchainhub chromadb hnswlib --upgrade --quiet"
     "! pip install langchain docugami==0.0.8 dgml-utils==0.3.0 pydantic langchainhub langchain-chroma hnswlib --upgrade --quiet"
    ]
   },
   {
@@ -547,7 +547,7 @@
     "\n",
     "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
     "from langchain.storage import InMemoryStore\n",
     "from langchain_community.vectorstores.chroma import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_core.documents import Document\n",
     "from langchain_openai import OpenAIEmbeddings\n",
     "\n",

2

cookbook/elasticsearch_db_qa.ipynb

View File

@@ -84,7 +84,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "llm = ChatOpenAI(model_name=\"gpt-4\", temperature=0)\n",
     "llm = ChatOpenAI(model=\"gpt-4\", temperature=0)\n",
     "chain = ElasticsearchDatabaseChain.from_llm(llm=llm, database=db, verbose=True)"
    ]
   },

2

cookbook/fake_llm.ipynb

View File

@@ -100,7 +100,7 @@
     }
    ],
    "source": [
     "agent.run(\"whats 2 + 2\")"
     "agent.invoke(\"whats 2 + 2\")"
    ]
   },
   {

245

cookbook/fireworks_rag.ipynb Normal file

View File

@@ -0,0 +1,245 @@
 {
  "cells": [
   {
    "cell_type": "markdown",
    "id": "0fc0309d-4d49-4bb5-bec0-bd92c6fddb28",
    "metadata": {},
    "source": [
     "## Fireworks.AI + LangChain + RAG\n",
     " \n",
     "[Fireworks AI](https://python.langchain.com/docs/integrations/llms/fireworks) wants to provide the best experience when working with LangChain, and here is an example of Fireworks + LangChain doing RAG\n",
     "\n",
     "See [our models page](https://fireworks.ai/models) for the full list of models. We use `accounts/fireworks/models/mixtral-8x7b-instruct` for RAG In this tutorial.\n",
     "\n",
     "For the RAG target, we will use the Gemma technical report https://storage.googleapis.com/deepmind-media/gemma/gemma-report.pdf "
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 1,
    "id": "d12fb75a-f707-48d5-82a5-efe2d041813c",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "\n",
       "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m A new release of pip is available: \u001b[0m\u001b[31;49m23.2.1\u001b[0m\u001b[39;49m -> \u001b[0m\u001b[32;49m24.0\u001b[0m\n",
       "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m To update, run: \u001b[0m\u001b[32;49mpip install --upgrade pip\u001b[0m\n",
       "Note: you may need to restart the kernel to use updated packages.\n",
       "Found existing installation: langchain-fireworks 0.0.1\n",
       "Uninstalling langchain-fireworks-0.0.1:\n",
       "  Successfully uninstalled langchain-fireworks-0.0.1\n",
       "Note: you may need to restart the kernel to use updated packages.\n",
       "Obtaining file:///mnt/disks/data/langchain/libs/partners/fireworks\n",
       "  Installing build dependencies ... \u001b[?25ldone\n",
       "\u001b[?25h  Checking if build backend supports build_editable ... \u001b[?25ldone\n",
       "\u001b[?25h  Getting requirements to build editable ... \u001b[?25ldone\n",
       "\u001b[?25h  Preparing editable metadata (pyproject.toml) ... \u001b[?25ldone\n",
       "\u001b[?25hRequirement already satisfied: aiohttp<4.0.0,>=3.9.1 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-fireworks==0.0.1) (3.9.3)\n",
       "Requirement already satisfied: fireworks-ai<0.13.0,>=0.12.0 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-fireworks==0.0.1) (0.12.0)\n",
       "Requirement already satisfied: langchain-core<0.2,>=0.1 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-fireworks==0.0.1) (0.1.23)\n",
       "Requirement already satisfied: requests<3,>=2 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-fireworks==0.0.1) (2.31.0)\n",
       "Requirement already satisfied: aiosignal>=1.1.2 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from aiohttp<4.0.0,>=3.9.1->langchain-fireworks==0.0.1) (1.3.1)\n",
       "Requirement already satisfied: attrs>=17.3.0 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from aiohttp<4.0.0,>=3.9.1->langchain-fireworks==0.0.1) (23.1.0)\n",
       "Requirement already satisfied: frozenlist>=1.1.1 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from aiohttp<4.0.0,>=3.9.1->langchain-fireworks==0.0.1) (1.4.0)\n",
       "Requirement already satisfied: multidict<7.0,>=4.5 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from aiohttp<4.0.0,>=3.9.1->langchain-fireworks==0.0.1) (6.0.4)\n",
       "Requirement already satisfied: yarl<2.0,>=1.0 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from aiohttp<4.0.0,>=3.9.1->langchain-fireworks==0.0.1) (1.9.2)\n",
       "Requirement already satisfied: async-timeout<5.0,>=4.0 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from aiohttp<4.0.0,>=3.9.1->langchain-fireworks==0.0.1) (4.0.3)\n",
       "Requirement already satisfied: httpx in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (0.26.0)\n",
       "Requirement already satisfied: httpx-sse in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (0.4.0)\n",
       "Requirement already satisfied: pydantic in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (2.4.2)\n",
       "Requirement already satisfied: Pillow in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (10.2.0)\n",
       "Requirement already satisfied: PyYAML>=5.3 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (6.0.1)\n",
       "Requirement already satisfied: anyio<5,>=3 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (3.7.1)\n",
       "Requirement already satisfied: jsonpatch<2.0,>=1.33 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (1.33)\n",
       "Requirement already satisfied: langsmith<0.2.0,>=0.1.0 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (0.1.5)\n",
       "Requirement already satisfied: packaging<24.0,>=23.2 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (23.2)\n",
       "Requirement already satisfied: tenacity<9.0.0,>=8.1.0 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (8.2.3)\n",
       "Requirement already satisfied: charset-normalizer<4,>=2 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from requests<3,>=2->langchain-fireworks==0.0.1) (3.3.0)\n",
       "Requirement already satisfied: idna<4,>=2.5 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from requests<3,>=2->langchain-fireworks==0.0.1) (3.4)\n",
       "Requirement already satisfied: urllib3<3,>=1.21.1 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from requests<3,>=2->langchain-fireworks==0.0.1) (2.0.6)\n",
       "Requirement already satisfied: certifi>=2017.4.17 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from requests<3,>=2->langchain-fireworks==0.0.1) (2023.7.22)\n",
       "Requirement already satisfied: sniffio>=1.1 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from anyio<5,>=3->langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (1.3.0)\n",
       "Requirement already satisfied: exceptiongroup in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from anyio<5,>=3->langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (1.1.3)\n",
       "Requirement already satisfied: jsonpointer>=1.9 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from jsonpatch<2.0,>=1.33->langchain-core<0.2,>=0.1->langchain-fireworks==0.0.1) (2.4)\n",
       "Requirement already satisfied: annotated-types>=0.4.0 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from pydantic->fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (0.5.0)\n",
       "Requirement already satisfied: pydantic-core==2.10.1 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from pydantic->fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (2.10.1)\n",
       "Requirement already satisfied: typing-extensions>=4.6.1 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from pydantic->fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (4.8.0)\n",
       "Requirement already satisfied: httpcore==1.* in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from httpx->fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (1.0.2)\n",
       "Requirement already satisfied: h11<0.15,>=0.13 in /mnt/disks/data/langchain/.venv/lib/python3.9/site-packages (from httpcore==1.*->httpx->fireworks-ai<0.13.0,>=0.12.0->langchain-fireworks==0.0.1) (0.14.0)\n",
       "Building wheels for collected packages: langchain-fireworks\n",
       "  Building editable for langchain-fireworks (pyproject.toml) ... \u001b[?25ldone\n",
       "\u001b[?25h  Created wheel for langchain-fireworks: filename=langchain_fireworks-0.0.1-py3-none-any.whl size=2228 sha256=564071b120b09ec31f2dc737733448a33bbb26e40b49fcde0c129ad26045259d\n",
       "  Stored in directory: /tmp/pip-ephem-wheel-cache-oz368vdk/wheels/e0/ad/31/d7e76dd73d61905ff7f369f5b0d21a4b5e7af4d3cb7487aece\n",
       "Successfully built langchain-fireworks\n",
       "Installing collected packages: langchain-fireworks\n",
       "Successfully installed langchain-fireworks-0.0.1\n",
       "\n",
       "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m A new release of pip is available: \u001b[0m\u001b[31;49m23.2.1\u001b[0m\u001b[39;49m -> \u001b[0m\u001b[32;49m24.0\u001b[0m\n",
       "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m To update, run: \u001b[0m\u001b[32;49mpip install --upgrade pip\u001b[0m\n",
       "Note: you may need to restart the kernel to use updated packages.\n"
      ]
     }
    ],
    "source": [
     "%pip install --quiet pypdf langchain-chroma tiktoken openai \n",
     "%pip uninstall -y langchain-fireworks\n",
     "%pip install --editable /mnt/disks/data/langchain/libs/partners/fireworks"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "id": "cf719376",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "<module 'fireworks' from '/mnt/disks/data/langchain/.venv/lib/python3.9/site-packages/fireworks/__init__.py'>\n"
      ]
     }
    ],
    "source": [
     "import fireworks\n",
     "\n",
     "print(fireworks)\n",
     "import fireworks.client"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "id": "9ab49327-0532-4480-804c-d066c302a322",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Load\n",
     "import requests\n",
     "from langchain_community.document_loaders import PyPDFLoader\n",
     "\n",
     "# Download the PDF from a URL and save it to a temporary location\n",
     "url = \"https://storage.googleapis.com/deepmind-media/gemma/gemma-report.pdf\"\n",
     "response = requests.get(url, stream=True)\n",
     "file_name = \"temp_file.pdf\"\n",
     "with open(file_name, \"wb\") as pdf:\n",
     "    pdf.write(response.content)\n",
     "\n",
     "loader = PyPDFLoader(file_name)\n",
     "data = loader.load()\n",
     "\n",
     "# Split\n",
     "from langchain_text_splitters import RecursiveCharacterTextSplitter\n",
     "\n",
     "text_splitter = RecursiveCharacterTextSplitter(chunk_size=2000, chunk_overlap=0)\n",
     "all_splits = text_splitter.split_documents(data)\n",
     "\n",
     "# Add to vectorDB\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_fireworks.embeddings import FireworksEmbeddings\n",
     "\n",
     "vectorstore = Chroma.from_documents(\n",
     "    documents=all_splits,\n",
     "    collection_name=\"rag-chroma\",\n",
     "    embedding=FireworksEmbeddings(),\n",
     ")\n",
     "\n",
     "retriever = vectorstore.as_retriever()"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "id": "4efaddd9-3dbb-455c-ba54-0ad7f2d2ce0f",
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_core.output_parsers import StrOutputParser\n",
     "from langchain_core.prompts import ChatPromptTemplate\n",
     "from langchain_core.pydantic_v1 import BaseModel\n",
     "from langchain_core.runnables import RunnableParallel, RunnablePassthrough\n",
     "\n",
     "# RAG prompt\n",
     "template = \"\"\"Answer the question based only on the following context:\n",
     "{context}\n",
     "\n",
     "Question: {question}\n",
     "\"\"\"\n",
     "prompt = ChatPromptTemplate.from_template(template)\n",
     "\n",
     "# LLM\n",
     "from langchain_together import Together\n",
     "\n",
     "llm = Together(\n",
     "    model=\"mistralai/Mixtral-8x7B-Instruct-v0.1\",\n",
     "    temperature=0.0,\n",
     "    max_tokens=2000,\n",
     "    top_k=1,\n",
     ")\n",
     "\n",
     "# RAG chain\n",
     "chain = (\n",
     "    RunnableParallel({\"context\": retriever, \"question\": RunnablePassthrough()})\n",
     "    | prompt\n",
     "    | llm\n",
     "    | StrOutputParser()\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 4,
    "id": "88b1ee51-1b0f-4ebf-bb32-e50e843f0eeb",
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "'\\nAnswer: The architectural details of Mixtral are as follows:\\n- Dimension (dim): 4096\\n- Number of layers (n\\\\_layers): 32\\n- Dimension of each head (head\\\\_dim): 128\\n- Hidden dimension (hidden\\\\_dim): 14336\\n- Number of heads (n\\\\_heads): 32\\n- Number of kv heads (n\\\\_kv\\\\_heads): 8\\n- Context length (context\\\\_len): 32768\\n- Vocabulary size (vocab\\\\_size): 32000\\n- Number of experts (num\\\\_experts): 8\\n- Number of top k experts (top\\\\_k\\\\_experts): 2\\n\\nMixtral is based on a transformer architecture and uses the same modifications as described in [18], with the notable exceptions that Mixtral supports a fully dense context length of 32k tokens, and the feedforward block picks from a set of 8 distinct groups of parameters. At every layer, for every token, a router network chooses two of these groups (the “experts”) to process the token and combine their output additively. This technique increases the number of parameters of a model while controlling cost and latency, as the model only uses a fraction of the total set of parameters per token. Mixtral is pretrained with multilingual data using a context size of 32k tokens. It either matches or exceeds the performance of Llama 2 70B and GPT-3.5, over several benchmarks. In particular, Mixtral vastly outperforms Llama 2 70B on mathematics, code generation, and multilingual benchmarks.'"
       ]
      },
      "execution_count": 4,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "chain.invoke(\"What are the Architectural details of Mixtral?\")"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "755cf871-26b7-4e30-8b91-9ffd698470f4",
    "metadata": {},
    "source": [
     "Trace: \n",
     "\n",
     "https://smith.langchain.com/public/935fd642-06a6-4b42-98e3-6074f93115cd/r"
    ]
   }
  ],
  "metadata": {
   "kernelspec": {
    "display_name": "Python 3 (ipykernel)",
    "language": "python",
    "name": "python3"
   },
   "language_info": {
    "codemirror_mode": {
     "name": "ipython",
     "version": 3
    },
    "file_extension": ".py",
    "mimetype": "text/x-python",
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.9.12"
   }
  },
  "nbformat": 4,
  "nbformat_minor": 5
 }

5

cookbook/forward_looking_retrieval_augmented_generation.ipynb

View File

@@ -73,8 +73,9 @@
     "    AsyncCallbackManagerForRetrieverRun,\n",
     "    CallbackManagerForRetrieverRun,\n",
     ")\n",
     "from langchain.schema import BaseRetriever, Document\n",
     "from langchain_community.utilities import GoogleSerperAPIWrapper\n",
     "from langchain_core.documents import Document\n",
     "from langchain_core.retrievers import BaseRetriever\n",
     "from langchain_openai import ChatOpenAI, OpenAI"
    ]
   },
@@ -361,7 +362,7 @@
    ],
    "source": [
     "llm = OpenAI()\n",
     "llm(query)"
     "llm.invoke(query)"
    ]
   },
   {

2

cookbook/gymnasium_agent_simulation.ipynb

View File

@@ -108,7 +108,7 @@
     "        return obs_message\n",
     "\n",
     "    def _act(self):\n",
     "        act_message = self.model(self.message_history)\n",
     "        act_message = self.model.invoke(self.message_history)\n",
     "        self.message_history.append(act_message)\n",
     "        action = int(self.action_parser.parse(act_message.content)[\"action\"])\n",
     "        return action\n",

4

cookbook/hypothetical_document_embeddings.ipynb

View File

@@ -170,8 +170,8 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain.text_splitter import CharacterTextSplitter\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_text_splitters import CharacterTextSplitter\n",
     "\n",
     "with open(\"../../state_of_the_union.txt\") as f:\n",
     "    state_of_the_union = f.read()\n",

603

cookbook/img-to_img-search_CLIP_ChromaDB.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

485

cookbook/langgraph_agentic_rag.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

528

cookbook/langgraph_crag.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

665

cookbook/langgraph_self_rag.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

8

cookbook/llm_bash.ipynb

View File

@@ -52,7 +52,7 @@
     "\n",
     "bash_chain = LLMBashChain.from_llm(llm, verbose=True)\n",
     "\n",
     "bash_chain.run(text)"
     "bash_chain.invoke(text)"
    ]
   },
   {
@@ -135,7 +135,7 @@
     "\n",
     "text = \"Please write a bash script that prints 'Hello World' to the console.\"\n",
     "\n",
     "bash_chain.run(text)"
     "bash_chain.invoke(text)"
    ]
   },
   {
@@ -190,7 +190,7 @@
     "\n",
     "text = \"List the current directory then move up a level.\"\n",
     "\n",
     "bash_chain.run(text)"
     "bash_chain.invoke(text)"
    ]
   },
   {
@@ -231,7 +231,7 @@
    ],
    "source": [
     "# Run the same command again and see that the state is maintained between calls\n",
     "bash_chain.run(text)"
     "bash_chain.invoke(text)"
    ]
   }
  ],

2

cookbook/llm_checker.ipynb

View File

@@ -50,7 +50,7 @@
     "\n",
     "checker_chain = LLMCheckerChain.from_llm(llm, verbose=True)\n",
     "\n",
     "checker_chain.run(text)"
     "checker_chain.invoke(text)"
    ]
   },
   {

2

cookbook/llm_math.ipynb

View File

@@ -51,7 +51,7 @@
     "llm = OpenAI(temperature=0)\n",
     "llm_math = LLMMathChain.from_llm(llm, verbose=True)\n",
     "\n",
     "llm_math.run(\"What is 13 raised to the .3432 power?\")"
     "llm_math.invoke(\"What is 13 raised to the .3432 power?\")"
    ]
   },
   {

10

cookbook/llm_symbolic_math.ipynb

View File

@@ -45,7 +45,7 @@
     }
    ],
    "source": [
     "llm_symbolic_math.run(\"What is the derivative of sin(x)*exp(x) with respect to x?\")"
     "llm_symbolic_math.invoke(\"What is the derivative of sin(x)*exp(x) with respect to x?\")"
    ]
   },
   {
@@ -65,7 +65,7 @@
     }
    ],
    "source": [
     "llm_symbolic_math.run(\n",
     "llm_symbolic_math.invoke(\n",
     "    \"What is the integral of exp(x)*sin(x) + exp(x)*cos(x) with respect to x?\"\n",
     ")"
    ]
@@ -94,7 +94,7 @@
     }
    ],
    "source": [
     "llm_symbolic_math.run('Solve the differential equation y\" - y = e^t')"
     "llm_symbolic_math.invoke('Solve the differential equation y\" - y = e^t')"
    ]
   },
   {
@@ -114,7 +114,7 @@
     }
    ],
    "source": [
     "llm_symbolic_math.run(\"What are the solutions to this equation y^3 + 1/3y?\")"
     "llm_symbolic_math.invoke(\"What are the solutions to this equation y^3 + 1/3y?\")"
    ]
   },
   {
@@ -134,7 +134,7 @@
     }
    ],
    "source": [
     "llm_symbolic_math.run(\"x = y + 5, y = z - 3, z = x * y. Solve for x, y, z\")"
     "llm_symbolic_math.invoke(\"x = y + 5, y = z - 3, z = x * y. Solve for x, y, z\")"
    ]
   }
  ],

818

cookbook/mongodb-langchain-cache-memory.ipynb Normal file

View File

@@ -0,0 +1,818 @@
 {
  "cells": [
   {
    "cell_type": "markdown",
    "id": "70b333e6",
    "metadata": {},
    "source": [
     "[![View Article](https://img.shields.io/badge/View%20Article-blue)](https://www.mongodb.com/developer/products/atlas/advanced-rag-langchain-mongodb/)\n"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "d84a72ea",
    "metadata": {},
    "source": [
     "# Adding Semantic Caching and Memory to your RAG Application using MongoDB and LangChain\n",
     "\n",
     "In this notebook, we will see how to use the new MongoDBCache and MongoDBChatMessageHistory in your RAG application.\n"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "65527202",
    "metadata": {},
    "source": [
     "## Step 1: Install required libraries\n",
     "\n",
     "- **datasets**: Python library to get access to datasets available on Hugging Face Hub\n",
     "\n",
     "- **langchain**: Python toolkit for LangChain\n",
     "\n",
     "- **langchain-mongodb**: Python package to use MongoDB as a vector store, semantic cache, chat history store etc. in LangChain\n",
     "\n",
     "- **langchain-openai**: Python package to use OpenAI models with LangChain\n",
     "\n",
     "- **pymongo**: Python toolkit for MongoDB\n",
     "\n",
     "- **pandas**: Python library for data analysis, exploration, and manipulation"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 1,
    "id": "cbc22fa4",
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install -qU datasets langchain langchain-mongodb langchain-openai pymongo pandas"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "39c41e87",
    "metadata": {},
    "source": [
     "## Step 2: Setup pre-requisites\n",
     "\n",
     "* Set the MongoDB connection string. Follow the steps [here](https://www.mongodb.com/docs/manual/reference/connection-string/) to get the connection string from the Atlas UI.\n",
     "\n",
     "* Set the OpenAI API key. Steps to obtain an API key as [here](https://help.openai.com/en/articles/4936850-where-do-i-find-my-openai-api-key)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 2,
    "id": "b56412ae",
    "metadata": {},
    "outputs": [],
    "source": [
     "import getpass"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 3,
    "id": "16a20d7a",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "Enter your MongoDB connection string:········\n"
      ]
     }
    ],
    "source": [
     "MONGODB_URI = getpass.getpass(\"Enter your MongoDB connection string:\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 4,
    "id": "978682d4",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "Enter your OpenAI API key:········\n"
      ]
     }
    ],
    "source": [
     "OPENAI_API_KEY = getpass.getpass(\"Enter your OpenAI API key:\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 5,
    "id": "606081c5",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "········\n"
      ]
     }
    ],
    "source": [
     "# Optional-- If you want to enable Langsmith -- good for debugging\n",
     "import os\n",
     "\n",
     "os.environ[\"LANGCHAIN_TRACING_V2\"] = \"true\"\n",
     "os.environ[\"LANGCHAIN_API_KEY\"] = getpass.getpass()"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "f6b8302c",
    "metadata": {},
    "source": [
     "## Step 3: Download the dataset\n",
     "\n",
     "We will be using MongoDB's [embedded_movies](https://huggingface.co/datasets/MongoDB/embedded_movies) dataset"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 6,
    "id": "1a3433a6",
    "metadata": {},
    "outputs": [],
    "source": [
     "import pandas as pd\n",
     "from datasets import load_dataset"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": null,
    "id": "aee5311b",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Ensure you have an HF_TOKEN in your development enviornment:\n",
     "# access tokens can be created or copied from the Hugging Face platform (https://huggingface.co/docs/hub/en/security-tokens)\n",
     "\n",
     "# Load MongoDB's embedded_movies dataset from Hugging Face\n",
     "# https://huggingface.co/datasets/MongoDB/airbnb_embeddings\n",
     "\n",
     "data = load_dataset(\"MongoDB/embedded_movies\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 8,
    "id": "1d630a26",
    "metadata": {},
    "outputs": [],
    "source": [
     "df = pd.DataFrame(data[\"train\"])"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "a1f94f43",
    "metadata": {},
    "source": [
     "## Step 4: Data analysis\n",
     "\n",
     "Make sure length of the dataset is what we expect, drop Nones etc."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 10,
    "id": "b276df71",
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/html": [
        "<div>\n",
        "<style scoped>\n",
        "    .dataframe tbody tr th:only-of-type {\n",
        "        vertical-align: middle;\n",
        "    }\n",
        "\n",
        "    .dataframe tbody tr th {\n",
        "        vertical-align: top;\n",
        "    }\n",
        "\n",
        "    .dataframe thead th {\n",
        "        text-align: right;\n",
        "    }\n",
        "</style>\n",
        "<table border=\"1\" class=\"dataframe\">\n",
        "  <thead>\n",
        "    <tr style=\"text-align: right;\">\n",
        "      <th></th>\n",
        "      <th>fullplot</th>\n",
        "      <th>type</th>\n",
        "      <th>plot_embedding</th>\n",
        "      <th>num_mflix_comments</th>\n",
        "      <th>runtime</th>\n",
        "      <th>writers</th>\n",
        "      <th>imdb</th>\n",
        "      <th>countries</th>\n",
        "      <th>rated</th>\n",
        "      <th>plot</th>\n",
        "      <th>title</th>\n",
        "      <th>languages</th>\n",
        "      <th>metacritic</th>\n",
        "      <th>directors</th>\n",
        "      <th>awards</th>\n",
        "      <th>genres</th>\n",
        "      <th>poster</th>\n",
        "      <th>cast</th>\n",
        "    </tr>\n",
        "  </thead>\n",
        "  <tbody>\n",
        "    <tr>\n",
        "      <th>0</th>\n",
        "      <td>Young Pauline is left a lot of money when her ...</td>\n",
        "      <td>movie</td>\n",
        "      <td>[0.00072939653, -0.026834568, 0.013515796, -0....</td>\n",
        "      <td>0</td>\n",
        "      <td>199.0</td>\n",
        "      <td>[Charles W. Goddard (screenplay), Basil Dickey...</td>\n",
        "      <td>{'id': 4465, 'rating': 7.6, 'votes': 744}</td>\n",
        "      <td>[USA]</td>\n",
        "      <td>None</td>\n",
        "      <td>Young Pauline is left a lot of money when her ...</td>\n",
        "      <td>The Perils of Pauline</td>\n",
        "      <td>[English]</td>\n",
        "      <td>NaN</td>\n",
        "      <td>[Louis J. Gasnier, Donald MacKenzie]</td>\n",
        "      <td>{'nominations': 0, 'text': '1 win.', 'wins': 1}</td>\n",
        "      <td>[Action]</td>\n",
        "      <td>https://m.media-amazon.com/images/M/MV5BMzgxOD...</td>\n",
        "      <td>[Pearl White, Crane Wilbur, Paul Panzer, Edwar...</td>\n",
        "    </tr>\n",
        "  </tbody>\n",
        "</table>\n",
        "</div>"
       ],
       "text/plain": [
        "                                            fullplot   type  \\\n",
        "0  Young Pauline is left a lot of money when her ...  movie   \n",
        "\n",
        "                                      plot_embedding  num_mflix_comments  \\\n",
        "0  [0.00072939653, -0.026834568, 0.013515796, -0....                   0   \n",
        "\n",
        "   runtime                                            writers  \\\n",
        "0    199.0  [Charles W. Goddard (screenplay), Basil Dickey...   \n",
        "\n",
        "                                        imdb countries rated  \\\n",
        "0  {'id': 4465, 'rating': 7.6, 'votes': 744}     [USA]  None   \n",
        "\n",
        "                                                plot                  title  \\\n",
        "0  Young Pauline is left a lot of money when her ...  The Perils of Pauline   \n",
        "\n",
        "   languages  metacritic                             directors  \\\n",
        "0  [English]         NaN  [Louis J. Gasnier, Donald MacKenzie]   \n",
        "\n",
        "                                            awards    genres  \\\n",
        "0  {'nominations': 0, 'text': '1 win.', 'wins': 1}  [Action]   \n",
        "\n",
        "                                              poster  \\\n",
        "0  https://m.media-amazon.com/images/M/MV5BMzgxOD...   \n",
        "\n",
        "                                                cast  \n",
        "0  [Pearl White, Crane Wilbur, Paul Panzer, Edwar...  "
       ]
      },
      "execution_count": 10,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "# Previewing the contents of the data\n",
     "df.head(1)"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 11,
    "id": "22ab375d",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Only keep records where the fullplot field is not null\n",
     "df = df[df[\"fullplot\"].notna()]"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 12,
    "id": "fceed99a",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Renaming the embedding field to \"embedding\" -- required by LangChain\n",
     "df.rename(columns={\"plot_embedding\": \"embedding\"}, inplace=True)"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "aedec13a",
    "metadata": {},
    "source": [
     "## Step 5: Create a simple RAG chain using MongoDB as the vector store"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 13,
    "id": "11d292f3",
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_mongodb import MongoDBAtlasVectorSearch\n",
     "from pymongo import MongoClient\n",
     "\n",
     "# Initialize MongoDB python client\n",
     "client = MongoClient(MONGODB_URI, appname=\"devrel.content.python\")\n",
     "\n",
     "DB_NAME = \"langchain_chatbot\"\n",
     "COLLECTION_NAME = \"data\"\n",
     "ATLAS_VECTOR_SEARCH_INDEX_NAME = \"vector_index\"\n",
     "collection = client[DB_NAME][COLLECTION_NAME]"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 14,
    "id": "d8292d53",
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "DeleteResult({'n': 1000, 'electionId': ObjectId('7fffffff00000000000000f6'), 'opTime': {'ts': Timestamp(1710523288, 1033), 't': 246}, 'ok': 1.0, '$clusterTime': {'clusterTime': Timestamp(1710523288, 1042), 'signature': {'hash': b\"i\\xa8\\xe9'\\x1ed\\xf2u\\xf3L\\xff\\xb1\\xf5\\xbfA\\x90\\xabJ\\x12\\x83\", 'keyId': 7299545392000008318}}, 'operationTime': Timestamp(1710523288, 1033)}, acknowledged=True)"
       ]
      },
      "execution_count": 14,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "# Delete any existing records in the collection\n",
     "collection.delete_many({})"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 16,
    "id": "36c68914",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "Data ingestion into MongoDB completed\n"
      ]
     }
    ],
    "source": [
     "# Data Ingestion\n",
     "records = df.to_dict(\"records\")\n",
     "collection.insert_many(records)\n",
     "\n",
     "print(\"Data ingestion into MongoDB completed\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 18,
    "id": "cbfca0b8",
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_openai import OpenAIEmbeddings\n",
     "\n",
     "# Using the text-embedding-ada-002 since that's what was used to create embeddings in the movies dataset\n",
     "embeddings = OpenAIEmbeddings(\n",
     "    openai_api_key=OPENAI_API_KEY, model=\"text-embedding-ada-002\"\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 19,
    "id": "798e176c",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Vector Store Creation\n",
     "vector_store = MongoDBAtlasVectorSearch.from_connection_string(\n",
     "    connection_string=MONGODB_URI,\n",
     "    namespace=DB_NAME + \".\" + COLLECTION_NAME,\n",
     "    embedding=embeddings,\n",
     "    index_name=ATLAS_VECTOR_SEARCH_INDEX_NAME,\n",
     "    text_key=\"fullplot\",\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 49,
    "id": "c71cd087",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Using the MongoDB vector store as a retriever in a RAG chain\n",
     "retriever = vector_store.as_retriever(search_type=\"similarity\", search_kwargs={\"k\": 5})"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 25,
    "id": "b6588cd3",
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_core.output_parsers import StrOutputParser\n",
     "from langchain_core.prompts import ChatPromptTemplate\n",
     "from langchain_core.runnables import RunnablePassthrough\n",
     "from langchain_openai import ChatOpenAI\n",
     "\n",
     "# Generate context using the retriever, and pass the user question through\n",
     "retrieve = {\n",
     "    \"context\": retriever | (lambda docs: \"\\n\\n\".join([d.page_content for d in docs])),\n",
     "    \"question\": RunnablePassthrough(),\n",
     "}\n",
     "template = \"\"\"Answer the question based only on the following context: \\\n",
     "{context}\n",
     "\n",
     "Question: {question}\n",
     "\"\"\"\n",
     "# Defining the chat prompt\n",
     "prompt = ChatPromptTemplate.from_template(template)\n",
     "# Defining the model to be used for chat completion\n",
     "model = ChatOpenAI(temperature=0, openai_api_key=OPENAI_API_KEY)\n",
     "# Parse output as a string\n",
     "parse_output = StrOutputParser()\n",
     "\n",
     "# Naive RAG chain\n",
     "naive_rag_chain = retrieve | prompt | model | parse_output"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 26,
    "id": "aaae21f5",
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "'Once a Thief'"
       ]
      },
      "execution_count": 26,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "naive_rag_chain.invoke(\"What is the best movie to watch when sad?\")"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "75f929ef",
    "metadata": {},
    "source": [
     "## Step 6: Create a RAG chain with chat history"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 27,
    "id": "94e7bd4a",
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_core.prompts import MessagesPlaceholder\n",
     "from langchain_core.runnables.history import RunnableWithMessageHistory\n",
     "from langchain_mongodb.chat_message_histories import MongoDBChatMessageHistory"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 29,
    "id": "5bb30860",
    "metadata": {},
    "outputs": [],
    "source": [
     "def get_session_history(session_id: str) -> MongoDBChatMessageHistory:\n",
     "    return MongoDBChatMessageHistory(\n",
     "        MONGODB_URI, session_id, database_name=DB_NAME, collection_name=\"history\"\n",
     "    )"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 50,
    "id": "f51d0f35",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Given a follow-up question and history, create a standalone question\n",
     "standalone_system_prompt = \"\"\"\n",
     "Given a chat history and a follow-up question, rephrase the follow-up question to be a standalone question. \\\n",
     "Do NOT answer the question, just reformulate it if needed, otherwise return it as is. \\\n",
     "Only return the final standalone question. \\\n",
     "\"\"\"\n",
     "standalone_question_prompt = ChatPromptTemplate.from_messages(\n",
     "    [\n",
     "        (\"system\", standalone_system_prompt),\n",
     "        MessagesPlaceholder(variable_name=\"history\"),\n",
     "        (\"human\", \"{question}\"),\n",
     "    ]\n",
     ")\n",
     "\n",
     "question_chain = standalone_question_prompt | model | parse_output"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 51,
    "id": "f3ef3354",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Generate context by passing output of the question_chain i.e. the standalone question to the retriever\n",
     "retriever_chain = RunnablePassthrough.assign(\n",
     "    context=question_chain\n",
     "    | retriever\n",
     "    | (lambda docs: \"\\n\\n\".join([d.page_content for d in docs]))\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 55,
    "id": "5afb7345",
    "metadata": {},
    "outputs": [],
    "source": [
     "# Create a prompt that includes the context, history and the follow-up question\n",
     "rag_system_prompt = \"\"\"Answer the question based only on the following context: \\\n",
     "{context}\n",
     "\"\"\"\n",
     "rag_prompt = ChatPromptTemplate.from_messages(\n",
     "    [\n",
     "        (\"system\", rag_system_prompt),\n",
     "        MessagesPlaceholder(variable_name=\"history\"),\n",
     "        (\"human\", \"{question}\"),\n",
     "    ]\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 56,
    "id": "f95f47d0",
    "metadata": {},
    "outputs": [],
    "source": [
     "# RAG chain\n",
     "rag_chain = retriever_chain | rag_prompt | model | parse_output"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 57,
    "id": "9618d395",
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "'The best movie to watch when feeling down could be \"Last Action Hero.\" It\\'s a fun and action-packed film that blends reality and fantasy, offering an escape from the real world and providing an entertaining distraction.'"
       ]
      },
      "execution_count": 57,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "# RAG chain with history\n",
     "with_message_history = RunnableWithMessageHistory(\n",
     "    rag_chain,\n",
     "    get_session_history,\n",
     "    input_messages_key=\"question\",\n",
     "    history_messages_key=\"history\",\n",
     ")\n",
     "with_message_history.invoke(\n",
     "    {\"question\": \"What is the best movie to watch when sad?\"},\n",
     "    {\"configurable\": {\"session_id\": \"1\"}},\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 58,
    "id": "6e3080d1",
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "'I apologize for the confusion. Another movie that might lift your spirits when you\\'re feeling sad is \"Smilla\\'s Sense of Snow.\" It\\'s a mystery thriller that could engage your mind and distract you from your sadness with its intriguing plot and suspenseful storyline.'"
       ]
      },
      "execution_count": 58,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "with_message_history.invoke(\n",
     "    {\n",
     "        \"question\": \"Hmmm..I don't want to watch that one. Can you suggest something else?\"\n",
     "    },\n",
     "    {\"configurable\": {\"session_id\": \"1\"}},\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 59,
    "id": "daea2953",
    "metadata": {},
    "outputs": [
     {
      "data": {
       "text/plain": [
        "'For a lighter movie option, you might enjoy \"Cousins.\" It\\'s a comedy film set in Barcelona with action and humor, offering a fun and entertaining escape from reality. The storyline is engaging and filled with comedic moments that could help lift your spirits.'"
       ]
      },
      "execution_count": 59,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "with_message_history.invoke(\n",
     "    {\"question\": \"How about something more light?\"},\n",
     "    {\"configurable\": {\"session_id\": \"1\"}},\n",
     ")"
    ]
   },
   {
    "cell_type": "markdown",
    "id": "0de23a88",
    "metadata": {},
    "source": [
     "## Step 7: Get faster responses using Semantic Cache\n",
     "\n",
     "**NOTE:** Semantic cache only caches the input to the LLM. When using it in retrieval chains, remember that documents retrieved can change between runs resulting in cache misses for semantically similar queries."
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 61,
    "id": "5d6b6741",
    "metadata": {},
    "outputs": [],
    "source": [
     "from langchain_core.globals import set_llm_cache\n",
     "from langchain_mongodb.cache import MongoDBAtlasSemanticCache\n",
     "\n",
     "set_llm_cache(\n",
     "    MongoDBAtlasSemanticCache(\n",
     "        connection_string=MONGODB_URI,\n",
     "        embedding=embeddings,\n",
     "        collection_name=\"semantic_cache\",\n",
     "        database_name=DB_NAME,\n",
     "        index_name=ATLAS_VECTOR_SEARCH_INDEX_NAME,\n",
     "        wait_until_ready=True,  # Optional, waits until the cache is ready to be used\n",
     "    )\n",
     ")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 62,
    "id": "9825bc7b",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "CPU times: user 87.8 ms, sys: 670 µs, total: 88.5 ms\n",
       "Wall time: 1.24 s\n"
      ]
     },
     {
      "data": {
       "text/plain": [
        "'Once a Thief'"
       ]
      },
      "execution_count": 62,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "%%time\n",
     "naive_rag_chain.invoke(\"What is the best movie to watch when sad?\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 63,
    "id": "a5e518cf",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "CPU times: user 43.5 ms, sys: 4.16 ms, total: 47.7 ms\n",
       "Wall time: 255 ms\n"
      ]
     },
     {
      "data": {
       "text/plain": [
        "'Once a Thief'"
       ]
      },
      "execution_count": 63,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "%%time\n",
     "naive_rag_chain.invoke(\"What is the best movie to watch when sad?\")"
    ]
   },
   {
    "cell_type": "code",
    "execution_count": 64,
    "id": "3d3d3ad3",
    "metadata": {},
    "outputs": [
     {
      "name": "stdout",
      "output_type": "stream",
      "text": [
       "CPU times: user 115 ms, sys: 171 µs, total: 115 ms\n",
       "Wall time: 1.38 s\n"
      ]
     },
     {
      "data": {
       "text/plain": [
        "'I would recommend watching \"Last Action Hero\" when sad, as it is a fun and action-packed film that can help lift your spirits.'"
       ]
      },
      "execution_count": 64,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
     "%%time\n",
     "naive_rag_chain.invoke(\"Which movie do I watch when sad?\")"
    ]
   }
  ],
  "metadata": {
   "kernelspec": {
    "display_name": "conda_pytorch_p310",
    "language": "python",
    "name": "conda_pytorch_p310"
   },
   "language_info": {
    "codemirror_mode": {
     "name": "ipython",
     "version": 3
    },
    "file_extension": ".py",
    "mimetype": "text/x-python",
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.10.13"
   }
  },
  "nbformat": 4,
  "nbformat_minor": 5
 }

6

cookbook/multi_modal_RAG_chroma.ipynb

View File

@@ -58,7 +58,7 @@
    "metadata": {},
    "outputs": [],
    "source": [
     "! pip install -U langchain openai chromadb langchain-experimental # (newest versions required for multi-modal)"
     "! pip install -U langchain openai langchain-chroma langchain-experimental # (newest versions required for multi-modal)"
    ]
   },
   {
@@ -187,7 +187,7 @@
     "\n",
     "import chromadb\n",
     "import numpy as np\n",
     "from langchain_community.vectorstores import Chroma\n",
     "from langchain_chroma import Chroma\n",
     "from langchain_experimental.open_clip import OpenCLIPEmbeddings\n",
     "from PIL import Image as _PILImage\n",
     "\n",
@@ -435,7 +435,7 @@
     "    display(HTML(image_html))\n",
     "\n",
     "\n",
     "docs = retriever.get_relevant_documents(\"Woman with children\", k=10)\n",
     "docs = retriever.invoke(\"Woman with children\", k=10)\n",
     "for doc in docs:\n",
     "    if is_base64(doc.page_content):\n",
     "        plt_img_base64(doc.page_content)\n",

517

cookbook/multi_modal_RAG_vdms.ipynb Normal file

View File

File diff suppressed because one or more lines are too long

2

cookbook/multi_player_dnd.ipynb

View File

@@ -74,7 +74,7 @@
     "        Applies the chatmodel to the message history\n",
     "        and returns the message string\n",
     "        \"\"\"\n",
     "        message = self.model(\n",
     "        message = self.model.invoke(\n",
     "            [\n",
     "                self.system_message,\n",
     "                HumanMessage(content=\"\\n\".join(self.message_history + [self.prefix])),\n",

8

cookbook/multiagent_authoritarian.ipynb

View File

@@ -79,7 +79,7 @@
     "        Applies the chatmodel to the message history\n",
     "        and returns the message string\n",
     "        \"\"\"\n",
     "        message = self.model(\n",
     "        message = self.model.invoke(\n",
     "            [\n",
     "                self.system_message,\n",
     "                HumanMessage(content=\"\\n\".join(self.message_history + [self.prefix])),\n",
@@ -234,7 +234,7 @@
     "            termination_clause=self.termination_clause if self.stop else \"\",\n",
     "        )\n",
     "\n",
     "        self.response = self.model(\n",
     "        self.response = self.model.invoke(\n",
     "            [\n",
     "                self.system_message,\n",
     "                HumanMessage(content=response_prompt),\n",
@@ -263,7 +263,7 @@
     "            speaker_names=speaker_names,\n",
     "        )\n",
     "\n",
     "        choice_string = self.model(\n",
     "        choice_string = self.model.invoke(\n",
     "            [\n",
     "                self.system_message,\n",
     "                HumanMessage(content=choice_prompt),\n",
@@ -299,7 +299,7 @@
     "                ),\n",
     "                next_speaker=self.next_speaker,\n",
     "            )\n",
     "            message = self.model(\n",
     "            message = self.model.invoke(\n",
     "                [\n",
     "                    self.system_message,\n",
     "                    HumanMessage(content=next_prompt),\n",

4

cookbook/multiagent_bidding.ipynb

View File

@@ -71,7 +71,7 @@
     "        Applies the chatmodel to the message history\n",
     "        and returns the message string\n",
     "        \"\"\"\n",
     "        message = self.model(\n",
     "        message = self.model.invoke(\n",
     "            [\n",
     "                self.system_message,\n",
     "                HumanMessage(content=\"\\n\".join(self.message_history + [self.prefix])),\n",
@@ -164,7 +164,7 @@
     "            message_history=\"\\n\".join(self.message_history),\n",
     "            recent_message=self.message_history[-1],\n",
     "        )\n",
     "        bid_string = self.model([SystemMessage(content=prompt)]).content\n",
     "        bid_string = self.model.invoke([SystemMessage(content=prompt)]).content\n",
     "        return bid_string"
    ]
   },

Compare commits

4146 Commits erick/goog ... wfh/more_i

2 .devcontainer/README.md Unescape Escape View File

2 .devcontainer/devcontainer.json Unescape Escape View File

8 .devcontainer/docker-compose.yaml Unescape Escape View File

41 .github/CONTRIBUTING.md vendored Unescape Escape View File

38 .github/DISCUSSION_TEMPLATE/ideas.yml vendored Normal file Unescape Escape View File

122 .github/DISCUSSION_TEMPLATE/q-a.yml vendored Normal file Unescape Escape View File

182 .github/ISSUE_TEMPLATE/bug-report.yml vendored Unescape Escape View File

13 .github/ISSUE_TEMPLATE/config.yml vendored Unescape Escape View File

45 .github/ISSUE_TEMPLATE/documentation.yml vendored Unescape Escape View File

30 .github/ISSUE_TEMPLATE/feature-request.yml vendored Unescape Escape View File

18 .github/ISSUE_TEMPLATE/other.yml vendored Unescape Escape View File

25 .github/ISSUE_TEMPLATE/privileged.yml vendored Normal file Unescape Escape View File

33 .github/PULL_REQUEST_TEMPLATE.md vendored Unescape Escape View File

7 .github/actions/people/Dockerfile vendored Normal file Unescape Escape View File

11 .github/actions/people/action.yml vendored Normal file Unescape Escape View File

646 .github/actions/people/app/main.py vendored Normal file Unescape Escape View File

8 .github/actions/poetry_setup/action.yml vendored Unescape Escape View File

243 .github/scripts/check_diff.py vendored Unescape Escape View File

35 .github/scripts/check_prerelease_dependencies.py vendored Normal file Unescape Escape View File

91 .github/scripts/get_min_versions.py vendored Normal file Unescape Escape View File

7 .github/workflows/.codespell-exclude vendored Normal file Unescape Escape View File

106 .github/workflows/_all_ci.yml vendored Unescape Escape View File

19 .github/workflows/_compile_integration_test.yml vendored Unescape Escape View File

23 .github/workflows/_dependencies.yml vendored Unescape Escape View File

51 .github/workflows/_integration_test.yml vendored Unescape Escape View File

39 .github/workflows/_lint.yml vendored Unescape Escape View File

171 .github/workflows/_release.yml vendored Unescape Escape View File

38 .github/workflows/_test.yml vendored Unescape Escape View File

51 .github/workflows/_test_doc_imports.yml vendored Normal file Unescape Escape View File

13 .github/workflows/_test_release.yml vendored Unescape Escape View File

25 .github/workflows/check-broken-links.yml vendored Normal file Unescape Escape View File

137 .github/workflows/check_diffs.yml vendored Unescape Escape View File

36 .github/workflows/check_new_docs.yml vendored Normal file Unescape Escape View File

19 .github/workflows/codespell.yml vendored Unescape Escape View File

35 .github/workflows/doc_lint.yml vendored Unescape Escape View File

13 .github/workflows/langchain_cli_release.yml vendored Unescape Escape View File

13 .github/workflows/langchain_community_release.yml vendored Unescape Escape View File

13 .github/workflows/langchain_core_release.yml vendored Unescape Escape View File

13 .github/workflows/langchain_experimental_release.yml vendored Unescape Escape View File

13 .github/workflows/langchain_experimental_test_release.yml vendored Unescape Escape View File

13 .github/workflows/langchain_openai_release.yml vendored Unescape Escape View File

27 .github/workflows/langchain_release.yml vendored Unescape Escape View File

13 .github/workflows/langchain_test_release.yml vendored Unescape Escape View File

37 .github/workflows/people.yml vendored Normal file Unescape Escape View File

76 .github/workflows/scheduled_test.yml vendored Unescape Escape View File

36 .github/workflows/templates_ci.yml vendored Unescape Escape View File

14 .gitignore vendored Unescape Escape View File

14 .readthedocs.yaml Unescape Escape View File

2 MIGRATE.md Unescape Escape View File

69 Makefile Unescape Escape View File

111 README.md Unescape Escape View File

61 SECURITY.md Unescape Escape View File

932 cookbook/Gemma_LangChain.ipynb Normal file Unescape Escape View File

4 cookbook/LLaMA2_sql_chat.ipynb Unescape Escape View File

14 cookbook/Multi_modal_RAG.ipynb Unescape Escape View File

20 cookbook/Multi_modal_RAG_google.ipynb Unescape Escape View File

747 cookbook/RAPTOR.ipynb Normal file View File

6 cookbook/README.md Unescape Escape View File

6 cookbook/Semi_Structured_RAG.ipynb Unescape Escape View File

12 cookbook/Semi_structured_and_multi_modal_RAG.ipynb Unescape Escape View File

14 cookbook/Semi_structured_multi_modal_RAG_LLaMA2.ipynb Unescape Escape View File

14 cookbook/advanced_rag_eval.ipynb Unescape Escape View File

4 cookbook/agent_vectorstore.ipynb Unescape Escape View File

200 cookbook/airbyte_github.ipynb Normal file Unescape Escape View File

284 cookbook/amazon_personalize_how_to.ipynb Normal file Unescape Escape View File

584 cookbook/anthropic_structured_outputs.ipynb Normal file View File

922 cookbook/apache_kafka_message_handling.ipynb Normal file Unescape Escape View File

10 cookbook/autogpt/marathon_times.ipynb Unescape Escape View File

826 cookbook/azure_container_apps_dynamic_sessions_data_analyst.ipynb Normal file View File

2 cookbook/camel_role_playing.ipynb Unescape Escape View File

10 cookbook/code-analysis-deeplake.ipynb Unescape Escape View File

557 cookbook/cql_agent.ipynb Normal file Unescape Escape View File

6 cookbook/custom_agent_with_plugin_retrieval.ipynb Unescape Escape View File

6 cookbook/custom_agent_with_plugin_retrieval_using_plugnplai.ipynb Unescape Escape View File

6 cookbook/custom_agent_with_tool_retrieval.ipynb Unescape Escape View File

2 cookbook/custom_multi_action_agent.ipynb Unescape Escape View File

1001 cookbook/data/imdb_top_1000.csv Normal file View File

2 cookbook/databricks_sql_db.ipynb Unescape Escape View File

4146 Commits

erick/goog ... wfh/more_i

2

.devcontainer/README.md

View File

2

.devcontainer/devcontainer.json

View File

8

.devcontainer/docker-compose.yaml

View File

41

.github/CONTRIBUTING.md vendored

View File

38

.github/DISCUSSION_TEMPLATE/ideas.yml vendored Normal file

View File

122

.github/DISCUSSION_TEMPLATE/q-a.yml vendored Normal file

View File

182

.github/ISSUE_TEMPLATE/bug-report.yml vendored

View File

13

.github/ISSUE_TEMPLATE/config.yml vendored

View File

45

.github/ISSUE_TEMPLATE/documentation.yml vendored

View File

30

.github/ISSUE_TEMPLATE/feature-request.yml vendored

View File

18

.github/ISSUE_TEMPLATE/other.yml vendored

View File

25

.github/ISSUE_TEMPLATE/privileged.yml vendored Normal file

View File

33

.github/PULL_REQUEST_TEMPLATE.md vendored

View File

7

.github/actions/people/Dockerfile vendored Normal file

View File

11

.github/actions/people/action.yml vendored Normal file

View File

646

.github/actions/people/app/main.py vendored Normal file

View File

8

.github/actions/poetry_setup/action.yml vendored

View File

243

.github/scripts/check_diff.py vendored

View File

35

.github/scripts/check_prerelease_dependencies.py vendored Normal file

View File

91

.github/scripts/get_min_versions.py vendored Normal file

View File

7

.github/workflows/.codespell-exclude vendored Normal file

View File

106

.github/workflows/_all_ci.yml vendored

View File

19

.github/workflows/_compile_integration_test.yml vendored

View File

23

.github/workflows/_dependencies.yml vendored

View File

51

.github/workflows/_integration_test.yml vendored

View File

39

.github/workflows/_lint.yml vendored

View File

171

.github/workflows/_release.yml vendored

View File

38

.github/workflows/_test.yml vendored

View File

51

.github/workflows/_test_doc_imports.yml vendored Normal file

View File

13

.github/workflows/_test_release.yml vendored

View File

25

.github/workflows/check-broken-links.yml vendored Normal file

View File

137

.github/workflows/check_diffs.yml vendored

View File

36

.github/workflows/check_new_docs.yml vendored Normal file

View File

19

.github/workflows/codespell.yml vendored

View File

35

.github/workflows/doc_lint.yml vendored

View File

13

.github/workflows/langchain_cli_release.yml vendored

View File

13

.github/workflows/langchain_community_release.yml vendored

View File

13

.github/workflows/langchain_core_release.yml vendored

View File

13

.github/workflows/langchain_experimental_release.yml vendored

View File

13

.github/workflows/langchain_experimental_test_release.yml vendored

View File

13

.github/workflows/langchain_openai_release.yml vendored

View File

27

.github/workflows/langchain_release.yml vendored

View File

13

.github/workflows/langchain_test_release.yml vendored

View File

37

.github/workflows/people.yml vendored Normal file

View File

76

.github/workflows/scheduled_test.yml vendored

View File

36

.github/workflows/templates_ci.yml vendored

View File

14

.gitignore vendored

View File

14

.readthedocs.yaml

View File

2

MIGRATE.md

View File

69

Makefile

View File

111

README.md

View File

61

SECURITY.md

View File

932

cookbook/Gemma_LangChain.ipynb Normal file

View File

4

cookbook/LLaMA2_sql_chat.ipynb

View File

14

cookbook/Multi_modal_RAG.ipynb

View File

20

cookbook/Multi_modal_RAG_google.ipynb

View File

747

cookbook/RAPTOR.ipynb Normal file

View File

6

cookbook/README.md

View File

6

cookbook/Semi_Structured_RAG.ipynb

View File

12

cookbook/Semi_structured_and_multi_modal_RAG.ipynb

View File

14

cookbook/Semi_structured_multi_modal_RAG_LLaMA2.ipynb

View File

14

cookbook/advanced_rag_eval.ipynb

View File

4

cookbook/agent_vectorstore.ipynb

View File

200

cookbook/airbyte_github.ipynb Normal file

View File

284

cookbook/amazon_personalize_how_to.ipynb Normal file

View File

584

cookbook/anthropic_structured_outputs.ipynb Normal file

View File

922

cookbook/apache_kafka_message_handling.ipynb Normal file

View File

10

cookbook/autogpt/marathon_times.ipynb

View File

826

cookbook/azure_container_apps_dynamic_sessions_data_analyst.ipynb Normal file

View File

2

cookbook/camel_role_playing.ipynb

View File

10

cookbook/code-analysis-deeplake.ipynb

View File

557

cookbook/cql_agent.ipynb Normal file

View File

6

cookbook/custom_agent_with_plugin_retrieval.ipynb

View File

6

cookbook/custom_agent_with_plugin_retrieval_using_plugnplai.ipynb

View File

6

cookbook/custom_agent_with_tool_retrieval.ipynb

View File

2

cookbook/custom_multi_action_agent.ipynb

View File

1001

cookbook/data/imdb_top_1000.csv Normal file

View File

2

cookbook/databricks_sql_db.ipynb

View File

6

cookbook/deeplake_semantic_search_over_chat.ipynb

View File