langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-02-21 14:43:07 +00:00

Author	SHA1	Message	Date
ccurme	732af24313	together: bump langchain-core (#22616 ) langchain-together depends on langchain-openai ^0.1.8 langchain-openai 0.1.8 has langchain-core >= 0.2.2 Here we bump langchain-core to 0.2.2, just to pass minimum dependency version tests.	2024-06-20 13:52:22 -07:00
ccurme	7397b7f20a	together[patch]: Release 0.1.3 (#22615 )	2024-06-20 13:52:22 -07:00
Asi Greenholts	b7c552506c	docs: Fix typo (#22596 ) Fix typo	2024-06-20 13:52:22 -07:00
CharlesCNorton	078cce9292	fix: typo in Agents section of README (#22599 ) Corrected the phrase "complete done" to "completely done" for better grammatical accuracy and clarity in the Agents section of the README. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-06-20 13:52:22 -07:00
Kirushikesh DB	42286e31fd	docs: Removed unwanted cell in refine segment (#22604 ) Description: There is one unwanted duplicate cell in refine section of summarization documentation, i have removed it.	2024-06-20 13:52:22 -07:00
andyjessen	96da77cb26	docs: Fix typo (#22603 ) This commit changes minor typo in the field description.	2024-06-20 13:52:21 -07:00
Isaac Francisco	e8de5f9178	community[patch]: recursive url loader fix and unit tests (#22521 ) Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-20 13:52:21 -07:00
Jacob Lee	d003350322	docs[minor]: Add "Build a PDF ingestion and Question/Answering system" tutorial (#22570 ) More direct entrypoint for a common use-case. Meant to give people a more hands-on intro to document loaders/loading data from different data sources as well. Some duplicate content for RAG and extraction (to show what you can do with the loaded documents), but defers to the appropriate sections rather than going too in-depth. @baskaryan @hwchase17	2024-06-20 13:52:21 -07:00
Jeffrey Mak	e37d7ad66e	community[patch]:Support filter for AzureAISearchRetriever (#22303 ) Description: The AzureAISearchRetriever does not support the "$filter" argument offered in the AISearch API: https://learn.microsoft.com/en-us/rest/api/searchservice/documents/search-get?view=rest-searchservice-2023-11-01&tabs=HTTP The $filter allows filtering of indexes based on values in metadata. Issue: https://github.com/langchain-ai/langchain/issues/19885 Dependencies: No Twitter handle: @Jeffreym9M - [ ] Add tests and docs: Not relevant - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-20 13:52:21 -07:00
Isaac Francisco	035992e8fc	docs: duckduckgosearch options listed (#22568 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-20 13:52:21 -07:00
Mikhail Khludnev	1b410bb6e5	docs: mentioning query_instruction with regards to BGE-M3 (#22405 ) see https://github.com/langchain-ai/langchain/pull/18017#issuecomment-2143942760 https://huggingface.co/BAAI/bge-m3#faq Co-authored-by: mikhail-khludnev <mikhail_khludnev@rntgroup.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-20 13:52:21 -07:00
X-HAN	c517ce0c8d	community[minor]: add DashScope Rerank (#22403 ) Description: this PR adds DashScope Rerank capability to Langchain, you can find DashScope Rerank API from [here](https://help.aliyun.com/document_detail/2780058.html?spm=a2c4g.2780059.0.0.6d995024FlrJ12) & [here](https://help.aliyun.com/document_detail/2780059.html?spm=a2c4g.2780058.0.0.63f75024cr11N9). [DashScope](https://dashscope.aliyun.com/) is the generative AI service from Alibaba Cloud (Aliyun). You can create DashScope API key from [here](https://bailian.console.aliyun.com/?apiKey=1#/api-key). Dependencies: DashScopeRerank depends on `dashscope` python package. Twitter handle: my twitter/x account is https://x.com/LastMonopoly and I'd like a mention, thanks you! Tests and docs 1. integration test: `test_dashscope_rerank.py` 2. example notebook: `dashscope_rerank.ipynb` Lint and test: I have run `make format`, `make lint` and `make test` from the root of the package I've modified. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-20 13:52:21 -07:00
Ethan Yang	1f85b55db2	[Community]add option to delete the prompt from HF output (#22225 ) This will help to solve pattern mismatching issue when parsing the output in Agent. https://github.com/langchain-ai/langchain/issues/21912	2024-06-20 13:52:21 -07:00
Jacob Lee	a07f3ac0a6	docs[patch]: Adds heading keywords to concepts page (#22577 ) @efriis @baskaryan	2024-06-20 13:52:21 -07:00
Erick Friis	b41d805992	docs: update agentexecutor title to legacy (#22575 )	2024-06-20 13:52:21 -07:00
Bagatur	bbc8819d0c	community[patch]: AzureSearch async functions (#22075 )	2024-06-20 13:52:21 -07:00
Bagatur	a1df71ad8e	langchain[minor]: add universal init_model (#22039 ) decisions to discuss - only chat models - model_provider isn't based on any existing values like llm-type, package names, class names - implemented as function not as a wrapper ChatModel - function name (init_model) - in langchain as opposed to community or core - marked beta	2024-06-20 13:52:21 -07:00
Isaac Francisco	edca3e33dd	docs: deprecation of max_length parameter used in Exa search (#22567 )	2024-06-20 13:52:21 -07:00
ccurme	15450bdef5	community: update how OpenAIAssistantV2Runnable creates threads with tool_resources (#22549 ) https://github.com/langchain-ai/langchain/issues/22503	2024-06-20 13:52:21 -07:00
Bagatur	24abcc60e8	community[patch]: Release 0.2.3 (#22562 )	2024-06-20 13:52:21 -07:00
Bagatur	9837bc92b3	nomic[patch]: Release 0.1.2 (#22561 )	2024-06-20 13:52:21 -07:00
Zach Nussbaum	4edd6af4fb	embeddings: nomic embed vision (#22482 ) Thank you for contributing to LangChain! Description: Adds Langchain support for Nomic Embed Vision Twitter handle: nomic_ai,zach_nussbaum - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Lance Martin <122662504+rlancemartin@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-20 13:52:21 -07:00
leila-messallem	3d1fff0cdd	community[patch]: improve test setup to accurately test filtering of labels in neo4j (#22531 ) Description: This PR addresses an issue with an existing test that was not effectively testing the intended functionality. The previous test setup did not adequately validate the filtering of the labels in neo4j, because the nodes and relationship in the test data did not have any properties set. Without properties these labels would not have been returned, regardless of the filtering. --------- Co-authored-by: Oskar Hane <oh@oskarhane.com>	2024-06-20 13:52:21 -07:00
Mohammad Mohtashim	f4c6b05497	[Experimental]: Async agenerate method ollama functions (#21682 ) - Description: : Added Async method for Generate for OllamaFunctions which was missing and was raising errors for the users. - Issue: #21422	2024-06-20 13:52:21 -07:00
Stefano Lottini	db9a7df552	community[minor]: Add support for metadata indexing policy in Cassandra vector store (#22548 ) This PR adds a constructor `metadata_indexing` parameter to the Cassandra vector store to allow optional fine-tuning of which fields of the metadata are to be indexed. This is a feature supported by the underlying CassIO library. Indexing mode of "all", "none" or deny- and allow-list based choices are available. The rationale is, in some cases it's advisable to programmatically exclude some portions of the metadata from the index if one knows in advance they won't ever be used at search-time. this keeps the index more lightweight and performant and avoids limitations on the length of _indexed_ strings. I added a integration test of the feature. I also added the possibility of running the integration test with Cassandra on an arbitrary IP address (e.g. Dockerized), via `CASSANDRA_CONTACT_POINTS=10.1.1.5,10.1.1.6 poetry run pytest [...]` or similar. While I was at it, I added a line to the `.gitignore` since the mypy _test_ cache was not ignored yet. My X (Twitter) handle: @rsprrs.	2024-06-20 13:52:21 -07:00
Emilien Chauvet	09492f78fa	community[minor]: add user agent for web scraping loaders (#22480 ) Description: This PR adds a `USER_AGENT` env variable that is to be used for web scraping. It creates a util to get that user agent and uses it in the classes used for scraping in [this piece of doc](https://python.langchain.com/v0.1/docs/use_cases/web_scraping/). Identifying your scraper is considered a good politeness practice, this PR aims at easing it. Issue: `None` Dependencies: `None` Twitter handle: `None`	2024-06-20 13:52:21 -07:00
Philippe PRADOS	f71ce8fd76	community[minor]: Add native async support to SQLChatMessageHistory (#22065 ) # package community: Fix SQLChatMessageHistory ## Description Here is a rewrite of `SQLChatMessageHistory` to properly implement the asynchronous approach. The code circumvents [issue 22021](https://github.com/langchain-ai/langchain/issues/22021) by accepting a synchronous call to `def add_messages()` in an asynchronous scenario. This bypasses the bug. For the same reasons as in [PR 22](https://github.com/langchain-ai/langchain-postgres/pull/32) of `langchain-postgres`, we use a lazy strategy for table creation. Indeed, the promise of the constructor cannot be fulfilled without this. It is not possible to invoke a synchronous call in a constructor. We compensate for this by waiting for the next asynchronous method call to create the table. The goal of the `PostgresChatMessageHistory` class (in `langchain-postgres`) is, among other things, to be able to recycle database connections. The implementation of the class is problematic, as we have demonstrated in [issue 22021](https://github.com/langchain-ai/langchain/issues/22021). Our new implementation of `SQLChatMessageHistory` achieves this by using a singleton of type (`Async`)`Engine` for the database connection. The connection pool is managed by this singleton, and the code is then reentrant. We also accept the type `str` (optionally complemented by `async_mode`. I know you don't like this much, but it's the only way to allow an asynchronous connection string). In order to unify the different classes handling database connections, we have renamed `connection_string` to `connection`, and `Session` to `session_maker`. Now, a single transaction is used to add a list of messages. Thus, a crash during this write operation will not leave the database in an unstable state with a partially added message list. This makes the code resilient. We believe that the `PostgresChatMessageHistory` class is no longer necessary and can be replaced by: ``` PostgresChatMessageHistory = SQLChatMessageHistory ``` This also fixes the bug. ## Issue - [issue 22021](https://github.com/langchain-ai/langchain/issues/22021) - Bug in _exit_history() - Bugs in PostgresChatMessageHistory and sync usage - Bugs in PostgresChatMessageHistory and async usage - [issue 36](https://github.com/langchain-ai/langchain-postgres/issues/36) ## Twitter handle: pprados ## Tests - libs/community/tests/unit_tests/chat_message_histories/test_sql.py (add async test) @baskaryan, @eyurtsev or @hwchase17 can you check this PR ? And, I've been waiting a long time for validation from other PRs. Can you take a look? - [PR 32](https://github.com/langchain-ai/langchain-postgres/pull/32) - [PR 15575](https://github.com/langchain-ai/langchain/pull/15575) - [PR 13200](https://github.com/langchain-ai/langchain/pull/13200) --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-06-20 13:52:21 -07:00
Vincent Min	d17efe37f1	community[minor]: Improve InMemoryVectorStore with ability to persist to disk and filter on metadata. (#22186 ) - Description: The InMemoryVectorStore is a nice and simple vector store implementation for quick development and debugging. The current implementation is quite limited in its functionalities. This PR extends the functionalities by adding utility function to persist the vector store to a json file and to load it from a json file. We choose the json file format because it allows inspection of the database contents in a text editor, which is great for debugging. Furthermore, it adds a `filter` keyword that can be used to filter out documents on their `page_content` or `metadata`. - Issue: - - Dependencies: - - Twitter handle: @Vincent_Min	2024-06-20 13:52:21 -07:00
Christophe Bornet	dacd50d0b9	core[patch]: Improve VectorStore API doc (#22547 )	2024-06-20 13:52:21 -07:00
maang-h	882b6cdbca	community[patch]: add detailed paragraph and example for BaichuanTextEmbeddings (#22031 ) - Description: add detailed paragraph and example for BaichuanTextEmbeddings - Issue: the issue #21983	2024-06-20 13:52:21 -07:00
Anthony Bernabeu	dd8fdfa375	community[minor]: Added filter search for LanceDB (#22461 ) - [ ] community: "vectorstore: added filtering support for LanceDB vector store" - [ ] This PR adds filtering capabilities to LanceDB: - Description: In LanceDB filtering can be applied when searching for data into the vectorstore. It is using the SQL language as mentioned in the LanceDB documentation. - Issue: #18235 - Dependencies: No - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-20 13:52:21 -07:00
Erick Friis	a3ba5c0048	huggingface: remove text-generation dep (#22543 )	2024-06-20 13:52:21 -07:00
Erick Friis	dcf10b7a7f	ai21: fix core version (#22544 )	2024-06-20 13:52:21 -07:00
Asaf Joseph Gardin	758fad6d03	ai21: fix ai21 unittests (#22526 ) Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-20 13:52:21 -07:00
Erick Friis	bc2f874835	community: fix huggingface deprecations (#22522 )	2024-06-20 13:52:21 -07:00
Jacob Lee	1ee9926af3	docs[patch]: Adds links to deprecations page (#22514 ) @baskaryan	2024-06-20 13:52:21 -07:00
William FH	704a9d4955	[Docs] Structured output Keywords (#22511 )	2024-06-20 13:52:21 -07:00
Christophe Bornet	7e28598358	core[patch]: Add similarity_score_threshold to VectorStore search types (#22477 )	2024-06-20 13:52:21 -07:00
Eugene Yurtsev	a46ac08183	core[patch]: Deduplicate of callback handlers in merge_configs (#22478 ) This PR adds deduplication of callback handlers in merge_configs. Fix for this issue: https://github.com/langchain-ai/langchain/issues/22227 The issue appears when the code is: 1) running python >=3.11 2) invokes a runnable from within a runnable 3) binds the callbacks to the child runnable from the parent runnable using with_config In this case, the same callbacks end up appearing twice: (1) the first time from with_config, (2) the second time with langchain automatically propagating them on behalf of the user. Prior to this PR this will emit duplicate events: ```python @tool async def get_items(question: str, callbacks: Callbacks): # <--- Accept callbacks """Ask question""" template = ChatPromptTemplate.from_messages( [ ( "human", "'{question}" ) ] ) chain = template \| chat_model.with_config( { "callbacks": callbacks, # <-- Propagate callbacks } ) return await chain.ainvoke({"question": question}) ``` Prior to this PR this will work work correctly (no duplicate events): ```python @tool async def get_items(question: str, callbacks: Callbacks): # <--- Accept callbacks """Ask question""" template = ChatPromptTemplate.from_messages( [ ( "human", "'{question}" ) ] ) chain = template \| chat_model return await chain.ainvoke({"question": question}, {"callbacks": callbacks}) ``` This will also work (as long as the user is using python >= 3.11) -- as langchain will automatically propagate callbacks ```python @tool async def get_items(question: str,): """Ask question""" template = ChatPromptTemplate.from_messages( [ ( "human", "'{question}" ) ] ) chain = template \| chat_model return await chain.ainvoke({"question": question}) ```	2024-06-20 13:52:21 -07:00
Jacob Lee	1b56ca3f84	docs[patch]: Update quickstart tutorial (#22504 ) Mentions LCEL more, hopefully flags it to more people as a simple entrypoint @baskaryan @hwchase17	2024-06-20 13:52:21 -07:00
Ofer Mendelevitch	b12e1ae568	community[minor]: Vectara Integration Update - Streaming, FCS, Chat, updates to documentation and example notebooks (#21334 ) Thank you for contributing to LangChain! Description: update to the Vectara / Langchain integration to integrate new Vectara capabilities: - Full RAG implemented as a Runnable with as_rag() - Vectara chat supported with as_chat() - Both support streaming response - Updated documentation and example notebook to reflect all the changes - Updated Vectara templates Twitter handle: ofermend Add tests and docs: no new tests or docs, but updated both existing tests and existing docs	2024-06-20 13:52:21 -07:00
Bagatur	95e8cc361e	docs: update anthropic chat model (#22483 ) Related to #22296 And update anthropic to accept base_url	2024-06-20 13:52:21 -07:00
Erick Friis	6cb5075ce2	robocorp: typo (#22509 )	2024-06-20 13:52:21 -07:00
Erick Friis	73308012dc	robocorp: release 0.0.9.post1 (#22507 )	2024-06-20 13:52:21 -07:00
Erick Friis	6a8b77d30a	ai21: release 0.1.6 (#22508 )	2024-06-20 13:52:21 -07:00
ccurme	80977fa0bd	together, upstage: bump minimum langchain-openai version (#22505 )	2024-06-20 13:52:21 -07:00
Erick Friis	450e4af347	docs: fix api ref link generation (#22438 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-06-20 13:52:21 -07:00
Bagatur	607ea7da83	mongodb[patch]: Release 0.1.6 (#22501 )	2024-06-20 13:52:20 -07:00
Bagatur	605fc224ba	groq[patch]: Release 0.1.5 (#22500 )	2024-06-20 13:52:20 -07:00
Bagatur	612558b251	milvus[patch]: Release 0.1.1 (#22499 )	2024-06-20 13:52:20 -07:00

1 2 3 4 5 ...

9713 Commits