langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-02-21 06:33:41 +00:00

Author	SHA1	Message	Date
Nuno Campos	8329cae709	core: Two updates to chat model interface (#19684 ) - .stream() and .astream() call on_llm_new_token, removing the need for subclasses to do so. Backwards compatible because now we don't pass run_manager into ._stream and ._astream - .generate() and .agenerate() now handle `stream: bool` kwarg for _generate and _agenerate. Subclasses handle this arg by delegating to ._stream(), now one less thing they need to do. Backwards compat because this is an optional arg that we now never pass to the subclasses - .generate() and .agenerate() now inspect callback handlers to decide on a default value for stream:bool if not passed in. This auto enables streaming when using astream_events and astream_log - as a result of these three changes any usage of .astream_events and .astream_log should now yield chat model stream events - In future PRs we can update all subclasses to reflect these two things now handled by base class, but in meantime all will continue to work	2024-04-25 17:39:32 -07:00
Kahlil Wehmeyer	cf4ca8d2f2	core[patch]: ToolException docs/exception message (#17590 ) Description: This PR adds a slightly more helpful message to a Tool Exception ``` # current state langchain_core.tools.ToolException: Too many arguments to single-input tool # proposed state langchain_core.tools.ToolException: Too many arguments to single-input tool. Consider using a StructuredTool instead. ``` Issue: Somewhat discussed here 👉 #6197 Dependencies: None Twitter handle: N/A --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 17:39:32 -07:00
Christophe Bornet	eafa9124af	core[minor]: Add async methods to MaxMarginalRelevanceExampleSelector (#19639 )	2024-04-25 17:39:32 -07:00
Jan Nissen	34fd605cac	core[minor]: support pydantic v2 models in PydanticOutputParser (#18811 ) As mentioned in #18322, the current PydanticOutputParser won't work for anyone trying to parse to pydantic v2 models. This PR adds a separate `PydanticV2OutputParser`, as well as a `langchain_core.pydantic_v2` namespace that will fail on import to any projects using pydantic<2. Happy to update the docs for output parsers if this is something we're interesting in adding. On a separate note, I also updated `check_pydantic.sh` to detect pydantic imports with leading whitespace and excluded the internal namespaces. That change can be separated into its own PR if needed. --------- Co-authored-by: Jan Nissen <jan23@gmail.com>	2024-04-25 17:39:32 -07:00
jhicks2306	cb6891c796	docs: Improve docstring for Runnable bind method (#19659 ) Added example to the docstring of the "bind" method of Runnable. This makes it easier to understand the purpose of the method when reviewing in code editors. E.g. VS Code below. <img width="833" alt="Screenshot 2024-03-27 at 16 24 18" src="https://github.com/langchain-ai/langchain/assets/45722942/ad022d4e-7bc0-4f4b-aa7a-838f1816cc52"> --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-25 17:39:32 -07:00
Christophe Bornet	5b9b20b543	core: Add async methods to LengthBasedExampleSelector (#19640 )	2024-04-25 17:39:32 -07:00
Bagatur	9cf95f3c24	core[patch]: Release 0.1.35 (#19660 )	2024-04-25 17:39:32 -07:00
Eugene Yurtsev	fdaf96cac3	core[patch]: Patch XML vulnerability in XMLOutputParser (CVE-2024-1455) (#19653 ) Patch potential XML vulnerability CVE-2024-1455 This patches a potential XML vulnerability in the XMLOutputParser in langchain-core. The vulnerability in some situations could lead to a denial of service attack. At risk are users that: 1) Running older distributions of python that have older version of libexpat 2) Are using XMLOutputParser with an agent 3) Accept inputs from untrusted sources with this agent (e.g., endpoint on the web that allows an untrusted user to interact wiith the parser)	2024-04-25 17:39:32 -07:00
Eugene Yurtsev	69530b0afc	core[patch]: XMLOutputParser fix to handle changes to xml standard library (#19612 ) Newest python micro releases broke streaming in the XMLOutputParser. This fixes the parsing code to work with trailing junk after the XML content.	2024-04-25 17:39:32 -07:00
jhicks2306	900329d505	docs: Update docstring for MessagesPlaceholder (#19601 ) Update to docstring for MessagesPlaceholder so that it shows helpful information in code editors. E.g. VS Code as shown below. <img width="587" alt="Screenshot 2024-03-26 at 17 18 58" src="https://github.com/langchain-ai/langchain/assets/45722942/8f49d09f-ed8d-4f61-a9d4-3611dbe9c9c5"> --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 17:39:31 -07:00
Bagatur	ba51e354a8	core[patch]: Release 0.1.34 (#19609 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-25 17:39:31 -07:00
Nuno Campos	38ec48a7a5	load: Optionally disable reading secrets from env (#19596 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-25 17:39:31 -07:00
Eugene Yurtsev	45ca3d6264	core[patch]: Temporarily disable test for streaming xml parser (#19610 ) Test is failing due to micro version bump in python interpreter which changed something about how std xml parser works	2024-04-25 17:39:31 -07:00
Eugene Yurtsev	fc489a3e71	core[patch]: Reverting changes with defusedXML (#19604 ) DefusedXML is causing parsing errors on previously functional code with the 0.7.x versions. These do not seem to support newer version of python well. 0.8.x has only been released as rc, so we're not going to to use it in the core package	2024-04-25 17:39:31 -07:00
Eugene Yurtsev	8d973dd3c2	core[patch]: Remove anyio dependency (#19583 ) The dependency isn't used anymore	2024-04-25 17:39:31 -07:00
Christophe Bornet	d3914842f6	core[minor]: Use BaseChatMessageHistory async methods in RunnableWithMessageHistory (#19565 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-25 17:39:31 -07:00
Christophe Bornet	34e7a344b8	core: Add async methods to BaseExampleSelector and SemanticSimilarityExampleSelector (#19399 ) Few-Shot prompt template may use a `SemanticSimilarityExampleSelector` that in turn uses a `VectorStore` that does I/O operations. So to work correctly on the event loop, we need: * async methods for the `VectorStore` (OK) * async methods for the `SemanticSimilarityExampleSelector` (this PR) * async methods for `BasePromptTemplate` and `BaseChatPromptTemplate` (future work)	2024-04-25 17:39:31 -07:00
Christophe Bornet	08f17961fb	core[minor]: Add default implementations to amax_marginal_relevance_search_by_vector and adelete (#19269 )	2024-04-25 17:39:31 -07:00
Guangdong Liu	d912472113	docs: Add in code documentation to core Runnable map methods (docs only) (#19517 ) - Issue: #18804 - @baskaryan, @eyurtsev	2024-04-25 17:39:31 -07:00
Eugene Yurtsev	3e2a7f3289	core[patch]: fix xml output parser transform (#19530 ) Previous PR passed _parser attribute which apparently is not meant to be used by user code and causes non deterministic failures on CI when testing the transform and a transform methods. Reverting this change temporarily.	2024-04-25 17:39:30 -07:00
aditya thomas	c9860ec689	core[runnables]: docstring for class runnable, method with_listeners() (#19515 ) Description: Docstring for method with_listerners() of class Runnable Issue: [Add in code documentation to core Runnable methods #18804](https://github.com/langchain-ai/langchain/issues/18804) Dependencies: None	2024-04-25 17:39:30 -07:00
Eugene Yurtsev	b747dbb9ab	core[patch]: Use defusedxml in XMLOutputParser (#19526 ) This mitigates a security concern for users still using older versions of libexpat that causes an attacker to compromise the availability of the system if an attacker manages to surface malicious payload to this XMLParser.	2024-04-25 17:39:30 -07:00
Guangdong Liu	1abe811c24	docs: docstring Runnable `pipe` and `pick` methods (docs only) (#19395 ) - Issue: #18804 - @eyurtsev @ccurme PTAL --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 17:39:30 -07:00
Harrison Chase	361b3d513a	core[minor]: Add utility code to create tool examples (#18602 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-25 17:39:30 -07:00
William FH	1f92eaf90d	core[patch]: allow "placeholder" type in from_messages tuples (#19152 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-25 17:39:30 -07:00
Bagatur	492d31237c	core[patch]: Release 0.1.33 (#19348 )	2024-04-25 17:39:29 -07:00
William FH	b9b20f6304	[Feat] Accept non-dict if only 1 prompt input variable (#19156 ) For prompt templates with only 1 variable (common in e.g., MessageGraph), it's convenient to wrap the incoming object in the variable before formatting. The downside of this, of course, would be that some number of invocations will successfully format when the user may have intended to format it properly before	2024-04-25 17:39:29 -07:00
Devesh Rahatekar	623907640d	core: Updated docstring for RunnablePick (#18832 ) Description: : Updated the docstring for RunnablePick. Added Overview and an Example for RunnablePick class. Issue: : #18803	2024-04-25 17:39:29 -07:00
Leonid Ganeline	8ecd9d643f	core[patch]: Update `messages` namespace to fix API reference docs (#19161 ) Classes and functions defined in __init__.py are not parsed into the API Reference. For example: - libs/core/langchain_core/messages/__init__.py : AnyMessage, MessageLikeRepresentation, get_buffer_string(), messages_from_dict(), ... Opinionated: __init__.py is not a typical place to define artifacts. Moved artifacts from __init__ into utils.py. Added `MessageLikeRepresentation` to __all__ since it is used outside of `messages`, for example, in `libs/core/langchain_core/language_models/base.py` Added `_message_from_dict` to __all__ since it is used outside of `messages`(???) I would add `message_from_dict` (without underscore) as an alias. Please, advise.	2024-04-25 17:39:29 -07:00
Christophe Bornet	621ff05f94	core: Simplify astream logic in BaseChatModel and BaseLLM (#19332 ) Covered by tests in `libs/core/tests/unit_tests/language_models/chat_models/test_base.py`, `libs/core/tests/unit_tests/language_models/llms/test_base.py` and `libs/core/tests/unit_tests/runnables/test_runnable_events.py`	2024-04-25 17:39:29 -07:00
Chris Papademetrious	a2de40b283	core: implement a batch_size parameter for CacheBackedEmbeddings (#18070 ) Description: Currently, `CacheBackedEmbeddings` computes vectors for all uncached documents before updating the store. This pull request updates the embedding computation loop to compute embeddings in batches, updating the store after each batch. I noticed this when I tried `CacheBackedEmbeddings` on our 30k document set and the cache directory hadn't appeared on disk after 30 minutes. The motivation is to minimize compute/data loss when problems occur: * If there is a transient embedding failure (e.g. a network outage at the embedding endpoint triggers an exception), at least the completed vectors are written to the store instead of being discarded. * If there is an issue with the store (e.g. no write permissions), the condition is detected early without computing (and discarding!) all the vectors. Issue: Implements enhancement #18026. Testing: I was unable to run unit tests; details in [this post](https://github.com/langchain-ai/langchain/discussions/15019#discussioncomment-8576684). --------- Signed-off-by: chrispy <chrispy@synopsys.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-25 17:39:29 -07:00
Guangdong Liu	ba85d67e91	code[patch]: Add in code documentation to core Runnable with_retry method (docs only) (#19192 ) - Description: Add in code documentation to core Runnable with_retry method (docs only) - Issue: #18804 @baskaryan @eyurtsev PTAL --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-25 17:39:29 -07:00
Eugene Yurtsev	4bb670becf	core[patch]: Pass sync run manager for sync stream fallback in astream (#19280 ) This PR patches the fallback in chat models and language models to pass in the appropriate version of the run manager (sync vs. async)	2024-04-25 17:39:29 -07:00
Leonid Ganeline	127b852633	core[patch]: Move `globals` to a module instead of a package (non breaking change) (#19159 ) Classes and functions defined in __init__.py are not parsed into the API Reference. For example: libs/core/langchain_core/globals/__init__.py : `set_verbose` `get_llm_cache`, `set_llm_cache`, ... And the whole `langchain_core.globals` namespace is not visible in the API Reference. The refactoring is just file renaming.	2024-04-25 17:39:29 -07:00
Al-Ekram Elahee Hridoy	668c74c82d	core[minor]: Enhance cache flexibility in BaseChatModel (#17386 ) - Description: Enhanced the `BaseChatModel` to support an `Optional[Union[bool, BaseCache]]` type for the `cache` attribute, allowing for both boolean flags and custom cache implementations. Implemented logic within chat model methods to utilize the provided custom cache implementation effectively. This change aims to provide more flexibility in caching strategies for chat models. - Issue: Implements enhancement request #17242. - Dependencies: No additional dependencies required for this change. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-25 17:39:29 -07:00
Roshan Santhosh	576f9e0c31	core: update _rm_titles to account for title argument name bug (#19036 ) Issue : For functions which have an argument with the name 'title', the convert_pydantic_to_openai_function generates an incorrect output and omits the argument all together. This is because the _rm_titles function removes all instances of the the key 'title' from the output. Description : Updates the _rm_titles function to check the presence of the 'type' key as well before removing the 'title' key. As the title key that we wish to omit always has a type key along with it. Potential gap if there is a function defined which has both title and key as argument names, in which case this would fail. Maybe we could set a filter on the function argument names and reject those with keyword argument names. No dependencies. Passed all tests. - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-04-25 17:39:29 -07:00
Aaron Jimenez	0d657e5a7f	core: Updated docstring for Context class (#19079 ) - Description: Improves the docstring for `class Context` by providing an overview and an example. - Issue: #18803	2024-04-25 17:39:29 -07:00
Kangmoon Seo	44b6f08469	core: Fix Exception handling in XMLOutputParser (#19126 ) - Description: - Exception handling in `XMLOutputParser` 1. Add Exception handling at `root = ET.fromstring(text)` // raises `ET.ParseError` 2. Fix Exception class (commonly uses in `BaseOutputParser` class) - AS-IS: raise `ValueError`, `ET.ParserError` without handling ```python # langchain_core/output_parsers/xml.py text = text.strip() if (text.startswith("<") or text.startswith("\n<")) and ( text.endswith(">") or text.endswith(">\n") ): root = ET.fromstring(text) return self._root_to_dict(root) else: raise ValueError(f"Could not parse output: {text}") ``` - TO-BE: raise `OutputParserException` ```python # langchain_core/output_parsers/xml.py text = text.strip() if (text.startswith("<") or text.startswith("\n<")) and ( text.endswith(">") or text.endswith(">\n") ): try: root = ET.fromstring(text) return self._root_to_dict(root) except ET.ParseError: raise OutputParserException(f"Could not parse output: {text}") else: raise OutputParserException(f"Could not parse output: {text}") ``` - Issue: #19107 - Dependencies: None	2024-04-25 17:39:29 -07:00
William FH	3637f89868	[Enhancement] Add support for directly providing a run_id (#18990 ) The root run id (~trace id's) is useful for assigning feedback, but the current recommended approach is to use callbacks to retrieve it, which has some drawbacks: 1. Doesn't work for streaming until after the first event 2. Doesn't let you call other endpoints with the same trace ID in parallel (since you have to wait until the call is completed/started to use This PR lets you provide = "run_id" in the runnable config. Couple considerations: 1. For batch calls, we split the trace up into separate trees (to permit better rendering). We keep the provided run ID for the first one and generate a unique one for other elements of the batch. 2. For nested calls, the provided ID is ONLY used on the top root/trace. ``` chain.invoke("foo", {"run_id": uuid.uuid4()}) ```	2024-04-25 17:39:27 -07:00
Jacob Lee	ab38250f55	core[patch]: Add LLM output to message response_metadata (#19158 ) This will more easily expose token usage information. CC @baskaryan --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 17:39:13 -07:00
Leonid Kuligin	80e1339d9d	core[minor]: moved fake llms and embeddings to core (#19226 ) - [ ] PR title: "core: moved fake llms and embeddings to core" - [ ] PR message: - Description: moved fake llms and embeddings to core"	2024-04-25 17:39:13 -07:00
Nikhil Kumar	e48073a9b8	docs: Add docs for RouterRunnable (#19191 ) - [x] Docs for `RouterRunnable`: core: Add docs for `RouterRunnable` - [x] Add docs for `RouterRunnable`: - Description: Add docs for `RouterRunnable`, which was previously missing documentation - Issue: #18803 - Dependencies: N/A - Twitter handle: None	2024-04-25 17:39:13 -07:00
Guangdong Liu	5b47b26a32	docs: Add in code documentation to core Runnable with_fallbacks method (docs only) (#19104 ) - Description: [a description of the change] Add in code documentation to core Runnable with_fallbacks method (docs only) - Issue: the issue #18804 @eyurtsev PTAL	2024-04-25 17:39:12 -07:00
Maxime Perrin	3fc3e075bd	core[minor]: allow LLMs async streaming to fallback on sync streaming (#18960 ) - Description: Handling fallbacks when calling async streaming for a LLM that doesn't support it. - Issue: #18920 - Twitter handle:@maximeperrin_ --------- Co-authored-by: Maxime Perrin <mperrin@doing.fr>	2024-04-25 17:39:12 -07:00
Erick Friis	3dec36a543	community, langchain, infra: revert store extended test deps outside of poetry (#19153 ) Reverts langchain-ai/langchain#18995 Because it makes installing dependencies in python 3.11 extended testing take 80 minutes	2024-04-25 17:39:12 -07:00
Erick Friis	e1231fdd2e	community, langchain, infra: store extended test deps outside of poetry (#18995 ) poetry can't reliably handle resolving the number of optional "extended test" dependencies we have. If we instead just rely on pip to install extended test deps in CI, this isn't an issue.	2024-04-25 17:39:12 -07:00
Bagatur	a88e19e197	core[patch]: rc release 0.1.33-rc.1 (#19103 )	2024-04-25 17:39:12 -07:00
Nuno Campos	84bccc4e7a	core[patch]: Change structured prompt lc id to match js (#19099 )	2024-04-25 17:39:12 -07:00
Eugene Yurtsev	0da204b457	core[patch]: RunnablePassthrough transform to autoupgrade to AddableDict (#19051 ) Follow up on https://github.com/langchain-ai/langchain/pull/18743 which missed RunnablePassthrough Issues: https://github.com/langchain-ai/langchain/issues/18741 https://github.com/langchain-ai/langgraph/issues/136 https://github.com/langchain-ai/langserve/issues/504	2024-04-25 17:39:12 -07:00
Guangdong Liu	03d32fbf0e	code[patch]: Add in code documentation to core Runnable assign method (docs only) (#18951 ) PR message: *Delete this entire checklist* and replace with - Description: [a description of the change](docs: Add in code documentation to core Runnable assign method) - Issue: the issue #18804	2024-04-25 17:39:12 -07:00

1 2 3 4 5 ...

358 Commits