langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-11 13:55:03 +00:00

Author	SHA1	Message	Date
William FH	ce84f8ba7e	Dereference run tree (#30377 )	2025-03-19 19:05:06 +00:00
Vadym Barda	37190881d3	core[patch]: add util for approximate token counting (#30373 )	2025-03-19 17:48:38 +00:00
William FH	f5a0092551	Rm test for parent_run presence (#30356 )	2025-03-18 19:44:19 -07:00
Vadym Barda	c7842730ef	core[patch]: support single-node subgraphs and put subgraph nodes under the respective subgraphs (#30234 )	2025-03-11 18:55:45 -04:00
ccurme	806211475a	core[patch]: update structured output tracing (#30123 ) - Trace JSON schema in `options` - Rename to `ls_structured_output_format`	2025-03-07 13:05:25 -05:00
ccurme	52b0570bec	core, openai, standard-tests: improve OpenAI compatibility with Anthropic content blocks (#30128 ) - Support thinking blocks in core's `convert_to_openai_messages` (pass through instead of error) - Ignore thinking blocks in ChatOpenAI (instead of error) - Support Anthropic-style image blocks in ChatOpenAI --- Standard integration tests include a `supports_anthropic_inputs` property which is currently enabled only for tests on `ChatAnthropic`. This test enforces compatibility with message histories of the form: ``` - system message - human message - AI message with tool calls specified only through `tool_use` content blocks - human message containing `tool_result` and an additional `text` block ``` It additionally checks support for Anthropic-style image inputs if `supports_image_inputs` is enabled. Here we change this test, such that if you enable `supports_anthropic_inputs`: - You support AI messages with text and `tool_use` content blocks - You support Anthropic-style image inputs (if `supports_image_inputs` is enabled) - You support thinking content blocks. That is, we add a test case for thinking content blocks, but we also remove the requirement of handling tool results within HumanMessages (motivated by existing agent abstractions, which should all return ToolMessage). We move that requirement to a ChatAnthropic-specific test.	2025-03-06 09:53:14 -05:00
Adrián Panella	c599ba47d5	core(mermaid): fix error when 3+ subgraph levels (#29970 )	2025-03-04 13:27:49 -05:00
Christophe Bornet	9e6ffd1264	core: Add ruff rules PTH (pathlib) (#29338 ) See https://docs.astral.sh/ruff/rules/#flake8-use-pathlib-pth Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-28 13:22:20 -05:00
Christophe Bornet	b3885c124f	core: Add ruff rules TC (#29268 ) See https://docs.astral.sh/ruff/rules/#flake8-type-checking-tc Some fixes done for TC001,TC002 and TC003 but these rules are excluded since they don't play well with Pydantic. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-26 19:39:05 +00:00
Vadym Barda	0fc50b82a0	core[patch]: allow passing description to @tool decorator (#29976 )	2025-02-25 17:45:36 -05:00
ccurme	b1a7f4e106	core, openai[patch]: support serialization of pydantic models in messages (#29940 ) Resolves https://github.com/langchain-ai/langchain/issues/29003, https://github.com/langchain-ai/langchain/issues/27264 Related: https://github.com/langchain-ai/langchain-redis/issues/52 ```python from langchain.chat_models import init_chat_model from langchain.globals import set_llm_cache from langchain_community.cache import SQLiteCache from pydantic import BaseModel cache = SQLiteCache() set_llm_cache(cache) class Temperature(BaseModel): value: int city: str llm = init_chat_model("openai:gpt-4o-mini") structured_llm = llm.with_structured_output(Temperature) ``` ```python # 681 ms response = structured_llm.invoke("What is the average temperature of Rome in May?") ``` ```python # 6.98 ms response = structured_llm.invoke("What is the average temperature of Rome in May?") ```	2025-02-24 09:34:27 -05:00
Christophe Bornet	f6d4fec4d5	core: Add ruff rules ANN (type annotations) (#29271 ) See https://docs.astral.sh/ruff/rules/#flake8-annotations-ann The interest compared to only mypy is that ruff is very fast at detecting missing annotations. ANN101 and ANN102 are deprecated so we ignore them ANN401 (no Any type) ignored to be in sync with mypy config --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-02-22 17:46:28 -05:00
Vadym Barda	437fe6d216	core[patch]: return ToolMessage from tools when tool call ID is empty string (#29921 )	2025-02-21 11:53:15 -05:00
ccurme	ed3c2bd557	core[patch]: set version="v2" as default in astream_events (#29894 )	2025-02-19 23:21:37 +00:00
Erick Friis	6c1e21d128	core: basemessage.text() (#29078 )	2025-02-18 17:45:44 -08:00
Vadym Barda	d04fa1ae50	core[patch]: allow passing JSON schema as args_schema to tools (#29812 )	2025-02-18 14:44:31 -05:00
Nuno Campos	fe59f2cc88	core: Fix output of convert_messages when called with BaseMessage.model_dump() (#29763 ) - additional_kwargs was being nested twice - example, response_metadata was placed inside additional_kwargs	2025-02-12 10:05:33 -08:00
Keenan Pepper	c67d473397	core: Make abatch_as_completed respect max_concurrency (#29426 ) - Description: Add tests for respecting max_concurrency and implement it for abatch_as_completed so that test passes - Issue: #29425 - Dependencies: none - Twitter handle: keenanpepper	2025-02-07 16:51:22 -08:00
Bruno Alvisio	3eaf561561	core: Handle unterminated escape character when parsing partial JSON (#29065 ) Description Currently, when parsing a partial JSON, if a string ends with the escape character, the whole key/value is removed. For example: ``` >>> from langchain_core.utils.json import parse_partial_json >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> >>> parse_partial_json(my_str) {'foo': 'bar'} ``` My expectation (and with this fix) would be for `parse_partial_json()` to return: ``` >>> from langchain_core.utils.json import parse_partial_json >>> >>> my_str = '{"foo": "bar", "baz": "qux\\' >>> parse_partial_json(my_str) {'foo': 'bar', 'baz': 'qux'} ``` Notes: 1. It could be argued that current behavior is still desired. 2. I have experienced this issue when the streaming output from an LLM and the chunk happens to end with `\\` 3. I haven't included tests. Will do if change is accepted. 4. This is specially troublesome when this function is used by `187131c55c/libs/core/langchain_core/output_parsers/transform.py (L111)` since what happens is that, for example, if the received sequence of chunks are: `{"foo": "b` , `ar\\` : Then, the result of calling `self.parse_result()` is: ``` {"foo": "b"} ``` and the second time: ``` {} ``` Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 23:18:21 +00:00
Isaac Francisco	91ffd7caad	core: allow passing message dicts into ChatPromptTemplate (#29363 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-05 09:45:52 -08:00
Christophe Bornet	aab2e42169	core[patch]: Use Blockbuster to detect blocking calls in asyncio during tests (#29043 ) This PR uses the [blockbuster](https://github.com/cbornet/blockbuster) library in langchain-core to detect blocking calls made in the asyncio event loop during unit tests. Avoiding blocking calls is hard as these can be deeply buried in the code or made in 3rd party libraries. Blockbuster makes it easier to detect them by raising an exception when a call is made to a known blocking function (eg: `time.sleep`). Adding blockbuster allowed to find a blocking call in `aconfig_with_context` (it ends up calling `get_function_nonlocals` which loads function code). Dependencies: - blockbuster (test) Twitter handle: cbornet_	2025-01-31 10:06:34 -05:00
Isaac Francisco	2bb2c9bfe8	change behavior for converting a string to openai messages (#29446 )	2025-01-27 18:18:54 -08:00
Christophe Bornet	dbb6b7b103	core: Add ruff rules TRY (tryceratops) (#29388 ) TRY004 ("use TypeError rather than ValueError") existing errors are marked as ignore to preserve backward compatibility. LMK if you prefer to fix some of them. Co-authored-by: Erick Friis <erick@langchain.dev>	2025-01-24 05:01:40 +00:00
Christophe Bornet	e4a78dfc2a	core: Bump ruff version to 0.9 (#29201 ) Also run some preview autofix and formatting --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-01-22 00:20:09 +00:00
Christophe Bornet	1c4ce7b42b	core: Auto-fix some docstrings (#29337 )	2025-01-21 13:29:53 -05:00
Adrián Panella	acddfc772e	core: allow artifact in create_retriever_tool (#28903 ) Add option to return content and artifacts, to also be able to access the full info of the retrieved documents. They are returned as a list of dicts in the `artifacts` property if parameter `response_format` is set to `"content_and_artifact"`. Defaults to `"content"` to keep current behavior. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2025-01-03 22:10:31 +00:00
Erick Friis	6a37899b39	core: dont mutate tool_kwargs during tool run (#28824 ) fixes https://github.com/langchain-ai/langchain/issues/24621	2024-12-19 18:11:56 +00:00
Qun	033ac41760	fix crash when using create_xml_agent with parameterless function as … (#26002 ) When using `create_xml_agent` or `create_json_chat_agent` to create a agent, and the function corresponding to the tool is a parameterless function, the `XMLAgentOutputParser` or `JSONAgentOutputParser` will parse the tool input into an empty string, `BaseTool` will parse it into a positional argument. So, the program will crash finally because we invoke a parameterless function but with a positional argument.Specially, below code will raise StopIteration in [_parse_input](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/tools/base.py#L419) ```python from langchain import hub from langchain.agents import AgentExecutor, create_json_chat_agent, create_xml_agent from langchain_openai import ChatOpenAI prompt = hub.pull("hwchase17/react-chat-json") llm = ChatOpenAI() # agent = create_xml_agent(llm, tools, prompt) agent = create_json_chat_agent(llm, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke(......) ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-19 13:00:46 -05:00
Bagatur	4a531437bb	core[patch], openai[patch]: Handle OpenAI developer msg (#28794 ) - Convert developer openai messages to SystemMessage - store additional_kwargs={"__openai_role__": "developer"} so that the correct role can be reconstructed if needed - update ChatOpenAI to read in openai_role --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-18 21:54:07 +00:00
William FH	50ea1c3ea3	[Core] respect tracing project name cvar (#28792 )	2024-12-18 10:02:02 -08:00
Satyam Kumar	90f7713399	refactor: improve docstring parsing logic for Google style (#28730 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Improved the `_parse_google_docstring` function in `langchain/core` to support parsing multi-paragraph descriptions before the `Args:` section while maintaining compliance with Google-style docstring guidelines. This change ensures better handling of docstrings with detailed function descriptions. Issue: Fixes #28628 Dependencies: None. Twitter handle: @isatyamks --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-18 09:35:19 -05:00
Keiichi Hirobe	67fd554512	core[patch]: throw exception indexing code if deletion fails in vectorstore (#28103 ) The delete methods in the VectorStore and DocumentIndex interfaces return a status indicating the result. Therefore, we can assume that their implementations don't throw exceptions but instead return a result indicating whether the delete operations have failed. The current implementation doesn't check the returned value, so I modified it to throw an exception when the operation fails. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-12-13 16:14:27 -05:00
Keiichi Hirobe	258b3be5ec	core[minor]: add new clean up strategy "scoped_full" to indexing (#28505 ) ~Note that this PR is now Draft, so I didn't add change to `aindex` function and didn't add test codes for my change. After we have an agreement on the direction, I will add commits.~ `batch_size` is very difficult to decide because setting a large number like >10000 will impact VectorDB and RecordManager, while setting a small number will delete records unnecessarily, leading to redundant work, as the `IMPORTANT` section says. On the other hand, we can't use `full` because the loader returns just a subset of the dataset in our use case. I guess many people are in the same situation as us. So, as one of the possible solutions for it, I would like to introduce a new argument, `scoped_full_cleanup`. This argument will be valid only when `claneup` is Full. If True, Full cleanup deletes all documents that haven't been updated AND that are associated with source ids that were seen during indexing. Default is False. This change keeps backward compatibility. --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-12-13 20:35:25 +00:00
Keiichi Hirobe	da28cf1f54	core[patch]: Reverts PR #25754 and add unit tests (#28702 ) I reported the bug 2 weeks ago here: https://github.com/langchain-ai/langchain/issues/28447 I believe this is a critical bug for the indexer, so I submitted a PR to revert the change and added unit tests to prevent similar bugs from being introduced in the future. @eyurtsev Could you check this?	2024-12-13 15:13:06 -05:00
Bagatur	e6a62d8422	core,langchain,community[patch]: allow langsmith 0.2 (#28598 )	2024-12-10 18:50:58 +00:00
Bagatur	e24f86e55f	core[patch]: return ToolMessage from tool (#28605 )	2024-12-10 09:59:38 +00:00
Filip Ratajczak	4e743b5427	Core: google docstring parsing fix (#28404 ) Thank you for contributing to LangChain! - [ ] PR title: "core: google docstring parsing fix" - [x] PR message: - Description: Added a solution for invalid parsing of google docstring such as: Args: net_annual_income (float): The user's net annual income (in current year dollars). - Issue: Previous code would return arg = "net_annual_income (float)" which would cause exception in _validate_docstring_args_against_annotations - Dependencies: None If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-10 00:27:25 +00:00
Fahim Zaman	481c4bfaba	core[patch]: Fixed trim functions, and added corresponding unit test for the solved issue (#28429 ) - Description: - Trim functions were incorrectly deleting nodes with more than 1 outgoing/incoming edge, so an extra condition was added to check for this directly. A unit test "test_trim_multi_edge" was written to test this test case specifically. - Issue: - Fixes #28411 - Fixes https://github.com/langchain-ai/langgraph/issues/1676 - Dependencies: - No changes were made to the dependencies - [x] Unit tests were added to verify the changes. - [x] Updated documentation where necessary. - [x] Ran make format, make lint, and make test to ensure compliance with project standards. --------- Co-authored-by: Tasif Hussain <tasif006@gmail.com>	2024-12-08 20:45:28 -08:00
Erick Friis	c38b845d7e	core: fix path test (#28584 )	2024-12-06 20:05:18 +00:00
ccurme	2c6bc74cb1	multiple: combine sync/async vector store standard test suites (#28580 ) Breaking change in `langchain-tests`.	2024-12-06 14:55:06 -05:00
Erick Friis	18386c16c7	core, tests: more tolerant _aget_relevant_documents function (#28462 )	2024-12-06 00:49:30 +00:00
William FH	ecee41ab72	fix: Handle response metadata in merge_messages_runs (#28453 )	2024-12-02 13:56:23 -08:00
Eugene Yurtsev	a813d11c14	core[patch]: Compat pydantic 2.10 (#28308 ) pydantic 2.10 compat for langchain-core	2024-11-22 21:44:55 -05:00
ccurme	a433039a56	core[patch]: support final AIMessage responses in `tool_example_to_messages` (#28267 ) We have a test [test_structured_few_shot_examples](`ad4333ca03/libs/standard-tests/langchain_tests/integration_tests/chat_models.py (L546)`) in standard integration tests that implements a version of tool-calling few shot examples that works with ~all tested providers. The formulation supported by ~all providers is: `human message, tool call, tool message, AI reponse`. Here we update `langchain_core.utils.function_calling.tool_example_to_messages` to support this formulation. The `tool_example_to_messages` util is undocumented outside of our API reference. IMO, if we are testing that this function works across all providers, it can be helpful to feature it in our guides. The structured few-shot examples we document at the moment require users to implement this function and can be simplified.	2024-11-22 15:38:49 +00:00
Erick Friis	0dbaf05bb7	standard-tests: rename langchain_standard_tests to langchain_tests, release 0.3.2 (#28203 )	2024-11-18 19:10:39 -08:00
Erick Friis	76e0127539	core: release 0.3.18 (#28070 )	2024-11-13 16:19:13 +00:00
Eric Pinzur	eadc2f6a90	core: added DeleteResponse to the module (#28069 ) Description: * added `DeleteResponse` to the `langchain_core.indexing` module, for implementing DocumentIndex classes.	2024-11-13 11:08:08 -05:00
ccurme	1538ee17f9	anthropic[major]: support python 3.13 (#27916 ) Last week Anthropic released version 0.39.0 of its python sdk, which enabled support for Python 3.13. This release deleted a legacy `client.count_tokens` method, which we currently access during init of the `Anthropic` LLM. Anthropic has replaced this functionality with the [client.beta.messages.count_tokens() API](https://github.com/anthropics/anthropic-sdk-python/pull/726). To enable support for `anthropic >= 0.39.0` and Python 3.13, here we drop support for the legacy token counting method, and add support for the new method via `ChatAnthropic.get_num_tokens_from_messages`. To fully support the token counting API, we update the signature of `get_num_tokens_from_message` to accept tools everywhere. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-11-12 14:31:07 -05:00
Bagatur	67ce05a0a7	core[patch]: make oai tool description optional (#27756 )	2024-11-06 18:06:47 +00:00
Jun Yamog	830cad7bc0	core: fix CommaSeparatedListOutputParser to handle columns that may contain commas in it (#26365 ) - Description: Currently CommaSeparatedListOutputParser can't handle strings that may contain commas within a column. It would parse any commas as the delimiter. Ex. "foo, foo2", "bar", "baz" It will create 4 columns: "foo", "foo2", "bar", "baz" This should be 3 columns: "foo, foo2", "bar", "baz" - Dependencies: Added 2 additional imports, but they are built in python packages. import csv from io import StringIO - Twitter handle: @jkyamog - [ ] Add tests and docs: 1. added simple unit test test_multiple_items_with_comma --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-11-01 22:42:24 +00:00
Ant White	e3ea365725	core: use friendlier names for duplicated nodes in mermaid output (#27747 ) Thank you for contributing to LangChain! - [x] PR title: "core: use friendlier names for duplicated nodes in mermaid output" - Description: When generating the Mermaid visualization of a chain, if the chain had multiple nodes of the same type, the reid function would replace their names with the UUID node_id. This made the generated graph difficult to understand. This change deduplicates the nodes in a chain by appending an index to their names. - Issue: None - Discussion: https://github.com/langchain-ai/langchain/discussions/27714 - Dependencies: None - [ ] Add tests and docs: - Currently this functionality is not covered by unit tests, happy to add tests if you'd like - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. # Example Code: ```python from langchain_core.runnables import RunnablePassthrough def fake_llm(prompt: str) -> str: # Fake LLM for the example return "completion" runnable = { 'llm1': fake_llm, 'llm2': fake_llm, } \| RunnablePassthrough.assign( total_chars=lambda inputs: len(inputs['llm1'] + inputs['llm2']) ) print(runnable.get_graph().draw_mermaid(with_styles=False)) ``` # Before ```mermaid graph TD; Parallel_llm1_llm2_Input --> 0b01139db5ed4587ad37964e3a40c0ec; 0b01139db5ed4587ad37964e3a40c0ec --> Parallel_llm1_llm2_Output; Parallel_llm1_llm2_Input --> a98d4b56bd294156a651230b9293347f; a98d4b56bd294156a651230b9293347f --> Parallel_llm1_llm2_Output; Parallel_total_chars_Input --> Lambda; Lambda --> Parallel_total_chars_Output; Parallel_total_chars_Input --> Passthrough; Passthrough --> Parallel_total_chars_Output; Parallel_llm1_llm2_Output --> Parallel_total_chars_Input; ``` # After ```mermaid graph TD; Parallel_llm1_llm2_Input --> fake_llm_1; fake_llm_1 --> Parallel_llm1_llm2_Output; Parallel_llm1_llm2_Input --> fake_llm_2; fake_llm_2 --> Parallel_llm1_llm2_Output; Parallel_total_chars_Input --> Lambda; Lambda --> Parallel_total_chars_Output; Parallel_total_chars_Input --> Passthrough; Passthrough --> Parallel_total_chars_Output; Parallel_llm1_llm2_Output --> Parallel_total_chars_Input; ```	2024-10-31 16:52:00 -04:00
Bagatur	c1e742347f	core[patch]: rm image loading (#27797 )	2024-10-31 10:34:51 -07:00
Bagatur	5d337326b0	core[patch]: make get_all_basemodel_annotations public (#27761 )	2024-10-30 14:43:29 -07:00
Bagatur	94ea950c6c	core[patch]: support bedrock converse -> openai tool (#27754 )	2024-10-30 12:20:39 -07:00
William FH	5a2cfb49e0	Support message trimming on single messages (#27729 ) Permit trimming message lists of length 1	2024-10-30 04:27:52 +00:00
Harsimran-19	c1d8c33df6	core: JsonOutputParser UTF characters bug (#27306 ) Description: This PR fixes an issue where non-ASCII characters in Pydantic field descriptions were being escaped to their Unicode representations when using `JsonOutputParser`. The change allows non-ASCII characters to be preserved in the output, which is especially important for multilingual support and when working with non-English languages. Issue: Fixes #27256 Example Code: ```python from pydantic import BaseModel, Field from langchain_core.output_parsers import JsonOutputParser class Article(BaseModel): title: str = Field(description="科学文章的标题") output_data_structure = Article parser = JsonOutputParser(pydantic_object=output_data_structure) print(parser.get_format_instructions()) ``` Previous Output: ```... "title": {"description": "\\u79d1\\u5b66\\u6587\\u7ae0\\u7684\\u6807\\u9898", "title": "Title", "type": "string"}} ...``` Current Output: ```... "title": {"description": "科学文章的标题", "title": "Title", "type": "string"}} ...``` Changes made: - Modified `json.dumps()` call in `langchain_core/output_parsers/json.py` to use `ensure_ascii=False` - Added a unit test to verify Unicode handling Co-authored-by: Harsimran-19 <harsimran1869@gmail.com>	2024-10-29 14:48:53 +00:00
Erick Friis	600b7bdd61	all: test 3.13 ci (#27197 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-10-25 12:56:58 -07:00
Eugene Yurtsev	7667ee126f	core: remove mustache in extended deps (#27629 ) Remove mustache from extended deps -- we vendor the mustache implementation	2024-10-24 22:12:49 -04:00
Tibor Reiss	20b56a0233	core[patch]: fix repr and str for Serializable (#26786 ) Fixes #26499 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-10-24 08:36:35 -07:00
Bagatur	968dccee04	core[patch]: convert_to_openai_tool Anthropic support (#27591 )	2024-10-23 12:27:06 -07:00
Chun Kang Lu	380449a7a9	core: fix Image prompt template hardcoded template format (#27495 ) Fixes #27411 Description: Adds `template_format` to the `ImagePromptTemplate` class and updates passing in the `template_format` parameter from ChatPromptTemplate instead of the hardcoded "f-string". Also updated docs and typing related to `template_format` to be more up-to-date and specific. Dependencies: None Add tests and docs: Added unit tests to validate fix. Needed to update `test_chat` snapshot due to adding new attribute `template_format` in `ImagePromptTemplate`. --------- Co-authored-by: Vadym Barda <vadym@langchain.dev>	2024-10-21 17:31:40 -04:00
Bagatur	a4392b070d	core[patch]: add convert_to_openai_messages util (#27263 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-16 17:10:10 +00:00
Erick Friis	92ae61bcc8	multiple: rely on asyncio_mode auto in tests (#27200 )	2024-10-15 16:26:38 +00:00
Erick Friis	7264fb254c	core: release 0.3.10 (#27209 )	2024-10-08 16:21:42 -07:00
Bagatur	e3e9ee8398	core[patch]: utils for adding/subtracting usage metadata (#27203 )	2024-10-08 13:15:33 -07:00
Vadym Barda	8d27325dbc	core[patch]: support ValidationError from pydantic v1 in tools (#27194 )	2024-10-08 10:19:04 -04:00
Christophe Bornet	16f5fdb38b	core: Add various ruff rules (#26836 ) Adds - ASYNC - COM - DJ - EXE - FLY - FURB - ICN - INT - LOG - NPY - PD - Q - RSE - SLOT - T10 - TID - YTT Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-07 22:30:27 +00:00
Christophe Bornet	d31ec8810a	core: Add ruff rules for error messages (EM) (#26965 ) All auto-fixes Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-07 22:12:28 +00:00
Christophe Bornet	c4ebccfec2	core[minor]: Improve support for id in VectorStore (#26660 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 15:01:08 -04:00
Bharat Ramanathan	931ce8d026	core[patch]: Update `AsyncCallbackManager` to honor `run_inline` attribute and prevent context loss (#26885 ) ## Description This PR fixes the context loss issue in `AsyncCallbackManager`, specifically in `on_llm_start` and `on_chat_model_start` methods. It properly honors the `run_inline` attribute of callback handlers, preventing race conditions and ordering issues. Key changes: 1. Separate handlers into inline and non-inline groups. 2. Execute inline handlers sequentially for each prompt. 3. Execute non-inline handlers concurrently across all prompts. 4. Preserve context for stateful handlers. 5. Maintain performance benefits for non-inline handlers. These changes are implemented in `AsyncCallbackManager` rather than `ahandle_event` because the issue occurs at the prompt and message_list levels, not within individual events. ## Testing - Test case implemented in #26857 now passes, verifying execution order for inline handlers. ## Related Issues - Fixes issue discussed in #23909 ## Dependencies No new dependencies are required. --- @eyurtsev: This PR implements the discussed changes to respect `run_inline` in `AsyncCallbackManager`. Please review and advise on any needed changes. Twitter handle: @parambharat --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 14:59:29 -04:00
João Carlos Ferra de Almeida	780ce00dea	core[minor]: add kwargs to index and aindex functions for custom vector_field support (#26998 ) Added `kwargs` parameters to the `index` and `aindex` functions in `libs/core/langchain_core/indexing/api.py`. This allows users to pass additional arguments to the `add_documents` and `aadd_documents` methods, enabling the specification of a custom `vector_field`. For example, users can now use `vector_field="embedding"` when indexing documents in `OpenSearchVectorStore` --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 14:52:50 -04:00
Bagatur	4935a14314	core,integrations[minor]: Dont error on fields in model_kwargs (#27110 ) Given the current erroring behavior, every time we've moved a kwarg from model_kwargs and made it its own field that was a breaking change. Updating this behavior to support the old instantiations / serializations. Assuming build_extra_kwargs was not something that itself is being used externally and needs to be kept backwards compatible	2024-10-04 11:30:27 -07:00
Erick Friis	ab4dab9a0c	core: fix batch race condition in FakeListChatModel (#26924 ) fixed #26273	2024-10-03 23:14:31 +00:00
Bagatur	87fc5ce688	core[patch]: exclude model cache from ser (#27086 )	2024-10-03 22:00:31 +00:00
Bagatur	546dc44da5	core[patch]: add UsageMetadata details (#27072 )	2024-10-03 20:36:17 +00:00
Eugene Yurtsev	74bf620e97	core[patch]: Support injected tool args that are arbitrary types (#27045 ) This adds support for inject tool args that are arbitrary types when used with pydantic 2. We'll need to add similar logic on the v1 path, and potentially mirror the config from the original model when we're doing the subset.	2024-10-02 12:50:58 -04:00
ccurme	9d10151123	core[patch]: fix init of RunnableAssign (#26903 ) Example in API ref currently raises ValidationError. Resolves https://github.com/langchain-ai/langchain/issues/26862	2024-10-01 14:21:54 -04:00
federico-pisanu	2538963945	core[patch]: improve index/aindex api when batch_size<n_docs (#25754 ) - Description: prevent index function to re-index entire source document even if nothing has changed. - Issue: #22135 I worked on a solution to this issue that is a compromise between being cheap and being fast. In the previous code, when batch_size is greater than the number of docs from a certain source almost the entire source is deleted (all documents from that source except for the documents in the first batch) My solution deletes documents from vector store and record manager only if at least one document has changed for that source. Hope this can help! --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-30 20:57:41 +00:00
Eugene Yurtsev	7fde2791dc	core[patch]: Add kwargs to Runnable (#27008 ) Fixes #26685 --------- Co-authored-by: Tibor Reiss <tibor.reiss@gmail.com>	2024-09-30 16:45:29 -04:00
Bagatur	248be02259	core[patch]: fix structured prompt template format (#27003 ) template_format is an init argument on ChatPromptTemplate but not an attribute on the object so was getting shoved into StructuredPrompt.structured_ouptut_kwargs	2024-09-30 11:47:46 -07:00
Christophe Bornet	db8845a62a	core: Add ruff rules for pycodestyle Warning (W) (#26964 ) All auto-fixes.	2024-09-30 09:31:43 -04:00
Christophe Bornet	7809b31b95	core[patch]: Add ruff rules for flake8-simplify (SIM) (#26848 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-27 20:13:23 +00:00
Christophe Bornet	f4e738bb40	core: Add ruff rules for PIE (#26939 ) All auto-fixes.	2024-09-27 12:08:35 -04:00
Julius Stopforth	121e79b1f0	core: Fix `IndexError` when `trim_messages` invoked with empty list (#26896 ) This prevents `trim_messages` from raising an `IndexError` when invoked with `include_system=True`, `strategy="last"`, and an empty message list. Fixes #26895 Dependencies: none	2024-09-26 11:29:58 -04:00
Christophe Bornet	3a1b9259a7	core: Add ruff rules for comprehensions (C4) (#26829 )	2024-09-25 09:34:17 -04:00
William FH	82b5b77940	[Core] Add more interops tests (#26841 ) To test that the client propagates both ways	2024-09-24 20:18:20 -07:00
William FH	9b6ac41442	[Core] Inherit tracing metadata & tags (#26838 )	2024-09-24 19:33:12 -07:00
William FH	864020e592	[Tracer] add project name to run from tracer (#26736 )	2024-09-20 16:48:37 -07:00
Alejandro Rodríguez	4ac9a6f52c	core: fix "template" not allowed as prompt param (#26060 ) - Description: fix "template" not allowed as prompt param - Issue: #26058 - Dependencies: none - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-20 23:33:06 +00:00
William FH	19ce95d3c9	Avoid copying runs (#26689 ) Also, re-unify run trees. Use a single shared client.	2024-09-20 10:57:41 -07:00
Erick Friis	311f861547	core, community: move graph vectorstores to community (#26678 ) remove beta namespace from core, add to community	2024-09-19 11:38:14 -07:00
Christophe Bornet	fd21ffe293	core: Add N(naming) ruff rules (#25362 ) Public classes/functions are not renamed and rule is ignored for them. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-19 05:09:39 +00:00
Christophe Bornet	a47b332841	core: Put Python version as a project requirement so it is considered by ruff (#26608 ) Ruff doesn't know about the python version in `[tool.poetry.dependencies]`. It can get it from `project.requires-python`. Notes: * poetry seems to have issues getting the python constraints from `requires-python` and using `python` in per dependency constraints. So I had to duplicate the info. I will open an issue on poetry. * `inspect.isclass()` doesn't work correctly with `GenericAlias` (`list[...]`, `dict[..., ...]`) on Python <3.11 so I added some `not isinstance(type, GenericAlias)` checks: Python 3.11 ```pycon >>> import inspect >>> inspect.isclass(list) True >>> inspect.isclass(list[str]) False ``` Python 3.9 ```pycon >>> import inspect >>> inspect.isclass(list) True >>> inspect.isclass(list[str]) True ``` Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-18 14:37:57 +00:00
Christophe Bornet	3a99467ccb	core[patch]: Add ruff rule UP006(use PEP585 annotations) (#26574 ) * Added rules `UPD006` now that Pydantic is v2+ --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-17 21:22:50 +00:00
Erick Friis	c2a3021bb0	multiple: pydantic 2 compatibility, v0.3 (#26443 ) Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Dan O'Donovan <dan.odonovan@gmail.com> Co-authored-by: Tom Daniel Grande <tomdgrande@gmail.com> Co-authored-by: Grande <Tom.Daniel.Grande@statsbygg.no> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: ZhangShenao <15201440436@163.com> Co-authored-by: Friso H. Kingma <fhkingma@gmail.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Morgante Pell <morgantep@google.com>	2024-09-13 14:38:45 -07:00
langchain-infra	8a02fd9c01	core: add additional import mappings to loads (#26406 ) Support using additional import mapping. This allows users to override old mappings/add new imports to the loads function. - [x ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-09-13 09:39:58 -07:00
Bagatur	feb351737c	core[patch]: fix empty OpenAI tools when strict=True (#26287 ) Fix #26232 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-11 16:06:03 -07:00
ccurme	398718e1cb	core[patch]: fix regression in convert_to_openai_tool with instances of Tool (#26327 ) ```python from langchain_core.tools import Tool from langchain_core.utils.function_calling import convert_to_openai_tool def my_function(x: int) -> int: return x + 2 tool = Tool( name="tool_name", func=my_function, description="test description", ) convert_to_openai_tool(tool) ``` Current: ``` {'type': 'function', 'function': {'name': 'tool_name', 'description': 'test description', 'parameters': {'type': 'object', 'properties': {'args': {'type': 'array', 'items': {}}, 'config': {'type': 'object', 'properties': {'tags': {'type': 'array', 'items': {'type': 'string'}}, 'metadata': {'type': 'object'}, 'callbacks': {'anyOf': [{'type': 'array', 'items': {}}, {}]}, 'run_name': {'type': 'string'}, 'max_concurrency': {'type': 'integer'}, 'recursion_limit': {'type': 'integer'}, 'configurable': {'type': 'object'}, 'run_id': {'type': 'string', 'format': 'uuid'}}}, 'kwargs': {'type': 'object'}}, 'required': ['config']}}} ``` Here: ``` {'type': 'function', 'function': {'name': 'tool_name', 'description': 'test description', 'parameters': {'properties': {'__arg1': {'title': '__arg1', 'type': 'string'}}, 'required': ['__arg1'], 'type': 'object'}}} ```	2024-09-11 15:51:10 -04:00
Nuno Campos	212c688ee0	core[minor]: Remove serialized manifest from tracing requests for non-llm runs (#26270 ) - This takes a long time to compute, isn't used, and currently called on every invocation of every chain/retriever/etc	2024-09-10 12:58:24 -07:00
Vadym Barda	bab9de581c	core[patch]: wrap mermaid node names w/ markdown in <p> tag (#26235 ) This fixes the issue where `__start__` and `__end__` node labels are being interpreted as markdown, as of the most recent Mermaid update	2024-09-09 20:11:00 -04:00
William FH	262e19b15d	infra: Clear cache for env-var checks (#26073 )	2024-09-06 21:29:29 +00:00
Bagatur	933bc0d6ff	core[patch]: support additional kwargs on StructuredPrompt (#25645 )	2024-09-02 14:55:26 -07:00
Nuno Campos	464dae8ac2	core: Include global variables in variables found by get_function_nonlocals (#25936 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-09-02 11:49:25 -07:00
Bagatur	d19e074374	core[patch]: handle serializable fields that cant be converted to bool (#25903 )	2024-09-01 16:44:33 -07:00
Bagatur	fabd3295fa	core[patch]: dont mutate merged lists/dicts (#25858 ) Update merging utils to - not mutate objects - have special handling to 'type' keys in dicts	2024-08-29 20:34:54 +00:00
Erick Friis	c8b8335b82	core: prompt variable error msg (#25787 )	2024-08-28 22:54:00 +00:00
Christophe Bornet	ff0df5ea15	core[patch]: Add B(bugbear) ruff rules (#25520 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-28 07:09:29 +00:00
Erick Friis	3dc7d447aa	infra: reenable min version testing 2, ci ignore ai21 (#25709 )	2024-08-23 23:28:42 +00:00
Erick Friis	6096c80b71	core: pydantic output parser streaming fix (#24415 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 18:00:09 -07:00
Eugene Yurtsev	c316361115	core[patch]: Add _api.rename_parameter to support renaming of parameters in functions (#25101 ) Add ability to rename paramerters in function signatures ```python @rename_parameter(since="2.0.0", removal="3.0.0", old="old_name", new="new_name") def foo(new_name: str) -> str: """original doc""" return new_name ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-22 17:16:31 -07:00
Yusuke Fukasawa	0258cb96fa	core[patch]: add additionalProperties recursively to oai function if strict (#25169 ) Hello. First of all, thank you for maintaining such a great project. ## Description In https://github.com/langchain-ai/langchain/pull/25123, support for structured_output is added. However, `"additionalProperties": false` needs to be included at all levels when a nested object is generated. error from current code: https://gist.github.com/fufufukakaka/e9b475300e6934853d119428e390f204 ``` BadRequestError: Error code: 400 - {'error': {'message': "Invalid schema for response_format 'JokeWithEvaluation': In context=('properties', 'self_evaluation'), 'additionalProperties' is required to be supplied and to be false", 'type': 'invalid_request_error', 'param': 'response_format', 'code': None}} ``` Reference: [Introducing Structured Outputs in the API](https://openai.com/index/introducing-structured-outputs-in-the-api/) ```json { "model": "gpt-4o-2024-08-06", "messages": [ { "role": "system", "content": "You are a helpful math tutor." }, { "role": "user", "content": "solve 8x + 31 = 2" } ], "response_format": { "type": "json_schema", "json_schema": { "name": "math_response", "strict": true, "schema": { "type": "object", "properties": { "steps": { "type": "array", "items": { "type": "object", "properties": { "explanation": { "type": "string" }, "output": { "type": "string" } }, "required": ["explanation", "output"], "additionalProperties": false } }, "final_answer": { "type": "string" } }, "required": ["steps", "final_answer"], "additionalProperties": false } } } } ``` In the current code, `"additionalProperties": false` is only added at the last level. This PR introduces the `_add_additional_properties_key` function, which recursively adds `"additionalProperties": false` to the entire JSON schema for the request. Twitter handle: `@fukkaa1225` Thank you! --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-23 00:08:58 +00:00
Bagatur	b35ee09b3f	infra: xfail pydantic v2 arg to py function (#25686 ) Issue to track: #25687	2024-08-22 23:52:57 +00:00
Christophe Bornet	ee98da4f4e	core[patch]: Add UP(upgrade) ruff rules (#25358 )	2024-08-22 16:29:22 -07:00
Vadym Barda	46d344c33d	core[patch]: support drawing nested subgraphs in draw_mermaid (#25581 ) Previously the code was able to only handle a single level of nesting for subgraphs in mermaid. This change adds support for arbitrary nesting of subgraphs.	2024-08-22 16:08:49 -07:00
CastaChick	7d13a2f958	core[patch]: add option to specify the chunk separator in `merge_message_runs` (#24783 ) Description: LLM will stop generating text even in the middle of a sentence if `finish_reason` is `length` (for OpenAI) or `stop_reason` is `max_tokens` (for Anthropic). To obtain longer outputs from LLM, we should call the message generation API multiple times and merge the results into the text to circumvent the API's output token limit. The extra line breaks forced by the `merge_message_runs` function when seamlessly merging messages can be annoying, so I added the option to specify the chunk separator. Issue: No corresponding issues. Dependencies: No dependencies required. Twitter handle: @hanama_chem https://x.com/hanama_chem --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-22 19:46:25 +00:00
Ivan	5b9290a449	Fix UnionType type var replacement (#25566 ) [langchain_core] Fix UnionType type var replacement - Added types.UnionType to typing.Union mapping Type replacement cause `TypeError: 'type' object is not subscriptable` if any of union type comes as function `_py_38_safe_origin` return `types.UnionType` instead of `typing.Union` ```python >>> from types import UnionType >>> from typing import Union, get_origin >>> type_ = get_origin(str \| None) >>> type_ <class 'types.UnionType'> >>> UnionType[(str, None)] Traceback (most recent call last): File "<stdin>", line 1, in <module> TypeError: 'type' object is not subscriptable >>> Union[(str, None)] typing.Optional[str] ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-08-22 14:22:09 -04:00
William FH	8230ba47f3	core[patch]: Improve some error messages and add another test for checking RunnableWithMessageHistory (#25209 ) Also add more useful error messages. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-22 18:14:27 +00:00
Bagatur	39c44817ae	infra: test convert_message (#25632 )	2024-08-21 18:24:06 +00:00
Bagatur	8a71f1b41b	core[minor]: add langsmith document loader (#25493 ) needs tests	2024-08-20 10:22:14 -07:00
Bagatur	4bd005adb6	core[patch]: Allow bound models as token_counter in trim_messages (#25563 )	2024-08-20 00:21:22 -07:00
Bagatur	6b98207eda	infra: test chat prompt ser/des (#25557 )	2024-08-19 15:27:36 -07:00
ccurme	b83f1eb0d5	core, partners: implement standard tracing params for LLMs (#25410 )	2024-08-16 13:18:09 -04:00
William FH	75ae585deb	Merge support for group manager (#25360 )	2024-08-15 09:56:31 -07:00
Bagatur	2494cecabf	core[patch]: tool import fix (#25419 )	2024-08-14 22:54:13 +00:00
Chengyu Yan	d0ad713937	core: fix issue#24660, slove error messages about `ValueError` when use model with history (#25183 ) - Description: This PR will slove error messages about `ValueError` when use model with history. Detail in #24660. #22933 causes that `langchain_core.runnables.history.RunnableWithMessageHistory._get_output_messages` miss type check of `output_val` if `output_val` is `False`. After running `RunnableWithMessageHistory._is_not_async`, `output` is `False`. `249945a572/libs/core/langchain_core/runnables/history.py (L323-L334)` `15a36dd0a2/libs/core/langchain_core/runnables/history.py (L461-L471)` ~~I suggest that `_get_output_messages` return empty list when `output_val == False`.~~ - Issue: - #24660 - Dependencies:: No Change. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-08-14 14:26:22 +00:00
Bagatur	493e474063	docs: udpated api reference (#25172 ) - Move the API reference into the vercel build - Update api reference organization and styling	2024-08-14 07:00:17 -07:00
Eugene Yurtsev	6dd9f053e3	core[patch]: Deprecating beta upsert APIs in vectorstore (#25069 ) This PR deprecates the beta upsert APIs in vectorstore. We'll introduce them in a V2 abstraction instead to keep the existing vectorstore implementations lighter weight. The main problem with the existing APIs is that it's a bit more challenging to implement the correct behavior w/ respect to IDs since ID can be present in both the function signature and as an optional attribute on the document object. But VectorStores that pass the standard tests should have implemented the semantics properly! --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-09 17:17:36 -04:00
Eugene Yurtsev	b6f0174bb9	community[patch],core[patch]: Update EdenaiTool root_validator and add unit test in core (#25233 ) This PR gets rid `root_validators(allow_reuse=True)` logic used in EdenAI Tool in preparation for pydantic 2 upgrade. - add another test to secret_from_env_factory	2024-08-09 15:59:27 +00:00
Eugene Yurtsev	429a0ee7fd	core[minor]: Add factory for looking up secrets from the env (#25198 ) Add factory method for looking secrets from the env.	2024-08-08 16:41:58 -04:00
Erick Friis	c6ece6a96d	core: autodetect more ls params (#25044 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2024-08-08 12:44:21 -07:00
Eugene Yurtsev	30fb345342	core[minor]: Add from_env utility (#25189 ) Add a utility that can be used as a default factory The goal will be to start migrating from of the pydantic models to use `from_env` as a default factory if possible. ```python from pydantic import Field, BaseModel from langchain_core.utils import from_env class Foo(BaseModel): name: str = Field(default_factory=from_env('HELLO')) ```	2024-08-08 14:52:35 -04:00
Eugene Yurtsev	2f209d84fa	core[patch]: Add pydantic get_fields adapter (#25187 ) Add adapter to get fields	2024-08-08 17:47:42 +00:00
Eugene Yurtsev	7b1a132aff	core[patch]: Add unit tests for Serializable (#25152 ) Add a few test cases for serializable (many other test cases already covered throguh runnable tests).	2024-08-07 21:01:36 +00:00
ccurme	803eba3163	core[patch]: check for model_fields attribute (#25108 ) `__fields__` raises a warning in pydantic v2	2024-08-07 13:32:56 -07:00
Erick Friis	dff83cce66	core[patch]: base language model disable_streaming (#25070 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-07 09:26:21 -07:00
Eugene Yurtsev	28e0958ff4	core[patch]: Relax rate limit unit tests in terms of timing (#25140 ) Relax rate limit unit tests	2024-08-07 14:04:58 +00:00
Eugene Yurtsev	d283f452cc	core[minor]: Add support for DocumentIndex in the index api (#25100 ) Support document index in the index api.	2024-08-06 12:30:49 -07:00
William FH	267855b3c1	Set Context in RunnableSequence & RunnableParallel (#25073 )	2024-08-06 11:10:37 -07:00
Gram Liu	88a9a6a758	core[patch]: Add pydantic metadata to subset model (#25032 ) - Description: This includes Pydantic field metadata in `_create_subset_model_v2` so that it gets included in the final serialized form that get sent out. - Issue: #25031 - Dependencies: n/a - Twitter handle: @gramliu --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-08-05 17:57:39 -07:00
Bagatur	e572521f2a	core[patch]: exclude special pydantic init params (#25084 )	2024-08-05 23:32:51 +00:00
Eugene Yurtsev	4bcd2aad6c	core[patch]: Relax time constraints on rate limit test (#25071 ) Try to keep the unit test fast, but also have it repeat more robustly	2024-08-05 17:04:22 -04:00
Eugene Yurtsev	41dfad5104	core[minor]: Introduce DocumentIndex abstraction (#25062 ) This PR adds a minimal document indexer abstraction. The goal of this abstraction is to allow developers to create custom retrievers that also have a standard indexing API and allow updating the document content in them. The abstraction comes with a test suite that can verify that the indexer implements the correct semantics. This is an iteration over a previous PRs (https://github.com/langchain-ai/langchain/pull/24364). The main difference is that we're sub-classing from BaseRetriever in this iteration and as so have consolidated the sync and async interfaces. The main problem with the current design is that runt time search configuration has to be specified at init rather than provided at run time. We will likely resolve this issue in one of the two ways: (1) Define a method (`get_retriever`) that will allow creating a retriever at run time with a specific configuration.. If we do this, we will likely break the subclass on BaseRetriever (2) Generalize base retriever so it can support structured queries --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-05 18:06:33 +00:00
Bagatur	1dcee68cb8	docs: show beta directive (#25013 ) ![Screenshot 2024-08-02 at 7 15 34 PM](https://github.com/user-attachments/assets/086831c7-36f3-4962-98dc-d707b6289747)	2024-08-03 03:07:45 +00:00
Bagatur	57747892ce	docs: show deprecation warning first in api ref (#25001 ) OLD ![Screenshot 2024-08-02 at 3 29 39 PM](https://github.com/user-attachments/assets/7f169121-1202-4770-a006-d72ac7a1aa33) NEW ![Screenshot 2024-08-02 at 3 29 45 PM](https://github.com/user-attachments/assets/9cc07cbd-2ae9-4077-95c5-03cb051e6cd7)	2024-08-02 17:35:25 -07:00
Bagatur	199e9c5ae0	core[patch]: Fix tool args schema inherited field parsing (#24936 ) Fix #24925	2024-08-01 18:36:33 -07:00
WU LIFU	ad16eed119	core[patch]: runnable config ensure_config deep copy from var_child_runnable… (#24862 ) issue: #24660 RunnableWithMessageHistory.stream result in error because the [evaluation](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/runnables/branch.py#L220) of the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) unexpectedly trigger the "[on_end](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`)" (exit_history) callback of the default branch descriptions After a lot of investigation I'm convinced that the root cause is that 1. during the execution of the runnable, the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L122)`) is shared between the branch [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) runnable and the [default branch runnable](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L332)`) within the same context 2. when the default branch runnable runs, it gets the [var_child_runnable_config](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L163)`) and may unintentionally [add more handlers ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L325)`)to the callback manager of this config 3. when it is again the turn for the [condition](`99eb31ec41/libs/core/langchain_core/runnables/history.py (L328C1-L329C1)`) to run, it gets the `var_child_runnable_config` whose callback manager has the handlers added by the default branch. When it runs that handler (`exit_history`) it leads to the error with the assumption that, the `ensure_config` function actually does want to create a immutable copy from `var_child_runnable_config` because it starts with an [`empty` variable ](`99eb31ec41/libs/core/langchain_core/runnables/config.py (L156)`), i go ahead to do a deepcopy to ensure that future modification to the returned value won't affect the `var_child_runnable_config` variable Having said that I actually 1. don't know if this is a proper fix 2. don't know whether it will lead to other unintended consequence 3. don't know why only "stream" runs into this issue while "invoke" runs without problem so @nfcampos @hwchase17 please help review, thanks! --------- Co-authored-by: Lifu Wu <lifu@nextbillion.ai> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-08-01 17:30:32 -07:00
Eugene Yurtsev	75776e4a54	core[patch]: In unit tests, use `_schema()` instead of BaseModel.schema() (#24930 ) This PR introduces a module with some helper utilities for the pydantic 1 -> 2 migration. They're meant to be used in the following way: 1) Use the utility code to get unit tests pass without requiring modification to the unit tests 2) (If desired) upgrade the unit tests to match pydantic 2 output 3) (If desired) stop using the utility code Currently, this module contains a way to map `schema()` generated by pydantic 2 to (mostly) match the output from pydantic v1.	2024-08-01 11:59:04 -04:00
Bagatur	25b93cc4c0	core[patch]: stringify tool non-content blocks (#24626 ) Slightly breaking bugfix. Shouldn't cause too many issues since no models would be able to handle non-content block ToolMessage.content anyways.	2024-07-31 16:42:38 -07:00
Eugene Yurtsev	210623b409	core[minor]: Add support for pydantic 2 to utility to get fields (#24899 ) Add compatibility for pydantic 2 for a utility function. This will help push some small changes to master, so they don't have to be kept track of on a separate branch.	2024-07-31 19:11:07 +00:00
Eugene Yurtsev	5099a9c9b4	core[patch]: Update unit tests with a workaround for using AnyID in pydantic 2 (#24892 ) Pydantic 2 ignores __eq__ overload for subclasses of strings.	2024-07-31 14:42:12 -04:00

1 2 3 4 5 ...

560 Commits