langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-22 02:50:31 +00:00

Author	SHA1	Message	Date
Jun Yamog	830cad7bc0	core: fix CommaSeparatedListOutputParser to handle columns that may contain commas in it (#26365 ) - Description: Currently CommaSeparatedListOutputParser can't handle strings that may contain commas within a column. It would parse any commas as the delimiter. Ex. "foo, foo2", "bar", "baz" It will create 4 columns: "foo", "foo2", "bar", "baz" This should be 3 columns: "foo, foo2", "bar", "baz" - Dependencies: Added 2 additional imports, but they are built in python packages. import csv from io import StringIO - Twitter handle: @jkyamog - [ ] Add tests and docs: 1. added simple unit test test_multiple_items_with_comma --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-11-01 22:42:24 +00:00
Ant White	e3ea365725	core: use friendlier names for duplicated nodes in mermaid output (#27747 ) Thank you for contributing to LangChain! - [x] PR title: "core: use friendlier names for duplicated nodes in mermaid output" - Description: When generating the Mermaid visualization of a chain, if the chain had multiple nodes of the same type, the reid function would replace their names with the UUID node_id. This made the generated graph difficult to understand. This change deduplicates the nodes in a chain by appending an index to their names. - Issue: None - Discussion: https://github.com/langchain-ai/langchain/discussions/27714 - Dependencies: None - [ ] Add tests and docs: - Currently this functionality is not covered by unit tests, happy to add tests if you'd like - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. # Example Code: ```python from langchain_core.runnables import RunnablePassthrough def fake_llm(prompt: str) -> str: # Fake LLM for the example return "completion" runnable = { 'llm1': fake_llm, 'llm2': fake_llm, } \| RunnablePassthrough.assign( total_chars=lambda inputs: len(inputs['llm1'] + inputs['llm2']) ) print(runnable.get_graph().draw_mermaid(with_styles=False)) ``` # Before ```mermaid graph TD; Parallel_llm1_llm2_Input --> 0b01139db5ed4587ad37964e3a40c0ec; 0b01139db5ed4587ad37964e3a40c0ec --> Parallel_llm1_llm2_Output; Parallel_llm1_llm2_Input --> a98d4b56bd294156a651230b9293347f; a98d4b56bd294156a651230b9293347f --> Parallel_llm1_llm2_Output; Parallel_total_chars_Input --> Lambda; Lambda --> Parallel_total_chars_Output; Parallel_total_chars_Input --> Passthrough; Passthrough --> Parallel_total_chars_Output; Parallel_llm1_llm2_Output --> Parallel_total_chars_Input; ``` # After ```mermaid graph TD; Parallel_llm1_llm2_Input --> fake_llm_1; fake_llm_1 --> Parallel_llm1_llm2_Output; Parallel_llm1_llm2_Input --> fake_llm_2; fake_llm_2 --> Parallel_llm1_llm2_Output; Parallel_total_chars_Input --> Lambda; Lambda --> Parallel_total_chars_Output; Parallel_total_chars_Input --> Passthrough; Passthrough --> Parallel_total_chars_Output; Parallel_llm1_llm2_Output --> Parallel_total_chars_Input; ```	2024-10-31 16:52:00 -04:00
Bagatur	c1e742347f	core[patch]: rm image loading (#27797 )	2024-10-31 10:34:51 -07:00
Bagatur	5d337326b0	core[patch]: make get_all_basemodel_annotations public (#27761 )	2024-10-30 14:43:29 -07:00
Bagatur	94ea950c6c	core[patch]: support bedrock converse -> openai tool (#27754 )	2024-10-30 12:20:39 -07:00
William FH	5a2cfb49e0	Support message trimming on single messages (#27729 ) Permit trimming message lists of length 1	2024-10-30 04:27:52 +00:00
Harsimran-19	c1d8c33df6	core: JsonOutputParser UTF characters bug (#27306 ) Description: This PR fixes an issue where non-ASCII characters in Pydantic field descriptions were being escaped to their Unicode representations when using `JsonOutputParser`. The change allows non-ASCII characters to be preserved in the output, which is especially important for multilingual support and when working with non-English languages. Issue: Fixes #27256 Example Code: ```python from pydantic import BaseModel, Field from langchain_core.output_parsers import JsonOutputParser class Article(BaseModel): title: str = Field(description="科学文章的标题") output_data_structure = Article parser = JsonOutputParser(pydantic_object=output_data_structure) print(parser.get_format_instructions()) ``` Previous Output: ```... "title": {"description": "\\u79d1\\u5b66\\u6587\\u7ae0\\u7684\\u6807\\u9898", "title": "Title", "type": "string"}} ...``` Current Output: ```... "title": {"description": "科学文章的标题", "title": "Title", "type": "string"}} ...``` Changes made: - Modified `json.dumps()` call in `langchain_core/output_parsers/json.py` to use `ensure_ascii=False` - Added a unit test to verify Unicode handling Co-authored-by: Harsimran-19 <harsimran1869@gmail.com>	2024-10-29 14:48:53 +00:00
Erick Friis	600b7bdd61	all: test 3.13 ci (#27197 ) Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-10-25 12:56:58 -07:00
Eugene Yurtsev	7667ee126f	core: remove mustache in extended deps (#27629 ) Remove mustache from extended deps -- we vendor the mustache implementation	2024-10-24 22:12:49 -04:00
Tibor Reiss	20b56a0233	core[patch]: fix repr and str for Serializable (#26786 ) Fixes #26499 --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-10-24 08:36:35 -07:00
Bagatur	968dccee04	core[patch]: convert_to_openai_tool Anthropic support (#27591 )	2024-10-23 12:27:06 -07:00
Chun Kang Lu	380449a7a9	core: fix Image prompt template hardcoded template format (#27495 ) Fixes #27411 Description: Adds `template_format` to the `ImagePromptTemplate` class and updates passing in the `template_format` parameter from ChatPromptTemplate instead of the hardcoded "f-string". Also updated docs and typing related to `template_format` to be more up-to-date and specific. Dependencies: None Add tests and docs: Added unit tests to validate fix. Needed to update `test_chat` snapshot due to adding new attribute `template_format` in `ImagePromptTemplate`. --------- Co-authored-by: Vadym Barda <vadym@langchain.dev>	2024-10-21 17:31:40 -04:00
Bagatur	a4392b070d	core[patch]: add convert_to_openai_messages util (#27263 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-16 17:10:10 +00:00
Erick Friis	92ae61bcc8	multiple: rely on asyncio_mode auto in tests (#27200 )	2024-10-15 16:26:38 +00:00
Erick Friis	7264fb254c	core: release 0.3.10 (#27209 )	2024-10-08 16:21:42 -07:00
Bagatur	e3e9ee8398	core[patch]: utils for adding/subtracting usage metadata (#27203 )	2024-10-08 13:15:33 -07:00
Vadym Barda	8d27325dbc	core[patch]: support ValidationError from pydantic v1 in tools (#27194 )	2024-10-08 10:19:04 -04:00
Christophe Bornet	16f5fdb38b	core: Add various ruff rules (#26836 ) Adds - ASYNC - COM - DJ - EXE - FLY - FURB - ICN - INT - LOG - NPY - PD - Q - RSE - SLOT - T10 - TID - YTT Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-07 22:30:27 +00:00
Christophe Bornet	d31ec8810a	core: Add ruff rules for error messages (EM) (#26965 ) All auto-fixes Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-07 22:12:28 +00:00
Christophe Bornet	c4ebccfec2	core[minor]: Improve support for id in VectorStore (#26660 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 15:01:08 -04:00
Bharat Ramanathan	931ce8d026	core[patch]: Update `AsyncCallbackManager` to honor `run_inline` attribute and prevent context loss (#26885 ) ## Description This PR fixes the context loss issue in `AsyncCallbackManager`, specifically in `on_llm_start` and `on_chat_model_start` methods. It properly honors the `run_inline` attribute of callback handlers, preventing race conditions and ordering issues. Key changes: 1. Separate handlers into inline and non-inline groups. 2. Execute inline handlers sequentially for each prompt. 3. Execute non-inline handlers concurrently across all prompts. 4. Preserve context for stateful handlers. 5. Maintain performance benefits for non-inline handlers. These changes are implemented in `AsyncCallbackManager` rather than `ahandle_event` because the issue occurs at the prompt and message_list levels, not within individual events. ## Testing - Test case implemented in #26857 now passes, verifying execution order for inline handlers. ## Related Issues - Fixes issue discussed in #23909 ## Dependencies No new dependencies are required. --- @eyurtsev: This PR implements the discussed changes to respect `run_inline` in `AsyncCallbackManager`. Please review and advise on any needed changes. Twitter handle: @parambharat --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 14:59:29 -04:00
João Carlos Ferra de Almeida	780ce00dea	core[minor]: add kwargs to index and aindex functions for custom vector_field support (#26998 ) Added `kwargs` parameters to the `index` and `aindex` functions in `libs/core/langchain_core/indexing/api.py`. This allows users to pass additional arguments to the `add_documents` and `aadd_documents` methods, enabling the specification of a custom `vector_field`. For example, users can now use `vector_field="embedding"` when indexing documents in `OpenSearchVectorStore` --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 14:52:50 -04:00
Bagatur	4935a14314	core,integrations[minor]: Dont error on fields in model_kwargs (#27110 ) Given the current erroring behavior, every time we've moved a kwarg from model_kwargs and made it its own field that was a breaking change. Updating this behavior to support the old instantiations / serializations. Assuming build_extra_kwargs was not something that itself is being used externally and needs to be kept backwards compatible	2024-10-04 11:30:27 -07:00
Erick Friis	ab4dab9a0c	core: fix batch race condition in FakeListChatModel (#26924 ) fixed #26273	2024-10-03 23:14:31 +00:00
Bagatur	87fc5ce688	core[patch]: exclude model cache from ser (#27086 )	2024-10-03 22:00:31 +00:00
Bagatur	546dc44da5	core[patch]: add UsageMetadata details (#27072 )	2024-10-03 20:36:17 +00:00
Eugene Yurtsev	74bf620e97	core[patch]: Support injected tool args that are arbitrary types (#27045 ) This adds support for inject tool args that are arbitrary types when used with pydantic 2. We'll need to add similar logic on the v1 path, and potentially mirror the config from the original model when we're doing the subset.	2024-10-02 12:50:58 -04:00
ccurme	9d10151123	core[patch]: fix init of RunnableAssign (#26903 ) Example in API ref currently raises ValidationError. Resolves https://github.com/langchain-ai/langchain/issues/26862	2024-10-01 14:21:54 -04:00
federico-pisanu	2538963945	core[patch]: improve index/aindex api when batch_size<n_docs (#25754 ) - Description: prevent index function to re-index entire source document even if nothing has changed. - Issue: #22135 I worked on a solution to this issue that is a compromise between being cheap and being fast. In the previous code, when batch_size is greater than the number of docs from a certain source almost the entire source is deleted (all documents from that source except for the documents in the first batch) My solution deletes documents from vector store and record manager only if at least one document has changed for that source. Hope this can help! --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-30 20:57:41 +00:00
Eugene Yurtsev	7fde2791dc	core[patch]: Add kwargs to Runnable (#27008 ) Fixes #26685 --------- Co-authored-by: Tibor Reiss <tibor.reiss@gmail.com>	2024-09-30 16:45:29 -04:00
Bagatur	248be02259	core[patch]: fix structured prompt template format (#27003 ) template_format is an init argument on ChatPromptTemplate but not an attribute on the object so was getting shoved into StructuredPrompt.structured_ouptut_kwargs	2024-09-30 11:47:46 -07:00
Christophe Bornet	db8845a62a	core: Add ruff rules for pycodestyle Warning (W) (#26964 ) All auto-fixes.	2024-09-30 09:31:43 -04:00
Christophe Bornet	7809b31b95	core[patch]: Add ruff rules for flake8-simplify (SIM) (#26848 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-27 20:13:23 +00:00
Christophe Bornet	f4e738bb40	core: Add ruff rules for PIE (#26939 ) All auto-fixes.	2024-09-27 12:08:35 -04:00
Julius Stopforth	121e79b1f0	core: Fix `IndexError` when `trim_messages` invoked with empty list (#26896 ) This prevents `trim_messages` from raising an `IndexError` when invoked with `include_system=True`, `strategy="last"`, and an empty message list. Fixes #26895 Dependencies: none	2024-09-26 11:29:58 -04:00
Christophe Bornet	3a1b9259a7	core: Add ruff rules for comprehensions (C4) (#26829 )	2024-09-25 09:34:17 -04:00
William FH	82b5b77940	[Core] Add more interops tests (#26841 ) To test that the client propagates both ways	2024-09-24 20:18:20 -07:00
William FH	9b6ac41442	[Core] Inherit tracing metadata & tags (#26838 )	2024-09-24 19:33:12 -07:00
William FH	864020e592	[Tracer] add project name to run from tracer (#26736 )	2024-09-20 16:48:37 -07:00
Alejandro Rodríguez	4ac9a6f52c	core: fix "template" not allowed as prompt param (#26060 ) - Description: fix "template" not allowed as prompt param - Issue: #26058 - Dependencies: none - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-20 23:33:06 +00:00
William FH	19ce95d3c9	Avoid copying runs (#26689 ) Also, re-unify run trees. Use a single shared client.	2024-09-20 10:57:41 -07:00
Erick Friis	311f861547	core, community: move graph vectorstores to community (#26678 ) remove beta namespace from core, add to community	2024-09-19 11:38:14 -07:00
Christophe Bornet	fd21ffe293	core: Add N(naming) ruff rules (#25362 ) Public classes/functions are not renamed and rule is ignored for them. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-19 05:09:39 +00:00
Christophe Bornet	a47b332841	core: Put Python version as a project requirement so it is considered by ruff (#26608 ) Ruff doesn't know about the python version in `[tool.poetry.dependencies]`. It can get it from `project.requires-python`. Notes: * poetry seems to have issues getting the python constraints from `requires-python` and using `python` in per dependency constraints. So I had to duplicate the info. I will open an issue on poetry. * `inspect.isclass()` doesn't work correctly with `GenericAlias` (`list[...]`, `dict[..., ...]`) on Python <3.11 so I added some `not isinstance(type, GenericAlias)` checks: Python 3.11 ```pycon >>> import inspect >>> inspect.isclass(list) True >>> inspect.isclass(list[str]) False ``` Python 3.9 ```pycon >>> import inspect >>> inspect.isclass(list) True >>> inspect.isclass(list[str]) True ``` Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-18 14:37:57 +00:00
Christophe Bornet	3a99467ccb	core[patch]: Add ruff rule UP006(use PEP585 annotations) (#26574 ) * Added rules `UPD006` now that Pydantic is v2+ --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-17 21:22:50 +00:00
Erick Friis	c2a3021bb0	multiple: pydantic 2 compatibility, v0.3 (#26443 ) Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Dan O'Donovan <dan.odonovan@gmail.com> Co-authored-by: Tom Daniel Grande <tomdgrande@gmail.com> Co-authored-by: Grande <Tom.Daniel.Grande@statsbygg.no> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: ZhangShenao <15201440436@163.com> Co-authored-by: Friso H. Kingma <fhkingma@gmail.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Morgante Pell <morgantep@google.com>	2024-09-13 14:38:45 -07:00
langchain-infra	8a02fd9c01	core: add additional import mappings to loads (#26406 ) Support using additional import mapping. This allows users to override old mappings/add new imports to the loads function. - [x ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-09-13 09:39:58 -07:00
Bagatur	feb351737c	core[patch]: fix empty OpenAI tools when strict=True (#26287 ) Fix #26232 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-11 16:06:03 -07:00
ccurme	398718e1cb	core[patch]: fix regression in convert_to_openai_tool with instances of Tool (#26327 ) ```python from langchain_core.tools import Tool from langchain_core.utils.function_calling import convert_to_openai_tool def my_function(x: int) -> int: return x + 2 tool = Tool( name="tool_name", func=my_function, description="test description", ) convert_to_openai_tool(tool) ``` Current: ``` {'type': 'function', 'function': {'name': 'tool_name', 'description': 'test description', 'parameters': {'type': 'object', 'properties': {'args': {'type': 'array', 'items': {}}, 'config': {'type': 'object', 'properties': {'tags': {'type': 'array', 'items': {'type': 'string'}}, 'metadata': {'type': 'object'}, 'callbacks': {'anyOf': [{'type': 'array', 'items': {}}, {}]}, 'run_name': {'type': 'string'}, 'max_concurrency': {'type': 'integer'}, 'recursion_limit': {'type': 'integer'}, 'configurable': {'type': 'object'}, 'run_id': {'type': 'string', 'format': 'uuid'}}}, 'kwargs': {'type': 'object'}}, 'required': ['config']}}} ``` Here: ``` {'type': 'function', 'function': {'name': 'tool_name', 'description': 'test description', 'parameters': {'properties': {'__arg1': {'title': '__arg1', 'type': 'string'}}, 'required': ['__arg1'], 'type': 'object'}}} ```	2024-09-11 15:51:10 -04:00
Nuno Campos	212c688ee0	core[minor]: Remove serialized manifest from tracing requests for non-llm runs (#26270 ) - This takes a long time to compute, isn't used, and currently called on every invocation of every chain/retriever/etc	2024-09-10 12:58:24 -07:00

1 2 3 4 5 ...

411 Commits