langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-03-18 11:07:36 +00:00

Author	SHA1	Message	Date
Mason Daugherty	557eddfd51	refactor(core): add warning for fallback GPT-2 tokenizer usage (#34621 )	2026-01-06 19:11:10 -05:00
Mason Daugherty	aa9c63b96a	release(langchain): 1.2.1 (#34622 ) langchain==1.2.1	2026-01-06 19:10:49 -05:00
Mason Daugherty	8aeff95341	fix(core,langchain): use `get_buffer_string` for message summarization (#34607 ) Fixes #34517 Supersedes #34557, #34570 Fixes token inflation in `SummarizationMiddleware` that caused context window overflow during summarization. Root cause: When formatting messages for the summary prompt, `str(messages)` was implicitly called, which includes all Pydantic metadata fields (`usage_metadata`, `response_metadata`, `additional_kwargs`, etc.). This caused the stringified representation to use ~2.5x more tokens than `count_tokens_approximately` estimates. Problem: - Summarization triggers at 85% of context window based on `count_tokens_approximately` - But `str(messages)` in the prompt uses 2.5x more tokens - Results in `ContextLengthExceeded` Fix: Use `get_buffer_string()` to format messages, which produces compact output: ``` Human: What's the weather? AI: Let me check...[tool_calls] Tool: 72°F and sunny ``` Instead of verbose Pydantic repr: ```python [HumanMessage(content='What's the weather?', additional_kwargs={}, response_metadata={}), ...] ```	2026-01-06 19:05:03 -05:00
Christophe Bornet	0438f8c277	chore(langchain): fix types in test_model_fallback (#34615 )	2026-01-06 13:07:18 -05:00
Christophe Bornet	7f4f130479	chore(langchain): fix types in test_pii (#34617 )	2026-01-06 13:06:25 -05:00
ccurme	6537939f53	chore(langchain): add admonition around redaction_rules (#34618 )	2026-01-06 13:01:09 -05:00
Ademola Balogun	a2529cd805	fix(langchain): correct typo 'langchain experiment' to 'langchain_experimental' in error messages (#34608 ) Fixed typo in ImportError messages where "langchain experiment" should be "langchain_experimental" for consistency with the actual package name. This helps improve clarity for users who encounter these error messages when trying to use deprecated tools that have moved to the langchain_experimental package. Related issues: #13858, #13859 Co-authored-by: Ademola <ademicho@gmail>	2026-01-05 18:10:06 -05:00
ccurme	c1f1641018	fix(anthropic): fix version (#34606 ) langchain-anthropic==1.3.1	2026-01-05 16:03:20 -05:00
ccurme	225e0fa8c9	release(anthropic): 1.3.1 (#34605 )	2026-01-05 15:55:15 -05:00
Loganaden Velvindron	f021e899dc	fix(anthropic): CVE-2025-68664 (#34563 )	2026-01-05 15:51:25 -05:00
lwtaiyty	578cef9622	fix(anthropic): skip cache_control for code_execution blocks (#34579 )	2026-01-05 15:40:59 -05:00
Christophe Bornet	7979fd3d9f	chore(langchain): fix types in test_composition (#34580 )	2026-01-05 14:49:34 -05:00
Christophe Bornet	3b65985551	chore(langchain): fix types in test_decorators (#34583 )	2026-01-05 14:47:10 -05:00
Christophe Bornet	c4babed5c6	chore(langchain): fix types in test_wrap_tool_call (#34600 )	2026-01-05 14:38:31 -05:00
Christophe Bornet	5ae53fdfb3	chore(langchain): fix types in test_model_call_limit_types (#34601 )	2026-01-05 14:37:03 -05:00
Christophe Bornet	901690ceec	chore(langchain): fix types in test_file_search and test_human_in_the_loop (#34602 )	2026-01-05 14:34:35 -05:00
ゆり	be2c7f1aa8	test(core): add tests for formatting utils and merge functions (#34511 ) ## Summary Add comprehensive test coverage for previously untested utilities in `langchain-core`. ## Changes ### New file: `test_formatting.py` (18 tests) Tests for `StrictFormatter` class: - `test_vformat_with_keyword_args` - basic functionality - `test_vformat_with_multiple_keyword_args` - multiple placeholders - `test_vformat_with_empty_string` - edge case - `test_vformat_with_no_placeholders` - literal strings - `test_vformat_raises_on_positional_args` - error handling - `test_vformat_raises_on_multiple_positional_args` - error handling - `test_vformat_with_special_characters` - newlines, tabs - `test_vformat_with_unicode` - emoji, CJK characters - `test_vformat_with_format_spec` - format specifications - `test_vformat_with_nested_braces` - escaped braces Tests for `validate_input_variables`: - `test_validate_input_variables_success` - valid input - `test_validate_input_variables_with_extra_variables` - extra vars allowed - `test_validate_input_variables_with_missing_variable` - KeyError - `test_validate_input_variables_empty_format` - edge case - `test_validate_input_variables_no_placeholders` - edge case Tests for `formatter` singleton: - `test_formatter_is_strict_formatter` - type check - `test_formatter_format_works` - functionality - `test_formatter_rejects_positional_args` - error handling ### Extended `test_utils.py` (14 new tests) Tests for `merge_lists`: - Parametrized tests covering None handling, simple merge, empty lists, index-based merging - `test_merge_lists_multiple_others` - merging 3+ lists - `test_merge_lists_all_none` - all None inputs Tests for `merge_obj`: - Parametrized tests for None, strings, dicts, lists, equal values - `test_merge_obj_type_mismatch` - TypeError on type mismatch - `test_merge_obj_unmergeable_values` - ValueError on different values - `test_merge_obj_tuple_raises` - ValueError for tuples ## Test plan - [x] Tests follow existing patterns in the codebase - [x] All tests are unit tests (no network calls) - [x] Tests cover happy paths and error conditions - [x] Tests verify no mutation of input data ## AI Disclosure This contribution was developed with AI assistance (Claude Code). 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: yurekami <yurekami@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-01-05 14:20:11 -05:00
ccurme	b5c5ba0a5f	release(xai): 1.2.1 (#34604 ) langchain-xai==1.2.1	2026-01-05 13:55:38 -05:00
ccurme	944b43dd25	fix(xai): count reasoning tokens in output total (#34603 )	2026-01-05 13:25:30 -05:00
aroun-coumar	730a3676f8	fix(core): strip message IDs from cache keys using `model_copy` (#33915 ) Description: Closes #[33883](https://github.com/langchain-ai/langchain/issues/33883) Chat model cache keys are generated by serializing messages via `dumps(messages)`. The optional `BaseMessage.id` field (a UUID used solely for tracing/threading) is included in this serialization, causing functionally identical messages to produce different cache keys. This results in repeated API calls, cache bloat, and degraded performance in production workloads (e.g., agents, RAG chains, long conversations). This change normalizes messages only for cache key generation by stripping the nonsemantic `id` field using Pydantic V2’s `model_copy(update={"id": None})`. The normalization is applied in both synchronous and asynchronous cache paths (`_generate_with_cache` / `_agenerate_with_cache`) immediately before `dumps()`. ```python normalized_messages = [ msg.model_copy(update={"id": None}) if getattr(msg, "id", None) is not None else msg for msg in messages ] prompt = dumps(normalized_messages) --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-01-05 10:37:10 -05:00
Julia (Juli) Huang	cd5b36456a	fix(text-splitters): HTMLSemanticPreservingSplitter nested preserved … (#34587 ) Summary Fixes an issue where HTMLSemanticPreservingSplitter failed to preserve elements nested inside non-container tags. With these changes, preserved elements are now correctly detected and handled at any nesting depth. Root Cause `_process_element()` only recursed into a small set of hard-coded container tags (`html`, `body`, `div`, `main`). For other tags, the subtree was flattened into text, preventing nested preserved elements (inside `<p>`, `<section>`, `<article>`, etc.) from being detected. Fix - Updated traversal logic in _process_element (html.py) to recursively process child elements for any tag that contains nested elements - Avoided duplicate text extraction - Preserved correct placeholder ordering - Treated leaf nodes as text only Tests Adds regression tests covering preserved elements nested inside non-container tags, including: - table inside section - nested divs - code inside paragraph All existing tests pass (make lint, format, test, etc). Breaking changes None. Fixes Fixes #31569 Disclaimer GitHub Copilot was used to assist with test case design in test_text_splitters.py and documentation comments; all code logic was manually implemented and reviewed. --------- Co-authored-by: julih <julih@julihs-MacBook-Pro.local> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-01-05 10:28:27 -05:00
Mohan Kumar S	13cfdf1676	fix(core): exclude injected args from tool schema (#34582 )	2026-01-05 09:59:59 -05:00
Andre Roelofs	c25f3847d0	refactor(core): select chunk_id via ranking and remove extra allocation (#34588 )	2026-01-05 09:13:05 -05:00
Christophe Bornet	7ca0efde04	chore(langchain): fix types in test_diagram and test_sync_async_wrappers (#34591 )	2026-01-05 09:05:24 -05:00
repeat-Q	9495eb348d	docs: add LangChain Academy link to Additional resources (#34597 )	2026-01-05 08:55:46 -05:00
Christophe Bornet	e5d4acf681	style(langchain): add ruff rule PLC0415 (#34559 )	2026-01-04 01:26:04 -05:00
ccurme	659eab2607	release(core): 1.2.6 (#34586 ) langchain-core==1.2.6	2026-01-02 16:20:20 -05:00
Angus Jelinek	458a186540	chore(core): Update LangChainTracer to use Pydantic v2 methods (#34541 )	2026-01-02 16:02:13 -05:00
ccurme	a7aad60989	fix(xai): ensure citations are streamed just once (#34556 ) langchain-xai==1.2.0	2025-12-31 18:01:41 -05:00
ccurme	9da28bac86	release(xai): 1.2.0 (#34555 )	2025-12-31 16:37:21 -05:00
ccurme	0b91774263	fix(xai): stream usage metadata by default (#34531 )	2025-12-31 16:30:52 -05:00
weiii668	5517ef37fb	docs(core): add docstrings to internal helper functions (#34525 ) Co-authored-by: weiii668 <your-email@example.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-30 21:58:00 -06:00
Mason Daugherty	2bbe4216e0	docs(core): refresh `content.py` docstrings (#34546 ) minor formatting improvements and increased disambiguation between `id` and `file_id` for `FileContentBlock` in response to https://github.com/langchain-ai/langchain-google/pull/1477	2025-12-30 20:44:47 -06:00
Pádraic Slattery	fcc02f78e4	chore(deps): Update outdated GitHub Actions versions (#34544 ) This PR updates an outdated GitHub Action version. - Updated `astral-sh/setup-uv` from `v6` to `v7` in `.github/actions/uv_setup/action.yml` Looks like this was missed as part of https://github.com/langchain-ai/langchain/pull/33457 so hopefully safe to bring it into alignment.	2025-12-30 20:42:22 -06:00
Mason Daugherty	721bf15430	fix(langchain): resolve race condition in `ShellSession.execute()` (#34535 ) Addresses a flaky test When executing `exit 1` as a startup command, the shell process terminates immediately. The code then tries to write a marker command (`printf '...'`) to stdin, but the pipe is already broken because the shell has exited, causing `BrokenPipeError`.	2025-12-29 18:16:08 -06:00
Mason Daugherty	dcfd9c0e04	fix(infra): use `langchain_v1` for dev container deps (#34534 )	2025-12-29 18:10:40 -06:00
Christophe Bornet	e03d6b80d5	chore(deps): bump mypy to v1.19 and ruff to v1.14 (#34521 ) * Set mypy to >=1.19.1,<1.20 * Set ruff to >=0.14.10,<0.15	2025-12-29 18:07:55 -06:00
Mason Daugherty	33378f16fb	feat(infra): add `.dockerignore` for codespaces (#34533 )	2025-12-29 17:58:28 -06:00
Christophe Bornet	ea25f5ebdd	chore(text-splitters): bump dependency locks for python 3.14 (#34522 ) * Support sentence-transformers optional dep on python 3.14 * Bump some dep locks to use pre-built wheels instead of building them (murmurhash, cymem, preshed, thinc, srsly, blis) * Still not possible to use spacy: even though there are wheels available, spacy depends on Pydantic v1 which doesn't work on Python 3.14. * Speeds up installation and CI. Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-29 17:55:34 -06:00
Christophe Bornet	04c0c1bdc3	chore(langchain-classic): bump markupsafe lock for python 3.14 (#34523 ) Bump lock of MarkupSafe to 3.0.3 which has Python 3.14 pre-built wheels. Speeds up installation and CI. Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-29 17:55:26 -06:00
Efe Çelik	c1f5d0963d	fix: typo: saved the world 'wether' -> 'whether' (#34524 ) Changed "wether" to "whether" in test comments.	2025-12-29 17:28:09 -06:00
Mason Daugherty	e81f00fb29	docs(standard-tests): remove autodoc comment (#34532 )	2025-12-29 17:25:52 -06:00
Mason Daugherty	9ecf6360af	feat(infra): add more pre-commit hooks (#34519 )	2025-12-29 02:14:20 -06:00
JJ	7ce68f27da	fix(docs): correct Code of Conduct link in README (#34518 ) The Code of Conduct link was pointing to a non-existent file path. Updated to use GitHub's community standards tab URL which correctly displays the Code of Conduct. Changed from: https://github.com/langchain-ai/langchain/blob/master/.github/CODE_OF_CONDUCT.md To: https://github.com/langchain-ai/langchain/?tab=coc-ov-file (Replace this entire block of text) Read the full contributing guidelines: https://docs.langchain.com/oss/python/contributing/overview Thank you for contributing to LangChain! Follow these steps to have your pull request considered as ready for review. 1. PR title: Should follow the format: TYPE(SCOPE): DESCRIPTION - Examples: - fix(anthropic): resolve flag parsing error - feat(core): add multi-tenant support - test(openai): update API usage tests - Allowed TYPE and SCOPE values: https://github.com/langchain-ai/langchain/blob/master/.github/workflows/pr_lint.yml#L15-L33 2. PR description: - Write 1-2 sentences summarizing the change. - If this PR addresses a specific issue, please include "Fixes #ISSUE_NUMBER" in the description to automatically close the issue when the PR is merged. - If there are any breaking changes, please clearly describe them. - If this PR depends on another PR being merged first, please include "Depends on #PR_NUMBER" inthe description. 3. Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. - We will not consider a PR unless these three are passing in CI. Additional guidelines: - We ask that if you use generative AI for your contribution, you include a disclaimer. - PRs should not touch more than one package unless absolutely necessary. - Do not update the `uv.lock` files unless or add dependencies to `pyproject.toml` files (even optional ones) unless you have explicit permission to do so by a maintainer.	2025-12-29 01:47:25 -06:00
Christophe Bornet	03ae39747b	refactor(core): fix some missing generic types (#31658 ) See https://mypy.readthedocs.io/en/stable/config_file.html#confval-disallow_any_generics --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-12-27 16:53:08 -06:00
Sarah Clark	10de0a5364	fix(langchain-classic): pass default to `config.getoption` (#34034 ) Fixes #34033 --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-27 16:36:51 -06:00
Mason Daugherty	30ac1da0de	release(standard-tests): 1.1.2 (#34507 ) langchain-tests==1.1.2	2025-12-27 03:01:56 -06:00
Dragos Bobolea	6d447f89d9	fix(fireworks): `bind_tools(strict: bool)` and `reasoning_content` (#34343 ) Extract strict from kwargs and pass it to convert_to_openai_tool when converting tools. This ensures that when strict is provided, it's properly used during tool conversion and removed from kwargs before calling the parent bind method. Also extract reasoning_content from API responses and store it in additional_kwargs for AIMessage objects. Fixes https://github.com/langchain-ai/langchain/issues/34341 and https://github.com/langchain-ai/langchain/issues/34342 --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-27 02:42:06 -06:00
Christophe Bornet	5ef9f6e036	style(core): add ruff RUF012 rule (#34492 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-27 02:36:28 -06:00
Connor Hyatt	e3939ade5a	fix(core): support `(message class, template)` tuples in `ChatPromptTemplate.from_messages` (#33989 ) ### Description `ChatPromptTemplate.from_messages` supports multiple tuple formats for defining message templates. One documented format is `(message class, template)`, which allows users to specify the message type using the class directly: ```python ChatPromptTemplate.from_messages([ (SystemMessage, "You are a helpful assistant named {name}."), (HumanMessage, "{input}"), ]) ``` However, this syntax was broken. Passing a tuple like `(HumanMessage, "{input}")` would raise a Pydantic validation error because the conversion logic in `_convert_to_message_template` didn't handle `BaseMessage` subclasses—it only recognized string-based role identifiers like `"human"` or `"system"`. This PR adds the missing branch to detect when the first element of a tuple is a message class (by checking for the `type` class attribute) and routes it through `_create_template_from_message_type`, which already knows how to create the appropriate `MessagePromptTemplate` for each message type. ### Changes - Updated `_convert_to_message_template` to properly support `(message class, template)` tuples ### Testing Added 16 comprehensive unit tests covering: - Basic usage with `HumanMessage`, `AIMessage`, and `SystemMessage` classes - Integration with `invoke()` method - Mixed syntax (message class tuples alongside string tuples) - Multiple template variables - Edge cases: empty templates, static text (no variables) - Correct extraction of `input_variables` - Partial variables support - Combination with `MessagesPlaceholder` - Mustache template format - Template operations: `append()`, `extend()`, concatenation, and slicing - Special characters and unicode in templates ### Issue Fixes #33791 ### Dependencies None --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-12-27 02:20:33 -06:00

1 2 3 4 5 ...

15048 Commits