langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-21 10:26:57 +00:00

Author	SHA1	Message	Date
Christophe Bornet	f896bcdb1d	chore(langchain): add mypy pydantic plugin (#32610 )	2025-08-19 16:59:59 -04:00
Christophe Bornet	73a7de63aa	chore(text-splitters): add mypy pydantic plugin (#32611 )	2025-08-19 16:58:12 -04:00
Christophe Bornet	02d6b9106b	chore(core): add mypy pydantic plugin (#32604 ) This helps to remove a bunch of mypy false positives.	2025-08-19 09:39:53 -04:00
William FH	b470c79f1d	refactor(core): Use duck typing for `_StreamingCallbackHandler` (#32535 ) It's used in langgraph and maybe elsewhere, so would be preferable if it could just be duck-typed	2025-08-19 05:41:07 -07:00
Mohammad Mohtashim	00259b0061	fix(deepseek): Deep Seek Model for LS Tracing (#32575 ) - Description: Fix for LS Tracing for Provider for DeepSeek. - Issue: #32484 --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-18 18:48:30 +00:00
Mason Daugherty	a6690eb9fd	release(anthropic): 0.3.19 (#32595 )	2025-08-18 14:25:03 -04:00
Mason Daugherty	f69f9598f5	chore: update references to use the latest version of Claude-3.5 Sonnet (#32594 )	2025-08-18 14:11:15 -04:00
Mason Daugherty	8d0fb2d04b	fix(anthropic): correct `input_token` count for streaming (#32591 ) * Create usage metadata on [`message_delta`](https://docs.anthropic.com/en/docs/build-with-claude/streaming#event-types) instead of at the beginning. Consequently, token counts are not included during streaming but instead at the end. This allows for accurate reporting of server-side tool usage (important for billing) * Add some clarifying comments * Fix some outstanding Pylance warnings * Remove unnecessary `text` popping in thinking blocks * Also now correctly reports `input_cache_read`/`input_cache_creation` as a result	2025-08-18 17:51:47 +00:00
Mason Daugherty	8042b04da6	fix(anthropic): clean up null `file_id` fields in citations during message formatting (#32592 ) When citations are returned from streaming, they include a `file_id: null` field in their `content_block_location` structure. When these citations are passed back to the API in subsequent messages, the API rejects them with "Extra inputs are not permitted" for the `file_id` field.	2025-08-18 13:01:52 -04:00
Keyu Chen	03138f41a0	feat(text-splitters): add optional custom header pattern support (#31887 ) ## Description This PR adds support for custom header patterns in `MarkdownHeaderTextSplitter`, allowing users to define non-standard Markdown header formats (like `Header`) and specify their hierarchy levels. Issue: Fixes #22738 Dependencies: None - this change has no new dependencies Key Changes: - Added optional `custom_header_patterns` parameter to support non-standard header formats - Enable splitting on patterns like `Header` and `*Header` - Maintain full backward compatibility with existing usage - Added comprehensive tests for custom and mixed header scenarios ## Example Usage ```python from langchain_text_splitters import MarkdownHeaderTextSplitter headers_to_split_on = [ ("", "Chapter"), ("", "Section"), ] custom_header_patterns = { "": 1, # Level 1 headers "*": 2, # Level 2 headers } splitter = MarkdownHeaderTextSplitter( headers_to_split_on=headers_to_split_on, custom_header_patterns=custom_header_patterns, ) # Now Chapter 1 is treated as a level 1 header # And Section 1.1** is treated as a level 2 header ``` ## Testing - ✅ Added unit tests for custom header patterns - ✅ Added tests for mixed standard and custom headers - ✅ All existing tests pass (backward compatibility maintained) - ✅ Linting and formatting checks pass --- The implementation provides a flexible solution while maintaining the simplicity of the existing API. Users can continue using the splitter exactly as before, with the new functionality being entirely opt-in through the `custom_header_patterns` parameter. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Claude <noreply@anthropic.com>	2025-08-18 10:10:49 -04:00
Mason Daugherty	fd891ee3d4	revert(anthropic): streaming token counting to defer input tokens until completion (#32587 ) Reverts langchain-ai/langchain#32518	2025-08-18 09:48:33 -04:00
ccurme	b8cdbc4eca	fix(anthropic): sanitize tool use block when taking directly from content (#32574 )	2025-08-18 09:06:57 -04:00
Christophe Bornet	791d309c06	chore(langchain): add mypy `warn_unreachable` setting (#32529 ) See https://mypy.readthedocs.io/en/stable/config_file.html#confval-warn_unreachable --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-15 23:03:53 +00:00
Mason Daugherty	d3d23e2372	fix(anthropic): streaming token counting to defer input tokens until completion (#32518 ) Supersedes #32461 Fixed incorrect input token reporting during streaming when tools are used. Previously, input tokens were counted at `message_start` before tool execution, leading to inaccurate counts. Now input tokens are properly deferred until `message_delta` (completion), aligning with Anthropic's billing model and SDK expectations. Before Fix: - Streaming with tools: Input tokens = 0 ❌ - Non-streaming with tools: Input tokens = 472 ✅ After Fix: - Streaming with tools: Input tokens = 472 ✅ - Non-streaming with tools: Input tokens = 472 ✅ Aligns with Anthropic's SDK expectations. The SDK handles input token updates in `message_delta` events: ```python # https://github.com/anthropics/anthropic-sdk-python/blob/main/src/anthropic/lib/streaming/_messages.py if event.usage.input_tokens is not None: current_snapshot.usage.input_tokens = event.usage.input_tokens ```	2025-08-15 17:49:46 -04:00
Christophe Bornet	4656f727da	chore(text-splitters): add mypy `warn_unreachable` (#32558 )	2025-08-15 09:45:20 -04:00
Mason Daugherty	34800332bf	chore: update integrations table (#32556 ) Enhance the integrations table by adding the `js: '@langchain/community'` reference for several packages and updating the titles of specific integrations to avoid improper capitalization	2025-08-14 22:37:36 -04:00
Mason Daugherty	a0331285d7	fix(core): Support no-args tools by defaulting args to empty dict (#32530 ) Supersedes #32408 Description: This PR ensures that tool calls without explicitly provided `args` will default to an empty dictionary (`{}`), allowing tools with no parameters (e.g. `def foo() -> str`) to be registered and invoked without validation errors. This change improves compatibility with agent frameworks that may omit the `args` field when generating tool calls. Issue: See [langgraph#5722](https://github.com/langchain-ai/langgraph/issues/5722) – LangGraph currently emits tool calls without `args`, which leads to validation errors when tools with no parameters are invoked. This PR ensures compatibility by defaulting `args` to `{}` when missing. Dependencies: None --------- Thank you for contributing to LangChain! Follow these steps to mark your pull request as ready for review. If any of these steps are not completed, your PR will not be considered for review. - [ ] PR title: Follows the format: {TYPE}({SCOPE}): {DESCRIPTION} - Examples: - feat(core): add multi-tenant support - fix(cli): resolve flag parsing error - docs(openai): update API usage examples - Allowed `{TYPE}` values: - feat, fix, docs, style, refactor, perf, test, build, ci, chore, revert, release - Allowed `{SCOPE}` values (optional): - core, cli, langchain, standard-tests, docs, anthropic, chroma, deepseek, exa, fireworks, groq, huggingface, mistralai, nomic, ollama, openai, perplexity, prompty, qdrant, xai - Note: the `{DESCRIPTION}` must not start with an uppercase letter. - Once you've written the title, please delete this checklist item; do not include it in the PR. - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change. Include a [closing keyword](https://docs.github.com/en/issues/tracking-your-work-with-issues/using-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword) if applicable to a relevant issue. - Issue: the issue # it fixes, if applicable (e.g. Fixes #123) - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, you must include: 1. A test for the integration, preferably unit tests that do not rely on network access, 2. An example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. We will not consider a PR unless these three are passing in CI. See [contribution guidelines](https://python.langchain.com/docs/contributing/) for more. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to `pyproject.toml` files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. --------- Signed-off-by: jitokim <pigberger70@gmail.com> Co-authored-by: jito <pigberger70@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-08-14 20:28:36 +00:00
Mason Daugherty	397cd89988	docs: update outdated `README.md` content (#32540 )	2025-08-13 22:19:38 +00:00
RecallIO	4f71c35eb0	docs(docs): Add RecallIO.AI as a memory provider (#32331 ) Add requested files to add RecallIO as a memory provider. --------- Co-authored-by: Frey <gfreyburger@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-13 15:09:56 +00:00
Mason Daugherty	5b701b5189	fix(tests): add `anthropic_proxy` to configurable test parameters (for v1)	2025-08-12 18:33:21 -04:00
Mason Daugherty	8848b3e018	fix(tests): add `anthropic_proxy` to configurable test parameters	2025-08-12 18:27:35 -04:00
Mason Daugherty	80068432ed	chore(core): bump lock	2025-08-12 17:32:24 -04:00
Jack	b9dcce95be	fix(anthropic): Add proxy (#32409 ) Thank you for contributing to LangChain! Follow these steps to mark your pull request as ready for review. If any of these steps are not completed, your PR will not be considered for review. - [x] PR title: Follows the format: {TYPE}({SCOPE}): {DESCRIPTION} - [x] PR message: *Delete this entire checklist* and replace with fix #30146 - [x] Add tests and docs: If you're adding a new integration, you must include: - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. We will not consider a PR unless these three are passing in CI. See [contribution guidelines](https://python.langchain.com/docs/contributing/) for more. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to `pyproject.toml` files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-12 21:21:26 +00:00
ccurme	be83ce74a7	feat(anthropic): support `cache_control` as a kwarg (#31523 ) ```python from langchain_anthropic import ChatAnthropic llm = ChatAnthropic(model="claude-3-5-haiku-latest") caching_llm = llm.bind(cache_control={"type": "ephemeral"}) caching_llm.invoke( [ HumanMessage("..."), AIMessage("..."), HumanMessage("..."), # <-- final message / content block gets cache annotation ] ) ``` Potentially useful given's Anthropic's [incremental caching](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching#continuing-a-multi-turn-conversation) capabilities: > During each turn, we mark the final block of the final message with cache_control so the conversation can be incrementally cached. The system will automatically lookup and use the longest previously cached prefix for follow-up messages. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-12 16:18:24 -04:00
Mason Daugherty	1167e7458e	fix(anthropic): update test model names and adjust token count assertions in integration tests (#32422 )	2025-08-12 19:39:35 +00:00
Mason Daugherty	d5fd0bca35	docs(anthropic): add documentation for extended context windows in Claude Sonnet 4 (#32517 )	2025-08-12 19:16:26 +00:00
Mason Daugherty	262c83763f	release(openai): 0.3.30 (#32515 )	2025-08-12 16:06:17 +00:00
Mason Daugherty	0024dffa68	feat(openai): officially support `verbosity` (#32470 )	2025-08-12 16:00:30 +00:00
Christophe Bornet	1563099f3f	chore(langchain): select ALL rules with exclusions (#31930 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-12 11:51:31 -04:00
Christophe Bornet	cf2b4bbe09	chore(cli): select ALL rules with exclusions (#31936 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-11 22:43:11 +00:00
Christophe Bornet	09a616fe85	chore(standard-tests): add ruff rules D (#32347 ) See https://docs.astral.sh/ruff/rules/#pydocstyle-d Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-11 22:26:11 +00:00
Christophe Bornet	46bbd52e81	chore(cli): add ruff rules D1 (#32350 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-11 22:25:30 +00:00
Christophe Bornet	8b663ed6c6	chore(text-splitters): bump mypy version to 1.17 (#32387 ) Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-11 22:24:49 +00:00
Anderson	166c027434	docs: add scrapeless integration documentation (#32081 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "core: add foobar LLM" - Description: Integrated the Scrapeless package to enable Langchain users to seamlessly incorporate Scrapeless into their agents. - Dependencies: None - Twitter handle: [Scrapelessteam](https://x.com/Scrapelessteam) - [x] Add tests and docs: If you're adding a new integration, you must include: 1. A test for the integration, preferably unit tests that do not rely on network access, 2. An example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See [contribution guidelines](https://python.langchain.com/docs/contributing/) for more. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to `pyproject.toml` files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-11 22:16:15 +00:00
GDanksAnchor	4a2a3fcd43	docs: add anchorbrowser (#32494 ) # Description This PR updates the docs for the [langchain-anchorbrowser](https://pypi.org/project/langchain-anchorbrowser/) package. It adds a few tools [Anchor Browser](https://anchorbrowser.io/?utm=langchain) is the platform for AI Agentic browser automation, which solves the challenge of automating workflows for web applications that lack APIs or have limited API coverage. It simplifies the creation, deployment, and management of browser-based automations, transforming complex web interactions into simple API endpoints. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-11 21:48:10 +00:00
Anubhav Dhawan	1e38fd2ce3	docs: add integration guide for MCP Toolbox (#32344 ) This PR introduces a new integration guide for MCP Toolbox. The primary goal of this new documentation is to enhance the discoverability of MCP Toolbox for developers working within the LangChain ecosystem, providing them with a clear and direct path to using our tools. This approach was chosen to provide users with a practical, hands-on example that they can easily follow. > [!NOTE] > The page added in this PR is linked to from a section in Google partners page added in #32356. --------- Co-authored-by: Lauren Hirata Singh <lauren@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-11 21:03:38 +00:00
Mason Daugherty	5ccdcd7b7b	feat(ollama): docs updates (#32507 )	2025-08-11 15:39:44 -04:00
Mason Daugherty	ee4c2510eb	feat: port various nit changes from `wip-v0.4` (#32506 ) Lots of work that wasn't directly related to core improvements/messages/testing functionality	2025-08-11 15:09:08 -04:00
Mason Daugherty	e5d0a4e4d6	feat(standard-tests): formatting (#32504 ) Not touching `pyproject.toml` or chat model related items as to not interfere with work in wip0.4 branch	2025-08-11 13:30:30 -04:00
Mason Daugherty	457ce9c4b0	feat(text-splitters): ruff fixes and rules (#32502 )	2025-08-11 13:28:22 -04:00
Mason Daugherty	27b6b53f20	feat(xai): ruff fixes and rules (#32501 )	2025-08-11 13:03:07 -04:00
Christophe Bornet	f55186b38f	fix(core): fix beta decorator for properties (#32497 )	2025-08-11 12:43:53 -04:00
Mason Daugherty	374f414c91	feat(qdrant): ruff fixes and rules (#32500 )	2025-08-11 12:43:41 -04:00
ccurme	9259eea846	fix(docs): use pepy for integration package download badges (#32491 ) pypi stats has been down for some time.	2025-08-10 18:41:36 -04:00
ccurme	afcb097ef5	fix(docs): DigitalOcean Gradient: link to correct provider page and update page title (#32490 )	2025-08-10 17:29:44 -04:00
ccurme	088095b663	release(openai): release 0.3.29 (#32463 )	2025-08-08 11:04:33 -04:00
Mason Daugherty	c31236264e	chore: formatting across codebase (#32466 )	2025-08-08 10:20:10 -04:00
ccurme	02001212b0	fix(openai): revert some changes (#32462 ) Keep coverage on `output_version="v0"` (increasing coverage is being managed in v0.4 branch).	2025-08-08 08:51:18 -04:00
Mason Daugherty	00244122bd	feat(openai): `minimal` and `verbosity` (#32455 )	2025-08-08 02:24:21 +00:00
ccurme	6727d6e8c8	release(core): 0.3.74 (#32454 )	2025-08-07 16:39:01 -04:00

1 2 3 4 5 ...

7431 Commits