Merge branch 'master' into pg/python-3.12

2026-04-13 16:02:46 +00:00 · 2023-10-04 14:24:35 -04:00 · 2023-10-03 15:20:29 +00:00 · 2023-10-02 17:21:43 -04:00 · 2023-10-02 17:29:51 +00:00
5696 changed files with 502372 additions and 423101 deletions
--- a/.devcontainer/README.md
+++ b/.devcontainer/README.md
@@ -5,33 +5,25 @@ This project includes a [dev container](https://containers.dev/), which lets you
 You can use the dev container configuration in this folder to build and run the app without needing to install any of its tools locally! You can use it in [GitHub Codespaces](https://github.com/features/codespaces) or the [VS Code Dev Containers extension](https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-containers).

 ## GitHub Codespaces
-
 [![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/langchain-ai/langchain)

 You may use the button above, or follow these steps to open this repo in a Codespace:
-
-1. Click the **Code** drop-down menu at the top of <https://github.com/langchain-ai/langchain>.
+1. Click the **Code** drop-down menu at the top of https://github.com/langchain-ai/langchain.
 1. Click on the **Codespaces** tab.
-1. Click **Create codespace on master**.
+1. Click **Create codespace on master** .

 For more info, check out the [GitHub documentation](https://docs.github.com/en/free-pro-team@latest/github/developing-online-with-codespaces/creating-a-codespace#creating-a-codespace).
-
+  
 ## VS Code Dev Containers
-
 [![Open in Dev Containers](https://img.shields.io/static/v1?label=Dev%20Containers&message=Open&color=blue&logo=visualstudiocode)](https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/langchain-ai/langchain)

-> [!NOTE]
-> If you click the link above you will open the main repo (`langchain-ai/langchain`) and *not* your local cloned repo. This is fine if you only want to run and test the library, but if you want to contribute you can use the link below and replace with your username and cloned repo name:
+Note: If you click this link you will open the main repo and not your local cloned repo, you can use this link and replace with your username and cloned repo name: 
+https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/<yourusername>/<yourclonedreponame>

-```txt
-https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/&lt;YOUR_USERNAME&gt;/&lt;YOUR_CLONED_REPO_NAME&gt;
-```

-Then you will have a local cloned repo where you can contribute and then create pull requests.
+If you already have VS Code and Docker installed, you can use the button above to get started. This will cause VS Code to automatically install the Dev Containers extension if needed, clone the source code into a container volume, and spin up a dev container for use.

-If you already have VS Code and Docker installed, you can use the button above to get started. This will use VSCode to automatically install the Dev Containers extension if needed, clone the source code into a container volume, and spin up a dev container for use.
-
-Alternatively you can also follow these steps to open this repo in a container using the VS Code Dev Containers extension:
+You can also follow these steps to open this repo in a container using the VS Code Dev Containers extension:

 1. If this is your first time using a development container, please ensure your system meets the pre-reqs (i.e. have Docker installed) in the [getting started steps](https://aka.ms/vscode-remote/containers/getting-started).

@@ -45,5 +37,5 @@ You can learn more in the [Dev Containers documentation](https://code.visualstud

 ## Tips and tricks

- If you are working with the same repository folder in a container and Windows, you'll want consistent line endings (otherwise you may see hundreds of changes in the SCM view). The `.gitattributes` file in the root of this repo will disable line ending conversion and should prevent this. See [tips and tricks](https://code.visualstudio.com/docs/devcontainers/tips-and-tricks#_resolving-git-line-ending-issues-in-containers-resulting-in-many-modified-files) for more info.
- If you'd like to review the contents of the image used in this dev container, you can check it out in the [devcontainers/images](https://github.com/devcontainers/images/tree/main/src/python) repo.
+* If you are working with the same repository folder in a container and Windows, you'll want consistent line endings (otherwise you may see hundreds of changes in the SCM view). The `.gitattributes` file in the root of this repo will disable line ending conversion and should prevent this. See [tips and tricks](https://code.visualstudio.com/docs/devcontainers/tips-and-tricks#_resolving-git-line-ending-issues-in-containers-resulting-in-many-modified-files) for more info.
+* If you'd like to review the contents of the image used in this dev container, you can check it out in the [devcontainers/images](https://github.com/devcontainers/images/tree/main/src/python) repo.
--- a/.devcontainer/devcontainer.json
+++ b/.devcontainer/devcontainer.json
@@ -1,58 +1,36 @@
 // For format details, see https://aka.ms/devcontainer.json. For config options, see the
 // README at: https://github.com/devcontainers/templates/tree/main/src/docker-existing-docker-compose
 {
-  // Name for the dev container
-  "name": "langchain",
-  // Point to a Docker Compose file
-  "dockerComposeFile": "./docker-compose.yaml",
-  // Required when using Docker Compose. The name of the service to connect to once running
-  "service": "langchain",
-  // The optional 'workspaceFolder' property is the path VS Code should open by default when
-  // connected. This is typically a file mount in .devcontainer/docker-compose.yml
-  "workspaceFolder": "/workspaces/langchain",
-  "mounts": [
-    "source=langchain-workspaces,target=/workspaces/langchain,type=volume"
-  ],
-  // Prevent the container from shutting down
-  "overrideCommand": true,
-  // Features to add to the dev container. More info: https://containers.dev/features
-  "features": {
-    "ghcr.io/devcontainers/features/git:1": {},
-    "ghcr.io/devcontainers/features/github-cli:1": {}
-  },
-  "containerEnv": {
-    "UV_LINK_MODE": "copy"
-  },
-  // Use 'forwardPorts' to make a list of ports inside the container available locally.
-  // "forwardPorts": [],
-  // Run commands after the container is created
-  "postCreateCommand": "cd libs/langchain_v1 && uv sync && echo 'LangChain (Python) dev environment ready!'",
-  // Configure tool-specific properties.
-  "customizations": {
-    "vscode": {
-      "extensions": [
-        "ms-python.python",
-        "ms-python.debugpy",
-        "ms-python.mypy-type-checker",
-        "ms-python.isort",
-        "unifiedjs.vscode-mdx",
-        "davidanson.vscode-markdownlint",
-        "ms-toolsai.jupyter",
-        "GitHub.copilot",
-        "GitHub.copilot-chat"
-      ],
-      "settings": {
-        "python.defaultInterpreterPath": "libs/langchain_v1/.venv/bin/python",
-        "python.formatting.provider": "none",
-        "[python]": {
-          "editor.formatOnSave": true,
-          "editor.codeActionsOnSave": {
-            "source.organizeImports": true
-          }
-        }
-      }
-    }
-  }
-  // Uncomment to connect as root instead. More info: https://aka.ms/dev-containers-non-root.
-  // "remoteUser": "root"
+	// Name for the dev container
+	"name": "langchain",
+
+	// Point to a Docker Compose file
+	"dockerComposeFile": "./docker-compose.yaml",
+
+	// Required when using Docker Compose. The name of the service to connect to once running
+	"service": "langchain",
+
+	// The optional 'workspaceFolder' property is the path VS Code should open by default when
+	// connected. This is typically a file mount in .devcontainer/docker-compose.yml
+	"workspaceFolder": "/workspaces/${localWorkspaceFolderBasename}",
+
+	// Prevent the container from shutting down
+	"overrideCommand": true
+
+	// Features to add to the dev container. More info: https://containers.dev/features
+	// "features": {
+	// 	"ghcr.io/devcontainers-contrib/features/poetry:2": {}
+	// }
+
+	// Use 'forwardPorts' to make a list of ports inside the container available locally.
+	// "forwardPorts": [],
+
+	// Uncomment the next line to run commands after the container is created.
+	// "postCreateCommand": "cat /etc/os-release",
+
+	// Configure tool-specific properties.
+	// "customizations": {},
+
+	// Uncomment to connect as root instead. More info: https://aka.ms/dev-containers-non-root.
+	// "remoteUser": "root"
 }
--- a/.devcontainer/docker-compose.yaml
+++ b/.devcontainer/docker-compose.yaml
@@ -4,10 +4,29 @@ services:
    build:
      dockerfile: libs/langchain/dev.Dockerfile
      context: ..
-
+    volumes:
+   # Update this to wherever you want VS Code to mount the folder of your project
+      - ..:/workspaces:cached
    networks:
-      - langchain-network
+      - langchain-network 
+  #   environment:
+  #     MONGO_ROOT_USERNAME: root
+  #     MONGO_ROOT_PASSWORD: example123
+  #   depends_on:
+  #     - mongo   
+  # mongo:
+  #   image: mongo
+  #   restart: unless-stopped
+  #   environment:
+  #     MONGO_INITDB_ROOT_USERNAME: root
+  #     MONGO_INITDB_ROOT_PASSWORD: example123
+  #   ports:
+  #     - "27017:27017"
+  #   networks:
+  #     - langchain-network

 networks:
  langchain-network:
    driver: bridge
+    
+    
--- a/.dockerignore
+++ b/.dockerignore
@@ -1,34 +0,0 @@
-# Git
-.git
-.github
-
-# Python
-__pycache__
-*.pyc
-*.pyo
-.venv
-.mypy_cache
-.pytest_cache
-.ruff_cache
-*.egg-info
-.tox
-
-# IDE
-.idea
-.vscode
-
-# Worktree
-worktree
-
-# Test artifacts
-.coverage
-htmlcov
-coverage.xml
-
-# Build artifacts
-dist
-build
-
-# Misc
-*.log
-.DS_Store
--- a/.editorconfig
+++ b/.editorconfig
@@ -1,52 +0,0 @@
-# top-most EditorConfig file
-root = true
-
-# All files
-[*]
-charset = utf-8
-end_of_line = lf
-insert_final_newline = true
-trim_trailing_whitespace = true
-
-# Python files
-[*.py]
-indent_style = space
-indent_size = 4
-max_line_length = 88
-
-# JSON files
-[*.json]
-indent_style = space
-indent_size = 2
-
-# YAML files
-[*.{yml,yaml}]
-indent_style = space
-indent_size = 2
-
-# Markdown files
-[*.md]
-indent_style = space
-indent_size = 2
-trim_trailing_whitespace = false
-
-# Configuration files
-[*.{toml,ini,cfg}]
-indent_style = space
-indent_size = 4
-
-# Shell scripts
-[*.sh]
-indent_style = space
-indent_size = 2
-
-# Makefile
-[Makefile]
-indent_style = tab
-indent_size = 4
-
-# Jupyter notebooks
-[*.ipynb]
-# Jupyter may include trailing whitespace in cell
-# outputs that's semantically meaningful
-trim_trailing_whitespace = false
--- a/.github/CODEOWNERS
+++ b/.github/CODEOWNERS
@@ -1,3 +0,0 @@
-/.github/   @ccurme @eyurtsev @mdrxy
-/libs/core/ @eyurtsev
-/libs/partners/ @ccurme @mdrxy
--- a/.github/CONTRIBUTING.md
+++ b/.github/CONTRIBUTING.md
@@ -0,0 +1,303 @@
+# Contributing to LangChain
+
+Hi there! Thank you for even being interested in contributing to LangChain.
+As an open source project in a rapidly developing field, we are extremely open
+to contributions, whether they be in the form of new features, improved infra, better documentation, or bug fixes.
+
+## 🗺️ Guidelines
+
+### 👩‍💻 Contributing Code
+
+To contribute to this project, please follow a ["fork and pull request"](https://docs.github.com/en/get-started/quickstart/contributing-to-projects) workflow.
+Please do not try to push directly to this repo unless you are a maintainer.
+
+Please follow the checked-in pull request template when opening pull requests. Note related issues and tag relevant
+maintainers.
+
+Pull requests cannot land without passing the formatting, linting and testing checks first. See [Testing](#testing) and
+[Formatting and Linting](#formatting-and-linting) for how to run these checks locally.
+
+It's essential that we maintain great documentation and testing. If you:
+- Fix a bug
+  - Add a relevant unit or integration test when possible. These live in `tests/unit_tests` and `tests/integration_tests`.
+- Make an improvement
+  - Update any affected example notebooks and documentation. These live in `docs`.
+  - Update unit and integration tests when relevant.
+- Add a feature
+  - Add a demo notebook in `docs/modules`.
+  - Add unit and integration tests.
+
+We're a small, building-oriented team. If there's something you'd like to add or change, opening a pull request is the
+best way to get our attention.
+
+### 🚩GitHub Issues
+
+Our [issues](https://github.com/langchain-ai/langchain/issues) page is kept up to date
+with bugs, improvements, and feature requests.
+
+There is a taxonomy of labels to help with sorting and discovery of issues of interest. Please use these to help
+organize issues.
+
+If you start working on an issue, please assign it to yourself.
+
+If you are adding an issue, please try to keep it focused on a single, modular bug/improvement/feature.
+If two issues are related, or blocking, please link them rather than combining them.
+
+We will try to keep these issues as up-to-date as possible, though
+with the rapid rate of development in this field some may get out of date.
+If you notice this happening, please let us know.
+
+### 🙋Getting Help
+
+Our goal is to have the simplest developer setup possible. Should you experience any difficulty getting setup, please
+contact a maintainer! Not only do we want to help get you unblocked, but we also want to make sure that the process is
+smooth for future contributors.
+
+In a similar vein, we do enforce certain linting, formatting, and documentation standards in the codebase.
+If you are finding these difficult (or even just annoying) to work with, feel free to contact a maintainer for help -
+we do not want these to get in the way of getting good code into the codebase.
+
+## 🚀 Quick Start
+
+This quick start describes running the repository locally.
+For a [development container](https://containers.dev/), see the [.devcontainer folder](https://github.com/langchain-ai/langchain/tree/master/.devcontainer).
+
+### Dependency Management: Poetry and other env/dependency managers
+
+This project uses [Poetry](https://python-poetry.org/) v1.6.1+ as a dependency manager.
+
+❗Note: *Before installing Poetry*, if you use `Conda`, create and activate a new Conda env (e.g. `conda create -n langchain python=3.9`)
+
+Install Poetry: **[documentation on how to install it](https://python-poetry.org/docs/#installation)**.
+
+❗Note: If you use `Conda` or `Pyenv` as your environment/package manager, after installing Poetry,
+tell Poetry to use the virtualenv python environment (`poetry config virtualenvs.prefer-active-python true`)
+
+### Core vs. Experimental
+
+There are two separate projects in this repository:
+- `langchain`: core langchain code, abstractions, and use cases
+- `langchain.experimental`: see the [Experimental README](../libs/experimental/README.md) for more information.
+
+Each of these has their own development environment. Docs are run from the top-level makefile, but development
+is split across separate test & release flows.
+
+For this quickstart, start with langchain core:
+
+```bash
+cd libs/langchain
+```
+
+### Local Development Dependencies
+
+Install langchain development requirements (for running langchain, running examples, linting, formatting, tests, and coverage):
+
+```bash
+poetry install --with test
+```
+
+Then verify dependency installation:
+
+```bash
+make test
+```
+
+If the tests don't pass, you may need to pip install additional dependencies, such as `numexpr` and `openapi_schema_pydantic`.
+
+If during installation you receive a `WheelFileValidationError` for `debugpy`, please make sure you are running
+Poetry v1.6.1+. This bug was present in older versions of Poetry (e.g. 1.4.1) and has been resolved in newer releases.
+If you are still seeing this bug on v1.6.1, you may also try disabling "modern installation"
+(`poetry config installer.modern-installation false`) and re-installing requirements.
+See [this `debugpy` issue](https://github.com/microsoft/debugpy/issues/1246) for more details.
+
+### Testing
+
+_some test dependencies are optional; see section about optional dependencies_.
+
+Unit tests cover modular logic that does not require calls to outside APIs.
+If you add new logic, please add a unit test.
+
+To run unit tests:
+
+```bash
+make test
+```
+
+To run unit tests in Docker:
+
+```bash
+make docker_tests
+```
+
+There are also [integration tests and code-coverage](../libs/langchain/tests/README.md) available.
+
+### Formatting and Linting
+
+Run these locally before submitting a PR; the CI system will check also.
+
+#### Code Formatting
+
+Formatting for this project is done via a combination of [Black](https://black.readthedocs.io/en/stable/) and [ruff](https://docs.astral.sh/ruff/rules/).
+
+To run formatting for this project:
+
+```bash
+make format
+```
+
+Additionally, you can run the formatter only on the files that have been modified in your current branch as compared to the master branch using the format_diff command:
+
+```bash
+make format_diff
+```
+
+This is especially useful when you have made changes to a subset of the project and want to ensure your changes are properly formatted without affecting the rest of the codebase.
+
+#### Linting
+
+Linting for this project is done via a combination of [Black](https://black.readthedocs.io/en/stable/), [ruff](https://docs.astral.sh/ruff/rules/), and [mypy](http://mypy-lang.org/).
+
+To run linting for this project:
+
+```bash
+make lint
+```
+
+In addition, you can run the linter only on the files that have been modified in your current branch as compared to the master branch using the lint_diff command:
+
+```bash
+make lint_diff
+```
+
+This can be very helpful when you've made changes to only certain parts of the project and want to ensure your changes meet the linting standards without having to check the entire codebase.
+
+We recognize linting can be annoying - if you do not want to do it, please contact a project maintainer, and they can help you with it. We do not want this to be a blocker for good code getting contributed.
+
+#### Spellcheck
+
+Spellchecking for this project is done via [codespell](https://github.com/codespell-project/codespell).
+Note that `codespell` finds common typos, so it could have false-positive (correctly spelled but rarely used) and false-negatives (not finding misspelled) words.
+
+To check spelling for this project:
+
+```bash
+make spell_check
+```
+
+To fix spelling in place:
+
+```bash
+make spell_fix
+```
+
+If codespell is incorrectly flagging a word, you can skip spellcheck for that word by adding it to the codespell config in the `pyproject.toml` file.
+
+```python
+[tool.codespell]
+...
+# Add here:
+ignore-words-list = 'momento,collison,ned,foor,reworkd,parth,whats,aapply,mysogyny,unsecure'
+```
+
+## Working with Optional Dependencies
+
+Langchain relies heavily on optional dependencies to keep the Langchain package lightweight.
+
+If you're adding a new dependency to Langchain, assume that it will be an optional dependency, and
+that most users won't have it installed.
+
+Users who do not have the dependency installed should be able to **import** your code without
+any side effects (no warnings, no errors, no exceptions).
+
+To introduce the dependency to the pyproject.toml file correctly, please do the following:
+
+1. Add the dependency to the main group as an optional dependency
+  ```bash
+  poetry add --optional [package_name]
+  ```
+2. Open pyproject.toml and add the dependency to the `extended_testing` extra
+3. Relock the poetry file to update the extra.
+  ```bash
+  poetry lock --no-update
+  ```
+4. Add a unit test that the very least attempts to import the new code. Ideally, the unit
+test makes use of lightweight fixtures to test the logic of the code.
+5. Please use the `@pytest.mark.requires(package_name)` decorator for any tests that require the dependency.
+
+## Adding a Jupyter Notebook
+
+If you are adding a Jupyter Notebook example, you'll want to install the optional `dev` dependencies.
+
+To install dev dependencies:
+
+```bash
+poetry install --with dev
+```
+
+Launch a notebook:
+
+```bash
+poetry run jupyter notebook
+```
+
+When you run `poetry install`, the `langchain` package is installed as editable in the virtualenv, so your new logic can be imported into the notebook.
+
+## Documentation
+
+While the code is split between `langchain` and `langchain.experimental`, the documentation is one holistic thing.
+This covers how to get started contributing to documentation.
+
+From the top-level of this repo, install documentation dependencies:
+
+```bash
+poetry install
+```
+
+### Contribute Documentation
+
+The docs directory contains Documentation and API Reference.
+
+Documentation is built using [Docusaurus 2](https://docusaurus.io/).
+
+API Reference are largely autogenerated by [sphinx](https://www.sphinx-doc.org/en/master/) from the code.
+For that reason, we ask that you add good documentation to all classes and methods.
+
+Similar to linting, we recognize documentation can be annoying. If you do not want to do it, please contact a project maintainer, and they can help you with it. We do not want this to be a blocker for good code getting contributed.
+
+### Build Documentation Locally
+
+In the following commands, the prefix `api_` indicates that those are operations for the API Reference.
+
+Before building the documentation, it is always a good idea to clean the build directory:
+
+```bash
+make docs_clean
+make api_docs_clean
+```
+
+Next, you can build the documentation as outlined below:
+
+```bash
+make docs_build
+make api_docs_build
+```
+
+Finally, you can run the linkchecker to make sure all links are valid:
+
+```bash
+make docs_linkcheck
+make api_docs_linkcheck
+```
+
+## 🏭 Release Process
+
+As of now, LangChain has an ad hoc release process: releases are cut with high frequency by
+a developer and published to [PyPI](https://pypi.org/project/langchain/).
+
+LangChain follows the [semver](https://semver.org/) versioning standard. However, as pre-1.0 software,
+even patch releases may contain [non-backwards-compatible changes](https://semver.org/#spec-item-4).
+
+### 🌟 Recognition
+
+If your contribution has made its way into a release, we will want to give you credit on Twitter (only if you want though)!
+If you have a Twitter account you would like us to mention, please let us know in the PR or in another manner.
--- a/.github/ISSUE_TEMPLATE/bug-report.yml
+++ b/.github/ISSUE_TEMPLATE/bug-report.yml
@@ -1,153 +1,106 @@
 name: "\U0001F41B Bug Report"
-description: Report a bug in LangChain. To report a security issue, please instead use the security option (below). For questions, please use the LangChain forum (below).
-labels: ["bug"]
-type: bug
+description: Submit a bug report to help us improve LangChain. To report a security issue, please instead use the security option below.
+labels: ["02 Bug Report"]
 body:
  - type: markdown
    attributes:
-      value: |
-        > **All contributions must be in English.** See the [language policy](https://docs.langchain.com/oss/python/contributing/overview#language-policy).
+      value: >
+        Thank you for taking the time to file a bug report. Before creating a new
+        issue, please make sure to take a few moments to check the issue tracker
+        for existing issues about the bug.

-        Thank you for taking the time to file a bug report.
-
-        For usage questions, feature requests and general design questions, please use the [LangChain Forum](https://forum.langchain.com/).
-
-        Check these before submitting to see if your issue has already been reported, fixed or if there's another way to solve your problem:
-
-        * [Documentation](https://docs.langchain.com/oss/python/langchain/overview),
-        * [API Reference Documentation](https://reference.langchain.com/python/),
-        * [LangChain ChatBot](https://chat.langchain.com/)
-        * [GitHub search](https://github.com/langchain-ai/langchain),
-        * [LangChain Forum](https://forum.langchain.com/),
-  - type: checkboxes
-    id: checks
-    attributes:
-      label: Checked other resources
-      description: Please confirm and check all the following options.
-      options:
-        - label: This is a bug, not a usage question.
-          required: true
-        - label: I added a clear and descriptive title that summarizes this issue.
-          required: true
-        - label: I used the GitHub search to find a similar question and didn't find it.
-          required: true
-        - label: I am sure that this is a bug in LangChain rather than my code.
-          required: true
-        - label: The bug is not resolved by updating to the latest stable version of LangChain (or the specific integration package).
-          required: true
-        - label: This is not related to the langchain-community package.
-          required: true
-        - label: I posted a self-contained, minimal, reproducible example. A maintainer can copy it and run it AS IS.
-          required: true
-  - type: checkboxes
-    id: package
-    attributes:
-      label: Package (Required)
-      description: |
-        Which `langchain` package(s) is this bug related to? Select at least one.
-
-        Note that if the package you are reporting for is not listed here, it is not in this repository (e.g. `langchain-google-genai` is in [`langchain-ai/langchain-google`](https://github.com/langchain-ai/langchain-google/)).
-
-        Please report issues for other packages to their respective repositories.
-      options:
-        - label: langchain
-        - label: langchain-openai
-        - label: langchain-anthropic
-        - label: langchain-classic
-        - label: langchain-core
-        - label: langchain-model-profiles
-        - label: langchain-tests
-        - label: langchain-text-splitters
-        - label: langchain-chroma
-        - label: langchain-deepseek
-        - label: langchain-exa
-        - label: langchain-fireworks
-        - label: langchain-groq
-        - label: langchain-huggingface
-        - label: langchain-mistralai
-        - label: langchain-nomic
-        - label: langchain-ollama
-        - label: langchain-openrouter
-        - label: langchain-perplexity
-        - label: langchain-qdrant
-        - label: langchain-xai
-        - label: Other / not sure / general
  - type: textarea
-    id: related
-    validations:
-      required: false
+    id: system-info
    attributes:
-      label: Related Issues / PRs
+      label: System Info
+      description: Please share your system info with us.
+      placeholder: LangChain version, platform, python version, ...
+    validations:
+      required: true
+
+  - type: textarea
+    id: who-can-help
+    attributes:
+      label: Who can help?
      description: |
-        If this bug is related to any existing issues or pull requests, please link them here.
-      placeholder: |
-        * e.g. #123, #456
+        Your issue will be replied to more quickly if you can figure out the right person to tag with @
+        If you know how to use git blame, that is the easiest way, otherwise, here is a rough guide of **who to tag**.
+
+        The core maintainers strive to read all issues, but tagging them will help them prioritize.
+
+        Please tag fewer than 3 people.
+
+        @hwchase17 - project lead
+
+        Tracing / Callbacks
+        - @agola11
+
+        Async
+        - @agola11
+
+        DataLoader Abstractions
+        - @eyurtsev
+
+        LLM/Chat Wrappers
+        - @hwchase17
+        - @agola11
+
+        Tools / Toolkits
+        - ...
+
+      placeholder: "@Username ..."
+
+  - type: checkboxes
+    id: information-scripts-examples
+    attributes:
+      label: Information
+      description: "The problem arises when using:"
+      options:
+        - label: "The official example notebooks/scripts"
+        - label: "My own modified scripts"
+
+  - type: checkboxes
+    id: related-components
+    attributes:
+      label: Related Components
+      description: "Select the components related to the issue (if applicable):"
+      options:
+        - label: "LLMs/Chat Models"
+        - label: "Embedding Models"
+        - label: "Prompts / Prompt Templates / Prompt Selectors"
+        - label: "Output Parsers"
+        - label: "Document Loaders"
+        - label: "Vector Stores / Retrievers"
+        - label: "Memory"
+        - label: "Agents / Agent Executors"
+        - label: "Tools / Toolkits"
+        - label: "Chains"
+        - label: "Callbacks/Tracing"
+        - label: "Async"
+
  - type: textarea
    id: reproduction
    validations:
      required: true
    attributes:
-      label: Reproduction Steps / Example Code (Python)
+      label: Reproduction
      description: |
-        Please add a self-contained, [minimal, reproducible, example](https://stackoverflow.com/help/minimal-reproducible-example) with your use case.
+        Please provide a [code sample](https://stackoverflow.com/help/minimal-reproducible-example) that reproduces the problem you ran into. It can be a Colab link or just a code snippet.
+        If you have code snippets, error messages, stack traces please provide them here as well.
+        Important! Use code tags to correctly format your code. See https://help.github.com/en/github/writing-on-github/creating-and-highlighting-code-blocks#syntax-highlighting
+        Avoid screenshots when possible, as they are hard to read and (more importantly) don't allow others to copy-and-paste your code.

-        If a maintainer can copy it, run it, and see it right away, there's a much higher chance that you'll be able to get help.
-
-        **Important!**
-
-        * Avoid screenshots, as they are hard to read and (more importantly) don't allow others to copy-and-paste your code.
-        * Reduce your code to the minimum required to reproduce the issue if possible.
-
-        (This will be automatically formatted into code, so no need for backticks.)
-      render: python
      placeholder: |
-        from langchain_core.runnables import RunnableLambda
+        Steps to reproduce the behavior:

-        def bad_code(inputs) -> int:
-          raise NotImplementedError('For demo purpose')
+          1.
+          2.
+          3.

-          chain = RunnableLambda(bad_code)
-          chain.invoke('Hello!')
  - type: textarea
-    attributes:
-      label: Error Message and Stack Trace (if applicable)
-      description: |
-        If you are reporting an error, please copy and paste the full error message and
-        stack trace.
-        (This will be automatically formatted into code, so no need for backticks.)
-      render: shell
-  - type: textarea
-    id: description
-    attributes:
-      label: Description
-      description: |
-        What is the problem, question, or error?
-
-        Write a short description telling what you are doing, what you expect to happen, and what is currently happening.
-      placeholder: |
-        * I'm trying to use the `langchain` library to do X.
-        * I expect to see Y.
-        * Instead, it does Z.
+    id: expected-behavior
    validations:
      required: true
-  - type: textarea
-    id: system-info
    attributes:
-      label: System Info
-      description: |
-        Please share your system info with us.
-
-        Run the following command in your terminal and paste the output here:
-
-        `python -m langchain_core.sys_info`
-
-        or if you have an existing python interpreter running:
-
-        ```python
-        from langchain_core import sys_info
-        sys_info.print_sys_info()
-        ```
-      placeholder: |
-        python -m langchain_core.sys_info
-    validations:
-      required: true
+      label: Expected behavior
+      description: "A clear and concise description of what you would expect to happen."
--- a/.github/ISSUE_TEMPLATE/config.yml
+++ b/.github/ISSUE_TEMPLATE/config.yml
@@ -1,15 +1,6 @@
-blank_issues_enabled: false
+blank_issues_enabled: true
 version: 2.1
 contact_links:
-  - name: 💬 LangChain Forum
-    url:  https://forum.langchain.com/
-    about: General community discussions and support
-  - name: 📚 LangChain Documentation
-    url: https://docs.langchain.com/oss/python/langchain/overview
-    about: View the official LangChain documentation
-  - name: 📚 API Reference Documentation
-    url: https://reference.langchain.com/python/
-    about: View the official LangChain API reference documentation
-  - name: 📚 Documentation issue
-    url: https://github.com/langchain-ai/docs/issues/new?template=01-langchain.yml
-    about: Report an issue related to the LangChain documentation
+  - name: Discord
+    url: https://discord.gg/6adMQxSpJS
+    about: General community discussions
--- a/.github/ISSUE_TEMPLATE/documentation.yml
+++ b/.github/ISSUE_TEMPLATE/documentation.yml
@@ -0,0 +1,19 @@
+name: Documentation
+description: Report an issue related to the LangChain documentation.
+title: "DOC: <Please write a comprehensive title after the 'DOC: ' prefix>"
+labels: [03 - Documentation]
+
+body:
+- type: textarea
+  attributes: 
+    label: "Issue with current documentation:"
+    description: >
+      Please make sure to leave a reference to the document/code you're
+      referring to.
+
+- type: textarea
+  attributes:
+    label: "Idea or request for content:"
+    description: >
+      Please describe as clearly as possible what topics you think are missing
+      from the current documentation.
--- a/.github/ISSUE_TEMPLATE/feature-request.yml
+++ b/.github/ISSUE_TEMPLATE/feature-request.yml
@@ -1,155 +1,30 @@
-name: "✨ Feature Request"
-description: Request a new feature or enhancement for LangChain. For questions, please use the LangChain forum (below).
-labels: ["feature request"]
-type: feature
+name: "\U0001F680 Feature request"
+description: Submit a proposal/request for a new LangChain feature
+labels: ["02 Feature Request"]
 body:
-  - type: markdown
-    attributes:
-      value: |
-        > **All contributions must be in English.** See the [language policy](https://docs.langchain.com/oss/python/contributing/overview#language-policy).
-
-        Thank you for taking the time to request a new feature.
-
-        Use this to request NEW FEATURES or ENHANCEMENTS in LangChain. For bug reports, please use the bug report template. For usage questions and general design questions, please use the [LangChain Forum](https://forum.langchain.com/).
-
-        Relevant links to check before filing a feature request to see if your request has already been made or
-        if there's another way to achieve what you want:
-
-        * [Documentation](https://docs.langchain.com/oss/python/langchain/overview),
-        * [API Reference Documentation](https://reference.langchain.com/python/),
-        * [LangChain ChatBot](https://chat.langchain.com/)
-        * [GitHub search](https://github.com/langchain-ai/langchain),
-        * [LangChain Forum](https://forum.langchain.com/),
-
-        **Note:** Do not begin work on a PR unless explicitly assigned to this issue by a maintainer.
-  - type: checkboxes
-    id: checks
-    attributes:
-      label: Checked other resources
-      description: Please confirm and check all the following options.
-      options:
-        - label: This is a feature request, not a bug report or usage question.
-          required: true
-        - label: I added a clear and descriptive title that summarizes the feature request.
-          required: true
-        - label: I used the GitHub search to find a similar feature request and didn't find it.
-          required: true
-        - label: I checked the LangChain documentation and API reference to see if this feature already exists.
-          required: true
-        - label: This is not related to the langchain-community package.
-          required: true
-  - type: checkboxes
-    id: package
-    attributes:
-      label: Package (Required)
-      description: |
-        Which `langchain` package(s) is this request related to? Select at least one.
-
-        Note that if the package you are requesting for is not listed here, it is not in this repository (e.g. `langchain-google-genai` is in `langchain-ai/langchain`).
-
-        Please submit feature requests for other packages to their respective repositories.
-      options:
-        - label: langchain
-        - label: langchain-openai
-        - label: langchain-anthropic
-        - label: langchain-classic
-        - label: langchain-core
-        - label: langchain-model-profiles
-        - label: langchain-tests
-        - label: langchain-text-splitters
-        - label: langchain-chroma
-        - label: langchain-deepseek
-        - label: langchain-exa
-        - label: langchain-fireworks
-        - label: langchain-groq
-        - label: langchain-huggingface
-        - label: langchain-mistralai
-        - label: langchain-nomic
-        - label: langchain-ollama
-        - label: langchain-openrouter
-        - label: langchain-perplexity
-        - label: langchain-qdrant
-        - label: langchain-xai
-        - label: Other / not sure / general
  - type: textarea
-    id: feature-description
+    id: feature-request
    validations:
      required: true
    attributes:
-      label: Feature Description
+      label: Feature request
      description: |
-        Please provide a clear and concise description of the feature you would like to see added to LangChain.
+        A clear and concise description of the feature proposal. Please provide links to any relevant GitHub repos, papers, or other resources if relevant.

-        What specific functionality are you requesting? Be as detailed as possible.
-      placeholder: |
-        I would like LangChain to support...
-
-        This feature would allow users to...
  - type: textarea
-    id: use-case
+    id: motivation
    validations:
      required: true
    attributes:
-      label: Use Case
+      label: Motivation
      description: |
-        Describe the specific use case or problem this feature would solve.
+        Please outline the motivation for the proposal. Is your feature request related to a problem? e.g., I'm always frustrated when [...]. If this is related to another GitHub issue, please link here too.

-        Why do you need this feature? What problem does it solve for you or other users?
-      placeholder: |
-        I'm trying to build an application that...
-
-        Currently, I have to work around this by...
-
-        This feature would help me/users to...
  - type: textarea
-    id: proposed-solution
+    id: contribution
    validations:
-      required: false
+      required: true
    attributes:
-      label: Proposed Solution
+      label: Your contribution
      description: |
-        If you have ideas about how this feature could be implemented, please describe them here.
-
-        This is optional but can be helpful for maintainers to understand your vision.
-      placeholder: |
-        I think this could be implemented by...
-
-        The API could look like...
-
-        ```python
-        # Example of how the feature might work
-        ```
-  - type: textarea
-    id: alternatives
-    validations:
-      required: false
-    attributes:
-      label: Alternatives Considered
-      description: |
-        Have you considered any alternative solutions or workarounds?
-
-        What other approaches have you tried or considered?
-      placeholder: |
-        I've tried using...
-
-        Alternative approaches I considered:
-        1. ...
-        2. ...
-
-        But these don't work because...
-  - type: textarea
-    id: additional-context
-    validations:
-      required: false
-    attributes:
-      label: Additional Context
-      description: |
-        Add any other context, screenshots, examples, or references that would help explain your feature request.
-      placeholder: |
-        Related issues: #...
-
-        Similar features in other libraries:
-        - ...
-
-        Additional context or examples:
-        - ...
+        Is there any way that you could help, e.g. by submitting a PR? Make sure to read the CONTRIBUTING.MD [readme](https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md)
--- a/.github/ISSUE_TEMPLATE/other.yml
+++ b/.github/ISSUE_TEMPLATE/other.yml
@@ -0,0 +1,18 @@
+name: Other Issue
+description: Raise an issue that wouldn't be covered by the other templates.
+title: "Issue: <Please write a comprehensive title after the 'Issue: ' prefix>"
+labels: [04 - Other]
+
+body:
+  - type: textarea
+    attributes:
+      label: "Issue you'd like to raise."
+      description: >
+        Please describe the issue you'd like to raise as clearly as possible.
+        Make sure to include any relevant links or references.
+
+  - type: textarea
+    attributes:
+      label: "Suggestion:"
+      description: >
+        Please outline a suggestion to improve the issue here.
--- a/.github/ISSUE_TEMPLATE/privileged.yml
+++ b/.github/ISSUE_TEMPLATE/privileged.yml
@@ -1,49 +0,0 @@
-name: 🔒 Privileged
-description: You are a LangChain maintainer, or was asked directly by a maintainer to create an issue here. If not, check the other options.
-body:
-  - type: markdown
-    attributes:
-      value: |
-        If you are not a LangChain maintainer, employee, or were not asked directly by a maintainer to create an issue, then please start the conversation on the [LangChain Forum](https://forum.langchain.com/) instead.
-  - type: checkboxes
-    id: privileged
-    attributes:
-      label: Privileged issue
-      description: Confirm that you are allowed to create an issue here.
-      options:
-        - label: I am a LangChain maintainer, or was asked directly by a LangChain maintainer to create an issue here.
-          required: true
-  - type: textarea
-    id: content
-    attributes:
-      label: Issue Content
-      description: Add the content of the issue here.
-  - type: checkboxes
-    id: package
-    attributes:
-      label: Package (Required)
-      description: |
-        Please select package(s) that this issue is related to.
-      options:
-        - label: langchain
-        - label: langchain-openai
-        - label: langchain-anthropic
-        - label: langchain-classic
-        - label: langchain-core
-        - label: langchain-model-profiles
-        - label: langchain-tests
-        - label: langchain-text-splitters
-        - label: langchain-chroma
-        - label: langchain-deepseek
-        - label: langchain-exa
-        - label: langchain-fireworks
-        - label: langchain-groq
-        - label: langchain-huggingface
-        - label: langchain-mistralai
-        - label: langchain-nomic
-        - label: langchain-ollama
-        - label: langchain-openrouter
-        - label: langchain-perplexity
-        - label: langchain-qdrant
-        - label: langchain-xai
-        - label: Other / not sure / general
--- a/.github/ISSUE_TEMPLATE/task.yml
+++ b/.github/ISSUE_TEMPLATE/task.yml
@@ -1,120 +0,0 @@
-name: "📋 Task"
-description: Create a task for project management and tracking by LangChain maintainers. If you are not a maintainer, please use other templates or the forum.
-labels: ["task"]
-type: task
-body:
-  - type: markdown
-    attributes:
-      value: |
-        Thanks for creating a task to help organize LangChain development.
-
-        This template is for **maintainer tasks** such as project management, development planning, refactoring, documentation updates, and other organizational work.
-
-        If you are not a LangChain maintainer or were not asked directly by a maintainer to create a task, then please start the conversation on the [LangChain Forum](https://forum.langchain.com/) instead or use the appropriate bug report or feature request templates on the previous page.
-  - type: checkboxes
-    id: maintainer
-    attributes:
-      label: Maintainer task
-      description: Confirm that you are allowed to create a task here.
-      options:
-        - label: I am a LangChain maintainer, or was asked directly by a LangChain maintainer to create a task here.
-          required: true
-  - type: textarea
-    id: task-description
-    attributes:
-      label: Task Description
-      description: |
-        Provide a clear and detailed description of the task.
-
-        What needs to be done? Be specific about the scope and requirements.
-      placeholder: |
-        This task involves...
-
-        The goal is to...
-
-        Specific requirements:
-        - ...
-        - ...
-    validations:
-      required: true
-  - type: textarea
-    id: acceptance-criteria
-    attributes:
-      label: Acceptance Criteria
-      description: |
-        Define the criteria that must be met for this task to be considered complete.
-
-        What are the specific deliverables or outcomes expected?
-      placeholder: |
-        This task will be complete when:
-        - [ ] ...
-        - [ ] ...
-        - [ ] ...
-    validations:
-      required: true
-  - type: textarea
-    id: context
-    attributes:
-      label: Context and Background
-      description: |
-        Provide any relevant context, background information, or links to related issues/PRs.
-
-        Why is this task needed? What problem does it solve?
-      placeholder: |
-        Background:
-        - ...
-
-        Related issues/PRs:
-        - #...
-
-        Additional context:
-        - ...
-    validations:
-      required: false
-  - type: textarea
-    id: dependencies
-    attributes:
-      label: Dependencies
-      description: |
-        List any dependencies or blockers for this task.
-
-        Are there other tasks, issues, or external factors that need to be completed first?
-      placeholder: |
-        This task depends on:
-        - [ ] Issue #...
-        - [ ] PR #...
-        - [ ] External dependency: ...
-
-        Blocked by:
-        - ...
-    validations:
-      required: false
-  - type: checkboxes
-    id: package
-    attributes:
-      label: Package (Required)
-      description: |
-        Please select package(s) that this task is related to.
-      options:
-        - label: langchain
-        - label: langchain-openai
-        - label: langchain-anthropic
-        - label: langchain-classic
-        - label: langchain-core
-        - label: langchain-model-profiles
-        - label: langchain-tests
-        - label: langchain-text-splitters
-        - label: langchain-chroma
-        - label: langchain-deepseek
-        - label: langchain-exa
-        - label: langchain-fireworks
-        - label: langchain-groq
-        - label: langchain-huggingface
-        - label: langchain-mistralai
-        - label: langchain-nomic
-        - label: langchain-ollama
-        - label: langchain-openrouter
-        - label: langchain-perplexity
-        - label: langchain-qdrant
-        - label: langchain-xai
-        - label: Other / not sure / general
--- a/.github/PULL_REQUEST_TEMPLATE.md
+++ b/.github/PULL_REQUEST_TEMPLATE.md
@@ -1,43 +1,20 @@
-Fixes #
+<!-- Thank you for contributing to LangChain!

-<!-- Replace everything above this line with a 1-2 sentence description of your change. Keep the "Fixes #xx" keyword and update the issue number. -->
+Replace this entire comment with:
+  - **Description:** a description of the change, 
+  - **Issue:** the issue # it fixes (if applicable),
+  - **Dependencies:** any dependencies required for this change,
+  - **Tag maintainer:** for a quicker response, tag the relevant maintainer (see below),
+  - **Twitter handle:** we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out!

-Read the full contributing guidelines: https://docs.langchain.com/oss/python/contributing/overview
+Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally.

-> **All contributions must be in English.** See the [language policy](https://docs.langchain.com/oss/python/contributing/overview#language-policy).
+See contribution guidelines for more information on how to write/run tests, lint, etc: 
+https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md

-If you paste a large clearly AI generated description here your PR may be IGNORED or CLOSED!
+If you're adding a new integration, please include:
+  1. a test for the integration, preferably unit tests that do not rely on network access,
+  2. an example notebook showing its use. It lives in `docs/extras` directory.

-Thank you for contributing to LangChain! Follow these steps to have your pull request considered as ready for review.
-
-1. PR title: Should follow the format: TYPE(SCOPE): DESCRIPTION
-
-  - Examples:
-    - fix(anthropic): resolve flag parsing error
-    - feat(core): add multi-tenant support
-    - test(openai): update API usage tests
-  - Allowed TYPE and SCOPE values: https://github.com/langchain-ai/langchain/blob/master/.github/workflows/pr_lint.yml#L15-L33
-
-2. PR description:
-
-  - Write 1-2 sentences summarizing the change.
-  - The `Fixes #xx` line at the top is **required** for external contributions — update the issue number and keep the keyword. This links your PR to the approved issue and auto-closes it on merge.
-  - If there are any breaking changes, please clearly describe them.
-  - If this PR depends on another PR being merged first, please include "Depends on #PR_NUMBER" in the description.
-
-3. Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified.
-
-  - We will not consider a PR unless these three are passing in CI.
-
-4. How did you verify your code works?
-
-Additional guidelines:
-
-  - All external PRs must link to an issue or discussion where a solution has been approved by a maintainer, and you must be assigned to that issue. PRs without prior approval will be closed.
-  - PRs should not touch more than one package unless absolutely necessary.
-  - Do not update the `uv.lock` files or add dependencies to `pyproject.toml` files (even optional ones) unless you have explicit permission to do so by a maintainer.
-
-## Social handles (optional)
-<!-- If you'd like a shoutout on release, add your socials below -->
-Twitter: @
-LinkedIn: https://linkedin.com/in/
+If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17.
+ -->
--- a/.github/THREAT_MODEL_CORE.md
+++ b/.github/THREAT_MODEL_CORE.md
@@ -1,400 +0,0 @@
-# Threat Model: langchain-core
-
-> Generated: 2026-04-08 | Commit: d3e60f5c03 | Scope: libs/core/ (langchain-core v1.2.27) | Visibility: Open Source | Mode: deep
-
-> **Disclaimer:** This threat model is automatically generated to help developers and security researchers understand where trust is placed in this system and where boundaries exist. It is experimental, subject to change, and not an authoritative security reference — findings should be validated before acting on them. The analysis may be incomplete or contain inaccuracies. We welcome suggestions and corrections to improve this document.
-
-For vulnerability reporting, see [GitHub Security Advisories](https://github.com/langchain-ai/langchain/security/advisories/new).
-
-See also: the [langchain_v1 threat model](THREAT_MODEL_V1.md) for the agent middleware layer.
-
---
-
-## Scope
-
-### In Scope
-
- `libs/core/langchain_core/load/` — Serialization/deserialization system (`loads`, `load`, `dumpd`, `dumps`, `Reviver`, allowlists, secret handling)
- `libs/core/langchain_core/_security/` — SSRF protection (`validate_safe_url`, `is_safe_url`, `SSRFProtected*` annotated types)
- `libs/core/langchain_core/prompts/` — Prompt templates, template formatting, deprecated prompt loading from files
- `libs/core/langchain_core/tools/` — Tool base classes, argument validation, schema generation
- `libs/core/langchain_core/output_parsers/` — JSON, XML, Pydantic output parsers
- `libs/core/langchain_core/runnables/` — Composable pipeline primitives (`RunnableLambda`, `RunnableSequence`, etc.)
- `libs/core/langchain_core/callbacks/` — Callback manager, handler invocation
- `libs/core/langchain_core/messages/` — Message types, content blocks, message utilities
- `libs/core/langchain_core/language_models/` — Abstract base classes for LLMs and chat models
- `libs/core/langchain_core/utils/` — Environment variable access, formatting, function schema extraction
-
-### Out of Scope
-
- `libs/langchain_v1/` — Agent middleware, execution policies, file search middleware (separate package; see [THREAT_MODEL_V1.md](THREAT_MODEL_V1.md))
- `libs/partners/` — Partner integration packages (separate packages, each with their own threat surface)
- `libs/text-splitters/` — Document chunking (separate package)
- `libs/standard-tests/` — Test harnesses; not shipped as attack surface
- `tests/` — Unit and integration tests (read during analysis for understanding; not threat-modeled)
- User application code, model selection, custom tools, custom callbacks — user-controlled
- LLM model behavior — the project cannot guarantee model safety across all models users may select
- Deployment infrastructure — users control hosting, network topology, and secrets management
- LangSmith, LangGraph — separate products and repositories
-
-### Assumptions
-
-1. The project is used as a library/framework — users control their own application code, model selection, and deployment infrastructure.
-2. API keys are sourced from environment variables or passed explicitly; the framework does not store them persistently.
-3. Users are responsible for validating that serialized LangChain objects (passed to `loads()`/`load()`) come from trusted sources.
-4. The `langchain-core` serialization allowlist (`allowed_objects='core'`) is the default and correct choice for untrusted data.
-5. `defusedxml` is not a required dependency of langchain-core; users who need `XMLOutputParser` must install it separately or accept reduced XML security.
-6. Jinja2 template format is blocked in deserialization and file-based prompt loading but available at runtime construction with `SandboxedEnvironment` — users who opt in accept the residual sandbox bypass risk.
-
---
-
-## System Overview
-
-`langchain-core` is the foundational Python library for the LangChain ecosystem. It provides base abstractions for building LLM-powered applications: messages, prompts, tools, runnables (composable pipelines), callbacks, output parsers, serialization, and SSRF protection. It does not serve HTTP traffic, store user data persistently, or communicate with external services directly — it is a library that processes data on behalf of user applications. Concrete LLM provider integrations live in separate partner packages.
-
-### Architecture Diagram
-
-```
-┌───────────────────────────────────────────────────────────────────┐
-│                        User Application                           │
-│                                                                   │
-│  User Code ────┬──► Prompt Templates (C3) ──► Messages (C8)      │
-│                │         │                                        │
-│                │    Template vars                                 │
-│                │                                                  │
-│                ├──► Tools Framework (C4) ──► Tool execution       │
-│                │    (arg schema validation)    (user-defined)     │
-│                │                                                  │
-│                ├──► Runnables (C6) ──► Composition pipeline       │
-│                │                                                  │
-│                ├──► Callbacks (C7) ──► User callback handlers     │
-│                │                                                  │
-│                └──► Output Parsers (C5) ◄── LLM output text      │
-│                                                                   │
-│ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ TB1 ─ ─ ─ ─ │
-│                                                                   │
-│  Serialization System (C1)                                        │
-│    loads()/load() ──► Reviver ──► importlib + cls(**kwargs)       │
-│         │               │                                         │
-│    ─ ─ ─│─ ─ ─ TB2 ─ ─ │                                         │
-│         │               ├──► Allowlist check                      │
-│         │               ├──► Namespace validation                 │
-│         │               ├──► Jinja2 blocking                      │
-│         │               └──► secrets_from_env ──► os.environ      │
-│                                                                   │
-│  SSRF Protection (C2)                                             │
-│    validate_safe_url() ──► DNS resolution ──► IP validation       │
-│    ─ ─ ─ ─ ─ ─ ─ ─ ─ TB6 ─ ─ ─ ─ ─ ─ ─                        │
-│                                                                   │
-│  Deprecated Prompt Loading (C10)                                  │
-│    load_prompt() ──► _validate_path() ──► filesystem              │
-│    ─ ─ ─ ─ ─ ─ ─ TB5 ─ ─ ─ ─ ─ ─ ─                             │
-└───────────────────────────────────────────────────────────────────┘
-```
-
---
-
-## Components
-
-| ID | Component | Description | Trust Level | Default? | Entry Points |
-|----|-----------|-------------|-------------|----------|--------------|
-| C1 | Serialization System | Allowlist-based JSON (de)serialization for LangChain objects; blocks jinja2, validates namespaces, handles secrets | framework-controlled | Yes (when `load`/`loads` called) | `langchain_core/load/load.py:loads`, `langchain_core/load/load.py:load`, `langchain_core/load/dump.py:dumpd`, `langchain_core/load/dump.py:dumps` |
-| C2 | SSRF Protection | URL validation utility blocking private IPs, localhost, and cloud metadata endpoints | framework-controlled | No (explicit opt-in via `validate_safe_url` or `SSRFProtected*` Pydantic types) | `langchain_core/_security/_ssrf_protection.py:validate_safe_url`, `langchain_core/_security/_ssrf_protection.py:is_safe_url` |
-| C3 | Prompt Templates | Template rendering for LLM prompts; supports f-string (default, safe) and mustache; jinja2 allowed at runtime with SandboxedEnvironment | framework-controlled | Yes | `langchain_core/prompts/prompt.py:PromptTemplate.from_template`, `langchain_core/prompts/chat.py:ChatPromptTemplate.from_template`, `langchain_core/prompts/chat.py:ChatPromptTemplate.format_messages` |
-| C4 | Tools Framework | Base tool classes with Pydantic schema validation for arguments; user-defined tool functions execute without sandboxing | framework-controlled (validation) / user-controlled (execution) | Yes | `langchain_core/tools/base.py:BaseTool.invoke`, `langchain_core/tools/base.py:BaseTool.ainvoke`, `langchain_core/tools/convert.py:tool` (decorator) |
-| C5 | Output Parsers | Parse LLM text output into structured formats (JSON, XML, Pydantic models) | framework-controlled | Yes | `langchain_core/output_parsers/json.py:JsonOutputParser.parse_result`, `langchain_core/output_parsers/xml.py:XMLOutputParser.parse`, `langchain_core/output_parsers/pydantic.py:PydanticOutputParser.parse_result` |
-| C6 | Runnables | Composable pipeline primitives; wraps arbitrary user functions via `RunnableLambda` | framework-controlled (composition) / user-controlled (lambda bodies) | Yes | `langchain_core/runnables/base.py:Runnable.invoke`, `langchain_core/runnables/base.py:RunnableLambda` |
-| C7 | Callbacks System | Invokes user-provided callback handlers with run data (inputs, outputs, errors, metadata) | framework-controlled (invocation) / user-controlled (handler code) | Yes | `langchain_core/callbacks/manager.py:CallbackManager`, `langchain_core/callbacks/manager.py:handle_event` |
-| C8 | Messages | Pydantic-validated message types (Human, AI, System, Tool) and content blocks (text, image, audio, file) | framework-controlled | Yes | `langchain_core/messages/utils.py:convert_to_messages`, `langchain_core/messages/utils.py:messages_from_dict` |
-| C9 | Language Model Abstractions | Abstract base classes for chat models and LLMs; define interfaces for partner implementations | framework-controlled | Yes | `langchain_core/language_models/chat_models.py:BaseChatModel.invoke` |
-| C10 | Prompt Loading (deprecated) | File-based prompt loading with path validation; deprecated since v1.2.21, removal target v2.0.0 | framework-controlled | Yes (when `load_prompt` called) | `langchain_core/prompts/loading.py:load_prompt`, `langchain_core/prompts/loading.py:_validate_path` |
-| C11 | Utility Layer | Environment variable access, string formatting, function schema extraction | framework-controlled | Yes | `langchain_core/utils/env.py:get_from_dict_or_env`, `langchain_core/utils/formatting.py:StrictFormatter` |
-
---
-
-## Data Classification
-
-Classifies all sensitive data types found in the codebase with their sensitivity levels, storage locations, and retention policies.
-
-| ID | PII Category | Specific Fields | Sensitivity | Storage Location(s) | Encrypted at Rest | Retention | Regulatory |
-|----|-------------|----------------|-------------|---------------------|-------------------|-----------|------------|
-| DC1 | API/service credentials | `SecretStr` fields in partner model constructors (e.g., `openai_api_key`), `langchain_core/load/serializable.py:Serializable.lc_secrets` property values | Critical | In-memory (`SecretStr`); OS environment variables | Yes (`SecretStr` masks in repr/logs) | Process lifetime | All |
-| DC2 | LLM conversation data | `HumanMessage.content`, `AIMessage.content`, `SystemMessage.content`, `ToolMessage.content`, prompt template variables | High | In-memory (transient) | N/A (not persisted by framework) | Transient (garbage-collected) | GDPR, CCPA (when containing PII) |
-| DC3 | Serialized LangChain objects | JSON payloads to `loads()`/`load()`; includes `kwargs` for any allowed class, `secret` type entries | High | User-application storage (not framework responsibility) | N/A (framework does not store) | User-controlled | N/A |
-| DC4 | OS environment variables | Arbitrary `os.environ` values accessible via `secrets_from_env=True` in `Reviver` | Critical | Host OS environment | N/A | Process lifetime | All (secrets may include credentials, tokens, database URLs) |
-| DC5 | Prompt template content | Template strings, template file contents, interpolated variables | Medium | In-memory; filesystem (for file-based loading) | N/A | Process lifetime | N/A |
-| DC6 | Tool call arguments | LLM-generated function call arguments passed to `BaseTool.invoke` | High | In-memory (transient) | N/A | Transient | N/A (may contain user PII depending on tool) |
-| DC7 | Callback/tracer data | Run inputs, outputs, errors, metadata, tags passed to callback handlers | Medium | In-memory; LangSmith (if tracer enabled, user-configured) | N/A (framework does not persist) | User-controlled | GDPR, CCPA (when containing PII) |
-
-### Data Classification Details
-
-#### DC1: API/service credentials
-
- **Fields**: API key fields across all partner integrations inheriting from `langchain_core.load.serializable.py:Serializable.lc_secrets`. Common pattern: `{field_name: "ENV_VAR_NAME"}`.
- **Storage**: In-memory only, wrapped in Pydantic `SecretStr`. Sourced from environment variables via `langchain_core/utils/utils.py:secret_from_env` at instantiation.
- **Access**: Read by partner SDK constructors at instantiation. `SecretStr` wrapper prevents accidental logging via `repr`/`str`.
- **Encryption**: In-memory only; no at-rest encryption needed (not persisted). In-transit via HTTPS to each provider API.
- **Retention**: Process lifetime. Credentials released when the model object is garbage-collected.
- **Logging exposure**: Protected by `SecretStr`; direct access requires `.get_secret_value()`. Risk exists if users log message contents that embed credentials (user responsibility).
- **Cross-border**: Transmitted to respective provider APIs over HTTPS; users choose which provider and thus which jurisdiction.
- **Gaps**: None identified in framework code.
-
-#### DC2: LLM conversation data
-
- **Fields**: `HumanMessage.content`, `AIMessage.content`, `SystemMessage.content`, `ToolMessage.content`, `ChatMessage.content`, prompt template input variables (arbitrary user-supplied kwargs).
- **Storage**: In-memory only within langchain-core. No persistence layer in core — persistence is user-application responsibility (e.g., chat history databases, LangSmith tracing).
- **Access**: Read by prompt templates (`langchain_core/prompts/chat.py:ChatPromptTemplate.format_messages`), output parsers, callback handlers, and tracers. Passed as kwargs through runnables.
- **Encryption**: N/A (not persisted by framework; in-transit encryption depends on user's tracing/logging configuration).
- **Retention**: Transient — garbage-collected when message objects go out of scope. No framework-level caching.
- **Logging exposure**: Message content is passed to callback handlers via `langchain_core/callbacks/manager.py:handle_event`. If a user registers a logging callback (e.g., `StdOutCallbackHandler`), message content appears in logs.
- **Gaps**: No framework-level PII detection or redaction. Users who pass PII in messages are responsible for downstream handling.
-
-#### DC3: Serialized LangChain objects
-
- **Fields**: JSON payloads to `loads()`/`load()` containing `{"lc": 1, "type": "constructor", "id": [...], "kwargs": {...}}` structures, including `secret` type entries (`{"lc": 1, "type": "secret", "id": [env_var_name]}`).
- **Storage**: Not stored by langchain-core. Users provide serialized payloads from their own storage (databases, files, APIs).
- **Access**: Consumed by `langchain_core/load/load.py:Reviver.__call__` during deserialization. The `kwargs` within become constructor arguments for instantiated classes.
- **Encryption**: N/A (framework does not store; encryption of serialized data at rest is user responsibility).
- **Retention**: Transient during deserialization; final objects retained per user's reference management.
- **Logging exposure**: Serialized payloads may contain secret references that, if logged before deserialization, reveal environment variable names (not values). Post-deserialization, values depend on whether `secrets_from_env=True`.
- **Gaps**: No integrity validation (signing/MAC) on serialized payloads. The allowlist prevents arbitrary class instantiation but does not verify payload authenticity.
-
-#### DC6: Tool call arguments
-
- **Fields**: LLM-generated `ToolCall.args` dict — keys and values determined by the LLM based on the tool's Pydantic schema. Passed to `langchain_core/tools/base.py:BaseTool._parse_input` then to user-defined `_run()`.
- **Storage**: In-memory only. Passed through the tool invocation chain; not persisted by the framework.
- **Access**: Validated by Pydantic in `BaseTool._parse_input`, then passed as `**kwargs` to user-defined tool functions via `langchain_core/tools/structured.py:StructuredTool._run`. Also passed to callback handlers via `langchain_core/callbacks/manager.py:CallbackManager.on_tool_start`.
- **Encryption**: N/A (in-memory only).
- **Retention**: Transient.
- **Logging exposure**: Tool arguments are passed to `on_tool_start` callback handlers. Default `StdOutCallbackHandler` prints tool inputs.
- **Gaps**: Pydantic validates types but not semantic content of string fields. An LLM can generate tool arguments containing adversarial content (prompt injection payloads, exfiltration URLs) that pass type validation.
-
-#### DC4: OS environment variables
-
- **Fields**: Any `os.environ` key named in a serialized payload's `secret` field, when `secrets_from_env=True` is passed to `loads()`/`load()`.
- **Storage**: Host OS environment.
- **Access**: `langchain_core/load/load.py:Reviver.__call__` (line 417) reads `os.environ[key]` directly when `secrets_from_env=True`. An attacker who controls the serialized payload can name any environment variable — there is no allowlist on variable names.
- **Encryption**: N/A (environment variables are plaintext at the OS level).
- **Retention**: Process lifetime.
- **Logging exposure**: Values returned as deserialized constructor kwargs; exposure depends on user logging.
- **Gaps**: **Critical gap**: When `secrets_from_env=True`, any environment variable can be read by a crafted payload. Default is `False`. The escape mechanism (`langchain_core/load/_validation.py:_is_escaped_dict`) prevents injection through the normal serialization round-trip, but direct `loads()` of attacker-controlled JSON bypasses this protection.
-
---
-
-## Trust Boundaries
-
-| ID | Boundary | Description | Controls (Inside) | Does NOT Control (Outside) |
-|----|----------|-------------|-------------------|---------------------------|
-| TB1 | User application ↔ langchain-core public API | Entry point for all user-provided inputs to the framework | Pydantic model validation on all public classes, default configurations (`template_format='f-string'`, `allowed_objects='core'`, `secrets_from_env=False`), tool argument schema validation | Model selection, custom tool implementations, custom callback handlers, application-level input sanitization, deployment topology |
-| TB2 | Untrusted payload ↔ serialization engine | JSON deserialization entry point via `loads()`/`load()` | Namespace allowlist (`DEFAULT_NAMESPACES`), class path allowlist (`allowed_objects='core'` default), jinja2 blocking (`langchain_core/load/load.py:default_init_validator`), Bedrock SSRF blocking (`langchain_core/load/validators.py:_bedrock_validator`), `__lc_escaped__` injection protection (`langchain_core/load/_validation.py:_is_escaped_dict`), `Serializable` subclass enforcement | Trustworthiness of the serialized payload; whether `secrets_from_env=True` is used; whether `allowed_objects='all'` is used |
-| TB3 | LLM output ↔ output parsers / tool invocation | Boundary where untrusted LLM-generated content enters framework processing | Pydantic schema validation for tool arguments (`langchain_core/tools/base.py:BaseTool._parse_input`), JSON/XML structural parsing, `defusedxml` for XML parsing (when installed) | LLM response content, model behavior, semantic meaning of tool arguments |
-| TB4 | Framework ↔ user-provided callbacks/tools | Boundary where user-authored code is invoked by the framework | Callback invocation protocol (`langchain_core/callbacks/manager.py:handle_event`), tool argument schema validation, exception handling around callback calls | What callback/tool code does, side effects, network calls, file I/O performed by user code |
-| TB5 | Framework ↔ filesystem | File access via deprecated `load_prompt` and `_load_template` | Path traversal prevention (`langchain_core/prompts/loading.py:_validate_path`): rejects absolute paths and `..` components; file type restriction to `.txt`; symlink resolution before suffix check | Content of files on disk; filesystem permissions; symbolic link targets outside validated paths |
-| TB6 | URL input ↔ SSRF protection | URL validation before external HTTP requests | Private IP range blocking (RFC 1918), cloud metadata endpoint blocking (AWS/GCP/Azure/Alibaba/Oracle), localhost blocking, DNS resolution validation (`langchain_core/_security/_ssrf_protection.py:validate_safe_url`) | DNS infrastructure behavior (rebinding); whether calling code actually uses `validate_safe_url` before fetching |
-
-### Boundary Details
-
-#### TB1: User application ↔ langchain-core public API
-
- **Inside**: Pydantic model validation on all public classes. Default configurations: `template_format='f-string'`, `allowed_objects='core'`, `secrets_from_env=False`. `StrictFormatter` (`langchain_core/utils/formatting.py:StrictFormatter`) blocks positional arguments. Template variable validation (`langchain_core/prompts/string.py:get_template_variables`) blocks attribute access (`.`) and indexing (`[`, `]`) in f-string variables.
- **Outside**: What users pass as tool implementations, callback handlers, and model configurations. Users may register tools that perform arbitrary operations; the framework validates tool argument schemas but not tool behavior.
- **Crossing mechanism**: Python function calls to public API methods.
-
-#### TB2: Untrusted payload ↔ serialization engine
-
- **Inside**: `langchain_core/load/load.py:Reviver.__init__` builds the class path allowlist. `langchain_core/load/load.py:Reviver.__call__` enforces: (1) namespace validation against `DEFAULT_NAMESPACES`, (2) allowlist check against `allowed_class_paths`, (3) `DISALLOW_LOAD_FROM_PATH` blocks for `langchain_community` and `langchain`, (4) class-specific validators via `CLASS_INIT_VALIDATORS`, (5) general init validator (jinja2 blocking), (6) `Serializable` subclass check, (7) `importlib.import_module()` with validated namespace. `langchain_core/load/_validation.py:_is_escaped_dict` prevents user data dicts from being treated as LC objects.
- **Outside**: The content of the JSON payload; whether the caller passes trusted or untrusted data; whether the caller enables `secrets_from_env=True` or broadens `allowed_objects`.
- **Crossing mechanism**: `json.loads(text)` + `Reviver` object hook.
-
-#### TB3: LLM output ↔ output parsers / tool invocation
-
- **Inside**: `langchain_core/tools/base.py:BaseTool._parse_input` validates tool arguments against Pydantic schemas. `langchain_core/output_parsers/json.py:JsonOutputParser.parse_result` uses `json.loads()` (safe). `langchain_core/output_parsers/xml.py:XMLOutputParser.parse` uses `defusedxml` by default. Tool names matched against registered tool list.
- **Outside**: LLM response content is untrusted — it may contain prompt injection, malicious tool call arguments, or unexpected structured data. No semantic sanitization of free-form text.
- **Crossing mechanism**: Python function calls from LLM response processing to parser/tool invocation.
-
-#### TB4: Framework ↔ user-provided callbacks/tools
-
- **Inside**: `langchain_core/callbacks/manager.py:handle_event` invokes handler methods with exception handling. `langchain_core/tools/base.py:BaseTool.run` passes validated arguments to user-defined `_run()`.
- **Outside**: What callback/tool code does — arbitrary Python execution, side effects, network calls.
- **Crossing mechanism**: Python method calls to user-provided handler/tool instances.
-
-#### TB5: Framework ↔ filesystem
-
- **Inside**: `langchain_core/prompts/loading.py:_validate_path` rejects absolute paths (line 30) and `..` directory traversal (line 38). `langchain_core/prompts/loading.py:_load_template` resolves symlinks before checking file suffix (line 101), restricts to `.txt` files only (line 103).
- **Outside**: Filesystem contents within validated paths; OS-level file permissions.
- **Crossing mechanism**: Python `Path.read_text()`, `json.load()`, `yaml.safe_load()`.
-
-#### TB6: URL input ↔ SSRF protection
-
- **Inside**: `langchain_core/_security/_ssrf_protection.py:validate_safe_url` validates URL scheme (http/https only), resolves DNS via `socket.getaddrinfo()`, checks each resolved IP against private ranges (RFC 1918), cloud metadata IPs/hostnames (169.254.169.254, metadata.google.internal, etc.), and localhost. Cloud metadata is ALWAYS blocked, even with `allow_private=True`. Fails closed on DNS errors.
- **Outside**: DNS infrastructure behavior; whether downstream code actually calls `validate_safe_url` before making HTTP requests.
- **Crossing mechanism**: URL string passed in, validated string returned.
-
---
-
-## Data Flows
-
-| ID | Source | Destination | Data Type | Classification | Crosses Boundary | Protocol |
-|----|--------|-------------|-----------|----------------|------------------|----------|
-| DF1 | User application | C1 Serialization (`loads`/`load`) | JSON serialized LC object payload (may contain secret refs for DC1) | DC1, DC3 | TB2 | Python function call |
-| DF2 | User application | C3 Prompt Templates → C8 Messages | Prompt template strings and variables, producing formatted messages | DC5, DC2 | TB1 | Python function call |
-| DF3 | User application | C10 `load_prompt` → filesystem | Config file path, template file path | DC5 | TB5 | Python file I/O |
-| DF4 | User application / partner code | C2 SSRF Protection | URL string for validation | — | TB6 | Python function call |
-| DF5 | LLM output (C8 Messages via partner) | C5 Output Parsers | LLM-generated text (JSON, XML, structured) | DC2, DC6 | TB3 | Python function call |
-| DF6 | LLM output (C8 Messages via partner) | C4 Tools Framework (`BaseTool.invoke`) | Tool call arguments (name, args dict) | DC6 | TB3, TB4 | Python function call |
-| DF7 | C6 Runnables / C4 Tools | C7 Callbacks (`CallbackManager`) | Run data: inputs, outputs, errors, metadata | DC7, DC2 | TB4 | Python function call |
-| DF8 | C1 Serialized payload (secret type) | OS environment (`os.environ`) | Environment variable name from payload | DC4 | TB2 | `os.environ[key]` |
-| DF9 | C1 Serialized payload (constructor type) | Python runtime (`importlib`) | Module path, class name, kwargs | DC3 | TB2 | `importlib.import_module()` |
-| DF10 | User application | C6 Runnables (`RunnableLambda`) | Arbitrary user function + input data | DC2 | TB4 | Python function call |
-
-### Flow Details
-
-#### DF1: User application → Serialization API (`loads`/`load`)
-
- **Data**: JSON string or dict representing serialized LangChain objects. Sensitivity depends on whether it contains `secret` fields (DC3/DC4).
- **Validation**: `langchain_core/load/load.py:Reviver.__call__` enforces namespace allowlist, class path allowlist, jinja2 blocking, Bedrock endpoint blocking, `Serializable` subclass check, and `__lc_escaped__` injection protection.
- **Trust assumption**: Caller ensures the payload comes from a trusted source. If `secrets_from_env=True`, caller trusts the payload completely with access to all OS environment variables.
-
-#### DF5: LLM output → Output Parsers
-
- **Data**: LLM-generated text intended to be parsed as JSON, XML, or Pydantic-structured data.
- **Validation**: `langchain_core/output_parsers/json.py:JsonOutputParser` uses `json.loads()` (safe). `langchain_core/output_parsers/xml.py:XMLOutputParser` defaults to `defusedxml` but falls back to standard library if not installed. `langchain_core/output_parsers/pydantic.py:PydanticOutputParser` validates against user-defined Pydantic schema.
- **Trust assumption**: LLM output is untrusted. Parsers extract structure but do not sanitize semantic content.
-
-#### DF6: LLM output → Tool invocation
-
- **Data**: Tool call arguments — function name and arguments dict generated by the LLM.
- **Validation**: Tool names matched against registered tool list. Arguments validated via `langchain_core/tools/base.py:BaseTool._parse_input` using Pydantic schema. Type validation only — no semantic sanitization of string field contents.
- **Trust assumption**: LLM output is untrusted. Schema validation prevents type errors but not adversarial content in text fields.
-
-#### DF8: Serialized secret → `os.environ`
-
- **Data**: Environment variable name extracted from `{"lc": 1, "type": "secret", "id": ["VAR_NAME"]}` in deserialized payload.
- **Validation**: None on variable name — any `os.environ` key can be read.
- **Trust assumption**: Only activated when `secrets_from_env=True` (default `False`). Caller trusts the payload not to name sensitive environment variables.
-
-#### DF9: Serialized constructor → `importlib`
-
- **Data**: Module path and class name from `{"lc": 1, "type": "constructor", "id": ["namespace", ..., "ClassName"]}`.
- **Validation**: Namespace validated against `DEFAULT_NAMESPACES` (line 456-462, 480-482). Class path checked against allowlist. Imported class must be `Serializable` subclass. Init validators run before instantiation.
- **Trust assumption**: Allowlist constrains which classes can be instantiated. Side effects in allowed classes' `__init__` are accepted risk.
-
---
-
-## Threats
-
-| ID | Data Flow | Classification | Threat | Boundary | Severity | Validation | Code Reference |
-|----|-----------|----------------|--------|----------|----------|------------|----------------|
-| T1 | DF8 | DC4 | Arbitrary OS environment variable exfiltration via crafted serialized payload when `secrets_from_env=True` | TB2 | High | Verified | `langchain_core/load/load.py:Reviver.__call__` (line 417) |
-| T2 | DF9 | DC3 | Side effects in allowed class `__init__` during deserialization when using `allowed_objects='all'` | TB2 | Medium | Likely | `langchain_core/load/load.py:Reviver.__call__` (line 506) |
-| T3 | DF5 | DC2 | XML entity expansion (DTD bomb) via `XMLOutputParser` when `defusedxml` not installed and `parser='xml'` | TB3 | Medium | Verified | `langchain_core/output_parsers/xml.py:XMLOutputParser.parse` (line 246) |
-| T4 | DF4 | — | DNS rebinding SSRF bypass in `validate_safe_url` due to TOCTOU between DNS validation and downstream HTTP request | TB6 | Medium | Likely | `langchain_core/_security/_ssrf_protection.py:validate_safe_url` (lines 251-280) |
-| T5 | DF6 | DC6 | Prompt injection via LLM-generated tool call arguments influencing subsequent LLM context in agentic workflows | TB3 | Medium | Unverified | `langchain_core/tools/base.py:BaseTool.invoke` |
-| T6 | DF2 | DC5 | Jinja2 sandbox escape via runtime `PromptTemplate(template_format='jinja2')` using SandboxedEnvironment bypass | TB1 | Medium | Unverified | `langchain_core/prompts/string.py:jinja2_formatter` (line 71) |
-
-### Threat Details
-
-#### T1: Arbitrary OS environment variable exfiltration via `secrets_from_env=True`
-
- **Flow**: DF8 (Serialized payload → `os.environ` via `Reviver.__call__`)
- **Description**: When `secrets_from_env=True` is passed to `loads()`/`load()`, a crafted serialized payload can name any OS environment variable in its `secret` fields (e.g., `{"lc":1,"type":"secret","id":["AWS_SECRET_ACCESS_KEY"]}`). The `Reviver.__call__` method reads that key from `os.environ` and returns it as a constructor `kwarg`. There is no allowlist or validation on which environment variable names can be read. The escape mechanism (`__lc_escaped__`) prevents injection through the normal `dumpd`/`dumps` round-trip, but direct `loads()` of attacker-controlled JSON bypasses this protection entirely.
- **Preconditions**: (1) User passes `secrets_from_env=True` to `loads()`/`load()`; AND (2) user passes attacker-controlled serialized data that did not originate from `dumpd()`/`dumps()`. Both conditions must be true simultaneously.
- **Historical context**: GHSA-c67j-w6g6-q2cm covers this pattern.
-
-#### T2: Side effects in allowed class `__init__` during deserialization
-
- **Flow**: DF9 (Serialized constructor → `importlib` → `cls(**kwargs)`)
- **Description**: When `allowed_objects='all'` is used, the allowlist includes partner integrations such as `ChatOpenAI`. If an allowed class performs side effects during `__init__` (e.g., API validation calls, network probes), those side effects trigger on deserializing a crafted payload. The allowlist prevents instantiation of classes outside the list, but does not sandbox `__init__` of allowed classes.
- **Preconditions**: (1) User uses `allowed_objects='all'`; AND (2) user passes attacker-controlled serialized data. Default `allowed_objects='core'` limits to core langchain-core types (messages, documents, prompts) that have no network side effects.
-
-#### T3: XML entity expansion via `XMLOutputParser`
-
- **Flow**: DF5 (LLM output → `XMLOutputParser.parse`)
- **Description**: `XMLOutputParser` defaults to `parser="defusedxml"` but `defusedxml` is not a required dependency of langchain-core. If `defusedxml` is not installed, users encounter an `ImportError` that steers them toward setting `parser="xml"`. With `parser="xml"`, the standard library `xml.etree.ElementTree.fromstring()` processes internal DTD entity declarations, allowing expansion up to ~300KB from a small input (limited by libexpat's built-in amplification limit in Python 3.9.8+/3.10.1+). External entity resolution (classic XXE file read) is blocked by modern expat defaults. A reduced DTD bomb (5 levels or fewer) succeeds silently; 6+ levels are blocked by libexpat.
- **Preconditions**: (1) `defusedxml` is not installed; AND (2) user sets `parser="xml"` or LLM output containing DTD declarations reaches the parser; AND (3) non-streaming `parse()` path is used (streaming parser accidentally strips DTD preamble).
-
-#### T4: DNS rebinding SSRF bypass in `validate_safe_url`
-
- **Flow**: DF4 (URL → `validate_safe_url` → downstream HTTP request)
- **Description**: `validate_safe_url` performs DNS resolution via `socket.getaddrinfo` at validation time and validates each resolved IP. However, the calling code typically performs a second DNS resolution when making the actual HTTP request (e.g., via `httpx.get()`). An attacker with DNS control can set a short TTL, return a public IP during validation, and switch to a private IP (169.254.169.254) for the actual request. Cloud metadata IPs are always blocked at validation time, but the TOCTOU window between validation and request remains.
- **Preconditions**: (1) Calling code uses `validate_safe_url` but does not pin the resolved IP; AND (2) attacker controls a domain's DNS with short TTL; AND (3) the URL reaches an HTTP client that re-resolves DNS.
- **Historical context**: GHSA-2g6r-c272-w58r; SSRF protection was added post-advisory.
-
-#### T5: Prompt injection via LLM-generated tool call arguments
-
- **Flow**: DF6 (LLM output → `BaseTool.invoke`)
- **Description**: In agentic workflows, LLM-generated tool call arguments are validated against Pydantic schemas by `BaseTool._parse_input`, but free-text string fields are not sanitized. A malicious instruction in a retrieved document, tool output, or environment variable can cause the LLM to emit tool calls designed to exfiltrate data or manipulate downstream behavior. Pydantic validates types but not semantic content.
- **Preconditions**: An agent processes untrusted external content containing adversarial instructions; the model follows those instructions; a tool with side effects is registered.
-
-#### T6: Jinja2 sandbox escape via runtime `PromptTemplate`
-
- **Flow**: DF2 (User app → `PromptTemplate` with `template_format='jinja2'`)
- **Description**: While jinja2 is blocked in deserialization (`_block_jinja2_templates`) and file-based prompt loading, it is available at runtime construction via `PromptTemplate(template_format='jinja2')`. The framework uses Jinja2's `SandboxedEnvironment` (`langchain_core/prompts/string.py:jinja2_formatter`, line 71), which blocks dunder attribute access but allows regular attribute/method calls. The docstring explicitly warns this is "best-effort" sandboxing, not a security guarantee. Known sandbox bypass techniques exist for `SandboxedEnvironment`.
- **Preconditions**: (1) User explicitly sets `template_format='jinja2'`; AND (2) user passes attacker-controlled template content; AND (3) a `SandboxedEnvironment` bypass is achievable in the deployed Jinja2 version.
-
-### Chain Analysis
-
-**T1 + T2 combined**: If an attacker controls a serialized payload and the user enables both `secrets_from_env=True` and `allowed_objects='all'`, the attacker can both exfiltrate arbitrary environment variables (T1) and trigger network side effects from allowed class constructors (T2). The exfiltrated credentials could then be sent to an attacker-controlled endpoint via a side-effecting `__init__`. However, both `secrets_from_env=True` and `allowed_objects='all'` must be explicitly enabled by the user — the default configuration prevents both attacks.
-
-No other threat chains identified within langchain-core alone. Cross-package chains (e.g., core serialization + partner init side effects) may exist but are outside this document's scope.
-
---
-
-## Input Source Coverage
-
-Maps each input source category to its data flows, threats, and validation. The "Responsibility" column reflects that users control many input paths in this open source library.
-
-| Input Source | Data Flows | Threats | Validation Points | Responsibility | Gaps |
-|-------------|-----------|---------|-------------------|----------------|------|
-| Serialized payloads (`loads`/`load`) | DF1, DF8, DF9 | T1, T2 | `langchain_core/load/load.py:Reviver.__call__`: namespace + allowlist + jinja2 blocker + Bedrock validator + escape protection | Project (framework controls allowlist defaults) | `secrets_from_env=True` with untrusted data; `allowed_objects='all'` with untrusted data |
-| User direct input (prompts, tool defs) | DF2, DF10 | T5, T6 | `langchain_core/utils/formatting.py:StrictFormatter` (blocks positional args); `langchain_core/prompts/string.py:get_template_variables` (blocks `.` and `[]` in f-string vars); Pydantic schema validation for tools | User | Users responsible for template content trust and tool implementation safety |
-| LLM output (tool calls, structured) | DF5, DF6 | T3, T5 | `langchain_core/tools/base.py:BaseTool._parse_input` (Pydantic schema); `langchain_core/output_parsers/xml.py:XMLOutputParser.parse` (defusedxml default) | User/shared | No semantic sanitization of free-text; XML DTD not blocked without defusedxml |
-| URL-sourced content | DF4 | T4 | `langchain_core/_security/_ssrf_protection.py:validate_safe_url` | Project (framework provides validation utility) | DNS rebinding TOCTOU; validation is opt-in, not automatic |
-| Configuration (env vars) | DF8 | T1 | `SecretStr` wrapper for credentials | Shared | `secrets_from_env=True` reads arbitrary env vars |
-| Filesystem paths (prompt loading) | DF3 | — | `langchain_core/prompts/loading.py:_validate_path` | Project (framework validates paths) | Deprecated; symlink resolution before suffix check mitigates bypass |
-
---
-
-## Out-of-Scope Threats
-
-Threats that appear valid in isolation but fall outside project responsibility because they depend on conditions the project does not control.
-
-| Pattern | Why Out of Scope | Project Responsibility Ends At |
-|---------|-----------------|-------------------------------|
-| Prompt injection leading to arbitrary code execution via user-registered tools | The project does not control which tools users register. A user who registers a code execution tool and uses a jailbreakable model accepts the risk. | Providing correct tool argument schemas (`langchain_core/tools/base.py:BaseTool._parse_input`); validating argument types via Pydantic |
-| API key leakage via user application logs | The project wraps API keys in `SecretStr` to prevent accidental logging by the framework itself. User logging behavior is outside the project's control. | `SecretStr` wrapping; `langchain_core/load/serializable.py:Serializable.lc_secrets` property; `langchain_core/utils/utils.py:secret_from_env` helper |
-| Malicious custom callback handler execution | Callback handlers are user-provided code. A malicious callback can do anything the Python process allows. | Providing a well-defined `BaseCallbackHandler` interface; exception handling in `langchain_core/callbacks/manager.py:handle_event` |
-| Model output containing harmful content | The project does not control model behavior, alignment, or safety filtering. | Correctly forwarding model responses without modification; providing output parser framework for structured validation |
-| Supply chain attacks on dependencies (Pydantic, PyYAML, tenacity, jsonpatch) | The project depends on these packages. Compromise of those packages is outside the project's control. | Pinning dependency versions in `pyproject.toml` and `uv.lock` |
-| Exfiltration via tool calls in agentic workflows | An agent equipped with network-capable tools (user-registered) can exfiltrate data if prompted to do so. Tool capabilities are user-controlled. | Not providing dangerous default tools (no PythonREPL, shell, or HTTP fetch tool in langchain-core) |
-| Arbitrary code execution via `RunnableLambda` with user functions | `RunnableLambda` wraps arbitrary Python callables. The wrapped function can do anything. | Providing composition primitives (`langchain_core/runnables/base.py:RunnableLambda`); users control what functions they wrap |
-| YAML deserialization attacks via prompt loading | `langchain_core/prompts/loading.py:_load_examples` uses `yaml.safe_load()` (not `yaml.load()`), preventing unsafe YAML deserialization. | Using `yaml.safe_load()` exclusively (`langchain_core/prompts/loading.py:_load_examples`, line 121) |
-
-### Rationale
-
-**Prompt injection as out-of-scope**: langchain-core is a library; users choose which models and tools to compose. The framework provides correct Pydantic schemas for tool arguments (`langchain_core/tools/base.py:BaseTool._parse_input`) and validates argument types, but cannot prevent a model from being manipulated into misusing legitimate tools. This is consistent with the industry-wide understanding that prompt injection is an application-layer concern when deploying LLM agents.
-
-**Runtime Jinja2 as a boundary case**: The project blocks jinja2 during *deserialization* (`langchain_core/load/load.py:_block_jinja2_templates`) and *file-based prompt loading* (`langchain_core/prompts/loading.py:_load_prompt`) because these are the paths where untrusted data is most likely to arrive. Runtime construction via `PromptTemplate(template_format='jinja2')` is a deliberate user choice — the framework uses `SandboxedEnvironment` and warns in docstrings that this is best-effort. This is classified as T6 (in-scope, Medium) rather than out-of-scope because the framework does provide the jinja2 execution path.
-
-**Callback data exposure**: Callback handlers receive run inputs, outputs, and metadata via `langchain_core/callbacks/manager.py:handle_event`. This data may include user PII. However, the framework's callback system is designed to pass this data — it is the feature, not a bug. Users who register callback handlers accept that those handlers receive run data.
-
---
-
-## Investigated and Dismissed
-
-Threats investigated during flaw validation that were found to be non-exploitable in the current version.
-
-| ID | Original Threat | Investigation | Evidence | Conclusion |
-|----|----------------|---------------|----------|------------|
-| D1 | Jinja2 SSTI via deserialized `PromptTemplate` (CVE path: GHSA-6qv9-48xg-fc7f) | Traced full deserialization path: `loads()` → `Reviver.__call__()` → `init_validator` → `default_init_validator` → `_block_jinja2_templates`. Checked whether `init_validator=None` could be passed to bypass. | `langchain_core/load/load.py:_block_jinja2_templates` (line 177); `langchain_core/load/load.py:default_init_validator` (line 208); default `init_validator=default_init_validator` in function signatures | Jinja2 check fires before `cls(**kwargs)` is called. Overriding with `init_validator=None` removes the check but requires the caller to explicitly opt out. Non-exploitable with default settings. |
-| D2 | Path traversal in `load_prompt()` via `template_path` (GHSA-qh6h-p6c9-ff54) | Reviewed `langchain_core/prompts/loading.py:_load_template`, `_validate_path`. Both `load_prompt` and `load_prompt_from_config` deprecated since v1.2.21 with `allow_dangerous_paths=False` default. | `langchain_core/prompts/loading.py:_validate_path` (line 21); `langchain_core/prompts/loading.py:load_prompt` (deprecated since 1.2.21); `_load_template` resolves symlinks at line 101 before suffix check | Patched in v1.2.21. Current code raises `ValueError` for absolute paths and `..` traversal by default. Symlink resolution happens before suffix validation. Not exploitable with default settings. |
-| D3 | F-string template injection via attribute access (e.g., `{input.__class__}`) | Reviewed `langchain_core/prompts/string.py:get_template_variables` and `langchain_core/utils/formatting.py:StrictFormatter`. | `langchain_core/prompts/string.py:get_template_variables` (lines 284-306): blocks variables containing `.`, `[`, `]`, and all-digit names. `langchain_core/utils/formatting.py:StrictFormatter.vformat` (lines 23-48): rejects positional arguments. | F-string attribute access, indexing, and positional arguments are all blocked. Not exploitable. |
-| D4 | XXE (external entity file read) via `XMLOutputParser` with `parser='xml'` | Tested standard library `xml.etree.ElementTree.fromstring()` with `<!ENTITY xxe SYSTEM "file:///etc/passwd">` payload. | Modern Python expat (3.9.8+/3.10.1+) does not resolve `SYSTEM` external entities in `fromstring()`. Returns `ParseError: undefined entity`. | External entity resolution is blocked by default in modern expat. Not exploitable for file read. Internal entity expansion (T3) remains a separate, verified concern. |
-
---
-
-## Revision History
-
-| Date | Author | Changes |
-|------|--------|---------|
-| 2026-04-08 | langster-threat-model (deep mode, commit d3e60f5c03) | Initial langchain-core focused threat model — 11 components, 7 data classifications (2 Critical, 3 High, 1 Medium, 1 Low; details for all Critical/High entries), 6 trust boundaries, 10 data flows, 6 threats (1 High verified, 5 Medium), 8 out-of-scope patterns, 4 investigated and dismissed. Initial langchain-core focused threat model. |
--- a/.github/THREAT_MODEL_V1.md
+++ b/.github/THREAT_MODEL_V1.md
@@ -1,333 +0,0 @@
-# Threat Model: langchain (langchain_v1)
-
-> Generated: 2026-04-08 | Commit: d3e60f5c03 | Scope: libs/langchain_v1/ (langchain v1.2.15) | Visibility: Open Source | Mode: deep
-
-> **Disclaimer:** This threat model is automatically generated to help developers and security researchers understand where trust is placed in this system and where boundaries exist. It is experimental, subject to change, and not an authoritative security reference -- findings should be validated before acting on them. The analysis may be incomplete or contain inaccuracies. We welcome suggestions and corrections to improve this document.
-
-For vulnerability reporting, see [GitHub Security Advisories](https://github.com/langchain-ai/langchain/security/advisories/new).
-
-See also: the [langchain-core threat model](THREAT_MODEL_CORE.md) for the base abstractions layer.
-
---
-
-## Scope
-
-### In Scope
-
- `libs/langchain_v1/langchain/agents/` -- Agent factory (`create_agent`), agent middleware framework, middleware types
- `libs/langchain_v1/langchain/agents/middleware/` -- All shipped middleware: `ShellToolMiddleware`, `FilesystemFileSearchMiddleware`, `PIIMiddleware`, `HumanInTheLoopMiddleware`, `ContextEditingMiddleware`, `SummarizationMiddleware`, `LLMToolEmulator`, `TodoListMiddleware`, `LLMToolSelectorMiddleware`, `ToolCallLimitMiddleware`, `ModelCallLimitMiddleware`, `ModelFallbackMiddleware`, `ModelRetryMiddleware`, `ToolRetryMiddleware`
- `libs/langchain_v1/langchain/agents/middleware/_execution.py` -- Execution policies: `HostExecutionPolicy`, `DockerExecutionPolicy`, `CodexSandboxExecutionPolicy`
- `libs/langchain_v1/langchain/agents/middleware/_redaction.py` -- PII detection and redaction engine
- `libs/langchain_v1/langchain/chat_models/base.py` -- `init_chat_model` factory with dynamic `importlib` loading
- `libs/langchain_v1/langchain/embeddings/base.py` -- `init_embeddings` factory with dynamic `importlib` loading
- `libs/langchain_v1/langchain/agents/structured_output.py` -- Structured output strategies (ToolStrategy, ProviderStrategy)
- `libs/langchain_v1/langchain/tools/tool_node.py` -- Tool node re-exports from LangGraph
-
-### Out of Scope
-
- `libs/core/` -- langchain-core base abstractions (separate threat model at `.github/THREAT_MODEL_CORE.md`)
- `libs/partners/` -- Partner integration packages (separate per-partner threat surface)
- `libs/langchain/` -- langchain-classic legacy package
- `libs/text-splitters/` -- Document chunking utilities
- `libs/standard-tests/` -- Test harnesses; not shipped attack surface
- `tests/` -- Unit and integration tests (read during analysis; not threat-modeled)
- User application code, model selection, custom tools, custom callbacks -- user-controlled
- LLM model behavior -- the project cannot guarantee model safety across all models users may select
- LangGraph internals -- separate product and repository; langchain_v1 depends on LangGraph but does not own its code
- Deployment infrastructure -- users control hosting, network topology, and secrets management
-
-### Assumptions
-
-1. The project is used as a library/framework -- users control their own application code, model selection, and deployment infrastructure.
-2. `ShellToolMiddleware` is an opt-in middleware that grants the agent explicit shell access by design; users who add it accept that the agent can execute arbitrary commands within the configured execution policy.
-3. `FilesystemFileSearchMiddleware` is an opt-in middleware; the `root_path` is set by the deployer to confine filesystem access.
-4. `HumanInTheLoopMiddleware` assumes the interrupt/resume boundary (LangGraph `interrupt()`) is trusted infrastructure; the human reviewer is a trusted party.
-5. API keys are managed by partner integrations in langchain-core via `SecretStr`; langchain_v1 does not directly handle credentials.
-6. The `create_agent` function delegates to LangGraph for graph compilation and execution; LangGraph's own security properties are inherited, not verified here.
-
---
-
-## System Overview
-
-`langchain` (v1.2.15, published as the `langchain` PyPI package from `libs/langchain_v1/`) is the actively maintained implementation layer of the LangChain Python ecosystem. It provides `create_agent` -- a high-level factory for building LLM-powered tool-calling agents -- along with a composable middleware system that intercepts and modifies agent behavior at model call, tool call, and lifecycle boundaries. Key shipped middleware includes shell command execution, filesystem search, PII redaction, human-in-the-loop approval, context window management, and rate limiting. The package depends on `langchain-core` (base abstractions) and `langgraph` (graph execution engine).
-
-### Architecture Diagram
-
-```
-+----------------------------------------------------------------------+
-|                          User Application                            |
-|                                                                      |
-|  User Code ---> create_agent(model, tools, middleware)               |
-|                      |                                               |
-|                      v                                               |
-|               LangGraph StateGraph                                   |
-|                      |                                               |
-|          +-----------+-----------+                                   |
-|          |                       |                                   |
-|          v                       v                                   |
-|    [model node]            [tools node]                              |
-|    Middleware hooks:       ToolNode dispatch:                         |
-|    before_model            wrap_tool_call                            |
-|    wrap_model_call         tool execution                            |
-|    after_model                   |                                   |
-|          |              +--------+--------+                          |
-|          |              |        |        |                          |
-| - - - - -|- - - - - - -|- - - - | - - - -|- - - - - TB1 - - - - -  |
-|          v              v        v        v                          |
-|    External LLM    ShellSession  FS     HITL                         |
-|    Provider API    (C2 via C3)  Search  interrupt                    |
-|                         |       (C4)    (C5)                         |
-|                  - - - -|- - - TB3 - - - - - -                       |
-|                         v                                            |
-|                   OS subprocess                                      |
-|                   (HostExec / Docker / Codex)                        |
-+----------------------------------------------------------------------+
-```
-
-> Trust boundaries TB1-TB5 are described in the Trust Boundaries section below.
-
---
-
-## Components
-
-| ID | Component | Description | Trust Level | Default? | Entry Points |
-|----|-----------|-------------|-------------|----------|--------------|
-| C1 | Agent Factory | `create_agent` -- assembles a LangGraph `StateGraph` from model, tools, and middleware; composes middleware hooks into chained handlers | framework-controlled | Yes (when `create_agent` called) | `factory.py:create_agent` |
-| C2 | Shell Tool Middleware | Persistent interactive bash session with configurable execution policies; writes LLM-generated commands to bash stdin | framework-controlled (shell infra) / user-controlled (execution policy selection) | No (opt-in middleware) | `shell_tool.py:ShellToolMiddleware.__init__`, `shell_tool.py:ShellSession.execute` |
-| C3 | Execution Policies | `HostExecutionPolicy` (bare subprocess), `DockerExecutionPolicy` (container isolation), `CodexSandboxExecutionPolicy` (Codex sandbox) | user-controlled (policy selection) | No (opt-in; `HostExecutionPolicy` is default when `ShellToolMiddleware` is used without specifying a policy) | `_execution.py:HostExecutionPolicy.spawn`, `_execution.py:DockerExecutionPolicy.spawn`, `_execution.py:CodexSandboxExecutionPolicy.spawn` |
-| C4 | File Search Middleware | Glob and grep search over local filesystem within a user-configured `root_path`; uses ripgrep with Python fallback | user-controlled (root_path, patterns) | No (opt-in middleware) | `file_search.py:FilesystemFileSearchMiddleware.__init__` (creates `glob_search` and `grep_search` tools) |
-| C5 | Human-in-the-Loop Middleware | Interrupts agent execution for human review of tool calls; supports approve/edit/reject decisions | framework-controlled (interrupt mechanism) / user-controlled (decision content) | No (opt-in middleware) | `human_in_the_loop.py:HumanInTheLoopMiddleware.after_model` |
-| C6 | PII Middleware | Detects and redacts PII (email, credit card, IP, MAC, URL) in message content using regex-based detectors; supports redact/mask/hash/block strategies | framework-controlled | No (opt-in middleware) | `pii.py:PIIMiddleware.before_model`, `pii.py:PIIMiddleware.after_model` |
-| C7 | Context Editing Middleware | Prunes tool use history from conversation when token limits are exceeded; operates on deep copies | framework-controlled | No (opt-in middleware) | `context_editing.py:ContextEditingMiddleware.wrap_model_call` |
-| C8 | Summarization Middleware | Summarizes older conversation history when token/message limits are approached; replaces old messages with a summary | framework-controlled | No (opt-in middleware) | `summarization.py:SummarizationMiddleware.before_model` |
-| C9 | Chat Model Factory | `init_chat_model` -- dynamic provider loading via `importlib.import_module` from a hardcoded provider registry | framework-controlled | Yes (when string model names used) | `chat_models/base.py:init_chat_model` |
-| C10 | Embeddings Factory | `init_embeddings` -- dynamic provider loading via `importlib.import_module` from a hardcoded provider registry | framework-controlled | Yes (when string model names used) | `embeddings/base.py:init_embeddings` |
-| C11 | Middleware Type System | Base `AgentMiddleware` class, `ModelRequest`/`ModelResponse`/`ToolCallRequest` data types, hook decorators, state schemas | framework-controlled | Yes | `types.py:AgentMiddleware`, `types.py:ModelRequest`, `types.py:AgentState` |
-| C12 | Structured Output | `ToolStrategy`, `ProviderStrategy`, `AutoStrategy` for enforcing structured LLM responses; Pydantic-based parsing | framework-controlled | No (opt-in via `response_format`) | `structured_output.py:ToolStrategy`, `structured_output.py:ProviderStrategy` |
-
---
-
-## Data Classification
-
-Classifies all sensitive data types found in the codebase with their sensitivity levels, storage locations, and retention policies.
-
-| ID | PII Category | Specific Fields | Sensitivity | Storage Location(s) | Encrypted at Rest | Retention | Regulatory |
-|----|-------------|----------------|-------------|---------------------|-------------------|-----------|------------|
-| DC1 | Shell commands and output | `_ShellToolInput.command`, `CommandExecutionResult.output` | High | In-memory (transient); written to bash stdin pipe | N/A | Process lifetime; output returned to LLM | N/A (may contain arbitrary data) |
-| DC2 | Filesystem paths and content | `FilesystemFileSearchMiddleware.root_path`, glob/grep results including file content | Medium | Host filesystem (read-only by middleware); in-memory results | N/A | Transient | N/A (depends on file content) |
-| DC3 | LLM conversation state | `AgentState.messages` (HumanMessage, AIMessage, ToolMessage content) | High | In-memory; LangGraph checkpointer (if configured) | N/A (framework does not persist) | Checkpointer-dependent | GDPR, CCPA (when containing PII) |
-| DC4 | HITL decision payloads | `HITLRequest`, `HITLResponse`, `EditDecision.edited_action` (tool name + args) | Medium | In-memory; LangGraph interrupt/resume state | N/A | Transient | N/A |
-| DC5 | PII detection results | `PIIMatch.value` (matched PII content), redacted output | High | In-memory (transient) | N/A | Transient | GDPR, CCPA |
-| DC6 | Subprocess environment | `env` dict passed to execution policies; may contain API keys or secrets | Critical | OS process environment; Docker `-e` flags | N/A | Process lifetime | All |
-| DC7 | Agent execution metadata | Tool call counts (`ToolCallLimitState`), model call counts (`ModelCallLimitState`), conversation summaries | Low | LangGraph state (checkpointer-dependent) | N/A | Checkpointer-dependent | N/A |
-
-### Data Classification Details
-
-#### DC1: Shell commands and output
-
- **Fields**: `_ShellToolInput.command` (LLM-generated string), `CommandExecutionResult.output` (shell stdout/stderr).
- **Storage**: In-memory only within langchain_v1. Commands are written directly to bash stdin; output is collected via pipe reader threads and returned as `ToolMessage` content.
- **Access**: Read by `shell_tool.py:ShellToolMiddleware._run_shell_tool` (dispatches command); `shell_tool.py:ShellSession.execute` (writes to stdin). Output read by `_collect_output`.
- **Encryption**: N/A (in-memory, piped to subprocess).
- **Retention**: Transient -- garbage-collected when `ToolMessage` goes out of scope or conversation is pruned.
- **Logging exposure**: `shell_tool.py:ShellToolMiddleware._run_shell_tool` logs the raw command string at INFO level. Output is logged only if operator configures verbose logging.
- **Gaps**: Commands are logged in plaintext. If commands contain secrets (e.g., `export API_KEY=...`), they appear in application logs. Redaction rules apply to output only, not to command input.
-
-#### DC6: Subprocess environment
-
- **Fields**: `env` dict passed to `ShellToolMiddleware.__init__`, forwarded to `BaseExecutionPolicy.spawn`.
- **Storage**: OS process environment for `HostExecutionPolicy`; Docker `-e K=V` flags for `DockerExecutionPolicy`.
- **Access**: `_execution.py:HostExecutionPolicy.spawn` passes `env` to `subprocess.Popen`. `_execution.py:DockerExecutionPolicy.spawn` iterates env as `-e` flags. No filtering or sanitization of keys or values.
- **Encryption**: N/A (environment variables are plaintext).
- **Retention**: Process lifetime of the subprocess.
- **Logging exposure**: Not logged by default. However, commands executed within the shell can read and exfiltrate env vars (e.g., `env`, `printenv`).
- **Gaps**: **Critical**: If the operator passes API keys or secrets in the `env` dict, any command executing in the shell can read them. The framework does not filter, warn, or redact environment variable content. For `DockerExecutionPolicy`, `_execution.py:DockerExecutionPolicy.spawn` also copies `os.environ` for the Docker CLI process itself (the host process running `docker run`).
-
---
-
-## Trust Boundaries
-
-| ID | Boundary | Description | Controls (Inside) | Does NOT Control (Outside) |
-|----|----------|-------------|-------------------|---------------------------|
-| TB1 | User application / deployer <-> agent framework | Configuration boundary where the deployer selects model, tools, middleware, and policies | Middleware composition, execution policy enforcement, tool registration, structured output validation, model provider loading from hardcoded registry | Which middleware the user enables, what tools the user registers, what execution policy the user selects, what `root_path` or `env` the user configures |
-| TB2 | Framework <-> external LLM provider API | HTTPS API boundary; inherited from langchain-core partner integrations | Request formatting via `init_chat_model` (C9); model is bound via provider registry; API key handling delegated to partner packages | Model behavior, LLM response content, tool call argument semantics |
-| TB3 | Framework <-> shell subprocess | Process boundary between the Python agent and the bash shell session | Execution policy selection (`_execution.py`), command timeout enforcement, output line/byte truncation, process group management, output redaction (post-execution) | Content of commands written to bash stdin (no validation); behavior of executed commands; filesystem/network access within the policy's isolation scope |
-| TB4 | Framework <-> filesystem (file search) | Filesystem access boundary via `FilesystemFileSearchMiddleware` | Path traversal prevention (`file_search.py:FilesystemFileSearchMiddleware._validate_and_resolve_path`): `..` and `~` blocking, `resolve()` + `relative_to()` containment check on user-supplied base path; file size limits; ripgrep subprocess with no `--follow` flag | Content of files within `root_path`; symbolic link targets discovered during glob/rglob traversal (per-file containment not checked in Python fallback); filesystem permissions |
-| TB5 | Framework <-> human reviewer (HITL) | LangGraph `interrupt()` boundary where agent execution pauses for human decision | Interrupt trigger (which tools require review), decision type gating (`allowed_decisions`), decision count validation | Content of human edit decisions (tool name and args are unconstrained); whether the edited tool name exists in the agent's tool registry; schema validity of edited args |
-
-### Boundary Details
-
-#### TB1: User application / deployer <-> agent framework
-
- **Inside**: `factory.py:create_agent` composes the middleware stack, binds tools to `ToolNode`, validates no duplicate middleware. `chat_models/base.py:init_chat_model` loads providers only from `_BUILTIN_PROVIDERS` hardcoded registry via `importlib.import_module`. `embeddings/base.py:init_embeddings` uses the same pattern.
- **Outside**: All middleware is opt-in. The deployer chooses which middleware to enable and how to configure it. Dangerous middleware (`ShellToolMiddleware`) with a permissive default policy (`HostExecutionPolicy`) is the deployer's explicit choice.
- **Crossing mechanism**: Python function calls to `create_agent` and middleware constructors.
-
-#### TB3: Framework <-> shell subprocess
-
- **Inside**: `_execution.py:_launch_subprocess` uses `subprocess.Popen` with list arguments (no `shell=True`). `HostExecutionPolicy` optionally applies CPU/memory `prlimit`. `DockerExecutionPolicy` adds `--network none`, `--rm`, optional `--read-only`, workspace bind-mount. `shell_tool.py:ShellSession` enforces command timeout with session restart, output truncation via `max_output_lines`/`max_output_bytes`.
- **Outside**: Commands written to bash stdin are not validated, escaped, filtered, or allowlisted. The bash process interprets all shell metacharacters (`;`, `&&`, `||`, `|`, `$()`, backticks, redirects). `HostExecutionPolicy` provides no filesystem or network sandboxing. Output redaction via PII rules is post-execution only.
- **Crossing mechanism**: `shell_tool.py:ShellSession.execute` writes command string to `self._stdin` (pipe to bash process).
-
-#### TB4: Framework <-> filesystem (file search)
-
- **Inside**: `file_search.py:FilesystemFileSearchMiddleware._validate_and_resolve_path` resolves the user-supplied path with `Path.resolve()` (follows symlinks), then checks `resolved.relative_to(self.root_path)`. The `root_path` itself is resolved at init time. `..` and `~` are blocked in the raw path string. Ripgrep subprocess uses `--` to prevent flag injection and does not pass `--follow` (no symlink following).
- **Outside**: When the Python fallback (`_python_search`) is active, `Path.rglob("*")` follows directory symlinks by default. Individual files discovered by rglob are not re-validated through `_validate_and_resolve_path`. `file_path.read_text()` follows symlinks to read content of files whose targets may be outside `root_path`.
- **Crossing mechanism**: Python `Path.glob()`, `Path.rglob()`, `Path.read_text()`, `subprocess.run(["rg", ...])`.
-
-#### TB5: Framework <-> human reviewer (HITL)
-
- **Inside**: `human_in_the_loop.py:HumanInTheLoopMiddleware.after_model` checks tool calls against `self.interrupt_on`, builds `HITLRequest` with `ActionRequest` and `ReviewConfig`, calls `langgraph.types.interrupt()`. Validates that `len(decisions) == len(interrupted_tool_calls)`. Validates decision type is in `allowed_decisions`.
- **Outside**: The `EditDecision.edited_action` allows the human to set any `name` (string) and any `args` (dict). No validation checks the edited name against the agent's registered tool list. The `args_schema` field in `InterruptOnConfig` is declared but never read or enforced. The policy lookup for edit processing uses the *original* tool name's config, not the edited tool name's config.
- **Crossing mechanism**: LangGraph `interrupt()` suspend/resume.
-
---
-
-## Data Flows
-
-| ID | Source | Destination | Data Type | Classification | Crosses Boundary | Protocol |
-|----|--------|-------------|-----------|----------------|------------------|----------|
-| DF1 | User application | C1 Agent Factory (`create_agent`) | Model config, tools, middleware, system prompt | -- | TB1 | Python function call |
-| DF2 | C1 Agent Factory -> C9/C10 | External LLM provider (via partner SDK) | Messages (DC3), API credentials | DC3 | TB2 | HTTPS (via partner SDK) |
-| DF3 | External LLM provider | C1 Agent Factory (model node) | LLM response, tool call arguments | DC3 | TB2 | HTTPS (via partner SDK) |
-| DF4 | C1 Agent Factory (model node) | C2 Shell Tool Middleware -> C3 Execution Policy | LLM-generated shell command string, env dict | DC1, DC6 | TB3 | Python -> stdin pipe |
-| DF5 | C3 Execution Policy (bash process) | C2 Shell Tool Middleware | Command stdout/stderr, exit code | DC1 | TB3 | stdout/stderr pipe |
-| DF6 | C1 Agent Factory (model node) | C4 File Search Middleware -> filesystem | Glob/grep patterns, base path | DC2 | TB4 | Python/ripgrep |
-| DF7 | Filesystem | C4 File Search Middleware | File paths, file content (grep results) | DC2 | TB4 | Python file I/O, ripgrep JSON |
-| DF8 | C1 Agent Factory (after_model hook) | C5 HITL Middleware -> human reviewer | HITLRequest (tool calls for review) | DC4 | TB5 | LangGraph interrupt |
-| DF9 | Human reviewer | C5 HITL Middleware -> C1 Agent Factory | HITLResponse (approve/edit/reject decisions) | DC4 | TB5 | LangGraph resume |
-| DF10 | C6 PII Middleware | Agent state (messages) | Redacted message content, PIIMatch results | DC5 | -- | In-memory state update |
-| DF11 | C7/C8 Context/Summarization Middleware | Agent state (messages) | Pruned/summarized conversation history | DC3, DC7 | -- | In-memory state update |
-
-### Flow Details
-
-#### DF4: LLM-generated command -> Shell subprocess
-
- **Data**: Raw command string from `_ShellToolInput.command`; env dict from middleware configuration.
- **Validation**: `_ShellToolInput.validate_payload` checks mutual exclusion of `command`/`restart` only. `shell_tool.py:ShellToolMiddleware._run_shell_tool` checks `not command or not isinstance(command, str)`. **No content validation, escaping, allowlisting, or denylisting.** The string is written verbatim to bash stdin.
- **Trust assumption**: The command is generated by the LLM and is therefore **untrusted**. The execution policy is the sole isolation mechanism.
-
-#### DF7: Filesystem -> File Search Middleware (Python fallback)
-
- **Data**: File paths discovered by `Path.rglob("*")`, file content read by `Path.read_text()`.
- **Validation**: Base path is validated via `_validate_and_resolve_path`. Individual files from rglob are **not** validated -- their path strings are children of the validated base, but symlink targets may be outside `root_path`.
- **Trust assumption**: Files within `root_path` are assumed safe to read. Symlinks within `root_path` pointing outside are not expected by the middleware.
-
-#### DF9: Human reviewer -> HITL Middleware
-
- **Data**: `HITLResponse` containing `Decision` objects. `EditDecision` carries `edited_action` with `name` (str) and `args` (dict).
- **Validation**: Decision count is validated. Decision type is checked against `allowed_decisions`. **No validation of edited tool name or args content.**
- **Trust assumption**: The human reviewer is a trusted party. However, the middleware does not distinguish between a legitimate human edit and a compromised/malicious client submitting the resume payload.
-
---
-
-## Threats
-
-| ID | Data Flow | Classification | Threat | Boundary | Severity | Validation | Code Reference |
-|----|-----------|----------------|--------|----------|----------|------------|----------------|
-| T1 | DF4 | DC1, DC6 | Unrestricted shell command execution via `HostExecutionPolicy` default -- LLM-generated commands are written verbatim to bash stdin with no validation, escaping, or sandboxing | TB3 | High | Verified | `shell_tool.py:ShellSession.execute`, `shell_tool.py:ShellToolMiddleware._run_shell_tool`, `_execution.py:HostExecutionPolicy.spawn` |
-| T2 | DF4 | DC6 | Environment variable exfiltration from shell subprocess -- commands can read all env vars passed to the subprocess; operator-supplied secrets in `env` dict are accessible | TB3 | Medium | Verified | `_execution.py:HostExecutionPolicy.spawn`, `shell_tool.py:ShellToolMiddleware.__init__` |
-| T3 | DF7 | DC2 | Symlink-following file read outside `root_path` in Python fallback search -- `_python_search` uses `rglob("*")` which follows symlinks; `read_text()` reads content without per-file containment check | TB4 | Medium | Verified | `file_search.py:FilesystemFileSearchMiddleware._python_search` |
-| T4 | DF6, DF7 | DC2 | Filesystem structure disclosure via symlink following in `glob_search` -- `Path.glob()` follows directory symlinks, disclosing filenames and metadata outside `root_path` | TB4 | Low | Verified | `file_search.py:FilesystemFileSearchMiddleware.__init__` (glob_search closure) |
-| T5 | DF9 | DC4 | HITL edit decision allows arbitrary tool redirection -- edited tool name and args are not validated against the agent's registered tool list or any schema | TB5 | Medium | Verified | `human_in_the_loop.py:HumanInTheLoopMiddleware._process_decision`, `human_in_the_loop.py:HumanInTheLoopMiddleware.after_model` |
-| T6 | DF6 | -- | ReDoS via user/LLM-supplied regex in `grep_search` Python fallback and custom PII detectors -- no timeout or complexity limit on regex patterns | TB4 | Low | Likely | `file_search.py:FilesystemFileSearchMiddleware.__init__` (grep_search closure), `_redaction.py:resolve_detector` |
-| T7 | DF4 | DC1 | Shell command logging in plaintext -- `_run_shell_tool` logs raw command at INFO level; commands containing secrets appear in application logs | TB3 | Low | Verified | `shell_tool.py:ShellToolMiddleware._run_shell_tool` |
-| T8 | DF3 -> DF4 | DC1 | Prompt injection escalation via shell tool -- LLM processes untrusted content (retrieved documents, tool outputs) that instructs it to execute malicious shell commands | TB2, TB3 | High | Unverified | `shell_tool.py:ShellSession.execute` (sink), `factory.py:create_agent` (agent loop) |
-
-### Threat Details
-
-#### T1: Unrestricted shell command execution via `HostExecutionPolicy`
-
- **Flow**: DF4 (LLM tool call -> `_run_shell_tool` -> `ShellSession.execute` -> bash stdin)
- **Description**: When `ShellToolMiddleware` is used with the default `HostExecutionPolicy`, LLM-generated commands are written verbatim to bash stdin. The complete validation surface is: (1) `_ShellToolInput.validate_payload` checks mutual exclusion of `command`/`restart`; (2) `_run_shell_tool` checks `not command or not isinstance(command, str)`. No content inspection occurs. Shell metacharacters (`;`, `&&`, `||`, `|`, `$()`, backticks, redirects, embedded newlines) are passed directly to bash. The bash binary is launched as `/bin/bash` with no restricted-mode flags (`-r`). `HostExecutionPolicy` provides no filesystem or network sandboxing; only optional CPU/memory `prlimit` limits (off by default).
- **Preconditions**: (1) User enables `ShellToolMiddleware` (opt-in); (2) user uses `HostExecutionPolicy` (default when no policy specified); (3) the LLM generates a command with shell metacharacters or malicious intent.
-
-#### T2: Environment variable exfiltration from shell subprocess
-
- **Flow**: DF4 (env dict -> execution policy -> subprocess environment)
- **Description**: The `env` dict passed to `ShellToolMiddleware.__init__` is forwarded to `BaseExecutionPolicy.spawn` without filtering. For `HostExecutionPolicy`, it becomes the subprocess environment via `subprocess.Popen(env=...)`. For `DockerExecutionPolicy`, each key-value pair becomes a `-e K=V` Docker flag. Commands executing in the shell can read all environment variables (e.g., `env`, `printenv`, `echo $SECRET_KEY`). If the operator passes API keys or secrets in the env dict, any LLM-generated or agent-executed command can access them.
- **Preconditions**: (1) User passes secrets in the `env` dict to `ShellToolMiddleware`; (2) an LLM-generated command reads environment variables.
-
-#### T3: Symlink-following file read outside `root_path` in Python fallback
-
- **Flow**: DF7 (filesystem -> `_python_search` -> `rglob` -> `read_text`)
- **Description**: `_python_search` validates only the user-supplied base path via `_validate_and_resolve_path`. Individual files discovered by `Path.rglob("*")` are not re-validated. Python's `rglob` follows directory symlinks by default. `Path.read_text()` follows file symlinks. If a symlink inside `root_path` points to a file or directory outside `root_path`, the target's content is read and returned to the agent. The ripgrep backend is not affected (no `--follow` flag), so this only occurs when: (a) `use_ripgrep=False`, (b) ripgrep is not installed, or (c) ripgrep times out (triggering the Python fallback).
- **Preconditions**: (1) A symlink inside `root_path` points outside; (2) the Python fallback search is active (ripgrep unavailable, disabled, or timed out); (3) the agent issues a grep/glob pattern that matches the symlink.
-
-#### T5: HITL edit decision allows arbitrary tool redirection
-
- **Flow**: DF9 (human reviewer -> `_process_decision` -> revised `ToolCall`)
- **Description**: When a human returns an `EditDecision`, the middleware constructs a new `ToolCall` from `edited_action["name"]` and `edited_action["args"]` with no validation. The `name` field is an unconstrained `str` -- it is not checked against `self.interrupt_on`, the agent's registered tool list, or any allowlist. The `args` field is `dict[str, Any]` with no schema validation. The `args_schema` field in `InterruptOnConfig` is declared in the type definition but never read or enforced in the implementation. The policy lookup at `after_model` uses the *original* tool name's config, not the edited name's config.
- **Preconditions**: (1) `HumanInTheLoopMiddleware` is configured with `"edit"` in `allowed_decisions` for at least one tool; (2) the human (or a compromised client submitting the resume payload) provides an `EditDecision` with a different tool name.
-
-#### T8: Prompt injection escalation via shell tool
-
- **Flow**: DF3 -> DF4 (LLM processes untrusted content -> generates shell command)
- **Description**: In agentic workflows, the LLM may process untrusted external content (retrieved documents, tool outputs, web pages) that contains adversarial instructions. If the agent has `ShellToolMiddleware` enabled, a successful prompt injection can escalate to arbitrary shell command execution. This is the standard prompt injection escalation path for agents with shell access, amplified by the lack of command validation at TB3.
- **Preconditions**: (1) Agent processes untrusted external content; (2) the model follows adversarial instructions; (3) `ShellToolMiddleware` is enabled. All three conditions must be true.
-
-### Chain Analysis
-
-**T8 = T1 + prompt injection**: The combination of unrestricted shell access (T1) with prompt injection via untrusted content creates a critical escalation path. Individually, T1 is Medium-to-High (requires LLM to generate malicious commands) and prompt injection is an inherent LLM risk. Together, they form a path from untrusted document content to arbitrary code execution with full host access when `HostExecutionPolicy` is used.
-
-**T3 + T6**: If an attacker can cause ripgrep to time out (e.g., via a very large directory tree or a slow filesystem), the Python fallback activates, enabling symlink-following file reads (T3). A separate ReDoS attack (T6) in the Python fallback could cause additional denial of service. However, these compose to DoS + information disclosure rather than escalating severity.
-
-No other threat chains identified within langchain_v1 scope.
-
---
-
-## Input Source Coverage
-
-Maps each input source category to its data flows, threats, and validation. The "Responsibility" column reflects that users control many input paths in this open source library.
-
-| Input Source | Data Flows | Threats | Validation Points | Responsibility | Gaps |
-|-------------|-----------|---------|-------------------|----------------|------|
-| LLM output (tool call arguments) | DF3, DF4, DF6 | T1, T2, T8 | `_ShellToolInput.validate_payload` (presence check only); `_validate_and_resolve_path` (file search paths); Pydantic schema on tool args (type only, no semantic validation) | User (chooses model, registers tools) / Project (provides shell tool with no command validation) | No command content validation in shell tool; no semantic validation of LLM-generated tool args |
-| Filesystem content (symlink targets) | DF7 | T3, T4 | `_validate_and_resolve_path` (base path only); ripgrep no-follow default | Project (provides file search with containment check) | Python fallback `rglob` follows symlinks without per-file containment check |
-| Human reviewer decisions (HITL) | DF9 | T5 | Decision count validation; decision type check (`allowed_decisions`) | Shared (project provides gating; human controls content) | No validation of edited tool name or args; `args_schema` declared but not enforced |
-| User/LLM-supplied regex patterns | DF6 | T6 | `re.compile()` for syntax validation; ripgrep has built-in regex engine limits | User (supplies patterns) | No complexity/timeout limit on Python regex in fallback path; custom PII detector regex not validated for backtracking |
-| Deployer configuration (env dict) | DF4 | T2 | `_normalize_env()` coerces values to str; no content filtering | User (controls env dict content) | No warning or filtering of secret-like env vars |
-| Deployer configuration (model string) | DF2 | -- | `_BUILTIN_PROVIDERS` hardcoded registry allowlist in `init_chat_model` and `init_embeddings` | Project (controls provider registry) | None -- provider names are hardcoded; `importlib.import_module` only loads from known module paths |
-
---
-
-## Out-of-Scope Threats
-
-Threats that appear valid in isolation but fall outside project responsibility because they depend on conditions the project does not control.
-
-| Pattern | Why Out of Scope | Project Responsibility Ends At |
-|---------|-----------------|-------------------------------|
-| Arbitrary code execution via LLM-directed shell commands when `ShellToolMiddleware` is explicitly enabled | `ShellToolMiddleware` is opt-in and designed to give the agent shell access. Users who enable it accept that the LLM can execute commands. The project's responsibility is providing execution policy options with documented isolation guarantees. | Providing `DockerExecutionPolicy` (container isolation) and `CodexSandboxExecutionPolicy` (syscall filtering) as alternatives to `HostExecutionPolicy`; documenting that `HostExecutionPolicy` provides no sandboxing |
-| Prompt injection leading to tool misuse in agentic workflows | The project does not control model selection, prompt construction, or what tools users register. Prompt injection is an inherent LLM risk. | Providing `HumanInTheLoopMiddleware` for tool call approval; providing `ToolCallLimitMiddleware` and `ModelCallLimitMiddleware` for execution limits; Pydantic schema validation on tool arguments |
-| Data exfiltration via user-registered tools | Users register custom tools with `create_agent`. A tool with network access can exfiltrate data if the LLM is manipulated. Tool capabilities are user-controlled. | Not shipping dangerous default tools; providing middleware hooks (`wrap_tool_call`) for custom tool call interception |
-| PII leakage via user application logging of message content | The framework passes message content through middleware hooks. Users who log message content in callbacks or external systems control their own logging behavior. | Providing `PIIMiddleware` for optional PII detection and redaction; providing `SummarizationMiddleware` and `ContextEditingMiddleware` for reducing conversation history |
-| LLM tool emulator generating incorrect/malicious content | `LLMToolEmulator` replaces real tool execution with LLM-generated fiction. It is explicitly designed for testing, not production. | Documenting that emulated responses are not real tool outputs; the middleware is opt-in |
-| Supply chain attacks on LangGraph or partner SDKs | langchain_v1 depends on `langgraph` and dynamically loads partner packages via `importlib`. Compromise of these dependencies is outside the project's control. | Pinning dependency versions in `pyproject.toml` and `uv.lock`; loading only from hardcoded `_BUILTIN_PROVIDERS` registries |
-| Docker container escape via `DockerExecutionPolicy` | Container security depends on host Docker daemon, kernel version, and container configuration. `DockerExecutionPolicy` is a best-effort isolation layer. | `DockerExecutionPolicy` defaults (`--network none`, `--rm`); documentation of host security requirements |
-
-### Rationale
-
-**Shell tool as opt-in accepted risk**: `ShellToolMiddleware` is explicitly designed to grant the LLM shell access. This is a deliberate, visible choice by the deployer -- analogous to giving a user SSH access. The project's responsibility is providing isolation options (`DockerExecutionPolicy`, `CodexSandboxExecutionPolicy`) and documenting the security properties of each policy. The `HostExecutionPolicy` docstring explicitly states: "best suited for trusted or single-tenant environments (CI jobs, developer workstations, pre-sandboxed containers)." In-scope threats (T1, T2) document the specific risks of the default policy; the out-of-scope pattern covers the broader "LLM runs commands" design decision.
-
-**HITL as a shared-responsibility boundary**: `HumanInTheLoopMiddleware` is designed to add a human approval gate. The design assumes the human reviewer is trusted and the interrupt/resume infrastructure is secure. T5 documents the specific gap (no edit content validation), but the broader pattern of "malicious human reviewer" is out of scope because the middleware's purpose is to empower the human, not to constrain them.
-
---
-
-## Investigated and Dismissed
-
-Threats investigated during flaw validation that were found to be non-exploitable or already mitigated.
-
-| ID | Original Threat | Investigation | Evidence | Conclusion |
-|----|----------------|---------------|----------|------------|
-| D1 | Shell injection via `subprocess.Popen` args list in `_launch_subprocess` | Traced `_execution.py:_launch_subprocess` -- uses `subprocess.Popen(list(command), ...)` with list arguments, not a string. The `# noqa: S603` suppression is appropriate. Shell injection via `Popen` args is not possible with list form. | `_execution.py:_launch_subprocess` -- `list(command)` passed to `Popen`; `shell=False` (default when list is provided) | Not exploitable. The injection risk is via stdin content (T1), not via the Popen args list. Bandit suppression is correct. |
-| D2 | Flag injection in ripgrep subprocess via pattern argument | Traced `file_search.py:FilesystemFileSearchMiddleware._ripgrep_search` -- the `--` separator is placed before `pattern` in the command list. Ripgrep stops option parsing at `--`. | `file_search.py:FilesystemFileSearchMiddleware._ripgrep_search` -- `cmd.extend(["--", pattern, str(base_full)])` | Not exploitable. The `--` separator prevents the pattern from being interpreted as a ripgrep flag. |
-| D3 | Provider registry injection via `init_chat_model` or `init_embeddings` | Traced `chat_models/base.py:init_chat_model` and `embeddings/base.py:init_embeddings` -- both use a hardcoded `_BUILTIN_PROVIDERS` dict. The `importlib.import_module` call only loads module paths from this registry. User-supplied `model_provider` is validated against the dict keys before any import. | `chat_models/base.py:_get_chat_model_creator` -- `if provider not in _BUILTIN_PROVIDERS: raise ValueError`; `embeddings/base.py:_get_embeddings_class_creator` -- same pattern | Not exploitable. Arbitrary module loading is prevented by the allowlist check before `importlib.import_module`. |
-| D4 | Symlink file read via ripgrep backend in file search | Tested ripgrep symlink behavior -- `rg` does not follow symlinks by default (requires `--follow`/`-L` flag). The ripgrep command construction in `_ripgrep_search` does not include `--follow`. | `file_search.py:FilesystemFileSearchMiddleware._ripgrep_search` -- `cmd = ["rg", "--json"]` with no `--follow` flag | Not exploitable via ripgrep path. Symlink content read is limited to the Python fallback (`_python_search`), documented as T3. |
-
---
-
-## Revision History
-
-| Date | Author | Changes |
-|------|--------|---------|
-| 2026-04-08 | langster-threat-model (deep mode, commit d3e60f5c03) | Initial langchain_v1 focused threat model -- 12 components, 7 data classifications (1 Critical, 3 High, 2 Medium, 1 Low), 5 trust boundaries, 11 data flows, 8 threats (2 High, 3 Medium, 3 Low; 6 Verified, 1 Likely, 1 Unverified), 7 out-of-scope patterns, 4 investigated and dismissed. Based on langchain-core THREAT_MODEL_CORE.md (2026-04-08). |
--- a/.github/actions/poetry_setup/action.yml
+++ b/.github/actions/poetry_setup/action.yml
@@ -0,0 +1,91 @@
+# An action for setting up poetry install with caching.
+# Using a custom action since the default action does not
+# take poetry install groups into account.
+# Action code from:
+# https://github.com/actions/setup-python/issues/505#issuecomment-1273013236
+name: poetry-install-with-caching
+description: Poetry install with support for caching of dependency groups.
+
+inputs:
+  python-version:
+    description: Python version, supporting MAJOR.MINOR only
+    required: true
+
+  poetry-version:
+    description: Poetry version
+    required: true
+
+  cache-key:
+    description: Cache key to use for manual handling of caching
+    required: true
+
+  working-directory:
+    description: Directory whose poetry.lock file should be cached
+    required: true
+
+runs:
+  using: composite
+  steps:
+    - uses: actions/setup-python@v4
+      name: Setup python ${{ inputs.python-version }}
+      with:
+        python-version: ${{ inputs.python-version }}
+
+    - uses: actions/cache@v3
+      id: cache-bin-poetry
+      name: Cache Poetry binary - Python ${{ inputs.python-version }}
+      env:
+        SEGMENT_DOWNLOAD_TIMEOUT_MIN: "1"
+      with:
+        path: |
+          /opt/pipx/venvs/poetry
+        # This step caches the poetry installation, so make sure it's keyed on the poetry version as well.
+        key: bin-poetry-${{ runner.os }}-${{ runner.arch }}-py-${{ inputs.python-version }}-${{ inputs.poetry-version }}
+
+    - name: Refresh shell hashtable and fixup softlinks
+      if: steps.cache-bin-poetry.outputs.cache-hit == 'true'
+      shell: bash
+      env:
+        POETRY_VERSION: ${{ inputs.poetry-version }}
+        PYTHON_VERSION: ${{ inputs.python-version }}
+      run: |
+        set -eux
+
+        # Refresh the shell hashtable, to ensure correct `which` output.
+        hash -r
+
+        # `actions/cache@v3` doesn't always seem able to correctly unpack softlinks.
+        # Delete and recreate the softlinks pipx expects to have.
+        rm /opt/pipx/venvs/poetry/bin/python
+        cd /opt/pipx/venvs/poetry/bin
+        ln -s "$(which "python$PYTHON_VERSION")" python
+        chmod +x python
+        cd /opt/pipx_bin/
+        ln -s /opt/pipx/venvs/poetry/bin/poetry poetry
+        chmod +x poetry
+
+        # Ensure everything got set up correctly.
+        /opt/pipx/venvs/poetry/bin/python --version
+        /opt/pipx_bin/poetry --version
+
+    - name: Install poetry
+      if: steps.cache-bin-poetry.outputs.cache-hit != 'true'
+      shell: bash
+      env:
+        POETRY_VERSION: ${{ inputs.poetry-version }}
+        PYTHON_VERSION: ${{ inputs.python-version }}
+      run: pipx install "poetry==$POETRY_VERSION" --python "python$PYTHON_VERSION" --verbose
+
+    - name: Restore pip and poetry cached dependencies
+      uses: actions/cache@v3
+      env:
+        SEGMENT_DOWNLOAD_TIMEOUT_MIN: "4"
+        WORKDIR: ${{ inputs.working-directory == '' && '.' || inputs.working-directory }}
+      with:
+        path: |
+          ~/.cache/pip
+          ~/.cache/pypoetry/virtualenvs
+          ~/.cache/pypoetry/cache
+          ~/.cache/pypoetry/artifacts
+          ${{ env.WORKDIR }}/.venv
+        key: py-deps-${{ runner.os }}-${{ runner.arch }}-py-${{ inputs.python-version }}-poetry-${{ inputs.poetry-version }}-${{ inputs.cache-key }}-${{ hashFiles(format('{0}/**/poetry.lock', env.WORKDIR)) }}
--- a/.github/actions/uv_setup/action.yml
+++ b/.github/actions/uv_setup/action.yml
@@ -1,39 +0,0 @@
-# Helper to set up Python and uv with caching
-
-name: uv-install
-description: Set up Python and uv with caching
-
-inputs:
-  python-version:
-    description: Python version, supporting MAJOR.MINOR only
-    required: true
-  enable-cache:
-    description: Enable caching for uv dependencies
-    required: false
-    default: "true"
-  cache-suffix:
-    description: Custom cache key suffix for cache invalidation
-    required: false
-    default: ""
-  working-directory:
-    description: Working directory for cache glob scoping
-    required: false
-    default: "**"
-
-env:
-  UV_VERSION: "0.5.25"
-
-runs:
-  using: composite
-  steps:
-    - name: Install uv and set the python version
-      uses: astral-sh/setup-uv@0ca8f610542aa7f4acaf39e65cf4eb3c35091883 # v7
-      with:
-        version: ${{ env.UV_VERSION }}
-        python-version: ${{ inputs.python-version }}
-        enable-cache: ${{ inputs.enable-cache }}
-        cache-dependency-glob: |
-          ${{ inputs.working-directory }}/pyproject.toml
-          ${{ inputs.working-directory }}/uv.lock
-          ${{ inputs.working-directory }}/requirements*.txt
-        cache-suffix: ${{ inputs.cache-suffix }}
--- a/.github/dependabot.yml
+++ b/.github/dependabot.yml
@@ -1,95 +0,0 @@
-# Please see the documentation for all configuration options:
-# https://docs.github.com/github/administering-a-repository/configuration-options-for-dependency-updates
-# and
-# https://docs.github.com/code-security/dependabot/dependabot-version-updates/configuration-options-for-the-dependabot.yml-file
-
-version: 2
-updates:
-  - package-ecosystem: "github-actions"
-    directory: "/"
-    schedule:
-      interval: "monthly"
-    groups:
-      minor-and-patch:
-        patterns:
-          - "*"
-        update-types:
-          - "minor"
-          - "patch"
-      major:
-        patterns:
-          - "*"
-        update-types:
-          - "major"
-
-  - package-ecosystem: "uv"
-    directories:
-      - "/libs/core/"
-      - "/libs/langchain/"
-      - "/libs/langchain_v1/"
-    schedule:
-      interval: "monthly"
-    groups:
-      minor-and-patch:
-        patterns:
-          - "*"
-        update-types:
-          - "minor"
-          - "patch"
-      major:
-        patterns:
-          - "*"
-        update-types:
-          - "major"
-
-  - package-ecosystem: "uv"
-    directories:
-      - "/libs/partners/anthropic/"
-      - "/libs/partners/chroma/"
-      - "/libs/partners/deepseek/"
-      - "/libs/partners/exa/"
-      - "/libs/partners/fireworks/"
-      - "/libs/partners/groq/"
-      - "/libs/partners/huggingface/"
-      - "/libs/partners/mistralai/"
-      - "/libs/partners/nomic/"
-      - "/libs/partners/ollama/"
-      - "/libs/partners/openai/"
-      - "/libs/partners/openrouter/"
-      - "/libs/partners/perplexity/"
-      - "/libs/partners/qdrant/"
-      - "/libs/partners/xai/"
-    schedule:
-      interval: "monthly"
-    groups:
-      minor-and-patch:
-        patterns:
-          - "*"
-        update-types:
-          - "minor"
-          - "patch"
-      major:
-        patterns:
-          - "*"
-        update-types:
-          - "major"
-
-  - package-ecosystem: "uv"
-    directories:
-      - "/libs/text-splitters/"
-      - "/libs/standard-tests/"
-      - "/libs/model-profiles/"
-    schedule:
-      interval: "monthly"
-    groups:
-      minor-and-patch:
-        patterns:
-          - "*"
-        update-types:
-          - "minor"
-          - "patch"
-      major:
-        patterns:
-          - "*"
-        update-types:
-          - "major"
--- a/.github/images/logo-dark.svg
+++ b/.github/images/logo-dark.svg
@@ -1,6 +0,0 @@
-<svg width="472" height="100" viewBox="0 0 472 100" fill="none" xmlns="http://www.w3.org/2000/svg">
-<rect width="100" height="100" rx="20" fill="#161F34"/>
-<path d="M54.2612 54.2583L63.1942 45.3253C67.8979 40.6215 67.8979 32.9952 63.1942 28.2914C58.4904 23.5877 50.8641 23.5877 46.1603 28.2914L37.2273 37.2244" stroke="#7FC8FF" stroke-width="12.0389"/>
-<path d="M45.7427 45.7412L36.8098 54.6742C32.106 59.3779 32.106 67.0042 36.8098 71.708C41.5135 76.4118 49.1398 76.4118 53.8436 71.708L62.7766 62.775" stroke="#7FC8FF" stroke-width="12.0389"/>
-<path d="M142.427 70.248V65.748H153.227V32.748H142.427V28.248H158.147V65.748H168.947V70.248H142.427ZM189.174 70.608C182.454 70.608 177.894 67.248 177.894 61.668C177.894 55.548 182.154 52.128 190.194 52.128H199.194V50.028C199.194 46.068 196.374 43.668 191.574 43.668C187.254 43.668 184.374 45.708 183.774 48.828H178.854C179.574 42.828 184.434 39.288 191.814 39.288C199.614 39.288 204.114 43.188 204.114 50.328V63.708C204.114 65.328 204.714 65.748 206.094 65.748H207.654V70.248H204.954C200.874 70.248 199.494 68.508 199.434 65.508C197.514 68.268 194.454 70.608 189.174 70.608ZM189.534 66.408C195.654 66.408 199.194 62.868 199.194 57.768V56.268H189.714C185.334 56.268 182.874 57.888 182.874 61.368C182.874 64.368 185.454 66.408 189.534 66.408ZM216.601 70.248V39.648H220.861L221.521 43.788C223.321 41.448 226.321 39.288 231.121 39.288C237.601 39.288 243.001 42.948 243.001 52.848V70.248H238.081V53.148C238.081 47.028 235.201 43.788 230.281 43.788C224.941 43.788 221.521 47.928 221.521 53.988V70.248H216.601ZM266.348 82.608C258.548 82.608 253.088 78.948 252.308 72.228H257.348C258.188 76.068 261.608 78.228 266.708 78.228C273.128 78.228 276.608 75.228 276.608 68.568V64.968C274.568 68.448 271.268 70.608 266.108 70.608C257.648 70.608 251.408 64.908 251.408 54.948C251.408 45.588 257.648 39.288 266.108 39.288C271.268 39.288 274.688 41.508 276.608 44.928L277.268 39.648H281.528V68.748C281.528 77.568 276.848 82.608 266.348 82.608ZM266.588 66.228C272.588 66.228 276.668 61.608 276.668 55.068C276.668 48.348 272.588 43.668 266.588 43.668C260.528 43.668 256.448 48.288 256.448 54.948C256.448 61.608 260.528 66.228 266.588 66.228ZM304.875 70.608C295.935 70.608 290.055 64.548 290.055 55.008C290.055 45.648 296.115 39.288 304.995 39.288C312.495 39.288 317.235 43.488 318.495 50.208H313.335C312.435 46.128 309.435 43.668 304.935 43.668C299.055 43.668 295.095 48.348 295.095 55.008C295.095 61.668 299.055 66.228 304.935 66.228C309.315 66.228 312.315 63.708 313.275 59.808H318.495C317.295 66.408 312.315 70.608 304.875 70.608ZM328.042 70.248V28.248H332.962V43.788C335.242 40.968 338.782 39.288 342.742 39.288C350.422 39.288 354.802 44.388 354.802 53.208V70.248H349.882V53.508C349.882 47.268 347.002 43.788 341.902 43.788C336.442 43.788 332.962 48.108 332.962 54.948V70.248H328.042ZM375.209 70.608C368.489 70.608 363.929 67.248 363.929 61.668C363.929 55.548 368.189 52.128 376.229 52.128H385.229V50.028C385.229 46.068 382.409 43.668 377.609 43.668C373.289 43.668 370.409 45.708 369.809 48.828H364.889C365.609 42.828 370.469 39.288 377.849 39.288C385.649 39.288 390.149 43.188 390.149 50.328V63.708C390.149 65.328 390.749 65.748 392.129 65.748H393.689V70.248H390.989C386.909 70.248 385.529 68.508 385.469 65.508C383.549 68.268 380.489 70.608 375.209 70.608ZM375.569 66.408C381.689 66.408 385.229 62.868 385.229 57.768V56.268H375.749C371.369 56.268 368.909 57.888 368.909 61.368C368.909 64.368 371.489 66.408 375.569 66.408ZM403.476 70.248V65.748H414.276V44.148H403.476V39.648H419.196V65.748H429.996V70.248H403.476ZM416.796 34.248C414.576 34.248 412.836 32.568 412.836 30.288C412.836 28.068 414.576 26.388 416.796 26.388C419.016 26.388 420.756 28.068 420.756 30.288C420.756 32.568 419.016 34.248 416.796 34.248ZM439.843 70.248V39.648H444.103L444.763 43.788C446.563 41.448 449.563 39.288 454.363 39.288C460.843 39.288 466.243 42.948 466.243 52.848V70.248H461.323V53.148C461.323 47.028 458.443 43.788 453.523 43.788C448.183 43.788 444.763 47.928 444.763 53.988V70.248H439.843Z" fill="white"/>
-</svg>
--- a/.github/images/logo-light.svg
+++ b/.github/images/logo-light.svg
@@ -1,6 +0,0 @@
-<svg width="472" height="100" viewBox="0 0 472 100" fill="none" xmlns="http://www.w3.org/2000/svg">
-<rect width="100" height="100" rx="20" fill="#161F34"/>
-<path d="M54.2612 54.2583L63.1942 45.3253C67.8979 40.6215 67.8979 32.9952 63.1942 28.2914C58.4904 23.5877 50.8641 23.5877 46.1603 28.2914L37.2273 37.2244" stroke="#7FC8FF" stroke-width="12.0389"/>
-<path d="M45.7427 45.7411L36.8098 54.6741C32.106 59.3779 32.106 67.0042 36.8098 71.7079C41.5135 76.4117 49.1398 76.4117 53.8436 71.7079L62.7766 62.775" stroke="#7FC8FF" stroke-width="12.0389"/>
-<path d="M142.427 70.248V65.748H153.227V32.748H142.427V28.248H158.147V65.748H168.947V70.248H142.427ZM189.174 70.608C182.454 70.608 177.894 67.248 177.894 61.668C177.894 55.548 182.154 52.128 190.194 52.128H199.194V50.028C199.194 46.068 196.374 43.668 191.574 43.668C187.254 43.668 184.374 45.708 183.774 48.828H178.854C179.574 42.828 184.434 39.288 191.814 39.288C199.614 39.288 204.114 43.188 204.114 50.328V63.708C204.114 65.328 204.714 65.748 206.094 65.748H207.654V70.248H204.954C200.874 70.248 199.494 68.508 199.434 65.508C197.514 68.268 194.454 70.608 189.174 70.608ZM189.534 66.408C195.654 66.408 199.194 62.868 199.194 57.768V56.268H189.714C185.334 56.268 182.874 57.888 182.874 61.368C182.874 64.368 185.454 66.408 189.534 66.408ZM216.601 70.248V39.648H220.861L221.521 43.788C223.321 41.448 226.321 39.288 231.121 39.288C237.601 39.288 243.001 42.948 243.001 52.848V70.248H238.081V53.148C238.081 47.028 235.201 43.788 230.281 43.788C224.941 43.788 221.521 47.928 221.521 53.988V70.248H216.601ZM266.348 82.608C258.548 82.608 253.088 78.948 252.308 72.228H257.348C258.188 76.068 261.608 78.228 266.708 78.228C273.128 78.228 276.608 75.228 276.608 68.568V64.968C274.568 68.448 271.268 70.608 266.108 70.608C257.648 70.608 251.408 64.908 251.408 54.948C251.408 45.588 257.648 39.288 266.108 39.288C271.268 39.288 274.688 41.508 276.608 44.928L277.268 39.648H281.528V68.748C281.528 77.568 276.848 82.608 266.348 82.608ZM266.588 66.228C272.588 66.228 276.668 61.608 276.668 55.068C276.668 48.348 272.588 43.668 266.588 43.668C260.528 43.668 256.448 48.288 256.448 54.948C256.448 61.608 260.528 66.228 266.588 66.228ZM304.875 70.608C295.935 70.608 290.055 64.548 290.055 55.008C290.055 45.648 296.115 39.288 304.995 39.288C312.495 39.288 317.235 43.488 318.495 50.208H313.335C312.435 46.128 309.435 43.668 304.935 43.668C299.055 43.668 295.095 48.348 295.095 55.008C295.095 61.668 299.055 66.228 304.935 66.228C309.315 66.228 312.315 63.708 313.275 59.808H318.495C317.295 66.408 312.315 70.608 304.875 70.608ZM328.042 70.248V28.248H332.962V43.788C335.242 40.968 338.782 39.288 342.742 39.288C350.422 39.288 354.802 44.388 354.802 53.208V70.248H349.882V53.508C349.882 47.268 347.002 43.788 341.902 43.788C336.442 43.788 332.962 48.108 332.962 54.948V70.248H328.042ZM375.209 70.608C368.489 70.608 363.929 67.248 363.929 61.668C363.929 55.548 368.189 52.128 376.229 52.128H385.229V50.028C385.229 46.068 382.409 43.668 377.609 43.668C373.289 43.668 370.409 45.708 369.809 48.828H364.889C365.609 42.828 370.469 39.288 377.849 39.288C385.649 39.288 390.149 43.188 390.149 50.328V63.708C390.149 65.328 390.749 65.748 392.129 65.748H393.689V70.248H390.989C386.909 70.248 385.529 68.508 385.469 65.508C383.549 68.268 380.489 70.608 375.209 70.608ZM375.569 66.408C381.689 66.408 385.229 62.868 385.229 57.768V56.268H375.749C371.369 56.268 368.909 57.888 368.909 61.368C368.909 64.368 371.489 66.408 375.569 66.408ZM403.476 70.248V65.748H414.276V44.148H403.476V39.648H419.196V65.748H429.996V70.248H403.476ZM416.796 34.248C414.576 34.248 412.836 32.568 412.836 30.288C412.836 28.068 414.576 26.388 416.796 26.388C419.016 26.388 420.756 28.068 420.756 30.288C420.756 32.568 419.016 34.248 416.796 34.248ZM439.843 70.248V39.648H444.103L444.763 43.788C446.563 41.448 449.563 39.288 454.363 39.288C460.843 39.288 466.243 42.948 466.243 52.848V70.248H461.323V53.148C461.323 47.028 458.443 43.788 453.523 43.788C448.183 43.788 444.763 47.928 444.763 53.988V70.248H439.843Z" fill="#161F34"/>
-</svg>
--- a/.github/scripts/check_diff.py
+++ b/.github/scripts/check_diff.py
@@ -1,357 +0,0 @@
-"""Analyze git diffs to determine which directories need to be tested.
-
-Intelligently determines which LangChain packages and directories need to be tested,
-linted, or built based on the changes. Handles dependency relationships between
-packages, maps file changes to appropriate CI job configurations, and outputs JSON
-configurations for GitHub Actions.
-
- Maps changed files to affected package directories (libs/core, libs/partners/*, etc.)
- Builds dependency graph to include dependent packages when core components change
- Generates test matrix configurations with appropriate Python versions
- Handles special cases for Pydantic version testing and performance benchmarks
-
-Used as part of the check_diffs workflow.
-"""
-
-import glob
-import json
-import os
-import sys
-from collections import defaultdict
-from pathlib import Path
-from typing import Dict, List, Set
-
-import tomllib
-from get_min_versions import get_min_version_from_toml
-from packaging.requirements import Requirement
-
-LANGCHAIN_DIRS = [
-    "libs/core",
-    "libs/text-splitters",
-    "libs/langchain",
-    "libs/langchain_v1",
-    "libs/model-profiles",
-]
-
-# Packages with VCR cassette-backed integration tests.
-# These get a playback-only CI check to catch stale cassettes.
-VCR_PACKAGES = {
-    "libs/partners/openai",
-}
-
-# When set to True, we are ignoring core dependents
-# in order to be able to get CI to pass for each individual
-# package that depends on core
-# e.g. if you touch core, we don't then add textsplitters/etc to CI
-IGNORE_CORE_DEPENDENTS = False
-
-# Ignored partners are removed from dependents but still run if directly edited
-IGNORED_PARTNERS = [
-    # remove huggingface from dependents because of CI instability
-    # specifically in huggingface jobs
-    "huggingface",
-]
-
-
-def all_package_dirs() -> Set[str]:
-    return {
-        "/".join(path.split("/")[:-1]).lstrip("./")
-        for path in glob.glob("./libs/**/pyproject.toml", recursive=True)
-        if "libs/standard-tests" not in path
-    }
-
-
-def dependents_graph() -> dict:
-    """Construct a mapping of package -> dependents
-
-    Done such that we can run tests on all dependents of a package when a change is made.
-    """
-    dependents = defaultdict(set)
-
-    for path in glob.glob("./libs/**/pyproject.toml", recursive=True):
-        if "template" in path:
-            continue
-
-        # load regular and test deps from pyproject.toml
-        with open(path, "rb") as f:
-            pyproject = tomllib.load(f)
-
-        pkg_dir = "libs" + "/".join(path.split("libs")[1].split("/")[:-1])
-        for dep in [
-            *pyproject["project"]["dependencies"],
-            *pyproject["dependency-groups"]["test"],
-        ]:
-            requirement = Requirement(dep)
-            package_name = requirement.name
-            if "langchain" in dep:
-                dependents[package_name].add(pkg_dir)
-                continue
-
-        # load extended deps from extended_testing_deps.txt
-        package_path = Path(path).parent
-        extended_requirement_path = package_path / "extended_testing_deps.txt"
-        if extended_requirement_path.exists():
-            with open(extended_requirement_path, "r") as f:
-                extended_deps = f.read().splitlines()
-                for depline in extended_deps:
-                    if depline.startswith("-e "):
-                        # editable dependency
-                        assert depline.startswith("-e ../partners/"), (
-                            "Extended test deps should only editable install partner packages"
-                        )
-                        partner = depline.split("partners/")[1]
-                        dep = f"langchain-{partner}"
-                    else:
-                        dep = depline.split("==")[0]
-
-                    if "langchain" in dep:
-                        dependents[dep].add(pkg_dir)
-
-    for k in dependents:
-        for partner in IGNORED_PARTNERS:
-            if f"libs/partners/{partner}" in dependents[k]:
-                dependents[k].remove(f"libs/partners/{partner}")
-    return dependents
-
-
-def add_dependents(dirs_to_eval: Set[str], dependents: dict) -> List[str]:
-    updated = set()
-    for dir_ in dirs_to_eval:
-        # handle core manually because it has so many dependents
-        if "core" in dir_:
-            updated.add(dir_)
-            continue
-        pkg = "langchain-" + dir_.split("/")[-1]
-        updated.update(dependents[pkg])
-        updated.add(dir_)
-    return list(updated)
-
-
-def _get_configs_for_single_dir(job: str, dir_: str) -> List[Dict[str, str]]:
-    if job == "test-pydantic":
-        return _get_pydantic_test_configs(dir_)
-
-    if job == "codspeed":
-        # CPU simulation (<1% variance, Valgrind-based) is the default.
-        # Partners with heavy SDK inits use walltime instead to keep CI fast.
-        CODSPEED_WALLTIME_DIRS = {
-            "libs/core",
-            "libs/partners/fireworks",  # ~328s under simulation
-            "libs/partners/openai",  # 6 benchmarks, ~6 min under simulation
-        }
-        mode = "walltime" if dir_ in CODSPEED_WALLTIME_DIRS else "simulation"
-        return [
-            {
-                "working-directory": dir_,
-                "python-version": "3.13",
-                "codspeed-mode": mode,
-            }
-        ]
-    if dir_ == "libs/core":
-        py_versions = ["3.10", "3.11", "3.12", "3.13", "3.14"]
-    else:
-        py_versions = ["3.10", "3.14"]
-
-    return [{"working-directory": dir_, "python-version": py_v} for py_v in py_versions]
-
-
-def _get_pydantic_test_configs(
-    dir_: str, *, python_version: str = "3.12"
-) -> List[Dict[str, str]]:
-    with open("./libs/core/uv.lock", "rb") as f:
-        core_uv_lock_data = tomllib.load(f)
-    for package in core_uv_lock_data["package"]:
-        if package["name"] == "pydantic":
-            core_max_pydantic_minor = package["version"].split(".")[1]
-            break
-
-    with open(f"./{dir_}/uv.lock", "rb") as f:
-        dir_uv_lock_data = tomllib.load(f)
-
-    for package in dir_uv_lock_data["package"]:
-        if package["name"] == "pydantic":
-            dir_max_pydantic_minor = package["version"].split(".")[1]
-            break
-
-    core_min_pydantic_version = get_min_version_from_toml(
-        "./libs/core/pyproject.toml", "release", python_version, include=["pydantic"]
-    )["pydantic"]
-    core_min_pydantic_minor = (
-        core_min_pydantic_version.split(".")[1]
-        if "." in core_min_pydantic_version
-        else "0"
-    )
-    dir_min_pydantic_version = get_min_version_from_toml(
-        f"./{dir_}/pyproject.toml", "release", python_version, include=["pydantic"]
-    ).get("pydantic", "0.0.0")
-    dir_min_pydantic_minor = (
-        dir_min_pydantic_version.split(".")[1]
-        if "." in dir_min_pydantic_version
-        else "0"
-    )
-
-    max_pydantic_minor = min(
-        int(dir_max_pydantic_minor),
-        int(core_max_pydantic_minor),
-    )
-    min_pydantic_minor = max(
-        int(dir_min_pydantic_minor),
-        int(core_min_pydantic_minor),
-    )
-
-    configs = [
-        {
-            "working-directory": dir_,
-            "pydantic-version": f"2.{v}.0",
-            "python-version": python_version,
-        }
-        for v in range(min_pydantic_minor, max_pydantic_minor + 1)
-    ]
-    return configs
-
-
-def _get_configs_for_multi_dirs(
-    job: str, dirs_to_run: Dict[str, Set[str]], dependents: dict
-) -> List[Dict[str, str]]:
-    if job == "lint":
-        dirs = add_dependents(
-            dirs_to_run["lint"] | dirs_to_run["test"] | dirs_to_run["extended-test"],
-            dependents,
-        )
-    elif job in ["test", "compile-integration-tests", "dependencies", "test-pydantic"]:
-        dirs = add_dependents(
-            dirs_to_run["test"] | dirs_to_run["extended-test"], dependents
-        )
-    elif job == "extended-tests":
-        dirs = list(dirs_to_run["extended-test"])
-    elif job == "codspeed":
-        dirs = list(dirs_to_run["codspeed"])
-    elif job == "vcr-tests":
-        # Only run VCR tests for packages that have cassettes and are affected
-        all_affected = set(
-            add_dependents(
-                dirs_to_run["test"] | dirs_to_run["extended-test"], dependents
-            )
-        )
-        dirs = [d for d in VCR_PACKAGES if d in all_affected]
-    else:
-        raise ValueError(f"Unknown job: {job}")
-
-    return [
-        config for dir_ in dirs for config in _get_configs_for_single_dir(job, dir_)
-    ]
-
-
-if __name__ == "__main__":
-    files = sys.argv[1:]
-
-    dirs_to_run: Dict[str, set] = {
-        "lint": set(),
-        "test": set(),
-        "extended-test": set(),
-        "codspeed": set(),
-    }
-    docs_edited = False
-
-    if len(files) >= 300:
-        # max diff length is 300 files - there are likely files missing
-        dirs_to_run["lint"] = all_package_dirs()
-        dirs_to_run["test"] = all_package_dirs()
-        dirs_to_run["extended-test"] = set(LANGCHAIN_DIRS)
-
-    for file in files:
-        if any(
-            file.startswith(dir_)
-            for dir_ in (
-                ".github/workflows",
-                ".github/tools",
-                ".github/actions",
-                ".github/scripts/check_diff.py",
-            )
-        ):
-            # Infrastructure changes (workflows, actions, CI scripts) trigger tests on
-            # all core packages as a safety measure. This ensures that changes to CI/CD
-            # infrastructure don't inadvertently break package testing, even if the change
-            # appears unrelated (e.g., documentation build workflows). This is intentionally
-            # conservative to catch unexpected side effects from workflow modifications.
-            #
-            # Example: A PR modifying .github/workflows/api_doc_build.yml will trigger
-            # lint/test jobs for libs/core, libs/text-splitters, libs/langchain, and
-            # libs/langchain_v1, even though the workflow may only affect documentation.
-            dirs_to_run["extended-test"].update(LANGCHAIN_DIRS)
-
-        if file.startswith("libs/core"):
-            dirs_to_run["codspeed"].add("libs/core")
-        if any(file.startswith(dir_) for dir_ in LANGCHAIN_DIRS):
-            # add that dir and all dirs after in LANGCHAIN_DIRS
-            # for extended testing
-
-            found = False
-            for dir_ in LANGCHAIN_DIRS:
-                if dir_ == "libs/core" and IGNORE_CORE_DEPENDENTS:
-                    dirs_to_run["extended-test"].add(dir_)
-                    continue
-                if file.startswith(dir_):
-                    found = True
-                if found:
-                    dirs_to_run["extended-test"].add(dir_)
-        elif file.startswith("libs/standard-tests"):
-            # TODO: update to include all packages that rely on standard-tests (all partner packages)
-            # Note: won't run on external repo partners
-            dirs_to_run["lint"].add("libs/standard-tests")
-            dirs_to_run["test"].add("libs/standard-tests")
-            dirs_to_run["test"].add("libs/partners/mistralai")
-            dirs_to_run["test"].add("libs/partners/openai")
-            dirs_to_run["test"].add("libs/partners/anthropic")
-            dirs_to_run["test"].add("libs/partners/fireworks")
-            dirs_to_run["test"].add("libs/partners/groq")
-
-        elif file.startswith("libs/partners"):
-            partner_dir = file.split("/")[2]
-            if os.path.isdir(f"libs/partners/{partner_dir}") and [
-                filename
-                for filename in os.listdir(f"libs/partners/{partner_dir}")
-                if not filename.startswith(".")
-            ] != ["README.md"]:
-                dirs_to_run["test"].add(f"libs/partners/{partner_dir}")
-                # Skip codspeed for partners without benchmarks or in IGNORED_PARTNERS
-                if partner_dir not in IGNORED_PARTNERS:
-                    dirs_to_run["codspeed"].add(f"libs/partners/{partner_dir}")
-            # Skip if the directory was deleted or is just a tombstone readme
-        elif file.startswith("libs/"):
-            # Check if this is a root-level file in libs/ (e.g., libs/README.md)
-            file_parts = file.split("/")
-            if len(file_parts) == 2:
-                # Root-level file in libs/, skip it (no tests needed)
-                continue
-            raise ValueError(
-                f"Unknown lib: {file}. check_diff.py likely needs "
-                "an update for this new library!"
-            )
-        elif file in [
-            "pyproject.toml",
-            "uv.lock",
-        ]:  # root uv files
-            docs_edited = True
-
-    dependents = dependents_graph()
-
-    # we now have dirs_by_job
-    # todo: clean this up
-    map_job_to_configs = {
-        job: _get_configs_for_multi_dirs(job, dirs_to_run, dependents)
-        for job in [
-            "lint",
-            "test",
-            "extended-tests",
-            "compile-integration-tests",
-            "dependencies",
-            "test-pydantic",
-            "codspeed",
-            "vcr-tests",
-        ]
-    }
-
-    for key, value in map_job_to_configs.items():
-        json_output = json.dumps(value)
-        print(f"{key}={json_output}")
--- a/.github/scripts/check_prerelease_dependencies.py
+++ b/.github/scripts/check_prerelease_dependencies.py
@@ -1,36 +0,0 @@
-"""Check that no dependencies allow prereleases unless we're releasing a prerelease."""
-
-import sys
-
-import tomllib
-
-if __name__ == "__main__":
-    # Get the TOML file path from the command line argument
-    toml_file = sys.argv[1]
-
-    with open(toml_file, "rb") as file:
-        toml_data = tomllib.load(file)
-
-    # See if we're releasing an rc or dev version
-    version = toml_data["project"]["version"]
-    releasing_rc = "rc" in version or "dev" in version
-
-    # If not, iterate through dependencies and make sure none allow prereleases
-    if not releasing_rc:
-        dependencies = toml_data["project"]["dependencies"]
-        for dep_version in dependencies:
-            dep_version_string = (
-                dep_version["version"] if isinstance(dep_version, dict) else dep_version
-            )
-
-            if "rc" in dep_version_string:
-                raise ValueError(
-                    f"Dependency {dep_version} has a prerelease version. Please remove this."
-                )
-
-            if isinstance(dep_version, dict) and dep_version.get(
-                "allow-prereleases", False
-            ):
-                raise ValueError(
-                    f"Dependency {dep_version} has allow-prereleases set to true. Please remove this."
-                )
--- a/.github/scripts/get_min_versions.py
+++ b/.github/scripts/get_min_versions.py
@@ -1,199 +0,0 @@
-"""Get minimum versions of dependencies from a pyproject.toml file."""
-
-import sys
-from collections import defaultdict
-
-if sys.version_info >= (3, 11):
-    import tomllib
-else:
-    # For Python 3.10 and below, which doesnt have stdlib tomllib
-    import tomli as tomllib
-
-import re
-from typing import List
-
-import requests
-from packaging.requirements import Requirement
-from packaging.specifiers import SpecifierSet
-from packaging.version import Version, parse
-
-MIN_VERSION_LIBS = [
-    "langchain-core",
-    "langchain",
-    "langchain-text-splitters",
-    "numpy",
-    "SQLAlchemy",
-]
-
-# some libs only get checked on release because of simultaneous changes in
-# multiple libs
-SKIP_IF_PULL_REQUEST = [
-    "langchain-core",
-    "langchain-text-splitters",
-    "langchain",
-]
-
-
-def get_pypi_versions(package_name: str) -> List[str]:
-    """Fetch all available versions for a package from PyPI.
-
-    Args:
-        package_name: Name of the package
-
-    Returns:
-        List of all available versions
-
-    Raises:
-        requests.exceptions.RequestException: If PyPI API request fails
-        KeyError: If package not found or response format unexpected
-    """
-    pypi_url = f"https://pypi.org/pypi/{package_name}/json"
-    response = requests.get(pypi_url, timeout=10.0)
-    response.raise_for_status()
-    return list(response.json()["releases"].keys())
-
-
-def get_minimum_version(package_name: str, spec_string: str) -> str | None:
-    """Find the minimum published version that satisfies the given constraints.
-
-    Args:
-        package_name: Name of the package
-        spec_string: Version specification string (e.g., ">=0.2.43,<0.4.0,!=0.3.0")
-
-    Returns:
-        Minimum compatible version or None if no compatible version found
-    """
-    # Rewrite occurrences of ^0.0.z to 0.0.z (can be anywhere in constraint string)
-    spec_string = re.sub(r"\^0\.0\.(\d+)", r"0.0.\1", spec_string)
-    # Rewrite occurrences of ^0.y.z to >=0.y.z,<0.y+1 (can be anywhere in constraint string)
-    for y in range(1, 10):
-        spec_string = re.sub(
-            rf"\^0\.{y}\.(\d+)", rf">=0.{y}.\1,<0.{y + 1}", spec_string
-        )
-    # Rewrite occurrences of ^x.y.z to >=x.y.z,<x+1.0.0 (can be anywhere in constraint string)
-    for x in range(1, 10):
-        spec_string = re.sub(
-            rf"\^{x}\.(\d+)\.(\d+)", rf">={x}.\1.\2,<{x + 1}", spec_string
-        )
-
-    spec_set = SpecifierSet(spec_string)
-    all_versions = get_pypi_versions(package_name)
-
-    valid_versions = []
-    for version_str in all_versions:
-        try:
-            version = parse(version_str)
-            if spec_set.contains(version):
-                valid_versions.append(version)
-        except ValueError:
-            continue
-
-    return str(min(valid_versions)) if valid_versions else None
-
-
-def _check_python_version_from_requirement(
-    requirement: Requirement, python_version: str
-) -> bool:
-    if not requirement.marker:
-        return True
-    else:
-        marker_str = str(requirement.marker)
-        if "python_version" in marker_str or "python_full_version" in marker_str:
-            python_version_str = "".join(
-                char
-                for char in marker_str
-                if char.isdigit() or char in (".", "<", ">", "=", ",")
-            )
-            return check_python_version(python_version, python_version_str)
-        return True
-
-
-def get_min_version_from_toml(
-    toml_path: str,
-    versions_for: str,
-    python_version: str,
-    *,
-    include: list | None = None,
-):
-    # Parse the TOML file
-    with open(toml_path, "rb") as file:
-        toml_data = tomllib.load(file)
-
-    dependencies = defaultdict(list)
-    for dep in toml_data["project"]["dependencies"]:
-        requirement = Requirement(dep)
-        dependencies[requirement.name].append(requirement)
-
-    # Initialize a dictionary to store the minimum versions
-    min_versions = {}
-
-    # Iterate over the libs in MIN_VERSION_LIBS
-    for lib in set(MIN_VERSION_LIBS + (include or [])):
-        if versions_for == "pull_request" and lib in SKIP_IF_PULL_REQUEST:
-            # some libs only get checked on release because of simultaneous
-            # changes in multiple libs
-            continue
-        # Check if the lib is present in the dependencies
-        if lib in dependencies:
-            if include and lib not in include:
-                continue
-            requirements = dependencies[lib]
-            for requirement in requirements:
-                if _check_python_version_from_requirement(requirement, python_version):
-                    version_string = str(requirement.specifier)
-                    break
-
-            # Use parse_version to get the minimum supported version from version_string
-            min_version = get_minimum_version(lib, version_string)
-
-            # Store the minimum version in the min_versions dictionary
-            min_versions[lib] = min_version
-
-    return min_versions
-
-
-def check_python_version(version_string, constraint_string):
-    """Check if the given Python version matches the given constraints.
-
-    Args:
-        version_string: A string representing the Python version (e.g. "3.8.5").
-        constraint_string: A string representing the package's Python version
-            constraints (e.g. ">=3.6, <4.0").
-
-    Returns:
-        True if the version matches the constraints
-    """
-
-    # Rewrite occurrences of ^0.0.z to 0.0.z (can be anywhere in constraint string)
-    constraint_string = re.sub(r"\^0\.0\.(\d+)", r"0.0.\1", constraint_string)
-    # Rewrite occurrences of ^0.y.z to >=0.y.z,<0.y+1.0 (can be anywhere in constraint string)
-    for y in range(1, 10):
-        constraint_string = re.sub(
-            rf"\^0\.{y}\.(\d+)", rf">=0.{y}.\1,<0.{y + 1}.0", constraint_string
-        )
-    # Rewrite occurrences of ^x.y.z to >=x.y.z,<x+1.0.0 (can be anywhere in constraint string)
-    for x in range(1, 10):
-        constraint_string = re.sub(
-            rf"\^{x}\.0\.(\d+)", rf">={x}.0.\1,<{x + 1}.0.0", constraint_string
-        )
-
-    try:
-        version = Version(version_string)
-        constraints = SpecifierSet(constraint_string)
-        return version in constraints
-    except Exception as e:
-        print(f"Error: {e}")
-        return False
-
-
-if __name__ == "__main__":
-    # Get the TOML file path from the command line argument
-    toml_file = sys.argv[1]
-    versions_for = sys.argv[2]
-    python_version = sys.argv[3]
-    assert versions_for in ["release", "pull_request"]
-
-    # Call the function to get the minimum versions
-    min_versions = get_min_version_from_toml(toml_file, versions_for, python_version)
-
-    print(" ".join([f"{lib}=={version}" for lib, version in min_versions.items()]))
--- a/.github/scripts/pr-labeler-config.json
+++ b/.github/scripts/pr-labeler-config.json
@@ -1,84 +0,0 @@
-{
-  "trustedThreshold": 5,
-  "labelColor": "b76e79",
-  "sizeThresholds": [
-    { "label": "size: XS", "max": 50 },
-    { "label": "size: S", "max": 200 },
-    { "label": "size: M", "max": 500 },
-    { "label": "size: L", "max": 1000 },
-    { "label": "size: XL" }
-  ],
-  "excludedFiles": ["uv.lock"],
-  "excludedPaths": ["docs/"],
-  "typeToLabel": {
-    "feat": "feature",
-    "fix": "fix",
-    "docs": "documentation",
-    "style": "linting",
-    "refactor": "refactor",
-    "perf": "performance",
-    "test": "tests",
-    "build": "infra",
-    "ci": "infra",
-    "chore": "infra",
-    "revert": "revert",
-    "release": "release",
-    "hotfix": "hotfix",
-    "breaking": "breaking"
-  },
-  "scopeToLabel": {
-    "core": "core",
-    "langchain": "langchain",
-    "langchain-classic": "langchain-classic",
-    "model-profiles": "model-profiles",
-    "standard-tests": "standard-tests",
-    "text-splitters": "text-splitters",
-    "anthropic": "anthropic",
-    "chroma": "chroma",
-    "deepseek": "deepseek",
-    "exa": "exa",
-    "fireworks": "fireworks",
-    "groq": "groq",
-    "huggingface": "huggingface",
-    "mistralai": "mistralai",
-    "nomic": "nomic",
-    "ollama": "ollama",
-    "openai": "openai",
-    "openrouter": "openrouter",
-    "perplexity": "perplexity",
-    "qdrant": "qdrant",
-    "xai": "xai",
-    "deps": "dependencies",
-    "docs": "documentation",
-    "infra": "infra"
-  },
-  "fileRules": [
-    { "label": "core", "prefix": "libs/core/", "skipExcludedFiles": true },
-    { "label": "langchain-classic", "prefix": "libs/langchain/", "skipExcludedFiles": true },
-    { "label": "langchain", "prefix": "libs/langchain_v1/", "skipExcludedFiles": true },
-    { "label": "standard-tests", "prefix": "libs/standard-tests/", "skipExcludedFiles": true },
-    { "label": "model-profiles", "prefix": "libs/model-profiles/", "skipExcludedFiles": true },
-    { "label": "text-splitters", "prefix": "libs/text-splitters/", "skipExcludedFiles": true },
-    { "label": "integration", "prefix": "libs/partners/", "skipExcludedFiles": true },
-    { "label": "anthropic", "prefix": "libs/partners/anthropic/", "skipExcludedFiles": true },
-    { "label": "chroma", "prefix": "libs/partners/chroma/", "skipExcludedFiles": true },
-    { "label": "deepseek", "prefix": "libs/partners/deepseek/", "skipExcludedFiles": true },
-    { "label": "exa", "prefix": "libs/partners/exa/", "skipExcludedFiles": true },
-    { "label": "fireworks", "prefix": "libs/partners/fireworks/", "skipExcludedFiles": true },
-    { "label": "groq", "prefix": "libs/partners/groq/", "skipExcludedFiles": true },
-    { "label": "huggingface", "prefix": "libs/partners/huggingface/", "skipExcludedFiles": true },
-    { "label": "mistralai", "prefix": "libs/partners/mistralai/", "skipExcludedFiles": true },
-    { "label": "nomic", "prefix": "libs/partners/nomic/", "skipExcludedFiles": true },
-    { "label": "ollama", "prefix": "libs/partners/ollama/", "skipExcludedFiles": true },
-    { "label": "openai", "prefix": "libs/partners/openai/", "skipExcludedFiles": true },
-    { "label": "openrouter", "prefix": "libs/partners/openrouter/", "skipExcludedFiles": true },
-    { "label": "perplexity", "prefix": "libs/partners/perplexity/", "skipExcludedFiles": true },
-    { "label": "qdrant", "prefix": "libs/partners/qdrant/", "skipExcludedFiles": true },
-    { "label": "xai", "prefix": "libs/partners/xai/", "skipExcludedFiles": true },
-    { "label": "github_actions", "prefix": ".github/workflows/" },
-    { "label": "github_actions", "prefix": ".github/actions/" },
-    { "label": "dependencies", "suffix": "pyproject.toml" },
-    { "label": "dependencies", "exact": "uv.lock" },
-    { "label": "dependencies", "pattern": "(?:^|/)requirements[^/]*\\.txt$" }
-  ]
-}
--- a/.github/scripts/pr-labeler.js
+++ b/.github/scripts/pr-labeler.js
@@ -1,278 +0,0 @@
-// Shared helpers for pr_labeler.yml and tag-external-issues.yml.
-//
-// Usage from actions/github-script (requires actions/checkout first):
-//   const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-const fs = require('fs');
-const path = require('path');
-
-function loadConfig() {
-  const configPath = path.join(__dirname, 'pr-labeler-config.json');
-  let raw;
-  try {
-    raw = fs.readFileSync(configPath, 'utf8');
-  } catch (e) {
-    throw new Error(`Failed to read ${configPath}: ${e.message}`);
-  }
-  let config;
-  try {
-    config = JSON.parse(raw);
-  } catch (e) {
-    throw new Error(`Failed to parse pr-labeler-config.json: ${e.message}`);
-  }
-  const required = [
-    'labelColor', 'sizeThresholds', 'fileRules',
-    'typeToLabel', 'scopeToLabel', 'trustedThreshold',
-    'excludedFiles', 'excludedPaths',
-  ];
-  const missing = required.filter(k => !(k in config));
-  if (missing.length > 0) {
-    throw new Error(`pr-labeler-config.json missing required keys: ${missing.join(', ')}`);
-  }
-  return config;
-}
-
-function init(github, owner, repo, config, core) {
-  if (!core) {
-    throw new Error('init() requires a `core` parameter (e.g., from actions/github-script)');
-  }
-  const {
-    trustedThreshold,
-    labelColor,
-    sizeThresholds,
-    scopeToLabel,
-    typeToLabel,
-    fileRules: fileRulesDef,
-    excludedFiles,
-    excludedPaths,
-  } = config;
-
-  const sizeLabels = sizeThresholds.map(t => t.label);
-  const allTypeLabels = [...new Set(Object.values(typeToLabel))];
-  const tierLabels = ['new-contributor', 'trusted-contributor'];
-
-  // ── Label management ──────────────────────────────────────────────
-
-  async function ensureLabel(name, color = labelColor) {
-    try {
-      await github.rest.issues.getLabel({ owner, repo, name });
-    } catch (e) {
-      if (e.status !== 404) throw e;
-      try {
-        await github.rest.issues.createLabel({ owner, repo, name, color });
-      } catch (createErr) {
-        // 422 = label created by a concurrent run between our get and create
-        if (createErr.status !== 422) throw createErr;
-        core.info(`Label "${name}" creation returned 422 (likely already exists)`);
-      }
-    }
-  }
-
-  // ── Size calculation ──────────────────────────────────────────────
-
-  function getSizeLabel(totalChanged) {
-    for (const t of sizeThresholds) {
-      if (t.max != null && totalChanged < t.max) return t.label;
-    }
-    // Last entry has no max — it's the catch-all
-    return sizeThresholds[sizeThresholds.length - 1].label;
-  }
-
-  function computeSize(files) {
-    const excluded = new Set(excludedFiles);
-    const totalChanged = files.reduce((sum, f) => {
-      const p = f.filename ?? '';
-      const base = p.split('/').pop();
-      if (excluded.has(base)) return sum;
-      for (const prefix of excludedPaths) {
-        if (p.startsWith(prefix)) return sum;
-      }
-      return sum + (f.additions ?? 0) + (f.deletions ?? 0);
-    }, 0);
-    return { totalChanged, sizeLabel: getSizeLabel(totalChanged) };
-  }
-
-  // ── File-based labels ─────────────────────────────────────────────
-
-  function buildFileRules() {
-    return fileRulesDef.map((rule, i) => {
-      let test;
-      if (rule.prefix) test = p => p.startsWith(rule.prefix);
-      else if (rule.suffix) test = p => p.endsWith(rule.suffix);
-      else if (rule.exact) test = p => p === rule.exact;
-      else if (rule.pattern) {
-        const re = new RegExp(rule.pattern);
-        test = p => re.test(p);
-      } else {
-        throw new Error(
-          `fileRules[${i}] (label: "${rule.label}") has no recognized matcher ` +
-          `(expected one of: prefix, suffix, exact, pattern)`
-        );
-      }
-      return { label: rule.label, test, skipExcluded: !!rule.skipExcludedFiles };
-    });
-  }
-
-  function matchFileLabels(files, fileRules) {
-    const rules = fileRules || buildFileRules();
-    const excluded = new Set(excludedFiles);
-    const labels = new Set();
-    for (const rule of rules) {
-      // skipExcluded: ignore files whose basename is in the top-level
-      // "excludedFiles" list (e.g. uv.lock) so lockfile-only changes
-      // don't trigger package labels.
-      const candidates = rule.skipExcluded
-        ? files.filter(f => !excluded.has((f.filename ?? '').split('/').pop()))
-        : files;
-      if (candidates.some(f => rule.test(f.filename ?? ''))) {
-        labels.add(rule.label);
-      }
-    }
-    return labels;
-  }
-
-  // ── Title-based labels ────────────────────────────────────────────
-
-  function matchTitleLabels(title) {
-    const labels = new Set();
-    const m = (title ?? '').match(/^(\w+)(?:\(([^)]+)\))?(!)?:/);
-    if (!m) return { labels, type: null, typeLabel: null, scopes: [], breaking: false };
-
-    const type = m[1].toLowerCase();
-    const scopeStr = m[2] ?? '';
-    const breaking = !!m[3];
-
-    const typeLabel = typeToLabel[type] || null;
-    if (typeLabel) labels.add(typeLabel);
-    if (breaking) labels.add('breaking');
-
-    const scopes = scopeStr.split(',').map(s => s.trim()).filter(Boolean);
-    for (const scope of scopes) {
-      const sl = scopeToLabel[scope];
-      if (sl) labels.add(sl);
-    }
-
-    return { labels, type, typeLabel, scopes, breaking };
-  }
-
-  // ── Org membership ────────────────────────────────────────────────
-
-  async function checkMembership(author, userType) {
-    if (userType === 'Bot') {
-      console.log(`${author} is a Bot — treating as internal`);
-      return { isExternal: false };
-    }
-
-    try {
-      const membership = await github.rest.orgs.getMembershipForUser({
-        org: 'langchain-ai',
-        username: author,
-      });
-      const isExternal = membership.data.state !== 'active';
-      console.log(
-        isExternal
-          ? `${author} has pending membership — treating as external`
-          : `${author} is an active member of langchain-ai`,
-      );
-      return { isExternal };
-    } catch (e) {
-      if (e.status === 404) {
-        console.log(`${author} is not a member of langchain-ai`);
-        return { isExternal: true };
-      }
-      // Non-404 errors (rate limit, auth failure, server error) must not
-      // silently default to external — rethrow to fail the step.
-      throw new Error(
-        `Membership check failed for ${author} (${e.status}): ${e.message}`,
-      );
-    }
-  }
-
-  // ── Contributor analysis ──────────────────────────────────────────
-
-  async function getContributorInfo(contributorCache, author, userType) {
-    if (contributorCache.has(author)) return contributorCache.get(author);
-
-    const { isExternal } = await checkMembership(author, userType);
-
-    let mergedCount = null;
-    if (isExternal) {
-      try {
-        const result = await github.rest.search.issuesAndPullRequests({
-          q: `repo:${owner}/${repo} is:pr is:merged author:"${author}"`,
-          per_page: 1,
-        });
-        mergedCount = result?.data?.total_count ?? null;
-      } catch (e) {
-        if (e?.status !== 422) throw e;
-        core.warning(`Search failed for ${author}; skipping tier.`);
-      }
-    }
-
-    const info = { isExternal, mergedCount };
-    contributorCache.set(author, info);
-    return info;
-  }
-
-  // ── Tier label resolution ───────────────────────────────────────────
-
-  async function applyTierLabel(issueNumber, author, { skipNewContributor = false } = {}) {
-    let mergedCount;
-    try {
-      const result = await github.rest.search.issuesAndPullRequests({
-        q: `repo:${owner}/${repo} is:pr is:merged author:"${author}"`,
-        per_page: 1,
-      });
-      mergedCount = result?.data?.total_count;
-    } catch (error) {
-      if (error?.status !== 422) throw error;
-      core.warning(`Search failed for ${author}; skipping tier label.`);
-      return;
-    }
-
-    if (mergedCount == null) {
-      core.warning(`Search response missing total_count for ${author}; skipping tier label.`);
-      return;
-    }
-
-    let tierLabel = null;
-    if (mergedCount >= trustedThreshold) tierLabel = 'trusted-contributor';
-    else if (mergedCount === 0 && !skipNewContributor) tierLabel = 'new-contributor';
-
-    if (tierLabel) {
-      await ensureLabel(tierLabel);
-      await github.rest.issues.addLabels({
-        owner, repo, issue_number: issueNumber, labels: [tierLabel],
-      });
-      console.log(`Applied '${tierLabel}' to #${issueNumber} (${mergedCount} merged PRs)`);
-    } else {
-      console.log(`No tier label for ${author} (${mergedCount} merged PRs)`);
-    }
-
-    return tierLabel;
-  }
-
-  return {
-    ensureLabel,
-    getSizeLabel,
-    computeSize,
-    buildFileRules,
-    matchFileLabels,
-    matchTitleLabels,
-    allTypeLabels,
-    checkMembership,
-    getContributorInfo,
-    applyTierLabel,
-    sizeLabels,
-    tierLabels,
-    trustedThreshold,
-    labelColor,
-  };
-}
-
-function loadAndInit(github, owner, repo, core) {
-  const config = loadConfig();
-  return { config, h: init(github, owner, repo, config, core) };
-}
-
-module.exports = { loadConfig, init, loadAndInit };
--- a/.github/scripts/test_release_options.py
+++ b/.github/scripts/test_release_options.py
@@ -1,48 +0,0 @@
-"""Verify _release.yml dropdown options match actual package directories."""
-
-from pathlib import Path
-
-import yaml
-
-REPO_ROOT = Path(__file__).resolve().parents[2]
-
-
-def _get_release_options() -> list[str]:
-    workflow = REPO_ROOT / ".github" / "workflows" / "_release.yml"
-    with open(workflow) as f:
-        data = yaml.safe_load(f)
-    try:
-        # PyYAML (YAML 1.1) parses the bare key `on` as boolean True
-        return data[True]["workflow_dispatch"]["inputs"]["working-directory"]["options"]
-    except (KeyError, TypeError) as e:
-        msg = f"Could not find workflow_dispatch options in {workflow}: {e}"
-        raise AssertionError(msg) from e
-
-
-def _get_package_dirs() -> set[str]:
-    libs = REPO_ROOT / "libs"
-    dirs: set[str] = set()
-    # Top-level packages (libs/core, libs/langchain, etc.)
-    for p in libs.iterdir():
-        if p.is_dir() and (p / "pyproject.toml").exists():
-            dirs.add(f"libs/{p.name}")
-    # Partner packages (libs/partners/*)
-    partners = libs / "partners"
-    if partners.exists():
-        for p in partners.iterdir():
-            if p.is_dir() and (p / "pyproject.toml").exists():
-                dirs.add(f"libs/partners/{p.name}")
-    return dirs
-
-
-def test_release_options_match_packages() -> None:
-    options = set(_get_release_options())
-    packages = _get_package_dirs()
-    missing_from_dropdown = packages - options
-    extra_in_dropdown = options - packages
-    assert not missing_from_dropdown, (
-        f"Packages on disk missing from _release.yml dropdown: {missing_from_dropdown}"
-    )
-    assert not extra_in_dropdown, (
-        f"Dropdown options with no matching package directory: {extra_in_dropdown}"
-    )
--- a/.github/tools/git-restore-mtime
+++ b/.github/tools/git-restore-mtime
@@ -81,93 +81,56 @@ import time
 __version__ = "2022.12+dev"

 # Update symlinks only if the platform supports not following them
-UPDATE_SYMLINKS = bool(os.utime in getattr(os, "supports_follow_symlinks", []))
+UPDATE_SYMLINKS = bool(os.utime in getattr(os, 'supports_follow_symlinks', []))

 # Call os.path.normpath() only if not in a POSIX platform (Windows)
-NORMALIZE_PATHS = os.path.sep != "/"
+NORMALIZE_PATHS = (os.path.sep != '/')

 # How many files to process in each batch when re-trying merge commits
 STEPMISSING = 100

 # (Extra) keywords for the os.utime() call performed by touch()
-UTIME_KWS = {} if not UPDATE_SYMLINKS else {"follow_symlinks": False}
+UTIME_KWS = {} if not UPDATE_SYMLINKS else {'follow_symlinks': False}


 # Command-line interface ######################################################

-
 def parse_args():
-    parser = argparse.ArgumentParser(description=__doc__.split("\n---")[0])
+    parser = argparse.ArgumentParser(
+        description=__doc__.split('\n---')[0])

    group = parser.add_mutually_exclusive_group()
-    group.add_argument(
-        "--quiet",
-        "-q",
-        dest="loglevel",
-        action="store_const",
-        const=logging.WARNING,
-        default=logging.INFO,
-        help="Suppress informative messages and summary statistics.",
-    )
-    group.add_argument(
-        "--verbose",
-        "-v",
-        action="count",
-        help="""
+    group.add_argument('--quiet', '-q', dest='loglevel',
+        action="store_const", const=logging.WARNING, default=logging.INFO,
+        help="Suppress informative messages and summary statistics.")
+    group.add_argument('--verbose', '-v', action="count", help="""
        Print additional information for each processed file.
        Specify twice to further increase verbosity.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--cwd",
-        "-C",
-        metavar="DIRECTORY",
-        help="""
+    parser.add_argument('--cwd', '-C', metavar="DIRECTORY", help="""
        Run as if %(prog)s was started in directory %(metavar)s.
        This affects how --work-tree, --git-dir and PATHSPEC arguments are handled.
        See 'man 1 git' or 'git --help' for more information.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--git-dir",
-        dest="gitdir",
-        metavar="GITDIR",
-        help="""
+    parser.add_argument('--git-dir', dest='gitdir', metavar="GITDIR", help="""
        Path to the git repository, by default auto-discovered by searching
        the current directory and its parents for a .git/ subdirectory.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--work-tree",
-        dest="workdir",
-        metavar="WORKTREE",
-        help="""
+    parser.add_argument('--work-tree', dest='workdir', metavar="WORKTREE", help="""
        Path to the work tree root, by default the parent of GITDIR if it's
        automatically discovered, or the current directory if GITDIR is set.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--force",
-        "-f",
-        default=False,
-        action="store_true",
-        help="""
+    parser.add_argument('--force', '-f', default=False, action="store_true", help="""
        Force updating files with uncommitted modifications.
        Untracked files and uncommitted deletions, renames and additions are
        always ignored.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--merge",
-        "-m",
-        default=False,
-        action="store_true",
-        help="""
+    parser.add_argument('--merge', '-m', default=False, action="store_true", help="""
        Include merge commits.
        Leads to more recent times and more files per commit, thus with the same
        time, which may or may not be what you want.
@@ -175,130 +138,71 @@ def parse_args():
        are found sooner, which can improve performance, sometimes substantially.
        But as merge commits are usually huge, processing them may also take longer.
        By default, merge commits are only used for files missing from regular commits.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--first-parent",
-        default=False,
-        action="store_true",
-        help="""
+    parser.add_argument('--first-parent', default=False, action="store_true", help="""
        Consider only the first parent, the "main branch", when evaluating merge commits.
        Only effective when merge commits are processed, either when --merge is
        used or when finding missing files after the first regular log search.
        See --skip-missing.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--skip-missing",
-        "-s",
-        dest="missing",
-        default=True,
-        action="store_false",
-        help="""
+    parser.add_argument('--skip-missing', '-s', dest="missing", default=True,
+        action="store_false", help="""
        Do not try to find missing files.
        If merge commits were not evaluated with --merge and some files were
        not found in regular commits, by default %(prog)s searches for these
        files again in the merge commits.
        This option disables this retry, so files found only in merge commits
        will not have their timestamp updated.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--no-directories",
-        "-D",
-        dest="dirs",
-        default=True,
-        action="store_false",
-        help="""
+    parser.add_argument('--no-directories', '-D', dest='dirs', default=True,
+        action="store_false", help="""
        Do not update directory timestamps.
        By default, use the time of its most recently created, renamed or deleted file.
        Note that just modifying a file will NOT update its directory time.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--test",
-        "-t",
-        default=False,
-        action="store_true",
-        help="Test run: do not actually update any file timestamp.",
-    )
+    parser.add_argument('--test', '-t', default=False, action="store_true",
+        help="Test run: do not actually update any file timestamp.")

-    parser.add_argument(
-        "--commit-time",
-        "-c",
-        dest="commit_time",
-        default=False,
-        action="store_true",
-        help="Use commit time instead of author time.",
-    )
+    parser.add_argument('--commit-time', '-c', dest='commit_time', default=False,
+        action='store_true', help="Use commit time instead of author time.")

-    parser.add_argument(
-        "--oldest-time",
-        "-o",
-        dest="reverse_order",
-        default=False,
-        action="store_true",
-        help="""
+    parser.add_argument('--oldest-time', '-o', dest='reverse_order', default=False,
+        action='store_true', help="""
        Update times based on the oldest, instead of the most recent commit of a file.
        This reverses the order in which the git log is processed to emulate a
        file "creation" date. Note this will be inaccurate for files deleted and
        re-created at later dates.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--skip-older-than",
-        metavar="SECONDS",
-        type=int,
-        help="""
+    parser.add_argument('--skip-older-than', metavar='SECONDS', type=int, help="""
        Ignore files that are currently older than %(metavar)s.
        Useful in workflows that assume such files already have a correct timestamp,
        as it may improve performance by processing fewer files.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--skip-older-than-commit",
-        "-N",
-        default=False,
-        action="store_true",
-        help="""
+    parser.add_argument('--skip-older-than-commit', '-N', default=False,
+        action='store_true', help="""
        Ignore files older than the timestamp it would be updated to.
        Such files may be considered "original", likely in the author's repository.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--unique-times",
-        default=False,
-        action="store_true",
-        help="""
+    parser.add_argument('--unique-times', default=False, action="store_true", help="""
        Set the microseconds to a unique value per commit.
        Allows telling apart changes that would otherwise have identical timestamps,
        as git's time accuracy is in seconds.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "pathspec",
-        nargs="*",
-        metavar="PATHSPEC",
-        help="""
+    parser.add_argument('pathspec', nargs='*', metavar='PATHSPEC', help="""
        Only modify paths matching %(metavar)s, relative to current directory.
        By default, update all but untracked files and submodules.
-        """,
-    )
+        """)

-    parser.add_argument(
-        "--version",
-        "-V",
-        action="version",
-        version="%(prog)s version {version}".format(version=get_version()),
-    )
+    parser.add_argument('--version', '-V', action='version',
+        version='%(prog)s version {version}'.format(version=get_version()))

    args_ = parser.parse_args()
    if args_.verbose:
@@ -308,18 +212,17 @@ def parse_args():


 def get_version(version=__version__):
-    if not version.endswith("+dev"):
+    if not version.endswith('+dev'):
        return version
    try:
        cwd = os.path.dirname(os.path.realpath(__file__))
-        return Git(cwd=cwd, errors=False).describe().lstrip("v")
+        return Git(cwd=cwd, errors=False).describe().lstrip('v')
    except Git.Error:
-        return "-".join((version, "unknown"))
+        return '-'.join((version, "unknown"))


 # Helper functions ############################################################

-
 def setup_logging():
    """Add TRACE logging level and corresponding method, return the root logger"""
    logging.TRACE = TRACE = logging.DEBUG // 2
@@ -352,13 +255,11 @@ def normalize(path):
    if path and path[0] == '"':
        # Python 2: path = path[1:-1].decode("string-escape")
        # Python 3: https://stackoverflow.com/a/46650050/624066
-        path = (
-            path[1:-1]  # Remove enclosing double quotes
-            .encode("latin1")  # Convert to bytes, required by 'unicode-escape'
-            .decode("unicode-escape")  # Perform the actual octal-escaping decode
-            .encode("latin1")  # 1:1 mapping to bytes, UTF-8 encoded
-            .decode("utf8", "surrogateescape")
-        )  # Decode from UTF-8
+        path = (path[1:-1]                 # Remove enclosing double quotes
+                .encode('latin1')          # Convert to bytes, required by 'unicode-escape'
+                .decode('unicode-escape')  # Perform the actual octal-escaping decode
+                .encode('latin1')          # 1:1 mapping to bytes, UTF-8 encoded
+                .decode('utf8', 'surrogateescape'))  # Decode from UTF-8
    if NORMALIZE_PATHS:
        # Make sure the slash matches the OS; for Windows we need a backslash
        path = os.path.normpath(path)
@@ -381,12 +282,12 @@ def touch_ns(path, mtime_ns):

 def isodate(secs: int):
    # time.localtime() accepts floats, but discards fractional part
-    return time.strftime("%Y-%m-%d %H:%M:%S", time.localtime(secs))
+    return time.strftime('%Y-%m-%d %H:%M:%S', time.localtime(secs))


 def isodate_ns(ns: int):
    # for integers fromtimestamp() is equivalent and ~16% slower than isodate()
-    return datetime.datetime.fromtimestamp(ns / 1000000000).isoformat(sep=" ")
+    return datetime.datetime.fromtimestamp(ns / 1000000000).isoformat(sep=' ')


 def get_mtime_ns(secs: int, idx: int):
@@ -404,49 +305,35 @@ def get_mtime_path(path):

 # Git class and parse_log(), the heart of the script ##########################

-
 class Git:
    def __init__(self, workdir=None, gitdir=None, cwd=None, errors=True):
-        self.gitcmd = ["git"]
+        self.gitcmd = ['git']
        self.errors = errors
        self._proc = None
-        if workdir:
-            self.gitcmd.extend(("--work-tree", workdir))
-        if gitdir:
-            self.gitcmd.extend(("--git-dir", gitdir))
-        if cwd:
-            self.gitcmd.extend(("-C", cwd))
+        if workdir: self.gitcmd.extend(('--work-tree', workdir))
+        if gitdir:  self.gitcmd.extend(('--git-dir',   gitdir))
+        if cwd:     self.gitcmd.extend(('-C',          cwd))
        self.workdir, self.gitdir = self._get_repo_dirs()

    def ls_files(self, paths: list = None):
-        return (normalize(_) for _ in self._run("ls-files --full-name", paths))
+        return (normalize(_) for _ in self._run('ls-files --full-name', paths))

    def ls_dirty(self, force=False):
-        return (
-            normalize(_[3:].split(" -> ", 1)[-1])
-            for _ in self._run("status --porcelain")
-            if _[:2] != "??" and (not force or (_[0] in ("R", "A") or _[1] == "D"))
-        )
+        return (normalize(_[3:].split(' -> ', 1)[-1])
+                for _ in self._run('status --porcelain')
+                if _[:2] != '??' and (not force or (_[0] in ('R', 'A')
+                                                    or _[1] == 'D')))

-    def log(
-        self,
-        merge=False,
-        first_parent=False,
-        commit_time=False,
-        reverse_order=False,
-        paths: list = None,
-    ):
-        cmd = "whatchanged --pretty={}".format("%ct" if commit_time else "%at")
-        if merge:
-            cmd += " -m"
-        if first_parent:
-            cmd += " --first-parent"
-        if reverse_order:
-            cmd += " --reverse"
+    def log(self, merge=False, first_parent=False, commit_time=False,
+            reverse_order=False, paths: list = None):
+        cmd = 'whatchanged --pretty={}'.format('%ct' if commit_time else '%at')
+        if merge:         cmd += ' -m'
+        if first_parent:  cmd += ' --first-parent'
+        if reverse_order: cmd += ' --reverse'
        return self._run(cmd, paths)

    def describe(self):
-        return self._run("describe --tags", check=True)[0]
+        return self._run('describe --tags', check=True)[0]

    def terminate(self):
        if self._proc is None:
@@ -458,22 +345,18 @@ class Git:
            pass

    def _get_repo_dirs(self):
-        return (
-            os.path.normpath(_)
-            for _ in self._run(
-                "rev-parse --show-toplevel --absolute-git-dir", check=True
-            )
-        )
+        return (os.path.normpath(_) for _ in
+            self._run('rev-parse --show-toplevel --absolute-git-dir', check=True))

    def _run(self, cmdstr: str, paths: list = None, output=True, check=False):
        cmdlist = self.gitcmd + shlex.split(cmdstr)
        if paths:
-            cmdlist.append("--")
+            cmdlist.append('--')
            cmdlist.extend(paths)
-        popen_args = dict(universal_newlines=True, encoding="utf8")
+        popen_args = dict(universal_newlines=True, encoding='utf8')
        if not self.errors:
-            popen_args["stderr"] = subprocess.DEVNULL
-        log.trace("Executing: %s", " ".join(cmdlist))
+            popen_args['stderr'] = subprocess.DEVNULL
+        log.trace("Executing: %s", ' '.join(cmdlist))
        if not output:
            return subprocess.call(cmdlist, **popen_args)
        if check:
@@ -496,26 +379,30 @@ def parse_log(filelist, dirlist, stats, git, merge=False, filterlist=None):
    mtime = 0
    datestr = isodate(0)
    for line in git.log(
-        merge, args.first_parent, args.commit_time, args.reverse_order, filterlist
+            merge,
+            args.first_parent,
+            args.commit_time,
+            args.reverse_order,
+            filterlist
    ):
-        stats["loglines"] += 1
+        stats['loglines'] += 1

        # Blank line between Date and list of files
        if not line:
            continue

        # Date line
-        if line[0] != ":":  # Faster than `not line.startswith(':')`
-            stats["commits"] += 1
+        if line[0] != ':':  # Faster than `not line.startswith(':')`
+            stats['commits'] += 1
            mtime = int(line)
            if args.unique_times:
-                mtime = get_mtime_ns(mtime, stats["commits"])
+                mtime = get_mtime_ns(mtime, stats['commits'])
            if args.debug:
                datestr = isodate(mtime)
            continue

        # File line: three tokens if it describes a renaming, otherwise two
-        tokens = line.split("\t")
+        tokens = line.split('\t')

        # Possible statuses:
        # M: Modified (content changed)
@@ -524,7 +411,7 @@ def parse_log(filelist, dirlist, stats, git, merge=False, filterlist=None):
        # T: Type changed: to/from regular file, symlinks, submodules
        # R099: Renamed (moved), with % of unchanged content. 100 = pure rename
        # Not possible in log: C=Copied, U=Unmerged, X=Unknown, B=pairing Broken
-        status = tokens[0].split(" ")[-1]
+        status = tokens[0].split(' ')[-1]
        file = tokens[-1]

        # Handles non-ASCII chars and OS path separator
@@ -532,76 +419,56 @@ def parse_log(filelist, dirlist, stats, git, merge=False, filterlist=None):

        def do_file():
            if args.skip_older_than_commit and get_mtime_path(file) <= mtime:
-                stats["skip"] += 1
+                stats['skip'] += 1
                return
            if args.debug:
-                log.debug(
-                    "%d\t%d\t%d\t%s\t%s",
-                    stats["loglines"],
-                    stats["commits"],
-                    stats["files"],
-                    datestr,
-                    file,
-                )
+                log.debug("%d\t%d\t%d\t%s\t%s",
+                          stats['loglines'], stats['commits'], stats['files'],
+                          datestr, file)
            try:
                touch(os.path.join(git.workdir, file), mtime)
-                stats["touches"] += 1
+                stats['touches'] += 1
            except Exception as e:
                log.error("ERROR: %s: %s", e, file)
-                stats["errors"] += 1
+                stats['errors'] += 1

        def do_dir():
            if args.debug:
-                log.debug(
-                    "%d\t%d\t-\t%s\t%s",
-                    stats["loglines"],
-                    stats["commits"],
-                    datestr,
-                    "{}/".format(dirname or "."),
-                )
+                log.debug("%d\t%d\t-\t%s\t%s",
+                          stats['loglines'], stats['commits'],
+                          datestr, "{}/".format(dirname or '.'))
            try:
                touch(os.path.join(git.workdir, dirname), mtime)
-                stats["dirtouches"] += 1
+                stats['dirtouches'] += 1
            except Exception as e:
                log.error("ERROR: %s: %s", e, dirname)
-                stats["direrrors"] += 1
+                stats['direrrors'] += 1

        if file in filelist:
-            stats["files"] -= 1
+            stats['files'] -= 1
            filelist.remove(file)
            do_file()

-        if args.dirs and status in ("A", "D"):
+        if args.dirs and status in ('A', 'D'):
            dirname = os.path.dirname(file)
            if dirname in dirlist:
                dirlist.remove(dirname)
                do_dir()

        # All files done?
-        if not stats["files"]:
+        if not stats['files']:
            git.terminate()
            return


 # Main Logic ##################################################################

-
 def main():
    start = time.time()  # yes, Wall time. CPU time is not realistic for users.
-    stats = {
-        _: 0
-        for _ in (
-            "loglines",
-            "commits",
-            "touches",
-            "skip",
-            "errors",
-            "dirtouches",
-            "direrrors",
-        )
-    }
+    stats = {_: 0 for _ in ('loglines', 'commits', 'touches', 'skip', 'errors',
+                            'dirtouches', 'direrrors')}

-    logging.basicConfig(level=args.loglevel, format="%(message)s")
+    logging.basicConfig(level=args.loglevel, format='%(message)s')
    log.trace("Arguments: %s", args)

    # First things first: Where and Who are we?
@@ -632,16 +499,13 @@ def main():

            # Symlink (to file, to dir or broken - git handles the same way)
            if not UPDATE_SYMLINKS and os.path.islink(fullpath):
-                log.warning(
-                    "WARNING: Skipping symlink, no OS support for updates: %s", path
-                )
+                log.warning("WARNING: Skipping symlink, no OS support for updates: %s",
+                            path)
                continue

            # skip files which are older than given threshold
-            if (
-                args.skip_older_than
-                and start - get_mtime_path(fullpath) > args.skip_older_than
-            ):
+            if (args.skip_older_than
+                    and start - get_mtime_path(fullpath) > args.skip_older_than):
                continue

            # Always add files relative to worktree root
@@ -655,17 +519,15 @@ def main():
    else:
        dirty = set(git.ls_dirty())
        if dirty:
-            log.warning(
-                "WARNING: Modified files in the working directory were ignored."
-                "\nTo include such files, commit your changes or use --force."
-            )
+            log.warning("WARNING: Modified files in the working directory were ignored."
+                "\nTo include such files, commit your changes or use --force.")
            filelist -= dirty

    # Build dir list to be processed
    dirlist = set(os.path.dirname(_) for _ in filelist) if args.dirs else set()

-    stats["totalfiles"] = stats["files"] = len(filelist)
-    log.info("{0:,} files to be processed in work dir".format(stats["totalfiles"]))
+    stats['totalfiles'] = stats['files'] = len(filelist)
+    log.info("{0:,} files to be processed in work dir".format(stats['totalfiles']))

    if not filelist:
        # Nothing to do. Exit silently and without errors, just like git does
@@ -682,18 +544,10 @@ def main():
        if args.missing and not args.merge:
            filterlist = list(filelist)
            missing = len(filterlist)
-            log.info(
-                "{0:,} files not found in log, trying merge commits".format(missing)
-            )
+            log.info("{0:,} files not found in log, trying merge commits".format(missing))
            for i in range(0, missing, STEPMISSING):
-                parse_log(
-                    filelist,
-                    dirlist,
-                    stats,
-                    git,
-                    merge=True,
-                    filterlist=filterlist[i : i + STEPMISSING],
-                )
+                parse_log(filelist, dirlist, stats, git,
+                          merge=True, filterlist=filterlist[i:i + STEPMISSING])

        # Still missing some?
        for file in filelist:
@@ -702,33 +556,29 @@ def main():
    # Final statistics
    # Suggestion: use git-log --before=mtime to brag about skipped log entries
    def log_info(msg, *a, width=13):
-        ifmt = "{:%d,}" % (width,)  # not using 'n' for consistency with ffmt
-        ffmt = "{:%d,.2f}" % (width,)
+        ifmt = '{:%d,}'    % (width,)  # not using 'n' for consistency with ffmt
+        ffmt = '{:%d,.2f}' % (width,)
        # %-formatting lacks a thousand separator, must pre-render with .format()
-        log.info(msg.replace("%d", ifmt).replace("%f", ffmt).format(*a))
+        log.info(msg.replace('%d', ifmt).replace('%f', ffmt).format(*a))

    log_info(
-        "Statistics:\n%f seconds\n%d log lines processed\n%d commits evaluated",
-        time.time() - start,
-        stats["loglines"],
-        stats["commits"],
-    )
+        "Statistics:\n"
+        "%f seconds\n"
+        "%d log lines processed\n"
+        "%d commits evaluated",
+        time.time() - start, stats['loglines'], stats['commits'])

    if args.dirs:
-        if stats["direrrors"]:
-            log_info("%d directory update errors", stats["direrrors"])
-        log_info("%d directories updated", stats["dirtouches"])
+        if stats['direrrors']: log_info("%d directory update errors", stats['direrrors'])
+        log_info("%d directories updated", stats['dirtouches'])

-    if stats["touches"] != stats["totalfiles"]:
-        log_info("%d files", stats["totalfiles"])
-    if stats["skip"]:
-        log_info("%d files skipped", stats["skip"])
-    if stats["files"]:
-        log_info("%d files missing", stats["files"])
-    if stats["errors"]:
-        log_info("%d file update errors", stats["errors"])
+    if stats['touches'] != stats['totalfiles']:
+                        log_info("%d files",              stats['totalfiles'])
+    if stats['skip']:   log_info("%d files skipped",      stats['skip'])
+    if stats['files']:  log_info("%d files missing",      stats['files'])
+    if stats['errors']: log_info("%d file update errors", stats['errors'])

-    log_info("%d files updated", stats["touches"])
+    log_info("%d files updated", stats['touches'])

    if args.test:
        log.info("TEST RUN - No files modified!")
--- a/.github/workflows/_compile_integration_test.yml
+++ b/.github/workflows/_compile_integration_test.yml
@@ -1,65 +0,0 @@
-# Validates that a package's integration tests compile without syntax or import errors.
-#
-# (If an integration test fails to compile, it won't run.)
-#
-# Called as part of check_diffs.yml workflow
-#
-# Runs pytest with compile marker to check syntax/imports.
-
-name: "🔗 Compile Integration Tests"
-
-on:
-  workflow_call:
-    inputs:
-      working-directory:
-        required: true
-        type: string
-        description: "From which folder this pipeline executes"
-      python-version:
-        required: true
-        type: string
-        description: "Python version to use"
-
-permissions:
-  contents: read
-
-env:
-  UV_FROZEN: "true"
-
-jobs:
-  build:
-    defaults:
-      run:
-        working-directory: ${{ inputs.working-directory }}
-    runs-on: ubuntu-latest
-    timeout-minutes: 20
-    name: "Python ${{ inputs.python-version }}"
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "🐍 Set up Python ${{ inputs.python-version }} + UV"
-        uses: "./.github/actions/uv_setup"
-        with:
-          python-version: ${{ inputs.python-version }}
-          cache-suffix: compile-integration-tests-${{ inputs.working-directory }}
-          working-directory: ${{ inputs.working-directory }}
-
-      - name: "📦 Install Integration Dependencies"
-        shell: bash
-        run: uv sync --group test --group test_integration
-
-      - name: "🔗 Check Integration Tests Compile"
-        shell: bash
-        run: uv run pytest -m compile tests/integration_tests
-
-      - name: "🧹 Verify Clean Working Directory"
-        shell: bash
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/_lint.yml
+++ b/.github/workflows/_lint.yml
@@ -1,11 +1,4 @@
-# Runs linting.
-#
-# Uses the package's Makefile to run the checks, specifically the
-# `lint_package` and `lint_tests` targets.
-#
-# Called as part of check_diffs.yml workflow.
-
-name: "🧹 Linting"
+name: lint

 on:
  workflow_call:
@@ -14,68 +7,144 @@ on:
        required: true
        type: string
        description: "From which folder this pipeline executes"
-      python-version:
-        required: true
-        type: string
-        description: "Python version to use"
-
-permissions:
-  contents: read

 env:
+  POETRY_VERSION: "1.6.1"
  WORKDIR: ${{ inputs.working-directory == '' && '.' || inputs.working-directory }}

-  # This env var allows us to get inline annotations when ruff has complaints.
-  RUFF_OUTPUT_FORMAT: github
-
-  UV_FROZEN: "true"
-
 jobs:
-  # Linting job - runs quality checks on package and test code
  build:
-    name: "Python ${{ inputs.python-version }}"
    runs-on: ubuntu-latest
-    timeout-minutes: 20
+    env:
+      # This number is set "by eye": we want it to be big enough
+      # so that it's bigger than the number of commits in any reasonable PR,
+      # and also as small as possible since increasing the number makes
+      # the initial `git fetch` slower.
+      FETCH_DEPTH: 50
+    strategy:
+      matrix:
+        # Only lint on the min and max supported Python versions.
+        # It's extremely unlikely that there's a lint issue on any version in between
+        # that doesn't show up on the min or max versions.
+        #
+        # GitHub rate-limits how many jobs can be running at any one time.
+        # Starting new jobs is also relatively slow,
+        # so linting on fewer versions makes CI faster.
+        python-version:
+          - "3.8"
+          - "3.12"
    steps:
-      - name: "📋 Checkout Code"
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "🐍 Set up Python ${{ inputs.python-version }} + UV"
-        uses: "./.github/actions/uv_setup"
+      - uses: actions/checkout@v3
        with:
-          python-version: ${{ inputs.python-version }}
-          cache-suffix: lint-${{ inputs.working-directory }}
+          # Fetch the last FETCH_DEPTH commits, so the mtime-changing script
+          # can accurately set the mtimes of files modified in the last FETCH_DEPTH commits.
+          fetch-depth: ${{ env.FETCH_DEPTH }}
+      - name: Restore workdir file mtimes to last-edited commit date
+        id: restore-mtimes
+        # This is needed to make black caching work.
+        # Black's cache uses file (mtime, size) to check whether a lookup is a cache hit.
+        # Without this command, files in the repo would have the current time as the modified time,
+        # since the previous action step just created them.
+        # This command resets the mtime to the last time the files were modified in git instead,
+        # which is a high-quality and stable representation of the last modification date.
+        run: |
+          # Important considerations:
+          # - These commands run at base of the repo, since we never `cd` to the `WORKDIR`.
+          # - We only want to alter mtimes for Python files, since that's all black checks.
+          # - We don't need to alter mtimes for directories, since black doesn't look at those.
+          # - We also only alter mtimes inside the `WORKDIR` since that's all we'll lint.
+          # - This should run before `poetry install`, because poetry's venv also contains
+          #   Python files, and we don't want to alter their mtimes since they aren't linted.
+
+          # Ensure we fail on non-zero exits and on undefined variables.
+          # Also print executed commands, for easier debugging.
+          set -eux
+
+          # Restore the mtimes of Python files in the workdir based on git history.
+          .github/tools/git-restore-mtime --no-directories "$WORKDIR/**/*.py"
+
+          # Since CI only does a partial fetch (to `FETCH_DEPTH`) for efficiency,
+          # the local git repo doesn't have full history. There are probably files
+          # that were last modified in a commit *older than* the oldest fetched commit.
+          # After `git-restore-mtime`, such files have a mtime set to the oldest fetched commit.
+          #
+          # As new commits get added, that timestamp will keep moving forward.
+          # If left unchanged, this will make `black` think that the files were edited
+          # more recently than its cache suggests. Instead, we can set their mtime
+          # to a fixed date in the far past that won't change and won't cause cache misses in black.
+          #
+          # For all workdir Python files modified in or before the oldest few fetched commits,
+          # make their mtime be 2000-01-01 00:00:00.
+          OLDEST_COMMIT="$(git log --reverse '--pretty=format:%H' | head -1)"
+          OLDEST_COMMIT_TIME="$(git show -s '--format=%ai' "$OLDEST_COMMIT")"
+          find "$WORKDIR" -name '*.py' -type f -not -newermt "$OLDEST_COMMIT_TIME" -exec touch -c -m -t '200001010000' '{}' '+'
+
+          echo "oldest-commit=$OLDEST_COMMIT" >> "$GITHUB_OUTPUT"
+
+      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
+        with:
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
          working-directory: ${{ inputs.working-directory }}
+          cache-key: lint-with-extras

-      # - name: "🔒 Verify Lockfile is Up-to-Date"
-      #   working-directory: ${{ inputs.working-directory }}
-      #   run: |
-      #     unset UV_FROZEN
-      #     uv lock --check
-
-      - name: "📦 Install Lint & Typing Dependencies"
+      - name: Check Poetry File
+        shell: bash
        working-directory: ${{ inputs.working-directory }}
        run: |
-          uv sync --group lint --group typing
+          poetry check

-      - name: "🔍 Analyze Package Code with Linters"
+      - name: Check lock file
+        shell: bash
        working-directory: ${{ inputs.working-directory }}
        run: |
-          make lint_package
+          poetry lock --check

-      - name: "📦 Install Test Dependencies (non-partners)"
-        # (For directories NOT starting with libs/partners/)
-        if: ${{ ! startsWith(inputs.working-directory, 'libs/partners/') }}
+      - name: Install dependencies
+        # Also installs dev/lint/test/typing dependencies, to ensure we have
+        # type hints for as many of our libraries as possible.
+        # This helps catch errors that require dependencies to be spotted, for example:
+        # https://github.com/langchain-ai/langchain/pull/10249/files#diff-935185cd488d015f026dcd9e19616ff62863e8cde8c0bee70318d3ccbca98341
+        #
+        # If you change this configuration, make sure to change the `cache-key`
+        # in the `poetry_setup` action above to stop using the old cache.
+        # It doesn't matter how you change it, any change will cause a cache-bust.
        working-directory: ${{ inputs.working-directory }}
        run: |
-          uv sync --inexact --group test
-      - name: "📦 Install Test Dependencies"
-        if: ${{ startsWith(inputs.working-directory, 'libs/partners/') }}
-        working-directory: ${{ inputs.working-directory }}
-        run: |
-          uv sync --inexact --group test --group test_integration
+          poetry install --with dev,lint,test,typing

-      - name: "🔍 Analyze Test Code with Linters"
+      - name: Install langchain editable
        working-directory: ${{ inputs.working-directory }}
+        if: ${{ inputs.working-directory != 'libs/langchain' }}
        run: |
-          make lint_tests
+          pip install -e ../langchain
+
+      - name: Restore black cache
+        uses: actions/cache@v3
+        env:
+          CACHE_BASE: black-${{ runner.os }}-${{ runner.arch }}-py${{ matrix.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', env.WORKDIR)) }}
+          SEGMENT_DOWNLOAD_TIMEOUT_MIN: "1"
+        with:
+          path: |
+            ${{ env.WORKDIR }}/.black_cache
+          key: ${{ env.CACHE_BASE }}-${{ steps.restore-mtimes.outputs.oldest-commit }}
+          restore-keys:
+            # If we can't find an exact match for our cache key, accept any with this prefix.
+            ${{ env.CACHE_BASE }}-
+
+      - name: Get .mypy_cache to speed up mypy
+        uses: actions/cache@v3
+        env:
+          SEGMENT_DOWNLOAD_TIMEOUT_MIN: "2"
+        with:
+          path: |
+            ${{ env.WORKDIR }}/.mypy_cache
+          key: mypy-${{ runner.os }}-${{ runner.arch }}-py${{ matrix.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', env.WORKDIR)) }}
+
+      - name: Analysing the code with our lint
+        working-directory: ${{ inputs.working-directory }}
+        env:
+          BLACK_CACHE_DIR: .black_cache
+        run: |
+          make lint
--- a/.github/workflows/_pydantic_compatibility.yml
+++ b/.github/workflows/_pydantic_compatibility.yml
@@ -0,0 +1,94 @@
+name: pydantic v1/v2 compatibility
+
+on:
+  workflow_call:
+    inputs:
+      working-directory:
+        required: true
+        type: string
+        description: "From which folder this pipeline executes"
+
+env:
+  POETRY_VERSION: "1.6.1"
+
+jobs:
+  build:
+    defaults:
+      run:
+        working-directory: ${{ inputs.working-directory }}
+    runs-on: ubuntu-latest
+    strategy:
+      matrix:
+        python-version:
+          - "3.8"
+          - "3.9"
+          - "3.10"
+          - "3.11"
+          - "3.12"
+    name: Pydantic v1/v2 compatibility - Python ${{ matrix.python-version }}
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
+        with:
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
+          working-directory: ${{ inputs.working-directory }}
+          cache-key: pydantic-cross-compat
+
+      - name: Install dependencies
+        shell: bash
+        run: poetry install
+
+      - name: Install the opposite major version of pydantic
+        # If normal tests use pydantic v1, here we'll use v2, and vice versa.
+        shell: bash
+        run: |
+          # Determine the major part of pydantic version
+          REGULAR_VERSION=$(poetry run python -c "import pydantic; print(pydantic.__version__)" | cut -d. -f1)
+
+          if [[ "$REGULAR_VERSION" == "1" ]]; then
+            PYDANTIC_DEP=">=2.1,<3"
+            TEST_WITH_VERSION="2"
+          elif [[ "$REGULAR_VERSION" == "2" ]]; then
+            PYDANTIC_DEP="<2"
+            TEST_WITH_VERSION="1"
+          else
+            echo "Unexpected pydantic major version '$REGULAR_VERSION', cannot determine which version to use for cross-compatibility test."
+            exit 1
+          fi
+
+          # Install via `pip` instead of `poetry add` to avoid changing lockfile,
+          # which would prevent caching from working: the cache would get saved
+          # to a different key than where it gets loaded from.
+          poetry run pip install "pydantic${PYDANTIC_DEP}"
+
+          # Ensure that the correct pydantic is installed now.
+          echo "Checking pydantic version... Expecting ${TEST_WITH_VERSION}"
+
+          # Determine the major part of pydantic version
+          CURRENT_VERSION=$(poetry run python -c "import pydantic; print(pydantic.__version__)" | cut -d. -f1)
+
+          # Check that the major part of pydantic version is as expected, if not
+          # raise an error
+          if [[ "$CURRENT_VERSION" != "$TEST_WITH_VERSION" ]]; then
+            echo "Error: expected pydantic version ${CURRENT_VERSION} to have been installed, but found: ${TEST_WITH_VERSION}"
+            exit 1
+          fi
+          echo "Found pydantic version ${CURRENT_VERSION}, as expected"
+      - name: Run pydantic compatibility tests
+        shell: bash
+        run: make test
+
+      - name: Ensure the tests did not create any additional files
+        shell: bash
+        run: |
+          set -eu
+
+          STATUS="$(git status)"
+          echo "$STATUS"
+
+          # grep will exit non-zero if the target message isn't found,
+          # and `set -e` above will cause the step to fail.
+          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/_refresh_model_profiles.yml
+++ b/.github/workflows/_refresh_model_profiles.yml
@@ -1,202 +0,0 @@
-# Reusable workflow: refreshes model profile data for any repo that uses the
-# `langchain-profiles` CLI. Creates (or updates) a pull request with the
-# resulting changes.
-#
-# Callers MUST set `permissions: { contents: write, pull-requests: write }` —
-# reusable workflows cannot escalate the caller's token permissions.
-#
-# ── Example: external repo (langchain-google) ──────────────────────────
-#
-#   jobs:
-#     refresh-profiles:
-#       uses: langchain-ai/langchain/.github/workflows/_refresh_model_profiles.yml@master
-#       with:
-#         providers: >-
-#           [
-#             {"provider":"google",        "data_dir":"libs/genai/langchain_google_genai/data"},
-#           ]
-#       secrets:
-#         MODEL_PROFILE_BOT_APP_ID:      ${{ secrets.MODEL_PROFILE_BOT_APP_ID }}
-#         MODEL_PROFILE_BOT_PRIVATE_KEY: ${{ secrets.MODEL_PROFILE_BOT_PRIVATE_KEY }}
-
-name: "Refresh Model Profiles (reusable)"
-
-on:
-  workflow_call:
-    inputs:
-      providers:
-        description: >-
-          JSON array of objects, each with `provider` (models.dev provider ID)
-          and `data_dir` (path relative to repo root where `_profiles.py` and
-          `profile_augmentations.toml` live).
-        required: true
-        type: string
-      cli-path:
-        description: >-
-          Path (relative to workspace) to an existing `libs/model-profiles`
-          checkout.  When set the workflow skips cloning the langchain repo and
-          uses this directory for the CLI instead.  Useful when the caller IS
-          the langchain monorepo.
-        required: false
-        type: string
-        default: ""
-      cli-ref:
-        description: >-
-          Git ref of langchain-ai/langchain to checkout for the CLI.
-          Ignored when `cli-path` is set.
-        required: false
-        type: string
-        default: master
-      add-paths:
-        description: "Glob for files to stage in the PR commit."
-        required: false
-        type: string
-        default: "**/_profiles.py"
-      pr-branch:
-        description: "Branch name for the auto-created PR."
-        required: false
-        type: string
-        default: bot/refresh-model-profiles
-      pr-title:
-        description: "PR / commit title."
-        required: false
-        type: string
-        default: "chore(model-profiles): refresh model profile data"
-      pr-body:
-        description: "PR body."
-        required: false
-        type: string
-        default: |
-          Automated refresh of model profile data via `langchain-profiles refresh`.
-
-          🤖 Generated by the `refresh_model_profiles` workflow.
-      pr-labels:
-        description: "Comma-separated labels to apply to the PR."
-        required: false
-        type: string
-        default: bot
-    secrets:
-      MODEL_PROFILE_BOT_APP_ID:
-        required: true
-      MODEL_PROFILE_BOT_PRIVATE_KEY:
-        required: true
-
-permissions:
-  contents: write
-  pull-requests: write
-
-jobs:
-  refresh-profiles:
-    name: refresh model profiles
-    runs-on: ubuntu-latest
-    steps:
-      - name: "📋 Checkout"
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "📋 Checkout langchain-profiles CLI"
-        if: inputs.cli-path == ''
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          repository: langchain-ai/langchain
-          ref: ${{ inputs.cli-ref }}
-          sparse-checkout: libs/model-profiles
-          path: _langchain-cli
-
-      - name: "🔧 Resolve CLI directory"
-        id: cli
-        env:
-          CLI_PATH: ${{ inputs.cli-path }}
-        run: |
-          if [ -n "${CLI_PATH}" ]; then
-            resolved="${GITHUB_WORKSPACE}/${CLI_PATH}"
-            if [ ! -d "${resolved}" ]; then
-              echo "::error::cli-path '${CLI_PATH}' does not exist at ${resolved}"
-              exit 1
-            fi
-            echo "dir=${CLI_PATH}" >> "$GITHUB_OUTPUT"
-          else
-            echo "dir=_langchain-cli/libs/model-profiles" >> "$GITHUB_OUTPUT"
-          fi
-
-      - name: "🐍 Set up Python + uv"
-        uses: astral-sh/setup-uv@0ca8f610542aa7f4acaf39e65cf4eb3c35091883 # v7
-        with:
-          version: "0.5.25"
-          python-version: "3.12"
-          enable-cache: true
-          cache-dependency-glob: "**/model-profiles/uv.lock"
-
-      - name: "📦 Install langchain-profiles CLI"
-        working-directory: ${{ steps.cli.outputs.dir }}
-        run: uv sync --frozen --no-group test --no-group dev --no-group lint
-
-      - name: "✅ Validate providers input"
-        env:
-          PROVIDERS_JSON: ${{ inputs.providers }}
-        run: |
-          echo "${PROVIDERS_JSON}" | jq -e 'type == "array" and length > 0' > /dev/null || {
-            echo "::error::providers input must be a non-empty JSON array"
-            exit 1
-          }
-          echo "${PROVIDERS_JSON}" | jq -e 'all(has("provider") and has("data_dir"))' > /dev/null || {
-            echo "::error::every entry in providers must have 'provider' and 'data_dir' keys"
-            exit 1
-          }
-
-      - name: "🔄 Refresh profiles"
-        env:
-          PROVIDERS_JSON: ${{ inputs.providers }}
-        run: |
-          cli_dir="${GITHUB_WORKSPACE}/${{ steps.cli.outputs.dir }}"
-          failed=""
-          mapfile -t rows < <(echo "${PROVIDERS_JSON}" | jq -c '.[]')
-          for row in "${rows[@]}"; do
-            provider=$(echo "${row}" | jq -r '.provider')
-            data_dir=$(echo "${row}" | jq -r '.data_dir')
-            echo "--- Refreshing ${provider} -> ${data_dir} ---"
-            if ! echo y | uv run --frozen --project "${cli_dir}" \
-              langchain-profiles refresh \
-              --provider "${provider}" \
-              --data-dir "${GITHUB_WORKSPACE}/${data_dir}"; then
-              echo "::error::Failed to refresh provider: ${provider}"
-              failed="${failed} ${provider}"
-            fi
-          done
-          if [ -n "${failed}" ]; then
-            echo "::error::The following providers failed:${failed}"
-            exit 1
-          fi
-
-      - name: "🔑 Generate GitHub App token"
-        id: app-token
-        uses: actions/create-github-app-token@f8d387b68d61c58ab83c6c016672934102569859 # v3
-        with:
-          app-id: ${{ secrets.MODEL_PROFILE_BOT_APP_ID }}
-          private-key: ${{ secrets.MODEL_PROFILE_BOT_PRIVATE_KEY }}
-
-      - name: "🔀 Create pull request"
-        id: create-pr
-        uses: peter-evans/create-pull-request@c0f553fe549906ede9cf27b5156039d195d2ece0 # v8
-        with:
-          token: ${{ steps.app-token.outputs.token }}
-          branch: ${{ inputs.pr-branch }}
-          commit-message: ${{ inputs.pr-title }}
-          title: ${{ inputs.pr-title }}
-          body: ${{ inputs.pr-body }}
-          labels: ${{ inputs.pr-labels }}
-          add-paths: ${{ inputs.add-paths }}
-
-      - name: "📝 Summary"
-        if: always()
-        env:
-          PR_OP: ${{ steps.create-pr.outputs.pull-request-operation }}
-          PR_URL: ${{ steps.create-pr.outputs.pull-request-url }}
-          JOB_STATUS: ${{ job.status }}
-        run: |
-          if [ "${PR_OP}" = "created" ] || [ "${PR_OP}" = "updated" ]; then
-            echo "### ✅ PR ${PR_OP}: ${PR_URL}" >> "$GITHUB_STEP_SUMMARY"
-          elif [ -z "${PR_OP}" ] && [ "${JOB_STATUS}" = "success" ]; then
-            echo "### ⏭️ Skipped: profiles already up to date" >> "$GITHUB_STEP_SUMMARY"
-          elif [ "${JOB_STATUS}" = "failure" ]; then
-            echo "### ❌ Job failed — check step logs for details" >> "$GITHUB_STEP_SUMMARY"
-          fi
--- a/.github/workflows/_release.yml
+++ b/.github/workflows/_release.yml
@@ -1,11 +1,5 @@
-# Builds and publishes LangChain packages to PyPI.
-#
-# Manually triggered, though can be used as a reusable workflow (workflow_call).
-#
-# Handles version bumping, building, and publishing to PyPI with authentication.
+name: release

-name: "🚀 Package Release"
-run-name: "Release ${{ inputs.working-directory }} ${{ inputs.release-version }}"
 on:
  workflow_call:
    inputs:
@@ -13,216 +7,14 @@ on:
        required: true
        type: string
        description: "From which folder this pipeline executes"
-  workflow_dispatch:
-    inputs:
-      working-directory:
-        required: true
-        type: choice
-        description: "From which folder this pipeline executes"
-        default: "libs/langchain_v1"
-        options:
-          - libs/core
-          - libs/langchain
-          - libs/langchain_v1
-          - libs/text-splitters
-          - libs/standard-tests
-          - libs/model-profiles
-          - libs/partners/anthropic
-          - libs/partners/chroma
-          - libs/partners/deepseek
-          - libs/partners/exa
-          - libs/partners/fireworks
-          - libs/partners/groq
-          - libs/partners/huggingface
-          - libs/partners/mistralai
-          - libs/partners/nomic
-          - libs/partners/ollama
-          - libs/partners/openai
-          - libs/partners/openrouter
-          - libs/partners/perplexity
-          - libs/partners/qdrant
-          - libs/partners/xai
-      release-version:
-        required: true
-        type: string
-        default: "0.1.0"
-        description: "New version of package being released"
-      dangerous-nonmaster-release:
-        required: false
-        type: boolean
-        default: false
-        description: "Release from a non-master branch (danger!) - Only use for hotfixes"

 env:
-  PYTHON_VERSION: "3.11"
-  UV_FROZEN: "true"
-  UV_NO_SYNC: "true"
-
-permissions:
-  contents: read # Job-level overrides grant write only where needed (mark-release)
+  POETRY_VERSION: "1.6.1"

 jobs:
-  # Build the distribution package and extract version info
-  # Runs in isolated environment with minimal permissions for security
-  build:
-    if: github.ref == 'refs/heads/master' || inputs.dangerous-nonmaster-release
-    environment: Scheduled testing
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-
-    outputs:
-      pkg-name: ${{ steps.check-version.outputs.pkg-name }}
-      version: ${{ steps.check-version.outputs.version }}
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: Set up Python + uv
-        uses: "./.github/actions/uv_setup"
-        with:
-          python-version: ${{ env.PYTHON_VERSION }}
-
-      # We want to keep this build stage *separate* from the release stage,
-      # so that there's no sharing of permissions between them.
-      # (Release stage has trusted publishing and GitHub repo contents write access,
-      # which the build stage must not have access to.)
-      #
-      # Otherwise, a malicious `build` step (e.g. via a compromised dependency)
-      # could get access to our GitHub or PyPI credentials.
-      #
-      # Per the trusted publishing GitHub Action:
-      # > It is strongly advised to separate jobs for building [...]
-      # > from the publish job.
-      # https://github.com/pypa/gh-action-pypi-publish#non-goals
-      - name: Build project for distribution
-        run: uv build
-        working-directory: ${{ inputs.working-directory }}
-
-      - name: Upload build
-        uses: actions/upload-artifact@bbbca2ddaa5d8feaa63e36b76fdaad77386f024f # v7
-        with:
-          name: dist
-          path: ${{ inputs.working-directory }}/dist/
-
-      - name: Check version
-        id: check-version
-        shell: python
-        working-directory: ${{ inputs.working-directory }}
-        run: |
-          import os
-          import tomllib
-          with open("pyproject.toml", "rb") as f:
-              data = tomllib.load(f)
-          pkg_name = data["project"]["name"]
-          version = data["project"]["version"]
-          with open(os.environ["GITHUB_OUTPUT"], "a") as f:
-              f.write(f"pkg-name={pkg_name}\n")
-              f.write(f"version={version}\n")
-  release-notes:
-    # release-notes must run before publishing because its check-tags step
-    # validates version/tag state — do not remove this dependency.
-    needs:
-      - build
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-    outputs:
-      release-body: ${{ steps.generate-release-body.outputs.release-body }}
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          repository: langchain-ai/langchain
-          path: langchain
-          sparse-checkout: | # this only grabs files for relevant dir
-            ${{ inputs.working-directory }}
-          ref: ${{ github.ref }} # this scopes to just ref'd branch
-          fetch-depth: 0 # this fetches entire commit history
-      - name: Check tags
-        id: check-tags
-        shell: bash
-        working-directory: langchain/${{ inputs.working-directory }}
-        env:
-          PKG_NAME: ${{ needs.build.outputs.pkg-name }}
-          VERSION: ${{ needs.build.outputs.version }}
-        run: |
-          # Handle regular versions and pre-release versions differently
-          if [[ "$VERSION" == *"-"* ]]; then
-            # This is a pre-release version (contains a hyphen)
-            # Extract the base version without the pre-release suffix
-            BASE_VERSION=${VERSION%%-*}
-            # Look for the latest release of the same base version
-            REGEX="^$PKG_NAME==$BASE_VERSION\$"
-            PREV_TAG=$(git tag --sort=-creatordate | (grep -P "$REGEX" || true) | head -1)
-
-            # If no exact base version match, look for the latest release of any kind
-            if [ -z "$PREV_TAG" ]; then
-              REGEX="^$PKG_NAME==\\d+\\.\\d+\\.\\d+\$"
-              PREV_TAG=$(git tag --sort=-creatordate | (grep -P "$REGEX" || true) | head -1)
-            fi
-          else
-            # Regular version handling
-            PREV_TAG="$PKG_NAME==${VERSION%.*}.$(( ${VERSION##*.} - 1 ))"; [[ "${VERSION##*.}" -eq 0 ]] && PREV_TAG=""
-
-            # backup case if releasing e.g. 0.3.0, looks up last release
-            # note if last release (chronologically) was e.g. 0.1.47 it will get
-            # that instead of the last 0.2 release
-            if [ -z "$PREV_TAG" ]; then
-              REGEX="^$PKG_NAME==\\d+\\.\\d+\\.\\d+\$"
-              echo $REGEX
-              PREV_TAG=$(git tag --sort=-creatordate | (grep -P $REGEX || true) | head -1)
-            fi
-          fi
-
-          # if PREV_TAG is empty or came out to 0.0.0, let it be empty
-          if [ -z "$PREV_TAG" ] || [ "$PREV_TAG" = "$PKG_NAME==0.0.0" ]; then
-            echo "No previous tag found - first release"
-          else
-            # confirm prev-tag actually exists in git repo with git tag
-            GIT_TAG_RESULT=$(git tag -l "$PREV_TAG")
-            if [ -z "$GIT_TAG_RESULT" ]; then
-              echo "Previous tag $PREV_TAG not found in git repo"
-              exit 1
-            fi
-          fi
-
-
-          TAG="${PKG_NAME}==${VERSION}"
-          if [ "$TAG" == "$PREV_TAG" ]; then
-            echo "No new version to release"
-            exit 1
-          fi
-          echo tag="$TAG" >> $GITHUB_OUTPUT
-          echo prev-tag="$PREV_TAG" >> $GITHUB_OUTPUT
-      - name: Generate release body
-        id: generate-release-body
-        working-directory: langchain
-        env:
-          WORKING_DIR: ${{ inputs.working-directory }}
-          PKG_NAME: ${{ needs.build.outputs.pkg-name }}
-          TAG: ${{ steps.check-tags.outputs.tag }}
-          PREV_TAG: ${{ steps.check-tags.outputs.prev-tag }}
-        run: |
-          PREAMBLE="Changes since $PREV_TAG"
-          # if PREV_TAG is empty or 0.0.0, then we are releasing the first version
-          if [ -z "$PREV_TAG" ] || [ "$PREV_TAG" = "$PKG_NAME==0.0.0" ]; then
-            PREAMBLE="Initial release"
-            PREV_TAG=$(git rev-list --max-parents=0 HEAD)
-          fi
-          {
-            echo 'release-body<<EOF'
-            echo $PREAMBLE
-            echo
-            git log --format="%s" "$PREV_TAG"..HEAD -- $WORKING_DIR
-            echo EOF
-          } >> "$GITHUB_OUTPUT"
-
-  test-pypi-publish:
-    # release-notes must run before publishing because its check-tags step
-    # validates version/tag state — do not remove this dependency.
-    needs:
-      - build
-      - release-notes
+  if_release:
+    # Disallow publishing from branches that aren't `master`.
+    if: github.ref == 'refs/heads/master'
    runs-on: ubuntu-latest
    permissions:
      # This permission is used for trusted publishing:
@@ -232,413 +24,41 @@ jobs:
      # https://docs.pypi.org/trusted-publishers/adding-a-publisher/
      id-token: write

-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - uses: actions/download-artifact@3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c # v8
-        with:
-          name: dist
-          path: ${{ inputs.working-directory }}/dist/
-
-      - name: Publish to test PyPI
-        uses: pypa/gh-action-pypi-publish@ed0c53931b1dc9bd32cbe73a98c7f6766f8a527e # release/v1
-        with:
-          packages-dir: ${{ inputs.working-directory }}/dist/
-          verbose: true
-          print-hash: true
-          repository-url: https://test.pypi.org/legacy/
-          # We overwrite any existing distributions with the same name and version.
-          # This is *only for CI use* and is *extremely dangerous* otherwise!
-          # https://github.com/pypa/gh-action-pypi-publish#tolerating-release-package-file-duplicates
-          skip-existing: true
-          # Temp workaround since attestations are on by default as of gh-action-pypi-publish v1.11.0
-          attestations: false
-
-  pre-release-checks:
-    needs:
-      - build
-      - release-notes
-      - test-pypi-publish
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-    timeout-minutes: 20
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      # We explicitly *don't* set up caching here. This ensures our tests are
-      # maximally sensitive to catching breakage.
-      #
-      # For example, here's a way that caching can cause a falsely-passing test:
-      # - Make the langchain package manifest no longer list a dependency package
-      #   as a requirement. This means it won't be installed by `pip install`,
-      #   and attempting to use it would cause a crash.
-      # - That dependency used to be required, so it may have been cached.
-      #   When restoring the venv packages from cache, that dependency gets included.
-      # - Tests pass, because the dependency is present even though it wasn't specified.
-      # - The package is published, and it breaks on the missing dependency when
-      #   used in the real world.
-
-      - name: Set up Python + uv
-        uses: "./.github/actions/uv_setup"
-        id: setup-python
-        with:
-          python-version: ${{ env.PYTHON_VERSION }}
-
-      - uses: actions/download-artifact@3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c # v8
-        with:
-          name: dist
-          path: ${{ inputs.working-directory }}/dist/
-
-      - name: Import dist package
-        shell: bash
-        working-directory: ${{ inputs.working-directory }}
-        env:
-          PKG_NAME: ${{ needs.build.outputs.pkg-name }}
-          VERSION: ${{ needs.build.outputs.version }}
-        # Install directly from the locally-built wheel (no index resolution needed)
-        run: |
-          uv venv
-          VIRTUAL_ENV=.venv uv pip install dist/*.whl
-
-          # Replace all dashes in the package name with underscores,
-          # since that's how Python imports packages with dashes in the name.
-          # also remove _official suffix
-          IMPORT_NAME="$(echo "$PKG_NAME" | sed s/-/_/g | sed s/_official//g)"
-
-          uv run python -c "import $IMPORT_NAME; print(dir($IMPORT_NAME))"
-
-      - name: Import test dependencies
-        run: uv sync --group test
-        working-directory: ${{ inputs.working-directory }}
-
-      # Overwrite the local version of the package with the built version
-      - name: Import published package (again)
-        working-directory: ${{ inputs.working-directory }}
-        shell: bash
-        env:
-          PKG_NAME: ${{ needs.build.outputs.pkg-name }}
-          VERSION: ${{ needs.build.outputs.version }}
-        run: |
-          VIRTUAL_ENV=.venv uv pip install dist/*.whl
-
-      - name: Check for prerelease versions
-        # Block release if any dependencies allow prerelease versions
-        # (unless this is itself a prerelease version)
-        working-directory: ${{ inputs.working-directory }}
-        run: |
-          uv run python $GITHUB_WORKSPACE/.github/scripts/check_prerelease_dependencies.py pyproject.toml
-
-      - name: Run unit tests
-        run: make tests
-        working-directory: ${{ inputs.working-directory }}
-
-      - name: Get minimum versions
-        # Find the minimum published versions that satisfies the given constraints
-        working-directory: ${{ inputs.working-directory }}
-        id: min-version
-        run: |
-          VIRTUAL_ENV=.venv uv pip install packaging requests
-          python_version="$(uv run python --version | awk '{print $2}')"
-          min_versions="$(uv run python $GITHUB_WORKSPACE/.github/scripts/get_min_versions.py pyproject.toml release $python_version)"
-          echo "min-versions=$min_versions" >> "$GITHUB_OUTPUT"
-          echo "min-versions=$min_versions"
-
-      - name: Run unit tests with minimum dependency versions
-        if: ${{ steps.min-version.outputs.min-versions != '' }}
-        env:
-          MIN_VERSIONS: ${{ steps.min-version.outputs.min-versions }}
-        run: |
-          VIRTUAL_ENV=.venv uv pip install --force-reinstall --editable .
-          VIRTUAL_ENV=.venv uv pip install --force-reinstall $MIN_VERSIONS
-          make tests
-        working-directory: ${{ inputs.working-directory }}
-
-      - name: Import integration test dependencies
-        run: uv sync --group test --group test_integration
-        working-directory: ${{ inputs.working-directory }}
-
-      - name: Run integration tests
-        # Uses the Makefile's `integration_tests` target for the specified package
-        if: ${{ startsWith(inputs.working-directory, 'libs/partners/') }}
-        env:
-          AI21_API_KEY: ${{ secrets.AI21_API_KEY }}
-          GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}
-          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
-          MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}
-          TOGETHER_API_KEY: ${{ secrets.TOGETHER_API_KEY }}
-          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
-          AZURE_OPENAI_API_VERSION: ${{ secrets.AZURE_OPENAI_API_VERSION }}
-          AZURE_OPENAI_API_BASE: ${{ secrets.AZURE_OPENAI_API_BASE }}
-          AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}
-          AZURE_OPENAI_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_CHAT_DEPLOYMENT_NAME }}
-          AZURE_OPENAI_LEGACY_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LEGACY_CHAT_DEPLOYMENT_NAME }}
-          AZURE_OPENAI_LLM_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LLM_DEPLOYMENT_NAME }}
-          AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME }}
-          NVIDIA_API_KEY: ${{ secrets.NVIDIA_API_KEY }}
-          GOOGLE_SEARCH_API_KEY: ${{ secrets.GOOGLE_SEARCH_API_KEY }}
-          GOOGLE_CSE_ID: ${{ secrets.GOOGLE_CSE_ID }}
-          GROQ_API_KEY: ${{ secrets.GROQ_API_KEY }}
-          HUGGINGFACEHUB_API_TOKEN: ${{ secrets.HUGGINGFACEHUB_API_TOKEN }}
-          EXA_API_KEY: ${{ secrets.EXA_API_KEY }}
-          NOMIC_API_KEY: ${{ secrets.NOMIC_API_KEY }}
-          WATSONX_APIKEY: ${{ secrets.WATSONX_APIKEY }}
-          WATSONX_PROJECT_ID: ${{ secrets.WATSONX_PROJECT_ID }}
-          ASTRA_DB_API_ENDPOINT: ${{ secrets.ASTRA_DB_API_ENDPOINT }}
-          ASTRA_DB_APPLICATION_TOKEN: ${{ secrets.ASTRA_DB_APPLICATION_TOKEN }}
-          ASTRA_DB_KEYSPACE: ${{ secrets.ASTRA_DB_KEYSPACE }}
-          ES_URL: ${{ secrets.ES_URL }}
-          ES_CLOUD_ID: ${{ secrets.ES_CLOUD_ID }}
-          ES_API_KEY: ${{ secrets.ES_API_KEY }}
-          MONGODB_ATLAS_URI: ${{ secrets.MONGODB_ATLAS_URI }}
-          UPSTAGE_API_KEY: ${{ secrets.UPSTAGE_API_KEY }}
-          FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}
-          XAI_API_KEY: ${{ secrets.XAI_API_KEY }}
-          DEEPSEEK_API_KEY: ${{ secrets.DEEPSEEK_API_KEY }}
-          PPLX_API_KEY: ${{ secrets.PPLX_API_KEY }}
-          OLLAMA_API_KEY: ${{ secrets.OLLAMA_API_KEY }}
-          OPENROUTER_API_KEY: ${{ secrets.OPENROUTER_API_KEY }}
-          LANGCHAIN_TESTS_USER_AGENT: ${{ secrets.LANGCHAIN_TESTS_USER_AGENT }}
-        run: make integration_tests
-        working-directory: ${{ inputs.working-directory }}
-
-  # Test select published packages against new core
-  # Done when code changes are made to langchain-core
-  test-prior-published-packages-against-new-core:
-    # Installs the new core with old partners: Installs the new unreleased core
-    # alongside the previously published partner packages and runs integration tests
-    needs:
-      - build
-      - release-notes
-      - test-pypi-publish
-      - pre-release-checks
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-    if: false # temporarily skip
-    strategy:
-      matrix:
-        partner: [anthropic]
-      fail-fast: false # Continue testing other partners if one fails
-    env:
-      ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
-      ANTHROPIC_FILES_API_IMAGE_ID: ${{ secrets.ANTHROPIC_FILES_API_IMAGE_ID }}
-      ANTHROPIC_FILES_API_PDF_ID: ${{ secrets.ANTHROPIC_FILES_API_PDF_ID }}
-      OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
-      AZURE_OPENAI_API_VERSION: ${{ secrets.AZURE_OPENAI_API_VERSION }}
-      AZURE_OPENAI_API_BASE: ${{ secrets.AZURE_OPENAI_API_BASE }}
-      AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}
-      AZURE_OPENAI_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_CHAT_DEPLOYMENT_NAME }}
-      AZURE_OPENAI_LEGACY_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LEGACY_CHAT_DEPLOYMENT_NAME }}
-      AZURE_OPENAI_LLM_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LLM_DEPLOYMENT_NAME }}
-      AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME }}
-      LANGCHAIN_TESTS_USER_AGENT: ${{ secrets.LANGCHAIN_TESTS_USER_AGENT }}
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      # We implement this conditional as Github Actions does not have good support
-      # for conditionally needing steps. https://github.com/actions/runner/issues/491
-      # TODO: this seems to be resolved upstream, so we can probably remove this workaround
-      - name: Check if libs/core
-        run: |
-          if [ "${{ startsWith(inputs.working-directory, 'libs/core') }}" != "true" ]; then
-            echo "Not in libs/core. Exiting successfully."
-            exit 0
-          fi
-
-      - name: Set up Python + uv
-        if: startsWith(inputs.working-directory, 'libs/core')
-        uses: "./.github/actions/uv_setup"
-        with:
-          python-version: ${{ env.PYTHON_VERSION }}
-
-      - uses: actions/download-artifact@3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c # v8
-        if: startsWith(inputs.working-directory, 'libs/core')
-        with:
-          name: dist
-          path: ${{ inputs.working-directory }}/dist/
-
-      - name: Test against ${{ matrix.partner }}
-        if: startsWith(inputs.working-directory, 'libs/core')
-        run: |
-          # Identify latest tag, excluding pre-releases
-          LATEST_PACKAGE_TAG="$(
-            git ls-remote --tags origin "langchain-${{ matrix.partner }}*" \
-            | awk '{print $2}' \
-            | sed 's|refs/tags/||' \
-            | grep -E '[0-9]+\.[0-9]+\.[0-9]+$' \
-            | sort -Vr \
-            | head -n 1
-          )"
-          echo "Latest package tag: $LATEST_PACKAGE_TAG"
-
-          # Shallow-fetch just that single tag
-          git fetch --depth=1 origin tag "$LATEST_PACKAGE_TAG"
-
-          # Checkout the latest package files
-          rm -rf $GITHUB_WORKSPACE/libs/partners/${{ matrix.partner }}/*
-          rm -rf $GITHUB_WORKSPACE/libs/standard-tests/*
-          cd $GITHUB_WORKSPACE/libs/
-          git checkout "$LATEST_PACKAGE_TAG" -- standard-tests/
-          git checkout "$LATEST_PACKAGE_TAG" -- partners/${{ matrix.partner }}/
-          cd partners/${{ matrix.partner }}
-
-          # Print as a sanity check
-          echo "Version number from pyproject.toml: "
-          cat pyproject.toml | grep "version = "
-
-          # Run tests
-          uv sync --group test --group test_integration
-          uv pip install ../../core/dist/*.whl
-          make integration_tests
-
-  # Test external packages that depend on langchain-core/langchain against the new release
-  # Only runs for core and langchain_v1 releases to catch breaking changes before publish
-  test-dependents:
-    name: "🐍 Python ${{ matrix.python-version }}: ${{ matrix.package.path }}"
-    needs:
-      - build
-      - release-notes
-      - test-pypi-publish
-      - pre-release-checks
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-    # Only run for core or langchain_v1 releases
-    if: startsWith(inputs.working-directory, 'libs/core') || startsWith(inputs.working-directory, 'libs/langchain_v1')
-    strategy:
-      fail-fast: false
-      matrix:
-        python-version: ["3.11", "3.13"]
-        package:
-          - name: deepagents
-            repo: langchain-ai/deepagents
-            path: libs/deepagents
-    # No API keys needed for now - deepagents `make test` only runs unit tests
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          path: langchain
-
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          repository: ${{ matrix.package.repo }}
-          path: ${{ matrix.package.name }}
-
-      - name: Set up Python + uv
-        uses: "./langchain/.github/actions/uv_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-
-      - uses: actions/download-artifact@3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c # v8
-        with:
-          name: dist
-          path: dist/
-
-      - name: Install ${{ matrix.package.name }} with local packages
-        # External dependents don't have [tool.uv.sources] pointing to this repo,
-        # so we install the package normally then override with the built wheel.
-        run: |
-          cd ${{ matrix.package.name }}/${{ matrix.package.path }}
-
-          # Install the package with test dependencies
-          uv sync --group test
-
-          # Override with the built wheel from this release
-          uv pip install $GITHUB_WORKSPACE/dist/*.whl
-
-      - name: Run ${{ matrix.package.name }} tests
-        run: |
-          cd ${{ matrix.package.name }}/${{ matrix.package.path }}
-          make test
-
-  publish:
-    # Publishes the package to PyPI
-    needs:
-      - build
-      - release-notes
-      - test-pypi-publish
-      - pre-release-checks
-      - test-dependents
-      # - test-prior-published-packages-against-new-core
-    # Run if all needed jobs succeeded or were skipped (test-dependents only runs for core/langchain_v1)
-    if: ${{ !cancelled() && !failure() }}
-    runs-on: ubuntu-latest
-    permissions:
-      # This permission is used for trusted publishing:
-      # https://blog.pypi.org/posts/2023-04-20-introducing-trusted-publishers/
-      #
-      # Trusted publishing has to also be configured on PyPI for each package:
-      # https://docs.pypi.org/trusted-publishers/adding-a-publisher/
-      id-token: write
-
-    defaults:
-      run:
-        working-directory: ${{ inputs.working-directory }}
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: Set up Python + uv
-        uses: "./.github/actions/uv_setup"
-        with:
-          python-version: ${{ env.PYTHON_VERSION }}
-
-      - uses: actions/download-artifact@3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c # v8
-        with:
-          name: dist
-          path: ${{ inputs.working-directory }}/dist/
-
-      - name: Publish package distributions to PyPI
-        uses: pypa/gh-action-pypi-publish@ed0c53931b1dc9bd32cbe73a98c7f6766f8a527e # release/v1
-        with:
-          packages-dir: ${{ inputs.working-directory }}/dist/
-          verbose: true
-          print-hash: true
-          # Temp workaround since attestations are on by default as of gh-action-pypi-publish v1.11.0
-          attestations: false
-
-  mark-release:
-    # Marks the GitHub release with the new version tag
-    needs:
-      - build
-      - release-notes
-      - test-pypi-publish
-      - pre-release-checks
-      - publish
-    # Run if all needed jobs succeeded or were skipped
-    if: ${{ !cancelled() && !failure() }}
-    runs-on: ubuntu-latest
-    permissions:
-      # This permission is needed by `ncipollo/release-action` to
-      # create the GitHub release/tag
+      # This permission is needed by `ncipollo/release-action` to create the GitHub release.
      contents: write
-
    defaults:
      run:
        working-directory: ${{ inputs.working-directory }}
-
    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
+      - uses: actions/checkout@v3

-      - name: Set up Python + uv
-        uses: "./.github/actions/uv_setup"
+      - name: Set up Python + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
        with:
-          python-version: ${{ env.PYTHON_VERSION }}
+          python-version: "3.10"
+          poetry-version: ${{ env.POETRY_VERSION }}
+          working-directory: ${{ inputs.working-directory }}
+          cache-key: release

-      - uses: actions/download-artifact@3e5f45b2cfb9172054b4087a40e8e0b5a5461e7c # v8
-        with:
-          name: dist
-          path: ${{ inputs.working-directory }}/dist/
-
-      - name: Create Tag
-        uses: ncipollo/release-action@339a81892b84b4eeb0f6e744e4574d79d0d9b8dd # v1
+      - name: Build project for distribution
+        run: poetry build
+      - name: Check Version
+        id: check-version
+        run: |
+          echo version=$(poetry version --short) >> $GITHUB_OUTPUT
+      - name: Create Release
+        uses: ncipollo/release-action@v1
+        if: ${{ inputs.working-directory == 'libs/langchain' }}
        with:
          artifacts: "dist/*"
          token: ${{ secrets.GITHUB_TOKEN }}
-          generateReleaseNotes: false
-          tag: ${{needs.build.outputs.pkg-name}}==${{ needs.build.outputs.version }}
-          body: ${{ needs.release-notes.outputs.release-body }}
-          commit: ${{ github.sha }}
-          makeLatest: ${{ needs.build.outputs.pkg-name == 'langchain-core'}}
+          draft: false
+          generateReleaseNotes: true
+          tag: v${{ steps.check-version.outputs.version }}
+          commit: master
+      - name: Publish package distributions to PyPI
+        uses: pypa/gh-action-pypi-publish@release/v1
+        with:
+          packages-dir: ${{ inputs.working-directory }}/dist/
+          verbose: true
+          print-hash: true
--- a/.github/workflows/_release_docker.yml
+++ b/.github/workflows/_release_docker.yml
@@ -0,0 +1,62 @@
+name: release_docker
+
+on:
+  workflow_call:
+    inputs:
+      dockerfile:
+        required: true
+        type: string
+        description: "Path to the Dockerfile to build"
+      image:
+        required: true
+        type: string
+        description: "Name of the image to build"
+
+env:
+  TEST_TAG: ${{ inputs.image }}:test
+  LATEST_TAG: ${{ inputs.image }}:latest
+
+jobs:
+  docker:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Get git tag
+        uses: actions-ecosystem/action-get-latest-tag@v1
+        id: get-latest-tag
+      - name: Set docker tag
+        env:
+          VERSION: ${{ steps.get-latest-tag.outputs.tag }}
+        run: |
+          echo "VERSION_TAG=${{ inputs.image }}:${VERSION#v}" >> $GITHUB_ENV
+      - name: Set up QEMU
+        uses: docker/setup-qemu-action@v3
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+      - name: Login to Docker Hub
+        uses: docker/login-action@v3
+        with:
+          username: ${{ secrets.DOCKERHUB_USERNAME }}
+          password: ${{ secrets.DOCKERHUB_TOKEN }}
+      - name: Build for Test
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: ${{ inputs.dockerfile }}
+          load: true
+          tags: ${{ env.TEST_TAG }}
+      - name: Test
+        run: |
+          docker run --rm ${{ env.TEST_TAG }} python -c "import langchain"
+      - name: Build and Push to Docker Hub
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: ${{ inputs.dockerfile }}
+          # We can only build for the intersection of platforms supported by
+          # QEMU and base python image, for now build only for
+          # linux/amd64 and linux/arm64
+          platforms: linux/amd64,linux/arm64
+          tags: ${{ env.LATEST_TAG }},${{ env.VERSION_TAG }}
+          push: true
--- a/.github/workflows/_test.yml
+++ b/.github/workflows/_test.yml
@@ -1,7 +1,4 @@
-# Runs unit tests with both current and minimum supported dependency versions
-# to ensure compatibility across the supported range.
-
-name: "🧪 Unit Testing"
+name: test

 on:
  workflow_call:
@@ -10,69 +7,45 @@ on:
        required: true
        type: string
        description: "From which folder this pipeline executes"
-      python-version:
-        required: true
-        type: string
-        description: "Python version to use"
-
-permissions:
-  contents: read

 env:
-  UV_FROZEN: "true"
-  UV_NO_SYNC: "true"
+  POETRY_VERSION: "1.6.1"

 jobs:
-  # Main test job - runs unit tests with current deps, then retests with minimum versions
  build:
    defaults:
      run:
        working-directory: ${{ inputs.working-directory }}
    runs-on: ubuntu-latest
-    timeout-minutes: 20
-    name: "Python ${{ inputs.python-version }}"
+    strategy:
+      matrix:
+        python-version:
+          - "3.8"
+          - "3.9"
+          - "3.10"
+          - "3.11"
+          - "3.12"
+    name: Python ${{ matrix.python-version }}
    steps:
-      - name: "📋 Checkout Code"
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
+      - uses: actions/checkout@v3

-      - name: "🐍 Set up Python ${{ inputs.python-version }} + UV"
-        uses: "./.github/actions/uv_setup"
-        id: setup-python
+      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
        with:
-          python-version: ${{ inputs.python-version }}
-          cache-suffix: test-${{ inputs.working-directory }}
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
          working-directory: ${{ inputs.working-directory }}
+          cache-key: core

-      - name: "📦 Install Test Dependencies"
+      - name: Install dependencies
        shell: bash
-        run: uv sync --group test --dev
+        run: poetry install

-      - name: "🧪 Run Core Unit Tests"
+      - name: Run core tests
        shell: bash
-        run: |
-          make test PYTEST_EXTRA=-q
+        run: make test

-      - name: "🔍 Calculate Minimum Dependency Versions"
-        working-directory: ${{ inputs.working-directory }}
-        id: min-version
-        shell: bash
-        run: |
-          VIRTUAL_ENV=.venv uv pip install packaging tomli requests
-          python_version="$(uv run python --version | awk '{print $2}')"
-          min_versions="$(uv run python $GITHUB_WORKSPACE/.github/scripts/get_min_versions.py pyproject.toml pull_request $python_version)"
-          echo "min-versions=$min_versions" >> "$GITHUB_OUTPUT"
-          echo "min-versions=$min_versions"
-
-      - name: "🧪 Run Tests with Minimum Dependencies"
-        if: ${{ steps.min-version.outputs.min-versions != '' }}
-        env:
-          MIN_VERSIONS: ${{ steps.min-version.outputs.min-versions }}
-        run: |
-          VIRTUAL_ENV=.venv uv pip install $MIN_VERSIONS
-          make tests PYTEST_EXTRA=-q
-        working-directory: ${{ inputs.working-directory }}
-
-      - name: "🧹 Verify Clean Working Directory"
+      - name: Ensure the tests did not create any additional files
        shell: bash
        run: |
          set -eu
--- a/.github/workflows/_test_pydantic.yml
+++ b/.github/workflows/_test_pydantic.yml
@@ -1,73 +0,0 @@
-# Facilitate unit testing against different Pydantic versions for a provided package.
-
-name: "🐍 Pydantic Version Testing"
-
-on:
-  workflow_call:
-    inputs:
-      working-directory:
-        required: true
-        type: string
-        description: "From which folder this pipeline executes"
-      python-version:
-        required: false
-        type: string
-        description: "Python version to use"
-        default: "3.12"
-      pydantic-version:
-        required: true
-        type: string
-        description: "Pydantic version to test."
-
-permissions:
-  contents: read
-
-env:
-  UV_FROZEN: "true"
-  UV_NO_SYNC: "true"
-
-jobs:
-  build:
-    defaults:
-      run:
-        working-directory: ${{ inputs.working-directory }}
-    runs-on: ubuntu-latest
-    timeout-minutes: 20
-    name: "Pydantic ~=${{ inputs.pydantic-version }}"
-    steps:
-      - name: "📋 Checkout Code"
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "🐍 Set up Python ${{ inputs.python-version }} + UV"
-        uses: "./.github/actions/uv_setup"
-        with:
-          python-version: ${{ inputs.python-version }}
-          cache-suffix: test-pydantic-${{ inputs.working-directory }}
-          working-directory: ${{ inputs.working-directory }}
-
-      - name: "📦 Install Test Dependencies"
-        shell: bash
-        run: uv sync --group test
-
-      - name: "🔄 Install Specific Pydantic Version"
-        shell: bash
-        env:
-          PYDANTIC_VERSION: ${{ inputs.pydantic-version }}
-        run: VIRTUAL_ENV=.venv uv pip install "pydantic~=$PYDANTIC_VERSION"
-
-      - name: "🧪 Run Core Tests"
-        shell: bash
-        run: |
-          make test
-
-      - name: "🧹 Verify Clean Working Directory"
-        shell: bash
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/_test_vcr.yml
+++ b/.github/workflows/_test_vcr.yml
@@ -1,66 +0,0 @@
-# Runs VCR cassette-backed integration tests in playback-only mode.
-#
-# No API keys needed — catches stale cassettes caused by test input
-# changes without re-recording.
-#
-# Called as part of check_diffs.yml workflow.
-
-name: "📼 VCR Cassette Tests"
-
-on:
-  workflow_call:
-    inputs:
-      working-directory:
-        required: true
-        type: string
-        description: "From which folder this pipeline executes"
-      python-version:
-        required: true
-        type: string
-        description: "Python version to use"
-
-permissions:
-  contents: read
-
-env:
-  UV_FROZEN: "true"
-
-jobs:
-  build:
-    defaults:
-      run:
-        working-directory: ${{ inputs.working-directory }}
-    runs-on: ubuntu-latest
-    timeout-minutes: 20
-    name: "Python ${{ inputs.python-version }}"
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "🐍 Set up Python ${{ inputs.python-version }} + UV"
-        uses: "./.github/actions/uv_setup"
-        with:
-          python-version: ${{ inputs.python-version }}
-          cache-suffix: test-vcr-${{ inputs.working-directory }}
-          working-directory: ${{ inputs.working-directory }}
-
-      - name: "📦 Install Test Dependencies"
-        shell: bash
-        run: uv sync --group test
-
-      - name: "📼 Run VCR Cassette Tests (playback-only)"
-        shell: bash
-        env:
-          OPENAI_API_KEY: sk-fake
-        run: make test_vcr
-
-      - name: "🧹 Verify Clean Working Directory"
-        shell: bash
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/auto-label-by-package.yml
+++ b/.github/workflows/auto-label-by-package.yml
@@ -1,115 +0,0 @@
-name: Auto Label Issues by Package
-
-on:
-  issues:
-    types: [opened, edited]
-
-permissions:
-  contents: read
-
-jobs:
-  label-by-package:
-    permissions:
-      issues: write
-    runs-on: ubuntu-latest
-
-    steps:
-      - name: Sync package labels
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          script: |
-            const body = context.payload.issue.body || "";
-
-            // Extract text under "## Package" or "### Package" (handles " (Required)" suffix and being last section)
-            const match = body.match(/#{2,3} Package[^\n]*\n([\s\S]*?)(?:\n#{2,3} |$)/i);
-            if (!match) {
-              core.setFailed(
-                `Could not find "## Package" section in issue #${context.issue.number} body. ` +
-                `The issue template may have changed — update the regex in this workflow.`
-              );
-              return;
-            }
-
-            const packageSection = match[1].trim();
-
-            // Mapping table for package names to labels
-            const mapping = {
-              "langchain": "langchain",
-              "langchain-openai": "openai",
-              "langchain-anthropic": "anthropic",
-              "langchain-classic": "langchain-classic",
-              "langchain-core": "core",
-              "langchain-model-profiles": "model-profiles",
-              "langchain-tests": "standard-tests",
-              "langchain-text-splitters": "text-splitters",
-              "langchain-chroma": "chroma",
-              "langchain-deepseek": "deepseek",
-              "langchain-exa": "exa",
-              "langchain-fireworks": "fireworks",
-              "langchain-groq": "groq",
-              "langchain-huggingface": "huggingface",
-              "langchain-mistralai": "mistralai",
-              "langchain-nomic": "nomic",
-              "langchain-ollama": "ollama",
-              "langchain-openrouter": "openrouter",
-              "langchain-perplexity": "perplexity",
-              "langchain-qdrant": "qdrant",
-              "langchain-xai": "xai",
-            };
-
-            // All possible package labels we manage
-            const allPackageLabels = Object.values(mapping);
-            const selectedLabels = [];
-
-            // Check if this is checkbox format (multiple selection)
-            const checkboxMatches = packageSection.match(/- \[x\]\s+([^\n\r]+)/gi);
-            if (checkboxMatches) {
-              // Handle checkbox format
-              for (const match of checkboxMatches) {
-                const packageName = match.replace(/- \[x\]\s+/i, '').trim();
-                const label = mapping[packageName];
-                if (label && !selectedLabels.includes(label)) {
-                  selectedLabels.push(label);
-                }
-              }
-            } else {
-              // Handle dropdown format (single selection)
-              const label = mapping[packageSection];
-              if (label) {
-                selectedLabels.push(label);
-              }
-            }
-
-            // Get current issue labels
-            const issue = await github.rest.issues.get({
-              owner: context.repo.owner,
-              repo: context.repo.repo,
-              issue_number: context.issue.number
-            });
-
-            const currentLabels = issue.data.labels.map(label => label.name);
-            const currentPackageLabels = currentLabels.filter(label => allPackageLabels.includes(label));
-
-            // Determine labels to add and remove
-            const labelsToAdd = selectedLabels.filter(label => !currentPackageLabels.includes(label));
-            const labelsToRemove = currentPackageLabels.filter(label => !selectedLabels.includes(label));
-
-            // Add new labels
-            if (labelsToAdd.length > 0) {
-              await github.rest.issues.addLabels({
-                owner: context.repo.owner,
-                repo: context.repo.repo,
-                issue_number: context.issue.number,
-                labels: labelsToAdd
-              });
-            }
-
-            // Remove old labels
-            for (const label of labelsToRemove) {
-              await github.rest.issues.removeLabel({
-                owner: context.repo.owner,
-                repo: context.repo.repo,
-                issue_number: context.issue.number,
-                name: label
-              });
-            }
--- a/.github/workflows/check_agents_sync.yml
+++ b/.github/workflows/check_agents_sync.yml
@@ -1,42 +0,0 @@
-# Ensures CLAUDE.md and AGENTS.md stay synchronized.
-#
-# These files contain the same development guidelines but are named differently
-# for compatibility with different AI coding assistants (Claude Code uses CLAUDE.md,
-# other tools may use AGENTS.md).
-
-name: "🔄 Check CLAUDE.md / AGENTS.md Sync"
-
-on:
-  push:
-    branches: [master]
-    paths:
-      - "CLAUDE.md"
-      - "AGENTS.md"
-  pull_request:
-    paths:
-      - "CLAUDE.md"
-      - "AGENTS.md"
-
-permissions:
-  contents: read
-
-jobs:
-  check-sync:
-    name: "verify files are identical"
-    runs-on: ubuntu-latest
-    steps:
-      - name: "📋 Checkout Code"
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "🔍 Check CLAUDE.md and AGENTS.md are in sync"
-        run: |
-          if ! diff -q CLAUDE.md AGENTS.md > /dev/null 2>&1; then
-            echo "❌ CLAUDE.md and AGENTS.md are out of sync!"
-            echo ""
-            echo "These files must contain identical content."
-            echo "Differences:"
-            echo ""
-            diff --color=always CLAUDE.md AGENTS.md || true
-            exit 1
-          fi
-          echo "✅ CLAUDE.md and AGENTS.md are in sync"
--- a/.github/workflows/check_core_versions.yml
+++ b/.github/workflows/check_core_versions.yml
@@ -1,67 +0,0 @@
-# Ensures version numbers in pyproject.toml and version.py stay in sync.
-#
-# (Prevents releases with mismatched version numbers)
-
-name: "🔍 Check Version Equality"
-
-on:
-  pull_request:
-    paths:
-      - "libs/core/pyproject.toml"
-      - "libs/core/langchain_core/version.py"
-      - "libs/partners/anthropic/pyproject.toml"
-      - "libs/partners/anthropic/langchain_anthropic/_version.py"
-
-permissions:
-  contents: read
-
-jobs:
-  check_version_equality:
-    runs-on: ubuntu-latest
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "✅ Verify pyproject.toml & version.py Match"
-        run: |
-          # Check core versions
-          CORE_PYPROJECT_VERSION=$(grep -Po '(?<=^version = ")[^"]*' libs/core/pyproject.toml)
-          CORE_VERSION_PY_VERSION=$(grep -Po '(?<=^VERSION = ")[^"]*' libs/core/langchain_core/version.py)
-
-          # Compare core versions
-          if [ "$CORE_PYPROJECT_VERSION" != "$CORE_VERSION_PY_VERSION" ]; then
-            echo "langchain-core versions in pyproject.toml and version.py do not match!"
-            echo "pyproject.toml version: $CORE_PYPROJECT_VERSION"
-            echo "version.py version: $CORE_VERSION_PY_VERSION"
-            exit 1
-          else
-            echo "Core versions match: $CORE_PYPROJECT_VERSION"
-          fi
-
-          # Check langchain_v1 versions
-          LANGCHAIN_PYPROJECT_VERSION=$(grep -Po '(?<=^version = ")[^"]*' libs/langchain_v1/pyproject.toml)
-          LANGCHAIN_INIT_PY_VERSION=$(grep -Po '(?<=^__version__ = ")[^"]*' libs/langchain_v1/langchain/__init__.py)
-
-          # Compare langchain_v1 versions
-          if [ "$LANGCHAIN_PYPROJECT_VERSION" != "$LANGCHAIN_INIT_PY_VERSION" ]; then
-            echo "langchain_v1 versions in pyproject.toml and __init__.py do not match!"
-            echo "pyproject.toml version: $LANGCHAIN_PYPROJECT_VERSION"
-            echo "version.py version: $LANGCHAIN_INIT_PY_VERSION"
-            exit 1
-          else
-            echo "Langchain v1 versions match: $LANGCHAIN_PYPROJECT_VERSION"
-          fi
-
-          # Check langchain-anthropic versions
-          ANTHROPIC_PYPROJECT_VERSION=$(grep -Po '(?<=^version = ")[^"]*' libs/partners/anthropic/pyproject.toml)
-          ANTHROPIC_VERSION_PY_VERSION=$(grep -Po '(?<=^__version__ = ")[^"]*' libs/partners/anthropic/langchain_anthropic/_version.py)
-
-          # Compare langchain-anthropic versions
-          if [ "$ANTHROPIC_PYPROJECT_VERSION" != "$ANTHROPIC_VERSION_PY_VERSION" ]; then
-            echo "langchain-anthropic versions in pyproject.toml and _version.py do not match!"
-            echo "pyproject.toml version: $ANTHROPIC_PYPROJECT_VERSION"
-            echo "_version.py version: $ANTHROPIC_VERSION_PY_VERSION"
-            exit 1
-          else
-            echo "Langchain-anthropic versions match: $ANTHROPIC_PYPROJECT_VERSION"
-          fi
--- a/.github/workflows/check_diffs.yml
+++ b/.github/workflows/check_diffs.yml
@@ -1,230 +0,0 @@
-# Primary CI workflow.
-#
-# Only runs against packages that have changed files.
-#
-# Runs:
-# - Linting (_lint.yml)
-# - Unit Tests (_test.yml)
-# - Pydantic compatibility tests (_test_pydantic.yml)
-# - Integration test compilation checks (_compile_integration_test.yml)
-# - Extended test suites that require additional dependencies
-#
-# Reports status to GitHub checks and PR status.
-
-name: "🔧 CI"
-
-on:
-  push:
-    branches: [master]
-  pull_request:
-  merge_group:
-
-# Optimizes CI performance by canceling redundant workflow runs
-# If another push to the same PR or branch happens while this workflow is still running,
-# cancel the earlier run in favor of the next run.
-#
-# There's no point in testing an outdated version of the code. GitHub only allows
-# a limited number of job runners to be active at the same time, so it's better to
-# cancel pointless jobs early so that more useful jobs can run sooner.
-concurrency:
-  group: ${{ github.workflow }}-${{ github.ref }}
-  cancel-in-progress: true
-
-permissions:
-  contents: read
-
-env:
-  UV_FROZEN: "true"
-  UV_NO_SYNC: "true"
-
-jobs:
-  # This job analyzes which files changed and creates a dynamic test matrix
-  # to only run tests/lints for the affected packages, improving CI efficiency
-  build:
-    name: "Detect Changes & Set Matrix"
-    runs-on: ubuntu-latest
-    if: ${{ !contains(github.event.pull_request.labels.*.name, 'ci-ignore') }}
-    steps:
-      - name: "📋 Checkout Code"
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-      - name: "🐍 Setup Python 3.11"
-        uses: actions/setup-python@a309ff8b426b58ec0e2a45f0f869d46889d02405 # v6
-        with:
-          python-version: "3.11"
-      - name: "📂 Get Changed Files"
-        id: files
-        uses: Ana06/get-changed-files@25f79e676e7ea1868813e21465014798211fad8c # v2.3.0
-      - name: "🔍 Analyze Changed Files & Generate Build Matrix"
-        id: set-matrix
-        run: |
-          python -m pip install packaging requests
-          python .github/scripts/check_diff.py ${{ steps.files.outputs.all }} >> $GITHUB_OUTPUT
-    outputs:
-      lint: ${{ steps.set-matrix.outputs.lint }}
-      test: ${{ steps.set-matrix.outputs.test }}
-      extended-tests: ${{ steps.set-matrix.outputs.extended-tests }}
-      compile-integration-tests: ${{ steps.set-matrix.outputs.compile-integration-tests }}
-      dependencies: ${{ steps.set-matrix.outputs.dependencies }}
-      test-pydantic: ${{ steps.set-matrix.outputs.test-pydantic }}
-      vcr-tests: ${{ steps.set-matrix.outputs.vcr-tests }}
-  # Run linting only on packages that have changed files
-  lint:
-    needs: [build]
-    if: ${{ needs.build.outputs.lint != '[]' }}
-    strategy:
-      matrix:
-        job-configs: ${{ fromJson(needs.build.outputs.lint) }}
-      fail-fast: false
-    uses: ./.github/workflows/_lint.yml
-    with:
-      working-directory: ${{ matrix.job-configs.working-directory }}
-      python-version: ${{ matrix.job-configs.python-version }}
-    secrets: inherit
-
-  # Run unit tests only on packages that have changed files
-  test:
-    needs: [build]
-    if: ${{ needs.build.outputs.test != '[]' }}
-    strategy:
-      matrix:
-        job-configs: ${{ fromJson(needs.build.outputs.test) }}
-      fail-fast: false
-    uses: ./.github/workflows/_test.yml
-    with:
-      working-directory: ${{ matrix.job-configs.working-directory }}
-      python-version: ${{ matrix.job-configs.python-version }}
-    secrets: inherit
-
-  # Test compatibility with different Pydantic versions for affected packages
-  test-pydantic:
-    needs: [build]
-    if: ${{ needs.build.outputs.test-pydantic != '[]' }}
-    strategy:
-      matrix:
-        job-configs: ${{ fromJson(needs.build.outputs.test-pydantic) }}
-      fail-fast: false
-    uses: ./.github/workflows/_test_pydantic.yml
-    with:
-      working-directory: ${{ matrix.job-configs.working-directory }}
-      pydantic-version: ${{ matrix.job-configs.pydantic-version }}
-    secrets: inherit
-
-  # Verify integration tests compile without actually running them (faster feedback)
-  compile-integration-tests:
-    name: "Compile Integration Tests"
-    needs: [build]
-    if: ${{ needs.build.outputs.compile-integration-tests != '[]' }}
-    strategy:
-      matrix:
-        job-configs: ${{ fromJson(needs.build.outputs.compile-integration-tests) }}
-      fail-fast: false
-    uses: ./.github/workflows/_compile_integration_test.yml
-    with:
-      working-directory: ${{ matrix.job-configs.working-directory }}
-      python-version: ${{ matrix.job-configs.python-version }}
-    secrets: inherit
-
-  # Run VCR cassette-backed integration tests in playback-only mode (no API keys)
-  vcr-tests:
-    name: "VCR Cassette Tests"
-    needs: [build]
-    if: ${{ needs.build.outputs.vcr-tests != '[]' }}
-    strategy:
-      matrix:
-        job-configs: ${{ fromJson(needs.build.outputs.vcr-tests) }}
-      fail-fast: false
-    uses: ./.github/workflows/_test_vcr.yml
-    with:
-      working-directory: ${{ matrix.job-configs.working-directory }}
-      python-version: ${{ matrix.job-configs.python-version }}
-    secrets: inherit
-
-  # Run extended test suites that require additional dependencies
-  extended-tests:
-    name: "Extended Tests"
-    needs: [build]
-    if: ${{ needs.build.outputs.extended-tests != '[]' }}
-    strategy:
-      matrix:
-        # note different variable for extended test dirs
-        job-configs: ${{ fromJson(needs.build.outputs.extended-tests) }}
-      fail-fast: false
-    runs-on: ubuntu-latest
-    timeout-minutes: 20
-    defaults:
-      run:
-        working-directory: ${{ matrix.job-configs.working-directory }}
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "🐍 Set up Python ${{ matrix.job-configs.python-version }} + UV"
-        uses: "./.github/actions/uv_setup"
-        with:
-          python-version: ${{ matrix.job-configs.python-version }}
-          cache-suffix: extended-tests-${{ matrix.job-configs.working-directory }}
-          working-directory: ${{ matrix.job-configs.working-directory }}
-
-      - name: "📦 Install Dependencies & Run Extended Tests"
-        shell: bash
-        run: |
-          echo "Running extended tests, installing dependencies with uv..."
-          uv venv
-          uv sync --group test
-          VIRTUAL_ENV=.venv uv pip install -r extended_testing_deps.txt
-          VIRTUAL_ENV=.venv make extended_tests
-
-      - name: "🧹 Verify Clean Working Directory"
-        shell: bash
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
-
-  # Verify _release.yml dropdown options stay in sync with package directories
-  check-release-options:
-    name: "Validate Release Options"
-    runs-on: ubuntu-latest
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-      - name: "🐍 Setup Python 3.11"
-        uses: actions/setup-python@a309ff8b426b58ec0e2a45f0f869d46889d02405 # v6
-        with:
-          python-version: "3.11"
-      - name: "📦 Install Dependencies"
-        run: python -m pip install pyyaml pytest
-      - name: "🔍 Check release dropdown matches packages"
-        run: python -m pytest .github/scripts/test_release_options.py -v
-
-  # Final status check - ensures all required jobs passed before allowing merge
-  ci_success:
-    name: "✅ CI Success"
-    needs:
-      [
-        build,
-        lint,
-        test,
-        compile-integration-tests,
-        vcr-tests,
-        extended-tests,
-        test-pydantic,
-        check-release-options,
-      ]
-    if: |
-      always()
-    runs-on: ubuntu-latest
-    env:
-      JOBS_JSON: ${{ toJSON(needs) }}
-      RESULTS_JSON: ${{ toJSON(needs.*.result) }}
-      EXIT_CODE: ${{!contains(needs.*.result, 'failure') && !contains(needs.*.result, 'cancelled') && '0' || '1'}}
-    steps:
-      - name: "🎉 All Checks Passed"
-        run: |
-          echo $JOBS_JSON
-          echo $RESULTS_JSON
-          echo "Exiting with $EXIT_CODE"
-          exit $EXIT_CODE
--- a/.github/workflows/close_unchecked_issues.yml
+++ b/.github/workflows/close_unchecked_issues.yml
@@ -1,106 +0,0 @@
-# Auto-close issues that bypass or ignore the issue template checkboxes.
-#
-# GitHub issue forms enforce `required: true` checkboxes in the web UI,
-# but the API bypasses form validation entirely — bots/scripts can open
-# issues with every box unchecked or skip the template altogether.
-#
-# Rules:
-#   1. Checkboxes present, none checked → close
-#   2. No checkboxes at all → close unless author is an org member or bot
-#
-# Org membership check reuses the shared helper from pr-labeler.js and
-# the same GitHub App used by tag-external-issues.yml.
-
-name: Close Unchecked Issues
-
-on:
-  issues:
-    types: [opened]
-
-permissions:
-  contents: read
-
-concurrency:
-  group: ${{ github.workflow }}-${{ github.event.issue.number }}
-  cancel-in-progress: true
-
-jobs:
-  check-boxes:
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-      issues: write
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: Generate GitHub App token
-        id: app-token
-        uses: actions/create-github-app-token@f8d387b68d61c58ab83c6c016672934102569859 # v3
-        with:
-          app-id: ${{ secrets.ORG_MEMBERSHIP_APP_ID }}
-          private-key: ${{ secrets.ORG_MEMBERSHIP_APP_PRIVATE_KEY }}
-
-      - name: Validate issue checkboxes
-        if: steps.app-token.outcome == 'success'
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          github-token: ${{ steps.app-token.outputs.token }}
-          script: |
-            const body = context.payload.issue.body ?? '';
-            const checked = (body.match(/- \[x\]/gi) || []).length;
-
-            if (checked > 0) {
-              console.log(`Found ${checked} checked checkbox(es) — OK`);
-              return;
-            }
-
-            const unchecked = (body.match(/- \[ \]/g) || []).length;
-
-            // No checkboxes at all — allow org members and bots, close everyone else
-            if (unchecked === 0) {
-              const { owner, repo } = context.repo;
-              const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-              const author = context.payload.sender.login;
-              const { isExternal } = await h.checkMembership(
-                author, context.payload.sender.type,
-              );
-
-              if (!isExternal) {
-                console.log(`No checkboxes, but ${author} is internal — OK`);
-                return;
-              }
-              console.log(`No checkboxes and ${author} is external — closing`);
-            } else {
-              console.log(`Found 0 checked and ${unchecked} unchecked checkbox(es) — closing`);
-            }
-
-            const { owner, repo } = context.repo;
-            const issue_number = context.payload.issue.number;
-
-            const reason = unchecked > 0
-              ? 'none of the required checkboxes were checked'
-              : 'no issue template was used';
-
-            // Close before commenting — a closed issue without a comment is
-            // less confusing than an open issue with a false "auto-closed" message
-            // if the second API call fails.
-            await github.rest.issues.update({
-              owner,
-              repo,
-              issue_number,
-              state: 'closed',
-              state_reason: 'not_planned',
-            });
-
-            await github.rest.issues.createComment({
-              owner,
-              repo,
-              issue_number,
-              body: [
-                `This issue was automatically closed because ${reason}.`,
-                '',
-                `Please use one of the [issue templates](https://github.com/${owner}/${repo}/issues/new/choose) and complete the checklist.`,
-              ].join('\n'),
-            });
--- a/.github/workflows/codespell.yml
+++ b/.github/workflows/codespell.yml
@@ -0,0 +1,36 @@
+---
+name: Codespell
+
+on:
+  push:
+    branches: [master]
+  pull_request:
+    branches: [master]
+
+permissions:
+  contents: read
+
+jobs:
+  codespell:
+    name: Check for spelling errors
+    runs-on: ubuntu-latest
+
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v3
+
+      - name: Install Dependencies
+        run: |
+          pip install toml
+
+      - name: Extract Ignore Words List
+        run: |
+          # Use a Python script to extract the ignore words list from pyproject.toml
+          python .github/workflows/extract_ignored_words_list.py
+        id: extract_ignore_words
+
+      - name: Codespell
+        uses: codespell-project/actions-codespell@v2
+        with:
+          skip: guide_imports.json
+          ignore_words_list: ${{ steps.extract_ignore_words.outputs.ignore_words_list }}
--- a/.github/workflows/codspeed.yml
+++ b/.github/workflows/codspeed.yml
@@ -1,85 +0,0 @@
-# CodSpeed performance benchmarks.
-#
-# Runs benchmarks on changed packages and uploads results to CodSpeed.
-# Separated from the main CI workflow so that push-to-master baseline runs
-# are never cancelled by subsequent merges (cancel-in-progress is only
-# enabled for pull_request events).
-
-name: "⚡ CodSpeed"
-
-on:
-  push:
-    branches: [master]
-  pull_request:
-
-# On PRs, cancel stale runs when new commits are pushed.
-# On push-to-master, never cancel — these runs populate CodSpeed baselines.
-concurrency:
-  group: ${{ github.workflow }}-${{ github.event_name == 'push' && github.sha || github.ref }}
-  cancel-in-progress: ${{ github.event_name == 'pull_request' }}
-
-permissions:
-  contents: read
-
-env:
-  UV_FROZEN: "true"
-  UV_NO_SYNC: "true"
-
-jobs:
-  build:
-    name: "Detect Changes"
-    runs-on: ubuntu-latest
-    if: ${{ !contains(github.event.pull_request.labels.*.name, 'codspeed-ignore') }}
-    steps:
-      - name: "📋 Checkout Code"
-        uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-      - name: "🐍 Setup Python 3.11"
-        uses: actions/setup-python@a309ff8b426b58ec0e2a45f0f869d46889d02405 # v6
-        with:
-          python-version: "3.11"
-      - name: "📂 Get Changed Files"
-        id: files
-        uses: Ana06/get-changed-files@25f79e676e7ea1868813e21465014798211fad8c # v2.3.0
-      - name: "🔍 Analyze Changed Files"
-        id: set-matrix
-        run: |
-          python -m pip install packaging requests
-          python .github/scripts/check_diff.py ${{ steps.files.outputs.all }} >> $GITHUB_OUTPUT
-    outputs:
-      codspeed: ${{ steps.set-matrix.outputs.codspeed }}
-
-  benchmarks:
-    name: "⚡ CodSpeed Benchmarks"
-    needs: [build]
-    if: ${{ needs.build.outputs.codspeed != '[]' }}
-    runs-on: ubuntu-latest
-    strategy:
-      matrix:
-        job-configs: ${{ fromJson(needs.build.outputs.codspeed) }}
-      fail-fast: false
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: "📦 Install UV Package Manager"
-        uses: astral-sh/setup-uv@0ca8f610542aa7f4acaf39e65cf4eb3c35091883 # v7
-        with:
-          # Pinned to 3.13.11 to work around CodSpeed walltime segfault on 3.13.12+
-          # See: https://github.com/CodSpeedHQ/pytest-codspeed/issues/106
-          python-version: "3.13.11"
-
-      - name: "📦 Install Test Dependencies"
-        run: uv sync --group test
-        working-directory: ${{ matrix.job-configs.working-directory }}
-
-      - name: "⚡ Run Benchmarks: ${{ matrix.job-configs.working-directory }}"
-        uses: CodSpeedHQ/action@a50965600eafa04edcd6717761f55b77e52aafbd # v4
-        with:
-          token: ${{ secrets.CODSPEED_TOKEN }}
-          run: |
-            cd ${{ matrix.job-configs.working-directory }}
-            if [ "${{ matrix.job-configs.working-directory }}" = "libs/core" ]; then
-              uv run --no-sync pytest ./tests/benchmarks --codspeed
-            else
-              uv run --no-sync pytest ./tests/unit_tests/ -m benchmark --codspeed
-            fi
-          mode: ${{ matrix.job-configs.codspeed-mode }}
--- a/.github/workflows/doc_lint.yml
+++ b/.github/workflows/doc_lint.yml
@@ -0,0 +1,22 @@
+---
+name: Documentation Lint
+
+on:
+  push:
+    branches: [master]
+  pull_request:
+    branches: [master]
+
+jobs:
+  check:
+    runs-on: ubuntu-latest
+
+    steps:
+    - name: Checkout repository
+      uses: actions/checkout@v2
+
+    - name: Run import check
+      run: |
+        # We should not encourage imports directly from main init file
+        # Expect for hub
+        git grep 'from langchain import' docs/{extras,docs_skeleton,snippets} | grep -vE 'from langchain import (hub)' && exit 1 || exit 0
--- a/.github/workflows/extract_ignored_words_list.py
+++ b/.github/workflows/extract_ignored_words_list.py
@@ -0,0 +1,8 @@
+import toml
+
+pyproject_toml = toml.load("pyproject.toml")
+
+# Extract the ignore words list (adjust the key as per your TOML structure)
+ignore_words_list = pyproject_toml.get("tool", {}).get("codespell", {}).get("ignore-words-list")
+
+print(f"::set-output name=ignore_words_list::{ignore_words_list}")
--- a/.github/workflows/integration_tests.yml
+++ b/.github/workflows/integration_tests.yml
@@ -1,271 +0,0 @@
-# Routine integration tests against partner libraries with live API credentials.
-#
-# Uses `make integration_tests` within each library being tested.
-#
-# Runs daily with the option to trigger manually.
-
-name: "⏰ Integration Tests"
-run-name: "Run Integration Tests - ${{ inputs.working-directory-force || 'all libs' }} (Python ${{ inputs.python-version-force || '3.10, 3.13' }})"
-
-on:
-  workflow_dispatch:
-    inputs:
-      working-directory-force:
-        type: string
-        description: "From which folder this pipeline executes - defaults to all in matrix - example value: libs/partners/anthropic"
-      python-version-force:
-        type: string
-        description: "Python version to use - defaults to 3.10 and 3.13 in matrix - example value: 3.11"
-  schedule:
-    - cron: "0 13 * * *" # Runs daily at 1PM UTC (9AM EDT/6AM PDT)
-
-permissions:
-  contents: read
-
-env:
-  UV_FROZEN: "true"
-  DEFAULT_LIBS: >-
-    ["libs/partners/openai",
-    "libs/partners/anthropic",
-    "libs/partners/fireworks",
-    "libs/partners/groq",
-    "libs/partners/mistralai",
-    "libs/partners/xai",
-    "libs/partners/google-vertexai",
-    "libs/partners/google-genai",
-    "libs/partners/aws"]
-
-jobs:
-  # Generate dynamic test matrix based on input parameters or defaults
-  # Only runs on the main repo (for scheduled runs) or when manually triggered
-  compute-matrix:
-    # Defend against forks running scheduled jobs, but allow manual runs from forks
-    if: github.repository_owner == 'langchain-ai' || github.event_name != 'schedule'
-
-    runs-on: ubuntu-latest
-    name: "📋 Compute Test Matrix"
-    outputs:
-      matrix: ${{ steps.set-matrix.outputs.matrix }}
-      python-version-min-3-11: ${{ steps.set-matrix.outputs.python-version-min-3-11 }}
-    steps:
-      - name: "🔢 Generate Python & Library Matrix"
-        id: set-matrix
-        env:
-          DEFAULT_LIBS: ${{ env.DEFAULT_LIBS }}
-          WORKING_DIRECTORY_FORCE: ${{ github.event.inputs.working-directory-force || '' }}
-          PYTHON_VERSION_FORCE: ${{ github.event.inputs.python-version-force || '' }}
-        run: |
-          # echo "matrix=..." where matrix is a json formatted str with keys python-version and working-directory
-          # python-version should default to 3.10 and 3.13, but is overridden to [PYTHON_VERSION_FORCE] if set
-          # working-directory should default to DEFAULT_LIBS, but is overridden to [WORKING_DIRECTORY_FORCE] if set
-          python_version='["3.10", "3.13"]'
-          python_version_min_3_11='["3.11", "3.13"]'
-          working_directory="$DEFAULT_LIBS"
-          if [ -n "$PYTHON_VERSION_FORCE" ]; then
-            python_version="[\"$PYTHON_VERSION_FORCE\"]"
-            # Bound forced version to >= 3.11 for packages requiring it
-            if [ "$(echo "$PYTHON_VERSION_FORCE >= 3.11" | bc -l)" -eq 1 ]; then
-              python_version_min_3_11="[\"$PYTHON_VERSION_FORCE\"]"
-            else
-              python_version_min_3_11='["3.11"]'
-            fi
-          fi
-          if [ -n "$WORKING_DIRECTORY_FORCE" ]; then
-            working_directory="[\"$WORKING_DIRECTORY_FORCE\"]"
-          fi
-          matrix="{\"python-version\": $python_version, \"working-directory\": $working_directory}"
-          echo $matrix
-          echo "matrix=$matrix" >> $GITHUB_OUTPUT
-          echo "python-version-min-3-11=$python_version_min_3_11" >> $GITHUB_OUTPUT
-
-  # Run integration tests against partner libraries with live API credentials
-  integration-tests:
-    if: github.repository_owner == 'langchain-ai' || github.event_name != 'schedule'
-    name: "🐍 Python ${{ matrix.python-version }}: ${{ matrix.working-directory }}"
-    runs-on: ubuntu-latest
-    needs: [compute-matrix]
-    timeout-minutes: 30
-    strategy:
-      fail-fast: false
-      matrix:
-        python-version: ${{ fromJSON(needs.compute-matrix.outputs.matrix).python-version }}
-        working-directory: ${{ fromJSON(needs.compute-matrix.outputs.matrix).working-directory }}
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          path: langchain
-
-      # These libraries exist outside of the monorepo and need to be checked out separately
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          repository: langchain-ai/langchain-google
-          path: langchain-google
-      - name: "🔐 Authenticate to Google Cloud"
-        id: "auth"
-        uses: google-github-actions/auth@7c6bc770dae815cd3e89ee6cdf493a5fab2cc093 # v3
-        with:
-          credentials_json: "${{ secrets.GOOGLE_CREDENTIALS }}"
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          repository: langchain-ai/langchain-aws
-          path: langchain-aws
-      - name: "🔐 Configure AWS Credentials"
-        uses: aws-actions/configure-aws-credentials@8df5847569e6427dd6c4fb1cf565c83acfa8afa7 # v6
-        with:
-          aws-access-key-id: ${{ secrets.AWS_ACCESS_KEY_ID }}
-          aws-secret-access-key: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
-          aws-region: ${{ secrets.AWS_REGION }}
-      - name: "📦 Organize External Libraries"
-        run: |
-          rm -rf \
-            langchain/libs/partners/google-genai \
-            langchain/libs/partners/google-vertexai
-          mv langchain-google/libs/genai langchain/libs/partners/google-genai
-          mv langchain-google/libs/vertexai langchain/libs/partners/google-vertexai
-          mv langchain-aws/libs/aws langchain/libs/partners/aws
-
-      - name: "🐍 Set up Python ${{ matrix.python-version }} + UV"
-        uses: "./langchain/.github/actions/uv_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-
-      - name: "📦 Install Dependencies"
-        # Partner packages use [tool.uv.sources] in their pyproject.toml to resolve
-        # langchain-core/langchain to local editable installs, so `uv sync` automatically
-        # tests against the versions from the current branch (not published releases).
-
-        # TODO: external google/aws don't have local resolution since they live in
-        # separate repos, so they pull `core`/`langchain_v1` from PyPI. We should update
-        # their dev groups to use git source dependencies pointing to the current
-        # branch's latest commit SHA to fully test against local langchain changes.
-        run: |
-          echo "Running scheduled tests, installing dependencies with uv..."
-          cd langchain/${{ matrix.working-directory }}
-          uv sync --group test --group test_integration
-
-      - name: "🚀 Run Integration Tests"
-        # WARNING: All secrets below are available to every matrix job regardless of
-        # which package is being tested. This is intentional for simplicity, but means
-        # any test file could technically access any key. Only use for trusted code.
-        env:
-          LANGCHAIN_TESTS_USER_AGENT: ${{ secrets.LANGCHAIN_TESTS_USER_AGENT }}
-
-          AI21_API_KEY: ${{ secrets.AI21_API_KEY }}
-          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
-          ANTHROPIC_FILES_API_IMAGE_ID: ${{ secrets.ANTHROPIC_FILES_API_IMAGE_ID }}
-          ANTHROPIC_FILES_API_PDF_ID: ${{ secrets.ANTHROPIC_FILES_API_PDF_ID }}
-          ASTRA_DB_API_ENDPOINT: ${{ secrets.ASTRA_DB_API_ENDPOINT }}
-          ASTRA_DB_APPLICATION_TOKEN: ${{ secrets.ASTRA_DB_APPLICATION_TOKEN }}
-          ASTRA_DB_KEYSPACE: ${{ secrets.ASTRA_DB_KEYSPACE }}
-          AZURE_OPENAI_API_VERSION: ${{ secrets.AZURE_OPENAI_API_VERSION }}
-          AZURE_OPENAI_API_BASE: ${{ secrets.AZURE_OPENAI_API_BASE }}
-          AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}
-          AZURE_OPENAI_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_CHAT_DEPLOYMENT_NAME }}
-          AZURE_OPENAI_LEGACY_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LEGACY_CHAT_DEPLOYMENT_NAME }}
-          AZURE_OPENAI_LLM_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LLM_DEPLOYMENT_NAME }}
-          AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME }}
-          COHERE_API_KEY: ${{ secrets.COHERE_API_KEY }}
-          DEEPSEEK_API_KEY: ${{ secrets.DEEPSEEK_API_KEY }}
-          ES_URL: ${{ secrets.ES_URL }}
-          ES_CLOUD_ID: ${{ secrets.ES_CLOUD_ID }}
-          ES_API_KEY: ${{ secrets.ES_API_KEY }}
-          EXA_API_KEY: ${{ secrets.EXA_API_KEY }}
-          FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}
-          GOOGLE_API_KEY: ${{ secrets.GOOGLE_API_KEY }}
-          GOOGLE_SEARCH_API_KEY: ${{ secrets.GOOGLE_SEARCH_API_KEY }}
-          GOOGLE_CSE_ID: ${{ secrets.GOOGLE_CSE_ID }}
-          GROQ_API_KEY: ${{ secrets.GROQ_API_KEY }}
-          HUGGINGFACEHUB_API_TOKEN: ${{ secrets.HUGGINGFACEHUB_API_TOKEN }}
-          MISTRAL_API_KEY: ${{ secrets.MISTRAL_API_KEY }}
-          MONGODB_ATLAS_URI: ${{ secrets.MONGODB_ATLAS_URI }}
-          NOMIC_API_KEY: ${{ secrets.NOMIC_API_KEY }}
-          NVIDIA_API_KEY: ${{ secrets.NVIDIA_API_KEY }}
-          OLLAMA_API_KEY: ${{ secrets.OLLAMA_API_KEY }}
-          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
-          OPENROUTER_API_KEY: ${{ secrets.OPENROUTER_API_KEY }}
-          PPLX_API_KEY: ${{ secrets.PPLX_API_KEY }}
-          TOGETHER_API_KEY: ${{ secrets.TOGETHER_API_KEY }}
-          UPSTAGE_API_KEY: ${{ secrets.UPSTAGE_API_KEY }}
-          WATSONX_APIKEY: ${{ secrets.WATSONX_APIKEY }}
-          WATSONX_PROJECT_ID: ${{ secrets.WATSONX_PROJECT_ID }}
-          XAI_API_KEY: ${{ secrets.XAI_API_KEY }}
-        run: |
-          cd langchain/${{ matrix.working-directory }}
-          make integration_tests
-
-      - name: "🧹 Clean up External Libraries"
-        # Clean up external libraries to avoid affecting the following git status check
-        run: |
-          rm -rf \
-            langchain/libs/partners/google-genai \
-            langchain/libs/partners/google-vertexai \
-            langchain/libs/partners/aws
-
-      - name: "🧹 Verify Clean Working Directory"
-        working-directory: langchain
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
-
-  # Test dependent packages against local packages to catch breaking changes
-  test-dependents:
-    # Defend against forks running scheduled jobs, but allow manual runs from forks
-    if: github.repository_owner == 'langchain-ai' || github.event_name != 'schedule'
-
-    name: "🐍 Python ${{ matrix.python-version }}: ${{ matrix.package.path }}"
-    runs-on: ubuntu-latest
-    needs: [compute-matrix]
-    timeout-minutes: 30
-    strategy:
-      fail-fast: false
-      matrix:
-        # deepagents requires Python >= 3.11, use bounded version from compute-matrix
-        python-version: ${{ fromJSON(needs.compute-matrix.outputs.python-version-min-3-11) }}
-        package:
-          - name: deepagents
-            repo: langchain-ai/deepagents
-            path: libs/deepagents
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          path: langchain
-
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          repository: ${{ matrix.package.repo }}
-          path: ${{ matrix.package.name }}
-
-      - name: "🐍 Set up Python ${{ matrix.python-version }} + UV"
-        uses: "./langchain/.github/actions/uv_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-
-      - name: "📦 Install ${{ matrix.package.name }} with Local"
-        # Unlike partner packages (which use [tool.uv.sources] for local resolution),
-        # external dependents live in separate repos and need explicit overrides to
-        # test against the langchain versions from the current branch, as their
-        # pyproject.toml files point to released versions.
-        run: |
-          cd ${{ matrix.package.name }}/${{ matrix.package.path }}
-
-          # Install the package with test dependencies
-          uv sync --group test
-
-          # Override langchain packages with local versions
-          uv pip install \
-            -e $GITHUB_WORKSPACE/langchain/libs/core \
-            -e $GITHUB_WORKSPACE/langchain/libs/langchain_v1
-
-      # No API keys needed for now - deepagents `make test` only runs unit tests
-      - name: "🚀 Run ${{ matrix.package.name }} Tests"
-        run: |
-          cd ${{ matrix.package.name }}/${{ matrix.package.path }}
-          make test
--- a/.github/workflows/langchain_ci.yml
+++ b/.github/workflows/langchain_ci.yml
@@ -0,0 +1,98 @@
+---
+name: libs/langchain CI
+
+on:
+  push:
+    branches: [ master ]
+  pull_request:
+    paths:
+      - '.github/actions/poetry_setup/action.yml'
+      - '.github/tools/**'
+      - '.github/workflows/_lint.yml'
+      - '.github/workflows/_test.yml'
+      - '.github/workflows/_pydantic_compatibility.yml'
+      - '.github/workflows/langchain_ci.yml'
+      - 'libs/langchain/**'
+  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+
+# If another push to the same PR or branch happens while this workflow is still running,
+# cancel the earlier run in favor of the next run.
+#
+# There's no point in testing an outdated version of the code. GitHub only allows
+# a limited number of job runners to be active at the same time, so it's better to cancel
+# pointless jobs early so that more useful jobs can run sooner.
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
+
+env:
+  POETRY_VERSION: "1.6.1"
+  WORKDIR: "libs/langchain"
+
+jobs:
+  lint:
+    uses:
+      ./.github/workflows/_lint.yml
+    with:
+      working-directory: libs/langchain
+    secrets: inherit
+
+  test:
+    uses:
+      ./.github/workflows/_test.yml
+    with:
+      working-directory: libs/langchain
+    secrets: inherit
+
+  pydantic-compatibility:
+    uses:
+      ./.github/workflows/_pydantic_compatibility.yml
+    with:
+      working-directory: libs/langchain
+    secrets: inherit
+
+  extended-tests:
+    runs-on: ubuntu-latest
+    defaults:
+      run:
+        working-directory: ${{ env.WORKDIR }}
+    strategy:
+      matrix:
+        python-version:
+          - "3.8"
+          - "3.9"
+          - "3.10"
+          - "3.11"
+          - "3.12"
+    name: Python ${{ matrix.python-version }} extended tests
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
+        with:
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
+          working-directory: libs/langchain
+          cache-key: extended
+
+      - name: Install dependencies
+        shell: bash
+        run: |
+          echo "Running extended tests, installing dependencies with poetry..."
+          poetry install -E extended_testing
+
+      - name: Run extended tests
+        run: make extended_tests
+
+      - name: Ensure the tests did not create any additional files
+        shell: bash
+        run: |
+          set -eu
+
+          STATUS="$(git status)"
+          echo "$STATUS"
+
+          # grep will exit non-zero if the target message isn't found,
+          # and `set -e` above will cause the step to fail.
+          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/langchain_experimental_ci.yml
+++ b/.github/workflows/langchain_experimental_ci.yml
@@ -0,0 +1,131 @@
+---
+name: libs/experimental CI
+
+on:
+  push:
+    branches: [ master ]
+  pull_request:
+    paths:
+      - '.github/actions/poetry_setup/action.yml'
+      - '.github/tools/**'
+      - '.github/workflows/_lint.yml'
+      - '.github/workflows/_test.yml'
+      - '.github/workflows/langchain_experimental_ci.yml'
+      - 'libs/langchain/**'
+      - 'libs/experimental/**'
+  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+
+# If another push to the same PR or branch happens while this workflow is still running,
+# cancel the earlier run in favor of the next run.
+#
+# There's no point in testing an outdated version of the code. GitHub only allows
+# a limited number of job runners to be active at the same time, so it's better to cancel
+# pointless jobs early so that more useful jobs can run sooner.
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
+
+env:
+  POETRY_VERSION: "1.6.1"
+  WORKDIR: "libs/experimental"
+
+jobs:
+  lint:
+    uses:
+      ./.github/workflows/_lint.yml
+    with:
+      working-directory: libs/experimental
+    secrets: inherit
+
+  test:
+    uses:
+      ./.github/workflows/_test.yml
+    with:
+      working-directory: libs/experimental
+    secrets: inherit
+
+  # It's possible that langchain-experimental works fine with the latest *published* langchain,
+  # but is broken with the langchain on `master`.
+  #
+  # We want to catch situations like that *before* releasing a new langchain, hence this test.
+  test-with-latest-langchain:
+    runs-on: ubuntu-latest
+    defaults:
+      run:
+        working-directory: ${{ env.WORKDIR }}
+    strategy:
+      matrix:
+        python-version:
+          - "3.8"
+          - "3.9"
+          - "3.10"
+          - "3.11"
+          - "3.12"
+    name: test with unpublished langchain - Python ${{ matrix.python-version }}
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
+        with:
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
+          working-directory: ${{ env.WORKDIR }}
+          cache-key: unpublished-langchain
+
+      - name: Install dependencies
+        shell: bash
+        run: |
+          echo "Running tests with unpublished langchain, installing dependencies with poetry..."
+          poetry install
+
+          echo "Editably installing langchain outside of poetry, to avoid messing up lockfile..."
+          poetry run pip install -e ../langchain
+
+      - name: Run tests
+        run: make test
+  extended-tests:
+    runs-on: ubuntu-latest
+    defaults:
+      run:
+        working-directory: ${{ env.WORKDIR }}
+    strategy:
+      matrix:
+        python-version:
+          - "3.8"
+          - "3.9"
+          - "3.10"
+          - "3.11"
+          - "3.12"
+    name: Python ${{ matrix.python-version }} extended tests
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
+        with:
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
+          working-directory: libs/experimental
+          cache-key: extended
+
+      - name: Install dependencies
+        shell: bash
+        run: |
+          echo "Running extended tests, installing dependencies with poetry..."
+          poetry install -E extended_testing
+
+      - name: Run extended tests
+        run: make extended_tests
+
+      - name: Ensure the tests did not create any additional files
+        shell: bash
+        run: |
+          set -eu
+
+          STATUS="$(git status)"
+          echo "$STATUS"
+
+          # grep will exit non-zero if the target message isn't found,
+          # and `set -e` above will cause the step to fail.
+          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/langchain_experimental_release.yml
+++ b/.github/workflows/langchain_experimental_release.yml
@@ -0,0 +1,13 @@
+---
+name: libs/experimental Release
+
+on:
+  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+
+jobs:
+  release:
+    uses:
+      ./.github/workflows/_release.yml
+    with:
+      working-directory: libs/experimental
+    secrets: inherit
--- a/.github/workflows/langchain_release.yml
+++ b/.github/workflows/langchain_release.yml
@@ -0,0 +1,26 @@
+---
+name: libs/langchain Release
+
+on:
+  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+
+jobs:
+  release:
+    uses:
+      ./.github/workflows/_release.yml
+    with:
+      working-directory: libs/langchain
+    secrets: inherit
+
+  # N.B.: It's possible that PyPI doesn't make the new release visible / available
+  #       immediately after publishing. If that happens, the docker build might not
+  #       create a new docker image for the new release, since it won't see it.
+  #
+  #       If this ends up being a problem, add a check to the end of the `_release.yml`
+  #       workflow that prevents the workflow from finishing until the new release
+  #       is visible and installable on PyPI.
+  release-docker:
+    needs:
+      - release
+    uses:
+      ./.github/workflows/langchain_release_docker.yml
--- a/.github/workflows/langchain_release_docker.yml
+++ b/.github/workflows/langchain_release_docker.yml
@@ -0,0 +1,14 @@
+---
+name: docker/langchain/langchain Release
+
+on:
+  workflow_dispatch: # Allows to trigger the workflow manually in GitHub UI
+  workflow_call: # Allows triggering from another workflow
+
+jobs:
+  release:
+    uses: ./.github/workflows/_release_docker.yml
+    with:
+      dockerfile: docker/Dockerfile.base
+      image: langchain/langchain
+    secrets: inherit
--- a/.github/workflows/pr_labeler.yml
+++ b/.github/workflows/pr_labeler.yml
@@ -1,213 +0,0 @@
-# Unified PR labeler — applies size, file-based, title-based, and
-# contributor classification labels in a single sequential workflow.
-#
-# Consolidates pr_labeler_file.yml, pr_labeler_title.yml,
-# pr_size_labeler.yml, and PR-handling from tag-external-contributions.yml
-# into one workflow to eliminate race conditions from concurrent label
-# mutations. tag-external-issues.yml remains active for issue-only
-# labeling. Backfill lives in pr_labeler_backfill.yml.
-#
-# Config and shared logic live in .github/scripts/pr-labeler-config.json
-# and .github/scripts/pr-labeler.js — update those when adding partners.
-#
-# Setup Requirements:
-# 1. Create a GitHub App with permissions:
-#    - Repository: Pull requests (write)
-#    - Repository: Issues (write)
-#    - Organization: Members (read)
-# 2. Install the app on your organization and this repository
-# 3. Add these repository secrets:
-#    - ORG_MEMBERSHIP_APP_ID: Your app's ID
-#    - ORG_MEMBERSHIP_APP_PRIVATE_KEY: Your app's private key
-#
-# The GitHub App token is required to check private organization membership
-# and to propagate label events to downstream workflows.
-
-name: "🏷️ PR Labeler"
-
-on:
-  # Safe since we're not checking out or running the PR's code.
-  # NEVER CHECK OUT UNTRUSTED CODE FROM A PR's HEAD IN A pull_request_target JOB.
-  # Doing so would allow attackers to execute arbitrary code in the context of your repository.
-  pull_request_target:
-    types: [opened, synchronize, reopened, edited]
-
-permissions:
-  contents: read
-
-concurrency:
-  # Separate opened events so external/tier labels are never lost to cancellation
-  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.run_id }}-${{ github.event.action == 'opened' && 'opened' || 'update' }}
-  cancel-in-progress: ${{ github.event.action != 'opened' }}
-
-jobs:
-  label:
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-      pull-requests: write
-      issues: write
-
-    steps:
-      # Checks out the BASE branch (safe for pull_request_target — never
-      # the PR head). Needed to load .github/scripts/pr-labeler*.
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: Generate GitHub App token
-        if: github.event.action == 'opened'
-        id: app-token
-        uses: actions/create-github-app-token@f8d387b68d61c58ab83c6c016672934102569859 # v3
-        with:
-          app-id: ${{ secrets.ORG_MEMBERSHIP_APP_ID }}
-          private-key: ${{ secrets.ORG_MEMBERSHIP_APP_PRIVATE_KEY }}
-
-      - name: Verify App token
-        if: github.event.action == 'opened'
-        run: |
-          if [ -z "${{ steps.app-token.outputs.token }}" ]; then
-            echo "::error::GitHub App token generation failed — cannot classify contributor"
-            exit 1
-          fi
-
-      - name: Check org membership
-        if: github.event.action == 'opened'
-        id: check-membership
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          github-token: ${{ steps.app-token.outputs.token }}
-          script: |
-            const { owner, repo } = context.repo;
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            const author = context.payload.sender.login;
-            const { isExternal } = await h.checkMembership(
-              author, context.payload.sender.type,
-            );
-            core.setOutput('is-external', isExternal ? 'true' : 'false');
-
-      - name: Apply PR labels
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        env:
-          IS_EXTERNAL: ${{ steps.check-membership.outputs.is-external }}
-        with:
-          github-token: ${{ secrets.GITHUB_TOKEN }}
-          script: |
-            const { owner, repo } = context.repo;
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            const pr = context.payload.pull_request;
-            if (!pr) return;
-            const prNumber = pr.number;
-            const action = context.payload.action;
-
-            const toAdd = new Set();
-            const toRemove = new Set();
-
-            const currentLabels = (await github.paginate(
-              github.rest.issues.listLabelsOnIssue,
-              { owner, repo, issue_number: prNumber, per_page: 100 },
-            )).map(l => l.name ?? '');
-
-            // ── Size + file labels (skip on 'edited' — files unchanged) ──
-            if (action !== 'edited') {
-              for (const sl of h.sizeLabels) await h.ensureLabel(sl);
-
-              const files = await github.paginate(github.rest.pulls.listFiles, {
-                owner, repo, pull_number: prNumber, per_page: 100,
-              });
-
-              const { totalChanged, sizeLabel } = h.computeSize(files);
-              toAdd.add(sizeLabel);
-              for (const sl of h.sizeLabels) {
-                if (currentLabels.includes(sl) && sl !== sizeLabel) toRemove.add(sl);
-              }
-              console.log(`Size: ${totalChanged} changed lines → ${sizeLabel}`);
-
-              for (const label of h.matchFileLabels(files)) {
-                toAdd.add(label);
-              }
-            }
-
-            // ── Title-based labels ──
-            const { labels: titleLabels, typeLabel } = h.matchTitleLabels(pr.title || '');
-            for (const label of titleLabels) toAdd.add(label);
-
-            // Remove stale type labels only when a type was detected
-            if (typeLabel) {
-              for (const tl of h.allTypeLabels) {
-                if (currentLabels.includes(tl) && !titleLabels.has(tl)) toRemove.add(tl);
-              }
-            }
-
-            // ── Internal label (only on open, non-external contributors) ──
-            // IS_EXTERNAL is empty string on non-opened events (step didn't
-            // run), so this guard is only true for opened + internal.
-            if (action === 'opened' && process.env.IS_EXTERNAL === 'false') {
-              toAdd.add('internal');
-            }
-
-            // ── Apply changes ──
-            // Ensure all labels we're about to add exist (addLabels returns
-            // 422 if any label in the batch is missing, which would prevent
-            // ALL labels from being applied).
-            for (const name of toAdd) {
-              await h.ensureLabel(name);
-            }
-
-            for (const name of toRemove) {
-              if (toAdd.has(name)) continue;
-              try {
-                await github.rest.issues.removeLabel({
-                  owner, repo, issue_number: prNumber, name,
-                });
-              } catch (e) {
-                if (e.status !== 404) throw e;
-              }
-            }
-
-            const addList = [...toAdd];
-            if (addList.length > 0) {
-              await github.rest.issues.addLabels({
-                owner, repo, issue_number: prNumber, labels: addList,
-              });
-            }
-
-            const removed = [...toRemove].filter(r => !toAdd.has(r));
-            console.log(`PR #${prNumber}: +[${addList.join(', ')}] -[${removed.join(', ')}]`);
-
-      # Apply tier label BEFORE the external label so that
-      # "trusted-contributor" is already present when the "external" labeled
-      # event fires and triggers require_issue_link.yml.
-      - name: Apply contributor tier label
-        if: github.event.action == 'opened' && steps.check-membership.outputs.is-external == 'true'
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          github-token: ${{ steps.app-token.outputs.token }}
-          script: |
-            const { owner, repo } = context.repo;
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            const pr = context.payload.pull_request;
-            await h.applyTierLabel(pr.number, pr.user.login);
-
-      - name: Add external label
-        if: github.event.action == 'opened' && steps.check-membership.outputs.is-external == 'true'
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          # Use App token so the "labeled" event propagates to downstream
-          # workflows (e.g. require_issue_link.yml). Events created by the
-          # default GITHUB_TOKEN do not trigger additional workflow runs.
-          github-token: ${{ steps.app-token.outputs.token }}
-          script: |
-            const { owner, repo } = context.repo;
-            const prNumber = context.payload.pull_request.number;
-
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            await h.ensureLabel('external');
-            await github.rest.issues.addLabels({
-              owner, repo,
-              issue_number: prNumber,
-              labels: ['external'],
-            });
-            console.log(`Added 'external' label to PR #${prNumber}`);
--- a/.github/workflows/pr_labeler_backfill.yml
+++ b/.github/workflows/pr_labeler_backfill.yml
@@ -1,130 +0,0 @@
-# Backfill PR labels on all open PRs.
-#
-# Manual-only workflow that applies the same labels as pr_labeler.yml
-# (size, file, title, contributor classification) to existing open PRs.
-# Reuses shared logic from .github/scripts/pr-labeler.js.
-
-name: "🏷️ PR Labeler Backfill"
-
-on:
-  workflow_dispatch:
-    inputs:
-      max_items:
-        description: "Maximum number of open PRs to process"
-        default: "100"
-        type: string
-
-permissions:
-  contents: read
-
-jobs:
-  backfill:
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-      pull-requests: write
-      issues: write
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: Generate GitHub App token
-        id: app-token
-        uses: actions/create-github-app-token@f8d387b68d61c58ab83c6c016672934102569859 # v3
-        with:
-          app-id: ${{ secrets.ORG_MEMBERSHIP_APP_ID }}
-          private-key: ${{ secrets.ORG_MEMBERSHIP_APP_PRIVATE_KEY }}
-
-      - name: Backfill labels on open PRs
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          github-token: ${{ steps.app-token.outputs.token }}
-          script: |
-            const { owner, repo } = context.repo;
-            const rawMax = '${{ inputs.max_items }}';
-            const maxItems = parseInt(rawMax, 10);
-            if (isNaN(maxItems) || maxItems <= 0) {
-              core.setFailed(`Invalid max_items: "${rawMax}" — must be a positive integer`);
-              return;
-            }
-
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            for (const name of [...h.sizeLabels, ...h.tierLabels]) {
-              await h.ensureLabel(name);
-            }
-
-            const contributorCache = new Map();
-            const fileRules = h.buildFileRules();
-
-            const prs = await github.paginate(github.rest.pulls.list, {
-              owner, repo, state: 'open', per_page: 100,
-            });
-
-            let processed = 0;
-            let failures = 0;
-            for (const pr of prs) {
-              if (processed >= maxItems) break;
-              try {
-                const author = pr.user.login;
-                const info = await h.getContributorInfo(contributorCache, author, pr.user.type);
-                const labels = new Set();
-
-                labels.add(info.isExternal ? 'external' : 'internal');
-                if (info.isExternal && info.mergedCount != null && info.mergedCount >= h.trustedThreshold) {
-                  labels.add('trusted-contributor');
-                } else if (info.isExternal && info.mergedCount === 0) {
-                  labels.add('new-contributor');
-                }
-
-                // Size + file labels
-                const files = await github.paginate(github.rest.pulls.listFiles, {
-                  owner, repo, pull_number: pr.number, per_page: 100,
-                });
-                const { sizeLabel } = h.computeSize(files);
-                labels.add(sizeLabel);
-
-                for (const label of h.matchFileLabels(files, fileRules)) {
-                  labels.add(label);
-                }
-
-                // Title labels
-                const { labels: titleLabels } = h.matchTitleLabels(pr.title ?? '');
-                for (const tl of titleLabels) labels.add(tl);
-
-                // Ensure all labels exist before batch add
-                for (const name of labels) {
-                  await h.ensureLabel(name);
-                }
-
-                // Remove stale managed labels
-                const currentLabels = (await github.paginate(
-                  github.rest.issues.listLabelsOnIssue,
-                  { owner, repo, issue_number: pr.number, per_page: 100 },
-                )).map(l => l.name ?? '');
-
-                const managed = [...h.sizeLabels, ...h.tierLabels, ...h.allTypeLabels];
-                for (const name of currentLabels) {
-                  if (managed.includes(name) && !labels.has(name)) {
-                    try {
-                      await github.rest.issues.removeLabel({
-                        owner, repo, issue_number: pr.number, name,
-                      });
-                    } catch (e) {
-                      if (e.status !== 404) throw e;
-                    }
-                  }
-                }
-
-                await github.rest.issues.addLabels({
-                  owner, repo, issue_number: pr.number, labels: [...labels],
-                });
-                console.log(`PR #${pr.number} (${author}): ${[...labels].join(', ')}`);
-                processed++;
-              } catch (e) {
-                failures++;
-                core.warning(`Failed to process PR #${pr.number}: ${e.message}`);
-              }
-            }
-
-            console.log(`\nBackfill complete. Processed ${processed} PRs, ${failures} failures. ${contributorCache.size} unique authors.`);
--- a/.github/workflows/pr_lint.yml
+++ b/.github/workflows/pr_lint.yml
@@ -1,128 +0,0 @@
-# PR title linting.
-#
-# FORMAT (Conventional Commits 1.0.0):
-#
-#   <type>[optional scope]: <description>
-#   [optional body]
-#   [optional footer(s)]
-#
-# Examples:
-#     feat(core): add multi‐tenant support
-#     fix(langchain): resolve error
-#     docs: update API usage examples
-#     docs(openai): update API usage examples
-#
-# Allowed Types:
-#   * feat       — a new feature (MINOR)
-#   * fix        — a bug fix (PATCH)
-#   * docs       — documentation only changes
-#   * style      — formatting, linting, etc.; no code change or typing refactors
-#   * refactor   — code change that neither fixes a bug nor adds a feature
-#   * perf       — code change that improves performance
-#   * test       — adding tests or correcting existing
-#   * build      — changes that affect the build system/external dependencies
-#   * ci         — continuous integration/configuration changes
-#   * chore      — other changes that don't modify source or test files
-#   * revert     — reverts a previous commit
-#   * release    — prepare a new release
-#   * hotfix     — urgent fix
-#
-# Allowed Scope(s) (optional):
-#   core, langchain, langchain-classic, model-profiles,
-#   standard-tests, text-splitters, docs, anthropic, chroma, deepseek, exa,
-#   fireworks, groq, huggingface, mistralai, nomic, ollama, openai,
-#   perplexity, qdrant, xai, infra, deps, partners
-#
-# Multiple scopes can be used by separating them with a comma. For example:
-#
-#   feat(core,langchain): add multi‐tenant support to core and langchain
-#
-# Note: PRs touching the langchain package should use the 'langchain' scope. It is not
-#   acceptable to omit the scope for changes to the langchain package, despite it being
-#   the main package & name of the repo.
-#
-# Rules:
-#   1. The 'Type' must start with a lowercase letter.
-#   2. Breaking changes: append "!" after type/scope (e.g., feat!: drop x support)
-#   3. When releasing (updating the pyproject.toml and uv.lock), the commit message
-#      should be: `release(scope): x.y.z` (e.g., `release(core): 1.2.0` with no
-#      body, footer, or preceeding/proceeding text).
-#
-# Enforces Conventional Commits format for pull request titles to maintain a clear and
-# machine-readable change history.
-
-name: "🏷️ PR Title Lint"
-
-permissions:
-  pull-requests: read
-
-on:
-  pull_request:
-    types: [opened, edited, synchronize]
-
-jobs:
-  # Validates that PR title follows Conventional Commits 1.0.0 specification
-  lint-pr-title:
-    name: "validate format"
-    runs-on: ubuntu-latest
-    steps:
-      - name: "🚫 Reject empty scope"
-        env:
-          PR_TITLE: ${{ github.event.pull_request.title }}
-        run: |
-          if [[ "$PR_TITLE" =~ ^[a-z]+\(\)[!]?: ]]; then
-            echo "::error::PR title has empty scope parentheses: '$PR_TITLE'"
-            echo "Either remove the parentheses or provide a scope (e.g., 'fix(core): ...')."
-            exit 1
-          fi
-      - name: "✅ Validate Conventional Commits Format"
-        uses: amannn/action-semantic-pull-request@48f256284bd46cdaab1048c3721360e808335d50 # v6
-        env:
-          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
-        with:
-          types: |
-            feat
-            fix
-            docs
-            style
-            refactor
-            perf
-            test
-            build
-            ci
-            chore
-            revert
-            release
-            hotfix
-          scopes: |
-            core
-            langchain
-            langchain-classic
-            model-profiles
-            standard-tests
-            text-splitters
-            docs
-            anthropic
-            chroma
-            deepseek
-            exa
-            fireworks
-            groq
-            huggingface
-            mistralai
-            nomic
-            ollama
-            openai
-            openrouter
-            perplexity
-            qdrant
-            xai
-            infra
-            deps
-            partners
-          requireScope: false
-          disallowScopes: |
-            release
-            [A-Z]+
-          ignoreLabels: |
-            ignore-lint-pr-title
--- a/.github/workflows/refresh_model_profiles.yml
+++ b/.github/workflows/refresh_model_profiles.yml
@@ -1,45 +0,0 @@
-# Refreshes model profile data for all in-monorepo partner integrations by
-# pulling the latest metadata from models.dev via the `langchain-profiles` CLI.
-#
-# Creates a pull request with any changes. Runs daily and can be triggered
-# manually from the Actions UI. Uses a fixed branch so each run supersedes
-# any stale PR from a previous run.
-
-name: "🔄 Refresh Model Profiles"
-
-on:
-  schedule:
-    - cron: "0 8 * * *" # daily at 08:00 UTC
-  workflow_dispatch:
-
-permissions:
-  contents: write
-  pull-requests: write
-
-jobs:
-  refresh-profiles:
-    uses: ./.github/workflows/_refresh_model_profiles.yml
-    with:
-      providers: >-
-        [
-          {"provider":"anthropic",    "data_dir":"libs/partners/anthropic/langchain_anthropic/data"},
-          {"provider":"deepseek",     "data_dir":"libs/partners/deepseek/langchain_deepseek/data"},
-          {"provider":"fireworks-ai", "data_dir":"libs/partners/fireworks/langchain_fireworks/data"},
-          {"provider":"groq",         "data_dir":"libs/partners/groq/langchain_groq/data"},
-          {"provider":"huggingface",  "data_dir":"libs/partners/huggingface/langchain_huggingface/data"},
-          {"provider":"mistral",      "data_dir":"libs/partners/mistralai/langchain_mistralai/data"},
-          {"provider":"openai",       "data_dir":"libs/partners/openai/langchain_openai/data"},
-          {"provider":"openrouter",   "data_dir":"libs/partners/openrouter/langchain_openrouter/data"},
-          {"provider":"perplexity",   "data_dir":"libs/partners/perplexity/langchain_perplexity/data"},
-          {"provider":"xai",          "data_dir":"libs/partners/xai/langchain_xai/data"}
-        ]
-      cli-path: libs/model-profiles
-      add-paths: libs/partners/**/data/_profiles.py
-      pr-body: |
-        Automated refresh of model profile data for all in-monorepo partner
-        integrations via `langchain-profiles refresh`.
-
-        🤖 Generated by the `refresh_model_profiles` workflow.
-    secrets:
-      MODEL_PROFILE_BOT_APP_ID: ${{ secrets.MODEL_PROFILE_BOT_APP_ID }}
-      MODEL_PROFILE_BOT_PRIVATE_KEY: ${{ secrets.MODEL_PROFILE_BOT_PRIVATE_KEY }}
--- a/.github/workflows/reopen_on_assignment.yml
+++ b/.github/workflows/reopen_on_assignment.yml
@@ -1,195 +0,0 @@
-# Reopen PRs that were auto-closed by require_issue_link.yml when the
-# contributor was not assigned to the linked issue. When a maintainer
-# assigns the contributor to the issue, this workflow finds matching
-# closed PRs, verifies the issue link, and reopens them.
-#
-# Uses the default GITHUB_TOKEN (not a PAT or app token) so that the
-# reopen and label-removal events do NOT re-trigger other workflows.
-# GitHub suppresses events created by the default GITHUB_TOKEN within
-# workflow runs to prevent infinite loops.
-
-name: Reopen PR on Issue Assignment
-
-on:
-  issues:
-    types: [assigned]
-
-permissions:
-  contents: read
-
-jobs:
-  reopen-linked-prs:
-    runs-on: ubuntu-latest
-    permissions:
-      actions: write
-      pull-requests: write
-
-    steps:
-      - name: Find and reopen matching PRs
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          script: |
-            const { owner, repo } = context.repo;
-            const issueNumber = context.payload.issue.number;
-            const assignee = context.payload.assignee.login;
-
-            console.log(
-              `Issue #${issueNumber} assigned to ${assignee} — searching for closed PRs to reopen`,
-            );
-
-            const q = [
-              `is:pr`,
-              `is:closed`,
-              `author:${assignee}`,
-              `label:missing-issue-link`,
-              `repo:${owner}/${repo}`,
-            ].join(' ');
-
-            let data;
-            try {
-              ({ data } = await github.rest.search.issuesAndPullRequests({
-                q,
-                per_page: 30,
-              }));
-            } catch (e) {
-              throw new Error(
-                `Failed to search for closed PRs to reopen after assigning ${assignee} ` +
-                `to #${issueNumber} (HTTP ${e.status ?? 'unknown'}): ${e.message}`,
-              );
-            }
-
-            if (data.total_count === 0) {
-              console.log('No matching closed PRs found');
-              return;
-            }
-
-            console.log(`Found ${data.total_count} candidate PR(s)`);
-
-            // Must stay in sync with the identical pattern in require_issue_link.yml
-            const pattern = /(?:close[sd]?|fix(?:e[sd])?|resolve[sd]?)\s*#(\d+)/gi;
-
-            for (const item of data.items) {
-              const prNumber = item.number;
-              const body = item.body || '';
-              const matches = [...body.matchAll(pattern)];
-              const referencedIssues = matches.map(m => parseInt(m[1], 10));
-
-              if (!referencedIssues.includes(issueNumber)) {
-                console.log(`PR #${prNumber} does not reference #${issueNumber} — skipping`);
-                continue;
-              }
-
-              // Skip if already bypassed
-              const labels = item.labels.map(l => l.name);
-              if (labels.includes('bypass-issue-check')) {
-                console.log(`PR #${prNumber} already has bypass-issue-check — skipping`);
-                continue;
-              }
-
-              // Reopen first, remove label second — a closed PR that still has
-              // missing-issue-link is recoverable; a closed PR with the label
-              // stripped is invisible to both workflows.
-              try {
-                await github.rest.pulls.update({
-                  owner,
-                  repo,
-                  pull_number: prNumber,
-                  state: 'open',
-                });
-                console.log(`Reopened PR #${prNumber}`);
-              } catch (e) {
-                if (e.status === 422) {
-                  // Head branch deleted — PR is unrecoverable. Notify the
-                  // contributor so they know to open a new PR.
-                  core.warning(`Cannot reopen PR #${prNumber}: head branch was likely deleted`);
-                  try {
-                    await github.rest.issues.createComment({
-                      owner,
-                      repo,
-                      issue_number: prNumber,
-                      body:
-                        `You have been assigned to #${issueNumber}, but this PR could not be ` +
-                        `reopened because the head branch has been deleted. Please open a new ` +
-                        `PR referencing the issue.`,
-                    });
-                  } catch (commentErr) {
-                    core.warning(
-                      `Also failed to post comment on PR #${prNumber}: ${commentErr.message}`,
-                    );
-                  }
-                  continue;
-                }
-                // Transient errors (rate limit, 5xx) should fail the job so
-                // the label is NOT removed and the run can be retried.
-                throw e;
-              }
-
-              // Remove missing-issue-link label only after successful reopen
-              try {
-                await github.rest.issues.removeLabel({
-                  owner,
-                  repo,
-                  issue_number: prNumber,
-                  name: 'missing-issue-link',
-                });
-                console.log(`Removed missing-issue-link from PR #${prNumber}`);
-              } catch (e) {
-                if (e.status !== 404) throw e;
-              }
-
-              // Minimize stale enforcement comment (best-effort;
-              // sync w/ require_issue_link.yml minimize blocks)
-              try {
-                const marker = '<!-- require-issue-link -->';
-                const comments = await github.paginate(
-                  github.rest.issues.listComments,
-                  { owner, repo, issue_number: prNumber, per_page: 100 },
-                );
-                const stale = comments.find(c => c.body && c.body.includes(marker));
-                if (stale) {
-                  await github.graphql(`
-                    mutation($id: ID!) {
-                      minimizeComment(input: {subjectId: $id, classifier: OUTDATED}) {
-                        minimizedComment { isMinimized }
-                      }
-                    }
-                  `, { id: stale.node_id });
-                  console.log(`Minimized stale enforcement comment ${stale.id} as outdated`);
-                }
-              } catch (e) {
-                core.warning(`Could not minimize stale comment on PR #${prNumber}: ${e.message}`);
-              }
-
-              // Re-run the failed require_issue_link check so it picks up the
-              // new assignment.  The re-run uses the original event payload but
-              // fetches live issue data, so the assignment check will pass.
-              //
-              // Limitation: we look up runs by the PR's current head SHA.  If the
-              // contributor pushed new commits while the PR was closed, head.sha
-              // won't match the SHA of the original failed run and the query will
-              // return 0 results.  This is acceptable because any push after reopen
-              // triggers a fresh require_issue_link run against the new SHA.
-              try {
-                const { data: pr } = await github.rest.pulls.get({
-                  owner, repo, pull_number: prNumber,
-                });
-                const { data: runs } = await github.rest.actions.listWorkflowRuns({
-                  owner, repo,
-                  workflow_id: 'require_issue_link.yml',
-                  head_sha: pr.head.sha,
-                  status: 'failure',
-                  per_page: 1,
-                });
-                if (runs.workflow_runs.length > 0) {
-                  await github.rest.actions.reRunWorkflowFailedJobs({
-                    owner, repo,
-                    run_id: runs.workflow_runs[0].id,
-                  });
-                  console.log(`Re-ran failed require_issue_link run ${runs.workflow_runs[0].id} for PR #${prNumber}`);
-                } else {
-                  console.log(`No failed require_issue_link runs found for PR #${prNumber} — skipping re-run`);
-                }
-              } catch (e) {
-                core.warning(`Could not re-run require_issue_link check for PR #${prNumber} (HTTP ${e.status ?? 'unknown'}): ${e.message}`);
-              }
-            }
--- a/.github/workflows/require_issue_link.yml
+++ b/.github/workflows/require_issue_link.yml
@@ -1,467 +0,0 @@
-# Require external PRs to reference an approved issue (e.g. Fixes #NNN) and
-# the PR author to be assigned to that issue. On failure the PR is
-# labeled "missing-issue-link", commented on, and closed.
-#
-# Maintainer override: an org member can reopen the PR or remove
-# "missing-issue-link" — both add "bypass-issue-check" and reopen.
-#
-# Dependency: pr_labeler.yml must apply the "external" label first. This
-# workflow does NOT trigger on "opened" (new PRs have no labels yet, so the
-# gate would always skip).
-
-name: Require Issue Link
-
-on:
-  pull_request_target:
-    # NEVER CHECK OUT UNTRUSTED CODE FROM A PR's HEAD IN A pull_request_target JOB.
-    # Doing so would allow attackers to execute arbitrary code in the context of your repository.
-    types: [edited, reopened, labeled, unlabeled]
-
-# ──────────────────────────────────────────────────────────────────────────────
-# Enforcement gate: set to 'true' to activate the issue link requirement.
-# When 'false', the workflow still runs the check logic (useful for dry-run
-# visibility) but will NOT label, comment, close, or fail PRs.
-# ──────────────────────────────────────────────────────────────────────────────
-env:
-  ENFORCE_ISSUE_LINK: "true"
-
-permissions:
-  contents: read
-
-jobs:
-  check-issue-link:
-    # Run when the "external" label is added, on edit/reopen if already labeled,
-    # or when "missing-issue-link" is removed (triggers maintainer override check).
-    # Skip entirely when the PR already carries "trusted-contributor" or
-    # "bypass-issue-check".
-    if: >-
-      !contains(github.event.pull_request.labels.*.name, 'trusted-contributor') &&
-      !contains(github.event.pull_request.labels.*.name, 'bypass-issue-check') &&
-      (
-        (github.event.action == 'labeled' && github.event.label.name == 'external') ||
-        (github.event.action == 'unlabeled' && github.event.label.name == 'missing-issue-link' && contains(github.event.pull_request.labels.*.name, 'external')) ||
-        (github.event.action != 'labeled' && github.event.action != 'unlabeled' && contains(github.event.pull_request.labels.*.name, 'external'))
-      )
-    runs-on: ubuntu-latest
-    permissions:
-      actions: write
-      pull-requests: write
-
-    steps:
-      - name: Check for issue link and assignee
-        id: check-link
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          script: |
-            const { owner, repo } = context.repo;
-            const prNumber = context.payload.pull_request.number;
-            const action = context.payload.action;
-
-            // ── Helper: ensure a label exists, then add it to the PR ────────
-            async function ensureAndAddLabel(labelName, color) {
-              try {
-                await github.rest.issues.getLabel({ owner, repo, name: labelName });
-              } catch (e) {
-                if (e.status !== 404) throw e;
-                try {
-                  await github.rest.issues.createLabel({ owner, repo, name: labelName, color });
-                } catch (createErr) {
-                  // 422 = label was created by a concurrent run between our
-                  // GET and POST — safe to ignore.
-                  if (createErr.status !== 422) throw createErr;
-                }
-              }
-              await github.rest.issues.addLabels({
-                owner, repo, issue_number: prNumber, labels: [labelName],
-              });
-            }
-
-            // ── Helper: check if the user who triggered this event (reopened
-            // the PR / removed the label) has write+ access on the repo ───
-            // Uses the repo collaborator permission endpoint instead of the
-            // org membership endpoint. The org endpoint requires the caller
-            // to be an org member, which GITHUB_TOKEN (an app installation
-            // token) never is — so it always returns 403.
-            async function senderIsOrgMember() {
-              const sender = context.payload.sender?.login;
-              if (!sender) {
-                throw new Error('Event has no sender — cannot check permissions');
-              }
-              try {
-                const { data } = await github.rest.repos.getCollaboratorPermissionLevel({
-                  owner, repo, username: sender,
-                });
-                const perm = data.permission;
-                if (['admin', 'maintain', 'write'].includes(perm)) {
-                  console.log(`${sender} has ${perm} permission — treating as maintainer`);
-                  return { isMember: true, login: sender };
-                }
-                console.log(`${sender} has ${perm} permission — not a maintainer`);
-                return { isMember: false, login: sender };
-              } catch (e) {
-                if (e.status === 404) {
-                  console.log(`Cannot check permissions for ${sender} — treating as non-maintainer`);
-                  return { isMember: false, login: sender };
-                }
-                const status = e.status ?? 'unknown';
-                throw new Error(
-                  `Permission check failed for ${sender} (HTTP ${status}): ${e.message}`,
-                );
-              }
-            }
-
-            // ── Helper: apply maintainer bypass (shared by both override paths) ──
-            async function applyMaintainerBypass(reason) {
-              console.log(reason);
-
-              // Remove missing-issue-link if present
-              try {
-                await github.rest.issues.removeLabel({
-                  owner, repo, issue_number: prNumber, name: 'missing-issue-link',
-                });
-              } catch (e) {
-                if (e.status !== 404) throw e;
-              }
-
-              // Reopen before adding bypass label — a failed reopen is more
-              // actionable than a closed PR with a bypass label stuck on it.
-              if (context.payload.pull_request.state === 'closed') {
-                try {
-                  await github.rest.pulls.update({
-                    owner, repo, pull_number: prNumber, state: 'open',
-                  });
-                  console.log(`Reopened PR #${prNumber}`);
-                } catch (e) {
-                  // 422 if head branch deleted; 403 if permissions insufficient.
-                  // Bypass labels still apply — maintainer can reopen manually.
-                  core.warning(
-                    `Could not reopen PR #${prNumber} (HTTP ${e.status ?? 'unknown'}): ${e.message}. ` +
-                    `Bypass labels were applied — a maintainer may need to reopen manually.`,
-                  );
-                }
-              }
-
-              // Add bypass-issue-check so future triggers skip enforcement
-              await ensureAndAddLabel('bypass-issue-check', '0e8a16');
-
-              // Minimize stale enforcement comment (best-effort; must not
-              // abort bypass — sync w/ reopen_on_assignment.yml & step below)
-              try {
-                const marker = '<!-- require-issue-link -->';
-                const comments = await github.paginate(
-                  github.rest.issues.listComments,
-                  { owner, repo, issue_number: prNumber, per_page: 100 },
-                );
-                const stale = comments.find(c => c.body && c.body.includes(marker));
-                if (stale) {
-                  await github.graphql(`
-                    mutation($id: ID!) {
-                      minimizeComment(input: {subjectId: $id, classifier: OUTDATED}) {
-                        minimizedComment { isMinimized }
-                      }
-                    }
-                  `, { id: stale.node_id });
-                  console.log(`Minimized stale enforcement comment ${stale.id} as outdated`);
-                }
-              } catch (e) {
-                core.warning(`Could not minimize stale comment on PR #${prNumber}: ${e.message}`);
-              }
-
-              core.setOutput('has-link', 'true');
-              core.setOutput('is-assigned', 'true');
-            }
-
-            // ── Maintainer override: removed "missing-issue-link" label ─────
-            if (action === 'unlabeled') {
-              const { isMember, login } = await senderIsOrgMember();
-              if (isMember) {
-                await applyMaintainerBypass(
-                  `Maintainer ${login} removed missing-issue-link from PR #${prNumber} — bypassing enforcement`,
-                );
-                return;
-              }
-              // Non-member removed the label — re-add it defensively and
-              // set failure outputs so downstream steps (comment, close) fire.
-              // NOTE: addLabels fires a "labeled" event, but the job-level gate
-              // only matches labeled events for "external", so no re-trigger.
-              console.log(`Non-member ${login} removed missing-issue-link — re-adding`);
-              try {
-                await ensureAndAddLabel('missing-issue-link', 'b76e79');
-              } catch (e) {
-                core.warning(
-                  `Failed to re-add missing-issue-link (HTTP ${e.status ?? 'unknown'}): ${e.message}. ` +
-                  `Downstream step will retry.`,
-                );
-              }
-              core.setOutput('has-link', 'false');
-              core.setOutput('is-assigned', 'false');
-              return;
-            }
-
-            // ── Maintainer override: reopened PR with "missing-issue-link" ──
-            const prLabels = context.payload.pull_request.labels.map(l => l.name);
-            if (action === 'reopened' && prLabels.includes('missing-issue-link')) {
-              const { isMember, login } = await senderIsOrgMember();
-              if (isMember) {
-                await applyMaintainerBypass(
-                  `Maintainer ${login} reopened PR #${prNumber} — bypassing enforcement`,
-                );
-                return;
-              }
-              console.log(`Non-member ${login} reopened PR — proceeding with check`);
-            }
-
-            // ── Fetch live labels (race guard) ──────────────────────────────
-            const { data: liveLabels } = await github.rest.issues.listLabelsOnIssue({
-              owner, repo, issue_number: prNumber,
-            });
-            const liveNames = liveLabels.map(l => l.name);
-            if (liveNames.includes('trusted-contributor') || liveNames.includes('bypass-issue-check')) {
-              console.log('PR has trusted-contributor or bypass-issue-check label — bypassing');
-              core.setOutput('has-link', 'true');
-              core.setOutput('is-assigned', 'true');
-              return;
-            }
-
-            const body = context.payload.pull_request.body || '';
-            const pattern = /(?:close[sd]?|fix(?:e[sd])?|resolve[sd]?)\s*#(\d+)/gi;
-            const matches = [...body.matchAll(pattern)];
-
-            if (matches.length === 0) {
-              console.log('No issue link found in PR body');
-              core.setOutput('has-link', 'false');
-              core.setOutput('is-assigned', 'false');
-              return;
-            }
-
-            const issues = matches.map(m => `#${m[1]}`).join(', ');
-            console.log(`Found issue link(s): ${issues}`);
-            core.setOutput('has-link', 'true');
-
-            // Check whether the PR author is assigned to at least one linked issue
-            const prAuthor = context.payload.pull_request.user.login;
-            const MAX_ISSUES = 5;
-            const allIssueNumbers = [...new Set(matches.map(m => parseInt(m[1], 10)))];
-            const issueNumbers = allIssueNumbers.slice(0, MAX_ISSUES);
-            if (allIssueNumbers.length > MAX_ISSUES) {
-              core.warning(
-                `PR references ${allIssueNumbers.length} issues — only checking the first ${MAX_ISSUES}`,
-              );
-            }
-
-            let assignedToAny = false;
-            for (const num of issueNumbers) {
-              try {
-                const { data: issue } = await github.rest.issues.get({
-                  owner, repo, issue_number: num,
-                });
-                const assignees = issue.assignees.map(a => a.login.toLowerCase());
-                if (assignees.includes(prAuthor.toLowerCase())) {
-                  console.log(`PR author "${prAuthor}" is assigned to #${num}`);
-                  assignedToAny = true;
-                  break;
-                } else {
-                  console.log(`PR author "${prAuthor}" is NOT assigned to #${num} (assignees: ${assignees.join(', ') || 'none'})`);
-                }
-              } catch (error) {
-                if (error.status === 404) {
-                  console.log(`Issue #${num} not found — skipping`);
-                } else {
-                  // Non-404 errors (rate limit, server error) must not be
-                  // silently skipped — they could cause false enforcement
-                  // (closing a legitimate PR whose assignment can't be verified).
-                  throw new Error(
-                    `Cannot verify assignee for issue #${num} (${error.status}): ${error.message}`,
-                  );
-                }
-              }
-            }
-
-            core.setOutput('is-assigned', assignedToAny ? 'true' : 'false');
-
-      - name: Add missing-issue-link label
-        if: >-
-          env.ENFORCE_ISSUE_LINK == 'true' &&
-          (steps.check-link.outputs.has-link != 'true' || steps.check-link.outputs.is-assigned != 'true')
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          script: |
-            const { owner, repo } = context.repo;
-            const prNumber = context.payload.pull_request.number;
-            const labelName = 'missing-issue-link';
-
-            // Ensure the label exists (no checkout/shared helper available)
-            try {
-              await github.rest.issues.getLabel({ owner, repo, name: labelName });
-            } catch (e) {
-              if (e.status !== 404) throw e;
-              try {
-                await github.rest.issues.createLabel({
-                  owner, repo, name: labelName, color: 'b76e79',
-                });
-              } catch (createErr) {
-                if (createErr.status !== 422) throw createErr;
-              }
-            }
-
-            await github.rest.issues.addLabels({
-              owner, repo, issue_number: prNumber, labels: [labelName],
-            });
-
-      - name: Remove missing-issue-link label and reopen PR
-        if: >-
-          env.ENFORCE_ISSUE_LINK == 'true' &&
-          steps.check-link.outputs.has-link == 'true' && steps.check-link.outputs.is-assigned == 'true'
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          script: |
-            const { owner, repo } = context.repo;
-            const prNumber = context.payload.pull_request.number;
-            try {
-              await github.rest.issues.removeLabel({
-                owner, repo, issue_number: prNumber, name: 'missing-issue-link',
-              });
-            } catch (error) {
-              if (error.status !== 404) throw error;
-            }
-
-            // Reopen if this workflow previously closed the PR. We check the
-            // event payload labels (not live labels) because we already removed
-            // missing-issue-link above; the payload still reflects pre-step state.
-            const labels = context.payload.pull_request.labels.map(l => l.name);
-            if (context.payload.pull_request.state === 'closed' && labels.includes('missing-issue-link')) {
-              await github.rest.pulls.update({
-                owner,
-                repo,
-                pull_number: prNumber,
-                state: 'open',
-              });
-              console.log(`Reopened PR #${prNumber}`);
-            }
-
-            // Minimize stale enforcement comment (best-effort;
-            // sync w/ applyMaintainerBypass above & reopen_on_assignment.yml)
-            try {
-              const marker = '<!-- require-issue-link -->';
-              const comments = await github.paginate(
-                github.rest.issues.listComments,
-                { owner, repo, issue_number: prNumber, per_page: 100 },
-              );
-              const stale = comments.find(c => c.body && c.body.includes(marker));
-              if (stale) {
-                await github.graphql(`
-                  mutation($id: ID!) {
-                    minimizeComment(input: {subjectId: $id, classifier: OUTDATED}) {
-                      minimizedComment { isMinimized }
-                    }
-                  }
-                `, { id: stale.node_id });
-                console.log(`Minimized stale enforcement comment ${stale.id} as outdated`);
-              }
-            } catch (e) {
-              core.warning(`Could not minimize stale comment on PR #${prNumber}: ${e.message}`);
-            }
-
-      - name: Post comment, close PR, and fail
-        if: >-
-          env.ENFORCE_ISSUE_LINK == 'true' &&
-          (steps.check-link.outputs.has-link != 'true' || steps.check-link.outputs.is-assigned != 'true')
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          script: |
-            const { owner, repo } = context.repo;
-            const prNumber = context.payload.pull_request.number;
-            const hasLink = '${{ steps.check-link.outputs.has-link }}' === 'true';
-            const isAssigned = '${{ steps.check-link.outputs.is-assigned }}' === 'true';
-            const marker = '<!-- require-issue-link -->';
-
-            let lines;
-            if (!hasLink) {
-              lines = [
-                marker,
-                '**This PR has been automatically closed** because it does not link to an approved issue.',
-                '',
-                'All external contributions must reference an approved issue or discussion. Please:',
-                '1. Find or [open an issue](https://github.com/' + owner + '/' + repo + '/issues/new/choose) describing the change',
-                '2. Wait for a maintainer to approve and assign you',
-                '3. Add `Fixes #<issue_number>`, `Closes #<issue_number>`, or `Resolves #<issue_number>` to your PR description and the PR will be reopened automatically',
-                '',
-                '*Maintainers: reopen this PR or remove the `missing-issue-link` label to bypass this check.*',
-              ];
-            } else {
-              lines = [
-                marker,
-                '**This PR has been automatically closed** because you are not assigned to the linked issue.',
-                '',
-                'External contributors must be assigned to an issue before opening a PR for it. Please:',
-                '1. Comment on the linked issue to request assignment from a maintainer',
-                '2. Once assigned, your PR will be reopened automatically',
-                '',
-                '*Maintainers: reopen this PR or remove the `missing-issue-link` label to bypass this check.*',
-              ];
-            }
-
-            const body = lines.join('\n');
-
-            // Deduplicate: check for existing comment with the marker
-            const comments = await github.paginate(
-              github.rest.issues.listComments,
-              { owner, repo, issue_number: prNumber, per_page: 100 },
-            );
-            const existing = comments.find(c => c.body && c.body.includes(marker));
-
-            if (!existing) {
-              await github.rest.issues.createComment({
-                owner,
-                repo,
-                issue_number: prNumber,
-                body,
-              });
-              console.log('Posted requirement comment');
-            } else if (existing.body !== body) {
-              await github.rest.issues.updateComment({
-                owner,
-                repo,
-                comment_id: existing.id,
-                body,
-              });
-              console.log('Updated existing comment with new message');
-            } else {
-              console.log('Comment already exists — skipping');
-            }
-
-            // Close the PR
-            if (context.payload.pull_request.state === 'open') {
-              await github.rest.pulls.update({
-                owner,
-                repo,
-                pull_number: prNumber,
-                state: 'closed',
-              });
-              console.log(`Closed PR #${prNumber}`);
-            }
-
-            // Cancel all other in-progress and queued workflow runs for this PR
-            const headSha = context.payload.pull_request.head.sha;
-            for (const status of ['in_progress', 'queued']) {
-              const runs = await github.paginate(
-                github.rest.actions.listWorkflowRunsForRepo,
-                { owner, repo, head_sha: headSha, status, per_page: 100 },
-              );
-              for (const run of runs) {
-                if (run.id === context.runId) continue;
-                try {
-                  await github.rest.actions.cancelWorkflowRun({
-                    owner, repo, run_id: run.id,
-                  });
-                  console.log(`Cancelled ${status} run ${run.id} (${run.name})`);
-                } catch (err) {
-                  console.log(`Could not cancel run ${run.id}: ${err.message}`);
-                }
-              }
-            }
-
-            const reason = !hasLink
-              ? 'PR must reference an issue using auto-close keywords (e.g., "Fixes #123").'
-              : 'PR author must be assigned to the linked issue.';
-            core.setFailed(reason);
--- a/.github/workflows/scheduled_test.yml
+++ b/.github/workflows/scheduled_test.yml
@@ -0,0 +1,78 @@
+name: Scheduled tests
+
+on:
+  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+  schedule:
+    - cron:  '0 13 * * *'
+
+env:
+  POETRY_VERSION: "1.6.1"
+
+jobs:
+  build:
+    defaults:
+      run:
+        working-directory: libs/langchain
+    runs-on: ubuntu-latest
+    environment: Scheduled testing
+    strategy:
+      matrix:
+        python-version:
+          - "3.8"
+          - "3.9"
+          - "3.10"
+          - "3.11"
+          - "3.12"
+    name: Python ${{ matrix.python-version }}
+    steps:
+      - uses: actions/checkout@v3
+
+      - name: Set up Python ${{ matrix.python-version }}
+        uses: "./.github/actions/poetry_setup"
+        with:
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
+          working-directory: libs/langchain
+          cache-key: scheduled
+
+      - name: 'Authenticate to Google Cloud'
+        id: 'auth'
+        uses: 'google-github-actions/auth@v1'
+        with:
+          credentials_json: '${{ secrets.GOOGLE_CREDENTIALS }}'
+
+      - name: Configure AWS Credentials
+        uses: aws-actions/configure-aws-credentials@v4
+        with:
+          aws-access-key-id: ${{ secrets.AWS_ACCESS_KEY_ID }}
+          aws-secret-access-key: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
+          aws-region: ${{ vars.AWS_REGION }}
+
+      - name: Install dependencies
+        working-directory: libs/langchain
+        shell: bash
+        run: |
+          echo "Running scheduled tests, installing dependencies with poetry..."
+          poetry install --with=test_integration
+          poetry run pip install google-cloud-aiplatform
+          poetry run pip install "boto3>=1.28.57"
+
+      - name: Run tests
+        shell: bash
+        env:
+          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
+          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
+        run: |
+          make scheduled_tests
+
+      - name: Ensure the tests did not create any additional files
+        shell: bash
+        run: |
+          set -eu
+
+          STATUS="$(git status)"
+          echo "$STATUS"
+
+          # grep will exit non-zero if the target message isn't found,
+          # and `set -e` above will cause the step to fail.
+          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/tag-external-issues.yml
+++ b/.github/workflows/tag-external-issues.yml
@@ -1,205 +0,0 @@
-# Automatically tag issues as "external" or "internal" based on whether
-# the author is a member of the langchain-ai GitHub organization, and
-# apply contributor tier labels to external contributors based on their
-# merged PR history.
-#
-# NOTE: PR labeling (including external/internal, tier, size, file, and
-# title labels) is handled by pr_labeler.yml. This workflow handles
-# issues only.
-#
-# Config (trustedThreshold, labelColor) is read from
-# .github/scripts/pr-labeler-config.json to stay in sync with
-# pr_labeler.yml.
-#
-# Setup Requirements:
-# 1. Create a GitHub App with permissions:
-#    - Repository: Issues (write)
-#    - Organization: Members (read)
-# 2. Install the app on your organization and this repository
-# 3. Add these repository secrets:
-#    - ORG_MEMBERSHIP_APP_ID: Your app's ID
-#    - ORG_MEMBERSHIP_APP_PRIVATE_KEY: Your app's private key
-#
-# The GitHub App token is required to check private organization membership.
-# Without it, the workflow will fail.
-
-name: Tag External Issues
-
-on:
-  issues:
-    types: [opened]
-  workflow_dispatch:
-    inputs:
-      max_items:
-        description: "Maximum number of open issues to process"
-        default: "100"
-        type: string
-
-permissions:
-  contents: read
-
-concurrency:
-  group: ${{ github.workflow }}-${{ github.event.issue.number || github.run_id }}
-  cancel-in-progress: true
-
-jobs:
-  tag-external:
-    if: github.event_name != 'workflow_dispatch'
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-      issues: write
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: Generate GitHub App token
-        id: app-token
-        uses: actions/create-github-app-token@f8d387b68d61c58ab83c6c016672934102569859 # v3
-        with:
-          app-id: ${{ secrets.ORG_MEMBERSHIP_APP_ID }}
-          private-key: ${{ secrets.ORG_MEMBERSHIP_APP_PRIVATE_KEY }}
-
-      - name: Check if contributor is external
-        if: steps.app-token.outcome == 'success'
-        id: check-membership
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          github-token: ${{ steps.app-token.outputs.token }}
-          script: |
-            const { owner, repo } = context.repo;
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            const author = context.payload.sender.login;
-            const { isExternal } = await h.checkMembership(
-              author, context.payload.sender.type,
-            );
-            core.setOutput('is-external', isExternal ? 'true' : 'false');
-
-      - name: Apply contributor tier label
-        if: steps.check-membership.outputs.is-external == 'true'
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          # GITHUB_TOKEN is fine here — no downstream workflow chains
-          # off tier labels on issues (unlike PRs where App token is
-          # needed for require_issue_link.yml).
-          github-token: ${{ secrets.GITHUB_TOKEN }}
-          script: |
-            const { owner, repo } = context.repo;
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            const issue = context.payload.issue;
-            // new-contributor is only meaningful on PRs, not issues
-            await h.applyTierLabel(issue.number, issue.user.login, { skipNewContributor: true });
-
-      - name: Add external/internal label
-        if: steps.check-membership.outputs.is-external != ''
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          github-token: ${{ secrets.GITHUB_TOKEN }}
-          script: |
-            const { owner, repo } = context.repo;
-            const issue_number = context.payload.issue.number;
-
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            const label = '${{ steps.check-membership.outputs.is-external }}' === 'true'
-              ? 'external' : 'internal';
-            await h.ensureLabel(label);
-            await github.rest.issues.addLabels({
-              owner, repo, issue_number, labels: [label],
-            });
-            console.log(`Added '${label}' label to issue #${issue_number}`);
-
-  backfill:
-    if: github.event_name == 'workflow_dispatch'
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-      issues: write
-
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-
-      - name: Generate GitHub App token
-        id: app-token
-        uses: actions/create-github-app-token@f8d387b68d61c58ab83c6c016672934102569859 # v3
-        with:
-          app-id: ${{ secrets.ORG_MEMBERSHIP_APP_ID }}
-          private-key: ${{ secrets.ORG_MEMBERSHIP_APP_PRIVATE_KEY }}
-
-      - name: Backfill labels on open issues
-        uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8
-        with:
-          github-token: ${{ steps.app-token.outputs.token }}
-          script: |
-            const { owner, repo } = context.repo;
-            const rawMax = '${{ inputs.max_items }}';
-            const maxItems = parseInt(rawMax, 10);
-            if (isNaN(maxItems) || maxItems <= 0) {
-              core.setFailed(`Invalid max_items: "${rawMax}" — must be a positive integer`);
-              return;
-            }
-
-            const { h } = require('./.github/scripts/pr-labeler.js').loadAndInit(github, owner, repo, core);
-
-            const tierLabels = ['trusted-contributor'];
-            for (const name of tierLabels) {
-              await h.ensureLabel(name);
-            }
-
-            const contributorCache = new Map();
-
-            const issues = await github.paginate(github.rest.issues.listForRepo, {
-              owner, repo, state: 'open', per_page: 100,
-            });
-
-            let processed = 0;
-            let failures = 0;
-            for (const issue of issues) {
-              if (processed >= maxItems) break;
-              if (issue.pull_request) continue;
-
-              try {
-                const author = issue.user.login;
-                const info = await h.getContributorInfo(contributorCache, author, issue.user.type);
-
-                const labels = [info.isExternal ? 'external' : 'internal'];
-                if (info.isExternal && info.mergedCount != null && info.mergedCount >= h.trustedThreshold) {
-                  labels.push('trusted-contributor');
-                }
-
-                // Ensure all labels exist before batch add
-                for (const name of labels) {
-                  await h.ensureLabel(name);
-                }
-
-                // Remove stale tier labels
-                const currentLabels = (await github.paginate(
-                  github.rest.issues.listLabelsOnIssue,
-                  { owner, repo, issue_number: issue.number, per_page: 100 },
-                )).map(l => l.name ?? '');
-                for (const name of currentLabels) {
-                  if (tierLabels.includes(name) && !labels.includes(name)) {
-                    try {
-                      await github.rest.issues.removeLabel({
-                        owner, repo, issue_number: issue.number, name,
-                      });
-                    } catch (e) {
-                      if (e.status !== 404) throw e;
-                    }
-                  }
-                }
-
-                await github.rest.issues.addLabels({
-                  owner, repo, issue_number: issue.number, labels,
-                });
-                console.log(`Issue #${issue.number} (${author}): ${labels.join(', ')}`);
-                processed++;
-              } catch (e) {
-                failures++;
-                core.warning(`Failed to process issue #${issue.number}: ${e.message}`);
-              }
-            }
-
-            console.log(`\nBackfill complete. Processed ${processed} issues, ${failures} failures. ${contributorCache.size} unique authors.`);
--- a/.github/workflows/v03_api_doc_build.yml
+++ b/.github/workflows/v03_api_doc_build.yml
@@ -1,167 +0,0 @@
-# Build the API reference documentation for v0.3 branch.
-#
-# Manual trigger only.
-#
-# Built HTML pushed to langchain-ai/langchain-api-docs-html.
-#
-# Looks for langchain-ai org repos in packages.yml and checks them out.
-# Calls prep_api_docs_build.py.
-
-name: "📚 API Docs (v0.3)"
-run-name: "Build & Deploy API Reference (v0.3)"
-
-on:
-  workflow_dispatch:
-
-permissions:
-  contents: read
-
-env:
-  PYTHON_VERSION: "3.11"
-
-jobs:
-  build:
-    if: github.repository == 'langchain-ai/langchain' || github.event_name != 'schedule'
-    runs-on: ubuntu-latest
-    permissions:
-      contents: read
-    steps:
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          ref: v0.3
-          path: langchain
-
-      - uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6
-        with:
-          repository: langchain-ai/langchain-api-docs-html
-          path: langchain-api-docs-html
-          token: ${{ secrets.TOKEN_GITHUB_API_DOCS_HTML }}
-
-      - name: "📋 Extract Repository List with yq"
-        id: get-unsorted-repos
-        uses: mikefarah/yq@17f66dc6c6a177fafd8b71a6abea6d6340aa1e16 # master
-        with:
-          cmd: |
-            # Extract repos from packages.yml that are in the langchain-ai org
-            # (excluding 'langchain' itself)
-            yq '
-              .packages[]
-              | select(
-                  (
-                    (.repo | test("^langchain-ai/"))
-                    and
-                    (.repo != "langchain-ai/langchain")
-                  )
-                  or
-                  (.include_in_api_ref // false)
-                )
-              | .repo
-            ' langchain/libs/packages.yml
-
-      - name: "📋 Parse YAML & Checkout Repositories"
-        env:
-          REPOS_UNSORTED: ${{ steps.get-unsorted-repos.outputs.result }}
-          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
-        run: |
-          # Get unique repositories
-          REPOS=$(echo "$REPOS_UNSORTED" | sort -u)
-          # Checkout each unique repository
-          for repo in $REPOS; do
-            # Validate repository format (allow any org with proper format)
-            if [[ ! "$repo" =~ ^[a-zA-Z0-9_.-]+/[a-zA-Z0-9_.-]+$ ]]; then
-              echo "Error: Invalid repository format: $repo"
-              exit 1
-            fi
-
-            REPO_NAME=$(echo $repo | cut -d'/' -f2)
-
-            # Additional validation for repo name
-            if [[ ! "$REPO_NAME" =~ ^[a-zA-Z0-9_.-]+$ ]]; then
-              echo "Error: Invalid repository name: $REPO_NAME"
-              exit 1
-            fi
-            echo "Checking out $repo to $REPO_NAME"
-
-            # Special handling for langchain-tavily: checkout by commit hash
-            if [[ "$REPO_NAME" == "langchain-tavily" ]]; then
-              git clone https://github.com/$repo.git $REPO_NAME
-              cd $REPO_NAME
-              git checkout f3515654724a9e87bdfe2c2f509d6cdde646e563
-              cd ..
-            else
-              git clone --depth 1 --branch v0.3 https://github.com/$repo.git $REPO_NAME
-            fi
-          done
-
-      - name: "🐍 Setup Python ${{ env.PYTHON_VERSION }}"
-        uses: actions/setup-python@a309ff8b426b58ec0e2a45f0f869d46889d02405 # v6
-        id: setup-python
-        with:
-          python-version: ${{ env.PYTHON_VERSION }}
-
-      - name: "📦 Install Initial Python Dependencies using uv"
-        working-directory: langchain
-        run: |
-          python -m pip install -U uv
-          python -m uv pip install --upgrade --no-cache-dir pip setuptools pyyaml
-
-      - name: "📦 Organize Library Directories"
-        # Places cloned partner packages into libs/partners structure
-        run: python langchain/.github/scripts/prep_api_docs_build.py
-
-      - name: "🧹 Clear Prior Build"
-        run:
-          # Remove artifacts from prior docs build
-          rm -rf langchain-api-docs-html/api_reference_build/html
-
-      - name: "📦 Install Documentation Dependencies using uv"
-        working-directory: langchain
-        run: |
-          # Install all partner packages in editable mode with overrides
-          python -m uv pip install $(ls ./libs/partners | grep -v azure-ai | xargs -I {} echo "./libs/partners/{}") --overrides ./docs/vercel_overrides.txt --prerelease=allow
-
-          # Install langchain-azure-ai with tools extra
-          python -m uv pip install "./libs/partners/azure-ai[tools]" --overrides ./docs/vercel_overrides.txt --prerelease=allow
-
-          # Install core langchain and other main packages
-          python -m uv pip install libs/core libs/langchain libs/text-splitters libs/community libs/experimental libs/standard-tests
-
-          # Install Sphinx and related packages for building docs
-          python -m uv pip install -r docs/api_reference/requirements.txt
-
-      - name: "🔧 Configure Git Settings"
-        working-directory: langchain
-        run: |
-          git config --local user.email "actions@github.com"
-          git config --local user.name "Github Actions"
-
-      - name: "📚 Build API Documentation"
-        working-directory: langchain
-        run: |
-          # Generate the API reference RST files
-          python docs/api_reference/create_api_rst.py
-
-          # Build the HTML documentation using Sphinx
-          # -T: show full traceback on exception
-          # -E: don't use cached environment (force rebuild, ignore cached doctrees)
-          # -b html: build HTML docs (vs PDS, etc.)
-          # -d: path for the cached environment (parsed document trees / doctrees)
-          #     - Separate from output dir for faster incremental builds
-          # -c: path to conf.py
-          # -j auto: parallel build using all available CPU cores
-          python -m sphinx -T -E -b html -d ../langchain-api-docs-html/_build/doctrees -c docs/api_reference docs/api_reference ../langchain-api-docs-html/api_reference_build/html -j auto
-
-          # Post-process the generated HTML
-          python docs/api_reference/scripts/custom_formatter.py ../langchain-api-docs-html/api_reference_build/html
-
-          # Default index page is blank so we copy in the actual home page.
-          cp ../langchain-api-docs-html/api_reference_build/html/{reference,index}.html
-
-          # Removes Sphinx's intermediate build artifacts after the build is complete.
-          rm -rf ../langchain-api-docs-html/_build/
-
-      # Commit and push changes to langchain-api-docs-html repo
-      - uses: EndBug/add-and-commit@290ea2c423ad77ca9c62ae0f5b224379612c0321 # v10.0.0
-        with:
-          cwd: langchain-api-docs-html
-          message: "Update API docs build from v0.3 branch"
--- a/.gitignore
+++ b/.gitignore
@@ -1,8 +1,6 @@
 .vs/
-.claude/
+.vscode/
 .idea/
-#Emacs backup
-*~
 # Byte-compiled / optimized / DLL files
 __pycache__/
 *.py[cod]
@@ -61,7 +59,6 @@ coverage.xml
 *.py,cover
 .hypothesis/
 .pytest_cache/
-.codspeed/

 # Translations
 *.mo
@@ -80,6 +77,10 @@ instance/
 # Scrapy stuff:
 .scrapy

+# Sphinx documentation
+docs/_build/
+docs/docs/_build/
+
 # PyBuilder
 target/

@@ -114,11 +115,13 @@ celerybeat.pid
 # Environments
 .env
 .envrc
-.venv*
-venv*
+.venv
+.venvs
 env/
+venv/
 ENV/
 env.bak/
+venv.bak/

 # Spyder project settings
 .spyderproject
@@ -132,7 +135,6 @@ env.bak/

 # mypy
 .mypy_cache/
-.mypy_cache_test/
 .dmypy.json
 dmypy.json

@@ -160,9 +162,18 @@ data_map*
 *replit*

 node_modules
-
-prof
-virtualenv/
-scratch/
-
-.langgraph_api/
+docs/.yarn/
+docs/node_modules/
+docs/.docusaurus/
+docs/.cache-loader/
+docs/_dist
+docs/api_reference/api_reference.rst
+docs/api_reference/experimental_api_reference.rst
+docs/api_reference/_build
+docs/api_reference/*/
+!docs/api_reference/_static/
+!docs/api_reference/templates/
+!docs/api_reference/themes/
+docs/docs_skeleton/build
+docs/docs_skeleton/node_modules
+docs/docs_skeleton/yarn.lock
--- a/.gitmodules
+++ b/.gitmodules
@@ -0,0 +1,4 @@
+[submodule "docs/_docs_skeleton"]
+	path = docs/_docs_skeleton
+	url = https://github.com/langchain-ai/langchain-shared-docs
+	branch = main
--- a/.markdownlint.json
+++ b/.markdownlint.json
@@ -1,14 +0,0 @@
-{
-  "MD013": false,
-  "MD024": {
-    "siblings_only": true
-  },
-  "MD025": false,
-  "MD033": false,
-  "MD034": false,
-  "MD036": false,
-  "MD041": false,
-  "MD046": {
-    "style": "fenced"
-  }
-}
--- a/.mcp.json
+++ b/.mcp.json
@@ -1,12 +0,0 @@
-{
-  "mcpServers": {
-    "docs-langchain": {
-      "type": "http",
-      "url": "https://docs.langchain.com/mcp"
-    },
-    "reference-langchain": {
-      "type": "http",
-      "url": "https://reference.langchain.com/mcp"
-    }
-  }
-}
--- a/.pre-commit-config.yaml
+++ b/.pre-commit-config.yaml
@@ -1,125 +0,0 @@
-repos:
-  - repo: https://github.com/pre-commit/pre-commit-hooks
-    rev: v4.3.0
-    hooks:
-      - id: no-commit-to-branch # prevent direct commits to protected branches
-        args: ["--branch", "master"]
-      - id: check-yaml # validate YAML syntax
-        args: ["--unsafe"] # allow custom tags
-      - id: check-toml # validate TOML syntax
-      - id: end-of-file-fixer # ensure files end with a newline
-      - id: trailing-whitespace # remove trailing whitespace from lines
-        exclude: \.ambr$
-
-  # Text normalization hooks for consistent formatting
-  - repo: https://github.com/sirosen/texthooks
-    rev: 0.6.8
-    hooks:
-      - id: fix-smartquotes # replace curly quotes with straight quotes
-      - id: fix-spaces # replace non-standard spaces (e.g., non-breaking) with regular spaces
-
-  # Per-package format and lint hooks for the monorepo
-  - repo: local
-    hooks:
-      - id: core
-        name: format and lint core
-        language: system
-        entry: make -C libs/core format lint
-        files: ^libs/core/
-        pass_filenames: false
-      - id: langchain
-        name: format and lint langchain
-        language: system
-        entry: make -C libs/langchain format lint
-        files: ^libs/langchain/
-        pass_filenames: false
-      - id: standard-tests
-        name: format and lint standard-tests
-        language: system
-        entry: make -C libs/standard-tests format lint
-        files: ^libs/standard-tests/
-        pass_filenames: false
-      - id: text-splitters
-        name: format and lint text-splitters
-        language: system
-        entry: make -C libs/text-splitters format lint
-        files: ^libs/text-splitters/
-        pass_filenames: false
-      - id: anthropic
-        name: format and lint partners/anthropic
-        language: system
-        entry: make -C libs/partners/anthropic format lint
-        files: ^libs/partners/anthropic/
-        pass_filenames: false
-      - id: chroma
-        name: format and lint partners/chroma
-        language: system
-        entry: make -C libs/partners/chroma format lint
-        files: ^libs/partners/chroma/
-        pass_filenames: false
-      - id: exa
-        name: format and lint partners/exa
-        language: system
-        entry: make -C libs/partners/exa format lint
-        files: ^libs/partners/exa/
-        pass_filenames: false
-      - id: fireworks
-        name: format and lint partners/fireworks
-        language: system
-        entry: make -C libs/partners/fireworks format lint
-        files: ^libs/partners/fireworks/
-        pass_filenames: false
-      - id: groq
-        name: format and lint partners/groq
-        language: system
-        entry: make -C libs/partners/groq format lint
-        files: ^libs/partners/groq/
-        pass_filenames: false
-      - id: huggingface
-        name: format and lint partners/huggingface
-        language: system
-        entry: make -C libs/partners/huggingface format lint
-        files: ^libs/partners/huggingface/
-        pass_filenames: false
-      - id: mistralai
-        name: format and lint partners/mistralai
-        language: system
-        entry: make -C libs/partners/mistralai format lint
-        files: ^libs/partners/mistralai/
-        pass_filenames: false
-      - id: nomic
-        name: format and lint partners/nomic
-        language: system
-        entry: make -C libs/partners/nomic format lint
-        files: ^libs/partners/nomic/
-        pass_filenames: false
-      - id: ollama
-        name: format and lint partners/ollama
-        language: system
-        entry: make -C libs/partners/ollama format lint
-        files: ^libs/partners/ollama/
-        pass_filenames: false
-      - id: openai
-        name: format and lint partners/openai
-        language: system
-        entry: make -C libs/partners/openai format lint
-        files: ^libs/partners/openai/
-        pass_filenames: false
-      - id: qdrant
-        name: format and lint partners/qdrant
-        language: system
-        entry: make -C libs/partners/qdrant format lint
-        files: ^libs/partners/qdrant/
-        pass_filenames: false
-      - id: core-version
-        name: check core version consistency
-        language: system
-        entry: make -C libs/core check_version
-        files: ^libs/core/(pyproject\.toml|langchain_core/version\.py)$
-        pass_filenames: false
-      - id: langchain-v1-version
-        name: check langchain version consistency
-        language: system
-        entry: make -C libs/langchain_v1 check_version
-        files: ^libs/langchain_v1/(pyproject\.toml|langchain/__init__\.py)$
-        pass_filenames: false
--- a/.readthedocs.yaml
+++ b/.readthedocs.yaml
@@ -0,0 +1,29 @@
+# Read the Docs configuration file
+# See https://docs.readthedocs.io/en/stable/config-file/v2.html for details
+
+# Required
+version: 2
+
+# Set the version of Python and other tools you might need
+build:
+  os: ubuntu-22.04
+  tools:
+    python: "3.11"
+  jobs:
+    pre_build:
+      - python docs/api_reference/create_api_rst.py
+
+# Build documentation in the docs/ directory with Sphinx
+sphinx:
+   configuration: docs/api_reference/conf.py
+
+# If using Sphinx, optionally build your docs in additional formats such as PDF
+# formats:
+#    - pdf
+
+# Optionally declare the Python requirements required to build your docs
+python:
+   install:
+   - requirements: docs/api_reference/requirements.txt
+   - method: pip
+     path: .
--- a/.vscode/extensions.json
+++ b/.vscode/extensions.json
@@ -1,19 +0,0 @@
-{
-  "recommendations": [
-    "ms-python.python",
-    "charliermarsh.ruff",
-    "ms-python.mypy-type-checker",
-    "ms-toolsai.jupyter",
-    "ms-toolsai.jupyter-keymap",
-    "ms-toolsai.jupyter-renderers",
-    "yzhang.markdown-all-in-one",
-    "davidanson.vscode-markdownlint",
-    "bierner.markdown-mermaid",
-    "bierner.markdown-preview-github-styles",
-    "eamodio.gitlens",
-    "github.vscode-pull-request-github",
-    "github.vscode-github-actions",
-    "redhat.vscode-yaml",
-    "editorconfig.editorconfig",
-  ],
-}
--- a/.vscode/settings.json
+++ b/.vscode/settings.json
@@ -1,78 +0,0 @@
-{
-  "python.analysis.include": [
-    "libs/**",
-  ],
-  "python.analysis.exclude": [
-    "**/node_modules",
-    "**/__pycache__",
-    "**/.pytest_cache",
-    "**/.*",
-  ],
-  "python.analysis.autoImportCompletions": true,
-  "python.analysis.typeCheckingMode": "basic",
-  "python.testing.cwd": "${workspaceFolder}",
-  "python.linting.enabled": true,
-  "python.linting.ruffEnabled": true,
-  "[python]": {
-    "editor.formatOnSave": true,
-    "editor.codeActionsOnSave": {
-      "source.organizeImports.ruff": "explicit",
-      "source.fixAll": "explicit"
-    },
-    "editor.defaultFormatter": "charliermarsh.ruff"
-  },
-  "editor.rulers": [
-    88
-  ],
-  "editor.tabSize": 4,
-  "editor.insertSpaces": true,
-  "editor.trimAutoWhitespace": true,
-  "files.trimTrailingWhitespace": true,
-  "files.insertFinalNewline": true,
-  "files.exclude": {
-    "**/__pycache__": true,
-    "**/.pytest_cache": true,
-    "**/*.pyc": true,
-    "**/.mypy_cache": true,
-    "**/.ruff_cache": true,
-    "_dist/**": true,
-    "**/node_modules": true,
-    "**/.git": false
-  },
-  "search.exclude": {
-    "**/__pycache__": true,
-    "**/*.pyc": true,
-    "_dist/**": true,
-    "**/node_modules": true,
-    "**/.git": true,
-    "uv.lock": true,
-    "yarn.lock": true
-  },
-  "git.autofetch": true,
-  "git.enableSmartCommit": true,
-  "jupyter.askForKernelRestart": false,
-  "jupyter.interactiveWindow.textEditor.executeSelection": true,
-  "[markdown]": {
-    "editor.wordWrap": "on",
-    "editor.quickSuggestions": {
-      "comments": "off",
-      "strings": "off",
-      "other": "off"
-    }
-  },
-  "[yaml]": {
-    "editor.tabSize": 2,
-    "editor.insertSpaces": true
-  },
-  "[json]": {
-    "editor.tabSize": 2,
-    "editor.insertSpaces": true
-  },
-  "python.terminal.activateEnvironment": false,
-  "python.defaultInterpreterPath": "./.venv/bin/python",
-  "github.copilot.chat.commitMessageGeneration.instructions": [
-    {
-      "file": ".github/workflows/pr_lint.yml"
-    }
-  ]
-}
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -1,272 +0,0 @@
-# Global development guidelines for the LangChain monorepo
-
-This document provides context to understand the LangChain Python project and assist with development.
-
-## Project architecture and context
-
-### Monorepo structure
-
-This is a Python monorepo with multiple independently versioned packages that use `uv`.
-
-```txt
-langchain/
-├── libs/
-│   ├── core/             # `langchain-core` primitives and base abstractions
-│   ├── langchain/        # `langchain-classic` (legacy, no new features)
-│   ├── langchain_v1/     # Actively maintained `langchain` package
-│   ├── partners/         # Third-party integrations
-│   │   ├── openai/       # OpenAI models and embeddings
-│   │   ├── anthropic/    # Anthropic (Claude) integration
-│   │   ├── ollama/       # Local model support
-│   │   └── ... (other integrations maintained by the LangChain team)
-│   ├── text-splitters/   # Document chunking utilities
-│   ├── standard-tests/   # Shared test suite for integrations
-│   ├── model-profiles/   # Model configuration profiles
-├── .github/              # CI/CD workflows and templates
-├── .vscode/              # VSCode IDE standard settings and recommended extensions
-└── README.md             # Information about LangChain
-```
-
- **Core layer** (`langchain-core`): Base abstractions, interfaces, and protocols. Users should not need to know about this layer directly.
- **Implementation layer** (`langchain`): Concrete implementations and high-level public utilities
- **Integration layer** (`partners/`): Third-party service integrations. Note that this monorepo is not exhaustive of all LangChain integrations; some are maintained in separate repos, such as `langchain-ai/langchain-google` and `langchain-ai/langchain-aws`. Usually these repos are cloned at the same level as this monorepo, so if needed, you can refer to their code directly by navigating to `../langchain-google/` from this monorepo.
- **Testing layer** (`standard-tests/`): Standardized integration tests for partner integrations
-
-### Development tools & commands
-
- `uv` – Fast Python package installer and resolver (replaces pip/poetry)
- `make` – Task runner for common development commands. Feel free to look at the `Makefile` for available commands and usage patterns.
- `ruff` – Fast Python linter and formatter
- `mypy` – Static type checking
- `pytest` – Testing framework
-
-This monorepo uses `uv` for dependency management. Local development uses editable installs: `[tool.uv.sources]`
-
-Each package in `libs/` has its own `pyproject.toml` and `uv.lock`.
-
-Before running your tests, set up all packages by running:
-
-```bash
-# For all groups
-uv sync --all-groups
-
-# or, to install a specific group only:
-uv sync --group test
-```
-
-```bash
-# Run unit tests (no network)
-make test
-
-# Run specific test file
-uv run --group test pytest tests/unit_tests/test_specific.py
-```
-
-```bash
-# Lint code
-make lint
-
-# Format code
-make format
-
-# Type checking
-uv run --group lint mypy .
-```
-
-#### Key config files
-
- pyproject.toml: Main workspace configuration with dependency groups
- uv.lock: Locked dependencies for reproducible builds
- Makefile: Development tasks
-
-#### Commit standards
-
-Suggest PR titles that follow Conventional Commits format. Refer to .github/workflows/pr_lint for allowed types and scopes. Note that all commit/PR titles should be in lowercase with the exception of proper nouns/named entities. All PR titles should include a scope with no exceptions. For example:
-
-```txt
-feat(langchain): add new chat completion feature
-fix(core): resolve type hinting issue in vector store
-chore(anthropic): update infrastructure dependencies
-```
-
-Note how `feat(langchain)` includes a scope even though it is the main package and name of the repo.
-
-#### Pull request guidelines
-
- Always add a disclaimer to the PR description mentioning how AI agents are involved with the contribution.
- Describe the "why" of the changes, why the proposed solution is the right one. Limit prose.
- Highlight areas of the proposed changes that require careful review.
-
-## Core development principles
-
-### Maintain stable public interfaces
-
-CRITICAL: Always attempt to preserve function signatures, argument positions, and names for exported/public methods. Do not make breaking changes.
-You should warn the developer for any function signature changes, regardless of whether they look breaking or not.
-
-**Before making ANY changes to public APIs:**
-
- Check if the function/class is exported in `__init__.py`
- Look for existing usage patterns in tests and examples
- Use keyword-only arguments for new parameters: `*, new_param: str = "default"`
- Mark experimental features clearly with docstring warnings (using MkDocs Material admonitions, like `!!! warning`)
-
-Ask: "Would this change break someone's code if they used it last week?"
-
-### Code quality standards
-
-All Python code MUST include type hints and return types.
-
-```python title="Example"
-def filter_unknown_users(users: list[str], known_users: set[str]) -> list[str]:
-    """Single line description of the function.
-
-    Any additional context about the function can go here.
-
-    Args:
-        users: List of user identifiers to filter.
-        known_users: Set of known/valid user identifiers.
-
-    Returns:
-        List of users that are not in the `known_users` set.
-    """
-```
-
- Use descriptive, self-explanatory variable names.
- Follow existing patterns in the codebase you're modifying
- Attempt to break up complex functions (>20 lines) into smaller, focused functions where it makes sense
-
-### Testing requirements
-
-Every new feature or bugfix MUST be covered by unit tests.
-
- Unit tests: `tests/unit_tests/` (no network calls allowed)
- Integration tests: `tests/integration_tests/` (network calls permitted)
- We use `pytest` as the testing framework; if in doubt, check other existing tests for examples.
- The testing file structure should mirror the source code structure.
-
-**Checklist:**
-
- [ ] Tests fail when your new logic is broken
- [ ] Happy path is covered
- [ ] Edge cases and error conditions are tested
- [ ] Use fixtures/mocks for external dependencies
- [ ] Tests are deterministic (no flaky tests)
- [ ] Does the test suite fail if your new logic is broken?
-
-### Security and risk assessment
-
- No `eval()`, `exec()`, or `pickle` on user-controlled input
- Proper exception handling (no bare `except:`) and use a `msg` variable for error messages
- Remove unreachable/commented code before committing
- Race conditions or resource leaks (file handles, sockets, threads).
- Ensure proper resource cleanup (file handles, connections)
-
-For threat models documenting trust boundaries, data flows, and known threats:
-
- [`.github/THREAT_MODEL_CORE.md`](.github/THREAT_MODEL_CORE.md) — langchain-core (serialization, SSRF protection, prompts, tools, output parsers)
- [`.github/THREAT_MODEL_V1.md`](.github/THREAT_MODEL_V1.md) — langchain v1 (agent middleware, shell tool, file search, HITL, execution policies)
-
-### Documentation standards
-
-Use Google-style docstrings with Args section for all public functions.
-
-```python title="Example"
-def send_email(to: str, msg: str, *, priority: str = "normal") -> bool:
-    """Send an email to a recipient with specified priority.
-
-    Any additional context about the function can go here.
-
-    Args:
-        to: The email address of the recipient.
-        msg: The message body to send.
-        priority: Email priority level.
-
-    Returns:
-        `True` if email was sent successfully, `False` otherwise.
-
-    Raises:
-        InvalidEmailError: If the email address format is invalid.
-        SMTPConnectionError: If unable to connect to email server.
-    """
-```
-
- Types go in function signatures, NOT in docstrings
-  - If a default is present, DO NOT repeat it in the docstring unless there is post-processing or it is set conditionally.
- Focus on "why" rather than "what" in descriptions
- Document all parameters, return values, and exceptions
- Keep descriptions concise but clear
- Ensure American English spelling (e.g., "behavior", not "behaviour")
- Do NOT use Sphinx-style double backtick formatting (` ``code`` `). Use single backticks (`` `code` ``) for inline code references in docstrings and comments.
-
-#### Model references in docs and examples
-
-Always use the latest generally available (GA) models when referencing LLMs in docstrings and illustrative code snippets. Avoid preview or beta identifiers unless the model has no GA equivalent. Outdated model names signal stale code and confuse users.
-
-Before writing or updating model references, verify current model IDs against the provider's official docs. Do not rely on memorized or cached model names — they go stale quickly.
-
-Changing **shipped default parameter values** in code (e.g., a `model=` kwarg default in a class constructor) may constitute a breaking change — see "Maintain stable public interfaces" above. This guidance applies to documentation and examples, not code defaults.
-
-For model *profile data* (capability flags, context windows), use the `langchain-profiles` CLI described below.
-
-## Model profiles
-
-Model profiles are generated using the `langchain-profiles` CLI in `libs/model-profiles`. The `--data-dir` must point to the directory containing `profile_augmentations.toml`, not the top-level package directory.
-
-```bash
-# Run from libs/model-profiles
-cd libs/model-profiles
-
-# Refresh profiles for a partner in this repo
-uv run langchain-profiles refresh --provider openai --data-dir ../partners/openai/langchain_openai/data
-
-# Refresh profiles for a partner in an external repo (requires echo y to confirm)
-echo y | uv run langchain-profiles refresh --provider google --data-dir /path/to/langchain-google/libs/genai/langchain_google_genai/data
-```
-
-Example partners with profiles in this repo:
-
- `libs/partners/openai/langchain_openai/data/` (provider: `openai`)
- `libs/partners/anthropic/langchain_anthropic/data/` (provider: `anthropic`)
- `libs/partners/perplexity/langchain_perplexity/data/` (provider: `perplexity`)
-
-The `echo y |` pipe is required when `--data-dir` is outside the `libs/model-profiles` working directory.
-
-## CI/CD infrastructure
-
-### Release process
-
-Releases are triggered manually via `.github/workflows/_release.yml` with `working-directory` and `release-version` inputs.
-
-### PR labeling and linting
-
-**Title linting** (`.github/workflows/pr_lint.yml`)
-
-**Auto-labeling:**
-
- `.github/workflows/pr_labeler.yml` – Unified PR labeler (size, file, title, external/internal, contributor tier)
- `.github/workflows/pr_labeler_backfill.yml` – Manual backfill of PR labels on open PRs
- `.github/workflows/auto-label-by-package.yml` – Issue labeling by package
- `.github/workflows/tag-external-issues.yml` – Issue external/internal classification
-
-### Adding a new partner to CI
-
-When adding a new partner package, update these files:
-
- `.github/ISSUE_TEMPLATE/*.yml` – Add to package dropdown
- `.github/dependabot.yml` – Add dependency update entry
- `.github/scripts/pr-labeler-config.json` – Add file rule and scope-to-label mapping
- `.github/workflows/_release.yml` – Add API key secrets if needed
- `.github/workflows/auto-label-by-package.yml` – Add package label
- `.github/workflows/check_diffs.yml` – Add to change detection
- `.github/workflows/integration_tests.yml` – Add integration test config
- `.github/workflows/pr_lint.yml` – Add to allowed scopes
-
-## GitHub Actions & Workflows
-
-This repository require actions to be pinned to a full-length commit SHA. Attempting to use a tag will fail. Use the `gh` cli to query. Verify tags are not annotated tag objects (which would need dereferencing).
-
-## Additional resources
-
- **Documentation:** https://docs.langchain.com/oss/python/langchain/overview and source at https://github.com/langchain-ai/docs or `../docs/`. Prefer the local install and use file search tools for best results. If needed, use the docs MCP server as defined in `.mcp.json` for programmatic access.
- **Contributing Guide:** [Contributing Guide](https://docs.langchain.com/oss/python/contributing/overview)
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -1,272 +0,0 @@
-# Global development guidelines for the LangChain monorepo
-
-This document provides context to understand the LangChain Python project and assist with development.
-
-## Project architecture and context
-
-### Monorepo structure
-
-This is a Python monorepo with multiple independently versioned packages that use `uv`.
-
-```txt
-langchain/
-├── libs/
-│   ├── core/             # `langchain-core` primitives and base abstractions
-│   ├── langchain/        # `langchain-classic` (legacy, no new features)
-│   ├── langchain_v1/     # Actively maintained `langchain` package
-│   ├── partners/         # Third-party integrations
-│   │   ├── openai/       # OpenAI models and embeddings
-│   │   ├── anthropic/    # Anthropic (Claude) integration
-│   │   ├── ollama/       # Local model support
-│   │   └── ... (other integrations maintained by the LangChain team)
-│   ├── text-splitters/   # Document chunking utilities
-│   ├── standard-tests/   # Shared test suite for integrations
-│   ├── model-profiles/   # Model configuration profiles
-├── .github/              # CI/CD workflows and templates
-├── .vscode/              # VSCode IDE standard settings and recommended extensions
-└── README.md             # Information about LangChain
-```
-
- **Core layer** (`langchain-core`): Base abstractions, interfaces, and protocols. Users should not need to know about this layer directly.
- **Implementation layer** (`langchain`): Concrete implementations and high-level public utilities
- **Integration layer** (`partners/`): Third-party service integrations. Note that this monorepo is not exhaustive of all LangChain integrations; some are maintained in separate repos, such as `langchain-ai/langchain-google` and `langchain-ai/langchain-aws`. Usually these repos are cloned at the same level as this monorepo, so if needed, you can refer to their code directly by navigating to `../langchain-google/` from this monorepo.
- **Testing layer** (`standard-tests/`): Standardized integration tests for partner integrations
-
-### Development tools & commands
-
- `uv` – Fast Python package installer and resolver (replaces pip/poetry)
- `make` – Task runner for common development commands. Feel free to look at the `Makefile` for available commands and usage patterns.
- `ruff` – Fast Python linter and formatter
- `mypy` – Static type checking
- `pytest` – Testing framework
-
-This monorepo uses `uv` for dependency management. Local development uses editable installs: `[tool.uv.sources]`
-
-Each package in `libs/` has its own `pyproject.toml` and `uv.lock`.
-
-Before running your tests, set up all packages by running:
-
-```bash
-# For all groups
-uv sync --all-groups
-
-# or, to install a specific group only:
-uv sync --group test
-```
-
-```bash
-# Run unit tests (no network)
-make test
-
-# Run specific test file
-uv run --group test pytest tests/unit_tests/test_specific.py
-```
-
-```bash
-# Lint code
-make lint
-
-# Format code
-make format
-
-# Type checking
-uv run --group lint mypy .
-```
-
-#### Key config files
-
- pyproject.toml: Main workspace configuration with dependency groups
- uv.lock: Locked dependencies for reproducible builds
- Makefile: Development tasks
-
-#### Commit standards
-
-Suggest PR titles that follow Conventional Commits format. Refer to .github/workflows/pr_lint for allowed types and scopes. Note that all commit/PR titles should be in lowercase with the exception of proper nouns/named entities. All PR titles should include a scope with no exceptions. For example:
-
-```txt
-feat(langchain): add new chat completion feature
-fix(core): resolve type hinting issue in vector store
-chore(anthropic): update infrastructure dependencies
-```
-
-Note how `feat(langchain)` includes a scope even though it is the main package and name of the repo.
-
-#### Pull request guidelines
-
- Always add a disclaimer to the PR description mentioning how AI agents are involved with the contribution.
- Describe the "why" of the changes, why the proposed solution is the right one. Limit prose.
- Highlight areas of the proposed changes that require careful review.
-
-## Core development principles
-
-### Maintain stable public interfaces
-
-CRITICAL: Always attempt to preserve function signatures, argument positions, and names for exported/public methods. Do not make breaking changes.
-You should warn the developer for any function signature changes, regardless of whether they look breaking or not.
-
-**Before making ANY changes to public APIs:**
-
- Check if the function/class is exported in `__init__.py`
- Look for existing usage patterns in tests and examples
- Use keyword-only arguments for new parameters: `*, new_param: str = "default"`
- Mark experimental features clearly with docstring warnings (using MkDocs Material admonitions, like `!!! warning`)
-
-Ask: "Would this change break someone's code if they used it last week?"
-
-### Code quality standards
-
-All Python code MUST include type hints and return types.
-
-```python title="Example"
-def filter_unknown_users(users: list[str], known_users: set[str]) -> list[str]:
-    """Single line description of the function.
-
-    Any additional context about the function can go here.
-
-    Args:
-        users: List of user identifiers to filter.
-        known_users: Set of known/valid user identifiers.
-
-    Returns:
-        List of users that are not in the `known_users` set.
-    """
-```
-
- Use descriptive, self-explanatory variable names.
- Follow existing patterns in the codebase you're modifying
- Attempt to break up complex functions (>20 lines) into smaller, focused functions where it makes sense
-
-### Testing requirements
-
-Every new feature or bugfix MUST be covered by unit tests.
-
- Unit tests: `tests/unit_tests/` (no network calls allowed)
- Integration tests: `tests/integration_tests/` (network calls permitted)
- We use `pytest` as the testing framework; if in doubt, check other existing tests for examples.
- The testing file structure should mirror the source code structure.
-
-**Checklist:**
-
- [ ] Tests fail when your new logic is broken
- [ ] Happy path is covered
- [ ] Edge cases and error conditions are tested
- [ ] Use fixtures/mocks for external dependencies
- [ ] Tests are deterministic (no flaky tests)
- [ ] Does the test suite fail if your new logic is broken?
-
-### Security and risk assessment
-
- No `eval()`, `exec()`, or `pickle` on user-controlled input
- Proper exception handling (no bare `except:`) and use a `msg` variable for error messages
- Remove unreachable/commented code before committing
- Race conditions or resource leaks (file handles, sockets, threads).
- Ensure proper resource cleanup (file handles, connections)
-
-For threat models documenting trust boundaries, data flows, and known threats:
-
- [`.github/THREAT_MODEL_CORE.md`](.github/THREAT_MODEL_CORE.md) — langchain-core (serialization, SSRF protection, prompts, tools, output parsers)
- [`.github/THREAT_MODEL_V1.md`](.github/THREAT_MODEL_V1.md) — langchain v1 (agent middleware, shell tool, file search, HITL, execution policies)
-
-### Documentation standards
-
-Use Google-style docstrings with Args section for all public functions.
-
-```python title="Example"
-def send_email(to: str, msg: str, *, priority: str = "normal") -> bool:
-    """Send an email to a recipient with specified priority.
-
-    Any additional context about the function can go here.
-
-    Args:
-        to: The email address of the recipient.
-        msg: The message body to send.
-        priority: Email priority level.
-
-    Returns:
-        `True` if email was sent successfully, `False` otherwise.
-
-    Raises:
-        InvalidEmailError: If the email address format is invalid.
-        SMTPConnectionError: If unable to connect to email server.
-    """
-```
-
- Types go in function signatures, NOT in docstrings
-  - If a default is present, DO NOT repeat it in the docstring unless there is post-processing or it is set conditionally.
- Focus on "why" rather than "what" in descriptions
- Document all parameters, return values, and exceptions
- Keep descriptions concise but clear
- Ensure American English spelling (e.g., "behavior", not "behaviour")
- Do NOT use Sphinx-style double backtick formatting (` ``code`` `). Use single backticks (`` `code` ``) for inline code references in docstrings and comments.
-
-#### Model references in docs and examples
-
-Always use the latest generally available (GA) models when referencing LLMs in docstrings and illustrative code snippets. Avoid preview or beta identifiers unless the model has no GA equivalent. Outdated model names signal stale code and confuse users.
-
-Before writing or updating model references, verify current model IDs against the provider's official docs. Do not rely on memorized or cached model names — they go stale quickly.
-
-Changing **shipped default parameter values** in code (e.g., a `model=` kwarg default in a class constructor) may constitute a breaking change — see "Maintain stable public interfaces" above. This guidance applies to documentation and examples, not code defaults.
-
-For model *profile data* (capability flags, context windows), use the `langchain-profiles` CLI described below.
-
-## Model profiles
-
-Model profiles are generated using the `langchain-profiles` CLI in `libs/model-profiles`. The `--data-dir` must point to the directory containing `profile_augmentations.toml`, not the top-level package directory.
-
-```bash
-# Run from libs/model-profiles
-cd libs/model-profiles
-
-# Refresh profiles for a partner in this repo
-uv run langchain-profiles refresh --provider openai --data-dir ../partners/openai/langchain_openai/data
-
-# Refresh profiles for a partner in an external repo (requires echo y to confirm)
-echo y | uv run langchain-profiles refresh --provider google --data-dir /path/to/langchain-google/libs/genai/langchain_google_genai/data
-```
-
-Example partners with profiles in this repo:
-
- `libs/partners/openai/langchain_openai/data/` (provider: `openai`)
- `libs/partners/anthropic/langchain_anthropic/data/` (provider: `anthropic`)
- `libs/partners/perplexity/langchain_perplexity/data/` (provider: `perplexity`)
-
-The `echo y |` pipe is required when `--data-dir` is outside the `libs/model-profiles` working directory.
-
-## CI/CD infrastructure
-
-### Release process
-
-Releases are triggered manually via `.github/workflows/_release.yml` with `working-directory` and `release-version` inputs.
-
-### PR labeling and linting
-
-**Title linting** (`.github/workflows/pr_lint.yml`)
-
-**Auto-labeling:**
-
- `.github/workflows/pr_labeler.yml` – Unified PR labeler (size, file, title, external/internal, contributor tier)
- `.github/workflows/pr_labeler_backfill.yml` – Manual backfill of PR labels on open PRs
- `.github/workflows/auto-label-by-package.yml` – Issue labeling by package
- `.github/workflows/tag-external-issues.yml` – Issue external/internal classification
-
-### Adding a new partner to CI
-
-When adding a new partner package, update these files:
-
- `.github/ISSUE_TEMPLATE/*.yml` – Add to package dropdown
- `.github/dependabot.yml` – Add dependency update entry
- `.github/scripts/pr-labeler-config.json` – Add file rule and scope-to-label mapping
- `.github/workflows/_release.yml` – Add API key secrets if needed
- `.github/workflows/auto-label-by-package.yml` – Add package label
- `.github/workflows/check_diffs.yml` – Add to change detection
- `.github/workflows/integration_tests.yml` – Add integration test config
- `.github/workflows/pr_lint.yml` – Add to allowed scopes
-
-## GitHub Actions & Workflows
-
-This repository require actions to be pinned to a full-length commit SHA. Attempting to use a tag will fail. Use the `gh` cli to query. Verify tags are not annotated tag objects (which would need dereferencing).
-
-## Additional resources
-
- **Documentation:** https://docs.langchain.com/oss/python/langchain/overview and source at https://github.com/langchain-ai/docs or `../docs/`. Prefer the local install and use file search tools for best results. If needed, use the docs MCP server as defined in `.mcp.json` for programmatic access.
- **Contributing Guide:** [Contributing Guide](https://docs.langchain.com/oss/python/contributing/overview)
--- a/12
+++ b/12
@@ -1,6 +1,6 @@
-MIT License
+The MIT License

-Copyright (c) LangChain, Inc.
+Copyright (c) Harrison Chase

 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
@@ -9,13 +9,13 @@ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 copies of the Software, and to permit persons to whom the Software is
 furnished to do so, subject to the following conditions:

-The above copyright notice and this permission notice shall be included in all
-copies or substantial portions of the Software.
+The above copyright notice and this permission notice shall be included in
+all copies or substantial portions of the Software.

 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
-OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-SOFTWARE.
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
+THE SOFTWARE.
--- a/MIGRATE.md
+++ b/MIGRATE.md
@@ -0,0 +1,61 @@
+# Migrating to `langchain_experimental`
+
+We are moving any experimental components of LangChain, or components with vulnerability issues, into `langchain_experimental`.
+This guide covers how to migrate.
+
+## Installation
+
+Previously:
+
+`pip install -U langchain`
+
+Now (only if you want to access things in experimental):
+
+`pip install -U langchain langchain_experimental`
+
+## Things in `langchain.experimental`
+
+Previously:
+
+`from langchain.experimental import ...`
+
+Now:
+
+`from langchain_experimental import ...`
+
+## PALChain
+
+Previously:
+
+`from langchain.chains import PALChain`
+
+Now:
+
+`from langchain_experimental.pal_chain import PALChain`
+
+## SQLDatabaseChain
+
+Previously:
+
+`from langchain.chains import SQLDatabaseChain`
+
+Now:
+
+`from langchain_experimental.sql import SQLDatabaseChain`
+
+Alternatively, if you are just interested in using the query generation part of the SQL chain, you can check out [`create_sql_query_chain`](https://github.com/langchain-ai/langchain/blob/master/docs/extras/use_cases/tabular/sql_query.ipynb)
+
+`from langchain.chains import create_sql_query_chain`
+
+## `load_prompt` for Python files
+
+Note: this only applies if you want to load Python files as prompts.
+If you want to load json/yaml files, no change is needed.
+
+Previously:
+
+`from langchain.prompts import load_prompt`
+
+Now:
+
+`from langchain_experimental.prompts import load_prompt`
--- a/56
+++ b/56
@@ -0,0 +1,56 @@
+.PHONY: all clean docs_build docs_clean docs_linkcheck api_docs_build api_docs_clean api_docs_linkcheck
+
+# Default target executed when no arguments are given to make.
+all: help
+
+
+######################
+# DOCUMENTATION
+######################
+
+clean: docs_clean api_docs_clean
+
+
+docs_build:
+	docs/.local_build.sh
+
+docs_clean:
+	rm -r docs/_dist
+
+docs_linkcheck:
+	poetry run linkchecker docs/_dist/docs_skeleton/ --ignore-url node_modules
+
+api_docs_build:
+	poetry run python docs/api_reference/create_api_rst.py
+	cd docs/api_reference && poetry run make html
+
+api_docs_clean:
+	rm -f docs/api_reference/api_reference.rst
+	cd docs/api_reference && poetry run make clean
+
+api_docs_linkcheck:
+	poetry run linkchecker docs/api_reference/_build/html/index.html
+
+spell_check:
+	poetry run codespell --toml pyproject.toml
+
+spell_fix:
+	poetry run codespell --toml pyproject.toml -w
+
+######################
+# HELP
+######################
+
+help:
+	@echo '===================='
+	@echo '-- DOCUMENTATION --'
+	@echo 'clean                        - run docs_clean and api_docs_clean'
+	@echo 'docs_build                   - build the documentation'
+	@echo 'docs_clean                   - clean the documentation build artifacts'
+	@echo 'docs_linkcheck               - run linkchecker on the documentation'
+	@echo 'api_docs_build               - build the API Reference documentation'
+	@echo 'api_docs_clean               - clean the API Reference documentation build artifacts'
+	@echo 'api_docs_linkcheck           - run linkchecker on the API Reference documentation'
+	@echo 'spell_check               	- run codespell on the project'
+	@echo 'spell_fix               		- run codespell on the project and fix the errors'
+	@echo '-- TEST and LINT tasks are within libs/*/ per-package --'
--- a/README.md
+++ b/README.md
@@ -1,84 +1,103 @@
-<div align="center">
-  <a href="https://docs.langchain.com/oss/python/langchain/overview">
-    <picture>
-      <source media="(prefers-color-scheme: dark)" srcset=".github/images/logo-dark.svg">
-      <source media="(prefers-color-scheme: light)" srcset=".github/images/logo-light.svg">
-      <img alt="LangChain Logo" src=".github/images/logo-dark.svg" width="50%">
-    </picture>
-  </a>
-</div>
+# 🦜️🔗 LangChain

-<div align="center">
-  <h3>The agent engineering platform.</h3>
-</div>
+⚡ Building applications with LLMs through composability ⚡

-<div align="center">
-  <a href="https://opensource.org/licenses/MIT" target="_blank"><img src="https://img.shields.io/pypi/l/langchain" alt="PyPI - License"></a>
-  <a href="https://pypistats.org/packages/langchain" target="_blank"><img src="https://img.shields.io/pepy/dt/langchain" alt="PyPI - Downloads"></a>
-  <a href="https://pypi.org/project/langchain/#history" target="_blank"><img src="https://img.shields.io/pypi/v/langchain?label=%20" alt="Version"></a>
-  <a href="https://x.com/langchain" target="_blank"><img src="https://img.shields.io/twitter/url/https/twitter.com/langchain.svg?style=social&label=Follow%20%40LangChain" alt="Twitter / X"></a>
-</div>
+[![Release Notes](https://img.shields.io/github/release/langchain-ai/langchain)](https://github.com/langchain-ai/langchain/releases)
+[![CI](https://github.com/langchain-ai/langchain/actions/workflows/langchain_ci.yml/badge.svg)](https://github.com/langchain-ai/langchain/actions/workflows/langchain_ci.yml)
+[![Experimental CI](https://github.com/langchain-ai/langchain/actions/workflows/langchain_experimental_ci.yml/badge.svg)](https://github.com/langchain-ai/langchain/actions/workflows/langchain_experimental_ci.yml)
+[![Downloads](https://static.pepy.tech/badge/langchain/month)](https://pepy.tech/project/langchain)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+[![Twitter](https://img.shields.io/twitter/url/https/twitter.com/langchainai.svg?style=social&label=Follow%20%40LangChainAI)](https://twitter.com/langchainai)
+[![](https://dcbadge.vercel.app/api/server/6adMQxSpJS?compact=true&style=flat)](https://discord.gg/6adMQxSpJS)
+[![Open in Dev Containers](https://img.shields.io/static/v1?label=Dev%20Containers&message=Open&color=blue&logo=visualstudiocode)](https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/langchain-ai/langchain)
+[![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/langchain-ai/langchain)
+[![GitHub star chart](https://img.shields.io/github/stars/langchain-ai/langchain?style=social)](https://star-history.com/#langchain-ai/langchain)
+[![Dependency Status](https://img.shields.io/librariesio/github/langchain-ai/langchain)](https://libraries.io/github/langchain-ai/langchain)
+[![Open Issues](https://img.shields.io/github/issues-raw/langchain-ai/langchain)](https://github.com/langchain-ai/langchain/issues)

-<br>

-LangChain is a framework for building agents and LLM-powered applications. It helps you chain together interoperable components and third-party integrations to simplify AI application development — all while future-proofing decisions as the underlying technology evolves.
+Looking for the JS/TS version? Check out [LangChain.js](https://github.com/langchain-ai/langchainjs).

-> [!NOTE]
-> Looking for the JS/TS library? Check out [LangChain.js](https://github.com/langchain-ai/langchainjs).
+**Production Support:** As you move your LangChains into production, we'd love to offer more hands-on support.
+Fill out [this form](https://airtable.com/appwQzlErAS2qiP0L/shrGtGaVBVAz7NcV2) to share more about what you're building, and our team will get in touch.

-## Quickstart
+## 🚨Breaking Changes for select chains (SQLDatabase) on 7/28/23

-```bash
-pip install langchain
-# or
-uv add langchain
-```
+In an effort to make `langchain` leaner and safer, we are moving select chains to `langchain_experimental`.
+This migration has already started, but we are remaining backwards compatible until 7/28.
+On that date, we will remove functionality from `langchain`.
+Read more about the motivation and the progress [here](https://github.com/langchain-ai/langchain/discussions/8043).
+Read how to migrate your code [here](MIGRATE.md).

-```python
-from langchain.chat_models import init_chat_model
+## Quick Install

-model = init_chat_model("openai:gpt-5.4")
-result = model.invoke("Hello, world!")
-```
+`pip install langchain`
+or
+`pip install langsmith && conda install langchain -c conda-forge`

-If you're looking for more advanced customization or agent orchestration, check out [LangGraph](https://docs.langchain.com/oss/python/langgraph/overview), our framework for building controllable agent workflows.
+## 🤔 What is this?

-> [!TIP]
-> For developing, debugging, and deploying AI agents and LLM applications, see [LangSmith](https://docs.langchain.com/langsmith/home).
+Large language models (LLMs) are emerging as a transformative technology, enabling developers to build applications that they previously could not. However, using these LLMs in isolation is often insufficient for creating a truly powerful app - the real power comes when you can combine them with other sources of computation or knowledge.

-## LangChain ecosystem
+This library aims to assist in the development of those types of applications. Common examples of these applications include:

-While the LangChain framework can be used standalone, it also integrates seamlessly with any LangChain product, giving developers a full suite of tools when building LLM applications.
+**❓ Question Answering over specific documents**

- **[Deep Agents](https://github.com/langchain-ai/deepagents)** — Build agents that can plan, use subagents, and leverage file systems for complex tasks
- **[LangGraph](https://docs.langchain.com/oss/python/langgraph/overview)** — Build agents that can reliably handle complex tasks with our low-level agent orchestration framework
- **[Integrations](https://docs.langchain.com/oss/python/integrations/providers/overview)** — Chat & embedding models, tools & toolkits, and more
- **[LangSmith](https://www.langchain.com/langsmith)** — Agent evals, observability, and debugging for LLM apps
- **[LangSmith Deployment](https://docs.langchain.com/langsmith/deployments)** — Deploy and scale agents with a purpose-built platform for long-running, stateful workflows
+- [Documentation](https://python.langchain.com/docs/use_cases/question_answering/)
+- End-to-end Example: [Question Answering over Notion Database](https://github.com/hwchase17/notion-qa)

-## Why use LangChain?
+**💬 Chatbots**

-LangChain helps developers build applications powered by LLMs through a standard interface for models, embeddings, vector stores, and more.
+- [Documentation](https://python.langchain.com/docs/use_cases/chatbots/)
+- End-to-end Example: [Chat-LangChain](https://github.com/langchain-ai/chat-langchain)

- **Real-time data augmentation** — Easily connect LLMs to diverse data sources and external/internal systems, drawing from LangChain's vast library of integrations with model providers, tools, vector stores, retrievers, and more
- **Model interoperability** — Swap models in and out as your engineering team experiments to find the best choice for your application's needs. As the industry frontier evolves, adapt quickly — LangChain's abstractions keep you moving without losing momentum
- **Rapid prototyping** — Quickly build and iterate on LLM applications with LangChain's modular, component-based architecture. Test different approaches and workflows without rebuilding from scratch, accelerating your development cycle
- **Production-ready features** — Deploy reliable applications with built-in support for monitoring, evaluation, and debugging through integrations like LangSmith. Scale with confidence using battle-tested patterns and best practices
- **Vibrant community and ecosystem** — Leverage a rich ecosystem of integrations, templates, and community-contributed components. Benefit from continuous improvements and stay up-to-date with the latest AI developments through an active open-source community
- **Flexible abstraction layers** — Work at the level of abstraction that suits your needs — from high-level chains for quick starts to low-level components for fine-grained control. LangChain grows with your application's complexity
+**🤖 Agents**

---
+- [Documentation](https://python.langchain.com/docs/modules/agents/)
+- End-to-end Example: [GPT+WolframAlpha](https://huggingface.co/spaces/JavaFXpert/Chat-GPT-LangChain)

-## Documentation
+## 📖 Documentation

- [docs.langchain.com](https://docs.langchain.com/oss/python/langchain/overview) – Comprehensive documentation, including conceptual overviews and guides
- [reference.langchain.com/python](https://reference.langchain.com/python) – API reference docs for LangChain packages
- [Chat LangChain](https://chat.langchain.com/) – Chat with the LangChain documentation and get answers to your questions
+Please see [here](https://python.langchain.com) for full documentation on:

-**Discussions**: Visit the [LangChain Forum](https://forum.langchain.com) to connect with the community and share all of your technical questions, ideas, and feedback.
+- Getting started (installation, setting up the environment, simple examples)
+- How-To examples (demos, integrations, helper functions)
+- Reference (full API docs)
+- Resources (high-level explanation of core concepts)

-## Additional resources
+## 🚀 What can this help with?

- [Contributing Guide](https://docs.langchain.com/oss/python/contributing/overview) – Learn how to contribute to LangChain projects and find good first issues.
- [Code of Conduct](https://github.com/langchain-ai/langchain/?tab=coc-ov-file) – Our community guidelines and standards for participation.
- [LangChain Academy](https://academy.langchain.com/) – Comprehensive, free courses on LangChain libraries and products, made by the LangChain team.
+There are six main areas that LangChain is designed to help with.
+These are, in increasing order of complexity:
+
+**📃 LLMs and Prompts:**
+
+This includes prompt management, prompt optimization, a generic interface for all LLMs, and common utilities for working with LLMs.
+
+**🔗 Chains:**
+
+Chains go beyond a single LLM call and involve sequences of calls (whether to an LLM or a different utility). LangChain provides a standard interface for chains, lots of integrations with other tools, and end-to-end chains for common applications.
+
+**📚 Data Augmented Generation:**
+
+Data Augmented Generation involves specific types of chains that first interact with an external data source to fetch data for use in the generation step. Examples include summarization of long pieces of text and question/answering over specific data sources.
+
+**🤖 Agents:**
+
+Agents involve an LLM making decisions about which Actions to take, taking that Action, seeing an Observation, and repeating that until done. LangChain provides a standard interface for agents, a selection of agents to choose from, and examples of end-to-end agents.
+
+**🧠 Memory:**
+
+Memory refers to persisting state between calls of a chain/agent. LangChain provides a standard interface for memory, a collection of memory implementations, and examples of chains/agents that use memory.
+
+**🧐 Evaluation:**
+
+[BETA] Generative models are notoriously hard to evaluate with traditional metrics. One new way of evaluating them is using language models themselves to do the evaluation. LangChain provides some prompts/chains for assisting in this.
+
+For more information on these concepts, please see our [full documentation](https://python.langchain.com).
+
+## 💁 Contributing
+
+As an open-source project in a rapidly developing field, we are extremely open to contributions, whether it be in the form of a new feature, improved infrastructure, or better documentation.
+
+For detailed information on how to contribute, see [here](.github/CONTRIBUTING.md).
--- a/SECURITY.md
+++ b/SECURITY.md
@@ -0,0 +1,6 @@
+# Security Policy
+
+## Reporting a Vulnerability
+
+Please report security vulnerabilities by email to `security@langchain.dev`.
+This email is an alias to a subset of our maintainers, and will ensure the issue is promptly triaged and acted upon as needed.
--- a/docker/Dockerfile.base
+++ b/docker/Dockerfile.base
@@ -0,0 +1,3 @@
+FROM python:latest
+
+RUN pip install langchain
--- a/docs/.local_build.sh
+++ b/docs/.local_build.sh
@@ -0,0 +1,18 @@
+#!/usr/bin/env bash
+
+set -o errexit
+set -o nounset
+set -o pipefail
+set -o xtrace
+
+SCRIPT_DIR="$(cd "$(dirname "$0")"; pwd)"
+cd "${SCRIPT_DIR}"
+
+mkdir -p _dist/docs_skeleton
+cp -r {docs_skeleton,snippets} _dist
+cp -r extras/* _dist/docs_skeleton/docs
+cd _dist/docs_skeleton
+poetry run nbdoc_build
+poetry run python generate_api_reference_links.py
+yarn install
+yarn start
--- a/docs/_scripts/model_feat_table.py
+++ b/docs/_scripts/model_feat_table.py
@@ -0,0 +1,150 @@
+import os
+from pathlib import Path
+
+from langchain import chat_models, llms
+from langchain.chat_models.base import BaseChatModel, SimpleChatModel
+from langchain.llms.base import BaseLLM, LLM
+
+INTEGRATIONS_DIR = (
+    Path(os.path.abspath(__file__)).parents[1] / "extras" / "integrations"
+)
+LLM_IGNORE = ("FakeListLLM", "OpenAIChat", "PromptLayerOpenAIChat")
+LLM_FEAT_TABLE_CORRECTION = {
+    "TextGen": {"_astream": False, "_agenerate": False},
+    "Ollama": {
+        "_stream": False,
+    },
+    "PromptLayerOpenAI": {"batch_generate": False, "batch_agenerate": False},
+}
+CHAT_MODEL_IGNORE = ("FakeListChatModel", "HumanInputChatModel")
+CHAT_MODEL_FEAT_TABLE_CORRECTION = {
+    "ChatMLflowAIGateway": {"_agenerate": False},
+    "PromptLayerChatOpenAI": {"_stream": False, "_astream": False},
+    "ChatKonko": {"_astream": False, "_agenerate": False},
+}
+
+LLM_TEMPLATE = """\
+---
+sidebar_position: 0
+sidebar_class_name: hidden
+---
+
+# LLMs
+
+import DocCardList from "@theme/DocCardList";
+
+## Features (natively supported)
+All LLMs implement the Runnable interface, which comes with default implementations of all methods, ie. `ainvoke`, `batch`, `abatch`, `stream`, `astream`. This gives all LLMs basic support for async, streaming and batch, which by default is implemented as below:
+- *Async* support defaults to calling the respective sync method in asyncio's default thread pool executor. This lets other async functions in your application make progress while the LLM is being executed, by moving this call to a background thread.
+- *Streaming* support defaults to returning an `Iterator` (or `AsyncIterator` in the case of async streaming) of a single value, the final result returned by the underlying LLM provider. This obviously doesn't give you token-by-token streaming, which requires native support from the LLM provider, but ensures your code that expects an iterator of tokens can work for any of our LLM integrations.
+- *Batch* support defaults to calling the underlying LLM in parallel for each input by making use of a thread pool executor (in the sync batch case) or `asyncio.gather` (in the async batch case). The concurrency can be controlled with the `max_concurrency` key in `RunnableConfig`.
+
+Each LLM integration can optionally provide native implementations for async, streaming or batch, which, for providers that support it, can be more efficient. The table shows, for each integration, which features have been implemented with native support.
+
+{table}
+
+<DocCardList />
+"""
+
+CHAT_MODEL_TEMPLATE = """\
+---
+sidebar_position: 1
+sidebar_class_name: hidden
+---
+
+# Chat models
+
+import DocCardList from "@theme/DocCardList";
+
+## Features (natively supported)
+All ChatModels implement the Runnable interface, which comes with default implementations of all methods, ie. `ainvoke`, `batch`, `abatch`, `stream`, `astream`. This gives all ChatModels basic support for async, streaming and batch, which by default is implemented as below:
+- *Async* support defaults to calling the respective sync method in asyncio's default thread pool executor. This lets other async functions in your application make progress while the ChatModel is being executed, by moving this call to a background thread.
+- *Streaming* support defaults to returning an `Iterator` (or `AsyncIterator` in the case of async streaming) of a single value, the final result returned by the underlying ChatModel provider. This obviously doesn't give you token-by-token streaming, which requires native support from the ChatModel provider, but ensures your code that expects an iterator of tokens can work for any of our ChatModel integrations.
+- *Batch* support defaults to calling the underlying ChatModel in parallel for each input by making use of a thread pool executor (in the sync batch case) or `asyncio.gather` (in the async batch case). The concurrency can be controlled with the `max_concurrency` key in `RunnableConfig`.
+
+Each ChatModel integration can optionally provide native implementations to truly enable async or streaming.
+The table shows, for each integration, which features have been implemented with native support.
+
+{table}
+
+<DocCardList />
+"""
+
+
+def get_llm_table():
+    llm_feat_table = {}
+    for cm in llms.__all__:
+        llm_feat_table[cm] = {}
+        cls = getattr(llms, cm)
+        if issubclass(cls, LLM):
+            for feat in ("_stream", "_astream", ("_acall", "_agenerate")):
+                if isinstance(feat, tuple):
+                    feat, name = feat
+                else:
+                    feat, name = feat, feat
+                llm_feat_table[cm][name] = getattr(cls, feat) != getattr(LLM, feat)
+        else:
+            for feat in [
+                "_stream",
+                "_astream",
+                ("_generate", "batch_generate"),
+                "_agenerate",
+                ("_agenerate", "batch_agenerate"),
+            ]:
+                if isinstance(feat, tuple):
+                    feat, name = feat
+                else:
+                    feat, name = feat, feat
+                llm_feat_table[cm][name] = getattr(cls, feat) != getattr(BaseLLM, feat)
+    final_feats = {
+        k: v
+        for k, v in {**llm_feat_table, **LLM_FEAT_TABLE_CORRECTION}.items()
+        if k not in LLM_IGNORE
+    }
+
+    header = [
+        "model",
+        "_agenerate",
+        "_stream",
+        "_astream",
+        "batch_generate",
+        "batch_agenerate",
+    ]
+    title = ["Model", "Invoke", "Async invoke", "Stream", "Async stream", "Batch", "Async batch"]
+    rows = [title, [":-"] + [":-:"] * (len(title) - 1)]
+    for llm, feats in sorted(final_feats.items()):
+        rows += [[llm, "✅"] + ["✅" if feats.get(h) else "❌" for h in header[1:]]]
+    return "\n".join(["|".join(row) for row in rows])
+
+
+def get_chat_model_table():
+    feat_table = {}
+    for cm in chat_models.__all__:
+        feat_table[cm] = {}
+        cls = getattr(chat_models, cm)
+        if issubclass(cls, SimpleChatModel):
+            comparison_cls = SimpleChatModel
+        else:
+            comparison_cls = BaseChatModel
+        for feat in ("_stream", "_astream", "_agenerate"):
+            feat_table[cm][feat] = getattr(cls, feat) != getattr(comparison_cls, feat)
+    final_feats = {
+        k: v
+        for k, v in {**feat_table, **CHAT_MODEL_FEAT_TABLE_CORRECTION}.items()
+        if k not in CHAT_MODEL_IGNORE
+    }
+    header = ["model", "_agenerate", "_stream", "_astream"]
+    title = ["Model", "Invoke", "Async invoke", "Stream", "Async stream"]
+    rows = [title, [":-"] + [":-:"] * (len(title) - 1)]
+    for llm, feats in sorted(final_feats.items()):
+        rows += [[llm, "✅"] + ["✅" if feats.get(h) else "❌" for h in header[1:]]]
+    return "\n".join(["|".join(row) for row in rows])
+
+
+if __name__ == "__main__":
+    llm_page = LLM_TEMPLATE.format(table=get_llm_table())
+    with open(INTEGRATIONS_DIR / "llms" / "index.mdx", "w") as f:
+        f.write(llm_page)
+    chat_model_page = CHAT_MODEL_TEMPLATE.format(table=get_chat_model_table())
+    with open(INTEGRATIONS_DIR / "chat" / "index.mdx", "w") as f:
+        f.write(chat_model_page)
--- a/docs/api_reference/Makefile
+++ b/docs/api_reference/Makefile
@@ -0,0 +1,21 @@
+# Minimal makefile for Sphinx documentation
+#
+
+# You can set these variables from the command line, and also
+# from the environment for the first two.
+SPHINXOPTS    ?= 
+SPHINXBUILD   ?= sphinx-build
+SPHINXAUTOBUILD   ?= sphinx-autobuild
+SOURCEDIR     = .
+BUILDDIR      = _build
+
+# Put it first so that "make" without argument is like "make help".
+help:
+	@$(SPHINXBUILD) -M help "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
+
+.PHONY: help Makefile
+
+# Catch-all target: route all unknown targets to Sphinx using the new
+# "make mode" option.  $(O) is meant as a shortcut for $(SPHINXOPTS).
+%: Makefile
+	@$(SPHINXBUILD) -M $@ "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
--- a/docs/api_reference/_static/css/custom.css
+++ b/docs/api_reference/_static/css/custom.css
@@ -0,0 +1,17 @@
+pre {
+  white-space: break-spaces;
+}
+
+@media (min-width: 1200px) {
+  .container,
+  .container-lg,
+  .container-md,
+  .container-sm,
+  .container-xl {
+    max-width: 2560px !important;
+  }
+}
+
+#my-component-root *, #headlessui-portal-root * {
+  z-index: 10000;
+}
--- a/docs/api_reference/conf.py
+++ b/docs/api_reference/conf.py
@@ -0,0 +1,168 @@
+"""Configuration file for the Sphinx documentation builder."""
+# Configuration file for the Sphinx documentation builder.
+#
+# This file only contains a selection of the most common options. For a full
+# list see the documentation:
+# https://www.sphinx-doc.org/en/master/usage/configuration.html
+
+# -- Path setup --------------------------------------------------------------
+
+import json
+import os
+import sys
+from pathlib import Path
+
+import toml
+from docutils import nodes
+from sphinx.util.docutils import SphinxDirective
+
+# If extensions (or modules to document with autodoc) are in another directory,
+# add these directories to sys.path here. If the directory is relative to the
+# documentation root, use os.path.abspath to make it absolute, like shown here.
+
+_DIR = Path(__file__).parent.absolute()
+sys.path.insert(0, os.path.abspath("."))
+sys.path.insert(0, os.path.abspath("../../libs/langchain"))
+sys.path.insert(0, os.path.abspath("../../libs/experimental"))
+
+with (_DIR.parents[1] / "libs" / "langchain" / "pyproject.toml").open("r") as f:
+    data = toml.load(f)
+with (_DIR / "guide_imports.json").open("r") as f:
+    imported_classes = json.load(f)
+
+
+class ExampleLinksDirective(SphinxDirective):
+    """Directive to generate a list of links to examples.
+
+    We have a script that extracts links to API reference docs
+    from our notebook examples. This directive uses that information
+    to backlink to the examples from the API reference docs."""
+
+    has_content = False
+    required_arguments = 1
+
+    def run(self):
+        """Run the directive.
+
+        Called any time :example_links:`ClassName` is used
+        in the template *.rst files."""
+        class_or_func_name = self.arguments[0]
+        links = imported_classes.get(class_or_func_name, {})
+        list_node = nodes.bullet_list()
+        for doc_name, link in links.items():
+            item_node = nodes.list_item()
+            para_node = nodes.paragraph()
+            link_node = nodes.reference()
+            link_node["refuri"] = link
+            link_node.append(nodes.Text(doc_name))
+            para_node.append(link_node)
+            item_node.append(para_node)
+            list_node.append(item_node)
+        if list_node.children:
+            title_node = nodes.title()
+            title_node.append(nodes.Text(f"Examples using {class_or_func_name}"))
+            return [title_node, list_node]
+        return [list_node]
+
+
+def setup(app):
+    app.add_directive("example_links", ExampleLinksDirective)
+
+
+# -- Project information -----------------------------------------------------
+
+project = "🦜🔗 LangChain"
+copyright = "2023, Harrison Chase"
+author = "Harrison Chase"
+
+version = data["tool"]["poetry"]["version"]
+release = version
+
+html_title = project + " " + version
+html_last_updated_fmt = "%b %d, %Y"
+
+
+# -- General configuration ---------------------------------------------------
+
+# Add any Sphinx extension module names here, as strings. They can be
+# extensions coming with Sphinx (named 'sphinx.ext.*') or your custom
+# ones.
+extensions = [
+    "sphinx.ext.autodoc",
+    "sphinx.ext.autodoc.typehints",
+    "sphinx.ext.autosummary",
+    "sphinx.ext.napoleon",
+    "sphinx.ext.viewcode",
+    "sphinxcontrib.autodoc_pydantic",
+    "sphinx_copybutton",
+    "sphinx_panels",
+    "IPython.sphinxext.ipython_console_highlighting",
+]
+source_suffix = [".rst"]
+
+# some autodoc pydantic options are repeated in the actual template.
+# potentially user error, but there may be bugs in the sphinx extension
+# with options not being passed through correctly (from either the location in the code)
+autodoc_pydantic_model_show_json = False
+autodoc_pydantic_field_list_validators = False
+autodoc_pydantic_config_members = False
+autodoc_pydantic_model_show_config_summary = False
+autodoc_pydantic_model_show_validator_members = False
+autodoc_pydantic_model_show_validator_summary = False
+autodoc_pydantic_model_signature_prefix = "class"
+autodoc_pydantic_field_signature_prefix = "param"
+autodoc_member_order = "groupwise"
+autoclass_content = "both"
+autodoc_typehints_format = "short"
+
+# autodoc_typehints = "description"
+# Add any paths that contain templates here, relative to this directory.
+templates_path = ["templates"]
+
+# List of patterns, relative to source directory, that match files and
+# directories to ignore when looking for source files.
+# This pattern also affects html_static_path and html_extra_path.
+exclude_patterns = ["_build", "Thumbs.db", ".DS_Store"]
+
+
+# -- Options for HTML output -------------------------------------------------
+
+# The theme to use for HTML and HTML Help pages.  See the documentation for
+# a list of builtin themes.
+#
+html_theme = "scikit-learn-modern"
+html_theme_path = ["themes"]
+
+# redirects dictionary maps from old links to new links
+html_additional_pages = {}
+redirects = {
+    "index": "api_reference",
+}
+for old_link in redirects:
+    html_additional_pages[old_link] = "redirects.html"
+
+html_context = {
+    "display_github": True,  # Integrate GitHub
+    "github_user": "hwchase17",  # Username
+    "github_repo": "langchain",  # Repo name
+    "github_version": "master",  # Version
+    "conf_py_path": "/docs/api_reference",  # Path in the checkout to the docs root
+    "redirects": redirects,
+}
+
+# Add any paths that contain custom static files (such as style sheets) here,
+# relative to this directory. They are copied after the builtin static files,
+# so a file named "default.css" will overwrite the builtin "default.css".
+html_static_path = ["_static"]
+
+# These paths are either relative to html_static_path
+# or fully qualified paths (e.g. https://...)
+html_css_files = [
+    "css/custom.css",
+]
+html_use_index = False
+
+myst_enable_extensions = ["colon_fence"]
+
+# generate autosummary even if no references
+autosummary_generate = True
--- a/docs/api_reference/create_api_rst.py
+++ b/docs/api_reference/create_api_rst.py
@@ -0,0 +1,304 @@
+"""Script for auto-generating api_reference.rst."""
+import importlib
+import inspect
+import typing
+from pathlib import Path
+from typing import TypedDict, Sequence, List, Dict, Literal, Union, Optional
+from enum import Enum
+
+from pydantic import BaseModel
+
+ROOT_DIR = Path(__file__).parents[2].absolute()
+HERE = Path(__file__).parent
+
+PKG_DIR = ROOT_DIR / "libs" / "langchain" / "langchain"
+EXP_DIR = ROOT_DIR / "libs" / "experimental" / "langchain_experimental"
+WRITE_FILE = HERE / "api_reference.rst"
+EXP_WRITE_FILE = HERE / "experimental_api_reference.rst"
+
+
+ClassKind = Literal["TypedDict", "Regular", "Pydantic", "enum"]
+
+
+class ClassInfo(TypedDict):
+    """Information about a class."""
+
+    name: str
+    """The name of the class."""
+    qualified_name: str
+    """The fully qualified name of the class."""
+    kind: ClassKind
+    """The kind of the class."""
+    is_public: bool
+    """Whether the class is public or not."""
+
+
+class FunctionInfo(TypedDict):
+    """Information about a function."""
+
+    name: str
+    """The name of the function."""
+    qualified_name: str
+    """The fully qualified name of the function."""
+    is_public: bool
+    """Whether the function is public or not."""
+
+
+class ModuleMembers(TypedDict):
+    """A dictionary of module members."""
+
+    classes_: Sequence[ClassInfo]
+    functions: Sequence[FunctionInfo]
+
+
+def _load_module_members(module_path: str, namespace: str) -> ModuleMembers:
+    """Load all members of a module.
+
+    Args:
+        module_path: Path to the module.
+        namespace: the namespace of the module.
+
+    Returns:
+        list: A list of loaded module objects.
+    """
+    classes_: List[ClassInfo] = []
+    functions: List[FunctionInfo] = []
+    module = importlib.import_module(module_path)
+    for name, type_ in inspect.getmembers(module):
+        if not hasattr(type_, "__module__"):
+            continue
+        if type_.__module__ != module_path:
+            continue
+
+        if inspect.isclass(type_):
+            if type(type_) == typing._TypedDictMeta:  # type: ignore
+                kind: ClassKind = "TypedDict"
+            elif issubclass(type_, Enum):
+                kind = "enum"
+            elif issubclass(type_, BaseModel):
+                kind = "Pydantic"
+            else:
+                kind = "Regular"
+
+            classes_.append(
+                ClassInfo(
+                    name=name,
+                    qualified_name=f"{namespace}.{name}",
+                    kind=kind,
+                    is_public=not name.startswith("_"),
+                )
+            )
+        elif inspect.isfunction(type_):
+            functions.append(
+                FunctionInfo(
+                    name=name,
+                    qualified_name=f"{namespace}.{name}",
+                    is_public=not name.startswith("_"),
+                )
+            )
+        else:
+            continue
+
+    return ModuleMembers(
+        classes_=classes_,
+        functions=functions,
+    )
+
+
+def _merge_module_members(
+    module_members: Sequence[ModuleMembers],
+) -> ModuleMembers:
+    """Merge module members."""
+    classes_: List[ClassInfo] = []
+    functions: List[FunctionInfo] = []
+    for module in module_members:
+        classes_.extend(module["classes_"])
+        functions.extend(module["functions"])
+
+    return ModuleMembers(
+        classes_=classes_,
+        functions=functions,
+    )
+
+
+def _load_package_modules(
+    package_directory: Union[str, Path],
+    submodule: Optional[str] = None
+) -> Dict[str, ModuleMembers]:
+    """Recursively load modules of a package based on the file system.
+
+    Traversal based on the file system makes it easy to determine which
+    of the modules/packages are part of the package vs. 3rd party or built-in.
+
+    Parameters:
+        package_directory: Path to the package directory.
+        submodule: Optional name of submodule to load.
+
+    Returns:
+        list: A list of loaded module objects.
+    """
+    package_path = (
+        Path(package_directory)
+        if isinstance(package_directory, str)
+        else package_directory
+    )
+    modules_by_namespace = {}
+
+    # Get the high level package name
+    package_name = package_path.name
+
+    # If we are loading a submodule, add it in
+    if submodule is not None:
+        package_path = package_path / submodule
+
+    for file_path in package_path.rglob("*.py"):
+        if file_path.name.startswith("_"):
+            continue
+
+        relative_module_name = file_path.relative_to(package_path)
+
+        # Skip if any module part starts with an underscore
+        if any(part.startswith("_") for part in relative_module_name.parts):
+            continue
+
+        # Get the full namespace of the module
+        namespace = str(relative_module_name).replace(".py", "").replace("/", ".")
+        # Keep only the top level namespace
+        top_namespace = namespace.split(".")[0]
+
+        try:
+            # If submodule is present, we need to construct the paths in a slightly
+            # different way
+            if submodule is not None:
+                module_members = _load_module_members(
+                    f"{package_name}.{submodule}.{namespace}", f"{submodule}.{namespace}"
+                )
+            else:
+                module_members = _load_module_members(
+                    f"{package_name}.{namespace}", namespace
+                )
+            # Merge module members if the namespace already exists
+            if top_namespace in modules_by_namespace:
+                existing_module_members = modules_by_namespace[top_namespace]
+                _module_members = _merge_module_members(
+                    [existing_module_members, module_members]
+                )
+            else:
+                _module_members = module_members
+
+            modules_by_namespace[top_namespace] = _module_members
+
+        except ImportError as e:
+            print(f"Error: Unable to import module '{namespace}' with error: {e}")
+
+    return modules_by_namespace
+
+
+def _construct_doc(pkg: str, members_by_namespace: Dict[str, ModuleMembers]) -> str:
+    """Construct the contents of the reference.rst file for the given package.
+
+    Args:
+        pkg: The package name
+        members_by_namespace: The members of the package, dict organized by top level
+                              module contains a list of classes and functions
+                              inside of the top level namespace.
+
+    Returns:
+        The contents of the reference.rst file.
+    """
+    full_doc = f"""\
+=======================
+``{pkg}`` API Reference
+=======================
+
+"""
+    namespaces = sorted(members_by_namespace)
+
+    for module in namespaces:
+        _members = members_by_namespace[module]
+        classes = _members["classes_"]
+        functions = _members["functions"]
+        if not (classes or functions):
+            continue
+        section = f":mod:`{pkg}.{module}`"
+        underline = "=" * (len(section) + 1)
+        full_doc += f"""\
+{section}
+{underline}
+
+.. automodule:: {pkg}.{module}
+    :no-members:
+    :no-inherited-members:
+
+"""
+
+        if classes:
+            full_doc += f"""\
+Classes
+--------------
+.. currentmodule:: {pkg}
+
+.. autosummary::
+    :toctree: {module}
+"""
+
+            for class_ in sorted(classes, key=lambda c: c["qualified_name"]):
+                if not class_["is_public"]:
+                    continue
+
+                if class_["kind"] == "TypedDict":
+                    template = "typeddict.rst"
+                elif class_["kind"] == "enum":
+                    template = "enum.rst"
+                elif class_["kind"] == "Pydantic":
+                    template = "pydantic.rst"
+                else:
+                    template = "class.rst"
+
+                full_doc += f"""\
+    :template: {template}
+    
+    {class_["qualified_name"]}
+    
+"""
+
+        if functions:
+            _functions = [f["qualified_name"] for f in functions if f["is_public"]]
+            fstring = "\n    ".join(sorted(_functions))
+            full_doc += f"""\
+Functions
+--------------
+.. currentmodule:: {pkg}
+
+.. autosummary::
+    :toctree: {module}
+    :template: function.rst
+
+    {fstring}
+
+"""
+    return full_doc
+
+
+def main() -> None:
+    """Generate the reference.rst file for each package."""
+    lc_members = _load_package_modules(PKG_DIR)
+    # Put some packages at top level
+    tools = _load_package_modules(PKG_DIR, "tools")
+    lc_members['tools.render'] = tools['render']
+    agents = _load_package_modules(PKG_DIR, "agents")
+    lc_members['agents.output_parsers'] = agents['output_parsers']
+    lc_members['agents.format_scratchpad'] = agents['format_scratchpad']
+    lc_doc = ".. _api_reference:\n\n" + _construct_doc("langchain", lc_members)
+    with open(WRITE_FILE, "w") as f:
+        f.write(lc_doc)
+    exp_members = _load_package_modules(EXP_DIR)
+    exp_doc = ".. _experimental_api_reference:\n\n" + _construct_doc(
+        "langchain_experimental", exp_members
+    )
+    with open(EXP_WRITE_FILE, "w") as f:
+        f.write(exp_doc)
+
+
+if __name__ == "__main__":
+    main()
--- a/docs/api_reference/guide_imports.json
+++ b/docs/api_reference/guide_imports.json
--- a/docs/api_reference/index.rst
+++ b/docs/api_reference/index.rst
@@ -0,0 +1,8 @@
+=============
+LangChain API
+=============
+
+.. toctree::
+    :maxdepth: 2
+
+    api_reference.rst
--- a/docs/api_reference/make.bat
+++ b/docs/api_reference/make.bat
@@ -0,0 +1,35 @@
+@ECHO OFF
+
+pushd %~dp0
+
+REM Command file for Sphinx documentation
+
+if "%SPHINXBUILD%" == "" (
+	set SPHINXBUILD=sphinx-build
+)
+set SOURCEDIR=.
+set BUILDDIR=_build
+
+if "%1" == "" goto help
+
+%SPHINXBUILD% >NUL 2>NUL
+if errorlevel 9009 (
+	echo.
+	echo.The 'sphinx-build' command was not found. Make sure you have Sphinx
+	echo.installed, then set the SPHINXBUILD environment variable to point
+	echo.to the full path of the 'sphinx-build' executable. Alternatively you
+	echo.may add the Sphinx directory to PATH.
+	echo.
+	echo.If you don't have Sphinx installed, grab it from
+	echo.http://sphinx-doc.org/
+	exit /b 1
+)
+
+%SPHINXBUILD% -M %1 %SOURCEDIR% %BUILDDIR% %SPHINXOPTS% %O%
+goto end
+
+:help
+%SPHINXBUILD% -M help %SOURCEDIR% %BUILDDIR% %SPHINXOPTS% %O%
+
+:end
+popd
--- a/docs/api_reference/requirements.txt
+++ b/docs/api_reference/requirements.txt
@@ -0,0 +1,15 @@
+-e libs/langchain
+-e libs/experimental
+pydantic<2
+autodoc_pydantic==1.8.0
+myst_parser
+nbsphinx==0.8.9
+sphinx==4.5.0
+sphinx-autobuild==2021.3.14
+sphinx_rtd_theme==1.0.0
+sphinx-typlog-theme==0.8.0
+sphinx-panels
+toml
+myst_nb
+sphinx_copybutton
+pydata-sphinx-theme==0.13.1
--- a/docs/api_reference/templates/COPYRIGHT.txt
+++ b/docs/api_reference/templates/COPYRIGHT.txt
@@ -0,0 +1,27 @@
+Copyright (c) 2007-2023 The scikit-learn developers.
+All rights reserved.
+
+Redistribution and use in source and binary forms, with or without
+modification, are permitted provided that the following conditions are met:
+
+* Redistributions of source code must retain the above copyright notice, this
+  list of conditions and the following disclaimer.
+
+* Redistributions in binary form must reproduce the above copyright notice,
+  this list of conditions and the following disclaimer in the documentation
+  and/or other materials provided with the distribution.
+
+* Neither the name of the copyright holder nor the names of its
+  contributors may be used to endorse or promote products derived from
+  this software without specific prior written permission.
+
+THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
+AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
+IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
+DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
+FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
+DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
+SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
+CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
+OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
--- a/docs/api_reference/templates/class.rst
+++ b/docs/api_reference/templates/class.rst
@@ -0,0 +1,36 @@
+:mod:`{{module}}`.{{objname}}
+{{ underline }}==============
+
+.. currentmodule:: {{ module }}
+
+.. autoclass:: {{ objname }}
+
+   {% block attributes %}
+   {% if attributes %}
+   .. rubric:: {{ _('Attributes') }}
+
+   .. autosummary::
+   {% for item in attributes %}
+      ~{{ name }}.{{ item }}
+   {%- endfor %}
+   {% endif %}
+   {% endblock %}
+
+   {% block methods %}
+   {% if methods %}
+   .. rubric:: {{ _('Methods') }}
+
+   .. autosummary::
+   {% for item in methods %}
+      ~{{ name }}.{{ item }}
+   {%- endfor %}
+
+   {% for item in methods %}
+   .. automethod:: {{ name }}.{{ item }}
+   {%- endfor %}
+
+   {% endif %}
+   {% endblock %}
+
+
+.. example_links:: {{ objname }}
--- a/docs/api_reference/templates/enum.rst
+++ b/docs/api_reference/templates/enum.rst
@@ -0,0 +1,14 @@
+:mod:`{{module}}`.{{objname}}
+{{ underline }}==============
+
+.. currentmodule:: {{ module }}
+
+.. autoclass:: {{ objname }}
+
+    {% block attributes %}
+    {% for item in attributes %}
+    .. autoattribute:: {{ item }}
+    {% endfor %}
+    {% endblock %}
+
+.. example_links:: {{ objname }}
--- a/docs/api_reference/templates/function.rst
+++ b/docs/api_reference/templates/function.rst
@@ -0,0 +1,8 @@
+:mod:`{{module}}`.{{objname}}
+{{ underline }}==============
+
+.. currentmodule:: {{ module }}
+
+.. autofunction:: {{ objname }}
+
+.. example_links:: {{ objname }}
--- a/docs/api_reference/templates/pydantic.rst
+++ b/docs/api_reference/templates/pydantic.rst
@@ -0,0 +1,22 @@
+:mod:`{{module}}`.{{objname}}
+{{ underline }}==============
+
+.. currentmodule:: {{ module }}
+
+.. autopydantic_model:: {{ objname }}
+    :model-show-json: False
+    :model-show-config-summary: False
+    :model-show-validator-members: False
+    :model-show-field-summary: False
+    :field-signature-prefix: param
+    :members:
+    :undoc-members:
+    :inherited-members:
+    :member-order: groupwise
+    :show-inheritance: True
+    :special-members: __call__
+
+    {% block attributes %}
+    {% endblock %}
+
+.. example_links:: {{ objname }}
--- a/docs/api_reference/templates/redirects.html
+++ b/docs/api_reference/templates/redirects.html
@@ -0,0 +1,16 @@
+{% set redirect = pathto(redirects[pagename]) %}
+<!DOCTYPE html>
+<html>
+  <head>
+    <meta charset="utf-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <meta http-equiv="Refresh" content="0; url={{ redirect }}" />
+    <meta name="robots" content="follow, index">
+    <meta name="Description" content="Python API reference for LangChain.">
+    <link rel="canonical" href="{{ redirect }}" />
+    <title>LangChain Python API Reference Documentation.</title>
+  </head>
+  <body>
+    <p>You will be automatically redirected to the <a href="{{ redirect }}">new location of this page</a>.</p>
+  </body>
+</html>
--- a/docs/api_reference/templates/typeddict.rst
+++ b/docs/api_reference/templates/typeddict.rst
@@ -0,0 +1,14 @@
+:mod:`{{module}}`.{{objname}}
+{{ underline }}==============
+
+.. currentmodule:: {{ module }}
+
+.. autoclass:: {{ objname }}
+
+    {% block attributes %}
+   {% for item in attributes %}
+  .. autoattribute:: {{ item }}
+   {% endfor %}
+   {% endblock %}
+
+.. example_links:: {{ objname }}
--- a/docs/api_reference/themes/COPYRIGHT.txt
+++ b/docs/api_reference/themes/COPYRIGHT.txt
@@ -0,0 +1,27 @@
+Copyright (c) 2007-2023 The scikit-learn developers.
+All rights reserved.
+
+Redistribution and use in source and binary forms, with or without
+modification, are permitted provided that the following conditions are met:
+
+* Redistributions of source code must retain the above copyright notice, this
+  list of conditions and the following disclaimer.
+
+* Redistributions in binary form must reproduce the above copyright notice,
+  this list of conditions and the following disclaimer in the documentation
+  and/or other materials provided with the distribution.
+
+* Neither the name of the copyright holder nor the names of its
+  contributors may be used to endorse or promote products derived from
+  this software without specific prior written permission.
+
+THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
+AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
+IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
+DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
+FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
+DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
+SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
+CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
+OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
--- a/docs/api_reference/themes/scikit-learn-modern/javascript.html
+++ b/docs/api_reference/themes/scikit-learn-modern/javascript.html
@@ -0,0 +1,67 @@
+<script>
+$(document).ready(function() {
+    /* Add a [>>>] button on the top-right corner of code samples to hide
+     * the >>> and ... prompts and the output and thus make the code
+     * copyable. */
+    var div = $('.highlight-python .highlight,' +
+                '.highlight-python3 .highlight,' +
+                '.highlight-pycon .highlight,' +
+		'.highlight-default .highlight')
+    var pre = div.find('pre');
+
+    // get the styles from the current theme
+    pre.parent().parent().css('position', 'relative');
+    var hide_text = 'Hide prompts and outputs';
+    var show_text = 'Show prompts and outputs';
+
+    // create and add the button to all the code blocks that contain >>>
+    div.each(function(index) {
+        var jthis = $(this);
+        if (jthis.find('.gp').length > 0) {
+            var button = $('<span class="copybutton">&gt;&gt;&gt;</span>');
+            button.attr('title', hide_text);
+            button.data('hidden', 'false');
+            jthis.prepend(button);
+        }
+        // tracebacks (.gt) contain bare text elements that need to be
+        // wrapped in a span to work with .nextUntil() (see later)
+        jthis.find('pre:has(.gt)').contents().filter(function() {
+            return ((this.nodeType == 3) && (this.data.trim().length > 0));
+        }).wrap('<span>');
+    });
+
+    // define the behavior of the button when it's clicked
+    $('.copybutton').click(function(e){
+        e.preventDefault();
+        var button = $(this);
+        if (button.data('hidden') === 'false') {
+            // hide the code output
+            button.parent().find('.go, .gp, .gt').hide();
+            button.next('pre').find('.gt').nextUntil('.gp, .go').css('visibility', 'hidden');
+            button.css('text-decoration', 'line-through');
+            button.attr('title', show_text);
+            button.data('hidden', 'true');
+        } else {
+            // show the code output
+            button.parent().find('.go, .gp, .gt').show();
+            button.next('pre').find('.gt').nextUntil('.gp, .go').css('visibility', 'visible');
+            button.css('text-decoration', 'none');
+            button.attr('title', hide_text);
+            button.data('hidden', 'false');
+        }
+    });
+
+	/*** Add permalink buttons next to glossary terms ***/
+	$('dl.glossary > dt[id]').append(function() {
+		return ('<a class="headerlink" href="#' +
+			    this.getAttribute('id') +
+			    '" title="Permalink to this term">¶</a>');
+	});
+});
+
+</script>
+{%- if pagename != 'index' and pagename != 'documentation' %}
+    {% if theme_mathjax_path %}
+<script id="MathJax-script" async src="{{ theme_mathjax_path }}"></script>
+    {% endif %}
+{%- endif %}
--- a/docs/api_reference/themes/scikit-learn-modern/layout.html
+++ b/docs/api_reference/themes/scikit-learn-modern/layout.html
@@ -0,0 +1,142 @@
+{# TEMPLATE VAR SETTINGS #}
+{%- set url_root = pathto('', 1) %}
+{%- if url_root == '#' %}{% set url_root = '' %}{% endif %}
+{%- if not embedded and docstitle %}
+  {%- set titlesuffix = " &mdash; "|safe + docstitle|e %}
+{%- else %}
+  {%- set titlesuffix = "" %}
+{%- endif %}
+{%- set lang_attr = 'en' %}
+
+<!DOCTYPE html>
+<!--[if IE 8]><html class="no-js lt-ie9" lang="{{ lang_attr }}" > <![endif]-->
+<!--[if gt IE 8]><!--> <html class="no-js" lang="{{ lang_attr }}" > <!--<![endif]-->
+<head>
+  <meta charset="utf-8">
+  {{ metatags }}
+  <meta name="viewport" content="width=device-width, initial-scale=1.0">
+
+  {% block htmltitle %}
+  <title>{{ title|striptags|e }}{{ titlesuffix }}</title>
+  {% endblock %}
+  <link rel="canonical" href="https://api.python.langchain.com/en/latest/{{pagename}}.html" />
+
+  {% if favicon_url %}
+  <link rel="shortcut icon" href="{{ favicon_url|e }}"/>
+  {% endif %}
+
+  <link rel="stylesheet" href="{{ pathto('_static/css/vendor/bootstrap.min.css', 1) }}" type="text/css" />
+  {%- for css in css_files %}
+    {%- if css|attr("rel") %}
+  <link rel="{{ css.rel }}" href="{{ pathto(css.filename, 1) }}" type="text/css"{% if css.title is not none %} title="{{ css.title }}"{% endif %} />
+    {%- else %}
+  <link rel="stylesheet" href="{{ pathto(css, 1) }}" type="text/css" />
+    {%- endif %}
+  {%- endfor %}
+  <link rel="stylesheet" href="{{ pathto('_static/' + style, 1) }}" type="text/css" />
+<script id="documentation_options" data-url_root="{{ pathto('', 1) }}" src="{{ pathto('_static/documentation_options.js', 1) }}"></script>
+<script src="{{ pathto('_static/jquery.js', 1) }}"></script>
+{%- block extrahead %} {% endblock %}
+</head>
+<body>
+{% include "nav.html" %}
+{%- block content %}
+<div class="d-flex" id="sk-doc-wrapper">
+    <input type="checkbox" name="sk-toggle-checkbox" id="sk-toggle-checkbox">
+    <label id="sk-sidemenu-toggle" class="sk-btn-toggle-toc btn sk-btn-primary" for="sk-toggle-checkbox">Toggle Menu</label>
+    <div id="sk-sidebar-wrapper" class="border-right">
+      <div class="sk-sidebar-toc-wrapper">
+        <div class="btn-group w-100 mb-2" role="group" aria-label="rellinks">
+          {%- if prev %}
+            <a href="{{ prev.link|e }}" role="button" class="btn sk-btn-rellink py-1" sk-rellink-tooltip="{{ prev.title|striptags }}">Prev</a>
+          {%- else %}
+            <a href="#" role="button" class="btn sk-btn-rellink py-1 disabled"">Prev</a>
+          {%- endif %}
+          {%- if parents -%}
+            <a href="{{ parents[-1].link|e }}" role="button" class="btn sk-btn-rellink py-1" sk-rellink-tooltip="{{ parents[-1].title|striptags }}">Up</a>
+          {%- else %}
+            <a href="#" role="button" class="btn sk-btn-rellink disabled py-1">Up</a>
+          {%- endif %}
+          {%- if next %}
+            <a href="{{ next.link|e }}" role="button" class="btn sk-btn-rellink py-1" sk-rellink-tooltip="{{ next.title|striptags }}">Next</a>
+          {%- else %}
+            <a href="#" role="button" class="btn sk-btn-rellink py-1 disabled"">Next</a>
+          {%- endif %}
+        </div>
+        {%- if pagename != "install" %}
+        <div class="alert alert-warning p-1 mb-2" role="alert">
+          <p class="text-center mb-0">
+          <strong>LangChain {{ release }}</strong><br/>
+          </p>
+        </div>
+        {%- endif %}
+            {%- if meta and meta['parenttoc']|tobool %}
+            <div class="sk-sidebar-toc">
+            {% set nav = get_nav_object(maxdepth=3, collapse=True, numbered=True) %}
+              <ul>
+              {% for main_nav_item in nav %}
+              {% if main_nav_item.active %}
+              <li>
+                <a href="{{ main_nav_item.url }}" class="sk-toc-active">{{ main_nav_item.title }}</a>
+              </li>
+              <ul>
+              {% for nav_item in main_nav_item.children %}
+                <li>
+                  <a href="{{ nav_item.url }}" class="{% if nav_item.active %}sk-toc-active{% endif %}">{{ nav_item.title }}</a>
+                  {% if nav_item.children %}
+                  <ul>
+                    {% for inner_child in nav_item.children %}
+                      <li class="sk-toctree-l3">
+                        <a href="{{ inner_child.url }}">{{ inner_child.title }}</a>
+                      </li>
+                    {% endfor %}
+                  </ul>
+                  {% endif %}
+                </li>
+              {% endfor %}
+              </ul>
+              {% endif %}
+              {% endfor %}
+              </ul>
+            </div>
+            {%- elif meta and meta['globalsidebartoc']|tobool %}
+            <div class="sk-sidebar-toc sk-sidebar-global-toc">
+              {{ toctree(maxdepth=2, titles_only=True) }}
+            </div>
+            {%- else %}
+            <div class="sk-sidebar-toc">
+              {{ toc }}
+            </div>
+            {%- endif %}
+      </div>
+    </div>
+    <div id="sk-page-content-wrapper">
+      <div class="sk-page-content container-fluid body px-md-3" role="main">
+        {% block body %}{% endblock %}
+      </div>
+    <div class="container">
+      <footer class="sk-content-footer">
+        {%- if pagename != 'index' %}
+        {%- if show_copyright %}
+          {%- if hasdoc('copyright') %}
+            {% trans path=pathto('copyright'), copyright=copyright|e %}&copy; {{ copyright }}.{% endtrans %}
+          {%- else %}
+            {% trans copyright=copyright|e %}&copy; {{ copyright }}.{% endtrans %}
+          {%- endif %}
+        {%- endif %}
+        {%- if last_updated %}
+          {% trans last_updated=last_updated|e %}Last updated on {{ last_updated }}.{% endtrans %}
+        {%- endif %}
+        {%- if show_source and has_source and sourcename %}
+          <a href="{{ pathto('_sources/' + sourcename, true)|e }}" rel="nofollow">{{ _('Show this page source') }}</a>
+        {%- endif %}
+        {%- endif %}
+      </footer>
+    </div>
+  </div>
+</div>
+{%- endblock %}
+<script src="{{ pathto('_static/js/vendor/bootstrap.min.js', 1) }}"></script>
+{% include "javascript.html" %}
+</body>
+</html>
--- a/docs/api_reference/themes/scikit-learn-modern/nav.html
+++ b/docs/api_reference/themes/scikit-learn-modern/nav.html
@@ -0,0 +1,61 @@
+{%- if pagename != 'index' and pagename != 'documentation' %}
+  {%- set nav_bar_class = "sk-docs-navbar" %}
+  {%- set top_container_cls = "sk-docs-container" %}
+{%- else %}
+  {%- set nav_bar_class = "sk-landing-navbar" %}
+  {%- set top_container_cls = "sk-landing-container" %}
+{%- endif %}
+
+<nav id="navbar" class="{{ nav_bar_class }} navbar navbar-expand-md navbar-light bg-light py-0">
+  <div class="container-fluid {{ top_container_cls }} px-0">
+    {%- if logo_url %}
+      <a class="navbar-brand py-0" href="{{ pathto('index') }}">
+        <img
+          class="sk-brand-img"
+          src="{{ logo_url|e }}"
+          alt="logo"/>
+      </a>
+    {%- endif %}
+    <button
+      id="sk-navbar-toggler"
+      class="navbar-toggler"
+      type="button"
+      data-toggle="collapse"
+      data-target="#navbarSupportedContent"
+      aria-controls="navbarSupportedContent"
+      aria-expanded="false"
+      aria-label="Toggle navigation"
+    >
+      <span class="navbar-toggler-icon"></span>
+    </button>
+
+    <div class="sk-navbar-collapse collapse navbar-collapse" id="navbarSupportedContent">
+      <ul class="navbar-nav mr-auto">
+        <li class="nav-item">
+          <a class="sk-nav-link nav-link" href="{{ pathto('api_reference') }}">API</a>
+        </li>
+        <li class="nav-item">
+          <a class="sk-nav-link nav-link" href="{{ pathto('experimental_api_reference') }}">Experimental</a>
+        </li>
+        <li class="nav-item">
+          <a class="sk-nav-link nav-link" target="_blank" rel="noopener noreferrer" href="https://python.langchain.com/">Python Docs</a>
+        </li>
+        {%- for title, link, link_attrs in drop_down_navigation %}
+        <li class="nav-item">
+          <a class="sk-nav-link nav-link nav-more-item-mobile-items" href="{{ link }}" {{ link_attrs }}>{{ title }}</a>
+        </li>
+        {%- endfor %}
+      </ul>
+      {%- if pagename != "search"%}
+      <div id="searchbox" role="search">
+          <div class="searchformwrapper">
+          <form class="search" action="{{ pathto('search') }}" method="get">
+            <input class="sk-search-text-input" type="text" name="q" aria-labelledby="searchlabel" />
+            <input class="sk-search-text-btn" type="submit" value="{{ _('Go') }}" />
+          </form>
+          </div>
+      </div>
+      {%- endif %}
+    </div>
+  </div>
+</nav>
--- a/docs/api_reference/themes/scikit-learn-modern/search.html
+++ b/docs/api_reference/themes/scikit-learn-modern/search.html
@@ -0,0 +1,16 @@
+{%- extends "basic/search.html" %}
+{% block extrahead %}
+  <script type="text/javascript" src="{{ pathto('_static/underscore.js', 1) }}"></script>
+  <script type="text/javascript" src="{{ pathto('searchindex.js', 1) }}" defer></script>
+  <script type="text/javascript" src="{{ pathto('_static/doctools.js', 1) }}"></script>
+  <script type="text/javascript" src="{{ pathto('_static/language_data.js', 1) }}"></script>
+  <script type="text/javascript" src="{{ pathto('_static/searchtools.js', 1) }}"></script>
+  <!-- <script type="text/javascript" src="{{ pathto('_static/sphinx_highlight.js', 1) }}"></script> -->
+  <script type="text/javascript">
+    $(document).ready(function() {
+      if (!Search.out) {
+        Search.init();
+      }
+    });
+  </script>
+{% endblock %}
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Predrag Gruevski	3bf39ca635	Merge branch 'master' into pg/python-3.12	2023-10-04 14:24:35 -04:00
Predrag Gruevski	8d7acc94ba	Merge branch 'master' into pg/python-3.12	2023-10-03 15:20:29 +00:00
Predrag Gruevski	f6a1a1c517	Merge branch 'master' into pg/python-3.12	2023-10-02 17:21:43 -04:00
Predrag Gruevski	df1594cbb6	Add Python 3.12 to CI testing matrix.	2023-10-02 17:29:51 +00:00