Merge branch 'langchain-ai:master' into master

2025-08-16 16:11:02 +00:00 · 2025-07-28 11:21:14 -07:00 · 2025-07-28 11:21:14 -07:00 · b19010c769
commit b19010c769
parent 09ca3949b1 e79e0bd6b4
529 changed files with 2612 additions and 1965 deletions
--- a/.devcontainer/README.md
+++ b/.devcontainer/README.md
@ -5,26 +5,31 @@ This project includes a [dev container](https://containers.dev/), which lets you
 You can use the dev container configuration in this folder to build and run the app without needing to install any of its tools locally! You can use it in [GitHub Codespaces](https://github.com/features/codespaces) or the [VS Code Dev Containers extension](https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-containers).

 ## GitHub Codespaces
+
 [![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/langchain-ai/langchain)

 You may use the button above, or follow these steps to open this repo in a Codespace:
-1. Click the **Code** drop-down menu at the top of https://github.com/langchain-ai/langchain.
+
+1. Click the **Code** drop-down menu at the top of <https://github.com/langchain-ai/langchain>.
 1. Click on the **Codespaces** tab.
 1. Click **Create codespace on master**.

 For more info, check out the [GitHub documentation](https://docs.github.com/en/free-pro-team@latest/github/developing-online-with-codespaces/creating-a-codespace#creating-a-codespace).
  
 ## VS Code Dev Containers
+
 [![Open in Dev Containers](https://img.shields.io/static/v1?label=Dev%20Containers&message=Open&color=blue&logo=visualstudiocode)](https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/langchain-ai/langchain)

-Note: If you click the link above you will open the main repo (langchain-ai/langchain) and not your local cloned repo. This is fine if you only want to run and test the library, but if you want to contribute you can use the  link below and replace with your username and cloned repo name: 
-```
-https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/<yourusername>/<yourclonedreponame>
+> [!NOTE]  
+> If you click the link above you will open the main repo (`langchain-ai/langchain`) and *not* your local cloned repo. This is fine if you only want to run and test the library, but if you want to contribute you can use the link below and replace with your username and cloned repo name:

+```txt
+https://vscode.dev/redirect?url=vscode://ms-vscode-remote.remote-containers/cloneInVolume?url=https://github.com/&lt;YOUR_USERNAME&gt;/&lt;YOUR_CLONED_REPO_NAME&gt;
 ```
+
 Then you will have a local cloned repo where you can contribute and then create pull requests.

-If you already have VS Code and Docker installed, you can use the button above to get started. This will cause VS Code to automatically install the Dev Containers extension if needed, clone the source code into a container volume, and spin up a dev container for use.
+If you already have VS Code and Docker installed, you can use the button above to get started. This will use VSCode to automatically install the Dev Containers extension if needed, clone the source code into a container volume, and spin up a dev container for use.

 Alternatively you can also follow these steps to open this repo in a container using the VS Code Dev Containers extension:

@ -40,5 +45,5 @@ You can learn more in the [Dev Containers documentation](https://code.visualstud

 ## Tips and tricks

-* If you are working with the same repository folder in a container and Windows, you'll want consistent line endings (otherwise you may see hundreds of changes in the SCM view). The `.gitattributes` file in the root of this repo will disable line ending conversion and should prevent this. See [tips and tricks](https://code.visualstudio.com/docs/devcontainers/tips-and-tricks#_resolving-git-line-ending-issues-in-containers-resulting-in-many-modified-files) for more info.
-* If you'd like to review the contents of the image used in this dev container, you can check it out in the [devcontainers/images](https://github.com/devcontainers/images/tree/main/src/python) repo.
+- If you are working with the same repository folder in a container and Windows, you'll want consistent line endings (otherwise you may see hundreds of changes in the SCM view). The `.gitattributes` file in the root of this repo will disable line ending conversion and should prevent this. See [tips and tricks](https://code.visualstudio.com/docs/devcontainers/tips-and-tricks#_resolving-git-line-ending-issues-in-containers-resulting-in-many-modified-files) for more info.
+- If you'd like to review the contents of the image used in this dev container, you can check it out in the [devcontainers/images](https://github.com/devcontainers/images/tree/main/src/python) repo.
--- a/.devcontainer/devcontainer.json
+++ b/.devcontainer/devcontainer.json
@ -1,36 +1,58 @@
 // For format details, see https://aka.ms/devcontainer.json. For config options, see the
 // README at: https://github.com/devcontainers/templates/tree/main/src/docker-existing-docker-compose
 {
-	// Name for the dev container
-	"name": "langchain",
-
-	// Point to a Docker Compose file
-	"dockerComposeFile": "./docker-compose.yaml",
-
-	// Required when using Docker Compose. The name of the service to connect to once running
-	"service": "langchain",
-
-	// The optional 'workspaceFolder' property is the path VS Code should open by default when
-	// connected. This is typically a file mount in .devcontainer/docker-compose.yml
-	"workspaceFolder": "/workspaces/langchain",
-
-	// Prevent the container from shutting down
-	"overrideCommand": true
-
-	// Features to add to the dev container. More info: https://containers.dev/features
-	// "features": {
-	// 	"ghcr.io/devcontainers-contrib/features/poetry:2": {}
-	// }
-
-	// Use 'forwardPorts' to make a list of ports inside the container available locally.
-	// "forwardPorts": [],
-
-	// Uncomment the next line to run commands after the container is created.
-	// "postCreateCommand": "cat /etc/os-release",
-
-	// Configure tool-specific properties.
-	// "customizations": {},
-
-	// Uncomment to connect as root instead. More info: https://aka.ms/dev-containers-non-root.
-	// "remoteUser": "root"
+  // Name for the dev container
+  "name": "langchain",
+  // Point to a Docker Compose file
+  "dockerComposeFile": "./docker-compose.yaml",
+  // Required when using Docker Compose. The name of the service to connect to once running
+  "service": "langchain",
+  // The optional 'workspaceFolder' property is the path VS Code should open by default when
+  // connected. This is typically a file mount in .devcontainer/docker-compose.yml
+  "workspaceFolder": "/workspaces/langchain",
+  "mounts": [
+    "source=langchain-workspaces,target=/workspaces/langchain,type=volume"
+  ],
+  // Prevent the container from shutting down
+  "overrideCommand": true,
+  // Features to add to the dev container. More info: https://containers.dev/features
+  "features": {
+    "ghcr.io/devcontainers/features/git:1": {},
+    "ghcr.io/devcontainers/features/github-cli:1": {}
+  },
+  "containerEnv": {
+    "UV_LINK_MODE": "copy"
+  },
+  // Use 'forwardPorts' to make a list of ports inside the container available locally.
+  // "forwardPorts": [],
+  // Run commands after the container is created
+  "postCreateCommand": "uv sync && echo 'LangChain (Python) dev environment ready!'",
+  // Configure tool-specific properties.
+  "customizations": {
+    "vscode": {
+      "extensions": [
+        "ms-python.python",
+        "ms-python.debugpy",
+        "ms-python.mypy-type-checker",
+        "ms-python.isort",
+        "unifiedjs.vscode-mdx",
+        "davidanson.vscode-markdownlint",
+        "ms-toolsai.jupyter",
+        "GitHub.copilot",
+        "GitHub.copilot-chat"
+      ],
+      "settings": {
+        "python.defaultInterpreterPath": ".venv/bin/python",
+        "python.formatting.provider": "none",
+        "[python]": {
+          "editor.formatOnSave": true,
+          "editor.codeActionsOnSave": {
+            "source.organizeImports": true
+          }
+        }
+      }
+    }
+  }
+  // Uncomment to connect as root instead. More info: https://aka.ms/dev-containers-non-root.
+  // "remoteUser": "root"
 }
--- a/.devcontainer/docker-compose.yaml
+++ b/.devcontainer/docker-compose.yaml
@ -4,26 +4,9 @@ services:
    build:
      dockerfile: libs/langchain/dev.Dockerfile
      context: ..
-    volumes:
-      # Update this to wherever you want VS Code to mount the folder of your project
-      - ..:/workspaces/langchain:cached
+    
    networks:
      - langchain-network
-  #   environment:
-  #     MONGO_ROOT_USERNAME: root
-  #     MONGO_ROOT_PASSWORD: example123
-  #   depends_on:
-  #     - mongo   
-  # mongo:
-  #   image: mongo
-  #   restart: unless-stopped
-  #   environment:
-  #     MONGO_INITDB_ROOT_USERNAME: root
-  #     MONGO_INITDB_ROOT_PASSWORD: example123
-  #   ports:
-  #     - "27017:27017"
-  #   networks:
-  #     - langchain-network

 networks:
  langchain-network:
--- a/.editorconfig
+++ b/.editorconfig
@ -0,0 +1,52 @@
+# top-most EditorConfig file
+root = true
+
+# All files
+[*]
+charset = utf-8
+end_of_line = lf
+insert_final_newline = true
+trim_trailing_whitespace = true
+
+# Python files
+[*.py]
+indent_style = space
+indent_size = 4
+max_line_length = 88
+
+# JSON files
+[*.json]
+indent_style = space
+indent_size = 2
+
+# YAML files
+[*.{yml,yaml}]
+indent_style = space
+indent_size = 2
+
+# Markdown files
+[*.md]
+indent_style = space
+indent_size = 2
+trim_trailing_whitespace = false
+
+# Configuration files
+[*.{toml,ini,cfg}]
+indent_style = space
+indent_size = 4
+
+# Shell scripts
+[*.sh]
+indent_style = space
+indent_size = 2
+
+# Makefile
+[Makefile]
+indent_style = tab
+indent_size = 4
+
+# Jupyter notebooks
+[*.ipynb]
+# Jupyter may include trailing whitespace in cell
+# outputs that's semantically meaningful
+trim_trailing_whitespace = false
--- a/.github/workflows/_compile_integration_test.yml
+++ b/.github/workflows/_compile_integration_test.yml
@ -1,4 +1,4 @@
-name: compile-integration-test
+name: '🔗 Compile Integration Tests'

 on:
  workflow_call:
@ -25,24 +25,24 @@ jobs:
        working-directory: ${{ inputs.working-directory }}
    runs-on: ubuntu-latest
    timeout-minutes: 20
-    name: "uv run pytest -m compile tests/integration_tests #${{ inputs.python-version }}"
+    name: 'Python ${{ inputs.python-version }}'
    steps:
      - uses: actions/checkout@v4

-      - name: Set up Python ${{ inputs.python-version }} + uv
+      - name: '🐍 Set up Python ${{ inputs.python-version }} + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ inputs.python-version }}

-      - name: Install integration dependencies
+      - name: '📦 Install Integration Dependencies'
        shell: bash
        run: uv sync --group test --group test_integration

-      - name: Check integration tests compile
+      - name: '🔗 Check Integration Tests Compile'
        shell: bash
        run: uv run pytest -m compile tests/integration_tests

-      - name: Ensure the tests did not create any additional files
+      - name: '🧹 Verify Clean Working Directory'
        shell: bash
        run: |
          set -eu
--- a/.github/workflows/_integration_test.yml
+++ b/.github/workflows/_integration_test.yml
@ -1,4 +1,4 @@
-name: Integration Tests
+name: '🚀 Integration Tests'

 on:
  workflow_dispatch:
@ -24,20 +24,20 @@ jobs:
      run:
        working-directory: ${{ inputs.working-directory }}
    runs-on: ubuntu-latest
-    name: Python ${{ inputs.python-version }}
+    name: '🚀 Integration Tests (Python ${{ inputs.python-version }})'
    steps:
      - uses: actions/checkout@v4

-      - name: Set up Python ${{ inputs.python-version }} + uv
+      - name: '🐍 Set up Python ${{ inputs.python-version }} + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ inputs.python-version }}

-      - name: Install dependencies
+      - name: '📦 Install Integration Dependencies'
        shell: bash
        run: uv sync --group test --group test_integration

-      - name: Run integration tests
+      - name: '🚀 Run Integration Tests'
        shell: bash
        env:
          AI21_API_KEY: ${{ secrets.AI21_API_KEY }}
--- a/.github/workflows/_lint.yml
+++ b/.github/workflows/_lint.yml
@ -1,4 +1,6 @@
-name: lint
+name: '🧹 Code Linting'
+# Runs code quality checks using ruff, mypy, and other linting tools
+# Checks both package code and test code for consistency

 on:
  workflow_call:
@ -24,19 +26,21 @@ env:
  UV_FROZEN: "true"

 jobs:
+  # Linting job - runs quality checks on package and test code
  build:
-    name: "make lint #${{ inputs.python-version }}"
+    name: 'Python ${{ inputs.python-version }}'
    runs-on: ubuntu-latest
    timeout-minutes: 20
    steps:
-      - uses: actions/checkout@v4
+      - name: '📋 Checkout Code'
+        uses: actions/checkout@v4

-      - name: Set up Python ${{ inputs.python-version }} + uv
+      - name: '🐍 Set up Python ${{ inputs.python-version }} + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ inputs.python-version }}

-      - name: Install dependencies
+      - name: '📦 Install Lint & Typing Dependencies'
        # Also installs dev/lint/test/typing dependencies, to ensure we have
        # type hints for as many of our libraries as possible.
        # This helps catch errors that require dependencies to be spotted, for example:
@ -49,12 +53,12 @@ jobs:
        run: |
          uv sync --group lint --group typing

-      - name: Analysing the code with our lint
+      - name: '🔍 Analyze Package Code with Linters'
        working-directory: ${{ inputs.working-directory }}
        run: |
          make lint_package

-      - name: Install unit test dependencies
+      - name: '📦 Install Unit Test Dependencies'
        # Also installs dev/lint/test/typing dependencies, to ensure we have
        # type hints for as many of our libraries as possible.
        # This helps catch errors that require dependencies to be spotted, for example:
@ -67,13 +71,13 @@ jobs:
        working-directory: ${{ inputs.working-directory }}
        run: |
          uv sync --inexact --group test
-      - name: Install unit+integration test dependencies
+      - name: '📦 Install Unit + Integration Test Dependencies'
        if: ${{ startsWith(inputs.working-directory, 'libs/partners/') }}
        working-directory: ${{ inputs.working-directory }}
        run: |
          uv sync --inexact --group test --group test_integration

-      - name: Analysing the code with our lint
+      - name: '🔍 Analyze Test Code with Linters'
        working-directory: ${{ inputs.working-directory }}
        run: |
          make lint_tests
--- a/.github/workflows/_release.yml
+++ b/.github/workflows/_release.yml
@ -1,5 +1,5 @@
-name: Release
-run-name: Release ${{ inputs.working-directory }} by @${{ github.actor }}
+name: '🚀 Package Release'
+run-name: '🚀 Release ${{ inputs.working-directory }} by @${{ github.actor }}'
 on:
  workflow_call:
    inputs:
@ -18,7 +18,7 @@ on:
        required: false
        type: boolean
        default: false
-        description: "Release from a non-master branch (danger!)"
+        description: "Release from a non-master branch (danger!) - Only use for hotfixes"

 env:
  PYTHON_VERSION: "3.11"
@ -26,6 +26,8 @@ env:
  UV_NO_SYNC: "true"

 jobs:
+  # Build the distribution package and extract version info
+  # Runs in isolated environment with minimal permissions for security
  build:
    if: github.ref == 'refs/heads/master' || inputs.dangerous-nonmaster-release
    environment: Scheduled testing
--- a/.github/workflows/_test.yml
+++ b/.github/workflows/_test.yml
@ -1,4 +1,6 @@
-name: test
+name: '🧪 Unit Testing'
+# Runs unit tests with both current and minimum supported dependency versions
+# to ensure compatibility across the supported range

 on:
  workflow_call:
@ -20,31 +22,33 @@ env:
  UV_NO_SYNC: "true"

 jobs:
+  # Main test job - runs unit tests with current deps, then retests with minimum versions
  build:
    defaults:
      run:
        working-directory: ${{ inputs.working-directory }}
    runs-on: ubuntu-latest
    timeout-minutes: 20
-    name: "make test #${{ inputs.python-version }}"
+    name: 'Python ${{ inputs.python-version }}'
    steps:
-      - uses: actions/checkout@v4
+      - name: '📋 Checkout Code'
+        uses: actions/checkout@v4

-      - name: Set up Python ${{ inputs.python-version }} + uv
+      - name: '🐍 Set up Python ${{ inputs.python-version }} + UV'
        uses: "./.github/actions/uv_setup"
        id: setup-python
        with:
          python-version: ${{ inputs.python-version }}
-      - name: Install dependencies
+      - name: '📦 Install Test Dependencies'
        shell: bash
        run: uv sync --group test --dev

-      - name: Run core tests
+      - name: '🧪 Run Core Unit Tests'
        shell: bash
        run: |
          make test

-      - name: Get minimum versions
+      - name: '🔍 Calculate Minimum Dependency Versions'
        working-directory: ${{ inputs.working-directory }}
        id: min-version
        shell: bash
@ -55,7 +59,7 @@ jobs:
          echo "min-versions=$min_versions" >> "$GITHUB_OUTPUT"
          echo "min-versions=$min_versions"

-      - name: Run unit tests with minimum dependency versions
+      - name: '🧪 Run Tests with Minimum Dependencies'
        if: ${{ steps.min-version.outputs.min-versions != '' }}
        env:
          MIN_VERSIONS: ${{ steps.min-version.outputs.min-versions }}
@ -64,7 +68,7 @@ jobs:
          make tests
        working-directory: ${{ inputs.working-directory }}

-      - name: Ensure the tests did not create any additional files
+      - name: '🧹 Verify Clean Working Directory'
        shell: bash
        run: |
          set -eu
--- a/.github/workflows/_test_doc_imports.yml
+++ b/.github/workflows/_test_doc_imports.yml
@ -1,4 +1,4 @@
-name: test_doc_imports
+name: '📑 Documentation Import Testing'

 on:
  workflow_call:
@ -18,29 +18,30 @@ jobs:
  build:
    runs-on: ubuntu-latest
    timeout-minutes: 20
-    name: "check doc imports #${{ inputs.python-version }}"
+    name: '🔍 Check Doc Imports (Python ${{ inputs.python-version }})'
    steps:
-      - uses: actions/checkout@v4
+      - name: '📋 Checkout Code'
+        uses: actions/checkout@v4

-      - name: Set up Python ${{ inputs.python-version }} + uv
+      - name: '🐍 Set up Python ${{ inputs.python-version }} + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ inputs.python-version }}

-      - name: Install dependencies
+      - name: '📦 Install Test Dependencies'
        shell: bash
        run: uv sync --group test

-      - name: Install langchain editable
+      - name: '📦 Install LangChain in Editable Mode'
        run: |
          VIRTUAL_ENV=.venv uv pip install langchain-experimental langchain-community -e libs/core libs/langchain

-      - name: Check doc imports
+      - name: '🔍 Validate Documentation Import Statements'
        shell: bash
        run: |
          uv run python docs/scripts/check_imports.py

-      - name: Ensure the test did not create any additional files
+      - name: '🧹 Verify Clean Working Directory'
        shell: bash
        run: |
          set -eu
--- a/.github/workflows/_test_pydantic.yml
+++ b/.github/workflows/_test_pydantic.yml
@ -1,4 +1,4 @@
-name: test pydantic intermediate versions
+name: '🐍 Pydantic Version Testing'

 on:
  workflow_call:
@ -31,29 +31,30 @@ jobs:
        working-directory: ${{ inputs.working-directory }}
    runs-on: ubuntu-latest
    timeout-minutes: 20
-    name: "make test # pydantic: ~=${{ inputs.pydantic-version }}, python: ${{ inputs.python-version }}, "
+    name: 'Pydantic ~=${{ inputs.pydantic-version }}'
    steps:
-      - uses: actions/checkout@v4
+      - name: '📋 Checkout Code'
+        uses: actions/checkout@v4

-      - name: Set up Python ${{ inputs.python-version }} + uv
+      - name: '🐍 Set up Python ${{ inputs.python-version }} + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ inputs.python-version }}

-      - name: Install dependencies
+      - name: '📦 Install Test Dependencies'
        shell: bash
        run: uv sync --group test

-      - name: Overwrite pydantic version
+      - name: '🔄 Install Specific Pydantic Version'
        shell: bash
        run: VIRTUAL_ENV=.venv uv pip install pydantic~=${{ inputs.pydantic-version }}

-      - name: Run core tests
+      - name: '🧪 Run Core Tests'
        shell: bash
        run: |
          make test

-      - name: Ensure the tests did not create any additional files
+      - name: '🧹 Verify Clean Working Directory'
        shell: bash
        run: |
          set -eu
--- a/.github/workflows/_test_release.yml
+++ b/.github/workflows/_test_release.yml
@ -1,4 +1,4 @@
-name: test-release
+name: '🧪 Test Release Package'

 on:
  workflow_call:
@ -29,7 +29,7 @@ jobs:
    steps:
      - uses: actions/checkout@v4

-      - name: Set up Python + uv
+      - name: '🐍 Set up Python + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ env.PYTHON_VERSION }}
@ -45,17 +45,17 @@ jobs:
      # > It is strongly advised to separate jobs for building [...]
      # > from the publish job.
      # https://github.com/pypa/gh-action-pypi-publish#non-goals
-      - name: Build project for distribution
+      - name: '📦 Build Project for Distribution'
        run: uv build
        working-directory: ${{ inputs.working-directory }}

-      - name: Upload build
+      - name: '⬆️ Upload Build Artifacts'
        uses: actions/upload-artifact@v4
        with:
          name: test-dist
          path: ${{ inputs.working-directory }}/dist/

-      - name: Check Version
+      - name: '🔍 Extract Version Information'
        id: check-version
        shell: python
        working-directory: ${{ inputs.working-directory }}
--- a/.github/workflows/api_doc_build.yml
+++ b/.github/workflows/api_doc_build.yml
@ -1,13 +1,15 @@
-name: API Docs Build
+name: '📚 API Documentation Build'
+# Runs daily or can be triggered manually for immediate updates

 on:
  workflow_dispatch:
  schedule:
-    - cron:  '0 13 * * *'
+    - cron:  '0 13 * * *'  # Daily at 1PM UTC
 env:
  PYTHON_VERSION: "3.11"

 jobs:
+  # Only runs on main repository to prevent unnecessary builds on forks
  build:
    if: github.repository == 'langchain-ai/langchain' || github.event_name != 'schedule'
    runs-on: ubuntu-latest
@ -23,7 +25,7 @@ jobs:
          path: langchain-api-docs-html
          token: ${{ secrets.TOKEN_GITHUB_API_DOCS_HTML }}

-      - name: Get repos with yq
+      - name: '📋 Extract Repository List with yq'
        id: get-unsorted-repos
        uses: mikefarah/yq@master
        with:
@ -42,7 +44,7 @@ jobs:
              | .repo
            ' langchain/libs/packages.yml

-      - name: Parse YAML and checkout repos
+      - name: '📋 Parse YAML & Checkout Repositories'
        env:
          REPOS_UNSORTED: ${{ steps.get-unsorted-repos.outputs.result }}
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
@ -70,39 +72,39 @@ jobs:
            git clone --depth 1 https://github.com/$repo.git $REPO_NAME
          done

-      - name: Setup Python ${{ env.PYTHON_VERSION }}
+      - name: '🐍 Setup Python ${{ env.PYTHON_VERSION }}'
        uses: actions/setup-python@v5
        id: setup-python
        with:
          python-version: ${{ env.PYTHON_VERSION }}

-      - name: Install initial py deps
+      - name: '📦 Install Initial Python Dependencies'
        working-directory: langchain
        run: |
          python -m pip install -U uv
          python -m uv pip install --upgrade --no-cache-dir pip setuptools pyyaml

-      - name: Move libs
+      - name: '📦 Organize Library Directories'
        run: python langchain/.github/scripts/prep_api_docs_build.py

-      - name: Rm old html
+      - name: '🧹 Remove Old HTML Files'
        run:
          rm -rf langchain-api-docs-html/api_reference_build/html

-      - name: Install dependencies
+      - name: '📦 Install Documentation Dependencies'
        working-directory: langchain
        run: |
          python -m uv pip install $(ls ./libs/partners | xargs -I {} echo "./libs/partners/{}") --overrides ./docs/vercel_overrides.txt
          python -m uv pip install libs/core libs/langchain libs/text-splitters libs/community libs/experimental libs/standard-tests
          python -m uv pip install -r docs/api_reference/requirements.txt

-      - name: Set Git config
+      - name: '🔧 Configure Git Settings'
        working-directory: langchain
        run: |
          git config --local user.email "actions@github.com"
          git config --local user.name "Github Actions"

-      - name: Build docs
+      - name: '📚 Build API Documentation'
        working-directory: langchain
        run: |
          python docs/api_reference/create_api_rst.py
--- a/.github/workflows/check-broken-links.yml
+++ b/.github/workflows/check-broken-links.yml
@ -1,4 +1,4 @@
-name: Check Broken Links
+name: '🔗 Check Broken Links'

 on:
  workflow_dispatch:
@ -14,15 +14,15 @@ jobs:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
-      - name: Use Node.js 18.x
+      - name: '🟢 Setup Node.js 18.x'
        uses: actions/setup-node@v4
        with:
          node-version: 18.x
          cache: "yarn"
          cache-dependency-path: ./docs/yarn.lock
-      - name: Install dependencies
+      - name: '📦 Install Node Dependencies'
        run: yarn install --immutable --mode=skip-build
        working-directory: ./docs
-      - name: Check broken links
+      - name: '🔍 Scan Documentation for Broken Links'
        run: yarn check-broken-links
        working-directory: ./docs
--- a/.github/workflows/check_core_versions.yml
+++ b/.github/workflows/check_core_versions.yml
@ -1,4 +1,6 @@
-name: Check `core` Version Equality
+name: '🔍 Check `core` Version Equality'
+# Ensures version numbers in pyproject.toml and version.py stay in sync
+# Prevents releases with mismatched version numbers

 on:
  pull_request:
@ -16,7 +18,7 @@ jobs:
    steps:
      - uses: actions/checkout@v4

-      - name: Check version equality
+      - name: '✅ Verify pyproject.toml & version.py Match'
        run: |
          PYPROJECT_VERSION=$(grep -Po '(?<=^version = ")[^"]*' libs/core/pyproject.toml)
          VERSION_PY_VERSION=$(grep -Po '(?<=^VERSION = ")[^"]*' libs/core/langchain_core/version.py)
--- a/.github/workflows/check_diffs.yml
+++ b/.github/workflows/check_diffs.yml
@ -1,4 +1,4 @@
-name: CI
+name: '🔧 CI'

 on:
  push:
@ -6,6 +6,7 @@ on:
  pull_request:
  merge_group:

+# Optimizes CI performance by canceling redundant workflow runs
 # If another push to the same PR or branch happens while this workflow is still running,
 # cancel the earlier run in favor of the next run.
 #
@ -24,16 +25,23 @@ env:
  UV_NO_SYNC: "true"

 jobs:
+  # This job analyzes which files changed and creates a dynamic test matrix
+  # to only run tests/lints for the affected packages, improving CI efficiency
  build:
+    name: 'Detect Changes & Set Matrix'
    runs-on: ubuntu-latest
    steps:
-      - uses: actions/checkout@v4
-      - uses: actions/setup-python@v5
+      - name: '📋 Checkout Code'
+        uses: actions/checkout@v4
+      - name: '🐍 Setup Python 3.11'
+        uses: actions/setup-python@v5
        with:
          python-version: '3.11'
-      - id: files
+      - name: '📂 Get Changed Files'
+        id: files
        uses: Ana06/get-changed-files@v2.3.0
-      - id: set-matrix
+      - name: '🔍 Analyze Changed Files & Generate Build Matrix'
+        id: set-matrix
        run: |
          python -m pip install packaging requests
          python .github/scripts/check_diff.py ${{ steps.files.outputs.all }} >> $GITHUB_OUTPUT
@ -45,8 +53,8 @@ jobs:
      dependencies: ${{ steps.set-matrix.outputs.dependencies }}
      test-doc-imports: ${{ steps.set-matrix.outputs.test-doc-imports }}
      test-pydantic: ${{ steps.set-matrix.outputs.test-pydantic }}
+  # Run linting only on packages that have changed files
  lint:
-    name: cd ${{ matrix.job-configs.working-directory }}
    needs: [ build ]
    if: ${{ needs.build.outputs.lint != '[]' }}
    strategy:
@ -59,8 +67,8 @@ jobs:
      python-version: ${{ matrix.job-configs.python-version }}
    secrets: inherit

+  # Run unit tests only on packages that have changed files
  test:
-    name: cd ${{ matrix.job-configs.working-directory }}
    needs: [ build ]
    if: ${{ needs.build.outputs.test != '[]' }}
    strategy:
@ -73,8 +81,8 @@ jobs:
      python-version: ${{ matrix.job-configs.python-version }}
    secrets: inherit

+  # Test compatibility with different Pydantic versions for affected packages
  test-pydantic:
-    name: cd ${{ matrix.job-configs.working-directory }}
    needs: [ build ]
    if: ${{ needs.build.outputs.test-pydantic != '[]' }}
    strategy:
@ -95,12 +103,12 @@ jobs:
        job-configs: ${{ fromJson(needs.build.outputs.test-doc-imports) }}
      fail-fast: false
    uses: ./.github/workflows/_test_doc_imports.yml
-    secrets: inherit
    with:
      python-version: ${{ matrix.job-configs.python-version }}
+    secrets: inherit

+  # Verify integration tests compile without actually running them (faster feedback)
  compile-integration-tests:
-    name: cd ${{ matrix.job-configs.working-directory }}
    needs: [ build ]
    if: ${{ needs.build.outputs.compile-integration-tests != '[]' }}
    strategy:
@ -113,8 +121,9 @@ jobs:
      python-version: ${{ matrix.job-configs.python-version }}
    secrets: inherit

+  # Run extended test suites that require additional dependencies
  extended-tests:
-    name: "cd ${{ matrix.job-configs.working-directory }} / make extended_tests #${{ matrix.job-configs.python-version }}"
+    name: 'Extended Tests'
    needs: [ build ]
    if: ${{ needs.build.outputs.extended-tests != '[]' }}
    strategy:
@ -130,12 +139,12 @@ jobs:
    steps:
      - uses: actions/checkout@v4

-      - name: Set up Python ${{ matrix.job-configs.python-version }} + uv
+      - name: '🐍 Set up Python ${{ matrix.job-configs.python-version }} + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ matrix.job-configs.python-version }}

-      - name: Install dependencies and run extended tests
+      - name: '📦 Install Dependencies & Run Extended Tests'
        shell: bash
        run: |
          echo "Running extended tests, installing dependencies with uv..."
@ -144,7 +153,7 @@ jobs:
          VIRTUAL_ENV=.venv uv pip install -r extended_testing_deps.txt
          VIRTUAL_ENV=.venv make extended_tests

-      - name: Ensure the tests did not create any additional files
+      - name: '🧹 Verify Clean Working Directory'
        shell: bash
        run: |
          set -eu
@ -156,8 +165,9 @@ jobs:
          # and `set -e` above will cause the step to fail.
          echo "$STATUS" | grep 'nothing to commit, working tree clean'

+  # Final status check - ensures all required jobs passed before allowing merge
  ci_success:
-    name: "CI Success"
+    name: '✅ CI Success'
    needs: [build, lint, test, compile-integration-tests, extended-tests, test-doc-imports, test-pydantic]
    if: |
      always()
@ -167,7 +177,7 @@ jobs:
      RESULTS_JSON: ${{ toJSON(needs.*.result) }}
      EXIT_CODE: ${{!contains(needs.*.result, 'failure') && !contains(needs.*.result, 'cancelled') && '0' || '1'}}
    steps:
-      - name: "CI Success"
+      - name: '🎉 All Checks Passed'
        run: |
          echo $JOBS_JSON
          echo $RESULTS_JSON
--- a/.github/workflows/check_new_docs.yml
+++ b/.github/workflows/check_new_docs.yml
@ -1,4 +1,4 @@
-name: Integration Docs Lint
+name: '📑 Integration Docs Lint'

 on:
  push:
@ -33,6 +33,6 @@ jobs:
            *.ipynb
            *.md
            *.mdx
-      - name: Check new docs
+      - name: '🔍 Check New Documentation Templates'
        run: |
          python docs/scripts/check_templates.py ${{ steps.files.outputs.added }}
--- a/.github/workflows/codespell.yml
+++ b/.github/workflows/codespell.yml
@ -1,35 +0,0 @@
-name: CI / cd . / make spell_check
-
-on:
-  push:
-    branches: [master, v0.1, v0.2]
-  pull_request:
-
-permissions:
-  contents: read
-
-jobs:
-  codespell:
-    name: (Check for spelling errors)
-    runs-on: ubuntu-latest
-
-    steps:
-      - name: Checkout
-        uses: actions/checkout@v4
-
-      - name: Install Dependencies
-        run: |
-          pip install toml
-
-      - name: Extract Ignore Words List
-        run: |
-          # Use a Python script to extract the ignore words list from pyproject.toml
-          python .github/workflows/extract_ignored_words_list.py
-        id: extract_ignore_words
-
-#      - name: Codespell
-#        uses: codespell-project/actions-codespell@v2
-#        with:
-#          skip: guide_imports.json,*.ambr,./cookbook/data/imdb_top_1000.csv,*.lock
-#          ignore_words_list: ${{ steps.extract_ignore_words.outputs.ignore_words_list }}
-#          exclude_file: ./.github/workflows/codespell-exclude
--- a/.github/workflows/codspeed.yml
+++ b/.github/workflows/codspeed.yml
@ -1,4 +1,4 @@
-name: CodSpeed
+name: '⚡ CodSpeed'

 on:
  push:
@ -18,7 +18,7 @@ env:

 jobs:
  codspeed:
-    name: Run benchmarks
+    name: 'Benchmark'
    runs-on: ubuntu-latest
    strategy:
      matrix:
@ -38,7 +38,7 @@ jobs:
      - uses: actions/checkout@v4

      # We have to use 3.12 as 3.13 is not yet supported
-      - name: Install uv
+      - name: '📦 Install UV Package Manager'
        uses: astral-sh/setup-uv@v6
        with:
          python-version: "3.12"
@ -47,11 +47,11 @@ jobs:
        with:
          python-version: "3.12"

-      - name: Install dependencies
+      - name: '📦 Install Test Dependencies'
        run: uv sync --group test
        working-directory: ${{ matrix.working-directory }}

-      - name: Run benchmarks ${{ matrix.working-directory }}
+      - name: '⚡ Run Benchmarks: ${{ matrix.working-directory }}'
        uses: CodSpeedHQ/action@v3
        with:
          token: ${{ secrets.CODSPEED_TOKEN }}
--- a/.github/workflows/people.yml
+++ b/.github/workflows/people.yml
@ -1,4 +1,4 @@
-name: LangChain People
+name: '👥 LangChain People'

 on:
  schedule:
@ -14,13 +14,13 @@ jobs:
    permissions:
      contents: write
    steps:
-      - name: Dump GitHub context
+      - name: '📋 Dump GitHub Context'
        env:
          GITHUB_CONTEXT: ${{ toJson(github) }}
        run: echo "$GITHUB_CONTEXT"
      - uses: actions/checkout@v4
      # Ref: https://github.com/actions/runner/issues/2033
-      - name: Fix git safe.directory in container
+      - name: '🔧 Fix Git Safe Directory in Container'
        run: mkdir -p /home/runner/work/_temp/_github_home && printf "[safe]\n\tdirectory = /github/workspace" > /home/runner/work/_temp/_github_home/.gitconfig
      - uses: ./.github/actions/people
        with:
--- a/.github/workflows/pr_lint.yml
+++ b/.github/workflows/pr_lint.yml
@ -4,6 +4,7 @@
 # Purpose:
 #   Enforces Conventional Commits format for pull request titles to maintain a
 #   clear, consistent, and machine-readable change history across our repository.
+#   This helps with automated changelog generation and semantic versioning.
 #
 # Enforced Commit Message Format (Conventional Commits 1.0.0):
 #   <type>[optional scope]: <description>
@ -45,7 +46,7 @@
 #   • Conventional Commits spec: https://www.conventionalcommits.org/en/v1.0.0/
 # -----------------------------------------------------------------------------

-name: PR Title Lint
+name: '🏷️ PR Title Lint'

 permissions:
  pull-requests: read
@ -55,11 +56,12 @@ on:
    types: [opened, edited, synchronize]

 jobs:
+  # Validates that PR title follows Conventional Commits specification
  lint-pr-title:
-    name: Validate PR Title
+    name: 'Validate PR Title Format'
    runs-on: ubuntu-latest
    steps:
-      - name: Validate PR Title
+      - name: '✅ Validate Conventional Commits Format'
        uses: amannn/action-semantic-pull-request@v5
        env:
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
--- a/.github/workflows/run_notebooks.yml
+++ b/.github/workflows/run_notebooks.yml
@ -1,4 +1,4 @@
-name: Run Notebooks
+name: '📝 Run Documentation Notebooks'

 on:
  workflow_dispatch:
@ -24,43 +24,43 @@ jobs:
  build:
    runs-on: ubuntu-latest
    if: github.repository == 'langchain-ai/langchain' || github.event_name != 'schedule'
-    name: "Test docs"
+    name: '📑 Test Documentation Notebooks'
    steps:
      - uses: actions/checkout@v4

-      - name: Set up Python + uv
+      - name: '🐍 Set up Python + UV'
        uses: "./.github/actions/uv_setup"
        with:
          python-version: ${{ github.event.inputs.python_version || '3.11' }}

-      - name: 'Authenticate to Google Cloud'
+      - name: '🔐 Authenticate to Google Cloud'
        id: 'auth'
        uses: google-github-actions/auth@v2
        with:
          credentials_json: '${{ secrets.GOOGLE_CREDENTIALS }}'

-      - name: Configure AWS Credentials
+      - name: '🔐 Configure AWS Credentials'
        uses: aws-actions/configure-aws-credentials@v4
        with:
          aws-access-key-id: ${{ secrets.AWS_ACCESS_KEY_ID }}
          aws-secret-access-key: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
          aws-region: ${{ secrets.AWS_REGION }}

-      - name: Install dependencies
+      - name: '📦 Install Dependencies'
        run: |
          uv sync --group dev --group test

-      - name: Pre-download files
+      - name: '📦 Pre-download Test Files'
        run: |
          uv run python docs/scripts/cache_data.py
          curl -s https://raw.githubusercontent.com/lerocha/chinook-database/master/ChinookDatabase/DataSources/Chinook_Sqlite.sql | sqlite3 docs/docs/how_to/Chinook.db
          cp docs/docs/how_to/Chinook.db docs/docs/tutorials/Chinook.db

-      - name: Prepare notebooks
+      - name: '🔧 Prepare Notebooks for CI'
        run: |
          uv run python docs/scripts/prepare_notebooks_for_ci.py --comment-install-cells --working-directory ${{ github.event.inputs.working-directory || 'all' }}

-      - name: Run notebooks
+      - name: '🚀 Execute Notebooks'
        env:
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
          FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}
--- a/.github/workflows/scheduled_test.yml
+++ b/.github/workflows/scheduled_test.yml
@ -1,7 +1,7 @@
-name: Scheduled Tests
+name: '⏰ Scheduled Integration Tests'

 on:
-  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+  workflow_dispatch:  # Allows maintainers to trigger the workflow manually in GitHub UI
    inputs:
      working-directory-force:
        type: string
@ -10,7 +10,7 @@ on:
        type: string
        description: "Python version to use - defaults to 3.9 and 3.11 in matrix - example value: 3.9"
  schedule:
-    - cron:  '0 13 * * *'
+    - cron:  '0 13 * * *'  # Runs daily at 1PM UTC (9AM EDT/6AM PDT)

 permissions:
  contents: read
@ -22,14 +22,16 @@ env:
  POETRY_LIBS: ("libs/partners/google-vertexai" "libs/partners/google-genai" "libs/partners/aws")

 jobs:
+  # Generate dynamic test matrix based on input parameters or defaults
+  # Only runs on the main repo (for scheduled runs) or when manually triggered
  compute-matrix:
    if: github.repository_owner == 'langchain-ai' || github.event_name != 'schedule'
    runs-on: ubuntu-latest
-    name: Compute matrix
+    name: '📋 Compute Test Matrix'
    outputs:
      matrix: ${{ steps.set-matrix.outputs.matrix }}
    steps:
-      - name: Set matrix
+      - name: '🔢 Generate Python & Library Matrix'
        id: set-matrix
        env:
          DEFAULT_LIBS: ${{ env.DEFAULT_LIBS }}
@ -50,9 +52,11 @@ jobs:
          matrix="{\"python-version\": $python_version, \"working-directory\": $working_directory}"
          echo $matrix
          echo "matrix=$matrix" >> $GITHUB_OUTPUT
+  # Run integration tests against partner libraries with live API credentials
+  # Tests are run with both Poetry and UV depending on the library's setup
  build:
    if: github.repository_owner == 'langchain-ai' || github.event_name != 'schedule'
-    name: Python ${{ matrix.python-version }} - ${{ matrix.working-directory }}
+    name: '🐍 Python ${{ matrix.python-version }}: ${{ matrix.working-directory }}'
    runs-on: ubuntu-latest
    needs: [compute-matrix]
    timeout-minutes: 20
@ -75,7 +79,7 @@ jobs:
          repository: langchain-ai/langchain-aws
          path: langchain-aws

-      - name: Move libs
+      - name: '📦 Organize External Libraries'
        run: |
          rm -rf \
            langchain/libs/partners/google-genai \
@ -84,7 +88,7 @@ jobs:
          mv langchain-google/libs/vertexai langchain/libs/partners/google-vertexai
          mv langchain-aws/libs/aws langchain/libs/partners/aws

-      - name: Set up Python ${{ matrix.python-version }} with poetry
+      - name: '🐍 Set up Python ${{ matrix.python-version }} + Poetry'
        if: contains(env.POETRY_LIBS, matrix.working-directory)
        uses: "./langchain/.github/actions/poetry_setup"
        with:
@ -93,40 +97,40 @@ jobs:
          working-directory: langchain/${{ matrix.working-directory }}
          cache-key: scheduled

-      - name: Set up Python ${{ matrix.python-version }} + uv
+      - name: '🐍 Set up Python ${{ matrix.python-version }} + UV'
        if: "!contains(env.POETRY_LIBS, matrix.working-directory)"
        uses: "./langchain/.github/actions/uv_setup"
        with:
          python-version: ${{ matrix.python-version }}

-      - name: 'Authenticate to Google Cloud'
+      - name: '🔐 Authenticate to Google Cloud'
        id: 'auth'
        uses: google-github-actions/auth@v2
        with:
          credentials_json: '${{ secrets.GOOGLE_CREDENTIALS }}'

-      - name: Configure AWS Credentials
+      - name: '🔐 Configure AWS Credentials'
        uses: aws-actions/configure-aws-credentials@v4
        with:
          aws-access-key-id: ${{ secrets.AWS_ACCESS_KEY_ID }}
          aws-secret-access-key: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
          aws-region: ${{ secrets.AWS_REGION }}

-      - name: Install dependencies (poetry)
+      - name: '📦 Install Dependencies (Poetry)'
        if: contains(env.POETRY_LIBS, matrix.working-directory)
        run: |
          echo "Running scheduled tests, installing dependencies with poetry..."
          cd langchain/${{ matrix.working-directory }}
          poetry install --with=test_integration,test

-      - name: Install dependencies (uv)
+      - name: '📦 Install Dependencies (UV)'
        if: "!contains(env.POETRY_LIBS, matrix.working-directory)"
        run: |
          echo "Running scheduled tests, installing dependencies with uv..."
          cd langchain/${{ matrix.working-directory }}
          uv sync --group test --group test_integration

-      - name: Run integration tests
+      - name: '🚀 Run Integration Tests'
        env:
          OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
@ -155,14 +159,15 @@ jobs:
          cd langchain/${{ matrix.working-directory }}
          make integration_tests

-      - name: Remove external libraries
+      - name: '🧹 Clean up External Libraries'
+        # Clean up external libraries to avoid affecting git status check
        run: | 
          rm -rf \
            langchain/libs/partners/google-genai \
            langchain/libs/partners/google-vertexai \
            langchain/libs/partners/aws

-      - name: Ensure tests did not create additional files
+      - name: '🧹 Verify Clean Working Directory'
        working-directory: langchain
        run: |
          set -eu
--- a/.gitignore
+++ b/.gitignore
@ -1,5 +1,4 @@
 .vs/
-.vscode/
 .idea/
 # Byte-compiled / optimized / DLL files
 __pycache__/
--- a/.markdownlint.json
+++ b/.markdownlint.json
@ -0,0 +1,14 @@
+{
+  "MD013": false,
+  "MD024": {
+    "siblings_only": true
+  },
+  "MD025": false,
+  "MD033": false,
+  "MD034": false,
+  "MD036": false,
+  "MD041": false,
+  "MD046": {
+    "style": "fenced"
+  }
+}
--- a/.pre-commit-config.yaml
+++ b/.pre-commit-config.yaml
@ -1,111 +1,111 @@
 repos:
- repo: local
-  hooks:
-    - id: core
-      name: format core
-      language: system
-      entry: make -C libs/core format
-      files: ^libs/core/
-      pass_filenames: false
-    - id: langchain
-      name: format langchain
-      language: system
-      entry: make -C libs/langchain format
-      files: ^libs/langchain/
-      pass_filenames: false
-    - id: standard-tests
-      name: format standard-tests
-      language: system
-      entry: make -C libs/standard-tests format
-      files: ^libs/standard-tests/
-      pass_filenames: false
-    - id: text-splitters
-      name: format text-splitters
-      language: system
-      entry: make -C libs/text-splitters format
-      files: ^libs/text-splitters/
-      pass_filenames: false
-    - id: anthropic
-      name: format partners/anthropic
-      language: system
-      entry: make -C libs/partners/anthropic format
-      files: ^libs/partners/anthropic/
-      pass_filenames: false
-    - id: chroma
-      name: format partners/chroma
-      language: system
-      entry: make -C libs/partners/chroma format
-      files: ^libs/partners/chroma/
-      pass_filenames: false
-    - id: couchbase
-      name: format partners/couchbase
-      language: system
-      entry: make -C libs/partners/couchbase format
-      files: ^libs/partners/couchbase/
-      pass_filenames: false
-    - id: exa
-      name: format partners/exa
-      language: system
-      entry: make -C libs/partners/exa format
-      files: ^libs/partners/exa/
-      pass_filenames: false
-    - id: fireworks
-      name: format partners/fireworks
-      language: system
-      entry: make -C libs/partners/fireworks format
-      files: ^libs/partners/fireworks/
-      pass_filenames: false
-    - id: groq
-      name: format partners/groq
-      language: system
-      entry: make -C libs/partners/groq format
-      files: ^libs/partners/groq/
-      pass_filenames: false
-    - id: huggingface
-      name: format partners/huggingface
-      language: system
-      entry: make -C libs/partners/huggingface format
-      files: ^libs/partners/huggingface/
-      pass_filenames: false
-    - id: mistralai
-      name: format partners/mistralai
-      language: system
-      entry: make -C libs/partners/mistralai format
-      files: ^libs/partners/mistralai/
-      pass_filenames: false
-    - id: nomic
-      name: format partners/nomic
-      language: system
-      entry: make -C libs/partners/nomic format
-      files: ^libs/partners/nomic/
-      pass_filenames: false
-    - id: ollama
-      name: format partners/ollama
-      language: system
-      entry: make -C libs/partners/ollama format
-      files: ^libs/partners/ollama/
-      pass_filenames: false
-    - id: openai
-      name: format partners/openai
-      language: system
-      entry: make -C libs/partners/openai format
-      files: ^libs/partners/openai/
-      pass_filenames: false
-    - id: prompty
-      name: format partners/prompty
-      language: system
-      entry: make -C libs/partners/prompty format
-      files: ^libs/partners/prompty/
-      pass_filenames: false
-    - id: qdrant
-      name: format partners/qdrant
-      language: system
-      entry: make -C libs/partners/qdrant format
-      files: ^libs/partners/qdrant/
-      pass_filenames: false
-    - id: root
-      name: format docs, cookbook
-      language: system
-      entry: make format
-      files: ^(docs|cookbook)/
-      pass_filenames: false
+  - repo: local
+    hooks:
+      - id: core
+        name: format core
+        language: system
+        entry: make -C libs/core format
+        files: ^libs/core/
+        pass_filenames: false
+      - id: langchain
+        name: format langchain
+        language: system
+        entry: make -C libs/langchain format
+        files: ^libs/langchain/
+        pass_filenames: false
+      - id: standard-tests
+        name: format standard-tests
+        language: system
+        entry: make -C libs/standard-tests format
+        files: ^libs/standard-tests/
+        pass_filenames: false
+      - id: text-splitters
+        name: format text-splitters
+        language: system
+        entry: make -C libs/text-splitters format
+        files: ^libs/text-splitters/
+        pass_filenames: false
+      - id: anthropic
+        name: format partners/anthropic
+        language: system
+        entry: make -C libs/partners/anthropic format
+        files: ^libs/partners/anthropic/
+        pass_filenames: false
+      - id: chroma
+        name: format partners/chroma
+        language: system
+        entry: make -C libs/partners/chroma format
+        files: ^libs/partners/chroma/
+        pass_filenames: false
+      - id: couchbase
+        name: format partners/couchbase
+        language: system
+        entry: make -C libs/partners/couchbase format
+        files: ^libs/partners/couchbase/
+        pass_filenames: false
+      - id: exa
+        name: format partners/exa
+        language: system
+        entry: make -C libs/partners/exa format
+        files: ^libs/partners/exa/
+        pass_filenames: false
+      - id: fireworks
+        name: format partners/fireworks
+        language: system
+        entry: make -C libs/partners/fireworks format
+        files: ^libs/partners/fireworks/
+        pass_filenames: false
+      - id: groq
+        name: format partners/groq
+        language: system
+        entry: make -C libs/partners/groq format
+        files: ^libs/partners/groq/
+        pass_filenames: false
+      - id: huggingface
+        name: format partners/huggingface
+        language: system
+        entry: make -C libs/partners/huggingface format
+        files: ^libs/partners/huggingface/
+        pass_filenames: false
+      - id: mistralai
+        name: format partners/mistralai
+        language: system
+        entry: make -C libs/partners/mistralai format
+        files: ^libs/partners/mistralai/
+        pass_filenames: false
+      - id: nomic
+        name: format partners/nomic
+        language: system
+        entry: make -C libs/partners/nomic format
+        files: ^libs/partners/nomic/
+        pass_filenames: false
+      - id: ollama
+        name: format partners/ollama
+        language: system
+        entry: make -C libs/partners/ollama format
+        files: ^libs/partners/ollama/
+        pass_filenames: false
+      - id: openai
+        name: format partners/openai
+        language: system
+        entry: make -C libs/partners/openai format
+        files: ^libs/partners/openai/
+        pass_filenames: false
+      - id: prompty
+        name: format partners/prompty
+        language: system
+        entry: make -C libs/partners/prompty format
+        files: ^libs/partners/prompty/
+        pass_filenames: false
+      - id: qdrant
+        name: format partners/qdrant
+        language: system
+        entry: make -C libs/partners/qdrant format
+        files: ^libs/partners/qdrant/
+        pass_filenames: false
+      - id: root
+        name: format docs, cookbook
+        language: system
+        entry: make format
+        files: ^(docs|cookbook)/
+        pass_filenames: false
--- a/.readthedocs.yaml
+++ b/.readthedocs.yaml
@ -13,7 +13,7 @@ build:

 # Build documentation in the docs/ directory with Sphinx
 sphinx:
-   configuration: docs/api_reference/conf.py
+  configuration: docs/api_reference/conf.py

 # If using Sphinx, optionally build your docs in additional formats such as PDF
 formats:
@ -21,5 +21,5 @@ formats:

 # Optionally declare the Python requirements required to build your docs
 python:
-   install:
-     - requirements: docs/api_reference/requirements.txt
+  install:
+    - requirements: docs/api_reference/requirements.txt
--- a/.vscode/extensions.json
+++ b/.vscode/extensions.json
@ -0,0 +1,21 @@
+{
+  "recommendations": [
+    "ms-python.python",
+    "charliermarsh.ruff",
+    "ms-python.mypy-type-checker",
+    "ms-toolsai.jupyter",
+    "ms-toolsai.jupyter-keymap",
+    "ms-toolsai.jupyter-renderers",
+    "ms-toolsai.vscode-jupyter-cell-tags",
+    "ms-toolsai.vscode-jupyter-slideshow",
+    "yzhang.markdown-all-in-one",
+    "davidanson.vscode-markdownlint",
+    "bierner.markdown-mermaid",
+    "bierner.markdown-preview-github-styles",
+    "eamodio.gitlens",
+    "github.vscode-pull-request-github",
+    "github.vscode-github-actions",
+    "redhat.vscode-yaml",
+    "editorconfig.editorconfig",
+  ],
+}
--- a/.vscode/settings.json
+++ b/.vscode/settings.json
@ -0,0 +1,80 @@
+{
+  "python.analysis.include": [
+    "libs/**",
+    "docs/**",
+    "cookbook/**"
+  ],
+  "python.analysis.exclude": [
+    "**/node_modules",
+    "**/__pycache__",
+    "**/.pytest_cache",
+    "**/.*",
+    "_dist/**",
+    "docs/_build/**",
+    "docs/api_reference/_build/**"
+  ],
+  "python.analysis.autoImportCompletions": true,
+  "python.analysis.typeCheckingMode": "basic",
+  "python.testing.cwd": "${workspaceFolder}",
+  "python.linting.enabled": true,
+  "python.linting.ruffEnabled": true,
+  "[python]": {
+    "editor.formatOnSave": true,
+    "editor.codeActionsOnSave": {
+      "source.organizeImports": "explicit",
+      "source.fixAll": "explicit"
+    },
+    "editor.defaultFormatter": "charliermarsh.ruff"
+  },
+  "editor.rulers": [
+    88
+  ],
+  "editor.tabSize": 4,
+  "editor.insertSpaces": true,
+  "editor.trimAutoWhitespace": true,
+  "files.trimTrailingWhitespace": true,
+  "files.insertFinalNewline": true,
+  "files.exclude": {
+    "**/__pycache__": true,
+    "**/.pytest_cache": true,
+    "**/*.pyc": true,
+    "**/.mypy_cache": true,
+    "**/.ruff_cache": true,
+    "_dist/**": true,
+    "docs/_build/**": true,
+    "docs/api_reference/_build/**": true,
+    "**/node_modules": true,
+    "**/.git": false
+  },
+  "search.exclude": {
+    "**/__pycache__": true,
+    "**/*.pyc": true,
+    "_dist/**": true,
+    "docs/_build/**": true,
+    "docs/api_reference/_build/**": true,
+    "**/node_modules": true,
+    "**/.git": true,
+    "uv.lock": true,
+    "yarn.lock": true
+  },
+  "git.autofetch": true,
+  "git.enableSmartCommit": true,
+  "jupyter.askForKernelRestart": false,
+  "jupyter.interactiveWindow.textEditor.executeSelection": true,
+  "[markdown]": {
+    "editor.wordWrap": "on",
+    "editor.quickSuggestions": {
+      "comments": "off",
+      "strings": "off",
+      "other": "off"
+    }
+  },
+  "[yaml]": {
+    "editor.tabSize": 2,
+    "editor.insertSpaces": true
+  },
+  "[json]": {
+    "editor.tabSize": 2,
+    "editor.insertSpaces": true
+  },
+}
--- a/MIGRATE.md
+++ b/MIGRATE.md
@ -7,5 +7,5 @@ Please see the following guides for migrating LangChain code:
 * Migrating from [LangChain 0.0.x Chains](https://python.langchain.com/docs/versions/migrating_chains/)
 * Upgrade to [LangGraph Memory](https://python.langchain.com/docs/versions/migrating_memory/)

-The [LangChain CLI](https://python.langchain.com/docs/versions/v0_3/#migrate-using-langchain-cli) can help you automatically upgrade your code to use non-deprecated imports. 
+The [LangChain CLI](https://python.langchain.com/docs/versions/v0_3/#migrate-using-langchain-cli) can help you automatically upgrade your code to use non-deprecated imports.
 This will be especially helpful if you're still on either version 0.0.x or 0.1.x of LangChain.
--- a/1
+++ b/1
@ -41,6 +41,7 @@ docs_linkcheck:
 ## api_docs_build: Build the API Reference documentation.
 api_docs_build: clean
 	@echo "📖 Building API Reference documentation..."
+	uv pip install -e libs/cli
 	uv run --group docs python docs/api_reference/create_api_rst.py
 	cd docs/api_reference && uv run --group docs make html
 	uv run --group docs python docs/api_reference/scripts/custom_formatter.py docs/api_reference/_build/html/
--- a/SECURITY.md
+++ b/SECURITY.md
@ -11,6 +11,7 @@ When building such applications developers should remember to follow good securi
 * [**Defense in Depth**](https://en.wikipedia.org/wiki/Defense_in_depth_(computing)): No security technique is perfect. Fine-tuning and good chain design can reduce, but not eliminate, the odds that a Large Language Model (LLM) may make a mistake. It's best to combine multiple layered security approaches rather than relying on any single layer of defense to ensure security. For example: use both read-only permissions and sandboxing to ensure that LLMs are only able to access data that is explicitly meant for them to use.

 Risks of not doing so include, but are not limited to:
+
 * Data corruption or loss.
 * Unauthorized access to confidential information.
 * Compromised performance or availability of critical resources.
@ -27,10 +28,10 @@ design and secure your applications.

 ## Reporting OSS Vulnerabilities

-LangChain is partnered with [huntr by Protect AI](https://huntr.com/) to provide 
-a bounty program for our open source projects. 
+LangChain is partnered with [huntr by Protect AI](https://huntr.com/) to provide
+a bounty program for our open source projects.

-Please report security vulnerabilities associated with the LangChain 
+Please report security vulnerabilities associated with the LangChain
 open source projects [here](https://huntr.com/bounties/disclose/?target=https%3A%2F%2Fgithub.com%2Flangchain-ai%2Flangchain&validSearch=true).

 Before reporting a vulnerability, please review:
@ -45,39 +46,39 @@ Before reporting a vulnerability, please review:

 The following packages and repositories are eligible for bug bounties:

- langchain-core
- langchain (see exceptions)
- langchain-community (see exceptions)
- langgraph
- langserve
+* langchain-core
+* langchain (see exceptions)
+* langchain-community (see exceptions)
+* langgraph
+* langserve

 ### Out of Scope Targets

 All out of scope targets defined by huntr as well as:

- **langchain-experimental**: This repository is for experimental code and is not
+* **langchain-experimental**: This repository is for experimental code and is not
  eligible for bug bounties (see [package warning](https://pypi.org/project/langchain-experimental/)), bug reports to it will be marked as interesting or waste of
  time and published with no bounty attached.
- **tools**: Tools in either langchain or langchain-community are not eligible for bug
+* **tools**: Tools in either langchain or langchain-community are not eligible for bug
  bounties. This includes the following directories
-  - libs/langchain/langchain/tools
-  - libs/community/langchain_community/tools
-  - Please review the [Best Practices](#best-practices)
+  * libs/langchain/langchain/tools
+  * libs/community/langchain_community/tools
+  * Please review the [Best Practices](#best-practices)
    for more details, but generally tools interact with the real world. Developers are
    expected to understand the security implications of their code and are responsible
    for the security of their tools.
- Code documented with security notices. This will be decided on a case by
+* Code documented with security notices. This will be decided on a case by
  case basis, but likely will not be eligible for a bounty as the code is already
  documented with guidelines for developers that should be followed for making their
  application secure.
- Any LangSmith related repositories or APIs (see [Reporting LangSmith Vulnerabilities](#reporting-langsmith-vulnerabilities)).
+* Any LangSmith related repositories or APIs (see [Reporting LangSmith Vulnerabilities](#reporting-langsmith-vulnerabilities)).

 ## Reporting LangSmith Vulnerabilities

 Please report security vulnerabilities associated with LangSmith by email to `security@langchain.dev`.

- LangSmith site: https://smith.langchain.com
- SDK client: https://github.com/langchain-ai/langsmith-sdk
+* LangSmith site: [https://smith.langchain.com](https://smith.langchain.com)
+* SDK client: [https://github.com/langchain-ai/langsmith-sdk](https://github.com/langchain-ai/langsmith-sdk)

 ### Other Security Concerns

--- a/cookbook/tool_call_messages.ipynb
+++ b/cookbook/tool_call_messages.ipynb
@ -34,7 +34,7 @@
    "tools = [multiply, exponentiate, add]\n",
    "\n",
    "gpt35 = ChatOpenAI(model=\"gpt-3.5-turbo-0125\", temperature=0).bind_tools(tools)\n",
-    "claude3 = ChatAnthropic(model=\"claude-3-sonnet-20240229\").bind_tools(tools)\n",
+    "claude3 = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\").bind_tools(tools)\n",
    "llm_with_tools = gpt35.configurable_alternatives(\n",
    "    ConfigurableField(id=\"llm\"), default_key=\"gpt35\", claude3=claude3\n",
    ")"
@ -113,14 +113,15 @@
    {
     "data": {
      "text/plain": [
-       "{'messages': [HumanMessage(content=\"what's 3 plus 5 raised to the 2.743. also what's 17.24 - 918.1241\"),\n",
-       "  AIMessage(content='', additional_kwargs={'tool_calls': [{'id': 'call_6yMU2WsS4Bqgi1WxFHxtfJRc', 'function': {'arguments': '{\"x\": 8, \"y\": 2.743}', 'name': 'exponentiate'}, 'type': 'function'}, {'id': 'call_GAL3dQiKFF9XEV0RrRLPTvVp', 'function': {'arguments': '{\"x\": 17.24, \"y\": -918.1241}', 'name': 'add'}, 'type': 'function'}]}, response_metadata={'token_usage': {'completion_tokens': 58, 'prompt_tokens': 168, 'total_tokens': 226}, 'model_name': 'gpt-3.5-turbo-0125', 'system_fingerprint': 'fp_b28b39ffa8', 'finish_reason': 'tool_calls', 'logprobs': None}, id='run-528302fc-7acf-4c11-82c4-119ccf40c573-0', tool_calls=[{'name': 'exponentiate', 'args': {'x': 8, 'y': 2.743}, 'id': 'call_6yMU2WsS4Bqgi1WxFHxtfJRc'}, {'name': 'add', 'args': {'x': 17.24, 'y': -918.1241}, 'id': 'call_GAL3dQiKFF9XEV0RrRLPTvVp'}]),\n",
-       "  ToolMessage(content='300.03770462067547', tool_call_id='call_6yMU2WsS4Bqgi1WxFHxtfJRc'),\n",
-       "  ToolMessage(content='-900.8841', tool_call_id='call_GAL3dQiKFF9XEV0RrRLPTvVp'),\n",
-       "  AIMessage(content='The result of \\\\(3 + 5^{2.743}\\\\) is approximately 300.04, and the result of \\\\(17.24 - 918.1241\\\\) is approximately -900.88.', response_metadata={'token_usage': {'completion_tokens': 44, 'prompt_tokens': 251, 'total_tokens': 295}, 'model_name': 'gpt-3.5-turbo-0125', 'system_fingerprint': 'fp_b28b39ffa8', 'finish_reason': 'stop', 'logprobs': None}, id='run-d1161669-ed09-4b18-94bd-6d8530df5aa8-0')]}"
+       "{'messages': [HumanMessage(content=\"what's 3 plus 5 raised to the 2.743. also what's 17.24 - 918.1241\", additional_kwargs={}, response_metadata={}),\n",
+       "  AIMessage(content='', additional_kwargs={'tool_calls': [{'id': 'call_xuNXwm2P6U2Pp2pAbC1sdIBz', 'function': {'arguments': '{\"x\": 3, \"y\": 5}', 'name': 'add'}, 'type': 'function'}, {'id': 'call_0pImUJUDlYa5zfBcxxuvWyYS', 'function': {'arguments': '{\"x\": 8, \"y\": 2.743}', 'name': 'exponentiate'}, 'type': 'function'}, {'id': 'call_yaownQ9TZK0dkqD1KSFyax4H', 'function': {'arguments': '{\"x\": 17.24, \"y\": -918.1241}', 'name': 'add'}, 'type': 'function'}], 'refusal': None}, response_metadata={'token_usage': {'completion_tokens': 75, 'prompt_tokens': 131, 'total_tokens': 206, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-3.5-turbo-0125', 'system_fingerprint': None, 'id': 'chatcmpl-ByJm2qxSWU3oTTSZQv64J4XQKZhA6', 'service_tier': 'default', 'finish_reason': 'tool_calls', 'logprobs': None}, id='run--35fad027-47f7-44d3-aa8b-99f4fc24098c-0', tool_calls=[{'name': 'add', 'args': {'x': 3, 'y': 5}, 'id': 'call_xuNXwm2P6U2Pp2pAbC1sdIBz', 'type': 'tool_call'}, {'name': 'exponentiate', 'args': {'x': 8, 'y': 2.743}, 'id': 'call_0pImUJUDlYa5zfBcxxuvWyYS', 'type': 'tool_call'}, {'name': 'add', 'args': {'x': 17.24, 'y': -918.1241}, 'id': 'call_yaownQ9TZK0dkqD1KSFyax4H', 'type': 'tool_call'}], usage_metadata={'input_tokens': 131, 'output_tokens': 75, 'total_tokens': 206, 'input_token_details': {'audio': 0, 'cache_read': 0}, 'output_token_details': {'audio': 0, 'reasoning': 0}}),\n",
+       "  ToolMessage(content='8.0', tool_call_id='call_xuNXwm2P6U2Pp2pAbC1sdIBz'),\n",
+       "  ToolMessage(content='300.03770462067547', tool_call_id='call_0pImUJUDlYa5zfBcxxuvWyYS'),\n",
+       "  ToolMessage(content='-900.8841', tool_call_id='call_yaownQ9TZK0dkqD1KSFyax4H'),\n",
+       "  AIMessage(content='The results are:\\n1. 3 plus 5 is 8.\\n2. 5 raised to the power of 2.743 is approximately 300.04.\\n3. 17.24 minus 918.1241 is approximately -900.88.', additional_kwargs={'refusal': None}, response_metadata={'token_usage': {'completion_tokens': 55, 'prompt_tokens': 236, 'total_tokens': 291, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-3.5-turbo-0125', 'system_fingerprint': None, 'id': 'chatcmpl-ByJm345MYnpowGS90iAZAlSs7haed', 'service_tier': 'default', 'finish_reason': 'stop', 'logprobs': None}, id='run--5fa66d47-d80e-45d0-9c32-31348c735d72-0', usage_metadata={'input_tokens': 236, 'output_tokens': 55, 'total_tokens': 291, 'input_token_details': {'audio': 0, 'cache_read': 0}, 'output_token_details': {'audio': 0, 'reasoning': 0}})]}"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -146,17 +147,17 @@
    {
     "data": {
      "text/plain": [
-       "{'messages': [HumanMessage(content=\"what's 3 plus 5 raised to the 2.743. also what's 17.24 - 918.1241\"),\n",
-       "  AIMessage(content=[{'text': \"Okay, let's break this down into two parts:\", 'type': 'text'}, {'id': 'toolu_01DEhqcXkXTtzJAiZ7uMBeDC', 'input': {'x': 3, 'y': 5}, 'name': 'add', 'type': 'tool_use'}], response_metadata={'id': 'msg_01AkLGH8sxMHaH15yewmjwkF', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'tool_use', 'stop_sequence': None, 'usage': {'input_tokens': 450, 'output_tokens': 81}}, id='run-f35bfae8-8ded-4f8a-831b-0940d6ad16b6-0', tool_calls=[{'name': 'add', 'args': {'x': 3, 'y': 5}, 'id': 'toolu_01DEhqcXkXTtzJAiZ7uMBeDC'}]),\n",
-       "  ToolMessage(content='8.0', tool_call_id='toolu_01DEhqcXkXTtzJAiZ7uMBeDC'),\n",
-       "  AIMessage(content=[{'id': 'toolu_013DyMLrvnrto33peAKMGMr1', 'input': {'x': 8.0, 'y': 2.743}, 'name': 'exponentiate', 'type': 'tool_use'}], response_metadata={'id': 'msg_015Fmp8aztwYcce2JDAFfce3', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'tool_use', 'stop_sequence': None, 'usage': {'input_tokens': 545, 'output_tokens': 75}}, id='run-48aaeeeb-a1e5-48fd-a57a-6c3da2907b47-0', tool_calls=[{'name': 'exponentiate', 'args': {'x': 8.0, 'y': 2.743}, 'id': 'toolu_013DyMLrvnrto33peAKMGMr1'}]),\n",
-       "  ToolMessage(content='300.03770462067547', tool_call_id='toolu_013DyMLrvnrto33peAKMGMr1'),\n",
-       "  AIMessage(content=[{'text': 'So 3 plus 5 raised to the 2.743 power is 300.04.\\n\\nFor the second part:', 'type': 'text'}, {'id': 'toolu_01UTmMrGTmLpPrPCF1rShN46', 'input': {'x': 17.24, 'y': -918.1241}, 'name': 'add', 'type': 'tool_use'}], response_metadata={'id': 'msg_015TkhfRBENPib2RWAxkieH6', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'tool_use', 'stop_sequence': None, 'usage': {'input_tokens': 638, 'output_tokens': 105}}, id='run-45fb62e3-d102-4159-881d-241c5dbadeed-0', tool_calls=[{'name': 'add', 'args': {'x': 17.24, 'y': -918.1241}, 'id': 'toolu_01UTmMrGTmLpPrPCF1rShN46'}]),\n",
-       "  ToolMessage(content='-900.8841', tool_call_id='toolu_01UTmMrGTmLpPrPCF1rShN46'),\n",
-       "  AIMessage(content='Therefore, 17.24 - 918.1241 = -900.8841', response_metadata={'id': 'msg_01LgKnRuUcSyADCpxv9tPoYD', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 759, 'output_tokens': 24}}, id='run-1008254e-ccd1-497c-8312-9550dd77bd08-0')]}"
+       "{'messages': [HumanMessage(content=\"what's 3 plus 5 raised to the 2.743. also what's 17.24 - 918.1241\", additional_kwargs={}, response_metadata={}),\n",
+       "  AIMessage(content=[{'text': \"I'll solve these calculations for you.\\n\\nFor the first part, I need to calculate 3 plus 5 raised to the power of 2.743.\\n\\nLet me break this down:\\n1) First, I'll calculate 5 raised to the power of 2.743\\n2) Then add 3 to the result\", 'type': 'text'}, {'id': 'toolu_01L1mXysBQtpPLQ2AZTaCGmE', 'input': {'x': 5, 'y': 2.743}, 'name': 'exponentiate', 'type': 'tool_use'}], additional_kwargs={}, response_metadata={'id': 'msg_01HCbDmuzdg9ATMyKbnecbEE', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'tool_use', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 563, 'output_tokens': 146, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--9f6469fb-bcbb-4c1c-9eec-79f6979c38e6-0', tool_calls=[{'name': 'exponentiate', 'args': {'x': 5, 'y': 2.743}, 'id': 'toolu_01L1mXysBQtpPLQ2AZTaCGmE', 'type': 'tool_call'}], usage_metadata={'input_tokens': 563, 'output_tokens': 146, 'total_tokens': 709, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}),\n",
+       "  ToolMessage(content='82.65606421491815', tool_call_id='toolu_01L1mXysBQtpPLQ2AZTaCGmE'),\n",
+       "  AIMessage(content=[{'text': \"Now I'll add 3 to this result:\", 'type': 'text'}, {'id': 'toolu_01NARC83e9obV35mZ6jYzBiN', 'input': {'x': 3, 'y': 82.65606421491815}, 'name': 'add', 'type': 'tool_use'}], additional_kwargs={}, response_metadata={'id': 'msg_01ELwyCtVLeGC685PUFqmdz2', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'tool_use', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 727, 'output_tokens': 87, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--d5af3d7c-e8b7-4cc2-997a-ad2dafd08751-0', tool_calls=[{'name': 'add', 'args': {'x': 3, 'y': 82.65606421491815}, 'id': 'toolu_01NARC83e9obV35mZ6jYzBiN', 'type': 'tool_call'}], usage_metadata={'input_tokens': 727, 'output_tokens': 87, 'total_tokens': 814, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}),\n",
+       "  ToolMessage(content='85.65606421491815', tool_call_id='toolu_01NARC83e9obV35mZ6jYzBiN'),\n",
+       "  AIMessage(content=[{'text': \"For the second part, you asked for 17.24 - 918.1241. I don't have a subtraction function available, but I can rewrite this as adding a negative number: 17.24 + (-918.1241)\", 'type': 'text'}, {'id': 'toolu_01Q6fLcZkBWZpMPCZ55WXR3N', 'input': {'x': 17.24, 'y': -918.1241}, 'name': 'add', 'type': 'tool_use'}], additional_kwargs={}, response_metadata={'id': 'msg_01WkmDwUxWjjaKGnTtdLGJnN', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'tool_use', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 832, 'output_tokens': 130, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--39a6fbda-4c81-47a6-b361-524bd4ee5823-0', tool_calls=[{'name': 'add', 'args': {'x': 17.24, 'y': -918.1241}, 'id': 'toolu_01Q6fLcZkBWZpMPCZ55WXR3N', 'type': 'tool_call'}], usage_metadata={'input_tokens': 832, 'output_tokens': 130, 'total_tokens': 962, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}),\n",
+       "  ToolMessage(content='-900.8841', tool_call_id='toolu_01Q6fLcZkBWZpMPCZ55WXR3N'),\n",
+       "  AIMessage(content='So, the answers are:\\n1) 3 plus 5 raised to the 2.743 = 85.65606421491815\\n2) 17.24 - 918.1241 = -900.8841', additional_kwargs={}, response_metadata={'id': 'msg_015Yoc62CvdJbANGFouiQ6AQ', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 978, 'output_tokens': 58, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--174c0882-6180-47ea-8f63-d7b747302327-0', usage_metadata={'input_tokens': 978, 'output_tokens': 58, 'total_tokens': 1036, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}})]}"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -177,7 +178,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -191,7 +192,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.4"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/api_reference/create_api_rst.py
+++ b/docs/api_reference/create_api_rst.py
@ -202,6 +202,12 @@ def _load_package_modules(
        if file_path.name.startswith("_"):
            continue

+        if "integration_template" in file_path.parts:
+            continue
+
+        if "project_template" in file_path.parts:
+            continue
+
        relative_module_name = file_path.relative_to(package_path)

        # Skip if any module part starts with an underscore
@ -495,16 +501,7 @@ def _package_namespace(package_name: str) -> str:

 def _package_dir(package_name: str = "langchain") -> Path:
    """Return the path to the directory containing the documentation."""
-    if package_name in (
-        "langchain",
-        "langchain_v1",
-        "experimental",
-        "community",
-        "core",
-        "cli",
-        "text-splitters",
-        "standard-tests",
-    ):
+    if (ROOT_DIR / "libs" / package_name).exists():
        return ROOT_DIR / "libs" / package_name / _package_namespace(package_name)
    else:
        return (
@ -666,18 +663,12 @@ def main(dirs: Optional[list] = None) -> None:
    print("Starting to build API reference files.")
    if not dirs:
        dirs = [
-            dir_
-            for dir_ in os.listdir(ROOT_DIR / "libs")
-            if dir_ not in ("cli", "partners", "packages.yml")
-            and "pyproject.toml" in os.listdir(ROOT_DIR / "libs" / dir_)
+            p.parent.name
+            for p in (ROOT_DIR / "libs").rglob("pyproject.toml")
+            # Exclude packages that are not directly under libs/ or libs/partners/
+            if p.parent.parent.name in ("libs", "partners")
        ]
-        dirs += [
-            dir_
-            for dir_ in os.listdir(ROOT_DIR / "libs" / "partners")
-            if os.path.isdir(ROOT_DIR / "libs" / "partners" / dir_)
-            and "pyproject.toml" in os.listdir(ROOT_DIR / "libs" / "partners" / dir_)
-        ]
-    for dir_ in dirs:
+    for dir_ in sorted(dirs):
        # Skip any hidden directories
        # Some of these could be present by mistake in the code base
        # e.g., .pytest_cache from running tests from the wrong location.
@ -688,7 +679,7 @@ def main(dirs: Optional[list] = None) -> None:
            print("Building package:", dir_)
            _build_rst_file(package_name=dir_)

-    _build_index(dirs)
+    _build_index(sorted(dirs))
    print("API reference files built.")


--- a/docs/docs/additional_resources/arxiv_references.mdx
+++ b/docs/docs/additional_resources/arxiv_references.mdx
@ -1,10 +1,10 @@
 # arXiv
-            
+
 LangChain implements the latest research in the field of Natural Language Processing.
 This page contains `arXiv` papers referenced in the LangChain Documentation, API Reference,
 Templates, and Cookbooks.

-From the opposite direction, scientists use `LangChain` in research and reference it in the research papers. 
+From the opposite direction, scientists use `LangChain` in research and reference it in the research papers.

 `arXiv` papers with references to:
 [LangChain](https://arxiv.org/search/?query=langchain&searchtype=all&source=header) | [LangGraph](https://arxiv.org/search/?query=langgraph&searchtype=all&source=header) | [LangSmith](https://arxiv.org/search/?query=langsmith&searchtype=all&source=header)
@ -83,7 +83,7 @@ a set of open-domain QA datasets, covering multiple query complexities, and
 show that ours enhances the overall efficiency and accuracy of QA systems,
 compared to relevant baselines including the adaptive retrieval approaches.
 Code is available at: https://github.com/starsuzi/Adaptive-RAG.
-                
+
 ## Self-Discover: Large Language Models Self-Compose Reasoning Structures

 - **Authors:** Pei Zhou, Jay Pujara, Xiang Ren,  et al.
@ -106,7 +106,7 @@ than 20%, while requiring 10-40x fewer inference compute. Finally, we show that
 the self-discovered reasoning structures are universally applicable across
 model families: from PaLM 2-L to GPT-4, and from GPT-4 to Llama2, and share
 commonalities with human reasoning patterns.
-                
+
 ## RAG-Fusion: a New Take on Retrieval-Augmented Generation

 - **Authors:** Zackary Rackauckas
@ -129,7 +129,7 @@ the generated queries' relevance to the original query is insufficient. This
 research marks significant progress in artificial intelligence (AI) and natural
 language processing (NLP) applications and demonstrates transformations in a
 global and multi-industry context.
-                
+
 ## RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

 - **Authors:** Parth Sarthi, Salman Abdullah, Aditi Tuli,  et al.
@ -152,7 +152,7 @@ tasks. On question-answering tasks that involve complex, multi-step reasoning,
 we show state-of-the-art results; for example, by coupling RAPTOR retrieval
 with the use of GPT-4, we can improve the best performance on the QuALITY
 benchmark by 20% in absolute accuracy.
-                
+
 ## Corrective Retrieval Augmented Generation

 - **Authors:** Shi-Qi Yan, Jia-Chen Gu, Yun Zhu,  et al.
@ -180,7 +180,7 @@ them. CRAG is plug-and-play and can be seamlessly coupled with various
 RAG-based approaches. Experiments on four datasets covering short- and
 long-form generation tasks show that CRAG can significantly improve the
 performance of RAG-based approaches.
-                
+
 ## Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering

 - **Authors:** Tal Ridnik, Dedy Kredo, Itamar Friedman
@ -206,7 +206,7 @@ to 44% with the AlphaCodium flow. Many of the principles and best practices
 acquired in this work, we believe, are broadly applicable to general code
 generation tasks. Full implementation is available at:
 https://github.com/Codium-ai/AlphaCodium
-                
+
 ## Mixtral of Experts

 - **Authors:** Albert Q. Jiang, Alexandre Sablayrolles, Antoine Roux,  et al.
@ -229,7 +229,7 @@ multilingual benchmarks. We also provide a model fine-tuned to follow
 instructions, Mixtral 8x7B - Instruct, that surpasses GPT-3.5 Turbo,
 Claude-2.1, Gemini Pro, and Llama 2 70B - chat model on human benchmarks. Both
 the base and instruct models are released under the Apache 2.0 license.
-                
+
 ## Dense X Retrieval: What Retrieval Granularity Should We Use?

 - **Authors:** Tong Chen, Hongwei Wang, Sihao Chen,  et al.
@ -255,7 +255,7 @@ also enhances the performance of downstream QA tasks, since the retrieved texts
 are more condensed with question-relevant information, reducing the need for
 lengthy input tokens and minimizing the inclusion of extraneous, irrelevant
 information.
-                
+
 ## Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models

 - **Authors:** Wenhao Yu, Hongming Zhang, Xiaoman Pan,  et al.
@ -286,7 +286,7 @@ with CoN significantly outperform standard RALMs. Notably, CoN achieves an
 average improvement of +7.9 in EM score given entirely noisy retrieved
 documents and +10.5 in rejection rates for real-time questions that fall
 outside the pre-training knowledge scope.
-                
+
 ## Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

 - **Authors:** Akari Asai, Zeqiu Wu, Yizhong Wang,  et al.
@ -317,7 +317,7 @@ outperforms ChatGPT and retrieval-augmented Llama2-chat on Open-domain QA,
 reasoning and fact verification tasks, and it shows significant gains in
 improving factuality and citation accuracy for long-form generations relative
 to these models.
-                
+
 ## Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

 - **Authors:** Huaixiu Steven Zheng, Swaroop Mishra, Xinyun Chen,  et al.
@ -338,7 +338,7 @@ substantial performance gains on various challenging reasoning-intensive tasks
 including STEM, Knowledge QA, and Multi-Hop Reasoning. For instance, Step-Back
 Prompting improves PaLM-2L performance on MMLU (Physics and Chemistry) by 7%
 and 11% respectively, TimeQA by 27%, and MuSiQue by 7%.
-                
+
 ## Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

 - **Authors:** Xuefei Ning, Zinan Lin, Zixuan Zhou,  et al.
@ -359,7 +359,7 @@ potentially improve the answer quality on several question categories. SoT is
 an initial attempt at data-centric optimization for inference efficiency, and
 showcases the potential of eliciting high-quality answers by explicitly
 planning the answer structure in language.
-                
+
 ## Llama 2: Open Foundation and Fine-Tuned Chat Models

 - **Authors:** Hugo Touvron, Louis Martin, Kevin Stone,  et al.
@ -377,7 +377,7 @@ safety, may be a suitable substitute for closed-source models. We provide a
 detailed description of our approach to fine-tuning and safety improvements of
 Llama 2-Chat in order to enable the community to build on our work and
 contribute to the responsible development of LLMs.
-                
+
 ## Lost in the Middle: How Language Models Use Long Contexts

 - **Authors:** Nelson F. Liu, Kevin Lin, John Hewitt,  et al.
@ -399,7 +399,7 @@ significantly degrades when models must access relevant information in the
 middle of long contexts, even for explicitly long-context models. Our analysis
 provides a better understanding of how language models use their input context
 and provides new evaluation protocols for future long-context language models.
-                
+
 ## Query Rewriting for Retrieval-Augmented Large Language Models

 - **Authors:** Xinbei Ma, Yeyun Gong, Pengcheng He,  et al.
@ -426,7 +426,7 @@ Evaluation is conducted on downstream tasks, open-domain QA and multiple-choice
 QA. Experiments results show consistent performance improvement, indicating
 that our framework is proven effective and scalable, and brings a new framework
 for retrieval-augmented LLM.
-                
+
 ## Large Language Model Guided Tree-of-Thought

 - **Authors:** Jieyi Long
@ -452,7 +452,7 @@ the effectiveness of the proposed technique, we implemented a ToT-based solver
 for the Sudoku Puzzle. Experimental results show that the ToT framework can
 significantly increase the success rate of Sudoku puzzle solving. Our
 implementation of the ToT-based Sudoku solver is available on [GitHub](https://github.com/jieyilong/tree-of-thought-puzzle-solver).
-                
+
 ## Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

 - **Authors:** Lei Wang, Wanyu Xu, Yihuai Lan,  et al.
@ -482,7 +482,7 @@ by a large margin, is comparable to or exceeds Zero-shot-Program-of-Thought
 Prompting, and has comparable performance with 8-shot CoT prompting on the math
 reasoning problem. The code can be found at
 https://github.com/AGI-Edgerunners/Plan-and-Solve-Prompting.
-                
+
 ## Zero-Shot Listwise Document Reranking with a Large Language Model

 - **Authors:** Xueguang Ma, Xinyu Zhang, Ronak Pradeep,  et al.
@ -506,7 +506,7 @@ results, but can also act as a final-stage reranker to improve the top-ranked
 results of a pointwise method for improved efficiency. Additionally, we apply
 our approach to subsets of MIRACL, a recent multilingual retrieval dataset,
 with results showing its potential to generalize across different languages.
-                
+
 ## Visual Instruction Tuning

 - **Authors:** Haotian Liu, Chunyuan Li, Qingyang Wu,  et al.
@ -530,7 +530,7 @@ instruction-following dataset. When fine-tuned on Science QA, the synergy of
 LLaVA and GPT-4 achieves a new state-of-the-art accuracy of 92.53%. We make
 GPT-4 generated visual instruction tuning data, our model and code base
 publicly available.
-                
+
 ## Generative Agents: Interactive Simulacra of Human Behavior

 - **Authors:** Joon Sung Park, Joseph C. O'Brien, Carrie J. Cai,  et al.
@ -563,7 +563,7 @@ architecture--observation, planning, and reflection--each contribute critically
 to the believability of agent behavior. By fusing large language models with
 computational, interactive agents, this work introduces architectural and
 interaction patterns for enabling believable simulations of human behavior.
-                
+
 ## CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society

 - **Authors:** Guohao Li, Hasan Abed Al Kader Hammoud, Hani Itani,  et al.
@ -590,7 +590,7 @@ include introducing a novel communicative agent framework, offering a scalable
 approach for studying the cooperative behaviors and capabilities of multi-agent
 systems, and open-sourcing our library to support research on communicative
 agents and beyond: https://github.com/camel-ai/camel.
-                
+
 ## HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

 - **Authors:** Yongliang Shen, Kaitao Song, Xu Tan,  et al.
@ -619,7 +619,7 @@ HuggingGPT can tackle a wide range of sophisticated AI tasks spanning different
 modalities and domains and achieve impressive results in language, vision,
 speech, and other challenging tasks, which paves a new way towards the
 realization of artificial general intelligence.
-                
+
 ## A Watermark for Large Language Models

 - **Authors:** John Kirchenbauer, Jonas Geiping, Yuxin Wen,  et al.
@ -641,7 +641,7 @@ interpretable p-values, and derive an information-theoretic framework for
 analyzing the sensitivity of the watermark. We test the watermark using a
 multi-billion parameter model from the Open Pretrained Transformer (OPT)
 family, and discuss robustness and security.
-                
+
 ## Precise Zero-Shot Dense Retrieval without Relevance Labels

 - **Authors:** Luyu Gao, Xueguang Ma, Jimmy Lin,  et al.
@ -670,7 +670,7 @@ details. Our experiments show that HyDE significantly outperforms the
 state-of-the-art unsupervised dense retriever Contriever and shows strong
 performance comparable to fine-tuned retrievers, across various tasks (e.g. web
 search, QA, fact verification) and languages~(e.g. sw, ko, ja).
-                
+
 ## Constitutional AI: Harmlessness from AI Feedback

 - **Authors:** Yuntao Bai, Saurav Kadavath, Sandipan Kundu,  et al.
@ -697,7 +697,7 @@ and RL methods can leverage chain-of-thought style reasoning to improve the
 human-judged performance and transparency of AI decision making. These methods
 make it possible to control AI behavior more precisely and with far fewer human
 labels.
-                
+
 ## Robust and Explainable Identification of Logical Fallacies in Natural Language Arguments

 - **Authors:** Zhivar Sourati, Vishnu Priya Prasanna Venkatesh, Darshan Deshpande,  et al.
@ -727,7 +727,7 @@ components and fallacy classes, indicating that fallacy identification is a
 challenging task that may require specialized forms of reasoning to capture
 various classes. We share our open-source code and data on GitHub to support
 further work on logical fallacy identification.
-                
+
 ## Complementary Explanations for Effective In-Context Learning

 - **Authors:** Xi Ye, Srinivasan Iyer, Asli Celikyilmaz,  et al.
@ -752,7 +752,7 @@ performance. Therefore, we propose a maximal marginal relevance-based exemplar
 selection approach for constructing exemplar sets that are both relevant as
 well as complementary, which successfully improves the in-context learning
 performance across three real-world tasks on multiple LLMs.
-                
+
 ## PAL: Program-aided Language Models

 - **Authors:** Luyu Gao, Aman Madaan, Shuyan Zhou,  et al.
@ -784,7 +784,7 @@ larger models. For example, PAL using Codex achieves state-of-the-art few-shot
 accuracy on the GSM8K benchmark of math word problems, surpassing PaLM-540B
 which uses chain-of-thought by absolute 15% top-1. Our code and data are
 publicly available at http://reasonwithpal.com/ .
-                
+
 ## An Analysis of Fusion Functions for Hybrid Retrieval

 - **Authors:** Sebastian Bruch, Siyu Gai, Amir Ingber
@ -803,7 +803,7 @@ learning of a CC fusion is generally agnostic to the choice of score
 normalization; that CC outperforms RRF in in-domain and out-of-domain settings;
 and finally, that CC is sample efficient, requiring only a small set of
 training examples to tune its only parameter to a target domain.
-                
+
 ## ReAct: Synergizing Reasoning and Acting in Language Models

 - **Authors:** Shunyu Yao, Jeffrey Zhao, Dian Yu,  et al.
@ -835,7 +835,7 @@ benchmarks (ALFWorld and WebShop), ReAct outperforms imitation and
 reinforcement learning methods by an absolute success rate of 34% and 10%
 respectively, while being prompted with only one or two in-context examples.
 Project site with code: https://react-lm.github.io
-                
+
 ## Deep Lake: a Lakehouse for Deep Learning

 - **Authors:** Sasun Hambardzumyan, Abhinav Tuli, Levon Ghukasyan,  et al.
@ -860,7 +860,7 @@ streams the data over the network to (a) Tensor Query Language, (b) in-browser
 visualization engine, or (c) deep learning frameworks without sacrificing GPU
 utilization. Datasets stored in Deep Lake can be accessed from PyTorch,
 TensorFlow, JAX, and integrate with numerous MLOps tools.
-                
+
 ## Matryoshka Representation Learning

 - **Authors:** Aditya Kusupati, Gantavya Bhatt, Aniket Rege,  et al.
@ -891,7 +891,7 @@ representations. Finally, we show that MRL extends seamlessly to web-scale
 datasets (ImageNet, JFT) across various modalities -- vision (ViT, ResNet),
 vision + language (ALIGN) and language (BERT). MRL code and pretrained models
 are open-sourced at https://github.com/RAIVNLab/MRL.
-                
+
 ## Bitext Mining Using Distilled Sentence Representations for Low-Resource Languages

 - **Authors:** Kevin Heffernan, Onur Çelebi, Holger Schwenk
@ -917,7 +917,7 @@ which is valuable in the low-resource setting.
 very low-resource languages and handle 50 African languages, many of which are
 not covered by any other model. For these languages, we train sentence
 encoders, mine bitexts, and validate the bitexts by training NMT systems.
-                
+
 ## Evaluating the Text-to-SQL Capabilities of Large Language Models

 - **Authors:** Nitarshan Rajkumar, Raymond Li, Dzmitry Bahdanau
@ -934,7 +934,7 @@ this setting. Furthermore, we demonstrate on the GeoQuery and Scholar
 benchmarks that a small number of in-domain examples provided in the prompt
 enables Codex to perform better than state-of-the-art models finetuned on such
 few-shot examples.
-                
+
 ## Locally Typical Sampling

 - **Authors:** Clara Meister, Tiago Pimentel, Gian Wiher,  et al.
@ -963,7 +963,7 @@ human evaluations show that, in comparison to nucleus and top-k sampling,
 locally typical sampling offers competitive performance (in both abstractive
 summarization and story generation) in terms of quality while consistently
 reducing degenerate repetitions.
-                
+
 ## ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

 - **Authors:** Keshav Santhanam, Omar Khattab, Jon Saad-Falcon,  et al.
@ -985,7 +985,7 @@ improve the quality and space footprint of late interaction. We evaluate
 ColBERTv2 across a wide range of benchmarks, establishing state-of-the-art
 quality within and outside the training domain while reducing the space
 footprint of late interaction models by 6--10$\times$.
-                
+
 ## Learning Transferable Visual Models From Natural Language Supervision

 - **Authors:** Alec Radford, Jong Wook Kim, Chris Hallacy,  et al.
@ -1014,7 +1014,7 @@ For instance, we match the accuracy of the original ResNet-50 on ImageNet
 zero-shot without needing to use any of the 1.28 million training examples it
 was trained on. We release our code and pre-trained model weights at
 https://github.com/OpenAI/CLIP.
-                
+
 ## Language Models are Few-Shot Learners

 - **Authors:** Tom B. Brown, Benjamin Mann, Nick Ryder,  et al.
@ -1047,7 +1047,7 @@ training on large web corpora. Finally, we find that GPT-3 can generate samples
 of news articles which human evaluators have difficulty distinguishing from
 articles written by humans. We discuss broader societal impacts of this finding
 and of GPT-3 in general.
-                
+
 ## Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

 - **Authors:** Patrick Lewis, Ethan Perez, Aleksandra Piktus,  et al.
@ -1078,7 +1078,7 @@ parametric seq2seq models and task-specific retrieve-and-extract architectures.
 For language generation tasks, we find that RAG models generate more specific,
 diverse and factual language than a state-of-the-art parametric-only seq2seq
 baseline.
-                
+
 ## CTRL: A Conditional Transformer Language Model for Controllable Generation

 - **Authors:** Nitish Shirish Keskar, Bryan McCann, Lav R. Varshney,  et al.
@ -1098,4 +1098,3 @@ codes also allow CTRL to predict which parts of the training data are most
 likely given a sequence. This provides a potential method for analyzing large
 amounts of data via model-based source attribution. We have released multiple
 full-sized, pretrained versions of CTRL at https://github.com/salesforce/ctrl.
-                
--- a/docs/docs/changes/changelog/core.mdx
+++ b/docs/docs/changes/changelog/core.mdx
@ -7,4 +7,4 @@
 - `BaseChatModel` methods `__call__`, `call_as_llm`, `predict`, `predict_messages`. Will be removed in 0.2.0. Use `BaseChatModel.invoke` instead.
 - `BaseChatModel` methods `apredict`, `apredict_messages`. Will be removed in 0.2.0. Use `BaseChatModel.ainvoke` instead.
 - `BaseLLM` methods `__call__`, `predict`, `predict_messages`. Will be removed in 0.2.0. Use `BaseLLM.invoke` instead.
- `BaseLLM` methods `apredict`, `apredict_messages`. Will be removed in 0.2.0. Use `BaseLLM.ainvoke` instead.
+- `BaseLLM` methods `apredict`, `apredict_messages`. Will be removed in 0.2.0. Use `BaseLLM.ainvoke` instead.
--- a/docs/docs/changes/changelog/langchain.mdx
+++ b/docs/docs/changes/changelog/langchain.mdx
@ -90,4 +90,4 @@ Deprecated classes and methods will be removed in 0.2.0
 | OpenAIMultiFunctionsAgent       | create_openai_tools_agent         | Use LCEL builder over a class                  |
 | SelfAskWithSearchAgent          | create_self_ask_with_search       | Use LCEL builder over a class                  |
 | StructuredChatAgent             | create_structured_chat_agent      | Use LCEL builder over a class                  |
-| XMLAgent                        | create_xml_agent                  | Use LCEL builder over a class                  |
+| XMLAgent                        | create_xml_agent                  | Use LCEL builder over a class                  |
--- a/docs/docs/concepts/agents.mdx
+++ b/docs/docs/concepts/agents.mdx
@ -11,8 +11,8 @@ Please see the following resources for more information:

 ## Legacy agent concept: AgentExecutor

-LangChain previously introduced the `AgentExecutor` as a runtime for agents. 
-While it served as an excellent starting point, its limitations became apparent when dealing with more sophisticated and customized agents. 
+LangChain previously introduced the `AgentExecutor` as a runtime for agents.
+While it served as an excellent starting point, its limitations became apparent when dealing with more sophisticated and customized agents.
 As a result, we're gradually phasing out `AgentExecutor` in favor of more flexible solutions in LangGraph.

 ### Transitioning from AgentExecutor to LangGraph
--- a/docs/docs/concepts/callbacks.mdx
+++ b/docs/docs/concepts/callbacks.mdx
@ -70,4 +70,4 @@ This is a common reason why you may fail to see events being emitted from custom
 runnables or tools.
 :::

-For specifics on how to use callbacks, see the [relevant how-to guides here](/docs/how_to/#callbacks).
+For specifics on how to use callbacks, see the [relevant how-to guides here](/docs/how_to/#callbacks).
--- a/docs/docs/concepts/chat_history.mdx
+++ b/docs/docs/concepts/chat_history.mdx
@ -26,7 +26,7 @@ A full conversation often involves a combination of two patterns of alternating

 Since chat models have a maximum limit on input size, it's important to manage chat history and trim it as needed to avoid exceeding the [context window](/docs/concepts/chat_models/#context-window).

-While processing chat history, it's essential to preserve a correct conversation structure. 
+While processing chat history, it's essential to preserve a correct conversation structure.

 Key guidelines for managing chat history:

--- a/docs/docs/concepts/chat_models.mdx
+++ b/docs/docs/concepts/chat_models.mdx
@ -127,7 +127,7 @@ If the input exceeds the context window, the model may not be able to process th
 The size of the input is measured in [tokens](/docs/concepts/tokens) which are the unit of processing that the model uses.

 ## Advanced topics
- 
+
 ### Rate-limiting

 Many chat model providers impose a limit on the number of requests that can be made in a given time period.
--- a/docs/docs/concepts/embedding_models.mdx
+++ b/docs/docs/concepts/embedding_models.mdx
@ -15,9 +15,9 @@ Embedding models can also be [multimodal](/docs/concepts/multimodality) though s

 Imagine being able to capture the essence of any text - a tweet, document, or book - in a single, compact representation.
 This is the power of embedding models, which lie at the heart of many retrieval systems.
-Embedding models transform human language into a format that machines can understand and compare with speed and accuracy. 
+Embedding models transform human language into a format that machines can understand and compare with speed and accuracy.
 These models take text as input and produce a fixed-length array of numbers, a numerical fingerprint of the text's semantic meaning.
-Embeddings allow search system to find relevant documents not just based on keyword matches, but on semantic understanding. 
+Embeddings allow search system to find relevant documents not just based on keyword matches, but on semantic understanding.

 ## Key concepts

@ -27,16 +27,16 @@ Embeddings allow search system to find relevant documents not just based on keyw

 (2) **Measure similarity**: Embedding vectors can be compared using simple mathematical operations.

-## Embedding 
+## Embedding

-### Historical context 
+### Historical context

-The landscape of embedding models has evolved significantly over the years. 
-A pivotal moment came in 2018 when Google introduced [BERT (Bidirectional Encoder Representations from Transformers)](https://www.nvidia.com/en-us/glossary/bert/). 
+The landscape of embedding models has evolved significantly over the years.
+A pivotal moment came in 2018 when Google introduced [BERT (Bidirectional Encoder Representations from Transformers)](https://www.nvidia.com/en-us/glossary/bert/).
 BERT applied transformer models to embed text as a simple vector representation, which lead to unprecedented performance across various NLP tasks.
-However, BERT wasn't optimized for generating sentence embeddings efficiently. 
+However, BERT wasn't optimized for generating sentence embeddings efficiently.
 This limitation spurred the creation of [SBERT (Sentence-BERT)](https://www.sbert.net/examples/training/sts/README.html), which adapted the BERT architecture to generate semantically rich sentence embeddings, easily comparable via similarity metrics like cosine similarity, dramatically reduced the computational overhead for tasks like finding similar sentences.
-Today, the embedding model ecosystem is diverse, with numerous providers offering their own implementations. 
+Today, the embedding model ecosystem is diverse, with numerous providers offering their own implementations.
 To navigate this variety, researchers and practitioners often turn to benchmarks like the Massive Text Embedding Benchmark (MTEB) [here](https://huggingface.co/blog/mteb) for objective comparisons.

 :::info[Further reading]
@ -93,9 +93,9 @@ LangChain offers many embedding model integrations which you can find [on the em

 ## Measure similarity

-Each embedding is essentially a set of coordinates, often in a high-dimensional space. 
+Each embedding is essentially a set of coordinates, often in a high-dimensional space.
 In this space, the position of each point (embedding) reflects the meaning of its corresponding text.
-Just as similar words might be close to each other in a thesaurus, similar concepts end up close to each other in this embedding space. 
+Just as similar words might be close to each other in a thesaurus, similar concepts end up close to each other in this embedding space.
 This allows for intuitive comparisons between different pieces of text.
 By reducing text to these numerical representations, we can use simple mathematical operations to quickly measure how alike two pieces of text are, regardless of their original length or structure.
 Some common similarity metrics include:
@ -118,7 +118,7 @@ def cosine_similarity(vec1, vec2):

 similarity = cosine_similarity(query_result, document_result)
 print("Cosine Similarity:", similarity)
-```  
+```

 :::info[Further reading]

@ -127,4 +127,4 @@ print("Cosine Similarity:", similarity)
 * See Pinecone's [blog post](https://www.pinecone.io/learn/vector-similarity/) on similarity metrics.
 * See OpenAI's [FAQ](https://platform.openai.com/docs/guides/embeddings/faq) on what similarity metric to use with OpenAI embeddings.

-::: 
+:::
--- a/docs/docs/concepts/evaluation.mdx
+++ b/docs/docs/concepts/evaluation.mdx
@ -14,4 +14,3 @@ This process is vital for building reliable applications.
 - It allows you to track results over time and automatically run your evaluators on a schedule or as part of CI/Code

 To learn more, check out [this LangSmith guide](https://docs.smith.langchain.com/concepts/evaluation).
-
--- a/docs/docs/concepts/example_selectors.mdx
+++ b/docs/docs/concepts/example_selectors.mdx
@ -17,4 +17,4 @@ Sometimes these examples are hardcoded into the prompt, but for more advanced si

 ## Related resources

-* [Example selector how-to guides](/docs/how_to/#example-selectors)
+* [Example selector how-to guides](/docs/how_to/#example-selectors)
--- a/docs/docs/concepts/multimodality.mdx
+++ b/docs/docs/concepts/multimodality.mdx
@ -14,7 +14,7 @@
 * [Chat models](/docs/concepts/chat_models)
 * [Messages](/docs/concepts/messages)
 :::
- 
+
 LangChain supports multimodal data as input to chat models:

 1. Following provider-specific formats
--- a/docs/docs/concepts/rag.mdx
+++ b/docs/docs/concepts/rag.mdx
@ -8,7 +8,7 @@

 ## Overview

-Retrieval Augmented Generation (RAG) is a powerful technique that enhances [language models](/docs/concepts/chat_models/) by combining them with external knowledge bases. 
+Retrieval Augmented Generation (RAG) is a powerful technique that enhances [language models](/docs/concepts/chat_models/) by combining them with external knowledge bases.
 RAG addresses [a key limitation of models](https://www.glean.com/blog/how-to-build-an-ai-assistant-for-the-enterprise): models rely on fixed training datasets, which can lead to outdated or incomplete information.
 When given a query, RAG systems first search a knowledge base for relevant information.
 The system then incorporates this retrieved information into the model's prompt.
@ -44,7 +44,7 @@ See our conceptual guide on [retrieval](/docs/concepts/retrieval/).

 ## Adding external knowledge

-With a retrieval system in place, we need to pass knowledge from this system to the model. 
+With a retrieval system in place, we need to pass knowledge from this system to the model.
 A RAG pipeline typically achieves this following these steps:

 - Receive an input query.
@ -59,12 +59,12 @@ from langchain_openai import ChatOpenAI
 from langchain_core.messages import SystemMessage, HumanMessage

 # Define a system prompt that tells the model how to use the retrieved context
-system_prompt = """You are an assistant for question-answering tasks. 
-Use the following pieces of retrieved context to answer the question. 
-If you don't know the answer, just say that you don't know. 
+system_prompt = """You are an assistant for question-answering tasks.
+Use the following pieces of retrieved context to answer the question.
+If you don't know the answer, just say that you don't know.
 Use three sentences maximum and keep the answer concise.
 Context: {context}:"""
-    
+
 # Define a question
 question = """What are the main components of an LLM-powered autonomous agent system?"""

@ -78,7 +78,7 @@ docs_text = "".join(d.page_content for d in docs)
 system_prompt_fmt = system_prompt.format(context=docs_text)

 # Create a model
-model = ChatOpenAI(model="gpt-4o", temperature=0) 
+model = ChatOpenAI(model="gpt-4o", temperature=0)

 # Generate a response
 questions = model.invoke([SystemMessage(content=system_prompt_fmt),
--- a/docs/docs/concepts/retrieval.mdx
+++ b/docs/docs/concepts/retrieval.mdx
@ -10,28 +10,28 @@
 :::

 :::danger[Security]
- 
+
 Some of the concepts reviewed here utilize models to generate queries (e.g., for SQL or graph databases).
-There are inherent risks in doing this. 
-Make sure that your database connection permissions are scoped as narrowly as possible for your application's needs. 
-This will mitigate, though not eliminate, the risks of building a model-driven system capable of querying databases. 
+There are inherent risks in doing this.
+Make sure that your database connection permissions are scoped as narrowly as possible for your application's needs.
+This will mitigate, though not eliminate, the risks of building a model-driven system capable of querying databases.
 For more on general security best practices, see our [security guide](/docs/security/).

 :::

-## Overview 
+## Overview

-Retrieval systems are fundamental to many AI applications, efficiently identifying relevant information from large datasets. 
+Retrieval systems are fundamental to many AI applications, efficiently identifying relevant information from large datasets.
 These systems accommodate various data formats:

 - Unstructured text (e.g., documents) is often stored in vector stores or lexical search indexes.
 - Structured data is typically housed in relational or graph databases with defined schemas.

-Despite the growing diversity in data formats, modern AI applications increasingly aim to make all types of data accessible through natural language interfaces. 
-Models play a crucial role in this process by translating natural language queries into formats compatible with the underlying search index or database. 
+Despite the growing diversity in data formats, modern AI applications increasingly aim to make all types of data accessible through natural language interfaces.
+Models play a crucial role in this process by translating natural language queries into formats compatible with the underlying search index or database.
 This translation enables more intuitive and flexible interactions with complex data structures.

-## Key concepts 
+## Key concepts

 ![Retrieval](/img/retrieval_concept.png)

@ -39,20 +39,20 @@ This translation enables more intuitive and flexible interactions with complex d

 (2) **Information retrieval**: Search queries are used to fetch information from various retrieval systems.

-## Query analysis 
+## Query analysis

-While users typically prefer to interact with retrieval systems using natural language, these systems may require specific query syntax or benefit from certain keywords. 
+While users typically prefer to interact with retrieval systems using natural language, these systems may require specific query syntax or benefit from certain keywords.
 Query analysis serves as a bridge between raw user input and optimized search queries. Some common applications of query analysis include:

 1. **Query Re-writing**: Queries can be re-written or expanded to improve semantic or lexical searches.
 2. **Query Construction**: Search indexes may require structured queries (e.g., SQL for databases).

-Query analysis employs models to transform or construct optimized search queries from raw user input. 
+Query analysis employs models to transform or construct optimized search queries from raw user input.

 ### Query re-writing

-Retrieval systems should ideally handle a wide spectrum of user inputs, from simple and poorly worded queries to complex, multi-faceted questions. 
-To achieve this versatility, a popular approach is to use models to transform raw user queries into more effective search queries. 
+Retrieval systems should ideally handle a wide spectrum of user inputs, from simple and poorly worded queries to complex, multi-faceted questions.
+To achieve this versatility, a popular approach is to use models to transform raw user queries into more effective search queries.
 This transformation can range from simple keyword extraction to sophisticated query expansion and reformulation.
 Here are some key benefits of using models for query analysis in unstructured data retrieval:

@ -87,7 +87,7 @@ class Questions(BaseModel):
    )

 # Create an instance of the model and enforce the output structure
-model = ChatOpenAI(model="gpt-4o", temperature=0) 
+model = ChatOpenAI(model="gpt-4o", temperature=0)
 structured_model = model.with_structured_output(Questions)

 # Define the system prompt
@ -111,7 +111,7 @@ See our RAG from Scratch videos for a few different specific approaches:

 ### Query construction

-Query analysis also can focus on translating natural language queries into specialized query languages or filters. 
+Query analysis also can focus on translating natural language queries into specialized query languages or filters.
 This translation is crucial for effectively interacting with various types of databases that house structured or semi-structured data.

 1. **Structured Data examples**: For relational and graph databases, Domain-Specific Languages (DSLs) are used to query data.
@ -129,10 +129,10 @@ These approaches leverage models to bridge the gap between user intent and the s
 | [Text to SQL](/docs/tutorials/sql_qa/)   | If users are asking questions that require information housed in a relational database, accessible via SQL.                          | This uses an LLM to transform user input into a SQL query.                                                                                                                                                                                           |
 | [Text-to-Cypher](/docs/tutorials/graph/) | If users are asking questions that require information housed in a graph database, accessible via Cypher.                            | This uses an LLM to transform user input into a Cypher query.                                                                                                                                                                                        |

-As an example, here is how to use the `SelfQueryRetriever` to convert natural language queries into metadata filters.  
+As an example, here is how to use the `SelfQueryRetriever` to convert natural language queries into metadata filters.

 ```python
-metadata_field_info = schema_for_metadata 
+metadata_field_info = schema_for_metadata
 document_content_description = "Brief summary of a movie"
 llm = ChatOpenAI(temperature=0)
 retriever = SelfQueryRetriever.from_llm(
@ -149,20 +149,20 @@ retriever = SelfQueryRetriever.from_llm(
 * See our [blog post overview](https://blog.langchain.dev/query-construction/).
 * See our RAG from Scratch video on [query construction](https://youtu.be/kl6NwWYxvbM?feature=shared).

-::: 
+:::

-## Information retrieval 
+## Information retrieval

 ### Common retrieval systems

 #### Lexical search indexes

-Many search engines are based upon matching words in a query to the words in each document. 
+Many search engines are based upon matching words in a query to the words in each document.
 This approach is called lexical retrieval, using search [algorithms that are typically based upon word frequencies](https://cameronrwolfe.substack.com/p/the-basics-of-ai-powered-vector-search?utm_source=profile&utm_medium=reader2).
 The intution is simple: a word appears frequently both in the user’s query and a particular document, then this document might be a good match.

 The particular data structure used to implement this is often an [*inverted index*](https://www.geeksforgeeks.org/inverted-index/).
-This types of index contains a list of words and a mapping of each word to a list of locations at which it occurs in various documents. 
+This types of index contains a list of words and a mapping of each word to a list of locations at which it occurs in various documents.
 Using this data structure, it is possible to efficiently match the words in search queries to the documents in which they appear.
 [BM25](https://en.wikipedia.org/wiki/Okapi_BM25#:~:text=BM25%20is%20a%20bag%2Dof,slightly%20different%20components%20and%20parameters.) and [TF-IDF](https://en.wikipedia.org/wiki/Tf%E2%80%93idf) are [two popular lexical search algorithms](https://cameronrwolfe.substack.com/p/the-basics-of-ai-powered-vector-search?utm_source=profile&utm_medium=reader2).

@ -171,13 +171,13 @@ Using this data structure, it is possible to efficiently match the words in sear
 * See the [BM25](/docs/integrations/retrievers/bm25/) retriever integration.
 * See the [Elasticsearch](/docs/integrations/retrievers/elasticsearch_retriever/) retriever integration.

-::: 
+:::

 #### Vector indexes

 Vector indexes are an alternative way to index and store unstructured data.
-See our conceptual guide on [vectorstores](/docs/concepts/vectorstores/) for a detailed overview.  
-In short, rather than using word frequencies, vectorstores use an [embedding model](/docs/concepts/embedding_models/) to compress documents into high-dimensional vector representation. 
+See our conceptual guide on [vectorstores](/docs/concepts/vectorstores/) for a detailed overview.
+In short, rather than using word frequencies, vectorstores use an [embedding model](/docs/concepts/embedding_models/) to compress documents into high-dimensional vector representation.
 This allows for efficient similarity search over embedding vectors using simple mathematical operations like cosine similarity.

 :::info[Further reading]
@ -190,9 +190,9 @@ This allows for efficient similarity search over embedding vectors using simple

 #### Relational databases

-Relational databases are a fundamental type of structured data storage used in many applications. 
-They organize data into tables with predefined schemas, where each table represents an entity or relationship. 
-Data is stored in rows (records) and columns (attributes), allowing for efficient querying and manipulation through SQL (Structured Query Language). 
+Relational databases are a fundamental type of structured data storage used in many applications.
+They organize data into tables with predefined schemas, where each table represents an entity or relationship.
+Data is stored in rows (records) and columns (attributes), allowing for efficient querying and manipulation through SQL (Structured Query Language).
 Relational databases excel at maintaining data integrity, supporting complex queries, and handling relationships between different data entities.

 :::info[Further reading]
@ -204,8 +204,8 @@ Relational databases excel at maintaining data integrity, supporting complex que

 #### Graph databases

-Graph databases are a specialized type of database designed to store and manage highly interconnected data. 
-Unlike traditional relational databases, graph databases use a flexible structure consisting of nodes (entities), edges (relationships), and properties. 
+Graph databases are a specialized type of database designed to store and manage highly interconnected data.
+Unlike traditional relational databases, graph databases use a flexible structure consisting of nodes (entities), edges (relationships), and properties.
 This structure allows for efficient representation and querying of complex, interconnected data.
 Graph databases store data in a graph structure, with nodes, edges, and properties.
 They are particularly useful for storing and querying complex relationships between data points, such as social networks, supply-chain management, fraud detection, and recommendation services
@ -213,12 +213,12 @@ They are particularly useful for storing and querying complex relationships betw
 :::info[Further reading]

 * See our [tutorial](/docs/tutorials/graph/) for working with graph databases.
-* See our [list of graph database integrations](/docs/integrations/graphs/). 
+* See our [list of graph database integrations](/docs/integrations/graphs/).
 * See Neo4j's [starter kit for LangChain](https://neo4j.com/developer-blog/langchain-neo4j-starter-kit/).

 :::

-### Retriever  
+### Retriever

 LangChain provides a unified interface for interacting with various retrieval systems through the [retriever](/docs/concepts/retrievers/) concept. The interface is straightforward:

--- a/docs/docs/concepts/retrievers.mdx
+++ b/docs/docs/concepts/retrievers.mdx
@ -23,16 +23,16 @@ The LangChain [retriever](/docs/concepts/retrievers/) interface is straightforwa
 ## Key concept

 ![Retriever](/img/retriever_concept.png)
- 
+
 All retrievers implement a simple interface for retrieving documents using natural language queries.

-## Interface 
+## Interface

-The only requirement for a retriever is the ability to accepts a query and return documents. 
+The only requirement for a retriever is the ability to accepts a query and return documents.
 In particular, [LangChain's retriever class](https://python.langchain.com/api_reference/core/retrievers/langchain_core.retrievers.BaseRetriever.html#) only requires that the `_get_relevant_documents` method is implemented, which takes a `query: str` and returns a list of [Document](https://python.langchain.com/api_reference/core/documents/langchain_core.documents.base.Document.html) objects that are most relevant to the query.
 The underlying logic used to get relevant documents is specified by the retriever and can be whatever is most useful for the application.

-A LangChain retriever is a [runnable](/docs/how_to/lcel_cheatsheet/), which is a standard interface for LangChain components. 
+A LangChain retriever is a [runnable](/docs/how_to/lcel_cheatsheet/), which is a standard interface for LangChain components.
 This means that it has a few common methods, including `invoke`, that are used to interact with it. A retriever can be invoked with a query:

 ```python
@ -42,23 +42,23 @@ docs = retriever.invoke(query)
 Retrievers return a list of [Document](https://python.langchain.com/api_reference/core/documents/langchain_core.documents.base.Document.html) objects, which have two attributes:

 * `page_content`: The content of this document. Currently is a string.
-* `metadata`: Arbitrary metadata associated with this document (e.g., document id, file name, source, etc). 
+* `metadata`: Arbitrary metadata associated with this document (e.g., document id, file name, source, etc).

 :::info[Further reading]

 * See our [how-to guide](/docs/how_to/custom_retriever/) on building your own custom retriever.

 :::
- 
+
 ## Common types

 Despite the flexibility of the retriever interface, a few common types of retrieval systems are frequently used.

 ### Search apis

-It's important to note that retrievers don't need to actually *store* documents. 
-For example, we can build retrievers on top of search APIs that simply return search results! 
-See our retriever integrations with [Amazon Kendra](/docs/integrations/retrievers/amazon_kendra_retriever/) or [Wikipedia Search](/docs/integrations/retrievers/wikipedia/). 
+It's important to note that retrievers don't need to actually *store* documents.
+For example, we can build retrievers on top of search APIs that simply return search results!
+See our retriever integrations with [Amazon Kendra](/docs/integrations/retrievers/amazon_kendra_retriever/) or [Wikipedia Search](/docs/integrations/retrievers/wikipedia/).

 ### Relational or graph database

@ -75,7 +75,7 @@ For example, you can build a retriever for a SQL database using text-to-SQL conv

 ### Lexical search

-As discussed in our conceptual review of [retrieval](/docs/concepts/retrieval/), many search engines are based upon matching words in a query to the words in each document. 
+As discussed in our conceptual review of [retrieval](/docs/concepts/retrieval/), many search engines are based upon matching words in a query to the words in each document.
 [BM25](https://en.wikipedia.org/wiki/Okapi_BM25#:~:text=BM25%20is%20a%20bag%2Dof,slightly%20different%20components%20and%20parameters.) and [TF-IDF](https://en.wikipedia.org/wiki/Tf%E2%80%93idf) are [two popular lexical search algorithms](https://cameronrwolfe.substack.com/p/the-basics-of-ai-powered-vector-search?utm_source=profile&utm_medium=reader2).
 LangChain has retrievers for many popular lexical search algorithms / engines.

@ -85,11 +85,11 @@ LangChain has retrievers for many popular lexical search algorithms / engines.
 * See the [TF-IDF](/docs/integrations/retrievers/tf_idf/) retriever integration.
 * See the [Elasticsearch](/docs/integrations/retrievers/elasticsearch_retriever/) retriever integration.

-::: 
+:::

-### Vector store 
+### Vector store

-[Vector stores](/docs/concepts/vectorstores/) are a powerful and efficient way to index and retrieve unstructured data. 
+[Vector stores](/docs/concepts/vectorstores/) are a powerful and efficient way to index and retrieve unstructured data.
 A vectorstore can be used as a retriever by calling the `as_retriever()` method.

 ```python
@ -99,7 +99,7 @@ retriever = vectorstore.as_retriever()

 ## Advanced retrieval patterns

-### Ensemble 
+### Ensemble

 Because the retriever interface is so simple, returning a list of `Document` objects given a search query, it is possible to combine multiple retrievers using ensembling.
 This is particularly useful when you have multiple retrievers that are good at finding different types of relevant documents.
@ -112,24 +112,24 @@ ensemble_retriever = EnsembleRetriever(
 )
 ```

-When ensembling, how do we combine search results from many retrievers? 
+When ensembling, how do we combine search results from many retrievers?
 This motivates the concept of re-ranking, which takes the output of multiple retrievers and combines them using a more sophisticated algorithm such as [Reciprocal Rank Fusion (RRF)](https://plg.uwaterloo.ca/~gvcormac/cormacksigir09-rrf.pdf).

-### Source document retention 
+### Source document retention

 Many retrievers utilize some kind of index to make documents easily searchable.
-The process of indexing can include a transformation step (e.g., vectorstores often use document splitting). 
+The process of indexing can include a transformation step (e.g., vectorstores often use document splitting).
 Whatever transformation is used, can be very useful to retain a link between the *transformed document* and the original, giving the retriever the ability to return the *original* document.

 ![Retrieval with full docs](/img/retriever_full_docs.png)

 This is particularly useful in AI applications, because it ensures no loss in document context for the model.
-For example, you may use small chunk size for indexing documents in a vectorstore. 
-If you return *only* the chunks as the retrieval result, then the model will have lost the original document context for the chunks. 
+For example, you may use small chunk size for indexing documents in a vectorstore.
+If you return *only* the chunks as the retrieval result, then the model will have lost the original document context for the chunks.

-LangChain has two different retrievers that can be used to address this challenge. 
-The [Multi-Vector](/docs/how_to/multi_vector/) retriever allows the user to use any document transformation (e.g., use an LLM to write a summary of the document) for indexing while retaining linkage to the source document. 
-The [ParentDocument](/docs/how_to/parent_document_retriever/) retriever links document chunks from a text-splitter transformation for indexing while retaining linkage to the source document. 
+LangChain has two different retrievers that can be used to address this challenge.
+The [Multi-Vector](/docs/how_to/multi_vector/) retriever allows the user to use any document transformation (e.g., use an LLM to write a summary of the document) for indexing while retaining linkage to the source document.
+The [ParentDocument](/docs/how_to/parent_document_retriever/) retriever links document chunks from a text-splitter transformation for indexing while retaining linkage to the source document.

 | Name                                                      | Index Type                    | Uses an LLM               | When to Use                                                                                                                             | Description                                                                                                                                                                                                              |
 |-----------------------------------------------------------|-------------------------------|---------------------------|-----------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
--- a/docs/docs/concepts/runnables.mdx
+++ b/docs/docs/concepts/runnables.mdx
@ -107,7 +107,7 @@ The Runnable interface provides methods to get the [JSON Schema](https://json-sc

 These APIs are mostly used internally for unit-testing and by [LangServe](/docs/concepts/architecture#langserve) which uses the APIs for input validation and generation of [OpenAPI documentation](https://www.openapis.org/).

-In addition, to the input and output types, some Runnables have been set up with additional run time configuration options. 
+In addition, to the input and output types, some Runnables have been set up with additional run time configuration options.
 There are corresponding APIs to get the Pydantic Schema and JSON Schema of the configuration options for the Runnable.
 Please see the [Configurable Runnables](#configurable-runnables) section for more information.

@ -151,12 +151,12 @@ Passing `config` to the `invoke` method is done like so:

 ```python
 some_runnable.invoke(
-   some_input, 
+   some_input,
   config={
-      'run_name': 'my_run', 
-      'tags': ['tag1', 'tag2'], 
+      'run_name': 'my_run',
+      'tags': ['tag1', 'tag2'],
      'metadata': {'key': 'value'}
-      
+
   }
 )
 ```
@ -185,13 +185,13 @@ There are two main patterns by which new `Runnables` are created:
    foo_runnable = RunnableLambda(foo)
    ```

-LangChain will try to propagate `RunnableConfig` automatically for both of the patterns. 
+LangChain will try to propagate `RunnableConfig` automatically for both of the patterns.

 For handling the second pattern, LangChain relies on Python's [contextvars](https://docs.python.org/3/library/contextvars.html).

 In Python 3.11 and above, this works out of the box, and you do not need to do anything special to propagate the `RunnableConfig` to the sub-calls.

-In Python 3.9 and 3.10, if you are using **async code**, you need to manually pass the `RunnableConfig` through to the `Runnable` when invoking it. 
+In Python 3.9 and 3.10, if you are using **async code**, you need to manually pass the `RunnableConfig` through to the `Runnable` when invoking it.

 This is due to a limitation in [asyncio's tasks](https://docs.python.org/3/library/asyncio-task.html#asyncio.create_task)  in Python 3.9 and 3.10 which did
 not accept a `context` argument.
@ -201,7 +201,7 @@ Propagating the `RunnableConfig` manually is done like so:
 ```python
 async def foo(input, config): # <-- Note the config argument
    return await bar_runnable.ainvoke(input, config=config)
-    
+
 foo_runnable = RunnableLambda(foo)
 ```

@ -235,7 +235,7 @@ The attributes will also be propagated to [callbacks](/docs/concepts/callbacks),
 This is an advanced feature that is unnecessary for most users.
 :::

-You may need to set a custom `run_id` for a given run, in case you want 
+You may need to set a custom `run_id` for a given run, in case you want
 to reference it later or correlate it with other systems.

 The `run_id` MUST be a valid UUID string and **unique** for each run. It is used to identify
@ -249,7 +249,7 @@ import uuid
 run_id = uuid.uuid4()

 some_runnable.invoke(
-   some_input, 
+   some_input,
   config={
      'run_id': run_id
   }
@ -292,7 +292,7 @@ In addition, you can use it to specify any custom configuration options to pass

 ### Setting callbacks

-Use this option to configure [callbacks](/docs/concepts/callbacks) for the runnable at 
+Use this option to configure [callbacks](/docs/concepts/callbacks) for the runnable at
 runtime. The callbacks will be passed to all sub-calls made by the runnable.

 ```python
--- a/docs/docs/concepts/streaming.mdx
+++ b/docs/docs/concepts/streaming.mdx
@ -52,7 +52,7 @@ In addition, there is a **legacy** async [astream_log](https://python.langchain.

 The `stream()` method returns an iterator that yields chunks of output synchronously as they are produced. You can use a `for` loop to process each chunk in real-time. For example, when using an LLM, this allows the output to be streamed incrementally as it is generated, reducing the wait time for users.

-The type of chunk yielded by the `stream()` and `astream()` methods depends on the component being streamed. For example, when streaming from an [LLM](/docs/concepts/chat_models) each component will be an [AIMessageChunk](/docs/concepts/messages#aimessagechunk); however, for other components, the chunk may be different. 
+The type of chunk yielded by the `stream()` and `astream()` methods depends on the component being streamed. For example, when streaming from an [LLM](/docs/concepts/chat_models) each component will be an [AIMessageChunk](/docs/concepts/messages#aimessagechunk); however, for other components, the chunk may be different.

 The `stream()` method returns an iterator that yields these chunks as they are produced. For example,

@ -99,7 +99,7 @@ If you compose multiple Runnables using [LangChain’s Expression Language (LCEL
 <span data-heading-keywords="astream_events,stream_events,stream events"></span>

 :::tip
-Use the `astream_events` API to access custom data and intermediate outputs from LLM applications built entirely with [LCEL](/docs/concepts/lcel). 
+Use the `astream_events` API to access custom data and intermediate outputs from LLM applications built entirely with [LCEL](/docs/concepts/lcel).

 While this API is available for use with [LangGraph](/docs/concepts/architecture#langgraph) as well, it is usually not necessary when working with LangGraph, as the `stream` and `astream` methods provide comprehensive streaming capabilities for LangGraph graphs.
 :::
@ -119,7 +119,7 @@ from langchain_core.output_parsers import StrOutputParser
 from langchain_core.prompts import ChatPromptTemplate
 from langchain_anthropic import ChatAnthropic

-model = ChatAnthropic(model="claude-3-sonnet-20240229")
+model = ChatAnthropic(model="claude-3-7-sonnet-20250219")

 prompt = ChatPromptTemplate.from_template("tell me a joke about {topic}")
 parser = StrOutputParser()
@ -148,7 +148,7 @@ LangChain simplifies streaming from [chat models](/docs/concepts/chat_models) by

 ### How It Works

-When you call the `invoke` (or `ainvoke`) method on a chat model, LangChain will automatically switch to streaming mode if it detects that you are trying to stream the overall application. 
+When you call the `invoke` (or `ainvoke`) method on a chat model, LangChain will automatically switch to streaming mode if it detects that you are trying to stream the overall application.

 Under the hood, it'll have `invoke` (or `ainvoke`) use the `stream` (or `astream`) method to generate its output. The result of the invocation will be the same as far as the code that was using `invoke` is concerned; however, while the chat model is being streamed, LangChain will take care of invoking `on_llm_new_token` events in LangChain's [callback system](/docs/concepts/callbacks). These callback events
 allow LangGraph `stream`/`astream` and `astream_events` to surface the chat model's output in real-time.
@ -158,14 +158,14 @@ Example:
 ```python
 def node(state):
    ...
-    # The code below uses the invoke method, but LangChain will 
+    # The code below uses the invoke method, but LangChain will
    # automatically switch to streaming mode
-    # when it detects that the overall 
+    # when it detects that the overall
    # application is being streamed.
    ai_message = model.invoke(state["messages"])
    ...

-for chunk in compiled_graph.stream(..., mode="messages"): 
+for chunk in compiled_graph.stream(..., mode="messages"):
    ...
 ```
 ## Async Programming
--- a/docs/docs/concepts/structured_outputs.mdx
+++ b/docs/docs/concepts/structured_outputs.mdx
@ -1,15 +1,15 @@
 # Structured outputs

-## Overview 
+## Overview

-For many applications, such as chatbots, models need to respond to users directly in natural language. 
-However, there are scenarios where we need models to output in a *structured format*. 
+For many applications, such as chatbots, models need to respond to users directly in natural language.
+However, there are scenarios where we need models to output in a *structured format*.
 For example, we might want to store the model output in a database and ensure that the output conforms to the database schema.
 This need motivates the concept of structured output, where models can be instructed to respond with a particular output structure.

 ![Structured output](/img/structured_output.png)

-## Key concepts 
+## Key concepts

 1. **Schema definition:** The output structure is represented as a schema, which can be defined in several ways.<br/>
 2. **Returning structured output:** The model is given this schema, and is instructed to return output that conforms to it.
@ -18,7 +18,7 @@ This need motivates the concept of structured output, where models can be instru

 This pseudocode illustrates the recommended workflow when using structured output.
 LangChain provides a method, [`with_structured_output()`](/docs/how_to/structured_output/#the-with_structured_output-method), that automates the process of binding the schema to the [model](/docs/concepts/chat_models/) and parsing the output.
-This helper function is available for all model providers that support structured output. 
+This helper function is available for all model providers that support structured output.

 ```python
 # Define schema
@ -31,7 +31,7 @@ structured_output = model_with_structure.invoke(user_input)

 ## Schema definition

-The central concept is that the output structure of model responses needs to be represented in some way. 
+The central concept is that the output structure of model responses needs to be represented in some way.
 While types of objects you can use depend on the model you're working with, there are common types of objects that are typically allowed or recommended for structured output in Python.

 The simplest and most common format for structured output is a JSON-like structure, which in Python can be represented as a dictionary (dict) or list (list).
@ -45,7 +45,7 @@ JSON objects (or dicts in Python) are often used directly when the tool requires
 ```

 As a second example, [Pydantic](https://docs.pydantic.dev/latest/) is particularly useful for defining structured output schemas because it offers type hints and validation.
-Here's an example of a Pydantic schema: 
+Here's an example of a Pydantic schema:

 ```python
 from pydantic import BaseModel, Field
@ -59,7 +59,7 @@ class ResponseFormatter(BaseModel):
 ## Returning structured output

 With a schema defined, we need a way to instruct the model to use it.
-While one approach is to include this schema in the prompt and *ask nicely* for the model to use it, this is not recommended. 
+While one approach is to include this schema in the prompt and *ask nicely* for the model to use it, this is not recommended.
 Several more powerful methods that utilizes native features in the model provider's API are available.

 ### Using tool calling
@ -78,7 +78,7 @@ model_with_tools = model.bind_tools([ResponseFormatter])
 ai_msg = model_with_tools.invoke("What is the powerhouse of the cell?")
 ```

-The arguments of the tool call are already extracted as a dictionary. 
+The arguments of the tool call are already extracted as a dictionary.
 This dictionary can be optionally parsed into a Pydantic object, matching our original `ResponseFormatter` schema.

 ```python
@ -92,7 +92,7 @@ pydantic_object = ResponseFormatter.model_validate(ai_msg.tool_calls[0]["args"])

 ### JSON mode

-In addition to tool calling, some model providers support a feature called `JSON mode`. 
+In addition to tool calling, some model providers support a feature called `JSON mode`.
 This supports JSON schema definition as input and enforces the model to produce a conforming JSON output.
 You can find a table of model providers that support JSON mode [here](/docs/integrations/chat/).
 Here is an example of how to use JSON mode with OpenAI:
@ -105,21 +105,21 @@ ai_msg
 {'random_ints': [45, 67, 12, 34, 89, 23, 78, 56, 90, 11]}
 ```

-## Structured output method 
+## Structured output method

-There are a few challenges when producing structured output with the above methods: 
+There are a few challenges when producing structured output with the above methods:

-1. When tool calling is used, tool call arguments needs to be parsed from a dictionary back to the original schema.<br/>  
+1. When tool calling is used, tool call arguments needs to be parsed from a dictionary back to the original schema.<br/>

-2. In addition, the model needs to be instructed to *always* use the tool when we want to enforce structured output, which is a provider specific setting.<br/> 
+2. In addition, the model needs to be instructed to *always* use the tool when we want to enforce structured output, which is a provider specific setting.<br/>

-3. When JSON mode is used, the output needs to be parsed into a JSON object. 
+3. When JSON mode is used, the output needs to be parsed into a JSON object.

 With these challenges in mind, LangChain provides a helper function (`with_structured_output()`) to streamline the process.

 ![Diagram of with structured output](/img/with_structured_output.png)

-This both binds the schema to the model as a tool and parses the output to the specified output schema. 
+This both binds the schema to the model as a tool and parses the output to the specified output schema.

 ```python
 # Bind the schema to the model
--- a/docs/docs/concepts/testing.mdx
+++ b/docs/docs/concepts/testing.mdx
@ -23,9 +23,9 @@ def test_convert_to_openai_messages():
            ToolCall(name='parrot_multiply_tool', id='1', args={'a': 2, 'b': 3}),
        ]
    )
-    
+
    result = convert_to_openai_messages(ai_message)
-    
+
    expected = {
        "role": "assistant",
        "tool_calls": [
--- a/docs/docs/concepts/text_llms.mdx
+++ b/docs/docs/concepts/text_llms.mdx
@ -7,4 +7,4 @@ You are probably looking for the [Chat Model Concept Guide](/docs/concepts/chat_
 LangChain has implementations for older language models that take a string as input and return a string as output. These models are typically named without the "Chat" prefix (e.g., `Ollama`, `Anthropic`, `OpenAI`, etc.), and may include the "LLM" suffix (e.g., `OllamaLLM`, `AnthropicLLM`, `OpenAILLM`, etc.). These models implement the [BaseLLM](https://python.langchain.com/api_reference/core/language_models/langchain_core.language_models.llms.BaseLLM.html#langchain_core.language_models.llms.BaseLLM) interface.

 Users should be using almost exclusively the newer [Chat Models](/docs/concepts/chat_models) as most
-model providers have adopted a chat like interface for interacting with language models.
+model providers have adopted a chat like interface for interacting with language models.
--- a/docs/docs/concepts/text_splitters.mdx
+++ b/docs/docs/concepts/text_splitters.mdx
@ -69,7 +69,7 @@ texts = text_splitter.split_text(document)

 ### Text-structured based

-Text is naturally organized into hierarchical units such as paragraphs, sentences, and words. 
+Text is naturally organized into hierarchical units such as paragraphs, sentences, and words.
 We can leverage this inherent structure to inform our splitting strategy, creating split that maintain natural language flow, maintain semantic coherence within split, and adapts to varying levels of text granularity.
 LangChain's [`RecursiveCharacterTextSplitter`](/docs/how_to/recursive_text_splitter/) implements this concept:
 - The `RecursiveCharacterTextSplitter` attempts to keep larger units (e.g., paragraphs) intact.
@ -92,7 +92,7 @@ texts = text_splitter.split_text(document)

 ### Document-structured based

-Some documents have an inherent structure, such as HTML, Markdown, or JSON files. 
+Some documents have an inherent structure, such as HTML, Markdown, or JSON files.
 In these cases, it's beneficial to split the document based on its structure, as it often naturally groups semantically related text.
 Key benefits of structure-based splitting:
 - Preserves the logical organization of the document
@ -116,7 +116,7 @@ Examples of structure-based splitting:

 ### Semantic meaning based

-Unlike the previous methods, semantic-based splitting actually considers the *content* of the text. 
+Unlike the previous methods, semantic-based splitting actually considers the *content* of the text.
 While other approaches use document or text structure as proxies for semantic meaning, this method directly analyzes the text's semantics.
 There are several ways to implement this, but conceptually the approach is split text when there are significant changes in text *meaning*.
 As an example, we can use a sliding window approach to generate embeddings, and compare the embeddings to find significant differences:
--- a/docs/docs/concepts/tokens.mdx
+++ b/docs/docs/concepts/tokens.mdx
@ -55,4 +55,4 @@ According to the OpenAI post, the approximate token counts for English text are

 * 1 token ~= 4 chars in English
 * 1 token ~= ¾ words
-* 100 tokens ~= 75 words
+* 100 tokens ~= 75 words
--- a/docs/docs/concepts/tool_calling.mdx
+++ b/docs/docs/concepts/tool_calling.mdx
@ -6,7 +6,7 @@
 :::


-## Overview 
+## Overview

 Many AI applications interact directly with humans. In these cases, it is appropriate for models to respond in natural language.
 But what about cases where we want a model to also interact *directly* with systems, such as databases or an API?
@ -14,12 +14,12 @@ These systems often have a particular input schema; for example, APIs frequently
 This need motivates the concept of *tool calling*. You can use [tool calling](https://platform.openai.com/docs/guides/function-calling/example-use-cases) to request model responses that match a particular schema.

 :::info
-You will sometimes hear the term `function calling`. We use this term interchangeably with `tool calling`. 
+You will sometimes hear the term `function calling`. We use this term interchangeably with `tool calling`.
 :::

 ![Conceptual overview of tool calling](/img/tool_calling_concept.png)

-## Key concepts 
+## Key concepts

 1. **Tool Creation:** Use the [@tool](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.convert.tool.html) decorator to create a [tool](/docs/concepts/tools). A tool is an association between a function and its schema.<br/>
 2. **Tool Binding:** The tool needs to be connected to a model that supports tool calling. This gives the model awareness of the tool and the associated input schema required by the tool.<br/>
@ -40,7 +40,7 @@ The tool call arguments can be passed directly to the tool.
 tools = [my_tool]
 # Tool binding
 model_with_tools = model.bind_tools(tools)
-# Tool calling 
+# Tool calling
 response = model_with_tools.invoke(user_input)
 ```

@ -65,16 +65,16 @@ def multiply(a: int, b: int) -> int:

 :::

-## Tool binding 
+## Tool binding

-[Many](https://platform.openai.com/docs/guides/function-calling) [model providers](https://platform.openai.com/docs/guides/function-calling) support tool calling. 
+[Many](https://platform.openai.com/docs/guides/function-calling) [model providers](https://platform.openai.com/docs/guides/function-calling) support tool calling.

 :::tip
 See our [model integration page](/docs/integrations/chat/) for a list of providers that support tool calling.
 :::

-The central concept to understand is that LangChain provides a standardized interface for connecting tools to models. 
-The `.bind_tools()` method can be used to specify which tools are available for a model to call. 
+The central concept to understand is that LangChain provides a standardized interface for connecting tools to models.
+The `.bind_tools()` method can be used to specify which tools are available for a model to call.

 ```python
 model_with_tools = model.bind_tools(tools_list)
@ -113,7 +113,7 @@ However, if we pass an input *relevant to the tool*, the model should choose to
 result = llm_with_tools.invoke("What is 2 multiplied by 3?")
 ```

-As before, the output `result` will be an `AIMessage`. 
+As before, the output `result` will be an `AIMessage`.
 But, if the tool was called, `result` will have a `tool_calls` [attribute](https://python.langchain.com/api_reference/core/messages/langchain_core.messages.ai.AIMessage.html#langchain_core.messages.ai.AIMessage.tool_calls).
 This attribute includes everything needed to execute the tool, including the tool name and input arguments:

--- a/docs/docs/concepts/tools.mdx
+++ b/docs/docs/concepts/tools.mdx
@ -6,7 +6,7 @@

 ## Overview

-The **tool** abstraction in LangChain associates a Python **function** with a **schema** that defines the function's **name**, **description** and **expected arguments**. 
+The **tool** abstraction in LangChain associates a Python **function** with a **schema** that defines the function's **name**, **description** and **expected arguments**.

 **Tools** can be passed to [chat models](/docs/concepts/chat_models) that support [tool calling](/docs/concepts/tool_calling) allowing the model to request the execution of a specific function with specific inputs.

@ -68,10 +68,10 @@ You can also inspect the tool's schema and other properties:
 ```python
 print(multiply.name) # multiply
 print(multiply.description) # Multiply two numbers.
-print(multiply.args) 
+print(multiply.args)
 # {
-# 'type': 'object', 
-# 'properties': {'a': {'type': 'integer'}, 'b': {'type': 'integer'}}, 
+# 'type': 'object',
+# 'properties': {'a': {'type': 'integer'}, 'b': {'type': 'integer'}},
 # 'required': ['a', 'b']
 # }
 ```
@ -96,7 +96,7 @@ Please see the [API reference for @tool](https://python.langchain.com/api_refere
 def some_tool(...) -> Tuple[str, Any]:
    """Tool that does something."""
    ...
-    return 'Message for chat model', some_artifact 
+    return 'Message for chat model', some_artifact
 ```

 See [how to return artifacts from tools](/docs/how_to/tool_artifacts/) for more details.
@ -134,7 +134,7 @@ def user_specific_tool(input_data: str, user_id: InjectedToolArg) -> str:
 Annotating the `user_id` argument with `InjectedToolArg` tells LangChain that this argument should not be exposed as part of the
 tool's schema.

-See [how to pass run time values to tools](/docs/how_to/tool_runtime/) for more details on how to use `InjectedToolArg`.  
+See [how to pass run time values to tools](/docs/how_to/tool_runtime/) for more details on how to use `InjectedToolArg`.


 ### RunnableConfig
--- a/docs/docs/concepts/vectorstores.mdx
+++ b/docs/docs/concepts/vectorstores.mdx
@ -9,7 +9,7 @@
 :::
 :::info[Note]

-This conceptual overview focuses on text-based indexing and retrieval for simplicity. 
+This conceptual overview focuses on text-based indexing and retrieval for simplicity.
 However, embedding models can be [multi-modal](https://cloud.google.com/vertex-ai/generative-ai/docs/embeddings/get-multimodal-embeddings)
 and vector stores can be used to store and retrieve a variety of data types beyond text.
 :::
@ -125,7 +125,7 @@ to the documentation of the specific vectorstore you are using to see what simil

 Given a similarity metric to measure the distance between the embedded query and any embedded document, we need an algorithm to efficiently search over *all* the embedded documents to find the most similar ones.
 There are various ways to do this. As an example, many vectorstores implement [HNSW (Hierarchical Navigable Small World)](https://www.pinecone.io/learn/series/faiss/hnsw/), a graph-based index structure that allows for efficient similarity search.
-Regardless of the search algorithm used under the hood, the LangChain vectorstore interface has a `similarity_search` method for all integrations. 
+Regardless of the search algorithm used under the hood, the LangChain vectorstore interface has a `similarity_search` method for all integrations.
 This will take the search query, create an embedding, find similar documents, and return them as a list of [Documents](https://python.langchain.com/api_reference/core/documents/langchain_core.documents.base.Document.html).

 ```python
@ -166,7 +166,7 @@ vectorstore.similarity_search(
    k=2,
    filter={"source": "tweet"},
 )
-```  
+```

 :::info[Further reading]

@ -179,7 +179,7 @@ vectorstore.similarity_search(

 While algorithms like HNSW provide the foundation for efficient similarity search in many cases, additional techniques can be employed to improve search quality and diversity.
 For example, [maximal marginal relevance](https://python.langchain.com/v0.1/docs/modules/model_io/prompts/example_selectors/mmr/) is a re-ranking algorithm used to diversify search results, which is applied after the initial similarity search to ensure a more diverse set of results.
-As a second example, some [vector stores](/docs/integrations/retrievers/pinecone_hybrid_search/) offer built-in [hybrid-search](https://docs.pinecone.io/guides/data/understanding-hybrid-search) to combine keyword and semantic similarity search, which marries the benefits of both approaches. 
+As a second example, some [vector stores](/docs/integrations/retrievers/pinecone_hybrid_search/) offer built-in [hybrid-search](https://docs.pinecone.io/guides/data/understanding-hybrid-search) to combine keyword and semantic similarity search, which marries the benefits of both approaches.
 At the moment, there is no unified way to perform hybrid search using LangChain vectorstores, but it is generally exposed as a keyword argument that is passed in with `similarity_search`.
 See this [how-to guide on hybrid search](/docs/how_to/hybrid/) for more details.

@ -188,4 +188,4 @@ See this [how-to guide on hybrid search](/docs/how_to/hybrid/) for more details.
 | [Hybrid search](/docs/integrations/retrievers/pinecone_hybrid_search/)                                            | When combining keyword-based and semantic similarity. | Hybrid search combines keyword and semantic similarity, marrying the benefits of both approaches. [Paper](https://arxiv.org/abs/2210.11934). |
 | [Maximal Marginal Relevance (MMR)](https://python.langchain.com/api_reference/pinecone/vectorstores/langchain_pinecone.vectorstores.PineconeVectorStore.html#langchain_pinecone.vectorstores.PineconeVectorStore.max_marginal_relevance_search) | When needing to diversify search results.             | MMR attempts to diversify the results of a search to avoid returning similar and redundant documents.                                        |

- 
+
--- a/docs/docs/concepts/why_langchain.mdx
+++ b/docs/docs/concepts/why_langchain.mdx
@ -18,7 +18,7 @@ LangChain exposes a standard interface for key components, making it easy to swi

 3. **Observability and evaluation:** As applications become more complex, it becomes increasingly difficult to understand what is happening within them.
 Furthermore, the pace of development can become rate-limited by the [paradox of choice](https://en.wikipedia.org/wiki/Paradox_of_choice).
-For example, developers often wonder how to engineer their prompt or which LLM best balances accuracy, latency, and cost. 
+For example, developers often wonder how to engineer their prompt or which LLM best balances accuracy, latency, and cost.
 [Observability](https://en.wikipedia.org/wiki/Observability) and evaluations can help developers monitor their applications and rapidly answer these types of questions with confidence.


@ -29,10 +29,10 @@ As an example, all [chat models](/docs/concepts/chat_models/) implement the [Bas
 This provides a standard way to interact with chat models, supporting important but often provider-specific features like [tool calling](/docs/concepts/tool_calling/) and [structured outputs](/docs/concepts/structured_outputs/).


-### Example: chat models 
+### Example: chat models

 Many [model providers](/docs/concepts/chat_models/) support [tool calling](/docs/concepts/tool_calling/), a critical feature for many applications (e.g., [agents](https://langchain-ai.github.io/langgraph/concepts/agentic_concepts/)), that allows a developer to request model responses that match a particular schema.
-The APIs for each provider differ. 
+The APIs for each provider differ.
 LangChain's [chat model](/docs/concepts/chat_models/) interface provides a common way to bind [tools](/docs/concepts/tools) to a model in order to support [tool calling](/docs/concepts/tool_calling/):

 ```python
@ -42,7 +42,7 @@ tools = [my_tool]
 model_with_tools = model.bind_tools(tools)
 ```

-Similarly, getting models to produce [structured outputs](/docs/concepts/structured_outputs/) is an extremely common use case. 
+Similarly, getting models to produce [structured outputs](/docs/concepts/structured_outputs/) is an extremely common use case.
 Providers support different approaches for this, including [JSON mode or tool calling](https://platform.openai.com/docs/guides/structured-outputs), with different APIs.
 LangChain's [chat model](/docs/concepts/chat_models/) interface provides a common way to produce structured outputs using the `with_structured_output()` method:

@ -62,9 +62,9 @@ The underlying implementation of the retriever depends on the type of data store
 documents = my_retriever.invoke("What is the meaning of life?")
 ```

-## Orchestration 
+## Orchestration

-While standardization for individual components is useful, we've increasingly seen that developers want to *combine* components into more complex applications. 
+While standardization for individual components is useful, we've increasingly seen that developers want to *combine* components into more complex applications.
 This motivates the need for [orchestration](https://en.wikipedia.org/wiki/Orchestration_(computing)).
 There are several common characteristics of LLM applications that this orchestration layer should support:

@ -75,7 +75,7 @@ There are several common characteristics of LLM applications that this orchestra
 The recommended way to orchestrate components for complex applications is [LangGraph](https://langchain-ai.github.io/langgraph/concepts/high_level/).
 LangGraph is a library that gives developers a high degree of control by expressing the flow of the application as a set of nodes and edges.
 LangGraph comes with built-in support for [persistence](https://langchain-ai.github.io/langgraph/concepts/persistence/), [human-in-the-loop](https://langchain-ai.github.io/langgraph/concepts/human_in_the_loop/), [memory](https://langchain-ai.github.io/langgraph/concepts/memory/), and other features.
-It's particularly well suited for building [agents](https://langchain-ai.github.io/langgraph/concepts/agentic_concepts/) or [multi-agent](https://langchain-ai.github.io/langgraph/concepts/multi_agent/) applications. 
+It's particularly well suited for building [agents](https://langchain-ai.github.io/langgraph/concepts/agentic_concepts/) or [multi-agent](https://langchain-ai.github.io/langgraph/concepts/multi_agent/) applications.
 Importantly, individual LangChain components can be used as LangGraph nodes, but you can also use LangGraph **without** using LangChain components.

 :::info[Further reading]
@ -86,8 +86,8 @@ Have a look at our free course, [Introduction to LangGraph](https://academy.lang

 ## Observability and evaluation

-The pace of AI application development is often rate-limited by high-quality evaluations because there is a paradox of choice. 
-Developers often wonder how to engineer their prompt or which LLM best balances accuracy, latency, and cost. 
+The pace of AI application development is often rate-limited by high-quality evaluations because there is a paradox of choice.
+Developers often wonder how to engineer their prompt or which LLM best balances accuracy, latency, and cost.
 High quality tracing and evaluations can help you rapidly answer these types of questions with confidence.
 [LangSmith](https://docs.smith.langchain.com/) is our platform that supports observability and evaluation for AI applications.
 See our conceptual guides on [evaluations](https://docs.smith.langchain.com/concepts/evaluation) and [tracing](https://docs.smith.langchain.com/concepts/tracing) for more details.
--- a/docs/docs/contributing/how_to/documentation/index.mdx
+++ b/docs/docs/contributing/how_to/documentation/index.mdx
@ -1,6 +1,6 @@
 # Contribute documentation

-Documentation is a vital part of LangChain. We welcome both new documentation for new features and 
+Documentation is a vital part of LangChain. We welcome both new documentation for new features and
 community improvements to our current documentation. Please read the resources below before getting started:

 - [Documentation style guide](style_guide.mdx)
--- a/docs/docs/contributing/how_to/documentation/style_guide.mdx
+++ b/docs/docs/contributing/how_to/documentation/style_guide.mdx
@ -35,7 +35,7 @@ Some examples include:
 - [Build a Retrieval Augmented Generation (RAG) App](/docs/tutorials/rag/)

 A good structural rule of thumb is to follow the structure of this [example from Numpy](https://numpy.org/numpy-tutorials/content/tutorial-svd.html).
-  
+
 Here are some high-level tips on writing a good tutorial:

 - Focus on guiding the user to get something done, but keep in mind the end-goal is more to impart principles than to create a perfect production system.
@ -49,7 +49,7 @@ Here are some high-level tips on writing a good tutorial:
 - The first time you mention a LangChain concept, use its full name (e.g. "LangChain Expression Language (LCEL)"), and link to its conceptual/other documentation page.
  - It's also helpful to add a prerequisite callout that links to any pages with necessary background information.
 - End with a recap/next steps section summarizing what the tutorial covered and future reading, such as related how-to guides.
-  
+
 ### How-to guides

 A how-to guide, as the name implies, demonstrates how to do something discrete and specific.
--- a/docs/docs/contributing/how_to/integrations/community.mdx
+++ b/docs/docs/contributing/how_to/integrations/community.mdx
@ -15,7 +15,7 @@ guide linked without much discussion.

 The `langchain-community` package is in `libs/community`.

-It can be installed with `pip install langchain-community`, and exported members can be imported with code like 
+It can be installed with `pip install langchain-community`, and exported members can be imported with code like

 ```python
 from langchain_community.chat_models import ChatParrotLink
@ -23,7 +23,7 @@ from langchain_community.llms import ParrotLinkLLM
 from langchain_community.vectorstores import ParrotLinkVectorStore
 ```

-The `community` package relies on manually-installed dependent packages, so you will see errors 
+The `community` package relies on manually-installed dependent packages, so you will see errors
 if you try to import a package that is not installed. In our fake example, if you tried to import `ParrotLinkLLM` without installing `parrot-link-sdk`, you will see an `ImportError` telling you to install it when trying to use it.

 Let's say we wanted to implement a chat model for Parrot Link AI. We would create a new file in `libs/community/langchain_community/chat_models/parrot_link.py` with the following code:
--- a/docs/docs/contributing/how_to/integrations/from_template.mdx
+++ b/docs/docs/contributing/how_to/integrations/from_template.mdx
@ -14,8 +14,8 @@ First, duplicate this template repository: https://github.com/langchain-ai/integ
 In this guide, we will create a `libs/langchain-parrot-link` folder, simulating the creation
 of a partner package for a fake company, "Parrot Link AI".

-A package is 
-installed by users with `pip install langchain-{partner}`, and the package members 
+A package is
+installed by users with `pip install langchain-{partner}`, and the package members
 can be imported with code like:

 ```python
@ -93,11 +93,11 @@ to the relevant `docs/docs/integrations` directory in the monorepo root.

 ## (If Necessary) Deprecate community integration

-Note: this is only necessary if you're migrating an existing community integration into 
-a partner package. If the component you're integrating is net-new to LangChain (i.e. 
+Note: this is only necessary if you're migrating an existing community integration into
+a partner package. If the component you're integrating is net-new to LangChain (i.e.
 not already in the `community` package), you can skip this step.

-Let's pretend we migrated our `ChatParrotLink` chat model from the community package to 
+Let's pretend we migrated our `ChatParrotLink` chat model from the community package to
 the partner package. We would need to deprecate the old model in the community package.

 We would do that by adding a `@deprecated` decorator to the old model as follows, in
@ -116,8 +116,8 @@ After our change, it would look like this:
 from langchain_core._api.deprecation import deprecated

@deprecated(
-    since="0.0.<next community version>", 
-    removal="1.0.0", 
+    since="0.0.<next community version>",
+    removal="1.0.0",
    alternative_import="langchain_parrot_link.ChatParrotLink"
 )
 class ChatParrotLink(BaseChatModel):
--- a/docs/docs/contributing/how_to/integrations/package.mdx
+++ b/docs/docs/contributing/how_to/integrations/package.mdx
@ -4,7 +4,7 @@ pagination_prev: contributing/how_to/integrations/index
 ---
 # How to implement an integration package

-This guide walks through the process of implementing a LangChain integration 
+This guide walks through the process of implementing a LangChain integration
 package.

 Integration packages are just Python packages that can be installed with `pip install <your-package>`,
@ -14,11 +14,11 @@ We will cover:

 1. (Optional) How to bootstrap a new integration package
 2. How to implement components, such as [chat models](/docs/concepts/chat_models/) and [vector stores](/docs/concepts/vectorstores/), that adhere
-to the LangChain interface;  
+to the LangChain interface;

 ## (Optional) bootstrapping a new integration package

-In this section, we will outline 2 options for bootstrapping a new integration package, 
+In this section, we will outline 2 options for bootstrapping a new integration package,
 and you're welcome to use other tools if you prefer!

 1. **langchain-cli**: This is a command-line tool that can be used to bootstrap a new integration package with a template for LangChain components and Poetry for dependency management.
@ -132,7 +132,7 @@ We will also add some `test` dependencies in a separate poetry dependency group.
 you are not using Poetry, we recommend adding these in a way that won't package them
 with your published package, or just installing them separately when you run tests.

-`langchain-tests` will provide the [standard tests](../standard_tests) we will use later. 
+`langchain-tests` will provide the [standard tests](../standard_tests) we will use later.
 We recommended pinning these to the latest version: <img src="https://img.shields.io/pypi/v/langchain-tests" style={{position:"relative",top:4,left:3}} />

 Note: Replace `<latest_version>` with the latest version of `langchain-tests` below.
@ -168,8 +168,8 @@ langchain-parrot-link/
 └── README.md
 ```

-All of these files should already exist from step 1, except for 
-`chat_models.py` and `test_chat_models.py`! We will implement `test_chat_models.py` 
+All of these files should already exist from step 1, except for
+`chat_models.py` and `test_chat_models.py`! We will implement `test_chat_models.py`
 later, following the [standard tests](../standard_tests) guide.

 For `chat_models.py`, simply paste the contents of the chat model implementation
@ -202,7 +202,7 @@ import CodeBlock from '@theme/CodeBlock';
 <Tabs>

    <TabItem value="chat_models" label="Chat models">
-        
+
        Refer to the [Custom Chat Model Guide](/docs/how_to/custom_chat_model) guide for
        detail on a starter chat model [implementation](/docs/how_to/custom_chat_model/#implementation).

@ -244,7 +244,7 @@ import ChatModelSource from '../../../../src/theme/integration_template/integrat
        base class. This interface consists of methods for writing, deleting and searching
        for documents in the vector store.

-        `VectorStore` supports a variety of synchronous and asynchronous search types (e.g., 
+        `VectorStore` supports a variety of synchronous and asynchronous search types (e.g.,
        nearest-neighbor or maximum marginal relevance), as well as interfaces for adding
        documents to the store. See the [API Reference](https://python.langchain.com/api_reference/core/vectorstores/langchain_core.vectorstores.base.VectorStore.html)
        for all supported methods. The required methods are tabulated below:
@ -331,7 +331,7 @@ or parameters to call the tool with.
 2. To take a "tool call" as generated above, and take some action and return a response
 that can be passed back to the chat model as a ToolMessage.

-The `Tools` class must inherit from the [BaseTool](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.base.BaseTool.html#langchain_core.tools.base.BaseTool) base class. This interface has 3 properties and 2 methods that should be implemented in a 
+The `Tools` class must inherit from the [BaseTool](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.base.BaseTool.html#langchain_core.tools.base.BaseTool) base class. This interface has 3 properties and 2 methods that should be implemented in a
 subclass.

 | Method/Property         | Description                                          |
@ -355,7 +355,7 @@ important for the initial user experience of the tool.
 arguments. This is used to validate the input arguments to the tool, and to provide
 a schema for the LLM to fill out when calling the tool. Similar to the `name` and
 `description` of the overall Tool class, the fields' names (the variable name) and
-description (part of `Field(..., description="description")`) are passed to the LLM, 
+description (part of `Field(..., description="description")`) are passed to the LLM,
 and the values in these fields should be concise and LLM-usable.

 ### Run methods
--- a/docs/docs/contributing/how_to/integrations/standard_tests.mdx
+++ b/docs/docs/contributing/how_to/integrations/standard_tests.mdx
@ -15,7 +15,7 @@ First, let's install 2 dependencies:

 :::note

-Because added tests in new versions of `langchain-tests` can break your CI/CD pipelines, we recommend pinning the 
+Because added tests in new versions of `langchain-tests` can break your CI/CD pipelines, we recommend pinning the
 version of `langchain-tests` to avoid unexpected changes.

 :::
@ -45,7 +45,7 @@ pip install --editable .

 ## Add and configure standard tests

-There are 2 namespaces in the `langchain-tests` package: 
+There are 2 namespaces in the `langchain-tests` package:

 - [unit tests](../../../concepts/testing.mdx#unit-tests) (`langchain_tests.unit_tests`): designed to be used to test the component in isolation and without access to external services
 - [integration tests](../../../concepts/testing.mdx#integration-tests) (`langchain_tests.integration_tests`): designed to be used to test the component with access to external services (in particular, the external service that the component is designed to interact with).
@ -283,7 +283,7 @@ to specify the tool to be tested and the tool's configuration:
 | `tool_constructor_params` | The parameters to pass to the tool (optional). |
 | `tool_invoke_params_example` | An example of the parameters to pass to the tool's `invoke` method. |

-If you are testing a tool class and pass a class like `MyTool` to `tool_constructor`, you can pass the parameters to the constructor in `tool_constructor_params`. 
+If you are testing a tool class and pass a class like `MyTool` to `tool_constructor`, you can pass the parameters to the constructor in `tool_constructor_params`.

 If you are testing an instantiated tool, you can pass the instantiated tool to `tool_constructor` and do not
 override `tool_constructor_params`.
--- a/docs/docs/contributing/reference/faq.mdx
+++ b/docs/docs/contributing/reference/faq.mdx
@ -13,7 +13,7 @@ necessary before merging it. Oftentimes, it is more efficient for the
 maintainers to make these changes themselves before merging, rather than asking you
 to do so in code review.

-By default, most pull requests will have a 
+By default, most pull requests will have a
 `✅ Maintainers are allowed to edit this pull request.`
 badge in the right-hand sidebar.

--- a/docs/docs/contributing/reference/index.mdx
+++ b/docs/docs/contributing/reference/index.mdx
@ -2,4 +2,4 @@

 - [**Repository Structure**](repo_structure.mdx): Understand the high level structure of the repository.
 - [**Review Process**](review_process.mdx): Learn about the review process for pull requests.
- [**Frequently Asked Questions (FAQ)**](faq.mdx): Get answers to common questions about contributing.
+- [**Frequently Asked Questions (FAQ)**](faq.mdx): Get answers to common questions about contributing.
--- a/docs/docs/contributing/reference/review_process.mdx
+++ b/docs/docs/contributing/reference/review_process.mdx
@ -8,7 +8,7 @@ This document outlines the process used by the LangChain maintainers for reviewi

 We categorize PRs using three main statuses, which are marked as project item statuses in the right sidebar and can be viewed in detail [here](https://github.com/orgs/langchain-ai/projects/12/views/1).

- **Triage**: 
+- **Triage**:
  - Initial status for all newly submitted PRs.
  - Requires a maintainer to categorize it into one of the other statuses.

--- a/docs/docs/how_to/callbacks_async.ipynb
+++ b/docs/docs/how_to/callbacks_async.ipynb
@ -56,32 +56,13 @@
     "text": [
      "zzzz....\n",
      "Hi! I just woke up. Your llm is starting\n",
-      "Sync handler being called in a `thread_pool_executor`: token: Here\n",
-      "Sync handler being called in a `thread_pool_executor`: token: 's\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  a\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  little\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  joke\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  for\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  you\n",
-      "Sync handler being called in a `thread_pool_executor`: token: :\n",
      "Sync handler being called in a `thread_pool_executor`: token: \n",
+      "Sync handler being called in a `thread_pool_executor`: token: Why\n",
+      "Sync handler being called in a `thread_pool_executor`: token:  don't scientists trust atoms?\n",
      "\n",
-      "Why\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  can\n",
-      "Sync handler being called in a `thread_pool_executor`: token: 't\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  a\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  bicycle\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  stan\n",
-      "Sync handler being called in a `thread_pool_executor`: token: d up\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  by\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  itself\n",
-      "Sync handler being called in a `thread_pool_executor`: token: ?\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  Because\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  it\n",
-      "Sync handler being called in a `thread_pool_executor`: token: 's\n",
-      "Sync handler being called in a `thread_pool_executor`: token:  two\n",
-      "Sync handler being called in a `thread_pool_executor`: token: -\n",
-      "Sync handler being called in a `thread_pool_executor`: token: tire\n",
+      "Because they make up\n",
+      "Sync handler being called in a `thread_pool_executor`: token:  everything!\n",
+      "Sync handler being called in a `thread_pool_executor`: token: \n",
      "zzzz....\n",
      "Hi! I just woke up. Your llm is ending\n"
     ]
@ -89,10 +70,10 @@
    {
     "data": {
      "text/plain": [
-       "LLMResult(generations=[[ChatGeneration(text=\"Here's a little joke for you:\\n\\nWhy can't a bicycle stand up by itself? Because it's two-tire\", message=AIMessage(content=\"Here's a little joke for you:\\n\\nWhy can't a bicycle stand up by itself? Because it's two-tire\", id='run-8afc89e8-02c0-4522-8480-d96977240bd4-0'))]], llm_output={}, run=[RunInfo(run_id=UUID('8afc89e8-02c0-4522-8480-d96977240bd4'))])"
+       "LLMResult(generations=[[ChatGeneration(text=\"Why don't scientists trust atoms?\\n\\nBecause they make up everything!\", message=AIMessage(content=\"Why don't scientists trust atoms?\\n\\nBecause they make up everything!\", additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None}, id='run--a596349d-8a7c-45fe-8691-bb1f9cfd6c08-0', usage_metadata={'input_tokens': 11, 'output_tokens': 17, 'total_tokens': 28, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}}))]], llm_output={}, run=[RunInfo(run_id=UUID('a596349d-8a7c-45fe-8691-bb1f9cfd6c08'))], type='LLMResult')"
      ]
     },
-     "execution_count": 2,
+     "execution_count": 1,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -134,7 +115,7 @@
    "# To enable streaming, we pass in `streaming=True` to the ChatModel constructor\n",
    "# Additionally, we pass in a list with our custom handler\n",
    "chat = ChatAnthropic(\n",
-    "    model=\"claude-3-sonnet-20240229\",\n",
+    "    model=\"claude-3-7-sonnet-20250219\",\n",
    "    max_tokens=25,\n",
    "    streaming=True,\n",
    "    callbacks=[MyCustomSyncHandler(), MyCustomAsyncHandler()],\n",
@ -157,7 +138,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -171,7 +152,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.6"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/callbacks_attach.ipynb
+++ b/docs/docs/how_to/callbacks_attach.ipynb
@ -49,22 +49,28 @@
   "execution_count": 1,
   "metadata": {},
   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "Error in LoggingHandler.on_chain_start callback: AttributeError(\"'NoneType' object has no attribute 'get'\")\n"
+     ]
+    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Chain RunnableSequence started\n",
      "Chain ChatPromptTemplate started\n",
-      "Chain ended, outputs: messages=[HumanMessage(content='What is 1 + 2?')]\n",
+      "Chain ended, outputs: messages=[HumanMessage(content='What is 1 + 2?', additional_kwargs={}, response_metadata={})]\n",
      "Chat model started\n",
-      "Chat model ended, response: generations=[[ChatGeneration(text='1 + 2 = 3', message=AIMessage(content='1 + 2 = 3', response_metadata={'id': 'msg_01NTYMsH9YxkoWsiPYs4Lemn', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}}, id='run-d6bcfd72-9c94-466d-bac0-f39e456ad6e3-0'))]] llm_output={'id': 'msg_01NTYMsH9YxkoWsiPYs4Lemn', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}} run=None\n",
-      "Chain ended, outputs: content='1 + 2 = 3' response_metadata={'id': 'msg_01NTYMsH9YxkoWsiPYs4Lemn', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}} id='run-d6bcfd72-9c94-466d-bac0-f39e456ad6e3-0'\n"
+      "Chat model ended, response: generations=[[ChatGeneration(text='The sum of 1 + 2 is 3.', message=AIMessage(content='The sum of 1 + 2 is 3.', additional_kwargs={}, response_metadata={'id': 'msg_01F1qPrmBD9igfzHdqVipmKX', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 17, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--71edddf3-2474-42dc-ad43-fadb4882c3c8-0', usage_metadata={'input_tokens': 16, 'output_tokens': 17, 'total_tokens': 33, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}))]] llm_output={'id': 'msg_01F1qPrmBD9igfzHdqVipmKX', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 17, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'} run=None type='LLMResult'\n",
+      "Chain ended, outputs: content='The sum of 1 + 2 is 3.' additional_kwargs={} response_metadata={'id': 'msg_01F1qPrmBD9igfzHdqVipmKX', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 17, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'} id='run--71edddf3-2474-42dc-ad43-fadb4882c3c8-0' usage_metadata={'input_tokens': 16, 'output_tokens': 17, 'total_tokens': 33, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}\n"
     ]
    },
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='1 + 2 = 3', response_metadata={'id': 'msg_01NTYMsH9YxkoWsiPYs4Lemn', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}}, id='run-d6bcfd72-9c94-466d-bac0-f39e456ad6e3-0')"
+       "AIMessage(content='The sum of 1 + 2 is 3.', additional_kwargs={}, response_metadata={'id': 'msg_01F1qPrmBD9igfzHdqVipmKX', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 17, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--71edddf3-2474-42dc-ad43-fadb4882c3c8-0', usage_metadata={'input_tokens': 16, 'output_tokens': 17, 'total_tokens': 33, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}})"
      ]
     },
     "execution_count": 1,
@ -101,7 +107,7 @@
    "\n",
    "\n",
    "callbacks = [LoggingHandler()]\n",
-    "llm = ChatAnthropic(model=\"claude-3-sonnet-20240229\")\n",
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\")\n",
    "prompt = ChatPromptTemplate.from_template(\"What is 1 + {number}?\")\n",
    "\n",
    "chain = prompt | llm\n",
@ -127,7 +133,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -141,7 +147,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.4"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/callbacks_constructor.ipynb
+++ b/docs/docs/how_to/callbacks_constructor.ipynb
@ -52,16 +52,16 @@
     "output_type": "stream",
     "text": [
      "Chat model started\n",
-      "Chat model ended, response: generations=[[ChatGeneration(text='1 + 2 = 3', message=AIMessage(content='1 + 2 = 3', response_metadata={'id': 'msg_01CdKsRmeS9WRb8BWnHDEHm7', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}}, id='run-2d7fdf2a-7405-4e17-97c0-67e6b2a65305-0'))]] llm_output={'id': 'msg_01CdKsRmeS9WRb8BWnHDEHm7', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}} run=None\n"
+      "Chat model ended, response: generations=[[ChatGeneration(text='1 + 2 = 3', message=AIMessage(content='1 + 2 = 3', additional_kwargs={}, response_metadata={'id': 'msg_01DQMbSk263KpY2vouHM5gsC', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 13, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--ab896e4e-c3fd-48b1-a41a-b6b525cbc041-0', usage_metadata={'input_tokens': 16, 'output_tokens': 13, 'total_tokens': 29, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}))]] llm_output={'id': 'msg_01DQMbSk263KpY2vouHM5gsC', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 13, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'} run=None type='LLMResult'\n"
     ]
    },
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='1 + 2 = 3', response_metadata={'id': 'msg_01CdKsRmeS9WRb8BWnHDEHm7', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}}, id='run-2d7fdf2a-7405-4e17-97c0-67e6b2a65305-0')"
+       "AIMessage(content='1 + 2 = 3', additional_kwargs={}, response_metadata={'id': 'msg_01DQMbSk263KpY2vouHM5gsC', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 13, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--ab896e4e-c3fd-48b1-a41a-b6b525cbc041-0', usage_metadata={'input_tokens': 16, 'output_tokens': 13, 'total_tokens': 29, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}})"
      ]
     },
-     "execution_count": 18,
+     "execution_count": 1,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -95,7 +95,7 @@
    "\n",
    "\n",
    "callbacks = [LoggingHandler()]\n",
-    "llm = ChatAnthropic(model=\"claude-3-sonnet-20240229\", callbacks=callbacks)\n",
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\", callbacks=callbacks)\n",
    "prompt = ChatPromptTemplate.from_template(\"What is 1 + {number}?\")\n",
    "\n",
    "chain = prompt | llm\n",
@ -119,7 +119,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -133,7 +133,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.4"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/callbacks_runtime.ipynb
+++ b/docs/docs/how_to/callbacks_runtime.ipynb
@ -42,25 +42,31 @@
   "execution_count": 4,
   "metadata": {},
   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "Error in LoggingHandler.on_chain_start callback: AttributeError(\"'NoneType' object has no attribute 'get'\")\n"
+     ]
+    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Chain RunnableSequence started\n",
      "Chain ChatPromptTemplate started\n",
-      "Chain ended, outputs: messages=[HumanMessage(content='What is 1 + 2?')]\n",
+      "Chain ended, outputs: messages=[HumanMessage(content='What is 1 + 2?', additional_kwargs={}, response_metadata={})]\n",
      "Chat model started\n",
-      "Chat model ended, response: generations=[[ChatGeneration(text='1 + 2 = 3', message=AIMessage(content='1 + 2 = 3', response_metadata={'id': 'msg_01D8Tt5FdtBk5gLTfBPm2tac', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}}, id='run-bb0dddd8-85f3-4e6b-8553-eaa79f859ef8-0'))]] llm_output={'id': 'msg_01D8Tt5FdtBk5gLTfBPm2tac', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}} run=None\n",
-      "Chain ended, outputs: content='1 + 2 = 3' response_metadata={'id': 'msg_01D8Tt5FdtBk5gLTfBPm2tac', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}} id='run-bb0dddd8-85f3-4e6b-8553-eaa79f859ef8-0'\n"
+      "Chat model ended, response: generations=[[ChatGeneration(text='1 + 2 = 3', message=AIMessage(content='1 + 2 = 3', additional_kwargs={}, response_metadata={'id': 'msg_019ieJt8K32iC77qBbQmSa9m', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 13, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--2f596356-99c9-45ef-94ff-fb170072abdf-0', usage_metadata={'input_tokens': 16, 'output_tokens': 13, 'total_tokens': 29, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}))]] llm_output={'id': 'msg_019ieJt8K32iC77qBbQmSa9m', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 13, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'} run=None type='LLMResult'\n",
+      "Chain ended, outputs: content='1 + 2 = 3' additional_kwargs={} response_metadata={'id': 'msg_019ieJt8K32iC77qBbQmSa9m', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 13, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'} id='run--2f596356-99c9-45ef-94ff-fb170072abdf-0' usage_metadata={'input_tokens': 16, 'output_tokens': 13, 'total_tokens': 29, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}}\n"
     ]
    },
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='1 + 2 = 3', response_metadata={'id': 'msg_01D8Tt5FdtBk5gLTfBPm2tac', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 13}}, id='run-bb0dddd8-85f3-4e6b-8553-eaa79f859ef8-0')"
+       "AIMessage(content='1 + 2 = 3', additional_kwargs={}, response_metadata={'id': 'msg_019ieJt8K32iC77qBbQmSa9m', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 13, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--2f596356-99c9-45ef-94ff-fb170072abdf-0', usage_metadata={'input_tokens': 16, 'output_tokens': 13, 'total_tokens': 29, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}})"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 1,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -94,7 +100,7 @@
    "\n",
    "\n",
    "callbacks = [LoggingHandler()]\n",
-    "llm = ChatAnthropic(model=\"claude-3-sonnet-20240229\")\n",
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\")\n",
    "prompt = ChatPromptTemplate.from_template(\"What is 1 + {number}?\")\n",
    "\n",
    "chain = prompt | llm\n",
@ -118,7 +124,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -132,7 +138,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.5"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/custom_callbacks.ipynb
+++ b/docs/docs/how_to/custom_callbacks.ipynb
@ -49,33 +49,13 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "My custom handler, token: Here\n",
-      "My custom handler, token: 's\n",
-      "My custom handler, token:  a\n",
-      "My custom handler, token:  bear\n",
-      "My custom handler, token:  joke\n",
-      "My custom handler, token:  for\n",
-      "My custom handler, token:  you\n",
-      "My custom handler, token: :\n",
      "My custom handler, token: \n",
+      "My custom handler, token: Why\n",
+      "My custom handler, token:  don't bears wear shoes?\n",
      "\n",
-      "Why\n",
-      "My custom handler, token:  di\n",
-      "My custom handler, token: d the\n",
-      "My custom handler, token:  bear\n",
-      "My custom handler, token:  dissol\n",
-      "My custom handler, token: ve\n",
-      "My custom handler, token:  in\n",
-      "My custom handler, token:  water\n",
-      "My custom handler, token: ?\n",
-      "My custom handler, token: \n",
-      "Because\n",
-      "My custom handler, token:  it\n",
-      "My custom handler, token:  was\n",
-      "My custom handler, token:  a\n",
-      "My custom handler, token:  polar\n",
-      "My custom handler, token:  bear\n",
-      "My custom handler, token: !\n"
+      "Because they\n",
+      "My custom handler, token:  prefer to go bear-foot!\n",
+      "My custom handler, token: \n"
     ]
    }
   ],
@ -95,7 +75,7 @@
    "# To enable streaming, we pass in `streaming=True` to the ChatModel constructor\n",
    "# Additionally, we pass in our custom handler as a list to the callbacks parameter\n",
    "model = ChatAnthropic(\n",
-    "    model=\"claude-3-sonnet-20240229\", streaming=True, callbacks=[MyCustomHandler()]\n",
+    "    model=\"claude-3-7-sonnet-20250219\", streaming=True, callbacks=[MyCustomHandler()]\n",
    ")\n",
    "\n",
    "chain = prompt | model\n",
@ -119,7 +99,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -133,7 +113,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.5"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/document_loader_json.mdx
+++ b/docs/docs/how_to/document_loader_json.mdx
@ -4,14 +4,14 @@

 [JSON Lines](https://jsonlines.org/) is a file format where each line is a valid JSON value.

-LangChain implements a [JSONLoader](https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.json_loader.JSONLoader.html) 
-to convert JSON and JSONL data into LangChain [Document](https://python.langchain.com/api_reference/core/documents/langchain_core.documents.base.Document.html#langchain_core.documents.base.Document) 
-objects. It uses a specified [jq schema](https://en.wikipedia.org/wiki/Jq_(programming_language)) to parse the JSON files, allowing for the extraction of specific fields into the content 
+LangChain implements a [JSONLoader](https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.json_loader.JSONLoader.html)
+to convert JSON and JSONL data into LangChain [Document](https://python.langchain.com/api_reference/core/documents/langchain_core.documents.base.Document.html#langchain_core.documents.base.Document)
+objects. It uses a specified [jq schema](https://en.wikipedia.org/wiki/Jq_(programming_language)) to parse the JSON files, allowing for the extraction of specific fields into the content
 and metadata of the LangChain Document.

 It uses the `jq` python package. Check out this [manual](https://stedolan.github.io/jq/manual/#Basicfilters) for a detailed documentation of the `jq` syntax.

-Here we will demonstrate: 
+Here we will demonstrate:

 - How to load JSON and JSONL data into the content of a LangChain `Document`;
 - How to load JSON and JSONL data into metadata associated with a `Document`.
@ -194,7 +194,7 @@ pprint(data)

 ### JSON file with jq schema `content_key`

-To load documents from a JSON file using the content_key within the jq schema, set is_content_key_jq_parsable=True. 
+To load documents from a JSON file using the content_key within the jq schema, set is_content_key_jq_parsable=True.
 Ensure that content_key is compatible and can be parsed using the jq schema.

 ```python
--- a/docs/docs/how_to/document_loader_office_file.mdx
+++ b/docs/docs/how_to/document_loader_office_file.mdx
@ -2,14 +2,14 @@

 The [Microsoft Office](https://www.office.com/) suite of productivity software includes Microsoft Word, Microsoft Excel, Microsoft PowerPoint, Microsoft Outlook, and Microsoft OneNote. It is available for Microsoft Windows and macOS operating systems. It is also available on Android and iOS.

-This covers how to load commonly used file formats including `DOCX`, `XLSX` and `PPTX` documents into a LangChain 
+This covers how to load commonly used file formats including `DOCX`, `XLSX` and `PPTX` documents into a LangChain
 [Document](https://python.langchain.com/api_reference/core/documents/langchain_core.documents.base.Document.html#langchain_core.documents.base.Document)
 object that we can use downstream.


 ## Loading DOCX, XLSX, PPTX with AzureAIDocumentIntelligenceLoader

-[Azure AI Document Intelligence](https://aka.ms/doc-intelligence) (formerly known as `Azure Form Recognizer`) is machine-learning 
+[Azure AI Document Intelligence](https://aka.ms/doc-intelligence) (formerly known as `Azure Form Recognizer`) is machine-learning
 based service that extracts texts (including handwriting), tables, document structures (e.g., titles, section headings, etc.) and key-value-pairs from
 digital or scanned PDFs, images, Office and HTML files. Document Intelligence supports `PDF`, `JPEG/JPG`, `PNG`, `BMP`, `TIFF`, `HEIF`, `DOCX`, `XLSX`, `PPTX` and `HTML`.

--- a/docs/docs/how_to/dynamic_chain.ipynb
+++ b/docs/docs/how_to/dynamic_chain.ipynb
@ -35,7 +35,7 @@
    "\n",
    "from langchain_anthropic import ChatAnthropic\n",
    "\n",
-    "llm = ChatAnthropic(model=\"claude-3-sonnet-20240229\")"
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\")"
   ]
  },
  {
@ -47,10 +47,10 @@
    {
     "data": {
      "text/plain": [
-       "\"According to the context provided, Egypt's population in 2024 is estimated to be about 111 million.\""
+       "\"Egypt's population in 2024 is about 111 million.\""
      ]
     },
-     "execution_count": 10,
+     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -142,13 +142,10 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
+      "\n",
      "What\n",
-      " is\n",
-      " the\n",
-      " population\n",
-      " of\n",
-      " Egypt\n",
-      "?\n"
+      " is the population of Egypt?\n",
+      "\n"
     ]
    }
   ],
@ -168,9 +165,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "poetry-venv-2",
+   "display_name": "langchain",
   "language": "python",
-   "name": "poetry-venv-2"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@ -182,7 +179,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/extraction_parse.ipynb
+++ b/docs/docs/how_to/extraction_parse.ipynb
@ -42,7 +42,7 @@
    "\n",
    "from langchain_anthropic.chat_models import ChatAnthropic\n",
    "\n",
-    "model = ChatAnthropic(model_name=\"claude-3-sonnet-20240229\", temperature=0)"
+    "model = ChatAnthropic(model_name=\"claude-3-7-sonnet-20250219\", temperature=0)"
   ]
  },
  {
@ -200,7 +200,7 @@
    {
     "data": {
      "text/plain": [
-       "People(people=[Person(name='Anna', height_in_meters=1.83)])"
+       "People(people=[Person(name='Anna', height_in_meters=1.8288)])"
      ]
     },
     "execution_count": 5,
@ -242,7 +242,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": null,
   "id": "b1f11912-c1bb-4a2a-a482-79bf3996961f",
   "metadata": {
    "execution": {
@ -359,14 +359,6 @@
    }
   },
   "outputs": [
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
-      "/Users/bagatur/langchain/.venv/lib/python3.11/site-packages/pydantic/_internal/_fields.py:201: UserWarning: Field name \"schema\" in \"PromptInput\" shadows an attribute in parent \"BaseModel\"\n",
-      "  warnings.warn(\n"
-     ]
-    },
    {
     "data": {
      "text/plain": [
@ -397,7 +389,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -411,7 +403,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.9"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/filter_messages.ipynb
+++ b/docs/docs/how_to/filter_messages.ipynb
@ -23,11 +23,11 @@
    {
     "data": {
      "text/plain": [
-       "[HumanMessage(content='example input', name='example_user', id='2'),\n",
-       " HumanMessage(content='real input', name='bob', id='4')]"
+       "[HumanMessage(content='example input', additional_kwargs={}, response_metadata={}, name='example_user', id='2'),\n",
+       " HumanMessage(content='real input', additional_kwargs={}, response_metadata={}, name='bob', id='4')]"
      ]
     },
-     "execution_count": 1,
+     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -60,12 +60,12 @@
    {
     "data": {
      "text/plain": [
-       "[SystemMessage(content='you are a good assistant', id='1'),\n",
-       " HumanMessage(content='real input', name='bob', id='4'),\n",
-       " AIMessage(content='real output', name='alice', id='5')]"
+       "[SystemMessage(content='you are a good assistant', additional_kwargs={}, response_metadata={}, id='1'),\n",
+       " HumanMessage(content='real input', additional_kwargs={}, response_metadata={}, name='bob', id='4'),\n",
+       " AIMessage(content='real output', additional_kwargs={}, response_metadata={}, name='alice', id='5')]"
      ]
     },
-     "execution_count": 2,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -83,12 +83,12 @@
    {
     "data": {
      "text/plain": [
-       "[HumanMessage(content='example input', name='example_user', id='2'),\n",
-       " HumanMessage(content='real input', name='bob', id='4'),\n",
-       " AIMessage(content='real output', name='alice', id='5')]"
+       "[HumanMessage(content='example input', additional_kwargs={}, response_metadata={}, name='example_user', id='2'),\n",
+       " HumanMessage(content='real input', additional_kwargs={}, response_metadata={}, name='bob', id='4'),\n",
+       " AIMessage(content='real output', additional_kwargs={}, response_metadata={}, name='alice', id='5')]"
      ]
     },
-     "execution_count": 3,
+     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -126,10 +126,10 @@
    {
     "data": {
      "text/plain": [
-       "AIMessage(content=[], response_metadata={'id': 'msg_01Wz7gBHahAwkZ1KCBNtXmwA', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 16, 'output_tokens': 3}}, id='run-b5d8a3fe-004f-4502-a071-a6c025031827-0', usage_metadata={'input_tokens': 16, 'output_tokens': 3, 'total_tokens': 19})"
+       "AIMessage(content=[], additional_kwargs={}, response_metadata={'id': 'msg_01At8GtCiJ79M29yvNwCiQaB', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 16, 'output_tokens': 3, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--b3db2b91-0edf-4c48-99e7-35e641b8229d-0', usage_metadata={'input_tokens': 16, 'output_tokens': 3, 'total_tokens': 19, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}})"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -137,7 +137,7 @@
   "source": [
    "from langchain_anthropic import ChatAnthropic\n",
    "\n",
-    "llm = ChatAnthropic(model=\"claude-3-sonnet-20240229\", temperature=0)\n",
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\", temperature=0)\n",
    "# Notice we don't pass in messages. This creates\n",
    "# a RunnableLambda that takes messages as input\n",
    "filter_ = filter_messages(exclude_names=[\"example_user\", \"example_assistant\"])\n",
@ -164,8 +164,9 @@
    {
     "data": {
      "text/plain": [
-       "[HumanMessage(content='real input', name='bob', id='4'),\n",
-       " AIMessage(content='real output', name='alice', id='5')]"
+       "[SystemMessage(content='you are a good assistant', additional_kwargs={}, response_metadata={}, id='1'),\n",
+       " HumanMessage(content='real input', additional_kwargs={}, response_metadata={}, name='bob', id='4'),\n",
+       " AIMessage(content='real output', additional_kwargs={}, response_metadata={}, name='alice', id='5')]"
      ]
     },
     "execution_count": 6,
@ -190,9 +191,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "poetry-venv-2",
+   "display_name": "langchain",
   "language": "python",
-   "name": "poetry-venv-2"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@ -204,7 +205,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/index.mdx
+++ b/docs/docs/how_to/index.mdx
@ -172,7 +172,7 @@ Indexing is the process of keeping your vectorstore in-sync with the underlying

 ### Tools

-LangChain [Tools](/docs/concepts/tools) contain a description of the tool (to pass to the language model) as well as the implementation of the function to call. Refer [here](/docs/integrations/tools/) for a list of pre-built tools. 
+LangChain [Tools](/docs/concepts/tools) contain a description of the tool (to pass to the language model) as well as the implementation of the function to call. Refer [here](/docs/integrations/tools/) for a list of pre-built tools.

 - [How to: create tools](/docs/how_to/custom_tools)
 - [How to: use built-in tools and toolkits](/docs/how_to/tools_builtin)
--- a/docs/docs/how_to/merge_message_runs.ipynb
+++ b/docs/docs/how_to/merge_message_runs.ipynb
@ -97,20 +97,13 @@
   "id": "6d5a0283-11f8-435b-b27b-7b18f7693592",
   "metadata": {},
   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "Note: you may need to restart the kernel to use updated packages.\n"
-     ]
-    },
    {
     "data": {
      "text/plain": [
-       "AIMessage(content=[], additional_kwargs={}, response_metadata={'id': 'msg_01KNGUMTuzBVfwNouLDpUMwf', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 84, 'output_tokens': 3}}, id='run-b908b198-9c24-450b-9749-9d4a8182937b-0', usage_metadata={'input_tokens': 84, 'output_tokens': 3, 'total_tokens': 87})"
+       "AIMessage(content='\\n\\nAs for the actual answer, LangChain is named for connecting (chaining) language models together with other components. And Harrison Chase is one of the co-founders of LangChain, not someone being chased! \\n\\nBut I like to think he\\'s running after runaway tokens that escaped from the embedding space. \"Come back here, you vectors!\"', additional_kwargs={}, response_metadata={'id': 'msg_018MF8xBrM1ztw69XTx3Uxcy', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 84, 'output_tokens': 80, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--caa1b9d6-a554-40ad-95cd-268938d8223b-0', usage_metadata={'input_tokens': 84, 'output_tokens': 80, 'total_tokens': 164, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}})"
      ]
     },
-     "execution_count": 9,
+     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -118,7 +111,7 @@
   "source": [
    "from langchain_anthropic import ChatAnthropic\n",
    "\n",
-    "llm = ChatAnthropic(model=\"claude-3-sonnet-20240229\", temperature=0)\n",
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\", temperature=0)\n",
    "# Notice we don't pass in messages. This creates\n",
    "# a RunnableLambda that takes messages as input\n",
    "merger = merge_message_runs()\n",
@ -150,7 +143,7 @@
       " AIMessage(content='Well, I guess they thought \"WordRope\" and \"SentenceString\" just didn\\'t have the same ring to it!\\nWhy, he\\'s probably chasing after the last cup of coffee in the office!', additional_kwargs={}, response_metadata={})]"
      ]
     },
-     "execution_count": 10,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -176,10 +169,10 @@
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='A convergent series is an infinite series whose partial sums approach a finite value as more terms are added. In other words, the sequence of partial sums has a limit.\\n\\nMore formally, an infinite series Σ an (where an are the terms of the series) is said to be convergent if the sequence of partial sums:\\n\\nS1 = a1\\nS2 = a1 + a2  \\nS3 = a1 + a2 + a3\\n...\\nSn = a1 + a2 + a3 + ... + an\\n...\\n\\nconverges to some finite number S as n goes to infinity. We write:\\n\\nlim n→∞ Sn = S\\n\\nThe finite number S is called the sum of the convergent infinite series.\\n\\nIf the sequence of partial sums does not approach any finite limit, the infinite series is said to be divergent.\\n\\nSome key properties:\\n- A series converges if and only if the sequence of its partial sums is a Cauchy sequence.\\n- Absolute/conditional convergence criteria help determine if a given series converges.\\n- Convergent series have many important applications in mathematics, physics, engineering etc.', additional_kwargs={}, response_metadata={'id': 'msg_01MfV6y2hep7ZNvDz24A36U4', 'model': 'claude-3-sonnet-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 29, 'output_tokens': 267}}, id='run-9d925f58-021e-4bd0-94fc-f8f5e91010a4-0', usage_metadata={'input_tokens': 29, 'output_tokens': 267, 'total_tokens': 296})"
+       "AIMessage(content=\"# Definition of a Convergent Series\\n\\nA series is a sum of terms in a sequence, typically written as:\\n\\n$$\\\\sum_{n=1}^{\\\\infty} a_n = a_1 + a_2 + a_3 + \\\\ldots$$\\n\\nA series is called **convergent** if the sequence of partial sums approaches a finite limit.\\n\\n## Formal Definition\\n\\nLet's define the sequence of partial sums:\\n$$S_N = \\\\sum_{n=1}^{N} a_n = a_1 + a_2 + \\\\ldots + a_N$$\\n\\nA series $\\\\sum_{n=1}^{\\\\infty} a_n$ is convergent if and only if:\\n- The limit of the partial sums exists and is finite\\n- That is, there exists a finite number $S$ such that $\\\\lim_{N \\\\to \\\\infty} S_N = S$\\n\\nIf this limit exists, we say the series converges to $S$, and we write:\\n$$\\\\sum_{n=1}^{\\\\infty} a_n = S$$\\n\\nIf the limit doesn't exist or is infinite, the series is called divergent.\", additional_kwargs={}, response_metadata={'id': 'msg_018ypyi2MTjV6S7jCydSqDn9', 'model': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 0, 'cache_read_input_tokens': 0, 'input_tokens': 29, 'output_tokens': 273, 'server_tool_use': None, 'service_tier': 'standard'}, 'model_name': 'claude-3-7-sonnet-20250219'}, id='run--5de0ca29-d031-48f7-bc75-671eade20b74-0', usage_metadata={'input_tokens': 29, 'output_tokens': 273, 'total_tokens': 302, 'input_token_details': {'cache_read': 0, 'cache_creation': 0}})"
      ]
     },
-     "execution_count": 14,
+     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -203,7 +196,7 @@
   "id": "51ba533a-43c7-4e5f-bd91-a4ec23ceeb34",
   "metadata": {},
   "source": [
-    "LangSmith Trace: https://smith.langchain.com/public/432150b6-9909-40a7-8ae7-944b7e657438/r/f4ad5fb2-4d38-42a6-b780-25f62617d53f"
+    "[LangSmith Trace](https://smith.langchain.com/public/432150b6-9909-40a7-8ae7-944b7e657438/r/f4ad5fb2-4d38-42a6-b780-25f62617d53f)"
   ]
  },
  {
@ -213,15 +206,15 @@
   "source": [
    "## API reference\n",
    "\n",
-    "For a complete description of all arguments head to the API reference: https://python.langchain.com/api_reference/core/messages/langchain_core.messages.utils.merge_message_runs.html"
+    "For a complete description of all arguments head to the [API reference](https://python.langchain.com/api_reference/core/messages/langchain_core.messages.utils.merge_message_runs.html)"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "poetry-venv-2",
+   "display_name": "langchain",
   "language": "python",
-   "name": "poetry-venv-2"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@ -233,7 +226,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.9"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/response_metadata.ipynb
+++ b/docs/docs/how_to/response_metadata.ipynb
@ -23,17 +23,18 @@
    {
     "data": {
      "text/plain": [
-       "{'token_usage': {'completion_tokens': 110,\n",
+       "{'token_usage': {'completion_tokens': 93,\n",
       "  'prompt_tokens': 16,\n",
-       "  'total_tokens': 126,\n",
+       "  'total_tokens': 109,\n",
       "  'completion_tokens_details': {'accepted_prediction_tokens': 0,\n",
       "   'audio_tokens': 0,\n",
       "   'reasoning_tokens': 0,\n",
       "   'rejected_prediction_tokens': 0},\n",
       "  'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}},\n",
       " 'model_name': 'gpt-4o-mini-2024-07-18',\n",
-       " 'system_fingerprint': 'fp_b8bc95a0ac',\n",
-       " 'id': 'chatcmpl-BDrISvLar6AqcZngBmhajFZXVc2u9',\n",
+       " 'system_fingerprint': 'fp_34a54ae93c',\n",
+       " 'id': 'chatcmpl-ByJtse6I3U1lmVyPscLCjzydCvfDO',\n",
+       " 'service_tier': 'default',\n",
       " 'finish_reason': 'stop',\n",
       " 'logprobs': None}"
      ]
@ -68,15 +69,17 @@
    {
     "data": {
      "text/plain": [
-       "{'id': 'msg_01JHnvPqgERY7MZwrvfkmq52',\n",
-       " 'model': 'claude-3-5-sonnet-20241022',\n",
+       "{'id': 'msg_017S9H7GMwA5RdZ1wHxzXoeX',\n",
+       " 'model': 'claude-3-7-sonnet-20250219',\n",
       " 'stop_reason': 'end_turn',\n",
       " 'stop_sequence': None,\n",
       " 'usage': {'cache_creation_input_tokens': 0,\n",
       "  'cache_read_input_tokens': 0,\n",
       "  'input_tokens': 17,\n",
-       "  'output_tokens': 221},\n",
-       " 'model_name': 'claude-3-5-sonnet-20241022'}"
+       "  'output_tokens': 180,\n",
+       "  'server_tool_use': None,\n",
+       "  'service_tier': 'standard'},\n",
+       " 'model_name': 'claude-3-7-sonnet-20250219'}"
      ]
     },
     "execution_count": 2,
@ -87,7 +90,7 @@
   "source": [
    "from langchain_anthropic import ChatAnthropic\n",
    "\n",
-    "llm = ChatAnthropic(model=\"claude-3-5-sonnet-latest\")\n",
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\")\n",
    "msg = llm.invoke(\"What's the oldest known example of cuneiform\")\n",
    "msg.response_metadata"
   ]
@ -173,7 +176,7 @@
   "source": [
    "from langchain_aws import ChatBedrockConverse\n",
    "\n",
-    "llm = ChatBedrockConverse(model=\"anthropic.claude-3-sonnet-20240229-v1:0\")\n",
+    "llm = ChatBedrockConverse(model=\"anthropic.claude-3-7-sonnet-20250219-v1:0\")\n",
    "msg = llm.invoke(\"What's the oldest known example of cuneiform\")\n",
    "msg.response_metadata"
   ]
@ -301,7 +304,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -315,7 +318,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.4"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/sequence.ipynb
+++ b/docs/docs/how_to/sequence.ipynb
@ -61,7 +61,7 @@
    "if \"ANTHROPIC_API_KEY\" not in os.environ:\n",
    "    os.environ[\"ANTHROPIC_API_KEY\"] = getpass()\n",
    "\n",
-    "model = ChatAnthropic(model=\"claude-3-sonnet-20240229\", temperature=0)"
+    "model = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\", temperature=0)"
   ]
  },
  {
@ -93,7 +93,7 @@
    {
     "data": {
      "text/plain": [
-       "\"Here's a bear joke for you:\\n\\nWhy did the bear dissolve in water?\\nBecause it was a polar bear!\""
+       "\"Why don't bears wear shoes?\\n\\nBecause they prefer to go bear-foot!\""
      ]
     },
     "execution_count": 3,
@ -128,7 +128,7 @@
    {
     "data": {
      "text/plain": [
-       "'Haha, that\\'s a clever play on words! Using \"polar\" to imply the bear dissolved or became polar/polarized when put in water. Not the most hilarious joke ever, but it has a cute, groan-worthy pun that makes it mildly amusing. I appreciate a good pun or wordplay joke.'"
+       "'Yes, that\\'s a funny joke! It\\'s a classic pun that plays on the homophone pair \"bare-foot\" and \"bear-foot.\" The humor comes from:\\n\\n1. The wordplay between \"barefoot\" (not wearing shoes) and \"bear-foot\" (the foot of a bear)\\n2. The logical connection to the setup (bears don\\'t wear shoes)\\n3. It\\'s family-friendly and accessible\\n4. It\\'s a simple, clean pun that creates an unexpected but satisfying punchline\\n\\nIt\\'s the kind of joke that might make you groan and smile at the same time - what people often call a \"dad joke.\"'"
      ]
     },
     "execution_count": 4,
@ -161,7 +161,7 @@
    {
     "data": {
      "text/plain": [
-       "\"Haha, that's a cute and punny joke! I like how it plays on the idea of beets blushing or turning red like someone blushing. Food puns can be quite amusing. While not a total knee-slapper, it's a light-hearted, groan-worthy dad joke that would make me chuckle and shake my head. Simple vegetable humor!\""
+       "'Yes, that\\'s a cute and funny joke! It works well because:\\n\\n1. It plays on the double meaning of \"roots\" - both the literal roots of the beet plant and the metaphorical sense of knowing one\\'s origins or foundation\\n2. It\\'s a simple, clean pun that doesn\\'t rely on offensive content\\n3. It has a satisfying logical connection (beets are root vegetables)\\n\\nIt\\'s the kind of wholesome food pun that might make people groan a little but also smile. Perfect for sharing in casual conversation or with kids!'"
      ]
     },
     "execution_count": 5,
@ -205,7 +205,7 @@
    {
     "data": {
      "text/plain": [
-       "\"I cannot reproduce any copyrighted material verbatim, but I can try to analyze the humor in the joke you provided without quoting it directly.\\n\\nThe joke plays on the idea that the Cylon raiders, who are the antagonists in the Battlestar Galactica universe, failed to locate the human survivors after attacking their home planets (the Twelve Colonies) due to using an outdated and poorly performing operating system (Windows Vista) for their targeting systems.\\n\\nThe humor stems from the juxtaposition of a futuristic science fiction setting with a relatable real-world frustration – the use of buggy, slow, or unreliable software or technology. It pokes fun at the perceived inadequacies of Windows Vista, which was widely criticized for its performance issues and other problems when it was released.\\n\\nBy attributing the Cylons' failure to locate the humans to their use of Vista, the joke creates an amusing and unexpected connection between a fictional advanced race of robots and a familiar technological annoyance experienced by many people in the real world.\\n\\nOverall, the joke relies on incongruity and relatability to generate humor, but without reproducing any copyrighted material directly.\""
+       "\"This joke is moderately funny! It plays on Battlestar Galactica lore where Cylons are robots with 12 different models trying to infiltrate human society. The humor comes from the idea of a Cylon accidentally revealing their non-human nature through a pickup line that references their artificial origins. It's a decent nerd-culture joke that would land well with fans of the show, though someone unfamiliar with Battlestar Galactica might not get the reference. The punchline effectively highlights the contradiction in a Cylon trying to blend in while simultaneously revealing their true identity.\""
      ]
     },
     "execution_count": 6,
@ -256,7 +256,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -270,7 +270,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/streaming.ipynb
+++ b/docs/docs/how_to/streaming.ipynb
@ -79,18 +79,7 @@
   "execution_count": 1,
   "id": "f123bdcb-8c8b-440c-9bbd-aa5ed4e9cd17",
   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m A new release of pip is available: \u001b[0m\u001b[31;49m23.2.1\u001b[0m\u001b[39;49m -> \u001b[0m\u001b[32;49m24.0\u001b[0m\n",
-      "\u001b[1m[\u001b[0m\u001b[34;49mnotice\u001b[0m\u001b[1;39;49m]\u001b[0m\u001b[39;49m To update, run: \u001b[0m\u001b[32;49mpython -m pip install --upgrade pip\u001b[0m\n",
-      "Note: you may need to restart the kernel to use updated packages.\n"
-     ]
-    }
-   ],
+   "outputs": [],
   "source": [
    "# | output: false\n",
    "# | echo: false\n",
@ -110,7 +99,7 @@
    "\n",
    "from langchain_anthropic import ChatAnthropic\n",
    "\n",
-    "model = ChatAnthropic(model=\"claude-3-sonnet-20240229\", temperature=0)"
+    "model = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\", temperature=0)"
   ]
  },
  {
@ -131,7 +120,14 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "The| sky| appears| blue| during| the| day|.|"
+      "|The sky typically| appears blue during the day due to a phenomenon| called Rayleigh scattering, where| air molecules scatter sunlight, with| blue light being scattered more than other colors. However|, the sky's color can vary|:\n",
+      "\n",
+      "- At sunrise/sunset:| Red, orange, pink, or purple\n",
+      "-| During storms: Gray or dark blue|\n",
+      "- At night: Dark| blue to black\n",
+      "- In certain| atmospheric conditions: White, yellow, or green| (rare)\n",
+      "\n",
+      "The color we perceive depends| on weather conditions, time of day, pollution| levels, and our viewing angle.||"
     ]
    }
   ],
@ -160,7 +156,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "The| sky| appears| blue| during| the| day|.|"
+      "|The sky typically| appears blue during the day due to a phenomenon called Rayleigh| scattering, where air molecules scatter sunlight,| with blue light being scattered more than other colors. However|, the sky's color can vary - appearing re|d, orange, or pink during sunrise and sunset,| gray on cloudy days, and black at night.| The color you see depends on the time of| day, weather conditions, and your location.||"
     ]
    }
   ],
@ -188,10 +184,10 @@
    {
     "data": {
      "text/plain": [
-       "AIMessageChunk(content='The', id='run-b36bea64-5511-4d7a-b6a3-a07b3db0c8e7')"
+       "AIMessageChunk(content='', additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219'}, id='run--c4f01565-8bb4-4f07-9b23-acfe46cbeca3', usage_metadata={'input_tokens': 13, 'output_tokens': 0, 'total_tokens': 13, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -219,10 +215,10 @@
    {
     "data": {
      "text/plain": [
-       "AIMessageChunk(content='The sky appears blue during', id='run-b36bea64-5511-4d7a-b6a3-a07b3db0c8e7')"
+       "AIMessageChunk(content='The sky typically appears blue during the day due to a phenomenon called Rayleigh scattering, where air molecules scatter sunlight, with blue light being scattered more than other colors. However', additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219'}, id='run--c4f01565-8bb4-4f07-9b23-acfe46cbeca3', usage_metadata={'input_tokens': 13, 'output_tokens': 0, 'total_tokens': 13, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -259,17 +255,9 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Here|'s| a| joke| about| a| par|rot|:|\n",
+      "|Why| don't parrots use the internet?\n",
      "\n",
-      "A man| goes| to| a| pet| shop| to| buy| a| par|rot|.| The| shop| owner| shows| him| two| stunning| pa|rr|ots| with| beautiful| pl|um|age|.|\n",
-      "\n",
-      "\"|There|'s| a| talking| par|rot| an|d a| non|-|talking| par|rot|,\"| the| owner| says|.| \"|The| talking| par|rot| costs| $|100|,| an|d the| non|-|talking| par|rot| is| $|20|.\"|\n",
-      "\n",
-      "The| man| says|,| \"|I|'ll| take| the| non|-|talking| par|rot| at| $|20|.\"|\n",
-      "\n",
-      "He| pays| an|d leaves| with| the| par|rot|.| As| he|'s| walking| down| the| street|,| the| par|rot| looks| up| at| him| an|d says|,| \"|You| know|,| you| really| are| a| stupi|d man|!\"|\n",
-      "\n",
-      "The| man| is| stun|ne|d an|d looks| at| the| par|rot| in| dis|bel|ief|.| The| par|rot| continues|,| \"|Yes|,| you| got| r|ippe|d off| big| time|!| I| can| talk| just| as| well| as| that| other| par|rot|,| an|d you| only| pai|d $|20| |for| me|!\"|"
+      "They|'re afraid of getting a virus from all the tweets|!||"
     ]
    }
   ],
@ -337,26 +325,14 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{}\n",
      "{'countries': []}\n",
-      "{'countries': [{}]}\n",
-      "{'countries': [{'name': ''}]}\n",
      "{'countries': [{'name': 'France'}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': ''}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain'}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351567}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351567}, {}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351567}, {'name': ''}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351567}, {'name': 'Japan'}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351567}, {'name': 'Japan', 'population': 125}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351567}, {'name': 'Japan', 'population': 125584}]}\n",
-      "{'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351567}, {'name': 'Japan', 'population': 125584000}]}\n"
+      "{'countries': [{'name': 'France', 'population': 67750}]}\n",
+      "{'countries': [{'name': 'France', 'population': 67750000}, {}]}\n",
+      "{'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain'}]}\n",
+      "{'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {}]}\n",
+      "{'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan'}]}\n",
+      "{'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan', 'population': 125700000}]}\n"
     ]
    }
   ],
@ -539,11 +515,11 @@
    {
     "data": {
      "text/plain": [
-       "[[Document(page_content='harrison worked at kensho'),\n",
-       "  Document(page_content='harrison likes spicy food')]]"
+       "[[Document(id='2740a247-9738-48c4-8c8f-d879d4ed39f7', metadata={}, page_content='harrison worked at kensho'),\n",
+       "  Document(id='1d3d012f-1cb0-4bee-928a-c8bf0f8b1b92', metadata={}, page_content='harrison likes spicy food')]]"
      ]
     },
-     "execution_count": 10,
+     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -614,15 +590,15 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Base|d on| the| given| context|,| Harrison| worke|d at| K|ens|ho|.|\n",
+      "|Base|d on the provided context, Harrison worked at Kens|ho.\n",
      "\n",
-      "Here| are| |3| |made| up| sentences| about| this| place|:|\n",
+      "Three made up sentences about Kens|ho:\n",
      "\n",
-      "1|.| K|ens|ho| was| a| cutting|-|edge| technology| company| known| for| its| innovative| solutions| in| artificial| intelligence| an|d data| analytics|.|\n",
+      "1. Kensho is a| cutting-edge technology company that specializes in| AI and data analytics for financial institutions.\n",
      "\n",
-      "2|.| The| modern| office| space| at| K|ens|ho| feature|d open| floor| plans|,| collaborative| work|sp|aces|,| an|d a| vib|rant| atmosphere| that| fos|tere|d creativity| an|d team|work|.|\n",
+      "2|. The Kensho office features| an open floor plan with panoramic views of the city| skyline, creating an inspiring environment for its| employees.\n",
      "\n",
-      "3|.| With| its| prime| location| in| the| heart| of| the| city|,| K|ens|ho| attracte|d top| talent| from| aroun|d the| worl|d,| creating| a| diverse| an|d dynamic| work| environment|.|"
+      "3. At Kensho,| team members often collaborate in innovative brainstorming sessions while| enjoying complimentary gourmet coffee from| their in-house café.||"
     ]
    }
   ],
@ -763,38 +739,38 @@
       "  'data': {'input': 'hello'},\n",
       "  'name': 'ChatAnthropic',\n",
       "  'tags': [],\n",
-       "  'run_id': 'b18d016d-8b9b-49e7-a555-44db498fcf66',\n",
+       "  'run_id': 'c35a72be-a5af-4bd5-bd9b-206135c28ef6',\n",
       "  'metadata': {'ls_provider': 'anthropic',\n",
-       "   'ls_model_name': 'claude-3-sonnet-20240229',\n",
+       "   'ls_model_name': 'claude-3-7-sonnet-20250219',\n",
       "   'ls_model_type': 'chat',\n",
       "   'ls_temperature': 0.0,\n",
       "   'ls_max_tokens': 1024},\n",
       "  'parent_ids': []},\n",
       " {'event': 'on_chat_model_stream',\n",
-       "  'run_id': 'b18d016d-8b9b-49e7-a555-44db498fcf66',\n",
+       "  'run_id': 'c35a72be-a5af-4bd5-bd9b-206135c28ef6',\n",
       "  'name': 'ChatAnthropic',\n",
       "  'tags': [],\n",
       "  'metadata': {'ls_provider': 'anthropic',\n",
-       "   'ls_model_name': 'claude-3-sonnet-20240229',\n",
+       "   'ls_model_name': 'claude-3-7-sonnet-20250219',\n",
       "   'ls_model_type': 'chat',\n",
       "   'ls_temperature': 0.0,\n",
       "   'ls_max_tokens': 1024},\n",
-       "  'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={}, id='run-b18d016d-8b9b-49e7-a555-44db498fcf66', usage_metadata={'input_tokens': 8, 'output_tokens': 4, 'total_tokens': 12, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})},\n",
+       "  'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219'}, id='run--c35a72be-a5af-4bd5-bd9b-206135c28ef6', usage_metadata={'input_tokens': 8, 'output_tokens': 0, 'total_tokens': 8, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})},\n",
       "  'parent_ids': []},\n",
       " {'event': 'on_chat_model_stream',\n",
-       "  'run_id': 'b18d016d-8b9b-49e7-a555-44db498fcf66',\n",
+       "  'run_id': 'c35a72be-a5af-4bd5-bd9b-206135c28ef6',\n",
       "  'name': 'ChatAnthropic',\n",
       "  'tags': [],\n",
       "  'metadata': {'ls_provider': 'anthropic',\n",
-       "   'ls_model_name': 'claude-3-sonnet-20240229',\n",
+       "   'ls_model_name': 'claude-3-7-sonnet-20250219',\n",
       "   'ls_model_type': 'chat',\n",
       "   'ls_temperature': 0.0,\n",
       "   'ls_max_tokens': 1024},\n",
-       "  'data': {'chunk': AIMessageChunk(content='Hello! How can', additional_kwargs={}, response_metadata={}, id='run-b18d016d-8b9b-49e7-a555-44db498fcf66')},\n",
+       "  'data': {'chunk': AIMessageChunk(content='Hello! How', additional_kwargs={}, response_metadata={}, id='run--c35a72be-a5af-4bd5-bd9b-206135c28ef6')},\n",
       "  'parent_ids': []}]"
      ]
     },
-     "execution_count": 14,
+     "execution_count": 17,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -813,30 +789,30 @@
     "data": {
      "text/plain": [
       "[{'event': 'on_chat_model_stream',\n",
-       "  'run_id': 'b18d016d-8b9b-49e7-a555-44db498fcf66',\n",
+       "  'run_id': 'c35a72be-a5af-4bd5-bd9b-206135c28ef6',\n",
       "  'name': 'ChatAnthropic',\n",
       "  'tags': [],\n",
       "  'metadata': {'ls_provider': 'anthropic',\n",
-       "   'ls_model_name': 'claude-3-sonnet-20240229',\n",
+       "   'ls_model_name': 'claude-3-7-sonnet-20250219',\n",
       "   'ls_model_type': 'chat',\n",
       "   'ls_temperature': 0.0,\n",
       "   'ls_max_tokens': 1024},\n",
-       "  'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={'stop_reason': 'end_turn', 'stop_sequence': None}, id='run-b18d016d-8b9b-49e7-a555-44db498fcf66', usage_metadata={'input_tokens': 0, 'output_tokens': 12, 'total_tokens': 12, 'input_token_details': {}})},\n",
+       "  'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={'stop_reason': 'end_turn', 'stop_sequence': None}, id='run--c35a72be-a5af-4bd5-bd9b-206135c28ef6', usage_metadata={'input_tokens': 0, 'output_tokens': 40, 'total_tokens': 40})},\n",
       "  'parent_ids': []},\n",
       " {'event': 'on_chat_model_end',\n",
-       "  'data': {'output': AIMessageChunk(content='Hello! How can I assist you today?', additional_kwargs={}, response_metadata={'stop_reason': 'end_turn', 'stop_sequence': None}, id='run-b18d016d-8b9b-49e7-a555-44db498fcf66', usage_metadata={'input_tokens': 8, 'output_tokens': 16, 'total_tokens': 24, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})},\n",
-       "  'run_id': 'b18d016d-8b9b-49e7-a555-44db498fcf66',\n",
+       "  'data': {'output': AIMessageChunk(content=\"Hello! How can I assist you today? Whether you have questions, need information, or just want to chat, I'm here to help. What would you like to talk about?\", additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219', 'stop_reason': 'end_turn', 'stop_sequence': None}, id='run--c35a72be-a5af-4bd5-bd9b-206135c28ef6', usage_metadata={'input_tokens': 8, 'output_tokens': 40, 'total_tokens': 48, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})},\n",
+       "  'run_id': 'c35a72be-a5af-4bd5-bd9b-206135c28ef6',\n",
       "  'name': 'ChatAnthropic',\n",
       "  'tags': [],\n",
       "  'metadata': {'ls_provider': 'anthropic',\n",
-       "   'ls_model_name': 'claude-3-sonnet-20240229',\n",
+       "   'ls_model_name': 'claude-3-7-sonnet-20250219',\n",
       "   'ls_model_type': 'chat',\n",
       "   'ls_temperature': 0.0,\n",
       "   'ls_max_tokens': 1024},\n",
       "  'parent_ids': []}]"
      ]
     },
-     "execution_count": 15,
+     "execution_count": 18,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -903,23 +879,34 @@
       "  'data': {'input': 'output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`'},\n",
       "  'name': 'RunnableSequence',\n",
       "  'tags': [],\n",
-       "  'run_id': '4765006b-16e2-4b1d-a523-edd9fd64cb92',\n",
-       "  'metadata': {}},\n",
+       "  'run_id': 'f859e56f-a760-4670-a24e-040e11bcd7fc',\n",
+       "  'metadata': {},\n",
+       "  'parent_ids': []},\n",
       " {'event': 'on_chat_model_start',\n",
-       "  'data': {'input': {'messages': [[HumanMessage(content='output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`')]]}},\n",
+       "  'data': {'input': {'messages': [[HumanMessage(content='output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`', additional_kwargs={}, response_metadata={})]]}},\n",
       "  'name': 'ChatAnthropic',\n",
       "  'tags': ['seq:step:1'],\n",
-       "  'run_id': '0320c234-7b52-4a14-ae4e-5f100949e589',\n",
-       "  'metadata': {}},\n",
+       "  'run_id': '2aa8c9e6-a5cd-4e94-b994-cb0e9bd8ab21',\n",
+       "  'metadata': {'ls_provider': 'anthropic',\n",
+       "   'ls_model_name': 'claude-3-7-sonnet-20250219',\n",
+       "   'ls_model_type': 'chat',\n",
+       "   'ls_temperature': 0.0,\n",
+       "   'ls_max_tokens': 1024},\n",
+       "  'parent_ids': ['f859e56f-a760-4670-a24e-040e11bcd7fc']},\n",
       " {'event': 'on_chat_model_stream',\n",
-       "  'data': {'chunk': AIMessageChunk(content='{', id='run-0320c234-7b52-4a14-ae4e-5f100949e589')},\n",
-       "  'run_id': '0320c234-7b52-4a14-ae4e-5f100949e589',\n",
+       "  'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219'}, id='run--2aa8c9e6-a5cd-4e94-b994-cb0e9bd8ab21', usage_metadata={'input_tokens': 56, 'output_tokens': 0, 'total_tokens': 56, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})},\n",
+       "  'run_id': '2aa8c9e6-a5cd-4e94-b994-cb0e9bd8ab21',\n",
       "  'name': 'ChatAnthropic',\n",
       "  'tags': ['seq:step:1'],\n",
-       "  'metadata': {}}]"
+       "  'metadata': {'ls_provider': 'anthropic',\n",
+       "   'ls_model_name': 'claude-3-7-sonnet-20250219',\n",
+       "   'ls_model_type': 'chat',\n",
+       "   'ls_temperature': 0.0,\n",
+       "   'ls_max_tokens': 1024},\n",
+       "  'parent_ids': ['f859e56f-a760-4670-a24e-040e11bcd7fc']}]"
      ]
     },
-     "execution_count": 18,
+     "execution_count": 20,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -955,25 +942,25 @@
     "output_type": "stream",
     "text": [
      "Chat model chunk: ''\n",
-      "Chat model chunk: '{'\n",
-      "Parser chunk: {}\n",
-      "Chat model chunk: '\\n  \"countries'\n",
-      "Chat model chunk: '\": [\\n    '\n",
+      "Chat model chunk: '```'\n",
+      "Chat model chunk: 'json\\n{\\n  \"countries\": ['\n",
      "Parser chunk: {'countries': []}\n",
-      "Chat model chunk: '{\\n      \"'\n",
-      "Parser chunk: {'countries': [{}]}\n",
-      "Chat model chunk: 'name\": \"France'\n",
+      "Chat model chunk: '\\n    {\\n      \"name\": \"France\",'\n",
      "Parser chunk: {'countries': [{'name': 'France'}]}\n",
-      "Chat model chunk: '\",\\n      \"'\n",
-      "Chat model chunk: 'population\": 67'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67}]}\n",
-      "Chat model chunk: '413'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413}]}\n",
-      "Chat model chunk: '000\\n    },'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413000}]}\n",
-      "Chat model chunk: '\\n    {'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413000}, {}]}\n",
-      "Chat model chunk: '\\n      \"name\":'\n",
+      "Chat model chunk: '\\n      \"population\": 67750000\\n    },'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}]}\n",
+      "Chat model chunk: '\\n    {\\n      \"name\": \"Spain\",'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain'}]}\n",
+      "Chat model chunk: '\\n      \"population\": 47350'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350}]}\n",
+      "Chat model chunk: '000\\n    },\\n    {\\n      \"'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {}]}\n",
+      "Chat model chunk: 'name\": \"Japan\",\\n      \"population\":'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan'}]}\n",
+      "Chat model chunk: ' 125700000\\n    }'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan', 'population': 125700000}]}\n",
+      "Chat model chunk: '\\n  ]\\n}\\n```'\n",
+      "Chat model chunk: ''\n",
      "...\n"
     ]
    }
@ -1033,18 +1020,16 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{'event': 'on_parser_start', 'data': {'input': 'output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`'}, 'name': 'my_parser', 'tags': ['seq:step:2'], 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'metadata': {}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': []}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France'}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67413}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67413000}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67413000}, {}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain'}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '37ee9e85-481c-415e-863b-c9e132d24948', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47}]}}, 'parent_ids': ['5a0bc625-09fd-4bdf-9932-54909a9a8c29']}\n",
-      "...\n"
+      "{'event': 'on_parser_start', 'data': {'input': 'output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`'}, 'name': 'my_parser', 'tags': ['seq:step:2'], 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'metadata': {}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': []}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France'}]}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67750}]}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67750000}, {}]}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain'}]}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}]}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan'}]}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan', 'population': 125700000}]}}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n",
+      "{'event': 'on_parser_end', 'data': {'output': {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan', 'population': 125700000}]}}, 'run_id': '781af9b6-31f8-47f2-ab79-52d17b000857', 'name': 'my_parser', 'tags': ['seq:step:2'], 'metadata': {}, 'parent_ids': ['82c918c6-d5f6-4d2d-b710-4668509fe2f0']}\n"
     ]
    }
   ],
@ -1086,17 +1071,17 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{'event': 'on_chat_model_start', 'data': {'input': 'output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`'}, 'name': 'model', 'tags': ['seq:step:1'], 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c', usage_metadata={'input_tokens': 56, 'output_tokens': 1, 'total_tokens': 57, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='{', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\\n  \"countries', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\": [\\n    ', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='{\\n      \"', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='name\": \"France', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\",\\n      \"', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='population\": 67', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='413', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='000\\n    },', additional_kwargs={}, response_metadata={}, id='run-156c3e40-82fb-49ff-8e41-9e998061be8c')}, 'run_id': '156c3e40-82fb-49ff-8e41-9e998061be8c', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['7b927055-bc1b-4b50-a34c-10d3cfcb3899']}\n",
+      "{'event': 'on_chat_model_start', 'data': {'input': 'output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`'}, 'name': 'model', 'tags': ['seq:step:1'], 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219'}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5', usage_metadata={'input_tokens': 56, 'output_tokens': 0, 'total_tokens': 56, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='```', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='json\\n{\\n  \"countries\": [', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\\n    {\\n      \"name\": \"France\",', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\\n      \"population\": 67750', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='000\\n    },\\n    {\\n      \"', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='name\": \"Spain\",\\n      \"population\":', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content=' 47350000\\n    },', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\\n    {\\n      \"name\": \"Japan\",', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\\n      \"population\": 125700', additional_kwargs={}, response_metadata={}, id='run--b7a08416-a629-4b42-b5d5-dbe48566e5d5')}, 'run_id': 'b7a08416-a629-4b42-b5d5-dbe48566e5d5', 'name': 'model', 'tags': ['seq:step:1'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['116a6506-5a19-4f60-a8c2-7b728d4b8248']}\n",
      "...\n"
     ]
    }
@ -1144,17 +1129,17 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{'event': 'on_chain_start', 'data': {'input': 'output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`'}, 'name': 'RunnableSequence', 'tags': ['my_chain'], 'run_id': '58d1302e-36ce-4df7-a3cb-47cb73d57e44', 'metadata': {}, 'parent_ids': []}\n",
-      "{'event': 'on_chat_model_start', 'data': {'input': {'messages': [[HumanMessage(content='output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`', additional_kwargs={}, response_metadata={})]]}}, 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'run_id': '8222e8a1-d978-4f30-87fc-b2dba838774b', 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={}, id='run-8222e8a1-d978-4f30-87fc-b2dba838774b', usage_metadata={'input_tokens': 56, 'output_tokens': 1, 'total_tokens': 57, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})}, 'run_id': '8222e8a1-d978-4f30-87fc-b2dba838774b', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_parser_start', 'data': {}, 'name': 'JsonOutputParser', 'tags': ['seq:step:2', 'my_chain'], 'run_id': '75604c84-e1e6-494a-8b2a-950f45d932e8', 'metadata': {}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='{', additional_kwargs={}, response_metadata={}, id='run-8222e8a1-d978-4f30-87fc-b2dba838774b')}, 'run_id': '8222e8a1-d978-4f30-87fc-b2dba838774b', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '75604c84-e1e6-494a-8b2a-950f45d932e8', 'name': 'JsonOutputParser', 'tags': ['seq:step:2', 'my_chain'], 'metadata': {}, 'data': {'chunk': {}}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_chain_stream', 'run_id': '58d1302e-36ce-4df7-a3cb-47cb73d57e44', 'name': 'RunnableSequence', 'tags': ['my_chain'], 'metadata': {}, 'data': {'chunk': {}}, 'parent_ids': []}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\\n  \"countries', additional_kwargs={}, response_metadata={}, id='run-8222e8a1-d978-4f30-87fc-b2dba838774b')}, 'run_id': '8222e8a1-d978-4f30-87fc-b2dba838774b', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\": [\\n    ', additional_kwargs={}, response_metadata={}, id='run-8222e8a1-d978-4f30-87fc-b2dba838774b')}, 'run_id': '8222e8a1-d978-4f30-87fc-b2dba838774b', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-sonnet-20240229', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_parser_stream', 'run_id': '75604c84-e1e6-494a-8b2a-950f45d932e8', 'name': 'JsonOutputParser', 'tags': ['seq:step:2', 'my_chain'], 'metadata': {}, 'data': {'chunk': {'countries': []}}, 'parent_ids': ['58d1302e-36ce-4df7-a3cb-47cb73d57e44']}\n",
-      "{'event': 'on_chain_stream', 'run_id': '58d1302e-36ce-4df7-a3cb-47cb73d57e44', 'name': 'RunnableSequence', 'tags': ['my_chain'], 'metadata': {}, 'data': {'chunk': {'countries': []}}, 'parent_ids': []}\n",
+      "{'event': 'on_chain_start', 'data': {'input': 'output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`'}, 'name': 'RunnableSequence', 'tags': ['my_chain'], 'run_id': '3e4f8c37-a44a-46b7-a7e5-75182d1cca31', 'metadata': {}, 'parent_ids': []}\n",
+      "{'event': 'on_chat_model_start', 'data': {'input': {'messages': [[HumanMessage(content='output a list of the countries france, spain and japan and their populations in JSON format. Use a dict with an outer key of \"countries\" which contains a list of countries. Each country should have the key `name` and `population`', additional_kwargs={}, response_metadata={})]]}}, 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'run_id': '778846c9-acd3-43b7-b9c0-ac718761b2bc', 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='', additional_kwargs={}, response_metadata={'model_name': 'claude-3-7-sonnet-20250219'}, id='run--778846c9-acd3-43b7-b9c0-ac718761b2bc', usage_metadata={'input_tokens': 56, 'output_tokens': 0, 'total_tokens': 56, 'input_token_details': {'cache_creation': 0, 'cache_read': 0}})}, 'run_id': '778846c9-acd3-43b7-b9c0-ac718761b2bc', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_parser_start', 'data': {}, 'name': 'JsonOutputParser', 'tags': ['seq:step:2', 'my_chain'], 'run_id': '2c46d24f-231c-4062-a7ab-b7954840986d', 'metadata': {}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='```', additional_kwargs={}, response_metadata={}, id='run--778846c9-acd3-43b7-b9c0-ac718761b2bc')}, 'run_id': '778846c9-acd3-43b7-b9c0-ac718761b2bc', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='json\\n{\\n  \"countries\": [', additional_kwargs={}, response_metadata={}, id='run--778846c9-acd3-43b7-b9c0-ac718761b2bc')}, 'run_id': '778846c9-acd3-43b7-b9c0-ac718761b2bc', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '2c46d24f-231c-4062-a7ab-b7954840986d', 'name': 'JsonOutputParser', 'tags': ['seq:step:2', 'my_chain'], 'metadata': {}, 'data': {'chunk': {'countries': []}}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_chain_stream', 'run_id': '3e4f8c37-a44a-46b7-a7e5-75182d1cca31', 'name': 'RunnableSequence', 'tags': ['my_chain'], 'metadata': {}, 'data': {'chunk': {'countries': []}}, 'parent_ids': []}\n",
+      "{'event': 'on_chat_model_stream', 'data': {'chunk': AIMessageChunk(content='\\n    {\\n      \"name\": \"France\",', additional_kwargs={}, response_metadata={}, id='run--778846c9-acd3-43b7-b9c0-ac718761b2bc')}, 'run_id': '778846c9-acd3-43b7-b9c0-ac718761b2bc', 'name': 'ChatAnthropic', 'tags': ['seq:step:1', 'my_chain'], 'metadata': {'ls_provider': 'anthropic', 'ls_model_name': 'claude-3-7-sonnet-20250219', 'ls_model_type': 'chat', 'ls_temperature': 0.0, 'ls_max_tokens': 1024}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_parser_stream', 'run_id': '2c46d24f-231c-4062-a7ab-b7954840986d', 'name': 'JsonOutputParser', 'tags': ['seq:step:2', 'my_chain'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France'}]}}, 'parent_ids': ['3e4f8c37-a44a-46b7-a7e5-75182d1cca31']}\n",
+      "{'event': 'on_chain_stream', 'run_id': '3e4f8c37-a44a-46b7-a7e5-75182d1cca31', 'name': 'RunnableSequence', 'tags': ['my_chain'], 'metadata': {}, 'data': {'chunk': {'countries': [{'name': 'France'}]}}, 'parent_ids': []}\n",
      "...\n"
     ]
    }
@ -1271,32 +1256,27 @@
     "output_type": "stream",
     "text": [
      "Chat model chunk: ''\n",
-      "Chat model chunk: '{'\n",
-      "Parser chunk: {}\n",
-      "Chat model chunk: '\\n  \"countries'\n",
-      "Chat model chunk: '\": [\\n    '\n",
+      "Chat model chunk: '```'\n",
+      "Chat model chunk: 'json\\n{\\n  \"countries\": ['\n",
      "Parser chunk: {'countries': []}\n",
-      "Chat model chunk: '{\\n      \"'\n",
-      "Parser chunk: {'countries': [{}]}\n",
-      "Chat model chunk: 'name\": \"France'\n",
+      "Chat model chunk: '\\n    {\\n      \"name\": \"France\",'\n",
      "Parser chunk: {'countries': [{'name': 'France'}]}\n",
-      "Chat model chunk: '\",\\n      \"'\n",
-      "Chat model chunk: 'population\": 67'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67}]}\n",
-      "Chat model chunk: '413'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413}]}\n",
-      "Chat model chunk: '000\\n    },'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413000}]}\n",
-      "Chat model chunk: '\\n    {'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413000}, {}]}\n",
-      "Chat model chunk: '\\n      \"name\":'\n",
-      "Chat model chunk: ' \"Spain\",'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain'}]}\n",
-      "Chat model chunk: '\\n      \"population\":'\n",
-      "Chat model chunk: ' 47'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47}]}\n",
-      "Chat model chunk: '351'\n",
-      "Parser chunk: {'countries': [{'name': 'France', 'population': 67413000}, {'name': 'Spain', 'population': 47351}]}\n",
+      "Chat model chunk: '\\n      \"population\": 67750'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750}]}\n",
+      "Chat model chunk: '000\\n    },\\n    {\\n      \"'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {}]}\n",
+      "Chat model chunk: 'name\": \"Spain\",\\n      \"population\":'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain'}]}\n",
+      "Chat model chunk: ' 47350000\\n    },'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}]}\n",
+      "Chat model chunk: '\\n    {\\n      \"name\": \"Japan\",'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan'}]}\n",
+      "Chat model chunk: '\\n      \"population\": 125700'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan', 'population': 125700}]}\n",
+      "Chat model chunk: '000\\n    }\\n  ]\\n}'\n",
+      "Parser chunk: {'countries': [{'name': 'France', 'population': 67750000}, {'name': 'Spain', 'population': 47350000}, {'name': 'Japan', 'population': 125700000}]}\n",
+      "Chat model chunk: '\\n```'\n",
+      "Chat model chunk: ''\n",
      "...\n"
     ]
    }
@ -1350,10 +1330,10 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{'event': 'on_tool_start', 'data': {'input': 'hello'}, 'name': 'bad_tool', 'tags': [], 'run_id': 'ea900472-a8f7-425d-b627-facdef936ee8', 'metadata': {}}\n",
-      "{'event': 'on_chain_start', 'data': {'input': 'hello'}, 'name': 'reverse_word', 'tags': [], 'run_id': '77b01284-0515-48f4-8d7c-eb27c1882f86', 'metadata': {}}\n",
-      "{'event': 'on_chain_end', 'data': {'output': 'olleh', 'input': 'hello'}, 'run_id': '77b01284-0515-48f4-8d7c-eb27c1882f86', 'name': 'reverse_word', 'tags': [], 'metadata': {}}\n",
-      "{'event': 'on_tool_end', 'data': {'output': 'olleh'}, 'run_id': 'ea900472-a8f7-425d-b627-facdef936ee8', 'name': 'bad_tool', 'tags': [], 'metadata': {}}\n"
+      "{'event': 'on_tool_start', 'data': {'input': 'hello'}, 'name': 'bad_tool', 'tags': [], 'run_id': 'b1c6b79d-f94b-432f-a289-1ea68a7c3cea', 'metadata': {}, 'parent_ids': []}\n",
+      "{'event': 'on_chain_start', 'data': {'input': 'hello'}, 'name': 'reverse_word', 'tags': [], 'run_id': 'e661c1ec-e6d2-4f9a-9620-b50645f2b194', 'metadata': {}, 'parent_ids': ['b1c6b79d-f94b-432f-a289-1ea68a7c3cea']}\n",
+      "{'event': 'on_chain_end', 'data': {'output': 'olleh', 'input': 'hello'}, 'run_id': 'e661c1ec-e6d2-4f9a-9620-b50645f2b194', 'name': 'reverse_word', 'tags': [], 'metadata': {}, 'parent_ids': ['b1c6b79d-f94b-432f-a289-1ea68a7c3cea']}\n",
+      "{'event': 'on_tool_end', 'data': {'output': 'olleh'}, 'run_id': 'b1c6b79d-f94b-432f-a289-1ea68a7c3cea', 'name': 'bad_tool', 'tags': [], 'metadata': {}, 'parent_ids': []}\n"
     ]
    }
   ],
@ -1397,10 +1377,10 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{'event': 'on_tool_start', 'data': {'input': 'hello'}, 'name': 'correct_tool', 'tags': [], 'run_id': 'd5ea83b9-9278-49cc-9f1d-aa302d671040', 'metadata': {}}\n",
-      "{'event': 'on_chain_start', 'data': {'input': 'hello'}, 'name': 'reverse_word', 'tags': [], 'run_id': '44dafbf4-2f87-412b-ae0e-9f71713810df', 'metadata': {}}\n",
-      "{'event': 'on_chain_end', 'data': {'output': 'olleh', 'input': 'hello'}, 'run_id': '44dafbf4-2f87-412b-ae0e-9f71713810df', 'name': 'reverse_word', 'tags': [], 'metadata': {}}\n",
-      "{'event': 'on_tool_end', 'data': {'output': 'olleh'}, 'run_id': 'd5ea83b9-9278-49cc-9f1d-aa302d671040', 'name': 'correct_tool', 'tags': [], 'metadata': {}}\n"
+      "{'event': 'on_tool_start', 'data': {'input': 'hello'}, 'name': 'correct_tool', 'tags': [], 'run_id': '399c91f5-a40b-4173-943f-a9c583a04003', 'metadata': {}, 'parent_ids': []}\n",
+      "{'event': 'on_chain_start', 'data': {'input': 'hello'}, 'name': 'reverse_word', 'tags': [], 'run_id': 'e9cc7db1-4587-40af-9c35-2d787b3f0956', 'metadata': {}, 'parent_ids': ['399c91f5-a40b-4173-943f-a9c583a04003']}\n",
+      "{'event': 'on_chain_end', 'data': {'output': 'olleh', 'input': 'hello'}, 'run_id': 'e9cc7db1-4587-40af-9c35-2d787b3f0956', 'name': 'reverse_word', 'tags': [], 'metadata': {}, 'parent_ids': ['399c91f5-a40b-4173-943f-a9c583a04003']}\n",
+      "{'event': 'on_tool_end', 'data': {'output': 'olleh'}, 'run_id': '399c91f5-a40b-4173-943f-a9c583a04003', 'name': 'correct_tool', 'tags': [], 'metadata': {}, 'parent_ids': []}\n"
     ]
    }
   ],
@ -1433,11 +1413,9 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{'event': 'on_chain_start', 'data': {'input': '1234'}, 'name': 'reverse_and_double', 'tags': [], 'run_id': '03b0e6a1-3e60-42fc-8373-1e7829198d80', 'metadata': {}}\n",
-      "{'event': 'on_chain_start', 'data': {'input': '1234'}, 'name': 'reverse_word', 'tags': [], 'run_id': '5cf26fc8-840b-4642-98ed-623dda28707a', 'metadata': {}}\n",
-      "{'event': 'on_chain_end', 'data': {'output': '4321', 'input': '1234'}, 'run_id': '5cf26fc8-840b-4642-98ed-623dda28707a', 'name': 'reverse_word', 'tags': [], 'metadata': {}}\n",
-      "{'event': 'on_chain_stream', 'data': {'chunk': '43214321'}, 'run_id': '03b0e6a1-3e60-42fc-8373-1e7829198d80', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}}\n",
-      "{'event': 'on_chain_end', 'data': {'output': '43214321'}, 'run_id': '03b0e6a1-3e60-42fc-8373-1e7829198d80', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}}\n"
+      "{'event': 'on_chain_start', 'data': {'input': '1234'}, 'name': 'reverse_and_double', 'tags': [], 'run_id': '04726e2e-f508-4f90-9d4f-f88e588f0b39', 'metadata': {}, 'parent_ids': []}\n",
+      "{'event': 'on_chain_stream', 'run_id': '04726e2e-f508-4f90-9d4f-f88e588f0b39', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}, 'data': {'chunk': '43214321'}, 'parent_ids': []}\n",
+      "{'event': 'on_chain_end', 'data': {'output': '43214321'}, 'run_id': '04726e2e-f508-4f90-9d4f-f88e588f0b39', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}, 'parent_ids': []}\n"
     ]
    }
   ],
@ -1475,11 +1453,9 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "{'event': 'on_chain_start', 'data': {'input': '1234'}, 'name': 'reverse_and_double', 'tags': [], 'run_id': '1bfcaedc-f4aa-4d8e-beee-9bba6ef17008', 'metadata': {}}\n",
-      "{'event': 'on_chain_start', 'data': {'input': '1234'}, 'name': 'reverse_word', 'tags': [], 'run_id': '64fc99f0-5d7d-442b-b4f5-4537129f67d1', 'metadata': {}}\n",
-      "{'event': 'on_chain_end', 'data': {'output': '4321', 'input': '1234'}, 'run_id': '64fc99f0-5d7d-442b-b4f5-4537129f67d1', 'name': 'reverse_word', 'tags': [], 'metadata': {}}\n",
-      "{'event': 'on_chain_stream', 'data': {'chunk': '43214321'}, 'run_id': '1bfcaedc-f4aa-4d8e-beee-9bba6ef17008', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}}\n",
-      "{'event': 'on_chain_end', 'data': {'output': '43214321'}, 'run_id': '1bfcaedc-f4aa-4d8e-beee-9bba6ef17008', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}}\n"
+      "{'event': 'on_chain_start', 'data': {'input': '1234'}, 'name': 'reverse_and_double', 'tags': [], 'run_id': '25f72976-aa79-408d-bb42-6d0f038cde52', 'metadata': {}, 'parent_ids': []}\n",
+      "{'event': 'on_chain_stream', 'run_id': '25f72976-aa79-408d-bb42-6d0f038cde52', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}, 'data': {'chunk': '43214321'}, 'parent_ids': []}\n",
+      "{'event': 'on_chain_end', 'data': {'output': '43214321'}, 'run_id': '25f72976-aa79-408d-bb42-6d0f038cde52', 'name': 'reverse_and_double', 'tags': [], 'metadata': {}, 'parent_ids': []}\n"
     ]
    }
   ],
@ -1513,7 +1489,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -1527,7 +1503,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.4"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/how_to/tools_human.ipynb
+++ b/docs/docs/how_to/tools_human.ipynb
@ -94,7 +94,7 @@
    "\n",
    "from langchain_anthropic import ChatAnthropic\n",
    "\n",
-    "llm = ChatAnthropic(model=\"claude-3-sonnet-20240229\", temperature=0)"
+    "llm = ChatAnthropic(model=\"claude-3-7-sonnet-20250219\", temperature=0)"
   ]
  },
  {
@ -108,11 +108,12 @@
      "text/plain": [
       "[{'name': 'count_emails',\n",
       "  'args': {'last_n_days': 5},\n",
-       "  'id': 'toolu_01QYZdJ4yPiqsdeENWHqioFW',\n",
+       "  'id': 'toolu_01XrE4AU9QLo4imbriDDkmXm',\n",
+       "  'type': 'tool_call',\n",
       "  'output': 10}]"
      ]
     },
-     "execution_count": 3,
+     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
@ -299,7 +300,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "langchain",
   "language": "python",
   "name": "python3"
  },
@ -313,7 +314,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.4"
+   "version": "3.10.16"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/chat/index.mdx
+++ b/docs/docs/integrations/chat/index.mdx
@ -37,4 +37,4 @@ import { CategoryTable, IndexTable } from "@theme/FeatureTables";

 ## All chat models

-<IndexTable />
+<IndexTable />
--- a/docs/docs/integrations/graphs/tigergraph.mdx
+++ b/docs/docs/integrations/graphs/tigergraph.mdx
@ -1,9 +1,9 @@
 # TigerGraph

 >[TigerGraph](https://www.tigergraph.com/tigergraph-db/) is a natively distributed and high-performance graph database.
-> The storage of data in a graph format of vertices and edges leads to rich relationships, 
+> The storage of data in a graph format of vertices and edges leads to rich relationships,
 > ideal for grouding LLM responses.
- 
+
 A big example of the `TigerGraph` and `LangChain` integration [presented here](https://github.com/tigergraph/graph-ml-notebooks/blob/main/applications/large_language_models/TigerGraph_LangChain_Demo.ipynb).

 ## Installation and Setup
--- a/docs/docs/integrations/llms/layerup_security.mdx
+++ b/docs/docs/integrations/llms/layerup_security.mdx
@ -82,4 +82,4 @@ layerup_security = LayerupSecurity(
 response = layerup_security.invoke(
    "Summarize this message: my name is Bob Dylan. My SSN is 123-45-6789."
 )
-```
+```
--- a/docs/docs/integrations/providers/acreom.mdx
+++ b/docs/docs/integrations/providers/acreom.mdx
@ -4,7 +4,7 @@

 ## Installation and Setup

-No installation is required. 
+No installation is required.

 ## Document Loader

--- a/docs/docs/integrations/providers/activeloop_deeplake.mdx
+++ b/docs/docs/integrations/providers/activeloop_deeplake.mdx
@ -1,6 +1,6 @@
 # Activeloop Deep Lake

->[Activeloop Deep Lake](https://docs.activeloop.ai/) is a data lake for Deep Learning applications, allowing you to use it 
+>[Activeloop Deep Lake](https://docs.activeloop.ai/) is a data lake for Deep Learning applications, allowing you to use it
 > as a vector store.

 ## Why Deep Lake?
--- a/docs/docs/integrations/providers/ads4gpts.mdx
+++ b/docs/docs/integrations/providers/ads4gpts.mdx
@ -1,6 +1,6 @@
 # ADS4GPTs

-> [ADS4GPTs](https://www.ads4gpts.com/) is building the open monetization backbone of the AI-Native internet. It helps AI applications monetize through advertising with a UX and Privacy first approach. 
+> [ADS4GPTs](https://www.ads4gpts.com/) is building the open monetization backbone of the AI-Native internet. It helps AI applications monetize through advertising with a UX and Privacy first approach.

 ## Installation and Setup

--- a/docs/docs/integrations/providers/ai21.mdx
+++ b/docs/docs/integrations/providers/ai21.mdx
@ -1,7 +1,7 @@
 # AI21 Labs

->[AI21 Labs](https://www.ai21.com/about) is a company specializing in Natural 
-> Language Processing (NLP), which develops AI systems 
+>[AI21 Labs](https://www.ai21.com/about) is a company specializing in Natural
+> Language Processing (NLP), which develops AI systems
 > that can understand and generate natural language.

 This page covers how to use the `AI21` ecosystem within `LangChain`.
@ -17,7 +17,7 @@ pip install langchain-ai21

 ## Chat models

-### AI21 Chat 
+### AI21 Chat

 See a [usage example](/docs/integrations/chat/ai21).

@ -27,7 +27,7 @@ from langchain_ai21 import ChatAI21

 ## Deprecated features

-:::caution The following features are deprecated. 
+:::caution The following features are deprecated.
 :::

 ### AI21 LLM
@ -48,4 +48,4 @@ from langchain_ai21 import AI21ContextualAnswers

 ```python
 from langchain_ai21 import AI21SemanticTextSplitter
-```
+```
--- a/docs/docs/integrations/providers/ainetwork.mdx
+++ b/docs/docs/integrations/providers/ainetwork.mdx
@ -1,7 +1,7 @@
 # AINetwork

->[AI Network](https://www.ainetwork.ai/build-on-ain) is a layer 1 blockchain designed to accommodate 
-> large-scale AI models, utilizing a decentralized GPU network powered by the 
+>[AI Network](https://www.ainetwork.ai/build-on-ain) is a layer 1 blockchain designed to accommodate
+> large-scale AI models, utilizing a decentralized GPU network powered by the
 > [$AIN token](https://www.ainetwork.ai/token), enriching AI-driven `NFTs` (`AINFTs`).


--- a/docs/docs/integrations/providers/airbyte.mdx
+++ b/docs/docs/integrations/providers/airbyte.mdx
@ -1,6 +1,6 @@
 # Airbyte

->[Airbyte](https://github.com/airbytehq/airbyte) is a data integration platform for ELT pipelines from APIs, 
+>[Airbyte](https://github.com/airbytehq/airbyte) is a data integration platform for ELT pipelines from APIs,
 > databases & files to warehouses & lakes. It has the largest catalog of ELT connectors to data warehouses and databases.

 ## Installation and Setup
--- a/docs/docs/integrations/providers/alibaba_cloud.mdx
+++ b/docs/docs/integrations/providers/alibaba_cloud.mdx
@ -1,15 +1,15 @@
 # Alibaba Cloud

 >[Alibaba Group Holding Limited (Wikipedia)](https://en.wikipedia.org/wiki/Alibaba_Group), or `Alibaba`
-> (Chinese: 阿里巴巴), is a Chinese multinational technology company specializing in e-commerce, retail, 
+> (Chinese: 阿里巴巴), is a Chinese multinational technology company specializing in e-commerce, retail,
 > Internet, and technology.
-> 
+>
 > [Alibaba Cloud (Wikipedia)](https://en.wikipedia.org/wiki/Alibaba_Cloud), also known as `Aliyun`
-> (Chinese: 阿里云; pinyin: Ālǐyún; lit. 'Ali Cloud'), is a cloud computing company, a subsidiary 
-> of `Alibaba Group`. `Alibaba Cloud` provides cloud computing services to online businesses and 
+> (Chinese: 阿里云; pinyin: Ālǐyún; lit. 'Ali Cloud'), is a cloud computing company, a subsidiary
+> of `Alibaba Group`. `Alibaba Cloud` provides cloud computing services to online businesses and
 > Alibaba's own e-commerce ecosystem.
- 
- 
+
+
 ## LLMs

 ### Alibaba Cloud PAI EAS
@ -104,4 +104,4 @@ See [installation instructions and a usage example](/docs/integrations/vectorsto

 ```python
 from langchain_community.vectorstores import TablestoreVectorStore
-```
+```
--- a/docs/docs/integrations/providers/analyticdb.mdx
+++ b/docs/docs/integrations/providers/analyticdb.mdx
@ -1,15 +1,15 @@
 # AnalyticDB

->[AnalyticDB for PostgreSQL](https://www.alibabacloud.com/help/en/analyticdb-for-postgresql/latest/product-introduction-overview) 
-> is a massively parallel processing (MPP) data warehousing service 
+>[AnalyticDB for PostgreSQL](https://www.alibabacloud.com/help/en/analyticdb-for-postgresql/latest/product-introduction-overview)
+> is a massively parallel processing (MPP) data warehousing service
 > from [Alibaba Cloud](https://www.alibabacloud.com/)
 >that is designed to analyze large volumes of data online.

->`AnalyticDB for PostgreSQL` is developed based on the open-source `Greenplum Database` 
-> project and is enhanced with in-depth extensions by `Alibaba Cloud`. AnalyticDB 
-> for PostgreSQL is compatible with the ANSI SQL 2003 syntax and the PostgreSQL and 
-> Oracle database ecosystems. AnalyticDB for PostgreSQL also supports row store and 
-> column store. AnalyticDB for PostgreSQL processes petabytes of data offline at a 
+>`AnalyticDB for PostgreSQL` is developed based on the open-source `Greenplum Database`
+> project and is enhanced with in-depth extensions by `Alibaba Cloud`. AnalyticDB
+> for PostgreSQL is compatible with the ANSI SQL 2003 syntax and the PostgreSQL and
+> Oracle database ecosystems. AnalyticDB for PostgreSQL also supports row store and
+> column store. AnalyticDB for PostgreSQL processes petabytes of data offline at a
 > high performance level and supports highly concurrent.

 This page covers how to use the AnalyticDB ecosystem within LangChain.
--- a/docs/docs/integrations/providers/annoy.mdx
+++ b/docs/docs/integrations/providers/annoy.mdx
@ -1,9 +1,9 @@
 # Annoy

-> [Annoy](https://github.com/spotify/annoy) (`Approximate Nearest Neighbors Oh Yeah`) 
-> is a C++ library with Python bindings to search for points in space that are 
-> close to a given query point. It also creates large read-only file-based data 
-> structures that are mapped into memory so that many processes may share the same data. 
+> [Annoy](https://github.com/spotify/annoy) (`Approximate Nearest Neighbors Oh Yeah`)
+> is a C++ library with Python bindings to search for points in space that are
+> close to a given query point. It also creates large read-only file-based data
+> structures that are mapped into memory so that many processes may share the same data.

 ## Installation and Setup

--- a/docs/docs/integrations/providers/anthropic.mdx
+++ b/docs/docs/integrations/providers/anthropic.mdx
@ -31,7 +31,7 @@ model = ChatAnthropic(model='claude-3-opus-20240229')

 ### [Legacy] AnthropicLLM

-**NOTE**: `AnthropicLLM` only supports legacy `Claude 2` models. 
+**NOTE**: `AnthropicLLM` only supports legacy `Claude 2` models.
 To use the newest `Claude 3` models, please use `ChatAnthropic` instead.

 See a [usage example](/docs/integrations/llms/anthropic).
--- a/docs/docs/integrations/providers/anyscale.mdx
+++ b/docs/docs/integrations/providers/anyscale.mdx
@ -3,12 +3,12 @@
 >[Anyscale](https://www.anyscale.com) is a platform to run, fine tune and scale LLMs via production-ready APIs.
 > [Anyscale Endpoints](https://docs.anyscale.com/endpoints/overview) serve many open-source models in a cost-effective way.

-`Anyscale` also provides [an example](https://docs.anyscale.com/endpoints/model-serving/examples/langchain-integration) 
+`Anyscale` also provides [an example](https://docs.anyscale.com/endpoints/model-serving/examples/langchain-integration)
 how to setup `LangChain` with `Anyscale` for advanced chat agents.

 ## Installation and Setup

- Get an Anyscale Service URL, route and API key and set them as environment variables (`ANYSCALE_SERVICE_URL`,`ANYSCALE_SERVICE_ROUTE`, `ANYSCALE_SERVICE_TOKEN`). 
+- Get an Anyscale Service URL, route and API key and set them as environment variables (`ANYSCALE_SERVICE_URL`,`ANYSCALE_SERVICE_ROUTE`, `ANYSCALE_SERVICE_TOKEN`).
 - Please see [the Anyscale docs](https://www.anyscale.com/get-started) for more details.

 We have to install the `openai` package:
--- a/docs/docs/integrations/providers/apache.mdx
+++ b/docs/docs/integrations/providers/apache.mdx
@ -1,63 +1,63 @@
 # Apache Software Foundation

->[The Apache Software Foundation (Wikipedia)](https://en.wikipedia.org/wiki/The_Apache_Software_Foundation) 
-> is a decentralized open source community of developers. The software they 
-> produce is distributed under the terms of the Apache License, a permissive 
-> open-source license for free and open-source software (FOSS). The Apache projects 
-> are characterized by a collaborative, consensus-based development process 
-> and an open and pragmatic software license, which is to say that it 
-> allows developers, who receive the software freely, to redistribute 
-> it under non-free terms. Each project is managed by a self-selected 
+>[The Apache Software Foundation (Wikipedia)](https://en.wikipedia.org/wiki/The_Apache_Software_Foundation)
+> is a decentralized open source community of developers. The software they
+> produce is distributed under the terms of the Apache License, a permissive
+> open-source license for free and open-source software (FOSS). The Apache projects
+> are characterized by a collaborative, consensus-based development process
+> and an open and pragmatic software license, which is to say that it
+> allows developers, who receive the software freely, to redistribute
+> it under non-free terms. Each project is managed by a self-selected
 > team of technical experts who are active contributors to the project.

 ## Apache AGE

->[Apache AGE](https://age.apache.org/) is a `PostgreSQL` extension that provides 
-> graph database functionality. `AGE` is an acronym for `A Graph Extension`, and 
-> is inspired by Bitnine’s fork of `PostgreSQL 10`, `AgensGraph`, which is 
-> a multimodal database. The goal of the project is to create single 
-> storage that can handle both relational and graph model data so that users 
-> can use standard ANSI SQL along with `openCypher`, the Graph query language. 
-> The data elements `Apache AGE` stores are nodes, edges connecting them, and 
+>[Apache AGE](https://age.apache.org/) is a `PostgreSQL` extension that provides
+> graph database functionality. `AGE` is an acronym for `A Graph Extension`, and
+> is inspired by Bitnine’s fork of `PostgreSQL 10`, `AgensGraph`, which is
+> a multimodal database. The goal of the project is to create single
+> storage that can handle both relational and graph model data so that users
+> can use standard ANSI SQL along with `openCypher`, the Graph query language.
+> The data elements `Apache AGE` stores are nodes, edges connecting them, and
 > attributes of nodes and edges.
- 
+
 See more about [integrating with Apache AGE](/docs/integrations/graphs/apache_age).

 ## Apache Cassandra

->[Apache Cassandra](https://cassandra.apache.org/) is a NoSQL, row-oriented, 
-> highly scalable and highly available database. Starting with version 5.0, 
+>[Apache Cassandra](https://cassandra.apache.org/) is a NoSQL, row-oriented,
+> highly scalable and highly available database. Starting with version 5.0,
 > the database ships with vector search capabilities.
- 
+
 See more about [integrating with Apache Cassandra](/docs/integrations/providers/cassandra/).

 ## Apache Doris

->[Apache Doris](https://doris.apache.org/) is a modern data warehouse for 
+>[Apache Doris](https://doris.apache.org/) is a modern data warehouse for
 > real-time analytics. It delivers lightning-fast analytics on real-time data at scale.
 >
->Usually `Apache Doris` is categorized into OLAP, and it has showed excellent 
-> performance in ClickBench — a Benchmark For Analytical DBMS. Since it has 
+>Usually `Apache Doris` is categorized into OLAP, and it has showed excellent
+> performance in ClickBench — a Benchmark For Analytical DBMS. Since it has
 > a super-fast vectorized execution engine, it could also be used as a fast vectordb.
- 
+
 See more about [integrating with Apache Doris](/docs/integrations/providers/apache_doris/).

 ## Apache Kafka

->[Apache Kafka](https://github.com/apache/kafka) is a distributed messaging system 
+>[Apache Kafka](https://github.com/apache/kafka) is a distributed messaging system
 > that is used to publish and subscribe to streams of records.
- 
+
 See more about [integrating with Apache Kafka](/docs/integrations/memory/kafka_chat_message_history).


 ## Apache Spark

->[Apache Spark](https://spark.apache.org/) is a unified analytics engine for 
-> large-scale data processing. It provides high-level APIs in Scala, Java, 
-> Python, and R, and an optimized engine that supports general computation 
-> graphs for data analysis. It also supports a rich set of higher-level 
-> tools including `Spark SQL` for SQL and DataFrames, `pandas API on Spark` 
-> for pandas workloads, `MLlib` for machine learning, 
+>[Apache Spark](https://spark.apache.org/) is a unified analytics engine for
+> large-scale data processing. It provides high-level APIs in Scala, Java,
+> Python, and R, and an optimized engine that supports general computation
+> graphs for data analysis. It also supports a rich set of higher-level
+> tools including `Spark SQL` for SQL and DataFrames, `pandas API on Spark`
+> for pandas workloads, `MLlib` for machine learning,
 > `GraphX` for graph processing, and `Structured Streaming` for stream processing.

 See more about [integrating with Apache Spark](/docs/integrations/providers/spark).
--- a/docs/docs/integrations/providers/apache_doris.mdx
+++ b/docs/docs/integrations/providers/apache_doris.mdx
@ -3,8 +3,8 @@
 >[Apache Doris](https://doris.apache.org/) is a modern data warehouse for real-time analytics.
 It delivers lightning-fast analytics on real-time data at scale.

->Usually `Apache Doris` is categorized into OLAP, and it has showed excellent performance 
-> in [ClickBench — a Benchmark For Analytical DBMS](https://benchmark.clickhouse.com/). 
+>Usually `Apache Doris` is categorized into OLAP, and it has showed excellent performance
+> in [ClickBench — a Benchmark For Analytical DBMS](https://benchmark.clickhouse.com/).
 > Since it has a super-fast vectorized execution engine, it could also be used as a fast vectordb.

 ## Installation and Setup
--- a/Show More
+++ b/Show More