Keep imports

keep imports
bumped required sdk version
2026-02-16 18:24:31 +00:00 · 2023-12-13 10:20:59 -08:00 · 2023-12-13 10:18:50 -08:00 · 2023-12-13 15:07:43 +01:00 · 2023-12-13 08:13:22 +01:00 · 2023-12-13 07:12:14 +01:00
3276 changed files with 245105 additions and 178560 deletions
--- a/.github/CONTRIBUTING.md
+++ b/.github/CONTRIBUTING.md
@@ -23,7 +23,7 @@ It's essential that we maintain great documentation and testing. If you:
  - Update any affected example notebooks and documentation. These live in `docs`.
  - Update unit and integration tests when relevant.
 - Add a feature
-  - Add a demo notebook in `docs/modules`.
+  - Add a demo notebook in `docs/docs/`.
  - Add unit and integration tests.

 We are a small, progress-oriented team. If there's something you'd like to add or change, opening a pull request is the
@@ -70,16 +70,18 @@ Install Poetry: **[documentation on how to install it](https://python-poetry.org
 ❗Note: If you use `Conda` or `Pyenv` as your environment/package manager, after installing Poetry,
 tell Poetry to use the virtualenv python environment (`poetry config virtualenvs.prefer-active-python true`)

-### Core vs. Experimental
+### Different packages

-This repository contains two separate projects:
- `langchain`: core langchain code, abstractions, and use cases.
- `langchain.experimental`: see the [Experimental README](https://github.com/langchain-ai/langchain/tree/master/libs/experimental/README.md) for more information.
+This repository contains multiple packages:
+- `langchain-core`: Base interfaces for key abstractions as well as logic for combining them in chains (LangChain Expression Language).
+- `langchain-community`: Third-party integrations of various components.
+- `langchain`: Chains, agents, and retrieval logic that makes up the cognitive architecture of your applications.
+- `langchain-experimental`: Components and chains that are experimental, either in the sense that the techniques are novel and still being tested, or they require giving the LLM more access than would be possible in most production systems.

 Each of these has its own development environment. Docs are run from the top-level makefile, but development
 is split across separate test & release flows.

-For this quickstart, start with langchain core:
+For this quickstart, start with langchain:

 ```bash
 cd libs/langchain
@@ -128,6 +130,24 @@ make docker_tests

 There are also [integration tests and code-coverage](https://github.com/langchain-ai/langchain/tree/master/libs/langchain/tests/README.md) available.

+### Only develop langchain_core or langchain_experimental
+
+If you are only developing `langchain_core` or `langchain_experimental`, you can simply install the dependencies for the respective projects and run tests:
+
+```bash
+cd libs/core
+poetry install --with test
+make test
+```
+
+Or:
+
+```bash
+cd libs/experimental
+poetry install --with test
+make test
+```
+
 ### Formatting and Linting

 Run these locally before submitting a PR; the CI system will check also.
@@ -214,6 +234,10 @@ ignore-words-list = 'momento,collison,ned,foor,reworkd,parth,whats,aapply,mysogy

 Langchain relies heavily on optional dependencies to keep the Langchain package lightweight.

+You only need to add a new dependency if a **unit test** relies on the package.
+If your package is only required for **integration tests**, then you can skip these
+steps and leave all pyproject.toml and poetry.lock files alone.
+
 If you're adding a new dependency to Langchain, assume that it will be an optional dependency, and
 that most users won't have it installed.

@@ -307,15 +331,50 @@ what you wanted by clicking the `View deployment` or `Visit Preview` buttons on
 This will take you to a preview of the documentation changes.
 This preview is created by [Vercel](https://vercel.com/docs/getting-started-with-vercel).

-## 🏭 Release Process
+## 📕 Releases & Versioning

 As of now, LangChain has an ad hoc release process: releases are cut with high frequency by
-a developer and published to [PyPI](https://pypi.org/project/langchain/).
+a maintainer and published to [PyPI](https://pypi.org/). 
+The different packages are versioned slightly differently.

-LangChain follows the [semver](https://semver.org/) versioning standard. However, as pre-1.0 software,
-even patch releases may contain [non-backwards-compatible changes](https://semver.org/#spec-item-4).
+### `langchain-core`

-### 🌟 Recognition
+`langchain-core` is currently on version `0.1.x`. 
+
+As `langchain-core` contains the base abstractions and runtime for the whole LangChain ecosystem, we will communicate any breaking changes with advance notice and version bumps. The exception for this is anything in `langchain_core.beta`. The reason for `langchain_core.beta` is that given the rate of change of the field, being able to move quickly is still a priority, and this module is our attempt to do so.
+
+Minor version increases will occur for:
+
+- Breaking changes for any public interfaces NOT in `langchain_core.beta`
+
+Patch version increases will occur for:
+
+- Bug fixes
+- New features
+- Any changes to private interfaces
+- Any changes to `langchain_core.beta`
+
+### `langchain`
+
+`langchain` is currently on version `0.0.x`
+
+All changes will be accompanied by a patch version increase. Any changes to public interfaces are nearly always done in a backwards compatible way and will be communicated ahead of time when they are not backwards compatible.
+
+We are targeting January 2024 for a release of `langchain` v0.1, at which point `langchain` will adopt the same versioning policy as `langchain-core`.
+
+### `langchain-community`
+
+`langchain-community` is currently on version `0.0.x`
+
+All changes will be accompanied by a patch version increase.
+
+### `langchain-experimental`
+
+`langchain-experimental` is currently on version `0.0.x`
+
+All changes will be accompanied by a patch version increase.
+
+## 🌟 Recognition

 If your contribution has made its way into a release, we will want to give you credit on Twitter (only if you want though)!
 If you have a Twitter account you would like us to mention, please let us know in the PR or through another means.
--- a/.github/scripts/check_diff.py
+++ b/.github/scripts/check_diff.py
@@ -0,0 +1,46 @@
+import json
+import sys
+
+ALL_DIRS = {
+    "libs/core",
+    "libs/langchain",
+    "libs/experimental",
+    "libs/community",
+}
+
+if __name__ == "__main__":
+    files = sys.argv[1:]
+    dirs_to_run = set()
+
+    for file in files:
+        if any(
+            file.startswith(dir_)
+            for dir_ in (
+                ".github/workflows",
+                ".github/tools",
+                ".github/actions",
+                "libs/core",
+                ".github/scripts/check_diff.py",
+            )
+        ):
+            dirs_to_run = ALL_DIRS
+            break
+        elif "libs/community" in file:
+            dirs_to_run.update(
+                ("libs/community", "libs/langchain", "libs/experimental")
+            )
+        elif "libs/partners" in file:
+            partner_dir = file.split("/")[2]
+            dirs_to_run.update(
+                (f"libs/partners/{partner_dir}", "libs/langchain", "libs/experimental")
+            )
+        elif "libs/langchain" in file:
+            dirs_to_run.update(("libs/langchain", "libs/experimental"))
+        elif "libs/experimental" in file:
+            dirs_to_run.add("libs/experimental")
+        elif file.startswith("libs/"):
+            dirs_to_run = ALL_DIRS
+            break
+        else:
+            pass
+    print(json.dumps(list(dirs_to_run)))
--- a/.github/workflows/_all_ci.yml
+++ b/.github/workflows/_all_ci.yml
@@ -0,0 +1,105 @@
+---
+name: langchain CI
+
+on:
+  workflow_call:
+    inputs:
+      working-directory:
+        required: true
+        type: string
+        description: "From which folder this pipeline executes"
+  workflow_dispatch:
+    inputs:
+      working-directory:
+        required: true
+        type: choice
+        default: 'libs/langchain'
+        options:
+        - libs/langchain
+        - libs/core
+        - libs/experimental
+        - libs/community
+
+
+# If another push to the same PR or branch happens while this workflow is still running,
+# cancel the earlier run in favor of the next run.
+#
+# There's no point in testing an outdated version of the code. GitHub only allows
+# a limited number of job runners to be active at the same time, so it's better to cancel
+# pointless jobs early so that more useful jobs can run sooner.
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}-${{ inputs.working-directory }}
+  cancel-in-progress: true
+
+env:
+  POETRY_VERSION: "1.6.1"
+
+jobs:
+  lint:
+    uses: ./.github/workflows/_lint.yml
+    with:
+      working-directory: ${{ inputs.working-directory }}
+    secrets: inherit
+
+  test:
+    uses: ./.github/workflows/_test.yml
+    with:
+      working-directory: ${{ inputs.working-directory }}
+    secrets: inherit
+
+  compile-integration-tests:
+    uses: ./.github/workflows/_compile_integration_test.yml
+    with:
+      working-directory: ${{ inputs.working-directory }}
+    secrets: inherit
+
+  dependencies:
+    uses: ./.github/workflows/_dependencies.yml
+    with:
+      working-directory: ${{ inputs.working-directory }}
+    secrets: inherit
+
+  extended-tests:
+    runs-on: ubuntu-latest
+    strategy:
+      matrix:
+        python-version:
+          - "3.8"
+          - "3.9"
+          - "3.10"
+          - "3.11"
+    name: Python ${{ matrix.python-version }} extended tests
+    defaults:
+      run:
+        working-directory: ${{ inputs.working-directory }}
+    steps:
+      - uses: actions/checkout@v4
+
+      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
+        uses: "./.github/actions/poetry_setup"
+        with:
+          python-version: ${{ matrix.python-version }}
+          poetry-version: ${{ env.POETRY_VERSION }}
+          working-directory: ${{ inputs.working-directory }}
+          cache-key: extended
+
+      - name: Install dependencies
+        shell: bash
+        run: |
+          echo "Running extended tests, installing dependencies with poetry..."
+          poetry install -E extended_testing --with test
+
+      - name: Run extended tests
+        run: make extended_tests
+
+      - name: Ensure the tests did not create any additional files
+        shell: bash
+        run: |
+          set -eu
+
+          STATUS="$(git status)"
+          echo "$STATUS"
+
+          # grep will exit non-zero if the target message isn't found,
+          # and `set -e` above will cause the step to fail.
+          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/_compile_integration_test.yml
+++ b/.github/workflows/_compile_integration_test.yml
@@ -7,10 +7,6 @@ on:
        required: true
        type: string
        description: "From which folder this pipeline executes"
-      langchain-core-location:
-        required: false
-        type: string
-        description: "Relative path to the langchain core library folder"

 env:
  POETRY_VERSION: "1.6.1"
@@ -42,15 +38,7 @@ jobs:

      - name: Install integration dependencies
        shell: bash
-        run: poetry install --with=test_integration
-
-      - name: Install langchain core editable
-        working-directory: ${{ inputs.working-directory }}
-        if: ${{ inputs.langchain-core-location }}
-        env:
-          LANGCHAIN_CORE_LOCATION: ${{ inputs.langchain-core-location }}
-        run: |
-          poetry run pip install -e "$LANGCHAIN_CORE_LOCATION"
+        run: poetry install --with=test_integration,test

      - name: Check integration tests compile
        shell: bash
--- a/.github/workflows/_pydantic_compatibility.yml
+++ b/.github/workflows/_pydantic_compatibility.yml
@@ -1,4 +1,4 @@
-name: pydantic v1/v2 compatibility
+name: dependencies

 on:
  workflow_call:
@@ -11,10 +11,6 @@ on:
        required: false
        type: string
        description: "Relative path to the langchain library folder"
-      langchain-core-location:
-        required: false
-        type: string
-        description: "Relative path to the langchain core library folder"

 env:
  POETRY_VERSION: "1.6.1"
@@ -32,7 +28,7 @@ jobs:
          - "3.9"
          - "3.10"
          - "3.11"
-    name: Pydantic v1/v2 compatibility - Python ${{ matrix.python-version }}
+    name: dependencies - Python ${{ matrix.python-version }}
    steps:
      - uses: actions/checkout@v4

@@ -48,6 +44,14 @@ jobs:
        shell: bash
        run: poetry install

+      - name: Check imports with base dependencies
+        shell: bash
+        run: poetry run make check_imports
+
+      - name: Install test dependencies
+        shell: bash
+        run: poetry install --with test
+
      - name: Install langchain editable
        working-directory: ${{ inputs.working-directory }}
        if: ${{ inputs.langchain-location }}
@@ -56,14 +60,6 @@ jobs:
        run: |
          poetry run pip install -e "$LANGCHAIN_LOCATION"

-      - name: Install langchain core editable
-        working-directory: ${{ inputs.working-directory }}
-        if: ${{ inputs.langchain-core-location }}
-        env:
-          LANGCHAIN_CORE_LOCATION: ${{ inputs.langchain-core-location }}
-        run: |
-          poetry run pip install -e "$LANGCHAIN_CORE_LOCATION"
-
      - name: Install the opposite major version of pydantic
        # If normal tests use pydantic v1, here we'll use v2, and vice versa.
        shell: bash
--- a/.github/workflows/_lint.yml
+++ b/.github/workflows/_lint.yml
@@ -11,10 +11,6 @@ on:
        required: false
        type: string
        description: "Relative path to the langchain library folder"
-      langchain-core-location:
-        required: false
-        type: string
-        description: "Relative path to the langchain core library folder"

 env:
  POETRY_VERSION: "1.6.1"
@@ -72,7 +68,7 @@ jobs:
        # It doesn't matter how you change it, any change will cause a cache-bust.
        working-directory: ${{ inputs.working-directory }}
        run: |
-          poetry install --with dev,lint,test,typing
+          poetry install --with lint,typing

      - name: Install langchain editable
        working-directory: ${{ inputs.working-directory }}
@@ -82,14 +78,6 @@ jobs:
        run: |
          poetry run pip install -e "$LANGCHAIN_LOCATION"

-      - name: Install langchain core editable
-        working-directory: ${{ inputs.working-directory }}
-        if: ${{ inputs.langchain-core-location }}
-        env:
-          LANGCHAIN_CORE_LOCATION: ${{ inputs.langchain-core-location }}
-        run: |
-          poetry run pip install -e "$LANGCHAIN_CORE_LOCATION"
-
      - name: Get .mypy_cache to speed up mypy
        uses: actions/cache@v3
        env:
@@ -97,9 +85,37 @@ jobs:
        with:
          path: |
            ${{ env.WORKDIR }}/.mypy_cache
-          key: mypy-${{ runner.os }}-${{ runner.arch }}-py${{ matrix.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', env.WORKDIR)) }}
+          key: mypy-lint-${{ runner.os }}-${{ runner.arch }}-py${{ matrix.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', env.WORKDIR)) }}
+

      - name: Analysing the code with our lint
        working-directory: ${{ inputs.working-directory }}
        run: |
-          make lint
+          make lint_package
+
+      - name: Install test dependencies
+        # Also installs dev/lint/test/typing dependencies, to ensure we have
+        # type hints for as many of our libraries as possible.
+        # This helps catch errors that require dependencies to be spotted, for example:
+        # https://github.com/langchain-ai/langchain/pull/10249/files#diff-935185cd488d015f026dcd9e19616ff62863e8cde8c0bee70318d3ccbca98341
+        #
+        # If you change this configuration, make sure to change the `cache-key`
+        # in the `poetry_setup` action above to stop using the old cache.
+        # It doesn't matter how you change it, any change will cause a cache-bust.
+        working-directory: ${{ inputs.working-directory }}
+        run: |
+          poetry install --with test
+
+      - name: Get .mypy_cache_test to speed up mypy
+        uses: actions/cache@v3
+        env:
+          SEGMENT_DOWNLOAD_TIMEOUT_MIN: "2"
+        with:
+          path: |
+            ${{ env.WORKDIR }}/.mypy_cache_test
+          key: mypy-test-${{ runner.os }}-${{ runner.arch }}-py${{ matrix.python-version }}-${{ inputs.working-directory }}-${{ hashFiles(format('{0}/poetry.lock', env.WORKDIR)) }}
+
+      - name: Analysing the code with our lint
+        working-directory: ${{ inputs.working-directory }}
+        run: |
+          make lint_tests
--- a/.github/workflows/_release.yml
+++ b/.github/workflows/_release.yml
@@ -7,6 +7,17 @@ on:
        required: true
        type: string
        description: "From which folder this pipeline executes"
+  workflow_dispatch:
+    inputs:
+      working-directory:
+        required: true
+        type: choice
+        default: 'libs/langchain'
+        options:
+          - libs/langchain
+          - libs/core
+          - libs/experimental
+          - libs/community

 env:
  PYTHON_VERSION: "3.10"
--- a/.github/workflows/_test.yml
+++ b/.github/workflows/_test.yml
@@ -11,10 +11,6 @@ on:
        required: false
        type: string
        description: "Relative path to the langchain library folder"
-      langchain-core-location:
-        required: false
-        type: string
-        description: "Relative path to the langchain core library folder"

 env:
  POETRY_VERSION: "1.6.1"
@@ -46,7 +42,7 @@ jobs:

      - name: Install dependencies
        shell: bash
-        run: poetry install
+        run: poetry install --with test

      - name: Install langchain editable
        working-directory: ${{ inputs.working-directory }}
@@ -56,14 +52,6 @@ jobs:
        run: |
          poetry run pip install -e "$LANGCHAIN_LOCATION"

-      - name: Install langchain core editable
-        working-directory: ${{ inputs.working-directory }}
-        if: ${{ inputs.langchain-core-location }}
-        env:
-          LANGCHAIN_CORE_LOCATION: ${{ inputs.langchain-core-location }}
-        run: |
-          poetry run pip install -e "$LANGCHAIN_CORE_LOCATION"
-
      - name: Run core tests
        shell: bash
        run: |
--- a/.github/workflows/check_diffs.yml
+++ b/.github/workflows/check_diffs.yml
@@ -0,0 +1,47 @@
+---
+name: Check library diffs
+
+on:
+  push:
+    branches: [master]
+  pull_request:
+    paths:
+      - ".github/actions/**"
+      - ".github/tools/**"
+      - ".github/workflows/**"
+      - "libs/**"
+
+# If another push to the same PR or branch happens while this workflow is still running,
+# cancel the earlier run in favor of the next run.
+#
+# There's no point in testing an outdated version of the code. GitHub only allows
+# a limited number of job runners to be active at the same time, so it's better to cancel
+# pointless jobs early so that more useful jobs can run sooner.
+concurrency:
+  group: ${{ github.workflow }}-${{ github.ref }}
+  cancel-in-progress: true
+
+jobs:
+  build:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - uses: actions/setup-python@v4
+        with:
+          python-version: '3.10'
+      - id: files
+        uses: Ana06/get-changed-files@v2.2.0
+      - id: set-matrix
+        run: echo "dirs-to-run=$(python .github/scripts/check_diff.py ${{ steps.files.outputs.all }})" >> $GITHUB_OUTPUT
+    outputs:
+      dirs-to-run: ${{ steps.set-matrix.outputs.dirs-to-run }}
+  ci:
+    needs: [ build ]
+    strategy:
+      matrix:
+        working-directory: ${{ fromJson(needs.build.outputs.dirs-to-run) }}
+    uses: ./.github/workflows/_all_ci.yml
+    with:
+      working-directory: ${{ matrix.working-directory }}
+
+
--- a/.github/workflows/langchain_ci.yml
+++ b/.github/workflows/langchain_ci.yml
@@ -1,154 +0,0 @@
---
-name: libs/langchain CI
-
-on:
-  push:
-    branches: [ master ]
-  pull_request:
-    paths:
-      - '.github/actions/poetry_setup/action.yml'
-      - '.github/tools/**'
-      - '.github/workflows/_lint.yml'
-      - '.github/workflows/_test.yml'
-      - '.github/workflows/_pydantic_compatibility.yml'
-      - '.github/workflows/langchain_ci.yml'
-      - 'libs/*'
-      - 'libs/langchain/**'
-  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
-
-# If another push to the same PR or branch happens while this workflow is still running,
-# cancel the earlier run in favor of the next run.
-#
-# There's no point in testing an outdated version of the code. GitHub only allows
-# a limited number of job runners to be active at the same time, so it's better to cancel
-# pointless jobs early so that more useful jobs can run sooner.
-concurrency:
-  group: ${{ github.workflow }}-${{ github.ref }}
-  cancel-in-progress: true
-
-env:
-  POETRY_VERSION: "1.6.1"
-  WORKDIR: "libs/langchain"
-
-jobs:
-  lint:
-    uses:
-      ./.github/workflows/_lint.yml
-    with:
-      working-directory: libs/langchain
-      langchain-core-location: ../core
-    secrets: inherit
-
-  test:
-    uses:
-      ./.github/workflows/_test.yml
-    with:
-      working-directory: libs/langchain
-      langchain-core-location: ../core
-    secrets: inherit
-
-  compile-integration-tests:
-    uses:
-      ./.github/workflows/_compile_integration_test.yml
-    with:
-      working-directory: libs/langchain
-      langchain-core-location: ../core
-    secrets: inherit
-
-  pydantic-compatibility:
-    uses:
-      ./.github/workflows/_pydantic_compatibility.yml
-    with:
-      working-directory: libs/langchain
-      langchain-core-location: ../core
-    secrets: inherit
-
-  # It's possible that langchain works fine with the latest *published* langchain-core,
-  # but is broken with the langchain-core on `master`.
-  #
-  # We want to catch situations like that *before* releasing a new langchain-core, hence this test.
-  test-with-latest-langchain-core:
-    runs-on: ubuntu-latest
-    defaults:
-      run:
-        working-directory: ${{ env.WORKDIR }}
-    strategy:
-      matrix:
-        python-version:
-          - "3.8"
-          - "3.9"
-          - "3.10"
-          - "3.11"
-    name: test with unpublished langchain-core - Python ${{ matrix.python-version }}
-    steps:
-      - uses: actions/checkout@v4
-
-      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
-        uses: "./.github/actions/poetry_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-          poetry-version: ${{ env.POETRY_VERSION }}
-          working-directory: ${{ env.WORKDIR }}
-          cache-key: unpublished-langchain-core
-
-      - name: Install dependencies
-        shell: bash
-        run: |
-          echo "Running tests with unpublished langchain, installing dependencies with poetry..."
-          poetry install
-
-          echo "Editably installing langchain-core outside of poetry, to avoid messing up lockfile..."
-          poetry run pip install -e ../core
-
-      - name: Run tests
-        run: make test
-
-  extended-tests:
-    runs-on: ubuntu-latest
-    defaults:
-      run:
-        working-directory: ${{ env.WORKDIR }}
-    strategy:
-      matrix:
-        python-version:
-          - "3.8"
-          - "3.9"
-          - "3.10"
-          - "3.11"
-    name: Python ${{ matrix.python-version }} extended tests
-    steps:
-      - uses: actions/checkout@v4
-
-      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
-        uses: "./.github/actions/poetry_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-          poetry-version: ${{ env.POETRY_VERSION }}
-          working-directory: libs/langchain
-          cache-key: extended
-
-      - name: Install dependencies
-        shell: bash
-        run: |
-          echo "Running extended tests, installing dependencies with poetry..."
-          poetry install -E extended_testing
-
-      - name: Install langchain core editable
-        shell: bash
-        run: |
-          poetry run pip install -e ../core
-
-      - name: Run extended tests
-        run: make extended_tests
-
-      - name: Ensure the tests did not create any additional files
-        shell: bash
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/langchain_cli_ci.yml
+++ b/.github/workflows/langchain_cli_ci.yml
@@ -1,47 +0,0 @@
---
-name: libs/cli CI
-
-on:
-  push:
-    branches: [ master ]
-  pull_request:
-    paths:
-      - '.github/actions/poetry_setup/action.yml'
-      - '.github/tools/**'
-      - '.github/workflows/_lint.yml'
-      - '.github/workflows/_test.yml'
-      - '.github/workflows/_pydantic_compatibility.yml'
-      - '.github/workflows/langchain_cli_ci.yml'
-      - 'libs/cli/**'
-      - 'libs/*'
-  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
-
-# If another push to the same PR or branch happens while this workflow is still running,
-# cancel the earlier run in favor of the next run.
-#
-# There's no point in testing an outdated version of the code. GitHub only allows
-# a limited number of job runners to be active at the same time, so it's better to cancel
-# pointless jobs early so that more useful jobs can run sooner.
-concurrency:
-  group: ${{ github.workflow }}-${{ github.ref }}
-  cancel-in-progress: true
-
-env:
-  POETRY_VERSION: "1.6.1"
-  WORKDIR: "libs/cli"
-
-jobs:
-  lint:
-    uses:
-      ./.github/workflows/_lint.yml
-    with:
-      working-directory: libs/cli
-      langchain-location: ../langchain
-    secrets: inherit
-
-  test:
-    uses:
-      ./.github/workflows/_test.yml
-    with:
-      working-directory: libs/cli
-    secrets: inherit
--- a/.github/workflows/langchain_community_release.yml
+++ b/.github/workflows/langchain_community_release.yml
@@ -0,0 +1,13 @@
+---
+name: libs/community Release
+
+on:
+  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+
+jobs:
+  release:
+    uses:
+      ./.github/workflows/_release.yml
+    with:
+      working-directory: libs/community
+    secrets: inherit
--- a/.github/workflows/langchain_core_ci.yml
+++ b/.github/workflows/langchain_core_ci.yml
@@ -1,52 +0,0 @@
---
-name: libs/langchain core CI
-
-on:
-  push:
-    branches: [ master ]
-  pull_request:
-    paths:
-      - '.github/actions/poetry_setup/action.yml'
-      - '.github/tools/**'
-      - '.github/workflows/_lint.yml'
-      - '.github/workflows/_test.yml'
-      - '.github/workflows/_pydantic_compatibility.yml'
-      - '.github/workflows/langchain_core_ci.yml'
-      - 'libs/core/**'
-  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
-
-# If another push to the same PR or branch happens while this workflow is still running,
-# cancel the earlier run in favor of the next run.
-#
-# There's no point in testing an outdated version of the code. GitHub only allows
-# a limited number of job runners to be active at the same time, so it's better to cancel
-# pointless jobs early so that more useful jobs can run sooner.
-concurrency:
-  group: ${{ github.workflow }}-${{ github.ref }}
-  cancel-in-progress: true
-
-env:
-  POETRY_VERSION: "1.6.1"
-  WORKDIR: "libs/core"
-
-jobs:
-  lint:
-    uses:
-      ./.github/workflows/_lint.yml
-    with:
-      working-directory: libs/core
-    secrets: inherit
-
-  test:
-    uses:
-      ./.github/workflows/_test.yml
-    with:
-      working-directory: libs/core
-    secrets: inherit
-
-  pydantic-compatibility:
-    uses:
-      ./.github/workflows/_pydantic_compatibility.yml
-    with:
-      working-directory: libs/core
-    secrets: inherit
--- a/.github/workflows/langchain_experimental_ci.yml
+++ b/.github/workflows/langchain_experimental_ci.yml
@@ -1,141 +0,0 @@
---
-name: libs/experimental CI
-
-on:
-  push:
-    branches: [ master ]
-  pull_request:
-    paths:
-      - '.github/actions/poetry_setup/action.yml'
-      - '.github/tools/**'
-      - '.github/workflows/_lint.yml'
-      - '.github/workflows/_test.yml'
-      - '.github/workflows/langchain_experimental_ci.yml'
-      - 'libs/*'
-      - 'libs/experimental/**'
-  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
-
-# If another push to the same PR or branch happens while this workflow is still running,
-# cancel the earlier run in favor of the next run.
-#
-# There's no point in testing an outdated version of the code. GitHub only allows
-# a limited number of job runners to be active at the same time, so it's better to cancel
-# pointless jobs early so that more useful jobs can run sooner.
-concurrency:
-  group: ${{ github.workflow }}-${{ github.ref }}
-  cancel-in-progress: true
-
-env:
-  POETRY_VERSION: "1.6.1"
-  WORKDIR: "libs/experimental"
-
-jobs:
-  lint:
-    uses:
-      ./.github/workflows/_lint.yml
-    with:
-      working-directory: libs/experimental
-      langchain-location: ../langchain
-      langchain-core-location: ../core
-    secrets: inherit
-
-  test:
-    uses:
-      ./.github/workflows/_test.yml
-    with:
-      working-directory: libs/experimental
-      langchain-location: ../langchain
-      langchain-core-location: ../core
-    secrets: inherit
-
-  compile-integration-tests:
-    uses:
-      ./.github/workflows/_compile_integration_test.yml
-    with:
-      working-directory: libs/experimental
-    secrets: inherit
-
-  # It's possible that langchain-experimental works fine with the latest *published* langchain,
-  # but is broken with the langchain on `master`.
-  #
-  # We want to catch situations like that *before* releasing a new langchain, hence this test.
-  test-with-latest-langchain:
-    runs-on: ubuntu-latest
-    defaults:
-      run:
-        working-directory: ${{ env.WORKDIR }}
-    strategy:
-      matrix:
-        python-version:
-          - "3.8"
-          - "3.9"
-          - "3.10"
-          - "3.11"
-    name: test with unpublished langchain - Python ${{ matrix.python-version }}
-    steps:
-      - uses: actions/checkout@v4
-
-      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
-        uses: "./.github/actions/poetry_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-          poetry-version: ${{ env.POETRY_VERSION }}
-          working-directory: ${{ env.WORKDIR }}
-          cache-key: unpublished-langchain
-
-      - name: Install dependencies
-        shell: bash
-        run: |
-          echo "Running tests with unpublished langchain, installing dependencies with poetry..."
-          poetry install
-
-          echo "Editably installing langchain outside of poetry, to avoid messing up lockfile..."
-          poetry run pip install -e ../langchain
-          poetry run pip install -e ../core
-
-      - name: Run tests
-        run: make test
-  extended-tests:
-    runs-on: ubuntu-latest
-    defaults:
-      run:
-        working-directory: ${{ env.WORKDIR }}
-    strategy:
-      matrix:
-        python-version:
-          - "3.8"
-          - "3.9"
-          - "3.10"
-          - "3.11"
-    name: Python ${{ matrix.python-version }} extended tests
-    steps:
-      - uses: actions/checkout@v4
-
-      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
-        uses: "./.github/actions/poetry_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-          poetry-version: ${{ env.POETRY_VERSION }}
-          working-directory: libs/experimental
-          cache-key: extended
-
-      - name: Install dependencies
-        shell: bash
-        run: |
-          echo "Running extended tests, installing dependencies with poetry..."
-          poetry install -E extended_testing
-
-      - name: Run extended tests
-        run: make extended_tests
-
-      - name: Ensure the tests did not create any additional files
-        shell: bash
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/langchain_openai_release.yml
+++ b/.github/workflows/langchain_openai_release.yml
@@ -0,0 +1,13 @@
+---
+name: libs/core Release
+
+on:
+  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
+
+jobs:
+  release:
+    uses:
+      ./.github/workflows/_release.yml
+    with:
+      working-directory: libs/core
+    secrets: inherit
--- a/.github/workflows/scheduled_test.yml
+++ b/.github/workflows/scheduled_test.yml
@@ -52,13 +52,7 @@ jobs:
        shell: bash
        run: |
          echo "Running scheduled tests, installing dependencies with poetry..."
-          poetry install --with=test_integration
-          poetry run pip install google-cloud-aiplatform
-          poetry run pip install "boto3>=1.28.57"
-          if [[ ${{ matrix.python-version }} != "3.8" ]]
-          then
-            poetry run pip install fireworks-ai
-          fi
+          poetry install --with=test_integration,test

      - name: Run tests
        shell: bash
@@ -68,7 +62,9 @@ jobs:
          AZURE_OPENAI_API_VERSION: ${{ secrets.AZURE_OPENAI_API_VERSION }}
          AZURE_OPENAI_API_BASE: ${{ secrets.AZURE_OPENAI_API_BASE }}
          AZURE_OPENAI_API_KEY: ${{ secrets.AZURE_OPENAI_API_KEY }}
-          AZURE_OPENAI_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_DEPLOYMENT_NAME }}
+          AZURE_OPENAI_CHAT_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_CHAT_DEPLOYMENT_NAME }}
+          AZURE_OPENAI_LLM_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_LLM_DEPLOYMENT_NAME }}
+          AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME: ${{ secrets.AZURE_OPENAI_EMBEDDINGS_DEPLOYMENT_NAME }}
          FIREWORKS_API_KEY: ${{ secrets.FIREWORKS_API_KEY }}
        run: |
          make scheduled_tests
--- a/.github/workflows/templates_ci.yml
+++ b/.github/workflows/templates_ci.yml
@@ -33,5 +33,4 @@ jobs:
      ./.github/workflows/_lint.yml
    with:
      working-directory: templates
-      langchain-location: ../libs/langchain
    secrets: inherit
--- a/.gitignore
+++ b/.gitignore
@@ -167,8 +167,7 @@ docs/node_modules/
 docs/.docusaurus/
 docs/.cache-loader/
 docs/_dist
-docs/api_reference/api_reference.rst
-docs/api_reference/experimental_api_reference.rst
+docs/api_reference/*api_reference.rst
 docs/api_reference/_build
 docs/api_reference/*/
 !docs/api_reference/_static/
--- a/12
+++ b/12
@@ -1,6 +1,6 @@
-The MIT License
+MIT License

-Copyright (c) Harrison Chase
+Copyright (c) LangChain, Inc.

 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
@@ -9,13 +9,13 @@ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 copies of the Software, and to permit persons to whom the Software is
 furnished to do so, subject to the following conditions:

-The above copyright notice and this permission notice shall be included in
-all copies or substantial portions of the Software.
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.

 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
-OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
-THE SOFTWARE.
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
--- a/3
+++ b/3
@@ -41,9 +41,10 @@ spell_fix:
 # LINTING AND FORMATTING
 ######################

-lint:
+lint lint_package lint_tests:
 	poetry run ruff docs templates cookbook
 	poetry run ruff format docs templates cookbook --diff
+	poetry run ruff --select I docs templates cookbook

 format format_diff:
 	poetry run ruff format docs templates cookbook
--- a/README.md
+++ b/README.md
@@ -3,8 +3,7 @@
 ⚡ Building applications with LLMs through composability ⚡

 [![Release Notes](https://img.shields.io/github/release/langchain-ai/langchain)](https://github.com/langchain-ai/langchain/releases)
-[![CI](https://github.com/langchain-ai/langchain/actions/workflows/langchain_ci.yml/badge.svg)](https://github.com/langchain-ai/langchain/actions/workflows/langchain_ci.yml)
-[![Experimental CI](https://github.com/langchain-ai/langchain/actions/workflows/langchain_experimental_ci.yml/badge.svg)](https://github.com/langchain-ai/langchain/actions/workflows/langchain_experimental_ci.yml)
+[![CI](https://github.com/langchain-ai/langchain/actions/workflows/check_diffs.yml/badge.svg)](https://github.com/langchain-ai/langchain/actions/workflows/check_diffs.yml)
 [![Downloads](https://static.pepy.tech/badge/langchain/month)](https://pepy.tech/project/langchain)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
 [![Twitter](https://img.shields.io/twitter/url/https/twitter.com/langchainai.svg?style=social&label=Follow%20%40LangChainAI)](https://twitter.com/langchainai)
@@ -30,7 +29,7 @@ pip install langchain

 With conda:
 ```bash
-pip install langsmith && conda install langchain -c conda-forge
+conda install langchain -c conda-forge
 ```

 ## 🤔 What is LangChain?
@@ -45,7 +44,10 @@ This framework consists of several parts.
 - **[LangServe](https://github.com/langchain-ai/langserve)**: A library for deploying LangChain chains as a REST API.
 - **[LangSmith](https://smith.langchain.com)**: A developer platform that lets you debug, test, evaluate, and monitor chains built on any LLM framework and seamlessly integrates with LangChain.

-**This repo contains the `langchain` ([here](libs/langchain)), `langchain-experimental` ([here](libs/experimental)), and `langchain-cli` ([here](libs/cli)) Python packages, as well as [LangChain Templates](templates).**
+The LangChain libraries themselves are made up of several different packages.
+- **[`langchain-core`](libs/core)**: Base abstractions and LangChain Expression Language.
+- **[`langchain-community`](libs/community)**: Third party integrations.
+- **[`langchain`](libs/langchain)**: Chains, agents, and retrieval strategies that make up an application's cognitive architecture.

 ![LangChain Stack](docs/static/img/langchain_stack.png)

@@ -104,3 +106,7 @@ Please see [here](https://python.langchain.com) for full documentation, which in
 As an open-source project in a rapidly developing field, we are extremely open to contributions, whether it be in the form of a new feature, improved infrastructure, or better documentation.

 For detailed information on how to contribute, see [here](.github/CONTRIBUTING.md).
+
+## 🌟 Contributors
+
+[![langchain contributors](https://contrib.rocks/image?repo=langchain-ai/langchain&max=2000)](https://github.com/langchain-ai/langchain/graphs/contributors)
--- a/cookbook/LLaMA2_sql_chat.ipynb
+++ b/cookbook/LLaMA2_sql_chat.ipynb
@@ -164,8 +164,8 @@
    ")\n",
    "\n",
    "# Chain to query\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "sql_response = (\n",
    "    RunnablePassthrough.assign(schema=get_schema)\n",
@@ -293,7 +293,7 @@
    "memory = ConversationBufferMemory(return_messages=True)\n",
    "\n",
    "# Chain to query with memory\n",
-    "from langchain.schema.runnable import RunnableLambda\n",
+    "from langchain_core.runnables import RunnableLambda\n",
    "\n",
    "sql_chain = (\n",
    "    RunnablePassthrough.assign(\n",
--- a/cookbook/Multi_modal_RAG.ipynb
+++ b/cookbook/Multi_modal_RAG.ipynb
@@ -200,7 +200,7 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
    "\n",
    "\n",
    "# Generate summaries of text elements\n",
@@ -270,7 +270,7 @@
    "import base64\n",
    "import os\n",
    "\n",
-    "from langchain.schema.messages import HumanMessage\n",
+    "from langchain_core.messages import HumanMessage\n",
    "\n",
    "\n",
    "def encode_image(image_path):\n",
@@ -355,9 +355,9 @@
    "\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
-    "from langchain.schema.document import Document\n",
    "from langchain.storage import InMemoryStore\n",
    "from langchain.vectorstores import Chroma\n",
+    "from langchain_core.documents import Document\n",
    "\n",
    "\n",
    "def create_multi_vector_retriever(\n",
@@ -442,7 +442,7 @@
    "import re\n",
    "\n",
    "from IPython.display import HTML, display\n",
-    "from langchain.schema.runnable import RunnableLambda, RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnableLambda, RunnablePassthrough\n",
    "from PIL import Image\n",
    "\n",
    "\n",
--- a/cookbook/Semi_Structured_RAG.ipynb
+++ b/cookbook/Semi_Structured_RAG.ipynb
@@ -237,7 +237,7 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser"
+    "from langchain_core.output_parsers import StrOutputParser"
   ]
  },
  {
@@ -320,9 +320,9 @@
    "\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
-    "from langchain.schema.document import Document\n",
    "from langchain.storage import InMemoryStore\n",
    "from langchain.vectorstores import Chroma\n",
+    "from langchain_core.documents import Document\n",
    "\n",
    "# The vectorstore to use to index the child chunks\n",
    "vectorstore = Chroma(collection_name=\"summaries\", embedding_function=OpenAIEmbeddings())\n",
@@ -374,7 +374,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "# Prompt template\n",
    "template = \"\"\"Answer the question based only on the following context, which can include text and tables:\n",
--- a/cookbook/Semi_structured_and_multi_modal_RAG.ipynb
+++ b/cookbook/Semi_structured_and_multi_modal_RAG.ipynb
@@ -213,7 +213,7 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser"
+    "from langchain_core.output_parsers import StrOutputParser"
   ]
  },
  {
@@ -375,9 +375,9 @@
    "\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
-    "from langchain.schema.document import Document\n",
    "from langchain.storage import InMemoryStore\n",
    "from langchain.vectorstores import Chroma\n",
+    "from langchain_core.documents import Document\n",
    "\n",
    "# The vectorstore to use to index the child chunks\n",
    "vectorstore = Chroma(collection_name=\"summaries\", embedding_function=OpenAIEmbeddings())\n",
@@ -646,7 +646,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "# Prompt template\n",
    "template = \"\"\"Answer the question based only on the following context, which can include text and tables:\n",
--- a/cookbook/Semi_structured_multi_modal_RAG_LLaMA2.ipynb
+++ b/cookbook/Semi_structured_multi_modal_RAG_LLaMA2.ipynb
@@ -211,7 +211,7 @@
   "source": [
    "from langchain.chat_models import ChatOllama\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser"
+    "from langchain_core.output_parsers import StrOutputParser"
   ]
  },
  {
@@ -378,9 +378,9 @@
    "\n",
    "from langchain.embeddings import GPT4AllEmbeddings\n",
    "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
-    "from langchain.schema.document import Document\n",
    "from langchain.storage import InMemoryStore\n",
    "from langchain.vectorstores import Chroma\n",
+    "from langchain_core.documents import Document\n",
    "\n",
    "# The vectorstore to use to index the child chunks\n",
    "vectorstore = Chroma(\n",
@@ -532,7 +532,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "# Prompt template\n",
    "template = \"\"\"Answer the question based only on the following context, which can include text and tables:\n",
--- a/cookbook/advanced_rag_eval.ipynb
+++ b/cookbook/advanced_rag_eval.ipynb
@@ -162,7 +162,7 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
    "\n",
    "# Prompt\n",
    "prompt_text = \"\"\"You are an assistant tasked with summarizing tables and text for retrieval. \\\n",
@@ -202,7 +202,7 @@
    "import os\n",
    "from io import BytesIO\n",
    "\n",
-    "from langchain.schema.messages import HumanMessage\n",
+    "from langchain_core.messages import HumanMessage\n",
    "from PIL import Image\n",
    "\n",
    "\n",
@@ -273,8 +273,8 @@
    "from base64 import b64decode\n",
    "\n",
    "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
-    "from langchain.schema.document import Document\n",
    "from langchain.storage import InMemoryStore\n",
+    "from langchain_core.documents import Document\n",
    "\n",
    "\n",
    "def create_multi_vector_retriever(\n",
@@ -475,7 +475,7 @@
   "source": [
    "from operator import itemgetter\n",
    "\n",
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "# Prompt\n",
    "template = \"\"\"Answer the question based only on the following context, which can include text and tables:\n",
@@ -521,7 +521,7 @@
    "import re\n",
    "\n",
    "from langchain.schema import Document\n",
-    "from langchain.schema.runnable import RunnableLambda\n",
+    "from langchain_core.runnables import RunnableLambda\n",
    "\n",
    "\n",
    "def looks_like_base64(sb):\n",
--- a/cookbook/code-analysis-deeplake.ipynb
+++ b/cookbook/code-analysis-deeplake.ipynb
@@ -648,7 +648,7 @@
    {
     "data": {
      "text/plain": [
-       "OpenAIEmbeddings(client=<class 'openai.api_resources.embedding.Embedding'>, model='text-embedding-ada-002', deployment='text-embedding-ada-002', openai_api_version='', openai_api_base='', openai_api_type='', openai_proxy='', embedding_ctx_length=8191, openai_api_key='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', openai_organization='', allowed_special=set(), disallowed_special='all', chunk_size=1000, max_retries=6, request_timeout=None, headers=None, tiktoken_model_name=None, show_progress_bar=False, model_kwargs={})"
+       "OpenAIEmbeddings(client=<class 'openai.api_resources.embedding.Embedding'>, model='text-embedding-ada-002', deployment='text-embedding-ada-002', openai_api_version='', openai_api_base='', openai_api_type='', openai_proxy='', embedding_ctx_length=8191, openai_api_key='', openai_organization='', allowed_special=set(), disallowed_special='all', chunk_size=1000, max_retries=6, request_timeout=None, headers=None, tiktoken_model_name=None, show_progress_bar=False, model_kwargs={})"
      ]
     },
     "execution_count": 13,
--- a/cookbook/docugami_xml_kg_rag.ipynb
+++ b/cookbook/docugami_xml_kg_rag.ipynb
@@ -34,12 +34,12 @@
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 16,
   "id": "5740fc70-c513-4ff4-9d72-cfc098f85fef",
   "metadata": {},
   "outputs": [],
   "source": [
-    "! pip install langchain docugami==0.0.4 dgml-utils==0.2.0 pydantic langchainhub chromadb --upgrade --quiet"
+    "! pip install langchain docugami==0.0.8 dgml-utils==0.3.0 pydantic langchainhub chromadb hnswlib --upgrade --quiet"
   ]
  },
  {
@@ -52,6 +52,7 @@
  },
  {
   "cell_type": "markdown",
+   "id": "c6fb4903-f845-4907-ae14-df305891b0ff",
   "metadata": {},
   "source": [
    "## Data Loading\n",
@@ -62,7 +63,7 @@
    "1. Create an access token via the Developer Playground for your workspace. [Detailed instructions](https://help.docugami.com/home/docugami-api).\n",
    "1. Add your documents (PDF \\[scanned or digital\\], DOC or DOCX) to Docugami for processing. There are two ways to do this:\n",
    "    1. Use the simple Docugami web experience. [Detailed instructions](https://help.docugami.com/home/adding-documents).\n",
-    "    1. Use the [Docugami API](https://api-docs.docugami.com), specifically the [documents](https://api-docs.docugami.com/#tag/documents/operation/upload-document) endpoint. Code samples are available for [python](../upload_file/) and [JavaScript](../../js/upload-file/) or you can use the [docugami](https://pypi.org/project/docugami/) python library.\n",
+    "    1. Use the [Docugami API](https://api-docs.docugami.com), specifically the [documents](https://api-docs.docugami.com/#tag/documents/operation/upload-document) endpoint. You can also use the [docugami python library](https://pypi.org/project/docugami/) as a convenient wrapper.\n",
    "\n",
    "Once your documents are in Docugami, they are processed and organized into sets of similar documents, e.g. NDAs, Lease Agreements, and Service Agreements. Docugami is not limited to any particular types of documents, and the clusters created depend on your particular documents. You can [change the docset assignments](https://help.docugami.com/home/working-with-the-doc-sets-view) later if you wish. You can monitor file status in the simple Docugami webapp, or use a [webhook](https://api-docs.docugami.com/#tag/webhooks) to be informed when your documents are done processing.\n",
    "\n",
@@ -75,115 +76,30 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 45,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "import os\n",
-    "from pathlib import Path\n",
-    "from pprint import pprint\n",
-    "import requests\n",
-    "import tempfile\n",
-    "from time import sleep\n",
-    "from typing import Dict, List\n",
-    "\n",
-    "from docugami import Docugami\n",
-    "from docugami.types import Document as DocugamiDocument\n",
-    "\n",
-    "api_key = os.environ.get(\"DOCUGAMI_API_KEY\")\n",
-    "if not api_key:\n",
-    "    raise Exception(\"Please set Docugami API key environment variable\")\n",
-    "\n",
-    "client = Docugami()\n",
-    "\n",
-    "\n",
-    "def upload_files(local_paths: List[str], docset_name: str) -> List[DocugamiDocument]:\n",
-    "    docset_list_response = client.docsets.list(name=docset_name)\n",
-    "    if docset_list_response and docset_list_response.docsets:\n",
-    "        # Docset already exists with this name\n",
-    "        docset_id = docset_list_response.docsets[0]\n",
-    "    else:\n",
-    "        dg_docset = client.docsets.create(name=docset_name)\n",
-    "        docset_id = dg_docset.id\n",
-    "\n",
-    "    document_list_response = client.documents.list(limit=int(1e5))\n",
-    "    dg_docs: List[DocugamiDocument] = []\n",
-    "    if document_list_response and document_list_response.documents:\n",
-    "        new_names = [Path(f).name for f in local_paths]\n",
-    "\n",
-    "        dg_docs = [\n",
-    "            d\n",
-    "            for d in document_list_response.documents\n",
-    "            if Path(d.name).name in new_names\n",
-    "        ]\n",
-    "        existing_names = [Path(d.name).name for d in dg_docs]\n",
-    "\n",
-    "        # Upload any files not previously uploaded\n",
-    "        for f in local_paths:\n",
-    "            if Path(f).name not in existing_names:\n",
-    "                dg_docs.append(\n",
-    "                    client.documents.contents.upload(\n",
-    "                        file=Path(f).absolute(),\n",
-    "                        docset_id=docset_id,\n",
-    "                    )\n",
-    "                )\n",
-    "    return dg_docs\n",
-    "\n",
-    "\n",
-    "def wait_for_xml(dg_docs: List[DocugamiDocument]) -> dict[str, str]:\n",
-    "    dgml_paths: dict[str, str] = {}\n",
-    "    while len(dgml_paths) < len(dg_docs):\n",
-    "        for doc in dg_docs:\n",
-    "            doc = client.documents.retrieve(doc.id)  # update with latest\n",
-    "            current_status = doc.status\n",
-    "            if current_status == \"Error\":\n",
-    "                raise Exception(\n",
-    "                    \"Document could not be processed, please confirm it is not a zero length, corrupt or password protected file\"\n",
-    "                )\n",
-    "            elif current_status == \"Ready\":\n",
-    "                dgml_url = doc.docset.url + f\"/documents/{doc.id}/dgml\"\n",
-    "                headers = {\"Authorization\": f\"Bearer {api_key}\"}\n",
-    "                dgml_response = requests.get(dgml_url, headers=headers)\n",
-    "                if not dgml_response.ok:\n",
-    "                    raise Exception(\n",
-    "                        f\"Could not download DGML artifact {dgml_url}: {dgml_response.status_code}\"\n",
-    "                    )\n",
-    "                dgml_contents = dgml_response.text\n",
-    "                with tempfile.NamedTemporaryFile(delete=False, mode=\"w\") as temp_file:\n",
-    "                    temp_file.write(dgml_contents)\n",
-    "                    temp_file_path = temp_file.name\n",
-    "                    dgml_paths[doc.name] = temp_file_path\n",
-    "\n",
-    "        print(f\"{len(dgml_paths)} docs done processing out of {len(dg_docs)}...\")\n",
-    "\n",
-    "        if len(dgml_paths) == len(dg_docs):\n",
-    "            # done\n",
-    "            return dgml_paths\n",
-    "        else:\n",
-    "            sleep(30)  # try again in a bit"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 46,
+   "execution_count": 3,
+   "id": "ce0b2b21-7623-46e7-ae2c-3a9f67e8b9b9",
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "6 docs done processing out of 6...\n",
-      "{'Report_CEN23LA277_192541.pdf': '/var/folders/0h/6cchx4k528bdj8cfcsdm0dqr0000gn/T/tmpel3o0rpg',\n",
-      " 'Report_CEN23LA338_192753.pdf': '/var/folders/0h/6cchx4k528bdj8cfcsdm0dqr0000gn/T/tmpgugb9ut1',\n",
-      " 'Report_CEN23LA363_192876.pdf': '/var/folders/0h/6cchx4k528bdj8cfcsdm0dqr0000gn/T/tmp3_gf2sky',\n",
-      " 'Report_CEN23LA394_192995.pdf': '/var/folders/0h/6cchx4k528bdj8cfcsdm0dqr0000gn/T/tmpwmfgoxkl',\n",
-      " 'Report_ERA23LA114_106615.pdf': '/var/folders/0h/6cchx4k528bdj8cfcsdm0dqr0000gn/T/tmptibrz2yu',\n",
-      " 'Report_WPR23LA254_192532.pdf': '/var/folders/0h/6cchx4k528bdj8cfcsdm0dqr0000gn/T/tmpvazrbbsi'}\n"
+      "{'Report_CEN23LA277_192541.pdf': '/tmp/tmpa0c77x46',\n",
+      " 'Report_CEN23LA338_192753.pdf': '/tmp/tmpaftfld2w',\n",
+      " 'Report_CEN23LA363_192876.pdf': '/tmp/tmpn7gp6be2',\n",
+      " 'Report_CEN23LA394_192995.pdf': '/tmp/tmp9udymprf',\n",
+      " 'Report_ERA23LA114_106615.pdf': '/tmp/tmpxdjbh4r_',\n",
+      " 'Report_WPR23LA254_192532.pdf': '/tmp/tmpz6h75a0h'}\n"
     ]
    }
   ],
   "source": [
-    "#### START DOCSET INFO (please change)\n",
+    "from pprint import pprint\n",
+    "\n",
+    "from docugami import Docugami\n",
+    "from docugami.lib.upload import upload_to_named_docset, wait_for_dgml\n",
+    "\n",
+    "#### START DOCSET INFO (please change this values as needed)\n",
    "DOCSET_NAME = \"NTSB Aviation Incident Reports\"\n",
    "FILE_PATHS = [\n",
    "    \"/Users/tjaffri/ntsb/Report_CEN23LA277_192541.pdf\",\n",
@@ -194,19 +110,22 @@
    "    \"/Users/tjaffri/ntsb/Report_WPR23LA254_192532.pdf\",\n",
    "]\n",
    "\n",
-    "assert (\n",
-    "    len(FILE_PATHS) > 5\n",
-    ")  # Please specify ~6 (or more!) similar files to process together as a document set\n",
+    "# Note: Please specify ~6 (or more!) similar files to process together as a document set\n",
+    "#       This is currently a requirement for Docugami to automatically detect motifs\n",
+    "#       across the document set to generate a semantic XML Knowledge Graph.\n",
+    "assert len(FILE_PATHS) > 5, \"Please provide at least 6 files\"\n",
    "#### END DOCSET INFO\n",
    "\n",
-    "dg_docs = upload_files(FILE_PATHS, DOCSET_NAME)\n",
-    "dgml_paths = wait_for_xml(dg_docs)\n",
+    "dg_client = Docugami()\n",
+    "dg_docs = upload_to_named_docset(dg_client, FILE_PATHS, DOCSET_NAME)\n",
+    "dgml_paths = wait_for_dgml(dg_client, dg_docs)\n",
    "\n",
    "pprint(dgml_paths)"
   ]
  },
  {
   "cell_type": "markdown",
+   "id": "01f035e5-c3f8-4d23-9d1b-8d2babdea8e9",
   "metadata": {},
   "source": [
    "If you are on the free Docugami tier, your files should be done in ~15 minutes or less depending on the number of pages uploaded and available resources (please contact Docugami for paid plans for faster processing). You can re-run the code above without reprocessing your files to continue waiting if your notebook is not continuously running (it does not re-upload)."
@@ -224,7 +143,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 47,
+   "execution_count": 4,
+   "id": "05fcdd57-090f-44bf-a1fb-2c3609c80e34",
   "metadata": {},
   "outputs": [
    {
@@ -232,13 +152,13 @@
     "output_type": "stream",
     "text": [
      "found 30 chunks, here are the first few\n",
-      "Aviation Investigation Final Report\n",
-      "<table><tbody><tr><td>Location: </td> <td><Location><TownName>Elbert</TownName>, <USState>Colorado </USState></Location></td> <td>Accident Number: </td> <td><AccidentNumber>CEN23LA277 </AccidentNumber></td></tr> <tr><td><LocationDateTime>Date &amp; Time: </LocationDateTime></td> <td><DateTime><EventDate>June 26, 2023</EventDate>, <EventTime>11:00 Local </EventTime></DateTime></td> <td><DateTimeAccidentNumber>Registration: </DateTimeAccidentNumber></td> <td><Registration>N23161 </Registration></td></tr> <tr><td><LocationAircraft>Aircraft: </LocationAircraft></td> <td><Aircraft>Piper <AircraftType>J3C-50 </AircraftType></Aircraft></td> <td><AircraftAccidentNumber>Aircraft Damage: </AircraftAccidentNumber></td> <td><AircraftDamage>Substantial </AircraftDamage></td></tr> <tr><td><LocationDefiningEvent>Defining Event: </LocationDefiningEvent></td> <td><DefiningEvent>Nose over/nose down </DefiningEvent></td> <td><DefiningEventAccidentNumber>Injuries: </DefiningEventAccidentNumber></td> <td><Injuries><Minor>1 </Minor>Minor </Injuries></td></tr> <tr><td><LocationFlightConductedUnder>Flight Conducted Under: </LocationFlightConductedUnder></td> <td><Part91-cell>Part <RegulationPart>91</RegulationPart>: General aviation - Personal </Part91-cell></td><td/><td><FlightConductedUnderCEN23LA277/></td></tr></tbody></table>\n",
+      "<AviationInvestigationFinalReport-section>Aviation </AviationInvestigationFinalReport-section>Investigation Final Report\n",
+      "<table><tbody><tr><td>Location: </td> <td><Location><TownName>Elbert</TownName>, <USState>Colorado </USState></Location></td> <td>Accident Number: </td> <td><AccidentNumber>CEN23LA277 </AccidentNumber></td></tr> <tr><td><LocationDateTime>Date &amp; Time: </LocationDateTime></td> <td><DateTime><EventDate>June 26, 2023</EventDate>, <EventTime>11:00 Local </EventTime></DateTime></td> <td><DateTimeAccidentNumber>Registration: </DateTimeAccidentNumber></td> <td><Registration>N23161 </Registration></td></tr> <tr><td><LocationAircraft>Aircraft: </LocationAircraft></td> <td><AircraftType>Piper <AircraftType>J3C-50 </AircraftType></AircraftType></td> <td><AircraftAccidentNumber>Aircraft Damage: </AircraftAccidentNumber></td> <td><AircraftDamage>Substantial </AircraftDamage></td></tr> <tr><td><LocationDefiningEvent>Defining Event: </LocationDefiningEvent></td> <td><DefiningEvent>Nose over/nose down </DefiningEvent></td> <td><DefiningEventAccidentNumber>Injuries: </DefiningEventAccidentNumber></td> <td><Injuries><Minor>1 </Minor>Minor </Injuries></td></tr> <tr><td><LocationFlightConductedUnder>Flight Conducted Under: </LocationFlightConductedUnder></td> <td><FlightConductedUnder><Part91-cell>Part <RegulationPart>91</RegulationPart>: General aviation - Personal </Part91-cell></FlightConductedUnder></td><td/><td><FlightConductedUnderCEN23LA277/></td></tr></tbody></table>\n",
      "Analysis\n",
-      "<TakeoffAccident> The pilot reported that, as the tail lifted during takeoff, the airplane veered left. He attempted to correct with full right rudder and full brakes. However, the airplane subsequently nosed over resulting in substantial damage to the fuselage, lift struts, rudder, and vertical stabilizer. </TakeoffAccident>\n",
+      "<TakeoffAccident> <Analysis>The pilot reported that, as the tail lifted during takeoff, the airplane veered left. He attempted to correct with full right rudder and full brakes. However, the airplane subsequently nosed over resulting in substantial damage to the fuselage, lift struts, rudder, and vertical stabilizer. </Analysis></TakeoffAccident>\n",
      "<AircraftCondition> The pilot reported that there were no preaccident mechanical malfunctions or anomalies with the airplane that would have precluded normal operation. </AircraftCondition>\n",
      "<WindConditions> At about the time of the accident, wind was from <WindDirection>180</WindDirection>° at <WindConditions>5 </WindConditions>knots. The pilot decided to depart on runway <Runway>35 </Runway>due to the prevailing airport traffic. He stated that departing with “more favorable wind conditions” may have prevented the accident. </WindConditions>\n",
-      "Probable Cause and Findings\n",
+      "<ProbableCauseAndFindings-section>Probable Cause and Findings </ProbableCauseAndFindings-section>\n",
      "<ProbableCause> The <ProbableCause>National Transportation Safety Board </ProbableCause>determines the probable cause(s) of this accident to be: </ProbableCause>\n",
      "<AccidentCause> The pilot's loss of directional control during takeoff and subsequent excessive use of brakes which resulted in a nose-over. Contributing to the accident was his decision to takeoff downwind. </AccidentCause>\n",
      "Page 1 of <PageNumber>5 </PageNumber>\n"
@@ -246,6 +166,8 @@
    }
   ],
   "source": [
+    "from pathlib import Path\n",
+    "\n",
    "from dgml_utils.segmentation import get_chunks_str\n",
    "\n",
    "# Here we just read the first file, you can do the same for others\n",
@@ -268,6 +190,7 @@
  },
  {
   "cell_type": "markdown",
+   "id": "bfc1f2c9-e6d4-4d98-a799-6bc30bc61661",
   "metadata": {},
   "source": [
    "The file processed by Docugami in the example above was [this one](https://data.ntsb.gov/carol-repgen/api/Aviation/ReportMain/GenerateNewestReport/192541/pdf) from the NTSB and you can look at the PDF side by side to compare the XML chunks above. \n",
@@ -277,7 +200,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 48,
+   "execution_count": 5,
+   "id": "8a4b49e0-de78-4790-a930-ad7cf324697a",
   "metadata": {},
   "outputs": [
    {
@@ -326,6 +250,7 @@
  },
  {
   "cell_type": "markdown",
+   "id": "1cfc06bc-67d2-46dd-b04d-95efa3619d0a",
   "metadata": {},
   "source": [
    "## Docugami XML Deep Dive: Jane Doe NDA Example\n",
@@ -335,7 +260,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 109,
+   "execution_count": 6,
+   "id": "7b697d30-1e94-47f0-87e8-f81d4b180da2",
   "metadata": {},
   "outputs": [
    {
@@ -344,12 +270,14 @@
       "39"
      ]
     },
-     "execution_count": 109,
+     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
+    "import requests\n",
+    "\n",
    "# Download XML from known URL\n",
    "dgml = requests.get(\n",
    "    \"https://raw.githubusercontent.com/docugami/dgml-utils/main/python/tests/test_data/article/Jane%20Doe.xml\"\n",
@@ -360,7 +288,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 98,
+   "execution_count": 7,
+   "id": "14714576-6e1d-499b-bcc8-39140bb2fd78",
   "metadata": {},
   "outputs": [
    {
@@ -369,7 +298,7 @@
       "{'h1': 9, 'div': 12, 'p': 3, 'lim h1': 9, 'lim': 1, 'table': 1, 'h1 div': 4}"
      ]
     },
-     "execution_count": 98,
+     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -390,7 +319,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 99,
+   "execution_count": 8,
   "id": "5462f29e-fd59-4e0e-9493-ea3b560e523e",
   "metadata": {},
   "outputs": [
@@ -415,6 +344,7 @@
  },
  {
   "cell_type": "markdown",
+   "id": "dc09ba64-4973-4471-9501-54294c1143fc",
   "metadata": {},
   "source": [
    "The Docugami XML contains extremely detailed semantics and visual bounding boxes for all elements. The `dgml-utils` library parses text and non-text elements into formats appropriate to pass into LLMs (chunked text with XML semantic labels)"
@@ -422,7 +352,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 100,
+   "execution_count": 9,
+   "id": "2b4ece00-2e43-4254-adc9-66dbb79139a6",
   "metadata": {},
   "outputs": [
    {
@@ -459,7 +390,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 101,
+   "execution_count": 10,
+   "id": "08350119-aa22-4ec1-8f65-b1316a0d4123",
   "metadata": {},
   "outputs": [
    {
@@ -476,6 +408,7 @@
  },
  {
   "cell_type": "markdown",
+   "id": "dca87b46-c0c2-4973-94ec-689c18075653",
   "metadata": {},
   "source": [
    "The XML markup contains structural as well as semantic tags, which provide additional semantics to the LLM for improved retrieval and generation.\n",
@@ -485,7 +418,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 112,
+   "execution_count": 11,
+   "id": "bcac8294-c54a-4b6e-af9d-3911a69620b2",
   "metadata": {},
   "outputs": [
    {
@@ -531,7 +465,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 113,
+   "execution_count": 12,
   "id": "8e275736-3408-4d7a-990e-4362c88e81f8",
   "metadata": {},
   "outputs": [],
@@ -539,10 +473,10 @@
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import (\n",
    "    ChatPromptTemplate,\n",
-    "    SystemMessagePromptTemplate,\n",
    "    HumanMessagePromptTemplate,\n",
+    "    SystemMessagePromptTemplate,\n",
    ")\n",
-    "from langchain.schema.output_parser import StrOutputParser"
+    "from langchain_core.output_parsers import StrOutputParser"
   ]
  },
  {
@@ -562,7 +496,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 114,
+   "execution_count": 13,
   "id": "1b12536a-1303-41ad-9948-4eb5a5f32614",
   "metadata": {},
   "outputs": [],
@@ -579,7 +513,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 115,
+   "execution_count": 14,
   "id": "8d8b567c-b442-4bf0-b639-04bd89effc62",
   "metadata": {},
   "outputs": [],
@@ -604,17 +538,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 116,
+   "execution_count": 17,
   "id": "346c3a02-8fea-4f75-a69e-fc9542b99dbc",
   "metadata": {},
   "outputs": [],
   "source": [
    "import uuid\n",
-    "from langchain.vectorstores.chroma import Chroma\n",
-    "from langchain.storage import InMemoryStore\n",
-    "from langchain.schema.document import Document\n",
+    "\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.retrievers.multi_vector import MultiVectorRetriever\n",
+    "from langchain.storage import InMemoryStore\n",
+    "from langchain.vectorstores.chroma import Chroma\n",
+    "from langchain_core.documents import Document\n",
    "\n",
    "\n",
    "def build_retriever(text_elements, tables, table_summaries):\n",
@@ -665,12 +600,12 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 117,
+   "execution_count": 18,
   "id": "f2489de4-51e3-48b4-bbcd-ed9171deadf3",
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "system_prompt = SystemMessagePromptTemplate.from_template(\n",
    "    \"You are a helpful assistant that answers questions based on provided context. Your provided context can include text or tables, \"\n",
@@ -709,9 +644,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 120,
+   "execution_count": 19,
+   "id": "636e992f-823b-496b-a082-8b4fcd479de5",
   "metadata": {},
   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "Number of requested results 4 is greater than number of elements in index 1, updating n_results = 1\n"
+     ]
+    },
    {
     "name": "stdout",
     "output_type": "stream",
@@ -743,6 +686,7 @@
  },
  {
   "cell_type": "markdown",
+   "id": "86cad5db-81fe-4ae6-a20e-550b85fcbe96",
   "metadata": {},
   "source": [
    "# RAG on Llama2 paper\n",
@@ -752,7 +696,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 121,
+   "execution_count": 20,
+   "id": "0e4a2f43-dd48-4ae3-8e27-7e87d169965f",
   "metadata": {},
   "outputs": [
    {
@@ -761,7 +706,7 @@
       "669"
      ]
     },
-     "execution_count": 121,
+     "execution_count": 20,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -776,7 +721,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 124,
+   "execution_count": 21,
+   "id": "56b78fb3-603d-4343-ae72-be54a3c5dd72",
   "metadata": {},
   "outputs": [
    {
@@ -800,7 +746,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 125,
+   "execution_count": 22,
+   "id": "d3cc5ba9-8553-4eda-a5d1-b799751186af",
   "metadata": {},
   "outputs": [],
   "source": [
@@ -811,7 +758,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 126,
+   "execution_count": 23,
+   "id": "d7c73faf-74cb-400d-8059-b69e2493de38",
   "metadata": {},
   "outputs": [],
   "source": [
@@ -822,7 +770,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 127,
+   "execution_count": 24,
+   "id": "4c553722-be42-42ce-83b8-76a17f323f1c",
   "metadata": {},
   "outputs": [],
   "source": [
@@ -831,7 +780,8 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 128,
+   "execution_count": 25,
+   "id": "65dce40b-f1c3-494a-949e-69a9c9544ddb",
   "metadata": {},
   "outputs": [
    {
@@ -840,7 +790,7 @@
       "'The number of training tokens for LLaMA2 is 2.0T for all parameter sizes.'"
      ]
     },
-     "execution_count": 128,
+     "execution_count": 25,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -851,6 +801,7 @@
  },
  {
   "cell_type": "markdown",
+   "id": "59877edf-9a02-45db-95cb-b7f4234abfa3",
   "metadata": {},
   "source": [
    "We can check the [trace](https://smith.langchain.com/public/5de100c3-bb40-4234-bf02-64bc708686a1/r) to see what chunks were retrieved.\n",
@@ -934,13 +885,51 @@
    "        </tr>\n",
    "    </tbody>\n",
    "</table>\n",
-    "``"
+    "```"
   ]
  },
  {
   "cell_type": "markdown",
+   "id": "867f8e11-384c-4aa1-8b3e-c59fb8d5fd7d",
   "metadata": {},
-   "source": []
+   "source": [
+    "Finally, you can ask other questions that rely on more subtle parsing of the table, e.g.:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 26,
+   "id": "d38f1459-7d2b-40df-8dcd-e747f85eb144",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'The learning rate for LLaMA2 was 3.0 × 10−4 for the 7B and 13B models, and 1.5 × 10−4 for the 34B and 70B models.'"
+      ]
+     },
+     "execution_count": 26,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "llama2_chain.invoke(\"What was the learning rate for LLaMA2?\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "94826165",
+   "metadata": {},
+   "source": [
+    "## Docugami KG-RAG Template\n",
+    "\n",
+    "Docugami also provides a [langchain template](https://github.com/docugami/langchain-template-docugami-kg-rag) that you can integrate into your langchain projects.\n",
+    "\n",
+    "Here's a walkthrough of how you can do this.\n",
+    "\n",
+    "[![Docugami KG-RAG Walkthrough](https://img.youtube.com/vi/xOHOmL1NFMg/0.jpg)](https://www.youtube.com/watch?v=xOHOmL1NFMg)\n"
+   ]
  }
 ],
 "metadata": {
--- a/cookbook/extraction_openai_tools.ipynb
+++ b/cookbook/extraction_openai_tools.ipynb
@@ -23,7 +23,7 @@
    "\n",
    "from langchain.chains.openai_tools import create_extraction_chain_pydantic\n",
    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.pydantic_v1 import BaseModel"
+    "from langchain_core.pydantic_v1 import BaseModel"
   ]
  },
  {
@@ -151,11 +151,11 @@
    "\n",
    "from langchain.output_parsers.openai_tools import PydanticToolsParser\n",
    "from langchain.utils.openai_functions import convert_pydantic_to_openai_tool\n",
-    "from langchain.schema.runnable import Runnable\n",
-    "from langchain.pydantic_v1 import BaseModel\n",
+    "from langchain_core.runnables import Runnable\n",
+    "from langchain_core.pydantic_v1 import BaseModel\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.messages import SystemMessage\n",
-    "from langchain.schema.language_model import BaseLanguageModel\n",
+    "from langchain_core.messages import SystemMessage\n",
+    "from langchain_core.language_models import BaseLanguageModel\n",
    "\n",
    "_EXTRACTION_TEMPLATE = \"\"\"Extract and save the relevant entities mentioned \\\n",
    "in the following passage together with their properties.\n",
--- a/cookbook/llm_bash.ipynb
+++ b/cookbook/llm_bash.ipynb
@@ -69,8 +69,8 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.chains.llm_bash.prompt import BashOutputParser\n",
    "from langchain.prompts.prompt import PromptTemplate\n",
+    "from langchain_experimental.llm_bash.prompt import BashOutputParser\n",
    "\n",
    "_PROMPT_TEMPLATE = \"\"\"If someone asks you to perform a task, your job is to come up with a series of bash commands that will perform the task. There is no need to put \"#!/bin/bash\" in your answer. Make sure to reason step by step, using this format:\n",
    "Question: \"copy the files in the directory named 'target' into a new directory at the same level as target called 'myNewDirectory'\"\n",
--- a/cookbook/multi_modal_QA.ipynb
+++ b/cookbook/multi_modal_QA.ipynb
@@ -92,7 +92,7 @@
   "outputs": [],
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.schema.messages import HumanMessage, SystemMessage"
+    "from langchain_core.messages import HumanMessage, SystemMessage"
   ]
  },
  {
--- a/cookbook/multi_modal_RAG_chroma.ipynb
+++ b/cookbook/multi_modal_RAG_chroma.ipynb
@@ -42,7 +42,7 @@
    "* We will use Open Clip multi-modal embeddings.\n",
    "* We will use [Chroma](https://www.trychroma.com/) with support for multi-modal.\n",
    "\n",
-    "A seperate cookbook highlights `Options 2 and 3` [here](https://github.com/langchain-ai/langchain/blob/master/cookbook/Multi_modal_RAG.ipynb).\n",
+    "A separate cookbook highlights `Options 2 and 3` [here](https://github.com/langchain-ai/langchain/blob/master/cookbook/Multi_modal_RAG.ipynb).\n",
    "\n",
    "![chroma_multimodal.png](attachment:1920fda3-1808-407c-9820-f518c9c6f566.png)\n",
    "\n",
@@ -316,9 +316,9 @@
    "from operator import itemgetter\n",
    "\n",
    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.schema.messages import HumanMessage, SystemMessage\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnableLambda, RunnablePassthrough\n",
+    "from langchain_core.messages import HumanMessage, SystemMessage\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnableLambda, RunnablePassthrough\n",
    "\n",
    "\n",
    "def prompt_func(data_dict):\n",
--- a/cookbook/multi_modal_output_agent.ipynb
+++ b/cookbook/multi_modal_output_agent.ipynb
@@ -31,7 +31,7 @@
   "source": [
    "import re\n",
    "\n",
-    "from IPython.display import Image\n",
+    "from IPython.display import Image, display\n",
    "from steamship import Block, Steamship"
   ]
  },
@@ -180,7 +180,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/cookbook/openai_v1_cookbook.ipynb
+++ b/cookbook/openai_v1_cookbook.ipynb
@@ -29,7 +29,7 @@
   "outputs": [],
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.schema.messages import HumanMessage, SystemMessage"
+    "from langchain_core.messages import HumanMessage, SystemMessage"
   ]
  },
  {
@@ -252,7 +252,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.agent import AgentFinish\n",
+    "from langchain_core.agents import AgentFinish\n",
    "\n",
    "\n",
    "def execute_agent(agent, tools, input):\n",
@@ -457,8 +457,8 @@
    "\n",
    "from langchain.output_parsers.openai_tools import PydanticToolsParser\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.pydantic_v1 import BaseModel, Field\n",
    "from langchain.utils.openai_functions import convert_pydantic_to_openai_tool\n",
+    "from langchain_core.pydantic_v1 import BaseModel, Field\n",
    "\n",
    "\n",
    "class GetCurrentWeather(BaseModel):\n",
--- a/cookbook/plan_and_execute_agent.ipynb
+++ b/cookbook/plan_and_execute_agent.ipynb
@@ -29,11 +29,11 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.agents.tools import Tool\n",
    "from langchain.chains import LLMMathChain\n",
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.llms import OpenAI\n",
    "from langchain.utilities import DuckDuckGoSearchAPIWrapper\n",
+    "from langchain_core.tools import Tool\n",
    "from langchain_experimental.plan_and_execute import (\n",
    "    PlanAndExecute,\n",
    "    load_agent_executor,\n",
--- a/cookbook/qianfan_baidu_elasticesearch_RAG.ipynb
+++ b/cookbook/qianfan_baidu_elasticesearch_RAG.ipynb
@@ -37,7 +37,8 @@
   "source": [
    "#!pip install qianfan\n",
    "#!pip install bce-python-sdk\n",
-    "#!pip install elasticsearch == 7.11.0"
+    "#!pip install elasticsearch == 7.11.0\n",
+    "#!pip install sentence-transformers"
   ]
  },
  {
@@ -54,8 +55,10 @@
   "metadata": {},
   "outputs": [],
   "source": [
+    "import sentence_transformers\n",
    "from baidubce.auth.bce_credentials import BceCredentials\n",
    "from baidubce.bce_client_configuration import BceClientConfiguration\n",
+    "from langchain.chains.retrieval_qa import RetrievalQA\n",
    "from langchain.document_loaders.baiducloud_bos_directory import BaiduBOSDirectoryLoader\n",
    "from langchain.embeddings.huggingface import HuggingFaceEmbeddings\n",
    "from langchain.llms.baidu_qianfan_endpoint import QianfanLLMEndpoint\n",
@@ -161,15 +164,22 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
   "name": "python",
-   "version": "3.9.17"
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
  },
-  "orig_nbformat": 4,
  "vscode": {
   "interpreter": {
    "hash": "aee8b7b246df8f9039afb4144a1f6fd8d2ca17a180786b69acc140d282b71a49"
@@ -177,5 +187,5 @@
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/cookbook/rag_fusion.ipynb
+++ b/cookbook/rag_fusion.ipynb
@@ -87,7 +87,7 @@
   "outputs": [],
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.schema.output_parser import StrOutputParser"
+    "from langchain_core.output_parsers import StrOutputParser"
   ]
  },
  {
--- a/cookbook/retrieval_in_sql.ipynb
+++ b/cookbook/retrieval_in_sql.ipynb
@@ -133,7 +133,7 @@
    "from tqdm import tqdm\n",
    "\n",
    "for i in tqdm(range(len(title_embeddings))):\n",
-    "    title = titles[i].replace(\"'\", \"''\")\n",
+    "    title = song_titles[i].replace(\"'\", \"''\")\n",
    "    embedding = title_embeddings[i]\n",
    "    sql_command = (\n",
    "        f'UPDATE \"Track\" SET \"embeddings\" = ARRAY{embedding} WHERE \"Name\" ='\n",
@@ -268,8 +268,8 @@
   "outputs": [],
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "db = SQLDatabase.from_uri(\n",
    "    CONNECTION_STRING\n",
@@ -324,7 +324,7 @@
   "source": [
    "import re\n",
    "\n",
-    "from langchain.schema.runnable import RunnableLambda\n",
+    "from langchain_core.runnables import RunnableLambda\n",
    "\n",
    "\n",
    "def replace_brackets(match):\n",
@@ -681,9 +681,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.8.18"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/cookbook/rewrite.ipynb
+++ b/cookbook/rewrite.ipynb
@@ -33,9 +33,9 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnablePassthrough\n",
-    "from langchain.utilities import DuckDuckGoSearchAPIWrapper"
+    "from langchain.utilities import DuckDuckGoSearchAPIWrapper\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough"
   ]
  },
  {
--- a/cookbook/selecting_llms_based_on_context_length.ipynb
+++ b/cookbook/selecting_llms_based_on_context_length.ipynb
@@ -19,8 +19,8 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import PromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.prompt import PromptValue"
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.prompt_values import PromptValue"
   ]
  },
  {
--- a/cookbook/stepback-qa.ipynb
+++ b/cookbook/stepback-qa.ipynb
@@ -25,8 +25,8 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate, FewShotChatMessagePromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnableLambda"
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnableLambda"
   ]
  },
  {
--- a/cookbook/wikibase_agent.ipynb
+++ b/cookbook/wikibase_agent.ipynb
@@ -187,7 +187,7 @@
    "    for key in path:\n",
    "        try:\n",
    "            current = current[key]\n",
-    "        except:\n",
+    "        except KeyError:\n",
    "            return None\n",
    "    return current\n",
    "\n",
--- a/docs/.local_build.sh
+++ b/docs/.local_build.sh
@@ -9,13 +9,15 @@ SCRIPT_DIR="$(cd "$(dirname "$0")"; pwd)"
 cd "${SCRIPT_DIR}"

 mkdir -p ../_dist
-cp -r . ../_dist
+rsync -ruv --exclude node_modules --exclude api_reference --exclude .venv --exclude .docusaurus . ../_dist
 cd ../_dist
 poetry run python scripts/model_feat_table.py
-poetry run nbdoc_build --srcdir docs
 cp ../cookbook/README.md src/pages/cookbook.mdx
 cp ../.github/CONTRIBUTING.md docs/contributing.md
+mkdir -p docs/templates
+cp ../templates/docs/INDEX.md docs/templates/index.md
 wget https://raw.githubusercontent.com/langchain-ai/langserve/main/README.md -O docs/langserve.md
-poetry run python scripts/generate_api_reference_links.py
-yarn install
-yarn start
+
+yarn
+
+quarto preview docs
--- a/docs/api_reference/create_api_rst.py
+++ b/docs/api_reference/create_api_rst.py
@@ -14,9 +14,10 @@ HERE = Path(__file__).parent
 PKG_DIR = ROOT_DIR / "libs" / "langchain" / "langchain"
 EXP_DIR = ROOT_DIR / "libs" / "experimental" / "langchain_experimental"
 CORE_DIR = ROOT_DIR / "libs" / "core" / "langchain_core"
+COMMUNITY_DIR = ROOT_DIR / "libs" / "core" / "langchain_community"
 WRITE_FILE = HERE / "api_reference.rst"
 EXP_WRITE_FILE = HERE / "experimental_api_reference.rst"
-CORE_WRITE_FILE = HERE / "core_api_reference.rst"
+COMMUNITY_WRITE_FILE = HERE / "community_api_reference.rst"


 ClassKind = Literal["TypedDict", "Regular", "Pydantic", "enum"]
@@ -196,11 +197,13 @@ def _load_package_modules(
    return modules_by_namespace


-def _construct_doc(pkg: str, members_by_namespace: Dict[str, ModuleMembers]) -> str:
+def _construct_doc(
+    package_namespace: str, members_by_namespace: Dict[str, ModuleMembers]
+) -> str:
    """Construct the contents of the reference.rst file for the given package.

    Args:
-        pkg: The package name
+        package_namespace: The package top level namespace
        members_by_namespace: The members of the package, dict organized by top level
                              module contains a list of classes and functions
                              inside of the top level namespace.
@@ -210,7 +213,7 @@ def _construct_doc(pkg: str, members_by_namespace: Dict[str, ModuleMembers]) ->
    """
    full_doc = f"""\
 =======================
-``{pkg}`` API Reference
+``{package_namespace}`` API Reference
 =======================

 """
@@ -222,13 +225,13 @@ def _construct_doc(pkg: str, members_by_namespace: Dict[str, ModuleMembers]) ->
        functions = _members["functions"]
        if not (classes or functions):
            continue
-        section = f":mod:`{pkg}.{module}`"
+        section = f":mod:`{package_namespace}.{module}`"
        underline = "=" * (len(section) + 1)
        full_doc += f"""\
 {section}
 {underline}

-.. automodule:: {pkg}.{module}
+.. automodule:: {package_namespace}.{module}
    :no-members:
    :no-inherited-members:

@@ -238,7 +241,7 @@ def _construct_doc(pkg: str, members_by_namespace: Dict[str, ModuleMembers]) ->
            full_doc += f"""\
 Classes
 --------------
-.. currentmodule:: {pkg}
+.. currentmodule:: {package_namespace}

 .. autosummary::
    :toctree: {module}
@@ -270,7 +273,7 @@ Classes
            full_doc += f"""\
 Functions
 --------------
-.. currentmodule:: {pkg}
+.. currentmodule:: {package_namespace}

 .. autosummary::
    :toctree: {module}
@@ -282,57 +285,61 @@ Functions
    return full_doc


-def _document_langchain_experimental() -> None:
-    """Document the langchain_experimental package."""
-    # Generate experimental_api_reference.rst
-    exp_members = _load_package_modules(EXP_DIR)
-    exp_doc = ".. _experimental_api_reference:\n\n" + _construct_doc(
-        "langchain_experimental", exp_members
-    )
-    with open(EXP_WRITE_FILE, "w") as f:
-        f.write(exp_doc)
+def _build_rst_file(package_name: str = "langchain") -> None:
+    """Create a rst file for building of documentation.
+
+    Args:
+        package_name: Can be either "langchain" or "core" or "experimental".
+    """
+    package_members = _load_package_modules(_package_dir(package_name))
+    with open(_out_file_path(package_name), "w") as f:
+        f.write(
+            _doc_first_line(package_name)
+            + _construct_doc(package_namespace[package_name], package_members)
+        )


-def _document_langchain_core() -> None:
-    """Document the langchain_core package."""
-    # Generate core_api_reference.rst
-    core_members = _load_package_modules(EXP_DIR)
-    core_doc = ".. _core_api_reference:\n\n" + _construct_doc(
-        "langchain_core", core_members
-    )
-    with open(CORE_WRITE_FILE, "w") as f:
-        f.write(core_doc)
+package_namespace = {
+    "langchain": "langchain",
+    "experimental": "langchain_experimental",
+    "core": "langchain_core",
+    "community": "langchain_community",
+}


-def _document_langchain() -> None:
-    """Document the main langchain package."""
-    # load top level module members
-    lc_members = _load_package_modules(PKG_DIR)
+def _package_dir(package_name: str = "langchain") -> Path:
+    """Return the path to the directory containing the documentation."""
+    return ROOT_DIR / "libs" / package_name / package_namespace[package_name]

-    # Add additional packages
-    tools = _load_package_modules(PKG_DIR, "tools")
-    agents = _load_package_modules(PKG_DIR, "agents")
-    schema = _load_package_modules(PKG_DIR, "schema")

-    lc_members.update(
-        {
-            "agents.output_parsers": agents["output_parsers"],
-            "agents.format_scratchpad": agents["format_scratchpad"],
-            "tools.render": tools["render"],
-        }
-    )
+def _out_file_path(package_name: str = "langchain") -> Path:
+    """Return the path to the file containing the documentation."""
+    name_prefix = {
+        "langchain": "",
+        "experimental": "experimental_",
+        "core": "core_",
+        "community": "community_",
+    }
+    return HERE / f"{name_prefix[package_name]}api_reference.rst"

-    lc_doc = ".. _api_reference:\n\n" + _construct_doc("langchain", lc_members)

-    with open(WRITE_FILE, "w") as f:
-        f.write(lc_doc)
+def _doc_first_line(package_name: str = "langchain") -> str:
+    """Return the path to the file containing the documentation."""
+    prefix = {
+        "langchain": "",
+        "experimental": "experimental",
+        "core": "core",
+        "community": "community",
+    }
+    return f".. {prefix[package_name]}_api_reference:\n\n"


 def main() -> None:
-    """Generate the reference.rst file for each package."""
-    _document_langchain()
-    _document_langchain_experimental()
-    _document_langchain_core()
+    """Generate the api_reference.rst file for each package."""
+    _build_rst_file(package_name="core")
+    _build_rst_file(package_name="langchain")
+    _build_rst_file(package_name="experimental")
+    _build_rst_file(package_name="community")


 if __name__ == "__main__":
--- a/docs/api_reference/requirements.txt
+++ b/docs/api_reference/requirements.txt
@@ -1,5 +1,7 @@
-e libs/langchain
 -e libs/experimental
+-e libs/langchain
+-e libs/core
+-e libs/community
 pydantic<2
 autodoc_pydantic==1.8.0
 myst_parser
--- a/docs/api_reference/themes/scikit-learn-modern/nav.html
+++ b/docs/api_reference/themes/scikit-learn-modern/nav.html
@@ -37,6 +37,9 @@
        <li class="nav-item">
          <a class="sk-nav-link nav-link" href="{{ pathto('core_api_reference') }}">Core</a>
        </li>
+        <li class="nav-item">
+          <a class="sk-nav-link nav-link" href="{{ pathto('community_api_reference') }}">Community</a>
+        </li>
        <li class="nav-item">
          <a class="sk-nav-link nav-link" href="{{ pathto('experimental_api_reference') }}">Experimental</a>
        </li>
--- a/docs/docs/additional_resources/dependents.mdx
+++ b/docs/docs/additional_resources/dependents.mdx
--- a/docs/docs/expression_language/cookbook/code_writing.ipynb
+++ b/docs/docs/expression_language/cookbook/code_writing.ipynb
@@ -21,7 +21,7 @@
    "from langchain.prompts import (\n",
    "    ChatPromptTemplate,\n",
    ")\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
    "from langchain_experimental.utilities import PythonREPL"
   ]
  },
--- a/docs/docs/expression_language/cookbook/embedding_router.ipynb
+++ b/docs/docs/expression_language/cookbook/embedding_router.ipynb
@@ -22,9 +22,9 @@
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.prompts import PromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnableLambda, RunnablePassthrough\n",
    "from langchain.utils.math import cosine_similarity\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnableLambda, RunnablePassthrough\n",
    "\n",
    "physics_template = \"\"\"You are a very smart physics professor. \\\n",
    "You are great at answering questions about physics in a concise and easy to understand manner. \\\n",
--- a/docs/docs/expression_language/cookbook/index.mdx
+++ b/docs/docs/expression_language/cookbook/index.mdx
@@ -1,5 +1,5 @@
 ---
-sidebar_position: 2
+sidebar_position: 3
 ---

 # Cookbook
--- a/docs/docs/expression_language/cookbook/memory.ipynb
+++ b/docs/docs/expression_language/cookbook/memory.ipynb
@@ -22,7 +22,7 @@
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.memory import ConversationBufferMemory\n",
    "from langchain.prompts import ChatPromptTemplate, MessagesPlaceholder\n",
-    "from langchain.schema.runnable import RunnableLambda, RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnableLambda, RunnablePassthrough\n",
    "\n",
    "model = ChatOpenAI()\n",
    "prompt = ChatPromptTemplate.from_messages(\n",
--- a/docs/docs/expression_language/cookbook/multiple_chains.ipynb
+++ b/docs/docs/expression_language/cookbook/multiple_chains.ipynb
@@ -69,7 +69,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "prompt1 = ChatPromptTemplate.from_template(\n",
    "    \"generate a {attribute} color. Return the name of the color and nothing else:\"\n",
@@ -146,7 +146,7 @@
   "source": [
    "### Branching and Merging\n",
    "\n",
-    "You may want the output of one component to be processed by 2 or more other components. [RunnableMaps](https://api.python.langchain.com/en/latest/schema/langchain.schema.runnable.base.RunnableMap.html) let you split or fork the chain so multiple components can process the input in parallel. Later, other components can join or merge the results to synthesize a final response. This type of chain creates a computation graph that looks like the following:\n",
+    "You may want the output of one component to be processed by 2 or more other components. [RunnableParallels](https://api.python.langchain.com/en/latest/runnables/langchain_core.runnables.base.RunnableParallel.html#langchain_core.runnables.base.RunnableParallel) let you split or fork the chain so multiple components can process the input in parallel. Later, other components can join or merge the results to synthesize a final response. This type of chain creates a computation graph that looks like the following:\n",
    "\n",
    "```text\n",
    "     Input\n",
--- a/docs/docs/expression_language/cookbook/prompt_llm_parser.ipynb
+++ b/docs/docs/expression_language/cookbook/prompt_llm_parser.ipynb
@@ -191,7 +191,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
    "\n",
    "chain = prompt | model | StrOutputParser()"
   ]
@@ -317,7 +317,7 @@
   "source": [
    "## Simplifying input\n",
    "\n",
-    "To make invocation even simpler, we can add a `RunnableMap` to take care of creating the prompt input dict for us:"
+    "To make invocation even simpler, we can add a `RunnableParallel` to take care of creating the prompt input dict for us:"
   ]
  },
  {
@@ -327,9 +327,9 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnableMap, RunnablePassthrough\n",
+    "from langchain_core.runnables import RunnableParallel, RunnablePassthrough\n",
    "\n",
-    "map_ = RunnableMap(foo=RunnablePassthrough())\n",
+    "map_ = RunnableParallel(foo=RunnablePassthrough())\n",
    "chain = (\n",
    "    map_\n",
    "    | prompt\n",
--- a/docs/docs/expression_language/cookbook/prompt_size.ipynb
+++ b/docs/docs/expression_language/cookbook/prompt_size.ipynb
@@ -209,7 +209,10 @@
   "id": "637f994a-5134-402a-bcf0-4de3911eaf49",
   "metadata": {},
   "source": [
-    ":::tip [LangSmith trace](https://smith.langchain.com/public/60909eae-f4f1-43eb-9f96-354f5176f66f/r)\n",
+    ":::tip\n",
+    "\n",
+    "[LangSmith trace](https://smith.langchain.com/public/60909eae-f4f1-43eb-9f96-354f5176f66f/r)\n",
+    "\n",
    ":::"
   ]
  },
@@ -374,7 +377,10 @@
   "id": "5a7e498b-dc68-4267-a35c-90ceffa91c46",
   "metadata": {},
   "source": [
-    ":::tip [LangSmith trace](https://smith.langchain.com/public/3b27d47f-e4df-4afb-81b1-0f88b80ca97e/r)\n",
+    ":::tip\n",
+    "\n",
+    "[LangSmith trace](https://smith.langchain.com/public/3b27d47f-e4df-4afb-81b1-0f88b80ca97e/r)\n",
+    "\n",
    ":::"
   ]
  }
--- a/docs/docs/expression_language/cookbook/retrieval.ipynb
+++ b/docs/docs/expression_language/cookbook/retrieval.ipynb
@@ -31,7 +31,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": 1,
   "id": "33be32af",
   "metadata": {},
   "outputs": [],
@@ -41,14 +41,14 @@
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnableLambda, RunnablePassthrough\n",
-    "from langchain.vectorstores import FAISS"
+    "from langchain.vectorstores import FAISS\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnableLambda, RunnablePassthrough"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": 2,
   "id": "bfc47ec1",
   "metadata": {},
   "outputs": [],
@@ -70,7 +70,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 3,
   "id": "eae31755",
   "metadata": {},
   "outputs": [],
@@ -85,7 +85,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 4,
   "id": "f3040b0c",
   "metadata": {},
   "outputs": [
@@ -95,7 +95,7 @@
       "'Harrison worked at Kensho.'"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -106,7 +106,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": 5,
   "id": "e1d20c7c",
   "metadata": {},
   "outputs": [],
@@ -134,7 +134,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": 6,
   "id": "7ee8b2d4",
   "metadata": {},
   "outputs": [
@@ -144,7 +144,7 @@
       "'Harrison ha lavorato a Kensho.'"
      ]
     },
-     "execution_count": 7,
+     "execution_count": 6,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -165,18 +165,19 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 21,
   "id": "3f30c348",
   "metadata": {},
   "outputs": [],
   "source": [
    "from langchain.schema import format_document\n",
-    "from langchain.schema.runnable import RunnableMap"
+    "from langchain_core.messages import AIMessage, HumanMessage, get_buffer_string\n",
+    "from langchain_core.runnables import RunnableParallel"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": 8,
   "id": "64ab1dbf",
   "metadata": {},
   "outputs": [],
@@ -194,7 +195,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": 9,
   "id": "7d628c97",
   "metadata": {},
   "outputs": [],
@@ -209,7 +210,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 11,
+   "execution_count": 10,
   "id": "f60a5d0f",
   "metadata": {},
   "outputs": [],
@@ -226,33 +227,14 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 12,
-   "id": "7d007db6",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "from typing import List, Tuple\n",
-    "\n",
-    "\n",
-    "def _format_chat_history(chat_history: List[Tuple]) -> str:\n",
-    "    buffer = \"\"\n",
-    "    for dialogue_turn in chat_history:\n",
-    "        human = \"Human: \" + dialogue_turn[0]\n",
-    "        ai = \"Assistant: \" + dialogue_turn[1]\n",
-    "        buffer += \"\\n\" + \"\\n\".join([human, ai])\n",
-    "    return buffer"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 13,
+   "execution_count": 11,
   "id": "5c32cc89",
   "metadata": {},
   "outputs": [],
   "source": [
-    "_inputs = RunnableMap(\n",
+    "_inputs = RunnableParallel(\n",
    "    standalone_question=RunnablePassthrough.assign(\n",
-    "        chat_history=lambda x: _format_chat_history(x[\"chat_history\"])\n",
+    "        chat_history=lambda x: get_buffer_string(x[\"chat_history\"])\n",
    "    )\n",
    "    | CONDENSE_QUESTION_PROMPT\n",
    "    | ChatOpenAI(temperature=0)\n",
@@ -267,17 +249,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 14,
+   "execution_count": 12,
   "id": "135c8205",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='Harrison was employed at Kensho.', additional_kwargs={}, example=False)"
+       "AIMessage(content='Harrison was employed at Kensho.')"
      ]
     },
-     "execution_count": 14,
+     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -293,17 +275,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 15,
+   "execution_count": 22,
   "id": "424e7e7a",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='Harrison worked at Kensho.', additional_kwargs={}, example=False)"
+       "AIMessage(content='Harrison worked at Kensho.')"
      ]
     },
-     "execution_count": 15,
+     "execution_count": 22,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -312,7 +294,10 @@
    "conversational_qa_chain.invoke(\n",
    "    {\n",
    "        \"question\": \"where did he work?\",\n",
-    "        \"chat_history\": [(\"Who wrote this notebook?\", \"Harrison\")],\n",
+    "        \"chat_history\": [\n",
+    "            HumanMessage(content=\"Who wrote this notebook?\"),\n",
+    "            AIMessage(content=\"Harrison\"),\n",
+    "        ],\n",
    "    }\n",
    ")"
   ]
@@ -329,7 +314,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 16,
+   "execution_count": 14,
   "id": "e31dd17c",
   "metadata": {},
   "outputs": [],
@@ -341,7 +326,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 17,
+   "execution_count": 15,
   "id": "d4bffe94",
   "metadata": {},
   "outputs": [],
@@ -353,7 +338,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 16,
   "id": "733be985",
   "metadata": {},
   "outputs": [],
@@ -367,7 +352,7 @@
    "standalone_question = {\n",
    "    \"standalone_question\": {\n",
    "        \"question\": lambda x: x[\"question\"],\n",
-    "        \"chat_history\": lambda x: _format_chat_history(x[\"chat_history\"]),\n",
+    "        \"chat_history\": lambda x: get_buffer_string(x[\"chat_history\"]),\n",
    "    }\n",
    "    | CONDENSE_QUESTION_PROMPT\n",
    "    | ChatOpenAI(temperature=0)\n",
@@ -394,18 +379,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 19,
+   "execution_count": 17,
   "id": "806e390c",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "{'answer': AIMessage(content='Harrison was employed at Kensho.', additional_kwargs={}, example=False),\n",
-       " 'docs': [Document(page_content='harrison worked at kensho', metadata={})]}"
+       "{'answer': AIMessage(content='Harrison was employed at Kensho.'),\n",
+       " 'docs': [Document(page_content='harrison worked at kensho')]}"
      ]
     },
-     "execution_count": 19,
+     "execution_count": 17,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -418,7 +403,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 20,
+   "execution_count": 18,
   "id": "977399fd",
   "metadata": {},
   "outputs": [],
@@ -431,18 +416,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 21,
+   "execution_count": 19,
   "id": "f94f7de4",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "{'history': [HumanMessage(content='where did harrison work?', additional_kwargs={}, example=False),\n",
-       "  AIMessage(content='Harrison was employed at Kensho.', additional_kwargs={}, example=False)]}"
+       "{'history': [HumanMessage(content='where did harrison work?'),\n",
+       "  AIMessage(content='Harrison was employed at Kensho.')]}"
      ]
     },
-     "execution_count": 21,
+     "execution_count": 19,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -450,6 +435,38 @@
   "source": [
    "memory.load_memory_variables({})"
   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "id": "88f2b7cd",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'answer': AIMessage(content='Harrison actually worked at Kensho.'),\n",
+       " 'docs': [Document(page_content='harrison worked at kensho')]}"
+      ]
+     },
+     "execution_count": 20,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "inputs = {\"question\": \"but where did he really work?\"}\n",
+    "result = final_chain.invoke(inputs)\n",
+    "result"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "207a2782",
+   "metadata": {},
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {
@@ -468,7 +485,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.1"
  }
 },
 "nbformat": 4,
--- a/docs/docs/expression_language/cookbook/sql_db.ipynb
+++ b/docs/docs/expression_language/cookbook/sql_db.ipynb
@@ -94,8 +94,8 @@
   "outputs": [],
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "model = ChatOpenAI()\n",
    "\n",
--- a/docs/docs/expression_language/cookbook/tools.ipynb
+++ b/docs/docs/expression_language/cookbook/tools.ipynb
@@ -29,8 +29,8 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.tools import DuckDuckGoSearchRun"
+    "from langchain.tools import DuckDuckGoSearchRun\n",
+    "from langchain_core.output_parsers import StrOutputParser"
   ]
  },
  {
--- a/docs/docs/expression_language/get_started.ipynb
+++ b/docs/docs/expression_language/get_started.ipynb
@@ -0,0 +1,493 @@
+{
+ "cells": [
+  {
+   "cell_type": "raw",
+   "id": "366a0e68-fd67-4fe5-a292-5c33733339ea",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_position: 0\n",
+    "title: Get started\n",
+    "---"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "befa7fd1",
+   "metadata": {},
+   "source": [
+    "LCEL makes it easy to build complex chains from basic components, and supports out of the box functionality such as streaming, parallelism, and logging."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "9a9acd2e",
+   "metadata": {},
+   "source": [
+    "## Basic example: prompt + model + output parser\n",
+    "\n",
+    "The most basic and common use case is chaining a prompt template and a model together. To see how this works, let's create a chain that takes a topic and generates a joke:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "466b65b3",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "\"Why did the ice cream go to therapy?\\n\\nBecause it had too many toppings and couldn't find its cone-fidence!\""
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.prompts import ChatPromptTemplate\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "\n",
+    "prompt = ChatPromptTemplate.from_template(\"tell me a short joke about {topic}\")\n",
+    "model = ChatOpenAI()\n",
+    "output_parser = StrOutputParser()\n",
+    "\n",
+    "chain = prompt | model | output_parser\n",
+    "\n",
+    "chain.invoke({\"topic\": \"ice cream\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "81c502c5-85ee-4f36-aaf4-d6e350b7792f",
+   "metadata": {},
+   "source": [
+    "Notice this line of this code, where we piece together then different components into a single chain using LCEL:\n",
+    "\n",
+    "```\n",
+    "chain = prompt | model | output_parser\n",
+    "```\n",
+    "\n",
+    "The `|` symbol is similar to a [unix pipe operator](https://en.wikipedia.org/wiki/Pipeline_(Unix)), which chains together the different components feeds the output from one component as input into the next component. \n",
+    "\n",
+    "In this chain the user input is passed to the prompt template, then the prompt template output is passed to the model, then the model output is passed to the output parser. Let's take a look at each component individually to really understand what's going on. "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "aa1b77fa",
+   "metadata": {},
+   "source": [
+    "### 1. Prompt\n",
+    "\n",
+    "`prompt` is a `BasePromptTemplate`, which means it takes in a dictionary of template variables and produces a `PromptValue`. A `PromptValue` is a wrapper around a completed prompt that can be passed to either an `LLM` (which takes a string as input) or `ChatModel` (which takes a sequence of messages as input). It can work with either language model type because it defines logic both for producing `BaseMessage`s and for producing a string."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "b8656990",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "ChatPromptValue(messages=[HumanMessage(content='tell me a short joke about ice cream')])"
+      ]
+     },
+     "execution_count": 8,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "prompt_value = prompt.invoke({\"topic\": \"ice cream\"})\n",
+    "prompt_value"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "e6034488",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[HumanMessage(content='tell me a short joke about ice cream')]"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "prompt_value.to_messages()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "60565463",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'Human: tell me a short joke about ice cream'"
+      ]
+     },
+     "execution_count": 10,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "prompt_value.to_string()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "577f0f76",
+   "metadata": {},
+   "source": [
+    "### 2. Model\n",
+    "\n",
+    "The `PromptValue` is then passed to `model`. In this case our `model` is a `ChatModel`, meaning it will output a `BaseMessage`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "33cf5f72",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=\"Why did the ice cream go to therapy? \\n\\nBecause it had too many toppings and couldn't find its cone-fidence!\")"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "message = model.invoke(prompt_value)\n",
+    "message"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "327e7db8",
+   "metadata": {},
+   "source": [
+    "If our `model` was an `LLM`, it would output a string."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "id": "8feb05da",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'\\n\\nRobot: Why did the ice cream go to therapy? Because it had a rocky road.'"
+      ]
+     },
+     "execution_count": 12,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain.llms import OpenAI\n",
+    "\n",
+    "llm = OpenAI(model=\"gpt-3.5-turbo-instruct\")\n",
+    "llm.invoke(prompt_value)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "91847478",
+   "metadata": {},
+   "source": [
+    "### 3. Output parser\n",
+    "\n",
+    "And lastly we pass our `model` output to the `output_parser`, which is a `BaseOutputParser` meaning it takes either a string or a \n",
+    "`BaseMessage` as input. The `StrOutputParser` specifically simple converts any input into a string."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "533e59a8",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "\"Why did the ice cream go to therapy? \\n\\nBecause it had too many toppings and couldn't find its cone-fidence!\""
+      ]
+     },
+     "execution_count": 13,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "output_parser.invoke(message)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "9851e842",
+   "metadata": {},
+   "source": [
+    "### 4. Entire Pipeline\n",
+    "\n",
+    "To follow the steps along:\n",
+    "\n",
+    "1. We pass in user input on the desired topic as `{\"topic\": \"ice cream\"}`\n",
+    "2. The `prompt` component takes the user input, which is then used to construct a PromptValue after using the `topic` to construct the prompt. \n",
+    "3. The `model` component takes the generated prompt, and passes into the OpenAI LLM model for evaluation. The generated output from the model is a `ChatMessage` object. \n",
+    "4. Finally, the `output_parser` component takes in a `ChatMessage`, and transforms this into a Python string, which is returned from the invoke method. \n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c4873109",
+   "metadata": {},
+   "source": [
+    "```mermaid\n",
+    "graph LR\n",
+    "    A(Input: topic=ice cream) --> |Dict| B(PromptTemplate)\n",
+    "    B -->|PromptValue| C(ChatModel)    \n",
+    "    C -->|ChatMessage| D(StrOutputParser)\n",
+    "    D --> |String| F(Result)\n",
+    "```\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "fe63534d",
+   "metadata": {},
+   "source": [
+    ":::info\n",
+    "\n",
+    "Note that if you’re curious about the output of any components, you can always test out a smaller version of the chain such as `prompt`  or `prompt | model` to see the intermediate results:\n",
+    "\n",
+    ":::"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "11089b6f-23f8-474f-97ec-8cae8d0ca6d4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "input = {\"topic\": \"ice cream\"}\n",
+    "\n",
+    "prompt.invoke(input)\n",
+    "# > ChatPromptValue(messages=[HumanMessage(content='tell me a short joke about ice cream')])\n",
+    "\n",
+    "(prompt | model).invoke(input)\n",
+    "# > AIMessage(content=\"Why did the ice cream go to therapy?\\nBecause it had too many toppings and couldn't cone-trol itself!\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "cc7d3b9d-e400-4c9b-9188-f29dac73e6bb",
+   "metadata": {},
+   "source": [
+    "## RAG Search Example\n",
+    "\n",
+    "For our next example, we want to run a retrieval-augmented generation chain to add some context when responding to questions. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "662426e8-4316-41dc-8312-9b58edc7e0c9",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Requires:\n",
+    "# pip install langchain docarray tiktoken\n",
+    "\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "from langchain.prompts import ChatPromptTemplate\n",
+    "from langchain.vectorstores import DocArrayInMemorySearch\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnableParallel, RunnablePassthrough\n",
+    "\n",
+    "vectorstore = DocArrayInMemorySearch.from_texts(\n",
+    "    [\"harrison worked at kensho\", \"bears like to eat honey\"],\n",
+    "    embedding=OpenAIEmbeddings(),\n",
+    ")\n",
+    "retriever = vectorstore.as_retriever()\n",
+    "\n",
+    "template = \"\"\"Answer the question based only on the following context:\n",
+    "{context}\n",
+    "\n",
+    "Question: {question}\n",
+    "\"\"\"\n",
+    "prompt = ChatPromptTemplate.from_template(template)\n",
+    "model = ChatOpenAI()\n",
+    "output_parser = StrOutputParser()\n",
+    "\n",
+    "setup_and_retrieval = RunnableParallel(\n",
+    "    {\"context\": retriever, \"question\": RunnablePassthrough()}\n",
+    ")\n",
+    "chain = setup_and_retrieval | prompt | model | output_parser\n",
+    "\n",
+    "chain.invoke(\"where did harrison work?\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f0999140-6001-423b-970b-adf1dfdb4dec",
+   "metadata": {},
+   "source": [
+    "In this case, the composed chain is: "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "5b88e9bb-f04a-4a56-87ec-19a0e6350763",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chain = setup_and_retrieval | prompt | model | output_parser"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "6e929e15-40a5-4569-8969-384f636cab87",
+   "metadata": {},
+   "source": [
+    "To explain this, we first can see that the prompt template above takes in `context` and `question` as values to be substituted in the prompt. Before building the prompt template, we want to retrieve relevant documents to the search and include them as part of the context. \n",
+    "\n",
+    "As a preliminary step, we’ve setup the retriever using an in memory store, which can retrieve documents based on a query. This is a runnable component as well that can be chained together with other components, but you can also try to run it separately:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "a7319ef6-613b-4638-ad7d-4a2183702c1d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "retriever.invoke(\"where did harrison work?\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "e6833844-f1c4-444c-a3d2-31b3c6b31d46",
+   "metadata": {},
+   "source": [
+    "We then use the `RunnableParallel` to prepare the expected inputs into the prompt by using the entries for the retrieved documents as well as the original user question, using the retriever for document search, and RunnablePassthrough to pass the user’s question:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "dcbca26b-d6b9-4c24-806c-1ec8fdaab4ed",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "setup_and_retrieval = RunnableParallel(\n",
+    "    {\"context\": retriever, \"question\": RunnablePassthrough()}\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "68c721c1-048b-4a64-9d78-df54fe465992",
+   "metadata": {},
+   "source": [
+    "To review, the complete chain is:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "1d5115a7-7b8e-458b-b936-26cc87ee81c4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "setup_and_retrieval = RunnableParallel(\n",
+    "    {\"context\": retriever, \"question\": RunnablePassthrough()}\n",
+    ")\n",
+    "chain = setup_and_retrieval | prompt | model | output_parser"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "5c6f5f74-b387-48a0-bedd-1fae202cd10a",
+   "metadata": {},
+   "source": [
+    "With the flow being:\n",
+    "\n",
+    "1. The first steps create a `RunnableParallel` object with two entries.  The first entry, `context` will include the document results fetched by the retriever. The second entry, `question` will contain the user’s original question. To pass on the question, we use `RunnablePassthrough` to copy this entry. \n",
+    "2. Feed the dictionary from the step above to the `prompt` component. It then takes the user input which is `question` as well as the retrieved document which is `context` to construct a prompt and output a PromptValue.  \n",
+    "3. The `model` component takes the generated prompt, and passes into the OpenAI LLM model for evaluation. The generated output from the model is a `ChatMessage` object. \n",
+    "4. Finally, the `output_parser` component takes in a `ChatMessage`, and transforms this into a Python string, which is returned from the invoke method.\n",
+    "\n",
+    "```mermaid\n",
+    "graph LR\n",
+    "    A(Question) --> B(RunnableParallel)\n",
+    "    B -->|Question| C(Retriever)\n",
+    "    B -->|Question| D(RunnablePassThrough)\n",
+    "    C -->|context=retrieved docs| E(PromptTemplate)\n",
+    "    D -->|question=Question| E\n",
+    "    E -->|PromptValue| F(ChatModel)    \n",
+    "    F -->|ChatMessage| G(StrOutputParser)\n",
+    "    G --> |String| H(Result)\n",
+    "```\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8c2438df-164e-4bbe-b5f4-461695e45b0f",
+   "metadata": {},
+   "source": [
+    "## Next steps\n",
+    "\n",
+    "We recommend reading our [Why use LCEL](/docs/expression_language/why) section next to see a side-by-side comparison of the code needed to produce common functionality with and without LCEL."
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/docs/expression_language/how_to/binding.ipynb
+++ b/docs/docs/expression_language/how_to/binding.ipynb
@@ -22,7 +22,7 @@
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
    "from langchain.schema import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnablePassthrough"
+    "from langchain_core.runnables import RunnablePassthrough"
   ]
  },
  {
--- a/docs/docs/expression_language/how_to/configure.ipynb
+++ b/docs/docs/expression_language/how_to/configure.ipynb
@@ -43,6 +43,7 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import PromptTemplate\n",
+    "from langchain_core.runnables import ConfigurableField\n",
    "\n",
    "model = ChatOpenAI(temperature=0).configurable_fields(\n",
    "    temperature=ConfigurableField(\n",
@@ -264,7 +265,7 @@
   "source": [
    "from langchain.chat_models import ChatAnthropic, ChatOpenAI\n",
    "from langchain.prompts import PromptTemplate\n",
-    "from langchain.schema.runnable import ConfigurableField"
+    "from langchain_core.runnables import ConfigurableField"
   ]
  },
  {
@@ -594,7 +595,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.11.5"
  }
 },
 "nbformat": 4,
--- a/docs/docs/expression_language/how_to/fallbacks.ipynb
+++ b/docs/docs/expression_language/how_to/fallbacks.ipynb
@@ -26,7 +26,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 1,
   "id": "d3e893bf",
   "metadata": {},
   "outputs": [],
@@ -44,19 +44,24 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 2,
   "id": "dfdd8bf5",
   "metadata": {},
   "outputs": [],
   "source": [
    "from unittest.mock import patch\n",
    "\n",
-    "from openai.error import RateLimitError"
+    "import httpx\n",
+    "from openai import RateLimitError\n",
+    "\n",
+    "request = httpx.Request(\"GET\", \"/\")\n",
+    "response = httpx.Response(200, request=request)\n",
+    "error = RateLimitError(\"rate limit\", response=response, body=\"\")"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 3,
   "id": "e6fdffc1",
   "metadata": {},
   "outputs": [],
@@ -69,7 +74,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 27,
+   "execution_count": 4,
   "id": "584461ab",
   "metadata": {},
   "outputs": [
@@ -83,10 +88,10 @@
   ],
   "source": [
    "# Let's use just the OpenAI LLm first, to show that we run into an error\n",
-    "with patch(\"openai.ChatCompletion.create\", side_effect=RateLimitError()):\n",
+    "with patch(\"openai.resources.chat.completions.Completions.create\", side_effect=error):\n",
    "    try:\n",
    "        print(openai_llm.invoke(\"Why did the chicken cross the road?\"))\n",
-    "    except:\n",
+    "    except RateLimitError:\n",
    "        print(\"Hit error\")"
   ]
  },
@@ -106,10 +111,10 @@
   ],
   "source": [
    "# Now let's try with fallbacks to Anthropic\n",
-    "with patch(\"openai.ChatCompletion.create\", side_effect=RateLimitError()):\n",
+    "with patch(\"openai.resources.chat.completions.Completions.create\", side_effect=error):\n",
    "    try:\n",
    "        print(llm.invoke(\"Why did the chicken cross the road?\"))\n",
-    "    except:\n",
+    "    except RateLimitError:\n",
    "        print(\"Hit error\")"
   ]
  },
@@ -148,10 +153,10 @@
    "    ]\n",
    ")\n",
    "chain = prompt | llm\n",
-    "with patch(\"openai.ChatCompletion.create\", side_effect=RateLimitError()):\n",
+    "with patch(\"openai.resources.chat.completions.Completions.create\", side_effect=error):\n",
    "    try:\n",
    "        print(chain.invoke({\"animal\": \"kangaroo\"}))\n",
-    "    except:\n",
+    "    except RateLimitError:\n",
    "        print(\"Hit error\")"
   ]
  },
@@ -185,10 +190,10 @@
    ")\n",
    "\n",
    "chain = prompt | llm\n",
-    "with patch(\"openai.ChatCompletion.create\", side_effect=RateLimitError()):\n",
+    "with patch(\"openai.resources.chat.completions.Completions.create\", side_effect=error):\n",
    "    try:\n",
    "        print(chain.invoke({\"animal\": \"kangaroo\"}))\n",
-    "    except:\n",
+    "    except RateLimitError:\n",
    "        print(\"Hit error\")"
   ]
  },
@@ -211,7 +216,7 @@
   "source": [
    "# First let's create a chain with a ChatModel\n",
    "# We add in a string output parser here so the outputs between the two are the same type\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
    "\n",
    "chat_prompt = ChatPromptTemplate.from_messages(\n",
    "    [\n",
@@ -286,7 +291,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.1"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/expression_language/how_to/functions.ipynb
+++ b/docs/docs/expression_language/how_to/functions.ipynb
@@ -1,5 +1,16 @@
 {
 "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_position: 2\n",
+    "title: \"RunnableLambda: Run Custom Functions\"\n",
+    "keywords: [RunnableLambda, LCEL]\n",
+    "---"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "fbc4bf6e",
@@ -7,14 +18,14 @@
   "source": [
    "# Run custom functions\n",
    "\n",
-    "You can use arbitrary functions in the pipeline\n",
+    "You can use arbitrary functions in the pipeline.\n",
    "\n",
    "Note that all inputs to these functions need to be a SINGLE argument. If you have a function that accepts multiple arguments, you should write a wrapper that accepts a single input and unpacks it into multiple argument."
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 1,
   "id": "6bb221b3",
   "metadata": {},
   "outputs": [],
@@ -23,7 +34,7 @@
    "\n",
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.runnable import RunnableLambda\n",
+    "from langchain_core.runnables import RunnableLambda\n",
    "\n",
    "\n",
    "def length_function(text):\n",
@@ -56,17 +67,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 2,
   "id": "5488ec85",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='3 + 9 equals 12.', additional_kwargs={}, example=False)"
+       "AIMessage(content='3 + 9 equals 12.')"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -82,23 +93,23 @@
   "source": [
    "## Accepting a Runnable Config\n",
    "\n",
-    "Runnable lambdas can optionally accept a [RunnableConfig](https://api.python.langchain.com/en/latest/schema/langchain.schema.runnable.config.RunnableConfig.html?highlight=runnableconfig#langchain.schema.runnable.config.RunnableConfig), which they can use to pass callbacks, tags, and other configuration information to nested runs."
+    "Runnable lambdas can optionally accept a [RunnableConfig](https://api.python.langchain.com/en/latest/runnables/langchain_core.runnables.config.RunnableConfig.html#langchain_core.runnables.config.RunnableConfig), which they can use to pass callbacks, tags, and other configuration information to nested runs."
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": 3,
   "id": "80b3b5f6-5d58-44b9-807e-cce9a46bf49f",
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnableConfig"
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnableConfig"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": 4,
   "id": "ff0daf0c-49dd-4d21-9772-e5fa133c5f36",
   "metadata": {},
   "outputs": [],
@@ -125,7 +136,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 12,
+   "execution_count": 5,
   "id": "1a5e709e-9d75-48c7-bb9c-503251990505",
   "metadata": {},
   "outputs": [
@@ -133,6 +144,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
+      "{'foo': 'bar'}\n",
      "Tokens Used: 65\n",
      "\tPrompt Tokens: 56\n",
      "\tCompletion Tokens: 9\n",
@@ -145,9 +157,10 @@
    "from langchain.callbacks import get_openai_callback\n",
    "\n",
    "with get_openai_callback() as cb:\n",
-    "    RunnableLambda(parse_or_fix).invoke(\n",
+    "    output = RunnableLambda(parse_or_fix).invoke(\n",
    "        \"{foo: bar}\", {\"tags\": [\"my-tag\"], \"callbacks\": [cb]}\n",
    "    )\n",
+    "    print(output)\n",
    "    print(cb)"
   ]
  },
@@ -176,7 +189,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.11.5"
  }
 },
 "nbformat": 4,
--- a/docs/docs/expression_language/how_to/generators.ipynb
+++ b/docs/docs/expression_language/how_to/generators.ipynb
@@ -17,6 +17,13 @@
    "Let's implement a custom output parser for comma-separated lists."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Sync version"
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": 1,
@@ -27,7 +34,7 @@
    "\n",
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.prompts.chat import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
    "\n",
    "prompt = ChatPromptTemplate.from_template(\n",
    "    \"Write a comma-separated list of 5 animals similar to: {animal}\"\n",
@@ -57,7 +64,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 3,
   "metadata": {},
   "outputs": [
    {
@@ -66,7 +73,7 @@
       "'lion, tiger, wolf, gorilla, panda'"
      ]
     },
-     "execution_count": 8,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -152,12 +159,81 @@
    "list_chain.invoke({\"animal\": \"bear\"})"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Async version"
+   ]
+  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 8,
   "metadata": {},
   "outputs": [],
-   "source": []
+   "source": [
+    "from typing import AsyncIterator\n",
+    "\n",
+    "\n",
+    "async def asplit_into_list(\n",
+    "    input: AsyncIterator[str]\n",
+    ") -> AsyncIterator[List[str]]:  # async def\n",
+    "    buffer = \"\"\n",
+    "    async for (\n",
+    "        chunk\n",
+    "    ) in input:  # `input` is a `async_generator` object, so use `async for`\n",
+    "        buffer += chunk\n",
+    "        while \",\" in buffer:\n",
+    "            comma_index = buffer.index(\",\")\n",
+    "            yield [buffer[:comma_index].strip()]\n",
+    "            buffer = buffer[comma_index + 1 :]\n",
+    "    yield [buffer.strip()]\n",
+    "\n",
+    "\n",
+    "list_chain = str_chain | asplit_into_list"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "['lion']\n",
+      "['tiger']\n",
+      "['wolf']\n",
+      "['gorilla']\n",
+      "['panda']\n"
+     ]
+    }
+   ],
+   "source": [
+    "async for chunk in list_chain.astream({\"animal\": \"bear\"}):\n",
+    "    print(chunk, flush=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "['lion', 'tiger', 'wolf', 'gorilla', 'panda']"
+      ]
+     },
+     "execution_count": 10,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "await list_chain.ainvoke({\"animal\": \"bear\"})"
+   ]
  }
 ],
 "metadata": {
@@ -176,7 +252,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.11.5"
  }
 },
 "nbformat": 4,
--- a/docs/docs/expression_language/how_to/index.mdx
+++ b/docs/docs/expression_language/how_to/index.mdx
@@ -1,5 +1,5 @@
 ---
-sidebar_position: 1
+sidebar_position: 2
 ---

 # How to
--- a/docs/docs/expression_language/how_to/map.ipynb
+++ b/docs/docs/expression_language/how_to/map.ipynb
@@ -1,56 +1,29 @@
 {
 "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "e2596041-9b76-4e74-836f-e6235086bbf0",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_position: 0\n",
+    "title: \"RunnableParallel: Manipulating data\"\n",
+    "keywords: [RunnableParallel, RunnableMap, LCEL]\n",
+    "---"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "b022ab74-794d-4c54-ad47-ff9549ddb9d2",
   "metadata": {},
   "source": [
-    "# Parallelize steps\n",
+    "# Manipulating inputs & output\n",
    "\n",
-    "RunnableParallel (aka. RunnableMap) makes it easy to execute multiple Runnables in parallel, and to return the output of these Runnables as a map."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 2,
-   "id": "7e1873d6-d4b6-43ac-96a1-edcf178201e0",
-   "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "{'joke': AIMessage(content=\"Why don't bears wear shoes? \\n\\nBecause they have bear feet!\", additional_kwargs={}, example=False),\n",
-       " 'poem': AIMessage(content=\"In woodland depths, bear prowls with might,\\nSilent strength, nature's sovereign, day and night.\", additional_kwargs={}, example=False)}"
-      ]
-     },
-     "execution_count": 2,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.runnable import RunnableParallel\n",
+    "RunnableParallel can be useful for manipulating the output of one Runnable to match the input format of the next Runnable in a sequence.\n",
    "\n",
-    "model = ChatOpenAI()\n",
-    "joke_chain = ChatPromptTemplate.from_template(\"tell me a joke about {topic}\") | model\n",
-    "poem_chain = (\n",
-    "    ChatPromptTemplate.from_template(\"write a 2-line poem about {topic}\") | model\n",
-    ")\n",
+    "Here the input to prompt is expected to be a map with keys \"context\" and \"question\". The user input is just the question. So we need to get the context using our retriever and passthrough the user input under the \"question\" key.\n",
    "\n",
-    "map_chain = RunnableParallel(joke=joke_chain, poem=poem_chain)\n",
-    "\n",
-    "map_chain.invoke({\"topic\": \"bear\"})"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "df867ae9-1cec-4c9e-9fef-21969b206af5",
-   "metadata": {},
-   "source": [
-    "## Manipulating outputs/inputs\n",
-    "Maps can be useful for manipulating the output of one Runnable to match the input format of the next Runnable in a sequence."
+    "\n"
   ]
  },
  {
@@ -71,10 +44,12 @@
    }
   ],
   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain.prompts import ChatPromptTemplate\n",
    "from langchain.vectorstores import FAISS\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "vectorstore = FAISS.from_texts(\n",
    "    [\"harrison worked at kensho\"], embedding=OpenAIEmbeddings()\n",
@@ -86,6 +61,7 @@
    "Question: {question}\n",
    "\"\"\"\n",
    "prompt = ChatPromptTemplate.from_template(template)\n",
+    "model = ChatOpenAI()\n",
    "\n",
    "retrieval_chain = (\n",
    "    {\"context\": retriever, \"question\": RunnablePassthrough()}\n",
@@ -102,9 +78,133 @@
   "id": "392cd4c4-e7ed-4ab8-934d-f7a4eca55ee1",
   "metadata": {},
   "source": [
-    "Here the input to prompt is expected to be a map with keys \"context\" and \"question\". The user input is just the question. So we need to get the context using our retriever and passthrough the user input under the \"question\" key.\n",
+    "::: {.callout-tip}\n",
+    "Note that when composing a RunnableParallel with another Runnable we don't even need to wrap our dictionary in the RunnableParallel class — the type conversion is handled for us. In the context of a chain, these are equivalent:\n",
+    ":::\n",
    "\n",
-    "Note that when composing a RunnableMap when another Runnable we don't even need to wrap our dictionary in the RunnableMap class — the type conversion is handled for us."
+    "```\n",
+    "{\"context\": retriever, \"question\": RunnablePassthrough()}\n",
+    "```\n",
+    "\n",
+    "```\n",
+    "RunnableParallel({\"context\": retriever, \"question\": RunnablePassthrough()})\n",
+    "```\n",
+    "\n",
+    "```\n",
+    "RunnableParallel(context=retriever, question=RunnablePassthrough())\n",
+    "```\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "7c1b8baa-3a80-44f0-bb79-d22f79815d3d",
+   "metadata": {},
+   "source": [
+    "## Using itemgetter as shorthand\n",
+    "\n",
+    "Note that you can use Python's `itemgetter` as shorthand to extract data from the map when combining with `RunnableParallel`. You can find more information about itemgetter in the [Python Documentation](https://docs.python.org/3/library/operator.html#operator.itemgetter). \n",
+    "\n",
+    "In the example below, we use itemgetter to extract specific keys from the map:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "84fc49e1-2daf-4700-ae33-a0a6ed47d5f6",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'Harrison ha lavorato a Kensho.'"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from operator import itemgetter\n",
+    "\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "from langchain.prompts import ChatPromptTemplate\n",
+    "from langchain.vectorstores import FAISS\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
+    "\n",
+    "vectorstore = FAISS.from_texts(\n",
+    "    [\"harrison worked at kensho\"], embedding=OpenAIEmbeddings()\n",
+    ")\n",
+    "retriever = vectorstore.as_retriever()\n",
+    "\n",
+    "template = \"\"\"Answer the question based only on the following context:\n",
+    "{context}\n",
+    "\n",
+    "Question: {question}\n",
+    "\n",
+    "Answer in the following language: {language}\n",
+    "\"\"\"\n",
+    "prompt = ChatPromptTemplate.from_template(template)\n",
+    "\n",
+    "chain = (\n",
+    "    {\n",
+    "        \"context\": itemgetter(\"question\") | retriever,\n",
+    "        \"question\": itemgetter(\"question\"),\n",
+    "        \"language\": itemgetter(\"language\"),\n",
+    "    }\n",
+    "    | prompt\n",
+    "    | model\n",
+    "    | StrOutputParser()\n",
+    ")\n",
+    "\n",
+    "chain.invoke({\"question\": \"where did harrison work\", \"language\": \"italian\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "bc2f9847-39aa-4fe4-9049-3a8969bc4bce",
+   "metadata": {},
+   "source": [
+    "## Parallelize steps\n",
+    "\n",
+    "RunnableParallel (aka. RunnableMap) makes it easy to execute multiple Runnables in parallel, and to return the output of these Runnables as a map."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "31f18442-f837-463f-bef4-8729368f5f8b",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'joke': AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they have bear feet!\"),\n",
+       " 'poem': AIMessage(content=\"In the wild's embrace, bear roams free,\\nStrength and grace, a majestic decree.\")}"
+      ]
+     },
+     "execution_count": 1,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.prompts import ChatPromptTemplate\n",
+    "from langchain_core.runnables import RunnableParallel\n",
+    "\n",
+    "model = ChatOpenAI()\n",
+    "joke_chain = ChatPromptTemplate.from_template(\"tell me a joke about {topic}\") | model\n",
+    "poem_chain = (\n",
+    "    ChatPromptTemplate.from_template(\"write a 2-line poem about {topic}\") | model\n",
+    ")\n",
+    "\n",
+    "map_chain = RunnableParallel(joke=joke_chain, poem=poem_chain)\n",
+    "\n",
+    "map_chain.invoke({\"topic\": \"bear\"})"
   ]
  },
  {
@@ -114,7 +214,7 @@
   "source": [
    "## Parallelism\n",
    "\n",
-    "RunnableMaps are also useful for running independent processes in parallel, since each Runnable in the map is executed in parallel. For example, we can see our earlier `joke_chain`, `poem_chain` and `map_chain` all have about the same runtime, even though `map_chain` executes both of the other two."
+    "RunnableParallel are also useful for running independent processes in parallel, since each Runnable in the map is executed in parallel. For example, we can see our earlier `joke_chain`, `poem_chain` and `map_chain` all have about the same runtime, even though `map_chain` executes both of the other two."
   ]
  },
  {
@@ -194,7 +294,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.11.6"
  }
 },
 "nbformat": 4,
--- a/docs/docs/expression_language/how_to/message_history.ipynb
+++ b/docs/docs/expression_language/how_to/message_history.ipynb
@@ -132,8 +132,8 @@
    "from langchain.chat_models import ChatAnthropic\n",
    "from langchain.memory.chat_message_histories import RedisChatMessageHistory\n",
    "from langchain.prompts import ChatPromptTemplate, MessagesPlaceholder\n",
-    "from langchain.schema.chat_history import BaseChatMessageHistory\n",
-    "from langchain.schema.runnable.history import RunnableWithMessageHistory"
+    "from langchain_core.chat_history import BaseChatMessageHistory\n",
+    "from langchain_core.runnables.history import RunnableWithMessageHistory"
   ]
  },
  {
@@ -251,7 +251,10 @@
   "id": "da3d1feb-b4bb-4624-961c-7db2e1180df7",
   "metadata": {},
   "source": [
-    ":::tip [Langsmith trace](https://smith.langchain.com/public/863a003b-7ca8-4b24-be9e-d63ec13c106e/r)\n",
+    ":::tip\n",
+    "\n",
+    "[Langsmith trace](https://smith.langchain.com/public/863a003b-7ca8-4b24-be9e-d63ec13c106e/r)\n",
+    "\n",
    ":::"
   ]
  },
@@ -289,10 +292,10 @@
    }
   ],
   "source": [
-    "from langchain.schema.messages import HumanMessage\n",
-    "from langchain.schema.runnable import RunnableMap\n",
+    "from langchain_core.messages import HumanMessage\n",
+    "from langchain_core.runnables import RunnableParallel\n",
    "\n",
-    "chain = RunnableMap({\"output_message\": ChatAnthropic(model=\"claude-2\")})\n",
+    "chain = RunnableParallel({\"output_message\": ChatAnthropic(model=\"claude-2\")})\n",
    "chain_with_history = RunnableWithMessageHistory(\n",
    "    chain,\n",
    "    lambda session_id: RedisChatMessageHistory(session_id, url=REDIS_URL),\n",
@@ -334,7 +337,10 @@
   "id": "b898d1b1-11e6-4d30-a8dd-cc5e45533611",
   "metadata": {},
   "source": [
-    ":::tip [LangSmith trace](https://smith.langchain.com/public/f6c3e1d1-a49d-4955-a9fa-c6519df74fa7/r)\n",
+    ":::tip\n",
+    "\n",
+    "[LangSmith trace](https://smith.langchain.com/public/f6c3e1d1-a49d-4955-a9fa-c6519df74fa7/r)\n",
+    "\n",
    ":::"
   ]
  },
--- a/docs/docs/expression_language/how_to/passthrough.ipynb
+++ b/docs/docs/expression_language/how_to/passthrough.ipynb
@@ -0,0 +1,159 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "d35de667-0352-4bfb-a890-cebe7f676fe7",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_position: 1\n",
+    "title: \"RunnablePassthrough: Passing data through\"\n",
+    "keywords: [RunnablePassthrough, RunnableParallel, LCEL]\n",
+    "---"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "b022ab74-794d-4c54-ad47-ff9549ddb9d2",
+   "metadata": {},
+   "source": [
+    "# Passing data through\n",
+    "\n",
+    "RunnablePassthrough allows to pass inputs unchanged or with the addition of extra keys. This typically is used in conjuction with RunnableParallel to assign data to a new key in the map. \n",
+    "\n",
+    "RunnablePassthrough() called on it's own, will simply take the input and pass it through. \n",
+    "\n",
+    "RunnablePassthrough called with assign (`RunnablePassthrough.assign(...)`) will take the input, and will add the extra arguments passed to the assign function. \n",
+    "\n",
+    "See the example below:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "03988b8d-d54c-4492-8707-1594372cf093",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'passed': {'num': 1}, 'extra': {'num': 1, 'mult': 3}, 'modified': 2}"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain_core.runnables import RunnableParallel, RunnablePassthrough\n",
+    "\n",
+    "runnable = RunnableParallel(\n",
+    "    passed=RunnablePassthrough(),\n",
+    "    extra=RunnablePassthrough.assign(mult=lambda x: x[\"num\"] * 3),\n",
+    "    modified=lambda x: x[\"num\"] + 1,\n",
+    ")\n",
+    "\n",
+    "runnable.invoke({\"num\": 1})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "702c7acc-cd31-4037-9489-647df192fd7c",
+   "metadata": {},
+   "source": [
+    "As seen above, `passed` key was called with `RunnablePassthrough()` and so it simply passed on `{'num': 1}`. \n",
+    "\n",
+    "In the second line, we used `RunnablePastshrough.assign` with a lambda that multiplies the numerical value by 3. In this cased, `extra` was set with `{'num': 1, 'mult': 3}` which is the original value with the `mult` key added. \n",
+    "\n",
+    "Finally, we also set a third key in the map with `modified` which uses a labmda to set a single value adding 1 to the num, which resulted in `modified` key with the value of `2`."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "15187a3b-d666-4b9b-a258-672fc51fe0e2",
+   "metadata": {},
+   "source": [
+    "## Retrieval Example\n",
+    "\n",
+    "In the example below, we see a use case where we use RunnablePassthrough along with RunnableMap. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "267d1460-53c1-4fdb-b2c3-b6a1eb7fccff",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'Harrison worked at Kensho.'"
+      ]
+     },
+     "execution_count": 17,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "from langchain.prompts import ChatPromptTemplate\n",
+    "from langchain.vectorstores import FAISS\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
+    "\n",
+    "vectorstore = FAISS.from_texts(\n",
+    "    [\"harrison worked at kensho\"], embedding=OpenAIEmbeddings()\n",
+    ")\n",
+    "retriever = vectorstore.as_retriever()\n",
+    "template = \"\"\"Answer the question based only on the following context:\n",
+    "{context}\n",
+    "\n",
+    "Question: {question}\n",
+    "\"\"\"\n",
+    "prompt = ChatPromptTemplate.from_template(template)\n",
+    "model = ChatOpenAI()\n",
+    "\n",
+    "retrieval_chain = (\n",
+    "    {\"context\": retriever, \"question\": RunnablePassthrough()}\n",
+    "    | prompt\n",
+    "    | model\n",
+    "    | StrOutputParser()\n",
+    ")\n",
+    "\n",
+    "retrieval_chain.invoke(\"where did harrison work?\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "392cd4c4-e7ed-4ab8-934d-f7a4eca55ee1",
+   "metadata": {},
+   "source": [
+    "Here the input to prompt is expected to be a map with keys \"context\" and \"question\". The user input is just the question. So we need to get the context using our retriever and passthrough the user input under the \"question\" key. In this case, the RunnablePassthrough allows us to pass on the user's question to the prompt and model. \n"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/docs/expression_language/how_to/routing.ipynb
+++ b/docs/docs/expression_language/how_to/routing.ipynb
@@ -1,5 +1,16 @@
 {
 "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_position: 3\n",
+    "title: \"RunnableBranch: Dynamically route logic based on input\"\n",
+    "keywords: [RunnableBranch, LCEL]\n",
+    "---"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "4b47436a",
@@ -42,7 +53,7 @@
   "source": [
    "from langchain.chat_models import ChatAnthropic\n",
    "from langchain.prompts import PromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser"
+    "from langchain_core.output_parsers import StrOutputParser"
   ]
  },
  {
@@ -63,7 +74,7 @@
    "chain = (\n",
    "    PromptTemplate.from_template(\n",
    "        \"\"\"Given the user question below, classify it as either being about `LangChain`, `Anthropic`, or `Other`.\n",
-    "                                     \n",
+    "\n",
    "Do not respond with more than one word.\n",
    "\n",
    "<question>\n",
@@ -153,7 +164,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnableBranch\n",
+    "from langchain_core.runnables import RunnableBranch\n",
    "\n",
    "branch = RunnableBranch(\n",
    "    (lambda x: \"anthropic\" in x[\"topic\"].lower(), anthropic_chain),\n",
@@ -268,7 +279,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnableLambda\n",
+    "from langchain_core.runnables import RunnableLambda\n",
    "\n",
    "full_chain = {\"topic\": chain, \"question\": lambda x: x[\"question\"]} | RunnableLambda(\n",
    "    route\n",
@@ -293,7 +304,7 @@
    }
   ],
   "source": [
-    "full_chain.invoke({\"question\": \"how do I use Anthroipc?\"})"
+    "full_chain.invoke({\"question\": \"how do I use Anthropic?\"})"
   ]
  },
  {
--- a/docs/docs/expression_language/index.mdx
+++ b/docs/docs/expression_language/index.mdx
@@ -20,7 +20,7 @@ Whenever your LCEL chains have steps that can be executed in parallel (eg if you
 Configure retries and fallbacks for any part of your LCEL chain. This is a great way to make your chains more reliable at scale. We’re currently working on adding streaming support for retries/fallbacks, so you can get the added reliability without any latency cost.

 **Access intermediate results**
-For more complex chains it’s often very useful to access the results of intermediate steps even before the final output is produced. This can be used let end-users know something is happening, or even just to debug your chain. You can stream intermediate results, and it’s available on every [LangServe](/docs/langserve) server.
+For more complex chains it’s often very useful to access the results of intermediate steps even before the final output is produced. This can be used to let end-users know something is happening, or even just to debug your chain. You can stream intermediate results, and it’s available on every [LangServe](/docs/langserve) server.

 **Input and output schemas**
 Input and output schemas give every LCEL chain Pydantic and JSONSchema schemas inferred from the structure of your chain. This can be used for validation of inputs and outputs, and is an integral part of LangServe.
@@ -30,4 +30,4 @@ As your chains get more and more complex, it becomes increasingly important to u
 With LCEL, **all** steps are automatically logged to [LangSmith](/docs/langsmith/) for maximum observability and debuggability.

 **Seamless LangServe deployment integration**
-Any chain created with LCEL can be easily deployed using [LangServe](/docs/langserve).
+Any chain created with LCEL can be easily deployed using [LangServe](/docs/langserve).
--- a/docs/docs/expression_language/interface.ipynb
+++ b/docs/docs/expression_language/interface.ipynb
@@ -6,7 +6,7 @@
   "metadata": {},
   "source": [
    "---\n",
-    "sidebar_position: 0\n",
+    "sidebar_position: 1\n",
    "title: Interface\n",
    "---"
   ]
@@ -16,7 +16,7 @@
   "id": "9a9acd2e",
   "metadata": {},
   "source": [
-    "To make it as easy as possible to create custom chains, we've implemented a [\"Runnable\"](https://api.python.langchain.com/en/latest/schema/langchain.schema.runnable.base.Runnable.html#langchain.schema.runnable.base.Runnable) protocol. The `Runnable` protocol is implemented for most components. \n",
+    "To make it as easy as possible to create custom chains, we've implemented a [\"Runnable\"](https://api.python.langchain.com/en/stable/runnables/langchain_core.runnables.base.Runnable.html#langchain_core.runnables.base.Runnable) protocol. The `Runnable` protocol is implemented for most components. \n",
    "This is a standard interface, which makes it easy to define custom chains as well as invoke them in a standard way. \n",
    "The standard interface includes:\n",
    "\n",
@@ -660,9 +660,9 @@
   ],
   "source": [
    "from langchain.embeddings import OpenAIEmbeddings\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnablePassthrough\n",
    "from langchain.vectorstores import FAISS\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import RunnablePassthrough\n",
    "\n",
    "template = \"\"\"Answer the question based only on the following context:\n",
    "{context}\n",
@@ -920,7 +920,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnableParallel\n",
+    "from langchain_core.runnables import RunnableParallel\n",
    "\n",
    "chain1 = ChatPromptTemplate.from_template(\"tell me a joke about {topic}\") | model\n",
    "chain2 = (\n",
--- a/docs/docs/expression_language/why.ipynb
+++ b/docs/docs/expression_language/why.ipynb
--- a/docs/docs/get_started/installation.mdx
+++ b/docs/docs/get_started/installation.mdx
@@ -29,6 +29,20 @@ If you want to install from source, you can do so by cloning the repo and be sur
 pip install -e .
 ```

+## LangChain community
+The `langchain-community` package contains third-party integrations. It is automatically installed by `langchain`, but can also be used separately. Install with:
+
+```bash
+pip install langchain-community
+```
+
+## LangChain core
+The `langchain-core` package contains base abstractions that the rest of the LangChain ecosystem uses, along with the LangChain Expression Language. It is automatically installed by `langchain`, but can also be used separately. Install with:
+
+```bash
+pip install langchain-core
+```
+
 ## LangChain experimental
 The `langchain-experimental` package holds experimental LangChain code, intended for research and experimental uses.
 Install with:
@@ -61,4 +75,4 @@ If not using LangChain, install with:

 ```bash
 pip install langsmith
-```
+```
--- a/docs/docs/get_started/introduction.mdx
+++ b/docs/docs/get_started/introduction.mdx
@@ -29,6 +29,11 @@ The main value props of the LangChain packages are:

 Off-the-shelf chains make it easy to get started. Components make it easy to customize existing chains and build new ones.

+The LangChain libraries themselves are made up of several different packages.
+- **`langchain-core`**: Base abstractions and LangChain Expression Language.
+- **`langchain-community`**: Third party integrations.
+- **`langchain`**: Chains, agents, and retrieval strategies that make up an application's cognitive architecture.
+
 ## Get started

 [Here’s](/docs/get_started/installation) how to install LangChain, set up your environment, and start building.
@@ -79,7 +84,7 @@ Walkthroughs and techniques for common end-to-end use cases, like:
 ### [Integrations](/docs/integrations/providers/)
 LangChain is part of a rich ecosystem of tools that integrate with our framework and build on top of it. Check out our growing list of [integrations](/docs/integrations/providers/).

-### [Guides](/docs/guides/adapters/openai)
+### [Guides](/docs/guides/guides/debugging)
 Best practices for developing with LangChain.

 ### [API reference](https://api.python.langchain.com)
--- a/docs/docs/get_started/quickstart.mdx
+++ b/docs/docs/get_started/quickstart.mdx
@@ -344,7 +344,7 @@ category_chain = chat_prompt | ChatOpenAI() | CommaSeparatedListOutputParser()
 app = FastAPI(
  title="LangChain Server",
  version="1.0",
-  description="A simple api server using Langchain's Runnable interfaces",
+  description="A simple API server using LangChain's Runnable interfaces",
 )

 # 3. Adding chain route
--- a/docs/docs/guides/debugging.md
+++ b/docs/docs/guides/debugging.md
@@ -12,7 +12,7 @@ Platforms with tracing capabilities like [LangSmith](/docs/langsmith/) and [Wand

 For anyone building production-grade LLM applications, we highly recommend using a platform like this.

-![LangSmith run](/img/run_details.png)
+![LangSmith run](../../static/img/run_details.png)

 ## `set_debug` and `set_verbose`

--- a/docs/docs/guides/evaluation/string/json.ipynb
+++ b/docs/docs/guides/evaluation/string/json.ipynb
@@ -5,13 +5,13 @@
   "id": "465cfbef-5bba-4b3b-b02d-fe2eba39db17",
   "metadata": {},
   "source": [
-    "# Evaluating Structured Output: JSON Evaluators\n",
+    "# JSON Evaluators\n",
    "\n",
-    "Evaluating [extraction](https://python.langchain.com/docs/use_cases/extraction) and function calling applications often comes down to validation that the LLM's string output can be parsed correctly and how it compares to a reference object. The following JSON validators provide provide functionality to check your model's output in a consistent way.\n",
+    "Evaluating [extraction](https://python.langchain.com/docs/use_cases/extraction) and function calling applications often comes down to validation that the LLM's string output can be parsed correctly and how it compares to a reference object. The following `JSON` validators provide functionality to check your model's output consistently.\n",
    "\n",
    "## JsonValidityEvaluator\n",
    "\n",
-    "The `JsonValidityEvaluator` is designed to check the validity of a JSON string prediction.\n",
+    "The `JsonValidityEvaluator` is designed to check the validity of a `JSON` string prediction.\n",
    "\n",
    "### Overview:\n",
    "- **Requires Input?**: No\n",
@@ -377,7 +377,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.2"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/guides/evaluation/string/string_distance.ipynb
+++ b/docs/docs/guides/evaluation/string/string_distance.ipynb
@@ -8,9 +8,12 @@
    "# String Distance\n",
    "[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/langchain-ai/langchain/blob/master/docs/docs/guides/evaluation/string/string_distance.ipynb)\n",
    "\n",
-    "One of the simplest ways to compare an LLM or chain's string output against a reference label is by using string distance measurements such as Levenshtein or postfix distance.  This can be used alongside approximate/fuzzy matching criteria for very basic unit testing.\n",
+    ">In information theory, linguistics, and computer science, the [Levenshtein distance (Wikipedia)](https://en.wikipedia.org/wiki/Levenshtein_distance) is a string metric for measuring the difference between two sequences. Informally, the Levenshtein distance between two words is the minimum number of single-character edits (insertions, deletions or substitutions) required to change one word into the other. It is named after the Soviet mathematician Vladimir Levenshtein, who considered this distance in 1965.\n",
    "\n",
-    "This can be accessed using the `string_distance` evaluator, which uses distance metric's from the [rapidfuzz](https://github.com/maxbachmann/RapidFuzz) library.\n",
+    "\n",
+    "One of the simplest ways to compare an LLM or chain's string output against a reference label is by using string distance measurements such as `Levenshtein` or `postfix` distance.  This can be used alongside approximate/fuzzy matching criteria for very basic unit testing.\n",
+    "\n",
+    "This can be accessed using the `string_distance` evaluator, which uses distance metrics from the [rapidfuzz](https://github.com/maxbachmann/RapidFuzz) library.\n",
    "\n",
    "**Note:** The returned scores are _distances_, meaning lower is typically \"better\".\n",
    "\n",
@@ -213,9 +216,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.2"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
-}
+}
--- a/docs/docs/guides/fallbacks.ipynb
+++ b/docs/docs/guides/fallbacks.ipynb
@@ -28,7 +28,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 1,
   "id": "d3e893bf",
   "metadata": {},
   "outputs": [],
@@ -46,19 +46,24 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 21,
+   "execution_count": 2,
   "id": "dfdd8bf5",
   "metadata": {},
   "outputs": [],
   "source": [
    "from unittest.mock import patch\n",
    "\n",
-    "from openai.error import RateLimitError"
+    "import httpx\n",
+    "from openai import RateLimitError\n",
+    "\n",
+    "request = httpx.Request(\"GET\", \"/\")\n",
+    "response = httpx.Response(200, request=request)\n",
+    "error = RateLimitError(\"rate limit\", response=response, body=\"\")"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 24,
+   "execution_count": 3,
   "id": "e6fdffc1",
   "metadata": {},
   "outputs": [],
@@ -71,7 +76,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 27,
+   "execution_count": 4,
   "id": "584461ab",
   "metadata": {},
   "outputs": [
@@ -85,10 +90,10 @@
   ],
   "source": [
    "# Let's use just the OpenAI LLm first, to show that we run into an error\n",
-    "with patch(\"openai.ChatCompletion.create\", side_effect=RateLimitError()):\n",
+    "with patch(\"openai.resources.chat.completions.Completions.create\", side_effect=error):\n",
    "    try:\n",
    "        print(openai_llm.invoke(\"Why did the chicken cross the road?\"))\n",
-    "    except:\n",
+    "    except RateLimitError:\n",
    "        print(\"Hit error\")"
   ]
  },
@@ -108,10 +113,10 @@
   ],
   "source": [
    "# Now let's try with fallbacks to Anthropic\n",
-    "with patch(\"openai.ChatCompletion.create\", side_effect=RateLimitError()):\n",
+    "with patch(\"openai.resources.chat.completions.Completions.create\", side_effect=error):\n",
    "    try:\n",
    "        print(llm.invoke(\"Why did the chicken cross the road?\"))\n",
-    "    except:\n",
+    "    except RateLimitError:\n",
    "        print(\"Hit error\")"
   ]
  },
@@ -150,10 +155,10 @@
    "    ]\n",
    ")\n",
    "chain = prompt | llm\n",
-    "with patch(\"openai.ChatCompletion.create\", side_effect=RateLimitError()):\n",
+    "with patch(\"openai.resources.chat.completions.Completions.create\", side_effect=error):\n",
    "    try:\n",
    "        print(chain.invoke({\"animal\": \"kangaroo\"}))\n",
-    "    except:\n",
+    "    except RateLimitError:\n",
    "        print(\"Hit error\")"
   ]
  },
@@ -176,7 +181,7 @@
   "source": [
    "# First let's create a chain with a ChatModel\n",
    "# We add in a string output parser here so the outputs between the two are the same type\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
    "\n",
    "chat_prompt = ChatPromptTemplate.from_messages(\n",
    "    [\n",
@@ -431,7 +436,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.12"
+   "version": "3.11.5"
  }
 },
 "nbformat": 4,
--- a/docs/docs/guides/local_llms.ipynb
+++ b/docs/docs/guides/local_llms.ipynb
@@ -32,7 +32,7 @@
    "1. `Base model`: What is the base-model and how was it trained?\n",
    "2. `Fine-tuning approach`: Was the base-model fine-tuned and, if so, what [set of instructions](https://cameronrwolfe.substack.com/p/beyond-llama-the-power-of-open-llms#%C2%A7alpaca-an-instruction-following-llama-model) was used?\n",
    "\n",
-    "![Image description](/img/OSS_LLM_overview.png)\n",
+    "![Image description](../../static/img/OSS_LLM_overview.png)\n",
    "\n",
    "The relative performance of these models can be assessed using several leaderboards, including:\n",
    "\n",
@@ -55,7 +55,7 @@
    "\n",
    "In particular, see [this excellent post](https://finbarr.ca/how-is-llama-cpp-possible/) on the importance of quantization.\n",
    "\n",
-    "![Image description](/img/llama-memory-weights.png)\n",
+    "![Image description](../../static/img/llama-memory-weights.png)\n",
    "\n",
    "With less precision, we radically decrease the memory needed to store the LLM in memory.\n",
    "\n",
@@ -63,13 +63,13 @@
    "\n",
    "A Mac M2 Max is 5-6x faster than a M1 for inference due to the larger GPU memory bandwidth.\n",
    "\n",
-    "![Image description](/img/llama_t_put.png)\n",
+    "![Image description](../../static/img/llama_t_put.png)\n",
    "\n",
    "## Quickstart\n",
    "\n",
    "[`Ollama`](https://ollama.ai/) is one way to easily run inference on macOS.\n",
    " \n",
-    "The instructions [here](docs/integrations/llms/ollama) provide details, which we summarize:\n",
+    "The instructions [here](https://github.com/jmorganca/ollama?tab=readme-ov-file#ollama) provide details, which we summarize:\n",
    " \n",
    "* [Download and run](https://ollama.ai/download) the app\n",
    "* From command line, fetch a model from this [list of options](https://github.com/jmorganca/ollama): e.g., `ollama pull llama2`\n",
@@ -197,10 +197,10 @@
    "\n",
    "### Ollama\n",
    "\n",
-    "With [Ollama](docs/integrations/llms/ollama), fetch a model via `ollama pull <model family>:<tag>`:\n",
+    "With [Ollama](https://github.com/jmorganca/ollama), fetch a model via `ollama pull <model family>:<tag>`:\n",
    "\n",
    "* E.g., for Llama-7b: `ollama pull llama2` will download the most basic version of the model (e.g., smallest # parameters and 4 bit quantization)\n",
-    "* We can also specify a particular version from the [model list](https://github.com/jmorganca/ollama), e.g., `ollama pull llama2:13b`\n",
+    "* We can also specify a particular version from the [model list](https://github.com/jmorganca/ollama?tab=readme-ov-file#model-library), e.g., `ollama pull llama2:13b`\n",
    "* See the full set of parameters on the [API reference page](https://api.python.langchain.com/en/latest/llms/langchain.llms.ollama.Ollama.html)"
   ]
  },
@@ -284,6 +284,8 @@
   "metadata": {},
   "outputs": [],
   "source": [
+    "from langchain.callbacks.manager import CallbackManager\n",
+    "from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler\n",
    "from langchain.llms import LlamaCpp\n",
    "\n",
    "llm = LlamaCpp(\n",
@@ -606,7 +608,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.1"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/guides/privacy/presidio_data_anonymization/index.ipynb
+++ b/docs/docs/guides/privacy/presidio_data_anonymization/index.ipynb
@@ -8,6 +8,8 @@
    "\n",
    "[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/langchain-ai/langchain/blob/master/docs/docs/guides/privacy/presidio_data_anonymization/index.ipynb)\n",
    "\n",
+    ">[Presidio](https://microsoft.github.io/presidio/) (Origin from Latin praesidium ‘protection, garrison’) helps to ensure sensitive data is properly managed and governed. It provides fast identification and anonymization modules for private entities in text and images such as credit card numbers, names, locations, social security numbers, bitcoin wallets, US phone numbers, financial data and more.\n",
+    "\n",
    "## Use case\n",
    "\n",
    "Data anonymization is crucial before passing information to a language model like GPT-4 because it helps protect privacy and maintain confidentiality. If data is not anonymized, sensitive information such as names, addresses, contact numbers, or other identifiers linked to specific individuals could potentially be learned and misused. Hence, by obscuring or removing this personally identifiable information (PII), data can be used freely without compromising individuals' privacy rights or breaching data protection laws and regulations.\n",
@@ -530,7 +532,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.4"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/guides/privacy/presidio_data_anonymization/qa_privacy_protection.ipynb
+++ b/docs/docs/guides/privacy/presidio_data_anonymization/qa_privacy_protection.ipynb
@@ -60,7 +60,7 @@
    "\n",
    " Firstly, the wallet contains my credit card with number 4111 1111 1111 1111, which is registered under my name and linked to my bank account, PL61109010140000071219812874.\n",
    "\n",
-    " Additionally, the wallet had a driver's license - DL No: 999000680 issued to my name. It also houses my Social Security Number, 602-76-4532. \n",
+    " Additionally, the wallet had a driver's license - DL No: 999000680 issued to my name. It also houses my Social Security Number, 602-76-4532.\n",
    "\n",
    " What's more, I had my polish identity card there, with the number ABC123456.\n",
    "\n",
@@ -68,7 +68,7 @@
    "\n",
    " In case any information arises regarding my wallet, please reach out to me on my phone number, 999-888-7777, or through my personal email, johndoe@example.com.\n",
    "\n",
-    " Please consider this information to be highly confidential and respect my privacy. \n",
+    " Please consider this information to be highly confidential and respect my privacy.\n",
    "\n",
    " The bank has been informed about the stolen credit card and necessary actions have been taken from their end. They will be reachable at their official email, support@bankname.com.\n",
    " My representative there is Victoria Cherry (her business phone: 987-654-3210).\n",
@@ -666,8 +666,12 @@
    "\n",
    "from langchain.chat_models.openai import ChatOpenAI\n",
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.schema.output_parser import StrOutputParser\n",
-    "from langchain.schema.runnable import RunnableLambda, RunnableMap, RunnablePassthrough\n",
+    "from langchain_core.output_parsers import StrOutputParser\n",
+    "from langchain_core.runnables import (\n",
+    "    RunnableLambda,\n",
+    "    RunnableParallel,\n",
+    "    RunnablePassthrough,\n",
+    ")\n",
    "\n",
    "# 6. Create anonymizer chain\n",
    "template = \"\"\"Answer the question based only on the following context:\n",
@@ -680,7 +684,7 @@
    "model = ChatOpenAI(temperature=0.3)\n",
    "\n",
    "\n",
-    "_inputs = RunnableMap(\n",
+    "_inputs = RunnableParallel(\n",
    "    question=RunnablePassthrough(),\n",
    "    # It is important to remember about question anonymization\n",
    "    anonymized_question=RunnableLambda(anonymizer.anonymize),\n",
@@ -882,7 +886,7 @@
    "\n",
    "\n",
    "chain_with_deanonymization = (\n",
-    "    RunnableMap({\"question\": RunnablePassthrough()})\n",
+    "    RunnableParallel({\"question\": RunnablePassthrough()})\n",
    "    | {\n",
    "        \"context\": itemgetter(\"question\")\n",
    "        | retriever\n",
--- a/docs/docs/guides/pydantic_compatibility.md
+++ b/docs/docs/guides/pydantic_compatibility.md
@@ -73,7 +73,7 @@ CustomTool(
 **YES**

 ```python
-from langchain.tools.base import Tool
+from langchain_core.tools import Tool
 from pydantic.v1 import BaseModel, Field # <-- Uses v1 namespace

 class CalculatorInput(BaseModel):
@@ -90,7 +90,7 @@ Tool.from_function( # <-- tool uses v1 namespace
 **NO**

 ```python
-from langchain.tools.base import Tool
+from langchain_core.tools import Tool
 from pydantic import BaseModel, Field # <-- Uses v2 namespace

 class CalculatorInput(BaseModel):
--- a/docs/docs/guides/safety/amazon_comprehend_chain.ipynb
+++ b/docs/docs/guides/safety/amazon_comprehend_chain.ipynb
@@ -7,7 +7,9 @@
   "source": [
    "# Amazon Comprehend Moderation Chain\n",
    "\n",
-    "This notebook shows how to use [Amazon Comprehend](https://aws.amazon.com/comprehend/) to detect and handle `Personally Identifiable Information` (`PII`) and toxicity.\n",
+    ">[Amazon Comprehend](https://aws.amazon.com/comprehend/) is a natural-language processing (NLP) service that uses machine learning to uncover valuable insights and connections in text.\n",
+    "\n",
+    "This notebook shows how to use `Amazon Comprehend` to detect and handle `Personally Identifiable Information` (`PII`) and toxicity.\n",
    "\n",
    "## Setting up"
   ]
@@ -1417,7 +1419,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/guides/safety/hugging_face_prompt_injection.ipynb
+++ b/docs/docs/guides/safety/hugging_face_prompt_injection.ipynb
@@ -8,7 +8,7 @@
    "# Hugging Face prompt injection identification\n",
    "\n",
    "This notebook shows how to prevent prompt injection attacks using the text classification model from `HuggingFace`.\n",
-    "It exploits the *deberta* model trained to identify prompt injections: https://huggingface.co/deepset/deberta-v3-base-injection"
+    "By default it uses a *deberta* model trained to identify prompt injections. In this walkthrough we'll use https://huggingface.co/laiyer/deberta-v3-base-prompt-injection."
   ]
  },
  {
@@ -21,19 +21,37 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": null,
   "id": "aea25588-3c3f-4506-9094-221b3a0d519b",
   "metadata": {},
   "outputs": [
    {
     "data": {
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "58ab3557623a495d8cc3c3e32a61938f",
+       "version_major": 2,
+       "version_minor": 0
+      },
      "text/plain": [
-       "'hugging_face_injection_identifier'"
+       "Downloading config.json:   0%|          | 0.00/994 [00:00<?, ?B/s]"
      ]
     },
-     "execution_count": 1,
     "metadata": {},
-     "output_type": "execute_result"
+     "output_type": "display_data"
+    },
+    {
+     "data": {
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "3bf062f02d304ab5a485a2a228b4cf41",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "Downloading model.safetensors:   0%|          | 0.00/738M [00:00<?, ?B/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
    }
   ],
   "source": [
@@ -41,7 +59,10 @@
    "    HuggingFaceInjectionIdentifier,\n",
    ")\n",
    "\n",
-    "injection_identifier = HuggingFaceInjectionIdentifier()\n",
+    "# Using https://huggingface.co/laiyer/deberta-v3-base-prompt-injection\n",
+    "injection_identifier = HuggingFaceInjectionIdentifier(\n",
+    "    model=\"laiyer/deberta-v3-base-prompt-injection\"\n",
+    ")\n",
    "injection_identifier.name"
   ]
  },
@@ -299,9 +320,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "poetry-venv",
   "language": "python",
-   "name": "python3"
+   "name": "poetry-venv"
  },
  "language_info": {
   "codemirror_mode": {
@@ -313,7 +334,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.12"
+   "version": "3.9.1"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/adapters/_category_.yml
+++ b/docs/docs/integrations/adapters/_category_.yml
--- a/docs/docs/integrations/adapters/openai-old.ipynb
+++ b/docs/docs/integrations/adapters/openai-old.ipynb
@@ -5,7 +5,9 @@
   "id": "700a516b",
   "metadata": {},
   "source": [
-    "# OpenAI Adapter\n",
+    "# OpenAI Adapter(Old)\n",
+    "\n",
+    "**Please ensure OpenAI library is less than 1.0.0; otherwise, refer to the newer doc [OpenAI Adapter](./openai).**\n",
    "\n",
    "A lot of people get started with OpenAI but want to explore other models. LangChain's integrations with many model providers make this easy to do so. While LangChain has it's own message and model APIs, we've also made it as easy as possible to explore other models by exposing an adapter to adapt LangChain models to the OpenAI api.\n",
    "\n",
@@ -49,18 +51,6 @@
    "Original OpenAI call"
   ]
  },
-  {
-   "cell_type": "code",
-   "execution_count": 14,
-   "id": "e1d27dfa",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "result = openai.ChatCompletion.create(\n",
-    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0\n",
-    ")"
-   ]
-  },
  {
   "cell_type": "code",
   "execution_count": 15,
@@ -79,6 +69,9 @@
    }
   ],
   "source": [
+    "result = openai.ChatCompletion.create(\n",
+    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0\n",
+    ")\n",
    "result[\"choices\"][0][\"message\"].to_dict_recursive()"
   ]
  },
@@ -90,18 +83,6 @@
    "LangChain OpenAI wrapper call"
   ]
  },
-  {
-   "cell_type": "code",
-   "execution_count": 16,
-   "id": "87c2d515",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "lc_result = lc_openai.ChatCompletion.create(\n",
-    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0\n",
-    ")"
-   ]
-  },
  {
   "cell_type": "code",
   "execution_count": 17,
@@ -120,6 +101,9 @@
    }
   ],
   "source": [
+    "lc_result = lc_openai.ChatCompletion.create(\n",
+    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0\n",
+    ")\n",
    "lc_result[\"choices\"][0][\"message\"]"
   ]
  },
@@ -131,18 +115,6 @@
    "Swapping out model providers"
   ]
  },
-  {
-   "cell_type": "code",
-   "execution_count": 18,
-   "id": "7a2c011c",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "lc_result = lc_openai.ChatCompletion.create(\n",
-    "    messages=messages, model=\"claude-2\", temperature=0, provider=\"ChatAnthropic\"\n",
-    ")"
-   ]
-  },
  {
   "cell_type": "code",
   "execution_count": 19,
@@ -161,6 +133,9 @@
    }
   ],
   "source": [
+    "lc_result = lc_openai.ChatCompletion.create(\n",
+    "    messages=messages, model=\"claude-2\", temperature=0, provider=\"ChatAnthropic\"\n",
+    ")\n",
    "lc_result[\"choices\"][0][\"message\"]"
   ]
  },
@@ -302,7 +277,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.1"
+   "version": "3.11.5"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/adapters/openai.ipynb
+++ b/docs/docs/integrations/adapters/openai.ipynb
@@ -0,0 +1,318 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "700a516b",
+   "metadata": {},
+   "source": [
+    "# OpenAI Adapter\n",
+    "\n",
+    "**Please ensure OpenAI library is version 1.0.0 or higher; otherwise, refer to the older doc [OpenAI Adapter(Old)](./openai-old).**\n",
+    "\n",
+    "A lot of people get started with OpenAI but want to explore other models. LangChain's integrations with many model providers make this easy to do so. While LangChain has it's own message and model APIs, we've also made it as easy as possible to explore other models by exposing an adapter to adapt LangChain models to the OpenAI api.\n",
+    "\n",
+    "At the moment this only deals with output and does not return other information (token counts, stop reasons, etc)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "6017f26a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import openai\n",
+    "from langchain.adapters import openai as lc_openai"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "b522ceda",
+   "metadata": {},
+   "source": [
+    "## chat.completions.create"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "1d22eb61",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "messages = [{\"role\": \"user\", \"content\": \"hi\"}]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "d550d3ad",
+   "metadata": {},
+   "source": [
+    "Original OpenAI call"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "012d81ae",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'content': 'Hello! How can I assist you today?',\n",
+       " 'role': 'assistant',\n",
+       " 'function_call': None,\n",
+       " 'tool_calls': None}"
+      ]
+     },
+     "execution_count": 3,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "result = openai.chat.completions.create(\n",
+    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0\n",
+    ")\n",
+    "result.choices[0].message.model_dump()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "db5b5500",
+   "metadata": {},
+   "source": [
+    "LangChain OpenAI wrapper call"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "c67a5ac8",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'role': 'assistant', 'content': 'Hello! How can I help you today?'}"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "lc_result = lc_openai.chat.completions.create(\n",
+    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0\n",
+    ")\n",
+    "\n",
+    "lc_result.choices[0].message  # Attribute access"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "37a6e461-8608-47f6-ac45-12ad753c062a",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'role': 'assistant', 'content': 'Hello! How can I help you today?'}"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "lc_result[\"choices\"][0][\"message\"]  # Also compatible with index access"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "034ba845",
+   "metadata": {},
+   "source": [
+    "Swapping out model providers"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "f7c94827",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'role': 'assistant', 'content': 'Hello! How can I assist you today?'}"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "lc_result = lc_openai.chat.completions.create(\n",
+    "    messages=messages, model=\"claude-2\", temperature=0, provider=\"ChatAnthropic\"\n",
+    ")\n",
+    "lc_result.choices[0].message"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "cb3f181d",
+   "metadata": {},
+   "source": [
+    "## chat.completions.stream"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f7b8cd18",
+   "metadata": {},
+   "source": [
+    "Original OpenAI call"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "fd8cb1ea",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'content': '', 'function_call': None, 'role': 'assistant', 'tool_calls': None}\n",
+      "{'content': 'Hello', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': '!', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': ' How', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': ' can', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': ' I', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': ' assist', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': ' you', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': ' today', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': '?', 'function_call': None, 'role': None, 'tool_calls': None}\n",
+      "{'content': None, 'function_call': None, 'role': None, 'tool_calls': None}\n"
+     ]
+    }
+   ],
+   "source": [
+    "for c in openai.chat.completions.create(\n",
+    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0, stream=True\n",
+    "):\n",
+    "    print(c.choices[0].delta.model_dump())"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "0b2a076b",
+   "metadata": {},
+   "source": [
+    "LangChain OpenAI wrapper call"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "9521218c",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'role': 'assistant', 'content': ''}\n",
+      "{'content': 'Hello'}\n",
+      "{'content': '!'}\n",
+      "{'content': ' How'}\n",
+      "{'content': ' can'}\n",
+      "{'content': ' I'}\n",
+      "{'content': ' assist'}\n",
+      "{'content': ' you'}\n",
+      "{'content': ' today'}\n",
+      "{'content': '?'}\n",
+      "{}\n"
+     ]
+    }
+   ],
+   "source": [
+    "for c in lc_openai.chat.completions.create(\n",
+    "    messages=messages, model=\"gpt-3.5-turbo\", temperature=0, stream=True\n",
+    "):\n",
+    "    print(c.choices[0].delta)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "0fc39750",
+   "metadata": {},
+   "source": [
+    "Swapping out model providers"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "68f0214e",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'role': 'assistant', 'content': ''}\n",
+      "{'content': 'Hello'}\n",
+      "{'content': '!'}\n",
+      "{'content': ' How'}\n",
+      "{'content': ' can'}\n",
+      "{'content': ' I'}\n",
+      "{'content': ' assist'}\n",
+      "{'content': ' you'}\n",
+      "{'content': ' today'}\n",
+      "{'content': '?'}\n",
+      "{}\n"
+     ]
+    }
+   ],
+   "source": [
+    "for c in lc_openai.chat.completions.create(\n",
+    "    messages=messages,\n",
+    "    model=\"claude-2\",\n",
+    "    temperature=0,\n",
+    "    stream=True,\n",
+    "    provider=\"ChatAnthropic\",\n",
+    "):\n",
+    "    print(c[\"choices\"][0][\"delta\"])"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.5"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/docs/integrations/callbacks/argilla.ipynb
+++ b/docs/docs/integrations/callbacks/argilla.ipynb
@@ -7,8 +7,6 @@
   "source": [
    "# Argilla\n",
    "\n",
-    "![Argilla - Open-source data platform for LLMs](https://argilla.io/og.png)\n",
-    "\n",
    ">[Argilla](https://argilla.io/) is an open-source data curation platform for LLMs.\n",
    "> Using Argilla, everyone can build robust language models through faster data curation \n",
    "> using both human and machine feedback. We provide support for each step in the MLOps cycle, \n",
@@ -410,7 +408,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  },
  "vscode": {
   "interpreter": {
--- a/docs/docs/integrations/callbacks/context.ipynb
+++ b/docs/docs/integrations/callbacks/context.ipynb
@@ -7,12 +7,9 @@
   "source": [
    "# Context\n",
    "\n",
-    "![Context - User Analytics for LLM Powered Products](https://with.context.ai/langchain.png)\n",
+    ">[Context](https://context.ai/) provides user analytics for LLM-powered products and features.\n",
    "\n",
-    "[Context](https://context.ai/) provides user analytics for LLM powered products and features.\n",
-    "\n",
-    "With Context, you can start understanding your users and improving their experiences in less than 30 minutes.\n",
-    "\n"
+    "With `Context`, you can start understanding your users and improving their experiences in less than 30 minutes.\n"
   ]
  },
  {
@@ -89,11 +86,9 @@
   "metadata": {},
   "source": [
    "## Usage\n",
-    "### Using the Context callback within a chat model\n",
+    "### Context callback within a chat model\n",
    "\n",
-    "The Context callback handler can be used to directly record transcripts between users and AI assistants.\n",
-    "\n",
-    "#### Example"
+    "The Context callback handler can be used to directly record transcripts between users and AI assistants."
   ]
  },
  {
@@ -132,7 +127,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "### Using the Context callback within Chains\n",
+    "### Context callback within Chains\n",
    "\n",
    "The Context callback handler can also be used to record the inputs and outputs of chains. Note that intermediate steps of the chain are not recorded - only the starting inputs and final outputs.\n",
    "\n",
@@ -149,9 +144,7 @@
    ">handler = ContextCallbackHandler(token)\n",
    ">chat = ChatOpenAI(temperature=0.9, callbacks=[callback])\n",
    ">chain = LLMChain(llm=chat, prompt=chat_prompt_template, callbacks=[callback])\n",
-    ">```\n",
-    "\n",
-    "#### Example"
+    ">```\n"
   ]
  },
  {
@@ -203,7 +196,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.12"
  },
  "vscode": {
   "interpreter": {
--- a/docs/docs/integrations/callbacks/infino.ipynb
+++ b/docs/docs/integrations/callbacks/infino.ipynb
@@ -7,12 +7,14 @@
   "source": [
    "# Infino\n",
    "\n",
+    ">[Infino](https://github.com/infinohq/infino) is a scalable telemetry store designed for logs, metrics, and traces. Infino can function as a standalone observability solution or as the storage layer in your observability stack.\n",
+    "\n",
    "This example shows how one can track the following while calling OpenAI and ChatOpenAI models via `LangChain` and [Infino](https://github.com/infinohq/infino):\n",
    "\n",
-    "* prompt input,\n",
-    "* response from `ChatGPT` or any other `LangChain` model,\n",
-    "* latency,\n",
-    "* errors,\n",
+    "* prompt input\n",
+    "* response from `ChatGPT` or any other `LangChain` model\n",
+    "* latency\n",
+    "* errors\n",
    "* number of tokens consumed"
   ]
  },
@@ -454,7 +456,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/callbacks/labelstudio.ipynb
+++ b/docs/docs/integrations/callbacks/labelstudio.ipynb
@@ -4,6 +4,9 @@
   "cell_type": "markdown",
   "metadata": {
    "collapsed": true,
+    "jupyter": {
+     "outputs_hidden": true
+    },
    "pycharm": {
     "name": "#%% md\n"
    }
@@ -11,17 +14,14 @@
   "source": [
    "# Label Studio\n",
    "\n",
-    "<div>\n",
-    "<img src=\"https://labelstudio-pub.s3.amazonaws.com/lc/open-source-data-labeling-platform.png\" width=\"400\"/>\n",
-    "</div>\n",
    "\n",
-    "Label Studio is an open-source data labeling platform that provides LangChain with flexibility when it comes to labeling data for fine-tuning large language models (LLMs). It also enables the preparation of custom training data and the collection and evaluation of responses through human feedback.\n",
+    ">[Label Studio](https://labelstud.io/guide/get_started) is an open-source data labeling platform that provides LangChain with flexibility when it comes to labeling data for fine-tuning large language models (LLMs). It also enables the preparation of custom training data and the collection and evaluation of responses through human feedback.\n",
    "\n",
-    "In this guide, you will learn how to connect a LangChain pipeline to Label Studio to:\n",
+    "In this guide, you will learn how to connect a LangChain pipeline to `Label Studio` to:\n",
    "\n",
-    "- Aggregate all input prompts, conversations, and responses in a single LabelStudio project. This consolidates all the data in one place for easier labeling and analysis.\n",
+    "- Aggregate all input prompts, conversations, and responses in a single `Label Studio` project. This consolidates all the data in one place for easier labeling and analysis.\n",
    "- Refine prompts and responses to create a dataset for supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) scenarios. The labeled data can be used to further train the LLM to improve its performance.\n",
-    "- Evaluate model responses through human feedback. LabelStudio provides an interface for humans to review and provide feedback on model responses, allowing evaluation and iteration."
+    "- Evaluate model responses through human feedback. `Label Studio` provides an interface for humans to review and provide feedback on model responses, allowing evaluation and iteration."
   ]
  },
  {
@@ -362,9 +362,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "labelops",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
-   "name": "labelops"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@@ -376,9 +376,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.16"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 1
+ "nbformat_minor": 4
 }
--- a/docs/docs/integrations/callbacks/llmonitor.md
+++ b/docs/docs/integrations/callbacks/llmonitor.md
@@ -1,6 +1,6 @@
 # LLMonitor

-[LLMonitor](https://llmonitor.com?utm_source=langchain&utm_medium=py&utm_campaign=docs) is an open-source observability platform that provides cost and usage analytics, user tracking, tracing and evaluation tools.
+>[LLMonitor](https://llmonitor.com?utm_source=langchain&utm_medium=py&utm_campaign=docs) is an open-source observability platform that provides cost and usage analytics, user tracking, tracing and evaluation tools.

 <video controls width='100%' >
  <source src='https://llmonitor.com/videos/demo-annotated.mp4'/>
--- a/docs/docs/integrations/callbacks/promptlayer.ipynb
+++ b/docs/docs/integrations/callbacks/promptlayer.ipynb
@@ -7,13 +7,13 @@
   "source": [
    "# PromptLayer\n",
    "\n",
-    "![PromptLayer](https://promptlayer.com/text_logo.png)\n",
+    ">[PromptLayer](https://docs.promptlayer.com/introduction) is a platform for prompt engineering. It also helps with the LLM observability to visualize requests, version prompts, and track usage.\n",
+    ">\n",
+    ">While `PromptLayer` does have LLMs that integrate directly with LangChain (e.g. [`PromptLayerOpenAI`](https://python.langchain.com/docs/integrations/llms/promptlayer_openai)), using a callback is the recommended way to integrate `PromptLayer` with LangChain.\n",
    "\n",
-    "[PromptLayer](https://promptlayer.com) is a an LLM observability platform that lets you visualize requests, version prompts, and track usage. In this guide we will go over how to setup the `PromptLayerCallbackHandler`. \n",
+    "In this guide, we will go over how to setup the `PromptLayerCallbackHandler`. \n",
    "\n",
-    "While PromptLayer does have LLMs that integrate directly with LangChain (e.g. [`PromptLayerOpenAI`](https://python.langchain.com/docs/integrations/llms/promptlayer_openai)), this callback is the recommended way to integrate PromptLayer with LangChain.\n",
-    "\n",
-    "See [our docs](https://docs.promptlayer.com/languages/langchain) for more information."
+    "See [PromptLayer docs](https://docs.promptlayer.com/languages/langchain) for more information."
   ]
  },
  {
@@ -51,7 +51,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "### Usage\n",
+    "## Usage\n",
    "\n",
    "Getting started with `PromptLayerCallbackHandler` is fairly simple, it takes two optional arguments:\n",
    "1. `pl_tags` - an optional list of strings that will be tracked as tags on PromptLayer.\n",
@@ -63,7 +63,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "### Simple OpenAI Example\n",
+    "## Simple OpenAI Example\n",
    "\n",
    "In this simple example we use `PromptLayerCallbackHandler` with `ChatOpenAI`. We add a PromptLayer tag named `chatopenai`"
   ]
@@ -99,7 +99,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "### GPT4All Example"
+    "## GPT4All Example"
   ]
  },
  {
@@ -125,9 +125,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "### Full Featured Example\n",
+    "## Full Featured Example\n",
    "\n",
-    "In this example we unlock more of the power of PromptLayer.\n",
+    "In this example, we unlock more of the power of `PromptLayer`.\n",
    "\n",
    "PromptLayer allows you to visually create, version, and track prompt templates. Using the [Prompt Registry](https://docs.promptlayer.com/features/prompt-registry), we can programmatically fetch the prompt template called `example`.\n",
    "\n",
@@ -182,7 +182,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "base",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
@@ -196,7 +196,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.8.8 (default, Apr 13 2021, 12:59:45) \n[Clang 10.0.0 ]"
+   "version": "3.10.12"
  },
  "vscode": {
   "interpreter": {
--- a/docs/docs/integrations/callbacks/sagemaker_tracking.ipynb
+++ b/docs/docs/integrations/callbacks/sagemaker_tracking.ipynb
@@ -7,14 +7,15 @@
   "source": [
    "# SageMaker Tracking\n",
    "\n",
-    "This notebook shows how LangChain Callback can be used to log and track prompts and other LLM hyperparameters into SageMaker Experiments. Here, we use different scenarios to showcase the capability:\n",
+    ">[Amazon SageMaker](https://aws.amazon.com/sagemaker/) is a fully managed service that is used to quickly and easily build, train and deploy machine learning (ML) models. \n",
+    "\n",
+    ">[Amazon SageMaker Experiments](https://docs.aws.amazon.com/sagemaker/latest/dg/experiments.html) is a capability of `Amazon SageMaker` that lets you organize, track, compare and evaluate ML experiments and model versions.\n",
+    "\n",
+    "This notebook shows how LangChain Callback can be used to log and track prompts and other LLM hyperparameters into `SageMaker Experiments`. Here, we use different scenarios to showcase the capability:\n",
    "* **Scenario 1**: *Single LLM* - A case where a single LLM model is used to generate output based on a given prompt.\n",
    "* **Scenario 2**: *Sequential Chain* - A case where a sequential chain of two LLM models is used.\n",
    "* **Scenario 3**: *Agent with Tools (Chain of Thought)* - A case where multiple tools (search and math) are used in addition to an LLM.\n",
    "\n",
-    "[Amazon SageMaker](https://aws.amazon.com/sagemaker/) is a fully managed service that is used to quickly and easily build, train and deploy machine learning (ML) models. \n",
-    "\n",
-    "[Amazon SageMaker Experiments](https://docs.aws.amazon.com/sagemaker/latest/dg/experiments.html) is a capability of Amazon SageMaker that lets you organize, track, compare and evaluate ML experiments and model versions.\n",
    "\n",
    "In this notebook, we will create a single experiment to log the prompts from each scenario."
   ]
@@ -899,9 +900,9 @@
  ],
  "instance_type": "ml.t3.large",
  "kernelspec": {
-   "display_name": "conda_pytorch_p310",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
-   "name": "conda_pytorch_p310"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@@ -913,7 +914,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.10"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/callbacks/trubrics.ipynb
+++ b/docs/docs/integrations/callbacks/trubrics.ipynb
@@ -9,12 +9,13 @@
   "source": [
    "# Trubrics\n",
    "\n",
-    "![Trubrics](https://miro.medium.com/v2/resize:fit:720/format:webp/1*AhYbKO-v8F4u3hx2aDIqKg.png)\n",
    "\n",
-    "[Trubrics](https://trubrics.com) is an LLM user analytics platform that lets you collect, analyse and manage user\n",
-    "prompts & feedback on AI models. In this guide we will go over how to setup the `TrubricsCallbackHandler`. \n",
+    ">[Trubrics](https://trubrics.com) is an LLM user analytics platform that lets you collect, analyse and manage user\n",
+    "prompts & feedback on AI models.\n",
+    ">\n",
+    ">Check out [Trubrics repo](https://github.com/trubrics/trubrics-sdk) for more information on `Trubrics`.\n",
    "\n",
-    "Check out [our repo](https://github.com/trubrics/trubrics-sdk) for more information on Trubrics."
+    "In this guide, we will go over how to set up the `TrubricsCallbackHandler`. \n"
   ]
  },
  {
@@ -347,9 +348,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "langchain",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
-   "name": "langchain"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@@ -361,7 +362,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.4"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/chat/anthropic.ipynb
+++ b/docs/docs/integrations/chat/anthropic.ipynb
@@ -1,11 +1,21 @@
 {
 "cells": [
+  {
+   "cell_type": "raw",
+   "id": "a016701c",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_label: Anthropic\n",
+    "---"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "bf733a38-db84-4363-89e2-de6735c37230",
   "metadata": {},
   "source": [
-    "# Anthropic\n",
+    "# ChatAnthropic\n",
    "\n",
    "This notebook covers how to get started with Anthropic chat models."
   ]
--- a/docs/docs/integrations/chat/anyscale.ipynb
+++ b/docs/docs/integrations/chat/anyscale.ipynb
@@ -1,12 +1,22 @@
 {
 "cells": [
+  {
+   "cell_type": "raw",
+   "id": "31895fc4",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_label: Anyscale\n",
+    "---"
+   ]
+  },
  {
   "attachments": {},
   "cell_type": "markdown",
   "id": "642fd21c-600a-47a1-be96-6e1438b421a9",
   "metadata": {},
   "source": [
-    "# Anyscale\n",
+    "# ChatAnyscale\n",
    "\n",
    "This notebook demonstrates the use of `langchain.chat_models.ChatAnyscale` for [Anyscale Endpoints](https://endpoints.anyscale.com/).\n",
    "\n",
@@ -33,7 +43,7 @@
   "metadata": {},
   "outputs": [
    {
-     "name": "stdin",
+     "name": "stdout",
     "output_type": "stream",
     "text": [
      " ········\n"
--- a/docs/docs/integrations/chat/azure_chat_openai.ipynb
+++ b/docs/docs/integrations/chat/azure_chat_openai.ipynb
@@ -1,11 +1,21 @@
 {
 "cells": [
+  {
+   "cell_type": "raw",
+   "id": "641f8cb0",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_label: Azure OpenAI\n",
+    "---"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "38f26d7a",
   "metadata": {},
   "source": [
-    "# Azure OpenAI\n",
+    "# AzureChatOpenAI\n",
    "\n",
    ">[Azure OpenAI Service](https://learn.microsoft.com/en-us/azure/ai-services/openai/overview) provides REST API access to OpenAI's powerful language models including the GPT-4, GPT-3.5-Turbo, and Embeddings model series. These models can be easily adapted to your specific task including but not limited to content generation, summarization, semantic search, and natural language to code translation. Users can access the service through REST APIs, Python SDK, or a web-based interface in the Azure OpenAI Studio.\n",
    "\n",
--- a/docs/docs/integrations/chat/azureml_chat_endpoint.ipynb
+++ b/docs/docs/integrations/chat/azureml_chat_endpoint.ipynb
@@ -1,10 +1,19 @@
 {
 "cells": [
+  {
+   "cell_type": "raw",
+   "metadata": {},
+   "source": [
+    "---\n",
+    "sidebar_label: Azure ML Endpoint\n",
+    "---"
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Azure ML Endpoint\n",
+    "# AzureMLChatOnlineEndpoint\n",
    "\n",
    ">[Azure Machine Learning](https://azure.microsoft.com/en-us/products/machine-learning/) is a platform used to build, train, and deploy machine learning models. Users can explore the types of models to deploy in the Model Catalog, which provides Azure Foundation Models and OpenAI Models. `Azure Foundation Models` include various open-source models and popular Hugging Face models. Users can also import models of their liking into AzureML.\n",
    ">\n",
--- a/Show More
+++ b/Show More