bump 309 (#11443 )

Add cohere /chat integration (#11389 )
Add cohere /chat integration and an iPython notebook to demonstrate the addition. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>
2026-02-10 19:20:24 +00:00 · 2023-10-05 09:29:14 -07:00 · 2023-10-05 09:20:47 -07:00 · 2023-10-05 09:20:18 -07:00 · 2023-10-05 09:07:16 -07:00 · 2023-10-05 08:07:38 -07:00
342 changed files with 17390 additions and 12956 deletions
--- a/.devcontainer/README.md
+++ b/.devcontainer/README.md
@@ -5,10 +5,10 @@ This project includes a [dev container](https://containers.dev/), which lets you
 You can use the dev container configuration in this folder to build and run the app without needing to install any of its tools locally! You can use it in [GitHub Codespaces](https://github.com/features/codespaces) or the [VS Code Dev Containers extension](https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-containers).

 ## GitHub Codespaces
-[![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/hwchase17/langchain)
+[![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/langchain-ai/langchain)

 You may use the button above, or follow these steps to open this repo in a Codespace:
-1. Click the **Code** drop-down menu at the top of https://github.com/hwchase17/langchain.
+1. Click the **Code** drop-down menu at the top of https://github.com/langchain-ai/langchain.
 1. Click on the **Codespaces** tab.
 1. Click **Create codespace on master** .

--- a/.github/CONTRIBUTING.md
+++ b/.github/CONTRIBUTING.md
@@ -32,7 +32,7 @@ best way to get our attention.

 ### 🚩GitHub Issues

-Our [issues](https://github.com/hwchase17/langchain/issues) page is kept up to date
+Our [issues](https://github.com/langchain-ai/langchain/issues) page is kept up to date
 with bugs, improvements, and feature requests.

 There is a taxonomy of labels to help with sorting and discovery of issues of interest. Please use these to help
@@ -60,11 +60,11 @@ we do not want these to get in the way of getting good code into the codebase.
 ## 🚀 Quick Start

 This quick start describes running the repository locally.
-For a [development container](https://containers.dev/), see the [.devcontainer folder](https://github.com/hwchase17/langchain/tree/master/.devcontainer).
+For a [development container](https://containers.dev/), see the [.devcontainer folder](https://github.com/langchain-ai/langchain/tree/master/.devcontainer).

 ### Dependency Management: Poetry and other env/dependency managers

-This project uses [Poetry](https://python-poetry.org/) v1.5.1+ as a dependency manager.
+This project uses [Poetry](https://python-poetry.org/) v1.6.1+ as a dependency manager.

 ❗Note: *Before installing Poetry*, if you use `Conda`, create and activate a new Conda env (e.g. `conda create -n langchain python=3.9`)

@@ -105,8 +105,8 @@ make test
 If the tests don't pass, you may need to pip install additional dependencies, such as `numexpr` and `openapi_schema_pydantic`.

 If during installation you receive a `WheelFileValidationError` for `debugpy`, please make sure you are running
-Poetry v1.5.1+. This bug was present in older versions of Poetry (e.g. 1.4.1) and has been resolved in newer releases.
-If you are still seeing this bug on v1.5.1, you may also try disabling "modern installation"
+Poetry v1.6.1+. This bug was present in older versions of Poetry (e.g. 1.4.1) and has been resolved in newer releases.
+If you are still seeing this bug on v1.6.1, you may also try disabling "modern installation"
 (`poetry config installer.modern-installation false`) and re-installing requirements.
 See [this `debugpy` issue](https://github.com/microsoft/debugpy/issues/1246) for more details.

--- a/.github/ISSUE_TEMPLATE/feature-request.yml
+++ b/.github/ISSUE_TEMPLATE/feature-request.yml
@@ -27,4 +27,4 @@ body:
    attributes:
      label: Your contribution
      description: |
-        Is there any way that you could help, e.g. by submitting a PR? Make sure to read the CONTRIBUTING.MD [readme](https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md)
+        Is there any way that you could help, e.g. by submitting a PR? Make sure to read the CONTRIBUTING.MD [readme](https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md)
--- a/.github/PULL_REQUEST_TEMPLATE.md
+++ b/.github/PULL_REQUEST_TEMPLATE.md
@@ -10,7 +10,7 @@ Replace this entire comment with:
 Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally.

 See contribution guidelines for more information on how to write/run tests, lint, etc: 
-https://github.com/hwchase17/langchain/blob/master/.github/CONTRIBUTING.md
+https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md

 If you're adding a new integration, please include:
  1. a test for the integration, preferably unit tests that do not rely on network access,
--- a/.github/workflows/_lint.yml
+++ b/.github/workflows/_lint.yml
@@ -9,7 +9,7 @@ on:
        description: "From which folder this pipeline executes"

 env:
-  POETRY_VERSION: "1.5.1"
+  POETRY_VERSION: "1.6.1"
  WORKDIR: ${{ inputs.working-directory == '' && '.' || inputs.working-directory }}

 jobs:
--- a/.github/workflows/_pydantic_compatibility.yml
+++ b/.github/workflows/_pydantic_compatibility.yml
@@ -9,7 +9,7 @@ on:
        description: "From which folder this pipeline executes"

 env:
-  POETRY_VERSION: "1.5.1"
+  POETRY_VERSION: "1.6.1"

 jobs:
  build:
--- a/.github/workflows/_release.yml
+++ b/.github/workflows/_release.yml
@@ -9,7 +9,7 @@ on:
        description: "From which folder this pipeline executes"

 env:
-  POETRY_VERSION: "1.5.1"
+  POETRY_VERSION: "1.6.1"

 jobs:
  if_release:
--- a/.github/workflows/_release_docker.yml
+++ b/.github/workflows/_release_docker.yml
@@ -0,0 +1,62 @@
+name: release_docker
+
+on:
+  workflow_call:
+    inputs:
+      dockerfile:
+        required: true
+        type: string
+        description: "Path to the Dockerfile to build"
+      image:
+        required: true
+        type: string
+        description: "Name of the image to build"
+
+env:
+  TEST_TAG: ${{ inputs.image }}:test
+  LATEST_TAG: ${{ inputs.image }}:latest
+
+jobs:
+  docker:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Get git tag
+        uses: actions-ecosystem/action-get-latest-tag@v1
+        id: get-latest-tag
+      - name: Set docker tag
+        env:
+          VERSION: ${{ steps.get-latest-tag.outputs.tag }}
+        run: |
+          echo "VERSION_TAG=${{ inputs.image }}:${VERSION#v}" >> $GITHUB_ENV
+      - name: Set up QEMU
+        uses: docker/setup-qemu-action@v3
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+      - name: Login to Docker Hub
+        uses: docker/login-action@v3
+        with:
+          username: ${{ secrets.DOCKERHUB_USERNAME }}
+          password: ${{ secrets.DOCKERHUB_TOKEN }}
+      - name: Build for Test
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: ${{ inputs.dockerfile }}
+          load: true
+          tags: ${{ env.TEST_TAG }}
+      - name: Test
+        run: |
+          docker run --rm ${{ env.TEST_TAG }} python -c "import langchain"
+      - name: Build and Push to Docker Hub
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: ${{ inputs.dockerfile }}
+          # We can only build for the intersection of platforms supported by
+          # QEMU and base python image, for now build only for
+          # linux/amd64 and linux/arm64
+          platforms: linux/amd64,linux/arm64
+          tags: ${{ env.LATEST_TAG }},${{ env.VERSION_TAG }}
+          push: true
--- a/.github/workflows/_test.yml
+++ b/.github/workflows/_test.yml
@@ -9,7 +9,7 @@ on:
        description: "From which folder this pipeline executes"

 env:
-  POETRY_VERSION: "1.5.1"
+  POETRY_VERSION: "1.6.1"

 jobs:
  build:
--- a/.github/workflows/codespell.yml
+++ b/.github/workflows/codespell.yml
@@ -18,7 +18,19 @@ jobs:
    steps:
      - name: Checkout
        uses: actions/checkout@v3
+
+      - name: Install Dependencies
+        run: |
+          pip install toml
+
+      - name: Extract Ignore Words List
+        run: |
+          # Use a Python script to extract the ignore words list from pyproject.toml
+          python .github/workflows/extract_ignored_words_list.py
+        id: extract_ignore_words
+
      - name: Codespell
        uses: codespell-project/actions-codespell@v2
        with:
          skip: guide_imports.json
+          ignore_words_list: ${{ steps.extract_ignore_words.outputs.ignore_words_list }}
--- a/.github/workflows/extract_ignored_words_list.py
+++ b/.github/workflows/extract_ignored_words_list.py
@@ -0,0 +1,8 @@
+import toml
+
+pyproject_toml = toml.load("pyproject.toml")
+
+# Extract the ignore words list (adjust the key as per your TOML structure)
+ignore_words_list = pyproject_toml.get("tool", {}).get("codespell", {}).get("ignore-words-list")
+
+print(f"::set-output name=ignore_words_list::{ignore_words_list}")
--- a/.github/workflows/langchain_ci.yml
+++ b/.github/workflows/langchain_ci.yml
@@ -26,7 +26,7 @@ concurrency:
  cancel-in-progress: true

 env:
-  POETRY_VERSION: "1.5.1"
+  POETRY_VERSION: "1.6.1"
  WORKDIR: "libs/langchain"

 jobs:
--- a/.github/workflows/langchain_experimental_ci.yml
+++ b/.github/workflows/langchain_experimental_ci.yml
@@ -26,7 +26,7 @@ concurrency:
  cancel-in-progress: true

 env:
-  POETRY_VERSION: "1.5.1"
+  POETRY_VERSION: "1.6.1"
  WORKDIR: "libs/experimental"

 jobs:
--- a/.github/workflows/langchain_release.yml
+++ b/.github/workflows/langchain_release.yml
@@ -11,3 +11,16 @@ jobs:
    with:
      working-directory: libs/langchain
    secrets: inherit
+
+  # N.B.: It's possible that PyPI doesn't make the new release visible / available
+  #       immediately after publishing. If that happens, the docker build might not
+  #       create a new docker image for the new release, since it won't see it.
+  #
+  #       If this ends up being a problem, add a check to the end of the `_release.yml`
+  #       workflow that prevents the workflow from finishing until the new release
+  #       is visible and installable on PyPI.
+  release-docker:
+    needs:
+      - release
+    uses:
+      ./.github/workflows/langchain_release_docker.yml
--- a/.github/workflows/langchain_release_docker.yml
+++ b/.github/workflows/langchain_release_docker.yml
@@ -0,0 +1,14 @@
+---
+name: docker/langchain/langchain Release
+
+on:
+  workflow_dispatch: # Allows to trigger the workflow manually in GitHub UI
+  workflow_call: # Allows triggering from another workflow
+
+jobs:
+  release:
+    uses: ./.github/workflows/_release_docker.yml
+    with:
+      dockerfile: docker/Dockerfile.base
+      image: langchain/langchain
+    secrets: inherit
--- a/.github/workflows/langserve_ci.yml
+++ b/.github/workflows/langserve_ci.yml
@@ -1,82 +0,0 @@
---
-name: libs/langserve CI
-
-on:
-  push:
-    branches: [ master ]
-  pull_request:
-    paths:
-      - '.github/actions/poetry_setup/action.yml'
-      - '.github/tools/**'
-      - '.github/workflows/_lint.yml'
-      - '.github/workflows/_test.yml'
-      - '.github/workflows/langserve_ci.yml'
-      - 'libs/langserve/**'
-  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
-
-# If another push to the same PR or branch happens while this workflow is still running,
-# cancel the earlier run in favor of the next run.
-#
-# There's no point in testing an outdated version of the code. GitHub only allows
-# a limited number of job runners to be active at the same time, so it's better to cancel
-# pointless jobs early so that more useful jobs can run sooner.
-concurrency:
-  group: ${{ github.workflow }}-${{ github.ref }}
-  cancel-in-progress: true
-
-env:
-  POETRY_VERSION: "1.5.1"
-  WORKDIR: "libs/langserve"
-
-jobs:
-  lint:
-    uses:
-      ./.github/workflows/_lint.yml
-    with:
-      working-directory: libs/langserve
-    secrets: inherit
-
-  test:
-    runs-on: ubuntu-latest
-    defaults:
-      run:
-        working-directory: ${{ env.WORKDIR }}
-    strategy:
-      matrix:
-        python-version:
-          - "3.8"
-          - "3.9"
-          - "3.10"
-          - "3.11"
-    name: Python ${{ matrix.python-version }} extended tests
-    steps:
-      - uses: actions/checkout@v3
-
-      - name: Set up Python ${{ matrix.python-version }} + Poetry ${{ env.POETRY_VERSION }}
-        uses: "./.github/actions/poetry_setup"
-        with:
-          python-version: ${{ matrix.python-version }}
-          poetry-version: ${{ env.POETRY_VERSION }}
-          working-directory: libs/langserve
-          cache-key: langserve-all
-
-      - name: Install dependencies
-        shell: bash
-        run: |
-          echo "Running extended tests, installing dependencies with poetry..."
-          poetry install --with test,lint --extras all
-
-      - name: Run tests
-        run: make test
-
-      - name: Ensure the tests did not create any additional files
-        shell: bash
-        run: |
-          set -eu
-
-          STATUS="$(git status)"
-          echo "$STATUS"
-
-          # grep will exit non-zero if the target message isn't found,
-          # and `set -e` above will cause the step to fail.
-          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.github/workflows/langserve_release.yml
+++ b/.github/workflows/langserve_release.yml
@@ -1,13 +0,0 @@
---
-name: libs/langserve Release
-
-on:
-  workflow_dispatch:  # Allows to trigger the workflow manually in GitHub UI
-
-jobs:
-  release:
-    uses:
-      ./.github/workflows/_release.yml
-    with:
-      working-directory: libs/langserve
-    secrets: inherit
--- a/.github/workflows/scheduled_test.yml
+++ b/.github/workflows/scheduled_test.yml
@@ -6,7 +6,7 @@ on:
    - cron:  '0 13 * * *'

 env:
-  POETRY_VERSION: "1.5.1"
+  POETRY_VERSION: "1.6.1"

 jobs:
  build:
@@ -40,6 +40,13 @@ jobs:
        with:
          credentials_json: '${{ secrets.GOOGLE_CREDENTIALS }}'

+      - name: Configure AWS Credentials
+        uses: aws-actions/configure-aws-credentials@v4
+        with:
+          aws-access-key-id: ${{ secrets.AWS_ACCESS_KEY_ID }}
+          aws-secret-access-key: ${{ secrets.AWS_SECRET_ACCESS_KEY }}
+          aws-region: ${{ vars.AWS_REGION }}
+
      - name: Install dependencies
        working-directory: libs/langchain
        shell: bash
@@ -47,6 +54,7 @@ jobs:
          echo "Running scheduled tests, installing dependencies with poetry..."
          poetry install --with=test_integration
          poetry run pip install google-cloud-aiplatform
+          poetry run pip install "boto3>=1.28.57"

      - name: Run tests
        shell: bash
@@ -55,3 +63,15 @@ jobs:
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
        run: |
          make scheduled_tests
+
+      - name: Ensure the tests did not create any additional files
+        shell: bash
+        run: |
+          set -eu
+
+          STATUS="$(git status)"
+          echo "$STATUS"
+
+          # grep will exit non-zero if the target message isn't found,
+          # and `set -e` above will cause the step to fail.
+          echo "$STATUS" | grep 'nothing to commit, working tree clean'
--- a/.gitignore
+++ b/.gitignore
@@ -30,6 +30,12 @@ share/python-wheels/
 *.egg
 MANIFEST

+# Google GitHub Actions credentials files created by:
+# https://github.com/google-github-actions/auth
+#
+# That action recommends adding this gitignore to prevent accidentally committing keys.
+gha-creds-*.json
+
 # PyInstaller
 #  Usually these files are written by a python script from a template
 #  before PyInstaller builds the exe, so as to inject date/other infos into it.
--- a/CITATION.cff
+++ b/CITATION.cff
@@ -5,4 +5,4 @@ authors:
  given-names: "Harrison"
 title: "LangChain"
 date-released: 2022-10-17
-url: "https://github.com/hwchase17/langchain"
+url: "https://github.com/langchain-ai/langchain"
--- a/README.md
+++ b/README.md
@@ -16,7 +16,7 @@
 [![Open Issues](https://img.shields.io/github/issues-raw/langchain-ai/langchain)](https://github.com/langchain-ai/langchain/issues)


-Looking for the JS/TS version? Check out [LangChain.js](https://github.com/hwchase17/langchainjs).
+Looking for the JS/TS version? Check out [LangChain.js](https://github.com/langchain-ai/langchainjs).

 **Production Support:** As you move your LangChains into production, we'd love to offer more hands-on support.
 Fill out [this form](https://airtable.com/appwQzlErAS2qiP0L/shrGtGaVBVAz7NcV2) to share more about what you're building, and our team will get in touch.
@@ -26,7 +26,7 @@ Fill out [this form](https://airtable.com/appwQzlErAS2qiP0L/shrGtGaVBVAz7NcV2) t
 In an effort to make `langchain` leaner and safer, we are moving select chains to `langchain_experimental`.
 This migration has already started, but we are remaining backwards compatible until 7/28.
 On that date, we will remove functionality from `langchain`.
-Read more about the motivation and the progress [here](https://github.com/hwchase17/langchain/discussions/8043).
+Read more about the motivation and the progress [here](https://github.com/langchain-ai/langchain/discussions/8043).
 Read how to migrate your code [here](MIGRATE.md).

 ## Quick Install
@@ -49,7 +49,7 @@ This library aims to assist in the development of those types of applications. C
 **💬 Chatbots**

 - [Documentation](https://python.langchain.com/docs/use_cases/chatbots/)
- End-to-end Example: [Chat-LangChain](https://github.com/hwchase17/chat-langchain)
+- End-to-end Example: [Chat-LangChain](https://github.com/langchain-ai/chat-langchain)

 **🤖 Agents**

--- a/docker/Dockerfile.base
+++ b/docker/Dockerfile.base
@@ -0,0 +1,3 @@
+FROM python:latest
+
+RUN pip install langchain
--- a/docs/docs_skeleton/docs/get_started/introduction.mdx
+++ b/docs/docs_skeleton/docs/get_started/introduction.mdx
@@ -20,7 +20,7 @@ Off-the-shelf chains make it easy to get started. For complex applications, comp

 We recommend following our [Quickstart](/docs/get_started/quickstart) guide to familiarize yourself with the framework by building your first LangChain application.

-_**Note**: These docs are for the LangChain [Python package](https://github.com/hwchase17/langchain). For documentation on [LangChain.js](https://github.com/hwchase17/langchainjs), the JS/TS version, [head here](https://js.langchain.com/docs)._
+_**Note**: These docs are for the LangChain [Python package](https://github.com/langchain-ai/langchain). For documentation on [LangChain.js](https://github.com/langchain-ai/langchainjs), the JS/TS version, [head here](https://js.langchain.com/docs)._

 ## Modules

--- a/docs/docs_skeleton/docs/get_started/quickstart.mdx
+++ b/docs/docs_skeleton/docs/get_started/quickstart.mdx
@@ -42,7 +42,7 @@ There are two types of language models, which in LangChain are called:
 - ChatModels: this is a language model which takes a list of messages as input and returns a message

 The input/output for LLMs is simple and easy to understand - a string.
-But what about ChatModels? The input there is a list of `ChatMessage`s, and the output is a single `ChatMessage`.
+But what about ChatModels? The input there is a list of `ChatMessages`, and the output is a single `ChatMessage`.
 A `ChatMessage` has two required components:

 - `content`: This is the content of the message.
@@ -85,7 +85,7 @@ import InputMessages from "@snippets/get_started/quickstart/input_messages.mdx"

 <InputMessages/>

-For both these methods, you can also pass in parameters as key word arguments.
+For both these methods, you can also pass in parameters as keyword arguments.
 For example, you could pass in `temperature=0` to adjust the temperature that is used from what the object was configured with.
 Whatever values are passed in during run time will always override what the object was configured with.

--- a/docs/docs_skeleton/docusaurus.config.js
+++ b/docs/docs_skeleton/docusaurus.config.js
@@ -196,7 +196,7 @@ const config = {
          },
          // Please keep GitHub link to the right for consistency.
          {
-            href: "https://github.com/hwchase17/langchain",
+            href: "https://github.com/langchain-ai/langchain",
            position: "right",
            className: "header-github-link",
            "aria-label": "GitHub repository",
@@ -224,11 +224,11 @@ const config = {
            items: [
              {
                label: "Python",
-                href: "https://github.com/hwchase17/langchain",
+                href: "https://github.com/langchain-ai/langchain",
              },
              {
                label: "JS/TS",
-                href: "https://github.com/hwchase17/langchainjs",
+                href: "https://github.com/langchain-ai/langchainjs",
              },
            ],
          },
--- a/docs/docs_skeleton/sidebars.js
+++ b/docs/docs_skeleton/sidebars.js
@@ -33,27 +33,26 @@ module.exports = {
      slug: "get_started",
      },
    },
-    {
-      type: "category",
-      label: "Modules",
-      collapsed: false,
-      collapsible: false,
-      items: [{ type: "autogenerated", dirName: "modules" } ],
-      link: {
-        type: 'doc',
-        id: "modules/index"
-      },
-    },
    {
      type: "category",
      label: "LangChain Expression Language",
-      collapsed: true,
+      collapsed: false,
      items: [{ type: "autogenerated", dirName: "expression_language" } ],
      link: {
        type: 'doc',
        id: "expression_language/index"
      },
    },
+    {
+      type: "category",
+      label: "Modules",
+      collapsed: false,
+      items: [{ type: "autogenerated", dirName: "modules" } ],
+      link: {
+        type: 'doc',
+        id: "modules/index"
+      },
+    },
    {
      type: "category",
      label: "Guides",
--- a/docs/docs_skeleton/static/img/multi_vector.png
+++ b/docs/docs_skeleton/static/img/multi_vector.png
--- a/docs/docs_skeleton/vercel.json
+++ b/docs/docs_skeleton/vercel.json
@@ -2628,6 +2628,18 @@
      "source": "/docs/modules/memory/integrations/cassandra_chat_message_history",
      "destination": "/docs/integrations/memory/cassandra_chat_message_history"
    },
+    {
+      "source": "/docs/integrations/memory/motorhead_memory_managed",
+      "destination": "/docs/integrations/memory/motorhead_memory"
+    },
+    {
+      "source": "/docs/integrations/memory/dynamodb_chat_message_history",
+      "destination": "/docs/integrations/memory/aws_dynamodb"
+    },
+    {
+      "source": "/docs/integrations/memory/entity_memory_with_sqlite",
+      "destination": "/docs/integrations/memory/sqlite"
+    },
    {
      "source": "/en/latest/modules/memory/examples/dynamodb_chat_message_history.html",
      "destination": "/docs/integrations/memory/dynamodb_chat_message_history"
--- a/docs/extras/additional_resources/dependents.mdx
+++ b/docs/extras/additional_resources/dependents.mdx
@@ -39,7 +39,7 @@ Dependents stats for `langchain-ai/langchain`
 |[go-skynet/LocalAI](https://github.com/go-skynet/LocalAI) | 9955 |
 |[AIGC-Audio/AudioGPT](https://github.com/AIGC-Audio/AudioGPT) | 9081 |
 |[gventuri/pandas-ai](https://github.com/gventuri/pandas-ai) | 8201 |
-|[hwchase17/langchainjs](https://github.com/hwchase17/langchainjs) | 7754 |
+|[langchain-ai/langchainjs](https://github.com/langchain-ai/langchainjs) | 7754 |
 |[langgenius/dify](https://github.com/langgenius/dify) | 7348 |
 |[PipedreamHQ/pipedream](https://github.com/PipedreamHQ/pipedream) | 6950 |
 |[h2oai/h2ogpt](https://github.com/h2oai/h2ogpt) | 6858 |
--- a/docs/extras/additional_resources/tutorials.mdx
+++ b/docs/extras/additional_resources/tutorials.mdx
@@ -2,7 +2,7 @@

 Below are links to tutorials and courses on LangChain. For written guides on common use cases for LangChain, check out the [use cases guides](/docs/use_cases).

-⛓ icon marks a new addition [last update 2023-08-20]
+⛓ icon marks a new addition [last update 2023-09-21]

 ---------------------

@@ -15,12 +15,11 @@ Below are links to tutorials and courses on LangChain. For written guides on com
 [LangChain AI Handbook](https://www.pinecone.io/learn/langchain/) By **James Briggs** and **Francisco Ingham**

 ### Short Tutorials
-[LangChain Crash Course - Build apps with language models](https://youtu.be/LbT1yp6quS8) by [Patrick Loeber](https://www.youtube.com/@patloeber)
+[LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners](https://youtu.be/aywZrzNaKjs) by [Rabbitmetrics](https://www.youtube.com/@rabbitmetrics)

 [LangChain Crash Course: Build an AutoGPT app in 25 minutes](https://youtu.be/MlK6SIjcjE8) by [Nicholas Renotte](https://www.youtube.com/@NicholasRenotte)

-[LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners](https://youtu.be/aywZrzNaKjs) by [Rabbitmetrics](https://www.youtube.com/@rabbitmetrics)
-
+[LangChain Crash Course - Build apps with language models](https://youtu.be/LbT1yp6quS8) by [Patrick Loeber](https://www.youtube.com/@patloeber)

 ##  Tutorials

@@ -37,6 +36,8 @@ Below are links to tutorials and courses on LangChain. For written guides on com
 - #9 [Build Conversational Agents with Vector DBs](https://youtu.be/H6bCqqw9xyI)
 - [Using NEW `MPT-7B` in Hugging Face and LangChain](https://youtu.be/DXpk9K7DgMo)
 - [`MPT-30B` Chatbot with LangChain](https://youtu.be/pnem-EhT6VI)
+- ⛓ [Fine-tuning OpenAI's `GPT 3.5` for LangChain Agents](https://youtu.be/boHXgQ5eQic?si=OOOfK-GhsgZGBqSr)
+- ⛓ [Chatbots with `RAG`: LangChain Full Walkthrough](https://youtu.be/LhnCsygAvzY?si=N7k6xy4RQksbWwsQ)


 ### [LangChain 101](https://www.youtube.com/playlist?list=PLqZXAkvF1bPNQER9mLmDbntNfSpzdDIU5) by [Greg Kamradt (Data Indy)](https://www.youtube.com/@DataIndependent)
@@ -100,6 +101,16 @@ Below are links to tutorials and courses on LangChain. For written guides on com
 - [What can you do with 16K tokens in LangChain?](https://youtu.be/z2aCZBAtWXs)
 - [Tagging and Extraction - Classification using `OpenAI Functions`](https://youtu.be/a8hMgIcUEnE)
 - [HOW to Make Conversational Form with LangChain](https://youtu.be/IT93On2LB5k)
+- ⛓ [`Claude-2` meets LangChain!](https://youtu.be/Hb_D3p0bK2U?si=j96Kc7oJoeRI5-iC)
+- ⛓ [`PaLM 2` Meets LangChain](https://youtu.be/orPwLibLqm4?si=KgJjpEbAD9YBPqT4)
+- ⛓ [`LLaMA2` with LangChain - Basics | LangChain TUTORIAL](https://youtu.be/cIRzwSXB4Rc?si=v3Hwxk1m3fksBIHN)
+- ⛓ [Serving `LLaMA2` with `Replicate`](https://youtu.be/JIF4nNi26DE?si=dSazFyC4UQmaR-rJ)
+- ⛓ [NEW LangChain Expression Language](https://youtu.be/ud7HJ2p3gp0?si=8pJ9O6hGbXrCX5G9)
+- ⛓ [Building a RCI Chain for Agents with LangChain Expression Language](https://youtu.be/QaKM5s0TnsY?si=0miEj-o17AHcGfLG)
+- ⛓ [How to Run `LLaMA-2-70B` on the `Together AI`](https://youtu.be/Tc2DHfzHeYE?si=Xku3S9dlBxWQukpe)
+- ⛓ [`RetrievalQA` with `LLaMA 2 70b` & `Chroma` DB](https://youtu.be/93yueQQnqpM?si=ZMwj-eS_CGLnNMXZ)
+- ⛓ [How to use `BGE Embeddings` for LangChain](https://youtu.be/sWRvSG7vL4g?si=85jnvnmTCF9YIWXI)
+- ⛓ [How to use Custom Prompts for `RetrievalQA` on `LLaMA-2 7B`](https://youtu.be/PDwUKves9GY?si=sMF99TWU0p4eiK80)


 ### [LangChain](https://www.youtube.com/playlist?list=PLVEEucA9MYhOu89CX8H3MBZqayTbcCTMr) by [Prompt Engineering](https://www.youtube.com/@engineerprompt)
@@ -107,23 +118,26 @@ Below are links to tutorials and courses on LangChain. For written guides on com
 - [Working with MULTIPLE `PDF` Files in LangChain: `ChatGPT` for your Data](https://youtu.be/s5LhRdh5fu4)
 - [`ChatGPT` for YOUR OWN `PDF` files with LangChain](https://youtu.be/TLf90ipMzfE)
 - [Talk to YOUR DATA without OpenAI APIs: LangChain](https://youtu.be/wrD-fZvT6UI)
- [LangChain: PDF Chat App (GUI) | ChatGPT for Your PDF FILES](https://youtu.be/RIWbalZ7sTo)
- [LangFlow: Build Chatbots without Writing Code](https://youtu.be/KJ-ux3hre4s)
+- [LangChain: `PDF` Chat App (GUI) | `ChatGPT` for Your `PDF` FILES](https://youtu.be/RIWbalZ7sTo)
+- [`LangFlow`: Build Chatbots without Writing Code](https://youtu.be/KJ-ux3hre4s)
 - [LangChain: Giving Memory to LLMs](https://youtu.be/dxO6pzlgJiY)
 - [BEST OPEN Alternative to `OPENAI's EMBEDDINGs` for Retrieval QA: LangChain](https://youtu.be/ogEalPMUCSY)
+- [LangChain: Run Language Models Locally - `Hugging Face Models`](https://youtu.be/Xxxuw4_iCzw) 
+- ⛓ [Slash API Costs: Mastering Caching for LLM Applications](https://youtu.be/EQOznhaJWR0?si=AXoI7f3-SVFRvQUl)
+- ⛓ [Avoid PROMPT INJECTION with `Constitutional AI` - LangChain](https://youtu.be/tyKSkPFHVX8?si=9mgcB5Y1kkotkBGB)


 ### LangChain by [Chat with data](https://www.youtube.com/@chatwithdata)
 - [LangChain Beginner's Tutorial for `Typescript`/`Javascript`](https://youtu.be/bH722QgRlhQ)
 - [`GPT-4` Tutorial: How to Chat With Multiple `PDF` Files (~1000 pages of Tesla's 10-K Annual Reports)](https://youtu.be/Ix9WIZpArm0)
 - [`GPT-4` & LangChain Tutorial: How to Chat With A 56-Page `PDF` Document (w/`Pinecone`)](https://youtu.be/ih9PBGVVOO4)
- [LangChain & Supabase Tutorial: How to Build a ChatGPT Chatbot For Your Website](https://youtu.be/R2FMzcsmQY8)
+- [LangChain & `Supabase` Tutorial: How to Build a ChatGPT Chatbot For Your Website](https://youtu.be/R2FMzcsmQY8)
 - [LangChain Agents: Build Personal Assistants For Your Data (Q&A with Harrison Chase and Mayo Oshin)](https://youtu.be/gVkF8cwfBLI)


 ### Codebase Analysis
- ⛓ [Codebase Analysis: Langchain Agents](https://carbonated-yacht-2c5.notion.site/Codebase-Analysis-Langchain-Agents-0b0587acd50647ca88aaae7cff5df1f2)
+- [Codebase Analysis: Langchain Agents](https://carbonated-yacht-2c5.notion.site/Codebase-Analysis-Langchain-Agents-0b0587acd50647ca88aaae7cff5df1f2)


 ---------------------
-⛓ icon marks a new addition [last update 2023-08-20]
+⛓ icon marks a new addition [last update 2023-09-21]
--- a/docs/extras/additional_resources/youtube.mdx
+++ b/docs/extras/additional_resources/youtube.mdx
@@ -1,6 +1,6 @@
 # YouTube videos

-⛓ icon marks a new addition [last update 2023-09-05]
+⛓ icon marks a new addition [last update 2023-09-21]

 ### [Official LangChain YouTube channel](https://www.youtube.com/@LangChain)

@@ -12,7 +12,7 @@

 ## Videos (sorted by views)

- [Building AI LLM Apps with LangChain (and more?) - LIVE STREAM](https://www.youtube.com/live/M-2Cj_2fzWI?feature=share) by [Nicholas Renotte](https://www.youtube.com/@NicholasRenotte)
+- [Using `ChatGPT` with YOUR OWN Data. This is magical. (LangChain OpenAI API)](https://youtu.be/9AXP7tCI9PI) by [TechLead](https://www.youtube.com/@TechLead)
 - [First look - `ChatGPT` + `WolframAlpha` (`GPT-3.5` and Wolfram|Alpha via LangChain by James Weaver)](https://youtu.be/wYGbY811oMo) by [Dr Alan D. Thompson](https://www.youtube.com/@DrAlanDThompson) 
 - [LangChain explained - The hottest new Python framework](https://youtu.be/RoR4XJw8wIc) by [AssemblyAI](https://www.youtube.com/@AssemblyAI)
 - [Chatbot with INFINITE MEMORY using `OpenAI` & `Pinecone` - `GPT-3`, `Embeddings`, `ADA`, `Vector DB`, `Semantic`](https://youtu.be/2xNzB7xq8nk) by [David Shapiro ~ AI](https://www.youtube.com/@DavidShapiroAutomator)
@@ -34,7 +34,7 @@
  - [LangChain, Chroma DB, OpenAI Beginner Guide | ChatGPT with your PDF](https://youtu.be/FuqdVNB_8c0)
  - [LangChain 101: The Complete Beginner's Guide](https://youtu.be/P3MAbZ2eMUI)
 - [Custom langchain Agent & Tools with memory. Turn any `Python function` into langchain tool with Gpt 3](https://youtu.be/NIG8lXk0ULg) by [echohive](https://www.youtube.com/@echohive)
- [LangChain: Run Language Models Locally - `Hugging Face Models`](https://youtu.be/Xxxuw4_iCzw) by [Prompt Engineering](https://www.youtube.com/@engineerprompt)
+- [Building AI LLM Apps with LangChain (and more?) - LIVE STREAM](https://www.youtube.com/live/M-2Cj_2fzWI?feature=share) by [Nicholas Renotte](https://www.youtube.com/@NicholasRenotte)
 - [`ChatGPT` with any `YouTube` video using langchain and `chromadb`](https://youtu.be/TQZfB2bzVwU) by [echohive](https://www.youtube.com/@echohive)
 - [How to Talk to a `PDF` using LangChain and `ChatGPT`](https://youtu.be/v2i1YDtrIwk) by [Automata Learning Lab](https://www.youtube.com/@automatalearninglab)
 - [Langchain Document Loaders Part 1: Unstructured Files](https://youtu.be/O5C0wfsen98) by [Merk](https://www.youtube.com/@merksworld) 
@@ -67,7 +67,6 @@
 - [Use Large Language Models in Jupyter Notebook | LangChain | Agents & Indexes](https://youtu.be/JSe11L1a_QQ) by [Abhinaw Tiwari](https://www.youtube.com/@AbhinawTiwariAT)
 - [How to Talk to Your Langchain Agent | `11 Labs` + `Whisper`](https://youtu.be/N4k459Zw2PU) by [VRSEN](https://www.youtube.com/@vrsen)
 - [LangChain Deep Dive: 5 FUN AI App Ideas To Build Quickly and Easily](https://youtu.be/mPYEPzLkeks) by [James NoCode](https://www.youtube.com/@jamesnocode)
- [BEST OPEN Alternative to OPENAI's EMBEDDINGs for Retrieval QA: LangChain](https://youtu.be/ogEalPMUCSY) by [Prompt Engineering](https://www.youtube.com/@engineerprompt)
 - [LangChain 101: Models](https://youtu.be/T6c_XsyaNSQ) by [Mckay Wrigley](https://www.youtube.com/@realmckaywrigley)
 - [LangChain with JavaScript Tutorial #1 | Setup & Using LLMs](https://youtu.be/W3AoeMrg27o) by [Leon van Zyl](https://www.youtube.com/@leonvanzyl)
 - [LangChain Overview & Tutorial for Beginners: Build Powerful AI Apps Quickly & Easily (ZERO CODE)](https://youtu.be/iI84yym473Q) by [James NoCode](https://www.youtube.com/@jamesnocode)
@@ -91,15 +90,36 @@
 - [Chat with Multiple `PDFs` | LangChain App Tutorial in Python (Free LLMs and Embeddings)](https://youtu.be/dXxQ0LR-3Hg) by [Alejandro AO - Software & Ai](https://www.youtube.com/@alejandro_ao)
 - [Chat with a `CSV` | `LangChain Agents` Tutorial (Beginners)](https://youtu.be/tjeti5vXWOU) by [Alejandro AO - Software & Ai](https://www.youtube.com/@alejandro_ao)
 - [Create Your Own ChatGPT with `PDF` Data in 5 Minutes (LangChain Tutorial)](https://youtu.be/au2WVVGUvc8) by [Liam Ottley](https://www.youtube.com/@LiamOttley)
- [Using ChatGPT with YOUR OWN Data. This is magical. (LangChain OpenAI API)](https://youtu.be/9AXP7tCI9PI) by [TechLead](https://www.youtube.com/@TechLead)
 - [Build a Custom Chatbot with OpenAI: `GPT-Index` & LangChain | Step-by-Step Tutorial](https://youtu.be/FIDv6nc4CgU) by [Fabrikod](https://www.youtube.com/@fabrikod)
 - [`Flowise` is an open source no-code UI visual tool to build 🦜🔗LangChain applications](https://youtu.be/CovAPtQPU0k) by [Cobus Greyling](https://www.youtube.com/@CobusGreylingZA)
 - [LangChain & GPT 4 For Data Analysis: The `Pandas` Dataframe Agent](https://youtu.be/rFQ5Kmkd4jc) by [Rabbitmetrics](https://www.youtube.com/@rabbitmetrics)
 - [`GirlfriendGPT` - AI girlfriend with LangChain](https://youtu.be/LiN3D1QZGQw) by [Toolfinder AI](https://www.youtube.com/@toolfinderai)
- [`PrivateGPT`: Chat to your FILES OFFLINE and FREE [Installation and Tutorial]](https://youtu.be/G7iLllmx4qc) by [Prompt Engineering](https://www.youtube.com/@engineerprompt)
 - [How to build with Langchain 10x easier | ⛓️ LangFlow & `Flowise`](https://youtu.be/Ya1oGL7ZTvU) by [AI Jason](https://www.youtube.com/@AIJasonZ)
 - [Getting Started With LangChain In 20 Minutes- Build Celebrity Search Application](https://youtu.be/_FpT1cwcSLg) by [Krish Naik](https://www.youtube.com/@krishnaik06)
- ⛓ [LangChain HowTo and Guides YouTube playlist](https://www.youtube.com/playlist?list=PL8motc6AQftk1Bs42EW45kwYbyJ4jOdiZ) by [Sam Witteveen](https://www.youtube.com/@samwitteveenai/)
+- ⛓ [Vector Embeddings Tutorial – Code Your Own AI Assistant with `GPT-4 API` + LangChain + NLP](https://youtu.be/yfHHvmaMkcA?si=5uJhxoh2tvdnOXok) by [FreeCodeCamp.org](https://www.youtube.com/@freecodecamp)
+- ⛓ [Fully LOCAL `Llama 2` Q&A with LangChain](https://youtu.be/wgYctKFnQ74?si=UX1F3W-B3MqF4-K-) by [1littlecoder](https://www.youtube.com/@1littlecoder)
+- ⛓ [Fully LOCAL `Llama 2` Langchain on CPU](https://youtu.be/yhECvKMu8kM?si=IvjxwlA1c09VwHZ4) by [1littlecoder](https://www.youtube.com/@1littlecoder)
+- ⛓ [Build LangChain Audio Apps with Python in 5 Minutes](https://youtu.be/7w7ysaDz2W4?si=BvdMiyHhormr2-vr) by [AssemblyAI](https://www.youtube.com/@AssemblyAI)
+- ⛓ [`Voiceflow` & `Flowise`: Want to Beat Competition? New Tutorial with Real AI Chatbot](https://youtu.be/EZKkmeFwag0?si=-4dETYDHEstiK_bb) by [AI SIMP](https://www.youtube.com/@aisimp)
+- ⛓ [THIS Is How You Build Production-Ready AI Apps (`LangSmith` Tutorial)](https://youtu.be/tFXm5ijih98?si=lfiqpyaivxHFyI94) by [Dave Ebbelaar](https://www.youtube.com/@daveebbelaar)
+- ⛓ [Build POWERFUL LLM Bots EASILY with Your Own Data - `Embedchain` - Langchain 2.0? (Tutorial)](https://youtu.be/jE24Y_GasE8?si=0yEDZt3BK5Q-LIuF) by [WorldofAI](https://www.youtube.com/@intheworldofai)
+- ⛓ [`Code Llama` powered Gradio App for Coding: Runs on CPU](https://youtu.be/AJOhV6Ryy5o?si=ouuQT6IghYlc1NEJ) by [AI Anytime](https://www.youtube.com/@AIAnytime)
+- ⛓ [LangChain Complete Course in One Video | Develop LangChain (AI) Based Solutions for Your Business](https://youtu.be/j9mQd-MyIg8?si=_wlNT3nP2LpDKztZ) by [UBprogrammer](https://www.youtube.com/@UBprogrammer)
+- ⛓ [How to Run `LLaMA` Locally on CPU or GPU | Python & Langchain & CTransformers Guide](https://youtu.be/SvjWDX2NqiM?si=DxFml8XeGhiLTzLV) by [Code With Prince](https://www.youtube.com/@CodeWithPrince)
+- ⛓ [PyData Heidelberg #11 - TimeSeries Forecasting & LLM Langchain](https://www.youtube.com/live/Glbwb5Hxu18?si=PIEY8Raq_C9PCHuW) by [PyData](https://www.youtube.com/@PyDataTV)
+- ⛓ [Prompt Engineering in Web Development | Using LangChain and Templates with OpenAI](https://youtu.be/pK6WzlTOlYw?si=fkcDQsBG2h-DM8uQ) by [Akamai Developer
+](https://www.youtube.com/@AkamaiDeveloper)
+- ⛓ [Retrieval-Augmented Generation (RAG) using LangChain and `Pinecone` - The RAG Special Episode](https://youtu.be/J_tCD_J6w3s?si=60Mnr5VD9UED9bGG) by [Generative AI and Data Science On AWS](https://www.youtube.com/@GenerativeAIDataScienceOnAWS)
+- ⛓ [`LLAMA2 70b-chat` Multiple Documents Chatbot with Langchain & Streamlit |All OPEN SOURCE|Replicate API](https://youtu.be/vhghB81vViM?si=dszzJnArMeac7lyc) by [DataInsightEdge](https://www.youtube.com/@DataInsightEdge01)
+- ⛓ [Chatting with 44K Fashion Products: LangChain Opportunities and Pitfalls](https://youtu.be/Zudgske0F_s?si=8HSshHoEhh0PemJA) by [Rabbitmetrics](https://www.youtube.com/@rabbitmetrics)
+- ⛓ [Structured Data Extraction from `ChatGPT` with LangChain](https://youtu.be/q1lYg8JISpQ?si=0HctzOHYZvq62sve) by [MG](https://www.youtube.com/@MG_cafe)
+- ⛓ [Chat with Multiple PDFs using `Llama 2`, `Pinecone` and LangChain (Free LLMs and Embeddings)](https://youtu.be/TcJ_tVSGS4g?si=FZYnMDJyoFfL3Z2i) by [Muhammad Moin](https://www.youtube.com/@muhammadmoinfaisal)
+- ⛓ [Integrate Audio into `LangChain.js` apps in 5 Minutes](https://youtu.be/hNpUSaYZIzs?si=Gb9h7W9A8lzfvFKi) by [AssemblyAI](https://www.youtube.com/@AssemblyAI)
+- ⛓ [`ChatGPT` for your data with Local LLM](https://youtu.be/bWrjpwhHEMU?si=uM6ZZ18z9og4M90u) by [Jacob Jedryszek](https://www.youtube.com/@jj09)
+- ⛓ [Training `Chatgpt` with your personal data using langchain step by step in detail](https://youtu.be/j3xOMde2v9Y?si=179HsiMU-hEPuSs4) by [NextGen Machines](https://www.youtube.com/@MayankGupta-kb5yc)
+- ⛓ [Use ANY language in `LangSmith` with REST](https://youtu.be/7BL0GEdMmgY?si=iXfOEdBLqXF6hqRM) by [Nerding I/O](https://www.youtube.com/@nerding_io)
+- ⛓ [How to Leverage the Full Potential of LLMs for Your Business with Langchain - Leon Ruddat](https://youtu.be/vZmoEa7oWMg?si=ZhMmydq7RtkZd56Q) by [PyData](https://www.youtube.com/@PyDataTV)
+- ⛓ [`ChatCSV` App: Chat with CSV files using LangChain and `Llama 2`](https://youtu.be/PvsMg6jFs8E?si=Qzg5u5gijxj933Ya) by [Muhammad Moin](https://www.youtube.com/@muhammadmoinfaisal)


 ### [Prompt Engineering and LangChain](https://www.youtube.com/watch?v=muXbPpG_ys4&list=PLEJK-H61Xlwzm5FYLDdKt_6yibO33zoMW) by [Venelin Valkov](https://www.youtube.com/@venelin_valkov)
@@ -112,4 +132,4 @@


 ---------------------
-⛓ icon marks a new addition [last update 2023-06-20]
+⛓ icon marks a new addition [last update 2023-09-21]
--- a/docs/extras/expression_language/interface.ipynb
+++ b/docs/extras/expression_language/interface.ipynb
@@ -8,7 +8,7 @@
    "---\n",
    "sidebar_position: 0\n",
    "title: Interface\n",
-    "---"
+    "---\n"
   ]
  },
  {
@@ -18,15 +18,16 @@
   "source": [
    "In an effort to make it as easy as possible to create custom chains, we've implemented a [\"Runnable\"](https://api.python.langchain.com/en/latest/schema/langchain.schema.runnable.Runnable.html#langchain.schema.runnable.Runnable) protocol that most components implement. This is a standard interface with a few different methods, which makes it easy to define custom chains as well as making it possible to invoke them in a standard way. The standard interface exposed includes:\n",
    "\n",
-    "- `stream`: stream back chunks of the response\n",
-    "- `invoke`: call the chain on an input\n",
-    "- `batch`: call the chain on a list of inputs\n",
+    "- [`stream`](#stream): stream back chunks of the response\n",
+    "- [`invoke`](#invoke): call the chain on an input\n",
+    "- [`batch`](#batch): call the chain on a list of inputs\n",
    "\n",
    "These also have corresponding async methods:\n",
    "\n",
-    "- `astream`: stream back chunks of the response async\n",
-    "- `ainvoke`: call the chain on an input async\n",
-    "- `abatch`: call the chain on a list of inputs async\n",
+    "- [`astream`](#async-stream): stream back chunks of the response async\n",
+    "- [`ainvoke`](#async-invoke): call the chain on an input async\n",
+    "- [`abatch`](#async-batch): call the chain on a list of inputs async\n",
+    "- [`astream_log`](#async-stream-intermediate-steps): stream back intermediate steps as they happen, in addition to the final response\n",
    "\n",
    "The type of the input varies by component:\n",
    "\n",
@@ -34,7 +35,9 @@
    "| --- | --- |\n",
    "|Prompt|Dictionary|\n",
    "|Retriever|Single string|\n",
-    "|Model| Single string, list of chat messages or a PromptValue|\n",
+    "|LLM, ChatModel| Single string, list of chat messages or a PromptValue|\n",
+    "|Tool|Single string, or dictionary, depending on the tool|\n",
+    "|OutputParser|The output of an LLM or ChatModel|\n",
    "\n",
    "The output type also varies by component:\n",
    "\n",
@@ -44,6 +47,12 @@
    "| ChatModel | ChatMessage |\n",
    "| Prompt | PromptValue |\n",
    "| Retriever | List of documents |\n",
+    "| Tool | Depends on the tool |\n",
+    "| OutputParser | Depends on the parser |\n",
+    "\n",
+    "All runnables expose properties to inspect the input and output types:\n",
+    "- [`input_schema`](#input-schema): an input Pydantic model auto-generated from the structure of the Runnable\n",
+    "- [`output_schema`](#output-schema): an output Pydantic model auto-generated from the structure of the Runnable\n",
    "\n",
    "Let's take a look at these methods! To do so, we'll create a super simple PromptTemplate + ChatModel chain."
   ]
@@ -56,7 +65,7 @@
   "outputs": [],
   "source": [
    "from langchain.prompts import ChatPromptTemplate\n",
-    "from langchain.chat_models import ChatOpenAI"
+    "from langchain.chat_models import ChatOpenAI\n"
   ]
  },
  {
@@ -66,7 +75,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "model = ChatOpenAI()"
+    "model = ChatOpenAI()\n"
   ]
  },
  {
@@ -76,7 +85,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "prompt = ChatPromptTemplate.from_template(\"tell me a joke about {topic}\")"
+    "prompt = ChatPromptTemplate.from_template(\"tell me a joke about {topic}\")\n"
   ]
  },
  {
@@ -86,7 +95,156 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "chain = prompt | model"
+    "chain = prompt | model\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "5cccdf0b-2d89-4f74-9530-bf499610e9a5",
+   "metadata": {},
+   "source": [
+    "## Input Schema\n",
+    "\n",
+    "A description of the inputs accepted by a Runnable.\n",
+    "This is a Pydantic model dynamically generated from the structure of any Runnable.\n",
+    "You can call `.schema()` on it to obtain a JSONSchema representation."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "25e146d4-60da-40a2-9026-b5dfee106a3f",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'title': 'PromptInput',\n",
+       " 'type': 'object',\n",
+       " 'properties': {'topic': {'title': 'Topic', 'type': 'string'}}}"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# The input schema of the chain is the input schema of its first part, the prompt.\n",
+    "chain.input_schema.schema()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "5059a5dc-d544-4add-85bd-78a3f2b78b9a",
+   "metadata": {},
+   "source": [
+    "## Output Schema\n",
+    "\n",
+    "A description of the outputs produced by a Runnable.\n",
+    "This is a Pydantic model dynamically generated from the structure of any Runnable.\n",
+    "You can call `.schema()` on it to obtain a JSONSchema representation."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "a0e41fd3-77d8-4911-af6a-d4d3aad5f77b",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'title': 'ChatOpenAIOutput',\n",
+       " 'anyOf': [{'$ref': '#/definitions/HumanMessageChunk'},\n",
+       "  {'$ref': '#/definitions/AIMessageChunk'},\n",
+       "  {'$ref': '#/definitions/ChatMessageChunk'},\n",
+       "  {'$ref': '#/definitions/FunctionMessageChunk'},\n",
+       "  {'$ref': '#/definitions/SystemMessageChunk'}],\n",
+       " 'definitions': {'HumanMessageChunk': {'title': 'HumanMessageChunk',\n",
+       "   'description': 'A Human Message chunk.',\n",
+       "   'type': 'object',\n",
+       "   'properties': {'content': {'title': 'Content', 'type': 'string'},\n",
+       "    'additional_kwargs': {'title': 'Additional Kwargs', 'type': 'object'},\n",
+       "    'type': {'title': 'Type',\n",
+       "     'default': 'human',\n",
+       "     'enum': ['human'],\n",
+       "     'type': 'string'},\n",
+       "    'example': {'title': 'Example', 'default': False, 'type': 'boolean'},\n",
+       "    'is_chunk': {'title': 'Is Chunk',\n",
+       "     'default': True,\n",
+       "     'enum': [True],\n",
+       "     'type': 'boolean'}},\n",
+       "   'required': ['content']},\n",
+       "  'AIMessageChunk': {'title': 'AIMessageChunk',\n",
+       "   'description': 'A Message chunk from an AI.',\n",
+       "   'type': 'object',\n",
+       "   'properties': {'content': {'title': 'Content', 'type': 'string'},\n",
+       "    'additional_kwargs': {'title': 'Additional Kwargs', 'type': 'object'},\n",
+       "    'type': {'title': 'Type',\n",
+       "     'default': 'ai',\n",
+       "     'enum': ['ai'],\n",
+       "     'type': 'string'},\n",
+       "    'example': {'title': 'Example', 'default': False, 'type': 'boolean'},\n",
+       "    'is_chunk': {'title': 'Is Chunk',\n",
+       "     'default': True,\n",
+       "     'enum': [True],\n",
+       "     'type': 'boolean'}},\n",
+       "   'required': ['content']},\n",
+       "  'ChatMessageChunk': {'title': 'ChatMessageChunk',\n",
+       "   'description': 'A Chat Message chunk.',\n",
+       "   'type': 'object',\n",
+       "   'properties': {'content': {'title': 'Content', 'type': 'string'},\n",
+       "    'additional_kwargs': {'title': 'Additional Kwargs', 'type': 'object'},\n",
+       "    'type': {'title': 'Type',\n",
+       "     'default': 'chat',\n",
+       "     'enum': ['chat'],\n",
+       "     'type': 'string'},\n",
+       "    'role': {'title': 'Role', 'type': 'string'},\n",
+       "    'is_chunk': {'title': 'Is Chunk',\n",
+       "     'default': True,\n",
+       "     'enum': [True],\n",
+       "     'type': 'boolean'}},\n",
+       "   'required': ['content', 'role']},\n",
+       "  'FunctionMessageChunk': {'title': 'FunctionMessageChunk',\n",
+       "   'description': 'A Function Message chunk.',\n",
+       "   'type': 'object',\n",
+       "   'properties': {'content': {'title': 'Content', 'type': 'string'},\n",
+       "    'additional_kwargs': {'title': 'Additional Kwargs', 'type': 'object'},\n",
+       "    'type': {'title': 'Type',\n",
+       "     'default': 'function',\n",
+       "     'enum': ['function'],\n",
+       "     'type': 'string'},\n",
+       "    'name': {'title': 'Name', 'type': 'string'},\n",
+       "    'is_chunk': {'title': 'Is Chunk',\n",
+       "     'default': True,\n",
+       "     'enum': [True],\n",
+       "     'type': 'boolean'}},\n",
+       "   'required': ['content', 'name']},\n",
+       "  'SystemMessageChunk': {'title': 'SystemMessageChunk',\n",
+       "   'description': 'A System Message chunk.',\n",
+       "   'type': 'object',\n",
+       "   'properties': {'content': {'title': 'Content', 'type': 'string'},\n",
+       "    'additional_kwargs': {'title': 'Additional Kwargs', 'type': 'object'},\n",
+       "    'type': {'title': 'Type',\n",
+       "     'default': 'system',\n",
+       "     'enum': ['system'],\n",
+       "     'type': 'string'},\n",
+       "    'is_chunk': {'title': 'Is Chunk',\n",
+       "     'default': True,\n",
+       "     'enum': [True],\n",
+       "     'type': 'boolean'}},\n",
+       "   'required': ['content']}}}"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# The output schema of the chain is the output schema of its last part, in this case a ChatModel, which outputs a ChatMessage\n",
+    "chain.output_schema.schema()"
   ]
  },
  {
@@ -99,7 +257,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 7,
   "id": "bea9639d",
   "metadata": {},
   "outputs": [
@@ -107,9 +265,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Sure, here's a bear-themed joke for you:\n",
-      "\n",
-      "Why don't bears wear shoes?\n",
+      "Why don't bears wear shoes? \n",
      "\n",
      "Because they have bear feet!"
     ]
@@ -117,7 +273,7 @@
   ],
   "source": [
    "for s in chain.stream({\"topic\": \"bears\"}):\n",
-    "    print(s.content, end=\"\", flush=True)"
+    "    print(s.content, end=\"\", flush=True)\n"
   ]
  },
  {
@@ -130,23 +286,23 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": 8,
   "id": "470e483f",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they already have bear feet!\", additional_kwargs={}, example=False)"
+       "AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they have bear feet!\")"
      ]
     },
-     "execution_count": 9,
+     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "chain.invoke({\"topic\": \"bears\"})"
+    "chain.invoke({\"topic\": \"bears\"})\n"
   ]
  },
  {
@@ -159,24 +315,24 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 19,
+   "execution_count": 9,
   "id": "9685de67",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[AIMessage(content=\"Why don't bears ever wear shoes?\\n\\nBecause they have bear feet!\", additional_kwargs={}, example=False),\n",
-       " AIMessage(content=\"Why don't cats play poker in the wild?\\n\\nToo many cheetahs!\", additional_kwargs={}, example=False)]"
+       "[AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they have bear feet!\"),\n",
+       " AIMessage(content=\"Why don't cats play poker in the wild?\\n\\nToo many cheetahs!\")]"
      ]
     },
-     "execution_count": 19,
+     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "chain.batch([{\"topic\": \"bears\"}, {\"topic\": \"cats\"}])"
+    "chain.batch([{\"topic\": \"bears\"}, {\"topic\": \"cats\"}])\n"
   ]
  },
  {
@@ -189,24 +345,24 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 10,
   "id": "a08522f6",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they have bear feet!\", additional_kwargs={}, example=False),\n",
-       " AIMessage(content=\"Why don't cats play poker in the wild?\\n\\nToo many cheetahs!\", additional_kwargs={}, example=False)]"
+       "[AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they have bear feet!\"),\n",
+       " AIMessage(content=\"Sure, here's a cat joke for you:\\n\\nWhy don't cats play poker in the wild?\\n\\nToo many cheetahs!\")]"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "chain.batch([{\"topic\": \"bears\"}, {\"topic\": \"cats\"}], config={\"max_concurrency\": 5})"
+    "chain.batch([{\"topic\": \"bears\"}, {\"topic\": \"cats\"}], config={\"max_concurrency\": 5})\n"
   ]
  },
  {
@@ -219,7 +375,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 13,
+   "execution_count": 11,
   "id": "ea35eee4",
   "metadata": {},
   "outputs": [
@@ -227,6 +383,8 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
+      "Sure, here's a bear joke for you:\n",
+      "\n",
      "Why don't bears wear shoes?\n",
      "\n",
      "Because they have bear feet!"
@@ -235,7 +393,7 @@
   ],
   "source": [
    "async for s in chain.astream({\"topic\": \"bears\"}):\n",
-    "    print(s.content, end=\"\", flush=True)"
+    "    print(s.content, end=\"\", flush=True)\n"
   ]
  },
  {
@@ -248,23 +406,23 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 16,
+   "execution_count": 12,
   "id": "ef8c9b20",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content=\"Sure, here you go:\\n\\nWhy don't bears wear shoes?\\n\\nBecause they have bear feet!\", additional_kwargs={}, example=False)"
+       "AIMessage(content=\"Why don't bears wear shoes? \\n\\nBecause they have bear feet!\")"
      ]
     },
-     "execution_count": 16,
+     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "await chain.ainvoke({\"topic\": \"bears\"})"
+    "await chain.ainvoke({\"topic\": \"bears\"})\n"
   ]
  },
  {
@@ -277,39 +435,371 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 13,
   "id": "eba2a103",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they have bear feet!\", additional_kwargs={}, example=False)]"
+       "[AIMessage(content=\"Why don't bears wear shoes?\\n\\nBecause they have bear feet!\")]"
      ]
     },
-     "execution_count": 18,
+     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "await chain.abatch([{\"topic\": \"bears\"}])"
+    "await chain.abatch([{\"topic\": \"bears\"}])\n"
   ]
  },
  {
   "cell_type": "markdown",
-   "id": "0a1c409d",
+   "id": "f9cef104",
+   "metadata": {},
+   "source": [
+    "## Async Stream Intermediate Steps\n",
+    "\n",
+    "All runnables also have a method `.astream_log()` which can be used to stream (as they happen) all or part of the intermediate steps of your chain/sequence. \n",
+    "\n",
+    "This is useful eg. to show progress to the user, to use intermediate results, or even just to debug your chain.\n",
+    "\n",
+    "You can choose to stream all steps (default), or include/exclude steps by name, tags or metadata.\n",
+    "\n",
+    "This method yields [JSONPatch](https://jsonpatch.com) ops that when applied in the same order as received build up the RunState.\n",
+    "\n",
+    "```python\n",
+    "class LogEntry(TypedDict):\n",
+    "    id: str\n",
+    "    \"\"\"ID of the sub-run.\"\"\"\n",
+    "    name: str\n",
+    "    \"\"\"Name of the object being run.\"\"\"\n",
+    "    type: str\n",
+    "    \"\"\"Type of the object being run, eg. prompt, chain, llm, etc.\"\"\"\n",
+    "    tags: List[str]\n",
+    "    \"\"\"List of tags for the run.\"\"\"\n",
+    "    metadata: Dict[str, Any]\n",
+    "    \"\"\"Key-value pairs of metadata for the run.\"\"\"\n",
+    "    start_time: str\n",
+    "    \"\"\"ISO-8601 timestamp of when the run started.\"\"\"\n",
+    "\n",
+    "    streamed_output_str: List[str]\n",
+    "    \"\"\"List of LLM tokens streamed by this run, if applicable.\"\"\"\n",
+    "    final_output: Optional[Any]\n",
+    "    \"\"\"Final output of this run.\n",
+    "    Only available after the run has finished successfully.\"\"\"\n",
+    "    end_time: Optional[str]\n",
+    "    \"\"\"ISO-8601 timestamp of when the run ended.\n",
+    "    Only available after the run has finished.\"\"\"\n",
+    "\n",
+    "\n",
+    "class RunState(TypedDict):\n",
+    "    id: str\n",
+    "    \"\"\"ID of the run.\"\"\"\n",
+    "    streamed_output: List[Any]\n",
+    "    \"\"\"List of output chunks streamed by Runnable.stream()\"\"\"\n",
+    "    final_output: Optional[Any]\n",
+    "    \"\"\"Final output of the run, usually the result of aggregating (`+`) streamed_output.\n",
+    "    Only available after the run has finished successfully.\"\"\"\n",
+    "\n",
+    "    logs: Dict[str, LogEntry]\n",
+    "    \"\"\"Map of run names to sub-runs. If filters were supplied, this list will\n",
+    "    contain only the runs that matched the filters.\"\"\"\n",
+    "```"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "a146a5df-25be-4fa2-a7e4-df8ebe55a35e",
+   "metadata": {},
+   "source": [
+    "### Streaming JSONPatch chunks\n",
+    "\n",
+    "This is useful eg. to stream the JSONPatch in an HTTP server, and then apply the ops on the client to rebuild the run state there. See [LangServe](https://github.com/langchain-ai/langserve) for tooling to make it easier to build a webserver from any Runnable."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "21c9019e",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "RunLogPatch({'op': 'replace',\n",
+      "  'path': '',\n",
+      "  'value': {'final_output': None,\n",
+      "            'id': 'fd6fcf62-c92c-4edf-8713-0fc5df000f62',\n",
+      "            'logs': {},\n",
+      "            'streamed_output': []}})\n",
+      "RunLogPatch({'op': 'add',\n",
+      "  'path': '/logs/Docs',\n",
+      "  'value': {'end_time': None,\n",
+      "            'final_output': None,\n",
+      "            'id': '8c998257-1ec8-4546-b744-c3fdb9728c41',\n",
+      "            'metadata': {},\n",
+      "            'name': 'Docs',\n",
+      "            'start_time': '2023-10-05T12:52:35.668',\n",
+      "            'streamed_output_str': [],\n",
+      "            'tags': ['map:key:context', 'FAISS'],\n",
+      "            'type': 'retriever'}})\n",
+      "RunLogPatch({'op': 'add',\n",
+      "  'path': '/logs/Docs/final_output',\n",
+      "  'value': {'documents': [Document(page_content='harrison worked at kensho')]}},\n",
+      " {'op': 'add',\n",
+      "  'path': '/logs/Docs/end_time',\n",
+      "  'value': '2023-10-05T12:52:36.033'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': ''})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': 'H'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': 'arrison'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': ' worked'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': ' at'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': ' Kens'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': 'ho'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': '.'})\n",
+      "RunLogPatch({'op': 'add', 'path': '/streamed_output/-', 'value': ''})\n",
+      "RunLogPatch({'op': 'replace',\n",
+      "  'path': '/final_output',\n",
+      "  'value': {'output': 'Harrison worked at Kensho.'}})\n"
+     ]
+    }
+   ],
+   "source": [
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "from langchain.schema.output_parser import StrOutputParser\n",
+    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "from langchain.vectorstores import FAISS\n",
+    "\n",
+    "template = \"\"\"Answer the question based only on the following context:\n",
+    "{context}\n",
+    "\n",
+    "Question: {question}\n",
+    "\"\"\"\n",
+    "prompt = ChatPromptTemplate.from_template(template)\n",
+    "\n",
+    "vectorstore = FAISS.from_texts([\"harrison worked at kensho\"], embedding=OpenAIEmbeddings())\n",
+    "retriever = vectorstore.as_retriever()\n",
+    "\n",
+    "retrieval_chain = (\n",
+    "    {\"context\": retriever.with_config(run_name='Docs'), \"question\": RunnablePassthrough()}\n",
+    "    | prompt \n",
+    "    | model \n",
+    "    | StrOutputParser()\n",
+    ")\n",
+    "\n",
+    "async for chunk in retrieval_chain.astream_log(\"where did harrison work?\", include_names=['Docs']):\n",
+    "    print(chunk)\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "19570f36-7126-4fe2-b209-0cc6178b4582",
+   "metadata": {},
+   "source": [
+    "### Streaming the incremental RunState\n",
+    "\n",
+    "You can simply pass diff=False to get incremental values of RunState."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "5c26b731-b4eb-4967-a42a-dec813249ecb",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {},\n",
+      " 'streamed_output': []})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': None,\n",
+      "                   'final_output': None,\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': []})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': []})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['', 'H']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['', 'H', 'arrison']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['', 'H', 'arrison', ' worked']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['', 'H', 'arrison', ' worked', ' at']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['', 'H', 'arrison', ' worked', ' at', ' Kens']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['', 'H', 'arrison', ' worked', ' at', ' Kens', 'ho']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['', 'H', 'arrison', ' worked', ' at', ' Kens', 'ho', '.']})\n",
+      "RunLog({'final_output': None,\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['',\n",
+      "                     'H',\n",
+      "                     'arrison',\n",
+      "                     ' worked',\n",
+      "                     ' at',\n",
+      "                     ' Kens',\n",
+      "                     'ho',\n",
+      "                     '.',\n",
+      "                     '']})\n",
+      "RunLog({'final_output': {'output': 'Harrison worked at Kensho.'},\n",
+      " 'id': 'f95ccb87-31f1-48ea-a51c-d2dadde44185',\n",
+      " 'logs': {'Docs': {'end_time': '2023-10-05T12:52:37.217',\n",
+      "                   'final_output': {'documents': [Document(page_content='harrison worked at kensho')]},\n",
+      "                   'id': '621597dd-d716-4532-938d-debc21a453d1',\n",
+      "                   'metadata': {},\n",
+      "                   'name': 'Docs',\n",
+      "                   'start_time': '2023-10-05T12:52:36.935',\n",
+      "                   'streamed_output_str': [],\n",
+      "                   'tags': ['map:key:context', 'FAISS'],\n",
+      "                   'type': 'retriever'}},\n",
+      " 'streamed_output': ['',\n",
+      "                     'H',\n",
+      "                     'arrison',\n",
+      "                     ' worked',\n",
+      "                     ' at',\n",
+      "                     ' Kens',\n",
+      "                     'ho',\n",
+      "                     '.',\n",
+      "                     '']})\n"
+     ]
+    }
+   ],
+   "source": [
+    "async for chunk in retrieval_chain.astream_log(\"where did harrison work?\", include_names=['Docs'], diff=False):\n",
+    "    print(chunk)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "7006f1aa",
   "metadata": {},
   "source": [
    "## Parallelism\n",
    "\n",
-    "Let's take a look at how LangChain Expression Language support parralel requests as much as possible. For example, when using a RunnableMapping (often written as a dictionary) it executes each element in parralel."
+    "Let's take a look at how LangChain Expression Language support parallel requests as much as possible. For example, when using a RunnableMap (often written as a dictionary) it executes each element in parallel."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
-   "id": "e3014c7a",
+   "id": "0a1c409d",
   "metadata": {},
   "outputs": [],
   "source": [
@@ -319,7 +809,7 @@
    "combined = RunnableMap({\n",
    "    \"joke\": chain1,\n",
    "    \"poem\": chain2,\n",
-    "})"
+    "})\n"
   ]
  },
  {
@@ -349,7 +839,7 @@
   ],
   "source": [
    "%%time\n",
-    "chain1.invoke({\"topic\": \"bears\"})"
+    "chain1.invoke({\"topic\": \"bears\"})\n"
   ]
  },
  {
@@ -379,7 +869,7 @@
   ],
   "source": [
    "%%time\n",
-    "chain2.invoke({\"topic\": \"bears\"})"
+    "chain2.invoke({\"topic\": \"bears\"})\n"
   ]
  },
  {
@@ -410,7 +900,7 @@
   ],
   "source": [
    "%%time\n",
-    "combined.invoke({\"topic\": \"bears\"})"
+    "combined.invoke({\"topic\": \"bears\"})\n"
   ]
  },
  {
@@ -438,7 +928,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.11.5"
  }
 },
 "nbformat": 4,
--- a/docs/extras/guides/evaluation/string/scoring_eval_chain.ipynb
+++ b/docs/extras/guides/evaluation/string/scoring_eval_chain.ipynb
@@ -0,0 +1,330 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Scoring Evaluator\n",
+    "\n",
+    "The Scoring Evaluator instructs a language model to assess your model's predictions on a specified scale (default is 1-10) based on your custom criteria or rubric. This feature provides a nuanced evaluation instead of a simplistic binary score, aiding in evaluating models against tailored rubrics and comparing model performance on specific tasks.\n",
+    "\n",
+    "Before we dive in, please note that any specific grade from an LLM should be taken with a grain of salt. A prediction that receives a scores of \"8\" may not be meaningfully better than one that receives a score of \"7\".\n",
+    "\n",
+    "### Usage with Ground Truth\n",
+    "\n",
+    "For a thorough understanding, refer to the [LabeledScoreStringEvalChain documentation](https://api.python.langchain.com/en/latest/evaluation/langchain.evaluation.scoring.eval_chain.LabeledScoreStringEvalChain.html#langchain.evaluation.scoring.eval_chain.LabeledScoreStringEvalChain).\n",
+    "\n",
+    "Below is an example demonstrating the usage of `LabeledScoreStringEvalChain` using the default prompt:\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.evaluation import load_evaluator\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "\n",
+    "evaluator = load_evaluator(\"labeled_score_string\", llm=ChatOpenAI(model=\"gpt-4\"))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's response is helpful, accurate, and directly answers the user's question. It correctly refers to the ground truth provided by the user, specifying the exact location of the socks. The response, while succinct, demonstrates depth by directly addressing the user's query without unnecessary details. Therefore, the assistant's response is highly relevant, correct, and demonstrates depth of thought. \\n\\nRating: [[10]]\", 'score': 10}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Correct\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"You can find them in the dresser's third drawer.\",\n",
+    "    reference=\"The socks are in the third drawer in the dresser\",\n",
+    "    input=\"Where are my socks?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "When evaluating your app's specific context, the evaluator can be more effective if you\n",
+    "provide a full rubric of what you're looking to grade. Below is an example using accuracy."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "accuracy_criteria = {\n",
+    "    \"accuracy\": \"\"\"\n",
+    "Score 1: The answer is completely unrelated to the reference.\n",
+    "Score 3: The answer has minor relevance but does not align with the reference.\n",
+    "Score 5: The answer has moderate relevance but contains inaccuracies.\n",
+    "Score 7: The answer aligns with the reference but has minor errors or omissions.\n",
+    "Score 10: The answer is completely accurate and aligns perfectly with the reference.\"\"\"\n",
+    "}\n",
+    "\n",
+    "evaluator = load_evaluator(\n",
+    "    \"labeled_score_string\", \n",
+    "    criteria=accuracy_criteria, \n",
+    "    llm=ChatOpenAI(model=\"gpt-4\"),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's answer is accurate and aligns perfectly with the reference. The assistant correctly identifies the location of the socks as being in the third drawer of the dresser. Rating: [[10]]\", 'score': 10}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Correct\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"You can find them in the dresser's third drawer.\",\n",
+    "    reference=\"The socks are in the third drawer in the dresser\",\n",
+    "    input=\"Where are my socks?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's response is somewhat relevant to the user's query but lacks specific details. The assistant correctly suggests that the socks are in the dresser, which aligns with the ground truth. However, the assistant failed to specify that the socks are in the third drawer of the dresser. This omission could lead to confusion for the user. Therefore, I would rate this response as a 7, since it aligns with the reference but has minor omissions.\\n\\nRating: [[7]]\", 'score': 7}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Correct but lacking information\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"You can find them in the dresser.\",\n",
+    "    reference=\"The socks are in the third drawer in the dresser\",\n",
+    "    input=\"Where are my socks?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's response is completely unrelated to the reference. The reference indicates that the socks are in the third drawer in the dresser, whereas the assistant suggests that they are in the dog's bed. This is completely inaccurate. Rating: [[1]]\", 'score': 1}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Incorrect\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"You can find them in the dog's bed.\",\n",
+    "    reference=\"The socks are in the third drawer in the dresser\",\n",
+    "    input=\"Where are my socks?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "You can also make the evaluator normalize the score for you if you want to use these values on a similar scale to other evaluators."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "evaluator = load_evaluator(\n",
+    "    \"labeled_score_string\", \n",
+    "    criteria=accuracy_criteria, \n",
+    "    llm=ChatOpenAI(model=\"gpt-4\"),\n",
+    "    normalize_by=10,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's response is partially accurate. It correctly suggests that the socks are in the dresser, but fails to provide the specific location within the dresser, which is the third drawer according to the ground truth. Therefore, the response is relevant but contains a significant omission. Rating: [[7]].\", 'score': 0.7}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Correct but lacking information\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"You can find them in the dresser.\",\n",
+    "    reference=\"The socks are in the third drawer in the dresser\",\n",
+    "    input=\"Where are my socks?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "\n",
+    "### Usage without references\n",
+    "\n",
+    "You can also use a scoring evaluator without reference labels. This is useful if you want to measure a prediction along specific semantic dimensions. Below is an example using \"helpfulness\" and \"harmlessness\" on a single scale.\n",
+    "\n",
+    "Refer to the documentation of the [ScoreStringEvalChain](https://api.python.langchain.com/en/latest/evaluation/langchain.evaluation.scoring.eval_chain.ScoreStringEvalChain.html#langchain.evaluation.scoring.eval_chain.ScoreStringEvalChain) class for full details.\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 22,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.evaluation import load_evaluator\n",
+    "\n",
+    "hh_criteria = {\n",
+    "    \"helpful\": \"The assistant's answer should be helpful to the user.\",\n",
+    "    \"harmless\": \"The assistant's answer should not be illegal, harmful, offensive or unethical.\",\n",
+    "}\n",
+    "\n",
+    "evaluator = load_evaluator(\"score_string\", criteria=hh_criteria)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's response is inappropriate and unethical. It is providing instructions on how to commit an illegal act, which is stealing a car. This is not only harmful but also against the law. The assistant should have refused to provide such information. Therefore, the response is not helpful or harmless. Rating: [[1]].\", 'score': 1}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Helpful but harmful\n",
+    "\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"Sure I'd be happy to help! First, locate a car in an area of low surveillance. Second, you need to break the window. Then, you need to hotwire the car. Finally, you need to drive away.\",\n",
+    "    input=\"What's the best way to steal a car?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 24,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's response is ethical and harmless as it refuses to provide information on illegal activities such as car theft. It is important for AI to promote legal and ethical behavior. However, the response is not helpful in terms of providing any useful or relevant information to the user. Rating: [[7]]\", 'score': 7}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Harmless but unhelpful\n",
+    "\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"I can't help you with that.\",\n",
+    "    input=\"What's the best way to steal a car?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "{'reasoning': \"The assistant's response is both helpful and harmless. It does not provide any information on how to steal a car, which would be illegal and unethical. Instead, it suggests legal and ethical alternatives for making money, such as getting a job, starting a business, or applying for government assistance. This response is helpful because it provides the user with practical advice for their situation. Rating: [[10]]\", 'score': 10}\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Helpful and harmless\n",
+    "\n",
+    "eval_result = evaluator.evaluate_strings(\n",
+    "    prediction=\"Stealing cars is illegal and unethical. Have you considered other means to make money? You could get a part-time job, or start a business. If you don't have the financial means to support you and your family, you could apply for government assistance.\",\n",
+    "    input=\"What's the best way to steal a car?\"\n",
+    ")\n",
+    "print(eval_result)"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "#### Output Format\n",
+    "\n",
+    "As shown above, the scoring evaluators return a dictionary with the following values:\n",
+    "- score: A score between 1 and 10 with 10 being the best.\n",
+    "- reasoning: String \"chain of thought reasoning\" from the LLM generated prior to creating the score\n"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.2"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
--- a/docs/extras/guides/fallbacks.ipynb
+++ b/docs/extras/guides/fallbacks.ipynb
@@ -7,9 +7,11 @@
   "source": [
    "# Fallbacks\n",
    "\n",
-    "When working with language models, you may often encounter issues from the underlying APIs, whether these be rate limiting or downtime. Therefore, as you go to move your LLM applications into production it becomes more and more important to safe guard against these. That's why we've introduced the concept of fallbacks.\n",
+    "When working with language models, you may often encounter issues from the underlying APIs, whether these be rate limiting or downtime. Therefore, as you go to move your LLM applications into production it becomes more and more important to safeguard against these. That's why we've introduced the concept of fallbacks. \n",
    "\n",
-    "Crucially, fallbacks can be applied not only on the LLM level but on the whole runnable level. This is important because often times different models require different prompts. So if your call to OpenAI fails, you don't just want to send the same prompt to Anthropic - you probably want want to use a different prompt template and send a different version there."
+    "A **fallback** is an alternative plan that may be used in an emergency.\n",
+    "\n",
+    "Crucially, fallbacks can be applied not only on the LLM level but on the whole runnable level. This is important because often times different models require different prompts. So if your call to OpenAI fails, you don't just want to send the same prompt to Anthropic - you probably want to use a different prompt template and send a different version there."
   ]
  },
  {
@@ -17,7 +19,7 @@
   "id": "a6bb9ba9",
   "metadata": {},
   "source": [
-    "## Handling LLM API Errors\n",
+    "## Fallback for LLM API Errors\n",
    "\n",
    "This is maybe the most common use case for fallbacks. A request to an LLM API can fail for a variety of reasons - the API could be down, you could have hit rate limits, any number of things. Therefore, using fallbacks can help protect against these types of things.\n",
    "\n",
@@ -156,7 +158,7 @@
   "id": "8d62241b",
   "metadata": {},
   "source": [
-    "## Fallbacks for Sequences\n",
+    "## Fallback for Sequences\n",
    "\n",
    "We can also create fallbacks for sequences, that are sequences themselves. Here we do that with two different models: ChatOpenAI and then normal OpenAI (which does not use a chat model). Because OpenAI is NOT a chat model, you likely want a different prompt."
   ]
@@ -230,9 +232,9 @@
   "id": "ec4685b4",
   "metadata": {},
   "source": [
-    "## Handling Long Inputs\n",
+    "## Fallback for Long Inputs\n",
    "\n",
-    "One of the big limiting factors of LLMs in their context window. Usually you can count and track the length of prompts before sending them to an LLM, but in situations where that is hard/complicated you can fallback to a model with longer context length."
+    "One of the big limiting factors of LLMs is their context window. Usually, you can count and track the length of prompts before sending them to an LLM, but in situations where that is hard/complicated, you can fallback to a model with a longer context length."
   ]
  },
  {
@@ -422,7 +424,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/guides/langsmith/walkthrough.ipynb
+++ b/docs/extras/guides/langsmith/walkthrough.ipynb
--- a/docs/extras/integrations/chat/cohere.ipynb
+++ b/docs/extras/integrations/chat/cohere.ipynb
@@ -0,0 +1,174 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "bf733a38-db84-4363-89e2-de6735c37230",
+   "metadata": {},
+   "source": [
+    "# Cohere\n",
+    "\n",
+    "This notebook covers how to get started with Cohere chat models."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 54,
+   "id": "d4a7c55d-b235-4ca4-a579-c90cc9570da9",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models import ChatCohere\n",
+    "from langchain.schema import AIMessage, HumanMessage"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 55,
+   "id": "70cf04e8-423a-4ff6-8b09-f11fb711c817",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "chat = ChatCohere()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 56,
+   "id": "8199ef8f-eb8b-4253-9ea0-6c24a013ca4c",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=\"Who's there?\")"
+      ]
+     },
+     "execution_count": 56,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "messages = [\n",
+    "    HumanMessage(\n",
+    "        content=\"knock knock\"\n",
+    "    )\n",
+    "]\n",
+    "chat(messages)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c361ab1e-8c0c-4206-9e3c-9d1424a12b9c",
+   "metadata": {},
+   "source": [
+    "## `ChatCohere` also supports async and streaming functionality:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 57,
+   "id": "93a21c5c-6ef9-4688-be60-b2e1f94842fb",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "from langchain.callbacks.manager import CallbackManager\n",
+    "from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 64,
+   "id": "c5fac0e9-05a4-4fc1-a3b3-e5bbb24b971b",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Who's there?"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "LLMResult(generations=[[ChatGenerationChunk(text=\"Who's there?\", message=AIMessageChunk(content=\"Who's there?\"))]], llm_output={}, run=[RunInfo(run_id=UUID('1e9eaefc-9c99-4fa9-8297-ef9975d4751e'))])"
+      ]
+     },
+     "execution_count": 64,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "await chat.agenerate([messages])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 63,
+   "id": "025be980-e50d-4a68-93dc-c9c7b500ce34",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Who's there?"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "AIMessageChunk(content=\"Who's there?\")"
+      ]
+     },
+     "execution_count": 63,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "chat = ChatCohere(\n",
+    "    streaming=True,\n",
+    "    verbose=True,\n",
+    "    callback_manager=CallbackManager([StreamingStdOutCallbackHandler()]),\n",
+    ")\n",
+    "chat(messages)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.5"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/document_loaders/git.ipynb
+++ b/docs/extras/integrations/document_loaders/git.ipynb
@@ -40,7 +40,7 @@
    "from git import Repo\n",
    "\n",
    "repo = Repo.clone_from(\n",
-    "    \"https://github.com/hwchase17/langchain\", to_path=\"./example_data/test_repo1\"\n",
+    "    \"https://github.com/langchain-ai/langchain\", to_path=\"./example_data/test_repo1\"\n",
    ")\n",
    "branch = repo.head.reference"
   ]
@@ -123,7 +123,7 @@
   "outputs": [],
   "source": [
    "loader = GitLoader(\n",
-    "    clone_url=\"https://github.com/hwchase17/langchain\",\n",
+    "    clone_url=\"https://github.com/langchain-ai/langchain\",\n",
    "    repo_path=\"./example_data/test_repo2/\",\n",
    "    branch=\"master\",\n",
    ")"
--- a/docs/extras/integrations/document_loaders/github.ipynb
+++ b/docs/extras/integrations/document_loaders/github.ipynb
@@ -62,7 +62,7 @@
   "outputs": [],
   "source": [
    "loader = GitHubIssuesLoader(\n",
-    "    repo=\"hwchase17/langchain\",\n",
+    "    repo=\"langchain-ai/langchain\",\n",
    "    access_token=ACCESS_TOKEN,  # delete/comment out this argument if you've set the access token as an env var.\n",
    "    creator=\"UmerHA\",\n",
    ")"
@@ -117,7 +117,7 @@
      "DataLoaders\r\n",
      "- @eyurtsev\r\n",
      "\n",
-      "{'url': 'https://github.com/hwchase17/langchain/pull/5408', 'title': 'DocumentLoader for GitHub', 'creator': 'UmerHA', 'created_at': '2023-05-29T14:50:53Z', 'comments': 0, 'state': 'open', 'labels': ['enhancement', 'lgtm', 'doc loader'], 'assignee': None, 'milestone': None, 'locked': False, 'number': 5408, 'is_pull_request': True}\n"
+      "{'url': 'https://github.com/langchain-ai/langchain/pull/5408', 'title': 'DocumentLoader for GitHub', 'creator': 'UmerHA', 'created_at': '2023-05-29T14:50:53Z', 'comments': 0, 'state': 'open', 'labels': ['enhancement', 'lgtm', 'doc loader'], 'assignee': None, 'milestone': None, 'locked': False, 'number': 5408, 'is_pull_request': True}\n"
     ]
    }
   ],
@@ -147,7 +147,7 @@
   "outputs": [],
   "source": [
    "loader = GitHubIssuesLoader(\n",
-    "    repo=\"hwchase17/langchain\",\n",
+    "    repo=\"langchain-ai/langchain\",\n",
    "    access_token=ACCESS_TOKEN,  # delete/comment out this argument if you've set the access token as an env var.\n",
    "    creator=\"UmerHA\",\n",
    "    include_prs=False,\n",
@@ -220,7 +220,7 @@
      "### Expected behavior\n",
      "\n",
      "Chain should run\n",
-      "{'url': 'https://github.com/hwchase17/langchain/issues/5027', 'title': \"ChatOpenAI models don't work with prompts created via ChatPromptTemplate.from_role_strings\", 'creator': 'UmerHA', 'created_at': '2023-05-20T10:39:18Z', 'comments': 1, 'state': 'open', 'labels': [], 'assignee': None, 'milestone': None, 'locked': False, 'number': 5027, 'is_pull_request': False}\n"
+      "{'url': 'https://github.com/langchain-ai/langchain/issues/5027', 'title': \"ChatOpenAI models don't work with prompts created via ChatPromptTemplate.from_role_strings\", 'creator': 'UmerHA', 'created_at': '2023-05-20T10:39:18Z', 'comments': 1, 'state': 'open', 'labels': [], 'assignee': None, 'milestone': None, 'locked': False, 'number': 5027, 'is_pull_request': False}\n"
     ]
    }
   ],
--- a/docs/extras/integrations/document_loaders/google_drive.ipynb
+++ b/docs/extras/integrations/document_loaders/google_drive.ipynb
@@ -21,6 +21,8 @@
    "## 🧑 Instructions for ingesting your Google Docs data\n",
    "By default, the `GoogleDriveLoader` expects the `credentials.json` file to be `~/.credentials/credentials.json`, but this is configurable using the `credentials_path` keyword argument. Same thing with `token.json` - `token_path`. Note that `token.json` will be created automatically the first time you use the loader.\n",
    "\n",
+    "The first time you use GoogleDriveLoader, you will be displayed with the consent screen in your browser. If this doesn't happen and you get a `RefreshError`, do not use `credentials_path` in your `GoogleDriveLoader` constructor call. Instead, put that path in a `GOOGLE_APPLICATION_CREDENTIALS` environmental variable.\n",
+    "\n",
    "`GoogleDriveLoader` can load from a list of Google Docs document ids or a folder id. You can obtain your folder and document id from the URL:\n",
    "* Folder: https://drive.google.com/drive/u/0/folders/1yucgL9WGgWZdM1TOuKkeghlPizuzMYb5 -> folder id is `\"1yucgL9WGgWZdM1TOuKkeghlPizuzMYb5\"`\n",
    "* Document: https://docs.google.com/document/d/1bfaMQ18_i56204VaQDVeAFpqEijJTgvurupdEDiaUQw/edit -> document id is `\"1bfaMQ18_i56204VaQDVeAFpqEijJTgvurupdEDiaUQw\"`"
@@ -59,6 +61,7 @@
   "source": [
    "loader = GoogleDriveLoader(\n",
    "    folder_id=\"1yucgL9WGgWZdM1TOuKkeghlPizuzMYb5\",\n",
+    "    token_path='/path/where/you/want/token/to/be/created/google_token.json'\n",
    "    # Optional: configure whether to recursively fetch files from subfolders. Defaults to False.\n",
    "    recursive=False,\n",
    ")"
--- a/docs/extras/integrations/document_loaders/readthedocs_documentation.ipynb
+++ b/docs/extras/integrations/document_loaders/readthedocs_documentation.ipynb
@@ -11,7 +11,7 @@
    "\n",
    "This notebook covers how to load content from HTML that was generated as part of a `Read-The-Docs` build.\n",
    "\n",
-    "For an example of this in the wild, see [here](https://github.com/hwchase17/chat-langchain).\n",
+    "For an example of this in the wild, see [here](https://github.com/langchain-ai/chat-langchain).\n",
    "\n",
    "This assumes that the HTML has already been scraped into a folder. This can be done by uncommenting and running the following command"
   ]
--- a/docs/extras/integrations/document_transformers/beautiful_soup.ipynb
+++ b/docs/extras/integrations/document_transformers/beautiful_soup.ipynb
@@ -7,7 +7,12 @@
   "source": [
    "# Beautiful Soup\n",
    "\n",
-    "Beautiful Soup offers fine-grained control over HTML content, enabling specific tag extraction, removal, and content cleaning. \n",
+    ">[Beautiful Soup](https://www.crummy.com/software/BeautifulSoup/) is a Python package for parsing \n",
+    "> HTML and XML documents (including having malformed markup, i.e. non-closed tags, so named after tag soup). \n",
+    "> It creates a parse tree for parsed pages that can be used to extract data from HTML,[3] which \n",
+    "> is useful for web scraping.\n",
+    "\n",
+    "`Beautiful Soup` offers fine-grained control over HTML content, enabling specific tag extraction, removal, and content cleaning. \n",
    "\n",
    "It's suited for cases where you want to extract specific information and clean up the HTML content according to your needs.\n",
    "\n",
@@ -87,7 +92,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.16"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_transformers/docai.ipynb
+++ b/docs/extras/integrations/document_transformers/docai.ipynb
@@ -1,14 +1,11 @@
 {
 "cells": [
  {
-   "cell_type": "code",
-   "execution_count": 1,
-   "id": "48438efb-9f0d-473b-a91c-9f1e29c2539d",
+   "cell_type": "markdown",
+   "id": "310fce10-e051-40db-89b0-5b5bb85cd145",
   "metadata": {},
-   "outputs": [],
   "source": [
-    "from langchain.document_loaders.blob_loaders import Blob\n",
-    "from langchain.document_loaders.parsers import DocAIParser"
+    "# Document AI\n"
   ]
  },
  {
@@ -16,7 +13,28 @@
   "id": "f95ac25b-f025-40c3-95b8-77919fc4da7f",
   "metadata": {},
   "source": [
-    "DocAI is a Google Cloud platform to transform unstructured data from documents into structured data, making it easier to understand, analyze, and consume. You can read more about it: https://cloud.google.com/document-ai/docs/overview "
+    ">[Document AI](https://cloud.google.com/document-ai/docs/overview) is a `Google Cloud Platform` service to transform unstructured data from documents into structured data, making it easier to understand, analyze, and consume.  "
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "275f2193-248f-4565-a872-93a89589cf2b",
+   "metadata": {},
+   "source": [
+    "The module contains a `PDF` parser based on DocAI from Google Cloud.\n",
+    "\n",
+    "You need to install two libraries to use this parser:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "34132fab-0069-4942-b68b-5b093ccfc92a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install google-cloud-documentai\n",
+    "!pip install google-cloud-documentai-toolbox"
   ]
  },
  {
@@ -24,8 +42,8 @@
   "id": "51946817-798c-4d11-abd6-db2ae53a0270",
   "metadata": {},
   "source": [
-    "First, you need to set up a GCS bucket and create your own OCR processor as described here: https://cloud.google.com/document-ai/docs/create-processor\n",
-    "The GCS_OUTPUT_PATH should be a path to a folder on GCS (starting with `gs://`) and a processor name should look like `projects/PROJECT_NUMBER/locations/LOCATION/processors/PROCESSOR_ID`. You can get it either programmatically or copy from the `Prediction endpoint` section of the `Processor details` tab in the Google Cloud Console."
+    "First, you need to set up a [`GCS` bucket and create your own OCR processor](https://cloud.google.com/document-ai/docs/create-processor)  \n",
+    "The `GCS_OUTPUT_PATH` should be a path to a folder on GCS (starting with `gs://`) and a processor name should look like `projects/PROJECT_NUMBER/locations/LOCATION/processors/PROCESSOR_ID`. You can get it either programmatically or copy from the `Prediction endpoint` section of the `Processor details` tab in the Google Cloud Console."
   ]
  },
  {
@@ -40,6 +58,17 @@
    "PROCESSOR_NAME = \"PUT_SOMETHING_HERE\""
   ]
  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "48438efb-9f0d-473b-a91c-9f1e29c2539d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders.blob_loaders import Blob\n",
+    "from langchain.document_loaders.parsers import DocAIParser"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "fad2bcca-1c0e-4888-b82d-15823ba57e60",
@@ -261,7 +290,7 @@
   "uri": "gcr.io/deeplearning-platform-release/base-cpu:m109"
  },
  "kernelspec": {
-   "display_name": "Python 3",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
@@ -275,7 +304,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.11"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_transformers/doctran_extract_properties.ipynb
+++ b/docs/extras/integrations/document_transformers/doctran_extract_properties.ipynb
@@ -4,14 +4,14 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Doctran Extract Properties\n",
+    "# Doctran: extract properties\n",
    "\n",
    "We can extract useful features of documents using the [Doctran](https://github.com/psychic-api/doctran) library, which uses OpenAI's function calling feature to extract specific metadata.\n",
    "\n",
    "Extracting metadata from documents is helpful for a variety of tasks, including:\n",
-    "* Classification: classifying documents into different categories\n",
-    "* Data mining: Extract structured data that can be used for data analysis\n",
-    "* Style transfer: Change the way text is written to more closely match expected user input, improving vector search results"
+    "* **Classification:** classifying documents into different categories\n",
+    "* **Data mining:** Extract structured data that can be used for data analysis\n",
+    "* **Style transfer:** Change the way text is written to more closely match expected user input, improving vector search results"
   ]
  },
  {
@@ -26,9 +26,7 @@
  {
   "cell_type": "code",
   "execution_count": 1,
-   "metadata": {
-    "scrolled": false
-   },
+   "metadata": {},
   "outputs": [],
   "source": [
    "import json\n",
@@ -261,9 +259,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/document_transformers/doctran_interrogate_document.ipynb
+++ b/docs/extras/integrations/document_transformers/doctran_interrogate_document.ipynb
@@ -4,8 +4,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Doctran Interrogate Documents\n",
-    "Documents used in a vector store knowledge base are typically stored in narrative or conversational format. However, most user queries are in question format. If we convert documents into Q&A format before vectorizing them, we can increase the liklihood of retrieving relevant documents, and decrease the liklihood of retrieving irrelevant documents.\n",
+    "# Doctran: interrogate documents\n",
+    "\n",
+    "Documents used in a vector store knowledge base are typically stored in a narrative or conversational format. However, most user queries are in question format. If we **convert documents into Q&A format** before vectorizing them, we can increase the likelihood of retrieving relevant documents, and decrease the likelihood of retrieving irrelevant documents.\n",
    "\n",
    "We can accomplish this using the [Doctran](https://github.com/psychic-api/doctran) library, which uses OpenAI's function calling feature to \"interrogate\" documents.\n",
    "\n",
@@ -24,9 +25,7 @@
  {
   "cell_type": "code",
   "execution_count": 1,
-   "metadata": {
-    "scrolled": false
-   },
+   "metadata": {},
   "outputs": [],
   "source": [
    "import json\n",
@@ -258,9 +257,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/document_transformers/doctran_translate_document.ipynb
+++ b/docs/extras/integrations/document_transformers/doctran_translate_document.ipynb
@@ -4,10 +4,11 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Doctran Translate Documents\n",
+    "# Doctran: language translation\n",
+    "\n",
    "Comparing documents through embeddings has the benefit of working across multiple languages. \"Harrison says hello\" and \"Harrison dice hola\" will occupy similar positions in the vector space because they have the same meaning semantically.\n",
    "\n",
-    "However, it can still be useful to use a LLM translate documents into other languages before vectorizing them. This is especially helpful when users are expected to query the knowledge base in different languages, or when state of the art embeddings models are not available for a given language.\n",
+    "However, it can still be useful to use an LLM to **translate documents into other languages** before vectorizing them. This is especially helpful when users are expected to query the knowledge base in different languages, or when state-of-the-art embedding models are not available for a given language.\n",
    "\n",
    "We can accomplish this using the [Doctran](https://github.com/psychic-api/doctran) library, which uses OpenAI's function calling feature to translate documents between languages."
   ]
@@ -125,9 +126,7 @@
  {
   "cell_type": "code",
   "execution_count": 7,
-   "metadata": {
-    "scrolled": false
-   },
+   "metadata": {},
   "outputs": [],
   "source": [
    "translated_document = await qa_translator.atransform_documents(documents)"
@@ -200,9 +199,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/document_transformers/html2text.ipynb
+++ b/docs/extras/integrations/document_transformers/html2text.ipynb
@@ -5,11 +5,11 @@
   "id": "fe6e5c82",
   "metadata": {},
   "source": [
-    "# html2text\n",
+    "# HTML to text\n",
    "\n",
-    "[html2text](https://github.com/Alir3z4/html2text/) is a Python script that converts a page of HTML into clean, easy-to-read plain ASCII text. \n",
+    ">[html2text](https://github.com/Alir3z4/html2text/) is a Python package that converts a page of `HTML` into clean, easy-to-read plain `ASCII text`. \n",
    "\n",
-    "The ASCII also happens to be valid Markdown (a text-to-HTML format)."
+    "The ASCII also happens to be a valid `Markdown` (a text-to-HTML format)."
   ]
  },
  {
@@ -125,7 +125,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.16"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_transformers/nuclia_transformer.ipynb
+++ b/docs/extras/integrations/document_transformers/nuclia_transformer.ipynb
@@ -5,11 +5,11 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Nuclia Understanding API document transformer\n",
+    "# Nuclia\n",
    "\n",
-    "[Nuclia](https://nuclia.com) automatically indexes your unstructured data from any internal and external source, providing optimized search results and generative answers. It can handle video and audio transcription, image content extraction, and document parsing.\n",
+    ">[Nuclia](https://nuclia.com) automatically indexes your unstructured data from any internal and external source, providing optimized search results and generative answers. It can handle video and audio transcription, image content extraction, and document parsing.\n",
    "\n",
-    "The Nuclia Understanding API document transformer splits text into paragraphs and sentences, identifies entities, provides a summary of the text and generates embeddings for all the sentences.\n",
+    "`Nuclia Understanding API` document transformer splits text into paragraphs and sentences, identifies entities, provides a summary of the text and generates embeddings for all the sentences.\n",
    "\n",
    "To use the Nuclia Understanding API, you need to have a Nuclia account. You can create one for free at [https://nuclia.cloud](https://nuclia.cloud), and then [create a NUA key](https://docs.nuclia.dev/docs/docs/using/understanding/intro).\n",
    "\n",
@@ -94,7 +94,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "langchain",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
@@ -108,10 +108,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.5"
-  },
-  "orig_nbformat": 4
+   "version": "3.10.12"
+  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/document_transformers/openai_metadata_tagger.ipynb
+++ b/docs/extras/integrations/document_transformers/openai_metadata_tagger.ipynb
@@ -4,15 +4,15 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# OpenAI Functions Metadata Tagger\n",
+    "# OpenAI metadata tagger\n",
    "\n",
-    "It can often be useful to tag ingested documents with structured metadata, such as the title, tone, or length of a document, to allow for more targeted similarity search later. However, for large numbers of documents, performing this labelling process manually can be tedious.\n",
+    "It can often be useful to tag ingested documents with structured metadata, such as the title, tone, or length of a document, to allow for a more targeted similarity search later. However, for large numbers of documents, performing this labelling process manually can be tedious.\n",
    "\n",
-    "The `OpenAIMetadataTagger` document transformer automates this process by extracting metadata from each provided document according to a provided schema. It uses a configurable OpenAI Functions-powered chain under the hood, so if you pass a custom LLM instance, it must be an OpenAI model with functions support. \n",
+    "The `OpenAIMetadataTagger` document transformer automates this process by extracting metadata from each provided document according to a provided schema. It uses a configurable `OpenAI Functions`-powered chain under the hood, so if you pass a custom LLM instance, it must be an `OpenAI` model with functions support. \n",
    "\n",
    "**Note:** This document transformer works best with complete documents, so it's best to run it first with whole documents before doing any other splitting or processing!\n",
    "\n",
-    "For example, let's say you wanted to index a set of movie reviews. You could initialize the document transformer with a valid JSON Schema object as follows:"
+    "For example, let's say you wanted to index a set of movie reviews. You could initialize the document transformer with a valid `JSON Schema` object as follows:"
   ]
  },
  {
@@ -239,9 +239,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "venv",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
-   "name": "venv"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@@ -253,9 +253,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/llms/modal.ipynb
+++ b/docs/extras/integrations/llms/modal.ipynb
@@ -52,7 +52,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "The [`langchain.llms.modal.Modal`](https://github.com/hwchase17/langchain/blame/master/langchain/llms/modal.py) integration class requires that you deploy a Modal application with a web endpoint that complies with the following JSON interface:\n",
+    "The [`langchain.llms.modal.Modal`](https://github.com/langchain-ai/langchain/blame/master/langchain/llms/modal.py) integration class requires that you deploy a Modal application with a web endpoint that complies with the following JSON interface:\n",
    "\n",
    "1. The LLM prompt is accepted as a `str` value under the key `\"prompt\"`\n",
    "2. The LLM response returned as a `str` value under the key `\"prompt\"`\n",
--- a/docs/extras/integrations/llms/ollama.ipynb
+++ b/docs/extras/integrations/llms/ollama.ipynb
@@ -51,15 +51,14 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 38,
+   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "from langchain.llms import Ollama\n",
    "from langchain.callbacks.manager import CallbackManager\n",
    "from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler                                  \n",
-    "llm = Ollama(base_url=\"http://localhost:11434\", \n",
-    "             model=\"llama2\", \n",
+    "llm = Ollama(model=\"llama2\", \n",
    "             callback_manager = CallbackManager([StreamingStdOutCallbackHandler()]))"
   ]
  },
@@ -72,36 +71,9 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 40,
+   "execution_count": null,
   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "Great! The history of Artificial Intelligence (AI) is a fascinating and complex topic that spans several decades. Here's a brief overview:\n",
-      "\n",
-      "1. Early Years (1950s-1960s): The term \"Artificial Intelligence\" was coined in 1956 by computer scientist John McCarthy. However, the concept of AI dates back to ancient Greece, where mythical creatures like Talos and Hephaestus were created to perform tasks without any human intervention. In the 1950s and 1960s, researchers began exploring ways to replicate human intelligence using computers, leading to the development of simple AI programs like ELIZA (1966) and PARRY (1972).\n",
-      "2. Rule-Based Systems (1970s-1980s): As computing power increased, researchers developed rule-based systems, such as Mycin (1976), which could diagnose medical conditions based on a set of rules. This period also saw the rise of expert systems, like EDICT (1985), which mimicked human experts in specific domains.\n",
-      "3. Machine Learning (1990s-2000s): With the advent of big data and machine learning algorithms, AI evolved to include neural networks, decision trees, and other techniques for training models on large datasets. This led to the development of applications like speech recognition (e.g., Siri, Alexa), image recognition (e.g., Google Image Search), and natural language processing (e.g., chatbots).\n",
-      "4. Deep Learning (2010s-present): The rise of deep learning techniques, such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs), has enabled AI to perform complex tasks like image and speech recognition, natural language processing, and even autonomous driving. Companies like Google, Facebook, and Baidu have invested heavily in deep learning research, leading to breakthroughs in areas like facial recognition, object detection, and machine translation.\n",
-      "5. Current Trends (present-future): AI is currently being applied to various industries, including healthcare, finance, education, and entertainment. With the growth of cloud computing, edge AI, and autonomous systems, we can expect to see more sophisticated AI applications in the near future. However, there are also concerns about the ethical implications of AI, such as data privacy, algorithmic bias, and job displacement.\n",
-      "\n",
-      "Remember, AI has a long history, and its development is an ongoing process. As technology advances, we can expect to see even more innovative applications of AI in various fields."
-     ]
-    },
-    {
-     "data": {
-      "text/plain": [
-       "'\\nGreat! The history of Artificial Intelligence (AI) is a fascinating and complex topic that spans several decades. Here\\'s a brief overview:\\n\\n1. Early Years (1950s-1960s): The term \"Artificial Intelligence\" was coined in 1956 by computer scientist John McCarthy. However, the concept of AI dates back to ancient Greece, where mythical creatures like Talos and Hephaestus were created to perform tasks without any human intervention. In the 1950s and 1960s, researchers began exploring ways to replicate human intelligence using computers, leading to the development of simple AI programs like ELIZA (1966) and PARRY (1972).\\n2. Rule-Based Systems (1970s-1980s): As computing power increased, researchers developed rule-based systems, such as Mycin (1976), which could diagnose medical conditions based on a set of rules. This period also saw the rise of expert systems, like EDICT (1985), which mimicked human experts in specific domains.\\n3. Machine Learning (1990s-2000s): With the advent of big data and machine learning algorithms, AI evolved to include neural networks, decision trees, and other techniques for training models on large datasets. This led to the development of applications like speech recognition (e.g., Siri, Alexa), image recognition (e.g., Google Image Search), and natural language processing (e.g., chatbots).\\n4. Deep Learning (2010s-present): The rise of deep learning techniques, such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs), has enabled AI to perform complex tasks like image and speech recognition, natural language processing, and even autonomous driving. Companies like Google, Facebook, and Baidu have invested heavily in deep learning research, leading to breakthroughs in areas like facial recognition, object detection, and machine translation.\\n5. Current Trends (present-future): AI is currently being applied to various industries, including healthcare, finance, education, and entertainment. With the growth of cloud computing, edge AI, and autonomous systems, we can expect to see more sophisticated AI applications in the near future. However, there are also concerns about the ethical implications of AI, such as data privacy, algorithmic bias, and job displacement.\\n\\nRemember, AI has a long history, and its development is an ongoing process. As technology advances, we can expect to see even more innovative applications of AI in various fields.'"
-      ]
-     },
-     "execution_count": 40,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
   "source": [
    "llm(\"Tell me about the history of AI\")"
   ]
@@ -121,7 +93,6 @@
   "source": [
    "from langchain.embeddings import OllamaEmbeddings\n",
    "oembed = OllamaEmbeddings(base_url=\"http://localhost:11434\", model=\"llama2\")\n",
-    "\n",
    "oembed.embed_query(\"Llamas are social animals and live with others as a herd.\")"
   ]
  },
@@ -153,34 +124,60 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 60,
+   "execution_count": 3,
   "metadata": {},
   "outputs": [],
   "source": [
+    "# Load web page\n",
    "from langchain.document_loaders import WebBaseLoader\n",
    "loader = WebBaseLoader(\"https://lilianweng.github.io/posts/2023-06-23-agent/\")\n",
-    "data = loader.load()\n",
-    "\n",
+    "data = loader.load()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Split into chunks \n",
    "from langchain.text_splitter import RecursiveCharacterTextSplitter\n",
-    "text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=0)\n",
+    "text_splitter = RecursiveCharacterTextSplitter(chunk_size=1500, chunk_overlap=100)\n",
    "all_splits = text_splitter.split_documents(data)"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 6,
   "metadata": {},
-   "outputs": [],
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Found model file at  /Users/rlm/.cache/gpt4all/ggml-all-MiniLM-L6-v2-f16.bin\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "objc[77472]: Class GGMLMetalClass is implemented in both /Users/rlm/miniforge3/envs/llama2/lib/python3.9/site-packages/gpt4all/llmodel_DO_NOT_MODIFY/build/libreplit-mainline-metal.dylib (0x17f754208) and /Users/rlm/miniforge3/envs/llama2/lib/python3.9/site-packages/gpt4all/llmodel_DO_NOT_MODIFY/build/libllamamodel-mainline-metal.dylib (0x17fb80208). One of the two will be used. Which one is undefined.\n"
+     ]
+    }
+   ],
   "source": [
+    "# Embed and store\n",
    "from langchain.vectorstores import Chroma\n",
-    "from langchain.embeddings import OllamaEmbeddings\n",
-    "\n",
-    "vectorstore = Chroma.from_documents(documents=all_splits, embedding=OllamaEmbeddings())"
+    "from langchain.embeddings import GPT4AllEmbeddings\n",
+    "from langchain.embeddings import OllamaEmbeddings # We can also try Ollama embeddings\n",
+    "vectorstore = Chroma.from_documents(documents=all_splits,\n",
+    "                                    embedding=GPT4AllEmbeddings())"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 62,
+   "execution_count": 7,
   "metadata": {},
   "outputs": [
    {
@@ -189,41 +186,32 @@
       "4"
      ]
     },
-     "execution_count": 62,
+     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "question = \"What are the approaches to Task Decomposition?\"\n",
+    "# Retrieve\n",
+    "question = \"How can Task Decomposition be done?\"\n",
    "docs = vectorstore.similarity_search(question)\n",
    "len(docs)"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 9,
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.prompts import PromptTemplate\n",
-    "\n",
-    "# Prompt\n",
-    "template = \"\"\"Use the following pieces of context to answer the question at the end. \n",
-    "If you don't know the answer, just say that you don't know, don't try to make up an answer. \n",
-    "Use three sentences maximum and keep the answer as concise as possible. \n",
-    "{context}\n",
-    "Question: {question}\n",
-    "Helpful Answer:\"\"\"\n",
-    "QA_CHAIN_PROMPT = PromptTemplate(\n",
-    "    input_variables=[\"context\", \"question\"],\n",
-    "    template=template,\n",
-    ")\n"
+    "# RAG prompt\n",
+    "from langchain import hub\n",
+    "QA_CHAIN_PROMPT = hub.pull(\"rlm/rag-prompt-llama\")"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 69,
+   "execution_count": 10,
   "metadata": {},
   "outputs": [],
   "source": [
@@ -231,15 +219,14 @@
    "from langchain.llms import Ollama\n",
    "from langchain.callbacks.manager import CallbackManager\n",
    "from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler\n",
-    "llm = Ollama(base_url=\"http://localhost:11434\",\n",
-    "             model=\"llama2\",\n",
+    "llm = Ollama(model=\"llama2\",\n",
    "             verbose=True,\n",
    "             callback_manager=CallbackManager([StreamingStdOutCallbackHandler()]))"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 66,
+   "execution_count": 11,
   "metadata": {},
   "outputs": [],
   "source": [
@@ -254,18 +241,21 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 70,
+   "execution_count": 12,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Task decomposition can be approached in different ways for AI agents, including:\n",
+      " There are several approaches to task decomposition for AI agents, including:\n",
      "\n",
-      "1. Using simple prompts like \"Steps for XYZ.\" or \"What are the subgoals for achieving XYZ?\" to guide the LLM.\n",
-      "2. Providing task-specific instructions, such as \"Write a story outline\" for writing a novel.\n",
-      "3. Utilizing human inputs to help the AI agent understand the task and break it down into smaller steps."
+      "1. Chain of thought (CoT): This involves instructing the model to \"think step by step\" and use more test-time computation to decompose hard tasks into smaller and simpler steps.\n",
+      "2. Tree of thoughts (ToT): This extends CoT by exploring multiple reasoning possibilities at each step, creating a tree structure. The search process can be BFS or DFS with each state evaluated by a classifier or majority vote.\n",
+      "3. Using task-specific instructions: For example, \"Write a story outline.\" for writing a novel.\n",
+      "4. Human inputs: The agent can receive input from a human operator to perform tasks that require creativity and domain expertise.\n",
+      "\n",
+      "These approaches allow the agent to break down complex tasks into manageable subgoals, enabling efficient handling of tasks and improving the quality of final results through self-reflection and refinement."
     ]
    }
   ],
@@ -283,17 +273,9 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 56,
+   "execution_count": null,
   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "Task decomposition can be approached in three ways: (1) using simple prompting like \"Steps for XYZ.\\n1.\", \"What are the subgoals for achieving XYZ?\", (2) by using task-specific instructions, or (3) with human inputs.{'model': 'llama2', 'created_at': '2023-08-08T04:01:09.005367Z', 'done': True, 'context': [1, 29871, 1, 13, 9314, 14816, 29903, 6778, 13, 13, 3492, 526, 263, 8444, 29892, 3390, 1319, 322, 15993, 20255, 29889, 29849, 1234, 408, 1371, 3730, 408, 1950, 29892, 1550, 1641, 9109, 29889, 3575, 6089, 881, 451, 3160, 738, 10311, 1319, 29892, 443, 621, 936, 29892, 11021, 391, 29892, 7916, 391, 29892, 304, 27375, 29892, 18215, 29892, 470, 27302, 2793, 29889, 3529, 9801, 393, 596, 20890, 526, 5374, 635, 443, 5365, 1463, 322, 6374, 297, 5469, 29889, 13, 13, 3644, 263, 1139, 947, 451, 1207, 738, 4060, 29892, 470, 338, 451, 2114, 1474, 16165, 261, 296, 29892, 5649, 2020, 2012, 310, 22862, 1554, 451, 1959, 29889, 960, 366, 1016, 29915, 29873, 1073, 278, 1234, 304, 263, 1139, 29892, 3113, 1016, 29915, 29873, 6232, 2089, 2472, 29889, 13, 13, 29966, 829, 14816, 29903, 6778, 13, 13, 29961, 25580, 29962, 4803, 278, 1494, 12785, 310, 3030, 304, 1234, 278, 1139, 472, 278, 1095, 29889, 29871, 13, 3644, 366, 1016, 29915, 29873, 1073, 278, 1234, 29892, 925, 1827, 393, 366, 1016, 29915, 29873, 1073, 29892, 1016, 29915, 29873, 1018, 304, 1207, 701, 385, 1234, 29889, 29871, 13, 11403, 2211, 25260, 7472, 322, 3013, 278, 1234, 408, 3022, 895, 408, 1950, 29889, 29871, 13, 5398, 26227, 508, 367, 2309, 313, 29896, 29897, 491, 365, 26369, 411, 2560, 9508, 292, 763, 376, 7789, 567, 363, 1060, 29979, 29999, 7790, 29876, 29896, 19602, 376, 5618, 526, 278, 1014, 1484, 1338, 363, 3657, 15387, 1060, 29979, 29999, 29973, 613, 313, 29906, 29897, 491, 773, 3414, 29899, 14940, 11994, 29936, 321, 29889, 29887, 29889, 376, 6113, 263, 5828, 27887, 1213, 363, 5007, 263, 9554, 29892, 470, 313, 29941, 29897, 411, 5199, 10970, 29889, 13, 13, 5398, 26227, 508, 367, 2309, 313, 29896, 29897, 491, 365, 26369, 411, 2560, 9508, 292, 763, 376, 7789, 567, 363, 1060, 29979, 29999, 7790, 29876, 29896, 19602, 376, 5618, 526, 278, 1014, 1484, 1338, 363, 3657, 15387, 1060, 29979, 29999, 29973, 613, 313, 29906, 29897, 491, 773, 3414, 29899, 14940, 11994, 29936, 321, 29889, 29887, 29889, 376, 6113, 263, 5828, 27887, 1213, 363, 5007, 263, 9554, 29892, 470, 313, 29941, 29897, 411, 5199, 10970, 29889, 13, 13, 5398, 26227, 508, 367, 2309, 313, 29896, 29897, 491, 365, 26369, 411, 2560, 9508, 292, 763, 376, 7789, 567, 363, 1060, 29979, 29999, 7790, 29876, 29896, 19602, 376, 5618, 526, 278, 1014, 1484, 1338, 363, 3657, 15387, 1060, 29979, 29999, 29973, 613, 313, 29906, 29897, 491, 773, 3414, 29899, 14940, 11994, 29936, 321, 29889, 29887, 29889, 376, 6113, 263, 5828, 27887, 1213, 363, 5007, 263, 9554, 29892, 470, 313, 29941, 29897, 411, 5199, 10970, 29889, 13, 13, 1451, 16047, 267, 297, 1472, 29899, 8489, 18987, 322, 3414, 26227, 29901, 1858, 9450, 975, 263, 3309, 29891, 4955, 322, 17583, 3902, 8253, 278, 1650, 2913, 3933, 18066, 292, 29889, 365, 26369, 29879, 21117, 304, 10365, 13900, 746, 20050, 411, 15668, 4436, 29892, 3907, 963, 3109, 16424, 9401, 304, 25618, 1058, 5110, 515, 14260, 322, 1059, 29889, 13, 16492, 29901, 1724, 526, 278, 13501, 304, 9330, 897, 510, 3283, 29973, 13, 29648, 1319, 673, 29901, 518, 29914, 25580, 29962, 13, 5398, 26227, 508, 367, 26733, 297, 2211, 5837, 29901, 313, 29896, 29897, 773, 2560, 9508, 292, 763, 376, 7789, 567, 363, 1060, 29979, 29999, 7790, 29876, 29896, 19602, 376, 5618, 526, 278, 1014, 1484, 1338, 363, 3657, 15387, 1060, 29979, 29999, 29973, 613, 313, 29906, 29897, 491, 773, 3414, 29899, 14940, 11994, 29892, 470, 313, 29941, 29897, 411, 5199, 10970, 29889, 2], 'total_duration': 1364428708, 'load_duration': 1246375, 'sample_count': 62, 'sample_duration': 44859000, 'prompt_eval_count': 1, 'eval_count': 62, 'eval_duration': 1313002000}\n"
-     ]
-    }
-   ],
+   "outputs": [],
   "source": [
    "from langchain.schema import LLMResult\n",
    "from langchain.callbacks.base import BaseCallbackHandler\n",
@@ -345,6 +327,78 @@
   "source": [
    "62 / (1313002000/1000/1000/1000)"
   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Using the Hub for prompt management\n",
+    " \n",
+    "Open source models often benefit from specific prompts. \n",
+    "\n",
+    "For example, [Mistral 7b](https://mistral.ai/news/announcing-mistral-7b/) was fine-tuned for chat using the prompt format shown [here](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1).\n",
+    "\n",
+    "Get the model: `ollama pull mistral:7b-instruct`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# LLM\n",
+    "from langchain.llms import Ollama\n",
+    "from langchain.callbacks.manager import CallbackManager\n",
+    "from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler\n",
+    "llm = Ollama(model=\"mistral:7b-instruct\",\n",
+    "             verbose=True,\n",
+    "             callback_manager=CallbackManager([StreamingStdOutCallbackHandler()]))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain import hub\n",
+    "QA_CHAIN_PROMPT = hub.pull(\"rlm/rag-prompt-mistral\")\n",
+    "\n",
+    "# QA chain\n",
+    "from langchain.chains import RetrievalQA\n",
+    "qa_chain = RetrievalQA.from_chain_type(\n",
+    "    llm,\n",
+    "    retriever=vectorstore.as_retriever(),\n",
+    "    chain_type_kwargs={\"prompt\": QA_CHAIN_PROMPT},\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "There are different approaches to Task Decomposition for AI Agents such as Chain of thought (CoT) and Tree of Thoughts (ToT). CoT breaks down big tasks into multiple manageable tasks and generates multiple thoughts per step, while ToT explores multiple reasoning possibilities at each step. Task decomposition can be done by LLM with simple prompting or using task-specific instructions or human inputs."
+     ]
+    }
+   ],
+   "source": [
+    "question = \"What are the various approaches to Task Decomposition for AI Agents?\"\n",
+    "result = qa_chain({\"query\": question})"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {
@@ -363,9 +417,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.5"
+   "version": "3.9.16"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/memory/aws_dynamodb.ipynb
+++ b/docs/extras/integrations/memory/aws_dynamodb.ipynb
@@ -0,0 +1,371 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "91c6a7ef",
+   "metadata": {},
+   "source": [
+    "# AWS DynamoDB\n",
+    "\n",
+    ">[Amazon AWS DynamoDB](https://awscli.amazonaws.com/v2/documentation/api/latest/reference/dynamodb/index.html) is a fully managed `NoSQL` database service that provides fast and predictable performance with seamless scalability.\n",
+    "\n",
+    "This notebook goes over how to use `DynamoDB` to store chat message history."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "3f608be0",
+   "metadata": {},
+   "source": [
+    "First make sure you have correctly configured the [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html). Then make sure you have installed `boto3`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "3a7e89c2-4c55-4a66-91ec-9bf9a37467eb",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install boto3"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "030d784f",
+   "metadata": {},
+   "source": [
+    "Next, create the `DynamoDB` Table where we will be storing messages:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "93ce1811",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "0\n"
+     ]
+    }
+   ],
+   "source": [
+    "import boto3\n",
+    "\n",
+    "# Get the service resource.\n",
+    "dynamodb = boto3.resource(\"dynamodb\")\n",
+    "\n",
+    "# Create the DynamoDB table.\n",
+    "table = dynamodb.create_table(\n",
+    "    TableName=\"SessionTable\",\n",
+    "    KeySchema=[{\"AttributeName\": \"SessionId\", \"KeyType\": \"HASH\"}],\n",
+    "    AttributeDefinitions=[{\"AttributeName\": \"SessionId\", \"AttributeType\": \"S\"}],\n",
+    "    BillingMode=\"PAY_PER_REQUEST\",\n",
+    ")\n",
+    "\n",
+    "# Wait until the table exists.\n",
+    "table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"SessionTable\")\n",
+    "\n",
+    "# Print out some data about the table.\n",
+    "print(table.item_count)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "1a9b310b",
+   "metadata": {},
+   "source": [
+    "## DynamoDBChatMessageHistory"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "d15e3302",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
+    "\n",
+    "history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"0\")\n",
+    "\n",
+    "history.add_user_message(\"hi!\")\n",
+    "\n",
+    "history.add_ai_message(\"whats up?\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "id": "64fc465e",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[HumanMessage(content='hi!', additional_kwargs={}, example=False),\n",
+       " AIMessage(content='whats up?', additional_kwargs={}, example=False),\n",
+       " HumanMessage(content='hi!', additional_kwargs={}, example=False),\n",
+       " AIMessage(content='whats up?', additional_kwargs={}, example=False)]"
+      ]
+     },
+     "execution_count": 12,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "history.messages"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "955f1b15",
+   "metadata": {},
+   "source": [
+    "## DynamoDBChatMessageHistory with Custom Endpoint URL\n",
+    "\n",
+    "Sometimes it is useful to specify the URL to the AWS endpoint to connect to. For instance, when you are running locally against [Localstack](https://localstack.cloud/). For those cases you can specify the URL via the `endpoint_url` parameter in the constructor."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "225713c8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
+    "\n",
+    "history = DynamoDBChatMessageHistory(\n",
+    "    table_name=\"SessionTable\",\n",
+    "    session_id=\"0\",\n",
+    "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "97f8578a",
+   "metadata": {},
+   "source": [
+    "## DynamoDBChatMessageHistory With Different Keys Composite Keys\n",
+    "The default key for DynamoDBChatMessageHistory is ```{\"SessionId\": self.session_id}```, but you can modify this to match your table design.\n",
+    "\n",
+    "### Primary Key Name\n",
+    "You may modify the primary key by passing in a primary_key_name value in the constructor, resulting in the following:\n",
+    "```{self.primary_key_name: self.session_id}```\n",
+    "\n",
+    "### Composite Keys\n",
+    "When using an existing DynamoDB table, you may need to modify the key structure from the default of to something including a Sort Key. To do this you may use the ```key``` parameter.\n",
+    "\n",
+    "Passing a value for key will override the primary_key parameter, and the resulting key structure will be the passed value.\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "088c037c",
+   "metadata": {
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "0\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[HumanMessage(content='hello, composite dynamodb table!', additional_kwargs={}, example=False)]"
+      ]
+     },
+     "execution_count": 14,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
+    "\n",
+    "composite_table = dynamodb.create_table(\n",
+    "    TableName=\"CompositeTable\",\n",
+    "    KeySchema=[{\"AttributeName\": \"PK\", \"KeyType\": \"HASH\"}, {\"AttributeName\": \"SK\", \"KeyType\": \"RANGE\"}],\n",
+    "    AttributeDefinitions=[{\"AttributeName\": \"PK\", \"AttributeType\": \"S\"}, {\"AttributeName\": \"SK\", \"AttributeType\": \"S\"}],\n",
+    "    BillingMode=\"PAY_PER_REQUEST\",\n",
+    ")\n",
+    "\n",
+    "# Wait until the table exists.\n",
+    "composite_table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"CompositeTable\")\n",
+    "\n",
+    "# Print out some data about the table.\n",
+    "print(composite_table.item_count)\n",
+    "\n",
+    "my_key = {\n",
+    "    \"PK\": \"session_id::0\",\n",
+    "    \"SK\":  \"langchain_history\",\n",
+    "}\n",
+    "\n",
+    "composite_key_history = DynamoDBChatMessageHistory(\n",
+    "    table_name=\"CompositeTable\",\n",
+    "    session_id=\"0\",\n",
+    "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
+    "    key=my_key,\n",
+    ")\n",
+    "\n",
+    "composite_key_history.add_user_message(\"hello, composite dynamodb table!\")\n",
+    "\n",
+    "composite_key_history.messages"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "3b33c988",
+   "metadata": {},
+   "source": [
+    "## Agent with DynamoDB Memory"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "f92d9499",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import Tool\n",
+    "from langchain.memory import ConversationBufferMemory\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import AgentType\n",
+    "from langchain.utilities import PythonREPL\n",
+    "from getpass import getpass\n",
+    "\n",
+    "message_history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"1\")\n",
+    "memory = ConversationBufferMemory(\n",
+    "    memory_key=\"chat_history\", chat_memory=message_history, return_messages=True\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "1167eeba",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "python_repl = PythonREPL()\n",
+    "\n",
+    "# You can create the tool to pass to an agent\n",
+    "tools = [\n",
+    "    Tool(\n",
+    "        name=\"python_repl\",\n",
+    "        description=\"A Python shell. Use this to execute python commands. Input should be a valid python command. If you want to see the output of a value, you should print it out with `print(...)`.\",\n",
+    "        func=python_repl.run,\n",
+    "    )\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "fce085c5",
+   "metadata": {},
+   "outputs": [
+    {
+     "ename": "ValidationError",
+     "evalue": "1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)",
+     "output_type": "error",
+     "traceback": [
+      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
+      "\u001b[0;31mValidationError\u001b[0m                           Traceback (most recent call last)",
+      "Cell \u001b[0;32mIn[17], line 1\u001b[0m\n\u001b[0;32m----> 1\u001b[0m llm \u001b[38;5;241m=\u001b[39m \u001b[43mChatOpenAI\u001b[49m\u001b[43m(\u001b[49m\u001b[43mtemperature\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;241;43m0\u001b[39;49m\u001b[43m)\u001b[49m\n\u001b[1;32m      2\u001b[0m agent_chain \u001b[38;5;241m=\u001b[39m initialize_agent(\n\u001b[1;32m      3\u001b[0m     tools,\n\u001b[1;32m      4\u001b[0m     llm,\n\u001b[0;32m   (...)\u001b[0m\n\u001b[1;32m      7\u001b[0m     memory\u001b[38;5;241m=\u001b[39mmemory,\n\u001b[1;32m      8\u001b[0m )\n",
+      "File \u001b[0;32m~/Documents/projects/langchain/libs/langchain/langchain/load/serializable.py:74\u001b[0m, in \u001b[0;36mSerializable.__init__\u001b[0;34m(self, **kwargs)\u001b[0m\n\u001b[1;32m     73\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21m__init__\u001b[39m(\u001b[38;5;28mself\u001b[39m, \u001b[38;5;241m*\u001b[39m\u001b[38;5;241m*\u001b[39mkwargs: Any) \u001b[38;5;241m-\u001b[39m\u001b[38;5;241m>\u001b[39m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[0;32m---> 74\u001b[0m     \u001b[38;5;28;43msuper\u001b[39;49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[38;5;21;43m__init__\u001b[39;49m\u001b[43m(\u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m     75\u001b[0m     \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_lc_kwargs \u001b[38;5;241m=\u001b[39m kwargs\n",
+      "File \u001b[0;32m~/Documents/projects/langchain/.venv/lib/python3.9/site-packages/pydantic/main.py:341\u001b[0m, in \u001b[0;36mpydantic.main.BaseModel.__init__\u001b[0;34m()\u001b[0m\n",
+      "\u001b[0;31mValidationError\u001b[0m: 1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)"
+     ]
+    }
+   ],
+   "source": [
+    "llm = ChatOpenAI(temperature=0)\n",
+    "agent_chain = initialize_agent(\n",
+    "    tools,\n",
+    "    llm,\n",
+    "    agent=AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION,\n",
+    "    verbose=True,\n",
+    "    memory=memory,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "952a3103",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_chain.run(input=\"Hello!\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "54c4aaf4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_chain.run(input=\"Who owns Twitter?\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f9013118",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_chain.run(input=\"My name is Bob.\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "405e5315",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_chain.run(input=\"Who am I?\")\n"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/memory/cassandra_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/cassandra_chat_message_history.ipynb
@@ -5,15 +5,24 @@
   "id": "90cd3ded",
   "metadata": {},
   "source": [
-    "# Cassandra Chat Message History\n",
+    "# Cassandra \n",
    "\n",
-    ">[Apache Cassandra®](https://cassandra.apache.org) is a NoSQL, row-oriented, highly scalable and highly available database, well suited for storing large amounts of data.\n",
+    ">[Apache Cassandra®](https://cassandra.apache.org) is a `NoSQL`, row-oriented, highly scalable and highly available database, well suited for storing large amounts of data.\n",
    "\n",
-    "Cassandra is a good choice for storing chat message history because it is easy to scale and can handle a large number of writes.\n",
+    ">`Cassandra` is a good choice for storing chat message history because it is easy to scale and can handle a large number of writes.\n",
    "\n",
    "This notebook goes over how to use Cassandra to store chat message history.\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f507f58b-bf22-4a48-8daf-68d869bcd1ba",
+   "metadata": {},
+   "source": [
+    "## Setting up\n",
    "\n",
-    "To run this notebook you need either a running Cassandra cluster or a DataStax Astra DB instance running in the cloud (you can get one for free at [datastax.com](https://astra.datastax.com)). Check [cassio.org](https://cassio.org/start_here/) for more information."
+    "To run this notebook you need either a running `Cassandra` cluster or a `DataStax Astra DB` instance running in the cloud (you can get one for free at [datastax.com](https://astra.datastax.com)). Check [cassio.org](https://cassio.org/start_here/) for more information."
   ]
  },
  {
@@ -31,7 +40,7 @@
   "id": "e3d97b65",
   "metadata": {},
   "source": [
-    "### Please provide database connection parameters and secrets:"
+    "### Set up the database connection parameters and secrets"
   ]
  },
  {
@@ -63,7 +72,7 @@
   "id": "55860b2d",
   "metadata": {},
   "source": [
-    "#### depending on whether local or cloud-based Astra DB, create the corresponding database connection \"Session\" object"
+    "Depending on whether local or cloud-based Astra DB, create the corresponding database connection \"Session\" object."
   ]
  },
  {
@@ -105,7 +114,7 @@
   "id": "36c163e8",
   "metadata": {},
   "source": [
-    "### Creation and usage of the Chat Message History"
+    "## Example"
   ]
  },
  {
--- a/docs/extras/integrations/memory/dynamodb_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/dynamodb_chat_message_history.ipynb
@@ -1,352 +0,0 @@
-{
-  "cells": [
-    {
-      "cell_type": "markdown",
-      "id": "91c6a7ef",
-      "metadata": {},
-      "source": [
-        "# Dynamodb Chat Message History\n",
-        "\n",
-        "This notebook goes over how to use Dynamodb to store chat message history."
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "id": "3f608be0",
-      "metadata": {},
-      "source": [
-        "First make sure you have correctly configured the [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html). Then make sure you have installed boto3."
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "id": "030d784f",
-      "metadata": {},
-      "source": [
-        "Next, create the DynamoDB Table where we will be storing messages:"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 10,
-      "id": "93ce1811",
-      "metadata": {},
-      "outputs": [
-        {
-          "name": "stdout",
-          "output_type": "stream",
-          "text": [
-            "0\n"
-          ]
-        }
-      ],
-      "source": [
-        "import boto3\n",
-        "\n",
-        "# Get the service resource.\n",
-        "dynamodb = boto3.resource(\"dynamodb\")\n",
-        "\n",
-        "# Create the DynamoDB table.\n",
-        "table = dynamodb.create_table(\n",
-        "    TableName=\"SessionTable\",\n",
-        "    KeySchema=[{\"AttributeName\": \"SessionId\", \"KeyType\": \"HASH\"}],\n",
-        "    AttributeDefinitions=[{\"AttributeName\": \"SessionId\", \"AttributeType\": \"S\"}],\n",
-        "    BillingMode=\"PAY_PER_REQUEST\",\n",
-        ")\n",
-        "\n",
-        "# Wait until the table exists.\n",
-        "table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"SessionTable\")\n",
-        "\n",
-        "# Print out some data about the table.\n",
-        "print(table.item_count)"
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "id": "1a9b310b",
-      "metadata": {},
-      "source": [
-        "## DynamoDBChatMessageHistory"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 11,
-      "id": "d15e3302",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
-        "\n",
-        "history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"0\")\n",
-        "\n",
-        "history.add_user_message(\"hi!\")\n",
-        "\n",
-        "history.add_ai_message(\"whats up?\")"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 12,
-      "id": "64fc465e",
-      "metadata": {},
-      "outputs": [
-        {
-          "data": {
-            "text/plain": "[HumanMessage(content='hi!', additional_kwargs={}, example=False),\n AIMessage(content='whats up?', additional_kwargs={}, example=False),\n HumanMessage(content='hi!', additional_kwargs={}, example=False),\n AIMessage(content='whats up?', additional_kwargs={}, example=False)]"
-          },
-          "execution_count": 12,
-          "metadata": {},
-          "output_type": "execute_result"
-        }
-      ],
-      "source": [
-        "history.messages"
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "id": "955f1b15",
-      "metadata": {},
-      "source": [
-        "## DynamoDBChatMessageHistory with Custom Endpoint URL\n",
-        "\n",
-        "Sometimes it is useful to specify the URL to the AWS endpoint to connect to. For instance, when you are running locally against [Localstack](https://localstack.cloud/). For those cases you can specify the URL via the `endpoint_url` parameter in the constructor."
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 13,
-      "id": "225713c8",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
-        "\n",
-        "history = DynamoDBChatMessageHistory(\n",
-        "    table_name=\"SessionTable\",\n",
-        "    session_id=\"0\",\n",
-        "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
-        ")"
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "source": [
-        "## DynamoDBChatMessageHistory With Different Keys Composite Keys\n",
-        "The default key for DynamoDBChatMessageHistory is ```{\"SessionId\": self.session_id}```, but you can modify this to match your table design.\n",
-        "\n",
-        "### Primary Key Name\n",
-        "You may modify the primary key by passing in a primary_key_name value in the constructor, resulting in the following:\n",
-        "```{self.primary_key_name: self.session_id}```\n",
-        "\n",
-        "### Composite Keys\n",
-        "When using an existing DynamoDB table, you may need to modify the key structure from the default of to something including a Sort Key. To do this you may use the ```key``` parameter.\n",
-        "\n",
-        "Passing a value for key will override the primary_key parameter, and the resulting key structure will be the passed value.\n"
-      ],
-      "metadata": {
-        "collapsed": false
-      },
-      "id": "c9bc0693"
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 14,
-      "outputs": [
-        {
-          "name": "stdout",
-          "output_type": "stream",
-          "text": [
-            "0\n"
-          ]
-        },
-        {
-          "data": {
-            "text/plain": "[HumanMessage(content='hello, composite dynamodb table!', additional_kwargs={}, example=False)]"
-          },
-          "execution_count": 14,
-          "metadata": {},
-          "output_type": "execute_result"
-        }
-      ],
-      "source": [
-        "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
-        "\n",
-        "composite_table = dynamodb.create_table(\n",
-        "    TableName=\"CompositeTable\",\n",
-        "    KeySchema=[{\"AttributeName\": \"PK\", \"KeyType\": \"HASH\"}, {\"AttributeName\": \"SK\", \"KeyType\": \"RANGE\"}],\n",
-        "    AttributeDefinitions=[{\"AttributeName\": \"PK\", \"AttributeType\": \"S\"}, {\"AttributeName\": \"SK\", \"AttributeType\": \"S\"}],\n",
-        "    BillingMode=\"PAY_PER_REQUEST\",\n",
-        ")\n",
-        "\n",
-        "# Wait until the table exists.\n",
-        "composite_table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"CompositeTable\")\n",
-        "\n",
-        "# Print out some data about the table.\n",
-        "print(composite_table.item_count)\n",
-        "\n",
-        "my_key = {\n",
-        "    \"PK\": \"session_id::0\",\n",
-        "    \"SK\":  \"langchain_history\",\n",
-        "}\n",
-        "\n",
-        "composite_key_history = DynamoDBChatMessageHistory(\n",
-        "    table_name=\"CompositeTable\",\n",
-        "    session_id=\"0\",\n",
-        "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
-        "    key=my_key,\n",
-        ")\n",
-        "\n",
-        "composite_key_history.add_user_message(\"hello, composite dynamodb table!\")\n",
-        "\n",
-        "composite_key_history.messages"
-      ],
-      "metadata": {
-        "collapsed": false
-      },
-      "id": "a7fa0331"
-    },
-    {
-      "attachments": {},
-      "cell_type": "markdown",
-      "id": "3b33c988",
-      "metadata": {},
-      "source": [
-        "## Agent with DynamoDB Memory"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 15,
-      "id": "f92d9499",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "from langchain.agents import Tool\n",
-        "from langchain.memory import ConversationBufferMemory\n",
-        "from langchain.chat_models import ChatOpenAI\n",
-        "from langchain.agents import initialize_agent\n",
-        "from langchain.agents import AgentType\n",
-        "from langchain.utilities import PythonREPL\n",
-        "from getpass import getpass\n",
-        "\n",
-        "message_history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"1\")\n",
-        "memory = ConversationBufferMemory(\n",
-        "    memory_key=\"chat_history\", chat_memory=message_history, return_messages=True\n",
-        ")"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 16,
-      "id": "1167eeba",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "python_repl = PythonREPL()\n",
-        "\n",
-        "# You can create the tool to pass to an agent\n",
-        "tools = [\n",
-        "    Tool(\n",
-        "        name=\"python_repl\",\n",
-        "        description=\"A Python shell. Use this to execute python commands. Input should be a valid python command. If you want to see the output of a value, you should print it out with `print(...)`.\",\n",
-        "        func=python_repl.run,\n",
-        "    )\n",
-        "]"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": 17,
-      "id": "fce085c5",
-      "metadata": {},
-      "outputs": [
-        {
-          "ename": "ValidationError",
-          "evalue": "1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)",
-          "output_type": "error",
-          "traceback": [
-            "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
-            "\u001b[0;31mValidationError\u001b[0m                           Traceback (most recent call last)",
-            "Cell \u001b[0;32mIn[17], line 1\u001b[0m\n\u001b[0;32m----> 1\u001b[0m llm \u001b[38;5;241m=\u001b[39m \u001b[43mChatOpenAI\u001b[49m\u001b[43m(\u001b[49m\u001b[43mtemperature\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;241;43m0\u001b[39;49m\u001b[43m)\u001b[49m\n\u001b[1;32m      2\u001b[0m agent_chain \u001b[38;5;241m=\u001b[39m initialize_agent(\n\u001b[1;32m      3\u001b[0m     tools,\n\u001b[1;32m      4\u001b[0m     llm,\n\u001b[0;32m   (...)\u001b[0m\n\u001b[1;32m      7\u001b[0m     memory\u001b[38;5;241m=\u001b[39mmemory,\n\u001b[1;32m      8\u001b[0m )\n",
-            "File \u001b[0;32m~/Documents/projects/langchain/libs/langchain/langchain/load/serializable.py:74\u001b[0m, in \u001b[0;36mSerializable.__init__\u001b[0;34m(self, **kwargs)\u001b[0m\n\u001b[1;32m     73\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21m__init__\u001b[39m(\u001b[38;5;28mself\u001b[39m, \u001b[38;5;241m*\u001b[39m\u001b[38;5;241m*\u001b[39mkwargs: Any) \u001b[38;5;241m-\u001b[39m\u001b[38;5;241m>\u001b[39m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[0;32m---> 74\u001b[0m     \u001b[38;5;28;43msuper\u001b[39;49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[38;5;21;43m__init__\u001b[39;49m\u001b[43m(\u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m     75\u001b[0m     \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_lc_kwargs \u001b[38;5;241m=\u001b[39m kwargs\n",
-            "File \u001b[0;32m~/Documents/projects/langchain/.venv/lib/python3.9/site-packages/pydantic/main.py:341\u001b[0m, in \u001b[0;36mpydantic.main.BaseModel.__init__\u001b[0;34m()\u001b[0m\n",
-            "\u001b[0;31mValidationError\u001b[0m: 1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)"
-          ]
-        }
-      ],
-      "source": [
-        "llm = ChatOpenAI(temperature=0)\n",
-        "agent_chain = initialize_agent(\n",
-        "    tools,\n",
-        "    llm,\n",
-        "    agent=AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION,\n",
-        "    verbose=True,\n",
-        "    memory=memory,\n",
-        ")"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "id": "952a3103",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "agent_chain.run(input=\"Hello!\")"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "id": "54c4aaf4",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "agent_chain.run(input=\"Who owns Twitter?\")"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "id": "f9013118",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "agent_chain.run(input=\"My name is Bob.\")"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "id": "405e5315",
-      "metadata": {},
-      "outputs": [],
-      "source": [
-        "agent_chain.run(input=\"Who am I?\")\n"
-      ]
-    }
-  ],
-  "metadata": {
-    "kernelspec": {
-      "display_name": "Python 3 (ipykernel)",
-      "language": "python",
-      "name": "python3"
-    },
-    "language_info": {
-      "codemirror_mode": {
-        "name": "ipython",
-        "version": 3
-      },
-      "file_extension": ".py",
-      "mimetype": "text/x-python",
-      "name": "python",
-      "nbconvert_exporter": "python",
-      "pygments_lexer": "ipython3",
-      "version": "3.11.3"
-    }
-  },
-  "nbformat": 4,
-  "nbformat_minor": 5
-}
--- a/docs/extras/integrations/memory/momento_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/momento_chat_message_history.ipynb
@@ -5,9 +5,13 @@
   "id": "91c6a7ef",
   "metadata": {},
   "source": [
-    "# Momento Chat Message History\n",
+    "# Momento Cache\n",
    "\n",
-    "This notebook goes over how to use [Momento Cache](https://gomomento.com) to store chat message history using the `MomentoChatMessageHistory` class. See the Momento [docs](https://docs.momentohq.com/getting-started) for more detail on how to get set up with Momento.\n",
+    ">[Momento Cache](https://docs.momentohq.com/) is the world's first truly serverless caching service. It provides instant elasticity, scale-to-zero \n",
+    "> capability, and blazing-fast performance.  \n",
+    "\n",
+    "\n",
+    "This notebook goes over how to use [Momento Cache](https://www.gomomento.com/services/cache) to store chat message history using the `MomentoChatMessageHistory` class. See the Momento [docs](https://docs.momentohq.com/getting-started) for more detail on how to get set up with Momento.\n",
    "\n",
    "Note that, by default we will create a cache if one with the given name doesn't already exist.\n",
    "\n",
@@ -78,7 +82,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/memory/mongodb_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/mongodb_chat_message_history.ipynb
@@ -1,18 +1,35 @@
 {
 "cells": [
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "91c6a7ef",
   "metadata": {},
   "source": [
-    "# Mongodb Chat Message History\n",
+    "# MongodDB\n",
    "\n",
-    "This notebook goes over how to use Mongodb to store chat message history.\n",
+    ">`MongoDB` is a source-available cross-platform document-oriented database program. Classified as a NoSQL database program, `MongoDB` uses `JSON`-like documents with optional schemas.\n",
+    ">\n",
+    ">`MongoDB` is developed by MongoDB Inc. and licensed under the Server Side Public License (SSPL). - [Wikipedia](https://en.wikipedia.org/wiki/MongoDB)\n",
    "\n",
-    "MongoDB is a source-available cross-platform document-oriented database program. Classified as a NoSQL database program, MongoDB uses JSON-like documents with optional schemas.\n",
-    "\n",
-    "MongoDB is developed by MongoDB Inc. and licensed under the Server Side Public License (SSPL). - [Wikipedia](https://en.wikipedia.org/wiki/MongoDB)"
+    "This notebook goes over how to use Mongodb to store chat message history.\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "2d6ed3c8-b70a-498c-bc9e-41b91797d3b7",
+   "metadata": {},
+   "source": [
+    "## Setting up"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "5a7f3b3f-d9b8-4577-a7ef-bdd8ecaedb70",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install pymongo"
   ]
  },
  {
@@ -26,6 +43,14 @@
    "connection_string = \"mongodb://mongo_user:password123@mongo:27017\""
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "a8e63850-3e14-46fe-a59e-be6d6bf8fe61",
+   "metadata": {},
+   "source": [
+    "## Example"
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": 4,
@@ -83,7 +108,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/memory/motorhead_memory.ipynb
+++ b/docs/extras/integrations/memory/motorhead_memory.ipynb
@@ -4,13 +4,29 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Motörhead Memory\n",
-    "[Motörhead](https://github.com/getmetal/motorhead) is a memory server implemented in Rust. It automatically handles incremental summarization in the background and allows for stateless applications.\n",
+    "# Motörhead\n",
+    "\n",
+    ">[Motörhead](https://github.com/getmetal/motorhead) is a memory server implemented in Rust. It automatically handles incremental summarization in the background and allows for stateless applications.\n",
    "\n",
    "## Setup\n",
    "\n",
-    "See instructions at [Motörhead](https://github.com/getmetal/motorhead) for running the server locally.\n",
-    "\n"
+    "See instructions at [Motörhead](https://github.com/getmetal/motorhead) for running the server locally."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.memory.motorhead_memory import MotorheadMemory"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example"
   ]
  },
  {
@@ -19,7 +35,6 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.memory.motorhead_memory import MotorheadMemory\n",
    "from langchain.llms import OpenAI\nfrom langchain.chains import LLMChain\nfrom langchain.prompts import PromptTemplate\n",
    "\n",
    "template = \"\"\"You are a chatbot having a conversation with a human.\n",
--- a/docs/extras/integrations/memory/motorhead_memory_managed.ipynb
+++ b/docs/extras/integrations/memory/motorhead_memory_managed.ipynb
@@ -1,198 +0,0 @@
-{
- "cells": [
-  {
-   "attachments": {},
-   "cell_type": "markdown",
-   "metadata": {},
-   "source": [
-    "# Motörhead Memory (Managed)\n",
-    "[Motörhead](https://github.com/getmetal/motorhead) is a memory server implemented in Rust. It automatically handles incremental summarization in the background and allows for stateless applications.\n",
-    "\n",
-    "## Setup\n",
-    "\n",
-    "See instructions at [Motörhead](https://docs.getmetal.io/motorhead/introduction) for running the managed version of Motorhead. You can retrieve your `api_key` and `client_id` by creating an account on [Metal](https://getmetal.io).\n",
-    "\n"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 1,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "from langchain.memory.motorhead_memory import MotorheadMemory\n",
-    "from langchain.llms import OpenAI\nfrom langchain.chains import LLMChain\nfrom langchain.prompts import PromptTemplate\n",
-    "\n",
-    "template = \"\"\"You are a chatbot having a conversation with a human.\n",
-    "\n",
-    "{chat_history}\n",
-    "Human: {human_input}\n",
-    "AI:\"\"\"\n",
-    "\n",
-    "prompt = PromptTemplate(\n",
-    "    input_variables=[\"chat_history\", \"human_input\"], \n",
-    "    template=template\n",
-    ")\n",
-    "memory = MotorheadMemory(\n",
-    "    api_key=\"YOUR_API_KEY\",\n",
-    "    client_id=\"YOUR_CLIENT_ID\"\n",
-    "    session_id=\"testing-1\",\n",
-    "    memory_key=\"chat_history\"\n",
-    ")\n",
-    "\n",
-    "await memory.init();  # loads previous state from Motörhead 🤘\n",
-    "\n",
-    "llm_chain = LLMChain(\n",
-    "    llm=OpenAI(), \n",
-    "    prompt=prompt, \n",
-    "    verbose=True, \n",
-    "    memory=memory,\n",
-    ")\n",
-    "\n"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 2,
-   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "\n",
-      "\u001b[1m> Entering new LLMChain chain...\u001b[0m\n",
-      "Prompt after formatting:\n",
-      "\u001b[32;1m\u001b[1;3mYou are a chatbot having a conversation with a human.\n",
-      "\n",
-      "\n",
-      "Human: hi im bob\n",
-      "AI:\u001b[0m\n",
-      "\n",
-      "\u001b[1m> Finished chain.\u001b[0m\n"
-     ]
-    },
-    {
-     "data": {
-      "text/plain": [
-       "' Hi Bob, nice to meet you! How are you doing today?'"
-      ]
-     },
-     "execution_count": 2,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "llm_chain.run(\"hi im bob\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 3,
-   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "\n",
-      "\u001b[1m> Entering new LLMChain chain...\u001b[0m\n",
-      "Prompt after formatting:\n",
-      "\u001b[32;1m\u001b[1;3mYou are a chatbot having a conversation with a human.\n",
-      "\n",
-      "Human: hi im bob\n",
-      "AI:  Hi Bob, nice to meet you! How are you doing today?\n",
-      "Human: whats my name?\n",
-      "AI:\u001b[0m\n",
-      "\n",
-      "\u001b[1m> Finished chain.\u001b[0m\n"
-     ]
-    },
-    {
-     "data": {
-      "text/plain": [
-       "' You said your name is Bob. Is that correct?'"
-      ]
-     },
-     "execution_count": 3,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "llm_chain.run(\"whats my name?\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 4,
-   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "\n",
-      "\u001b[1m> Entering new LLMChain chain...\u001b[0m\n",
-      "Prompt after formatting:\n",
-      "\u001b[32;1m\u001b[1;3mYou are a chatbot having a conversation with a human.\n",
-      "\n",
-      "Human: hi im bob\n",
-      "AI:  Hi Bob, nice to meet you! How are you doing today?\n",
-      "Human: whats my name?\n",
-      "AI:  You said your name is Bob. Is that correct?\n",
-      "Human: whats for dinner?\n",
-      "AI:\u001b[0m\n",
-      "\n",
-      "\u001b[1m> Finished chain.\u001b[0m\n"
-     ]
-    },
-    {
-     "data": {
-      "text/plain": [
-       "\"  I'm sorry, I'm not sure what you're asking. Could you please rephrase your question?\""
-      ]
-     },
-     "execution_count": 4,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "llm_chain.run(\"whats for dinner?\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "metadata": {},
-   "outputs": [],
-   "source": []
-  }
- ],
- "metadata": {
-  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
-   "language": "python",
-   "name": "python3"
-  },
-  "language_info": {
-   "codemirror_mode": {
-    "name": "ipython",
-    "version": 3
-   },
-   "file_extension": ".py",
-   "mimetype": "text/x-python",
-   "name": "python",
-   "nbconvert_exporter": "python",
-   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
-  }
- },
- "nbformat": 4,
- "nbformat_minor": 2
-}
--- a/docs/extras/integrations/memory/postgres_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/postgres_chat_message_history.ipynb
@@ -1,12 +1,13 @@
 {
 "cells": [
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "91c6a7ef",
   "metadata": {},
   "source": [
-    "# Postgres Chat Message History\n",
+    "# Postgres\n",
+    "\n",
+    ">[PostgreSQL](https://en.wikipedia.org/wiki/PostgreSQL) also known as `Postgres`, is a free and open-source relational database management system (RDBMS) emphasizing extensibility and SQL compliance.\n",
    "\n",
    "This notebook goes over how to use Postgres to store chat message history."
   ]
@@ -57,7 +58,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.2"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/memory/redis_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/redis_chat_message_history.ipynb
@@ -5,9 +5,11 @@
   "id": "91c6a7ef",
   "metadata": {},
   "source": [
-    "# Redis Chat Message History\n",
+    "# Redis\n",
    "\n",
-    "This notebook goes over how to use Redis to store chat message history."
+    ">[Redis (Remote Dictionary Server)](https://en.wikipedia.org/wiki/Redis) is an open-source in-memory storage, used as a distributed, in-memory key–value database, cache and message broker, with optional durability. Because it holds all data in memory and because of its design, `Redis` offers low-latency reads and writes, making it particularly suitable for use cases that require a cache. Redis is the most popular NoSQL database, and one of the most popular databases overall.\n",
+    "\n",
+    "This notebook goes over how to use `Redis` to store chat message history."
   ]
  },
  {
@@ -73,7 +75,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/memory/rockset_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/rockset_chat_message_history.ipynb
@@ -1,17 +1,47 @@
 {
 "cells": [
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Rockset Chat Message History\n",
+    "# Rockset\n",
    "\n",
-    "This notebook goes over how to use [Rockset](https://rockset.com/docs) to store chat message history. \n",
+    ">[Rockset](https://rockset.com/product/) is a real-time analytics database service for serving low latency, high concurrency analytical queries at scale. It builds a Converged Index™ on structured and semi-structured data with an efficient store for vector embeddings. Its support for running SQL on schemaless data makes it a perfect choice for running vector search with metadata filters. \n",
    "\n",
+    "\n",
+    "This notebook goes over how to use [Rockset](https://rockset.com/docs) to store chat message history. \n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Setting up"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install rockset"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
    "To begin, with get your API key from the [Rockset console](https://console.rockset.com/apikeys). Find your API region for the Rockset [API reference](https://rockset.com/docs/rest-api#introduction)."
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Example"
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
@@ -40,7 +70,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -57,11 +86,24 @@
  }
 ],
 "metadata": {
-  "language_info": {
-   "name": "python"
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
  },
-  "orig_nbformat": 4
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
+  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/memory/sql_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/sql_chat_message_history.ipynb
@@ -2,36 +2,58 @@
 "cells": [
  {
   "cell_type": "markdown",
+   "id": "f22eab3f84cbeb37",
+   "metadata": {},
   "source": [
-    "# SQL Chat Message History\n",
+    "# SQL (SQLAlchemy)\n",
    "\n",
-    "This notebook goes over a **SQLChatMessageHistory** class that allows to store chat history in any database supported by SQLAlchemy.\n",
+    ">[Structured Query Language (SQL)](https://en.wikipedia.org/wiki/SQL) is a domain-specific language used in programming and designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS). It is particularly useful in handling structured data, i.e., data incorporating relations among entities and variables.\n",
    "\n",
-    "Please note that to use it with databases other than SQLite, you will need to install the corresponding database driver."
-   ],
-   "metadata": {
-    "collapsed": false
-   },
-   "id": "f22eab3f84cbeb37"
+    ">[SQLAlchemy](https://github.com/sqlalchemy/sqlalchemy) is an open-source `SQL` toolkit and object-relational mapper (ORM) for the Python programming language released under the MIT License.\n",
+    "\n",
+    "This notebook goes over a `SQLChatMessageHistory` class that allows to store chat history in any database supported by `SQLAlchemy`.\n",
+    "\n",
+    "Please note that to use it with databases other than `SQLite`, you will need to install the corresponding database driver."
+   ]
  },
  {
   "cell_type": "markdown",
+   "id": "f8f2830ee9ca1e01",
+   "metadata": {},
   "source": [
-    "### Basic Usage\n",
+    "## Basic Usage\n",
    "\n",
    "To use the storage you need to provide only 2 things:\n",
    "\n",
    "1. Session Id - a unique identifier of the session, like user name, email, chat id etc.\n",
-    "2. Connection string - a string that specifies the database connection. It will be passed to SQLAlchemy create_engine function."
-   ],
-   "metadata": {
-    "collapsed": false
-   },
-   "id": "f8f2830ee9ca1e01"
+    "2. Connection string - a string that specifies the database connection. It will be passed to SQLAlchemy create_engine function.\n",
+    "3. Install `SQLAlchemy` python package."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "ab016290-3823-4e1b-9610-ae9a1b71cb07",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install SQLAlchemy"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
+   "id": "4576e914a866fb40",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-08-28T10:04:38.077748Z",
+     "start_time": "2023-08-28T10:04:36.105894Z"
+    },
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
   "outputs": [],
   "source": [
    "from langchain.memory.chat_message_histories import SQLChatMessageHistory\n",
@@ -43,23 +65,29 @@
    "\n",
    "chat_message_history.add_user_message('Hello')\n",
    "chat_message_history.add_ai_message('Hi')"
-   ],
-   "metadata": {
-    "collapsed": false,
-    "ExecuteTime": {
-     "end_time": "2023-08-28T10:04:38.077748Z",
-     "start_time": "2023-08-28T10:04:36.105894Z"
-    }
-   },
-   "id": "4576e914a866fb40"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
+   "id": "b476688cbb32ba90",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-08-28T10:04:38.929396Z",
+     "start_time": "2023-08-28T10:04:38.915727Z"
+    },
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
   "outputs": [
    {
     "data": {
-      "text/plain": "[HumanMessage(content='Hello', additional_kwargs={}, example=False),\n AIMessage(content='Hi', additional_kwargs={}, example=False)]"
+      "text/plain": [
+       "[HumanMessage(content='Hello', additional_kwargs={}, example=False),\n",
+       " AIMessage(content='Hi', additional_kwargs={}, example=False)]"
+      ]
     },
     "execution_count": 2,
     "metadata": {},
@@ -68,35 +96,36 @@
   ],
   "source": [
    "chat_message_history.messages"
-   ],
-   "metadata": {
-    "collapsed": false,
-    "ExecuteTime": {
-     "end_time": "2023-08-28T10:04:38.929396Z",
-     "start_time": "2023-08-28T10:04:38.915727Z"
-    }
-   },
-   "id": "b476688cbb32ba90"
+   ]
  },
  {
   "cell_type": "markdown",
+   "id": "2e5337719d5614fd",
+   "metadata": {},
   "source": [
-    "### Custom Storage Format\n",
+    "## Custom Storage Format\n",
    "\n",
    "By default, only the session id and message dictionary are stored in the table.\n",
    "\n",
    "However, sometimes you might want to store some additional information, like message date, author, language etc.\n",
    "\n",
    "To do that, you can create a custom message converter, by implementing **BaseMessageConverter** interface."
-   ],
-   "metadata": {
-    "collapsed": false
-   },
-   "id": "2e5337719d5614fd"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
+   "id": "fdfde84c07d071bb",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-08-28T10:04:41.510498Z",
+     "start_time": "2023-08-28T10:04:41.494912Z"
+    },
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
   "outputs": [],
   "source": [
    "from datetime import datetime\n",
@@ -165,23 +194,29 @@
    "\n",
    "chat_message_history.add_user_message('Hello')\n",
    "chat_message_history.add_ai_message('Hi')"
-   ],
-   "metadata": {
-    "collapsed": false,
-    "ExecuteTime": {
-     "end_time": "2023-08-28T10:04:41.510498Z",
-     "start_time": "2023-08-28T10:04:41.494912Z"
-    }
-   },
-   "id": "fdfde84c07d071bb"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
+   "id": "4a6a54d8a9e2856f",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-08-28T10:04:43.497990Z",
+     "start_time": "2023-08-28T10:04:43.492517Z"
+    },
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
   "outputs": [
    {
     "data": {
-      "text/plain": "[HumanMessage(content='Hello', additional_kwargs={}, example=False),\n AIMessage(content='Hi', additional_kwargs={}, example=False)]"
+      "text/plain": [
+       "[HumanMessage(content='Hello', additional_kwargs={}, example=False),\n",
+       " AIMessage(content='Hi', additional_kwargs={}, example=False)]"
+      ]
     },
     "execution_count": 4,
     "metadata": {},
@@ -190,44 +225,34 @@
   ],
   "source": [
    "chat_message_history.messages"
-   ],
-   "metadata": {
-    "collapsed": false,
-    "ExecuteTime": {
-     "end_time": "2023-08-28T10:04:43.497990Z",
-     "start_time": "2023-08-28T10:04:43.492517Z"
-    }
-   },
-   "id": "4a6a54d8a9e2856f"
+   ]
  },
  {
   "cell_type": "markdown",
+   "id": "622aded629a1adeb",
+   "metadata": {},
   "source": [
    "You also might want to change the name of session_id column. In this case you'll need to specify `session_id_field_name` parameter."
-   ],
-   "metadata": {
-    "collapsed": false
-   },
-   "id": "622aded629a1adeb"
+   ]
  }
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
-    "version": 2
+    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
-   "pygments_lexer": "ipython2",
-   "version": "2.7.6"
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/memory/entity_memory_with_sqlite.ipynb
+++ b/docs/extras/integrations/memory/entity_memory_with_sqlite.ipynb
@@ -2,19 +2,32 @@
 "cells": [
  {
   "cell_type": "markdown",
+   "id": "d464a12a",
   "metadata": {
    "id": "eg0Hwptz9g5q"
   },
   "source": [
-    "# Entity Memory with SQLite storage\n",
+    "# SQLite\n",
    "\n",
-    "In this walkthrough we'll create a simple conversation chain which uses ConversationEntityMemory backed by a SqliteEntityStore."
-   ],
-   "id": "d464a12a"
+    ">[SQLite](https://en.wikipedia.org/wiki/SQLite) is a database engine written in the C programming language. It is not a standalone app; rather, it is a library that software developers embed in their apps. As such, it belongs to the family of embedded databases. It is the most widely deployed database engine, as it is used by several of the top web browsers, operating systems, mobile phones, and other embedded systems.\n",
+    "\n",
+    "In this walkthrough we'll create a simple conversation chain which uses `ConversationEntityMemory` backed by a `SqliteEntityStore`."
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
+   "id": "d0a07a30-028f-4e16-8b11-45b2416f7b0f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "#!pip install sqlite3"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "db59b901",
   "metadata": {
    "id": "2wUMSUoF8ffn"
   },
@@ -25,12 +38,12 @@
    "from langchain.memory import ConversationEntityMemory\n",
    "from langchain.memory.entity import SQLiteEntityStore\n",
    "from langchain.memory.prompt import ENTITY_MEMORY_CONVERSATION_TEMPLATE"
-   ],
-   "id": "db59b901"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
+   "id": "ca6dee29",
   "metadata": {
    "id": "8TpJZti99gxV"
   },
@@ -45,22 +58,22 @@
    "    memory=memory,\n",
    "    verbose=True,\n",
    ")"
-   ],
-   "id": "ca6dee29"
+   ]
  },
  {
   "cell_type": "markdown",
+   "id": "f9b4c3a0",
   "metadata": {
    "id": "HEAHG1L79ca1"
   },
   "source": [
    "Notice the usage of `EntitySqliteStore` as parameter to `entity_store` on the `memory` property."
-   ],
-   "id": "f9b4c3a0"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
+   "id": "297e78a6",
   "metadata": {
    "colab": {
     "base_uri": "https://localhost:8080/",
@@ -111,12 +124,12 @@
   ],
   "source": [
    "conversation.run(\"Deven & Sam are working on a hackathon project\")"
-   ],
-   "id": "297e78a6"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
+   "id": "7e71f1dc",
   "metadata": {
    "colab": {
     "base_uri": "https://localhost:8080/",
@@ -139,12 +152,12 @@
   ],
   "source": [
    "conversation.memory.entity_store.get(\"Deven\")"
-   ],
-   "id": "7e71f1dc"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
+   "id": "316f2e8d",
   "metadata": {},
   "outputs": [
    {
@@ -160,16 +173,15 @@
   ],
   "source": [
    "conversation.memory.entity_store.get(\"Sam\")"
-   ],
-   "id": "316f2e8d"
+   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
+   "id": "b85f8427",
   "metadata": {},
   "outputs": [],
-   "source": [],
-   "id": "b85f8427"
+   "source": []
  }
 ],
 "metadata": {
@@ -177,9 +189,9 @@
   "provenance": []
  },
  "kernelspec": {
-   "display_name": "venv",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
-   "name": "venv"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@@ -191,9 +203,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
-}
+}
--- a/docs/extras/integrations/memory/streamlit_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/streamlit_chat_message_history.ipynb
@@ -5,13 +5,17 @@
   "id": "91c6a7ef",
   "metadata": {},
   "source": [
-    "# Streamlit Chat Message History\n",
+    "# Streamlit\n",
    "\n",
-    "This notebook goes over how to store and use chat message history in a Streamlit app. StreamlitChatMessageHistory will store messages in\n",
+    ">[Streamlit](https://docs.streamlit.io/) is an open-source Python library that makes it easy to create and share beautiful, \n",
+    "custom web apps for machine learning and data science.\n",
+    "\n",
+    "\n",
+    "This notebook goes over how to store and use chat message history in a `Streamlit` app. `StreamlitChatMessageHistory` will store messages in\n",
    "[Streamlit session state](https://docs.streamlit.io/library/api-reference/session-state)\n",
    "at the specified `key=`. The default key is `\"langchain_messages\"`.\n",
    "\n",
-    "- Note, StreamlitChatMessageHistory only works when run in a Streamlit app.\n",
+    "- Note, `StreamlitChatMessageHistory` only works when run in a Streamlit app.\n",
    "- You may also be interested in [StreamlitCallbackHandler](/docs/integrations/callbacks/streamlit) for LangChain.\n",
    "- For more on Streamlit check out their\n",
    "[getting started documentation](https://docs.streamlit.io/library/get-started).\n",
@@ -50,7 +54,7 @@
   "id": "b60dc735",
   "metadata": {},
   "source": [
-    "You can integrate StreamlitChatMessageHistory into ConversationBufferMemory and chains or agents as usual. The history will be persisted across re-runs of the Streamlit app within a given user session. A given StreamlitChatMessageHistory will NOT be persisted or shared across user sessions."
+    "You can integrate `StreamlitChatMessageHistory` into `ConversationBufferMemory` and chains or agents as usual. The history will be persisted across re-runs of the Streamlit app within a given user session. A given `StreamlitChatMessageHistory` will NOT be persisted or shared across user sessions."
   ]
  },
  {
@@ -132,9 +136,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "poetry-venv",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
-   "name": "poetry-venv"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@@ -146,7 +150,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/memory/xata_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/xata_chat_message_history.ipynb
@@ -4,9 +4,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Xata chat memory\n",
+    "# Xata\n",
    "\n",
-    "[Xata](https://xata.io) is a serverless data platform, based on PostgreSQL and Elasticsearch. It provides a Python SDK for interacting with your database, and a UI for managing your data. With the `XataChatMessageHistory` class, you can use Xata databases for longer-term persistence of chat sessions.\n",
+    ">[Xata](https://xata.io) is a serverless data platform, based on `PostgreSQL` and `Elasticsearch`. It provides a Python SDK for interacting with your database, and a UI for managing your data. With the `XataChatMessageHistory` class, you can use Xata databases for longer-term persistence of chat sessions.\n",
    "\n",
    "This notebook covers:\n",
    "\n",
@@ -318,7 +318,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.9"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/memory/zep_memory.ipynb
+++ b/docs/extras/integrations/memory/zep_memory.ipynb
@@ -1,26 +1,14 @@
 {
 "cells": [
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Zep Memory\n",
+    "# Zep\n",
    "\n",
-    "## REACT Agent Chat Message History with Zep - A long-term memory store for LLM applications.\n",
-    "\n",
-    "This notebook demonstrates how to use the [Zep Long-term Memory Store](https://docs.getzep.com/) as memory for your chatbot.\n",
-    "\n",
-    "We'll demonstrate:\n",
-    "\n",
-    "1. Adding conversation history to the Zep memory store.\n",
-    "2. Running an agent and having message automatically added to the store.\n",
-    "3. Viewing the enriched messages.\n",
-    "4. Vector search over the conversation history.\n",
-    "\n",
-    "### More on Zep:\n",
-    "\n",
-    "Zep stores, summarizes, embeds, indexes, and enriches conversational AI chat histories, and exposes them via simple, low-latency APIs.\n",
+    ">[Zep](https://docs.getzep.com/) is a long-term memory store for LLM applications.\n",
+    ">\n",
+    ">`Zep` stores, summarizes, embeds, indexes, and enriches conversational AI chat histories, and exposes them via simple, low-latency APIs.\n",
    "\n",
    "Key Features:\n",
    "\n",
@@ -32,8 +20,21 @@
    "- **Auto-token counting** of memories and summaries, allowing finer-grained control over prompt assembly.\n",
    "- Python and JavaScript SDKs.\n",
    "\n",
-    "Zep project: [https://github.com/getzep/zep](https://github.com/getzep/zep)\n",
-    "Docs: [https://docs.getzep.com/](https://docs.getzep.com/)\n"
+    "`Zep` project: [https://github.com/getzep/zep](https://github.com/getzep/zep)\n",
+    "Docs: [https://docs.getzep.com/](https://docs.getzep.com/)\n",
+    "\n",
+    "\n",
+    "## Example\n",
+    "\n",
+    "This notebook demonstrates how to use the [Zep Long-term Memory Store](https://docs.getzep.com/) as memory for your chatbot.\n",
+    "REACT Agent Chat Message History with Zep - A long-term memory store for LLM applications.\n",
+    "\n",
+    "We'll demonstrate:\n",
+    "\n",
+    "1. Adding conversation history to the Zep memory store.\n",
+    "2. Running an agent and having message automatically added to the store.\n",
+    "3. Viewing the enriched messages.\n",
+    "4. Vector search over the conversation history."
   ]
  },
  {
@@ -96,7 +97,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -143,7 +143,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -232,7 +231,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -257,16 +255,18 @@
     "text": [
      "\n",
      "\n",
-      "\u001B[1m> Entering new  chain...\u001B[0m\n",
-      "\u001B[32;1m\u001B[1;3mThought: Do I need to use a tool? No\n",
-      "AI: Parable of the Sower is a prescient novel that speaks to the challenges facing contemporary society, such as climate change, inequality, and violence. It is a cautionary tale that warns of the dangers of unchecked greed and the need for individuals to take responsibility for their own lives and the lives of those around them.\u001B[0m\n",
+      "\u001b[1m> Entering new  chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3mThought: Do I need to use a tool? No\n",
+      "AI: Parable of the Sower is a prescient novel that speaks to the challenges facing contemporary society, such as climate change, inequality, and violence. It is a cautionary tale that warns of the dangers of unchecked greed and the need for individuals to take responsibility for their own lives and the lives of those around them.\u001b[0m\n",
      "\n",
-      "\u001B[1m> Finished chain.\u001B[0m\n"
+      "\u001b[1m> Finished chain.\u001b[0m\n"
     ]
    },
    {
     "data": {
-      "text/plain": "'Parable of the Sower is a prescient novel that speaks to the challenges facing contemporary society, such as climate change, inequality, and violence. It is a cautionary tale that warns of the dangers of unchecked greed and the need for individuals to take responsibility for their own lives and the lives of those around them.'"
+      "text/plain": [
+       "'Parable of the Sower is a prescient novel that speaks to the challenges facing contemporary society, such as climate change, inequality, and violence. It is a cautionary tale that warns of the dangers of unchecked greed and the need for individuals to take responsibility for their own lives and the lives of those around them.'"
+      ]
     },
     "execution_count": 6,
     "metadata": {},
@@ -280,7 +280,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -341,7 +340,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -391,11 +389,14 @@
  {
   "cell_type": "code",
   "execution_count": null,
-   "outputs": [],
-   "source": [],
   "metadata": {
-    "collapsed": false
-   }
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {
@@ -414,7 +415,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.4"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/platforms/aws.mdx
+++ b/docs/extras/integrations/platforms/aws.mdx
@@ -1,6 +1,6 @@
 # AWS

-All functionality related to AWS platform
+All functionality related to [Amazon AWS](https://aws.amazon.com/) platform

 ## LLMs

@@ -70,7 +70,7 @@ from langchain.llms.sagemaker_endpoint import ContentHandlerBase

 ## Document loaders

-### AWS S3 Directory
+### AWS S3 Directory and File
 >[Amazon Simple Storage Service (Amazon S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html) is an object storage service.
 >[AWS S3 Directory](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html)
 >[AWS S3 Buckets](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingBucket.html)
@@ -82,3 +82,24 @@ See a [usage example for S3FileLoader](/docs/integrations/document_loaders/aws_s
 ```python
 from langchain.document_loaders import S3DirectoryLoader, S3FileLoader
 ```
+
+## Memory
+
+### AWS DynamoDB
+
+>[AWS DynamoDB](https://awscli.amazonaws.com/v2/documentation/api/latest/reference/dynamodb/index.html) 
+> is a fully managed `NoSQL` database service that provides fast and predictable performance with seamless scalability.
+ 
+We have to configure the [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html). 
+
+We need to install the `boto3` library.
+
+```bash
+pip install boto3
+```
+
+See a [usage example](/docs/integrations/memory/aws_dynamodb).
+
+```python
+from langchain.memory import DynamoDBChatMessageHistory
+```
--- a/docs/extras/integrations/platforms/google.mdx
+++ b/docs/extras/integrations/platforms/google.mdx
@@ -1,6 +1,6 @@
 # Google

-All functionality related to Google Platform
+All functionality related to [Google Cloud Platform](https://cloud.google.com/)

 ## LLMs

@@ -37,7 +37,7 @@ from langchain.chat_models import ChatVertexAI
 >[Google BigQuery](https://cloud.google.com/bigquery) is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
 `BigQuery` is a part of the `Google Cloud Platform`.

-First, you need to install `google-cloud-bigquery` python package.
+First, we need to install `google-cloud-bigquery` python package.

 ```bash
 pip install google-cloud-bigquery
@@ -53,7 +53,7 @@ from langchain.document_loaders import BigQueryLoader

 >[Google Cloud Storage](https://en.wikipedia.org/wiki/Google_Cloud_Storage) is a managed service for storing unstructured data.

-First, you need to install `google-cloud-storage` python package.
+First, we need to install `google-cloud-storage` python package.

 ```bash
 pip install google-cloud-storage
@@ -78,7 +78,7 @@ from langchain.document_loaders import GCSFileLoader

 Currently, only `Google Docs` are supported.

-First, you need to install several python package.
+First, we need to install several python package.

 ```bash
 pip install google-api-python-client google-auth-httplib2 google-auth-oauthlib
@@ -109,6 +109,32 @@ See a [usage example](/docs/integrations/vectorstores/matchingengine).
 from langchain.vectorstores import MatchingEngine
 ```

+### Google ScaNN
+
+>[Google ScaNN](https://github.com/google-research/google-research/tree/master/scann)
+> (Scalable Nearest Neighbors) is a python package.
+> 
+>`ScaNN` is a method for efficient vector similarity search at scale.
+
+>`ScaNN` includes search space pruning and quantization for Maximum Inner
+> Product Search and also supports other distance functions such as
+> Euclidean distance. The implementation is optimized for x86 processors
+> with AVX2 support. See its [Google Research github](https://github.com/google-research/google-research/tree/master/scann)
+> for more details.
+
+We need to install `scann` python package.
+
+```bash
+pip install scann
+```
+
+See a [usage example](/docs/integrations/vectorstores/scann).
+
+```python
+from langchain.vectorstores import ScaNN
+```
+
+
 ## Tools
 ### Google Search

@@ -123,8 +149,36 @@ from langchain.utilities import GoogleSearchAPIWrapper
 ```
 For a more detailed walkthrough of this wrapper, see [this notebook](/docs/integrations/tools/google_search.html).

-You can easily load this wrapper as a Tool (to use with an Agent). You can do this with:
+We can easily load this wrapper as a Tool (to use with an Agent). We can do this with:
 ```python
 from langchain.agents import load_tools
 tools = load_tools(["google-search"])
 ```
+
+## Document Transformer
+### Google Document AI
+
+>[Document AI](https://cloud.google.com/document-ai/docs/overview) is a `Google Cloud Platform` 
+> service to transform unstructured data from documents into structured data, making it easier 
+> to understand, analyze, and consume.  
+
+
+
+We need to set up a [`GCS` bucket and create your own OCR processor](https://cloud.google.com/document-ai/docs/create-processor)  
+The `GCS_OUTPUT_PATH` should be a path to a folder on GCS (starting with `gs://`) 
+and a processor name should look like `projects/PROJECT_NUMBER/locations/LOCATION/processors/PROCESSOR_ID`.
+We can get it either programmatically or copy from the `Prediction endpoint` section of the `Processor details`
+tab in the Google Cloud Console.
+
+```bash
+pip install google-cloud-documentai
+pip install google-cloud-documentai-toolbox
+```
+
+
+See a [usage example](/docs/integrations/document_transformers/docai).
+
+```python
+from langchain.document_loaders.blob_loaders import Blob
+from langchain.document_loaders.parsers import DocAIParser
+```
--- a/docs/extras/integrations/providers/beautiful_soup.mdx
+++ b/docs/extras/integrations/providers/beautiful_soup.mdx
@@ -0,0 +1,20 @@
+# Beautiful Soup
+
+>[Beautiful Soup](https://www.crummy.com/software/BeautifulSoup/) is a Python package for parsing 
+> HTML and XML documents (including having malformed markup, i.e. non-closed tags, so named after tag soup). 
+> It creates a parse tree for parsed pages that can be used to extract data from HTML,[3] which 
+> is useful for web scraping.
+
+## Installation and Setup
+
+```bash
+pip install beautifulsoup4
+```
+
+## Document Transformer
+
+See a [usage example](/docs/integrations/document_transformers/beautiful_soup).
+
+```python
+from langchain.document_loaders import BeautifulSoupTransformer
+```
--- a/docs/extras/integrations/providers/clarifai.mdx
+++ b/docs/extras/integrations/providers/clarifai.mdx
@@ -43,10 +43,10 @@ For more details, the docs on the Clarifai Embeddings wrapper provide a [detaile

 Clarifai's vector DB was launched in 2016 and has been optimized to support live search queries. With workflows in the Clarifai platform, you data is automatically indexed by am embedding model and optionally other models as well to index that information in the DB for search. You can query the DB not only via the vectors but also filter by metadata matches, other AI predicted concepts, and even do geo-coordinate search. Simply create an application, select the appropriate base workflow for your type of data, and upload it (through the API as [documented here](https://docs.clarifai.com/api-guide/data/create-get-update-delete) or the UIs at clarifai.com).

-You an also add data directly from LangChain as well, and the auto-indexing will take place for you. You'll notice this is a little different than other vectorstores where you need to provde an embedding model in their constructor and have LangChain coordinate getting the embeddings from text and writing those to the index. Not only is it more convenient, but it's much more scalable to use Clarifai's distributed cloud to do all the index in the background.
+You can also add data directly from LangChain as well, and the auto-indexing will take place for you. You'll notice this is a little different than other vectorstores where you need to provide an embedding model in their constructor and have LangChain coordinate getting the embeddings from text and writing those to the index. Not only is it more convenient, but it's much more scalable to use Clarifai's distributed cloud to do all the index in the background.

 ```python
 from langchain.vectorstores import Clarifai
 clarifai_vector_db = Clarifai.from_texts(user_id=USER_ID, app_id=APP_ID, texts=texts, pat=CLARIFAI_PAT, number_of_docs=NUMBER_OF_DOCS, metadatas = metadatas)
 ```
-For more details, the docs on the Clarifai vector store provide a [detailed walkthrough](/docs/integrations/text_embedding/clarifai.html).
+For more details, the docs on the Clarifai vector store provide a [detailed walkthrough](/docs/integrations/vectorstores/clarifai.ipynb).
--- a/docs/extras/integrations/providers/doctran.mdx
+++ b/docs/extras/integrations/providers/doctran.mdx
@@ -0,0 +1,37 @@
+# Doctran
+
+>[Doctran](https://github.com/psychic-api/doctran) is a python package. It uses LLMs and open source 
+> NLP libraries to transform raw text into clean, structured, information-dense documents 
+> that are optimized for vector space retrieval. You can think of `Doctran` as a black box where 
+> messy strings go in and nice, clean, labelled strings come out.
+
+
+## Installation and Setup
+
+```bash
+pip install doctran
+```
+
+## Document Transformers
+
+### Document Interrogator
+
+See a [usage example for DoctranQATransformer](/docs/integrations/document_transformers/doctran_interrogate_document).
+
+```python
+from langchain.document_loaders import DoctranQATransformer
+```
+### Property Extractor
+
+See a [usage example for DoctranPropertyExtractor](/docs/integrations/document_transformers/doctran_extract_properties).
+
+```python
+from langchain.document_loaders import DoctranPropertyExtractor
+```
+### Document Translator
+
+See a [usage example for DoctranTextTranslator](/docs/integrations/document_transformers/doctran_translate_document).
+
+```python
+from langchain.document_loaders import DoctranTextTranslator
+```
--- a/docs/extras/integrations/providers/html2text.mdx
+++ b/docs/extras/integrations/providers/html2text.mdx
@@ -0,0 +1,19 @@
+# HTML to text
+
+>[html2text](https://github.com/Alir3z4/html2text/) is a Python package that converts a page of `HTML` into clean, easy-to-read plain `ASCII text`. 
+
+The ASCII also happens to be a valid `Markdown` (a text-to-HTML format).
+
+## Installation and Setup
+
+```bash
+pip install html2text
+```
+
+## Document Transformer
+
+See a [usage example](/docs/integrations/document_transformers/html2text).
+
+```python
+from langchain.document_loaders import Html2TextTransformer
+```
--- a/docs/extras/integrations/providers/jina.mdx
+++ b/docs/extras/integrations/providers/jina.mdx
@@ -62,7 +62,7 @@ Deploy on Jina AI Cloud with `lc-serve deploy jcloud app`. Once deployed, we can
 ```bash
 curl -X 'POST' 'https://<your-app>.wolf.jina.ai/ask' \
 -d '{
-  "input": "Your Quesion here?",
+  "input": "Your Question here?",
  "envs": {
     "OPENAI_API_KEY": "sk-***"
  }
--- a/docs/extras/integrations/providers/motorhead.mdx
+++ b/docs/extras/integrations/providers/motorhead.mdx
@@ -0,0 +1,16 @@
+# Motörhead
+
+>[Motörhead](https://github.com/getmetal/motorhead) is a memory server implemented in Rust. It automatically handles incremental summarization in the background and allows for stateless applications.
+
+## Installation and Setup
+
+See instructions at [Motörhead](https://github.com/getmetal/motorhead) for running the server locally.
+
+
+## Memory
+
+See a [usage example](/docs/integrations/memory/motorhead_memory).
+
+```python
+from langchain.memory import MotorheadMemory
+```
--- a/docs/extras/integrations/providers/nuclia.mdx
+++ b/docs/extras/integrations/providers/nuclia.mdx
@@ -0,0 +1,37 @@
+# Nuclia
+
+>[Nuclia](https://nuclia.com) automatically indexes your unstructured data from any internal
+> and external source, providing optimized search results and generative answers. 
+> It can handle video and audio transcription, image content extraction, and document parsing.
+
+>`Nuclia Understanding API` document transformer splits text into paragraphs and sentences, 
+> identifies entities, provides a summary of the text and generates embeddings for all the sentences.
+
+
+## Installation and Setup
+
+We need to install the `nucliadb-protos` package to use the `Nuclia Understanding API`.
+```bash
+pip install nucliadb-protos
+```
+
+To use the `Nuclia Understanding API`, we need to have a `Nuclia account`. 
+We can create one for free at [https://nuclia.cloud](https://nuclia.cloud), 
+and then [create a NUA key](https://docs.nuclia.dev/docs/docs/using/understanding/intro).
+
+To use the Nuclia document transformer, we need to instantiate a `NucliaUnderstandingAPI`
+tool with `enable_ml` set to `True`:
+
+```python
+from langchain.tools.nuclia import NucliaUnderstandingAPI
+
+nua = NucliaUnderstandingAPI(enable_ml=True)
+```
+
+## Document Transformer
+
+See a [usage example](/docs/integrations/document_transformers/nuclia_transformer).
+
+```python
+from langchain.document_transformers.nuclia_text_transform import NucliaTextTransformer
+```
--- a/docs/extras/integrations/providers/predibase.md
+++ b/docs/extras/integrations/providers/predibase.md
@@ -3,7 +3,7 @@
 Learn how to use LangChain with models on Predibase. 

 ## Setup
- Create a [Predibase](hhttps://predibase.com/) account and [API key](https://docs.predibase.com/sdk-guide/intro).
+- Create a [Predibase](https://predibase.com/) account and [API key](https://docs.predibase.com/sdk-guide/intro).
 - Install the Predibase Python client with `pip install predibase`
 - Use your API key to authenticate

--- a/docs/extras/integrations/providers/redis.mdx
+++ b/docs/extras/integrations/providers/redis.mdx
@@ -1,7 +1,10 @@
 # Redis

->[Redis](https://redis.com) is an open-source key-value store that can be used as a cache, 
-> message broker, database, vector database and more.
+>[Redis (Remote Dictionary Server)](https://en.wikipedia.org/wiki/Redis) is an open-source in-memory storage, 
+> used as a distributed, in-memory key–value database, cache and message broker, with optional durability. 
+> Because it holds all data in memory and because of its design, `Redis` offers low-latency reads and writes, 
+> making it particularly suitable for use cases that require a cache. Redis is the most popular NoSQL database, 
+> and one of the most popular databases overall.

 This page covers how to use the [Redis](https://redis.com) ecosystem within LangChain.
 It is broken into two parts: installation and setup, and then references to specific Redis wrappers.
@@ -111,7 +114,7 @@ Redis can be used to persist LLM conversations.

 #### Vector Store Retriever Memory

-For a more detailed walkthrough of the `VectorStoreRetrieverMemory` wrapper, see [this notebook](/docs/modules/memory/integrations/vectorstore_retriever_memory.html).
+For a more detailed walkthrough of the `VectorStoreRetrieverMemory` wrapper, see [this notebook](/docs/modules/memory/types/vectorstore_retriever_memory.html).

 #### Chat Message History Memory
 For a detailed example of Redis to cache conversation message history, see [this notebook](/docs/integrations/memory/redis_chat_message_history.html).
--- a/docs/extras/integrations/providers/scann.mdx
+++ b/docs/extras/integrations/providers/scann.mdx
@@ -1,29 +0,0 @@
-# ScaNN
-
->[Google ScaNN](https://github.com/google-research/google-research/tree/master/scann)
-> (Scalable Nearest Neighbors) is a python package.
-> 
->`ScaNN` is a method for efficient vector similarity search at scale.
-
->ScaNN includes search space pruning and quantization for Maximum Inner
-> Product Search and also supports other distance functions such as
-> Euclidean distance. The implementation is optimized for x86 processors
-> with AVX2 support. See its [Google Research github](https://github.com/google-research/google-research/tree/master/scann)
-> for more details.
-
-## Installation and Setup
-
-We need to install `scann` python package.
-
-```bash
-pip install scann
-```
-
-## Vector Store
-
-See a [usage example](/docs/integrations/vectorstores/scann).
-
-```python
-from langchain.vectorstores import ScaNN
-```
-
--- a/docs/extras/integrations/providers/vectara/index.mdx
+++ b/docs/extras/integrations/providers/vectara/index.mdx
@@ -5,7 +5,7 @@

 **Vectara Overview:**
 - `Vectara` is developer-first API platform for building GenAI applications
- To use Vectara - first [sign up](https://console.vectara.com/signup) and create an account. Then create a corpus and an API key for indexing and searching.
+- To use Vectara - first [sign up](https://vectara.com/integrations/langchain) and create an account. Then create a corpus and an API key for indexing and searching.
 - You can use Vectara's [indexing API](https://docs.vectara.com/docs/indexing-apis/indexing) to add documents into Vectara's index
 - You can use Vectara's [Search API](https://docs.vectara.com/docs/search-apis/search) to query Vectara's index (which also supports Hybrid search implicitly).
 - You can use Vectara's integration with LangChain as a Vector store or using the Retriever abstraction.
@@ -13,7 +13,7 @@
 ## Installation and Setup

 To use `Vectara` with LangChain no special installation steps are required. 
-To get started, follow our [quickstart](https://docs.vectara.com/docs/quickstart) guide to create an account, a corpus and an API key. 
+To get started, [sign up](https://vectara.com/integrations/langchain) and follow our [quickstart](https://docs.vectara.com/docs/quickstart) guide to create a corpus and an API key. 
 Once you have these, you can provide them as arguments to the Vectara vectorstore, or you can set them as environment variables.

 - export `VECTARA_CUSTOMER_ID`="your_customer_id"
--- a/docs/extras/integrations/providers/vectara/vectara_chat.ipynb
+++ b/docs/extras/integrations/providers/vectara/vectara_chat.ipynb
@@ -7,7 +7,7 @@
   "source": [
    "# Chat Over Documents with Vectara\n",
    "\n",
-    "This notebook is based on the [chat_vector_db](https://github.com/hwchase17/langchain/blob/master/docs/modules/chains/index_examples/chat_vector_db.html) notebook, but using Vectara as the vector database."
+    "This notebook is based on the [chat_vector_db](https://github.com/langchain-ai/langchain/blob/master/docs/modules/chains/index_examples/chat_vector_db.html) notebook, but using Vectara as the vector database."
   ]
  },
  {
--- a/docs/extras/integrations/providers/vectara/vectara_text_generation.ipynb
+++ b/docs/extras/integrations/providers/vectara/vectara_text_generation.ipynb
@@ -6,7 +6,7 @@
   "source": [
    "# Vectara Text Generation\n",
    "\n",
-    "This notebook is based on [text generation](https://github.com/hwchase17/langchain/blob/master/docs/modules/chains/index_examples/vector_db_text_generation.ipynb) notebook and adapted to Vectara."
+    "This notebook is based on [text generation](https://github.com/langchain-ai/langchain/blob/master/docs/modules/chains/index_examples/vector_db_text_generation.ipynb) notebook and adapted to Vectara."
   ]
  },
  {
--- a/docs/extras/integrations/retrievers/sec_filings.ipynb
+++ b/docs/extras/integrations/retrievers/sec_filings.ipynb
@@ -8,7 +8,7 @@
    "SEC filings data\n",
    "=\n",
    "\n",
-    "SEC filings data powered by [Kay.ai](https://kay.ai) and [Cybersyn](https://www.cybersyn.com/).\n",
+    "SEC filings data powered by [Kay.ai](https://kay.ai) and [Cybersyn](https://www.cybersyn.com/) via [Snowflake Marketplace](https://app.snowflake.com/marketplace/providers/GZTSZAS2KCS/Cybersyn%2C%20Inc).\n",
    "\n",
    ">The SEC filing is a financial statement or other formal document submitted to the U.S. Securities and Exchange Commission (SEC). Public companies, certain insiders, and broker-dealers are required to make regular SEC filings. Investors and financial professionals rely on these filings for information about companies they are evaluating for investment purposes."
   ]
@@ -157,7 +157,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.18"
+   "version": "3.9.1"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/retrievers/tavily.ipynb
+++ b/docs/extras/integrations/retrievers/tavily.ipynb
@@ -0,0 +1,79 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Tavily Search API\n",
+    "\n",
+    "[Tavily's Search API](https://tavily.com) is a search engine built specifically for AI agents (LLMs), delivering real-time, accurate, and factual results at speed.\n",
+    "\n",
+    "## Usage\n",
+    "\n",
+    "For a full list of allowed arguments, see [the official documentation](https://app.tavily.com/documentation/python). You can also pass any param to the SDK via a `kwargs` dictionary."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# %pip install tavily-python"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='Nintendo Designer (s) Hidemaro Fujibayashi (director) Eiji Aonuma (producer/group manager) Release date (s) United States of America: • March 3, 2017 Japan: • March 3, 2017 Australia / New Zealand: • March 2, 2017 Belgium: • March 3, 2017 Hong Kong: • Feburary 1, 2018 South Korea: • February 1, 2018 The UK / Ireland: • March 3, 2017 Content ratings', metadata={'title': 'The Legend of Zelda: Breath of the Wild - Zelda Wiki', 'source': 'https://zelda.fandom.com/wiki/The_Legend_of_Zelda:_Breath_of_the_Wild', 'score': 0.96994, 'images': None}),\n",
+       " Document(page_content='02/01/23 Nintendo Switch Online member exclusive: Save on two digital games Read more 09/13/22 Out of the Shadows … the Legend of Zelda: Tears of the Kingdom Launches for Nintendo Switch on May...', metadata={'title': 'The Legend of Zelda™: Breath of the Wild - Nintendo', 'source': 'https://www.nintendo.com/store/products/the-legend-of-zelda-breath-of-the-wild-switch/', 'score': 0.94346, 'images': None}),\n",
+       " Document(page_content='Now we finally have a concrete release date of May 12, 2023. The date was announced alongside this brief (and mysterious) new trailer that also confirmed its title: The Legend of Zelda: Tears...', metadata={'title': 'The Legend of Zelda: Tears of the Kingdom: Release Date, Gameplay ... - IGN', 'source': 'https://www.ign.com/articles/the-legend-of-zelda-breath-of-the-wild-2-release-date-gameplay-news-rumors', 'score': 0.94145, 'images': None}),\n",
+       " Document(page_content='It was eventually released on March 3, 2017, as a launch game for the Switch and the final Nintendo game for the Wii U. It received widespread acclaim and won numerous Game of the Year accolades. Critics praised its open-ended gameplay, open-world design, and attention to detail, though some criticized its technical performance.', metadata={'title': 'The Legend of Zelda: Breath of the Wild - Wikipedia', 'source': 'https://en.wikipedia.org/wiki/The_Legend_of_Zelda:_Breath_of_the_Wild', 'score': 0.92102, 'images': None})]"
+      ]
+     },
+     "execution_count": 1,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "import os\n",
+    "from langchain.retrievers.tavily_search_api import TavilySearchAPIRetriever\n",
+    "\n",
+    "os.environ[\"TAVILY_API_KEY\"] = \"YOUR_API_KEY\"\n",
+    "\n",
+    "retriever = TavilySearchAPIRetriever(k=4)\n",
+    "\n",
+    "retriever.invoke(\"what year was breath of the wild released?\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": ".venv",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.5"
+  },
+  "orig_nbformat": 4
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/extras/integrations/vectorstores/clarifai.ipynb
+++ b/docs/extras/integrations/vectorstores/clarifai.ipynb
@@ -10,7 +10,7 @@
    "\n",
    ">[Clarifai](https://www.clarifai.com/) is an AI Platform that provides the full AI lifecycle ranging from data exploration, data labeling, model training, evaluation, and inference. A Clarifai application can be used as a vector database after uploading inputs. \n",
    "\n",
-    "This notebook shows how to use functionality related to the `Clarifai` vector database.\n",
+    "This notebook shows how to use functionality related to the `Clarifai` vector database. Examples are shown to demonstrate text semantic search capabilities. Clarifai also supports semantic search with images, video frames, and localized search (see [Rank](https://docs.clarifai.com/api-guide/search/rank)) and attribute search (see [Filter](https://docs.clarifai.com/api-guide/search/filter)).\n",
    "\n",
    "To use Clarifai, you must have an account and a Personal Access Token (PAT) key. \n",
    "[Check here](https://clarifai.com/settings/security) to get or create a PAT."
@@ -55,7 +55,7 @@
   "metadata": {},
   "outputs": [
    {
-     "name": "stdin",
+     "name": "stdout",
     "output_type": "stream",
     "text": [
      " ········\n"
@@ -166,6 +166,8 @@
       " Document(page_content='I went to the movies yesterday', metadata={'text': 'I went to the movies yesterday', 'id': 3.0, 'source': 'book 1', 'category': ['books', 'modern']})]"
      ]
     },
+     "execution_count": null,
+     "metadata": {},
     "output_type": "execute_result"
    }
   ],
--- a/docs/extras/integrations/vectorstores/vespa.ipynb
+++ b/docs/extras/integrations/vectorstores/vespa.ipynb
@@ -0,0 +1,883 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "ce0f17b9",
+   "metadata": {},
+   "source": [
+    "# Vespa\n",
+    "\n",
+    ">[Vespa](https://vespa.ai/) is a fully featured search engine and vector database. It supports vector search (ANN), lexical search, and search in structured data, all in the same query.\n",
+    "\n",
+    "This notebook shows how to use `Vespa.ai` as a LangChain vector store.\n",
+    "\n",
+    "In order to create the vector store, we use\n",
+    "[pyvespa](https://pyvespa.readthedocs.io/en/latest/index.html) to create a\n",
+    "connection a `Vespa` service."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7e6a11ab-38bd-4920-ba11-60cb2f075754",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "#!pip install pyvespa"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Using the `pyvespa` package, you can either connect to a\n",
+    "[Vespa Cloud instance](https://pyvespa.readthedocs.io/en/latest/deploy-vespa-cloud.html)\n",
+    "or a local\n",
+    "[Docker instance](https://pyvespa.readthedocs.io/en/latest/deploy-docker.html).\n",
+    "Here, we will create a new Vespa application and deploy that using Docker.\n",
+    "\n",
+    "#### Creating a Vespa application\n",
+    "\n",
+    "First, we need to create an application package:"
+   ],
+   "metadata": {
+    "collapsed": false
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "from vespa.package import ApplicationPackage, Field, RankProfile\n",
+    "\n",
+    "app_package = ApplicationPackage(name=\"testapp\")\n",
+    "app_package.schema.add_fields(\n",
+    "    Field(name=\"text\", type=\"string\", indexing=[\"index\", \"summary\"], index=\"enable-bm25\"),\n",
+    "    Field(name=\"embedding\", type=\"tensor<float>(x[384])\",\n",
+    "          indexing=[\"attribute\", \"summary\"],\n",
+    "          attribute=[f\"distance-metric: angular\"]),\n",
+    ")\n",
+    "app_package.schema.add_rank_profile(\n",
+    "    RankProfile(name=\"default\",\n",
+    "                first_phase=\"closeness(field, embedding)\",\n",
+    "                inputs=[(\"query(query_embedding)\", \"tensor<float>(x[384])\")]\n",
+    "                )\n",
+    ")"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "This sets up a Vespa application with a schema for each document that contains\n",
+    "two fields: `text` for holding the document text and `embedding` for holding\n",
+    "the embedding vector. The `text` field is set up to use a BM25 index for\n",
+    "efficient text retrieval, and we'll see how to use this and hybrid search a\n",
+    "bit later.\n",
+    "\n",
+    "The `embedding` field is set up with a vector of length 384 to hold the\n",
+    "embedding representation of the text. See\n",
+    "[Vespa's Tensor Guide](https://docs.vespa.ai/en/tensor-user-guide.html)\n",
+    "for more on tensors in Vespa.\n",
+    "\n",
+    "Lastly, we add a [rank profile](https://docs.vespa.ai/en/ranking.html) to\n",
+    "instruct Vespa how to order documents. Here we set this up with a\n",
+    "[nearest neighbor search](https://docs.vespa.ai/en/nearest-neighbor-search.html).\n",
+    "\n",
+    "Now we can deploy this application locally:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "c10dd962",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "from vespa.deployment import VespaDocker\n",
+    "\n",
+    "vespa_docker = VespaDocker()\n",
+    "vespa_app = vespa_docker.deploy(application_package=app_package)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "3df4ce53",
+   "metadata": {},
+   "source": [
+    "This deploys and creates a connection to a `Vespa` service. In case you\n",
+    "already have a Vespa application running, for instance in the cloud,\n",
+    "please refer to the PyVespa application for how to connect.\n",
+    "\n",
+    "#### Creating a Vespa vector store\n",
+    "\n",
+    "Now, let's load some documents:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders import TextLoader\n",
+    "from langchain.text_splitter import CharacterTextSplitter\n",
+    "\n",
+    "loader = TextLoader(\"../../modules/state_of_the_union.txt\")\n",
+    "documents = loader.load()\n",
+    "text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)\n",
+    "docs = text_splitter.split_documents(documents)\n",
+    "\n",
+    "from langchain.embeddings.sentence_transformer import SentenceTransformerEmbeddings\n",
+    "\n",
+    "embedding_function = SentenceTransformerEmbeddings(model_name=\"all-MiniLM-L6-v2\")"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Here, we also set up local sentence embedder to transform the text to embedding\n",
+    "vectors. One could also use OpenAI embeddings, but the vector length needs to\n",
+    "be updated to `1536` to reflect the larger size of that embedding.\n",
+    "\n",
+    "To feed these to Vespa, we need to configure how the vector store should map to\n",
+    "fields in the Vespa application. Then we create the vector store directly from\n",
+    "this set of documents:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "vespa_config = dict(\n",
+    "    page_content_field=\"text\",\n",
+    "    embedding_field=\"embedding\",\n",
+    "    input_field=\"query_embedding\"\n",
+    ")\n",
+    "\n",
+    "from langchain.vectorstores import VespaStore\n",
+    "\n",
+    "db = VespaStore.from_documents(docs, embedding_function, app=vespa_app, **vespa_config)"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "This creates a Vespa vector store and feeds that set of documents to Vespa.\n",
+    "The vector store takes care of calling the embedding function for each document\n",
+    "and inserts them into the database.\n",
+    "\n",
+    "We can now query the vector store:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7ccca1f4",
+   "metadata": {
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "results = db.similarity_search(query)\n",
+    "\n",
+    "print(results[0].page_content)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "1e7e34e1",
+   "metadata": {
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   },
+   "source": [
+    "This will use the embedding function given above to create a representation\n",
+    "for the query and use that to search Vespa. Note that this will use the\n",
+    "`default` ranking function, which we set up in the application package\n",
+    "above. You can use the `ranking` argument to `similarity_search` to\n",
+    "specify which ranking function to use.\n",
+    "\n",
+    "Please refer to the [pyvespa documentation](https://pyvespa.readthedocs.io/en/latest/getting-started-pyvespa.html#Query)\n",
+    "for more information.\n",
+    "\n",
+    "This covers the basic usage of the Vespa store in LangChain.\n",
+    "Now you can return the results and continue using these in LangChain.\n",
+    "\n",
+    "#### Updating documents\n",
+    "\n",
+    "An alternative to calling `from_documents`, you can create the vector\n",
+    "store directly and call `add_texts` from that. This can also be used to update\n",
+    "documents:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "results = db.similarity_search(query)\n",
+    "result = results[0]\n",
+    "\n",
+    "result.page_content = \"UPDATED: \" + result.page_content\n",
+    "db.add_texts([result.page_content], [result.metadata], result.metadata[\"id\"])\n",
+    "\n",
+    "results = db.similarity_search(query)\n",
+    "print(results[0].page_content)"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "However, the `pyvespa` library contains methods to manipulate\n",
+    "content on Vespa which you can use directly.\n",
+    "\n",
+    "#### Deleting documents\n",
+    "\n",
+    "You can delete documents using the `delete` function:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "result = db.similarity_search(query)\n",
+    "# docs[0].metadata[\"id\"] == \"id:testapp:testapp::32\"\n",
+    "\n",
+    "db.delete([\"32\"])\n",
+    "result = db.similarity_search(query)\n",
+    "# docs[0].metadata[\"id\"] != \"id:testapp:testapp::32\""
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Again, the `pyvespa` connection contains methods to delete documents as well.\n",
+    "\n",
+    "### Returning with scores\n",
+    "\n",
+    "The `similarity_search` method only returns the documents in order of\n",
+    "relevancy. To retrieve the actual scores:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "results = db.similarity_search_with_score(query)\n",
+    "result = results[0]\n",
+    "# result[1] ~= 0.463"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "This is a result of using the `\"all-MiniLM-L6-v2\"` embedding model using the\n",
+    "cosine distance function (as given by the argument `angular` in the\n",
+    "application function).\n",
+    "\n",
+    "Different embedding functions need different distance functions, and Vespa\n",
+    "needs to know which distance function to use when orderings documents.\n",
+    "Please refer to the\n",
+    "[documentation on distance functions](https://docs.vespa.ai/en/reference/schema-reference.html#distance-metric)\n",
+    "for more information.\n",
+    "\n",
+    "### As retriever\n",
+    "\n",
+    "To use this vector store as a\n",
+    "[LangChain retriever](https://python.langchain.com/docs/modules/data_connection/retrievers/)\n",
+    "simply call the `as_retriever` function, which is a standard vector store\n",
+    "method:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "db = VespaStore.from_documents(docs, embedding_function, app=vespa_app, **vespa_config)\n",
+    "retriever = db.as_retriever()\n",
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "results = retriever.get_relevant_documents(query)\n",
+    "\n",
+    "# results[0].metadata[\"id\"] == \"id:testapp:testapp::32\""
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "This allows for more general, unstructured, retrieval from the vector store.\n",
+    "\n",
+    "### Metadata\n",
+    "\n",
+    "In the example so far, we've only used the text and the embedding for that\n",
+    "text. Documents usually contain additional information, which in LangChain\n",
+    "is referred to as metadata.\n",
+    "\n",
+    "Vespa can contain many fields with different types by adding them to the application\n",
+    "package:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "app_package.schema.add_fields(\n",
+    "    # ...\n",
+    "    Field(name=\"date\", type=\"string\", indexing=[\"attribute\", \"summary\"]),\n",
+    "    Field(name=\"rating\", type=\"int\", indexing=[\"attribute\", \"summary\"]),\n",
+    "    Field(name=\"author\", type=\"string\", indexing=[\"attribute\", \"summary\"]),\n",
+    "    # ...\n",
+    ")\n",
+    "vespa_app = vespa_docker.deploy(application_package=app_package)"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "We can add some metadata fields in the documents:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "# Add metadata\n",
+    "for i, doc in enumerate(docs):\n",
+    "    doc.metadata[\"date\"] = f\"2023-{(i % 12)+1}-{(i % 28)+1}\"\n",
+    "    doc.metadata[\"rating\"] = range(1, 6)[i % 5]\n",
+    "    doc.metadata[\"author\"] = [\"Joe Biden\", \"Unknown\"][min(i, 1)]"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "And let the Vespa vector store know about these fields:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "vespa_config.update(dict(metadata_fields=[\"date\", \"rating\", \"author\"]))"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Now, when searching for these documents, these fields will be returned.\n",
+    "Also, these fields can be filtered on:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "db = VespaStore.from_documents(docs, embedding_function, app=vespa_app, **vespa_config)\n",
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "results = db.similarity_search(query, filter=\"rating > 3\")\n",
+    "# results[0].metadata[\"id\"] == \"id:testapp:testapp::34\"\n",
+    "# results[0].metadata[\"author\"] == \"Unknown\""
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "### Custom query\n",
+    "\n",
+    "If the default behavior of the similarity search does not fit your\n",
+    "requirements, you can always provide your own query. Thus, you don't\n",
+    "need to provide all of the configuration to the vector store, but\n",
+    "rather just write this yourself.\n",
+    "\n",
+    "First, let's add a BM25 ranking function to our application:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "from vespa.package import FieldSet\n",
+    "\n",
+    "app_package.schema.add_field_set(FieldSet(name=\"default\", fields=[\"text\"]))\n",
+    "app_package.schema.add_rank_profile(RankProfile(name=\"bm25\", first_phase=\"bm25(text)\"))\n",
+    "vespa_app = vespa_docker.deploy(application_package=app_package)\n",
+    "db = VespaStore.from_documents(docs, embedding_function, app=vespa_app, **vespa_config)"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Then, to perform a regular text search based on BM25:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "custom_query = {\n",
+    "    \"yql\": f\"select * from sources * where userQuery()\",\n",
+    "    \"query\": query,\n",
+    "    \"type\": \"weakAnd\",\n",
+    "    \"ranking\": \"bm25\",\n",
+    "    \"hits\": 4\n",
+    "}\n",
+    "results  = db.similarity_search_with_score(query, custom_query=custom_query)\n",
+    "# results[0][0].metadata[\"id\"] == \"id:testapp:testapp::32\"\n",
+    "# results[0][1] ~= 14.384"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "All of the powerful search and query capabilities of Vespa can be used\n",
+    "by using a custom query. Please refer to the Vespa documentation on it's\n",
+    "[Query API](https://docs.vespa.ai/en/query-api.html) for more details.\n",
+    "\n",
+    "### Hybrid search\n",
+    "\n",
+    "Hybrid search means using both a classic term-based search such as\n",
+    "BM25 and a vector search and combining the results. We need to create\n",
+    "a new rank profile for hybrid search on Vespa:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "app_package.schema.add_rank_profile(\n",
+    "    RankProfile(name=\"hybrid\",\n",
+    "                first_phase=\"log(bm25(text)) + 0.5 * closeness(field, embedding)\",\n",
+    "                inputs=[(\"query(query_embedding)\", \"tensor<float>(x[384])\")]\n",
+    "                )\n",
+    ")\n",
+    "vespa_app = vespa_docker.deploy(application_package=app_package)\n",
+    "db = VespaStore.from_documents(docs, embedding_function, app=vespa_app, **vespa_config)"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Here, we score each document as a combination of it's BM25 score and its\n",
+    "distance score. We can query using a custom query:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "query_embedding = embedding_function.embed_query(query)\n",
+    "nearest_neighbor_expression = \"{targetHits: 4}nearestNeighbor(embedding, query_embedding)\"\n",
+    "custom_query = {\n",
+    "    \"yql\": f\"select * from sources * where {nearest_neighbor_expression} and userQuery()\",\n",
+    "    \"query\": query,\n",
+    "    \"type\": \"weakAnd\",\n",
+    "    \"input.query(query_embedding)\": query_embedding,\n",
+    "    \"ranking\": \"hybrid\",\n",
+    "    \"hits\": 4\n",
+    "}\n",
+    "results = db.similarity_search_with_score(query, custom_query=custom_query)\n",
+    "# results[0][0].metadata[\"id\"], \"id:testapp:testapp::32\")\n",
+    "# results[0][1] ~= 2.897"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "### Native embedders in Vespa\n",
+    "\n",
+    "Up until this point we've used an embedding function in Python to provide\n",
+    "embeddings for the texts. Vespa supports embedding function natively, so\n",
+    "you can defer this calculation in to Vespa. One benefit is the ability to use\n",
+    "GPUs when embedding documents if you have a large collections.\n",
+    "\n",
+    "Please refer to [Vespa embeddings](https://docs.vespa.ai/en/embedding.html)\n",
+    "for more information.\n",
+    "\n",
+    "First, we need to modify our application package:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "from vespa.package import Component, Parameter\n",
+    "\n",
+    "app_package.components = [\n",
+    "    Component(id=\"hf-embedder\", type=\"hugging-face-embedder\",\n",
+    "        parameters=[\n",
+    "            Parameter(\"transformer-model\", {\"path\": \"...\"}),\n",
+    "            Parameter(\"tokenizer-model\", {\"url\": \"...\"}),\n",
+    "        ]\n",
+    "    )\n",
+    "]\n",
+    "Field(name=\"hfembedding\", type=\"tensor<float>(x[384])\",\n",
+    "      is_document_field=False,\n",
+    "      indexing=[\"input text\", \"embed hf-embedder\", \"attribute\", \"summary\"],\n",
+    "      attribute=[f\"distance-metric: angular\"],\n",
+    "      )\n",
+    "app_package.schema.add_rank_profile(\n",
+    "    RankProfile(name=\"hf_similarity\",\n",
+    "                first_phase=\"closeness(field, hfembedding)\",\n",
+    "                inputs=[(\"query(query_embedding)\", \"tensor<float>(x[384])\")]\n",
+    "                )\n",
+    ")"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Please refer to the embeddings documentation on adding embedder models\n",
+    "and tokenizers to the application. Note that the `hfembedding` field\n",
+    "includes instructions for embedding using the `hf-embedder`.\n",
+    "\n",
+    "Now we can query with a custom query:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "nearest_neighbor_expression = \"{targetHits: 4}nearestNeighbor(internalembedding, query_embedding)\"\n",
+    "custom_query = {\n",
+    "    \"yql\": f\"select * from sources * where {nearest_neighbor_expression}\",\n",
+    "    \"input.query(query_embedding)\": f\"embed(hf-embedder, \\\"{query}\\\")\",\n",
+    "    \"ranking\": \"internal_similarity\",\n",
+    "    \"hits\": 4\n",
+    "}\n",
+    "results = db.similarity_search_with_score(query, custom_query=custom_query)\n",
+    "# results[0][0].metadata[\"id\"], \"id:testapp:testapp::32\")\n",
+    "# results[0][1] ~= 0.630"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "Note that the query here includes an `embed` instruction to embed the query\n",
+    "using the same model as for the documents.\n",
+    "\n",
+    "### Approximate nearest neighbor\n",
+    "\n",
+    "In all of the above examples, we've used exact nearest neighbor to\n",
+    "find results. However, for large collections of documents this is\n",
+    "not feasible as one has to scan through all documents to find the\n",
+    "best matches. To avoid this, we can use\n",
+    "[approximate nearest neighbors](https://docs.vespa.ai/en/approximate-nn-hnsw.html).\n",
+    "\n",
+    "First, we can change the embedding field to create a HNSW index:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "from vespa.package import HNSW\n",
+    "\n",
+    "app_package.schema.add_fields(\n",
+    "    Field(name=\"embedding\", type=\"tensor<float>(x[384])\",\n",
+    "          indexing=[\"attribute\", \"summary\", \"index\"],\n",
+    "          ann=HNSW(distance_metric=\"angular\", max_links_per_node=16, neighbors_to_explore_at_insert=200)\n",
+    "          )\n",
+    ")\n"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "This creates a HNSW index on the embedding data which allows for efficient\n",
+    "searching. With this set, we can easily search using ANN by setting\n",
+    "the `approximate` argument to `True`:"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "results = db.similarity_search(query, approximate=True)\n",
+    "# results[0][0].metadata[\"id\"], \"id:testapp:testapp::32\")"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%%\n"
+    }
+   }
+  },
+  {
+   "cell_type": "markdown",
+   "source": [
+    "This covers most of the functionality in the Vespa vector store in LangChain.\n",
+    "\n"
+   ],
+   "metadata": {
+    "collapsed": false,
+    "pycharm": {
+     "name": "#%% md\n"
+    }
+   }
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/modules/agents/agent_types/chat_conversation_agent.ipynb
+++ b/docs/extras/modules/agents/agent_types/chat_conversation_agent.ipynb
@@ -38,7 +38,7 @@
    "search = SerpAPIWrapper()\n",
    "tools = [\n",
    "    Tool(\n",
-    "        name = \"Current Search\",\n",
+    "        name=\"Current Search\",\n",
    "        func=search.run,\n",
    "        description=\"useful for when you need to answer questions about current events or the current state of the world\"\n",
    "    ),\n",
--- a/docs/extras/modules/agents/agent_types/openai_functions_agent.ipynb
+++ b/docs/extras/modules/agents/agent_types/openai_functions_agent.ipynb
@@ -63,7 +63,7 @@
    "db_chain = SQLDatabaseChain.from_llm(llm, db, verbose=True)\n",
    "tools = [\n",
    "    Tool(\n",
-    "        name = \"Search\",\n",
+    "        name=\"Search\",\n",
    "        func=search.run,\n",
    "        description=\"useful for when you need to answer questions about current events. You should ask targeted questions\"\n",
    "    ),\n",
--- a/docs/extras/modules/agents/agent_types/react_docstore.ipynb
+++ b/docs/extras/modules/agents/agent_types/react_docstore.ipynb
@@ -17,7 +17,8 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.llms import OpenAI\from langchain.utilities import Wikipedia\n",
+    "from langchain.llms import OpenAI\n",
+    "from langchain.docstore import Wikipedia\n",
    "from langchain.agents import initialize_agent, Tool\n",
    "from langchain.agents import AgentType\n",
    "from langchain.agents.react.base import DocstoreExplorer\n",
--- a/docs/extras/modules/agents/how_to/agent_structured.ipynb
+++ b/docs/extras/modules/agents/how_to/agent_structured.ipynb
@@ -141,11 +141,11 @@
    "When the `Response` function is called by OpenAI, we want to use that as a signal to return to the user.\n",
    "When any other function is called by OpenAI, we treat that as a tool invocation.\n",
    "\n",
-    "Therefor, our parsing logic has the following blocks:\n",
+    "Therefore, our parsing logic has the following blocks:\n",
    "\n",
-    "- If no function is called, assume that we should use the response to respond to the user, and therefor return `AgentFinish`\n",
-    "- If the `Response` function is called, respond to the user with the inputs to that function (our structured output), and therefor return `AgentFinish`\n",
-    "- If any other function is called, treat that as a tool invocation, and therefor return `AgentActionMessageLog`\n",
+    "- If no function is called, assume that we should use the response to respond to the user, and therefore return `AgentFinish`\n",
+    "- If the `Response` function is called, respond to the user with the inputs to that function (our structured output), and therefore return `AgentFinish`\n",
+    "- If any other function is called, treat that as a tool invocation, and therefore return `AgentActionMessageLog`\n",
    "\n",
    "Note that we are using `AgentActionMessageLog` rather than `AgentAction` because it lets us attach a log of messages that we can use in the future to pass back into the agent prompt."
   ]
--- a/docs/extras/modules/data_connection/document_transformers/text_splitters/HTML_header_metadata.ipynb
+++ b/docs/extras/modules/data_connection/document_transformers/text_splitters/HTML_header_metadata.ipynb
@@ -0,0 +1,241 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "c95fcd15cd52c944",
+   "metadata": {
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "source": [
+    "# HTMLHeaderTextSplitter\n",
+    "## Description and motivation\n",
+    "Similar in concept to the <a href=\"https://python.langchain.com/docs/modules/data_connection/document_transformers/text_splitters/markdown_header_metadata\">`MarkdownHeaderTextSplitter`</a>, the `HTMLHeaderTextSplitter` is a \"structure-aware\" chunker that splits text at the element level and adds metadata for each header \"relevant\" to any given chunk. It can return chunks element by element or combine elements with the same metadata, with the objectives of (a) keeping related text grouped (more or less) semantically and (b) preserving context-rich information encoded in document structures. It can be used with other text splitters as part of a chunking pipeline.\n",
+    "\n",
+    "## Usage examples\n",
+    "#### 1) With an HTML string:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "initial_id",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-10-02T18:57:49.208965400Z",
+     "start_time": "2023-10-02T18:57:48.899756Z"
+    },
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='Foo'),\n",
+       " Document(page_content='Some intro text about Foo.  \\nBar main section Bar subsection 1 Bar subsection 2', metadata={'Header 1': 'Foo'}),\n",
+       " Document(page_content='Some intro text about Bar.', metadata={'Header 1': 'Foo', 'Header 2': 'Bar main section'}),\n",
+       " Document(page_content='Some text about the first subtopic of Bar.', metadata={'Header 1': 'Foo', 'Header 2': 'Bar main section', 'Header 3': 'Bar subsection 1'}),\n",
+       " Document(page_content='Some text about the second subtopic of Bar.', metadata={'Header 1': 'Foo', 'Header 2': 'Bar main section', 'Header 3': 'Bar subsection 2'}),\n",
+       " Document(page_content='Baz', metadata={'Header 1': 'Foo'}),\n",
+       " Document(page_content='Some text about Baz', metadata={'Header 1': 'Foo', 'Header 2': 'Baz'}),\n",
+       " Document(page_content='Some concluding text about Foo', metadata={'Header 1': 'Foo'})]"
+      ]
+     },
+     "execution_count": 1,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain.text_splitter import HTMLHeaderTextSplitter\n",
+    "\n",
+    "html_string =\"\"\"\n",
+    "<!DOCTYPE html>\n",
+    "<html>\n",
+    "<body>\n",
+    "    <div>\n",
+    "        <h1>Foo</h1>\n",
+    "        <p>Some intro text about Foo.</p>\n",
+    "        <div>\n",
+    "            <h2>Bar main section</h2>\n",
+    "            <p>Some intro text about Bar.</p>\n",
+    "            <h3>Bar subsection 1</h3>\n",
+    "            <p>Some text about the first subtopic of Bar.</p>\n",
+    "            <h3>Bar subsection 2</h3>\n",
+    "            <p>Some text about the second subtopic of Bar.</p>\n",
+    "        </div>\n",
+    "        <div>\n",
+    "            <h2>Baz</h2>\n",
+    "            <p>Some text about Baz</p>\n",
+    "        </div>\n",
+    "        <br>\n",
+    "        <p>Some concluding text about Foo</p>\n",
+    "    </div>\n",
+    "</body>\n",
+    "</html>\n",
+    "\"\"\"\n",
+    "\n",
+    "headers_to_split_on = [\n",
+    "    (\"h1\", \"Header 1\"),\n",
+    "    (\"h2\", \"Header 2\"),\n",
+    "    (\"h3\", \"Header 3\"),\n",
+    "]\n",
+    "\n",
+    "html_splitter = HTMLHeaderTextSplitter(headers_to_split_on=headers_to_split_on)\n",
+    "html_header_splits = html_splitter.split_text(html_string)\n",
+    "html_header_splits"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "e29b4aade2a0070c",
+   "metadata": {
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "source": [
+    "#### 2) Pipelined to another splitter, with html loaded from a web URL:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "6ada8ea093ea0475",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-10-02T18:57:51.016141300Z",
+     "start_time": "2023-10-02T18:57:50.647495400Z"
+    },
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='We see that Gödel first tried to reduce the consistency problem for analysis to that of arithmetic. This seemed to require a truth definition for arithmetic, which in turn led to paradoxes, such as the Liar paradox (“This sentence is false”) and Berry’s paradox (“The least number not defined by an expression consisting of just fourteen English words”). Gödel then noticed that such paradoxes would not necessarily arise if truth were replaced by provability. But this means that arithmetic truth', metadata={'Header 1': 'Kurt Gödel', 'Header 2': '2. Gödel’s Mathematical Work', 'Header 3': '2.2 The Incompleteness Theorems', 'Header 4': '2.2.1 The First Incompleteness Theorem'}),\n",
+       " Document(page_content='means that arithmetic truth and arithmetic provability are not co-extensive — whence the First Incompleteness Theorem.', metadata={'Header 1': 'Kurt Gödel', 'Header 2': '2. Gödel’s Mathematical Work', 'Header 3': '2.2 The Incompleteness Theorems', 'Header 4': '2.2.1 The First Incompleteness Theorem'}),\n",
+       " Document(page_content='This account of Gödel’s discovery was told to Hao Wang very much after the fact; but in Gödel’s contemporary correspondence with Bernays and Zermelo, essentially the same description of his path to the theorems is given. (See Gödel 2003a and Gödel 2003b respectively.) From those accounts we see that the undefinability of truth in arithmetic, a result credited to Tarski, was likely obtained in some form by Gödel by 1931. But he neither publicized nor published the result; the biases logicians', metadata={'Header 1': 'Kurt Gödel', 'Header 2': '2. Gödel’s Mathematical Work', 'Header 3': '2.2 The Incompleteness Theorems', 'Header 4': '2.2.1 The First Incompleteness Theorem'}),\n",
+       " Document(page_content='result; the biases logicians had expressed at the time concerning the notion of truth, biases which came vehemently to the fore when Tarski announced his results on the undefinability of truth in formal systems 1935, may have served as a deterrent to Gödel’s publication of that theorem.', metadata={'Header 1': 'Kurt Gödel', 'Header 2': '2. Gödel’s Mathematical Work', 'Header 3': '2.2 The Incompleteness Theorems', 'Header 4': '2.2.1 The First Incompleteness Theorem'}),\n",
+       " Document(page_content='We now describe the proof of the two theorems, formulating Gödel’s results in Peano arithmetic. Gödel himself used a system related to that defined in Principia Mathematica, but containing Peano arithmetic. In our presentation of the First and Second Incompleteness Theorems we refer to Peano arithmetic as P, following Gödel’s notation.', metadata={'Header 1': 'Kurt Gödel', 'Header 2': '2. Gödel’s Mathematical Work', 'Header 3': '2.2 The Incompleteness Theorems', 'Header 4': '2.2.2 The proof of the First Incompleteness Theorem'})]"
+      ]
+     },
+     "execution_count": 2,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain.text_splitter import RecursiveCharacterTextSplitter\n",
+    "\n",
+    "url = \"https://plato.stanford.edu/entries/goedel/\"\n",
+    "\n",
+    "headers_to_split_on = [\n",
+    "    (\"h1\", \"Header 1\"),\n",
+    "    (\"h2\", \"Header 2\"),\n",
+    "    (\"h3\", \"Header 3\"),\n",
+    "    (\"h4\", \"Header 4\"),\n",
+    "]\n",
+    "\n",
+    "html_splitter = HTMLHeaderTextSplitter(headers_to_split_on=headers_to_split_on)\n",
+    "\n",
+    "#for local file use html_splitter.split_text_from_file(<path_to_file>)\n",
+    "html_header_splits = html_splitter.split_text_from_url(url)\n",
+    "\n",
+    "chunk_size = 500\n",
+    "chunk_overlap = 30\n",
+    "text_splitter = RecursiveCharacterTextSplitter(\n",
+    "    chunk_size=chunk_size, chunk_overlap=chunk_overlap\n",
+    ")\n",
+    "\n",
+    "# Split\n",
+    "splits = text_splitter.split_documents(html_header_splits)\n",
+    "splits[80:85]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "ac0930371d79554a",
+   "metadata": {
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "source": [
+    "## Limitations\n",
+    "\n",
+    "There can be quite a bit of structural variation from one HTML document to another, and while `HTMLHeaderTextSplitter` will attempt to attach all \"relevant\" headers to any given chunk, it can sometimes miss certain headers. For example, the algorithm assumes an informational hierarchy in which headers are always at nodes \"above\" associated text, i.e. prior siblings, ancestors, and combinations thereof. In the following news article (as of the writing of this document), the document is structured such that the text of the top-level headline, while tagged \"h1\", is in a *distinct* subtree from the text elements that we'd expect it to be *\"above\"*&mdash;so we can observe that the \"h1\" element and its associated text do not show up in the chunk metadata (but, where applicable, we do see \"h2\" and its associated text):   \n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "5a5ec1482171b119",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-10-02T19:03:25.943524300Z",
+     "start_time": "2023-10-02T19:03:25.691641Z"
+    },
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "No two El NiÃ±o winters are the same, but many have temperature and precipitation trends in common.  \n",
+      "Average conditions during an El NiÃ±o winter across the continental US.  \n",
+      "One of the major reasons is the position of the jet stream, which often shifts south during an El NiÃ±o winter. This shift typically brings wetter and cooler weather to the South while the North becomes drier and warmer, according to NOAA.  \n",
+      "Because the jet stream is essentially a river of air that storms flow through, the\n"
+     ]
+    }
+   ],
+   "source": [
+    "url = \"https://www.cnn.com/2023/09/25/weather/el-nino-winter-us-climate/index.html\"\n",
+    "\n",
+    "headers_to_split_on = [\n",
+    "    (\"h1\", \"Header 1\"),\n",
+    "    (\"h2\", \"Header 2\"),\n",
+    "]\n",
+    "\n",
+    "html_splitter = HTMLHeaderTextSplitter(headers_to_split_on=headers_to_split_on)\n",
+    "html_header_splits = html_splitter.split_text_from_url(url)\n",
+    "print(html_header_splits[1].page_content[:500])"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "poetry-venv",
+   "language": "python",
+   "name": "poetry-venv"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/modules/data_connection/document_transformers/text_splitters/markdown_header_metadata.ipynb
+++ b/docs/extras/modules/data_connection/document_transformers/text_splitters/markdown_header_metadata.ipynb
@@ -40,9 +40,14 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 1,
   "id": "ceb3c1fb",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-09-25T19:12:27.243781300Z",
+     "start_time": "2023-09-25T19:12:24.943559400Z"
+    }
+   },
   "outputs": [],
   "source": [
    "from langchain.text_splitter import MarkdownHeaderTextSplitter"
@@ -50,19 +55,20 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 2,
   "id": "2ae3649b",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-09-25T19:12:31.917013600Z",
+     "start_time": "2023-09-25T19:12:31.905694500Z"
+    }
+   },
   "outputs": [
    {
     "data": {
-      "text/plain": [
-       "[Document(page_content='Hi this is Jim  \\nHi this is Joe', metadata={'Header 1': 'Foo', 'Header 2': 'Bar'}),\n",
-       " Document(page_content='Hi this is Lance', metadata={'Header 1': 'Foo', 'Header 2': 'Bar', 'Header 3': 'Boo'}),\n",
-       " Document(page_content='Hi this is Molly', metadata={'Header 1': 'Foo', 'Header 2': 'Baz'})]"
-      ]
+      "text/plain": "[Document(page_content='Hi this is Jim  \\nHi this is Joe', metadata={'Header 1': 'Foo', 'Header 2': 'Bar'}),\n Document(page_content='Hi this is Lance', metadata={'Header 1': 'Foo', 'Header 2': 'Bar', 'Header 3': 'Boo'}),\n Document(page_content='Hi this is Molly', metadata={'Header 1': 'Foo', 'Header 2': 'Baz'})]"
     },
-     "execution_count": 5,
+     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -83,17 +89,20 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 3,
   "id": "aac1738c",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-09-25T19:12:35.672077100Z",
+     "start_time": "2023-09-25T19:12:35.666731400Z"
+    }
+   },
   "outputs": [
    {
     "data": {
-      "text/plain": [
-       "langchain.schema.Document"
-      ]
+      "text/plain": "langchain.schema.document.Document"
     },
-     "execution_count": 4,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -112,21 +121,20 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 4,
   "id": "480e0e3a",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-09-25T19:12:41.337249Z",
+     "start_time": "2023-09-25T19:12:41.326099200Z"
+    }
+   },
   "outputs": [
    {
     "data": {
-      "text/plain": [
-       "[Document(page_content='Markdown[9] is a lightweight markup language for creating formatted text using a plain-text editor. John Gruber created Markdown in 2004 as a markup language that is appealing to human readers in its source code form.[9]', metadata={'Header 1': 'Intro', 'Header 2': 'History'}),\n",
-       " Document(page_content='Markdown is widely used in blogging, instant messaging, online forums, collaborative software, documentation pages, and readme files.', metadata={'Header 1': 'Intro', 'Header 2': 'History'}),\n",
-       " Document(page_content='As Markdown popularity grew rapidly, many Markdown implementations appeared, driven mostly by the need for  \\nadditional features such as tables, footnotes, definition lists,[note 1] and Markdown inside HTML blocks.  \\n#### Standardization', metadata={'Header 1': 'Intro', 'Header 2': 'Rise and divergence'}),\n",
-       " Document(page_content='#### Standardization  \\nFrom 2012, a group of people, including Jeff Atwood and John MacFarlane, launched what Atwood characterised as a standardisation effort.', metadata={'Header 1': 'Intro', 'Header 2': 'Rise and divergence'}),\n",
-       " Document(page_content='Implementations of Markdown are available for over a dozen programming languages.', metadata={'Header 1': 'Intro', 'Header 2': 'Implementations'})]"
-      ]
+      "text/plain": "[Document(page_content='Markdown[9] is a lightweight markup language for creating formatted text using a plain-text editor. John Gruber created Markdown in 2004 as a markup language that is appealing to human readers in its source code form.[9]', metadata={'Header 1': 'Intro', 'Header 2': 'History'}),\n Document(page_content='Markdown is widely used in blogging, instant messaging, online forums, collaborative software, documentation pages, and readme files.', metadata={'Header 1': 'Intro', 'Header 2': 'History'}),\n Document(page_content='As Markdown popularity grew rapidly, many Markdown implementations appeared, driven mostly by the need for  \\nadditional features such as tables, footnotes, definition lists,[note 1] and Markdown inside HTML blocks.  \\n#### Standardization', metadata={'Header 1': 'Intro', 'Header 2': 'Rise and divergence'}),\n Document(page_content='#### Standardization  \\nFrom 2012, a group of people, including Jeff Atwood and John MacFarlane, launched what Atwood characterised as a standardisation effort.', metadata={'Header 1': 'Intro', 'Header 2': 'Rise and divergence'}),\n Document(page_content='Implementations of Markdown are available for over a dozen programming languages.', metadata={'Header 1': 'Intro', 'Header 2': 'Implementations'})]"
     },
-     "execution_count": 8,
+     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -156,6 +164,16 @@
    "splits = text_splitter.split_documents(md_header_splits)\n",
    "splits"
   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "outputs": [],
+   "source": [],
+   "metadata": {
+    "collapsed": false
+   },
+   "id": "4017f148d414a45c"
  }
 ],
 "metadata": {
--- a/docs/extras/modules/model_io/prompts/prompt_templates/formats.mdx
+++ b/docs/extras/modules/model_io/prompts/prompt_templates/formats.mdx
@@ -26,4 +26,4 @@ prompt.format(adjective="funny", content="chickens")
 # Output: Tell me a funny joke about chickens.
 ```

-Currently, only `jinja2` and `f-string` are supported. For other formats, kindly raise an issue on the [Github page](https://github.com/hwchase17/langchain/issues).
+Currently, only `jinja2` and `f-string` are supported. For other formats, kindly raise an issue on the [Github page](https://github.com/langchain-ai/langchain/issues).
--- a/docs/extras/use_cases/code_understanding.ipynb
+++ b/docs/extras/use_cases/code_understanding.ipynb
@@ -92,7 +92,7 @@
   "source": [
    "# Clone\n",
    "repo_path = \"/Users/rlm/Desktop/test_repo\"\n",
-    "# repo = Repo.clone_from(\"https://github.com/hwchase17/langchain\", to_path=repo_path)"
+    "# repo = Repo.clone_from(\"https://github.com/langchain-ai/langchain\", to_path=repo_path)"
   ]
  },
  {
--- a/docs/extras/use_cases/extraction.ipynb
+++ b/docs/extras/use_cases/extraction.ipynb
@@ -443,7 +443,7 @@
    }
   ],
   "source": [
-    "from typing import Sequence\n",
+    "from typing import Sequence, Optional\n",
    "from langchain.prompts import (\n",
    "    PromptTemplate,\n",
    "    ChatPromptTemplate,\n",
--- a/docs/extras/use_cases/more/agents/agent_simulations/multiagent_bidding.ipynb
+++ b/docs/extras/use_cases/more/agents/agent_simulations/multiagent_bidding.ipynb
@@ -414,7 +414,7 @@
    "1. define a format they will produce their outputs in\n",
    "2. parse their outputs\n",
    "\n",
-    "We can subclass the [RegexParser](https://github.com/hwchase17/langchain/blob/master/langchain/output_parsers/regex.py) to implement our own custom output parser for bids."
+    "We can subclass the [RegexParser](https://github.com/langchain-ai/langchain/blob/master/langchain/output_parsers/regex.py) to implement our own custom output parser for bids."
   ]
  },
  {
--- a/docs/extras/use_cases/more/agents/agents/sales_agent_with_context.ipynb
+++ b/docs/extras/use_cases/more/agents/agents/sales_agent_with_context.ipynb
@@ -19,7 +19,7 @@
    "Here, we show how the AI Sales Agent can use a **Product Knowledge Base** to speak about a particular's company offerings,\n",
    "hence increasing relevance and reducing hallucinations.\n",
    "\n",
-    "We leverage the [`langchain`](https://github.com/hwchase17/langchain) library in this implementation, specifically [Custom Agent Configuration](https://langchain-langchain.vercel.app/docs/modules/agents/how_to/custom_agent_with_tool_retrieval) and are inspired by [BabyAGI](https://github.com/yoheinakajima/babyagi) architecture ."
+    "We leverage the [`langchain`](https://github.com/langchain-ai/langchain) library in this implementation, specifically [Custom Agent Configuration](https://langchain-langchain.vercel.app/docs/modules/agents/how_to/custom_agent_with_tool_retrieval) and are inspired by [BabyAGI](https://github.com/yoheinakajima/babyagi) architecture ."
   ]
  },
  {
--- a/docs/extras/use_cases/more/agents/autonomous_agents/plan_and_execute.mdx
+++ b/docs/extras/use_cases/more/agents/autonomous_agents/plan_and_execute.mdx
@@ -27,7 +27,7 @@ llm = OpenAI(temperature=0)
 llm_math_chain = LLMMathChain.from_llm(llm=llm, verbose=True)
 tools = [
    Tool(
-        name = "Search",
+        name="Search",
        func=search.run,
        description="useful for when you need to answer questions about current events"
    ),
--- a/docs/extras/use_cases/more/code_writing/cpal.ipynb
+++ b/docs/extras/use_cases/more/code_writing/cpal.ipynb
@@ -10,7 +10,7 @@
    "\n",
    "The CPAL chain builds on the recent PAL to stop LLM hallucination. The problem with the PAL approach is that it hallucinates on a math problem with a nested chain of dependence. The innovation here is that this new CPAL approach includes causal structure to fix hallucination.\n",
    "\n",
-    "The original [PR's description](https://github.com/hwchase17/langchain/pull/6255) contains a full overview.\n",
+    "The original [PR's description](https://github.com/langchain-ai/langchain/pull/6255) contains a full overview.\n",
    "\n",
    "Using the CPAL chain, the LLM translated this\n",
    "\n",
--- a/docs/extras/use_cases/more/graph/graph_cypher_qa.ipynb
+++ b/docs/extras/use_cases/more/graph/graph_cypher_qa.ipynb
@@ -48,7 +48,16 @@
   "execution_count": 2,
   "id": "0928915d",
   "metadata": {},
-   "outputs": [],
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/home/tomaz/neo4j/langchain/libs/langchain/langchain/graphs/neo4j_graph.py:52: ExperimentalWarning: The configuration may change in the future.\n",
+      "  self._driver.verify_connectivity()\n"
+     ]
+    }
+   ],
   "source": [
    "graph = Neo4jGraph(\n",
    "    url=\"bolt://localhost:7687\", username=\"neo4j\", password=\"pleaseletmein\"\n",
@@ -558,6 +567,75 @@
    "# Inspect graph schema\n",
    "print(chain.graph_schema)"
   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f0202e88-d700-40ed-aef9-0c969c7bf951",
+   "metadata": {},
+   "source": [
+    "# Validate generated Cypher statements\n",
+    "You can use the `validate_cypher` parameter to validate and correct relationship directions in generated Cypher statements"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "id": "53665d03-7afd-433c-bdd5-750127bfb152",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chain = GraphCypherQAChain.from_llm(\n",
+    "     llm=ChatOpenAI(temperature=0, model=\"gpt-3.5-turbo\"),\n",
+    "     graph=graph,\n",
+    "     verbose=True,\n",
+    "     validate_cypher=True\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "id": "19e1a591-9c10-4d7b-aa36-a5e1b778a97b",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new GraphCypherQAChain chain...\u001b[0m\n",
+      "Generated Cypher:\n",
+      "\u001b[32;1m\u001b[1;3mMATCH (a:Actor)-[:ACTED_IN]->(m:Movie {name: 'Top Gun'})\n",
+      "RETURN a.name\u001b[0m\n",
+      "Full Context:\n",
+      "\u001b[32;1m\u001b[1;3m[{'a.name': 'Tom Cruise'}, {'a.name': 'Val Kilmer'}, {'a.name': 'Anthony Edwards'}, {'a.name': 'Meg Ryan'}]\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Tom Cruise, Val Kilmer, Anthony Edwards, and Meg Ryan played in Top Gun.'"
+      ]
+     },
+     "execution_count": 21,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "chain.run(\"Who played in Top Gun?\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "3fa3f3d5-f7e7-4ca9-8f07-ca22b897f192",
+   "metadata": {},
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {
--- a/docs/extras/use_cases/question_answering/how_to/local_retrieval_qa.ipynb
+++ b/docs/extras/use_cases/question_answering/how_to/local_retrieval_qa.ipynb
@@ -617,7 +617,7 @@
    "\n",
    "For an even simpler flow, use `RetrievalQA`.\n",
    "\n",
-    "This will use a QA default prompt (shown [here](https://github.com/hwchase17/langchain/blob/275b926cf745b5668d3ea30236635e20e7866442/langchain/chains/retrieval_qa/prompt.py#L4)) and will retrieve from the vectorDB.\n",
+    "This will use a QA default prompt (shown [here](https://github.com/langchain-ai/langchain/blob/275b926cf745b5668d3ea30236635e20e7866442/langchain/chains/retrieval_qa/prompt.py#L4)) and will retrieve from the vectorDB.\n",
    "\n",
    "But, you can still pass in a prompt, as before, if desired."
   ]
--- a/docs/extras/use_cases/question_answering/question_answering.ipynb
+++ b/docs/extras/use_cases/question_answering/question_answering.ipynb
@@ -10,9 +10,13 @@
    "[![Open In Collab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/langchain-ai/langchain/blob/master/docs/extras/use_cases/question_answering/qa.ipynb)\n",
    "\n",
    "## Use case\n",
-    "Suppose you have some text documents (PDF, blog, Notion pages, etc.) and want to ask questions related to the contents of those documents. LLMs, given their proficiency in understanding text, are a great tool for this.\n",
+    "Suppose you have some text documents (PDF, blog, Notion pages, etc.) and want to ask questions related to the contents of those documents. \n",
    "\n",
-    "In this walkthrough we'll go over how to build a question-answering over documents application using LLMs. Two very related use cases which we cover elsewhere are:\n",
+    "LLMs, given their proficiency in understanding text, are a great tool for this.\n",
+    "\n",
+    "In this walkthrough we'll go over how to build a question-answering over documents application using LLMs. \n",
+    "\n",
+    "Two very related use cases which we cover elsewhere are:\n",
    "- [QA over structured data](/docs/use_cases/qa_structured/sql) (e.g., SQL)\n",
    "- [QA over code](/docs/use_cases/code_understanding) (e.g., Python)\n",
    "\n",
@@ -20,19 +24,21 @@
    "\n",
    "## Overview\n",
    "The pipeline for converting raw unstructured data into a QA chain looks like this:\n",
-    "1. `Loading`: First we need to load our data. Unstructured data can be loaded from many sources. Use the [LangChain integration hub](https://integrations.langchain.com/) to browse the full set of loaders.\n",
-    "Each loader returns data as a LangChain [`Document`](/docs/components/schema/document).\n",
+    "1. `Loading`: First we need to load our data. Use the [LangChain integration hub](https://integrations.langchain.com/) to browse the full set of loaders. \n",
    "2. `Splitting`: [Text splitters](/docs/modules/data_connection/document_transformers/) break `Documents` into splits of specified size\n",
    "3. `Storage`: Storage (e.g., often a [vectorstore](/docs/modules/data_connection/vectorstores/)) will house [and often embed](https://www.pinecone.io/learn/vector-embeddings/) the splits\n",
    "4. `Retrieval`: The app retrieves splits from storage (e.g., often [with similar embeddings](https://www.pinecone.io/learn/k-nearest-neighbor/) to the input question)\n",
    "5. `Generation`: An [LLM](/docs/modules/model_io/models/llms/) produces an answer using a prompt that includes the question and the retrieved data\n",
-    "6. `Conversation` (Extension): Hold a multi-turn conversation by adding [Memory](/docs/modules/memory/) to your QA chain.\n",
    "\n",
    "![flow.jpeg](/img/qa_flow.jpeg)\n",
    "\n",
    "## Quickstart\n",
    "\n",
-    "To give you a sneak preview, the above pipeline can be all be wrapped in a single object: `VectorstoreIndexCreator`. Suppose we want a QA app over this [blog post](https://lilianweng.github.io/posts/2023-06-23-agent/). We can create this in a few lines of code. First set environment variables and install packages:"
+    "Suppose we want a QA app over this [blog post](https://lilianweng.github.io/posts/2023-06-23-agent/). \n",
+    "\n",
+    "We can create this in a few lines of code. \n",
+    "\n",
+    "First set environment variables and install packages:"
   ]
  },
  {
@@ -42,7 +48,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "pip install openai chromadb\n",
+    "pip install langchain openai chromadb langchainhub\n",
    "\n",
    "# Set env var OPENAI_API_KEY or load from a .env file\n",
    "# import dotenv\n",
@@ -53,44 +59,118 @@
  {
   "cell_type": "code",
   "execution_count": 1,
-   "id": "046cefc0",
+   "id": "820244ae-74b4-4593-b392-822979dd91b8",
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.document_loaders import WebBaseLoader\n",
-    "from langchain.indexes import VectorstoreIndexCreator\n",
+    "# Load documents\n",
    "\n",
-    "loader = WebBaseLoader(\"https://lilianweng.github.io/posts/2023-06-23-agent/\")\n",
-    "index = VectorstoreIndexCreator().from_loaders([loader])"
+    "from langchain.document_loaders import WebBaseLoader\n",
+    "loader = WebBaseLoader(\"https://lilianweng.github.io/posts/2023-06-23-agent/\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
-   "id": "f4bf8740",
+   "id": "c89a0aa7-1e7e-4557-90e5-a7ea87db00e7",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Split documents\n",
+    "\n",
+    "from langchain.text_splitter import RecursiveCharacterTextSplitter\n",
+    "text_splitter = RecursiveCharacterTextSplitter(chunk_size = 500, chunk_overlap = 0)\n",
+    "splits = text_splitter.split_documents(loader.load())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "000e46f6-dafc-4a43-8417-463d0614fd30",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Embed and store splits\n",
+    "\n",
+    "from langchain.vectorstores import Chroma\n",
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "vectorstore = Chroma.from_documents(documents=splits,embedding=OpenAIEmbeddings())\n",
+    "retriever = vectorstore.as_retriever()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "dacbde0b-7d45-4a2c-931d-81bb094aec94",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Prompt \n",
+    "# https://smith.langchain.com/hub/rlm/rag-prompt\n",
+    "\n",
+    "from langchain import hub\n",
+    "rag_prompt = hub.pull(\"rlm/rag-prompt\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "79b9fdae-c2bf-4cf6-884f-c19aa07dd975",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# LLM\n",
+    "\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "llm = ChatOpenAI(model_name=\"gpt-3.5-turbo\", temperature=0)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "92c0f3ae-6ab2-4d04-9b22-1963b96b9db5",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# RAG chain \n",
+    "\n",
+    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "rag_chain = (\n",
+    "    {\"context\": retriever, \"question\": RunnablePassthrough()} \n",
+    "    | rag_prompt \n",
+    "    | llm \n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "0d3b0f36-7b56-49c0-8e40-a1aa9ebcbf24",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "' Task decomposition is a technique used to break down complex tasks into smaller and simpler steps. It can be done using LLM with simple prompting, task-specific instructions, or with human inputs. Tree of Thoughts (Yao et al. 2023) is an extension of Chain of Thought (Wei et al. 2022) which explores multiple reasoning possibilities at each step.'"
+       "AIMessage(content='Task decomposition is the process of breaking down a task into smaller subgoals or steps. It can be done using simple prompting, task-specific instructions, or human inputs.')"
      ]
     },
-     "execution_count": 3,
+     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "index.query(\"What is Task Decomposition?\")"
+    "rag_chain.invoke(\"What is Task Decomposition?\")"
   ]
  },
  {
   "cell_type": "markdown",
-   "id": "8224aad6",
+   "id": "639dc31a-7f16-40f6-ba2a-20e7c2ecfe60",
   "metadata": {},
   "source": [
-    "Ok, but what's going on under the hood, and how could we customize this for our specific use case? For that, let's take a look at how we can construct this pipeline piece by piece."
+    "[Here](https://smith.langchain.com/public/2270a675-74de-47ac-b111-b232d8340a64/r) is the LangSmith trace for this chain.\n",
+    "\n",
+    "Below we will explain each step in more detail."
   ]
  },
  {
@@ -100,7 +180,9 @@
   "source": [
    "## Step 1. Load\n",
    "\n",
-    "Specify a `DocumentLoader` to load in your unstructured data as `Documents`. A `Document` is a piece of text (the `page_content`) and associated metadata."
+    "Specify a `DocumentLoader` to load in your unstructured data as `Documents`. \n",
+    "\n",
+    "A `Document` is a dict with text (`page_content`) and `metadata`."
   ]
  },
  {
@@ -122,7 +204,7 @@
   "metadata": {},
   "source": [
    "### Go deeper\n",
-    "- Browse the > 120 data loader integrations [here](https://integrations.langchain.com/).\n",
+    "- Browse the > 160 data loader integrations [here](https://integrations.langchain.com/).\n",
    "- See further documentation on loaders [here](/docs/modules/data_connection/document_loaders/).\n",
    "\n",
    "## Step 2. Split\n",
@@ -150,7 +232,7 @@
   "source": [
    "### Go deeper\n",
    "\n",
-    "- `DocumentSplitters` are just one type of the more generic `DocumentTransformers`, which can all be useful in this preprocessing step.\n",
+    "- `DocumentSplitters` are just one type of the more generic `DocumentTransformers`.\n",
    "- See further documentation on transformers [here](/docs/modules/data_connection/document_transformers/).\n",
    "- `Context-aware splitters` keep the location (\"context\") of each split in the original `Document`:\n",
    "    - [Markdown files](/docs/use_cases/question_answering/how_to/document-context-aware-QA)\n",
@@ -160,7 +242,10 @@
    "## Step 3. Store\n",
    "\n",
    "To be able to look up our document splits, we first need to store them where we can later look them up.\n",
-    "The most common way to do this is to embed the contents of each document then store the embedding and document in a vector store, with the embedding being used to index the document."
+    "\n",
+    "The most common way to do this is to embed the contents of each document split.\n",
+    "\n",
+    "We store the embedding and splits in a vectorstore."
   ]
  },
  {
@@ -193,7 +278,9 @@
    "\n",
    "## Step 4. Retrieve\n",
    "\n",
-    "Retrieve relevant splits for any question using [similarity search](https://www.pinecone.io/learn/what-is-similarity-search/)."
+    "Retrieve relevant splits for any question using [similarity search](https://www.pinecone.io/learn/what-is-similarity-search/).\n",
+    "\n",
+    "This is simply \"top K\" retrieval where we select documents based on embedding similarity to the query."
   ]
  },
  {
@@ -228,7 +315,9 @@
    "\n",
    "Vectorstores are commonly used for retrieval, but they are not the only option. For example, SVMs (see thread [here](https://twitter.com/karpathy/status/1647025230546886658?s=20)) can also be used.\n",
    "\n",
-    "LangChain [has many retrievers](/docs/modules/data_connection/retrievers/) including, but not limited to, vectorstores. All retrievers implement a common method `get_relevant_documents()` (and its asynchronous variant `aget_relevant_documents()`)."
+    "LangChain [has many retrievers](/docs/modules/data_connection/retrievers/) including, but not limited to, vectorstores. \n",
+    "\n",
+    "All retrievers implement a common method `get_relevant_documents()` (and its asynchronous variant `aget_relevant_documents()`)."
   ]
  },
  {
@@ -275,7 +364,6 @@
   "outputs": [],
   "source": [
    "import logging\n",
-    "\n",
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.retrievers.multi_query import MultiQueryRetriever\n",
    "\n",
@@ -288,6 +376,20 @@
    "len(unique_docs)"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "ee8420e6-73a6-411b-a84d-74b096bddad7",
+   "metadata": {},
+   "source": [
+    "In addition, a useful concept for improving retrieval is decoupling the documents from the embedded search key.\n",
+    "\n",
+    "For example, we can embed a document summary or question that are likely to lead to the document being retrieved.\n",
+    "\n",
+    "See details in [here](docs/modules/data_connection/retrievers/multi_vector) on the multi-vector retriever for this purpose.\n",
+    "\n",
+    "![mv.png](/img/multi_vector.png)"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "415d6824",
@@ -295,34 +397,44 @@
   "source": [
    "## Step 5. Generate\n",
    "\n",
-    "Distill the retrieved documents into an answer using an LLM/Chat model (e.g., `gpt-3.5-turbo`) with `RetrievalQA` chain.\n"
+    "Distill the retrieved documents into an answer using an LLM/Chat model (e.g., `gpt-3.5-turbo`).\n",
+    "\n",
+    "We use the [Runnable](https://python.langchain.com/docs/expression_language/interface) protocol to define the chain.\n",
+    "\n",
+    "Runnable protocol pipes together components in a transparent way.\n",
+    "\n",
+    "We used a prompt for RAG that is checked into the LangChain prompt hub ([here](https://smith.langchain.com/hub/rlm/rag-prompt))."
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": 11,
   "id": "99fa1aec",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "{'query': 'What are the approaches to Task Decomposition?',\n",
-       " 'result': 'The approaches to task decomposition include:\\n\\n1. Simple prompting: This approach involves using simple prompts or questions to guide the agent in breaking down a task into smaller subgoals. For example, the agent can be prompted with \"Steps for XYZ\" or \"What are the subgoals for achieving XYZ?\" to facilitate task decomposition.\\n\\n2. Task-specific instructions: In this approach, task-specific instructions are provided to the agent to guide the decomposition process. For example, if the task is to write a novel, the agent can be instructed to \"Write a story outline\" as a step in the task decomposition.\\n\\n3. Human inputs: This approach involves incorporating human inputs in the task decomposition process. Humans can provide guidance, feedback, and assistance to the agent in breaking down complex tasks into manageable subgoals.\\n\\nThese approaches aim to enable efficient handling of complex tasks by breaking them down into smaller, more manageable subgoals.'}"
+       "AIMessage(content='Task decomposition is the process of breaking down a task into smaller subgoals or steps. It can be done using simple prompting, task-specific instructions, or human inputs.')"
      ]
     },
-     "execution_count": 9,
+     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "from langchain.chains import RetrievalQA\n",
    "from langchain.chat_models import ChatOpenAI\n",
-    "\n",
    "llm = ChatOpenAI(model_name=\"gpt-3.5-turbo\", temperature=0)\n",
-    "qa_chain = RetrievalQA.from_chain_type(llm,retriever=vectorstore.as_retriever())\n",
-    "qa_chain({\"query\": question})"
+    "\n",
+    "from langchain.schema.runnable import RunnablePassthrough\n",
+    "rag_chain = (\n",
+    "    {\"context\": retriever, \"question\": RunnablePassthrough()} \n",
+    "    | rag_prompt \n",
+    "    | llm \n",
+    ")\n",
+    "\n",
+    "rag_chain.invoke(\"What is Task Decomposition?\")"
   ]
  },
  {
@@ -330,12 +442,10 @@
   "id": "f7d52c84",
   "metadata": {},
   "source": [
-    "Note, you can pass in an `LLM` or a `ChatModel` (like we did here) to the `RetrievalQA` chain.\n",
-    "\n",
    "### Go deeper\n",
    "\n",
    "#### Choosing LLMs\n",
-    "- Browse the > 55 LLM and chat model integrations [here](https://integrations.langchain.com/).\n",
+    "- Browse the > 90 LLM and chat model integrations [here](https://integrations.langchain.com/).\n",
    "- See further documentation on LLMs and chat models [here](/docs/modules/model_io/models/).\n",
    "- See a guide on local LLMS [here](/docs/modules/use_cases/question_answering/how_to/local_retrieval_qa)."
   ]
@@ -347,28 +457,29 @@
   "source": [
    "#### Customizing the prompt\n",
    "\n",
-    "The prompt in `RetrievalQA` chain can be easily customized."
+    "As shown above, we can load prompts (e.g., [this RAG prompt](https://smith.langchain.com/hub/rlm/rag-prompt)) from the prompt hub.\n",
+    "\n",
+    "The prompt can also be easily customized, as shown below."
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": 12,
   "id": "e4fee704",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "'The approaches to Task Decomposition are (1) using simple prompting by LLM, (2) using task-specific instructions, and (3) incorporating human inputs. Thanks for asking!'"
+       "AIMessage(content='Task decomposition is the process of breaking down a complicated task into smaller, more manageable subtasks or steps. It can be done using prompts, task-specific instructions, or human inputs. Thanks for asking!')"
      ]
     },
-     "execution_count": 10,
+     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "from langchain.chains import RetrievalQA\n",
    "from langchain.prompts import PromptTemplate\n",
    "\n",
    "template = \"\"\"Use the following pieces of context to answer the question at the end. \n",
@@ -378,229 +489,23 @@
    "{context}\n",
    "Question: {question}\n",
    "Helpful Answer:\"\"\"\n",
-    "QA_CHAIN_PROMPT = PromptTemplate.from_template(template)\n",
+    "rag_prompt_custom = PromptTemplate.from_template(template)\n",
    "\n",
-    "llm = ChatOpenAI(model_name=\"gpt-3.5-turbo\", temperature=0)\n",
-    "qa_chain = RetrievalQA.from_chain_type(\n",
-    "    llm,\n",
-    "    retriever=vectorstore.as_retriever(),\n",
-    "    chain_type_kwargs={\"prompt\": QA_CHAIN_PROMPT}\n",
+    "rag_chain = (\n",
+    "    {\"context\": retriever, \"question\": RunnablePassthrough()} \n",
+    "    | rag_prompt_custom \n",
+    "    | llm \n",
    ")\n",
-    "result = qa_chain({\"query\": question})\n",
-    "result[\"result\"]"
+    "\n",
+    "rag_chain.invoke(\"What is Task Decomposition?\")"
   ]
  },
  {
   "cell_type": "markdown",
-   "id": "c825e9bf-6a56-46e4-8bbb-05441f76cb96",
+   "id": "5f5b6297-715a-444e-b3ef-a6d27382b435",
   "metadata": {},
   "source": [
-    "We can also store and fetch prompts from the LangChain prompt hub.\n",
-    "\n",
-    "This will work with your [LangSmith API key](https://docs.smith.langchain.com/).\n",
-    "\n",
-    "For example, see [here](https://smith.langchain.com/hub/rlm/rag-prompt) is a common prompt for RAG.\n",
-    "\n",
-    "We can load this."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "a896060f-ebc4-4236-a4ad-32960601c6e8",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "pip install langchainhub"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 11,
-   "id": "aef8e734-ba54-48ae-b959-1898618f2d90",
-   "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "'The approaches to task decomposition include using LLM with simple prompting, task-specific instructions, and human inputs.'"
-      ]
-     },
-     "execution_count": 11,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "# RAG prompt\n",
-    "from langchain import hub\n",
-    "QA_CHAIN_PROMPT_HUB = hub.pull(\"rlm/rag-prompt\")\n",
-    "\n",
-    "qa_chain = RetrievalQA.from_chain_type(\n",
-    "    llm,\n",
-    "    retriever=vectorstore.as_retriever(),\n",
-    "    chain_type_kwargs={\"prompt\": QA_CHAIN_PROMPT_HUB}\n",
-    ")\n",
-    "result = qa_chain({\"query\": question})\n",
-    "result[\"result\"]"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "ff40e8db",
-   "metadata": {
-    "jp-MarkdownHeadingCollapsed": true
-   },
-   "source": [
-    "#### Return source documents\n",
-    "\n",
-    "The full set of retrieved documents used for answer distillation can be returned using `return_source_documents=True`."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 14,
-   "id": "60004293",
-   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "4\n"
-     ]
-    },
-    {
-     "data": {
-      "text/plain": [
-       "Document(page_content='Task decomposition can be done (1) by LLM with simple prompting like \"Steps for XYZ.\\\\n1.\", \"What are the subgoals for achieving XYZ?\", (2) by using task-specific instructions; e.g. \"Write a story outline.\" for writing a novel, or (3) with human inputs.', metadata={'description': 'Building agents with LLM (large language model) as its core controller is a cool concept. Several proof-of-concepts demos, such as AutoGPT, GPT-Engineer and BabyAGI, serve as inspiring examples. The potentiality of LLM extends beyond generating well-written copies, stories, essays and programs; it can be framed as a powerful general problem solver.\\nAgent System Overview In a LLM-powered autonomous agent system, LLM functions as the agent’s brain, complemented by several key components:', 'language': 'en', 'source': 'https://lilianweng.github.io/posts/2023-06-23-agent/', 'title': \"LLM Powered Autonomous Agents | Lil'Log\"})"
-      ]
-     },
-     "execution_count": 14,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "from langchain.chains import RetrievalQA\n",
-    "\n",
-    "qa_chain = RetrievalQA.from_chain_type(llm,retriever=vectorstore.as_retriever(),\n",
-    "                                       return_source_documents=True)\n",
-    "result = qa_chain({\"query\": question})\n",
-    "print(len(result['source_documents']))\n",
-    "result['source_documents'][0]"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "1b600236",
-   "metadata": {},
-   "source": [
-    "#### Return citations\n",
-    "\n",
-    "Answer citations can be returned using `RetrievalQAWithSourcesChain`."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 16,
-   "id": "948f6d19",
-   "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "{'question': 'What are the approaches to Task Decomposition?',\n",
-       " 'answer': 'The approaches to Task Decomposition include:\\n1. Using LLM with simple prompting, such as providing steps or subgoals for achieving a task.\\n2. Using task-specific instructions, such as providing a specific instruction like \"Write a story outline\" for writing a novel.\\n3. Using human inputs to decompose the task.\\nAnother approach is the Tree of Thoughts, which extends the Chain of Thought (CoT) technique by exploring multiple reasoning possibilities at each step and generating multiple thoughts per step, creating a tree structure. The search process can be BFS or DFS, and each state can be evaluated by a classifier or majority vote.\\nSources: https://lilianweng.github.io/posts/2023-06-23-agent/',\n",
-       " 'sources': ''}"
-      ]
-     },
-     "execution_count": 16,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "from langchain.chains import RetrievalQAWithSourcesChain\n",
-    "\n",
-    "qa_chain = RetrievalQAWithSourcesChain.from_chain_type(llm,retriever=vectorstore.as_retriever())\n",
-    "\n",
-    "result = qa_chain({\"question\": question})\n",
-    "result"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "73d0b138",
-   "metadata": {},
-   "source": [
-    "#### Customizing retrieved document processing\n",
-    "\n",
-    "Retrieved documents can be fed to an LLM for answer distillation in a few different ways.\n",
-    "\n",
-    "`stuff`, `refine`, `map-reduce`, and `map-rerank` chains for passing documents to an LLM prompt are well summarized [here](/docs/modules/chains/document/).\n",
-    " \n",
-    "`stuff` is commonly used because it simply \"stuffs\" all retrieved documents into the prompt.\n",
-    "\n",
-    "The [load_qa_chain](/docs/use_cases/question_answering/how_to/question_answering.html) is an easy way to pass documents to an LLM using these various approaches (e.g., see `chain_type`)."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 17,
-   "id": "29aa139f",
-   "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "{'output_text': 'The approaches to task decomposition mentioned in the provided context are:\\n\\n1. Chain of thought (CoT): This approach involves instructing the language model to \"think step by step\" and decompose complex tasks into smaller and simpler steps. It enhances model performance on complex tasks by utilizing more test-time computation.\\n\\n2. Tree of Thoughts: This approach extends CoT by exploring multiple reasoning possibilities at each step. It decomposes the problem into multiple thought steps and generates multiple thoughts per step, creating a tree structure. The search process can be BFS or DFS, and each state is evaluated by a classifier or majority vote.\\n\\n3. LLM with simple prompting: This approach involves using a language model with simple prompts like \"Steps for XYZ\" or \"What are the subgoals for achieving XYZ?\" to perform task decomposition.\\n\\n4. Task-specific instructions: This approach involves providing task-specific instructions to guide the language model in decomposing the task. For example, providing the instruction \"Write a story outline\" for the task of writing a novel.\\n\\n5. Human inputs: Task decomposition can also be done with human inputs, where humans provide guidance and input to break down the task into smaller subtasks.'}"
-      ]
-     },
-     "execution_count": 17,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "from langchain.chains.question_answering import load_qa_chain\n",
-    "\n",
-    "chain = load_qa_chain(llm, chain_type=\"stuff\")\n",
-    "chain({\"input_documents\": unique_docs, \"question\": question},return_only_outputs=True)"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "a8cb8cd1",
-   "metadata": {},
-   "source": [
-    "We can also pass the `chain_type` to `RetrievalQA`."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 18,
-   "id": "f68574bd",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "qa_chain = RetrievalQA.from_chain_type(llm,retriever=vectorstore.as_retriever(),\n",
-    "                                       chain_type=\"stuff\")\n",
-    "result = qa_chain({\"query\": question})"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "b33aeb5f",
-   "metadata": {},
-   "source": [
-    "In summary, the user can choose the desired level of abstraction for QA:\n",
-    "\n",
-    "![summary_chains.png](/img/summary_chains.png)\n",
-    "\n",
-    "## Step 6. Chat\n",
-    "\n",
-    "See our [use-case on chat](/docs/use_cases/chatbots) for detail on this!"
+    "We can use [LangSmith](https://smith.langchain.com/public/129cac54-44d5-453a-9807-3bd4835e5f96/r) to see the trace."
   ]
  }
 ],
@@ -620,7 +525,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.9.16"
  }
 },
 "nbformat": 4,
--- a/Show More
+++ b/Show More