x

2026-02-07 01:30:24 +00:00 · 2023-09-27 16:18:12 -04:00 · 2023-09-27 16:11:00 -04:00 · 2023-09-27 16:10:22 -04:00 · 2023-09-27 15:55:48 -04:00 · 2023-09-27 11:17:12 -07:00
346 changed files with 19131 additions and 9648 deletions
--- a/.github/CONTRIBUTING.md
+++ b/.github/CONTRIBUTING.md
@@ -14,8 +14,8 @@ Please do not try to push directly to this repo unless you are a maintainer.
 Please follow the checked-in pull request template when opening pull requests. Note related issues and tag relevant
 maintainers.

-Pull requests cannot land without passing the formatting, linting and testing checks first. See
-[Common Tasks](#-common-tasks) for how to run these checks locally.
+Pull requests cannot land without passing the formatting, linting and testing checks first. See [Testing](#testing) and
+[Formatting and Linting](#formatting-and-linting) for how to run these checks locally.

 It's essential that we maintain great documentation and testing. If you:
 - Fix a bug
@@ -59,43 +59,85 @@ we do not want these to get in the way of getting good code into the codebase.

 ## 🚀 Quick Start

-> **Note:** You can run this repository locally (which is described below) or in a [development container](https://containers.dev/) (which is described in the [.devcontainer folder](https://github.com/hwchase17/langchain/tree/master/.devcontainer)).
+This quick start describes running the repository locally.
+For a [development container](https://containers.dev/), see the [.devcontainer folder](https://github.com/hwchase17/langchain/tree/master/.devcontainer).

-This project uses [Poetry](https://python-poetry.org/) v1.5.1 as a dependency manager. Check out Poetry's [documentation on how to install it](https://python-poetry.org/docs/#installation) on your system before proceeding.
+### Dependency Management: Poetry and other env/dependency managers

-❗Note: If you use `Conda` or `Pyenv` as your environment/package manager, avoid dependency conflicts by doing the following first:
-1. *Before installing Poetry*, create and activate a new Conda env (e.g. `conda create -n langchain python=3.9`)
-2. Install Poetry v1.5.1 (see above)
-3. Tell Poetry to use the virtualenv python environment (`poetry config virtualenvs.prefer-active-python true`)
-4. Continue with the following steps.
+This project uses [Poetry](https://python-poetry.org/) v1.5.1+ as a dependency manager.
+
+❗Note: *Before installing Poetry*, if you use `Conda`, create and activate a new Conda env (e.g. `conda create -n langchain python=3.9`)
+
+Install Poetry: **[documentation on how to install it](https://python-poetry.org/docs/#installation)**.
+
+❗Note: If you use `Conda` or `Pyenv` as your environment/package manager, after installing Poetry,
+tell Poetry to use the virtualenv python environment (`poetry config virtualenvs.prefer-active-python true`)
+
+### Core vs. Experimental

 There are two separate projects in this repository:
 - `langchain`: core langchain code, abstractions, and use cases
- `langchain.experimental`: more experimental code
+- `langchain.experimental`: see the [Experimental README](../libs/experimental/README.md) for more information.

-Each of these has their OWN development environment.
-In order to run any of the commands below, please move into their respective directories.
-For example, to contribute to `langchain` run `cd libs/langchain` before getting started with the below.
+Each of these has their own development environment. Docs are run from the top-level makefile, but development
+is split across separate test & release flows.

-To install requirements:
+For this quickstart, start with langchain core:
+
+```bash
+cd libs/langchain
+```
+
+### Local Development Dependencies
+
+Install langchain development requirements (for running langchain, running examples, linting, formatting, tests, and coverage):

 ```bash
 poetry install --with test
 ```

-This will install all requirements for running the package, examples, linting, formatting, tests, and coverage.
+Then verify dependency installation:

-❗Note: If during installation you receive a `WheelFileValidationError` for `debugpy`, please make sure you are running Poetry v1.5.1. This bug was present in older versions of Poetry (e.g. 1.4.1) and has been resolved in newer releases. If you are still seeing this bug on v1.5.1, you may also try disabling "modern installation" (`poetry config installer.modern-installation false`) and re-installing requirements. See [this `debugpy` issue](https://github.com/microsoft/debugpy/issues/1246) for more details.
+```bash
+make test
+```

-Now assuming `make` and `pytest` are installed, you should be able to run the common tasks in the following section. To double check, run `make test` under `libs/langchain`, all tests should pass. If they don't, you may need to pip install additional dependencies, such as `numexpr` and `openapi_schema_pydantic`.
+If the tests don't pass, you may need to pip install additional dependencies, such as `numexpr` and `openapi_schema_pydantic`.

-## ✅ Common Tasks
+If during installation you receive a `WheelFileValidationError` for `debugpy`, please make sure you are running
+Poetry v1.5.1+. This bug was present in older versions of Poetry (e.g. 1.4.1) and has been resolved in newer releases.
+If you are still seeing this bug on v1.5.1, you may also try disabling "modern installation"
+(`poetry config installer.modern-installation false`) and re-installing requirements.
+See [this `debugpy` issue](https://github.com/microsoft/debugpy/issues/1246) for more details.

-Type `make` for a list of common tasks.
+### Testing

-### Code Formatting
+_some test dependencies are optional; see section about optional dependencies_.

-Formatting for this project is done via a combination of [Black](https://black.readthedocs.io/en/stable/) and [isort](https://pycqa.github.io/isort/).
+Unit tests cover modular logic that does not require calls to outside APIs.
+If you add new logic, please add a unit test.
+
+To run unit tests:
+
+```bash
+make test
+```
+
+To run unit tests in Docker:
+
+```bash
+make docker_tests
+```
+
+There are also [integration tests and code-coverage](../libs/langchain/tests/README.md) available.
+
+### Formatting and Linting
+
+Run these locally before submitting a PR; the CI system will check also.
+
+#### Code Formatting
+
+Formatting for this project is done via a combination of [Black](https://black.readthedocs.io/en/stable/) and [ruff](https://docs.astral.sh/ruff/rules/).

 To run formatting for this project:

@@ -111,9 +153,9 @@ make format_diff

 This is especially useful when you have made changes to a subset of the project and want to ensure your changes are properly formatted without affecting the rest of the codebase.

-### Linting
+#### Linting

-Linting for this project is done via a combination of [Black](https://black.readthedocs.io/en/stable/), [isort](https://pycqa.github.io/isort/), [flake8](https://flake8.pycqa.org/en/latest/), and [mypy](http://mypy-lang.org/).
+Linting for this project is done via a combination of [Black](https://black.readthedocs.io/en/stable/), [ruff](https://docs.astral.sh/ruff/rules/), and [mypy](http://mypy-lang.org/).

 To run linting for this project:

@@ -131,7 +173,7 @@ This can be very helpful when you've made changes to only certain parts of the p

 We recognize linting can be annoying - if you do not want to do it, please contact a project maintainer, and they can help you with it. We do not want this to be a blocker for good code getting contributed.

-### Spellcheck
+#### Spellcheck

 Spellchecking for this project is done via [codespell](https://github.com/codespell-project/codespell).
 Note that `codespell` finds common typos, so it could have false-positive (correctly spelled but rarely used) and false-negatives (not finding misspelled) words.
@@ -157,17 +199,7 @@ If codespell is incorrectly flagging a word, you can skip spellcheck for that wo
 ignore-words-list = 'momento,collison,ned,foor,reworkd,parth,whats,aapply,mysogyny,unsecure'
 ```

-### Coverage
-
-Code coverage (i.e. the amount of code that is covered by unit tests) helps identify areas of the code that are potentially more or less brittle.
-
-To get a report of current coverage, run the following:
-
-```bash
-make coverage
-```
-
-### Working with Optional Dependencies
+## Working with Optional Dependencies

 Langchain relies heavily on optional dependencies to keep the Langchain package lightweight.

@@ -192,51 +224,7 @@ To introduce the dependency to the pyproject.toml file correctly, please do the
 test makes use of lightweight fixtures to test the logic of the code.
 5. Please use the `@pytest.mark.requires(package_name)` decorator for any tests that require the dependency.

-### Testing
-
-See section about optional dependencies.
-
-#### Unit Tests
-
-Unit tests cover modular logic that does not require calls to outside APIs.
-
-To run unit tests:
-
-```bash
-make test
-```
-
-To run unit tests in Docker:
-
-```bash
-make docker_tests
-```
-
-If you add new logic, please add a unit test.
-
-
-
-#### Integration Tests
-
-Integration tests cover logic that requires making calls to outside APIs (often integration with other services).
-
-**warning** Almost no tests should be integration tests.
-
-  Tests that require making network connections make it difficult for other
-  developers to test the code.
-
-  Instead favor relying on `responses` library and/or mock.patch to mock
-  requests using small fixtures.
-
-To run integration tests:
-
-```bash
-make integration_tests
-```
-
-If you add support for a new external API, please add a new integration test.
-
-### Adding a Jupyter Notebook
+## Adding a Jupyter Notebook

 If you are adding a Jupyter Notebook example, you'll want to install the optional `dev` dependencies.

@@ -259,6 +247,12 @@ When you run `poetry install`, the `langchain` package is installed as editable
 While the code is split between `langchain` and `langchain.experimental`, the documentation is one holistic thing.
 This covers how to get started contributing to documentation.

+From the top-level of this repo, install documentation dependencies:
+
+```bash
+poetry install
+```
+
 ### Contribute Documentation

 The docs directory contains Documentation and API Reference.
--- a/.github/workflows/doc_lint.yml
+++ b/.github/workflows/doc_lint.yml
@@ -19,4 +19,4 @@ jobs:
      run: |
        # We should not encourage imports directly from main init file
        # Expect for hub
-        git grep 'from langchain import' docs | grep -vE 'from langchain import (hub)' && exit 1 || exit 0
+        git grep 'from langchain import' docs/{extras,docs_skeleton,snippets} | grep -vE 'from langchain import (hub)' && exit 1 || exit 0
--- a/.github/workflows/scheduled_test.yml
+++ b/.github/workflows/scheduled_test.yml
@@ -34,12 +34,19 @@ jobs:
          working-directory: libs/langchain
          cache-key: scheduled

+      - name: 'Authenticate to Google Cloud'
+        id: 'auth'
+        uses: 'google-github-actions/auth@v1'
+        with:
+          credentials_json: '${{ secrets.GOOGLE_CREDENTIALS }}'
+
      - name: Install dependencies
        working-directory: libs/langchain
        shell: bash
        run: |
          echo "Running scheduled tests, installing dependencies with poetry..."
          poetry install --with=test_integration
+          poetry run pip install google-cloud-aiplatform

      - name: Run tests
        shell: bash
--- a/6
+++ b/6
@@ -42,7 +42,8 @@ spell_fix:
 ######################

 help:
-	@echo '----'
+	@echo '===================='
+	@echo '-- DOCUMENTATION --'
 	@echo 'clean                        - run docs_clean and api_docs_clean'
 	@echo 'docs_build                   - build the documentation'
 	@echo 'docs_clean                   - clean the documentation build artifacts'
@@ -51,4 +52,5 @@ help:
 	@echo 'api_docs_clean               - clean the API Reference documentation build artifacts'
 	@echo 'api_docs_linkcheck           - run linkchecker on the API Reference documentation'
 	@echo 'spell_check               	- run codespell on the project'
-	@echo 'spell_fix               		- run codespell on the project and fix the errors'
+	@echo 'spell_fix               		- run codespell on the project and fix the errors'
+	@echo '-- TEST and LINT tasks are within libs/*/ per-package --'
--- a/docs/_scripts/model_feat_table.py
+++ b/docs/_scripts/model_feat_table.py
@@ -0,0 +1,150 @@
+import os
+from pathlib import Path
+
+from langchain import chat_models, llms
+from langchain.chat_models.base import BaseChatModel, SimpleChatModel
+from langchain.llms.base import BaseLLM, LLM
+
+INTEGRATIONS_DIR = (
+    Path(os.path.abspath(__file__)).parents[1] / "extras" / "integrations"
+)
+LLM_IGNORE = ("FakeListLLM", "OpenAIChat", "PromptLayerOpenAIChat")
+LLM_FEAT_TABLE_CORRECTION = {
+    "TextGen": {"_astream": False, "_agenerate": False},
+    "Ollama": {
+        "_stream": False,
+    },
+    "PromptLayerOpenAI": {"batch_generate": False, "batch_agenerate": False},
+}
+CHAT_MODEL_IGNORE = ("FakeListChatModel", "HumanInputChatModel")
+CHAT_MODEL_FEAT_TABLE_CORRECTION = {
+    "ChatMLflowAIGateway": {"_agenerate": False},
+    "PromptLayerChatOpenAI": {"_stream": False, "_astream": False},
+    "ChatKonko": {"_astream": False, "_agenerate": False},
+}
+
+LLM_TEMPLATE = """\
+---
+sidebar_position: 0
+sidebar_class_name: hidden
+---
+
+# LLMs
+
+import DocCardList from "@theme/DocCardList";
+
+## Features (natively supported)
+All LLMs implement the Runnable interface, which comes with default implementations of all methods, ie. `ainvoke`, `batch`, `abatch`, `stream`, `astream`. This gives all LLMs basic support for async, streaming and batch, which by default is implemented as below:
+- *Async* support defaults to calling the respective sync method in asyncio's default thread pool executor. This lets other async functions in your application make progress while the LLM is being executed, by moving this call to a background thread.
+- *Streaming* support defaults to returning an `Iterator` (or `AsyncIterator` in the case of async streaming) of a single value, the final result returned by the underlying LLM provider. This obviously doesn't give you token-by-token streaming, which requires native support from the LLM provider, but ensures your code that expects an iterator of tokens can work for any of our LLM integrations.
+- *Batch* support defaults to calling the underlying LLM in parallel for each input by making use of a thread pool executor (in the sync batch case) or `asyncio.gather` (in the async batch case). The concurrency can be controlled with the `max_concurrency` key in `RunnableConfig`.
+
+Each LLM integration can optionally provide native implementations for async, streaming or batch, which, for providers that support it, can be more efficient. The table shows, for each integration, which features have been implemented with native support.
+
+{table}
+
+<DocCardList />
+"""
+
+CHAT_MODEL_TEMPLATE = """\
+---
+sidebar_position: 1
+sidebar_class_name: hidden
+---
+
+# Chat models
+
+import DocCardList from "@theme/DocCardList";
+
+## Features (natively supported)
+All ChatModels implement the Runnable interface, which comes with default implementations of all methods, ie. `ainvoke`, `batch`, `abatch`, `stream`, `astream`. This gives all ChatModels basic support for async, streaming and batch, which by default is implemented as below:
+- *Async* support defaults to calling the respective sync method in asyncio's default thread pool executor. This lets other async functions in your application make progress while the ChatModel is being executed, by moving this call to a background thread.
+- *Streaming* support defaults to returning an `Iterator` (or `AsyncIterator` in the case of async streaming) of a single value, the final result returned by the underlying ChatModel provider. This obviously doesn't give you token-by-token streaming, which requires native support from the ChatModel provider, but ensures your code that expects an iterator of tokens can work for any of our ChatModel integrations.
+- *Batch* support defaults to calling the underlying ChatModel in parallel for each input by making use of a thread pool executor (in the sync batch case) or `asyncio.gather` (in the async batch case). The concurrency can be controlled with the `max_concurrency` key in `RunnableConfig`.
+
+Each ChatModel integration can optionally provide native implementations to truly enable async or streaming.
+The table shows, for each integration, which features have been implemented with native support.
+
+{table}
+
+<DocCardList />
+"""
+
+
+def get_llm_table():
+    llm_feat_table = {}
+    for cm in llms.__all__:
+        llm_feat_table[cm] = {}
+        cls = getattr(llms, cm)
+        if issubclass(cls, LLM):
+            for feat in ("_stream", "_astream", ("_acall", "_agenerate")):
+                if isinstance(feat, tuple):
+                    feat, name = feat
+                else:
+                    feat, name = feat, feat
+                llm_feat_table[cm][name] = getattr(cls, feat) != getattr(LLM, feat)
+        else:
+            for feat in [
+                "_stream",
+                "_astream",
+                ("_generate", "batch_generate"),
+                "_agenerate",
+                ("_agenerate", "batch_agenerate"),
+            ]:
+                if isinstance(feat, tuple):
+                    feat, name = feat
+                else:
+                    feat, name = feat, feat
+                llm_feat_table[cm][name] = getattr(cls, feat) != getattr(BaseLLM, feat)
+    final_feats = {
+        k: v
+        for k, v in {**llm_feat_table, **LLM_FEAT_TABLE_CORRECTION}.items()
+        if k not in LLM_IGNORE
+    }
+
+    header = [
+        "model",
+        "_agenerate",
+        "_stream",
+        "_astream",
+        "batch_generate",
+        "batch_agenerate",
+    ]
+    title = ["Model", "Invoke", "Async invoke", "Stream", "Async stream", "Batch", "Async batch"]
+    rows = [title, [":-"] + [":-:"] * (len(title) - 1)]
+    for llm, feats in sorted(final_feats.items()):
+        rows += [[llm, "✅"] + ["✅" if feats.get(h) else "❌" for h in header[1:]]]
+    return "\n".join(["|".join(row) for row in rows])
+
+
+def get_chat_model_table():
+    feat_table = {}
+    for cm in chat_models.__all__:
+        feat_table[cm] = {}
+        cls = getattr(chat_models, cm)
+        if issubclass(cls, SimpleChatModel):
+            comparison_cls = SimpleChatModel
+        else:
+            comparison_cls = BaseChatModel
+        for feat in ("_stream", "_astream", "_agenerate"):
+            feat_table[cm][feat] = getattr(cls, feat) != getattr(comparison_cls, feat)
+    final_feats = {
+        k: v
+        for k, v in {**feat_table, **CHAT_MODEL_FEAT_TABLE_CORRECTION}.items()
+        if k not in CHAT_MODEL_IGNORE
+    }
+    header = ["model", "_agenerate", "_stream", "_astream"]
+    title = ["Model", "Invoke", "Async invoke", "Stream", "Async stream"]
+    rows = [title, [":-"] + [":-:"] * (len(title) - 1)]
+    for llm, feats in sorted(final_feats.items()):
+        rows += [[llm, "✅"] + ["✅" if feats.get(h) else "❌" for h in header[1:]]]
+    return "\n".join(["|".join(row) for row in rows])
+
+
+if __name__ == "__main__":
+    llm_page = LLM_TEMPLATE.format(table=get_llm_table())
+    with open(INTEGRATIONS_DIR / "llms" / "index.mdx", "w") as f:
+        f.write(llm_page)
+    chat_model_page = CHAT_MODEL_TEMPLATE.format(table=get_chat_model_table())
+    with open(INTEGRATIONS_DIR / "chat" / "index.mdx", "w") as f:
+        f.write(chat_model_page)
--- a/docs/api_reference/create_api_rst.py
+++ b/docs/api_reference/create_api_rst.py
@@ -1,7 +1,6 @@
 """Script for auto-generating api_reference.rst."""
 import importlib
 import inspect
-import os
 import typing
 from pathlib import Path
 from typing import TypedDict, Sequence, List, Dict, Literal, Union, Optional
@@ -284,9 +283,12 @@ Functions
 def main() -> None:
    """Generate the reference.rst file for each package."""
    lc_members = _load_package_modules(PKG_DIR)
-    # Put tools.render at the top level
+    # Put some packages at top level
    tools = _load_package_modules(PKG_DIR, "tools")
    lc_members['tools.render'] = tools['render']
+    agents = _load_package_modules(PKG_DIR, "agents")
+    lc_members['agents.output_parsers'] = agents['output_parsers']
+    lc_members['agents.format_scratchpad'] = agents['format_scratchpad']
    lc_doc = ".. _api_reference:\n\n" + _construct_doc("langchain", lc_members)
    with open(WRITE_FILE, "w") as f:
        f.write(lc_doc)
--- a/docs/api_reference/guide_imports.json
+++ b/docs/api_reference/guide_imports.json
--- a/docs/docs_skeleton/docs/expression_language/index.mdx
+++ b/docs/docs_skeleton/docs/expression_language/index.mdx
@@ -5,7 +5,23 @@ sidebar_class_name: hidden
 # LangChain Expression Language (LCEL)

 LangChain Expression Language or LCEL is a declarative way to easily compose chains together.
-Any chain constructed this way will automatically have full sync, async, and streaming support.
+There are several benefits to writing chains in this manner (as opposed to writing normal code):
+
+**Async, Batch, and Streaming Support**
+Any chain constructed this way will automatically have full sync, async, batch, and streaming support.
+This makes it easy to prototype a chain in a Jupyter notebook using the sync interface, and then expose it as an async streaming interface.
+
+**Fallbacks**
+The non-determinism of LLMs makes it important to be able to handle errors gracefully.
+With LCEL you can easily attach fallbacks to any chain.
+
+**Parallelism**
+Since LLM applications involve (sometimes long) API calls, it often becomes important to run things in parallel.
+With LCEL syntax, any components that can be run in parallel automatically are.
+
+**Seamless LangSmith Tracing Integration**
+As your chains get more and more complex, it becomes increasingly important to understand what exactly is happening at every step.
+With LCEL, **all** steps are automatically logged to [LangSmith](https://smith.langchain.com) for maximal observability and debuggability.

 #### [Interface](/docs/expression_language/interface)
 The base interface shared by all LCEL objects
--- a/docs/docs_skeleton/docs/modules/agents/agent_types/chat_conversation_agent.mdx
+++ b/docs/docs_skeleton/docs/modules/agents/agent_types/chat_conversation_agent.mdx
@@ -1,13 +0,0 @@
-# Conversational
-
-This walkthrough demonstrates how to use an agent optimized for conversation. Other agents are often optimized for using tools to figure out the best response, which is not ideal in a conversational setting where you may want the agent to be able to chat with the user as well.
-
-import Example from "@snippets/modules/agents/agent_types/conversational_agent.mdx"
-
-<Example/>
-
-import ChatExample from "@snippets/modules/agents/agent_types/chat_conversation_agent.mdx"
-
-## Using a chat model
-
-<ChatExample/>
--- a/docs/docs_skeleton/docs/modules/agents/agent_types/index.mdx
+++ b/docs/docs_skeleton/docs/modules/agents/agent_types/index.mdx
@@ -2,15 +2,13 @@
 sidebar_position: 0
 ---

-# Agent types
-
-## Action agents
+# Agent Types

 Agents use an LLM to determine which actions to take and in what order.
 An action can either be using a tool and observing its output, or returning a response to the user.
 Here are the agents available in LangChain.

-### [Zero-shot ReAct](/docs/modules/agents/agent_types/react.html)
+## [Zero-shot ReAct](/docs/modules/agents/agent_types/react.html)

 This agent uses the [ReAct](https://arxiv.org/pdf/2210.03629) framework to determine which tool to use
 based solely on the tool's description. Any number of tools can be provided.
@@ -18,33 +16,33 @@ This agent requires that a description is provided for each tool.

 **Note**: This is the most general purpose action agent.

-### [Structured input ReAct](/docs/modules/agents/agent_types/structured_chat.html)
+## [Structured input ReAct](/docs/modules/agents/agent_types/structured_chat.html)

 The structured tool chat agent is capable of using multi-input tools.
 Older agents are configured to specify an action input as a single string, but this agent can use a tools' argument
 schema to create a structured action input. This is useful for more complex tool usage, like precisely
 navigating around a browser.

-### [OpenAI Functions](/docs/modules/agents/agent_types/openai_functions_agent.html)
+## [OpenAI Functions](/docs/modules/agents/agent_types/openai_functions_agent.html)

 Certain OpenAI models (like gpt-3.5-turbo-0613 and gpt-4-0613) have been explicitly fine-tuned to detect when a
 function should be called and respond with the inputs that should be passed to the function.
 The OpenAI Functions Agent is designed to work with these models.

-### [Conversational](/docs/modules/agents/agent_types/chat_conversation_agent.html)
+## [Conversational](/docs/modules/agents/agent_types/chat_conversation_agent.html)

 This agent is designed to be used in conversational settings.
 The prompt is designed to make the agent helpful and conversational.
 It uses the ReAct framework to decide which tool to use, and uses memory to remember the previous conversation interactions.

-### [Self-ask with search](/docs/modules/agents/agent_types/self_ask_with_search.html)
+## [Self-ask with search](/docs/modules/agents/agent_types/self_ask_with_search.html)

 This agent utilizes a single tool that should be named `Intermediate Answer`.
 This tool should be able to lookup factual answers to questions. This agent
 is equivalent to the original [self-ask with search paper](https://ofir.io/self-ask.pdf),
 where a Google search API was provided as the tool.

-### [ReAct document store](/docs/modules/agents/agent_types/react_docstore.html)
+## [ReAct document store](/docs/modules/agents/agent_types/react_docstore.html)

 This agent uses the ReAct framework to interact with a docstore. Two tools must
 be provided: a `Search` tool and a `Lookup` tool (they must be named exactly as so).
@@ -52,6 +50,3 @@ The `Search` tool should search for a document, while the `Lookup` tool should l
 a term in the most recently found document.
 This agent is equivalent to the
 original [ReAct paper](https://arxiv.org/pdf/2210.03629.pdf), specifically the Wikipedia example.
-
-## [Plan-and-execute agents](/docs/modules/agents/agent_types/plan_and_execute.html)
-Plan-and-execute agents accomplish an objective by first planning what to do, then executing the sub tasks. This idea is largely inspired by [BabyAGI](https://github.com/yoheinakajima/babyagi) and then the ["Plan-and-Solve" paper](https://arxiv.org/abs/2305.04091).
--- a/docs/docs_skeleton/docs/modules/agents/agent_types/openai_functions_agent.mdx
+++ b/docs/docs_skeleton/docs/modules/agents/agent_types/openai_functions_agent.mdx
@@ -1,11 +0,0 @@
-# OpenAI functions
-
-Certain OpenAI models (like gpt-3.5-turbo-0613 and gpt-4-0613) have been fine-tuned to detect when a function should be called and respond with the inputs that should be passed to the function.
-In an API call, you can describe functions and have the model intelligently choose to output a JSON object containing arguments to call those functions.
-The goal of the OpenAI Function APIs is to more reliably return valid and useful function calls than a generic text completion or chat API.
-
-The OpenAI Functions Agent is designed to work with these models.
-
-import Example from "@snippets/modules/agents/agent_types/openai_functions_agent.mdx";
-
-<Example/>
--- a/docs/docs_skeleton/docs/modules/agents/agent_types/plan_and_execute.mdx
+++ b/docs/docs_skeleton/docs/modules/agents/agent_types/plan_and_execute.mdx
@@ -1,11 +0,0 @@
-# Plan-and-execute
-
-Plan-and-execute agents accomplish an objective by first planning what to do, then executing the sub tasks. This idea is largely inspired by [BabyAGI](https://github.com/yoheinakajima/babyagi) and then the ["Plan-and-Solve" paper](https://arxiv.org/abs/2305.04091).
-
-The planning is almost always done by an LLM.
-
-The execution is usually done by a separate agent (equipped with tools).
-
-import Example from "@snippets/modules/agents/agent_types/plan_and_execute.mdx"
-
-<Example/>
--- a/docs/docs_skeleton/docs/modules/agents/agent_types/react.mdx
+++ b/docs/docs_skeleton/docs/modules/agents/agent_types/react.mdx
@@ -1,15 +0,0 @@
-# ReAct
-
-This walkthrough showcases using an agent to implement the [ReAct](https://react-lm.github.io/) logic.
-
-import Example from "@snippets/modules/agents/agent_types/react.mdx"
-
-<Example/>
-
-## Using chat models
-
-You can also create ReAct agents that use chat models instead of LLMs as the agent driver.
-
-import ChatExample from "@snippets/modules/agents/agent_types/react_chat.mdx"
-
-<ChatExample/>
--- a/docs/docs_skeleton/docs/modules/agents/agent_types/structured_chat.mdx
+++ b/docs/docs_skeleton/docs/modules/agents/agent_types/structured_chat.mdx
@@ -1,10 +0,0 @@
-# Structured tool chat
-
-The structured tool chat agent is capable of using multi-input tools.
-
-Older agents are configured to specify an action input as a single string, but this agent can use the provided tools' `args_schema` to populate the action input.
-
-
-import Example from "@snippets/modules/agents/agent_types/structured_chat.mdx"
-
-<Example/>
--- a/docs/docs_skeleton/docs/modules/agents/index.mdx
+++ b/docs/docs_skeleton/docs/modules/agents/index.mdx
@@ -7,20 +7,27 @@ The core idea of agents is to use an LLM to choose a sequence of actions to take
 In chains, a sequence of actions is hardcoded (in code).
 In agents, a language model is used as a reasoning engine to determine which actions to take and in which order.

+Some important terminology (and schema) to know:
+
+1. `AgentAction`: This is a dataclass that represents the action an agent should take. It has a `tool` property (which is the name of the tool that should be invoked) and a `tool_input` property (the input to that tool)
+2. `AgentFinish`: This is a dataclass that signifies that the agent has finished and should return to the user. It has a `return_values` parameter, which is a dictionary to return. It often only has one key - `output` - that is a string, and so often it is just this key that is returned.
+3. `intermediate_steps`: These represent previous agent actions and corresponding outputs that are passed around. These are important to pass to future iteration so the agent knows what work it has already done. This is typed as a `List[Tuple[AgentAction, Any]]`. Note that observation is currently left as type `Any` to be maximally flexible. In practice, this is often a string.
+
 There are several key components here:

 ## Agent

-This is the class responsible for deciding what step to take next.
+This is the chain responsible for deciding what step to take next.
 This is powered by a language model and a prompt.
-This prompt can include things like:
+The inputs to this chain are:

-1. The personality of the agent (useful for having it respond in a certain way)
-2. Background context for the agent (useful for giving it more context on the types of tasks it's being asked to do)
-3. Prompting strategies to invoke better reasoning (the most famous/widely used being [ReAct](https://arxiv.org/abs/2210.03629))
+1. List of available tools
+2. User input
+3. Any previously executed steps (`intermediate_steps`)

-LangChain provides a few different types of agents to get started.
-Even then, you will likely want to customize those agents with parts (1) and (2).
+This chain then returns either the next action to take or the final response to send to the user (`AgentAction` or `AgentFinish`).
+
+Different agents have different prompting styles for reasoning, different ways of encoding input, and different ways of parsing the output.
 For a full list of agent types see [agent types](/docs/modules/agents/agent_types/)

 ## Tools
@@ -74,12 +81,22 @@ The `AgentExecutor` class is the main agent runtime supported by LangChain.
 However, there are other, more experimental runtimes we also support.
 These include:

- [Plan-and-execute Agent](/docs/modules/agents/agent_types/plan_and_execute.html)
- [Baby AGI](/docs/use_cases/autonomous_agents/baby_agi.html)
- [Auto GPT](/docs/use_cases/autonomous_agents/autogpt.html)
+- [Plan-and-execute Agent](/docs/use_cases/more/agents/autonomous_agents/plan_and_execute)
+- [Baby AGI](/docs/use_cases/more/agents/autonomous_agents/baby_agi)
+- [Auto GPT](/docs/use_cases/more/agents/autonomous_agents/autogpt)

 ## Get started

 import GetStarted from "@snippets/modules/agents/get_started.mdx"

 <GetStarted/>
+
+## Next Steps
+
+Awesome! You've now run your first end-to-end agent.
+To dive deeper, you can:
+
+- Check out all the different [agent types](/docs/modules/agents/agent_types/) supported
+- Learn all the controls for [AgentExecutor](/docs/modules/agents/how_to/)
+- See a full list of all the off-the-shelf [toolkits](/docs/modules/agents/toolkits/) we provide
+- Explore all the individual [tools](/docs/modules/agents/tools/) supported
--- a/docs/docs_skeleton/docusaurus.config.js
+++ b/docs/docs_skeleton/docusaurus.config.js
@@ -71,9 +71,9 @@ const config = {
              test: /\.ipynb$/,
              loader: "raw-loader",
              resolve: {
-                fullySpecified: false
-              }
-            }
+                fullySpecified: false,
+              },
+            },
          ],
        },
      }),
@@ -158,16 +158,16 @@ const config = {
            position: "left",
          },
          {
-            type: 'docSidebar',
-            position: 'left',
-            sidebarId: 'use_cases',
-            label: 'Use cases',
+            type: "docSidebar",
+            position: "left",
+            sidebarId: "use_cases",
+            label: "Use cases",
          },
          {
-            type: 'docSidebar',
-            position: 'left',
-            sidebarId: 'integrations',
-            label: 'Integrations',
+            type: "docSidebar",
+            position: "left",
+            sidebarId: "integrations",
+            label: "Integrations",
          },
          {
            href: "https://api.python.langchain.com",
@@ -187,9 +187,9 @@ const config = {
          // Please keep GitHub link to the right for consistency.
          {
            href: "https://github.com/hwchase17/langchain",
-            position: 'right',
-            className: 'header-github-link',
-            'aria-label': 'GitHub repository',
+            position: "right",
+            className: "header-github-link",
+            "aria-label": "GitHub repository",
          },
        ],
      },
@@ -239,6 +239,14 @@ const config = {
        copyright: `Copyright © ${new Date().getFullYear()} LangChain, Inc.`,
      },
    }),
+
+  scripts: [
+    "/js/google_analytics.js",
+    {
+      src: "https://www.googletagmanager.com/gtag/js?id=G-9B66JQQH2F",
+      async: true,
+    },
+  ],
 };

 module.exports = config;
--- a/docs/docs_skeleton/sidebars.js
+++ b/docs/docs_skeleton/sidebars.js
@@ -99,8 +99,8 @@ module.exports = {
      label: "Components",
      collapsible: false,
      items: [
-        { type: "category", label: "LLMs", collapsed: true, items: [{type:"autogenerated", dirName: "integrations/llms" }], link: {type: "generated-index", slug: "integrations/llms" }},
-        { type: "category", label: "Chat models", collapsed: true, items: [{type:"autogenerated", dirName: "integrations/chat" }], link: {type: "generated-index", slug: "integrations/chat" }},
+        { type: "category", label: "LLMs", collapsed: true, items: [{type:"autogenerated", dirName: "integrations/llms" }], link: { type: 'doc', id: "integrations/llms/index"}},
+        { type: "category", label: "Chat models", collapsed: true, items: [{type:"autogenerated", dirName: "integrations/chat" }], link: { type: 'doc', id: "integrations/chat/index"}},
        { type: "category", label: "Document loaders", collapsed: true, items: [{type:"autogenerated", dirName: "integrations/document_loaders" }], link: {type: "generated-index", slug: "integrations/document_loaders" }},
        { type: "category", label: "Document transformers", collapsed: true, items: [{type: "autogenerated", dirName: "integrations/document_transformers" }], link: {type: "generated-index", slug: "integrations/document_transformers" }},
        { type: "category", label: "Text embedding models", collapsed: true, items: [{type: "autogenerated", dirName: "integrations/text_embedding" }], link: {type: "generated-index", slug: "integrations/text_embedding" }},
--- a/docs/docs_skeleton/static/js/google_analytics.js
+++ b/docs/docs_skeleton/static/js/google_analytics.js
@@ -0,0 +1,7 @@
+window.dataLayer = window.dataLayer || [];
+function gtag() {
+  dataLayer.push(arguments);
+}
+gtag("js", new Date());
+
+gtag("config", "G-9B66JQQH2F");
--- a/docs/docs_skeleton/vercel.json
+++ b/docs/docs_skeleton/vercel.json
@@ -1,72 +1,92 @@
 {
  "redirects": [
+    {
+      "source": "/docs/modules/agents/agents/examples/mrkl_chat(.html?)",
+      "destination": "/docs/modules/agents/"
+    },
+    {
+      "source": "/docs/use_cases(/?)",
+      "destination": "/docs/use_cases/question_answering/"
+    },
+    {
+      "source": "/docs/integrations(/?)",
+      "destination": "/docs/integrations/providers/"
+    },
+    {
+      "source": "/docs/integrations/platforms(/?)",
+      "destination": "/docs/integrations/providers/"
+    },
+    {
+      "source": "/docs/integrations/platforms(/?)",
+      "destination": "/docs/integrations/providers/"
+    },
    {
      "source": "/docs/expression_language/cookbook/routing",
      "destination": "/docs/expression_language/how_to/routing"
    },
    {
      "source":  "/docs/integrations/providers/amazon_api_gateway",
-      "destination": "/docs/integrations/platform/aws"
+      "destination": "/docs/integrations/platforms/aws"
    },
    {
      "source":  "/docs/integrations/providers/azure_blob_storage",
-      "destination": "/docs/integrations/platform/microsoft"
+      "destination": "/docs/integrations/platforms/microsoft"
    },
    {
      "source":  "/docs/integrations/providers/google_vertexai_matchingengine",
-      "destination": "/docs/integrations/platform/google"
+      "destination": "/docs/integrations/platforms/google"
    },
    {
      "source":  "/docs/integrations/providers/aws_s3",
-      "destination": "/docs/integrations/platform/aws"
+      "destination": "/docs/integrations/platforms/aws"
    },
    {
      "source":  "/docs/integrations/providers/azure_openai",
-      "destination": "/docs/integrations/platform/microsoft"
+      "destination": "/docs/integrations/platforms/microsoft"
    },
    {
      "source":  "/docs/integrations/providers/azure_blob_storage",
-      "destination": "/docs/integrations/platform/microsoft"
+      "destination": "/docs/integrations/platforms/microsoft"
    },
    {
      "source":  "/docs/integrations/providers/azure_cognitive_search_",
-      "destination": "/docs/integrations/platform/microsoft"
+      "destination": "/docs/integrations/platforms/microsoft"
    },
    {
      "source":  "/docs/integrations/providers/bedrock",
-      "destination": "/docs/integrations/platform/aws"
+      "destination": "/docs/integrations/platforms/aws"
    },
    {
      "source":  "/docs/integrations/providers/google_bigquery",
-      "destination": "/docs/integrations/platform/google"
+      "destination": "/docs/integrations/platforms/google"
    },
    {
      "source":  "/docs/integrations/providers/google_cloud_storage",
-      "destination": "/docs/integrations/platform/google"
+      "destination": "/docs/integrations/platforms/google"
    },
    {
      "source":  "/docs/integrations/providers/google_drive",
-      "destination": "/docs/integrations/platform/google"
+      "destination": "/docs/integrations/platforms/google"
    },
    {
      "source":  "/docs/integrations/providers/google_search",
-      "destination": "/docs/integrations/platform/google"
+      "destination": "/docs/integrations/platforms/google"
    },
    {
      "source":  "/docs/integrations/providers/microsoft_onedrive",
-      "destination": "/docs/integrations/platform/microsoft"
+      "destination": "/docs/integrations/platforms/microsoft"
    },
    {
      "source":  "/docs/integrations/providers/microsoft_powerpoint",
-      "destination": "/docs/integrations/platform/microsoft"
+      "destination": "/docs/integrations/platforms/microsoft"
    },
    {
      "source":  "/docs/integrations/providers/microsoft_word",
-      "destination": "/docs/integrations/platform/microsoft"
+      "destination": "/docs/integrations/platforms/microsoft"
    },
    {
      "source":  "/docs/integrations/providers/sagemaker_endpoint",
-      "destination": "/docs/integrations/platform/aws"
+      "destination": "/docs/integrations/platforms/aws"
    },
    {
      "source":  "/docs/integrations/providers/sagemaker_tracking",
@@ -74,7 +94,7 @@
    },
    {
      "source":  "/docs/integrations/providers/openai",
-      "destination": "/docs/integrations/callbacks/openai"
+      "destination": "/docs/integrations/platforms/openai"
    },
    {
      "source": "/docs/modules/data_connection/caching_embeddings(/?)",
@@ -438,7 +458,7 @@
    },
    {
      "source": "/docs/integrations/openai",
-      "destination": "/docs/integrations/providers/openai"
+      "destination": "/docs/integrations/platforms/openai"
    },
    {
      "source": "/docs/integrations/opensearch",
@@ -1952,6 +1972,18 @@
      "source": "/docs/modules/data_connection/document_loaders/integrations/youtube_transcript",
      "destination": "/docs/integrations/document_loaders/youtube_transcript"
    },
+    {
+      "source": "/docs/integrations/document_loaders/Etherscan",
+      "destination": "/docs/integrations/document_loaders/etherscan"
+    },
+    {
+      "source": "/docs/integrations/document_loaders/merge_doc_loader",
+      "destination": "/docs/integrations/document_loaders/merge_doc"
+    },
+    {
+      "source": "/docs/integrations/document_loaders/recursive_url_loader",
+      "destination": "/docs/integrations/document_loaders/recursive_url"
+    },
    {
      "source": "/en/latest/modules/indexes/text_splitters/examples/markdown_header_metadata.html",
      "destination": "/docs/modules/data_connection/document_transformers/text_splitters/markdown_header_metadata"
--- a/docs/extras/expression_language/cookbook/multiple_chains.ipynb
+++ b/docs/extras/expression_language/cookbook/multiple_chains.ipynb
@@ -95,7 +95,7 @@
    }
   ],
   "source": [
-    "question_generator.invoke({\"warm\"})"
+    "question_generator.invoke(\"warm\")"
   ]
  },
  {
@@ -116,7 +116,7 @@
    }
   ],
   "source": [
-    "prompt = question_generator.invoke({\"warm\"})\n",
+    "prompt = question_generator.invoke(\"warm\")\n",
    "model.invoke(prompt)"
   ]
  },
--- a/docs/extras/expression_language/how_to/_category_.yml
+++ b/docs/extras/expression_language/how_to/_category_.yml
@@ -1,2 +0,0 @@
-label: 'How to'
-position: 1
--- a/docs/extras/expression_language/how_to/fallbacks.ipynb
+++ b/docs/extras/expression_language/how_to/fallbacks.ipynb
@@ -277,7 +277,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.1"
  }
 },
 "nbformat": 4,
--- a/docs/extras/expression_language/how_to/functions.ipynb
+++ b/docs/extras/expression_language/how_to/functions.ipynb
@@ -14,12 +14,15 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 77,
+   "execution_count": 4,
   "id": "6bb221b3",
   "metadata": {},
   "outputs": [],
   "source": [
    "from langchain.schema.runnable import RunnableLambda\n",
+    "from langchain.prompts import ChatPromptTemplate\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from operator import itemgetter\n",
    "\n",
    "def length_function(text):\n",
    "    return len(text)\n",
@@ -31,6 +34,7 @@
    "    return _multiple_length_function(_dict[\"text1\"], _dict[\"text2\"])\n",
    "\n",
    "prompt = ChatPromptTemplate.from_template(\"what is {a} + {b}\")\n",
+    "model = ChatOpenAI()\n",
    "\n",
    "chain1 = prompt | model\n",
    "\n",
@@ -42,7 +46,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 78,
+   "execution_count": 5,
   "id": "5488ec85",
   "metadata": {},
   "outputs": [
@@ -52,7 +56,7 @@
       "AIMessage(content='3 + 9 equals 12.', additional_kwargs={}, example=False)"
      ]
     },
-     "execution_count": 78,
+     "execution_count": 5,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -73,17 +77,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 139,
+   "execution_count": 9,
   "id": "80b3b5f6-5d58-44b9-807e-cce9a46bf49f",
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.schema.runnable import RunnableConfig"
+    "from langchain.schema.runnable import RunnableConfig\n",
+    "from langchain.schema.output_parser import StrOutputParser"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 149,
+   "execution_count": 10,
   "id": "ff0daf0c-49dd-4d21-9772-e5fa133c5f36",
   "metadata": {},
   "outputs": [],
@@ -109,7 +114,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 152,
+   "execution_count": 12,
   "id": "1a5e709e-9d75-48c7-bb9c-503251990505",
   "metadata": {},
   "outputs": [
@@ -132,6 +137,14 @@
    "    RunnableLambda(parse_or_fix).invoke(\"{foo: bar}\", {\"tags\": [\"my-tag\"], \"callbacks\": [cb]})\n",
    "    print(cb)"
   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "29f55c38",
+   "metadata": {},
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {
@@ -150,7 +163,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.1"
  }
 },
 "nbformat": 4,
--- a/docs/extras/expression_language/how_to/index.mdx
+++ b/docs/extras/expression_language/how_to/index.mdx
@@ -0,0 +1,9 @@
+---
+sidebar_position: 1
+---
+
+# How to
+
+import DocCardList from "@theme/DocCardList";
+
+<DocCardList />
--- a/docs/extras/expression_language/how_to/map.ipynb
+++ b/docs/extras/expression_language/how_to/map.ipynb
@@ -12,18 +12,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 2,
   "id": "7e1873d6-d4b6-43ac-96a1-edcf178201e0",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "{'joke': AIMessage(content=\"Why don't bears wear shoes? \\nBecause they have bear feet!\", additional_kwargs={}, example=False),\n",
-       " 'poem': AIMessage(content=\"In twilight's embrace, a bear's gentle lumber,\\nSilent strength, nature's awe, a humble slumber.\", additional_kwargs={}, example=False)}"
+       "{'joke': AIMessage(content=\"Why don't bears wear shoes? \\n\\nBecause they have bear feet!\", additional_kwargs={}, example=False),\n",
+       " 'poem': AIMessage(content=\"In woodland depths, bear prowls with might,\\nSilent strength, nature's sovereign, day and night.\", additional_kwargs={}, example=False)}"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 2,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -38,7 +38,7 @@
    "joke_chain = ChatPromptTemplate.from_template(\"tell me a joke about {topic}\") | model\n",
    "poem_chain = ChatPromptTemplate.from_template(\"write a 2-line poem about {topic}\") | model\n",
    "\n",
-    "map_chain = RunnableMap({\"joke\": chain1, \"poem\": chain2,})\n",
+    "map_chain = RunnableMap({\"joke\": joke_chain, \"poem\": poem_chain,})\n",
    "\n",
    "map_chain.invoke({\"topic\": \"bear\"})"
   ]
@@ -54,7 +54,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 3,
   "id": "267d1460-53c1-4fdb-b2c3-b6a1eb7fccff",
   "metadata": {},
   "outputs": [
@@ -64,7 +64,7 @@
       "'Harrison worked at Kensho.'"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -191,7 +191,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.1"
  }
 },
 "nbformat": 4,
--- a/docs/extras/guides/deployments/template_repos.mdx
+++ b/docs/extras/guides/deployments/template_repos.mdx
@@ -47,13 +47,13 @@ A minimal example on how to deploy LangChain to [Kinsta](https://kinsta.com) usi

 A minimal example of how to deploy LangChain to [Fly.io](https://fly.io/) using Flask.

-## [Digitalocean App Platform](https://github.com/homanp/digitalocean-langchain)
+## [DigitalOcean App Platform](https://github.com/homanp/digitalocean-langchain)

 A minimal example of how to deploy LangChain to DigitalOcean App Platform.

 ## [CI/CD Google Cloud Build + Dockerfile + Serverless Google Cloud Run](https://github.com/g-emarco/github-assistant)

-Boilerplate LangChain project on how to deploy to Google Cloud Run using Docker with Cloud Build CI/CD pipeline
+Boilerplate LangChain project on how to deploy to Google Cloud Run using Docker with Cloud Build CI/CD pipeline.

 ## [Google Cloud Run](https://github.com/homanp/gcp-langchain)

--- a/docs/extras/guides/langsmith/img/log_traces.png
+++ b/docs/extras/guides/langsmith/img/log_traces.png
--- a/docs/extras/guides/langsmith/img/test_results.png
+++ b/docs/extras/guides/langsmith/img/test_results.png
--- a/docs/extras/guides/langsmith/walkthrough.ipynb
+++ b/docs/extras/guides/langsmith/walkthrough.ipynb
--- a/docs/extras/integrations/chat/bedrock.ipynb
+++ b/docs/extras/integrations/chat/bedrock.ipynb
@@ -22,7 +22,7 @@
        },
        {
            "cell_type": "code",
-            "execution_count": 1,
+            "execution_count": null,
            "id": "d4a7c55d-b235-4ca4-a579-c90cc9570da9",
            "metadata": {
                "tags": []
@@ -73,13 +73,46 @@
                "chat(messages)"
            ]
        },
+        {
+            "attachments": {},
+            "cell_type": "markdown",
+            "id": "a4a4f4d4",
+            "metadata": {},
+            "source": [
+                "### For BedrockChat with Streaming"
+            ]
+        },
        {
            "cell_type": "code",
            "execution_count": null,
            "id": "c253883f",
            "metadata": {},
            "outputs": [],
-            "source": []
+            "source": [
+                "from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler\n",
+                "\n",
+                "chat = BedrockChat(\n",
+                "    model_id=\"anthropic.claude-v2\",\n",
+                "    streaming=True,\n",
+                "    callbacks=[StreamingStdOutCallbackHandler()],\n",
+                "    model_kwargs={\"temperature\": 0.1},\n",
+                ")"
+            ]
+        },
+        {
+            "cell_type": "code",
+            "execution_count": null,
+            "id": "d9e52838",
+            "metadata": {},
+            "outputs": [],
+            "source": [
+                "messages = [\n",
+                "    HumanMessage(\n",
+                "        content=\"Translate this sentence from English to French. I love programming.\"\n",
+                "    )\n",
+                "]\n",
+                "chat(messages)"
+            ]
        }
    ],
    "metadata": {
@@ -98,7 +131,7 @@
            "name": "python",
            "nbconvert_exporter": "python",
            "pygments_lexer": "ipython3",
-            "version": "3.11.4"
+            "version": "3.10.9"
        }
    },
    "nbformat": 4,
--- a/docs/extras/integrations/chat/fireworks.ipynb
+++ b/docs/extras/integrations/chat/fireworks.ipynb
@@ -0,0 +1,255 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "642fd21c-600a-47a1-be96-6e1438b421a9",
+   "metadata": {},
+   "source": [
+    "# ChatFireworks\n",
+    "\n",
+    ">[Fireworks](https://app.fireworks.ai/) accelerates product development on generative AI by creating an innovative AI experiment and production platform. \n",
+    "\n",
+    "This example goes over how to use LangChain to interact with `ChatFireworks` models."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "d00d850917865298",
+   "metadata": {
+    "collapsed": false,
+    "jupyter": {
+     "outputs_hidden": false
+    }
+   },
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models.fireworks import ChatFireworks\n",
+    "from langchain.schema import SystemMessage, HumanMessage\n",
+    "import os"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f28ebf8b-f14f-46c7-9962-8b8dc42e31be",
+   "metadata": {},
+   "source": [
+    "# Setup\n",
+    "Contact Fireworks AI for the an API Key to access our models\n",
+    "\n",
+    "Set up your model using a model id. If the model is not set, the default model is fireworks-llama-v2-7b-chat."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "d096fb14-8acc-4047-9cd0-c842430c3a1d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Initialize a Fireworks Chat model\n",
+    "os.environ['FIREWORKS_API_KEY'] = \"<your_api_key>\"  # Change this to your own API key\n",
+    "chat = ChatFireworks(model=\"accounts/fireworks/models/llama-v2-13b-chat\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "d8f13144-37cf-47a5-b5a0-e3cdf76d9a72",
+   "metadata": {},
+   "source": [
+    "# Calling the Model\n",
+    "\n",
+    "You can use the LLMs to call the model for specified message(s). \n",
+    "\n",
+    "See the full, most up-to-date model list on [app.fireworks.ai](https://app.fireworks.ai)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "72340871-ae2f-415f-b399-0777d32dc379",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# ChatFireworks Wrapper\n",
+    "system_message = SystemMessage(content=\"You are to chat with the user.\")\n",
+    "human_message = HumanMessage(content=\"Who are you?\")\n",
+    "response = chat([system_message, human_message])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "2d6ef879-69e3-422b-8379-bb980b70fe55",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=\"Hello! My name is LLaMA, I'm a large language model trained by a team of researcher at Meta AI. My primary function is to assist users with tasks and answer questions to the best of my ability. I am capable of understanding and responding to natural language input, and I am here to help you with any questions or tasks you may have. Is there anything specific you would like to know or discuss?\", additional_kwargs={}, example=False)"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "response"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "68c6b1fa-2ff7-4a63-8d88-3cec302180b8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Setting additional parameters: temperature, max_tokens, top_p\n",
+    "chat = ChatFireworks(model=\"accounts/fireworks/models/llama-v2-13b-chat\", model_kwargs={\"temperature\":1, \"max_tokens\": 20, \"top_p\": 1})\n",
+    "system_message = SystemMessage(content=\"You are to chat with the user.\")\n",
+    "human_message = HumanMessage(content=\"How's the weather today?\")\n",
+    "response = chat([system_message, human_message])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "a09025f8-e4c3-4005-a8fc-c9c774b03a64",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=\"Oh, you know, it's just another beautiful day in the virtual world! The sun\", additional_kwargs={}, example=False)"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "response"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "d93aa186-39cf-4e1a-aa32-01ed31d43bc8",
+   "metadata": {},
+   "source": [
+    "# ChatFireworks Wrapper with generate"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "cbe29efc-37c3-4c83-8b84-b8bba1a1e589",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chat = ChatFireworks()\n",
+    "message = HumanMessage(content=\"Hello\")\n",
+    "response = chat.generate([[message], [message]])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "35109f36-9519-47a6-a223-25639123e836",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "LLMResult(generations=[[ChatGeneration(text=\"Hello! It's nice to meet you. I'm here to help answer any questions you may have, while being respectful and safe. Please feel free to ask me anything, and I will do my best to provide helpful and positive responses. Is there something specific you would like to know or discuss?\", generation_info={'finish_reason': 'stop'}, message=AIMessage(content=\"Hello! It's nice to meet you. I'm here to help answer any questions you may have, while being respectful and safe. Please feel free to ask me anything, and I will do my best to provide helpful and positive responses. Is there something specific you would like to know or discuss?\", additional_kwargs={}, example=False))], [ChatGeneration(text=\"Hello! *smiling* I'm here to help you with any questions or concerns you may have. Please feel free to ask me anything, and I will do my best to provide helpful, respectful, and honest responses. I'm programmed to avoid any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content, and to provide socially unbiased and positive responses. Is there anything specific you would like to talk about or ask?\", generation_info={'finish_reason': 'stop'}, message=AIMessage(content=\"Hello! *smiling* I'm here to help you with any questions or concerns you may have. Please feel free to ask me anything, and I will do my best to provide helpful, respectful, and honest responses. I'm programmed to avoid any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content, and to provide socially unbiased and positive responses. Is there anything specific you would like to talk about or ask?\", additional_kwargs={}, example=False))]], llm_output={'model': 'accounts/fireworks/models/llama-v2-7b-chat'}, run=[RunInfo(run_id=UUID('f137463e-e1c7-454a-8b85-b999ce20e0f2')), RunInfo(run_id=UUID('f3ef1138-92de-4e01-900b-991e34a647a7'))])"
+      ]
+     },
+     "execution_count": 8,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "response"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "92c2cabb-9eaf-4c49-b0e5-a5de5a7d920e",
+   "metadata": {},
+   "source": [
+    "# ChatFireworks Wrapper with stream"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "12717a29-fb7d-4a4d-860b-40435452b065",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "Hello! I'm just\n",
+      " an AI assistant,\n",
+      " here to help answer your\n",
+      " questions and provide information in\n",
+      " a responsible and respectful manner\n",
+      ". I'm not able\n",
+      " to access personal information or provide\n",
+      " any content that could be considered\n",
+      " harmful, uneth\n",
+      "ical, racist, sex\n",
+      "ist, toxic, dangerous\n",
+      ", or illegal. My purpose\n",
+      " is to assist and provide helpful\n",
+      " responses that are socially un\n",
+      "biased and positive in nature\n",
+      ". Is there something specific you\n",
+      " would like to know or discuss\n",
+      "?\n"
+     ]
+    }
+   ],
+   "source": [
+    "llm = ChatFireworks()\n",
+    "\n",
+    "for token in llm.stream(\"Who are you\"):\n",
+    "    print(token.content)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "02991e05-a38e-47d4-9ab3-7e630a8ead55",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.4"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/chat/google_vertex_ai_palm.ipynb
+++ b/docs/extras/integrations/chat/google_vertex_ai_palm.ipynb
@@ -5,7 +5,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Google Cloud Platform Vertex AI PaLM \n",
+    "# GCP Vertex AI \n",
    "\n",
    "Note: This is seperate from the Google PaLM integration. Google has chosen to offer an enterprise version of PaLM through GCP, and this supports the models made available through there. \n",
    "\n",
@@ -31,7 +31,7 @@
   },
   "outputs": [],
   "source": [
-    "#!pip install google-cloud-aiplatform"
+    "#!pip install langchain google-cloud-aiplatform"
   ]
  },
  {
@@ -41,12 +41,7 @@
   "outputs": [],
   "source": [
    "from langchain.chat_models import ChatVertexAI\n",
-    "from langchain.prompts.chat import (\n",
-    "    ChatPromptTemplate,\n",
-    "    SystemMessagePromptTemplate,\n",
-    "    HumanMessagePromptTemplate,\n",
-    ")\n",
-    "from langchain.schema import HumanMessage, SystemMessage"
+    "from langchain.prompts import ChatPromptTemplate"
   ]
  },
  {
@@ -60,82 +55,78 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 34,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "system = \"You are a helpful assistant who translate English to French\"\n",
+    "human = \"Translate this sentence from English to French. I love programming.\"\n",
+    "prompt = ChatPromptTemplate.from_messages(\n",
+    "    [(\"system\", system), (\"human\",  human)]\n",
+    ")\n",
+    "messages = prompt.format_messages()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='Sure, here is the translation of the sentence \"I love programming\" from English to French:\\n\\nJ\\'aime programmer.', additional_kwargs={}, example=False)"
+       "AIMessage(content=\" J'aime la programmation.\", additional_kwargs={}, example=False)"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "messages = [\n",
-    "    SystemMessage(\n",
-    "        content=\"You are a helpful assistant that translates English to French.\"\n",
-    "    ),\n",
-    "    HumanMessage(\n",
-    "        content=\"Translate this sentence from English to French. I love programming.\"\n",
-    "    ),\n",
-    "]\n",
    "chat(messages)"
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "You can make use of templating by using a `MessagePromptTemplate`. You can build a `ChatPromptTemplate` from one or more `MessagePromptTemplates`. You can use `ChatPromptTemplate`'s `format_prompt` -- this returns a `PromptValue`, which you can convert to a string or Message object, depending on whether you want to use the formatted value as input to an llm or chat model.\n",
-    "\n",
-    "For convenience, there is a `from_template` method exposed on the template. If you were to use this template, this is what it would look like:"
+    "If we want to construct a simple chain that takes user specified parameters:"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": 12,
   "metadata": {},
   "outputs": [],
   "source": [
-    "template = (\n",
-    "    \"You are a helpful assistant that translates {input_language} to {output_language}.\"\n",
-    ")\n",
-    "system_message_prompt = SystemMessagePromptTemplate.from_template(template)\n",
-    "human_template = \"{text}\"\n",
-    "human_message_prompt = HumanMessagePromptTemplate.from_template(human_template)"
+    "system = \"You are a helpful assistant that translates {input_language} to {output_language}.\"\n",
+    "human = \"{text}\"\n",
+    "prompt = ChatPromptTemplate.from_messages(\n",
+    "    [(\"system\", system), (\"human\",  human)]\n",
+    ")"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": 13,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='Sure, here is the translation of \"I love programming\" in French:\\n\\nJ\\'aime programmer.', additional_kwargs={}, example=False)"
+       "AIMessage(content=' 私はプログラミングが大好きです。', additional_kwargs={}, example=False)"
      ]
     },
-     "execution_count": 7,
+     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "chat_prompt = ChatPromptTemplate.from_messages(\n",
-    "    [system_message_prompt, human_message_prompt]\n",
-    ")\n",
-    "\n",
-    "# get a chat completion from the formatted messages\n",
-    "chat(\n",
-    "    chat_prompt.format_prompt(\n",
-    "        input_language=\"English\", output_language=\"French\", text=\"I love programming.\"\n",
-    "    ).to_messages()\n",
+    "chain = prompt | chat\n",
+    "chain.invoke(\n",
+    "    {\"input_language\": \"English\", \"output_language\": \"Japanese\", \"text\": \"I love programming\"}\n",
    ")"
   ]
  },
@@ -153,60 +144,129 @@
    "tags": []
   },
   "source": [
+    "## Code generation chat models\n",
    "You can now leverage the Codey API for code chat within Vertex AI. The model name is:\n",
    "- codechat-bison: for code assistance"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 18,
   "metadata": {
-    "execution": {
-     "iopub.execute_input": "2023-06-17T21:30:43.974841Z",
-     "iopub.status.busy": "2023-06-17T21:30:43.974431Z",
-     "iopub.status.idle": "2023-06-17T21:30:44.248119Z",
-     "shell.execute_reply": "2023-06-17T21:30:44.247362Z",
-     "shell.execute_reply.started": "2023-06-17T21:30:43.974820Z"
-    },
    "tags": []
   },
   "outputs": [],
   "source": [
-    "chat = ChatVertexAI(model_name=\"codechat-bison\")"
+    "chat = ChatVertexAI(\n",
+    "    model_name=\"codechat-bison\",\n",
+    "    max_output_tokens=1000,\n",
+    "    temperature=0.5\n",
+    ")"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 20,
   "metadata": {
-    "execution": {
-     "iopub.execute_input": "2023-06-17T21:30:45.146093Z",
-     "iopub.status.busy": "2023-06-17T21:30:45.145752Z",
-     "iopub.status.idle": "2023-06-17T21:30:47.449126Z",
-     "shell.execute_reply": "2023-06-17T21:30:47.448609Z",
-     "shell.execute_reply.started": "2023-06-17T21:30:45.146069Z"
-    },
    "tags": []
   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      " ```python\n",
+      "def is_prime(x): \n",
+      "    if (x <= 1): \n",
+      "        return False\n",
+      "    for i in range(2, x): \n",
+      "        if (x % i == 0): \n",
+      "            return False\n",
+      "    return True\n",
+      "```\n"
+     ]
+    }
+   ],
+   "source": [
+    "# For simple string in string out usage, we can use the `predict` method:\n",
+    "print(chat.predict(\"Write a Python function to identify all prime numbers\"))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Asynchronous calls\n",
+    "\n",
+    "We can make asynchronous calls via the `agenerate` and `ainvoke` methods."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import asyncio\n",
+    "# import nest_asyncio\n",
+    "# nest_asyncio.apply()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 35,
+   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='The following Python function can be used to identify all prime numbers up to a given integer:\\n\\n```\\ndef is_prime(n):\\n  \"\"\"\\n  Determines whether the given integer is prime.\\n\\n  Args:\\n    n: The integer to be tested for primality.\\n\\n  Returns:\\n    True if n is prime, False otherwise.\\n  \"\"\"\\n\\n  # Check if n is divisible by 2.\\n  if n % 2 == 0:\\n    return False\\n\\n  # Check if n is divisible by any integer from 3 to the square root', additional_kwargs={}, example=False)"
+       "LLMResult(generations=[[ChatGeneration(text=\" J'aime la programmation.\", generation_info=None, message=AIMessage(content=\" J'aime la programmation.\", additional_kwargs={}, example=False))]], llm_output={}, run=[RunInfo(run_id=UUID('223599ef-38f8-4c79-ac6d-a5013060eb9d'))])"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 35,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "messages = [\n",
-    "    HumanMessage(\n",
-    "        content=\"How do I create a python function to identify all prime numbers?\"\n",
-    "    )\n",
-    "]\n",
-    "chat(messages)"
+    "chat = ChatVertexAI(\n",
+    "    model_name=\"chat-bison\",\n",
+    "    max_output_tokens=1000,\n",
+    "    temperature=0.7,\n",
+    "    top_p=0.95,\n",
+    "    top_k=40,\n",
+    ")\n",
+    "\n",
+    "asyncio.run(chat.agenerate([messages]))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 36,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=' अहं प्रोग्रामिंग प्रेमामि', additional_kwargs={}, example=False)"
+      ]
+     },
+     "execution_count": 36,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "asyncio.run(chain.ainvoke({\"input_language\": \"English\", \"output_language\": \"Sanskrit\", \"text\": \"I love programming\"}))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Streaming calls\n",
+    "\n",
+    "We can also stream outputs via the `stream` method:"
   ]
  },
  {
@@ -214,14 +274,51 @@
   "execution_count": null,
   "metadata": {},
   "outputs": [],
-   "source": []
+   "source": [
+    "import sys"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 32,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      " 1. China (1,444,216,107)\n",
+      "2. India (1,393,409,038)\n",
+      "3. United States (332,403,650)\n",
+      "4. Indonesia (273,523,615)\n",
+      "5. Pakistan (220,892,340)\n",
+      "6. Brazil (212,559,409)\n",
+      "7. Nigeria (206,139,589)\n",
+      "8. Bangladesh (164,689,383)\n",
+      "9. Russia (145,934,462)\n",
+      "10. Mexico (128,932,488)\n",
+      "11. Japan (126,476,461)\n",
+      "12. Ethiopia (115,063,982)\n",
+      "13. Philippines (109,581,078)\n",
+      "14. Egypt (102,334,404)\n",
+      "15. Vietnam (97,338,589)"
+     ]
+    }
+   ],
+   "source": [
+    "prompt = ChatPromptTemplate.from_messages([(\"human\", \"List out the 15 most populous countries in the world\")])\n",
+    "messages = prompt.format_messages()\n",
+    "for chunk in chat.stream(messages):\n",
+    "    sys.stdout.write(chunk.content)\n",
+    "    sys.stdout.flush()"
+   ]
  }
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
+   "display_name": "poetry-venv",
   "language": "python",
-   "name": "python3"
+   "name": "poetry-venv"
  },
  "language_info": {
   "codemirror_mode": {
--- a/docs/extras/integrations/chat/index.mdx
+++ b/docs/extras/integrations/chat/index.mdx
@@ -0,0 +1,39 @@
+---
+sidebar_position: 1
+sidebar_class_name: hidden
+---
+
+# Chat models
+
+import DocCardList from "@theme/DocCardList";
+
+## Features (natively supported)
+All ChatModels implement the Runnable interface, which comes with default implementations of all methods, ie. `ainvoke`, `batch`, `abatch`, `stream`, `astream`. This gives all ChatModels basic support for async, streaming and batch, which by default is implemented as below:
+- *Async* support defaults to calling the respective sync method in asyncio's default thread pool executor. This lets other async functions in your application make progress while the ChatModel is being executed, by moving this call to a background thread.
+- *Streaming* support defaults to returning an `Iterator` (or `AsyncIterator` in the case of async streaming) of a single value, the final result returned by the underlying ChatModel provider. This obviously doesn't give you token-by-token streaming, which requires native support from the ChatModel provider, but ensures your code that expects an iterator of tokens can work for any of our ChatModel integrations.
+- *Batch* support defaults to calling the underlying ChatModel in parallel for each input by making use of a thread pool executor (in the sync batch case) or `asyncio.gather` (in the async batch case). The concurrency can be controlled with the `max_concurrency` key in `RunnableConfig`.
+
+Each ChatModel integration can optionally provide native implementations to truly enable async or streaming.
+The table shows, for each integration, which features have been implemented with native support.
+
+Model|Invoke|Async invoke|Stream|Async stream
+:-|:-:|:-:|:-:|:-:
+AzureChatOpenAI|✅|✅|✅|✅
+BedrockChat|✅|❌|✅|❌
+ChatAnthropic|✅|✅|✅|✅
+ChatAnyscale|✅|✅|✅|✅
+ChatGooglePalm|✅|✅|❌|❌
+ChatJavelinAIGateway|✅|✅|❌|❌
+ChatKonko|✅|❌|❌|❌
+ChatLiteLLM|✅|✅|✅|✅
+ChatMLflowAIGateway|✅|❌|❌|❌
+ChatOllama|✅|❌|✅|❌
+ChatOpenAI|✅|✅|✅|✅
+ChatVertexAI|✅|✅|✅|❌
+ErnieBotChat|✅|❌|❌|❌
+JinaChat|✅|✅|✅|✅
+MiniMaxChat|✅|✅|❌|❌
+PromptLayerChatOpenAI|✅|❌|❌|❌
+QianfanChatEndpoint|✅|✅|✅|✅
+
+<DocCardList />
--- a/docs/extras/integrations/chat/vllm.ipynb
+++ b/docs/extras/integrations/chat/vllm.ipynb
@@ -0,0 +1,174 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "eb7e5679-aa06-47e4-a1a3-b6b70e604017",
+   "metadata": {},
+   "source": [
+    "# vLLM Chat\n",
+    "\n",
+    "vLLM can be deployed as a server that mimics the OpenAI API protocol. This allows vLLM to be used as a drop-in replacement for applications using OpenAI API. This server can be queried in the same format as OpenAI API.\n",
+    "\n",
+    "This notebook covers how to get started with vLLM chat models using langchain's `ChatOpenAI` **as it is**."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "060a2e3d-d42f-4221-bd09-a9a06544dcd3",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.prompts.chat import (\n",
+    "    ChatPromptTemplate,\n",
+    "    SystemMessagePromptTemplate,\n",
+    "    AIMessagePromptTemplate,\n",
+    "    HumanMessagePromptTemplate,\n",
+    ")\n",
+    "from langchain.schema import AIMessage, HumanMessage, SystemMessage"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "bf24d732-68a9-44fd-b05d-4903ce5620c6",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "inference_server_url = \"http://localhost:8000/v1\"\n",
+    "\n",
+    "chat = ChatOpenAI(\n",
+    "    model=\"mosaicml/mpt-7b\",\n",
+    "    openai_api_key=\"EMPTY\",\n",
+    "    openai_api_base=inference_server_url,\n",
+    "    max_tokens=5,\n",
+    "    temperature=0,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "aea4e363-5688-4b07-82ed-6aa8153c2377",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=' Io amo programmare', additional_kwargs={}, example=False)"
+      ]
+     },
+     "execution_count": 15,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "messages = [\n",
+    "    SystemMessage(\n",
+    "        content=\"You are a helpful assistant that translates English to Italian.\"\n",
+    "    ),\n",
+    "    HumanMessage(\n",
+    "        content=\"Translate the following sentence from English to Italian: I love programming.\"\n",
+    "    ),\n",
+    "]\n",
+    "chat(messages)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "55fc7046-a6dc-4720-8c0c-24a6db76a4f4",
+   "metadata": {},
+   "source": [
+    "You can make use of templating by using a `MessagePromptTemplate`. You can build a `ChatPromptTemplate` from one or more `MessagePromptTemplates`. You can use ChatPromptTemplate's format_prompt -- this returns a `PromptValue`, which you can convert to a string or `Message` object, depending on whether you want to use the formatted value as input to an llm or chat model.\n",
+    "\n",
+    "For convenience, there is a `from_template` method exposed on the template. If you were to use this template, this is what it would look like:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "123980e9-0dee-4ce5-bde6-d964dd90129c",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "template = (\n",
+    "    \"You are a helpful assistant that translates {input_language} to {output_language}.\"\n",
+    ")\n",
+    "system_message_prompt = SystemMessagePromptTemplate.from_template(template)\n",
+    "human_template = \"{text}\"\n",
+    "human_message_prompt = HumanMessagePromptTemplate.from_template(human_template)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "b2fb8c59-8892-4270-85a2-4f8ab276b75d",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=' I love programming too.', additional_kwargs={}, example=False)"
+      ]
+     },
+     "execution_count": 17,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "chat_prompt = ChatPromptTemplate.from_messages(\n",
+    "    [system_message_prompt, human_message_prompt]\n",
+    ")\n",
+    "\n",
+    "# get a chat completion from the formatted messages\n",
+    "chat(\n",
+    "    chat_prompt.format_prompt(\n",
+    "        input_language=\"English\", output_language=\"Italian\", text=\"I love programming.\"\n",
+    "    ).to_messages()\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "0bbd9861-2b94-4920-8708-b690004f4c4d",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "conda_pytorch_p310",
+   "language": "python",
+   "name": "conda_pytorch_p310"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/document_loaders/async_html.ipynb
+++ b/docs/extras/integrations/document_loaders/async_html.ipynb
@@ -5,9 +5,9 @@
   "id": "e229e34c",
   "metadata": {},
   "source": [
-    "# AsyncHtmlLoader\n",
+    "# AsyncHtml\n",
    "\n",
-    "AsyncHtmlLoader loads raw HTML from a list of urls concurrently."
+    "`AsyncHtmlLoader` loads raw HTML from a list of URLs concurrently."
   ]
  },
  {
@@ -99,7 +99,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.16"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_loaders/aws_s3_directory.ipynb
+++ b/docs/extras/integrations/document_loaders/aws_s3_directory.ipynb
@@ -1,156 +1,159 @@
 {
- "cells": [
-  {
-   "cell_type": "markdown",
-   "id": "a634365e",
-   "metadata": {},
-   "source": [
-    "# AWS S3 Directory\n",
-    "\n",
-    ">[Amazon Simple Storage Service (Amazon S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html) is an object storage service\n",
-    "\n",
-    ">[AWS S3 Directory](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html)\n",
-    "\n",
-    "This covers how to load document objects from an `AWS S3 Directory` object."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "49815096",
-   "metadata": {
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "#!pip install boto3"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 2,
-   "id": "2f0cd6a5",
-   "metadata": {
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "from langchain.document_loaders import S3DirectoryLoader"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 3,
-   "id": "321cc7f1",
-   "metadata": {
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "loader = S3DirectoryLoader(\"testing-hwc\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "2b11d155",
-   "metadata": {
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "loader.load()"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "0690c40a",
-   "metadata": {},
-   "source": [
-    "## Specifying a prefix\n",
-    "You can also specify a prefix for more finegrained control over what files to load."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 5,
-   "id": "72d44781",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "loader = S3DirectoryLoader(\"testing-hwc\", prefix=\"fake\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 6,
-   "id": "2d3c32db",
-   "metadata": {},
-   "outputs": [
+  "cells": [
    {
-     "data": {
-      "text/plain": [
-       "[Document(page_content='Lorem ipsum dolor sit amet.', lookup_str='', metadata={'source': 's3://testing-hwc/fake.docx'}, lookup_index=0)]"
+      "cell_type": "markdown",
+      "id": "a634365e",
+      "metadata": {},
+      "source": [
+        "# AWS S3 Directory\n",
+        "\n",
+        ">[Amazon Simple Storage Service (Amazon S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html) is an object storage service\n",
+        "\n",
+        ">[AWS S3 Directory](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html)\n",
+        "\n",
+        "This covers how to load document objects from an `AWS S3 Directory` object."
      ]
-     },
-     "execution_count": 6,
-     "metadata": {},
-     "output_type": "execute_result"
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "49815096",
+      "metadata": {
+        "tags": []
+      },
+      "outputs": [],
+      "source": [
+        "#!pip install boto3"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 2,
+      "id": "2f0cd6a5",
+      "metadata": {
+        "tags": []
+      },
+      "outputs": [],
+      "source": [
+        "from langchain.document_loaders import S3DirectoryLoader"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 3,
+      "id": "321cc7f1",
+      "metadata": {
+        "tags": []
+      },
+      "outputs": [],
+      "source": [
+        "loader = S3DirectoryLoader(\"testing-hwc\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "2b11d155",
+      "metadata": {
+        "tags": []
+      },
+      "outputs": [],
+      "source": [
+        "loader.load()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "0690c40a",
+      "metadata": {},
+      "source": [
+        "## Specifying a prefix\n",
+        "You can also specify a prefix for more finegrained control over what files to load."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 5,
+      "id": "72d44781",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "loader = S3DirectoryLoader(\"testing-hwc\", prefix=\"fake\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 6,
+      "id": "2d3c32db",
+      "metadata": {},
+      "outputs": [
+        {
+          "data": {
+            "text/plain": [
+              "[Document(page_content='Lorem ipsum dolor sit amet.', lookup_str='', metadata={'source': 's3://testing-hwc/fake.docx'}, lookup_index=0)]"
+            ]
+          },
+          "execution_count": 6,
+          "metadata": {},
+          "output_type": "execute_result"
+        }
+      ],
+      "source": [
+        "loader.load()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "source": [
+        "## Configuring the AWS Boto3 client\n",
+        "You can configure the AWS [Boto3](https://boto3.amazonaws.com/v1/documentation/api/latest/index.html) client by passing\n",
+        "named arguments when creating the S3DirectoryLoader.\n",
+        "This is useful for instance when AWS credentials can't be set as environment variables.\n",
+        "See the [list of parameters](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/core/session.html#boto3.session.Session) that can be configured."
+      ],
+      "metadata": {},
+      "id": "91a7ac07"
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "outputs": [],
+      "source": [
+        "loader = S3DirectoryLoader(\"testing-hwc\", aws_access_key_id=\"xxxx\", aws_secret_access_key=\"yyyy\")"
+      ],
+      "metadata": {},
+      "id": "f485ec8c"
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "outputs": [],
+      "source": [
+        "loader.load()"
+      ],
+      "metadata": {},
+      "id": "c0fa76ae"
+    }
+  ],
+  "metadata": {
+    "kernelspec": {
+      "display_name": "Python 3 (ipykernel)",
+      "language": "python",
+      "name": "python3"
+    },
+    "language_info": {
+      "codemirror_mode": {
+        "name": "ipython",
+        "version": 3
+      },
+      "file_extension": ".py",
+      "mimetype": "text/x-python",
+      "name": "python",
+      "nbconvert_exporter": "python",
+      "pygments_lexer": "ipython3",
+      "version": "3.10.6"
    }
-   ],
-   "source": [
-    "loader.load()"
-   ]
  },
-  {
-   "cell_type": "markdown",
-   "source": [
-    "## Configuring the AWS Boto3 client\n",
-    "You can configure the AWS [Boto3](https://boto3.amazonaws.com/v1/documentation/api/latest/index.html) client by passing\n",
-    "named arguments when creating the S3DirectoryLoader.\n",
-    "This is useful for instance when AWS credentials can't be set as environment variables.\n",
-    "See the [list of parameters](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/core/session.html#boto3.session.Session) that can be configured."
-   ],
-   "metadata": {}
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "outputs": [],
-   "source": [
-    "loader = S3DirectoryLoader(\"testing-hwc\", aws_access_key_id=\"xxxx\", aws_secret_access_key=\"yyyy\")"
-   ],
-   "metadata": {}
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "outputs": [],
-   "source": [
-    "loader.load()"
-   ],
-   "metadata": {}
-  }
- ],
- "metadata": {
-  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
-   "language": "python",
-   "name": "python3"
-  },
-  "language_info": {
-   "codemirror_mode": {
-    "name": "ipython",
-    "version": 3
-   },
-   "file_extension": ".py",
-   "mimetype": "text/x-python",
-   "name": "python",
-   "nbconvert_exporter": "python",
-   "pygments_lexer": "ipython3",
-   "version": "3.10.6"
-  }
- },
- "nbformat": 4,
- "nbformat_minor": 5
-}
+  "nbformat": 4,
+  "nbformat_minor": 5
+}
--- a/docs/extras/integrations/document_loaders/aws_s3_file.ipynb
+++ b/docs/extras/integrations/document_loaders/aws_s3_file.ipynb
@@ -1,121 +1,122 @@
 {
- "cells": [
-  {
-   "cell_type": "markdown",
-   "id": "66a7777e",
-   "metadata": {},
-   "source": [
-    "# AWS S3 File\n",
-    "\n",
-    ">[Amazon Simple Storage Service (Amazon S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html) is an object storage service.\n",
-    "\n",
-    ">[AWS S3 Buckets](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingBucket.html)\n",
-    "\n",
-    "This covers how to load document objects from an `AWS S3 File` object."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 1,
-   "id": "9ec8a3b3",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "from langchain.document_loaders import S3FileLoader"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 2,
-   "id": "43128d8d",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "#!pip install boto3"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 8,
-   "id": "35d6809a",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "loader = S3FileLoader(\"testing-hwc\", \"fake.docx\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 9,
-   "id": "efd6be84",
-   "metadata": {},
-   "outputs": [
+  "cells": [
    {
-     "data": {
-      "text/plain": [
-       "[Document(page_content='Lorem ipsum dolor sit amet.', lookup_str='', metadata={'source': 's3://testing-hwc/fake.docx'}, lookup_index=0)]"
+      "cell_type": "markdown",
+      "id": "66a7777e",
+      "metadata": {},
+      "source": [
+        "# AWS S3 File\n",
+        "\n",
+        ">[Amazon Simple Storage Service (Amazon S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-folders.html) is an object storage service.\n",
+        "\n",
+        ">[AWS S3 Buckets](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingBucket.html)\n",
+        "\n",
+        "This covers how to load document objects from an `AWS S3 File` object."
      ]
-     },
-     "execution_count": 9,
-     "metadata": {},
-     "output_type": "execute_result"
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 1,
+      "id": "9ec8a3b3",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from langchain.document_loaders import S3FileLoader"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 2,
+      "id": "43128d8d",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "#!pip install boto3"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 8,
+      "id": "35d6809a",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "loader = S3FileLoader(\"testing-hwc\", \"fake.docx\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 9,
+      "id": "efd6be84",
+      "metadata": {},
+      "outputs": [
+        {
+          "data": {
+            "text/plain": [
+              "[Document(page_content='Lorem ipsum dolor sit amet.', lookup_str='', metadata={'source': 's3://testing-hwc/fake.docx'}, lookup_index=0)]"
+            ]
+          },
+          "execution_count": 9,
+          "metadata": {},
+          "output_type": "execute_result"
+        }
+      ],
+      "source": [
+        "loader.load()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "93689594",
+      "metadata": {},
+      "source": [
+        "## Configuring the AWS Boto3 client\n",
+        "You can configure the AWS [Boto3](https://boto3.amazonaws.com/v1/documentation/api/latest/index.html) client by passing\n",
+        "named arguments when creating the S3DirectoryLoader.\n",
+        "This is useful for instance when AWS credentials can't be set as environment variables.\n",
+        "See the [list of parameters](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/core/session.html#boto3.session.Session) that can be configured."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "outputs": [],
+      "source": [
+        "loader = S3FileLoader(\"testing-hwc\", \"fake.docx\", aws_access_key_id=\"xxxx\", aws_secret_access_key=\"yyyy\")"
+      ],
+      "metadata": {},
+      "id": "43106ee8"
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "outputs": [],
+      "source": [
+        "loader.load()"
+      ],
+      "metadata": {},
+      "id": "1764a727"
+    }
+  ],
+  "metadata": {
+    "kernelspec": {
+      "display_name": "Python 3 (ipykernel)",
+      "language": "python",
+      "name": "python3"
+    },
+    "language_info": {
+      "codemirror_mode": {
+        "name": "ipython",
+        "version": 3
+      },
+      "file_extension": ".py",
+      "mimetype": "text/x-python",
+      "name": "python",
+      "nbconvert_exporter": "python",
+      "pygments_lexer": "ipython3",
+      "version": "3.10.6"
    }
-   ],
-   "source": [
-    "loader.load()"
-   ]
  },
-  {
-   "cell_type": "markdown",
-   "id": "93689594",
-   "metadata": {},
-   "source": [
-    "## Configuring the AWS Boto3 client\n",
-    "You can configure the AWS [Boto3](https://boto3.amazonaws.com/v1/documentation/api/latest/index.html) client by passing\n",
-    "named arguments when creating the S3DirectoryLoader.\n",
-    "This is useful for instance when AWS credentials can't be set as environment variables.\n",
-    "See the [list of parameters](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/core/session.html#boto3.session.Session) that can be configured."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "outputs": [],
-   "source": [
-    "loader = S3FileLoader(\"testing-hwc\", \"fake.docx\", aws_access_key_id=\"xxxx\", aws_secret_access_key=\"yyyy\")"
-   ],
-   "metadata": {}
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "outputs": [],
-   "source": [
-    "loader.load()"
-   ],
-   "metadata": {}
-  }
- ],
- "metadata": {
-  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
-   "language": "python",
-   "name": "python3"
-  },
-  "language_info": {
-   "codemirror_mode": {
-    "name": "ipython",
-    "version": 3
-   },
-   "file_extension": ".py",
-   "mimetype": "text/x-python",
-   "name": "python",
-   "nbconvert_exporter": "python",
-   "pygments_lexer": "ipython3",
-   "version": "3.10.6"
-  }
- },
- "nbformat": 4,
- "nbformat_minor": 5
-}
-
+  "nbformat": 4,
+  "nbformat_minor": 5
+}
--- a/docs/extras/integrations/document_loaders/etherscan.ipynb
+++ b/docs/extras/integrations/document_loaders/etherscan.ipynb
@@ -5,12 +5,17 @@
   "id": "1ab83660",
   "metadata": {},
   "source": [
-    "# Etherscan Loader\n",
+    "# Etherscan\n",
+    "\n",
+    ">[Etherscan](https://docs.etherscan.io/)  is the leading blockchain explorer, search, API and analytics platform for Ethereum, \n",
+    "a decentralized smart contracts platform.\n",
+    "\n",
+    "\n",
    "## Overview\n",
    "\n",
-    "The Etherscan loader use etherscan api to load transaction histories under specific account on Ethereum Mainnet.\n",
+    "The `Etherscan` loader use `Etherscan API` to load transacactions histories under specific account on `Ethereum Mainnet`.\n",
    "\n",
-    "You will need a Etherscan api key to proceed. The free api key has 5 calls per second quota.\n",
+    "You will need a `Etherscan api key` to proceed. The free api key has 5 calls per seconds quota.\n",
    "\n",
    "The loader supports the following six functinalities:\n",
    "* Retrieve normal transactions under specific account on Ethereum Mainet\n",
@@ -47,7 +52,7 @@
   "id": "d72d4e22",
   "metadata": {},
   "source": [
-    "# Setup"
+    "## Setup"
   ]
  },
  {
@@ -86,7 +91,7 @@
   "id": "3bcbb63e",
   "metadata": {},
   "source": [
-    "# Create a ERC20 transaction loader"
+    "## Create a ERC20 transaction loader"
   ]
  },
  {
@@ -136,7 +141,7 @@
   "id": "2a1ecce0",
   "metadata": {},
   "source": [
-    "# Create a normal transaction loader with customized parameters"
+    "## Create a normal transaction loader with customized parameters"
   ]
  },
  {
@@ -212,7 +217,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.2"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_loaders/mediawikidump.ipynb
+++ b/docs/extras/integrations/document_loaders/mediawikidump.ipynb
@@ -4,7 +4,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# MediaWikiDump\n",
+    "# MediaWiki Dump\n",
    "\n",
    ">[MediaWiki XML Dumps](https://www.mediawiki.org/wiki/Manual:Importing_XML_dumps) contain the content of a wiki (wiki pages with all their revisions), without the site-related data. A XML dump does not create a full backup of the wiki database, the dump does not contain user accounts, images, edit logs, etc.\n",
    "\n",
@@ -122,7 +122,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.6"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_loaders/merge_doc_loader.ipynb
+++ b/docs/extras/integrations/document_loaders/merge_doc_loader.ipynb
@@ -5,7 +5,7 @@
   "id": "dd7c3503",
   "metadata": {},
   "source": [
-    "# MergeDocLoader\n",
+    "# Merge Documents Loader\n",
    "\n",
    "Merge the documents returned from a set of specified data loaders."
   ]
@@ -96,7 +96,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.16"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_loaders/nuclia.ipynb
+++ b/docs/extras/integrations/document_loaders/nuclia.ipynb
@@ -1,17 +1,28 @@
 {
 "cells": [
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Nuclia Understanding API document loader\n",
+    "# Nuclia\n",
    "\n",
-    "[Nuclia](https://nuclia.com) automatically indexes your unstructured data from any internal and external source, providing optimized search results and generative answers. It can handle video and audio transcription, image content extraction, and document parsing.\n",
+    ">[Nuclia](https://nuclia.com) automatically indexes your unstructured data from any internal and external source, providing optimized search results and generative answers. It can handle video and audio transcription, image content extraction, and document parsing.\n",
    "\n",
-    "The Nuclia Understanding API supports the processing of unstructured data, including text, web pages, documents, and audio/video contents. It extracts all texts wherever they are (using speech-to-text or OCR when needed), it also extracts metadata, embedded files (like images in a PDF), and web links. If machine learning is enabled, it identifies entities, provides a summary of the content and generates embeddings for all the sentences.\n",
-    "\n",
-    "To use the Nuclia Understanding API, you need to have a Nuclia account. You can create one for free at [https://nuclia.cloud](https://nuclia.cloud), and then [create a NUA key](https://docs.nuclia.dev/docs/docs/using/understanding/intro)."
+    ">The `Nuclia Understanding API` supports the processing of unstructured data, including text, web pages, documents, and audio/video contents. It extracts all texts wherever they are (using speech-to-text or OCR when needed), it also extracts metadata, embedded files (like images in a PDF), and web links. If machine learning is enabled, it identifies entities, provides a summary of the content and generates embeddings for all the sentences.\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Setup"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "To use the `Nuclia Understanding API`, you need to have a Nuclia account. You can create one for free at [https://nuclia.cloud](https://nuclia.cloud), and then [create a NUA key](https://docs.nuclia.dev/docs/docs/using/understanding/intro)."
   ]
  },
  {
@@ -37,10 +48,11 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
+    "## Example\n",
+    "\n",
    "To use the Nuclia document loader, you need to instantiate a `NucliaUnderstandingAPI` tool:"
   ]
  },
@@ -67,7 +79,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -95,7 +106,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
@@ -121,7 +131,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "langchain",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
@@ -135,10 +145,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.5"
-  },
-  "orig_nbformat": 4
+   "version": "3.10.12"
+  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/document_loaders/pyspark_dataframe.ipynb
+++ b/docs/extras/integrations/document_loaders/pyspark_dataframe.ipynb
@@ -1,11 +1,10 @@
 {
 "cells": [
  {
-   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# PySpark DataFrame Loader\n",
+    "# PySpark\n",
    "\n",
    "This notebook goes over how to load data from a [PySpark](https://spark.apache.org/docs/latest/api/python/) DataFrame."
   ]
@@ -147,9 +146,9 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.9"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/extras/integrations/document_loaders/recursive_url_loader.ipynb
+++ b/docs/extras/integrations/document_loaders/recursive_url_loader.ipynb
@@ -5,7 +5,7 @@
   "id": "5a7cc773",
   "metadata": {},
   "source": [
-    "# Recursive URL Loader\n",
+    "# Recursive URL\n",
    "\n",
    "We may want to process load all URLs under a root directory.\n",
    "\n",
@@ -170,7 +170,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.16"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/document_loaders/youtube_audio.ipynb
+++ b/docs/extras/integrations/document_loaders/youtube_audio.ipynb
@@ -1,16 +1,15 @@
 {
 "cells": [
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "e48afb8d",
   "metadata": {},
   "source": [
-    "# Loading documents from a YouTube url\n",
+    "# YouTube audio\n",
    "\n",
    "Building chat or QA applications on YouTube videos is a topic of high interest.\n",
    "\n",
-    "Below we show how to easily go from a YouTube url to text to chat!\n",
+    "Below we show how to easily go from a `YouTube url` to `audio of the video` to `text` to `chat`!\n",
    "\n",
    "We wil use the `OpenAIWhisperParser`, which will use the OpenAI Whisper API to transcribe audio to text, \n",
    "and the  `OpenAIWhisperParserLocal` for local support and running on private clouds or on premise.\n",
@@ -82,9 +81,7 @@
   "cell_type": "code",
   "execution_count": 2,
   "id": "23e1e134",
-   "metadata": {
-    "scrolled": false
-   },
+   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
@@ -128,9 +125,7 @@
   "cell_type": "code",
   "execution_count": 3,
   "id": "72a94fd8",
-   "metadata": {
-    "scrolled": false
-   },
+   "metadata": {},
   "outputs": [
    {
     "data": {
@@ -293,7 +288,7 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "Python 3",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
@@ -307,7 +302,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.11"
+   "version": "3.10.12"
  },
  "vscode": {
   "interpreter": {
--- a/docs/extras/integrations/llms/bedrock.ipynb
+++ b/docs/extras/integrations/llms/bedrock.ipynb
@@ -61,6 +61,46 @@
    "\n",
    "conversation.predict(input=\"Hi there!\")"
   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Conversation Chain With Streaming"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.llms import Bedrock\n",
+    "from langchain.callbacks.streaming_stdout import StreamingStdOutCallbackHandler\n",
+    "\n",
+    "\n",
+    "llm = Bedrock(\n",
+    "    credentials_profile_name=\"bedrock-admin\",\n",
+    "    model_id=\"amazon.titan-tg1-large\",\n",
+    "    streaming=True,\n",
+    "    callbacks=[StreamingStdOutCallbackHandler()],\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "\n",
+    "conversation = ConversationChain(\n",
+    "    llm=llm, verbose=True, memory=ConversationBufferMemory()\n",
+    ")\n",
+    "\n",
+    "conversation.predict(input=\"Hi there!\")"
+   ]
  }
 ],
 "metadata": {
--- a/docs/extras/integrations/llms/fireworks.ipynb
+++ b/docs/extras/integrations/llms/fireworks.ipynb
@@ -19,7 +19,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.llms.fireworks import Fireworks, FireworksChat\n",
+    "from langchain.llms.fireworks import Fireworks\n",
    "from langchain.prompts import PromptTemplate\nfrom langchain.chains import LLMChain\n",
    "from langchain.prompts.chat import (\n",
    "    ChatPromptTemplate,\n",
@@ -48,8 +48,8 @@
   "outputs": [],
   "source": [
    "# Initialize a Fireworks LLM\n",
-    "os.environ['FIREWORKS_API_KEY'] = \"<YOUR_API_KEY>\"  # Change this to your own API key\n",
-    "llm = Fireworks(model_id=\"accounts/fireworks/models/llama-v2-13b-chat\")"
+    "os.environ['FIREWORKS_API_KEY'] = \"<your_api_key>\"  # Change this to your own API key\n",
+    "llm = Fireworks(model=\"accounts/fireworks/models/llama-v2-13b-chat\")"
   ]
  },
  {
@@ -61,28 +61,7 @@
    "\n",
    "You can use the LLMs to call the model for specified prompt(s). \n",
    "\n",
-    "Currently supported models: \n",
-    "\n",
-    "*   Falcon\n",
-    "    *   `accounts/fireworks/models/falcon-7b`\n",
-    "    *   `accounts/fireworks/models/falcon-40b-w8a16`\n",
-    "*   Llama 2\n",
-    "    *   `accounts/fireworks/models/llama-v2-7b`\n",
-    "    *   `accounts/fireworks/models/llama-v2-7b-w8a16`\n",
-    "    *   `accounts/fireworks/models/llama-v2-7b-chat`\n",
-    "    *   `accounts/fireworks/models/llama-v2-7b-chat-w8a16`\n",
-    "    *   `accounts/fireworks/models/llama-v2-13b`\n",
-    "    *   `accounts/fireworks/models/llama-v2-13b-w8a16`\n",
-    "    *   `accounts/fireworks/models/llama-v2-13b-chat`\n",
-    "    *   `accounts/fireworks/models/llama-v2-13b-chat-w8a16`\n",
-    "    *   `accounts/fireworks/models/llama-v2-70b-chat-4gpu`\n",
-    "*   StarCoder\n",
-    "    *   `accounts/fireworks/models/starcoder-1b-w8a16-1gpu`\n",
-    "    *   `accounts/fireworks/models/starcoder-3b-w8a16-1gpu`\n",
-    "    *   `accounts/fireworks/models/starcoder-7b-w8a16-1gpu`\n",
-    "    *   `accounts/fireworks/models/starcoder-16b-w8a16`\n",
-    "\n",
-    "See the full, most up-to-date list on [app.fireworks.ai](https://app.fireworks.ai)."
+    "See the full, most up-to-date model list on [app.fireworks.ai](https://app.fireworks.ai)."
   ]
  },
  {
@@ -95,29 +74,17 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Is it Tom Brady, Aaron Rodgers, or someone else? It's a tough question to answer, and there are strong arguments for each of these quarterbacks. Here are some of the reasons why each of these quarterbacks could be considered the best:\n",
      "\n",
-      "Tom Brady:\n",
      "\n",
-      "* He has the most Super Bowl wins (6) of any quarterback in NFL history.\n",
-      "* He has been named Super Bowl MVP four times, more than any other player.\n",
-      "* He has led the New England Patriots to 18 playoff victories, the most in NFL history.\n",
-      "* He has thrown for over 70,000 yards in his career, the most of any quarterback in NFL history.\n",
-      "* He has thrown for 50 or more touchdowns in a season four times, the most of any quarterback in NFL history.\n",
+      "It's a question that's been debated for years, and there are plenty of strong candidates. Here are some of the top quarterbacks in the league right now:\n",
      "\n",
-      "Aaron Rodgers:\n",
+      "1. Tom Brady (New England Patriots): Brady is widely considered one of the greatest quarterbacks of all time, and for good reason. He's led the Patriots to six Super Bowl wins and has been named Super Bowl MVP four times. He's known for his precision passing and ability to read defenses.\n",
+      "2. Aaron Rodgers (Green Bay Packers): Rodgers is another top-tier quarterback who's known for his accuracy and ability to make plays outside of the pocket. He's led the Packers to a Super Bowl win and has been named NFL MVP twice.\n",
+      "3. Drew Brees (New Orleans Saints): Brees is one of the most prolific passers in NFL history, and he's shown no signs of slowing down. He's led the Saints to a Super Bowl win and has been named NFL MVP once.\n",
+      "4. Russell Wilson (Seattle Seahawks): Wilson is a dynamic quarterback who's known for his ability to make plays with his legs and his arm. He's led the Seahawks to a Super Bowl win and has been named NFL MVP once.\n",
+      "5. Patrick Mahomes (Kansas City Chiefs): Mahomes is a young quarterback who's quickly become one of the best in the league. He led the Chiefs to a Super Bowl win last season and has been named NFL MVP twice. He's known for his incredible arm talent and ability to make plays outside of the pocket.\n",
      "\n",
-      "* He has led the Green Bay Packers to a Super Bowl victory in 2010.\n",
-      "* He has been named Super Bowl MVP once.\n",
-      "* He has thrown for over 40,000 yards in his career, the most of any quarterback in NFL history.\n",
-      "* He has thrown for 40 or more touchdowns in a season three times, the most of any quarterback in NFL history.\n",
-      "* He has a career passer rating of 103.1, the highest of any quarterback in NFL history.\n",
-      "\n",
-      "So, who's the best quarterback in the NFL? It's a tough call, but here's my opinion:\n",
-      "\n",
-      "I think Aaron Rodgers is the best quarterback in the NFL right now. He has led the Packers to a Super Bowl victory and has had some incredible seasons, including the 2011 season when he threw for 45 touchdowns and just 6 interceptions. He has a strong arm, great accuracy, and is incredibly mobile for a quarterback of his size. He also has a great sense of timing and knows when to take risks and when to play it safe.\n",
-      "\n",
-      "Tom Brady is a close second, though. He has an incredible track record of success, including six Super Bowl victories, and has been one of the most consistent quarterbacks in the league for the past two decades. He has a strong arm and is incredibly accurate\n"
+      "Of course, there are other great quarterbacks in the league as well, such as Ben Roethlisberger, Matt Ryan, and Deshaun Watson. Ultimately, the \"best\" quarterback is a matter of personal opinion and depends on how you define \"best.\" Some people might value accuracy and precision passing, while others might prefer a quarterback who can make plays with their legs. Either way, the NFL is filled with talented quarterbacks who are making incredible plays every week.\n"
     ]
    }
   ],
@@ -137,7 +104,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "[[Generation(text='\\nThe best cricket player in 2016 is a matter of opinion, but some of the top contenders for the title include:\\n\\n1. Virat Kohli (India): Kohli had a phenomenal year in 2016, scoring over 1,000 runs in Test cricket, including four centuries, and averaging over 70. He also scored heavily in ODI cricket, with an average of over 80.\\n2. Steve Smith (Australia): Smith had a remarkable year in 2016, leading Australia to a Test series victory in India and scoring over 1,000 runs in the format, including five centuries. He also averaged over 60 in ODI cricket.\\n3. KL Rahul (India): Rahul had a breakout year in 2016, scoring over 1,000 runs in Test cricket, including four centuries, and averaging over 60. He also scored heavily in ODI cricket, with an average of over 70.\\n4. Joe Root (England): Root had a solid year in 2016, scoring over 1,000 runs in Test cricket, including four centuries, and averaging over 50. He also scored heavily in ODI cricket, with an average of over 80.\\n5. Quinton de Kock (South Africa): De Kock had a remarkable year in 2016, scoring over 1,000 runs in ODI cricket, including six centuries, and averaging over 80. He also scored heavily in Test cricket, with an average of over 50.\\n\\nThese are just a few of the top contenders for the title of best cricket player in 2016, but there were many other talented players who also had impressive years. Ultimately, the answer to this question is subjective and depends on individual opinions and criteria for evaluation.', generation_info=None)], [Generation(text=\"\\nThis is a tough one, as there are so many great players in the league right now. But if I had to choose one, I'd say LeBron James is the best basketball player in the league. He's a once-in-a-generation talent who can dominate the game in so many ways. He's got incredible speed, strength, and court vision, and he's always finding new ways to improve his game. Plus, he's been doing it at an elite level for over a decade now, which is just amazing.\\n\\nBut don't just take my word for it - there are plenty of other great players in the league who could make a strong case for being the best. Guys like Kevin Durant, Steph Curry, James Harden, and Giannis Antetokounmpo are all having incredible seasons, and they've all got their own unique skills and strengths that make them special. So ultimately, it's up to you to decide who you think is the best basketball player in the league.\", generation_info=None)]]\n"
+      "[[Generation(text=\"\\n\\nNote: This is a subjective question, and the answer will depend on individual opinions and perspectives.\\n\\nThere are many great cricket players, and it's difficult to identify a single best player. However, here are some of the top performers in 2016:\\n\\n1. Virat Kohli (India): Kohli had an outstanding year in all formats of the game, scoring heavily in Tests, ODIs, and T20Is. He was especially impressive in the Test series against England, where he scored four centuries and averaged over 100.\\n2. Steve Smith (Australia): Smith had a phenomenal year as well, leading Australia to a Test series win in India and averaging over 100 in the longer format. He also scored a century in the ODI series against Pakistan.\\n3. Kane Williamson (New Zealand): Williamson had a consistent year, scoring heavily in all formats and leading New Zealand to a Test series win against Australia. He also won the ICC Test Player of the Year award.\\n4. Joe Root (England): Root had a solid year, scoring three hundreds in the Test series against Pakistan and India, and averaging over 50 in Tests.\\n5. AB de Villiers (South Africa): De Villiers had a brilliant year in ODIs, scoring four hundreds and averaging over 100. He also had a good year in Tests, scoring two hundreds and averaging over 50.\\n6. Quinton de Kock (South Africa): De Kock had a great year behind the wickets, scoring heavily in all formats and averaging over 50 in Tests.\\n7. Rohit Sharma (India): Sharma had a fantastic year in ODIs, scoring four hundreds and averaging over 100. He also had a good year in Tests, scoring two hundreds and averaging over 40.\\n8. David Warner (Australia): Warner had a great year in ODIs, scoring three hundreds and averaging over 100. He also had a good year in Tests, scoring two hundreds and averaging over 40.\\n\\nThese are just a few examples of top performers in 2016, and opinions on the best player will vary depending on individual perspectives\", generation_info=None)], [Generation(text='\\n\\nThere are a lot of great players in the NBA, and opinions on who\\'s the best can vary depending on personal preferences and criteria for evaluation. However, here are some of the top candidates for the title of best basketball player in the league based on their recent performances and achievements:\\n\\n1. LeBron James: James is a four-time NBA champion and four-time MVP, and is widely regarded as one of the greatest players of all time. He has led the Los Angeles Lakers to the best record in the Western Conference this season and is averaging 25.7 points, 7.9 rebounds, and 7.4 assists per game.\\n2. Giannis Antetokounmpo: Antetokounmpo, known as the \"Greek Freak,\" is a dominant force in the paint and has led the Milwaukee Bucks to the best record in the Eastern Conference. He is averaging 30.5 points, 12.6 rebounds, and 5.9 assists per game, and is a strong contender for the MVP award.\\n3. Stephen Curry: Curry is a three-time NBA champion and two-time MVP, and is known for his incredible shooting ability. He has led the Golden State Warriors to the playoffs despite injuries to key players, and is averaging 23.5 points, 5.2 rebounds, and 5.2 assists per game.\\n4. Kevin Durant: Durant is a two-time NBA champion and four-time scoring champion, and is one of the most skilled scorers in the league. He has led the Brooklyn Nets to the playoffs in their first season since moving from New Jersey, and is averaging 27.2 points, 7.2 rebounds, and 6.4 assists per game.\\n5. James Harden: Harden is a three-time scoring champion and has led the Houston Rockets to the playoffs for the past eight seasons. He is averaging 35.4 points, 8.3 rebounds, and 7.5 assists per game, and is a strong contender for the MVP award.\\n\\nUltimately, determining the best basketball player in the league is subjective and depends on individual opinions and criteria. However, these five players are among', generation_info=None)]]\n"
     ]
    }
   ],
@@ -161,13 +128,13 @@
     "output_type": "stream",
     "text": [
      "\n",
-      "Kansas City in December is quite cold, with temperatures typically r\n"
+      "Kansas City's weather in December can be quite chilly,\n"
     ]
    }
   ],
   "source": [
    "# Setting additional parameters: temperature, max_tokens, top_p\n",
-    "llm = Fireworks(model_id=\"accounts/fireworks/models/llama-v2-13b-chat\", temperature=0.7, max_tokens=15, top_p=1.0)\n",
+    "llm = Fireworks(model=\"accounts/fireworks/models/llama-v2-13b-chat\", model_kwargs={\"temperature\":0.7, \"max_tokens\":15, \"top_p\":1.0})\n",
    "print(llm(\"What's the weather like in Kansas City in December?\"))"
   ]
  },
@@ -192,30 +159,140 @@
     "output_type": "stream",
     "text": [
      "\n",
-      "Naming a company can be a fun and creative process! Here are a few name ideas for a company that makes football helmets:\n",
      "\n",
-      "1. Helix Headgear: This name plays off the idea of the helix shape of a football helmet and could be a memorable and catchy name for a company.\n",
-      "2. Gridiron Gear: \"Gridiron\" is a term used to describe a football field, and \"gear\" refers to the products the company sells. This name is straightforward and easy to understand.\n",
-      "3. Cushion Crusaders: This name emphasizes the protective qualities of football helmets and could appeal to customers looking for safety-conscious products.\n",
-      "4. Helmet Heroes: This name has a fun, heroic tone and could appeal to customers looking for high-quality products.\n",
-      "5. Tackle Tech: \"Tackle\" is a term used in football to describe a player's attempt to stop an opponent, and \"tech\" refers to the technology used in the helmets. This name could appeal to customers interested in innovative products.\n",
-      "6. Padded Protection: This name emphasizes the protective qualities of football helmets and could appeal to customers looking for products that prioritize safety.\n",
-      "7. Gridiron Gear Co.: This name is simple and straightforward, and it clearly conveys the company's focus on football-related products.\n",
-      "8. Helmet Haven: This name has a soothing, protective tone and could appeal to customers looking for a reliable brand.\n",
+      "Assistant: That's a great question! There are many factors to consider when choosing a name for a company that makes football helmets. Here are a few suggestions:\n",
      "\n",
-      "Remember to choose a name that reflects your company's values and mission, and that resonates with your target market. Good luck with your company!\n"
+      "1. Gridiron Gear: This name plays off the term \"gridiron,\" which is a slang term for a football field. It also suggests that the company's products are high-quality and durable, like gear used in a gridiron game.\n",
+      "2. Helmet Headquarters: This name is straightforward and to the point. It clearly communicates that the company is a leading manufacturer of football helmets.\n",
+      "3. Tackle Tough: This name plays off the idea of tackling a tough opponent on the football field. It suggests that the company's helmets are designed to protect players from even the toughest hits.\n",
+      "4. Block Breakthrough: This name is a play on words that suggests the company's helmets are breaking through the competition. It also implies that the company is innovative and forward-thinking.\n",
+      "5. First Down Fashion: This name combines the idea of scoring a first down on the football field with the idea of fashionable clothing. It suggests that the company's helmets are not only functional but also stylish.\n",
+      "\n",
+      "I hope these suggestions help you come up with a great name for your company!\n"
     ]
    }
   ],
   "source": [
    "human_message_prompt = HumanMessagePromptTemplate.from_template(\"What is a good name for a company that makes {product}?\")\n",
    "chat_prompt_template = ChatPromptTemplate.from_messages([human_message_prompt])\n",
-    "chat = FireworksChat()\n",
+    "chat = Fireworks()\n",
    "chain = LLMChain(llm=chat, prompt=chat_prompt_template)\n",
    "output = chain.run(\"football helmets\")\n",
    "\n",
    "print(output)"
   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "25812db3-23a6-41dd-8636-5a49c52bb6eb",
+   "metadata": {},
+   "source": [
+    "# Run Stream"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "26d67ecf-9290-4ec2-8b39-ff17fc99620f",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      " Tom Brady, Aaron Rod\n",
+      "gers, or Drew Bre\n",
+      "es?\n",
+      "Some people might\n",
+      " say Tom Brady, who\n",
+      " has won six Super Bowls\n",
+      " and four Super Bowl MVP\n",
+      " awards, is the best quarter\n",
+      "back in the NFL. O\n",
+      "thers might argue that Aaron\n",
+      " Rodgers, who has led\n",
+      " his team to a Super Bowl\n",
+      " victory and has been named the\n",
+      " NFL MVP twice, is\n",
+      " the best. Still, others\n",
+      " might say that Drew Bre\n",
+      "es, who holds the NFL\n",
+      " record for most career passing yards\n",
+      " and has led his team to\n",
+      " a Super Bowl victory, is\n",
+      " the best.\n",
+      "But what\n",
+      " if I told you there'\n",
+      "s actually a fourth quarterback\n",
+      " who could make a strong case\n",
+      " for being the best in the\n",
+      " NFL? Meet Russell Wilson\n",
+      ", the Seattle Seahaw\n",
+      "ks' dynamic signal-call\n",
+      "er who has led his team\n",
+      " to a Super Bowl victory and\n",
+      " has been named the NFL M\n",
+      "VP twice.\n",
+      "Wilson\n",
+      " has a unique combination of physical\n",
+      " and mental skills that set him\n",
+      " apart from other quarterbacks\n",
+      " in the league. He'\n",
+      "s incredibly athletic,\n",
+      " with the ability to make plays\n",
+      " with his feet and his arm\n",
+      ", and he's also\n",
+      " highly intelligent, with a\n",
+      " quick mind and the ability to\n",
+      " read defenses like a pro\n",
+      ".\n",
+      "But what really\n",
+      " sets Wilson apart is his\n",
+      " leadership ability. He'\n",
+      "s a natural-born\n",
+      " leader who has a way\n",
+      " of inspiring his team\n",
+      "mates and getting them\n",
+      " to buy into his vision\n",
+      " for the game. He\n",
+      "'s also an excellent\n",
+      " communicator, who can\n",
+      " articulate his strategy\n",
+      " and game plan in a\n",
+      " way that his teamm\n",
+      "ates can understand and execute\n",
+      ".\n",
+      "So, who\n",
+      "'s the best quarter\n",
+      "back in the NFL?\n",
+      " It's hard to\n",
+      " say for sure, but\n",
+      " if you ask me,\n",
+      " Russell Wilson is definitely in\n",
+      " the conversation. He'\n",
+      "s got the physical skills\n",
+      ", the mental skills,\n",
+      " and the leadership ability to\n",
+      " be the best of the\n",
+      " best.\n"
+     ]
+    }
+   ],
+   "source": [
+    "llm = Fireworks()\n",
+    "generator = llm.stream(\"Who's the best quarterback in the NFL?\")\n",
+    "\n",
+    "for token in generator:\n",
+    "    print(token)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "e3a35e0b-c875-493a-8143-d802d273247c",
+   "metadata": {},
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {
--- a/docs/extras/integrations/llms/google_vertex_ai_palm.ipynb
+++ b/docs/extras/integrations/llms/google_vertex_ai_palm.ipynb
@@ -4,7 +4,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Google Vertex AI PaLM \n",
+    "# GCP Vertex AI\n",
    "\n",
    "**Note:** This is separate from the `Google PaLM` integration, it exposes [Vertex AI PaLM API](https://cloud.google.com/vertex-ai/docs/generative-ai/learn/overview) on `Google Cloud`. \n"
   ]
@@ -41,32 +41,56 @@
   },
   "outputs": [],
   "source": [
-    "#!pip install google-cloud-aiplatform"
+    "#!pip install langchain google-cloud-aiplatform"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 2,
   "metadata": {},
   "outputs": [],
   "source": [
    "from langchain.llms import VertexAI"
   ]
  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      " Python is a widely used, interpreted, object-oriented, and high-level programming language with dynamic semantics, used for general-purpose programming. It is known for its readability, simplicity, and versatility. Here are some of the pros and cons of Python:\n",
+      "\n",
+      "**Pros:**\n",
+      "\n",
+      "- **Easy to learn:** Python is known for its simple and intuitive syntax, making it easy for beginners to learn. It has a relatively shallow learning curve compared to other programming languages.\n",
+      "\n",
+      "- **Versatile:** Python is a general-purpose programming language, meaning it can be used for a wide variety of tasks, including web development, data science, machine\n"
+     ]
+    }
+   ],
+   "source": [
+    "llm = VertexAI()\n",
+    "print(llm(\"What are some of the pros and cons of Python as a programming language?\"))"
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "## Question-answering example"
+    "## Using in a chain"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 5,
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.prompts import PromptTemplate\nfrom langchain.chains import LLMChain"
+    "from langchain.prompts import PromptTemplate"
   ]
  },
  {
@@ -78,17 +102,7 @@
    "template = \"\"\"Question: {question}\n",
    "\n",
    "Answer: Let's think step by step.\"\"\"\n",
-    "\n",
-    "prompt = PromptTemplate(template=template, input_variables=[\"question\"])"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 4,
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "llm = VertexAI()"
+    "prompt = PromptTemplate.from_template(template)"
   ]
  },
  {
@@ -97,29 +111,26 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "llm_chain = LLMChain(prompt=prompt, llm=llm)"
+    "chain = prompt | llm"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 10,
   "metadata": {},
   "outputs": [
    {
-     "data": {
-      "text/plain": [
-       "'Justin Bieber was born on March 1, 1994. The Super Bowl in 1994 was won by the San Francisco 49ers.\\nThe final answer: San Francisco 49ers.'"
-      ]
-     },
-     "execution_count": 8,
-     "metadata": {},
-     "output_type": "execute_result"
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      " Justin Bieber was born on March 1, 1994. Bill Clinton was the president of the United States from January 20, 1993, to January 20, 2001.\n",
+      "The final answer is Bill Clinton\n"
+     ]
    }
   ],
   "source": [
-    "question = \"What NFL team won the Super Bowl in the year Justin Beiber was born?\"\n",
-    "\n",
-    "llm_chain.run(question)"
+    "question = \"Who was the president in the year Justin Beiber was born?\"\n",
+    "print(chain.invoke({\"question\": question}))"
   ]
  },
  {
@@ -140,78 +151,200 @@
    "- `code-gecko`: for code completion"
   ]
  },
-  {
-   "cell_type": "code",
-   "execution_count": 9,
-   "metadata": {
-    "execution": {
-     "iopub.execute_input": "2023-06-17T21:16:53.149438Z",
-     "iopub.status.busy": "2023-06-17T21:16:53.149065Z",
-     "iopub.status.idle": "2023-06-17T21:16:53.421824Z",
-     "shell.execute_reply": "2023-06-17T21:16:53.421136Z",
-     "shell.execute_reply.started": "2023-06-17T21:16:53.149415Z"
-    },
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "llm = VertexAI(model_name=\"code-bison\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 12,
-   "metadata": {
-    "execution": {
-     "iopub.execute_input": "2023-06-17T21:17:11.179077Z",
-     "iopub.status.busy": "2023-06-17T21:17:11.178686Z",
-     "iopub.status.idle": "2023-06-17T21:17:11.182499Z",
-     "shell.execute_reply": "2023-06-17T21:17:11.181895Z",
-     "shell.execute_reply.started": "2023-06-17T21:17:11.179052Z"
-    },
-    "tags": []
-   },
-   "outputs": [],
-   "source": [
-    "llm_chain = LLMChain(prompt=prompt, llm=llm)"
-   ]
-  },
  {
   "cell_type": "code",
   "execution_count": 15,
   "metadata": {
-    "execution": {
-     "iopub.execute_input": "2023-06-17T21:18:47.024785Z",
-     "iopub.status.busy": "2023-06-17T21:18:47.024230Z",
-     "iopub.status.idle": "2023-06-17T21:18:49.352249Z",
-     "shell.execute_reply": "2023-06-17T21:18:49.351695Z",
-     "shell.execute_reply.started": "2023-06-17T21:18:47.024762Z"
-    },
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "llm = VertexAI(model_name=\"code-bison\", max_output_tokens=1000, temperature=0.3)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "question = \"Write a python function that checks if a string is a valid email address\""
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "metadata": {
    "tags": []
   },
   "outputs": [
    {
-     "data": {
-      "text/plain": [
-       "'```python\\ndef is_prime(n):\\n  \"\"\"\\n  Determines if a number is prime.\\n\\n  Args:\\n    n: The number to be tested.\\n\\n  Returns:\\n    True if the number is prime, False otherwise.\\n  \"\"\"\\n\\n  # Check if the number is 1.\\n  if n == 1:\\n    return False\\n\\n  # Check if the number is 2.\\n  if n == 2:\\n    return True\\n\\n'"
-      ]
-     },
-     "execution_count": 15,
-     "metadata": {},
-     "output_type": "execute_result"
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "```python\n",
+      "import re\n",
+      "\n",
+      "def is_valid_email(email):\n",
+      "    pattern = re.compile(r\"[^@]+@[^@]+\\.[^@]+\")\n",
+      "    return pattern.match(email)\n",
+      "```\n"
+     ]
    }
   ],
   "source": [
-    "question = \"Write a python function that identifies if the number is a prime number?\"\n",
-    "\n",
-    "llm_chain.run(question)"
+    "print(llm(question))"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "## Using models deployed on Vertex Model Garden"
+    "## Full generation info\n",
+    "\n",
+    "We can use the `generate` method to get back extra metadata like [safety attributes](https://cloud.google.com/vertex-ai/docs/generative-ai/learn/responsible-ai#safety_attribute_confidence_scoring) and not just text completions"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[[GenerationChunk(text='```python\\nimport re\\n\\ndef is_valid_email(email):\\n    pattern = re.compile(r\"[^@]+@[^@]+\\\\.[^@]+\")\\n    return pattern.match(email)\\n```', generation_info={'is_blocked': False, 'safety_attributes': {'Health': 0.1}})]]"
+      ]
+     },
+     "execution_count": 23,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "result = llm.generate([question])\n",
+    "result.generations"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Asynchronous calls\n",
+    "\n",
+    "With `agenerate` we can make asynchronous calls"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# If running in a Jupyter notebook you'll need to install nest_asyncio\n",
+    "\n",
+    "# !pip install nest_asyncio"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 24,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import asyncio\n",
+    "# import nest_asyncio\n",
+    "# nest_asyncio.apply()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "LLMResult(generations=[[GenerationChunk(text='```python\\nimport re\\n\\ndef is_valid_email(email):\\n    pattern = re.compile(r\"[^@]+@[^@]+\\\\.[^@]+\")\\n    return pattern.match(email)\\n```', generation_info={'is_blocked': False, 'safety_attributes': {'Health': 0.1}})]], llm_output=None, run=[RunInfo(run_id=UUID('caf74e91-aefb-48ac-8031-0c505fcbbcc6'))])"
+      ]
+     },
+     "execution_count": 25,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "asyncio.run(llm.agenerate([question]))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Streaming calls\n",
+    "\n",
+    "With `stream` we can stream results from the model"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 27,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import sys"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 28,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "```python\n",
+      "import re\n",
+      "\n",
+      "def is_valid_email(email):\n",
+      "  \"\"\"\n",
+      "  Checks if a string is a valid email address.\n",
+      "\n",
+      "  Args:\n",
+      "    email: The string to check.\n",
+      "\n",
+      "  Returns:\n",
+      "    True if the string is a valid email address, False otherwise.\n",
+      "  \"\"\"\n",
+      "\n",
+      "  # Check for a valid email address format.\n",
+      "  if not re.match(r\"^[A-Za-z0-9\\.\\+_-]+@[A-Za-z0-9\\._-]+\\.[a-zA-Z]*$\", email):\n",
+      "    return False\n",
+      "\n",
+      "  # Check if the domain name exists.\n",
+      "  try:\n",
+      "    domain = email.split(\"@\")[1]\n",
+      "    socket.gethostbyname(domain)\n",
+      "  except socket.gaierror:\n",
+      "    return False\n",
+      "\n",
+      "  return True\n",
+      "```"
+     ]
+    }
+   ],
+   "source": [
+    "for chunk in llm.stream(question):\n",
+    "    sys.stdout.write(chunk)\n",
+    "    sys.stdout.flush()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Vertex Model Garden"
   ]
  },
  {
@@ -248,7 +381,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "llm(\"What is the meaning of life?\")"
+    "print(llm(\"What is the meaning of life?\"))"
   ]
  },
  {
@@ -264,8 +397,6 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.prompts import PromptTemplate\n",
-    "\n",
    "prompt = PromptTemplate.from_template(\"What is the meaning of {thing}?\")"
   ]
  },
@@ -275,9 +406,8 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "llm_oss_chain = prompt | llm\n",
-    "\n",
-    "llm_oss_chain.invoke({\"thing\": \"life\"})"
+    "chian = prompt | llm\n",
+    "print(chain.invoke({\"thing\": \"life\"}))"
   ]
  }
 ],
--- a/docs/extras/integrations/llms/gradient.ipynb
+++ b/docs/extras/integrations/llms/gradient.ipynb
@@ -0,0 +1,216 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Gradient\n",
+    "\n",
+    "`Gradient` allows to fine tune and get completions on LLMs with a simple web API.\n",
+    "\n",
+    "This notebook goes over how to use Langchain with [Gradient](https://gradient.ai/).\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Imports"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "import requests\n",
+    "from langchain.llms import GradientLLM\n",
+    "from langchain.prompts import PromptTemplate\n",
+    "from langchain.chains import LLMChain"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Set the Environment API Key\n",
+    "Make sure to get your API key from Gradient AI. You are given $10 in free credits to test and fine-tune different models."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from getpass import getpass\n",
+    "\n",
+    "\n",
+    "if not os.environ.get(\"GRADIENT_ACCESS_TOKEN\",None):\n",
+    "    # Access token under https://auth.gradient.ai/select-workspace\n",
+    "    os.environ[\"GRADIENT_ACCESS_TOKEN\"] = getpass(\"gradient.ai access token:\")\n",
+    "if not os.environ.get(\"GRADIENT_WORKSPACE_ID\",None):\n",
+    "    # `ID` listed in `$ gradient workspace list`\n",
+    "    # also displayed after login at at https://auth.gradient.ai/select-workspace\n",
+    "    os.environ[\"GRADIENT_WORKSPACE_ID\"] = getpass(\"gradient.ai workspace id:\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Optional: Validate your Enviroment variables ```GRADIENT_ACCESS_TOKEN``` and ```GRADIENT_WORKSPACE_ID``` to get currently deployed models."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Credentials valid.\n",
+      "Possible values for `model_id` are:\n",
+      " {'models': [{'id': '99148c6d-c2a0-4fbe-a4a7-e7c05bdb8a09_base_ml_model', 'name': 'bloom-560m', 'slug': 'bloom-560m', 'type': 'baseModel'}, {'id': 'f0b97d96-51a8-4040-8b22-7940ee1fa24e_base_ml_model', 'name': 'llama2-7b-chat', 'slug': 'llama2-7b-chat', 'type': 'baseModel'}, {'id': 'cc2dafce-9e6e-4a23-a918-cad6ba89e42e_base_ml_model', 'name': 'nous-hermes2', 'slug': 'nous-hermes2', 'type': 'baseModel'}, {'baseModelId': 'f0b97d96-51a8-4040-8b22-7940ee1fa24e_base_ml_model', 'id': 'bb7b9865-0ce3-41a8-8e2b-5cbcbe1262eb_model_adapter', 'name': 'optical-transmitting-sensor', 'type': 'modelAdapter'}]}\n"
+     ]
+    }
+   ],
+   "source": [
+    "import requests\n",
+    "\n",
+    "resp = requests.get(f'https://api.gradient.ai/api/models', headers={\n",
+    "                    \"authorization\": f\"Bearer {os.environ['GRADIENT_ACCESS_TOKEN']}\",\n",
+    "                    \"x-gradient-workspace-id\": f\"{os.environ['GRADIENT_WORKSPACE_ID']}\",\n",
+    "                },\n",
+    "              )\n",
+    "if resp.status_code == 200:\n",
+    "    models = resp.json()\n",
+    "    print(\"Credentials valid.\\nPossible values for `model_id` are:\\n\", models)\n",
+    "else:\n",
+    "    print(\"Error when listing models. Are your credentials valid?\", resp.text)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Create the Gradient instance\n",
+    "You can specify different parameters such as the model name, max tokens generated, temperature, etc."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = GradientLLM(\n",
+    "    # `ID` listed in `$ gradient model list`\n",
+    "    model_id=\"99148c6d-c2a0-4fbe-a4a7-e7c05bdb8a09_base_ml_model\",\n",
+    "    # # optional: set new credentials, they default to environment variables\n",
+    "    # gradient_workspace_id=os.environ[\"GRADIENT_WORKSPACE_ID\"],\n",
+    "    # gradient_access_token=os.environ[\"GRADIENT_ACCESS_TOKEN\"],\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Create a Prompt Template\n",
+    "We will create a prompt template for Question and Answer."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "template = \"\"\"Question: {question}\n",
+    "\n",
+    "Answer: Let's think step by step.\"\"\"\n",
+    "\n",
+    "prompt = PromptTemplate(template=template, input_variables=[\"question\"])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Initiate the LLMChain"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_chain = LLMChain(prompt=prompt, llm=llm)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Run the LLMChain\n",
+    "Provide a question and run the LLMChain."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "' The first team to win the Super Bowl was the New England Patriots. The Patriots won the'"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "question = \"What NFL team won the Super Bowl in 1994?\"\n",
+    "\n",
+    "llm_chain.run(\n",
+    "    question=question\n",
+    ")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.13"
+  },
+  "vscode": {
+   "interpreter": {
+    "hash": "a0a0263b650d907a3bfe41c0f8d6a63a071b884df3cfdc1579f00cdc1aed6b03"
+   }
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
--- a/docs/extras/integrations/llms/huggingface_pipelines.ipynb
+++ b/docs/extras/integrations/llms/huggingface_pipelines.ipynb
@@ -46,7 +46,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": null,
   "id": "165ae236-962a-4763-8052-c4836d78a5d2",
   "metadata": {
    "tags": []
@@ -75,18 +75,10 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": null,
   "id": "3acf0069",
   "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      " First, we need to understand what is an electroencephalogram. An electroencephalogram is a recording of brain activity. It is a recording of brain activity that is made by placing electrodes on the scalp. The electrodes are placed\n"
-     ]
-    }
-   ],
+   "outputs": [],
   "source": [
    "from langchain.prompts import PromptTemplate\n",
    "\n",
@@ -101,6 +93,42 @@
    "\n",
    "print(chain.invoke({\"question\": question}))"
   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "dbbc3a37",
+   "metadata": {},
+   "source": [
+    "### Batch GPU Inference\n",
+    "\n",
+    "If running on a device with GPU, you can also run inference on the GPU in batch mode."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "097ba62f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "gpu_llm = HuggingFacePipeline.from_model_id(\n",
+    "    model_id=\"bigscience/bloom-1b7\",\n",
+    "    task=\"text-generation\",\n",
+    "    device=0,  # -1 for CPU\n",
+    "    batch_size=2,  # adjust as needed based on GPU map and model size.\n",
+    "    model_kwargs={\"temperature\": 0, \"max_length\": 64},\n",
+    ")\n",
+    "\n",
+    "gpu_chain = prompt | gpu_llm.bind(stop=[\"\\n\\n\"])\n",
+    "\n",
+    "questions = []\n",
+    "for i in range(4):\n",
+    "    questions.append({\"question\": f\"What is the number {i} in french?\"})\n",
+    "\n",
+    "answers = gpu_chain.batch(questions)\n",
+    "for answer in answers:\n",
+    "    print(answer)"
+   ]
  }
 ],
 "metadata": {
@@ -119,7 +147,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.2"
+   "version": "3.8.10"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/llms/index.mdx
+++ b/docs/extras/integrations/llms/index.mdx
@@ -0,0 +1,93 @@
+---
+sidebar_position: 0
+sidebar_class_name: hidden
+---
+
+# LLMs
+
+import DocCardList from "@theme/DocCardList";
+
+## Features (natively supported)
+All LLMs implement the Runnable interface, which comes with default implementations of all methods, ie. `ainvoke`, `batch`, `abatch`, `stream`, `astream`. This gives all LLMs basic support for async, streaming and batch, which by default is implemented as below:
+- *Async* support defaults to calling the respective sync method in asyncio's default thread pool executor. This lets other async functions in your application make progress while the LLM is being executed, by moving this call to a background thread.
+- *Streaming* support defaults to returning an `Iterator` (or `AsyncIterator` in the case of async streaming) of a single value, the final result returned by the underlying LLM provider. This obviously doesn't give you token-by-token streaming, which requires native support from the LLM provider, but ensures your code that expects an iterator of tokens can work for any of our LLM integrations.
+- *Batch* support defaults to calling the underlying LLM in parallel for each input by making use of a thread pool executor (in the sync batch case) or `asyncio.gather` (in the async batch case). The concurrency can be controlled with the `max_concurrency` key in `RunnableConfig`.
+
+Each LLM integration can optionally provide native implementations for async, streaming or batch, which, for providers that support it, can be more efficient. The table shows, for each integration, which features have been implemented with native support.
+
+Model|Invoke|Async invoke|Stream|Async stream|Batch|Async batch
+:-|:-:|:-:|:-:|:-:|:-:|:-:
+AI21|✅|❌|❌|❌|❌|❌
+AlephAlpha|✅|❌|❌|❌|❌|❌
+AmazonAPIGateway|✅|❌|❌|❌|❌|❌
+Anthropic|✅|✅|✅|✅|❌|❌
+Anyscale|✅|❌|❌|❌|❌|❌
+Aviary|✅|❌|❌|❌|❌|❌
+AzureMLOnlineEndpoint|✅|❌|❌|❌|❌|❌
+AzureOpenAI|✅|✅|✅|✅|✅|✅
+Banana|✅|❌|❌|❌|❌|❌
+Baseten|✅|❌|❌|❌|❌|❌
+Beam|✅|❌|❌|❌|❌|❌
+Bedrock|✅|❌|✅|❌|❌|❌
+CTransformers|✅|✅|❌|❌|❌|❌
+CTranslate2|✅|❌|❌|❌|✅|❌
+CerebriumAI|✅|❌|❌|❌|❌|❌
+ChatGLM|✅|❌|❌|❌|❌|❌
+Clarifai|✅|❌|❌|❌|❌|❌
+Cohere|✅|✅|❌|❌|❌|❌
+Databricks|✅|❌|❌|❌|❌|❌
+DeepInfra|✅|❌|❌|❌|❌|❌
+DeepSparse|✅|❌|❌|❌|❌|❌
+EdenAI|✅|✅|❌|❌|❌|❌
+Fireworks|✅|✅|❌|❌|✅|✅
+FireworksChat|✅|✅|❌|❌|✅|✅
+ForefrontAI|✅|❌|❌|❌|❌|❌
+GPT4All|✅|❌|❌|❌|❌|❌
+GooglePalm|✅|❌|❌|❌|✅|❌
+GooseAI|✅|❌|❌|❌|❌|❌
+GradientLLM|✅|✅|❌|❌|❌|❌
+HuggingFaceEndpoint|✅|❌|❌|❌|❌|❌
+HuggingFaceHub|✅|❌|❌|❌|❌|❌
+HuggingFacePipeline|✅|❌|❌|❌|❌|❌
+HuggingFaceTextGenInference|✅|✅|✅|✅|❌|❌
+HumanInputLLM|✅|❌|❌|❌|❌|❌
+JavelinAIGateway|✅|✅|❌|❌|❌|❌
+KoboldApiLLM|✅|❌|❌|❌|❌|❌
+LlamaCpp|✅|❌|✅|❌|❌|❌
+ManifestWrapper|✅|❌|❌|❌|❌|❌
+Minimax|✅|❌|❌|❌|❌|❌
+MlflowAIGateway|✅|❌|❌|❌|❌|❌
+Modal|✅|❌|❌|❌|❌|❌
+MosaicML|✅|❌|❌|❌|❌|❌
+NIBittensorLLM|✅|❌|❌|❌|❌|❌
+NLPCloud|✅|❌|❌|❌|❌|❌
+Nebula|✅|❌|❌|❌|❌|❌
+OctoAIEndpoint|✅|❌|❌|❌|❌|❌
+Ollama|✅|❌|❌|❌|❌|❌
+OpaquePrompts|✅|❌|❌|❌|❌|❌
+OpenAI|✅|✅|✅|✅|✅|✅
+OpenLLM|✅|✅|❌|❌|❌|❌
+OpenLM|✅|✅|✅|✅|✅|✅
+Petals|✅|❌|❌|❌|❌|❌
+PipelineAI|✅|❌|❌|❌|❌|❌
+Predibase|✅|❌|❌|❌|❌|❌
+PredictionGuard|✅|❌|❌|❌|❌|❌
+PromptLayerOpenAI|✅|❌|❌|❌|❌|❌
+QianfanLLMEndpoint|✅|✅|✅|✅|❌|❌
+RWKV|✅|❌|❌|❌|❌|❌
+Replicate|✅|❌|✅|❌|❌|❌
+SagemakerEndpoint|✅|❌|❌|❌|❌|❌
+SelfHostedHuggingFaceLLM|✅|❌|❌|❌|❌|❌
+SelfHostedPipeline|✅|❌|❌|❌|❌|❌
+StochasticAI|✅|❌|❌|❌|❌|❌
+TextGen|✅|❌|❌|❌|❌|❌
+TitanTakeoff|✅|❌|✅|❌|❌|❌
+Tongyi|✅|❌|❌|❌|❌|❌
+VLLM|✅|❌|❌|❌|✅|❌
+VLLMOpenAI|✅|✅|✅|✅|✅|✅
+VertexAI|✅|✅|✅|❌|✅|✅
+VertexAIModelGarden|✅|✅|❌|❌|✅|✅
+Writer|✅|❌|❌|❌|❌|❌
+Xinference|✅|❌|❌|❌|❌|❌
+
+<DocCardList />
--- a/docs/extras/integrations/llms/javelin.ipynb
+++ b/docs/extras/integrations/llms/javelin.ipynb
@@ -0,0 +1,242 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "62bacc68-1976-44eb-9316-d5baf54bf595",
+   "metadata": {},
+   "source": [
+    "# Javelin AI Gateway Tutorial\n",
+    "\n",
+    "This Jupyter Notebook will explore how to interact with the Javelin AI Gateway using the Python SDK. \n",
+    "The Javelin AI Gateway facilitates the utilization of large language models (LLMs) like OpenAI, Cohere, Anthropic, and others by \n",
+    "providing a secure and unified endpoint. The gateway itself provides a centralized mechanism to roll out models systematically, \n",
+    "provide access security, policy & cost guardrails for enterprises, etc., \n",
+    "\n",
+    "For a complete listing of all the features & benefits of Javelin, please visit www.getjavelin.io\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "e52185f8-132b-4585-b73d-6fee928ac199",
+   "metadata": {},
+   "source": [
+    "## Step 1: Introduction\n",
+    "[The Javelin AI Gateway](https://www.getjavelin.io) is an enterprise-grade API Gateway for AI applications. It integrates robust access security, ensuring secure interactions with large language models. Learn more in the [official documentation](https://docs.getjavelin.io).\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "2e2acdb3-e3b8-422b-b077-7a0d63d18349",
+   "metadata": {},
+   "source": [
+    "## Step 2: Installation\n",
+    "Before we begin, we must install the `javelin_sdk` and set up the Javelin API key as an environment variable. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "e91518a4-43ce-443e-b4c0-dbc652eb749f",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Requirement already satisfied: javelin_sdk in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (0.1.8)\n",
+      "Requirement already satisfied: httpx<0.25.0,>=0.24.0 in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from javelin_sdk) (0.24.1)\n",
+      "Requirement already satisfied: pydantic<2.0.0,>=1.10.7 in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from javelin_sdk) (1.10.12)\n",
+      "Requirement already satisfied: certifi in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from httpx<0.25.0,>=0.24.0->javelin_sdk) (2023.5.7)\n",
+      "Requirement already satisfied: httpcore<0.18.0,>=0.15.0 in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from httpx<0.25.0,>=0.24.0->javelin_sdk) (0.17.3)\n",
+      "Requirement already satisfied: idna in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from httpx<0.25.0,>=0.24.0->javelin_sdk) (3.4)\n",
+      "Requirement already satisfied: sniffio in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from httpx<0.25.0,>=0.24.0->javelin_sdk) (1.3.0)\n",
+      "Requirement already satisfied: typing-extensions>=4.2.0 in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from pydantic<2.0.0,>=1.10.7->javelin_sdk) (4.7.1)\n",
+      "Requirement already satisfied: h11<0.15,>=0.13 in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from httpcore<0.18.0,>=0.15.0->httpx<0.25.0,>=0.24.0->javelin_sdk) (0.14.0)\n",
+      "Requirement already satisfied: anyio<5.0,>=3.0 in /usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages (from httpcore<0.18.0,>=0.15.0->httpx<0.25.0,>=0.24.0->javelin_sdk) (3.7.1)\n",
+      "Note: you may need to restart the kernel to use updated packages.\n"
+     ]
+    }
+   ],
+   "source": [
+    "pip install 'javelin_sdk'"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "53b546dc-9ca3-4602-9a7b-d733d99e8e2f",
+   "metadata": {},
+   "source": [
+    "## Step 3: Completions Example\n",
+    "This section will demonstrate how to interact with the Javelin AI Gateway to get completions from a large language model. Here is a Python script that demonstrates this:\n",
+    "(note) assumes that you have setup a route in the gateway called 'eng_dept03'"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "d36949f0-5354-44ca-9a31-70c769344319",
+   "metadata": {},
+   "outputs": [
+    {
+     "ename": "ImportError",
+     "evalue": "cannot import name 'JavelinAIGateway' from 'langchain.llms' (/usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages/langchain/llms/__init__.py)",
+     "output_type": "error",
+     "traceback": [
+      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
+      "\u001b[0;31mImportError\u001b[0m                               Traceback (most recent call last)",
+      "Cell \u001b[0;32mIn[6], line 2\u001b[0m\n\u001b[1;32m      1\u001b[0m \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01mlangchain\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01mchains\u001b[39;00m \u001b[38;5;28;01mimport\u001b[39;00m LLMChain\n\u001b[0;32m----> 2\u001b[0m \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01mlangchain\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01mllms\u001b[39;00m \u001b[38;5;28;01mimport\u001b[39;00m JavelinAIGateway\n\u001b[1;32m      3\u001b[0m \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01mlangchain\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01mprompts\u001b[39;00m \u001b[38;5;28;01mimport\u001b[39;00m PromptTemplate\n\u001b[1;32m      5\u001b[0m route_completions \u001b[38;5;241m=\u001b[39m \u001b[38;5;124m\"\u001b[39m\u001b[38;5;124meng_dept03\u001b[39m\u001b[38;5;124m\"\u001b[39m\n",
+      "\u001b[0;31mImportError\u001b[0m: cannot import name 'JavelinAIGateway' from 'langchain.llms' (/usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages/langchain/llms/__init__.py)"
+     ]
+    }
+   ],
+   "source": [
+    "from langchain.chains import LLMChain\n",
+    "from langchain.llms import JavelinAIGateway\n",
+    "from langchain.prompts import PromptTemplate\n",
+    "\n",
+    "route_completions = \"eng_dept03\"\n",
+    "\n",
+    "gateway = JavelinAIGateway(\n",
+    "    gateway_uri=\"http://localhost:8000\", # replace with service URL or host/port of Javelin\n",
+    "    route=route_completions,\n",
+    "    model_name=\"text-davinci-003\",\n",
+    ")\n",
+    "\n",
+    "prompt = PromptTemplate(\"Translate the following English text to French: {text}\")\n",
+    "\n",
+    "llmchain = LLMChain(llm=gateway, prompt=prompt)\n",
+    "result = llmchain.run(\"podcast player\")\n",
+    "\n",
+    "print(result)\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "6b63fe93-2e77-4ea9-b8e7-dec2b96b8e95",
+   "metadata": {},
+   "source": [
+    "# Step 4: Embeddings Example\n",
+    "This section demonstrates how to use the Javelin AI Gateway to obtain embeddings for text queries and documents. Here is a Python script that illustrates this:\n",
+    "(note) assumes that you have setup a route in the gateway called 'embeddings'"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "878e6c1d-be7f-49de-825c-43c266c8714e",
+   "metadata": {},
+   "outputs": [
+    {
+     "ename": "ImportError",
+     "evalue": "cannot import name 'JavelinAIGatewayEmbeddings' from 'langchain.embeddings' (/usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages/langchain/embeddings/__init__.py)",
+     "output_type": "error",
+     "traceback": [
+      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
+      "\u001b[0;31mImportError\u001b[0m                               Traceback (most recent call last)",
+      "Cell \u001b[0;32mIn[9], line 1\u001b[0m\n\u001b[0;32m----> 1\u001b[0m \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01mlangchain\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01membeddings\u001b[39;00m \u001b[38;5;28;01mimport\u001b[39;00m JavelinAIGatewayEmbeddings\n\u001b[1;32m      2\u001b[0m \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01mlangchain\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01membeddings\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01mopenai\u001b[39;00m \u001b[38;5;28;01mimport\u001b[39;00m OpenAIEmbeddings\n\u001b[1;32m      4\u001b[0m embeddings \u001b[38;5;241m=\u001b[39m JavelinAIGatewayEmbeddings(\n\u001b[1;32m      5\u001b[0m     gateway_uri\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mhttp://localhost:8000\u001b[39m\u001b[38;5;124m\"\u001b[39m, \u001b[38;5;66;03m# replace with service URL or host/port of Javelin\u001b[39;00m\n\u001b[1;32m      6\u001b[0m     route\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124membeddings\u001b[39m\u001b[38;5;124m\"\u001b[39m,\n\u001b[1;32m      7\u001b[0m )\n",
+      "\u001b[0;31mImportError\u001b[0m: cannot import name 'JavelinAIGatewayEmbeddings' from 'langchain.embeddings' (/usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages/langchain/embeddings/__init__.py)"
+     ]
+    }
+   ],
+   "source": [
+    "from langchain.embeddings import JavelinAIGatewayEmbeddings\n",
+    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "\n",
+    "embeddings = JavelinAIGatewayEmbeddings(\n",
+    "    gateway_uri=\"http://localhost:8000\", # replace with service URL or host/port of Javelin\n",
+    "    route=\"embeddings\",\n",
+    ")\n",
+    "\n",
+    "print(embeddings.embed_query(\"hello\"))\n",
+    "print(embeddings.embed_documents([\"hello\"]))\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "07c6691b-d333-4598-b2b7-c0933ed75937",
+   "metadata": {},
+   "source": [
+    "# Step 5: Chat Example\n",
+    "This section illustrates how to interact with the Javelin AI Gateway to facilitate a chat with a large language model. Here is a Python script that demonstrates this:\n",
+    "(note) assumes that you have setup a route in the gateway called 'mychatbot_route'"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "653ef88c-36cd-4730-9c12-43c246b551f1",
+   "metadata": {},
+   "outputs": [
+    {
+     "ename": "ImportError",
+     "evalue": "cannot import name 'ChatJavelinAIGateway' from 'langchain.chat_models' (/usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages/langchain/chat_models/__init__.py)",
+     "output_type": "error",
+     "traceback": [
+      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
+      "\u001b[0;31mImportError\u001b[0m                               Traceback (most recent call last)",
+      "Cell \u001b[0;32mIn[8], line 1\u001b[0m\n\u001b[0;32m----> 1\u001b[0m \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01mlangchain\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01mchat_models\u001b[39;00m \u001b[38;5;28;01mimport\u001b[39;00m ChatJavelinAIGateway\n\u001b[1;32m      2\u001b[0m \u001b[38;5;28;01mfrom\u001b[39;00m \u001b[38;5;21;01mlangchain\u001b[39;00m\u001b[38;5;21;01m.\u001b[39;00m\u001b[38;5;21;01mschema\u001b[39;00m \u001b[38;5;28;01mimport\u001b[39;00m HumanMessage, SystemMessage\n\u001b[1;32m      4\u001b[0m messages \u001b[38;5;241m=\u001b[39m [\n\u001b[1;32m      5\u001b[0m     SystemMessage(\n\u001b[1;32m      6\u001b[0m         content\u001b[38;5;241m=\u001b[39m\u001b[38;5;124m\"\u001b[39m\u001b[38;5;124mYou are a helpful assistant that translates English to French.\u001b[39m\u001b[38;5;124m\"\u001b[39m\n\u001b[0;32m   (...)\u001b[0m\n\u001b[1;32m     10\u001b[0m     ),\n\u001b[1;32m     11\u001b[0m ]\n",
+      "\u001b[0;31mImportError\u001b[0m: cannot import name 'ChatJavelinAIGateway' from 'langchain.chat_models' (/usr/local/Caskroom/miniconda/base/lib/python3.11/site-packages/langchain/chat_models/__init__.py)"
+     ]
+    }
+   ],
+   "source": [
+    "from langchain.chat_models import ChatJavelinAIGateway\n",
+    "from langchain.schema import HumanMessage, SystemMessage\n",
+    "\n",
+    "messages = [\n",
+    "    SystemMessage(\n",
+    "        content=\"You are a helpful assistant that translates English to French.\"\n",
+    "    ),\n",
+    "    HumanMessage(\n",
+    "        content=\"Artificial Intelligence has the power to transform humanity and make the world a better place\"\n",
+    "    ),\n",
+    "]\n",
+    "\n",
+    "chat = ChatJavelinAIGateway(\n",
+    "    gateway_uri=\"http://localhost:8000\", # replace with service URL or host/port of Javelin\n",
+    "    route=\"mychatbot_route\",\n",
+    "    model_name=\"gpt-3.5-turbo\",\n",
+    "    params={\n",
+    "        \"temperature\": 0.1\n",
+    "    }\n",
+    ")\n",
+    "\n",
+    "print(chat(messages))\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "6eb9cf33-6505-4e05-808b-645856463a8e",
+   "metadata": {},
+   "source": [
+    "Step 6: Conclusion\n",
+    "This tutorial introduced the Javelin AI Gateway and demonstrated how to interact with it using the Python SDK. \n",
+    "Remember to check the Javelin [Python SDK](https://www.github.com/getjavelin.io/javelin-python) for more examples and to explore the official documentation for additional details.\n",
+    "\n",
+    "Happy coding!"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.4"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/llms/llm_caching.ipynb
+++ b/docs/extras/integrations/llms/llm_caching.ipynb
@@ -95,7 +95,7 @@
    {
     "data": {
      "text/plain": [
-       "'\\n\\nWhy did the chicken cross the road?\\n\\nTo get to the other side.'"
+       "\"\\n\\nWhy couldn't the bicycle stand up by itself? It was...two tired!\""
      ]
     },
     "execution_count": 7,
@@ -811,6 +811,228 @@
    "langchain.llm_cache = SQLAlchemyCache(engine, FulltextLLMCache)"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "eeba7d60",
+   "metadata": {},
+   "source": [
+    "## `Cassandra` caches\n",
+    "\n",
+    "You can use Cassandra / Astra DB for caching LLM responses, choosing from the exact-match `CassandraCache` or the (vector-similarity-based) `CassandraSemanticCache`.\n",
+    "\n",
+    "Let's see both in action in the following cells."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "a4a6725d",
+   "metadata": {},
+   "source": [
+    "#### Connect to the DB\n",
+    "\n",
+    "First you need to establish a `Session` to the DB and to specify a _keyspace_ for the cache table(s). The following gets you started with an Astra DB instance (see e.g. [here](https://cassio.org/start_here/#vector-database) for more backends and connection options)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "cc53ce1b",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "Keyspace name? my_keyspace\n",
+      "\n",
+      "Astra DB Token (\"AstraCS:...\") ········\n",
+      "Full path to your Secure Connect Bundle? /path/to/secure-connect-databasename.zip\n"
+     ]
+    }
+   ],
+   "source": [
+    "import getpass\n",
+    "\n",
+    "keyspace = input(\"\\nKeyspace name? \")\n",
+    "ASTRA_DB_APPLICATION_TOKEN = getpass.getpass('\\nAstra DB Token (\"AstraCS:...\") ')\n",
+    "ASTRA_DB_SECURE_BUNDLE_PATH = input(\"Full path to your Secure Connect Bundle? \")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "4617f485",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from cassandra.cluster import Cluster\n",
+    "from cassandra.auth import PlainTextAuthProvider\n",
+    "\n",
+    "cluster = Cluster(\n",
+    "    cloud={\n",
+    "        \"secure_connect_bundle\": ASTRA_DB_SECURE_BUNDLE_PATH,\n",
+    "    },\n",
+    "    auth_provider=PlainTextAuthProvider(\"token\", ASTRA_DB_APPLICATION_TOKEN),\n",
+    ")\n",
+    "session = cluster.connect()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8665664a",
+   "metadata": {},
+   "source": [
+    "### Exact cache\n",
+    "\n",
+    "This will avoid invoking the LLM when the supplied prompt is _exactly_ the same as one encountered already:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "00a5e66f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import langchain\n",
+    "from langchain.cache import CassandraCache\n",
+    "\n",
+    "langchain.llm_cache = CassandraCache(session=session, keyspace=keyspace)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "956a5145",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "The Moon always shows the same side because it is tidally locked to Earth.\n",
+      "CPU times: user 41.7 ms, sys: 153 µs, total: 41.8 ms\n",
+      "Wall time: 1.96 s\n"
+     ]
+    }
+   ],
+   "source": [
+    "%%time\n",
+    "\n",
+    "print(llm(\"Why is the Moon always showing the same side?\"))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "id": "158f0151",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "The Moon always shows the same side because it is tidally locked to Earth.\n",
+      "CPU times: user 4.09 ms, sys: 0 ns, total: 4.09 ms\n",
+      "Wall time: 119 ms\n"
+     ]
+    }
+   ],
+   "source": [
+    "%%time\n",
+    "\n",
+    "print(llm(\"Why is the Moon always showing the same side?\"))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8fc4d017",
+   "metadata": {},
+   "source": [
+    "### Semantic cache\n",
+    "\n",
+    "This cache will do a semantic similarity search and return a hit if it finds a cached entry that is similar enough, For this, you need to provide an `Embeddings` instance of your choice."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "b9ad3f54",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "\n",
+    "embedding=OpenAIEmbeddings()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "4623f95e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.cache import CassandraSemanticCache\n",
+    "\n",
+    "langchain.llm_cache = CassandraSemanticCache(\n",
+    "    session=session, keyspace=keyspace, embedding=embedding, table_name=\"cass_sem_cache\"\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "1a8e577b",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "The Moon always shows the same side because it is tidally locked with Earth. This means that the same side of the Moon always faces Earth.\n",
+      "CPU times: user 21.3 ms, sys: 177 µs, total: 21.4 ms\n",
+      "Wall time: 3.09 s\n"
+     ]
+    }
+   ],
+   "source": [
+    "%%time\n",
+    "\n",
+    "print(llm(\"Why is the Moon always showing the same side?\"))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "f7abddfd",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "The Moon always shows the same side because it is tidally locked with Earth. This means that the same side of the Moon always faces Earth.\n",
+      "CPU times: user 10.9 ms, sys: 17 µs, total: 10.9 ms\n",
+      "Wall time: 461 ms\n"
+     ]
+    }
+   ],
+   "source": [
+    "%%time\n",
+    "\n",
+    "print(llm(\"How come we always see one face of the moon?\"))"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "0c69d84d",
--- a/docs/extras/integrations/memory/dynamodb_chat_message_history.ipynb
+++ b/docs/extras/integrations/memory/dynamodb_chat_message_history.ipynb
@@ -1,350 +1,352 @@
 {
- "cells": [
-  {
-   "cell_type": "markdown",
-   "id": "91c6a7ef",
-   "metadata": {},
-   "source": [
-    "# Dynamodb Chat Message History\n",
-    "\n",
-    "This notebook goes over how to use Dynamodb to store chat message history."
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "3f608be0",
-   "metadata": {},
-   "source": [
-    "First make sure you have correctly configured the [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html). Then make sure you have installed boto3."
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "030d784f",
-   "metadata": {},
-   "source": [
-    "Next, create the DynamoDB Table where we will be storing messages:"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 10,
-   "id": "93ce1811",
-   "metadata": {},
-   "outputs": [
+  "cells": [
    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "0\n"
-     ]
-    }
-   ],
-   "source": [
-    "import boto3\n",
-    "\n",
-    "# Get the service resource.\n",
-    "dynamodb = boto3.resource(\"dynamodb\")\n",
-    "\n",
-    "# Create the DynamoDB table.\n",
-    "table = dynamodb.create_table(\n",
-    "    TableName=\"SessionTable\",\n",
-    "    KeySchema=[{\"AttributeName\": \"SessionId\", \"KeyType\": \"HASH\"}],\n",
-    "    AttributeDefinitions=[{\"AttributeName\": \"SessionId\", \"AttributeType\": \"S\"}],\n",
-    "    BillingMode=\"PAY_PER_REQUEST\",\n",
-    ")\n",
-    "\n",
-    "# Wait until the table exists.\n",
-    "table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"SessionTable\")\n",
-    "\n",
-    "# Print out some data about the table.\n",
-    "print(table.item_count)"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "1a9b310b",
-   "metadata": {},
-   "source": [
-    "## DynamoDBChatMessageHistory"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 11,
-   "id": "d15e3302",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
-    "\n",
-    "history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"0\")\n",
-    "\n",
-    "history.add_user_message(\"hi!\")\n",
-    "\n",
-    "history.add_ai_message(\"whats up?\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 12,
-   "id": "64fc465e",
-   "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": "[HumanMessage(content='hi!', additional_kwargs={}, example=False),\n AIMessage(content='whats up?', additional_kwargs={}, example=False),\n HumanMessage(content='hi!', additional_kwargs={}, example=False),\n AIMessage(content='whats up?', additional_kwargs={}, example=False)]"
-     },
-     "execution_count": 12,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "history.messages"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "id": "955f1b15",
-   "metadata": {},
-   "source": [
-    "## DynamoDBChatMessageHistory with Custom Endpoint URL\n",
-    "\n",
-    "Sometimes it is useful to specify the URL to the AWS endpoint to connect to. For instance, when you are running locally against [Localstack](https://localstack.cloud/). For those cases you can specify the URL via the `endpoint_url` parameter in the constructor."
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 13,
-   "id": "225713c8",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
-    "\n",
-    "history = DynamoDBChatMessageHistory(\n",
-    "    table_name=\"SessionTable\",\n",
-    "    session_id=\"0\",\n",
-    "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
-    ")"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "source": [
-    "## DynamoDBChatMessageHistory With Different Keys Composite Keys\n",
-    "The default key for DynamoDBChatMessageHistory is ```{\"SessionId\": self.session_id}```, but you can modify this to match your table design.\n",
-    "\n",
-    "### Primary Key Name\n",
-    "You may modify the primary key by passing in a primary_key_name value in the constructor, resulting in the following:\n",
-    "```{self.primary_key_name: self.session_id}```\n",
-    "\n",
-    "### Composite Keys\n",
-    "When using an existing DynamoDB table, you may need to modify the key structure from the default of to something including a Sort Key. To do this you may use the ```key``` parameter.\n",
-    "\n",
-    "Passing a value for key will override the primary_key parameter, and the resulting key structure will be the passed value.\n"
-   ],
-   "metadata": {
-    "collapsed": false
-   }
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 14,
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "0\n"
-     ]
+      "cell_type": "markdown",
+      "id": "91c6a7ef",
+      "metadata": {},
+      "source": [
+        "# Dynamodb Chat Message History\n",
+        "\n",
+        "This notebook goes over how to use Dynamodb to store chat message history."
+      ]
    },
    {
-     "data": {
-      "text/plain": "[HumanMessage(content='hello, composite dynamodb table!', additional_kwargs={}, example=False)]"
-     },
-     "execution_count": 14,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
-   "source": [
-    "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
-    "\n",
-    "composite_table = dynamodb.create_table(\n",
-    "    TableName=\"CompositeTable\",\n",
-    "    KeySchema=[{\"AttributeName\": \"PK\", \"KeyType\": \"HASH\"}, {\"AttributeName\": \"SK\", \"KeyType\": \"RANGE\"}],\n",
-    "    AttributeDefinitions=[{\"AttributeName\": \"PK\", \"AttributeType\": \"S\"}, {\"AttributeName\": \"SK\", \"AttributeType\": \"S\"}],\n",
-    "    BillingMode=\"PAY_PER_REQUEST\",\n",
-    ")\n",
-    "\n",
-    "# Wait until the table exists.\n",
-    "composite_table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"CompositeTable\")\n",
-    "\n",
-    "# Print out some data about the table.\n",
-    "print(composite_table.item_count)\n",
-    "\n",
-    "my_key = {\n",
-    "    \"PK\": \"session_id::0\",\n",
-    "    \"SK\":  \"langchain_history\",\n",
-    "}\n",
-    "\n",
-    "composite_key_history = DynamoDBChatMessageHistory(\n",
-    "    table_name=\"CompositeTable\",\n",
-    "    session_id=\"0\",\n",
-    "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
-    "    key=my_key,\n",
-    ")\n",
-    "\n",
-    "composite_key_history.add_user_message(\"hello, composite dynamodb table!\")\n",
-    "\n",
-    "composite_key_history.messages"
-   ],
-   "metadata": {
-    "collapsed": false
-   }
-  },
-  {
-   "attachments": {},
-   "cell_type": "markdown",
-   "id": "3b33c988",
-   "metadata": {},
-   "source": [
-    "## Agent with DynamoDB Memory"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 15,
-   "id": "f92d9499",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "from langchain.agents import Tool\n",
-    "from langchain.memory import ConversationBufferMemory\n",
-    "from langchain.chat_models import ChatOpenAI\n",
-    "from langchain.agents import initialize_agent\n",
-    "from langchain.agents import AgentType\n",
-    "from langchain.utilities import PythonREPL\n",
-    "from getpass import getpass\n",
-    "\n",
-    "message_history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"1\")\n",
-    "memory = ConversationBufferMemory(\n",
-    "    memory_key=\"chat_history\", chat_memory=message_history, return_messages=True\n",
-    ")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 16,
-   "id": "1167eeba",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "python_repl = PythonREPL()\n",
-    "\n",
-    "# You can create the tool to pass to an agent\n",
-    "tools = [\n",
-    "    Tool(\n",
-    "        name=\"python_repl\",\n",
-    "        description=\"A Python shell. Use this to execute python commands. Input should be a valid python command. If you want to see the output of a value, you should print it out with `print(...)`.\",\n",
-    "        func=python_repl.run,\n",
-    "    )\n",
-    "]"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 17,
-   "id": "fce085c5",
-   "metadata": {},
-   "outputs": [
+      "cell_type": "markdown",
+      "id": "3f608be0",
+      "metadata": {},
+      "source": [
+        "First make sure you have correctly configured the [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html). Then make sure you have installed boto3."
+      ]
+    },
    {
-     "ename": "ValidationError",
-     "evalue": "1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)",
-     "output_type": "error",
-     "traceback": [
-      "\u001B[0;31m---------------------------------------------------------------------------\u001B[0m",
-      "\u001B[0;31mValidationError\u001B[0m                           Traceback (most recent call last)",
-      "Cell \u001B[0;32mIn[17], line 1\u001B[0m\n\u001B[0;32m----> 1\u001B[0m llm \u001B[38;5;241m=\u001B[39m \u001B[43mChatOpenAI\u001B[49m\u001B[43m(\u001B[49m\u001B[43mtemperature\u001B[49m\u001B[38;5;241;43m=\u001B[39;49m\u001B[38;5;241;43m0\u001B[39;49m\u001B[43m)\u001B[49m\n\u001B[1;32m      2\u001B[0m agent_chain \u001B[38;5;241m=\u001B[39m initialize_agent(\n\u001B[1;32m      3\u001B[0m     tools,\n\u001B[1;32m      4\u001B[0m     llm,\n\u001B[0;32m   (...)\u001B[0m\n\u001B[1;32m      7\u001B[0m     memory\u001B[38;5;241m=\u001B[39mmemory,\n\u001B[1;32m      8\u001B[0m )\n",
-      "File \u001B[0;32m~/Documents/projects/langchain/libs/langchain/langchain/load/serializable.py:74\u001B[0m, in \u001B[0;36mSerializable.__init__\u001B[0;34m(self, **kwargs)\u001B[0m\n\u001B[1;32m     73\u001B[0m \u001B[38;5;28;01mdef\u001B[39;00m \u001B[38;5;21m__init__\u001B[39m(\u001B[38;5;28mself\u001B[39m, \u001B[38;5;241m*\u001B[39m\u001B[38;5;241m*\u001B[39mkwargs: Any) \u001B[38;5;241m-\u001B[39m\u001B[38;5;241m>\u001B[39m \u001B[38;5;28;01mNone\u001B[39;00m:\n\u001B[0;32m---> 74\u001B[0m     \u001B[38;5;28;43msuper\u001B[39;49m\u001B[43m(\u001B[49m\u001B[43m)\u001B[49m\u001B[38;5;241;43m.\u001B[39;49m\u001B[38;5;21;43m__init__\u001B[39;49m\u001B[43m(\u001B[49m\u001B[38;5;241;43m*\u001B[39;49m\u001B[38;5;241;43m*\u001B[39;49m\u001B[43mkwargs\u001B[49m\u001B[43m)\u001B[49m\n\u001B[1;32m     75\u001B[0m     \u001B[38;5;28mself\u001B[39m\u001B[38;5;241m.\u001B[39m_lc_kwargs \u001B[38;5;241m=\u001B[39m kwargs\n",
-      "File \u001B[0;32m~/Documents/projects/langchain/.venv/lib/python3.9/site-packages/pydantic/main.py:341\u001B[0m, in \u001B[0;36mpydantic.main.BaseModel.__init__\u001B[0;34m()\u001B[0m\n",
-      "\u001B[0;31mValidationError\u001B[0m: 1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)"
-     ]
+      "cell_type": "markdown",
+      "id": "030d784f",
+      "metadata": {},
+      "source": [
+        "Next, create the DynamoDB Table where we will be storing messages:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 10,
+      "id": "93ce1811",
+      "metadata": {},
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "0\n"
+          ]
+        }
+      ],
+      "source": [
+        "import boto3\n",
+        "\n",
+        "# Get the service resource.\n",
+        "dynamodb = boto3.resource(\"dynamodb\")\n",
+        "\n",
+        "# Create the DynamoDB table.\n",
+        "table = dynamodb.create_table(\n",
+        "    TableName=\"SessionTable\",\n",
+        "    KeySchema=[{\"AttributeName\": \"SessionId\", \"KeyType\": \"HASH\"}],\n",
+        "    AttributeDefinitions=[{\"AttributeName\": \"SessionId\", \"AttributeType\": \"S\"}],\n",
+        "    BillingMode=\"PAY_PER_REQUEST\",\n",
+        ")\n",
+        "\n",
+        "# Wait until the table exists.\n",
+        "table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"SessionTable\")\n",
+        "\n",
+        "# Print out some data about the table.\n",
+        "print(table.item_count)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "1a9b310b",
+      "metadata": {},
+      "source": [
+        "## DynamoDBChatMessageHistory"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 11,
+      "id": "d15e3302",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
+        "\n",
+        "history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"0\")\n",
+        "\n",
+        "history.add_user_message(\"hi!\")\n",
+        "\n",
+        "history.add_ai_message(\"whats up?\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 12,
+      "id": "64fc465e",
+      "metadata": {},
+      "outputs": [
+        {
+          "data": {
+            "text/plain": "[HumanMessage(content='hi!', additional_kwargs={}, example=False),\n AIMessage(content='whats up?', additional_kwargs={}, example=False),\n HumanMessage(content='hi!', additional_kwargs={}, example=False),\n AIMessage(content='whats up?', additional_kwargs={}, example=False)]"
+          },
+          "execution_count": 12,
+          "metadata": {},
+          "output_type": "execute_result"
+        }
+      ],
+      "source": [
+        "history.messages"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "955f1b15",
+      "metadata": {},
+      "source": [
+        "## DynamoDBChatMessageHistory with Custom Endpoint URL\n",
+        "\n",
+        "Sometimes it is useful to specify the URL to the AWS endpoint to connect to. For instance, when you are running locally against [Localstack](https://localstack.cloud/). For those cases you can specify the URL via the `endpoint_url` parameter in the constructor."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 13,
+      "id": "225713c8",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
+        "\n",
+        "history = DynamoDBChatMessageHistory(\n",
+        "    table_name=\"SessionTable\",\n",
+        "    session_id=\"0\",\n",
+        "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
+        ")"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "source": [
+        "## DynamoDBChatMessageHistory With Different Keys Composite Keys\n",
+        "The default key for DynamoDBChatMessageHistory is ```{\"SessionId\": self.session_id}```, but you can modify this to match your table design.\n",
+        "\n",
+        "### Primary Key Name\n",
+        "You may modify the primary key by passing in a primary_key_name value in the constructor, resulting in the following:\n",
+        "```{self.primary_key_name: self.session_id}```\n",
+        "\n",
+        "### Composite Keys\n",
+        "When using an existing DynamoDB table, you may need to modify the key structure from the default of to something including a Sort Key. To do this you may use the ```key``` parameter.\n",
+        "\n",
+        "Passing a value for key will override the primary_key parameter, and the resulting key structure will be the passed value.\n"
+      ],
+      "metadata": {
+        "collapsed": false
+      },
+      "id": "c9bc0693"
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 14,
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "0\n"
+          ]
+        },
+        {
+          "data": {
+            "text/plain": "[HumanMessage(content='hello, composite dynamodb table!', additional_kwargs={}, example=False)]"
+          },
+          "execution_count": 14,
+          "metadata": {},
+          "output_type": "execute_result"
+        }
+      ],
+      "source": [
+        "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
+        "\n",
+        "composite_table = dynamodb.create_table(\n",
+        "    TableName=\"CompositeTable\",\n",
+        "    KeySchema=[{\"AttributeName\": \"PK\", \"KeyType\": \"HASH\"}, {\"AttributeName\": \"SK\", \"KeyType\": \"RANGE\"}],\n",
+        "    AttributeDefinitions=[{\"AttributeName\": \"PK\", \"AttributeType\": \"S\"}, {\"AttributeName\": \"SK\", \"AttributeType\": \"S\"}],\n",
+        "    BillingMode=\"PAY_PER_REQUEST\",\n",
+        ")\n",
+        "\n",
+        "# Wait until the table exists.\n",
+        "composite_table.meta.client.get_waiter(\"table_exists\").wait(TableName=\"CompositeTable\")\n",
+        "\n",
+        "# Print out some data about the table.\n",
+        "print(composite_table.item_count)\n",
+        "\n",
+        "my_key = {\n",
+        "    \"PK\": \"session_id::0\",\n",
+        "    \"SK\":  \"langchain_history\",\n",
+        "}\n",
+        "\n",
+        "composite_key_history = DynamoDBChatMessageHistory(\n",
+        "    table_name=\"CompositeTable\",\n",
+        "    session_id=\"0\",\n",
+        "    endpoint_url=\"http://localhost.localstack.cloud:4566\",\n",
+        "    key=my_key,\n",
+        ")\n",
+        "\n",
+        "composite_key_history.add_user_message(\"hello, composite dynamodb table!\")\n",
+        "\n",
+        "composite_key_history.messages"
+      ],
+      "metadata": {
+        "collapsed": false
+      },
+      "id": "a7fa0331"
+    },
+    {
+      "attachments": {},
+      "cell_type": "markdown",
+      "id": "3b33c988",
+      "metadata": {},
+      "source": [
+        "## Agent with DynamoDB Memory"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 15,
+      "id": "f92d9499",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from langchain.agents import Tool\n",
+        "from langchain.memory import ConversationBufferMemory\n",
+        "from langchain.chat_models import ChatOpenAI\n",
+        "from langchain.agents import initialize_agent\n",
+        "from langchain.agents import AgentType\n",
+        "from langchain.utilities import PythonREPL\n",
+        "from getpass import getpass\n",
+        "\n",
+        "message_history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"1\")\n",
+        "memory = ConversationBufferMemory(\n",
+        "    memory_key=\"chat_history\", chat_memory=message_history, return_messages=True\n",
+        ")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 16,
+      "id": "1167eeba",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "python_repl = PythonREPL()\n",
+        "\n",
+        "# You can create the tool to pass to an agent\n",
+        "tools = [\n",
+        "    Tool(\n",
+        "        name=\"python_repl\",\n",
+        "        description=\"A Python shell. Use this to execute python commands. Input should be a valid python command. If you want to see the output of a value, you should print it out with `print(...)`.\",\n",
+        "        func=python_repl.run,\n",
+        "    )\n",
+        "]"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 17,
+      "id": "fce085c5",
+      "metadata": {},
+      "outputs": [
+        {
+          "ename": "ValidationError",
+          "evalue": "1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)",
+          "output_type": "error",
+          "traceback": [
+            "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
+            "\u001b[0;31mValidationError\u001b[0m                           Traceback (most recent call last)",
+            "Cell \u001b[0;32mIn[17], line 1\u001b[0m\n\u001b[0;32m----> 1\u001b[0m llm \u001b[38;5;241m=\u001b[39m \u001b[43mChatOpenAI\u001b[49m\u001b[43m(\u001b[49m\u001b[43mtemperature\u001b[49m\u001b[38;5;241;43m=\u001b[39;49m\u001b[38;5;241;43m0\u001b[39;49m\u001b[43m)\u001b[49m\n\u001b[1;32m      2\u001b[0m agent_chain \u001b[38;5;241m=\u001b[39m initialize_agent(\n\u001b[1;32m      3\u001b[0m     tools,\n\u001b[1;32m      4\u001b[0m     llm,\n\u001b[0;32m   (...)\u001b[0m\n\u001b[1;32m      7\u001b[0m     memory\u001b[38;5;241m=\u001b[39mmemory,\n\u001b[1;32m      8\u001b[0m )\n",
+            "File \u001b[0;32m~/Documents/projects/langchain/libs/langchain/langchain/load/serializable.py:74\u001b[0m, in \u001b[0;36mSerializable.__init__\u001b[0;34m(self, **kwargs)\u001b[0m\n\u001b[1;32m     73\u001b[0m \u001b[38;5;28;01mdef\u001b[39;00m \u001b[38;5;21m__init__\u001b[39m(\u001b[38;5;28mself\u001b[39m, \u001b[38;5;241m*\u001b[39m\u001b[38;5;241m*\u001b[39mkwargs: Any) \u001b[38;5;241m-\u001b[39m\u001b[38;5;241m>\u001b[39m \u001b[38;5;28;01mNone\u001b[39;00m:\n\u001b[0;32m---> 74\u001b[0m     \u001b[38;5;28;43msuper\u001b[39;49m\u001b[43m(\u001b[49m\u001b[43m)\u001b[49m\u001b[38;5;241;43m.\u001b[39;49m\u001b[38;5;21;43m__init__\u001b[39;49m\u001b[43m(\u001b[49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[38;5;241;43m*\u001b[39;49m\u001b[43mkwargs\u001b[49m\u001b[43m)\u001b[49m\n\u001b[1;32m     75\u001b[0m     \u001b[38;5;28mself\u001b[39m\u001b[38;5;241m.\u001b[39m_lc_kwargs \u001b[38;5;241m=\u001b[39m kwargs\n",
+            "File \u001b[0;32m~/Documents/projects/langchain/.venv/lib/python3.9/site-packages/pydantic/main.py:341\u001b[0m, in \u001b[0;36mpydantic.main.BaseModel.__init__\u001b[0;34m()\u001b[0m\n",
+            "\u001b[0;31mValidationError\u001b[0m: 1 validation error for ChatOpenAI\n__root__\n  Did not find openai_api_key, please add an environment variable `OPENAI_API_KEY` which contains it, or pass  `openai_api_key` as a named parameter. (type=value_error)"
+          ]
+        }
+      ],
+      "source": [
+        "llm = ChatOpenAI(temperature=0)\n",
+        "agent_chain = initialize_agent(\n",
+        "    tools,\n",
+        "    llm,\n",
+        "    agent=AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION,\n",
+        "    verbose=True,\n",
+        "    memory=memory,\n",
+        ")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "952a3103",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "agent_chain.run(input=\"Hello!\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "54c4aaf4",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "agent_chain.run(input=\"Who owns Twitter?\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "f9013118",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "agent_chain.run(input=\"My name is Bob.\")"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "405e5315",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "agent_chain.run(input=\"Who am I?\")\n"
+      ]
+    }
+  ],
+  "metadata": {
+    "kernelspec": {
+      "display_name": "Python 3 (ipykernel)",
+      "language": "python",
+      "name": "python3"
+    },
+    "language_info": {
+      "codemirror_mode": {
+        "name": "ipython",
+        "version": 3
+      },
+      "file_extension": ".py",
+      "mimetype": "text/x-python",
+      "name": "python",
+      "nbconvert_exporter": "python",
+      "pygments_lexer": "ipython3",
+      "version": "3.11.3"
    }
-   ],
-   "source": [
-    "llm = ChatOpenAI(temperature=0)\n",
-    "agent_chain = initialize_agent(\n",
-    "    tools,\n",
-    "    llm,\n",
-    "    agent=AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION,\n",
-    "    verbose=True,\n",
-    "    memory=memory,\n",
-    ")"
-   ]
  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "952a3103",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "agent_chain.run(input=\"Hello!\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "54c4aaf4",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "agent_chain.run(input=\"Who owns Twitter?\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "f9013118",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "agent_chain.run(input=\"My name is Bob.\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "405e5315",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "agent_chain.run(input=\"Who am I?\")\n"
-   ]
-  }
- ],
- "metadata": {
-  "kernelspec": {
-   "display_name": "Python 3 (ipykernel)",
-   "language": "python",
-   "name": "python3"
-  },
-  "language_info": {
-   "codemirror_mode": {
-    "name": "ipython",
-    "version": 3
-   },
-   "file_extension": ".py",
-   "mimetype": "text/x-python",
-   "name": "python",
-   "nbconvert_exporter": "python",
-   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
-  }
- },
- "nbformat": 4,
- "nbformat_minor": 5
-}
+  "nbformat": 4,
+  "nbformat_minor": 5
+}
--- a/docs/extras/integrations/platforms/google.mdx
+++ b/docs/extras/integrations/platforms/google.mdx
@@ -2,6 +2,35 @@

 All functionality related to Google Platform

+## LLMs
+
+### Vertex AI
+
+Access PaLM LLMs like `text-bison` and `code-bison` via Google Cloud.
+
+```python
+from langchain.llms import VertexAI
+```
+
+### Model Garden
+
+Access PaLM and hundreds of OSS models via Vertex AI Model Garden.
+
+```python
+from langchain.llms import VertexAIModelGarden
+```
+
+## Chat models
+
+### Vertex AI
+
+Access PaLM chat models like `chat-bison` and `codechat-bison` via Google Cloud.
+
+```python
+from langchain.chat_models import ChatVertexAI
+```
+
+
 ## Document Loader
 ### Google BigQuery

--- a/docs/extras/integrations/providers/argilla.mdx
+++ b/docs/extras/integrations/providers/argilla.mdx
@@ -5,7 +5,7 @@
 >[Argilla](https://argilla.io/) is an open-source data curation platform for LLMs.
 > Using Argilla, everyone can build robust language models through faster data curation 
 > using both human and machine feedback. We provide support for each step in the MLOps cycle, 
-> from data labeling to model monitoring.
+> from data labelling to model monitoring.

 ## Installation and Setup

--- a/docs/extras/integrations/providers/databricks.md
+++ b/docs/extras/integrations/providers/databricks.md
@@ -13,12 +13,13 @@ Databricks embraces the LangChain ecosystem in various ways:

 Databricks connector for the SQLDatabase Chain
 ----------------------------------------------
-You can connect to [Databricks runtimes](https://docs.databricks.com/runtime/index.html) and [Databricks SQL](https://www.databricks.com/product/databricks-sql) using the SQLDatabase wrapper of LangChain. See the notebook [Connect to Databricks](/docs/ecosystem/integrations/databricks/databricks.html) for details.
+You can connect to [Databricks runtimes](https://docs.databricks.com/runtime/index.html) and [Databricks SQL](https://www.databricks.com/product/databricks-sql) using the SQLDatabase wrapper of LangChain. 
+See the notebook [Connect to Databricks](/docs/use_cases/qa_structured/integrations/databricks) for details.

 Databricks MLflow integrates with LangChain
 -------------------------------------------

-MLflow is an open source platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry. See the notebook [MLflow Callback Handler](/docs/ecosystem/integrations/mlflow_tracking.ipynb) for details about MLflow's integration with LangChain.
+MLflow is an open source platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry. See the notebook [MLflow Callback Handler](/docs/integrations/providers/mlflow_tracking) for details about MLflow's integration with LangChain.

 Databricks provides a fully managed and hosted version of MLflow integrated with enterprise security features, high availability, and other Databricks workspace features such as experiment and run management and notebook revision capture. MLflow on Databricks offers an integrated experience for tracking and securing machine learning model training runs and running machine learning projects. See [MLflow guide](https://docs.databricks.com/mlflow/index.html) for more details.

@@ -27,7 +28,7 @@ Databricks MLflow makes it more convenient to develop LangChain applications on
 Databricks MLflow AI Gateway
 ----------------------------

-See [MLflow AI Gateway](/docs/ecosystem/integrations/mlflow_ai_gateway).
+See [MLflow AI Gateway](/docs/integrations/providers/mlflow_ai_gateway).

 Databricks as an LLM provider
 -----------------------------
--- a/docs/extras/integrations/providers/elasticsearch.mdx
+++ b/docs/extras/integrations/providers/elasticsearch.mdx
@@ -18,7 +18,7 @@ Example: Run a single-node Elasticsearch instance with security disabled. This i

 #### Deploy Elasticsearch on Elastic Cloud

-Elastic Cloud is a managed Elasticsearch service. Signup for a [free trial](https://cloud.elastic.co/registration?storm=langchain-notebook).
+Elastic Cloud is a managed Elasticsearch service. Signup for a [free trial](https://cloud.elastic.co/registration?utm_source=langchain&utm_content=documentation).

 ### Install Client

--- a/docs/extras/integrations/providers/javelin_ai_gateway.mdx
+++ b/docs/extras/integrations/providers/javelin_ai_gateway.mdx
@@ -0,0 +1,92 @@
+# Javelin AI Gateway
+
+[The Javelin AI Gateway](https://www.getjavelin.io) service is a high-performance, enterprise grade API Gateway for AI applications.  
+It is designed to streamline the usage and access of various large language model (LLM) providers, 
+such as OpenAI, Cohere, Anthropic and custom large language models within an organization by incorporating
+robust access security for all interactions with LLMs. 
+
+Javelin offers a high-level interface that simplifies the interaction with LLMs by providing a unified endpoint 
+to handle specific LLM related requests. 
+
+See the Javelin AI Gateway [documentation](https://docs.getjavelin.io) for more details.  
+[Javelin Python SDK](https://www.github.com/getjavelin/javelin-python) is an easy to use client library meant to be embedded into AI Applications
+
+## Installation and Setup
+
+Install `javelin_sdk` to interact with Javelin AI Gateway:
+
+```sh
+pip install 'javelin_sdk'
+```
+
+Set the Javelin's API key as an environment variable:
+
+```sh
+export JAVELIN_API_KEY=...
+```
+
+## Completions Example
+
+```python
+
+from langchain.chains import LLMChain
+from langchain.llms import JavelinAIGateway
+from langchain.prompts import PromptTemplate
+
+route_completions = "eng_dept03"
+
+gateway = JavelinAIGateway(
+    gateway_uri="http://localhost:8000",
+    route=route_completions,
+    model_name="text-davinci-003",
+)
+
+llmchain = LLMChain(llm=gateway, prompt=prompt)
+result = llmchain.run("podcast player")
+
+print(result)
+
+```
+
+## Embeddings Example
+
+```python
+from langchain.embeddings import JavelinAIGatewayEmbeddings
+from langchain.embeddings.openai import OpenAIEmbeddings
+
+embeddings = JavelinAIGatewayEmbeddings(
+    gateway_uri="http://localhost:8000",
+    route="embeddings",
+)
+
+print(embeddings.embed_query("hello"))
+print(embeddings.embed_documents(["hello"]))
+```
+
+## Chat Example
+```python
+from langchain.chat_models import ChatJavelinAIGateway
+from langchain.schema import HumanMessage, SystemMessage
+
+messages = [
+    SystemMessage(
+        content="You are a helpful assistant that translates English to French."
+    ),
+    HumanMessage(
+        content="Artificial Intelligence has the power to transform humanity and make the world a better place"
+    ),
+]
+
+chat = ChatJavelinAIGateway(
+    gateway_uri="http://localhost:8000",
+    route="mychatbot_route",
+    model_name="gpt-3.5-turbo"
+    params={
+        "temperature": 0.1
+    }
+)
+
+print(chat(messages))
+
+```
+
--- a/docs/extras/integrations/retrievers/kay.ipynb
+++ b/docs/extras/integrations/retrievers/kay.ipynb
@@ -0,0 +1,207 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "263f914c-9d67-4316-8b3d-03c3b99ba9d8",
+   "metadata": {},
+   "source": [
+    "Kay.ai\n",
+    "=\n",
+    "\n",
+    "> Data API built for RAG 🕵️ We are curating the world's largest datasets as high-quality embeddings so your AI agents can retrieve context on the fly. Latest models, fast retrieval, and zero infra.\n",
+    "\n",
+    "This notebook shows you how to retrieve datasets supported by [Kay](https://kay.ai/). You can currently search SEC Filings and Press Releases of US companies. Visit [kay.ai](https://kay.ai) for the latest data drops. For any questions, join our [discord](https://discord.gg/hAnE4e5T6M) or [tweet at us](https://twitter.com/vishalrohra_)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "fc507b8e-ea51-417c-93da-42bf998a1195",
+   "metadata": {},
+   "source": [
+    "Installation\n",
+    "=\n",
+    "\n",
+    "First you will need to install the [`kay` package](https://pypi.org/project/kay/). You will also need an API key: you can get one for free at [https://kay.ai](https://kay.ai/). Once you have an API key, you must set it as an environment variable `KAY_API_KEY`.\n",
+    "\n",
+    "`KayAiRetriever` has a static `.create()` factory method that takes the following arguments:\n",
+    "\n",
+    "* `dataset_id: string` required -- A Kay dataset id. This is a collection of data about a particular entity such as companies, people, or places. For example, try `\"company\"` \n",
+    "* `data_type: List[string]` optional -- This is a category within a  dataset based on its origin or format, such as ‘SEC Filings’, ‘Press Releases’, or ‘Reports’ within the “company” dataset. For example, try [\"10-K\", \"10-Q\", \"PressRelease\"] under the “company” dataset. If left empty, Kay will retrieve the most relevant context across all types.\n",
+    "* `num_contexts: int` optional, defaults to 6 -- The number of document chunks to retrieve on each call to `get_relevant_documents()`"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c923bea0-585a-4f62-8662-efc167e8d793",
+   "metadata": {},
+   "source": [
+    "Examples\n",
+    "=\n",
+    "\n",
+    "Basic Retriever Usage\n",
+    "-"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "f7b8c99c-0341-4f3c-912f-a11e98f7de71",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      " ········\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Setup API key\n",
+    "from getpass import getpass\n",
+    "KAY_API_KEY = getpass()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "id": "b4d4d386-2a6b-4942-863e-9202f5a9f1d6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.retrievers import KayAiRetriever\n",
+    "import os\n",
+    "from kay.rag.retrievers import KayRetriever\n",
+    "os.environ[\"KAY_API_KEY\"] = KAY_API_KEY\n",
+    "retriever = KayAiRetriever.create(dataset_id=\"company\", data_types=[\"10-K\", \"10-Q\", \"PressRelease\"], num_contexts=3)\n",
+    "docs = retriever.get_relevant_documents(\"What were the biggest strategy changes and partnerships made by Roku in 2023??\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "id": "04ee2d6b-c2ab-4e15-8a8b-afaf6ef8c0f6",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='Company Name: ROKU INC\\nCompany Industry: CABLE & OTHER PAY TELEVISION SERVICES\\nArticle Title: Roku and FreeWheel Announce Strategic Partnership to Bring Roku’s Leading Ad Tech to FreeWheel Customers\\nText: Additionally, eMarketer Link: https://cts.businesswire.com/ct/CT?id=smartlink&url=https%3A%2F%2Fwww.insiderintelligence.com%2Finsights%2Favod-more-than-50-percent-of-us-digital-video-viewers%2F&esheet=53451144&newsitemid=20230712907788&lan=en-US&anchor=eMarketer&index=4&md5=b64dea72bcf6b6379474462602781d83 projects 57% of U.S. digital video users will stream an advertising-based video on demand (AVOD) service this year.\\nHaving solutions aimed at driving greater interoperability and automation will help accelerate this growth.\\nKey highlights of this collaboration include:\\nStreamlined Integration: Roku has now integrated its demand application programming interface (dAPI) with FreeWheel s TV platform. Roku s demand API gives publishers direct, automatic and real-time access to more advertiser demand. This enhanced integration allows for streamlined ad operation workflows and better inventory quality control, both of which will improve publisher yield and revenue.\\nSeamless Data Targeting: Publishers can now use Roku platform signals to enable advertisers to target audiences and measure campaign performance without relying on cookies. Additionally, FreeWheel and Roku will rely on data clean room technology to enable the activation of additional data sets providing better measurement and monetization to publishers and agencies.', metadata={'_additional': {'id': '962b79e0-f9d1-43ae-9f7a-8a9b42bc7a9a'}, 'chunk_type': 'text', 'chunk_years_mentioned': [], 'company_name': 'ROKU INC', 'company_sic_code_description': 'CABLE & OTHER PAY TELEVISION SERVICES', 'data_source': 'PressRelease', 'data_source_link': 'https://www.nasdaq.com/press-release/roku-and-freewheel-announce-strategic-partnership-to-bring-rokus-leading-ad-tech-to', 'data_source_publish_date': '2023-07-12T00:00:00Z', 'data_source_uid': 'a46f309c-705d-3946-96db-87aa4e73261f', 'title': 'ROKU INC |  Roku and FreeWheel Announce Strategic Partnership to Bring Roku’s Leading Ad Tech to FreeWheel Customers'}),\n",
+       " Document(page_content='Company Name: ROKU INC \\n Company Industry: CABLE & OTHER PAY TELEVISION SERVICES \\n Form Title: 10-K 2022-FY \\n Form Section: Risk Factors \\n Text: nd the Note Regarding Forward Looking Statements.This section of this Annual Report generally discusses fiscal years 2022 and 2021 and year to year comparisons between those years.Discussions of fiscal year 2020 and year to year comparisons between fiscal years 2021 and 2020 that are not included in this Annual Report can be found in Management\\'s Discussion and Analysis of Financial Condition and Results of Operations in Part II, Item 7 of our Annual Report for the fiscal year ended December 31, 2021 filed with the SEC on February 18, 2022.Overview Effective as of the fourth quarter of fiscal 2022, we reorganized our reportable segments to better align with management\\'s reporting of information reviewed by the Chief Operating Decision Maker (\"CODM\") for each segment.We renamed our \"player\" segment to \"devices\" which now includes our licensing arrangements with service operators and licensed Roku TV partners in addition to sales of our streaming players, audio products, smart home products and Roku branded TVs that will be designed, made, and sold by us in 2023.Our historical segment information is recast to conform to our new presentation in our financial statements and accompanying notes included in Item 8 of this Annual Report.Our two reportable segments are the platform segment and the devices segment.', metadata={'_additional': {'id': 'a76c5fed-5d63-45a7-b63a-2c30e05140fc'}, 'chunk_type': 'text', 'chunk_years_mentioned': [2020, 2021, 2022, 2023], 'company_name': 'ROKU INC', 'company_sic_code_description': 'CABLE & OTHER PAY TELEVISION SERVICES', 'data_source': '10-K', 'data_source_link': 'https://www.sec.gov/Archives/edgar/data/1428439/000142843923000007', 'data_source_publish_date': '2022-01-01T00:00:00Z', 'data_source_uid': '0001428439-23-000007', 'title': 'ROKU INC |  10-K 2022-FY '}),\n",
+       " Document(page_content='Company Name: ROKU INC \\n Company Industry: CABLE & OTHER PAY TELEVISION SERVICES \\n Form Title: 10-Q 2023-Q1 \\n Form Section: Risk Factors \\n Text: Our current and potential partners include TV brands, cable and satellite companies, and telecommunication providers.Under these license arrangements, we generally have limited or no control over the amount and timing of resources these entities dedicate to the relationship.In the past, our licensed Roku TV partners have failed to meet their forecasts and anticipated market launch dates for distributing Roku TV models, and they may fail to meet their forecasts or such launches in the future.If our licensed Roku TV partners or service operator partners fail to meet their forecasts or such launches for distributing licensed streaming devices or choose to deploy competing streaming solutions within their product lines, our business may be harmed.We depend on a small number of content publishers for a majority of our streaming hours, and if we fail to maintain these relationships, our business could be harmed.*Historically, a small number of content publishers have accounted for a significant portion of the hours streamed on our platform.In the three months ended March 31, 2023, the top three streaming services represented over 50% of all hours streamed in the period.If, for any reason, we cease distributing channels that have historically streamed a large percentage of the aggregate streaming hours on our platform, our streaming hours, our active accounts, or Roku streaming device sales may be adversely affected, and our business may be harmed.', metadata={'_additional': {'id': '2a92b2bb-02a0-4e15-8b64-d7e04078a205'}, 'chunk_type': 'text', 'chunk_years_mentioned': [2023], 'company_name': 'ROKU INC', 'company_sic_code_description': 'CABLE & OTHER PAY TELEVISION SERVICES', 'data_source': '10-Q', 'data_source_link': 'https://www.sec.gov/Archives/edgar/data/1428439/000142843923000017', 'data_source_publish_date': '2023-01-01T00:00:00Z', 'data_source_uid': '0001428439-23-000017', 'title': 'ROKU INC |  10-Q 2023-Q1 '})]"
+      ]
+     },
+     "execution_count": 21,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "docs"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "21f6e9e5-478c-4b2c-9d61-f7a84f4d2f8f",
+   "metadata": {},
+   "source": [
+    "Usage in a chain\n",
+    "-"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "d1cba716-ab8d-4518-9196-43f17eb189dc",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      " ········\n"
+     ]
+    }
+   ],
+   "source": [
+    "OPENAI_API_KEY = getpass()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "79441f1f-fa06-452c-bcd6-160ad0debc6a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "os.environ[\"OPENAI_API_KEY\"] = OPENAI_API_KEY"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "0c504bcd-f6e0-4028-a797-b31fb4b6d027",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.chains import ConversationalRetrievalChain\n",
+    "\n",
+    "model = ChatOpenAI(model_name=\"gpt-3.5-turbo\")\n",
+    "qa = ConversationalRetrievalChain.from_llm(model, retriever=retriever)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 22,
+   "id": "977f158b-38d3-4b5f-9379-7cdd09436327",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "-> **Question**: What were the biggest strategy changes and partnerships made by Roku in 2023? \n",
+      "\n",
+      "**Answer**: In 2023, Roku made a strategic partnership with FreeWheel to bring Roku's leading ad tech to FreeWheel customers. This partnership aimed to drive greater interoperability and automation in the advertising-based video on demand (AVOD) space. Key highlights of this collaboration include streamlined integration of Roku's demand application programming interface (dAPI) with FreeWheel's TV platform, allowing for better inventory quality control and improved publisher yield and revenue. Additionally, publishers can now use Roku platform signals to enable advertisers to target audiences and measure campaign performance without relying on cookies. This partnership also involves the use of data clean room technology to enable the activation of additional data sets for better measurement and monetization for publishers and agencies. These partnerships and strategies aim to support Roku's growth in the AVOD market. \n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "questions = [\n",
+    "    \"What were the biggest strategy changes and partnerships made by Roku in 2023?\"\n",
+    "    # \"Where is Wex making the most money in 2023?\",\n",
+    "]\n",
+    "chat_history = []\n",
+    "\n",
+    "for question in questions:\n",
+    "    result = qa({\"question\": question, \"chat_history\": chat_history})\n",
+    "    chat_history.append((question, result[\"answer\"]))\n",
+    "    print(f\"-> **Question**: {question} \\n\")\n",
+    "    print(f\"**Answer**: {result['answer']} \\n\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.18"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/retrievers/pubmed.ipynb
+++ b/docs/extras/integrations/retrievers/pubmed.ipynb
@@ -81,7 +81,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.12"
+   "version": "3.9.18"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/retrievers/sec_filings.ipynb
+++ b/docs/extras/integrations/retrievers/sec_filings.ipynb
@@ -0,0 +1,165 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "263f914c-9d67-4316-8b3d-03c3b99ba9d8",
+   "metadata": {},
+   "source": [
+    "SEC filings data\n",
+    "=\n",
+    "\n",
+    "SEC filings data powered by [Kay.ai](https://kay.ai) and [Cybersyn](https://www.cybersyn.com/).\n",
+    "\n",
+    ">The SEC filing is a financial statement or other formal document submitted to the U.S. Securities and Exchange Commission (SEC). Public companies, certain insiders, and broker-dealers are required to make regular SEC filings. Investors and financial professionals rely on these filings for information about companies they are evaluating for investment purposes."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "fc507b8e-ea51-417c-93da-42bf998a1195",
+   "metadata": {},
+   "source": [
+    "Setup\n",
+    "=\n",
+    "\n",
+    "First you will need to install the `kay` package. You will also need an API key: you can get one for free at [https://kay.ai](https://kay.ai/). Once you have an API key, you must set it as an environment variable `KAY_API_KEY`.\n",
+    "\n",
+    "In this example we're going to use the `KayAiRetriever`. Take a look at the [kay notebook](/docs/integrations/retrievers/kay) for more detailed information for the parmeters that it accepts.`"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c923bea0-585a-4f62-8662-efc167e8d793",
+   "metadata": {},
+   "source": [
+    "Examples\n",
+    "=\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "f7b8c99c-0341-4f3c-912f-a11e98f7de71",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      " ········\n",
+      " ········\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Setup API keys for Kay and OpenAI\n",
+    "from getpass import getpass\n",
+    "KAY_API_KEY = getpass()\n",
+    "OPENAI_API_KEY = getpass()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "04ee2d6b-c2ab-4e15-8a8b-afaf6ef8c0f6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "os.environ[\"KAY_API_KEY\"] = KAY_API_KEY\n",
+    "os.environ[\"OPENAI_API_KEY\"] = OPENAI_API_KEY"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "0c504bcd-f6e0-4028-a797-b31fb4b6d027",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chains import ConversationalRetrievalChain\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.retrievers import KayAiRetriever\n",
+    "\n",
+    "model = ChatOpenAI(model_name=\"gpt-3.5-turbo\")\n",
+    "retriever = KayAiRetriever.create(dataset_id=\"company\", data_types=[\"10-K\", \"10-Q\"], num_contexts=6)\n",
+    "qa = ConversationalRetrievalChain.from_llm(model, retriever=retriever)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "977f158b-38d3-4b5f-9379-7cdd09436327",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "-> **Question**: What are patterns in Nvidia's spend over the past three quarters? \n",
+      "\n",
+      "**Answer**: Based on the provided information, here are the patterns in NVIDIA's spend over the past three quarters:\n",
+      "\n",
+      "1. Research and Development Expenses:\n",
+      "   - Q3 2022: Increased by 34% compared to Q3 2021.\n",
+      "   - Q1 2023: Increased by 40% compared to Q1 2022.\n",
+      "   - Q2 2022: Increased by 25% compared to Q2 2021.\n",
+      "   \n",
+      "   Overall, research and development expenses have been consistently increasing over the past three quarters.\n",
+      "\n",
+      "2. Sales, General and Administrative Expenses:\n",
+      "   - Q3 2022: Increased by 8% compared to Q3 2021.\n",
+      "   - Q1 2023: Increased by 14% compared to Q1 2022.\n",
+      "   - Q2 2022: Decreased by 16% compared to Q2 2021.\n",
+      "   \n",
+      "   The pattern for sales, general and administrative expenses is not as consistent, with some quarters showing an increase and others showing a decrease.\n",
+      "\n",
+      "3. Total Operating Expenses:\n",
+      "   - Q3 2022: Increased by 25% compared to Q3 2021.\n",
+      "   - Q1 2023: Increased by 113% compared to Q1 2022.\n",
+      "   - Q2 2022: Increased by 9% compared to Q2 2021.\n",
+      "   \n",
+      "   Total operating expenses have generally been increasing over the past three quarters, with a significant increase in Q1 2023.\n",
+      "\n",
+      "Overall, the pattern indicates a consistent increase in research and development expenses and total operating expenses, while sales, general and administrative expenses show some fluctuations. \n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "questions = [\n",
+    "    \"What are patterns in Nvidia's spend over the past three quarters?\",\n",
+    "    #\"What are some recent challenges faced by the renewable energy sector?\",\n",
+    "]\n",
+    "chat_history = []\n",
+    "\n",
+    "for question in questions:\n",
+    "    result = qa({\"question\": question, \"chat_history\": chat_history})\n",
+    "    chat_history.append((question, result[\"answer\"]))\n",
+    "    print(f\"-> **Question**: {question} \\n\")\n",
+    "    print(f\"**Answer**: {result['answer']} \\n\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.18"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/text_embedding/gradient.ipynb
+++ b/docs/extras/integrations/text_embedding/gradient.ipynb
@@ -0,0 +1,150 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Gradient\n",
+    "\n",
+    "`Gradient` allows to create `Embeddings` as well fine tune and get completions on LLMs with a simple web API.\n",
+    "\n",
+    "This notebook goes over how to use Langchain with Embeddings of [Gradient](https://gradient.ai/).\n"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Imports"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings import GradientEmbeddings"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Set the Environment API Key\n",
+    "Make sure to get your API key from Gradient AI. You are given $10 in free credits to test and fine-tune different models."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from getpass import getpass\n",
+    "import os\n",
+    "\n",
+    "if not os.environ.get(\"GRADIENT_ACCESS_TOKEN\",None):\n",
+    "    # Access token under https://auth.gradient.ai/select-workspace\n",
+    "    os.environ[\"GRADIENT_ACCESS_TOKEN\"] = getpass(\"gradient.ai access token:\")\n",
+    "if not os.environ.get(\"GRADIENT_WORKSPACE_ID\",None):\n",
+    "    # `ID` listed in `$ gradient workspace list`\n",
+    "    # also displayed after login at at https://auth.gradient.ai/select-workspace\n",
+    "    os.environ[\"GRADIENT_WORKSPACE_ID\"] = getpass(\"gradient.ai workspace id:\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Optional: Validate your Enviroment variables ```GRADIENT_ACCESS_TOKEN``` and ```GRADIENT_WORKSPACE_ID``` to get currently deployed models. Using the `gradientai` Python package."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install gradientai"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Create the Gradient instance"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "documents = [\"Pizza is a dish.\",\"Paris is the capital of France\", \"numpy is a lib for linear algebra\"]\n",
+    "query = \"Where is Paris?\""
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "embeddings = GradientEmbeddings(\n",
+    "    model=\"bge-large\"\n",
+    ")\n",
+    "\n",
+    "documents_embedded = embeddings.embed_documents(documents)\n",
+    "query_result = embeddings.embed_query(query)\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# (demo) compute similarity\n",
+    "import numpy as np\n",
+    "\n",
+    "scores = np.array(documents_embedded) @ np.array(query_result).T\n",
+    "dict(zip(documents, scores))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.6"
+  },
+  "vscode": {
+   "interpreter": {
+    "hash": "a0a0263b650d907a3bfe41c0f8d6a63a071b884df3cfdc1579f00cdc1aed6b03"
+   }
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
--- a/docs/extras/integrations/text_embedding/llm_rails.ipynb
+++ b/docs/extras/integrations/text_embedding/llm_rails.ipynb
@@ -0,0 +1,133 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "278b6c63",
+   "metadata": {},
+   "source": [
+    "# LLMRails\n",
+    "\n",
+    "Let's load the LLMRails Embeddings class.\n",
+    "\n",
+    "To use LLMRails embedding you need to pass api key by argument or set it in environment with `LLM_RAILS_API_KEY` key.\n",
+    "To gey API Key you need to sign up in https://console.llmrails.com/signup and then go to https://console.llmrails.com/api-keys and copy key from there after creating one key in platform."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "0be1af71",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings import LLMRailsEmbeddings"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "2c66e5da",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "embeddings = LLMRailsEmbeddings(model='embedding-english-v1') # or embedding-multi-v1"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "01370375",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "text = \"This is a test document.\""
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "a42e4035",
+   "metadata": {},
+   "source": [
+    "To generate embeddings, you can either query an invidivual text, or you can query a list of texts."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "91bc875d-829b-4c3d-8e6f-fc2dda30a3bd",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[-0.09996652603149414,\n",
+       " 0.015568195842206478,\n",
+       " 0.17670190334320068,\n",
+       " 0.16521021723747253,\n",
+       " 0.21193109452724457]"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "query_result = embeddings.embed_query(text)\n",
+    "query_result[:5]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "a4b0d49e-0c73-44b6-aed5-5b426564e085",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[-0.04242777079343796,\n",
+       " 0.016536075621843338,\n",
+       " 0.10052520781755447,\n",
+       " 0.18272875249385834,\n",
+       " 0.2079043835401535]"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "doc_result = embeddings.embed_documents([text])\n",
+    "doc_result[0][:5]"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.5"
+  },
+  "vscode": {
+   "interpreter": {
+    "hash": "e971737741ff4ec9aff7dc6155a1060a59a8a6d52c757dbbe66bf8ee389494b1"
+   }
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/integrations/vectorstores/elasticsearch.ipynb
+++ b/docs/extras/integrations/vectorstores/elasticsearch.ipynb
@@ -44,7 +44,7 @@
   "source": [
    "There are two main ways to setup an Elasticsearch instance for use with:\n",
    "\n",
-    "1. Elastic Cloud: Elastic Cloud is a managed Elasticsearch service. Signup for a [free trial](https://cloud.elastic.co/registration?storm=langchain-notebook).\n",
+    "1. Elastic Cloud: Elastic Cloud is a managed Elasticsearch service. Signup for a [free trial](https://cloud.elastic.co/registration?utm_source=langchain&utm_content=documentation).\n",
    "\n",
    "To connect to an Elasticsearch instance that does not require\n",
    "login credentials (starting the docker instance with security enabled), pass the Elasticsearch URL and index name along with the\n",
@@ -662,7 +662,7 @@
   "id": "0960fa0a",
   "metadata": {},
   "source": [
-    "# Customise the Query\n",
+    "## Customise the Query\n",
    "With `custom_query` parameter at search, you are able to adjust the query that is used to retrieve documents from Elasticsearch. This is useful if you want to want to use a more complex query, to support linear boosting of fields."
   ]
  },
@@ -720,6 +720,35 @@
    "print(results[0])"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "3242fd42",
+   "metadata": {},
+   "source": [
+    "# FAQ\n",
+    "\n",
+    "## Question: Im getting timeout errors when indexing documents into Elasticsearch. How do I fix this?\n",
+    "One possible issue is your documents might take longer to index into Elasticsearch. ElasticsearchStore uses the Elasticsearch bulk API which has a few defaults that you can adjust to reduce the chance of timeout errors.\n",
+    "\n",
+    "This is also a good idea when you're using SparseVectorRetrievalStrategy.\n",
+    "\n",
+    "The defaults are:\n",
+    "- `chunk_size`: 500\n",
+    "- `max_chunk_bytes`: 100MB\n",
+    "\n",
+    "To adjust these, you can pass in the `chunk_size` and `max_chunk_bytes` parameters to the ElasticsearchStore `add_texts` method.\n",
+    "\n",
+    "```python\n",
+    "    vector_store.add_texts(\n",
+    "        texts,\n",
+    "        bulk_kwargs={\n",
+    "            \"chunk_size\": 50,\n",
+    "            \"max_chunk_bytes\": 200000000\n",
+    "        }\n",
+    "    )\n",
+    "```"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "604c66ea",
--- a/docs/extras/integrations/vectorstores/supabase.ipynb
+++ b/docs/extras/integrations/vectorstores/supabase.ipynb
@@ -92,7 +92,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 15,
   "id": "19846a7b-99bc-47a7-8e1c-f13c2497f1ae",
   "metadata": {},
   "outputs": [],
@@ -105,7 +105,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 16,
   "id": "c71c3901-d44b-4d09-92c5-3018628c28fa",
   "metadata": {},
   "outputs": [],
@@ -115,7 +115,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 17,
   "id": "8b91ecfa-f61b-489a-a337-dff1f12f6ab2",
   "metadata": {},
   "outputs": [],
@@ -138,51 +138,66 @@
    "load_dotenv()"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "924d4df5",
+   "metadata": {},
+   "source": [
+    "First we'll create a Supabase client and instantiate a OpenAI embeddings class."
+   ]
+  },
  {
   "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 19,
   "id": "5ce44f7c",
   "metadata": {},
   "outputs": [],
   "source": [
    "import os\n",
    "from supabase.client import Client, create_client\n",
+    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "from langchain.vectorstores import SupabaseVectorStore\n",
    "\n",
    "supabase_url = os.environ.get(\"SUPABASE_URL\")\n",
    "supabase_key = os.environ.get(\"SUPABASE_SERVICE_KEY\")\n",
-    "supabase: Client = create_client(supabase_url, supabase_key)"
+    "supabase: Client = create_client(supabase_url, supabase_key)\n",
+    "\n",
+    "embeddings = OpenAIEmbeddings()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "0c707d4c",
+   "metadata": {},
+   "source": [
+    "Next we'll load and parse some data for our vector store (skip if you already have documents with embeddings stored in your DB)."
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 20,
   "id": "aac9563e",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
-    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "\n",
    "from langchain.text_splitter import CharacterTextSplitter\n",
-    "from langchain.vectorstores import SupabaseVectorStore\n",
-    "from langchain.document_loaders import TextLoader"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 5,
-   "id": "a3c3999a",
-   "metadata": {},
-   "outputs": [],
-   "source": [
    "from langchain.document_loaders import TextLoader\n",
    "\n",
    "loader = TextLoader(\"../../../state_of_the_union.txt\")\n",
    "documents = loader.load()\n",
    "text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)\n",
-    "docs = text_splitter.split_documents(documents)\n",
-    "\n",
-    "embeddings = OpenAIEmbeddings()"
+    "docs = text_splitter.split_documents(documents)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "5abb9b93",
+   "metadata": {},
+   "source": [
+    "Insert the above documents into the database. Embeddings will automatically be generated for each document."
   ]
  },
  {
@@ -192,13 +207,39 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "# We're using the default `documents` table here. You can modify this by passing in a `table_name` argument to the `from_documents` method.\n",
-    "vector_store = SupabaseVectorStore.from_documents(docs, embeddings, client=supabase)"
+    "\n",
+    "vector_store = SupabaseVectorStore.from_documents(docs, embeddings, client=supabase, table_name=\"documents\", query_name=\"match_documents\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "e169345d",
+   "metadata": {},
+   "source": [
+    "Alternatively if you already have documents with embeddings in your database, simply instantiate a new `SupabaseVectorStore` directly:"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": 10,
+   "id": "397e3e7d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "vector_store = SupabaseVectorStore(embedding=embeddings, client=supabase, table_name=\"documents\", query_name=\"match_documents\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "e28ce092",
+   "metadata": {},
+   "source": [
+    "Finally, test it out by performing a similarity search:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
   "id": "5eabdb75",
   "metadata": {},
   "outputs": [],
@@ -209,7 +250,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": null,
   "id": "4b172de8",
   "metadata": {},
   "outputs": [
@@ -431,7 +472,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.12"
+   "version": "3.11.5"
  }
 },
 "nbformat": 4,
--- a/docs/extras/integrations/vectorstores/timescalevector.ipynb
+++ b/docs/extras/integrations/vectorstores/timescalevector.ipynb
--- a/docs/extras/modules/agents/agent_types/chat_conversation_agent.ipynb
+++ b/docs/extras/modules/agents/agent_types/chat_conversation_agent.ipynb
@@ -0,0 +1,561 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "69014601",
+   "metadata": {},
+   "source": [
+    "# Conversational\n",
+    "\n",
+    "This walkthrough demonstrates how to use an agent optimized for conversation. Other agents are often optimized for using tools to figure out the best response, which is not ideal in a conversational setting where you may want the agent to be able to chat with the user as well.\n",
+    "\n",
+    "If we compare it to the standard ReAct agent, the main difference is the prompt.\n",
+    "We want it to be much more conversational."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "cc3fad9e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import Tool\n",
+    "from langchain.agents import AgentType\n",
+    "from langchain.memory import ConversationBufferMemory\n",
+    "from langchain.llms import OpenAI\n",
+    "from langchain.utilities import SerpAPIWrapper\n",
+    "from langchain.agents import initialize_agent"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "2d84b9bc",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "search = SerpAPIWrapper()\n",
+    "tools = [\n",
+    "    Tool(\n",
+    "        name = \"Current Search\",\n",
+    "        func=search.run,\n",
+    "        description=\"useful for when you need to answer questions about current events or the current state of the world\"\n",
+    "    ),\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "799a31bf",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm=OpenAI(temperature=0)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "f9d11cb6",
+   "metadata": {},
+   "source": [
+    "## Using LCEL\n",
+    "\n",
+    "We will first show how to create this agent using LCEL"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "03c09ef9",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.tools.render import render_text_description\n",
+    "from langchain.agents.output_parsers import ReActSingleInputOutputParser\n",
+    "from langchain.agents.format_scratchpad import format_log_to_str\n",
+    "from langchain import hub"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 28,
+   "id": "6bd84102",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = hub.pull(\"hwchase17/react-chat\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 29,
+   "id": "7ccc785d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = prompt.partial(\n",
+    "    tools=render_text_description(tools),\n",
+    "    tool_names=\", \".join([t.name for t in tools]),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "d7aac2b0",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_with_stop = llm.bind(stop=[\"\\nObservation\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "a028bca6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    \"agent_scratchpad\": lambda x: format_log_to_str(x['intermediate_steps']),\n",
+    "    \"chat_history\": lambda x: x[\"chat_history\"]\n",
+    "} | prompt | llm_with_stop | ReActSingleInputOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "0b354cfe",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "id": "9b044ae9",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "memory = ConversationBufferMemory(memory_key=\"chat_history\")\n",
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True, memory=memory)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 24,
+   "id": "adcdd0c7",
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m\n",
+      "Thought: Do I need to use a tool? No\n",
+      "Final Answer: Hi Bob, nice to meet you! How can I help you today?\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Hi Bob, nice to meet you! How can I help you today?'"
+      ]
+     },
+     "execution_count": 24,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"hi, i am bob\"})['output']"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "id": "c5846cd1",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m\n",
+      "Thought: Do I need to use a tool? No\n",
+      "Final Answer: Your name is Bob.\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Your name is Bob.'"
+      ]
+     },
+     "execution_count": 25,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"whats my name?\"})['output']"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 26,
+   "id": "95a1192a",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m\n",
+      "Thought: Do I need to use a tool? Yes\n",
+      "Action: Current Search\n",
+      "Action Input: Movies showing 9/21/2023\u001b[0m\u001b[36;1m\u001b[1;3m['September 2023 Movies: The Creator • Dumb Money • Expend4bles • The Kill Room • The Inventor • The Equalizer 3 • PAW Patrol: The Mighty Movie, ...']\u001b[0m\u001b[32;1m\u001b[1;3m Do I need to use a tool? No\n",
+      "Final Answer: According to current search, some movies showing on 9/21/2023 are The Creator, Dumb Money, Expend4bles, The Kill Room, The Inventor, The Equalizer 3, and PAW Patrol: The Mighty Movie.\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'According to current search, some movies showing on 9/21/2023 are The Creator, Dumb Money, Expend4bles, The Kill Room, The Inventor, The Equalizer 3, and PAW Patrol: The Mighty Movie.'"
+      ]
+     },
+     "execution_count": 26,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"what are some movies showing 9/21/2023?\"})['output']"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c0b2d86d",
+   "metadata": {},
+   "source": [
+    "## Use the off-the-shelf agent\n",
+    "\n",
+    "We can also create this agent using the off-the-shelf agent class"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 27,
+   "id": "53e43064",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = initialize_agent(tools, llm, agent=AgentType.CONVERSATIONAL_REACT_DESCRIPTION, verbose=True, memory=memory)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "68e45a24",
+   "metadata": {},
+   "source": [
+    "## Use a chat model\n",
+    "\n",
+    "We can also use a chat model here. The main difference here is in the prompts used."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "a5a705b2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain import hub"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "16b17ca8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = hub.pull(\"hwchase17/react-chat-json\")\n",
+    "chat_model = ChatOpenAI(temperature=0, model='gpt-4')"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 24,
+   "id": "c8a94b0b",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = prompt.partial(\n",
+    "    tools=render_text_description(tools),\n",
+    "    tool_names=\", \".join([t.name for t in tools]),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "id": "c5d710f2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chat_model_with_stop = chat_model.bind(stop=[\"\\nObservation\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f50a5ea8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents.output_parsers import JSONAgentOutputParser\n",
+    "from langchain.agents.format_scratchpad import format_log_to_messages"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 26,
+   "id": "2c845796",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# We need some extra steering, or the chat model forgets how to respond sometimes\n",
+    "TEMPLATE_TOOL_RESPONSE = \"\"\"TOOL RESPONSE: \n",
+    "---------------------\n",
+    "{observation}\n",
+    "\n",
+    "USER'S INPUT\n",
+    "--------------------\n",
+    "\n",
+    "Okay, so what is the response to my last comment? If using information obtained from the tools you must mention it explicitly without mentioning the tool names - I have forgotten all TOOL RESPONSES! Remember to respond with a markdown code snippet of a json blob with a single action, and NOTHING else - even if you just want to respond to the user. Do NOT respond with anything except a JSON snippet no matter what!\"\"\"\n",
+    "\n",
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    \"agent_scratchpad\": lambda x: format_log_to_messages(x['intermediate_steps'], template_tool_response=TEMPLATE_TOOL_RESPONSE),\n",
+    "    \"chat_history\": lambda x: x[\"chat_history\"],\n",
+    "} | prompt | chat_model_with_stop | JSONAgentOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "6cc033fc",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 27,
+   "id": "332ba2ff",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "memory = ConversationBufferMemory(memory_key=\"chat_history\", return_messages=True)\n",
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True, memory=memory)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 28,
+   "id": "139717b4",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m```json\n",
+      "{\n",
+      "    \"action\": \"Final Answer\",\n",
+      "    \"action_input\": \"Hello Bob, how can I assist you today?\"\n",
+      "}\n",
+      "```\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Hello Bob, how can I assist you today?'"
+      ]
+     },
+     "execution_count": 28,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"hi, i am bob\"})['output']"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 29,
+   "id": "7e7cf6d3",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m```json\n",
+      "{\n",
+      "    \"action\": \"Final Answer\",\n",
+      "    \"action_input\": \"Your name is Bob.\"\n",
+      "}\n",
+      "```\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Your name is Bob.'"
+      ]
+     },
+     "execution_count": 29,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"whats my name?\"})['output']"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 30,
+   "id": "3fc00073",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m```json\n",
+      "{\n",
+      "    \"action\": \"Current Search\",\n",
+      "    \"action_input\": \"movies showing on 9/21/2023\"\n",
+      "}\n",
+      "```\u001b[0m\u001b[36;1m\u001b[1;3m['September 2023 Movies: The Creator • Dumb Money • Expend4bles • The Kill Room • The Inventor • The Equalizer 3 • PAW Patrol: The Mighty Movie, ...']\u001b[0m\u001b[32;1m\u001b[1;3m```json\n",
+      "{\n",
+      "    \"action\": \"Final Answer\",\n",
+      "    \"action_input\": \"Some movies that are showing on 9/21/2023 include 'The Creator', 'Dumb Money', 'Expend4bles', 'The Kill Room', 'The Inventor', 'The Equalizer 3', and 'PAW Patrol: The Mighty Movie'.\"\n",
+      "}\n",
+      "```\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "\"Some movies that are showing on 9/21/2023 include 'The Creator', 'Dumb Money', 'Expend4bles', 'The Kill Room', 'The Inventor', 'The Equalizer 3', and 'PAW Patrol: The Mighty Movie'.\""
+      ]
+     },
+     "execution_count": 30,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"what are some movies showing 9/21/2023?\"})['output']"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8d464ead",
+   "metadata": {},
+   "source": [
+    "We can also initialize the agent executor with a predefined agent type"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "141f2469",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.memory import ConversationBufferMemory\n",
+    "from langchain.chat_models import ChatOpenAI"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "734d1b21",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "memory = ConversationBufferMemory(memory_key=\"chat_history\", return_messages=True)\n",
+    "llm = ChatOpenAI(openai_api_key=OPENAI_API_KEY, temperature=0)\n",
+    "agent_chain = initialize_agent(tools, llm, agent=AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION, verbose=True, memory=memory)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/modules/agents/agent_types/openai_functions_agent.ipynb
+++ b/docs/extras/modules/agents/agent_types/openai_functions_agent.ipynb
@@ -0,0 +1,295 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "e10aa932",
+   "metadata": {},
+   "source": [
+    "# OpenAI functions\n",
+    "\n",
+    "Certain OpenAI models (like gpt-3.5-turbo-0613 and gpt-4-0613) have been fine-tuned to detect when a function should be called and respond with the inputs that should be passed to the function. In an API call, you can describe functions and have the model intelligently choose to output a JSON object containing arguments to call those functions. The goal of the OpenAI Function APIs is to more reliably return valid and useful function calls than a generic text completion or chat API.\n",
+    "\n",
+    "The OpenAI Functions Agent is designed to work with these models.\n",
+    "\n",
+    "Install `openai`, `google-search-results` packages which are required as the LangChain packages call them internally."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "ec89be68",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "! pip install openai google-search-results"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "82787d8d",
+   "metadata": {},
+   "source": [
+    "## Initialize tools\n",
+    "\n",
+    "We will first create some tools we can use"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "b812b982",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import initialize_agent, AgentType, Tool\n",
+    "from langchain.chains import LLMMathChain\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.llms import OpenAI\n",
+    "from langchain.utilities import SerpAPIWrapper, SQLDatabase\n",
+    "from langchain_experimental.sql import SQLDatabaseChain"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "23fc0aa6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = ChatOpenAI(temperature=0, model=\"gpt-3.5-turbo-0613\")\n",
+    "search = SerpAPIWrapper()\n",
+    "llm_math_chain = LLMMathChain.from_llm(llm=llm, verbose=True)\n",
+    "db = SQLDatabase.from_uri(\"sqlite:///../../../../../notebooks/Chinook.db\")\n",
+    "db_chain = SQLDatabaseChain.from_llm(llm, db, verbose=True)\n",
+    "tools = [\n",
+    "    Tool(\n",
+    "        name = \"Search\",\n",
+    "        func=search.run,\n",
+    "        description=\"useful for when you need to answer questions about current events. You should ask targeted questions\"\n",
+    "    ),\n",
+    "    Tool(\n",
+    "        name=\"Calculator\",\n",
+    "        func=llm_math_chain.run,\n",
+    "        description=\"useful for when you need to answer questions about math\"\n",
+    "    ),\n",
+    "    Tool(\n",
+    "        name=\"FooBar-DB\",\n",
+    "        func=db_chain.run,\n",
+    "        description=\"useful for when you need to answer questions about FooBar. Input should be in the form of a question containing full context\"\n",
+    "    )\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "39c3ba21",
+   "metadata": {},
+   "source": [
+    "## Using LCEL\n",
+    "\n",
+    "We will first use LangChain Expression Language to create this agent"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "eac103f1",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.prompts import ChatPromptTemplate, MessagesPlaceholder"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "55292bed",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = ChatPromptTemplate.from_messages([\n",
+    "    (\"system\", \"You are a helpful assistant\"),\n",
+    "    (\"user\", \"{input}\"),\n",
+    "    MessagesPlaceholder(variable_name=\"agent_scratchpad\"),\n",
+    "])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "50f40df4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.tools.render import format_tool_to_openai_function"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "552421b3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_with_tools = llm.bind(\n",
+    "    functions=[format_tool_to_openai_function(t) for t in tools]\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "3cafa0a3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents.format_scratchpad import format_to_openai_functions\n",
+    "from langchain.agents.output_parsers import OpenAIFunctionsAgentOutputParser"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "bf514eb4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    \"agent_scratchpad\": lambda x: format_to_openai_functions(x['intermediate_steps'])\n",
+    "} | prompt | llm_with_tools | OpenAIFunctionsAgentOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "5125573e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "bdc7e506",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "2cd65218",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m\n",
+      "Invoking: `Search` with `Leo DiCaprio's girlfriend`\n",
+      "\n",
+      "\n",
+      "\u001b[0m\u001b[36;1m\u001b[1;3m['Blake Lively and DiCaprio are believed to have enjoyed a whirlwind five-month romance in 2011. The pair were seen on a yacht together in Cannes, ...']\u001b[0m\u001b[32;1m\u001b[1;3m\n",
+      "Invoking: `Calculator` with `0.43`\n",
+      "\n",
+      "\n",
+      "\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Entering new LLMMathChain chain...\u001b[0m\n",
+      "0.43\u001b[32;1m\u001b[1;3m```text\n",
+      "0.43\n",
+      "```\n",
+      "...numexpr.evaluate(\"0.43\")...\n",
+      "\u001b[0m\n",
+      "Answer: \u001b[33;1m\u001b[1;3m0.43\u001b[0m\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n",
+      "\u001b[33;1m\u001b[1;3mAnswer: 0.43\u001b[0m\u001b[32;1m\u001b[1;3mI'm sorry, but I couldn't find any information about Leo DiCaprio's current girlfriend. As for raising her age to the power of 0.43, I'm not sure what her current age is, so I can't provide an answer for that.\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "{'input': \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\",\n",
+       " 'output': \"I'm sorry, but I couldn't find any information about Leo DiCaprio's current girlfriend. As for raising her age to the power of 0.43, I'm not sure what her current age is, so I can't provide an answer for that.\"}"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8e91393f",
+   "metadata": {},
+   "source": [
+    "## Using OpenAIFunctionsAgent\n",
+    "\n",
+    "We can now use `OpenAIFunctionsAgent`, which creates this agent under the hood"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "9ed07c8f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = initialize_agent(tools, llm, agent=AgentType.OPENAI_FUNCTIONS, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "8d9fb674",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor.invoke({\"input\": \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\"})"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "2bc581dc",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/modules/agents/agent_types/openai_multi_functions_agent.ipynb
+++ b/docs/extras/modules/agents/agent_types/openai_multi_functions_agent.ipynb
@@ -444,9 +444,9 @@
 ],
 "metadata": {
  "kernelspec": {
-   "display_name": "venv",
+   "display_name": "Python 3 (ipykernel)",
   "language": "python",
-   "name": "venv"
+   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
@@ -458,7 +458,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.1"
  }
 },
 "nbformat": 4,
--- a/docs/extras/modules/agents/agent_types/react.ipynb
+++ b/docs/extras/modules/agents/agent_types/react.ipynb
@@ -0,0 +1,391 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "d82e62ec",
+   "metadata": {},
+   "source": [
+    "# ReAct\n",
+    "\n",
+    "This walkthrough showcases using an agent to implement the [ReAct](https://react-lm.github.io/) logic."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "102b0e52",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import load_tools\n",
+    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import AgentType\n",
+    "from langchain.llms import OpenAI"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "e0c9c056",
+   "metadata": {},
+   "source": [
+    "First, let's load the language model we're going to use to control the agent."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "184f0682",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = OpenAI(temperature=0)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "2e67a000",
+   "metadata": {},
+   "source": [
+    "Next, let's load some tools to use. Note that the `llm-math` tool uses an LLM, so we need to pass that in."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "256408d5",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "tools = load_tools([\"serpapi\", \"llm-math\"], llm=llm)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "b7d04f53",
+   "metadata": {},
+   "source": [
+    "## Using LCEL\n",
+    "\n",
+    "We will first show how to create the agent using LCEL"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "bb0813a3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.tools.render import render_text_description\n",
+    "from langchain.agents.output_parsers import ReActSingleInputOutputParser\n",
+    "from langchain.agents.format_scratchpad import format_log_to_str\n",
+    "from langchain import hub"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "d3ae5fcd",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = hub.pull(\"hwchase17/react\")\n",
+    "prompt = prompt.partial(\n",
+    "    tools=render_text_description(tools),\n",
+    "    tool_names=\", \".join([t.name for t in tools]),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "bf47a3c7",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_with_stop = llm.bind(stop=[\"\\nObservation\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "b3d3958b",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    \"agent_scratchpad\": lambda x: format_log_to_str(x['intermediate_steps'])\n",
+    "} | prompt | llm_with_stop | ReActSingleInputOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "a0a57769",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "026de6cd",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "57780ce1",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m I need to find out who Leo DiCaprio's girlfriend is and then calculate her age raised to the 0.43 power.\n",
+      "Action: Search\n",
+      "Action Input: \"Leo DiCaprio girlfriend\"\u001b[0m\u001b[36;1m\u001b[1;3mmodel Vittoria Ceretti\u001b[0m\u001b[32;1m\u001b[1;3m I need to find out Vittoria Ceretti's age\n",
+      "Action: Search\n",
+      "Action Input: \"Vittoria Ceretti age\"\u001b[0m\u001b[36;1m\u001b[1;3m25 years\u001b[0m\u001b[32;1m\u001b[1;3m I need to calculate 25 raised to the 0.43 power\n",
+      "Action: Calculator\n",
+      "Action Input: 25^0.43\u001b[0m\u001b[33;1m\u001b[1;3mAnswer: 3.991298452658078\u001b[0m\u001b[32;1m\u001b[1;3m I now know the final answer\n",
+      "Final Answer: Leo DiCaprio's girlfriend is Vittoria Ceretti and her current age raised to the 0.43 power is 3.991298452658078.\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "{'input': \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\",\n",
+       " 'output': \"Leo DiCaprio's girlfriend is Vittoria Ceretti and her current age raised to the 0.43 power is 3.991298452658078.\"}"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "b4a33ea8",
+   "metadata": {},
+   "source": [
+    "## Using ZeroShotReactAgent\n",
+    "\n",
+    "We will now show how to use the agent with an off-the-shelf agent implementation"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "9752e90e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "04c5bcf6",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m I need to find out who Leo DiCaprio's girlfriend is and then calculate her age raised to the 0.43 power.\n",
+      "Action: Search\n",
+      "Action Input: \"Leo DiCaprio girlfriend\"\u001b[0m\n",
+      "Observation: \u001b[36;1m\u001b[1;3mmodel Vittoria Ceretti\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3m I need to find out Vittoria Ceretti's age\n",
+      "Action: Search\n",
+      "Action Input: \"Vittoria Ceretti age\"\u001b[0m\n",
+      "Observation: \u001b[36;1m\u001b[1;3m25 years\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3m I need to calculate 25 raised to the 0.43 power\n",
+      "Action: Calculator\n",
+      "Action Input: 25^0.43\u001b[0m\n",
+      "Observation: \u001b[33;1m\u001b[1;3mAnswer: 3.991298452658078\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3m I now know the final answer\n",
+      "Final Answer: Leo DiCaprio's girlfriend is Vittoria Ceretti and her current age raised to the 0.43 power is 3.991298452658078.\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "{'input': \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\",\n",
+       " 'output': \"Leo DiCaprio's girlfriend is Vittoria Ceretti and her current age raised to the 0.43 power is 3.991298452658078.\"}"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "7f3e8fc8",
+   "metadata": {},
+   "source": [
+    "## Using chat models\n",
+    "\n",
+    "You can also create ReAct agents that use chat models instead of LLMs as the agent driver.\n",
+    "\n",
+    "The main difference here is a different prompt. We will use JSON to encode the agent's actions (chat models are a bit tougher to steet, so using JSON helps to enforce the output format)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "6eeb1693",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 29,
+   "id": "fe846c48",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chat_model = ChatOpenAI(temperature=0)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 27,
+   "id": "0843590d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = hub.pull(\"hwchase17/react-json\")\n",
+    "prompt = prompt.partial(\n",
+    "    tools=render_text_description(tools),\n",
+    "    tool_names=\", \".join([t.name for t in tools]),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 30,
+   "id": "a863b763",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chat_model_with_stop = chat_model.bind(stop=[\"\\nObservation\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "deaeb1f6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents.output_parsers import ReActJsonSingleInputOutputParser"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 31,
+   "id": "6336a378",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    \"agent_scratchpad\": lambda x: format_log_to_str(x['intermediate_steps'])\n",
+    "} | prompt | chat_model_with_stop | ReActJsonSingleInputOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 32,
+   "id": "13ad514e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "3a3394a4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor.invoke({\"input\": \"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "ffc28e29",
+   "metadata": {},
+   "source": [
+    "We can also use an off-the-shelf agent class"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "6c41464c",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "\n",
+    "agent = initialize_agent(tools, chat_model, agent=AgentType.CHAT_ZERO_SHOT_REACT_DESCRIPTION, verbose=True)\n",
+    "agent.run(\"Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/modules/agents/agent_types/self_ask_with_search.ipynb
+++ b/docs/extras/modules/agents/agent_types/self_ask_with_search.ipynb
@@ -13,6 +13,154 @@
  {
   "cell_type": "code",
   "execution_count": 1,
+   "id": "2018da2d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.llms import OpenAI\n",
+    "from langchain.utilities import SerpAPIWrapper\n",
+    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
+    "\n",
+    "llm = OpenAI(temperature=0)\n",
+    "search = SerpAPIWrapper()\n",
+    "tools = [\n",
+    "    Tool(\n",
+    "        name=\"Intermediate Answer\",\n",
+    "        func=search.run,\n",
+    "        description=\"useful for when you need to ask with search\",\n",
+    "    )\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "769c5940",
+   "metadata": {},
+   "source": [
+    "## Using LangChain Expression Language\n",
+    "\n",
+    "First we will show how to construct this agent from components using LangChain Expression Language"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "6be0e94d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents.output_parsers import SelfAskOutputParser\n",
+    "from langchain.agents.format_scratchpad import format_log_to_str\n",
+    "from langchain import hub"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "933ca47b",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = hub.pull(\"hwchase17/self-ask-with-search\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "id": "d1437a27",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_with_stop = llm.bind(stop=[\"\\nIntermediate answer:\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "d793401e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    # Use some custom observation_prefix/llm_prefix for formatting\n",
+    "    \"agent_scratchpad\": lambda x: format_log_to_str(\n",
+    "        x['intermediate_steps'], \n",
+    "        observation_prefix=\"\\nIntermediate answer: \",\n",
+    "        llm_prefix=\"\",\n",
+    "    ),\n",
+    "} | prompt | llm_with_stop | SelfAskOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "643c3bfa",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "a1bb513c",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "5181f35f",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m Yes.\n",
+      "Follow up: Who is the reigning men's U.S. Open champion?\u001b[0m\u001b[36;1m\u001b[1;3mMen's US Open Tennis Champions Novak Djokovic earned his 24th major singles title against 2021 US Open champion Daniil Medvedev, 6-3, 7-6 (7-5), 6-3. The victory ties the Serbian player with the legendary Margaret Court for the most Grand Slam wins across both men's and women's singles.\u001b[0m\u001b[32;1m\u001b[1;3m\n",
+      "Follow up: Where is Novak Djokovic from?\u001b[0m\u001b[36;1m\u001b[1;3mBelgrade, Serbia\u001b[0m\u001b[32;1m\u001b[1;3m\n",
+      "So the final answer is: Belgrade, Serbia\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "{'input': \"What is the hometown of the reigning men's U.S. Open champion?\",\n",
+       " 'output': 'Belgrade, Serbia'}"
+      ]
+     },
+     "execution_count": 15,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"What is the hometown of the reigning men's U.S. Open champion?\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "6556f348",
+   "metadata": {},
+   "source": [
+    "## Use off-the-shelf agent"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
   "id": "7e3b513e",
   "metadata": {},
   "outputs": [
@@ -25,10 +173,11 @@
      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
      "\u001b[32;1m\u001b[1;3m Yes.\n",
      "Follow up: Who is the reigning men's U.S. Open champion?\u001b[0m\n",
-      "Intermediate answer: \u001b[36;1m\u001b[1;3mCarlos Alcaraz Garfia\u001b[0m\n",
-      "\u001b[32;1m\u001b[1;3mFollow up: Where is Carlos Alcaraz Garfia from?\u001b[0m\n",
-      "Intermediate answer: \u001b[36;1m\u001b[1;3mEl Palmar, Spain\u001b[0m\n",
-      "\u001b[32;1m\u001b[1;3mSo the final answer is: El Palmar, Spain\u001b[0m\n",
+      "Intermediate answer: \u001b[36;1m\u001b[1;3mMen's US Open Tennis Champions Novak Djokovic earned his 24th major singles title against 2021 US Open champion Daniil Medvedev, 6-3, 7-6 (7-5), 6-3. The victory ties the Serbian player with the legendary Margaret Court for the most Grand Slam wins across both men's and women's singles.\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m\n",
+      "Follow up: Where is Novak Djokovic from?\u001b[0m\n",
+      "Intermediate answer: \u001b[36;1m\u001b[1;3mBelgrade, Serbia\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3mSo the final answer is: Belgrade, Serbia\u001b[0m\n",
      "\n",
      "\u001b[1m> Finished chain.\u001b[0m\n"
     ]
@@ -36,29 +185,15 @@
    {
     "data": {
      "text/plain": [
-       "'El Palmar, Spain'"
+       "'Belgrade, Serbia'"
      ]
     },
-     "execution_count": 1,
+     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "from langchain.llms import OpenAI\nfrom langchain.utilities import SerpAPIWrapper\n",
-    "from langchain.agents import initialize_agent, Tool\n",
-    "from langchain.agents import AgentType\n",
-    "\n",
-    "llm = OpenAI(temperature=0)\n",
-    "search = SerpAPIWrapper()\n",
-    "tools = [\n",
-    "    Tool(\n",
-    "        name=\"Intermediate Answer\",\n",
-    "        func=search.run,\n",
-    "        description=\"useful for when you need to ask with search\",\n",
-    "    )\n",
-    "]\n",
-    "\n",
    "self_ask_with_search = initialize_agent(\n",
    "    tools, llm, agent=AgentType.SELF_ASK_WITH_SEARCH, verbose=True\n",
    ")\n",
@@ -92,7 +227,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.1"
  },
  "vscode": {
   "interpreter": {
--- a/docs/extras/modules/agents/agent_types/structured_chat.ipynb
+++ b/docs/extras/modules/agents/agent_types/structured_chat.ipynb
@@ -0,0 +1,330 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "2ac2115b",
+   "metadata": {},
+   "source": [
+    "# Structured tool chat\n",
+    "\n",
+    "The structured tool chat agent is capable of using multi-input tools.\n",
+    "\n",
+    "Older agents are configured to specify an action input as a single string, but this agent can use the provided tools' `args_schema` to populate the action input.\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "68d58093",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentType\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.agents import initialize_agent"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "9414475b",
+   "metadata": {},
+   "source": [
+    "## Initialize Tools\n",
+    "\n",
+    "We will test the agent using a web browser"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "a990cea8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents.agent_toolkits import PlayWrightBrowserToolkit\n",
+    "from langchain.tools.playwright.utils import (\n",
+    "    create_async_playwright_browser,\n",
+    "    create_sync_playwright_browser, # A synchronous browser is available, though it isn't compatible with jupyter.\n",
+    ")\n",
+    "\n",
+    "# This import is required only for jupyter notebooks, since they have their own eventloop\n",
+    "import nest_asyncio\n",
+    "nest_asyncio.apply()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "536fa92a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install playwright\n",
+    "\n",
+    "!playwright install"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "daa3d594",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "async_browser = create_async_playwright_browser()\n",
+    "browser_toolkit = PlayWrightBrowserToolkit.from_browser(async_browser=async_browser)\n",
+    "tools = browser_toolkit.get_tools()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "e3089aa8",
+   "metadata": {},
+   "source": [
+    "## Use LCEL\n",
+    "\n",
+    "We can first construct this agent using LangChain Expression Language"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "bf35a623",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain import hub"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "id": "319e6c40",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = hub.pull(\"hwchase17/react-multi-input-json\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "38c6496f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.tools.render import render_text_description_and_args"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "id": "d25b216f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = prompt.partial(\n",
+    "    tools=render_text_description_and_args(tools),\n",
+    "    tool_names=\", \".join([t.name for t in tools]),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "id": "fffcad76",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = ChatOpenAI(temperature=0)\n",
+    "llm_with_stop = llm.bind(stop=[\"Observation\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "2ceceadb",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents.output_parsers import JSONAgentOutputParser\n",
+    "from langchain.agents.format_scratchpad import format_log_to_str"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 22,
+   "id": "d410855f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    \"agent_scratchpad\": lambda x: format_log_to_str(x['intermediate_steps']),\n",
+    "} | prompt | llm_with_stop | JSONAgentOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "470b0859",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "id": "b62702b4",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 24,
+   "id": "97c15ef5",
+   "metadata": {
+    "scrolled": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3mAction:\n",
+      "```\n",
+      "{\n",
+      "  \"action\": \"navigate_browser\",\n",
+      "  \"action_input\": {\n",
+      "    \"url\": \"https://blog.langchain.dev\"\n",
+      "  }\n",
+      "}\n",
+      "```\n",
+      "\u001b[0m\u001b[33;1m\u001b[1;3mNavigating to https://blog.langchain.dev returned status code 200\u001b[0m\u001b[32;1m\u001b[1;3mAction:\n",
+      "```\n",
+      "{\n",
+      "  \"action\": \"extract_text\",\n",
+      "  \"action_input\": {}\n",
+      "}\n",
+      "```\n",
+      "\n",
+      "\u001b[0m\u001b[31;1m\u001b[1;3mLangChain LangChain Home GitHub Docs By LangChain Release Notes Write with Us Sign in Subscribe The official LangChain blog. Subscribe now Login Featured Posts Announcing LangChain Hub Using LangSmith to Support Fine-tuning Announcing LangSmith, a unified platform for debugging, testing, evaluating, and monitoring your LLM applications Sep 20 Peering Into the Soul of AI Decision-Making with LangSmith 10 min read Sep 20 LangChain + Docugami Webinar: Lessons from Deploying LLMs with LangSmith 3 min read Sep 18 TED AI Hackathon Kickoff (and projects we’d love to see) 2 min read Sep 12 How to Safely Query Enterprise Data with LangChain Agents + SQL + OpenAI + Gretel 6 min read Sep 12 OpaquePrompts x LangChain: Enhance the privacy of your LangChain application with just one code change 4 min read Load more LangChain © 2023 Sign up Powered by Ghost\u001b[0m\u001b[32;1m\u001b[1;3mAction:\n",
+      "```\n",
+      "{\n",
+      "  \"action\": \"Final Answer\",\n",
+      "  \"action_input\": \"The LangChain blog features posts on topics such as using LangSmith for fine-tuning, AI decision-making with LangSmith, deploying LLMs with LangSmith, and more. It also includes information on LangChain Hub and upcoming webinars. LangChain is a platform for debugging, testing, evaluating, and monitoring LLM applications.\"\n",
+      "}\n",
+      "```\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n",
+      "The LangChain blog features posts on topics such as using LangSmith for fine-tuning, AI decision-making with LangSmith, deploying LLMs with LangSmith, and more. It also includes information on LangChain Hub and upcoming webinars. LangChain is a platform for debugging, testing, evaluating, and monitoring LLM applications.\n"
+     ]
+    }
+   ],
+   "source": [
+    "response = await agent_executor.ainvoke({\"input\": \"Browse to blog.langchain.dev and summarize the text, please.\"})\n",
+    "print(response['output'])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "62fc1fdf",
+   "metadata": {},
+   "source": [
+    "## Use off the shelf agent"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "4b585225",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = ChatOpenAI(temperature=0) # Also works well with Anthropic models\n",
+    "agent_chain = initialize_agent(tools, llm, agent=AgentType.STRUCTURED_CHAT_ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "c2a9e29c",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3mAction:\n",
+      "```\n",
+      "{\n",
+      "  \"action\": \"navigate_browser\",\n",
+      "  \"action_input\": {\n",
+      "    \"url\": \"https://blog.langchain.dev\"\n",
+      "  }\n",
+      "}\n",
+      "```\u001b[0m\n",
+      "Observation: \u001b[33;1m\u001b[1;3mNavigating to https://blog.langchain.dev returned status code 200\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3mI have successfully navigated to the blog.langchain.dev website. Now I need to extract the text from the webpage to summarize it.\n",
+      "Action:\n",
+      "```\n",
+      "{\n",
+      "  \"action\": \"extract_text\",\n",
+      "  \"action_input\": {}\n",
+      "}\n",
+      "```\u001b[0m\n",
+      "Observation: \u001b[31;1m\u001b[1;3mLangChain LangChain Home GitHub Docs By LangChain Release Notes Write with Us Sign in Subscribe The official LangChain blog. Subscribe now Login Featured Posts Announcing LangChain Hub Using LangSmith to Support Fine-tuning Announcing LangSmith, a unified platform for debugging, testing, evaluating, and monitoring your LLM applications Sep 20 Peering Into the Soul of AI Decision-Making with LangSmith 10 min read Sep 20 LangChain + Docugami Webinar: Lessons from Deploying LLMs with LangSmith 3 min read Sep 18 TED AI Hackathon Kickoff (and projects we’d love to see) 2 min read Sep 12 How to Safely Query Enterprise Data with LangChain Agents + SQL + OpenAI + Gretel 6 min read Sep 12 OpaquePrompts x LangChain: Enhance the privacy of your LangChain application with just one code change 4 min read Load more LangChain © 2023 Sign up Powered by Ghost\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3mI have successfully navigated to the blog.langchain.dev website. The text on the webpage includes featured posts such as \"Announcing LangChain Hub,\" \"Using LangSmith to Support Fine-tuning,\" \"Peering Into the Soul of AI Decision-Making with LangSmith,\" \"LangChain + Docugami Webinar: Lessons from Deploying LLMs with LangSmith,\" \"TED AI Hackathon Kickoff (and projects we’d love to see),\" \"How to Safely Query Enterprise Data with LangChain Agents + SQL + OpenAI + Gretel,\" and \"OpaquePrompts x LangChain: Enhance the privacy of your LangChain application with just one code change.\" There are also links to other pages on the website.\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n",
+      "I have successfully navigated to the blog.langchain.dev website. The text on the webpage includes featured posts such as \"Announcing LangChain Hub,\" \"Using LangSmith to Support Fine-tuning,\" \"Peering Into the Soul of AI Decision-Making with LangSmith,\" \"LangChain + Docugami Webinar: Lessons from Deploying LLMs with LangSmith,\" \"TED AI Hackathon Kickoff (and projects we’d love to see),\" \"How to Safely Query Enterprise Data with LangChain Agents + SQL + OpenAI + Gretel,\" and \"OpaquePrompts x LangChain: Enhance the privacy of your LangChain application with just one code change.\" There are also links to other pages on the website.\n"
+     ]
+    }
+   ],
+   "source": [
+    "response = await agent_chain.ainvoke({\"input\": \"Browse to blog.langchain.dev and summarize the text, please.\"})\n",
+    "print(response['output'])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "fc3ce811",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/modules/agents/agent_types/xml_agent.ipynb
+++ b/docs/extras/modules/agents/agent_types/xml_agent.ipynb
@@ -11,34 +11,24 @@
   ]
  },
  {
-   "cell_type": "code",
-   "execution_count": 3,
-   "id": "f9d2ead2",
+   "cell_type": "markdown",
+   "id": "fe972808",
   "metadata": {},
-   "outputs": [],
   "source": [
-    "from langchain.agents import XMLAgent, tool, AgentExecutor\n",
-    "from langchain.chat_models import ChatAnthropic\n",
-    "from langchain.chains import LLMChain"
+    "## Initialize the tools\n",
+    "\n",
+    "We will initialize some fake tools for demo purposes"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
-   "id": "ebadf04f",
-   "metadata": {},
-   "outputs": [],
-   "source": [
-    "model = ChatAnthropic(model=\"claude-2\")"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 5,
-   "id": "6ce9f9a5",
+   "execution_count": 1,
+   "id": "ba547497",
   "metadata": {},
   "outputs": [],
   "source": [
+    "from langchain.agents import tool\n",
+    "\n",
    "@tool\n",
    "def search(query: str) -> str:\n",
    "    \"\"\"Search things about current events.\"\"\"\n",
@@ -47,17 +37,174 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 10,
-   "id": "c589944e",
+   "execution_count": 6,
+   "id": "e30e99e2",
   "metadata": {},
   "outputs": [],
   "source": [
-    "tool_list = [search]"
+    "tools = [search]"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 11,
+   "execution_count": 2,
+   "id": "401db6ce",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models import ChatAnthropic\n",
+    "model = ChatAnthropic(model=\"claude-2\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "90f83099",
+   "metadata": {},
+   "source": [
+    "## Use LangChain Expression Language\n",
+    "\n",
+    "We will first show how to create this agent using LangChain Expression Language"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "78937679",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.tools.render import render_text_description\n",
+    "from langchain.agents.output_parsers import XMLAgentOutputParser\n",
+    "from langchain.agents.format_scratchpad import format_xml\n",
+    "from langchain import hub"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "54fc5a22",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = hub.pull(\"hwchase17/xml-agent\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "b1802fcc",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = prompt.partial(\n",
+    "    tools=render_text_description(tools),\n",
+    "    tool_names=\", \".join([t.name for t in tools]),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "f9d2ead2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_with_stop = model.bind(stop=[\"</tool_input>\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "ebadf04f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"question\": lambda x: x[\"question\"],\n",
+    "    \"agent_scratchpad\": lambda x: format_xml(x['intermediate_steps']),\n",
+    "} | prompt | llm_with_stop | XMLAgentOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "4e2bb03e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "6ce9f9a5",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "id": "e14affef",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m <tool>search</tool>\n",
+      "<tool_input>weather in new york\u001b[0m\u001b[36;1m\u001b[1;3m32 degrees\u001b[0m\u001b[32;1m\u001b[1;3m <tool>search</tool>\n",
+      "<tool_input>weather in new york\u001b[0m\u001b[36;1m\u001b[1;3m32 degrees\u001b[0m\u001b[32;1m\u001b[1;3m <final_answer>\n",
+      "The weather in New York is 32 degrees.\n",
+      "</final_answer>\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "{'question': 'whats the weather in New york?',\n",
+       " 'output': '\\nThe weather in New York is 32 degrees.\\n'}"
+      ]
+     },
+     "execution_count": 18,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"question\": \"whats the weather in New york?\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "42ff473d",
+   "metadata": {},
+   "source": [
+    "## Use off-the-shelf agent"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 22,
+   "id": "7e5e73e3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chains import LLMChain\n",
+    "from langchain.agents import XMLAgent"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
   "id": "2d8454be",
   "metadata": {},
   "outputs": [],
@@ -67,22 +214,22 @@
    "    prompt=XMLAgent.get_default_prompt(),\n",
    "    output_parser=XMLAgent.get_default_output_parser()\n",
    ")\n",
-    "agent = XMLAgent(tools=tool_list, llm_chain=chain)"
+    "agent = XMLAgent(tools=tools, llm_chain=chain)"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 12,
+   "execution_count": 25,
   "id": "bca6096f",
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent_executor = AgentExecutor(agent=agent, tools=tool_list, verbose=True)"
+    "agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 13,
+   "execution_count": 28,
   "id": "71b872b1",
   "metadata": {},
   "outputs": [
@@ -94,7 +241,7 @@
      "\n",
      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
      "\u001b[32;1m\u001b[1;3m <tool>search</tool>\n",
-      "<tool_input>weather in New York\u001b[0m\u001b[36;1m\u001b[1;3m32 degrees\u001b[0m\u001b[32;1m\u001b[1;3m\n",
+      "<tool_input>weather in new york\u001b[0m\u001b[36;1m\u001b[1;3m32 degrees\u001b[0m\u001b[32;1m\u001b[1;3m\n",
      "\n",
      "<final_answer>The weather in New York is 32 degrees\u001b[0m\n",
      "\n",
@@ -104,16 +251,17 @@
    {
     "data": {
      "text/plain": [
-       "'The weather in New York is 32 degrees'"
+       "{'input': 'whats the weather in New york?',\n",
+       " 'output': 'The weather in New York is 32 degrees'}"
      ]
     },
-     "execution_count": 13,
+     "execution_count": 28,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
-    "agent_executor.run(\"whats the weather in New york?\")"
+    "agent_executor.invoke({\"input\": \"whats the weather in New york?\"})"
   ]
  },
  {
--- a/docs/extras/modules/agents/how_to/agent_structured.ipynb
+++ b/docs/extras/modules/agents/how_to/agent_structured.ipynb
@@ -0,0 +1,358 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "fb69907a",
+   "metadata": {},
+   "source": [
+    "# Returning Structured Output\n",
+    "\n",
+    "This notebook covers how to have an agent return a structured output.\n",
+    "By default, most of the agents return a single string.\n",
+    "It can often be useful to have an agent return something with more structure.\n",
+    "\n",
+    "\n",
+    "A good example of this is an agent tasked with doing question-answering over some sources.\n",
+    "Let's say we want the agent to respond not only with the answer, but also a list of the sources used.\n",
+    "We then want our output to roughly follow the schema below:\n",
+    "\n",
+    "```python\n",
+    "class Response(BaseModel):\n",
+    "    \"\"\"Final response to the question being asked\"\"\"\n",
+    "    answer: str = Field(description = \"The final answer to respond to the user\")\n",
+    "    sources: List[int] = Field(description=\"List of page chunks that contain answer to the question. Only include a page chunk if it contains relevant information\")\n",
+    "```\n",
+    "\n",
+    "In this notebook we will go over an agent that has a retriever tool and responds in the correct format."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "4fc33ba5",
+   "metadata": {},
+   "source": [
+    "## Create the Retriever\n",
+    "\n",
+    "In this section we will do some setup work to create our retriever over some mock data containing the \"State of the Union\" address. Importantly, we will add a \"page_chunk\" tag to the metadata of each document. This is just some fake data intended to simulate a source field. In practice, this would more likely be the URL or path of a document."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "4ea20467",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "from langchain.vectorstores import Chroma\n",
+    "from langchain.text_splitter import RecursiveCharacterTextSplitter\n",
+    "from langchain.document_loaders import TextLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "e3002ed7",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Load in document to retrieve over\n",
+    "loader = TextLoader('../../state_of_the_union.txt')\n",
+    "documents = loader.load()\n",
+    "\n",
+    "# Split document into chunks\n",
+    "text_splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=0)\n",
+    "texts = text_splitter.split_documents(documents)\n",
+    "\n",
+    "# Here is where we add in the fake source information\n",
+    "for i, doc in enumerate(texts):\n",
+    "    doc.metadata['page_chunk'] = i\n",
+    "\n",
+    "# Create our retriever\n",
+    "embeddings = OpenAIEmbeddings()\n",
+    "vectorstore = Chroma.from_documents(texts, embeddings, collection_name=\"state-of-union\")\n",
+    "retriever = vectorstore.as_retriever()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "6ec1c106",
+   "metadata": {},
+   "source": [
+    "## Create the tools\n",
+    "\n",
+    "We will now create the tools we want to give to the agent. In this case, it is just one - a tool that wraps our retriever."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "204ef7ca",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents.agent_toolkits.conversational_retrieval.tool import create_retriever_tool\n",
+    "\n",
+    "retriever_tool = create_retriever_tool(\n",
+    "    retriever,\n",
+    "    \"state-of-union-retriever\",\n",
+    "    \"Query a retriever to get information about state of the union address\"\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "9af5b61b",
+   "metadata": {},
+   "source": [
+    "## Create response schema\n",
+    "\n",
+    "Here is where we will define the response schema. In this case, we want the final answer to have two fields: one for the `answer`, and then another that is a list of `sources`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "2df91723",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from pydantic import BaseModel, Field\n",
+    "from typing import List\n",
+    "from langchain.utils.openai_functions import convert_pydantic_to_openai_function\n",
+    "\n",
+    "class Response(BaseModel):\n",
+    "    \"\"\"Final response to the question being asked\"\"\"\n",
+    "    answer: str = Field(description = \"The final answer to respond to the user\")\n",
+    "    sources: List[int] = Field(description=\"List of page chunks that contain answer to the question. Only include a page chunk if it contains relevant information\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "2cd181df",
+   "metadata": {},
+   "source": [
+    "## Create the custom parsing logic\n",
+    "\n",
+    "We now create some custom parsing logic.\n",
+    "How this works is that we will pass the `Response` schema to the OpenAI LLM via their `functions` parameter.\n",
+    "This is similar to how we pass tools for the agent to use.\n",
+    "\n",
+    "When the `Response` function is called by OpenAI, we want to use that as a signal to return to the user.\n",
+    "When any other function is called by OpenAI, we treat that as a tool invocation.\n",
+    "\n",
+    "Therefor, our parsing logic has the following blocks:\n",
+    "\n",
+    "- If no function is called, assume that we should use the response to respond to the user, and therefor return `AgentFinish`\n",
+    "- If the `Response` function is called, respond to the user with the inputs to that function (our structured output), and therefor return `AgentFinish`\n",
+    "- If any other function is called, treat that as a tool invocation, and therefor return `AgentActionMessageLog`\n",
+    "\n",
+    "Note that we are using `AgentActionMessageLog` rather than `AgentAction` because it lets us attach a log of messages that we can use in the future to pass back into the agent prompt."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "dfb73fe3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.schema.agent import AgentActionMessageLog, AgentFinish\n",
+    "import json"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "5b46cdb2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "def parse(output):\n",
+    "    # If no function was invoked, return to user\n",
+    "    if \"function_call\" not in output.additional_kwargs:\n",
+    "        return AgentFinish(return_values={\"output\": output.content}, log=output.content)\n",
+    "    \n",
+    "    # Parse out the function call\n",
+    "    function_call = output.additional_kwargs[\"function_call\"]\n",
+    "    name = function_call['name']\n",
+    "    inputs = json.loads(function_call['arguments'])\n",
+    "    \n",
+    "    # If the Response function was invoked, return to the user with the function inputs\n",
+    "    if name == \"Response\":\n",
+    "        return AgentFinish(return_values=inputs, log=str(function_call))\n",
+    "    # Otherwise, return an agent action\n",
+    "    else:\n",
+    "        return AgentActionMessageLog(tool=name, tool_input=inputs, log=\"\", message_log=[output])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "6d7401a1",
+   "metadata": {},
+   "source": [
+    "## Create the Agent\n",
+    "\n",
+    "We can now put this all together! The components of this agent are:\n",
+    "\n",
+    "- prompt: a simple prompt with placeholders for the user's question and then the `agent_scratchpad` (any intermediate steps)\n",
+    "- tools: we can attach the tools and `Response` format to the LLM as functions\n",
+    "- format scratchpad: in order to format the `agent_scratchpad` from intermediate steps, we will use the standard `format_to_openai_functions`. This takes intermediate steps and formats them as AIMessages and FunctionMessages.\n",
+    "- output parser: we will use our custom parser above to parse the response of the LLM\n",
+    "- AgentExecutor: we will use the standard AgentExecutor to run the loop of agent-tool-agent-tool..."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "73c785f9",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.prompts import ChatPromptTemplate, MessagesPlaceholder\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.tools.render import format_tool_to_openai_function\n",
+    "from langchain.agents.format_scratchpad import format_to_openai_functions\n",
+    "from langchain.agents import AgentExecutor"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "e1feaeda",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = ChatPromptTemplate.from_messages([\n",
+    "    (\"system\", \"You are a helpful assistant\"),\n",
+    "    (\"user\", \"{input}\"),\n",
+    "    MessagesPlaceholder(variable_name=\"agent_scratchpad\"),\n",
+    "])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "d27dc3a8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = ChatOpenAI(temperature=0)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "7bab4af2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_with_tools = llm.bind(\n",
+    "    functions=[\n",
+    "        # The retriever tool\n",
+    "        format_tool_to_openai_function(retriever_tool), \n",
+    "        # Response schema\n",
+    "        convert_pydantic_to_openai_function(Response)\n",
+    "    ]\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "b886416c",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent = {\n",
+    "    \"input\": lambda x: x[\"input\"],\n",
+    "    # Format agent scratchpad from intermediate steps\n",
+    "    \"agent_scratchpad\": lambda x: format_to_openai_functions(x['intermediate_steps'])\n",
+    "} | prompt | llm_with_tools | parse"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "2cfd783e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor(tools=[retriever_tool], agent=agent, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "9f114fec",
+   "metadata": {},
+   "source": [
+    "## Run the agent\n",
+    "\n",
+    "We can now run the agent! Notice how it responds with a dictionary with two keys: `answer` and `sources`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "id": "2667c9a4",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m\u001b[0m\u001b[36;1m\u001b[1;3m[Document(page_content='Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \\n\\nTonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \\n\\nOne of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \\n\\nAnd I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'page_chunk': 31, 'source': '../../state_of_the_union.txt'}), Document(page_content='One was stationed at bases and breathing in toxic smoke from “burn pits” that incinerated wastes of war—medical and hazard material, jet fuel, and more. \\n\\nWhen they came home, many of the world’s fittest and best trained warriors were never the same. \\n\\nHeadaches. Numbness. Dizziness. \\n\\nA cancer that would put them in a flag-draped coffin. \\n\\nI know. \\n\\nOne of those soldiers was my son Major Beau Biden. \\n\\nWe don’t know for sure if a burn pit was the cause of his brain cancer, or the diseases of so many of our troops. \\n\\nBut I’m committed to finding out everything we can. \\n\\nCommitted to military families like Danielle Robinson from Ohio. \\n\\nThe widow of Sergeant First Class Heath Robinson.  \\n\\nHe was born a soldier. Army National Guard. Combat medic in Kosovo and Iraq. \\n\\nStationed near Baghdad, just yards from burn pits the size of football fields. \\n\\nHeath’s widow Danielle is here with us tonight. They loved going to Ohio State football games. He loved building Legos with their daughter.', metadata={'page_chunk': 37, 'source': '../../state_of_the_union.txt'}), Document(page_content='A former top litigator in private practice. A former federal public defender. And from a family of public school educators and police officers. A consensus builder. Since she’s been nominated, she’s received a broad range of support—from the Fraternal Order of Police to former judges appointed by Democrats and Republicans. \\n\\nAnd if we are to advance liberty and justice, we need to secure the Border and fix the immigration system. \\n\\nWe can do both. At our border, we’ve installed new technology like cutting-edge scanners to better detect drug smuggling.  \\n\\nWe’ve set up joint patrols with Mexico and Guatemala to catch more human traffickers.  \\n\\nWe’re putting in place dedicated immigration judges so families fleeing persecution and violence can have their cases heard faster. \\n\\nWe’re securing commitments and supporting partners in South and Central America to host more refugees and secure their own borders.', metadata={'page_chunk': 32, 'source': '../../state_of_the_union.txt'}), Document(page_content='But cancer from prolonged exposure to burn pits ravaged Heath’s lungs and body. \\n\\nDanielle says Heath was a fighter to the very end. \\n\\nHe didn’t know how to stop fighting, and neither did she. \\n\\nThrough her pain she found purpose to demand we do better. \\n\\nTonight, Danielle—we are. \\n\\nThe VA is pioneering new ways of linking toxic exposures to diseases, already helping more veterans get benefits. \\n\\nAnd tonight, I’m announcing we’re expanding eligibility to veterans suffering from nine respiratory cancers. \\n\\nI’m also calling on Congress: pass a law to make sure veterans devastated by toxic exposures in Iraq and Afghanistan finally get the benefits and comprehensive health care they deserve. \\n\\nAnd fourth, let’s end cancer as we know it. \\n\\nThis is personal to me and Jill, to Kamala, and to so many of you. \\n\\nCancer is the #2 cause of death in America–second only to heart disease.', metadata={'page_chunk': 38, 'source': '../../state_of_the_union.txt'})]\u001b[0m\u001b[32;1m\u001b[1;3m{'name': 'Response', 'arguments': '{\\n  \"answer\": \"President mentioned Ketanji Brown Jackson as a nominee for the United States Supreme Court and praised her as one of the nation\\'s top legal minds.\",\\n  \"sources\": [31]\\n}'}\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "{'answer': \"President mentioned Ketanji Brown Jackson as a nominee for the United States Supreme Court and praised her as one of the nation's top legal minds.\",\n",
+       " 'sources': [31]}"
+      ]
+     },
+     "execution_count": 18,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.invoke({\"input\": \"what did the president say about kentaji brown jackson\"}, return_only_outputs=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "b355665e",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/modules/agents/how_to/intermediate_steps.ipynb
+++ b/docs/extras/modules/agents/how_to/intermediate_steps.ipynb
@@ -166,9 +166,9 @@
    }
   ],
   "source": [
-    "import json\n",
+    "from langchain.load.dump import dumps\n",
    "\n",
-    "print(json.dumps(response[\"intermediate_steps\"], indent=2))"
+    "print(dumps(response[\"intermediate_steps\"], pretty=True))"
   ]
  },
  {
--- a/docs/extras/modules/data_connection/indexing.ipynb
+++ b/docs/extras/modules/data_connection/indexing.ipynb
@@ -603,7 +603,7 @@
   "id": "4002a4ac-02dd-4599-9b23-9b59f54237c8",
   "metadata": {},
   "source": [
-    "The metadata attribute contains a filed called `source`. This source should be pointing at the *ultimate* provenance associated with the given document.\n",
+    "The metadata attribute contains a field called `source`. This source should be pointing at the *ultimate* provenance associated with the given document.\n",
    "\n",
    "For example, if these documents are representing chunks of some parent document, the `source` for both documents should be the same and reference the parent document.\n",
    "\n",
--- a/docs/extras/modules/data_connection/retrievers/self_query/timescalevector_self_query.ipynb
+++ b/docs/extras/modules/data_connection/retrievers/self_query/timescalevector_self_query.ipynb
@@ -0,0 +1,534 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "13afcae7",
+   "metadata": {},
+   "source": [
+    "# Timescale Vector (Postgres) self-querying \n",
+    "\n",
+    "[Timescale Vector](https://www.timescale.com/ai) is PostgreSQL++ for AI applications. It enables you to efficiently store and query billions of vector embeddings in `PostgreSQL`.\n",
+    "\n",
+    "This notebook shows how to use the Postgres vector database (`TimescaleVector`) to perform self-querying. In the notebook we'll demo the `SelfQueryRetriever` wrapped around a TimescaleVector vector store. \n",
+    "\n",
+    "## What is Timescale Vector?\n",
+    "**[Timescale Vector](https://www.timescale.com/ai) is PostgreSQL++ for AI applications.**\n",
+    "\n",
+    "Timescale Vector enables you to efficiently store and query millions of vector embeddings in `PostgreSQL`.\n",
+    "- Enhances `pgvector` with faster and more accurate similarity search on 1B+ vectors via DiskANN inspired indexing algorithm.\n",
+    "- Enables fast time-based vector search via automatic time-based partitioning and indexing.\n",
+    "- Provides a familiar SQL interface for querying vector embeddings and relational data.\n",
+    "\n",
+    "Timescale Vector is cloud PostgreSQL for AI that scales with you from POC to production:\n",
+    "- Simplifies operations by enabling you to store relational metadata, vector embeddings, and time-series data in a single database.\n",
+    "- Benefits from rock-solid PostgreSQL foundation with enterprise-grade feature liked streaming backups and replication, high-availability and row-level security.\n",
+    "- Enables a worry-free experience with enterprise-grade security and compliance.\n",
+    "\n",
+    "## How to access Timescale Vector\n",
+    "Timescale Vector is available on [Timescale](https://www.timescale.com/ai), the cloud PostgreSQL platform. (There is no self-hosted version at this time.)\n",
+    "\n",
+    "LangChain users get a 90-day free trial for Timescale Vector.\n",
+    "- To get started, [signup](https://console.cloud.timescale.com/signup?utm_campaign=vectorlaunch&utm_source=langchain&utm_medium=referral) to Timescale, create a new database and follow this notebook!\n",
+    "- See the [Timescale Vector explainer blog](https://www.timescale.com/blog/how-we-made-postgresql-the-best-vector-database/?utm_campaign=vectorlaunch&utm_source=langchain&utm_medium=referral) for more details and performance benchmarks.\n",
+    "- See the [installation instructions](https://github.com/timescale/python-vector) for more details on using Timescale Vector in python.\n"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "68e75fb9",
+   "metadata": {},
+   "source": [
+    "## Creating a TimescaleVector vectorstore\n",
+    "First we'll want to create a Timescale Vector vectorstore and seed it with some data. We've created a small demo set of documents that contain summaries of movies.\n",
+    "\n",
+    "NOTE: The self-query retriever requires you to have `lark` installed (`pip install lark`). We also need the `timescale-vector` package."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "63a8af5b",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "#!pip install lark"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "22431060-52c4-48a7-a97b-9f542b8b0928",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "#!pip install timescale-vector "
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "83811610-7df3-4ede-b268-68a6a83ba9e2",
+   "metadata": {},
+   "source": [
+    "In this example, we'll use `OpenAIEmbeddings`, so let's load your OpenAI API key."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "dd01b61b-7d32-4a55-85d6-b2d2d4f18840",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "# Get openAI api key by reading local .env file\n",
+    "# The .env file should contain a line starting with `OPENAI_API_KEY=sk-`\n",
+    "import os\n",
+    "from dotenv import load_dotenv, find_dotenv\n",
+    "_ = load_dotenv(find_dotenv())\n",
+    "\n",
+    "OPENAI_API_KEY  = os.environ['OPENAI_API_KEY']\n",
+    "# Alternatively, use getpass to enter the key in a prompt\n",
+    "#import os\n",
+    "#import getpass\n",
+    "#os.environ[\"OPENAI_API_KEY\"] = getpass.getpass(\"OpenAI API Key:\")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "766e9c4b",
+   "metadata": {},
+   "source": [
+    "To connect to your PostgreSQL database, you'll need your service URI, which can be found in the cheatsheet or `.env` file you downloaded after creating a new database. \n",
+    "\n",
+    "If you haven't already, [signup for Timescale](https://console.cloud.timescale.com/signup?utm_campaign=vectorlaunch&utm_source=langchain&utm_medium=referral), and create a new database.\n",
+    "\n",
+    "The URI will look something like this: `postgres://tsdbadmin:<password>@<id>.tsdb.cloud.timescale.com:<port>/tsdb?sslmode=require`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "6bd6877e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Get the service url by reading local .env file\n",
+    "# The .env file should contain a line starting with `TIMESCALE_SERVICE_URL=postgresql://`\n",
+    "_ = load_dotenv(find_dotenv())\n",
+    "TIMESCALE_SERVICE_URL = os.environ[\"TIMESCALE_SERVICE_URL\"]\n",
+    "\n",
+    "# Alternatively, use getpass to enter the key in a prompt\n",
+    "#import os\n",
+    "#import getpass\n",
+    "#TIMESCALE_SERVICE_URL = getpass.getpass(\"Timescale Service URL:\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "cb4a5787",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "from langchain.schema import Document\n",
+    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "from langchain.vectorstores.timescalevector import TimescaleVector\n",
+    "\n",
+    "embeddings = OpenAIEmbeddings()"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "a4f863f5",
+   "metadata": {},
+   "source": [
+    "Here's the sample documents we'll use for this demo. The data is about movies, and has both content and metadata fields with information about particular movie."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "bcbe04d9",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "docs = [\n",
+    "    Document(\n",
+    "        page_content=\"A bunch of scientists bring back dinosaurs and mayhem breaks loose\",\n",
+    "        metadata={\"year\": 1993, \"rating\": 7.7, \"genre\": \"science fiction\"},\n",
+    "    ),\n",
+    "    Document(\n",
+    "        page_content=\"Leo DiCaprio gets lost in a dream within a dream within a dream within a ...\",\n",
+    "        metadata={\"year\": 2010, \"director\": \"Christopher Nolan\", \"rating\": 8.2},\n",
+    "    ),\n",
+    "    Document(\n",
+    "        page_content=\"A psychologist / detective gets lost in a series of dreams within dreams within dreams and Inception reused the idea\",\n",
+    "        metadata={\"year\": 2006, \"director\": \"Satoshi Kon\", \"rating\": 8.6},\n",
+    "    ),\n",
+    "    Document(\n",
+    "        page_content=\"A bunch of normal-sized women are supremely wholesome and some men pine after them\",\n",
+    "        metadata={\"year\": 2019, \"director\": \"Greta Gerwig\", \"rating\": 8.3},\n",
+    "    ),\n",
+    "    Document(\n",
+    "        page_content=\"Toys come alive and have a blast doing so\",\n",
+    "        metadata={\"year\": 1995, \"genre\": \"animated\"},\n",
+    "    ),\n",
+    "    Document(\n",
+    "        page_content=\"Three men walk into the Zone, three men walk out of the Zone\",\n",
+    "        metadata={\n",
+    "            \"year\": 1979,\n",
+    "            \"rating\": 9.9,\n",
+    "            \"director\": \"Andrei Tarkovsky\",\n",
+    "            \"genre\": \"science fiction\",\n",
+    "            \"rating\": 9.9,\n",
+    "        },\n",
+    "    ),\n",
+    "]"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "7d0d771e",
+   "metadata": {},
+   "source": [
+    "Finally, we'll create our Timescale Vector vectorstore. Note that the collection name will be the name of the PostgreSQL table in which the documents are stored in."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "2428d1ba",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "COLLECTION_NAME = \"langchain_self_query_demo\"\n",
+    "vectorstore = TimescaleVector.from_documents(\n",
+    "    embedding=embeddings,\n",
+    "    documents=docs,\n",
+    "    collection_name=COLLECTION_NAME,\n",
+    "    service_url=TIMESCALE_SERVICE_URL,\n",
+    ")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "5ecaab6d",
+   "metadata": {},
+   "source": [
+    "## Creating our self-querying retriever\n",
+    "Now we can instantiate our retriever. To do this we'll need to provide some information upfront about the metadata fields that our documents support and a short description of the document contents."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "86e34dbf",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "from langchain.llms import OpenAI\n",
+    "from langchain.retrievers.self_query.base import SelfQueryRetriever\n",
+    "from langchain.chains.query_constructor.base import AttributeInfo\n",
+    "\n",
+    "# Give LLM info about the metadata fields\n",
+    "metadata_field_info = [\n",
+    "    AttributeInfo(\n",
+    "        name=\"genre\",\n",
+    "        description=\"The genre of the movie\",\n",
+    "        type=\"string or list[string]\",\n",
+    "    ),\n",
+    "    AttributeInfo(\n",
+    "        name=\"year\",\n",
+    "        description=\"The year the movie was released\",\n",
+    "        type=\"integer\",\n",
+    "    ),\n",
+    "    AttributeInfo(\n",
+    "        name=\"director\",\n",
+    "        description=\"The name of the movie director\",\n",
+    "        type=\"string\",\n",
+    "    ),\n",
+    "    AttributeInfo(\n",
+    "        name=\"rating\", description=\"A 1-10 rating for the movie\", type=\"float\"\n",
+    "    ),\n",
+    "]\n",
+    "document_content_description = \"Brief summary of a movie\"\n",
+    "\n",
+    "# Instantiate the self-query retriever from an LLM\n",
+    "llm = OpenAI(temperature=0)\n",
+    "retriever = SelfQueryRetriever.from_llm(\n",
+    "    llm, vectorstore, document_content_description, metadata_field_info, verbose=True\n",
+    ")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "ea9df8d4",
+   "metadata": {},
+   "source": [
+    "## Self Querying Retrieval with Timescale Vector\n",
+    "And now we can try actually using our retriever!\n",
+    "\n",
+    "Run the queries below and note how you can specify a query, filter, composite filter (filters with AND, OR) in natural language and the self-query retriever will translate that query into SQL and perform the search on the Timescale Vector (Postgres) vectorstore.\n",
+    "\n",
+    "This illustrates the power of the self-query retriever. You can use it to perform complex searches over your vectorstore without you or your users having to write any SQL directly!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "38a126e9",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/Users/avtharsewrathan/sideprojects2023/timescaleai/tsv-langchain/langchain/libs/langchain/langchain/chains/llm.py:275: UserWarning: The predict_and_parse method is deprecated, instead pass an output parser directly to LLMChain.\n",
+      "  warnings.warn(\n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "query='dinosaur' filter=None limit=None\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='A bunch of scientists bring back dinosaurs and mayhem breaks loose', metadata={'year': 1993, 'genre': 'science fiction', 'rating': 7.7}),\n",
+       " Document(page_content='A bunch of scientists bring back dinosaurs and mayhem breaks loose', metadata={'year': 1993, 'genre': 'science fiction', 'rating': 7.7}),\n",
+       " Document(page_content='Toys come alive and have a blast doing so', metadata={'year': 1995, 'genre': 'animated'}),\n",
+       " Document(page_content='Toys come alive and have a blast doing so', metadata={'year': 1995, 'genre': 'animated'})]"
+      ]
+     },
+     "execution_count": 15,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# This example only specifies a relevant query\n",
+    "retriever.get_relevant_documents(\"What are some movies about dinosaurs\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "fc3f1e6e",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "query=' ' filter=Comparison(comparator=<Comparator.GT: 'gt'>, attribute='rating', value=8.5) limit=None\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='Three men walk into the Zone, three men walk out of the Zone', metadata={'year': 1979, 'genre': 'science fiction', 'rating': 9.9, 'director': 'Andrei Tarkovsky'}),\n",
+       " Document(page_content='Three men walk into the Zone, three men walk out of the Zone', metadata={'year': 1979, 'genre': 'science fiction', 'rating': 9.9, 'director': 'Andrei Tarkovsky'}),\n",
+       " Document(page_content='A psychologist / detective gets lost in a series of dreams within dreams within dreams and Inception reused the idea', metadata={'year': 2006, 'rating': 8.6, 'director': 'Satoshi Kon'}),\n",
+       " Document(page_content='A psychologist / detective gets lost in a series of dreams within dreams within dreams and Inception reused the idea', metadata={'year': 2006, 'rating': 8.6, 'director': 'Satoshi Kon'})]"
+      ]
+     },
+     "execution_count": 16,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# This example only specifies a filter\n",
+    "retriever.get_relevant_documents(\"I want to watch a movie rated higher than 8.5\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "b19d4da0",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "query='women' filter=Comparison(comparator=<Comparator.EQ: 'eq'>, attribute='director', value='Greta Gerwig') limit=None\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='A bunch of normal-sized women are supremely wholesome and some men pine after them', metadata={'year': 2019, 'rating': 8.3, 'director': 'Greta Gerwig'}),\n",
+       " Document(page_content='A bunch of normal-sized women are supremely wholesome and some men pine after them', metadata={'year': 2019, 'rating': 8.3, 'director': 'Greta Gerwig'})]"
+      ]
+     },
+     "execution_count": 17,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# This example specifies a query and a filter\n",
+    "retriever.get_relevant_documents(\"Has Greta Gerwig directed any movies about women\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 18,
+   "id": "f900e40e",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "query=' ' filter=Operation(operator=<Operator.AND: 'and'>, arguments=[Comparison(comparator=<Comparator.GTE: 'gte'>, attribute='rating', value=8.5), Comparison(comparator=<Comparator.EQ: 'eq'>, attribute='genre', value='science fiction')]) limit=None\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='Three men walk into the Zone, three men walk out of the Zone', metadata={'year': 1979, 'genre': 'science fiction', 'rating': 9.9, 'director': 'Andrei Tarkovsky'}),\n",
+       " Document(page_content='Three men walk into the Zone, three men walk out of the Zone', metadata={'year': 1979, 'genre': 'science fiction', 'rating': 9.9, 'director': 'Andrei Tarkovsky'})]"
+      ]
+     },
+     "execution_count": 18,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# This example specifies a composite filter\n",
+    "retriever.get_relevant_documents(\n",
+    "    \"What's a highly rated (above 8.5) science fiction film?\"\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "12a51522",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "query='toys' filter=Operation(operator=<Operator.AND: 'and'>, arguments=[Comparison(comparator=<Comparator.GT: 'gt'>, attribute='year', value=1990), Comparison(comparator=<Comparator.LT: 'lt'>, attribute='year', value=2005), Comparison(comparator=<Comparator.EQ: 'eq'>, attribute='genre', value='animated')]) limit=None\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='Toys come alive and have a blast doing so', metadata={'year': 1995, 'genre': 'animated'})]"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# This example specifies a query and composite filter\n",
+    "retriever.get_relevant_documents(\n",
+    "    \"What's a movie after 1990 but before 2005 that's all about toys, and preferably is animated\"\n",
+    ")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "39bd1de1-b9fe-4a98-89da-58d8a7a6ae51",
+   "metadata": {},
+   "source": [
+    "### Filter k\n",
+    "\n",
+    "We can also use the self query retriever to specify `k`: the number of documents to fetch.\n",
+    "\n",
+    "We can do this by passing `enable_limit=True` to the constructor."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "id": "bff36b88-b506-4877-9c63-e5a1a8d78e64",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [],
+   "source": [
+    "retriever = SelfQueryRetriever.from_llm(\n",
+    "    llm,\n",
+    "    vectorstore,\n",
+    "    document_content_description,\n",
+    "    metadata_field_info,\n",
+    "    enable_limit=True,\n",
+    "    verbose=True,\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 22,
+   "id": "2758d229-4f97-499c-819f-888acaf8ee10",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "query='dinosaur' filter=None limit=2\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[Document(page_content='A bunch of scientists bring back dinosaurs and mayhem breaks loose', metadata={'year': 1993, 'genre': 'science fiction', 'rating': 7.7}),\n",
+       " Document(page_content='A bunch of scientists bring back dinosaurs and mayhem breaks loose', metadata={'year': 1993, 'genre': 'science fiction', 'rating': 7.7})]"
+      ]
+     },
+     "execution_count": 22,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# This example specifies a query with a LIMIT value\n",
+    "retriever.get_relevant_documents(\"what are two movies about dinosaurs\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/extras/use_cases/more/agents/autonomous_agents/baby_agi.ipynb
+++ b/docs/extras/use_cases/more/agents/autonomous_agents/baby_agi.ipynb
@@ -36,7 +36,7 @@
    "from langchain.chains import LLMChain\nfrom langchain.llms import OpenAI\nfrom langchain.prompts import PromptTemplate\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.llms import BaseLLM\n",
-    "from langchain.vectorstores.base import VectorStore\n",
+    "from langchain.schema.vectorstore import VectorStore\n",
    "from pydantic import BaseModel, Field\n",
    "from langchain.chains.base import Chain\n",
    "from langchain_experimental.autonomous_agents import BabyAGI"
--- a/docs/extras/use_cases/more/agents/autonomous_agents/baby_agi_with_agent.ipynb
+++ b/docs/extras/use_cases/more/agents/autonomous_agents/baby_agi_with_agent.ipynb
@@ -32,7 +32,7 @@
    "from langchain.chains import LLMChain\nfrom langchain.llms import OpenAI\nfrom langchain.prompts import PromptTemplate\n",
    "from langchain.embeddings import OpenAIEmbeddings\n",
    "from langchain.llms import BaseLLM\n",
-    "from langchain.vectorstores.base import VectorStore\n",
+    "from langchain.schema.vectorstore import VectorStore\n",
    "from pydantic import BaseModel, Field\n",
    "from langchain.chains.base import Chain\n",
    "from langchain_experimental.autonomous_agents import BabyAGI"
--- a/docs/extras/use_cases/more/agents/autonomous_agents/plan_and_execute.mdx
+++ b/docs/extras/use_cases/more/agents/autonomous_agents/plan_and_execute.mdx
@@ -1,3 +1,11 @@
+# Plan-and-execute
+
+Plan-and-execute agents accomplish an objective by first planning what to do, then executing the sub tasks. This idea is largely inspired by [BabyAGI](https://github.com/yoheinakajima/babyagi) and then the ["Plan-and-Solve" paper](https://arxiv.org/abs/2305.04091).
+
+The planning is almost always done by an LLM.
+
+The execution is usually done by a separate agent (equipped with tools).
+
 ## Imports


--- a/docs/extras/use_cases/more/graph/graph_cypher_qa.ipynb
+++ b/docs/extras/use_cases/more/graph/graph_cypher_qa.ipynb
@@ -135,7 +135,7 @@
    }
   ],
   "source": [
-    "print(graph.get_schema)"
+    "print(graph.schema)"
   ]
  },
  {
@@ -510,13 +510,54 @@
    "chain.run(\"Who played in Top Gun?\")"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "eefea16b-508f-4552-8942-9d5063ed7d37",
+   "metadata": {},
+   "source": [
+    "# Ignore specified node and relationship types\n",
+    "You can use `include_types` or `exclude_types` to ignore parts of the graph schema when generating Cypher statements."
+   ]
+  },
  {
   "cell_type": "code",
-   "execution_count": null,
-   "id": "48ff7cf8-18a3-43d7-8cb1-c1b91744608d",
+   "execution_count": 18,
+   "id": "a20fa21e-fb85-41c4-aac0-53fb25e34604",
   "metadata": {},
   "outputs": [],
-   "source": []
+   "source": [
+    "chain = GraphCypherQAChain.from_llm(\n",
+    "     graph=graph,\n",
+    "     cypher_llm=ChatOpenAI(temperature=0, model=\"gpt-3.5-turbo\"),\n",
+    "     qa_llm=ChatOpenAI(temperature=0, model=\"gpt-3.5-turbo-16k\"),\n",
+    "     verbose=True,\n",
+    "     exclude_types=['Movie']\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "id": "3ad7f6b8-543e-46e4-a3b2-40fa3e66e895",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Node properties are the following: \n",
+      " {'Actor': [{'property': 'name', 'type': 'STRING'}]}\n",
+      "Relationships properties are the following: \n",
+      " {}\n",
+      "Relationships are: \n",
+      "[]\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Inspect graph schema\n",
+    "print(chain.graph_schema)"
+   ]
  }
 ],
 "metadata": {
--- a/docs/extras/use_cases/more/graph/graph_memgraph_qa.ipynb
+++ b/docs/extras/use_cases/more/graph/graph_memgraph_qa.ipynb
@@ -187,7 +187,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "print(graph.get_schema)"
+    "print(graph.schema)"
   ]
  },
  {
@@ -687,7 +687,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.13"
+   "version": "3.8.8"
  }
 },
 "nbformat": 4,
--- a/docs/extras/use_cases/web_scraping.ipynb
+++ b/docs/extras/use_cases/web_scraping.ipynb
--- a/docs/package-lock.json
+++ b/docs/package-lock.json
@@ -1,6 +0,0 @@
-{
-  "name": "docs",
-  "lockfileVersion": 3,
-  "requires": true,
-  "packages": {}
-}
--- a/docs/snippets/modules/agents/agent_types/chat_conversation_agent.mdx
+++ b/docs/snippets/modules/agents/agent_types/chat_conversation_agent.mdx
@@ -1,130 +0,0 @@
-The `chat-conversational-react-description` agent type lets us create a conversational agent using a chat model instead of an LLM.
-
-```python
-from langchain.memory import ConversationBufferMemory
-from langchain.chat_models import ChatOpenAI
-
-memory = ConversationBufferMemory(memory_key="chat_history", return_messages=True)
-llm = ChatOpenAI(openai_api_key=OPENAI_API_KEY, temperature=0)
-agent_chain = initialize_agent(tools, llm, agent=AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION, verbose=True, memory=memory)
-```
-
-
-```python
-agent_chain.run(input="hi, i am bob")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    {
-        "action": "Final Answer",
-        "action_input": "Hello Bob! How can I assist you today?"
-    }
-    
-    > Finished chain.
-
-
-    'Hello Bob! How can I assist you today?'
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run(input="what's my name?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    {
-        "action": "Final Answer",
-        "action_input": "Your name is Bob."
-    }
-    
-    > Finished chain.
-
-
-    'Your name is Bob.'
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run("what are some good dinners to make this week, if i like thai food?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    {
-        "action": "Current Search",
-        "action_input": "Thai food dinner recipes"
-    }
-    Observation: 64 easy Thai recipes for any night of the week · Thai curry noodle soup · Thai yellow cauliflower, snake bean and tofu curry · Thai-spiced chicken hand pies · Thai ...
-    Thought:{
-        "action": "Final Answer",
-        "action_input": "Here are some Thai food dinner recipes you can try this week: Thai curry noodle soup, Thai yellow cauliflower, snake bean and tofu curry, Thai-spiced chicken hand pies, and many more. You can find the full list of recipes at the source I found earlier."
-    }
-    
-    > Finished chain.
-
-
-    'Here are some Thai food dinner recipes you can try this week: Thai curry noodle soup, Thai yellow cauliflower, snake bean and tofu curry, Thai-spiced chicken hand pies, and many more. You can find the full list of recipes at the source I found earlier.'
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run(input="tell me the last letter in my name, and also tell me who won the world cup in 1978?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    {
-        "action": "Final Answer",
-        "action_input": "The last letter in your name is 'b'. Argentina won the World Cup in 1978."
-    }
-    
-    > Finished chain.
-
-
-    "The last letter in your name is 'b'. Argentina won the World Cup in 1978."
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run(input="whats the weather like in pomfret?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    {
-        "action": "Current Search",
-        "action_input": "weather in pomfret"
-    }
-    Observation: Cloudy with showers. Low around 55F. Winds S at 5 to 10 mph. Chance of rain 60%. Humidity76%.
-    Thought:{
-        "action": "Final Answer",
-        "action_input": "Cloudy with showers. Low around 55F. Winds S at 5 to 10 mph. Chance of rain 60%. Humidity76%."
-    }
-    
-    > Finished chain.
-
-
-    'Cloudy with showers. Low around 55F. Winds S at 5 to 10 mph. Chance of rain 60%. Humidity76%.'
-```
-
-</CodeOutputBlock>
--- a/docs/snippets/modules/agents/agent_types/conversational_agent.mdx
+++ b/docs/snippets/modules/agents/agent_types/conversational_agent.mdx
@@ -1,150 +0,0 @@
-This is accomplished with a specific type of agent (`conversational-react-description`) which expects to be used with a memory component.
-
-```python
-from langchain.agents import Tool
-from langchain.agents import AgentType
-from langchain.memory import ConversationBufferMemory
-from langchain.llms import OpenAI
-from langchain.utilities import SerpAPIWrapper
-from langchain.agents import initialize_agent
-```
-
-
-```python
-search = SerpAPIWrapper()
-tools = [
-    Tool(
-        name = "Current Search",
-        func=search.run,
-        description="useful for when you need to answer questions about current events or the current state of the world"
-    ),
-]
-```
-
-
-```python
-memory = ConversationBufferMemory(memory_key="chat_history")
-```
-
-
-```python
-llm=OpenAI(temperature=0)
-agent_chain = initialize_agent(tools, llm, agent=AgentType.CONVERSATIONAL_REACT_DESCRIPTION, verbose=True, memory=memory)
-```
-
-
-```python
-agent_chain.run(input="hi, i am bob")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    
-    Thought: Do I need to use a tool? No
-    AI: Hi Bob, nice to meet you! How can I help you today?
-    
-    > Finished chain.
-
-
-    'Hi Bob, nice to meet you! How can I help you today?'
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run(input="what's my name?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    
-    Thought: Do I need to use a tool? No
-    AI: Your name is Bob!
-    
-    > Finished chain.
-
-
-    'Your name is Bob!'
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run("what are some good dinners to make this week, if i like thai food?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    
-    Thought: Do I need to use a tool? Yes
-    Action: Current Search
-    Action Input: Thai food dinner recipes
-    Observation: 59 easy Thai recipes for any night of the week · Marion Grasby's Thai spicy chilli and basil fried rice · Thai curry noodle soup · Marion Grasby's Thai Spicy ...
-    Thought: Do I need to use a tool? No
-    AI: Here are some great Thai dinner recipes you can try this week: Marion Grasby's Thai Spicy Chilli and Basil Fried Rice, Thai Curry Noodle Soup, Thai Green Curry with Coconut Rice, Thai Red Curry with Vegetables, and Thai Coconut Soup. I hope you enjoy them!
-    
-    > Finished chain.
-
-
-    "Here are some great Thai dinner recipes you can try this week: Marion Grasby's Thai Spicy Chilli and Basil Fried Rice, Thai Curry Noodle Soup, Thai Green Curry with Coconut Rice, Thai Red Curry with Vegetables, and Thai Coconut Soup. I hope you enjoy them!"
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run(input="tell me the last letter in my name, and also tell me who won the world cup in 1978?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    
-    Thought: Do I need to use a tool? Yes
-    Action: Current Search
-    Action Input: Who won the World Cup in 1978
-    Observation: Argentina national football team
-    Thought: Do I need to use a tool? No
-    AI: The last letter in your name is "b" and the winner of the 1978 World Cup was the Argentina national football team.
-    
-    > Finished chain.
-
-
-    'The last letter in your name is "b" and the winner of the 1978 World Cup was the Argentina national football team.'
-```
-
-</CodeOutputBlock>
-
-
-```python
-agent_chain.run(input="whats the current temperature in pomfret?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-    
-    Thought: Do I need to use a tool? Yes
-    Action: Current Search
-    Action Input: Current temperature in Pomfret
-    Observation: Partly cloudy skies. High around 70F. Winds W at 5 to 10 mph. Humidity41%.
-    Thought: Do I need to use a tool? No
-    AI: The current temperature in Pomfret is around 70F with partly cloudy skies and winds W at 5 to 10 mph. The humidity is 41%.
-    
-    > Finished chain.
-
-
-    'The current temperature in Pomfret is around 70F with partly cloudy skies and winds W at 5 to 10 mph. The humidity is 41%.'
-```
-
-</CodeOutputBlock>
--- a/docs/snippets/modules/agents/agent_types/openai_functions_agent.mdx
+++ b/docs/snippets/modules/agents/agent_types/openai_functions_agent.mdx
@@ -1,80 +0,0 @@
-Install `openai`, `google-search-results` packages which are required as the LangChain packages call them internally.
-
-```bash
-pip install openai google-search-results
-```
-
-```python
-from langchain.agents import initialize_agent, AgentType, Tool
-from langchain.chains import LLMMathChain
-from langchain.chat_models import ChatOpenAI
-from langchain.llms import OpenAI
-from langchain.utilities import SerpAPIWrapper, SQLDatabase
-from langchain_experimental.sql import SQLDatabaseChain
-```
-
-
-```python
-llm = ChatOpenAI(temperature=0, model="gpt-3.5-turbo-0613")
-search = SerpAPIWrapper()
-llm_math_chain = LLMMathChain.from_llm(llm=llm, verbose=True)
-db = SQLDatabase.from_uri("sqlite:///../../../../../notebooks/Chinook.db")
-db_chain = SQLDatabaseChain.from_llm(llm, db, verbose=True)
-tools = [
-    Tool(
-        name = "Search",
-        func=search.run,
-        description="useful for when you need to answer questions about current events. You should ask targeted questions"
-    ),
-    Tool(
-        name="Calculator",
-        func=llm_math_chain.run,
-        description="useful for when you need to answer questions about math"
-    ),
-    Tool(
-        name="FooBar-DB",
-        func=db_chain.run,
-        description="useful for when you need to answer questions about FooBar. Input should be in the form of a question containing full context"
-    )
-]
-```
-
-
-```python
-agent = initialize_agent(tools, llm, agent=AgentType.OPENAI_FUNCTIONS, verbose=True)
-```
-
-
-```python
-agent.run("Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new  chain...
-    
-    Invoking: `Search` with `{'query': 'Leo DiCaprio girlfriend'}`
-    
-    
-    Amidst his casual romance with Gigi, Leo allegedly entered a relationship with 19-year old model, Eden Polani, in February 2023.
-    Invoking: `Calculator` with `{'expression': '19^0.43'}`
-    
-
-    > Entering new  chain...
-    19^0.43```text
-    19**0.43
-    ```
-    ...numexpr.evaluate("19**0.43")...
-    
-    Answer: 3.547023357958959
-    > Finished chain.
-    Answer: 3.547023357958959Leo DiCaprio's girlfriend is reportedly Eden Polani. Her current age raised to the power of 0.43 is approximately 3.55.
-    
-    > Finished chain.
-
-
-    "Leo DiCaprio's girlfriend is reportedly Eden Polani. Her current age raised to the power of 0.43 is approximately 3.55."
-```
-
-</CodeOutputBlock>
--- a/docs/snippets/modules/agents/agent_types/react.mdx
+++ b/docs/snippets/modules/agents/agent_types/react.mdx
@@ -1,62 +0,0 @@
-```python
-from langchain.agents import load_tools
-from langchain.agents import initialize_agent
-from langchain.agents import AgentType
-from langchain.llms import OpenAI
-```
-
-First, let's load the language model we're going to use to control the agent.
-
-
-```python
-llm = OpenAI(temperature=0)
-```
-
-Next, let's load some tools to use. Note that the `llm-math` tool uses an LLM, so we need to pass that in.
-
-
-```python
-tools = load_tools(["serpapi", "llm-math"], llm=llm)
-```
-
-Finally, let's initialize an agent with the tools, the language model, and the type of agent we want to use.
-
-
-```python
-agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)
-```
-
-Now let's test it out!
-
-
-```python
-agent.run("Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?")
-```
-
-<CodeOutputBlock lang="python">
-
-```
-    > Entering new AgentExecutor chain...
-     I need to find out who Leo DiCaprio's girlfriend is and then calculate her age raised to the 0.43 power.
-    Action: Search
-    Action Input: "Leo DiCaprio girlfriend"
-    Observation: Camila Morrone
-    Thought: I need to find out Camila Morrone's age
-    Action: Search
-    Action Input: "Camila Morrone age"
-    Observation: 25 years
-    Thought: I need to calculate 25 raised to the 0.43 power
-    Action: Calculator
-    Action Input: 25^0.43
-    Observation: Answer: 3.991298452658078
-    
-    Thought: I now know the final answer
-    Final Answer: Camila Morrone is Leo DiCaprio's girlfriend and her current age raised to the 0.43 power is 3.991298452658078.
-    
-    > Finished chain.
-
-
-    "Camila Morrone is Leo DiCaprio's girlfriend and her current age raised to the 0.43 power is 3.991298452658078."
-```
-
-</CodeOutputBlock>
--- a/docs/snippets/modules/agents/agent_types/react_chat.mdx
+++ b/docs/snippets/modules/agents/agent_types/react_chat.mdx
@@ -1,7 +0,0 @@
-```python
-from langchain.chat_models import ChatOpenAI
-
-chat_model = ChatOpenAI(temperature=0)
-agent = initialize_agent(tools, chat_model, agent=AgentType.CHAT_ZERO_SHOT_REACT_DESCRIPTION, verbose=True)
-agent.run("Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?")
-```
--- a/docs/snippets/modules/agents/get_started.mdx
+++ b/docs/snippets/modules/agents/get_started.mdx
@@ -1,13 +1,15 @@
 This will go over how to get started building an agent.
-We will use a LangChain agent class, but show how to customize it to give it specific context.
-We will then define custom tools, and then run it all in the standard LangChain `AgentExecutor`.
+We will create this agent from scratch, using LangChain Expression Language.
+We will then define custom tools, and then run it in a custom loop (we will also show how to use the standard LangChain `AgentExecutor`).

 ### Set up the agent

-We will use the `OpenAIFunctionsAgent`.
-This is easiest and best agent to get started with.
-It does however require usage of `ChatOpenAI` models.
-If you want to use a different language model, we would recommend using the [ReAct](/docs/modules/agents/agent_types/react) agent.
+We first need to create our agent.
+This is the chain responsible for determining what action to take next.
+
+In this example, we will use OpenAI Function Calling to create this agent.
+This is generally the most reliable way create agents.
+In this example we will show what it is like to construct this agent from scratch, using LangChain Expression Language.

 For this guide, we will construct a custom agent that has access to a custom tool.
 We are choosing this example because we think for most use cases you will NEED to customize either the agent or the tools.
@@ -39,23 +41,94 @@ tools = [get_word_length]
 ```

 Now let us create the prompt.
-We can use the `OpenAIFunctionsAgent.create_prompt` helper function to create a prompt automatically.
-This allows for a few different ways to customize, including passing in a custom `SystemMessage`, which we will do.
+Because OpenAI Function Calling is finetuned for tool usage, we hardly need any instructions on how to reason, or how to output format.
+We will just have two input variables: `input` (for the user question) and `agent_scratchpad` (for any previous steps taken)

 ```python
-from langchain.schema import SystemMessage
-from langchain.agents import OpenAIFunctionsAgent
-system_message = SystemMessage(content="You are very powerful assistant, but bad at calculating lengths of words.")
-prompt = OpenAIFunctionsAgent.create_prompt(system_message=system_message)
+from langchain.prompts import ChatPromptTemplate, MessagesPlaceholder
+prompt = ChatPromptTemplate.from_messages([
+    ("system", "You are very powerful assistant, but bad at calculating lengths of words."),
+    ("user", "{input}"),
+    MessagesPlaceholder(variable_name="agent_scratchpad"),
+])
+```
+
+How does the agent know what tools it can use?
+Those are passed in as a separate argument, so we can bind those as key word arguments to the LLM.
+
+```python
+from langchain.tools.render import format_tool_to_openai_function
+llm_with_tools = llm.bind(
+    functions=[format_tool_to_openai_function(t) for t in tools]
+)
 ```

 Putting those pieces together, we can now create the agent.
+We will import two last utility functions: a component for formatting intermediate steps to messages, and a component for converting the output message into an agent action/agent finish.
+

 ```python
-agent = OpenAIFunctionsAgent(llm=llm, tools=tools, prompt=prompt)
+from langchain.agents.format_scratchpad import format_to_openai_functions
+from langchain.agents.output_parsers import OpenAIFunctionsAgentOutputParser
+agent = {
+    "input": lambda x: x["input"],
+    "agent_scratchpad": lambda x: format_to_openai_functions(x['intermediate_steps'])
+} | prompt | llm_with_tools | OpenAIFunctionsAgentOutputParser()
 ```

-Finally, we create the `AgentExecutor` - the runtime for our agent.
+Now that we have our agent, let's play around with it!
+Let's pass in a simple question and empty intermediate steps and see what it returns:
+
+```python
+agent.invoke({
+    "input": "how many letters in the word educa?",
+    "intermediate_steps": []
+})
+```
+
+We can see that it responds with an `AgentAction` to take (it's actually an `AgentActionMessageLog` - a subclass of `AgentAction` which also tracks the full message log).
+
+So this is just the first step - now we need to write a runtime for this.
+The simplest one is just one that continuously loops, calling the agent, then taking the action, and repeating until an `AgentFinish` is returned.
+Let's code that up below:
+
+```python
+from langchain.schema.agent import AgentFinish
+intermediate_steps = []
+while True:
+    output = agent.invoke({
+        "input": "how many letters in the word educa?",
+        "intermediate_steps": intermediate_steps
+    })
+    if isinstance(output, AgentFinish):
+        final_result = output.return_values["output"]
+        break
+    else:
+        print(output.tool, output.tool_input)
+        tool = {
+            "get_word_length": get_word_length
+        }[output.tool]
+        observation = tool.run(output.tool_input)
+        intermediate_steps.append((output, observation))
+print(final_result)
+```
+
+We can see this prints out the following:
+
+<CodeOutputBlock lang="python">
+
+```
+get_word_length {'word': 'educa'}
+There are 5 letters in the word "educa".
+```
+
+</CodeOutputBlock>
+
+Woo! It's working.
+
+To simplify this a bit, we can import and use the `AgentExecutor` class.
+This bundles up all of the above and adds in error handling, early stopping, tracing, and other quality-of-life improvements that reduce safeguards you need to write.
+

 ```python
 from langchain.agents import AgentExecutor
@@ -66,7 +139,7 @@ Now let's test it out!


 ```python
-agent_executor.run("how many letters in the word educa?")
+agent_executor.invoke({"input": "how many letters in the word educa?"})
 ```

 <CodeOutputBlock lang="python">
@@ -97,36 +170,44 @@ Let's fix that by adding in memory.
 In order to do this, we need to do two things:

 1. Add a place for memory variables to go in the prompt
-2. Add memory to the `AgentExecutor` (note that we add it here, and NOT to the agent, as this is the outermost chain)
+2. Keep track of the chat history

 First, let's add a place for memory in the prompt.
 We do this by adding a placeholder for messages with the key `"chat_history"`.
+Notice that we put this ABOVE the new user input (to follow the conversation flow).

 ```python
 from langchain.prompts import MessagesPlaceholder

 MEMORY_KEY = "chat_history"
-prompt = OpenAIFunctionsAgent.create_prompt(
-    system_message=system_message,
-    extra_prompt_messages=[MessagesPlaceholder(variable_name=MEMORY_KEY)]
-)
+prompt = ChatPromptTemplate.from_messages([
+    ("system", "You are very powerful assistant, but bad at calculating lengths of words."),
+    MessagesPlaceholder(variable_name=MEMORY_KEY),
+    ("user", "{input}"),
+    MessagesPlaceholder(variable_name="agent_scratchpad"),
+])
 ```
-
-Next, let's create a memory object.
-We will do this by using `ConversationBufferMemory`.
-Importantly, we set `memory_key` also equal to `"chat_history"` (to align it with the prompt) and set `return_messages` (to make it return messages rather than a string).
-
-```python
-from langchain.memory import ConversationBufferMemory
-
-memory = ConversationBufferMemory(memory_key=MEMORY_KEY, return_messages=True)
+We can then set up a list to track the chat history
+```
+from langchain.schema.messages import HumanMessage, AIMessage
+chat_history = []
 ```

 We can then put it all together!

 ```python
-agent = OpenAIFunctionsAgent(llm=llm, tools=tools, prompt=prompt)
-agent_executor = AgentExecutor(agent=agent, tools=tools, memory=memory, verbose=True)
-agent_executor.run("how many letters in the word educa?")
-agent_executor.run("is that a real word?")
+agent = {
+    "input": lambda x: x["input"],
+    "agent_scratchpad": lambda x: format_to_openai_functions(x['intermediate_steps']),
+    "chat_history": lambda x: x["chat_history"]
+} | prompt | llm_with_tools | OpenAIFunctionsAgentOutputParser()
+agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True)
+```
+When running, we now need to track the inputs and outputs as chat history
+```
+input1 = "how many letters in the word educa?"
+result = agent_executor.invoke({"input": input1, "chat_history": chat_history})
+chat_history.append(HumanMessage(content=input1))
+chat_history.append(AIMessage(content=result['output']))
+agent_executor.invoke({"input": "is that a real word?", "chat_history": chat_history})
 ```
--- a/docs/snippets/modules/data_connection/document_loaders/how_to/json.mdx
+++ b/docs/snippets/modules/data_connection/document_loaders/how_to/json.mdx
@@ -89,7 +89,8 @@ Suppose we are interested in extracting the values under the `content` field wit
 ```python
 loader = JSONLoader(
    file_path='./example_data/facebook_chat.json',
-    jq_schema='.messages[].content')
+    jq_schema='.messages[].content',
+    text_content=False)

 data = loader.load()
 ```
@@ -145,6 +146,7 @@ pprint(Path(file_path).read_text())
 loader = JSONLoader(
    file_path='./example_data/facebook_chat_messages.jsonl',
    jq_schema='.content',
+    text_content=False,
    json_lines=True)

 data = loader.load()
--- a/libs/experimental/README.md
+++ b/libs/experimental/README.md
@@ -13,4 +13,4 @@ Some of the code here may be marked with security notices. However,
 given the exploratory and experimental nature of the code in this package,
 the lack of a security notice on a piece of code does not mean that
 the code in question does not require additional security considerations
-in order to be safe to use.
+in order to be safe to use.
--- a/libs/experimental/langchain_experimental/autonomous_agents/autogpt/agent.py
+++ b/libs/experimental/langchain_experimental/autonomous_agents/autogpt/agent.py
@@ -10,9 +10,9 @@ from langchain.schema import (
    Document,
 )
 from langchain.schema.messages import AIMessage, HumanMessage, SystemMessage
+from langchain.schema.vectorstore import VectorStoreRetriever
 from langchain.tools.base import BaseTool
 from langchain.tools.human.tool import HumanInputRun
-from langchain.vectorstores.base import VectorStoreRetriever

 from langchain_experimental.autonomous_agents.autogpt.output_parser import (
    AutoGPTOutputParser,
--- a/libs/experimental/langchain_experimental/autonomous_agents/autogpt/memory.py
+++ b/libs/experimental/langchain_experimental/autonomous_agents/autogpt/memory.py
@@ -1,7 +1,7 @@
 from typing import Any, Dict, List

 from langchain.memory.chat_memory import BaseChatMemory, get_prompt_input_key
-from langchain.vectorstores.base import VectorStoreRetriever
+from langchain.schema.vectorstore import VectorStoreRetriever

 from langchain_experimental.pydantic_v1 import Field

--- a/libs/experimental/langchain_experimental/autonomous_agents/autogpt/prompt.py
+++ b/libs/experimental/langchain_experimental/autonomous_agents/autogpt/prompt.py
@@ -5,8 +5,8 @@ from langchain.prompts.chat import (
    BaseChatPromptTemplate,
 )
 from langchain.schema.messages import BaseMessage, HumanMessage, SystemMessage
+from langchain.schema.vectorstore import VectorStoreRetriever
 from langchain.tools.base import BaseTool
-from langchain.vectorstores.base import VectorStoreRetriever

 from langchain_experimental.autonomous_agents.autogpt.prompt_generator import get_prompt
 from langchain_experimental.pydantic_v1 import BaseModel
--- a/libs/experimental/langchain_experimental/autonomous_agents/baby_agi/baby_agi.py
+++ b/libs/experimental/langchain_experimental/autonomous_agents/baby_agi/baby_agi.py
@@ -5,7 +5,7 @@ from typing import Any, Dict, List, Optional
 from langchain.callbacks.manager import CallbackManagerForChainRun
 from langchain.chains.base import Chain
 from langchain.schema.language_model import BaseLanguageModel
-from langchain.vectorstores.base import VectorStore
+from langchain.schema.vectorstore import VectorStore

 from langchain_experimental.autonomous_agents.baby_agi.task_creation import (
    TaskCreationChain,
--- a/libs/experimental/langchain_experimental/sql/vector_sql.py
+++ b/libs/experimental/langchain_experimental/sql/vector_sql.py
@@ -8,7 +8,7 @@ from langchain.chains.llm import LLMChain
 from langchain.chains.sql_database.prompt import PROMPT, SQL_PROMPTS
 from langchain.prompts.prompt import PromptTemplate
 from langchain.schema import BaseOutputParser, BasePromptTemplate
-from langchain.schema.base import Embeddings
+from langchain.schema.embeddings import Embeddings
 from langchain.schema.language_model import BaseLanguageModel
 from langchain.tools.sql_database.prompt import QUERY_CHECKER
 from langchain.utilities.sql_database import SQLDatabase
@@ -76,23 +76,11 @@ class VectorSQLRetrieveAllOutputParser(VectorSQLOutputParser):
        return super().parse(text)


-def _try_eval(x: Any) -> Any:
-    try:
-        return eval(x)
-    except Exception:
-        return x
-
-
 def get_result_from_sqldb(
    db: SQLDatabase, cmd: str
 ) -> Union[str, List[Dict[str, Any]], Dict[str, Any]]:
    result = db._execute(cmd, fetch="all")  # type: ignore
-    if isinstance(result, list):
-        return [{k: _try_eval(v) for k, v in dict(d._asdict()).items()} for d in result]
-    else:
-        return {
-            k: _try_eval(v) for k, v in dict(result._asdict()).items()  # type: ignore
-        }
+    return result


 class VectorSQLDatabaseChain(SQLDatabaseChain):
--- a/libs/experimental/langchain_experimental/synthetic_data/init.py
+++ b/libs/experimental/langchain_experimental/synthetic_data/init.py
@@ -1,14 +1,12 @@
-from typing import TYPE_CHECKING, Any, Dict, List, Optional
+from typing import Any, Dict, List, Optional

+from langchain.chains.base import Chain
 from langchain.chains.llm import LLMChain
+from langchain.prompts import PromptTemplate
+from langchain.schema.language_model import BaseLanguageModel

 from langchain_experimental.synthetic_data.prompts import SENTENCE_PROMPT

-if TYPE_CHECKING:
-    from langchain.chains.base import Chain
-    from langchain.prompts import PromptTemplate
-    from langchain.schema.language_model import BaseLanguageModel
-

 def create_data_generation_chain(
    llm: BaseLanguageModel,
--- a/libs/experimental/pyproject.toml
+++ b/libs/experimental/pyproject.toml
@@ -1,6 +1,6 @@
 [tool.poetry]
 name = "langchain-experimental"
-version = "0.0.18"
+version = "0.0.22"
 description = "Building applications with LLMs through composability"
 authors = []
 license = "MIT"
--- a/libs/langchain/Makefile
+++ b/libs/langchain/Makefile
@@ -7,47 +7,18 @@ all: help
 # TESTING AND COVERAGE
 ######################

+# Define a variable for the test file path.
+TEST_FILE ?= tests/unit_tests/
+
 # Run unit tests and generate a coverage report.
 coverage:
 	poetry run pytest --cov \
 		--cov-config=.coveragerc \
 		--cov-report xml \
-		--cov-report term-missing:skip-covered
+		--cov-report term-missing:skip-covered \
+		$(TEST_FILE)

-######################
-# DOCUMENTATION
-######################
-
-clean: docs_clean api_docs_clean
-
-
-docs_build:
-	docs/.local_build.sh
-
-docs_clean:
-	rm -r docs/_dist
-
-docs_linkcheck:
-	poetry run linkchecker docs/_dist/docs_skeleton/ --ignore-url node_modules
-
-api_docs_build:
-	poetry run python docs/api_reference/create_api_rst.py
-	cd docs/api_reference && poetry run make html
-
-api_docs_clean:
-	rm -f docs/api_reference/api_reference.rst
-	cd docs/api_reference && poetry run make clean
-
-api_docs_linkcheck:
-	poetry run linkchecker docs/api_reference/_build/html/index.html
-
-# Define a variable for the test file path.
-TEST_FILE ?= tests/unit_tests/
-
-test:
-	poetry run pytest --disable-socket --allow-unix-socket $(TEST_FILE)
-
-tests: 
+test tests:
 	poetry run pytest --disable-socket --allow-unix-socket $(TEST_FILE)

 extended_tests:
@@ -98,7 +69,6 @@ spell_fix:

 help:
 	@echo '===================='
-	@echo '-- DOCUMENTATION --'
 	@echo 'clean                        - run docs_clean and api_docs_clean'
 	@echo 'docs_build                   - build the documentation'
 	@echo 'docs_clean                   - clean the documentation build artifacts'
@@ -120,3 +90,4 @@ help:
 	@echo 'test_watch                   - run unit tests in watch mode'
 	@echo 'integration_tests            - run integration tests'
 	@echo 'docker_tests                 - run unit tests in docker'
+	@echo '-- DOCUMENTATION tasks are from the top-level Makefile --'
--- a/Show More
+++ b/Show More