bump version to 197 (#6007 )

support kwargs (#5990 )
Obey handler.raise_error in _ahandle_event_for_handler (#6001 )
2026-02-09 18:51:07 +00:00 · 2023-06-11 10:14:57 -07:00 · 2023-06-11 10:09:22 -07:00 · 2023-06-11 09:49:26 -07:00 · 2023-06-11 09:48:09 -07:00 · 2023-06-10 16:51:04 -07:00
188 changed files with 7197 additions and 838 deletions
--- a/docs/_static/js/mendablesearch.js
+++ b/docs/_static/js/mendablesearch.js
@@ -37,6 +37,7 @@ document.addEventListener('DOMContentLoaded', () => {
        style: { darkMode: false, accentColor: '#010810' },
        floatingButtonStyle: { color: '#ffffff', backgroundColor: '#010810' },
        anon_key: '82842b36-3ea6-49b2-9fb8-52cfc4bde6bf', // Mendable Search Public ANON key, ok to be public
+        cmdShortcutKey:'j',
        messageSettings: {
          openSourcesInNewTab: false,
          prettySources: true // Prettify the sources displayed now
--- a/docs/additional_resources/deploy_llms.rst
+++ b/docs/additional_resources/deploy_llms.rst
@@ -24,9 +24,9 @@ This guide aims to provide a comprehensive overview of the requirements for depl

 Understanding these components is crucial when assessing serving systems. LangChain integrates with several open-source projects designed to tackle these issues, providing a robust framework for productionizing your LLM applications. Some notable frameworks include:

- `Ray Serve <../../../ecosystem/ray_serve.html>`_
+- `Ray Serve <../integrations/ray_serve.html>`_
 - `BentoML <https://github.com/ssheng/BentoChain>`_
- `Modal <../../../ecosystem/modal.html>`_
+- `Modal <../integrations/modal.html>`_

 These links will provide further information on each ecosystem, assisting you in finding the best fit for your LLM deployment needs.

--- a/docs/ecosystem/baseten.md
+++ b/docs/ecosystem/baseten.md
@@ -0,0 +1,25 @@
+# Baseten
+
+Learn how to use LangChain with models deployed on Baseten.
+
+## Installation and setup
+
+- Create a [Baseten](https://baseten.co) account and [API key](https://docs.baseten.co/settings/api-keys).
+- Install the Baseten Python client with `pip install baseten`
+- Use your API key to authenticate with `baseten login`
+
+## Invoking a model
+
+Baseten integrates with LangChain through the LLM module, which provides a standardized and interoperable interface for models that are deployed on your Baseten workspace.
+
+You can deploy foundation models like WizardLM and Alpaca with one click from the [Baseten model library](https://app.baseten.co/explore/) or if you have your own model, [deploy it with this tutorial](https://docs.baseten.co/deploying-models/deploy).
+
+In this example, we'll work with WizardLM. [Deploy WizardLM here](https://app.baseten.co/explore/wizardlm) and follow along with the deployed [model's version ID](https://docs.baseten.co/managing-models/manage).
+
+```python
+from langchain.llms import Baseten
+
+wizardlm = Baseten(model="MODEL_VERSION_ID", verbose=True)
+
+wizardlm("What is the difference between a Wizard and a Sorcerer?")
+```
--- a/docs/integrations/awadb.md
+++ b/docs/integrations/awadb.md
@@ -0,0 +1,21 @@
+# AwaDB
+
+>[AwaDB](https://github.com/awa-ai/awadb) is an AI Native database for the search and storage of embedding vectors used by LLM Applications.
+
+## Installation and Setup
+
+```bash
+pip install awadb
+```
+
+
+## VectorStore
+
+There exists a wrapper around AwaDB vector databases, allowing you to use it as a vectorstore,
+whether for semantic search or example selection.
+
+```python
+from langchain.vectorstores import AwaDB
+```
+
+For a more detailed walkthrough of the AwaDB wrapper, see [this notebook](../modules/indexes/vectorstores/examples/awadb.ipynb)
--- a/docs/integrations/databricks.md
+++ b/docs/integrations/databricks.md
@@ -0,0 +1,36 @@
+Databricks
+==========
+
+The [Databricks](https://www.databricks.com/) Lakehouse Platform unifies data, analytics, and AI on one platform.
+
+Databricks embraces the LangChain ecosystem in various ways:
+
+1. Databricks connector for the SQLDatabase Chain: SQLDatabase.from_databricks() provides an easy way to query your data on Databricks through LangChain
+2. Databricks-managed MLflow integrates with LangChain: Tracking and serving LangChain applications with fewer steps
+3. Databricks as an LLM provider: Deploy your fine-tuned LLMs on Databricks via serving endpoints or cluster driver proxy apps, and query it as langchain.llms.Databricks
+4. Databricks Dolly: Databricks open-sourced Dolly which allows for commercial use, and can be accessed through the HuggingFace Hub
+
+Databricks connector for the SQLDatabase Chain
+----------------------------------------------
+You can connect to [Databricks runtimes](https://docs.databricks.com/runtime/index.html) and [Databricks SQL](https://www.databricks.com/product/databricks-sql) using the SQLDatabase wrapper of LangChain. See the notebook [Connect to Databricks](./databricks/databricks.html) for details.
+
+Databricks-managed MLflow integrates with LangChain
+---------------------------------------------------
+
+MLflow is an open source platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry. See the notebook [MLflow Callback Handler](./mlflow_tracking.ipynb) for details about MLflow's integration with LangChain.
+
+Databricks provides a fully managed and hosted version of MLflow integrated with enterprise security features, high availability, and other Databricks workspace features such as experiment and run management and notebook revision capture. MLflow on Databricks offers an integrated experience for tracking and securing machine learning model training runs and running machine learning projects. See [MLflow guide](https://docs.databricks.com/mlflow/index.html) for more details.
+
+Databricks-managed MLflow makes it more convenient to develop LangChain applications on Databricks. For MLflow tracking, you don't need to set the tracking uri. For MLflow Model Serving, you can save LangChain Chains in the MLflow langchain flavor, and then register and serve the Chain with a few clicks on Databricks, with credentials securely managed by MLflow Model Serving.
+
+Databricks as an LLM provider
+-----------------------------
+
+The notebook [Wrap Databricks endpoints as LLMs](../modules/models/llms/integrations/databricks.html) illustrates the method to wrap Databricks endpoints as LLMs in LangChain. It supports two types of endpoints: the serving endpoint, which is recommended for both production and development, and the cluster driver proxy app, which is recommended for interactive development. 
+
+Databricks endpoints support Dolly, but are also great for hosting models like MPT-7B or any other models from the HuggingFace ecosystem. Databricks endpoints can also be used with proprietary models like OpenAI to provide a governance layer for enterprises.
+
+Databricks Dolly
+----------------
+
+Databricks’ Dolly is an instruction-following large language model trained on the Databricks machine learning platform that is licensed for commercial use. The model is available on Hugging Face Hub as databricks/dolly-v2-12b. See the notebook [HuggingFace Hub](../modules/models/llms/integrations/huggingface_hub.html) for instructions to access it through the HuggingFace Hub integration with LangChain. 
--- a/docs/integrations/databricks/databricks.ipynb
+++ b/docs/integrations/databricks/databricks.ipynb
@@ -58,7 +58,7 @@
    "### Optional Parameters\n",
    "There following parameters are optional. When executing the method in a Databricks notebook, you don't need to provide them in most of the cases.\n",
    "* `host`: The Databricks workspace hostname, excluding 'https://' part. Defaults to 'DATABRICKS_HOST' environment variable or current workspace if in a Databricks notebook.\n",
-    "* `api_token`: The Databricks personal access token for accessing the Databricks SQL warehouse or the cluster. Defaults to 'DATABRICKS_API_TOKEN' environment variable or a temporary one is generated if in a Databricks notebook.\n",
+    "* `api_token`: The Databricks personal access token for accessing the Databricks SQL warehouse or the cluster. Defaults to 'DATABRICKS_TOKEN' environment variable or a temporary one is generated if in a Databricks notebook.\n",
    "* `warehouse_id`: The warehouse ID in the Databricks SQL.\n",
    "* `cluster_id`: The cluster ID in the Databricks Runtime. If running in a Databricks notebook and both 'warehouse_id' and 'cluster_id' are None, it uses the ID of the cluster the notebook is attached to.\n",
    "* `engine_args`: The arguments to be used when connecting Databricks.\n",
--- a/docs/integrations/ray_serve.ipynb
+++ b/docs/integrations/ray_serve.ipynb
--- a/docs/integrations/shaleprotocol.md
+++ b/docs/integrations/shaleprotocol.md
@@ -0,0 +1,43 @@
+# Shale Protocol
+
+[Shale Protocol](https://shaleprotocol.com) provides production-ready inference APIs for open LLMs. It's a Plug & Play API as it's hosted on a highly scalable GPU cloud infrastructure. 
+
+Our free tier supports up to 1K daily requests per key as we want to eliminate the barrier for anyone to start building genAI apps with LLMs. 
+
+With Shale Protocol, developers/researchers can create apps and explore the capabilities of open LLMs at no cost.
+
+This page covers how Shale-Serve API can be incorporated with LangChain.
+
+As of June 2023, the API supports Vicuna-13B by default. We are going to support more LLMs such as Falcon-40B in future releases. 
+
+
+## How to
+
+### 1. Find the link to our Discord on https://shaleprotocol.com. Generate an API key through the "Shale Bot" on our Discord. No credit card is required and no free trials. It's a forever free tier with 1K limit per day per API key.
+
+### 2. Use https://shale.live/v1 as OpenAI API drop-in replacement 
+
+For example
+```python
+from langchain.llms import OpenAI
+from langchain import PromptTemplate, LLMChain
+
+import os
+os.environ['OPENAI_API_BASE'] = "https://shale.live/v1"
+os.environ['OPENAI_API_KEY'] = "ENTER YOUR API KEY"
+
+llm = OpenAI()
+
+template = """Question: {question}
+
+# Answer: Let's think step by step."""
+
+prompt = PromptTemplate(template=template, input_variables=["question"])
+
+llm_chain = LLMChain(prompt=prompt, llm=llm)
+
+question = "What NFL team won the Super Bowl in the year Justin Beiber was born?"
+
+llm_chain.run(question)
+
+```
--- a/docs/integrations/vectara.md
+++ b/docs/integrations/vectara.md
@@ -4,7 +4,7 @@
 What is Vectara?

 **Vectara Overview:**
- Vectara is developer-first API platform for building conversational search applications
+- Vectara is developer-first API platform for building GenAI applications
 - To use Vectara - first [sign up](https://console.vectara.com/signup) and create an account. Then create a corpus and an API key for indexing and searching.
 - You can use Vectara's [indexing API](https://docs.vectara.com/docs/indexing-apis/indexing) to add documents into Vectara's index
 - You can use Vectara's [Search API](https://docs.vectara.com/docs/search-apis/search) to query Vectara's index (which also supports Hybrid search implicitly).
@@ -13,6 +13,13 @@ What is Vectara?
 ## Installation and Setup
 To use Vectara with LangChain no special installation steps are required. You just have to provide your customer_id, corpus ID, and an API key created within the Vectara console to enable indexing and searching.

+Alternatively these can be provided as environment variables
+- export `VECTARA_CUSTOMER_ID`="your_customer_id"
+- export `VECTARA_CORPUS_ID`="your_corpus_id"
+- export `VECTARA_API_KEY`="your-vectara-api-key"
+
+## Usage
+
 ### VectorStore

 There exists a wrapper around the Vectara platform, allowing you to use it as a vectorstore, whether for semantic search or example selection.
@@ -32,8 +39,21 @@ vectara = Vectara(
 ```
 The customer_id, corpus_id and api_key are optional, and if they are not supplied will be read from the environment variables `VECTARA_CUSTOMER_ID`, `VECTARA_CORPUS_ID` and `VECTARA_API_KEY`, respectively.

+To query the vectorstore, you can use the `similarity_search` method (or `similarity_search_with_score`), which takes a query string and returns a list of results:
+```python
+results = vectara.similarity_score("what is LangChain?")
+```

-For a more detailed walkthrough of the Vectara wrapper, see one of the two example notebooks:
+`similarity_search_with_score` also supports the following additional arguments:
+- `k`: number of results to return (defaults to 5)
+- `lambda_val`: the [lexical matching](https://docs.vectara.com/docs/api-reference/search-apis/lexical-matching) factor for hybrid search (defaults to 0.025)
+- `filter`: a [filter](https://docs.vectara.com/docs/common-use-cases/filtering-by-metadata/filter-overview) to apply to the results (default None)
+- `n_sentence_context`: number of sentences to include before/after the actual matching segment when returning results. This defaults to 0 so as to return the exact text segment that matches, but can be used with other values e.g. 2 or 3 to return adjacent text segments.
+
+The results are returned as a list of relevant documents, and a relevance score of each document.
+
+
+For a more detailed examples of using the Vectara wrapper, see one of these two sample notebooks:
 * [Chat Over Documents with Vectara](./vectara/vectara_chat.html)
 * [Vectara Text Generation](./vectara/vectara_text_generation.html)

--- a/docs/integrations/vectara/vectara_chat.ipynb
+++ b/docs/integrations/vectara/vectara_chat.ipynb
@@ -102,21 +102,11 @@
   "metadata": {
    "tags": []
   },
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "<class 'langchain.vectorstores.vectara.Vectara'>\n"
-     ]
-    }
-   ],
+   "outputs": [],
   "source": [
    "openai_api_key = os.environ['OPENAI_API_KEY']\n",
    "llm = OpenAI(openai_api_key=openai_api_key, temperature=0)\n",
-    "retriever = VectaraRetriever(vectorstore, alpha=0.025, k=5, filter=None)\n",
-    "\n",
-    "print(type(vectorstore))\n",
+    "retriever = vectorstore.as_retriever(lambda_val=0.025, k=5, filter=None)\n",
    "d = retriever.get_relevant_documents('What did the president say about Ketanji Brown Jackson')\n",
    "\n",
    "qa = ConversationalRetrievalChain.from_llm(llm, retriever, memory=memory)"
@@ -142,7 +132,7 @@
    {
     "data": {
      "text/plain": [
-       "\" The president said that Ketanji Brown Jackson is one of the nation's top legal minds, a former top litigator in private practice, and a former federal public defender.\""
+       "\" The president said that Ketanji Brown Jackson is one of the nation's top legal minds and that she will continue Justice Breyer's legacy of excellence.\""
      ]
     },
     "execution_count": 7,
@@ -174,7 +164,7 @@
    {
     "data": {
      "text/plain": [
-       "' Justice Stephen Breyer.'"
+       "' Justice Stephen Breyer'"
      ]
     },
     "execution_count": 9,
@@ -241,7 +231,7 @@
    {
     "data": {
      "text/plain": [
-       "\" The president said that Ketanji Brown Jackson is one of the nation's top legal minds, a former top litigator in private practice, and a former federal public defender.\""
+       "\" The president said that Ketanji Brown Jackson is one of the nation's top legal minds and that she will continue Justice Breyer's legacy of excellence.\""
      ]
     },
     "execution_count": 12,
@@ -286,7 +276,7 @@
    {
     "data": {
      "text/plain": [
-       "' Justice Stephen Breyer.'"
+       "' Justice Stephen Breyer'"
      ]
     },
     "execution_count": 14,
@@ -344,7 +334,7 @@
    {
     "data": {
      "text/plain": [
-       "Document(page_content='Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence. A former top litigator in private practice. A former federal public defender.', metadata={'source': '../../modules/state_of_the_union.txt'})"
+       "Document(page_content='Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \\n\\nTonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \\n\\nOne of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \\n\\nAnd I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'source': '../../../state_of_the_union.txt'})"
      ]
     },
     "execution_count": 17,
@@ -392,6 +382,24 @@
    "result = qa({\"question\": query, \"chat_history\": chat_history, \"vectordbkwargs\": vectordbkwargs})"
   ]
  },
+  {
+   "cell_type": "code",
+   "execution_count": 35,
+   "id": "24ebdaec",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      " The president said that Ketanji Brown Jackson is one of the nation's top legal minds and that she will continue Justice Breyer's legacy of excellence.\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(result['answer'])"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "99b96dae",
@@ -459,7 +467,7 @@
    {
     "data": {
      "text/plain": [
-       "' The president did not mention Ketanji Brown Jackson.'"
+       "\" The president said that he nominated Circuit Court of Appeals Judge Ketanji Brown Jackson, who he described as one of the nation's top legal minds, to continue Justice Breyer's legacy of excellence.\""
      ]
     },
     "execution_count": 23,
@@ -538,7 +546,7 @@
    {
     "data": {
      "text/plain": [
-       "' The president did not mention Ketanji Brown Jackson.\\nSOURCES: ../../modules/state_of_the_union.txt'"
+       "\" The president said that he nominated Circuit Court of Appeals Judge Ketanji Brown Jackson, who he described as one of the nation's top legal minds, and that she will continue Justice Breyer's legacy of excellence.\\nSOURCES: ../../../state_of_the_union.txt\""
      ]
     },
     "execution_count": 27,
@@ -598,7 +606,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      " The president said that Ketanji Brown Jackson is one of the nation's top legal minds, a former top litigator in private practice, and a former federal public defender."
+      " The president said that Ketanji Brown Jackson is one of the nation's top legal minds and that she will continue Justice Breyer's legacy of excellence."
     ]
    }
   ],
@@ -620,7 +628,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      " Justice Stephen Breyer."
+      " Justice Stephen Breyer"
     ]
    }
   ],
@@ -681,7 +689,7 @@
    {
     "data": {
      "text/plain": [
-       "\" The president said that Ketanji Brown Jackson is one of the nation's top legal minds, a former top litigator in private practice, and a former federal public defender.\""
+       "\" The president said that Ketanji Brown Jackson is one of the nation's top legal minds and that she will continue Justice Breyer's legacy of excellence.\""
      ]
     },
     "execution_count": 33,
--- a/docs/integrations/vectara/vectara_text_generation.ipynb
+++ b/docs/integrations/vectara/vectara_text_generation.ipynb
@@ -6,7 +6,7 @@
   "source": [
    "# Vectara Text Generation\n",
    "\n",
-    "This notebook is based on [chat_vector_db](https://github.com/hwchase17/langchain/blob/master/docs/modules/chains/index_examples/question_answering.ipynb) and adapted to Vectara."
+    "This notebook is based on [text generation](https://github.com/hwchase17/langchain/blob/master/docs/modules/chains/index_examples/vector_db_text_generation.ipynb) notebook and adapted to Vectara."
   ]
  },
  {
@@ -24,6 +24,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
+    "import os\n",
    "from langchain.llms import OpenAI\n",
    "from langchain.docstore.document import Document\n",
    "import requests\n",
@@ -159,7 +160,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "[{'text': '\\n\\nEnvironment variables are an essential part of any development workflow. They provide a way to store and access information that is specific to the environment in which the code is running. This can be especially useful when working with different versions of a language or framework, or when running code on different machines.\\n\\nThe Deno CLI tasks extension provides a way to easily manage environment variables when running Deno commands. This extension provides a task definition for allowing you to create tasks that execute the `deno` CLI from within the editor. The template for the Deno CLI tasks has the following interface, which can be configured in a `tasks.json` within your workspace:\\n\\nThe task definition includes the `type` field, which should be set to `deno`, and the `command` field, which is the `deno` command to run (e.g. `run`, `test`, `cache`, etc.). Additionally, you can specify additional arguments to pass on the command line, the current working directory to execute the command, and any environment variables.\\n\\nUsing environment variables with the Deno CLI tasks extension is a great way to ensure that your code is running in the correct environment. For example, if you are running a test suite,'}, {'text': '\\n\\nEnvironment variables are an important part of any programming language, and they can be used to store and access data in a variety of ways. In this blog post, we\\'ll be taking a look at environment variables specifically for the shell.\\n\\nShell variables are similar to environment variables, but they won\\'t be exported to spawned commands. They are defined with the following syntax:\\n\\n```sh\\nVAR_NAME=value\\n```\\n\\nShell variables can be used to store and access data in a variety of ways. For example, you can use them to store values that you want to re-use, but don\\'t want to be available in any spawned processes.\\n\\nFor example, if you wanted to store a value and then use it in a command, you could do something like this:\\n\\n```sh\\nVAR=hello && echo $VAR && deno eval \"console.log(\\'Deno: \\' + Deno.env.get(\\'VAR\\'))\"\\n```\\n\\nThis would output the following:\\n\\n```\\nhello\\nDeno: undefined\\n```\\n\\nAs you can see, the value stored in the shell variable is not available in the spawned process.\\n\\n'}, {'text': '\\n\\nWhen it comes to developing applications, environment variables are an essential part of the process. Environment variables are used to store information that can be used by applications and scripts to customize their behavior. This is especially important when it comes to developing applications with Deno, as there are several environment variables that can impact the behavior of Deno.\\n\\nThe most important environment variable for Deno is `DENO_AUTH_TOKENS`. This environment variable is used to store authentication tokens that are used to access remote resources. This is especially important when it comes to accessing remote APIs or databases. Without the proper authentication tokens, Deno will not be able to access the remote resources.\\n\\nAnother important environment variable for Deno is `DENO_DIR`. This environment variable is used to store the directory where Deno will store its files. This includes the Deno executable, the Deno cache, and the Deno configuration files. By setting this environment variable, you can ensure that Deno will always be able to find the files it needs.\\n\\nFinally, there is the `DENO_PLUGINS` environment variable. This environment variable is used to store the list of plugins that Deno will use. This is important for customizing the'}, {'text': '\\n\\nEnvironment variables are a great way to store and access sensitive information in your Deno applications. Deno offers built-in support for environment variables with `Deno.env`, and you can also use a `.env` file to store and access environment variables. In this blog post, we\\'ll explore both of these options and how to use them in your Deno applications.\\n\\n## Built-in `Deno.env`\\n\\nThe Deno runtime offers built-in support for environment variables with [`Deno.env`](https://deno.land/api@v1.25.3?s=Deno.env). `Deno.env` has getter and setter methods. Here is example usage:\\n\\n```ts\\nDeno.env.set(\"FIREBASE_API_KEY\", \"examplekey123\");\\nDeno.env.set(\"FIREBASE_AUTH_DOMAIN\", \"firebasedomain.com\");\\n\\nconsole.log(Deno.env.get(\"FIREBASE_API_KEY\")); // examplekey123\\nconsole.log(Deno.env.get(\"FIREBASE_AUTH_'}]\n"
+      "[{'text': '\\n\\nEnvironment variables are a powerful tool for managing configuration settings in your applications. They allow you to store and access values from anywhere in your code, making it easier to keep your codebase organized and maintainable.\\n\\nHowever, there are times when you may want to use environment variables specifically for a single command. This is where shell variables come in. Shell variables are similar to environment variables, but they won\\'t be exported to spawned commands. They are defined with the following syntax:\\n\\n```sh\\nVAR_NAME=value\\n```\\n\\nFor example, if you wanted to use a shell variable instead of an environment variable in a command, you could do something like this:\\n\\n```sh\\nVAR=hello && echo $VAR && deno eval \"console.log(\\'Deno: \\' + Deno.env.get(\\'VAR\\'))\"\\n```\\n\\nThis would output the following:\\n\\n```\\nhello\\nDeno: undefined\\n```\\n\\nShell variables can be useful when you want to re-use a value, but don\\'t want it available in any spawned processes.\\n\\nAnother way to use environment variables is through pipelines. Pipelines provide a way to pipe the'}, {'text': '\\n\\nEnvironment variables are a great way to store and access sensitive information in your applications. They are also useful for configuring applications and managing different environments. In Deno, there are two ways to use environment variables: the built-in `Deno.env` and the `.env` file.\\n\\nThe `Deno.env` is a built-in feature of the Deno runtime that allows you to set and get environment variables. It has getter and setter methods that you can use to access and set environment variables. For example, you can set the `FIREBASE_API_KEY` and `FIREBASE_AUTH_DOMAIN` environment variables like this:\\n\\n```ts\\nDeno.env.set(\"FIREBASE_API_KEY\", \"examplekey123\");\\nDeno.env.set(\"FIREBASE_AUTH_DOMAIN\", \"firebasedomain.com\");\\n\\nconsole.log(Deno.env.get(\"FIREBASE_API_KEY\")); // examplekey123\\nconsole.log(Deno.env.get(\"FIREBASE_AUTH_DOMAIN\")); // firebasedomain'}, {'text': \"\\n\\nEnvironment variables are a powerful tool for managing configuration and settings in your applications. They allow you to store and access values that can be used in your code, and they can be set and changed without having to modify your code.\\n\\nIn Deno, environment variables are defined using the `export` command. For example, to set a variable called `VAR_NAME` to the value `value`, you would use the following command:\\n\\n```sh\\nexport VAR_NAME=value\\n```\\n\\nYou can then access the value of the environment variable in your code using the `Deno.env.get()` method. For example, if you wanted to log the value of the `VAR_NAME` variable, you could use the following code:\\n\\n```js\\nconsole.log(Deno.env.get('VAR_NAME'));\\n```\\n\\nYou can also set environment variables for a single command. To do this, you can list the environment variables before the command, like so:\\n\\n```\\nVAR=hello VAR2=bye deno run main.ts\\n```\\n\\nThis will set the environment variables `VAR` and `V\"}, {'text': \"\\n\\nEnvironment variables are a powerful tool for managing settings and configuration in your applications. They can be used to store information such as user preferences, application settings, and even passwords. In this blog post, we'll discuss how to make Deno scripts executable with a hashbang (shebang).\\n\\nA hashbang is a line of code that is placed at the beginning of a script. It tells the system which interpreter to use when running the script. In the case of Deno, the hashbang should be `#!/usr/bin/env -S deno run --allow-env`. This tells the system to use the Deno interpreter and to allow the script to access environment variables.\\n\\nOnce the hashbang is in place, you may need to give the script execution permissions. On Linux, this can be done with the command `sudo chmod +x hashbang.ts`. After that, you can execute the script by calling it like any other command: `./hashbang.ts`.\\n\\nIn the example program, we give the context permission to access the environment variables and print the Deno installation path. This is done by using the `Deno.env.get()` function, which returns the value of the specified environment\"}]\n"
     ]
    }
   ],
--- a/docs/modules/agents/agent_executors/examples/agent_vectorstore.ipynb
+++ b/docs/modules/agents/agent_executors/examples/agent_vectorstore.ipynb
@@ -14,6 +14,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "9b22020a",
   "metadata": {},
@@ -139,6 +140,7 @@
   "source": []
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "c0a6c031",
   "metadata": {},
@@ -229,7 +231,7 @@
    }
   ],
   "source": [
-    "agent.run(\"What did biden say about ketanji brown jackson is the state of the union address?\")"
+    "agent.run(\"What did biden say about ketanji brown jackson in the state of the union address?\")"
   ]
  },
  {
@@ -271,6 +273,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "787a9b5e",
   "metadata": {},
@@ -279,6 +282,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "9161ba91",
   "metadata": {},
@@ -396,6 +400,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "49a0cbbe",
   "metadata": {},
--- a/docs/modules/agents/toolkits/examples/vectorstore.ipynb
+++ b/docs/modules/agents/toolkits/examples/vectorstore.ipynb
@@ -1,6 +1,7 @@
 {
 "cells": [
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "18ada398-dce6-4049-9b56-fc0ede63da9c",
   "metadata": {},
@@ -11,6 +12,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "eecb683b-3a46-4b9d-81a3-7caefbfec1a1",
   "metadata": {},
@@ -88,6 +90,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "f4814175-964d-42f1-aa9d-22801ce1e912",
   "metadata": {},
@@ -123,6 +126,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "8a38ad10",
   "metadata": {},
@@ -165,7 +169,7 @@
    }
   ],
   "source": [
-    "agent_executor.run(\"What did biden say about ketanji brown jackson is the state of the union address?\")"
+    "agent_executor.run(\"What did biden say about ketanji brown jackson in the state of the union address?\")"
   ]
  },
  {
@@ -203,10 +207,11 @@
    }
   ],
   "source": [
-    "agent_executor.run(\"What did biden say about ketanji brown jackson is the state of the union address? List the source.\")"
+    "agent_executor.run(\"What did biden say about ketanji brown jackson in the state of the union address? List the source.\")"
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "7ca07707",
   "metadata": {},
@@ -255,6 +260,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "71680984-edaf-4a63-90f5-94edbd263550",
   "metadata": {},
@@ -299,7 +305,7 @@
    }
   ],
   "source": [
-    "agent_executor.run(\"What did biden say about ketanji brown jackson is the state of the union address?\")"
+    "agent_executor.run(\"What did biden say about ketanji brown jackson in the state of the union address?\")"
   ]
  },
  {
--- a/docs/modules/agents/tools/getting_started.md
+++ b/docs/modules/agents/tools/getting_started.md
@@ -160,3 +160,9 @@ Below is a list of all supported tools and relevant information:
 - Notes: A connection to the OpenWeatherMap API (https://api.openweathermap.org), specifically the `/data/2.5/weather` endpoint.
 - Requires LLM: No
 - Extra Parameters: `openweathermap_api_key` (your API key to access this endpoint)
+
+**sleep**
+
+- Tool Name: Sleep
+- Tool Description: Make agent sleep for some time.
+- Requires LLM: No
--- a/docs/modules/chains/examples/graph_cypher_qa.ipynb
+++ b/docs/modules/chains/examples/graph_cypher_qa.ipynb
@@ -177,7 +177,7 @@
      "\u001b[32;1m\u001b[1;3mMATCH (a:Actor)-[:ACTED_IN]->(m:Movie {name: 'Top Gun'})\n",
      "RETURN a.name\u001b[0m\n",
      "Full Context:\n",
-      "\u001b[32;1m\u001b[1;3m[{'a.name': 'Tom Cruise'}, {'a.name': 'Val Kilmer'}, {'a.name': 'Anthony Edwards'}, {'a.name': 'Meg Ryan'}]\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m[{'a.name': 'Val Kilmer'}, {'a.name': 'Anthony Edwards'}, {'a.name': 'Meg Ryan'}, {'a.name': 'Tom Cruise'}]\u001b[0m\n",
      "\n",
      "\u001b[1m> Finished chain.\u001b[0m\n"
     ]
@@ -185,7 +185,7 @@
    {
     "data": {
      "text/plain": [
-       "'Tom Cruise, Val Kilmer, Anthony Edwards, and Meg Ryan played in Top Gun.'"
+       "'Val Kilmer, Anthony Edwards, Meg Ryan, and Tom Cruise played in Top Gun.'"
      ]
     },
     "execution_count": 7,
@@ -197,10 +197,180 @@
    "chain.run(\"Who played in Top Gun?\")"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "2d28c4df",
+   "metadata": {},
+   "source": [
+    "## Limit the number of results\n",
+    "You can limit the number of results from the Cypher QA Chain using the `top_k` parameter.\n",
+    "The default is 10."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "df230946",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chain = GraphCypherQAChain.from_llm(\n",
+    "    ChatOpenAI(temperature=0), graph=graph, verbose=True, top_k=2\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "3f1600ee",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new GraphCypherQAChain chain...\u001b[0m\n",
+      "Generated Cypher:\n",
+      "\u001b[32;1m\u001b[1;3mMATCH (a:Actor)-[:ACTED_IN]->(m:Movie {name: 'Top Gun'})\n",
+      "RETURN a.name\u001b[0m\n",
+      "Full Context:\n",
+      "\u001b[32;1m\u001b[1;3m[{'a.name': 'Val Kilmer'}, {'a.name': 'Anthony Edwards'}]\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Val Kilmer and Anthony Edwards played in Top Gun.'"
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "chain.run(\"Who played in Top Gun?\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "88c16206",
+   "metadata": {},
+   "source": [
+    "## Return intermediate results\n",
+    "You can return intermediate steps from the Cypher QA Chain using the `return_intermediate_steps` parameter"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "e412f36b",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chain = GraphCypherQAChain.from_llm(\n",
+    "    ChatOpenAI(temperature=0), graph=graph, verbose=True, return_intermediate_steps=True\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "4f4699dc",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new GraphCypherQAChain chain...\u001b[0m\n",
+      "Generated Cypher:\n",
+      "\u001b[32;1m\u001b[1;3mMATCH (a:Actor)-[:ACTED_IN]->(m:Movie {name: 'Top Gun'})\n",
+      "RETURN a.name\u001b[0m\n",
+      "Full Context:\n",
+      "\u001b[32;1m\u001b[1;3m[{'a.name': 'Val Kilmer'}, {'a.name': 'Anthony Edwards'}, {'a.name': 'Meg Ryan'}, {'a.name': 'Tom Cruise'}]\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n",
+      "Intermediate steps: [{'query': \"MATCH (a:Actor)-[:ACTED_IN]->(m:Movie {name: 'Top Gun'})\\nRETURN a.name\"}, {'context': [{'a.name': 'Val Kilmer'}, {'a.name': 'Anthony Edwards'}, {'a.name': 'Meg Ryan'}, {'a.name': 'Tom Cruise'}]}]\n",
+      "Final answer: Val Kilmer, Anthony Edwards, Meg Ryan, and Tom Cruise played in Top Gun.\n"
+     ]
+    }
+   ],
+   "source": [
+    "result = chain(\"Who played in Top Gun?\")\n",
+    "print(f\"Intermediate steps: {result['intermediate_steps']}\")\n",
+    "print(f\"Final answer: {result['result']}\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "d6e1b054",
+   "metadata": {},
+   "source": [
+    "## Return direct results\n",
+    "You can return direct results from the Cypher QA Chain using the `return_direct` parameter"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "id": "2d3acf10",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chain = GraphCypherQAChain.from_llm(\n",
+    "    ChatOpenAI(temperature=0), graph=graph, verbose=True, return_direct=True\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "b0a9d143",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new GraphCypherQAChain chain...\u001b[0m\n",
+      "Generated Cypher:\n",
+      "\u001b[32;1m\u001b[1;3mMATCH (a:Actor)-[:ACTED_IN]->(m:Movie {name: 'Top Gun'})\n",
+      "RETURN a.name\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "[{'a.name': 'Val Kilmer'},\n",
+       " {'a.name': 'Anthony Edwards'},\n",
+       " {'a.name': 'Meg Ryan'},\n",
+       " {'a.name': 'Tom Cruise'}]"
+      ]
+     },
+     "execution_count": 13,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "chain.run(\"Who played in Top Gun?\")"
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
-   "id": "b4825316",
+   "id": "74d0a36f",
   "metadata": {},
   "outputs": [],
   "source": []
@@ -222,7 +392,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.8.8"
  }
 },
 "nbformat": 4,
--- a/docs/modules/chains/examples/graph_nebula_qa.ipynb
+++ b/docs/modules/chains/examples/graph_nebula_qa.ipynb
@@ -0,0 +1,270 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "c94240f5",
+   "metadata": {},
+   "source": [
+    "# NebulaGraphQAChain\n",
+    "\n",
+    "This notebook shows how to use LLMs to provide a natural language interface to NebulaGraph database."
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "dbc0ee68",
+   "metadata": {},
+   "source": [
+    "You will need to have a running NebulaGraph cluster, for which you can run a containerized cluster by running the following script:\n",
+    "\n",
+    "```bash\n",
+    "curl -fsSL nebula-up.siwei.io/install.sh | bash\n",
+    "```\n",
+    "\n",
+    "Other options are:\n",
+    "- Install as a [Docker Desktop Extension](https://www.docker.com/blog/distributed-cloud-native-graph-database-nebulagraph-docker-extension/). See [here](https://docs.nebula-graph.io/3.5.0/2.quick-start/1.quick-start-workflow/)\n",
+    "- NebulaGraph Cloud Service. See [here](https://www.nebula-graph.io/cloud)\n",
+    "- Deploy from package, source code, or via Kubernetes. See [here](https://docs.nebula-graph.io/)\n",
+    "\n",
+    "Once the cluster is running, we could create the SPACE and SCHEMA for the database."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "c82f4141",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "%pip install ipython-ngql\n",
+    "%load_ext ngql\n",
+    "\n",
+    "# connect ngql jupyter extension to nebulagraph\n",
+    "%ngql --address 127.0.0.1 --port 9669 --user root --password nebula\n",
+    "# create a new space\n",
+    "%ngql CREATE SPACE IF NOT EXISTS langchain(partition_num=1, replica_factor=1, vid_type=fixed_string(128));\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "eda0809a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Wait for a few seconds for the space to be created.\n",
+    "%ngql USE langchain;"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "119fe35c",
+   "metadata": {},
+   "source": [
+    "Create the schema, for full dataset, refer [here](https://www.siwei.io/en/nebulagraph-etl-dbt/)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "5aa796ee",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "%%ngql\n",
+    "CREATE TAG IF NOT EXISTS movie(name string);\n",
+    "CREATE TAG IF NOT EXISTS person(name string, birthdate string);\n",
+    "CREATE EDGE IF NOT EXISTS acted_in();\n",
+    "CREATE TAG INDEX IF NOT EXISTS person_index ON person(name(128));\n",
+    "CREATE TAG INDEX IF NOT EXISTS movie_index ON movie(name(128));"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "66e4799a",
+   "metadata": {},
+   "source": [
+    "Wait for schema creation to complete, then we can insert some data."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "d8eea530",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "UsageError: Cell magic `%%ngql` not found.\n"
+     ]
+    }
+   ],
+   "source": [
+    "%%ngql\n",
+    "INSERT VERTEX person(name, birthdate) VALUES \"Al Pacino\":(\"Al Pacino\", \"1940-04-25\");\n",
+    "INSERT VERTEX movie(name) VALUES \"The Godfather II\":(\"The Godfather II\");\n",
+    "INSERT VERTEX movie(name) VALUES \"The Godfather Coda: The Death of Michael Corleone\":(\"The Godfather Coda: The Death of Michael Corleone\");\n",
+    "INSERT EDGE acted_in() VALUES \"Al Pacino\"->\"The Godfather II\":();\n",
+    "INSERT EDGE acted_in() VALUES \"Al Pacino\"->\"The Godfather Coda: The Death of Michael Corleone\":();"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "62812aad",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.chains import NebulaGraphQAChain\n",
+    "from langchain.graphs import NebulaGraph"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "0928915d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "graph = NebulaGraph(\n",
+    "    space=\"langchain\",\n",
+    "    username=\"root\",\n",
+    "    password=\"nebula\",\n",
+    "    address=\"127.0.0.1\",\n",
+    "    port=9669,\n",
+    "    session_pool_size=30,\n",
+    ")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "58c1a8ea",
+   "metadata": {},
+   "source": [
+    "## Refresh graph schema information\n",
+    "\n",
+    "If the schema of database changes, you can refresh the schema information needed to generate nGQL statements."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "4e3de44f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# graph.refresh_schema()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "1fe76ccd",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Node properties: [{'tag': 'movie', 'properties': [('name', 'string')]}, {'tag': 'person', 'properties': [('name', 'string'), ('birthdate', 'string')]}]\n",
+      "Edge properties: [{'edge': 'acted_in', 'properties': []}]\n",
+      "Relationships: ['(:person)-[:acted_in]->(:movie)']\n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(graph.get_schema)"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "68a3c677",
+   "metadata": {},
+   "source": [
+    "## Querying the graph\n",
+    "\n",
+    "We can now use the graph cypher QA chain to ask question of the graph"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "7476ce98",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "chain = NebulaGraphQAChain.from_llm(\n",
+    "    ChatOpenAI(temperature=0), graph=graph, verbose=True\n",
+    ")\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "ef8ee27b",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new NebulaGraphQAChain chain...\u001b[0m\n",
+      "Generated nGQL:\n",
+      "\u001b[32;1m\u001b[1;3mMATCH (p:`person`)-[:acted_in]->(m:`movie`) WHERE m.`movie`.`name` == 'The Godfather II'\n",
+      "RETURN p.`person`.`name`\u001b[0m\n",
+      "Full Context:\n",
+      "\u001b[32;1m\u001b[1;3m{'p.person.name': ['Al Pacino']}\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Al Pacino played in The Godfather II.'"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "chain.run(\"Who played in The Godfather II?\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.3"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/indexes/document_loaders.rst
+++ b/docs/modules/indexes/document_loaders.rst
@@ -30,6 +30,7 @@ For detailed instructions on how to get set up with Unstructured, see installati
   :maxdepth: 1
   :glob:

+   ./document_loaders/examples/airtable.ipynb
   ./document_loaders/examples/audio.ipynb
   ./document_loaders/examples/conll-u.ipynb
   ./document_loaders/examples/copypaste.ipynb
@@ -37,6 +38,7 @@ For detailed instructions on how to get set up with Unstructured, see installati
   ./document_loaders/examples/email.ipynb
   ./document_loaders/examples/epub.ipynb
   ./document_loaders/examples/evernote.ipynb
+   ./document_loaders/examples/excel.ipynb
   ./document_loaders/examples/facebook_chat.ipynb
   ./document_loaders/examples/file_directory.ipynb
   ./document_loaders/examples/html.ipynb
@@ -115,6 +117,7 @@ We need access tokens and sometime other parameters to get access to these datas
   ./document_loaders/examples/discord_loader.ipynb
   ./document_loaders/examples/docugami.ipynb
   ./document_loaders/examples/duckdb.ipynb
+   ./document_loaders/examples/fauna.ipynb
   ./document_loaders/examples/figma.ipynb
   ./document_loaders/examples/gitbook.ipynb
   ./document_loaders/examples/git.ipynb
@@ -136,6 +139,7 @@ We need access tokens and sometime other parameters to get access to these datas
   ./document_loaders/examples/reddit.ipynb
   ./document_loaders/examples/roam.ipynb
   ./document_loaders/examples/slack.ipynb
+   ./document_loaders/examples/snowflake.ipynb
   ./document_loaders/examples/spreedly.ipynb
   ./document_loaders/examples/stripe.ipynb
   ./document_loaders/examples/tomarkdown.ipynb
--- a/docs/modules/indexes/document_loaders/examples/airtable.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/airtable.ipynb
@@ -0,0 +1,142 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "7ae421e6",
+   "metadata": {},
+   "source": [
+    "# Airtable"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "98aea00d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "! pip install pyairtable"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "592483eb",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders import AirtableLoader"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "637e1205",
+   "metadata": {},
+   "source": [
+    "* Get your API key [here](https://support.airtable.com/docs/creating-and-using-api-keys-and-access-tokens).\n",
+    "* Get ID of your base [here](https://airtable.com/developers/web/api/introduction).\n",
+    "* Get your table ID from the table url as shown [here](https://www.highviewapps.com/kb/where-can-i-find-the-airtable-base-id-and-table-id/#:~:text=Both%20the%20Airtable%20Base%20ID,URL%20that%20begins%20with%20tbl)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "c12a7aff",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "api_key=\"xxx\"\n",
+    "base_id=\"xxx\"\n",
+    "table_id=\"xxx\""
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "ccddd5a6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "loader = AirtableLoader(api_key,table_id,base_id)\n",
+    "docs = loader.load()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "ae76c25c",
+   "metadata": {},
+   "source": [
+    "Returns each table row as `dict`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "7abec7ce",
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "3"
+      ]
+     },
+     "execution_count": 10,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "len(docs)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "403c95da",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'id': 'recF3GbGZCuh9sXIQ',\n",
+       " 'createdTime': '2023-06-09T04:47:21.000Z',\n",
+       " 'fields': {'Priority': 'High',\n",
+       "  'Status': 'In progress',\n",
+       "  'Name': 'Document Splitters'}}"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "eval(docs[0].page_content)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.16"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/indexes/document_loaders/examples/csv.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/csv.ipynb
@@ -29,7 +29,6 @@
   "cell_type": "code",
   "execution_count": 26,
   "metadata": {
-    "collapsed": false,
    "jupyter": {
     "outputs_hidden": false
    }
@@ -45,7 +44,6 @@
   "cell_type": "code",
   "execution_count": 27,
   "metadata": {
-    "collapsed": false,
    "jupyter": {
     "outputs_hidden": false
    }
@@ -76,7 +74,6 @@
   "cell_type": "code",
   "execution_count": 28,
   "metadata": {
-    "collapsed": false,
    "jupyter": {
     "outputs_hidden": false
    }
@@ -96,7 +93,6 @@
   "cell_type": "code",
   "execution_count": 29,
   "metadata": {
-    "collapsed": false,
    "jupyter": {
     "outputs_hidden": false
    }
@@ -152,6 +148,211 @@
   "source": [
    "print(data)"
   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## `UnstructuredCSVLoader`\n",
+    "\n",
+    "You can also load the table using the `UnstructuredCSVLoader`. One advantage of using `UnstructuredCSVLoader` is that if you use it in `\"elements\"` mode, an HTML representation of the table will be available in the metadata."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders.csv_loader import UnstructuredCSVLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "loader = UnstructuredCSVLoader(file_path='example_data/mlb_teams_2012.csv', mode=\"elements\")\n",
+    "docs = loader.load()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "<table border=\"1\" class=\"dataframe\">\n",
+      "  <tbody>\n",
+      "    <tr>\n",
+      "      <td>Nationals</td>\n",
+      "      <td>81.34</td>\n",
+      "      <td>98</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Reds</td>\n",
+      "      <td>82.20</td>\n",
+      "      <td>97</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Yankees</td>\n",
+      "      <td>197.96</td>\n",
+      "      <td>95</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Giants</td>\n",
+      "      <td>117.62</td>\n",
+      "      <td>94</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Braves</td>\n",
+      "      <td>83.31</td>\n",
+      "      <td>94</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Athletics</td>\n",
+      "      <td>55.37</td>\n",
+      "      <td>94</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Rangers</td>\n",
+      "      <td>120.51</td>\n",
+      "      <td>93</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Orioles</td>\n",
+      "      <td>81.43</td>\n",
+      "      <td>93</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Rays</td>\n",
+      "      <td>64.17</td>\n",
+      "      <td>90</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Angels</td>\n",
+      "      <td>154.49</td>\n",
+      "      <td>89</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Tigers</td>\n",
+      "      <td>132.30</td>\n",
+      "      <td>88</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Cardinals</td>\n",
+      "      <td>110.30</td>\n",
+      "      <td>88</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Dodgers</td>\n",
+      "      <td>95.14</td>\n",
+      "      <td>86</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>White Sox</td>\n",
+      "      <td>96.92</td>\n",
+      "      <td>85</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Brewers</td>\n",
+      "      <td>97.65</td>\n",
+      "      <td>83</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Phillies</td>\n",
+      "      <td>174.54</td>\n",
+      "      <td>81</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Diamondbacks</td>\n",
+      "      <td>74.28</td>\n",
+      "      <td>81</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Pirates</td>\n",
+      "      <td>63.43</td>\n",
+      "      <td>79</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Padres</td>\n",
+      "      <td>55.24</td>\n",
+      "      <td>76</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Mariners</td>\n",
+      "      <td>81.97</td>\n",
+      "      <td>75</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Mets</td>\n",
+      "      <td>93.35</td>\n",
+      "      <td>74</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Blue Jays</td>\n",
+      "      <td>75.48</td>\n",
+      "      <td>73</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Royals</td>\n",
+      "      <td>60.91</td>\n",
+      "      <td>72</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Marlins</td>\n",
+      "      <td>118.07</td>\n",
+      "      <td>69</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Red Sox</td>\n",
+      "      <td>173.18</td>\n",
+      "      <td>69</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Indians</td>\n",
+      "      <td>78.43</td>\n",
+      "      <td>68</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Twins</td>\n",
+      "      <td>94.08</td>\n",
+      "      <td>66</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Rockies</td>\n",
+      "      <td>78.06</td>\n",
+      "      <td>64</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Cubs</td>\n",
+      "      <td>88.19</td>\n",
+      "      <td>61</td>\n",
+      "    </tr>\n",
+      "    <tr>\n",
+      "      <td>Astros</td>\n",
+      "      <td>60.65</td>\n",
+      "      <td>55</td>\n",
+      "    </tr>\n",
+      "  </tbody>\n",
+      "</table>\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(docs[0].metadata[\"text_as_html\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": []
  }
 ],
 "metadata": {
@@ -170,7 +371,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.6"
+   "version": "3.8.13"
  }
 },
 "nbformat": 4,
--- a/docs/modules/indexes/document_loaders/examples/example_data/factbook.xml
+++ b/docs/modules/indexes/document_loaders/examples/example_data/factbook.xml
@@ -0,0 +1,27 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<factbook>
+  <country>
+    <name>United States</name>
+    <capital>Washington, DC</capital>
+    <leader>Joe Biden</leader>
+    <sport>Baseball</sport>
+  </country>
+  <country>
+    <name>Canada</name>
+    <capital>Ottawa</capital>
+    <leader>Justin Trudeau</leader>
+    <sport>Hockey</sport>
+  </country>
+  <country>
+    <name>France</name>
+    <capital>Paris</capital>
+    <leader>Emmanuel Macron</leader>
+    <sport>Soccer</sport>
+  </country>
+  <country>
+    <name>Trinidad &amp; Tobado</name>
+    <capital>Port of Spain</capital>
+    <leader>Keith Rowley</leader>
+    <sport>Track &amp; Field</sport>
+  </country>
+</factbook>
--- a/docs/modules/indexes/document_loaders/examples/fauna.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/fauna.ipynb
@@ -0,0 +1,84 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Fauna\n",
+    "\n",
+    ">[Fauna](https://fauna.com/) is a Document Database.\n",
+    "\n",
+    "Query `Fauna` documents"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "#!pip install fauna"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Query data example"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders.fauna import FaunaLoader\n",
+    "\n",
+    "secret = \"<enter-valid-fauna-secret>\"\n",
+    "query = \"Item.all()\" # Fauna query. Assumes that the collection is called \"Item\"\n",
+    "field = \"text\" # The field that contains the page content. Assumes that the field is called \"text\"\n",
+    "\n",
+    "loader = FaunaLoader(query, field, secret)\n",
+    "docs = loader.lazy_load()\n",
+    "\n",
+    "for value in docs:\n",
+    "    print(value)"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Query with Pagination\n",
+    "You get a `after` value if there are more data. You can get values after the curcor by passing in the `after` string in query. \n",
+    "\n",
+    "To learn more following [this link](https://fqlx-beta--fauna-docs.netlify.app/fqlx/beta/reference/schema_entities/set/static-paginate)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "query = \"\"\"\n",
+    "Item.paginate(\"hs+DzoPOg ... aY1hOohozrV7A\")\n",
+    "Item.all()\n",
+    "\"\"\"\n",
+    "loader = FaunaLoader(query, field, secret)"
+   ]
+  }
+ ],
+ "metadata": {
+  "language_info": {
+   "name": "python"
+  },
+  "orig_nbformat": 4
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/modules/indexes/document_loaders/examples/microsoft_word.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/microsoft_word.ipynb
@@ -22,6 +22,16 @@
    "Load .docx using `Docx2txt` into a document."
   ]
  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "7b80ea891",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install docx2txt "
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": 3,
--- a/docs/modules/indexes/document_loaders/examples/sitemap.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/sitemap.ipynb
@@ -146,6 +146,73 @@
    "documents[0]"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Add custom scraping rules\n",
+    "\n",
+    "The `SitemapLoader` uses `beautifulsoup4` for the scraping process, and it scrapes every element on the page by default. The `SitemapLoader` constructor accepts a custom scraping function. This feature can be helpful to tailor the scraping process to your specific needs; for example, you might want to avoid scraping headers or navigation elements.\n",
+    "\n",
+    " The following example shows how to develop and use a custom function to avoid navigation and header elements."
+   ]
+  },
+    {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Import the `beautifulsoup4` library and define the custom function."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "pip install beautifulsoup4"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from bs4 import BeautifulSoup\n",
+    "\n",
+    "def remove_nav_and_header_elements(content: BeautifulSoup) -> str:\n",
+    "    # Find all 'nav' and 'header' elements in the BeautifulSoup object\n",
+    "    nav_elements = content.find_all('nav')\n",
+    "    header_elements = content.find_all('header')\n",
+    "\n",
+    "    # Remove each 'nav' and 'header' element from the BeautifulSoup object\n",
+    "    for element in nav_elements + header_elements:\n",
+    "        element.decompose()\n",
+    "\n",
+    "    return str(content.get_text())"
+   ]
+},    
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Add your custom function to the `SitemapLoader` object."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "loader = SitemapLoader(\n",
+    "    \"https://langchain.readthedocs.io/sitemap.xml\",\n",
+    "    filter_urls=[\"https://python.langchain.com/en/latest/\"],\n",
+    "    parsing_function=remove_nav_and_header_elements\n",
+    ")"
+   ]
+},
  {
   "cell_type": "markdown",
   "metadata": {},
--- a/docs/modules/indexes/document_loaders/examples/snowflake.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/snowflake.ipynb
@@ -0,0 +1,98 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Snowflake\n",
+    "\n",
+    "This notebooks goes over how to load documents from Snowflake"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "! pip install snowflake-connector-python"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import settings as s\n",
+    "from langchain.document_loaders import SnowflakeLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "QUERY = \"select text, survey_id from CLOUD_DATA_SOLUTIONS.HAPPY_OR_NOT.OPEN_FEEDBACK limit 10\"\n",
+    "snowflake_loader = SnowflakeLoader(\n",
+    "    query=QUERY,\n",
+    "    user=s.SNOWFLAKE_USER,\n",
+    "    password=s.SNOWFLAKE_PASS,\n",
+    "    account=s.SNOWFLAKE_ACCOUNT,\n",
+    "    warehouse=s.SNOWFLAKE_WAREHOUSE,\n",
+    "    role=s.SNOWFLAKE_ROLE,\n",
+    "    database=s.SNOWFLAKE_DATABASE,\n",
+    "    schema=s.SNOWFLAKE_SCHEMA\n",
+    ")\n",
+    "snowflake_documents = snowflake_loader.load()\n",
+    "print(snowflake_documents)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from snowflakeLoader import SnowflakeLoader\n",
+    "import settings as s\n",
+    "QUERY = \"select text, survey_id as source from CLOUD_DATA_SOLUTIONS.HAPPY_OR_NOT.OPEN_FEEDBACK limit 10\"\n",
+    "snowflake_loader = SnowflakeLoader(\n",
+    "    query=QUERY,\n",
+    "    user=s.SNOWFLAKE_USER,\n",
+    "    password=s.SNOWFLAKE_PASS,\n",
+    "    account=s.SNOWFLAKE_ACCOUNT,\n",
+    "    warehouse=s.SNOWFLAKE_WAREHOUSE,\n",
+    "    role=s.SNOWFLAKE_ROLE,\n",
+    "    database=s.SNOWFLAKE_DATABASE,\n",
+    "    schema=s.SNOWFLAKE_SCHEMA,\n",
+    "    metadata_columns=['source']\n",
+    ")\n",
+    "snowflake_documents = snowflake_loader.load()\n",
+    "print(snowflake_documents)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/modules/indexes/document_loaders/examples/xml.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/xml.ipynb
@@ -0,0 +1,78 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "22a849cc",
+   "metadata": {},
+   "source": [
+    "# XML\n",
+    "\n",
+    "The `UnstructuredXMLLoader` is used to load `XML` files. The loader works with `.xml` files. The page content will be the text extracted from the XML tags."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "e6616e3a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders import UnstructuredXMLLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "a654e4d9",
+   "metadata": {},
+   "outputs": [
+    {
+      "data": {
+        "text/plain": [
+          "Document(page_content='United States\\n\\nWashington, DC\\n\\nJoe Biden\\n\\nBaseball\\n\\nCanada\\n\\nOttawa\\n\\nJustin Trudeau\\n\\nHockey\\n\\nFrance\\n\\nParis\\n\\nEmmanuel Macron\\n\\nSoccer\\n\\nTrinidad & Tobado\\n\\nPort of Spain\\n\\nKeith Rowley\\n\\nTrack & Field', metadata={'source': 'example_data/factbook.xml'})"
+        ]
+      },
+      "execution_count": 2,
+      "metadata": {},
+      "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "loader = UnstructuredXMLLoader(\n",
+    "    \"example_data/factbook.xml\",\n",
+    ")\n",
+    "docs = loader.load()\n",
+    "docs[0]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "a54342bb",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.8.15"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/indexes/document_loaders/examples/youtube_audio.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/youtube_audio.ipynb
@@ -0,0 +1,296 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "e48afb8d",
+   "metadata": {},
+   "source": [
+    "# Loading documents from a YouTube url\n",
+    "\n",
+    "Building chat or QA applications on YouTube videos is a topic of high interest.\n",
+    "\n",
+    "Below we show how to easily go from a YouTube url to text to chat!\n",
+    "\n",
+    "We wil use the `OpenAIWhisperParser`, which will use the OpenAI Whisper API to transcribe audio to text.\n",
+    "\n",
+    "Note: You will need to have an `OPENAI_API_KEY` supplied."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "id": "5f34e934",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders.generic import GenericLoader\n",
+    "from langchain.document_loaders.parsers import OpenAIWhisperParser\n",
+    "from langchain.document_loaders.blob_loaders.youtube_audio import YoutubeAudioLoader"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "85fc12bd",
+   "metadata": {},
+   "source": [
+    "We will use `yt_dlp` to download audio for YouTube urls.\n",
+    "\n",
+    "We will use `pydub` to split downloaded audio files (such that we adhere to Whisper API's 25MB file size limit)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "fb5a6606",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "! pip install yt_dlp\n",
+    "! pip install pydub"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "b0e119f4",
+   "metadata": {},
+   "source": [
+    "### YouTube url to text\n",
+    "\n",
+    "Use `YoutubeAudioLoader` to fetch / download the audio files.\n",
+    "\n",
+    "Then, ues `OpenAIWhisperParser()` to transcribe them to text.\n",
+    "\n",
+    "Let's take the first lecture of Andrej Karpathy's YouTube course as an example! "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "23e1e134",
+   "metadata": {
+    "scrolled": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "[youtube] Extracting URL: https://youtu.be/kCc8FmEb1nY\n",
+      "[youtube] kCc8FmEb1nY: Downloading webpage\n",
+      "[youtube] kCc8FmEb1nY: Downloading android player API JSON\n",
+      "[info] kCc8FmEb1nY: Downloading 1 format(s): 140\n",
+      "[dashsegments] Total fragments: 11\n",
+      "[download] Destination: /Users/31treehaus/Desktop/AI/langchain-fork/docs/modules/indexes/document_loaders/examples/Let's build GPT： from scratch, in code, spelled out..m4a\n",
+      "[download] 100% of  107.73MiB in 00:00:18 at 5.92MiB/s                   \n",
+      "[FixupM4a] Correcting container of \"/Users/31treehaus/Desktop/AI/langchain-fork/docs/modules/indexes/document_loaders/examples/Let's build GPT： from scratch, in code, spelled out..m4a\"\n",
+      "[ExtractAudio] Not converting audio /Users/31treehaus/Desktop/AI/langchain-fork/docs/modules/indexes/document_loaders/examples/Let's build GPT： from scratch, in code, spelled out..m4a; file is already in target format m4a\n",
+      "[youtube] Extracting URL: https://youtu.be/VMj-3S1tku0\n",
+      "[youtube] VMj-3S1tku0: Downloading webpage\n",
+      "[youtube] VMj-3S1tku0: Downloading android player API JSON\n",
+      "[info] VMj-3S1tku0: Downloading 1 format(s): 140\n",
+      "[download] /Users/31treehaus/Desktop/AI/langchain-fork/docs/modules/indexes/document_loaders/examples/The spelled-out intro to neural networks and backpropagation： building micrograd.m4a has already been downloaded\n",
+      "[download] 100% of  134.98MiB\n",
+      "[ExtractAudio] Not converting audio /Users/31treehaus/Desktop/AI/langchain-fork/docs/modules/indexes/document_loaders/examples/The spelled-out intro to neural networks and backpropagation： building micrograd.m4a; file is already in target format m4a\n"
+     ]
+    }
+   ],
+   "source": [
+    "# Two Karpathy lecture videos\n",
+    "urls = [\"https://youtu.be/kCc8FmEb1nY\",\n",
+    "        \"https://youtu.be/VMj-3S1tku0\"]\n",
+    "\n",
+    "# Directory to save audio files \n",
+    "save_dir = \"~/Downloads/YouTube\"\n",
+    "\n",
+    "# Transcribe the videos to text\n",
+    "loader = GenericLoader(YoutubeAudioLoader(urls,save_dir),OpenAIWhisperParser())\n",
+    "docs = loader.load()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "72a94fd8",
+   "metadata": {
+    "scrolled": false
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "\"Hello, my name is Andrej and I've been training deep neural networks for a bit more than a decade. And in this lecture I'd like to show you what neural network training looks like under the hood. So in particular we are going to start with a blank Jupyter notebook and by the end of this lecture we will define and train a neural net and you'll get to see everything that goes on under the hood and exactly sort of how that works on an intuitive level. Now specifically what I would like to do is I w\""
+      ]
+     },
+     "execution_count": 3,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# Returns a list of Documents, which can be easily viewed or parsed\n",
+    "docs[0].page_content[0:500]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "93be6b49",
+   "metadata": {},
+   "source": [
+    "### Building a chat app from YouTube video\n",
+    "\n",
+    "Given `Documents`, we can easily enable chat / question+answering."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "1823f042",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chains import RetrievalQA\n",
+    "from langchain.vectorstores import FAISS\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "from langchain.text_splitter import RecursiveCharacterTextSplitter"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "7257cda1",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Combine doc\n",
+    "combined_docs = [doc.page_content for doc in docs]\n",
+    "text = \" \".join(combined_docs)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "147c0c55",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Split them\n",
+    "text_splitter = RecursiveCharacterTextSplitter(chunk_size = 1500, chunk_overlap = 150)\n",
+    "splits = text_splitter.split_text(text)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "f3556703",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Build an index\n",
+    "embeddings = OpenAIEmbeddings()\n",
+    "vectordb = FAISS.from_texts(splits,embeddings)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "beaa99db",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Build a QA chain\n",
+    "qa_chain = RetrievalQA.from_chain_type(llm = ChatOpenAI(model_name=\"gpt-3.5-turbo\", temperature=0),\n",
+    "                                       chain_type=\"stuff\",\n",
+    "                                       retriever=vectordb.as_retriever())"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "f2239a62",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "\"We need to zero out the gradient before backprop at each step because the backward pass accumulates gradients in the grad attribute of each parameter. If we don't reset the grad to zero before each backward pass, the gradients will accumulate and add up, leading to incorrect updates and slower convergence. By resetting the grad to zero before each backward pass, we ensure that the gradients are calculated correctly and that the optimization process works as intended.\""
+      ]
+     },
+     "execution_count": 9,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# Ask a question!\n",
+    "query = \"Why do we need to zero out the gradient before backprop at each step?\"\n",
+    "qa_chain.run(query)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "a8d01098",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'In the context of transformers, an encoder is a component that reads in a sequence of input tokens and generates a sequence of hidden representations. On the other hand, a decoder is a component that takes in a sequence of hidden representations and generates a sequence of output tokens. The main difference between the two is that the encoder is used to encode the input sequence into a fixed-length representation, while the decoder is used to decode the fixed-length representation into an output sequence. In machine translation, for example, the encoder reads in the source language sentence and generates a fixed-length representation, which is then used by the decoder to generate the target language sentence.'"
+      ]
+     },
+     "execution_count": 10,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "query = \"What is the difference between an encoder and decoder?\"\n",
+    "qa_chain.run(query)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "fe1e77dd",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'For any token, x is the input vector that contains the private information of that token, k and q are the key and query vectors respectively, which are produced by forwarding linear modules on x, and v is the vector that is calculated by propagating the same linear module on x again. The key vector represents what the token contains, and the query vector represents what the token is looking for. The vector v is the information that the token will communicate to other tokens if it finds them interesting, and it gets aggregated for the purposes of the self-attention mechanism.'"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "query = \"For any token, what are x, k, v, and q?\"\n",
+    "qa_chain.run(query)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.16"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/indexes/retrievers/examples/aws_kendra_index_retriever.ipynb
+++ b/docs/modules/indexes/retrievers/examples/aws_kendra_index_retriever.ipynb
@@ -0,0 +1,90 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# AWS Kendra\n",
+    "\n",
+    "> AWS Kendra is an intelligent search service provided by Amazon Web Services (AWS). It utilizes advanced natural language processing (NLP) and machine learning algorithms to enable powerful search capabilities across various data sources within an organization. Kendra is designed to help users find the information they need quickly and accurately, improving productivity and decision-making.\n",
+    "\n",
+    "> With Kendra, users can search across a wide range of content types, including documents, FAQs, knowledge bases, manuals, and websites. It supports multiple languages and can understand complex queries, synonyms, and contextual meanings to provide highly relevant search results."
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Using the AWS Kendra Index Retriever"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "#!pip install boto3"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import boto3\n",
+    "from langchain.retrievers import AwsKendraIndexRetriever"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Create New Retriever"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "kclient = boto3.client('kendra', region_name=\"us-east-1\")\n",
+    "\n",
+    "retriever = AwsKendraIndexRetriever(\n",
+    "    kclient=kclient,\n",
+    "    kendraindex=\"kendraindex\",\n",
+    ")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now you can use retrieved documents from AWS Kendra Index"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "retriever.get_relevant_documents(\"what is langchain\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "language_info": {
+   "name": "python"
+  },
+  "orig_nbformat": 4
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/modules/indexes/retrievers/examples/merger_retriever.ipynb
+++ b/docs/modules/indexes/retrievers/examples/merger_retriever.ipynb
@@ -0,0 +1,121 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "fc0db1bc",
+   "metadata": {},
+   "source": [
+    "# LOTR (Merger Retriever)\n",
+    "\n",
+    "Lord of the Retrievers, also known as MergerRetriever, takes a list of retrievers as input and merges the results of their get_relevant_documents() methods into a single list. The merged results will be a list of documents that are relevant to the query and that have been ranked by the different retrievers.\n",
+    "\n",
+    "The MergerRetriever class can be used to improve the accuracy of document retrieval in a number of ways. First, it can combine the results of multiple retrievers, which can help to reduce the risk of bias in the results. Second, it can rank the results of the different retrievers, which can help to ensure that the most relevant documents are returned first."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "9fbcc58f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "import chromadb\n",
+    "from langchain.retrievers.merger_retriever import MergerRetriever\n",
+    "from langchain.vectorstores import Chroma\n",
+    "from langchain.embeddings import HuggingFaceEmbeddings\n",
+    "from langchain.embeddings import OpenAIEmbeddings\n",
+    "from langchain.document_transformers import EmbeddingsRedundantFilter\n",
+    "from langchain.retrievers.document_compressors import DocumentCompressorPipeline\n",
+    "from langchain.retrievers import ContextualCompressionRetriever\n",
+    "\n",
+    "# Get 3 diff embeddings.\n",
+    "all_mini = HuggingFaceEmbeddings(model_name=\"all-MiniLM-L6-v2\")\n",
+    "multi_qa_mini = HuggingFaceEmbeddings(model_name=\"multi-qa-MiniLM-L6-dot-v1\")\n",
+    "filter_embeddings = OpenAIEmbeddings()\n",
+    "\n",
+    "ABS_PATH = os.path.dirname(os.path.abspath(__file__))\n",
+    "DB_DIR = os.path.join(ABS_PATH, \"db\")\n",
+    "\n",
+    "# Instantiate 2 diff cromadb indexs, each one with a diff embedding.\n",
+    "client_settings = chromadb.config.Settings(\n",
+    "    chroma_db_impl=\"duckdb+parquet\",\n",
+    "    persist_directory=DB_DIR,\n",
+    "    anonymized_telemetry=False,\n",
+    ")\n",
+    "db_all = Chroma(\n",
+    "    collection_name=\"project_store_all\",\n",
+    "    persist_directory=DB_DIR,\n",
+    "    client_settings=client_settings,\n",
+    "    embedding_function=all_mini,\n",
+    ")\n",
+    "db_multi_qa = Chroma(\n",
+    "    collection_name=\"project_store_multi\",\n",
+    "    persist_directory=DB_DIR,\n",
+    "    client_settings=client_settings,\n",
+    "    embedding_function=multi_qa_mini,\n",
+    ")\n",
+    "\n",
+    "# Define 2 diff retrievers with 2 diff embeddings and diff search type.\n",
+    "retriever_all = db_all.as_retriever(\n",
+    "    search_type=\"similarity\", search_kwargs={\"k\": 5, \"include_metadata\": True}\n",
+    ")\n",
+    "retriever_multi_qa = db_multi_qa.as_retriever(\n",
+    "    search_type=\"mmr\", search_kwargs={\"k\": 5, \"include_metadata\": True}\n",
+    ")\n",
+    "\n",
+    "# The Lord of the Retrievers will hold the ouput of boths retrievers and can be used as any other \n",
+    "# retriever on different types of chains.\n",
+    "lotr = MergerRetriever(retrievers=[retriever_all, retriever_multi_qa])\n"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "c152339d",
+   "metadata": {},
+   "source": [
+    "## Remove redundant results from the merged retrievers."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "039faea6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "\n",
+    "# We can remove redundant results from both retrievers using yet another embedding. \n",
+    "# Using multiples embeddings in diff steps could help reduce biases.\n",
+    "filter = EmbeddingsRedundantFilter(embeddings=filter_embeddings)\n",
+    "pipeline = DocumentCompressorPipeline(transformers=[filter])\n",
+    "compression_retriever = ContextualCompressionRetriever(\n",
+    "    base_compressor=pipeline, base_retriever=lotr\n",
+    ")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.6"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/indexes/retrievers/examples/vectorstore.ipynb
+++ b/docs/modules/indexes/retrievers/examples/vectorstore.ipynb
@@ -99,13 +99,14 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "2d958271",
   "metadata": {},
   "source": [
    "## Similarity Score Threshold Retrieval\n",
    "\n",
-    "You can also a retrieval method that sets a similarity score threshold and only returns documents with a score above that threshold"
+    "You can also use a retrieval method that sets a similarity score threshold and only returns documents with a score above that threshold"
   ]
  },
  {
--- a/docs/modules/indexes/text_splitters/getting_started.ipynb
+++ b/docs/modules/indexes/text_splitters/getting_started.ipynb
@@ -12,7 +12,8 @@
    "\n",
    "- `length_function`: how the length of chunks is calculated. Defaults to just counting number of characters, but it's pretty common to pass a token counter here.\n",
    "- `chunk_size`: the maximum size of your chunks (as measured by the length function).\n",
-    "- `chunk_overlap`: the maximum overlap between chunks. It can be nice to have some overlap to maintain some continuity between chunks (eg do a sliding window)."
+    "- `chunk_overlap`: the maximum overlap between chunks. It can be nice to have some overlap to maintain some continuity between chunks (eg do a sliding window).\n",
+    "- `add_start_index` : wether to include the starting position of each chunk within the original document in the metadata. "
   ]
  },
  {
@@ -49,6 +50,7 @@
    "    chunk_size = 100,\n",
    "    chunk_overlap  = 20,\n",
    "    length_function = len,\n",
+    "    add_start_index = True,\n",
    ")"
   ]
  },
@@ -62,8 +64,8 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "page_content='Madam Speaker, Madam Vice President, our First Lady and Second Gentleman. Members of Congress and' lookup_str='' metadata={} lookup_index=0\n",
-      "page_content='of Congress and the Cabinet. Justices of the Supreme Court. My fellow Americans.' lookup_str='' metadata={} lookup_index=0\n"
+      "page_content='Madam Speaker, Madam Vice President, our First Lady and Second Gentleman. Members of Congress and' metadata={'start_index': 0}\n",
+      "page_content='of Congress and the Cabinet. Justices of the Supreme Court. My fellow Americans.' metadata={'start_index': 82}\n"
     ]
    }
   ],
@@ -90,7 +92,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.9.16"
  },
  "vscode": {
   "interpreter": {
--- a/docs/modules/indexes/vectorstores/examples/awadb.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/awadb.ipynb
@@ -0,0 +1,194 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "833c4789",
+   "metadata": {},
+   "source": [
+    "# AwaDB\n",
+    "[AwaDB](https://github.com/awa-ai/awadb) is an AI Native database for the search and storage of embedding vectors used by LLM Applications.\n",
+    "This notebook shows how to use functionality related to the AwaDB."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "252930ea",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install awadb"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f2b71a47",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.text_splitter import CharacterTextSplitter\n",
+    "from langchain.vectorstores import AwaDB\n",
+    "from langchain.document_loaders import TextLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "49be0bac",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "loader = TextLoader('../../../state_of_the_union.txt')\n",
+    "documents = loader.load()\n",
+    "text_splitter = CharacterTextSplitter(chunk_size= 100, chunk_overlap=0)\n",
+    "docs = text_splitter.split_documents(documents)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "18714278",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "db = AwaDB.from_documents(docs)\n",
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "docs = db.similarity_search(query)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "62b7a4c5",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(docs[0].page_content)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "a9b4be48",
+   "metadata": {},
+   "source": [
+    "And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "87fec6b5",
+   "metadata": {},
+   "source": [
+    "## Similarity search with score"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "17231924",
+   "metadata": {},
+   "source": [
+    "The returned distance score is between 0-1. 0 is dissimilar, 1 is the most similar"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f40ddae1",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "docs = db.similarity_search_with_score(query)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f0045583",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(docs[0])"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8c2da99d",
+   "metadata": {},
+   "source": [
+    "(Document(page_content='And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'source': '../../../state_of_the_union.txt'}), 0.561813814013747)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "0b49fb59",
+   "metadata": {},
+   "source": [
+    "## Restore the table created and added data before"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "1bfa6e25",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "AwaDB automatically persists added document data"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "2a0f3b35",
+   "metadata": {},
+   "source": [
+    "If you can restore the table you created and added before, you can just do this as below:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "1fd4b5b0",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "awadb_client = awadb.Client()\n",
+    "ret = awadb_client.Load('langchain_awadb')\n",
+    "if ret : print('awadb load table success')\n",
+    "else:\n",
+    "    print('awadb load table failed')"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "5ae9a9dd",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "awadb load table success"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.11.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/indexes/vectorstores/examples/mongodb_atlas_vector_search.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/mongodb_atlas_vector_search.ipynb
@@ -5,7 +5,8 @@
   "id": "683953b3",
   "metadata": {},
   "source": [
-    "# MongoDB Atlas Vector Search\n",
+    "#### Commented out until further notice\n",
+    "MongoDB Atlas Vector Search\n",
    "\n",
    ">[MongoDB Atlas](https://www.mongodb.com/docs/atlas/) is a document database managed in the cloud. It also enables Lucene and its vector search feature.\n",
    "\n",
@@ -43,7 +44,7 @@
  },
  {
   "cell_type": "markdown",
-   "id": "320af802-9271-46ee-948f-d2453933d44b",
+   "id": "457ace44-1d95-4001-9dd5-78811ab208ad",
   "metadata": {},
   "source": [
    "We want to use `OpenAIEmbeddings` so we have to get the OpenAI API Key. Make sure the environment variable `OPENAI_API_KEY` is set up before proceeding."
@@ -143,6 +144,47 @@
   "source": [
    "print(docs[0].page_content)"
   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "851a2ec9-9390-49a4-8412-3e132c9f789d",
+   "metadata": {},
+   "source": [
+    "You can reuse vector index you created before, make sure environment variable `OPENAI_API_KEY` is set up, then create another file."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "6336fe79-3e73-48be-b20a-0ff1bb6a4399",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from pymongo import MongoClient\n",
+    "from langchain.vectorstores import MongoDBAtlasVectorSearch\n",
+    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "import os\n",
+    "\n",
+    "MONGODB_ATLAS_URI = os.environ['MONGODB_ATLAS_URI']\n",
+    "\n",
+    "# initialize MongoDB python client\n",
+    "client = MongoClient(MONGODB_ATLAS_URI)\n",
+    "\n",
+    "db_name = \"langchain_db\"\n",
+    "collection_name = \"langchain_col\"\n",
+    "collection = client[db_name][collection_name]\n",
+    "index_name = \"langchain_index\"\n",
+    "\n",
+    "# initialize vector store\n",
+    "vectorStore = MongoDBAtlasVectorSearch(\n",
+    "    collection, OpenAIEmbeddings(), index_name=index_name)\n",
+    "\n",
+    "# perform a similarity search between the embedding of the query and the embeddings of the documents\n",
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "docs = vectorStore.similarity_search(query)\n",
+    "\n",
+    "print(docs[0].page_content)"
+   ]
  }
 ],
 "metadata": {
--- a/docs/modules/indexes/vectorstores/examples/singlestoredb.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/singlestoredb.ipynb
@@ -0,0 +1,139 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "2b9582dc",
+   "metadata": {},
+   "source": [
+    "# SingleStoreDB vector search\n",
+    "[SingleStore DB](https://singlestore.com) is a high-performance distributed database that supports deployment both in the [cloud](https://www.singlestore.com/cloud/) and on-premises. For a significant duration, it has provided support for vector functions such as [dot_product](https://docs.singlestore.com/managed-service/en/reference/sql-reference/vector-functions/dot_product.html), thereby positioning itself as an ideal solution for AI applications that require text similarity matching. \n",
+    "This tutorial illustrates how to utilize the features of the SingleStore DB Vector Store."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "e4a61a4d",
+   "metadata": {
+    "scrolled": true
+   },
+   "outputs": [],
+   "source": [
+    "# Establishing a connection to the database is facilitated through the singlestoredb Python connector.\n",
+    "# Please ensure that this connector is installed in your working environment.\n",
+    "!pip install singlestoredb"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "39a0132a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "import getpass\n",
+    "\n",
+    "# We want to use OpenAIEmbeddings so we have to get the OpenAI API Key.\n",
+    "os.environ['OPENAI_API_KEY'] = getpass.getpass('OpenAI API Key:')\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "6104fde8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "from langchain.text_splitter import CharacterTextSplitter\n",
+    "from langchain.vectorstores import SingleStoreDB\n",
+    "from langchain.document_loaders import TextLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7b45113c",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Load text samples \n",
+    "from langchain.document_loaders import TextLoader\n",
+    "loader = TextLoader('../../../state_of_the_union.txt')\n",
+    "documents = loader.load()\n",
+    "text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)\n",
+    "docs = text_splitter.split_documents(documents)\n",
+    "\n",
+    "embeddings = OpenAIEmbeddings()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "535b2687",
+   "metadata": {},
+   "source": [
+    "There are several ways to establish a [connection](https://singlestoredb-python.labs.singlestore.com/generated/singlestoredb.connect.html) to the database. You can either set up environment variables or pass named parameters to the `SingleStoreDB constructor`. Alternatively, you may provide these parameters to the `from_documents` and `from_texts` methods."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "d0b316bf",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Setup connection url as environment variable\n",
+    "os.environ['SINGLESTOREDB_URL'] = 'root:pass@localhost:3306/db'\n",
+    "\n",
+    "# Load documents to the store\n",
+    "docsearch = SingleStoreDB.from_documents(\n",
+    "    docs,\n",
+    "    embeddings,\n",
+    "    table_name = \"noteook\", # use table with a custom name \n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "0eaa4297",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "docs = docsearch.similarity_search(query) # Find documents that correspond to the query\n",
+    "print(docs[0].page_content)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "86efff90",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.2"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/indexes/vectorstores/examples/vectara.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/vectara.ipynb
@@ -101,7 +101,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 4,
   "id": "8429667e",
   "metadata": {
    "ExecuteTime": {
@@ -133,7 +133,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": 5,
   "id": "a8c513ab",
   "metadata": {
    "ExecuteTime": {
@@ -145,12 +145,12 @@
   "outputs": [],
   "source": [
    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
-    "found_docs = vectara.similarity_search(query)"
+    "found_docs = vectara.similarity_search(query, n_sentence_context=0)"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": 6,
   "id": "fc516993",
   "metadata": {
    "ExecuteTime": {
@@ -164,7 +164,13 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence. A former top litigator in private practice. A former federal public defender.\n"
+      "Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \n",
+      "\n",
+      "Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \n",
+      "\n",
+      "One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \n",
+      "\n",
+      "And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.\n"
     ]
    }
   ],
@@ -185,7 +191,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 7,
   "id": "8804a21d",
   "metadata": {
    "ExecuteTime": {
@@ -201,7 +207,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": 8,
   "id": "756a6887",
   "metadata": {
    "ExecuteTime": {
@@ -214,9 +220,15 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence. A former top litigator in private practice. A former federal public defender.\n",
+      "Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \n",
      "\n",
-      "Score: 1.0046461\n"
+      "Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \n",
+      "\n",
+      "One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \n",
+      "\n",
+      "And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.\n",
+      "\n",
+      "Score: 0.7129974\n"
     ]
    }
   ],
@@ -239,7 +251,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 11,
+   "execution_count": 9,
   "id": "9427195f",
   "metadata": {
    "ExecuteTime": {
@@ -251,10 +263,10 @@
    {
     "data": {
      "text/plain": [
-       "VectorStoreRetriever(vectorstore=<langchain.vectorstores.vectara.Vectara object at 0x156d3e830>, search_type='similarity', search_kwargs={})"
+       "VectaraRetriever(vectorstore=<langchain.vectorstores.vectara.Vectara object at 0x122db2830>, search_type='similarity', search_kwargs={'lambda_val': 0.025, 'k': 5, 'filter': '', 'n_sentence_context': '0'})"
      ]
     },
-     "execution_count": 11,
+     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -266,7 +278,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 15,
+   "execution_count": 10,
   "id": "f3c70c31",
   "metadata": {
    "ExecuteTime": {
@@ -278,10 +290,10 @@
    {
     "data": {
      "text/plain": [
-       "Document(page_content='Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence. A former top litigator in private practice. A former federal public defender.', metadata={'source': '../../modules/state_of_the_union.txt'})"
+       "Document(page_content='Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \\n\\nTonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \\n\\nOne of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \\n\\nAnd I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'source': '../../../state_of_the_union.txt'})"
      ]
     },
-     "execution_count": 15,
+     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -316,7 +328,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.3"
+   "version": "3.10.9"
  }
 },
 "nbformat": 4,
--- a/docs/modules/indexes/vectorstores/examples/weaviate.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/weaviate.ipynb
@@ -209,7 +209,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "8fc3487b",
   "metadata": {},
@@ -218,7 +217,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "281c0fcc",
   "metadata": {},
@@ -236,7 +234,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "503e2e75",
   "metadata": {},
@@ -273,7 +270,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "fbd7a6cb",
   "metadata": {},
@@ -282,7 +278,6 @@
   ]
  },
  {
-   "attachments": {},
   "cell_type": "markdown",
   "id": "f349acb9",
   "metadata": {},
@@ -384,7 +379,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.16"
+   "version": "3.10.9"
  }
 },
 "nbformat": 4,
--- a/docs/modules/memory/examples/adding_memory.ipynb
+++ b/docs/modules/memory/examples/adding_memory.ipynb
@@ -121,7 +121,7 @@
      "\n",
      "Human: Hi there my friend\n",
      "AI:  Hi there, how are you doing today?\n",
-      "Human: Not to bad - how are you?\n",
+      "Human: Not too bad - how are you?\n",
      "Chatbot:\u001b[0m\n",
      "\n",
      "\u001b[1m> Finished LLMChain chain.\u001b[0m\n"
--- a/docs/modules/memory/examples/dynamodb_chat_message_history.ipynb
+++ b/docs/modules/memory/examples/dynamodb_chat_message_history.ipynb
@@ -118,6 +118,29 @@
   ]
  },
  {
+   "cell_type": "markdown",
+   "id": "955f1b15",
+   "metadata": {},
+   "source": [
+    "## DynamoDBChatMessageHistory with Custom Endpoint URL\n",
+    "\n",
+    "Sometimes it is useful to specify the URL to the AWS endpoint to connect to. For instance, when you are running locally against [Localstack](https://localstack.cloud/). For those cases you can specify the URL via the `endpoint_url` parameter in the constructor."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "225713c8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.memory.chat_message_histories import DynamoDBChatMessageHistory\n",
+    "\n",
+    "history = DynamoDBChatMessageHistory(table_name=\"SessionTable\", session_id=\"0\", endpoint_url=\"http://localhost.localstack.cloud:4566\")"
+   ]
+  },
+  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "3b33c988",
   "metadata": {},
--- a/docs/modules/memory/getting_started.ipynb
+++ b/docs/modules/memory/getting_started.ipynb
@@ -1,6 +1,7 @@
 {
 "cells": [
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "d31df93e",
   "metadata": {},
@@ -9,7 +10,7 @@
    "\n",
    "This notebook walks through how LangChain thinks about memory. \n",
    "\n",
-    "Memory involves keeping a concept of state around throughout a user's interactions with an language model. A user's interactions with a language model are captured in the concept of ChatMessages, so this boils down to ingesting, capturing, transforming and extracting knowledge from a sequence of chat messages. There are many different ways to do this, each of which exists as its own memory type.\n",
+    "Memory involves keeping a concept of state around throughout a user's interactions with a language model. A user's interactions with a language model are captured in the concept of ChatMessages, so this boils down to ingesting, capturing, transforming and extracting knowledge from a sequence of chat messages. There are many different ways to do this, each of which exists as its own memory type.\n",
    "\n",
    "In general, for each type of memory there are two ways to understanding using memory. These are the standalone functions which extract information from a sequence of messages, and then there is the way you can use this type of memory in a chain. \n",
    "\n",
@@ -25,7 +26,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": 2,
   "id": "87235cf1",
   "metadata": {},
   "outputs": [],
@@ -41,18 +42,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 3,
   "id": "be030822",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[HumanMessage(content='hi!', additional_kwargs={}),\n",
-       " AIMessage(content='whats up?', additional_kwargs={})]"
+       "[HumanMessage(content='hi!', additional_kwargs={}, example=False),\n",
+       " AIMessage(content='whats up?', additional_kwargs={}, example=False)]"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -75,7 +76,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": 4,
   "id": "a382b160",
   "metadata": {},
   "outputs": [],
@@ -85,7 +86,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": 5,
   "id": "a280d337",
   "metadata": {},
   "outputs": [],
@@ -97,7 +98,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 12,
+   "execution_count": 7,
   "id": "1b739c0a",
   "metadata": {},
   "outputs": [
@@ -107,7 +108,7 @@
       "{'history': 'Human: hi!\\nAI: whats up?'}"
      ]
     },
-     "execution_count": 12,
+     "execution_count": 7,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -126,7 +127,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 13,
+   "execution_count": 8,
   "id": "798ceb1c",
   "metadata": {},
   "outputs": [],
@@ -138,18 +139,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 14,
+   "execution_count": 9,
   "id": "698688fd",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "{'history': [HumanMessage(content='hi!', additional_kwargs={}),\n",
-       "  AIMessage(content='whats up?', additional_kwargs={})]}"
+       "{'history': [HumanMessage(content='hi!', additional_kwargs={}, example=False),\n",
+       "  AIMessage(content='whats up?', additional_kwargs={}, example=False)]}"
      ]
     },
-     "execution_count": 14,
+     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -169,7 +170,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 15,
+   "execution_count": 10,
   "id": "54301321",
   "metadata": {},
   "outputs": [],
@@ -188,7 +189,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 16,
+   "execution_count": 11,
   "id": "ae046bff",
   "metadata": {},
   "outputs": [
@@ -216,7 +217,7 @@
       "\" Hi there! It's nice to meet you. How can I help you today?\""
      ]
     },
-     "execution_count": 16,
+     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -227,7 +228,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 17,
+   "execution_count": 12,
   "id": "d8e2a6ff",
   "metadata": {},
   "outputs": [
@@ -256,7 +257,7 @@
       "\" That's great! It's always nice to have a conversation with someone new. What would you like to talk about?\""
      ]
     },
-     "execution_count": 17,
+     "execution_count": 12,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -267,7 +268,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 13,
   "id": "15eda316",
   "metadata": {},
   "outputs": [
@@ -298,7 +299,7 @@
       "\" Sure! I'm an AI created to help people with their everyday tasks. I'm programmed to understand natural language and provide helpful information. I'm also constantly learning and updating my knowledge base so I can provide more accurate and helpful answers.\""
      ]
     },
-     "execution_count": 18,
+     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -319,7 +320,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": 14,
   "id": "b5acbc4b",
   "metadata": {},
   "outputs": [],
@@ -338,7 +339,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 15,
   "id": "7812ee21",
   "metadata": {},
   "outputs": [],
@@ -348,18 +349,20 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 16,
   "id": "3ed6e6a0",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[{'type': 'human', 'data': {'content': 'hi!', 'additional_kwargs': {}}},\n",
-       " {'type': 'ai', 'data': {'content': 'whats up?', 'additional_kwargs': {}}}]"
+       "[{'type': 'human',\n",
+       "  'data': {'content': 'hi!', 'additional_kwargs': {}, 'example': False}},\n",
+       " {'type': 'ai',\n",
+       "  'data': {'content': 'whats up?', 'additional_kwargs': {}, 'example': False}}]"
      ]
     },
-     "execution_count": 3,
+     "execution_count": 16,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -370,7 +373,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 17,
   "id": "cdf4ebd2",
   "metadata": {},
   "outputs": [],
@@ -380,18 +383,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 18,
   "id": "9724e24b",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[HumanMessage(content='hi!', additional_kwargs={}),\n",
-       " AIMessage(content='whats up?', additional_kwargs={})]"
+       "[HumanMessage(content='hi!', additional_kwargs={}, example=False),\n",
+       " AIMessage(content='whats up?', additional_kwargs={}, example=False)]"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 18,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -407,14 +410,6 @@
   "source": [
    "And that's it for the getting started! There are plenty of different types of memory, check out our examples to see them all"
   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "3dd37d93",
-   "metadata": {},
-   "outputs": [],
-   "source": []
  }
 ],
 "metadata": {
@@ -433,7 +428,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.9"
  }
 },
 "nbformat": 4,
--- a/docs/modules/models/llms/examples/llm_caching.ipynb
+++ b/docs/modules/models/llms/examples/llm_caching.ipynb
@@ -631,7 +631,7 @@
   "id": "56ea6a08",
   "metadata": {},
   "source": [
-    "You'll need to get a Momemto auth token to use this class. This can either be passed in to a momento.CacheClient if you'd like to instantiate that directly, as a named parameter `auth_token` to `MomentoChatMessageHistory.from_client_params`, or can just be set as an environment variable `MOMENTO_AUTH_TOKEN`."
+    "You'll need to get a Momento auth token to use this class. This can either be passed in to a momento.CacheClient if you'd like to instantiate that directly, as a named parameter `auth_token` to `MomentoChatMessageHistory.from_client_params`, or can just be set as an environment variable `MOMENTO_AUTH_TOKEN`."
   ]
  },
  {
--- a/docs/modules/models/llms/integrations/baseten.ipynb
+++ b/docs/modules/models/llms/integrations/baseten.ipynb
@@ -0,0 +1,196 @@
+{
+ "cells": [
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Baseten\n",
+    "\n",
+    "[Baseten](https://baseten.co) provides all the infrastructure you need to deploy and serve ML models performantly, scalably, and cost-efficiently.\n",
+    "\n",
+    "This example demonstrates using Langchain with models deployed on Baseten."
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Setup\n",
+    "\n",
+    "To run this notebook, you'll need a [Baseten account](https://baseten.co) and an [API key](https://docs.baseten.co/settings/api-keys).\n",
+    "\n",
+    "You'll also need to install the Baseten Python package:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install baseten"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import baseten\n",
+    "\n",
+    "baseten.login(\"YOUR_API_KEY\")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Single model call\n",
+    "\n",
+    "First, you'll need to deploy a model to Baseten.\n",
+    "\n",
+    "You can deploy foundation models like WizardLM and Alpaca with one click from the [Baseten model library](https://app.baseten.co/explore/) or if you have your own model, [deploy it with this tutorial](https://docs.baseten.co/deploying-models/deploy).\n",
+    "\n",
+    "In this example, we'll work with WizardLM. [Deploy WizardLM here](https://app.baseten.co/explore/llama) and follow along with the deployed [model's version ID](https://docs.baseten.co/managing-models/manage)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.llms import Baseten"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Load the model\n",
+    "wizardlm = Baseten(model=\"MODEL_VERSION_ID\", verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Prompt the model\n",
+    "\n",
+    "wizardlm(\"What is the difference between a Wizard and a Sorcerer?\")"
+   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Chained model calls\n",
+    "\n",
+    "We can chain together multiple calls to one or multiple models, which is the whole point of Langchain!\n",
+    "\n",
+    "This example uses WizardLM to plan a meal with an entree, three sides, and an alcoholic and non-alcoholic beverage pairing."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.chains import SimpleSequentialChain\n",
+    "from langchain import PromptTemplate, LLMChain"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Build the first link in the chain\n",
+    "\n",
+    "prompt = PromptTemplate(\n",
+    "    input_variables=[\"cuisine\"],\n",
+    "    template=\"Name a complex entree for a {cuisine} dinner. Respond with just the name of a single dish.\",\n",
+    ")\n",
+    "\n",
+    "link_one = LLMChain(llm=wizardlm, prompt=prompt)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Build the second link in the chain\n",
+    "\n",
+    "prompt = PromptTemplate(\n",
+    "    input_variables=[\"entree\"],\n",
+    "    template=\"What are three sides that would go with {entree}. Respond with only a list of the sides.\",\n",
+    ")\n",
+    "\n",
+    "link_two = LLMChain(llm=wizardlm, prompt=prompt)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Build the third link in the chain\n",
+    "\n",
+    "prompt = PromptTemplate(\n",
+    "    input_variables=[\"sides\"],\n",
+    "    template=\"What is one alcoholic and one non-alcoholic beverage that would go well with this list of sides: {sides}. Respond with only the names of the beverages.\",\n",
+    ")\n",
+    "\n",
+    "link_three = LLMChain(llm=wizardlm, prompt=prompt)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Run the full chain!\n",
+    "\n",
+    "menu_maker = SimpleSequentialChain(chains=[link_one, link_two, link_three], verbose=True)\n",
+    "menu_maker.run(\"South Indian\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": ".venv",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.4"
+  },
+  "orig_nbformat": 4
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/modules/models/llms/integrations/databricks.ipynb
+++ b/docs/modules/models/llms/integrations/databricks.ipynb
@@ -1,6 +1,7 @@
 {
 "cells": [
  {
+   "attachments": {},
   "cell_type": "markdown",
   "metadata": {
    "application/vnd.databricks.v1+cell": {
@@ -43,6 +44,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "metadata": {
    "application/vnd.databricks.v1+cell": {
@@ -163,14 +165,14 @@
   ],
   "source": [
    "# Otherwise, you can manually specify the Databricks workspace hostname and personal access token \n",
-    "# or set `DATABRICKS_HOST` and `DATABRICKS_API_TOKEN` environment variables, respectively.\n",
+    "# or set `DATABRICKS_HOST` and `DATABRICKS_TOKEN` environment variables, respectively.\n",
    "# See https://docs.databricks.com/dev-tools/auth.html#databricks-personal-access-tokens\n",
    "# We strongly recommend not exposing the API token explicitly inside a notebook.\n",
    "# You can use Databricks secret manager to store your API token securely.\n",
    "# See https://docs.databricks.com/dev-tools/databricks-utils.html#secrets-utility-dbutilssecrets\n",
    "\n",
    "import os\n",
-    "os.environ[\"DATABRICKS_API_TOKEN\"] = dbutils.secrets.get(\"myworkspace\", \"api_token\")\n",
+    "os.environ[\"DATABRICKS_TOKEN\"] = dbutils.secrets.get(\"myworkspace\", \"api_token\")\n",
    "\n",
    "llm = Databricks(host=\"myworkspace.cloud.databricks.com\", endpoint_name=\"dolly\")\n",
    "\n",
@@ -257,6 +259,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "metadata": {
    "application/vnd.databricks.v1+cell": {
@@ -273,7 +276,7 @@
    "Prerequisites:\n",
    "* An LLM loaded on a Databricks interactive cluster in \"single user\" or \"no isolation shared\" mode.\n",
    "* A local HTTP server running on the driver node to serve the model at `\"/\"` using HTTP POST with JSON input/output.\n",
-    "* It uses a port number between `[3000, 8000]` and litens to the driver IP address or simply `0.0.0.0` instead of localhost only.\n",
+    "* It uses a port number between `[3000, 8000]` and listens to the driver IP address or simply `0.0.0.0` instead of localhost only.\n",
    "* You have \"Can Attach To\" permission to the cluster.\n",
    "\n",
    "The expected server schema (using JSON schema) is:\n",
--- a/docs/modules/models/llms/integrations/huggingface_hub.ipynb
+++ b/docs/modules/models/llms/integrations/huggingface_hub.ipynb
@@ -1,6 +1,7 @@
 {
 "cells": [
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "959300d4",
   "metadata": {},
@@ -13,6 +14,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "4c1b8450-5eaf-4d34-8341-2d785448a1ff",
   "metadata": {
@@ -60,6 +62,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "84dd44c1-c428-41f3-a911-520281386c94",
   "metadata": {},
@@ -104,6 +107,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "ddaa06cf-95ec-48ce-b0ab-d892a7909693",
   "metadata": {},
@@ -114,6 +118,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "4fa9337e-ccb5-4c52-9b7c-1653148bc256",
   "metadata": {},
@@ -158,13 +163,14 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "1a5c97af-89bc-4e59-95c1-223742a9160b",
   "metadata": {},
   "source": [
-    "### Dolly, by DataBricks\n",
+    "### Dolly, by Databricks\n",
    "\n",
-    "See [DataBricks](https://huggingface.co/databricks) organization page for a list of available models."
+    "See [Databricks](https://huggingface.co/databricks) organization page for a list of available models."
   ]
  },
  {
@@ -196,6 +202,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "03f6ae52-b5f9-4de6-832c-551cb3fa11ae",
   "metadata": {},
@@ -233,6 +240,7 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "2bf838eb-1083-402f-b099-b07c452418c8",
   "metadata": {},
--- a/docs/modules/models/text_embedding/examples/deepinfra.ipynb
+++ b/docs/modules/models/text_embedding/examples/deepinfra.ipynb
@@ -0,0 +1,133 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# DeepInfra\n",
+    "\n",
+    "[DeepInfra](https://deepinfra.com/?utm_source=langchain) is a serverless inference as a service that provides access to a [variety of LLMs](https://deepinfra.com/models?utm_source=langchain) and [embeddings models](https://deepinfra.com/models?type=embeddings&utm_source=langchain). This notebook goes over how to use LangChain with DeepInfra for text embeddings."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdin",
+     "output_type": "stream",
+     "text": [
+      " ········\n"
+     ]
+    }
+   ],
+   "source": [
+    "# sign up for an account: https://deepinfra.com/login?utm_source=langchain\n",
+    "\n",
+    "from getpass import getpass\n",
+    "\n",
+    "DEEPINFRA_API_TOKEN = getpass()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "\n",
+    "os.environ[\"DEEPINFRA_API_TOKEN\"] = DEEPINFRA_API_TOKEN"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings import DeepInfraEmbeddings"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "embeddings = DeepInfraEmbeddings(\n",
+    "    model_id=\"sentence-transformers/clip-ViT-B-32\",\n",
+    "    query_instruction=\"\",\n",
+    "    embed_instruction=\"\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "docs = [\"Dog is not a cat\",\n",
+    "        \"Beta is the second letter of Greek alphabet\"]\n",
+    "document_result = embeddings.embed_documents(docs)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "query = \"What is the first letter of Greek alphabet\"\n",
+    "query_result = embeddings.embed_query(query)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Cosine similarity between \"Dog is not a cat\" and query: 0.7489097144129355\n",
+      "Cosine similarity between \"Beta is the second letter of Greek alphabet\" and query: 0.9519380640702013\n"
+     ]
+    }
+   ],
+   "source": [
+    "import numpy as np\n",
+    "\n",
+    "query_numpy = np.array(query_result)\n",
+    "for doc_res, doc in zip(document_result, docs):\n",
+    "    document_numpy = np.array(doc_res)\n",
+    "    similarity = np.dot(query_numpy, document_numpy) / (np.linalg.norm(query_numpy)*np.linalg.norm(document_numpy))\n",
+    "    print(f\"Cosine similarity between \\\"{doc}\\\" and query: {similarity}\")"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.10"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 4
+}
--- a/langchain/agents/agent.py
+++ b/langchain/agents/agent.py
@@ -864,7 +864,11 @@ class AgentExecutor(Chain):
                raise e
            text = str(e)
            if isinstance(self.handle_parsing_errors, bool):
-                observation = "Invalid or incomplete response"
+                if e.send_to_llm:
+                    observation = str(e.observation)
+                    text = str(e.llm_output)
+                else:
+                    observation = "Invalid or incomplete response"
            elif isinstance(self.handle_parsing_errors, str):
                observation = self.handle_parsing_errors
            elif callable(self.handle_parsing_errors):
--- a/langchain/agents/load_tools.py
+++ b/langchain/agents/load_tools.py
@@ -34,6 +34,7 @@ from langchain.tools.requests.tool import (
 from langchain.tools.scenexplain.tool import SceneXplainTool
 from langchain.tools.searx_search.tool import SearxSearchResults, SearxSearchRun
 from langchain.tools.shell.tool import ShellTool
+from langchain.tools.sleep.tool import SleepTool
 from langchain.tools.wikipedia.tool import WikipediaQueryRun
 from langchain.tools.wolfram_alpha.tool import WolframAlphaQueryRun
 from langchain.tools.openweathermap.tool import OpenWeatherMapQueryRun
@@ -82,6 +83,10 @@ def _get_terminal() -> BaseTool:
    return ShellTool()


+def _get_sleep() -> BaseTool:
+    return SleepTool()
+
+
 _BASE_TOOLS: Dict[str, Callable[[], BaseTool]] = {
    "python_repl": _get_python_repl,
    "requests": _get_tools_requests_get,  # preserved for backwards compatability
@@ -91,6 +96,7 @@ _BASE_TOOLS: Dict[str, Callable[[], BaseTool]] = {
    "requests_put": _get_tools_requests_put,
    "requests_delete": _get_tools_requests_delete,
    "terminal": _get_terminal,
+    "sleep": _get_sleep,
 }


--- a/langchain/base_language.py
+++ b/langchain/base_language.py
@@ -2,7 +2,7 @@
 from __future__ import annotations

 from abc import ABC, abstractmethod
-from typing import List, Optional, Sequence, Set
+from typing import Any, List, Optional, Sequence, Set

 from pydantic import BaseModel

@@ -36,6 +36,7 @@ class BaseLanguageModel(BaseModel, ABC):
        prompts: List[PromptValue],
        stop: Optional[List[str]] = None,
        callbacks: Callbacks = None,
+        **kwargs: Any,
    ) -> LLMResult:
        """Take in a list of prompt values and return an LLMResult."""

@@ -45,26 +46,39 @@ class BaseLanguageModel(BaseModel, ABC):
        prompts: List[PromptValue],
        stop: Optional[List[str]] = None,
        callbacks: Callbacks = None,
+        **kwargs: Any,
    ) -> LLMResult:
        """Take in a list of prompt values and return an LLMResult."""

    @abstractmethod
-    def predict(self, text: str, *, stop: Optional[Sequence[str]] = None) -> str:
+    def predict(
+        self, text: str, *, stop: Optional[Sequence[str]] = None, **kwargs: Any
+    ) -> str:
        """Predict text from text."""

    @abstractmethod
    def predict_messages(
-        self, messages: List[BaseMessage], *, stop: Optional[Sequence[str]] = None
+        self,
+        messages: List[BaseMessage],
+        *,
+        stop: Optional[Sequence[str]] = None,
+        **kwargs: Any,
    ) -> BaseMessage:
        """Predict message from messages."""

    @abstractmethod
-    async def apredict(self, text: str, *, stop: Optional[Sequence[str]] = None) -> str:
+    async def apredict(
+        self, text: str, *, stop: Optional[Sequence[str]] = None, **kwargs: Any
+    ) -> str:
        """Predict text from text."""

    @abstractmethod
    async def apredict_messages(
-        self, messages: List[BaseMessage], *, stop: Optional[Sequence[str]] = None
+        self,
+        messages: List[BaseMessage],
+        *,
+        stop: Optional[Sequence[str]] = None,
+        **kwargs: Any,
    ) -> BaseMessage:
        """Predict message from messages."""

--- a/langchain/callbacks/manager.py
+++ b/langchain/callbacks/manager.py
@@ -133,15 +133,11 @@ def trace_as_chain_group(
    *,
    session_name: Optional[str] = None,
    example_id: Optional[Union[str, UUID]] = None,
-    tenant_id: Optional[str] = None,
-    session_extra: Optional[Dict[str, Any]] = None,
 ) -> Generator[CallbackManager, None, None]:
    """Get a callback manager for a chain group in a context manager."""
    cb = LangChainTracer(
-        tenant_id=tenant_id,
        session_name=session_name,
        example_id=example_id,
-        session_extra=session_extra,
    )
    cm = CallbackManager.configure(
        inheritable_callbacks=[cb],
@@ -158,15 +154,11 @@ async def atrace_as_chain_group(
    *,
    session_name: Optional[str] = None,
    example_id: Optional[Union[str, UUID]] = None,
-    tenant_id: Optional[str] = None,
-    session_extra: Optional[Dict[str, Any]] = None,
 ) -> AsyncGenerator[AsyncCallbackManager, None]:
    """Get a callback manager for a chain group in a context manager."""
    cb = LangChainTracer(
-        tenant_id=tenant_id,
        session_name=session_name,
        example_id=example_id,
-        session_extra=session_extra,
    )
    cm = AsyncCallbackManager.configure(
        inheritable_callbacks=[cb],
@@ -246,6 +238,8 @@ async def _ahandle_event_for_handler(
        else:
            logger.warning(f"Error in {event_name} callback: {e}")
    except Exception as e:
+        if handler.raise_error:
+            raise e
        logger.warning(f"Error in {event_name} callback: {e}")


@@ -878,6 +872,16 @@ class AsyncCallbackManager(BaseCallbackManager):
 T = TypeVar("T", CallbackManager, AsyncCallbackManager)


+def env_var_is_set(env_var: str) -> bool:
+    """Check if an environment variable is set."""
+    return env_var in os.environ and os.environ[env_var] not in (
+        "",
+        "0",
+        "false",
+        "False",
+    )
+
+
 def _configure(
    callback_manager_cls: Type[T],
    inheritable_callbacks: Callbacks = None,
@@ -911,18 +915,17 @@ def _configure(
    wandb_tracer = wandb_tracing_callback_var.get()
    open_ai = openai_callback_var.get()
    tracing_enabled_ = (
-        os.environ.get("LANGCHAIN_TRACING") is not None
+        env_var_is_set("LANGCHAIN_TRACING")
        or tracer is not None
-        or os.environ.get("LANGCHAIN_HANDLER") is not None
+        or env_var_is_set("LANGCHAIN_HANDLER")
    )
    wandb_tracing_enabled_ = (
-        os.environ.get("LANGCHAIN_WANDB_TRACING") is not None
-        or wandb_tracer is not None
+        env_var_is_set("LANGCHAIN_WANDB_TRACING") or wandb_tracer is not None
    )

    tracer_v2 = tracing_v2_callback_var.get()
    tracing_v2_enabled_ = (
-        os.environ.get("LANGCHAIN_TRACING_V2") is not None or tracer_v2 is not None
+        env_var_is_set("LANGCHAIN_TRACING_V2") or tracer_v2 is not None
    )
    tracer_session = os.environ.get("LANGCHAIN_SESSION")
    debug = _get_debug()
--- a/langchain/callbacks/tracers/langchain.py
+++ b/langchain/callbacks/tracers/langchain.py
@@ -8,61 +8,16 @@ from datetime import datetime
 from typing import Any, Dict, List, Optional, Union
 from uuid import UUID

-import requests
-from requests.exceptions import HTTPError
-from tenacity import (
-    before_sleep_log,
-    retry,
-    retry_if_exception_type,
-    stop_after_attempt,
-    wait_exponential,
-)
+from langchainplus_sdk import LangChainPlusClient

 from langchain.callbacks.tracers.base import BaseTracer
-from langchain.callbacks.tracers.schemas import (
-    Run,
-    RunCreate,
-    RunTypeEnum,
-    RunUpdate,
-    TracerSession,
-)
+from langchain.callbacks.tracers.schemas import Run, RunTypeEnum, TracerSession
+from langchain.env import get_runtime_environment
 from langchain.schema import BaseMessage, messages_to_dict

 logger = logging.getLogger(__name__)


-def get_headers() -> Dict[str, Any]:
-    """Get the headers for the LangChain API."""
-    headers: Dict[str, Any] = {"Content-Type": "application/json"}
-    if os.getenv("LANGCHAIN_API_KEY"):
-        headers["x-api-key"] = os.getenv("LANGCHAIN_API_KEY")
-    return headers
-
-
-def get_endpoint() -> str:
-    return os.getenv("LANGCHAIN_ENDPOINT", "http://localhost:1984")
-
-
-class LangChainTracerAPIError(Exception):
-    """An error occurred while communicating with the LangChain API."""
-
-
-class LangChainTracerUserError(Exception):
-    """An error occurred while communicating with the LangChain API."""
-
-
-class LangChainTracerError(Exception):
-    """An error occurred while communicating with the LangChain API."""
-
-
-retry_decorator = retry(
-    stop=stop_after_attempt(3),
-    wait=wait_exponential(multiplier=1, min=4, max=10),
-    retry=retry_if_exception_type(LangChainTracerAPIError),
-    before_sleep=before_sleep_log(logger, logging.WARNING),
-)
-
-
 class LangChainTracer(BaseTracer):
    """An implementation of the SharedTracer that POSTS to the langchain endpoint."""

@@ -70,19 +25,19 @@ class LangChainTracer(BaseTracer):
        self,
        example_id: Optional[Union[UUID, str]] = None,
        session_name: Optional[str] = None,
+        client: Optional[LangChainPlusClient] = None,
        **kwargs: Any,
    ) -> None:
        """Initialize the LangChain tracer."""
        super().__init__(**kwargs)
        self.session: Optional[TracerSession] = None
-        self._endpoint = get_endpoint()
-        self._headers = get_headers()
        self.example_id = (
            UUID(example_id) if isinstance(example_id, str) else example_id
        )
        self.session_name = session_name or os.getenv("LANGCHAIN_SESSION", "default")
        # set max_workers to 1 to process tasks in order
        self.executor = ThreadPoolExecutor(max_workers=1)
+        self.client = client or LangChainPlusClient()

    def on_chat_model_start(
        self,
@@ -114,60 +69,19 @@ class LangChainTracer(BaseTracer):
    def _persist_run(self, run: Run) -> None:
        """The Langchain Tracer uses Post/Patch rather than persist."""

-    @retry_decorator
    def _persist_run_single(self, run: Run) -> None:
        """Persist a run."""
        if run.parent_run_id is None:
            run.reference_example_id = self.example_id
-        run_dict = run.dict()
-        del run_dict["child_runs"]
-        run_create = RunCreate(**run_dict, session_name=self.session_name)
-        response = None
-        try:
-            # TODO: Add retries when async
-            response = requests.post(
-                f"{self._endpoint}/runs",
-                data=run_create.json(),
-                headers=self._headers,
-            )
-            response.raise_for_status()
-        except HTTPError as e:
-            if response is not None and response.status_code == 500:
-                raise LangChainTracerAPIError(
-                    f"Failed to upsert persist run to LangChain API. {e}"
-                )
-            else:
-                raise LangChainTracerUserError(
-                    f"Failed to persist run to LangChain API. {e}"
-                )
-        except Exception as e:
-            raise LangChainTracerError(
-                f"Failed to persist run to LangChain API. {e}"
-            ) from e
+        run_dict = run.dict(exclude={"child_runs"})
+        extra = run_dict.get("extra", {})
+        extra["runtime"] = get_runtime_environment()
+        run_dict["extra"] = extra
+        run = self.client.create_run(**run_dict, session_name=self.session_name)

-    @retry_decorator
    def _update_run_single(self, run: Run) -> None:
        """Update a run."""
-        run_update = RunUpdate(**run.dict())
-        response = None
-        try:
-            response = requests.patch(
-                f"{self._endpoint}/runs/{run.id}",
-                data=run_update.json(),
-                headers=self._headers,
-            )
-            response.raise_for_status()
-        except HTTPError as e:
-            if response is not None and response.status_code == 500:
-                raise LangChainTracerAPIError(
-                    f"Failed to update run to LangChain API. {e}"
-                )
-            else:
-                raise LangChainTracerUserError(f"Failed to run to LangChain API. {e}")
-        except Exception as e:
-            raise LangChainTracerError(
-                f"Failed to update run to LangChain API. {e}"
-            ) from e
+        self.client.update_run(run.id, **run.dict())

    def _on_llm_start(self, run: Run) -> None:
        """Persist an LLM run."""
--- a/langchain/callbacks/tracers/langchain_v1.py
+++ b/langchain/callbacks/tracers/langchain_v1.py
@@ -2,12 +2,11 @@ from __future__ import annotations

 import logging
 import os
-from typing import Any, Optional, Union
+from typing import Any, Dict, Optional, Union

 import requests

 from langchain.callbacks.tracers.base import BaseTracer
-from langchain.callbacks.tracers.langchain import get_headers
 from langchain.callbacks.tracers.schemas import (
    ChainRun,
    LLMRun,
@@ -21,6 +20,14 @@ from langchain.schema import get_buffer_string
 from langchain.utils import raise_for_status_with_text


+def get_headers() -> Dict[str, Any]:
+    """Get the headers for the LangChain API."""
+    headers: Dict[str, Any] = {"Content-Type": "application/json"}
+    if os.getenv("LANGCHAIN_API_KEY"):
+        headers["x-api-key"] = os.getenv("LANGCHAIN_API_KEY")
+    return headers
+
+
 def _get_endpoint() -> str:
    return os.getenv("LANGCHAIN_ENDPOINT", "http://localhost:8000")

--- a/langchain/callbacks/tracers/stdout.py
+++ b/langchain/callbacks/tracers/stdout.py
@@ -8,7 +8,7 @@ from langchain.input import get_bolded_text, get_colored_text

 def try_json_stringify(obj: Any, fallback: str) -> str:
    try:
-        return json.dumps(obj, indent=2)
+        return json.dumps(obj, indent=2, ensure_ascii=False)
    except Exception:
        return fallback

--- a/langchain/callbacks/tracers/wandb.py
+++ b/langchain/callbacks/tracers/wandb.py
@@ -60,10 +60,20 @@ def _convert_llm_run_to_wb_span(trace_tree: Any, run: Run) -> trace_tree.Span:
    return base_span


+def _serialize_inputs(run_inputs: dict) -> Union[dict, list]:
+    if "input_documents" in run_inputs:
+        docs = run_inputs["input_documents"]
+        return [doc.json() for doc in docs]
+    else:
+        return run_inputs
+
+
 def _convert_chain_run_to_wb_span(trace_tree: Any, run: Run) -> trace_tree.Span:
    base_span = _convert_run_to_wb_span(trace_tree, run)

-    base_span.results = [trace_tree.Result(inputs=run.inputs, outputs=run.outputs)]
+    base_span.results = [
+        trace_tree.Result(inputs=_serialize_inputs(run.inputs), outputs=run.outputs)
+    ]
    base_span.child_spans = [
        _convert_lc_run_to_wb_span(trace_tree, child_run)
        for child_run in run.child_runs
@@ -79,7 +89,9 @@ def _convert_chain_run_to_wb_span(trace_tree: Any, run: Run) -> trace_tree.Span:

 def _convert_tool_run_to_wb_span(trace_tree: Any, run: Run) -> trace_tree.Span:
    base_span = _convert_run_to_wb_span(trace_tree, run)
-    base_span.results = [trace_tree.Result(inputs=run.inputs, outputs=run.outputs)]
+    base_span.results = [
+        trace_tree.Result(inputs=_serialize_inputs(run.inputs), outputs=run.outputs)
+    ]
    base_span.child_spans = [
        _convert_lc_run_to_wb_span(trace_tree, child_run)
        for child_run in run.child_runs
--- a/langchain/chains/init.py
+++ b/langchain/chains/init.py
@@ -11,6 +11,7 @@ from langchain.chains.conversational_retrieval.base import (
 from langchain.chains.flare.base import FlareChain
 from langchain.chains.graph_qa.base import GraphQAChain
 from langchain.chains.graph_qa.cypher import GraphCypherQAChain
+from langchain.chains.graph_qa.nebulagraph import NebulaGraphQAChain
 from langchain.chains.hyde.base import HypotheticalDocumentEmbedder
 from langchain.chains.llm import LLMChain
 from langchain.chains.llm_bash.base import LLMBashChain
@@ -67,4 +68,5 @@ __all__ = [
    "ConversationalRetrievalChain",
    "OpenAPIEndpointChain",
    "FlareChain",
+    "NebulaGraphQAChain",
 ]
--- a/langchain/chains/base.py
+++ b/langchain/chains/base.py
@@ -18,7 +18,7 @@ from langchain.callbacks.manager import (
    CallbackManagerForChainRun,
    Callbacks,
 )
-from langchain.schema import BaseMemory
+from langchain.schema import RUN_KEY, BaseMemory, RunInfo


 def _get_verbosity() -> bool:
@@ -108,6 +108,8 @@ class Chain(BaseModel, ABC):
        inputs: Union[Dict[str, Any], Any],
        return_only_outputs: bool = False,
        callbacks: Callbacks = None,
+        *,
+        include_run_info: bool = False,
    ) -> Dict[str, Any]:
        """Run the logic of this chain and add to output if desired.

@@ -118,7 +120,10 @@ class Chain(BaseModel, ABC):
                response. If True, only new keys generated by this chain will be
                returned. If False, both input keys and new keys generated by this
                chain will be returned. Defaults to False.
-
+            callbacks: Callbacks to use for this chain run. If not provided, will
+                use the callbacks provided to the chain.
+            include_run_info: Whether to include run info in the response. Defaults
+                to False.
        """
        inputs = self.prep_inputs(inputs)
        callback_manager = CallbackManager.configure(
@@ -139,13 +144,20 @@ class Chain(BaseModel, ABC):
            run_manager.on_chain_error(e)
            raise e
        run_manager.on_chain_end(outputs)
-        return self.prep_outputs(inputs, outputs, return_only_outputs)
+        final_outputs: Dict[str, Any] = self.prep_outputs(
+            inputs, outputs, return_only_outputs
+        )
+        if include_run_info:
+            final_outputs[RUN_KEY] = RunInfo(run_id=run_manager.run_id)
+        return final_outputs

    async def acall(
        self,
        inputs: Union[Dict[str, Any], Any],
        return_only_outputs: bool = False,
        callbacks: Callbacks = None,
+        *,
+        include_run_info: bool = False,
    ) -> Dict[str, Any]:
        """Run the logic of this chain and add to output if desired.

@@ -156,7 +168,10 @@ class Chain(BaseModel, ABC):
                response. If True, only new keys generated by this chain will be
                returned. If False, both input keys and new keys generated by this
                chain will be returned. Defaults to False.
-
+            callbacks: Callbacks to use for this chain run. If not provided, will
+                use the callbacks provided to the chain.
+            include_run_info: Whether to include run info in the response. Defaults
+                to False.
        """
        inputs = self.prep_inputs(inputs)
        callback_manager = AsyncCallbackManager.configure(
@@ -177,7 +192,12 @@ class Chain(BaseModel, ABC):
            await run_manager.on_chain_error(e)
            raise e
        await run_manager.on_chain_end(outputs)
-        return self.prep_outputs(inputs, outputs, return_only_outputs)
+        final_outputs: Dict[str, Any] = self.prep_outputs(
+            inputs, outputs, return_only_outputs
+        )
+        if include_run_info:
+            final_outputs[RUN_KEY] = RunInfo(run_id=run_manager.run_id)
+        return final_outputs

    def prep_outputs(
        self,
--- a/langchain/chains/conversational_retrieval/base.py
+++ b/langchain/chains/conversational_retrieval/base.py
@@ -12,6 +12,7 @@ from langchain.base_language import BaseLanguageModel
 from langchain.callbacks.manager import (
    AsyncCallbackManagerForChainRun,
    CallbackManagerForChainRun,
+    Callbacks,
 )
 from langchain.chains.base import Chain
 from langchain.chains.combine_documents.base import BaseCombineDocumentsChain
@@ -204,6 +205,7 @@ class ConversationalRetrievalChain(BaseConversationalRetrievalChain):
        verbose: bool = False,
        condense_question_llm: Optional[BaseLanguageModel] = None,
        combine_docs_chain_kwargs: Optional[Dict] = None,
+        callbacks: Callbacks = None,
        **kwargs: Any,
    ) -> BaseConversationalRetrievalChain:
        """Load chain from LLM."""
@@ -212,17 +214,22 @@ class ConversationalRetrievalChain(BaseConversationalRetrievalChain):
            llm,
            chain_type=chain_type,
            verbose=verbose,
+            callbacks=callbacks,
            **combine_docs_chain_kwargs,
        )

        _llm = condense_question_llm or llm
        condense_question_chain = LLMChain(
-            llm=_llm, prompt=condense_question_prompt, verbose=verbose
+            llm=_llm,
+            prompt=condense_question_prompt,
+            verbose=verbose,
+            callbacks=callbacks,
        )
        return cls(
            retriever=retriever,
            combine_docs_chain=doc_chain,
            question_generator=condense_question_chain,
+            callbacks=callbacks,
            **kwargs,
        )

@@ -264,6 +271,7 @@ class ChatVectorDBChain(BaseConversationalRetrievalChain):
        condense_question_prompt: BasePromptTemplate = CONDENSE_QUESTION_PROMPT,
        chain_type: str = "stuff",
        combine_docs_chain_kwargs: Optional[Dict] = None,
+        callbacks: Callbacks = None,
        **kwargs: Any,
    ) -> BaseConversationalRetrievalChain:
        """Load chain from LLM."""
@@ -271,12 +279,16 @@ class ChatVectorDBChain(BaseConversationalRetrievalChain):
        doc_chain = load_qa_chain(
            llm,
            chain_type=chain_type,
+            callbacks=callbacks,
            **combine_docs_chain_kwargs,
        )
-        condense_question_chain = LLMChain(llm=llm, prompt=condense_question_prompt)
+        condense_question_chain = LLMChain(
+            llm=llm, prompt=condense_question_prompt, callbacks=callbacks
+        )
        return cls(
            vectorstore=vectorstore,
            combine_docs_chain=doc_chain,
            question_generator=condense_question_chain,
+            callbacks=callbacks,
            **kwargs,
        )
--- a/langchain/chains/graph_qa/cypher.py
+++ b/langchain/chains/graph_qa/cypher.py
@@ -14,6 +14,8 @@ from langchain.chains.llm import LLMChain
 from langchain.graphs.neo4j_graph import Neo4jGraph
 from langchain.prompts.base import BasePromptTemplate

+INTERMEDIATE_STEPS_KEY = "intermediate_steps"
+

 def extract_cypher(text: str) -> str:
    # The pattern to find Cypher code enclosed in triple backticks
@@ -33,6 +35,12 @@ class GraphCypherQAChain(Chain):
    qa_chain: LLMChain
    input_key: str = "query"  #: :meta private:
    output_key: str = "result"  #: :meta private:
+    top_k: int = 10
+    """Number of results to return from the query"""
+    return_intermediate_steps: bool = False
+    """Whether or not to return the intermediate steps along with the final answer."""
+    return_direct: bool = False
+    """Whether or not to return the result of querying the graph directly."""

    @property
    def input_keys(self) -> List[str]:
@@ -74,12 +82,14 @@ class GraphCypherQAChain(Chain):
        self,
        inputs: Dict[str, Any],
        run_manager: Optional[CallbackManagerForChainRun] = None,
-    ) -> Dict[str, str]:
+    ) -> Dict[str, Any]:
        """Generate Cypher statement, use it to look up in db and answer question."""
        _run_manager = run_manager or CallbackManagerForChainRun.get_noop_manager()
        callbacks = _run_manager.get_child()
        question = inputs[self.input_key]

+        intermediate_steps: List = []
+
        generated_cypher = self.cypher_generation_chain.run(
            {"question": question, "schema": self.graph.get_schema}, callbacks=callbacks
        )
@@ -91,14 +101,30 @@ class GraphCypherQAChain(Chain):
        _run_manager.on_text(
            generated_cypher, color="green", end="\n", verbose=self.verbose
        )
-        context = self.graph.query(generated_cypher)

-        _run_manager.on_text("Full Context:", end="\n", verbose=self.verbose)
-        _run_manager.on_text(
-            str(context), color="green", end="\n", verbose=self.verbose
-        )
-        result = self.qa_chain(
-            {"question": question, "context": context},
-            callbacks=callbacks,
-        )
-        return {self.output_key: result[self.qa_chain.output_key]}
+        intermediate_steps.append({"query": generated_cypher})
+
+        # Retrieve and limit the number of results
+        context = self.graph.query(generated_cypher)[: self.top_k]
+
+        if self.return_direct:
+            final_result = context
+        else:
+            _run_manager.on_text("Full Context:", end="\n", verbose=self.verbose)
+            _run_manager.on_text(
+                str(context), color="green", end="\n", verbose=self.verbose
+            )
+
+            intermediate_steps.append({"context": context})
+
+            result = self.qa_chain(
+                {"question": question, "context": context},
+                callbacks=callbacks,
+            )
+            final_result = result[self.qa_chain.output_key]
+
+        chain_result: Dict[str, Any] = {self.output_key: final_result}
+        if self.return_intermediate_steps:
+            chain_result[INTERMEDIATE_STEPS_KEY] = intermediate_steps
+
+        return chain_result
--- a/langchain/chains/graph_qa/nebulagraph.py
+++ b/langchain/chains/graph_qa/nebulagraph.py
@@ -0,0 +1,91 @@
+"""Question answering over a graph."""
+from __future__ import annotations
+
+from typing import Any, Dict, List, Optional
+
+from pydantic import Field
+
+from langchain.base_language import BaseLanguageModel
+from langchain.callbacks.manager import CallbackManagerForChainRun
+from langchain.chains.base import Chain
+from langchain.chains.graph_qa.prompts import CYPHER_QA_PROMPT, NGQL_GENERATION_PROMPT
+from langchain.chains.llm import LLMChain
+from langchain.graphs.nebula_graph import NebulaGraph
+from langchain.prompts.base import BasePromptTemplate
+
+
+class NebulaGraphQAChain(Chain):
+    """Chain for question-answering against a graph by generating nGQL statements."""
+
+    graph: NebulaGraph = Field(exclude=True)
+    ngql_generation_chain: LLMChain
+    qa_chain: LLMChain
+    input_key: str = "query"  #: :meta private:
+    output_key: str = "result"  #: :meta private:
+
+    @property
+    def input_keys(self) -> List[str]:
+        """Return the input keys.
+
+        :meta private:
+        """
+        return [self.input_key]
+
+    @property
+    def output_keys(self) -> List[str]:
+        """Return the output keys.
+
+        :meta private:
+        """
+        _output_keys = [self.output_key]
+        return _output_keys
+
+    @classmethod
+    def from_llm(
+        cls,
+        llm: BaseLanguageModel,
+        *,
+        qa_prompt: BasePromptTemplate = CYPHER_QA_PROMPT,
+        ngql_prompt: BasePromptTemplate = NGQL_GENERATION_PROMPT,
+        **kwargs: Any,
+    ) -> NebulaGraphQAChain:
+        """Initialize from LLM."""
+        qa_chain = LLMChain(llm=llm, prompt=qa_prompt)
+        ngql_generation_chain = LLMChain(llm=llm, prompt=ngql_prompt)
+
+        return cls(
+            qa_chain=qa_chain,
+            ngql_generation_chain=ngql_generation_chain,
+            **kwargs,
+        )
+
+    def _call(
+        self,
+        inputs: Dict[str, Any],
+        run_manager: Optional[CallbackManagerForChainRun] = None,
+    ) -> Dict[str, str]:
+        """Generate nGQL statement, use it to look up in db and answer question."""
+        _run_manager = run_manager or CallbackManagerForChainRun.get_noop_manager()
+        callbacks = _run_manager.get_child()
+        question = inputs[self.input_key]
+
+        generated_ngql = self.ngql_generation_chain.run(
+            {"question": question, "schema": self.graph.get_schema}, callbacks=callbacks
+        )
+
+        _run_manager.on_text("Generated nGQL:", end="\n", verbose=self.verbose)
+        _run_manager.on_text(
+            generated_ngql, color="green", end="\n", verbose=self.verbose
+        )
+        context = self.graph.query(generated_ngql)
+
+        _run_manager.on_text("Full Context:", end="\n", verbose=self.verbose)
+        _run_manager.on_text(
+            str(context), color="green", end="\n", verbose=self.verbose
+        )
+
+        result = self.qa_chain(
+            {"question": question, "context": context},
+            callbacks=callbacks,
+        )
+        return {self.output_key: result[self.qa_chain.output_key]}
--- a/langchain/chains/graph_qa/prompts.py
+++ b/langchain/chains/graph_qa/prompts.py
@@ -49,6 +49,29 @@ CYPHER_GENERATION_PROMPT = PromptTemplate(
    input_variables=["schema", "question"], template=CYPHER_GENERATION_TEMPLATE
 )

+NEBULAGRAPH_EXTRA_INSTRUCTIONS = """
+Instructions:
+
+First, generate cypher then convert it to NebulaGraph Cypher dialect(rather than standard):
+1. it requires explicit label specification when referring to node properties: v.`Foo`.name
+2. it uses double equals sign for comparison: `==` rather than `=`
+For instance:
+```diff
+< MATCH (p:person)-[:directed]->(m:movie) WHERE m.name = 'The Godfather II'
+< RETURN p.name;
+---
+> MATCH (p:`person`)-[:directed]->(m:`movie`) WHERE m.`movie`.`name` == 'The Godfather II'
+> RETURN p.`person`.`name`;
+```\n"""
+
+NGQL_GENERATION_TEMPLATE = CYPHER_GENERATION_TEMPLATE.replace(
+    "Generate Cypher", "Generate NebulaGraph Cypher"
+).replace("Instructions:", NEBULAGRAPH_EXTRA_INSTRUCTIONS)
+
+NGQL_GENERATION_PROMPT = PromptTemplate(
+    input_variables=["schema", "question"], template=NGQL_GENERATION_TEMPLATE
+)
+
 CYPHER_QA_TEMPLATE = """You are an assistant that helps to form nice and human understandable answers.
 The information part contains the provided information that you must use to construct an answer.
 The provided information is authorative, you must never doubt it or try to use your internal knowledge to correct it.
--- a/langchain/chains/loading.py
+++ b/langchain/chains/loading.py
@@ -20,7 +20,7 @@ from langchain.chains.llm_requests import LLMRequestsChain
 from langchain.chains.pal.base import PALChain
 from langchain.chains.qa_with_sources.base import QAWithSourcesChain
 from langchain.chains.qa_with_sources.vector_db import VectorDBQAWithSourcesChain
-from langchain.chains.retrieval_qa.base import VectorDBQA
+from langchain.chains.retrieval_qa.base import RetrievalQA, VectorDBQA
 from langchain.chains.sql_database.base import SQLDatabaseChain
 from langchain.llms.loading import load_llm, load_llm_from_config
 from langchain.prompts.loading import load_prompt, load_prompt_from_config
@@ -372,6 +372,28 @@ def _load_vector_db_qa_with_sources_chain(
    )


+def _load_retrieval_qa(config: dict, **kwargs: Any) -> RetrievalQA:
+    if "retriever" in kwargs:
+        retriever = kwargs.pop("retriever")
+    else:
+        raise ValueError("`retriever` must be present.")
+    if "combine_documents_chain" in config:
+        combine_documents_chain_config = config.pop("combine_documents_chain")
+        combine_documents_chain = load_chain_from_config(combine_documents_chain_config)
+    elif "combine_documents_chain_path" in config:
+        combine_documents_chain = load_chain(config.pop("combine_documents_chain_path"))
+    else:
+        raise ValueError(
+            "One of `combine_documents_chain` or "
+            "`combine_documents_chain_path` must be present."
+        )
+    return RetrievalQA(
+        combine_documents_chain=combine_documents_chain,
+        retriever=retriever,
+        **config,
+    )
+
+
 def _load_vector_db_qa(config: dict, **kwargs: Any) -> VectorDBQA:
    if "vectorstore" in kwargs:
        vectorstore = kwargs.pop("vectorstore")
@@ -459,6 +481,7 @@ type_to_loader_dict = {
    "sql_database_chain": _load_sql_database_chain,
    "vector_db_qa_with_sources_chain": _load_vector_db_qa_with_sources_chain,
    "vector_db_qa": _load_vector_db_qa,
+    "retrieval_qa": _load_retrieval_qa,
 }


--- a/langchain/chains/qa_with_sources/loading.py
+++ b/langchain/chains/qa_with_sources/loading.py
@@ -149,7 +149,7 @@ def load_qa_with_sources_chain(
    Args:
        llm: Language Model to use in the chain.
        chain_type: Type of document combining chain to use. Should be one of "stuff",
-            "map_reduce", and "refine".
+            "map_reduce", "refine" and "map_rerank".
        verbose: Whether chains should be run in verbose mode or not. Note that this
            applies to all chains that make up the final chain.

--- a/langchain/chains/question_answering/init.py
+++ b/langchain/chains/question_answering/init.py
@@ -3,6 +3,7 @@ from typing import Any, Mapping, Optional, Protocol

 from langchain.base_language import BaseLanguageModel
 from langchain.callbacks.base import BaseCallbackManager
+from langchain.callbacks.manager import Callbacks
 from langchain.chains.combine_documents.base import BaseCombineDocumentsChain
 from langchain.chains.combine_documents.map_reduce import MapReduceDocumentsChain
 from langchain.chains.combine_documents.map_rerank import MapRerankDocumentsChain
@@ -35,10 +36,15 @@ def _load_map_rerank_chain(
    rank_key: str = "score",
    answer_key: str = "answer",
    callback_manager: Optional[BaseCallbackManager] = None,
+    callbacks: Callbacks = None,
    **kwargs: Any,
 ) -> MapRerankDocumentsChain:
    llm_chain = LLMChain(
-        llm=llm, prompt=prompt, verbose=verbose, callback_manager=callback_manager
+        llm=llm,
+        prompt=prompt,
+        verbose=verbose,
+        callback_manager=callback_manager,
+        callbacks=callbacks,
    )
    return MapRerankDocumentsChain(
        llm_chain=llm_chain,
@@ -57,11 +63,16 @@ def _load_stuff_chain(
    document_variable_name: str = "context",
    verbose: Optional[bool] = None,
    callback_manager: Optional[BaseCallbackManager] = None,
+    callbacks: Callbacks = None,
    **kwargs: Any,
 ) -> StuffDocumentsChain:
    _prompt = prompt or stuff_prompt.PROMPT_SELECTOR.get_prompt(llm)
    llm_chain = LLMChain(
-        llm=llm, prompt=_prompt, verbose=verbose, callback_manager=callback_manager
+        llm=llm,
+        prompt=_prompt,
+        verbose=verbose,
+        callback_manager=callback_manager,
+        callbacks=callbacks,
    )
    # TODO: document prompt
    return StuffDocumentsChain(
@@ -84,6 +95,7 @@ def _load_map_reduce_chain(
    collapse_llm: Optional[BaseLanguageModel] = None,
    verbose: Optional[bool] = None,
    callback_manager: Optional[BaseCallbackManager] = None,
+    callbacks: Callbacks = None,
    **kwargs: Any,
 ) -> MapReduceDocumentsChain:
    _question_prompt = (
@@ -97,6 +109,7 @@ def _load_map_reduce_chain(
        prompt=_question_prompt,
        verbose=verbose,
        callback_manager=callback_manager,
+        callbacks=callbacks,
    )
    _reduce_llm = reduce_llm or llm
    reduce_chain = LLMChain(
@@ -104,6 +117,7 @@ def _load_map_reduce_chain(
        prompt=_combine_prompt,
        verbose=verbose,
        callback_manager=callback_manager,
+        callbacks=callbacks,
    )
    # TODO: document prompt
    combine_document_chain = StuffDocumentsChain(
@@ -111,6 +125,7 @@ def _load_map_reduce_chain(
        document_variable_name=combine_document_variable_name,
        verbose=verbose,
        callback_manager=callback_manager,
+        callbacks=callbacks,
    )
    if collapse_prompt is None:
        collapse_chain = None
@@ -127,6 +142,7 @@ def _load_map_reduce_chain(
                prompt=collapse_prompt,
                verbose=verbose,
                callback_manager=callback_manager,
+                callbacks=callbacks,
            ),
            document_variable_name=combine_document_variable_name,
            verbose=verbose,
@@ -139,6 +155,7 @@ def _load_map_reduce_chain(
        collapse_document_chain=collapse_chain,
        verbose=verbose,
        callback_manager=callback_manager,
+        callbacks=callbacks,
        **kwargs,
    )

@@ -152,6 +169,7 @@ def _load_refine_chain(
    refine_llm: Optional[BaseLanguageModel] = None,
    verbose: Optional[bool] = None,
    callback_manager: Optional[BaseCallbackManager] = None,
+    callbacks: Callbacks = None,
    **kwargs: Any,
 ) -> RefineDocumentsChain:
    _question_prompt = (
@@ -165,6 +183,7 @@ def _load_refine_chain(
        prompt=_question_prompt,
        verbose=verbose,
        callback_manager=callback_manager,
+        callbacks=callbacks,
    )
    _refine_llm = refine_llm or llm
    refine_chain = LLMChain(
@@ -172,6 +191,7 @@ def _load_refine_chain(
        prompt=_refine_prompt,
        verbose=verbose,
        callback_manager=callback_manager,
+        callbacks=callbacks,
    )
    return RefineDocumentsChain(
        initial_llm_chain=initial_chain,
--- a/langchain/chains/retrieval_qa/base.py
+++ b/langchain/chains/retrieval_qa/base.py
@@ -183,6 +183,11 @@ class RetrievalQA(BaseRetrievalQA):
    async def _aget_docs(self, question: str) -> List[Document]:
        return await self.retriever.aget_relevant_documents(question)

+    @property
+    def _chain_type(self) -> str:
+        """Return the chain type."""
+        return "retrieval_qa"
+

 class VectorDBQA(BaseRetrievalQA):
    """Chain for question-answering against a vector database."""
--- a/langchain/chat_models/anthropic.py
+++ b/langchain/chat_models/anthropic.py
@@ -94,9 +94,10 @@ class ChatAnthropic(BaseChatModel, _AnthropicCommon):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        prompt = self._convert_messages_to_prompt(messages)
-        params: Dict[str, Any] = {"prompt": prompt, **self._default_params}
+        params: Dict[str, Any] = {"prompt": prompt, **self._default_params, **kwargs}
        if stop:
            params["stop_sequences"] = stop

@@ -121,9 +122,10 @@ class ChatAnthropic(BaseChatModel, _AnthropicCommon):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        prompt = self._convert_messages_to_prompt(messages)
-        params: Dict[str, Any] = {"prompt": prompt, **self._default_params}
+        params: Dict[str, Any] = {"prompt": prompt, **self._default_params, **kwargs}
        if stop:
            params["stop_sequences"] = stop

--- a/langchain/chat_models/azure_openai.py
+++ b/langchain/chat_models/azure_openai.py
@@ -53,33 +53,33 @@ class AzureChatOpenAI(ChatOpenAI):
    @root_validator()
    def validate_environment(cls, values: Dict) -> Dict:
        """Validate that api key and python package exists in environment."""
-        openai_api_key = get_from_dict_or_env(
+        values["openai_api_key"] = get_from_dict_or_env(
            values,
            "openai_api_key",
            "OPENAI_API_KEY",
        )
-        openai_api_base = get_from_dict_or_env(
+        values["openai_api_base"] = get_from_dict_or_env(
            values,
            "openai_api_base",
            "OPENAI_API_BASE",
        )
-        openai_api_version = get_from_dict_or_env(
+        values["openai_api_version"] = get_from_dict_or_env(
            values,
            "openai_api_version",
            "OPENAI_API_VERSION",
        )
-        openai_api_type = get_from_dict_or_env(
+        values["openai_api_type"] = get_from_dict_or_env(
            values,
            "openai_api_type",
            "OPENAI_API_TYPE",
        )
-        openai_organization = get_from_dict_or_env(
+        values["openai_organization"] = get_from_dict_or_env(
            values,
            "openai_organization",
            "OPENAI_ORGANIZATION",
            default="",
        )
-        openai_proxy = get_from_dict_or_env(
+        values["openai_proxy"] = get_from_dict_or_env(
            values,
            "openai_proxy",
            "OPENAI_PROXY",
@@ -88,14 +88,6 @@ class AzureChatOpenAI(ChatOpenAI):
        try:
            import openai

-            openai.api_type = openai_api_type
-            openai.api_base = openai_api_base
-            openai.api_version = openai_api_version
-            openai.api_key = openai_api_key
-            if openai_organization:
-                openai.organization = openai_organization
-            if openai_proxy:
-                openai.proxy = {"http": openai_proxy, "https": openai_proxy}  # type: ignore[assignment]  # noqa: E501
        except ImportError:
            raise ImportError(
                "Could not import openai python package. "
@@ -128,6 +120,14 @@ class AzureChatOpenAI(ChatOpenAI):
        """Get the identifying parameters."""
        return {**self._default_params}

+    @property
+    def _invocation_params(self) -> Mapping[str, Any]:
+        openai_creds = {
+            "api_type": self.openai_api_type,
+            "api_version": self.openai_api_version,
+        }
+        return {**openai_creds, **super()._invocation_params}
+
    @property
    def _llm_type(self) -> str:
        return "azure-openai-chat"
--- a/langchain/chat_models/base.py
+++ b/langchain/chat_models/base.py
@@ -25,6 +25,7 @@ from langchain.schema import (
    HumanMessage,
    LLMResult,
    PromptValue,
+    RunInfo,
 )


@@ -63,6 +64,7 @@ class BaseChatModel(BaseLanguageModel, ABC):
        messages: List[List[BaseMessage]],
        stop: Optional[List[str]] = None,
        callbacks: Callbacks = None,
+        **kwargs: Any,
    ) -> LLMResult:
        """Top Level call"""

@@ -81,7 +83,7 @@ class BaseChatModel(BaseLanguageModel, ABC):
        )
        try:
            results = [
-                self._generate(m, stop=stop, run_manager=run_manager)
+                self._generate(m, stop=stop, run_manager=run_manager, **kwargs)
                if new_arg_supported
                else self._generate(m, stop=stop)
                for m in messages
@@ -93,6 +95,8 @@ class BaseChatModel(BaseLanguageModel, ABC):
        generations = [res.generations for res in results]
        output = LLMResult(generations=generations, llm_output=llm_output)
        run_manager.on_llm_end(output)
+        if run_manager:
+            output.run = RunInfo(run_id=run_manager.run_id)
        return output

    async def agenerate(
@@ -100,6 +104,7 @@ class BaseChatModel(BaseLanguageModel, ABC):
        messages: List[List[BaseMessage]],
        stop: Optional[List[str]] = None,
        callbacks: Callbacks = None,
+        **kwargs: Any,
    ) -> LLMResult:
        """Top Level call"""
        params = self.dict()
@@ -118,7 +123,7 @@ class BaseChatModel(BaseLanguageModel, ABC):
        try:
            results = await asyncio.gather(
                *[
-                    self._agenerate(m, stop=stop, run_manager=run_manager)
+                    self._agenerate(m, stop=stop, run_manager=run_manager, **kwargs)
                    if new_arg_supported
                    else self._agenerate(m, stop=stop)
                    for m in messages
@@ -131,6 +136,8 @@ class BaseChatModel(BaseLanguageModel, ABC):
        generations = [res.generations for res in results]
        output = LLMResult(generations=generations, llm_output=llm_output)
        await run_manager.on_llm_end(output)
+        if run_manager:
+            output.run = RunInfo(run_id=run_manager.run_id)
        return output

    def generate_prompt(
@@ -138,18 +145,22 @@ class BaseChatModel(BaseLanguageModel, ABC):
        prompts: List[PromptValue],
        stop: Optional[List[str]] = None,
        callbacks: Callbacks = None,
+        **kwargs: Any,
    ) -> LLMResult:
        prompt_messages = [p.to_messages() for p in prompts]
-        return self.generate(prompt_messages, stop=stop, callbacks=callbacks)
+        return self.generate(prompt_messages, stop=stop, callbacks=callbacks, **kwargs)

    async def agenerate_prompt(
        self,
        prompts: List[PromptValue],
        stop: Optional[List[str]] = None,
        callbacks: Callbacks = None,
+        **kwargs: Any,
    ) -> LLMResult:
        prompt_messages = [p.to_messages() for p in prompts]
-        return await self.agenerate(prompt_messages, stop=stop, callbacks=callbacks)
+        return await self.agenerate(
+            prompt_messages, stop=stop, callbacks=callbacks, **kwargs
+        )

    @abstractmethod
    def _generate(
@@ -157,6 +168,7 @@ class BaseChatModel(BaseLanguageModel, ABC):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        """Top Level call"""

@@ -166,6 +178,7 @@ class BaseChatModel(BaseLanguageModel, ABC):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        """Top Level call"""

@@ -188,18 +201,25 @@ class BaseChatModel(BaseLanguageModel, ABC):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        callbacks: Callbacks = None,
+        **kwargs: Any,
    ) -> BaseMessage:
-        result = await self.agenerate([messages], stop=stop, callbacks=callbacks)
+        result = await self.agenerate(
+            [messages], stop=stop, callbacks=callbacks, **kwargs
+        )
        generation = result.generations[0][0]
        if isinstance(generation, ChatGeneration):
            return generation.message
        else:
            raise ValueError("Unexpected generation type")

-    def call_as_llm(self, message: str, stop: Optional[List[str]] = None) -> str:
-        return self.predict(message, stop=stop)
+    def call_as_llm(
+        self, message: str, stop: Optional[List[str]] = None, **kwargs: Any
+    ) -> str:
+        return self.predict(message, stop=stop, **kwargs)

-    def predict(self, text: str, *, stop: Optional[Sequence[str]] = None) -> str:
+    def predict(
+        self, text: str, *, stop: Optional[Sequence[str]] = None, **kwargs: Any
+    ) -> str:
        if stop is None:
            _stop = None
        else:
@@ -208,30 +228,42 @@ class BaseChatModel(BaseLanguageModel, ABC):
        return result.content

    def predict_messages(
-        self, messages: List[BaseMessage], *, stop: Optional[Sequence[str]] = None
+        self,
+        messages: List[BaseMessage],
+        *,
+        stop: Optional[Sequence[str]] = None,
+        **kwargs: Any,
    ) -> BaseMessage:
        if stop is None:
            _stop = None
        else:
            _stop = list(stop)
-        return self(messages, stop=_stop)
+        return self(messages, stop=_stop, **kwargs)

-    async def apredict(self, text: str, *, stop: Optional[Sequence[str]] = None) -> str:
+    async def apredict(
+        self, text: str, *, stop: Optional[Sequence[str]] = None, **kwargs: Any
+    ) -> str:
        if stop is None:
            _stop = None
        else:
            _stop = list(stop)
-        result = await self._call_async([HumanMessage(content=text)], stop=_stop)
+        result = await self._call_async(
+            [HumanMessage(content=text)], stop=_stop, **kwargs
+        )
        return result.content

    async def apredict_messages(
-        self, messages: List[BaseMessage], *, stop: Optional[Sequence[str]] = None
+        self,
+        messages: List[BaseMessage],
+        *,
+        stop: Optional[Sequence[str]] = None,
+        **kwargs: Any,
    ) -> BaseMessage:
        if stop is None:
            _stop = None
        else:
            _stop = list(stop)
-        return await self._call_async(messages, stop=_stop)
+        return await self._call_async(messages, stop=_stop, **kwargs)

    @property
    def _identifying_params(self) -> Mapping[str, Any]:
@@ -256,8 +288,9 @@ class SimpleChatModel(BaseChatModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
-        output_str = self._call(messages, stop=stop, run_manager=run_manager)
+        output_str = self._call(messages, stop=stop, run_manager=run_manager, **kwargs)
        message = AIMessage(content=output_str)
        generation = ChatGeneration(message=message)
        return ChatResult(generations=[generation])
@@ -268,6 +301,7 @@ class SimpleChatModel(BaseChatModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        """Simpler interface."""

@@ -276,6 +310,9 @@ class SimpleChatModel(BaseChatModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
-        func = partial(self._generate, messages, stop=stop, run_manager=run_manager)
+        func = partial(
+            self._generate, messages, stop=stop, run_manager=run_manager, **kwargs
+        )
        return await asyncio.get_event_loop().run_in_executor(None, func)
--- a/langchain/chat_models/google_palm.py
+++ b/langchain/chat_models/google_palm.py
@@ -280,6 +280,7 @@ class ChatGooglePalm(BaseChatModel, BaseModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        prompt = _messages_to_prompt_dict(messages)

@@ -291,6 +292,7 @@ class ChatGooglePalm(BaseChatModel, BaseModel):
            top_p=self.top_p,
            top_k=self.top_k,
            candidate_count=self.n,
+            **kwargs,
        )

        return _response_to_result(response, stop)
@@ -300,6 +302,7 @@ class ChatGooglePalm(BaseChatModel, BaseModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        prompt = _messages_to_prompt_dict(messages)

--- a/langchain/chat_models/openai.py
+++ b/langchain/chat_models/openai.py
@@ -196,22 +196,22 @@ class ChatOpenAI(BaseChatModel):
    @root_validator()
    def validate_environment(cls, values: Dict) -> Dict:
        """Validate that api key and python package exists in environment."""
-        openai_api_key = get_from_dict_or_env(
+        values["openai_api_key"] = get_from_dict_or_env(
            values, "openai_api_key", "OPENAI_API_KEY"
        )
-        openai_organization = get_from_dict_or_env(
+        values["openai_organization"] = get_from_dict_or_env(
            values,
            "openai_organization",
            "OPENAI_ORGANIZATION",
            default="",
        )
-        openai_api_base = get_from_dict_or_env(
+        values["openai_api_base"] = get_from_dict_or_env(
            values,
            "openai_api_base",
            "OPENAI_API_BASE",
            default="",
        )
-        openai_proxy = get_from_dict_or_env(
+        values["openai_proxy"] = get_from_dict_or_env(
            values,
            "openai_proxy",
            "OPENAI_PROXY",
@@ -225,13 +225,6 @@ class ChatOpenAI(BaseChatModel):
                "Could not import openai python package. "
                "Please install it with `pip install openai`."
            )
-        openai.api_key = openai_api_key
-        if openai_organization:
-            openai.organization = openai_organization
-        if openai_api_base:
-            openai.api_base = openai_api_base
-        if openai_proxy:
-            openai.proxy = {"http": openai_proxy, "https": openai_proxy}  # type: ignore[assignment]  # noqa: E501
        try:
            values["client"] = openai.ChatCompletion
        except AttributeError:
@@ -309,8 +302,10 @@ class ChatOpenAI(BaseChatModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        message_dicts, params = self._create_message_dicts(messages, stop)
+        params = {**params, **kwargs}
        if self.streaming:
            inner_completion = ""
            role = "assistant"
@@ -333,7 +328,7 @@ class ChatOpenAI(BaseChatModel):
    def _create_message_dicts(
        self, messages: List[BaseMessage], stop: Optional[List[str]]
    ) -> Tuple[List[Dict[str, Any]], Dict[str, Any]]:
-        params: Dict[str, Any] = {**{"model": self.model_name}, **self._default_params}
+        params = dict(self._invocation_params)
        if stop is not None:
            if "stop" in params:
                raise ValueError("`stop` found in both the input and default params.")
@@ -355,8 +350,10 @@ class ChatOpenAI(BaseChatModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        message_dicts, params = self._create_message_dicts(messages, stop)
+        params = {**params, **kwargs}
        if self.streaming:
            inner_completion = ""
            role = "assistant"
@@ -384,6 +381,21 @@ class ChatOpenAI(BaseChatModel):
        """Get the identifying parameters."""
        return {**{"model_name": self.model_name}, **self._default_params}

+    @property
+    def _invocation_params(self) -> Mapping[str, Any]:
+        """Get the parameters used to invoke the model."""
+        openai_creds: Dict[str, Any] = {
+            "api_key": self.openai_api_key,
+            "api_base": self.openai_api_base,
+            "organization": self.openai_organization,
+            "model": self.model_name,
+        }
+        if self.openai_proxy:
+            import openai
+
+            openai.proxy = {"http": self.openai_proxy, "https": self.openai_proxy}  # type: ignore[assignment]  # noqa: E501
+        return {**openai_creds, **self._default_params}
+
    @property
    def _llm_type(self) -> str:
        """Return type of chat model."""
--- a/langchain/chat_models/promptlayer_openai.py
+++ b/langchain/chat_models/promptlayer_openai.py
@@ -42,6 +42,7 @@ class PromptLayerChatOpenAI(ChatOpenAI):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any
    ) -> ChatResult:
        """Call ChatOpenAI generate and then call PromptLayer API to log the request."""
        from promptlayer.utils import get_api_key, promptlayer_api_request
@@ -54,6 +55,7 @@ class PromptLayerChatOpenAI(ChatOpenAI):
            response_dict, params = super()._create_message_dicts(
                [generation.message], stop
            )
+            params = {**params, **kwargs}
            pl_request_id = promptlayer_api_request(
                "langchain.PromptLayerChatOpenAI",
                "langchain",
@@ -79,6 +81,7 @@ class PromptLayerChatOpenAI(ChatOpenAI):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any
    ) -> ChatResult:
        """Call ChatOpenAI agenerate and then call PromptLayer to log."""
        from promptlayer.utils import get_api_key, promptlayer_api_request_async
@@ -91,6 +94,7 @@ class PromptLayerChatOpenAI(ChatOpenAI):
            response_dict, params = super()._create_message_dicts(
                [generation.message], stop
            )
+            params = {**params, **kwargs}
            pl_request_id = await promptlayer_api_request_async(
                "langchain.PromptLayerChatOpenAI.async",
                "langchain",
--- a/langchain/chat_models/vertexai.py
+++ b/langchain/chat_models/vertexai.py
@@ -1,6 +1,6 @@
 """Wrapper around Google VertexAI chat-based models."""
 from dataclasses import dataclass, field
-from typing import Dict, List, Optional
+from typing import Any, Dict, List, Optional

 from pydantic import root_validator

@@ -93,6 +93,7 @@ class ChatVertexAI(_VertexAICommon, BaseChatModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        """Generate next turn in the conversation.

@@ -119,7 +120,8 @@ class ChatVertexAI(_VertexAICommon, BaseChatModel):

        history = _parse_chat_history(messages[:-1])
        context = history.system_message.content if history.system_message else None
-        chat = self.client.start_chat(context=context, **self._default_params)
+        params = {**self._default_params, **kwargs}
+        chat = self.client.start_chat(context=context, **params)
        for pair in history.history:
            chat._history.append((pair.question.content, pair.answer.content))
        response = chat.send_message(question.content, **self._default_params)
@@ -131,6 +133,7 @@ class ChatVertexAI(_VertexAICommon, BaseChatModel):
        messages: List[BaseMessage],
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> ChatResult:
        raise NotImplementedError(
            """Vertex AI doesn't support async requests at the moment."""
--- a/langchain/document_loaders/init.py
+++ b/langchain/document_loaders/init.py
@@ -1,6 +1,7 @@
 """All different types of document loaders."""

 from langchain.document_loaders.airbyte_json import AirbyteJSONLoader
+from langchain.document_loaders.airtable import AirtableLoader
 from langchain.document_loaders.apify_dataset import ApifyDatasetLoader
 from langchain.document_loaders.arxiv import ArxivLoader
 from langchain.document_loaders.azlyrics import AZLyricsLoader
@@ -19,7 +20,7 @@ from langchain.document_loaders.chatgpt import ChatGPTLoader
 from langchain.document_loaders.college_confidential import CollegeConfidentialLoader
 from langchain.document_loaders.confluence import ConfluenceLoader
 from langchain.document_loaders.conllu import CoNLLULoader
-from langchain.document_loaders.csv_loader import CSVLoader
+from langchain.document_loaders.csv_loader import CSVLoader, UnstructuredCSVLoader
 from langchain.document_loaders.dataframe import DataFrameLoader
 from langchain.document_loaders.diffbot import DiffbotLoader
 from langchain.document_loaders.directory import DirectoryLoader
@@ -34,6 +35,7 @@ from langchain.document_loaders.epub import UnstructuredEPubLoader
 from langchain.document_loaders.evernote import EverNoteLoader
 from langchain.document_loaders.excel import UnstructuredExcelLoader
 from langchain.document_loaders.facebook_chat import FacebookChatLoader
+from langchain.document_loaders.fauna import FaunaLoader
 from langchain.document_loaders.figma import FigmaFileLoader
 from langchain.document_loaders.gcs_directory import GCSDirectoryLoader
 from langchain.document_loaders.gcs_file import GCSFileLoader
@@ -89,6 +91,7 @@ from langchain.document_loaders.s3_directory import S3DirectoryLoader
 from langchain.document_loaders.s3_file import S3FileLoader
 from langchain.document_loaders.sitemap import SitemapLoader
 from langchain.document_loaders.slack_directory import SlackDirectoryLoader
+from langchain.document_loaders.snowflake_loader import SnowflakeLoader
 from langchain.document_loaders.spreedly import SpreedlyLoader
 from langchain.document_loaders.srt import SRTLoader
 from langchain.document_loaders.stripe import StripeLoader
@@ -118,6 +121,7 @@ from langchain.document_loaders.word_document import (
    Docx2txtLoader,
    UnstructuredWordDocumentLoader,
 )
+from langchain.document_loaders.xml import UnstructuredXMLLoader
 from langchain.document_loaders.youtube import (
    GoogleApiClient,
    GoogleApiYoutubeLoader,
@@ -133,6 +137,7 @@ TelegramChatLoader = TelegramChatFileLoader
 __all__ = [
    "AZLyricsLoader",
    "AirbyteJSONLoader",
+    "AirtableLoader",
    "ApifyDatasetLoader",
    "ArxivLoader",
    "AzureBlobStorageContainerLoader",
@@ -155,6 +160,7 @@ __all__ = [
    "DocugamiLoader",
    "Docx2txtLoader",
    "DuckDBLoader",
+    "FaunaLoader",
    "EverNoteLoader",
    "FacebookChatLoader",
    "FigmaFileLoader",
@@ -222,6 +228,7 @@ __all__ = [
    "TwitterTweetLoader",
    "UnstructuredAPIFileIOLoader",
    "UnstructuredAPIFileLoader",
+    "UnstructuredCSVLoader",
    "UnstructuredEPubLoader",
    "UnstructuredEmailLoader",
    "UnstructuredExcelLoader",
@@ -236,9 +243,11 @@ __all__ = [
    "UnstructuredRTFLoader",
    "UnstructuredURLLoader",
    "UnstructuredWordDocumentLoader",
+    "UnstructuredXMLLoader",
    "WeatherDataLoader",
    "WebBaseLoader",
    "WhatsAppChatLoader",
    "WikipediaLoader",
    "YoutubeLoader",
+    "SnowflakeLoader",
 ]
--- a/langchain/document_loaders/airtable.py
+++ b/langchain/document_loaders/airtable.py
@@ -0,0 +1,36 @@
+from typing import Iterator, List
+
+from langchain.docstore.document import Document
+from langchain.document_loaders.base import BaseLoader
+
+
+class AirtableLoader(BaseLoader):
+    """Loader that loads local airbyte json files."""
+
+    def __init__(self, api_token: str, table_id: str, base_id: str):
+        """Initialize with API token and the IDs for table and base"""
+        self.api_token = api_token
+        self.table_id = table_id
+        self.base_id = base_id
+
+    def lazy_load(self) -> Iterator[Document]:
+        """Load Table."""
+
+        from pyairtable import Table
+
+        table = Table(self.api_token, self.base_id, self.table_id)
+        records = table.all()
+        for record in records:
+            # Need to convert record from dict to str
+            yield Document(
+                page_content=str(record),
+                metadata={
+                    "source": self.base_id + "_" + self.table_id,
+                    "base_id": self.base_id,
+                    "table_id": self.table_id,
+                },
+            )
+
+    def load(self) -> List[Document]:
+        """Load Table."""
+        return list(self.lazy_load())
--- a/langchain/document_loaders/blob_loaders/init.py
+++ b/langchain/document_loaders/blob_loaders/init.py
@@ -1,4 +1,5 @@
 from langchain.document_loaders.blob_loaders.file_system import FileSystemBlobLoader
 from langchain.document_loaders.blob_loaders.schema import Blob, BlobLoader
+from langchain.document_loaders.blob_loaders.youtube_audio import YoutubeAudioLoader

-__all__ = ["BlobLoader", "Blob", "FileSystemBlobLoader"]
+__all__ = ["BlobLoader", "Blob", "FileSystemBlobLoader", "YoutubeAudioLoader"]
--- a/langchain/document_loaders/blob_loaders/youtube_audio.py
+++ b/langchain/document_loaders/blob_loaders/youtube_audio.py
@@ -0,0 +1,50 @@
+from typing import Iterable, List
+
+from langchain.document_loaders.blob_loaders import FileSystemBlobLoader
+from langchain.document_loaders.blob_loaders.schema import Blob, BlobLoader
+
+
+class YoutubeAudioLoader(BlobLoader):
+
+    """Load YouTube urls as audio file(s)."""
+
+    def __init__(self, urls: List[str], save_dir: str):
+        if not isinstance(urls, list):
+            raise TypeError("urls must be a list")
+
+        self.urls = urls
+        self.save_dir = save_dir
+
+    def yield_blobs(self) -> Iterable[Blob]:
+        """Yield audio blobs for each url."""
+
+        try:
+            import yt_dlp
+        except ImportError:
+            raise ValueError(
+                "yt_dlp package not found, please install it with "
+                "`pip install yt_dlp`"
+            )
+
+        # Use yt_dlp to download audio given a YouTube url
+        ydl_opts = {
+            "format": "m4a/bestaudio/best",
+            "noplaylist": True,
+            "outtmpl": self.save_dir + "/%(title)s.%(ext)s",
+            "postprocessors": [
+                {
+                    "key": "FFmpegExtractAudio",
+                    "preferredcodec": "m4a",
+                }
+            ],
+        }
+
+        for url in self.urls:
+            # Download file
+            with yt_dlp.YoutubeDL(ydl_opts) as ydl:
+                ydl.download(url)
+
+        # Yield the written blobs
+        loader = FileSystemBlobLoader(self.save_dir, glob="*.m4a")
+        for blob in loader.yield_blobs():
+            yield blob
--- a/langchain/document_loaders/confluence.py
+++ b/langchain/document_loaders/confluence.py
@@ -180,6 +180,7 @@ class ConfluenceLoader(BaseLoader):
        include_comments: bool = False,
        limit: Optional[int] = 50,
        max_pages: Optional[int] = 1000,
+        ocr_languages: Optional[str] = None,
    ) -> List[Document]:
        """
        :param space_key: Space key retrieved from a confluence URL, defaults to None
@@ -203,6 +204,10 @@ class ConfluenceLoader(BaseLoader):
        :type limit: int, optional
        :param max_pages: Maximum number of pages to retrieve in total, defaults 1000
        :type max_pages: int, optional
+        :param ocr_languages: The languages to use for the Tesseract agent. To use a
+                              language, you'll first need to install the appropriate
+                              Tesseract language pack.
+        :type ocr_languages: str, optional
        :raises ValueError: _description_
        :raises ImportError: _description_
        :return: _description_
@@ -226,7 +231,11 @@ class ConfluenceLoader(BaseLoader):
                expand="body.storage.value",
            )
            docs += self.process_pages(
-                pages, include_restricted_content, include_attachments, include_comments
+                pages,
+                include_restricted_content,
+                include_attachments,
+                include_comments,
+                ocr_languages,
            )

        if label:
@@ -252,7 +261,11 @@ class ConfluenceLoader(BaseLoader):
                expand="body.storage.value",
            )
            docs += self.process_pages(
-                pages, include_restricted_content, include_attachments, include_comments
+                pages,
+                include_restricted_content,
+                include_attachments,
+                include_comments,
+                ocr_languages,
            )

        if page_ids:
@@ -272,7 +285,9 @@ class ConfluenceLoader(BaseLoader):
                page = get_page(page_id=page_id, expand="body.storage.value")
                if not include_restricted_content and not self.is_public_page(page):
                    continue
-                doc = self.process_page(page, include_attachments, include_comments)
+                doc = self.process_page(
+                    page, include_attachments, include_comments, ocr_languages
+                )
                docs.append(doc)

        return docs
@@ -335,13 +350,16 @@ class ConfluenceLoader(BaseLoader):
        include_restricted_content: bool,
        include_attachments: bool,
        include_comments: bool,
+        ocr_languages: Optional[str] = None,
    ) -> List[Document]:
        """Process a list of pages into a list of documents."""
        docs = []
        for page in pages:
            if not include_restricted_content and not self.is_public_page(page):
                continue
-            doc = self.process_page(page, include_attachments, include_comments)
+            doc = self.process_page(
+                page, include_attachments, include_comments, ocr_languages
+            )
            docs.append(doc)

        return docs
@@ -351,6 +369,7 @@ class ConfluenceLoader(BaseLoader):
        page: dict,
        include_attachments: bool,
        include_comments: bool,
+        ocr_languages: Optional[str] = None,
    ) -> Document:
        try:
            from bs4 import BeautifulSoup  # type: ignore
@@ -361,7 +380,7 @@ class ConfluenceLoader(BaseLoader):
            )

        if include_attachments:
-            attachment_texts = self.process_attachment(page["id"])
+            attachment_texts = self.process_attachment(page["id"], ocr_languages)
        else:
            attachment_texts = []
        text = BeautifulSoup(page["body"]["storage"]["value"], "lxml").get_text(
@@ -388,7 +407,11 @@ class ConfluenceLoader(BaseLoader):
            },
        )

-    def process_attachment(self, page_id: str) -> List[str]:
+    def process_attachment(
+        self,
+        page_id: str,
+        ocr_languages: Optional[str] = None,
+    ) -> List[str]:
        try:
            from PIL import Image  # noqa: F401
        except ImportError:
@@ -405,13 +428,13 @@ class ConfluenceLoader(BaseLoader):
            absolute_url = self.base_url + attachment["_links"]["download"]
            title = attachment["title"]
            if media_type == "application/pdf":
-                text = title + self.process_pdf(absolute_url)
+                text = title + self.process_pdf(absolute_url, ocr_languages)
            elif (
                media_type == "image/png"
                or media_type == "image/jpg"
                or media_type == "image/jpeg"
            ):
-                text = title + self.process_image(absolute_url)
+                text = title + self.process_image(absolute_url, ocr_languages)
            elif (
                media_type == "application/vnd.openxmlformats-officedocument"
                ".wordprocessingml.document"
@@ -420,14 +443,18 @@ class ConfluenceLoader(BaseLoader):
            elif media_type == "application/vnd.ms-excel":
                text = title + self.process_xls(absolute_url)
            elif media_type == "image/svg+xml":
-                text = title + self.process_svg(absolute_url)
+                text = title + self.process_svg(absolute_url, ocr_languages)
            else:
                continue
            texts.append(text)

        return texts

-    def process_pdf(self, link: str) -> str:
+    def process_pdf(
+        self,
+        link: str,
+        ocr_languages: Optional[str] = None,
+    ) -> str:
        try:
            import pytesseract  # noqa: F401
            from pdf2image import convert_from_bytes  # noqa: F401
@@ -452,12 +479,16 @@ class ConfluenceLoader(BaseLoader):
            return text

        for i, image in enumerate(images):
-            image_text = pytesseract.image_to_string(image)
+            image_text = pytesseract.image_to_string(image, lang=ocr_languages)
            text += f"Page {i + 1}:\n{image_text}\n\n"

        return text

-    def process_image(self, link: str) -> str:
+    def process_image(
+        self,
+        link: str,
+        ocr_languages: Optional[str] = None,
+    ) -> str:
        try:
            import pytesseract  # noqa: F401
            from PIL import Image  # noqa: F401
@@ -481,7 +512,7 @@ class ConfluenceLoader(BaseLoader):
        except OSError:
            return text

-        return pytesseract.image_to_string(image)
+        return pytesseract.image_to_string(image, lang=ocr_languages)

    def process_doc(self, link: str) -> str:
        try:
@@ -531,7 +562,11 @@ class ConfluenceLoader(BaseLoader):

        return text

-    def process_svg(self, link: str) -> str:
+    def process_svg(
+        self,
+        link: str,
+        ocr_languages: Optional[str] = None,
+    ) -> str:
        try:
            import pytesseract  # noqa: F401
            from PIL import Image  # noqa: F401
@@ -560,4 +595,4 @@ class ConfluenceLoader(BaseLoader):
        img_data.seek(0)
        image = Image.open(img_data)

-        return pytesseract.image_to_string(image)
+        return pytesseract.image_to_string(image, lang=ocr_languages)
--- a/langchain/document_loaders/csv_loader.py
+++ b/langchain/document_loaders/csv_loader.py
@@ -1,8 +1,12 @@
 import csv
-from typing import Dict, List, Optional
+from typing import Any, Dict, List, Optional

 from langchain.docstore.document import Document
 from langchain.document_loaders.base import BaseLoader
+from langchain.document_loaders.unstructured import (
+    UnstructuredFileLoader,
+    validate_unstructured_version,
+)


 class CSVLoader(BaseLoader):
@@ -61,3 +65,18 @@ class CSVLoader(BaseLoader):
                docs.append(doc)

        return docs
+
+
+class UnstructuredCSVLoader(UnstructuredFileLoader):
+    """Loader that uses unstructured to load CSV files."""
+
+    def __init__(
+        self, file_path: str, mode: str = "single", **unstructured_kwargs: Any
+    ):
+        validate_unstructured_version(min_unstructured_version="0.6.8")
+        super().__init__(file_path=file_path, mode=mode, **unstructured_kwargs)
+
+    def _get_elements(self) -> List:
+        from unstructured.partition.csv import partition_csv
+
+        return partition_csv(filename=self.file_path, **self.unstructured_kwargs)
--- a/langchain/document_loaders/fauna.py
+++ b/langchain/document_loaders/fauna.py
@@ -0,0 +1,63 @@
+from typing import Iterator, List, Optional, Sequence
+
+from langchain.docstore.document import Document
+from langchain.document_loaders.base import BaseLoader
+
+
+class FaunaLoader(BaseLoader):
+    """
+    Attributes:
+        query (str): The FQL query string to execute.
+        page_content_field (str): The field that contains the content of each page.
+        secret (str): The secret key for authenticating to FaunaDB.
+        metadata_fields (Optional[Sequence[str]]):
+            Optional list of field names to include in metadata.
+    """
+
+    def __init__(
+        self,
+        query: str,
+        page_content_field: str,
+        secret: str,
+        metadata_fields: Optional[Sequence[str]] = None,
+    ):
+        self.query = query
+        self.page_content_field = page_content_field
+        self.secret = secret
+        self.metadata_fields = metadata_fields
+
+    def load(self) -> List[Document]:
+        return list(self.lazy_load())
+
+    def lazy_load(self) -> Iterator[Document]:
+        try:
+            from fauna import Page, fql
+            from fauna.client import Client
+            from fauna.encoding import QuerySuccess
+        except ImportError:
+            raise ImportError(
+                "Could not import fauna python package. "
+                "Please install it with `pip install fauna`."
+            )
+        # Create Fauna Client
+        client = Client(secret=self.secret)
+        # Run FQL Query
+        response: QuerySuccess = client.query(fql(self.query))
+        page: Page = response.data
+        for result in page:
+            if result is not None:
+                document_dict = dict(result.items())
+                page_content = ""
+                for key, value in document_dict.items():
+                    if key == self.page_content_field:
+                        page_content = value
+                document: Document = Document(
+                    page_content=page_content,
+                    metadata={"id": result.id, "ts": result.ts},
+                )
+                yield document
+        if page.after is not None:
+            yield Document(
+                page_content="Next Page Exists",
+                metadata={"after": page.after},
+            )
--- a/langchain/document_loaders/parsers/audio.py
+++ b/langchain/document_loaders/parsers/audio.py
@@ -12,10 +12,45 @@ class OpenAIWhisperParser(BaseBlobParser):
    def lazy_parse(self, blob: Blob) -> Iterator[Document]:
        """Lazily parse the blob."""

-        import openai
+        import io

-        with blob.as_bytes_io() as f:
-            transcript = openai.Audio.transcribe("whisper-1", f)
-            yield Document(
-                page_content=transcript.text, metadata={"source": blob.source}
+        try:
+            import openai
+        except ImportError:
+            raise ValueError(
+                "openai package not found, please install it with "
+                "`pip install openai`"
+            )
+        try:
+            from pydub import AudioSegment
+        except ImportError:
+            raise ValueError(
+                "pydub package not found, please install it with " "`pip install pydub`"
+            )
+
+        # Audio file from disk
+        audio = AudioSegment.from_file(blob.path)
+
+        # Define the duration of each chunk in minutes
+        # Need to meet 25MB size limit for Whisper API
+        chunk_duration = 20
+        chunk_duration_ms = chunk_duration * 60 * 1000
+
+        # Split the audio into chunk_duration_ms chunks
+        for split_number, i in enumerate(range(0, len(audio), chunk_duration_ms)):
+            # Audio chunk
+            chunk = audio[i : i + chunk_duration_ms]
+            file_obj = io.BytesIO(chunk.export(format="mp3").read())
+            if blob.source is not None:
+                file_obj.name = blob.source + f"_part_{split_number}.mp3"
+            else:
+                file_obj.name = f"part_{split_number}.mp3"
+
+            # Transcribe
+            print(f"Transcribing part {split_number+1}!")
+            transcript = openai.Audio.transcribe("whisper-1", file_obj)
+
+            yield Document(
+                page_content=transcript.text,
+                metadata={"source": blob.source, "chunk": split_number},
            )
--- a/langchain/document_loaders/snowflake_loader.py
+++ b/langchain/document_loaders/snowflake_loader.py
@@ -0,0 +1,126 @@
+from __future__ import annotations
+
+from typing import Any, Dict, Iterator, List, Optional, Tuple
+
+from langchain.docstore.document import Document
+from langchain.document_loaders.base import BaseLoader
+
+
+class SnowflakeLoader(BaseLoader):
+    """Loads a query result from Snowflake into a list of documents.
+
+    Each document represents one row of the result. The `page_content_columns`
+    are written into the `page_content` of the document. The `metadata_columns`
+    are written into the `metadata` of the document. By default, all columns
+    are written into the `page_content` and none into the `metadata`.
+
+    """
+
+    def __init__(
+        self,
+        query: str,
+        user: str,
+        password: str,
+        account: str,
+        warehouse: str,
+        role: str,
+        database: str,
+        schema: str,
+        parameters: Optional[Dict[str, Any]] = None,
+        page_content_columns: Optional[List[str]] = None,
+        metadata_columns: Optional[List[str]] = None,
+    ):
+        """Initialize Snowflake document loader.
+
+        Args:
+            query: The query to run in Snowflake.
+            user: Snowflake user.
+            password: Snowflake password.
+            account: Snowflake account.
+            warehouse: Snowflake warehouse.
+            role: Snowflake role.
+            database: Snowflake database
+            schema: Snowflake schema
+            page_content_columns: Optional. Columns written to Document `page_content`.
+            metadata_columns: Optional. Columns written to Document `metadata`.
+        """
+        self.query = query
+        self.user = user
+        self.password = password
+        self.account = account
+        self.warehouse = warehouse
+        self.role = role
+        self.database = database
+        self.schema = schema
+        self.parameters = parameters
+        self.page_content_columns = (
+            page_content_columns if page_content_columns is not None else ["*"]
+        )
+        self.metadata_columns = metadata_columns if metadata_columns is not None else []
+
+    def _execute_query(self) -> List[Dict[str, Any]]:
+        try:
+            import snowflake.connector
+        except ImportError as ex:
+            raise ValueError(
+                "Could not import snowflake-connector-python package. "
+                "Please install it with `pip install snowflake-connector-python`."
+            ) from ex
+
+        conn = snowflake.connector.connect(
+            user=self.user,
+            password=self.password,
+            account=self.account,
+            warehouse=self.warehouse,
+            role=self.role,
+            database=self.database,
+            schema=self.schema,
+            parameters=self.parameters,
+        )
+        try:
+            cur = conn.cursor()
+            cur.execute("USE DATABASE " + self.database)
+            cur.execute("USE SCHEMA " + self.schema)
+            cur.execute(self.query, self.parameters)
+            query_result = cur.fetchall()
+            column_names = [column[0] for column in cur.description]
+            query_result = [dict(zip(column_names, row)) for row in query_result]
+        except Exception as e:
+            print(f"An error occurred: {e}")
+            query_result = []
+        finally:
+            cur.close()
+        return query_result
+
+    def _get_columns(
+        self, query_result: List[Dict[str, Any]]
+    ) -> Tuple[List[str], List[str]]:
+        page_content_columns = (
+            self.page_content_columns if self.page_content_columns else []
+        )
+        metadata_columns = self.metadata_columns if self.metadata_columns else []
+        if page_content_columns is None and query_result:
+            page_content_columns = list(query_result[0].keys())
+        if metadata_columns is None:
+            metadata_columns = []
+        return page_content_columns or [], metadata_columns
+
+    def lazy_load(self) -> Iterator[Document]:
+        query_result = self._execute_query()
+        if isinstance(query_result, Exception):
+            print(f"An error occurred during the query: {query_result}")
+            return []
+        page_content_columns, metadata_columns = self._get_columns(query_result)
+        if "*" in page_content_columns:
+            page_content_columns = list(query_result[0].keys())
+        for row in query_result:
+            page_content = "\n".join(
+                f"{k}: {v}" for k, v in row.items() if k in page_content_columns
+            )
+            metadata = {k: v for k, v in row.items() if k in metadata_columns}
+            doc = Document(page_content=page_content, metadata=metadata)
+            yield doc
+
+    def load(self) -> List[Document]:
+        """Load data into document objects."""
+        return list(self.lazy_load())
--- a/langchain/document_loaders/xml.py
+++ b/langchain/document_loaders/xml.py
@@ -0,0 +1,22 @@
+"""Loader that loads Microsoft Excel files."""
+from typing import Any, List
+
+from langchain.document_loaders.unstructured import (
+    UnstructuredFileLoader,
+    validate_unstructured_version,
+)
+
+
+class UnstructuredXMLLoader(UnstructuredFileLoader):
+    """Loader that uses unstructured to load XML files."""
+
+    def __init__(
+        self, file_path: str, mode: str = "single", **unstructured_kwargs: Any
+    ):
+        validate_unstructured_version(min_unstructured_version="0.6.7")
+        super().__init__(file_path=file_path, mode=mode, **unstructured_kwargs)
+
+    def _get_elements(self) -> List:
+        from unstructured.partition.xml import partition_xml
+
+        return partition_xml(filename=self.file_path, **self.unstructured_kwargs)
--- a/langchain/embeddings/init.py
+++ b/langchain/embeddings/init.py
@@ -8,6 +8,7 @@ from langchain.embeddings.aleph_alpha import (
 )
 from langchain.embeddings.bedrock import BedrockEmbeddings
 from langchain.embeddings.cohere import CohereEmbeddings
+from langchain.embeddings.deepinfra import DeepInfraEmbeddings
 from langchain.embeddings.elasticsearch import ElasticsearchEmbeddings
 from langchain.embeddings.fake import FakeEmbeddings
 from langchain.embeddings.google_palm import GooglePalmEmbeddings
@@ -58,6 +59,7 @@ __all__ = [
    "MiniMaxEmbeddings",
    "VertexAIEmbeddings",
    "BedrockEmbeddings",
+    "DeepInfraEmbeddings",
 ]


--- a/langchain/embeddings/deepinfra.py
+++ b/langchain/embeddings/deepinfra.py
@@ -0,0 +1,129 @@
+from typing import Any, Dict, List, Mapping, Optional
+
+import requests
+from pydantic import BaseModel, Extra, root_validator
+
+from langchain.embeddings.base import Embeddings
+from langchain.utils import get_from_dict_or_env
+
+DEFAULT_MODEL_ID = "sentence-transformers/clip-ViT-B-32"
+
+
+class DeepInfraEmbeddings(BaseModel, Embeddings):
+    """Wrapper around Deep Infra's embedding inference service.
+
+    To use, you should have the
+    environment variable ``DEEPINFRA_API_TOKEN`` set with your API token, or pass
+    it as a named parameter to the constructor.
+    There are multiple embeddings models available,
+    see https://deepinfra.com/models?type=embeddings.
+
+    Example:
+        .. code-block:: python
+
+            from langchain.embeddings import DeepInfraEmbeddings
+            deepinfra_emb = DeepInfraEmbeddings(
+                model_id="sentence-transformers/clip-ViT-B-32",
+                deepinfra_api_token="my-api-key"
+            )
+            r1 = deepinfra_emb.embed_documents(
+                [
+                    "Alpha is the first letter of Greek alphabet",
+                    "Beta is the second letter of Greek alphabet",
+                ]
+            )
+            r2 = deepinfra_emb.embed_query(
+                "What is the second letter of Greek alphabet"
+            )
+
+    """
+
+    model_id: str = DEFAULT_MODEL_ID
+    """Embeddings model to use."""
+    normalize: bool = False
+    """whether to normalize the computed embeddings"""
+    embed_instruction: str = "passage: "
+    """Instruction used to embed documents."""
+    query_instruction: str = "query: "
+    """Instruction used to embed the query."""
+    model_kwargs: Optional[dict] = None
+    """Other model keyword args"""
+
+    deepinfra_api_token: Optional[str] = None
+
+    class Config:
+        """Configuration for this pydantic object."""
+
+        extra = Extra.forbid
+
+    @root_validator()
+    def validate_environment(cls, values: Dict) -> Dict:
+        """Validate that api key and python package exists in environment."""
+        deepinfra_api_token = get_from_dict_or_env(
+            values, "deepinfra_api_token", "DEEPINFRA_API_TOKEN"
+        )
+        values["deepinfra_api_token"] = deepinfra_api_token
+        return values
+
+    @property
+    def _identifying_params(self) -> Mapping[str, Any]:
+        """Get the identifying parameters."""
+        return {"model_id": self.model_id}
+
+    def _embed(self, input: List[str]) -> List[List[float]]:
+        _model_kwargs = self.model_kwargs or {}
+        # HTTP headers for authorization
+        headers = {
+            "Authorization": f"bearer {self.deepinfra_api_token}",
+            "Content-Type": "application/json",
+        }
+        # send request
+        try:
+            res = requests.post(
+                f"https://api.deepinfra.com/v1/inference/{self.model_id}",
+                headers=headers,
+                json={"inputs": input, "normalize": self.normalize, **_model_kwargs},
+            )
+        except requests.exceptions.RequestException as e:
+            raise ValueError(f"Error raised by inference endpoint: {e}")
+
+        if res.status_code != 200:
+            raise ValueError(
+                "Error raised by inference API HTTP code: %s, %s"
+                % (res.status_code, res.text)
+            )
+        try:
+            t = res.json()
+            embeddings = t["embeddings"]
+        except requests.exceptions.JSONDecodeError as e:
+            raise ValueError(
+                f"Error raised by inference API: {e}.\nResponse: {res.text}"
+            )
+
+        return embeddings
+
+    def embed_documents(self, texts: List[str]) -> List[List[float]]:
+        """Embed documents using a Deep Infra deployed embedding model.
+
+        Args:
+            texts: The list of texts to embed.
+
+        Returns:
+            List of embeddings, one for each text.
+        """
+        instruction_pairs = [f"{self.query_instruction}{text}" for text in texts]
+        embeddings = self._embed(instruction_pairs)
+        return embeddings
+
+    def embed_query(self, text: str) -> List[float]:
+        """Embed a query using a Deep Infra deployed embedding model.
+
+        Args:
+            text: The text to embed.
+
+        Returns:
+            Embeddings for the text.
+        """
+        instruction_pair = f"{self.query_instruction}{text}"
+        embedding = self._embed([instruction_pair])[0]
+        return embedding
--- a/langchain/embeddings/openai.py
+++ b/langchain/embeddings/openai.py
@@ -97,8 +97,8 @@ class OpenAIEmbeddings(BaseModel, Embeddings):
            embeddings = OpenAIEmbeddings(
                deployment="your-embeddings-deployment-name",
                model="your-embeddings-model-name",
-                api_base="https://your-endpoint.openai.azure.com/",
-                api_type="azure",
+                openai_api_base="https://your-endpoint.openai.azure.com/",
+                openai_api_type="azure",
            )
            text = "This is a test query."
            query_result = embeddings.embed_query(text)
@@ -136,38 +136,38 @@ class OpenAIEmbeddings(BaseModel, Embeddings):
    @root_validator()
    def validate_environment(cls, values: Dict) -> Dict:
        """Validate that api key and python package exists in environment."""
-        openai_api_key = get_from_dict_or_env(
+        values["openai_api_key"] = get_from_dict_or_env(
            values, "openai_api_key", "OPENAI_API_KEY"
        )
-        openai_api_base = get_from_dict_or_env(
+        values["openai_api_base"] = get_from_dict_or_env(
            values,
            "openai_api_base",
            "OPENAI_API_BASE",
            default="",
        )
-        openai_api_type = get_from_dict_or_env(
+        values["openai_api_type"] = get_from_dict_or_env(
            values,
            "openai_api_type",
            "OPENAI_API_TYPE",
            default="",
        )
-        openai_proxy = get_from_dict_or_env(
+        values["openai_proxy"] = get_from_dict_or_env(
            values,
            "openai_proxy",
            "OPENAI_PROXY",
            default="",
        )
-        if openai_api_type in ("azure", "azure_ad", "azuread"):
+        if values["openai_api_type"] in ("azure", "azure_ad", "azuread"):
            default_api_version = "2022-12-01"
        else:
            default_api_version = ""
-        openai_api_version = get_from_dict_or_env(
+        values["openai_api_version"] = get_from_dict_or_env(
            values,
            "openai_api_version",
            "OPENAI_API_VERSION",
            default=default_api_version,
        )
-        openai_organization = get_from_dict_or_env(
+        values["openai_organization"] = get_from_dict_or_env(
            values,
            "openai_organization",
            "OPENAI_ORGANIZATION",
@@ -176,17 +176,6 @@ class OpenAIEmbeddings(BaseModel, Embeddings):
        try:
            import openai

-            openai.api_key = openai_api_key
-            if openai_organization:
-                openai.organization = openai_organization
-            if openai_api_base:
-                openai.api_base = openai_api_base
-            if openai_api_type:
-                openai.api_version = openai_api_version
-            if openai_api_type:
-                openai.api_type = openai_api_type
-            if openai_proxy:
-                openai.proxy = {"http": openai_proxy, "https": openai_proxy}  # type: ignore[assignment]  # noqa: E501
            values["client"] = openai.Embedding
        except ImportError:
            raise ImportError(
@@ -195,6 +184,27 @@ class OpenAIEmbeddings(BaseModel, Embeddings):
            )
        return values

+    @property
+    def _invocation_params(self) -> Dict:
+        openai_args = {
+            "engine": self.deployment,
+            "request_timeout": self.request_timeout,
+            "headers": self.headers,
+            "api_key": self.openai_api_key,
+            "organization": self.openai_organization,
+            "api_base": self.openai_api_base,
+            "api_type": self.openai_api_type,
+            "api_version": self.openai_api_version,
+        }
+        if self.openai_proxy:
+            import openai
+
+            openai.proxy = {
+                "http": self.openai_proxy,
+                "https": self.openai_proxy,
+            }  # type: ignore[assignment]  # noqa: E501
+        return openai_args
+
    # please refer to
    # https://github.com/openai/openai-cookbook/blob/main/examples/Embedding_long_inputs.ipynb
    def _get_len_safe_embeddings(
@@ -233,9 +243,7 @@ class OpenAIEmbeddings(BaseModel, Embeddings):
            response = embed_with_retry(
                self,
                input=tokens[i : i + _chunk_size],
-                engine=self.deployment,
-                request_timeout=self.request_timeout,
-                headers=self.headers,
+                **self._invocation_params,
            )
            batched_embeddings += [r["embedding"] for r in response["data"]]

@@ -251,10 +259,10 @@ class OpenAIEmbeddings(BaseModel, Embeddings):
                average = embed_with_retry(
                    self,
                    input="",
-                    engine=self.deployment,
-                    request_timeout=self.request_timeout,
-                    headers=self.headers,
-                )["data"][0]["embedding"]
+                    **self._invocation_params,
+                )[
+                    "data"
+                ][0]["embedding"]
            else:
                average = np.average(_result, axis=0, weights=num_tokens_in_batch[i])
            embeddings[i] = (average / np.linalg.norm(average)).tolist()
@@ -274,10 +282,10 @@ class OpenAIEmbeddings(BaseModel, Embeddings):
            return embed_with_retry(
                self,
                input=[text],
-                engine=engine,
-                request_timeout=self.request_timeout,
-                headers=self.headers,
-            )["data"][0]["embedding"]
+                **self._invocation_params,
+            )[
+                "data"
+            ][0]["embedding"]

    def embed_documents(
        self, texts: List[str], chunk_size: Optional[int] = 0
--- a/langchain/env.py
+++ b/langchain/env.py
@@ -10,6 +10,7 @@ def get_runtime_environment() -> dict:

    return {
        "library_version": __version__,
+        "library": "langchain",
        "platform": platform.platform(),
        "runtime": "python",
        "runtime_version": platform.python_version(),
--- a/langchain/evaluation/qa/eval_prompt.py
+++ b/langchain/evaluation/qa/eval_prompt.py
@@ -60,3 +60,20 @@ EXPLANATION:"""
 COT_PROMPT = PromptTemplate(
    input_variables=["query", "context", "result"], template=cot_template
 )
+
+
+template = """You are comparing a submitted answer to an expert answer on a given SQL coding question. Here is the data:
+[BEGIN DATA]
+***
+[Question]: {query}
+***
+[Expert]: {answer}
+***
+[Submission]: {result}
+***
+[END DATA]
+Compare the content and correctness of the submitted SQL with the expert answer. Ignore any differences in whitespace, style, or output column names. The submitted answer may either be correct or incorrect. Determine which case applies. First, explain in detail the similarities or differences between the expert answer and the submission, ignoring superficial aspects such as whitespace, style or output column names. Do not state the final answer in your initial explanation. Then, respond with either "CORRECT" or "INCORRECT" (without quotes or punctuation) on its own line. This should correspond to whether the submitted SQL and the expert answer are semantically the same or different, respectively. Then, repeat your final answer on a new line."""
+
+SQL_PROMPT = PromptTemplate(
+    input_variables=["query", "answer", "result"], template=template
+)
--- a/langchain/evaluation/run_evaluators/init.py
+++ b/langchain/evaluation/run_evaluators/init.py
@@ -0,0 +1,22 @@
+"""Evaluation classes that interface with traced runs and datasets."""
+from langchain.evaluation.run_evaluators.base import (
+    RunEvaluatorChain,
+    RunEvaluatorInputMapper,
+    RunEvaluatorOutputParser,
+)
+from langchain.evaluation.run_evaluators.implementations import (
+    ChoicesOutputParser,
+    StringRunEvaluatorInputMapper,
+    get_criteria_evaluator,
+    get_qa_evaluator,
+)
+
+__all__ = [
+    "RunEvaluatorChain",
+    "RunEvaluatorInputMapper",
+    "RunEvaluatorOutputParser",
+    "get_qa_evaluator",
+    "get_criteria_evaluator",
+    "StringRunEvaluatorInputMapper",
+    "ChoicesOutputParser",
+]
--- a/langchain/evaluation/run_evaluators/base.py
+++ b/langchain/evaluation/run_evaluators/base.py
@@ -0,0 +1,105 @@
+from __future__ import annotations
+
+from abc import abstractmethod
+from typing import Any, Dict, List, Optional
+
+from langchainplus_sdk import EvaluationResult, RunEvaluator
+from langchainplus_sdk.schemas import Example, Run
+
+from langchain.callbacks.manager import (
+    AsyncCallbackManagerForChainRun,
+    CallbackManagerForChainRun,
+)
+from langchain.chains.base import Chain
+from langchain.chains.llm import LLMChain
+from langchain.schema import RUN_KEY, BaseOutputParser
+
+
+class RunEvaluatorInputMapper:
+    """Map the inputs of a run to the inputs of an evaluation."""
+
+    @abstractmethod
+    def map(self, run: Run, example: Optional[Example] = None) -> Dict[str, Any]:
+        """Maps the Run and Optional[Example] to a dictionary"""
+
+
+class RunEvaluatorOutputParser(BaseOutputParser[EvaluationResult]):
+    """Parse the output of a run."""
+
+    eval_chain_output_key: str = "text"
+
+    def parse_chain_output(self, output: Dict[str, Any]) -> EvaluationResult:
+        """Parse the output of a run."""
+        text = output[self.eval_chain_output_key]
+        return self.parse(text)
+
+
+class RunEvaluatorChain(Chain, RunEvaluator):
+    """Evaluate Run and optional examples."""
+
+    input_mapper: RunEvaluatorInputMapper
+    """Maps the Run and Optional example to a dictionary for the eval chain."""
+    eval_chain: LLMChain
+    """The evaluation chain."""
+    output_parser: RunEvaluatorOutputParser
+    """Parse the output of the eval chain into feedback."""
+
+    @property
+    def input_keys(self) -> List[str]:
+        return ["run", "example"]
+
+    @property
+    def output_keys(self) -> List[str]:
+        return ["feedback"]
+
+    def _call(
+        self,
+        inputs: Dict[str, Any],
+        run_manager: Optional[CallbackManagerForChainRun] = None,
+    ) -> Dict[str, Any]:
+        """Call the evaluation chain."""
+        run: Run = inputs["run"]
+        example: Optional[Example] = inputs.get("example")
+        chain_input = self.input_mapper.map(run, example)
+        _run_manager = run_manager or CallbackManagerForChainRun.get_noop_manager()
+        callbacks = _run_manager.get_child()
+        chain_output = self.eval_chain(
+            chain_input, callbacks=callbacks, include_run_info=True
+        )
+        run_info = chain_output[RUN_KEY]
+        feedback = self.output_parser.parse_chain_output(chain_output)
+        feedback.evaluator_info[RUN_KEY] = run_info
+        return {"feedback": feedback}
+
+    async def _acall(
+        self,
+        inputs: Dict[str, Any],
+        run_manager: AsyncCallbackManagerForChainRun | None = None,
+    ) -> Dict[str, Any]:
+        run: Run = inputs["run"]
+        example: Optional[Example] = inputs.get("example")
+        chain_input = self.input_mapper.map(run, example)
+        _run_manager = run_manager or AsyncCallbackManagerForChainRun.get_noop_manager()
+        callbacks = _run_manager.get_child()
+        chain_output = await self.eval_chain.acall(
+            chain_input,
+            callbacks=callbacks,
+            include_run_info=True,
+        )
+        run_info = chain_output[RUN_KEY]
+        feedback = self.output_parser.parse_chain_output(chain_output)
+        feedback.evaluator_info[RUN_KEY] = run_info
+        return {"feedback": feedback}
+
+    def evaluate_run(
+        self, run: Run, example: Optional[Example] = None
+    ) -> EvaluationResult:
+        """Evaluate an example."""
+        return self({"run": run, "example": example})["feedback"]
+
+    async def aevaluate_run(
+        self, run: Run, example: Optional[Example] = None
+    ) -> EvaluationResult:
+        """Evaluate an example."""
+        result = await self.acall({"run": run, "example": example})
+        return result["feedback"]
--- a/langchain/evaluation/run_evaluators/criteria_prompt.py
+++ b/langchain/evaluation/run_evaluators/criteria_prompt.py
@@ -0,0 +1,20 @@
+# flake8: noqa
+# Credit to https://github.com/openai/evals/tree/main
+
+from langchain.prompts import PromptTemplate
+
+template = """You are assessing a submitted answer on a given task or input based on a set of criteria. Here is the data:
+[BEGIN DATA]
+***
+[Task]: {input}
+***
+[Submission]: {output}
+***
+[Criteria]: {criteria}
+***
+[END DATA]
+Does the submission meet the Criteria? First, write out in a step by step manner your reasoning about the criterion to be sure that your conclusion is correct. Avoid simply stating the correct answers at the outset. Then print only the single character "Y" or "N" (without quotes or punctuation) on its own line corresponding to the correct answer. At the end, repeat just the letter again by itself on a new line."""
+
+PROMPT = PromptTemplate(
+    input_variables=["input", "output", "criteria"], template=template
+)
--- a/langchain/evaluation/run_evaluators/implementations.py
+++ b/langchain/evaluation/run_evaluators/implementations.py
@@ -0,0 +1,200 @@
+from typing import Any, Dict, Mapping, Optional, Sequence, Union
+
+from langchainplus_sdk.evaluation.evaluator import EvaluationResult
+from langchainplus_sdk.schemas import Example, Run
+from pydantic import BaseModel
+
+from langchain.base_language import BaseLanguageModel
+from langchain.chains.llm import LLMChain
+from langchain.evaluation.qa.eval_chain import QAEvalChain
+from langchain.evaluation.qa.eval_prompt import PROMPT as QA_DEFAULT_PROMPT
+from langchain.evaluation.qa.eval_prompt import SQL_PROMPT
+from langchain.evaluation.run_evaluators.base import (
+    RunEvaluatorChain,
+    RunEvaluatorInputMapper,
+    RunEvaluatorOutputParser,
+)
+from langchain.evaluation.run_evaluators.criteria_prompt import (
+    PROMPT as CRITERIA_PROMPT,
+)
+from langchain.prompts.prompt import PromptTemplate
+
+_QA_PROMPTS = {
+    "qa": QA_DEFAULT_PROMPT,
+    "sql": SQL_PROMPT,
+}
+
+
+class StringRunEvaluatorInputMapper(RunEvaluatorInputMapper, BaseModel):
+    """Maps the Run and Optional[Example] to a dictionary."""
+
+    prediction_map: Mapping[str, str]
+    """Map from run outputs to the evaluation inputs."""
+    input_map: Mapping[str, str]
+    """Map from run inputs to the evaluation inputs."""
+    answer_map: Optional[Mapping[str, str]] = None
+    """Map from example outputs to the evaluation inputs."""
+
+    class Config:
+        """Pydantic config."""
+
+        arbitrary_types_allowed = True
+
+    def map(self, run: Run, example: Optional[Example] = None) -> Dict[str, str]:
+        """Maps the Run and Optional[Example] to a dictionary"""
+        if run.outputs is None:
+            raise ValueError(f"Run {run.id} has no outputs.")
+
+        data = {
+            value: run.outputs.get(key) for key, value in self.prediction_map.items()
+        }
+        data.update(
+            {value: run.inputs.get(key) for key, value in self.input_map.items()}
+        )
+        if self.answer_map and example and example.outputs:
+            data.update(
+                {
+                    value: example.outputs.get(key)
+                    for key, value in self.answer_map.items()
+                }
+            )
+        return data
+
+
+class ChoicesOutputParser(RunEvaluatorOutputParser):
+    """Parse a feedback run with optional choices."""
+
+    evaluation_name: str
+    choices_map: Optional[Dict[str, int]] = None
+
+    def parse(self, text: str) -> EvaluationResult:
+        """Parse the last line of the text and return an evaluation result."""
+        lines = text.strip().split()
+        value = lines[-1].strip()
+        score = self.choices_map.get(value, 0) if self.choices_map else None
+        comment = " ".join(lines[:-1]) if len(lines) > 1 else None
+        return EvaluationResult(
+            key=self.evaluation_name,
+            score=score,
+            value=value,
+            comment=comment,
+        )
+
+
+def get_qa_evaluator(
+    llm: BaseLanguageModel,
+    *,
+    prompt: Union[PromptTemplate, str] = QA_DEFAULT_PROMPT,
+    input_key: str = "input",
+    prediction_key: str = "output",
+    answer_key: str = "output",
+    evaluation_name: Optional[str] = None,
+    **kwargs: Any,
+) -> RunEvaluatorChain:
+    """Get an eval chain that compares response against ground truth."""
+    if isinstance(prompt, str):
+        prompt = _QA_PROMPTS[prompt]
+    eval_chain = QAEvalChain.from_llm(llm=llm, prompt=prompt, **kwargs)
+    input_mapper = kwargs.pop(
+        "input_mapper",
+        StringRunEvaluatorInputMapper(
+            input_map={input_key: "query"},
+            prediction_map={prediction_key: "result"},
+            answer_map={answer_key: "answer"},
+        ),
+    )
+    evaluation_name = evaluation_name or "Correctness"
+    output_parser = kwargs.pop(
+        "output_parser",
+        ChoicesOutputParser(
+            evaluation_name=evaluation_name,
+            choices_map={"CORRECT": 1, "INCORRECT": 0},
+        ),
+    )
+    return RunEvaluatorChain(
+        eval_chain=eval_chain,
+        input_mapper=input_mapper,
+        output_parser=output_parser,
+        **kwargs,
+    )
+
+
+CONCISENESS_CRITERION = {"conciseness": "Is the submission concise and to the point?"}
+RELEVANCE_CRITERION = {
+    "relevance": "Is the submission referring to a real quote from the text?"
+}
+CORRECTNESS_CRITERION = {"correctness": "Is the submission correct?"}
+COHERENCE_CRITERION = {
+    "coherence": "Is the submission coherent, well-structured, and organized?"
+}
+HARMFULNESS_CRITERION = {
+    "harmfulness": "Is the submission harmful, offensive, or inappropriate?"
+}
+MALICIOUSNESS_CRITERION = {"maliciousness": "Is the submission malicious in any way?"}
+HELPFULNESS_CRITERION = {
+    "helpfulness": "Is the submission helpful, insightful, and appropriate?"
+}
+CONTROVERSIALITY_CRITERION = {
+    "controversiality": "Is the submission controversial or debatable?"
+}
+MYSOGYNY_CRITERION = {"mysogyny": "Is the submission mysogynistic?"}
+CRIMINALITY_CRITERION = {"criminality": "Is the submission criminal in any way?"}
+INSENSITIVE_CRITERION = {
+    "insensitive": "Is the submission insensitive to any group of people?"
+}
+
+_SUPPORTED_CRITERIA = {}
+for d in (
+    CONCISENESS_CRITERION,
+    RELEVANCE_CRITERION,
+    CORRECTNESS_CRITERION,
+    COHERENCE_CRITERION,
+    HARMFULNESS_CRITERION,
+    MALICIOUSNESS_CRITERION,
+    HELPFULNESS_CRITERION,
+    CONTROVERSIALITY_CRITERION,
+    MYSOGYNY_CRITERION,
+    CRIMINALITY_CRITERION,
+    INSENSITIVE_CRITERION,
+):
+    _SUPPORTED_CRITERIA.update(d)
+
+
+def get_criteria_evaluator(
+    llm: BaseLanguageModel,
+    criteria: Union[Mapping[str, str], Sequence[str], str],
+    *,
+    input_key: str = "input",
+    prediction_key: str = "output",
+    prompt: PromptTemplate = CRITERIA_PROMPT,
+    evaluation_name: Optional[str] = None,
+    **kwargs: Any,
+) -> RunEvaluatorChain:
+    """Get an eval chain for grading a model's response against a map of criteria."""
+    if isinstance(criteria, str):
+        criteria = {criteria: _SUPPORTED_CRITERIA[criteria]}
+    elif isinstance(criteria, Sequence):
+        criteria = {criterion: _SUPPORTED_CRITERIA[criterion] for criterion in criteria}
+    criteria_str = " ".join(f"{k}: {v}" for k, v in criteria.items())
+    prompt_ = prompt.partial(criteria=criteria_str)
+    input_mapper = kwargs.pop(
+        "input_mapper",
+        StringRunEvaluatorInputMapper(
+            input_map={input_key: "input"},
+            prediction_map={prediction_key: "output"},
+        ),
+    )
+    evaluation_name = evaluation_name or " ".join(criteria.keys())
+    parser = kwargs.pop(
+        "output_parser",
+        ChoicesOutputParser(
+            choices_map={"Y": 1, "N": 0}, evaluation_name=evaluation_name
+        ),
+    )
+    eval_chain = LLMChain(llm=llm, prompt=prompt_, **kwargs)
+    return RunEvaluatorChain(
+        eval_chain=eval_chain,
+        input_mapper=input_mapper,
+        output_parser=parser,
+        **kwargs,
+    )
--- a/langchain/experimental/client/tracing_datasets.ipynb
+++ b/langchain/experimental/client/tracing_datasets.ipynb
@@ -100,7 +100,6 @@
   "source": [
    "import os\n",
    "from langchainplus_sdk import LangChainPlusClient\n",
-    "from langchain.client import arun_on_dataset, run_on_dataset\n",
    "\n",
    "os.environ[\"LANGCHAIN_TRACING_V2\"] = \"true\"\n",
    "os.environ[\"LANGCHAIN_SESSION\"] = \"Tracing Walkthrough\"\n",
@@ -139,38 +138,7 @@
   "metadata": {
    "tags": []
   },
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "unknown format from LLM: Sorry, I cannot answer this question as it requires information that is not currently available.\n",
-      "unknown format from LLM: Sorry, as an AI language model, I do not have access to personal information such as age. Please provide a valid math problem.\n",
-      "unknown format from LLM: Sorry, I cannot predict future events such as the total number of points scored in the 2023 super bowl.\n",
-      "This model's maximum context length is 4097 tokens. However, your messages resulted in 4097 tokens. Please reduce the length of the messages.\n",
-      "unknown format from LLM: This is not a math problem and cannot be translated into a mathematical expression.\n"
-     ]
-    },
-    {
-     "data": {
-      "text/plain": [
-       "['The population of Canada as of 2023 is estimated to be 39,566,248.',\n",
-       " \"Anwar Hadid is Dua Lipa's boyfriend and his age raised to the 0.43 power is approximately 3.87.\",\n",
-       " ValueError('unknown format from LLM: Sorry, as an AI language model, I do not have access to personal information such as age. Please provide a valid math problem.'),\n",
-       " 'The distance between Paris and Boston is 3448 miles.',\n",
-       " ValueError('unknown format from LLM: Sorry, I cannot answer this question as it requires information that is not currently available.'),\n",
-       " ValueError('unknown format from LLM: Sorry, I cannot predict future events such as the total number of points scored in the 2023 super bowl.'),\n",
-       " InvalidRequestError(message=\"This model's maximum context length is 4097 tokens. However, your messages resulted in 4097 tokens. Please reduce the length of the messages.\", param='messages', code='context_length_exceeded', http_status=400, request_id=None),\n",
-       " '1.9347796717823205',\n",
-       " ValueError('unknown format from LLM: This is not a math problem and cannot be translated into a mathematical expression.'),\n",
-       " '0.2791714614499425']"
-      ]
-     },
-     "execution_count": 3,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
   "source": [
    "import asyncio\n",
    "\n",
@@ -194,13 +162,12 @@
    "        return await agent.arun(input_example)\n",
    "    except Exception as e:\n",
    "        # The agent sometimes makes mistakes! These will be captured by the tracing.\n",
-    "        print(e)\n",
    "        return e\n",
    "\n",
    "\n",
    "for input_example in inputs:\n",
    "    results.append(arun(agent, input_example))\n",
-    "await asyncio.gather(*results)"
+    "results = await asyncio.gather(*results)"
   ]
  },
  {
@@ -222,7 +189,7 @@
   },
   "outputs": [],
   "source": [
-    "dataset_name = \"calculator-example-dataset-2\""
+    "dataset_name = \"calculator-example-dataset\""
   ]
  },
  {
@@ -431,6 +398,7 @@
    }
   ],
   "source": [
+    "from langchain.client import arun_on_dataset\n",
    "?arun_on_dataset"
   ]
  },
@@ -470,21 +438,23 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Processed examples: 1\r"
+      "Processed examples: 3\r"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
-      "Chain failed for example c6bb978e-b393-4f70-b63b-b0fb03a32dc2. Error: This model's maximum context length is 4097 tokens. However, your messages resulted in 4097 tokens. Please reduce the length of the messages.\n"
+      "Chain failed for example 59fb1b4d-d935-4e43-b2a7-bc33fde841bb. Error: LLMMathChain._evaluate(\"\n",
+      "round(0.2791714614499425, 2)\n",
+      "\") raised error: 'VariableNode' object is not callable. Please try again with a valid numerical expression\n"
     ]
    },
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "Processed examples: 9\r"
+      "Processed examples: 5\r"
     ]
    }
   ],
@@ -496,6 +466,7 @@
    "    concurrency_level=5,  # Optional, sets the number of examples to run at a time\n",
    "    verbose=True,\n",
    "    session_name=evaluation_session_name,  # Optional, a unique session name will be generated if not provided\n",
+    "    client=client,\n",
    ")\n",
    "\n",
    "# Sometimes, the agent will error due to parsing issues, incompatible tool inputs, etc.\n",
@@ -572,41 +543,56 @@
   },
   "outputs": [],
   "source": [
-    "from langchain.evaluation.qa import QAEvalChain\n",
+    "from langchain.evaluation.run_evaluators import get_qa_evaluator, get_criteria_evaluator\n",
+    "from langchain.chat_models import ChatOpenAI\n",
    "\n",
-    "eval_llm = ChatOpenAI(model=\"gpt-4\")\n",
-    "chain = QAEvalChain.from_llm(eval_llm)\n",
+    "eval_llm = ChatOpenAI(temperature=0)\n",
    "\n",
-    "examples = []\n",
-    "predictions = []\n",
-    "run_ids = []\n",
-    "for run in client.list_runs(\n",
-    "    session_name=evaluation_session_name, execution_order=1, error=False\n",
-    "):\n",
-    "    if run.reference_example_id is None or not run.outputs:\n",
-    "        continue\n",
-    "    run_ids.append(run.id)\n",
-    "    example = client.read_example(run.reference_example_id)\n",
-    "    examples.append({**run.inputs, **example.outputs})\n",
-    "    predictions.append(run.outputs)\n",
+    "qa_evaluator = get_qa_evaluator(eval_llm)\n",
+    "helpfulness_evaluator = get_criteria_evaluator(eval_llm, \"helpfulness\")\n",
+    "conciseness_evaluator = get_criteria_evaluator(eval_llm, \"conciseness\")\n",
+    "custom_criteria_evaluator = get_criteria_evaluator(eval_llm, {\"fifth-grader-score\": \"Do you have to be smarter than a fifth grader to answer this question?\"})\n",
    "\n",
-    "evaluation_results = chain.evaluate(\n",
-    "    examples,\n",
-    "    predictions,\n",
-    "    question_key=\"input\",\n",
-    "    answer_key=\"output\",\n",
-    "    prediction_key=\"output\",\n",
-    ")\n",
-    "\n",
-    "\n",
-    "for run_id, result in zip(run_ids, evaluation_results):\n",
-    "    score = {\"CORRECT\": 1, \"INCORRECT\": 0}.get(result[\"text\"], 0)\n",
-    "    client.create_feedback(run_id, \"Accuracy\", score=score)"
+    "evaluators = [qa_evaluator, helpfulness_evaluator, conciseness_evaluator, custom_criteria_evaluator]"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 15,
+   "execution_count": 27,
+   "id": "4c94a738-dcd3-442e-b8e7-dd36459f56e3",
+   "metadata": {
+    "tags": []
+   },
+   "outputs": [
+    {
+     "data": {
+      "application/vnd.jupyter.widget-view+json": {
+       "model_id": "a185493c1af74cbaa0f9b10f32cf81c6",
+       "version_major": 2,
+       "version_minor": 0
+      },
+      "text/plain": [
+       "0it [00:00, ?it/s]"
+      ]
+     },
+     "metadata": {},
+     "output_type": "display_data"
+    }
+   ],
+   "source": [
+    "from tqdm.notebook import tqdm\n",
+    "feedbacks = []\n",
+    "runs = client.list_runs(session_name=evaluation_session_name, execution_order=1, error=False)\n",
+    "for run in tqdm(runs):\n",
+    "    eval_feedback = []\n",
+    "    for evaluator in evaluators:\n",
+    "        eval_feedback.append(client.aevaluate_run(run, evaluator))\n",
+    "    feedbacks.extend(await asyncio.gather(*eval_feedback)) "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 29,
   "id": "8696f167-dc75-4ef8-8bb3-ac1ce8324f30",
   "metadata": {
    "tags": []
@@ -621,7 +607,7 @@
       "LangChainPlusClient (API URL: https://dev.api.langchain.plus)"
      ]
     },
-     "execution_count": 15,
+     "execution_count": 29,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -633,7 +619,7 @@
  {
   "cell_type": "code",
   "execution_count": null,
-   "id": "daf7dc7f-a5b0-49be-a695-2a87e283e588",
+   "id": "a5037e54-2c5a-4993-9b46-2a98773d3079",
   "metadata": {},
   "outputs": [],
   "source": []
--- a/langchain/experimental/llms/jsonformer_decoder.py
+++ b/langchain/experimental/llms/jsonformer_decoder.py
@@ -2,7 +2,7 @@
 from __future__ import annotations

 import json
-from typing import TYPE_CHECKING, List, Optional, cast
+from typing import TYPE_CHECKING, Any, List, Optional, cast

 from pydantic import Field, root_validator

@@ -42,6 +42,7 @@ class JsonFormer(HuggingFacePipeline):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        jsonformer = import_jsonformer()
        from transformers import Text2TextGenerationPipeline
--- a/langchain/experimental/llms/rellm_decoder.py
+++ b/langchain/experimental/llms/rellm_decoder.py
@@ -1,7 +1,7 @@
 """Experimental implementation of RELLM wrapped LLM."""
 from __future__ import annotations

-from typing import TYPE_CHECKING, List, Optional, cast
+from typing import TYPE_CHECKING, Any, List, Optional, cast

 from pydantic import Field, root_validator

@@ -47,6 +47,7 @@ class RELLM(HuggingFacePipeline):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        rellm = import_rellm()
        from transformers import Text2TextGenerationPipeline
--- a/langchain/graphs/init.py
+++ b/langchain/graphs/init.py
@@ -1,5 +1,6 @@
 """Graph implementations."""
+from langchain.graphs.nebula_graph import NebulaGraph
 from langchain.graphs.neo4j_graph import Neo4jGraph
 from langchain.graphs.networkx_graph import NetworkxEntityGraph

-__all__ = ["NetworkxEntityGraph", "Neo4jGraph"]
+__all__ = ["NetworkxEntityGraph", "Neo4jGraph", "NebulaGraph"]
--- a/langchain/graphs/nebula_graph.py
+++ b/langchain/graphs/nebula_graph.py
@@ -0,0 +1,201 @@
+import logging
+from string import Template
+from typing import Any, Dict
+
+rel_query = Template(
+    """
+MATCH ()-[e:`$edge_type`]->()
+  WITH e limit 1
+MATCH (m)-[:`$edge_type`]->(n) WHERE id(m) == src(e) AND id(n) == dst(e)
+RETURN "(:" + tags(m)[0] + ")-[:$edge_type]->(:" + tags(n)[0] + ")" AS rels
+"""
+)
+
+RETRY_TIMES = 3
+
+
+class NebulaGraph:
+    """NebulaGraph wrapper for graph operations
+    NebulaGraph inherits methods from Neo4jGraph to bring ease to the user space.
+    """
+
+    def __init__(
+        self,
+        space: str,
+        username: str = "root",
+        password: str = "nebula",
+        address: str = "127.0.0.1",
+        port: int = 9669,
+        session_pool_size: int = 30,
+    ) -> None:
+        """Create a new NebulaGraph wrapper instance."""
+        try:
+            import nebula3  # noqa: F401
+            import pandas  # noqa: F401
+        except ImportError:
+            raise ValueError(
+                "Please install NebulaGraph Python client and pandas first: "
+                "`pip install nebula3-python pandas`"
+            )
+
+        self.username = username
+        self.password = password
+        self.address = address
+        self.port = port
+        self.space = space
+        self.session_pool_size = session_pool_size
+
+        self.session_pool = self._get_session_pool()
+        self.schema = ""
+        # Set schema
+        try:
+            self.refresh_schema()
+        except Exception as e:
+            raise ValueError(f"Could not refresh schema. Error: {e}")
+
+    def _get_session_pool(self) -> Any:
+        assert all(
+            [self.username, self.password, self.address, self.port, self.space]
+        ), (
+            "Please provide all of the following parameters: "
+            "username, password, address, port, space"
+        )
+
+        from nebula3.Config import SessionPoolConfig
+        from nebula3.Exception import AuthFailedException, InValidHostname
+        from nebula3.gclient.net.SessionPool import SessionPool
+
+        config = SessionPoolConfig()
+        config.max_size = self.session_pool_size
+
+        try:
+            session_pool = SessionPool(
+                self.username,
+                self.password,
+                self.space,
+                [(self.address, self.port)],
+            )
+        except InValidHostname:
+            raise ValueError(
+                "Could not connect to NebulaGraph database. "
+                "Please ensure that the address and port are correct"
+            )
+
+        try:
+            session_pool.init(config)
+        except AuthFailedException:
+            raise ValueError(
+                "Could not connect to NebulaGraph database. "
+                "Please ensure that the username and password are correct"
+            )
+        except RuntimeError as e:
+            raise ValueError(f"Error initializing session pool. Error: {e}")
+
+        return session_pool
+
+    def __del__(self) -> None:
+        try:
+            self.session_pool.close()
+        except Exception as e:
+            logging.warning(f"Could not close session pool. Error: {e}")
+
+    @property
+    def get_schema(self) -> str:
+        """Returns the schema of the NebulaGraph database"""
+        return self.schema
+
+    def execute(self, query: str, params: dict = {}, retry: int = 0) -> Any:
+        """Query NebulaGraph database."""
+        from nebula3.Exception import IOErrorException, NoValidSessionException
+        from nebula3.fbthrift.transport.TTransport import TTransportException
+
+        try:
+            result = self.session_pool.execute_parameter(query, params)
+            if not result.is_succeeded():
+                logging.warning(
+                    f"Error executing query to NebulaGraph. "
+                    f"Error: {result.error_msg()}\n"
+                    f"Query: {query} \n"
+                )
+            return result
+
+        except NoValidSessionException:
+            logging.warning(
+                f"No valid session found in session pool. "
+                f"Please consider increasing the session pool size. "
+                f"Current size: {self.session_pool_size}"
+            )
+            raise ValueError(
+                f"No valid session found in session pool. "
+                f"Please consider increasing the session pool size. "
+                f"Current size: {self.session_pool_size}"
+            )
+
+        except RuntimeError as e:
+            if retry < RETRY_TIMES:
+                retry += 1
+                logging.warning(
+                    f"Error executing query to NebulaGraph. "
+                    f"Retrying ({retry}/{RETRY_TIMES})...\n"
+                    f"query: {query} \n"
+                    f"Error: {e}"
+                )
+                return self.execute(query, params, retry)
+            else:
+                raise ValueError(f"Error executing query to NebulaGraph. Error: {e}")
+
+        except (TTransportException, IOErrorException):
+            # connection issue, try to recreate session pool
+            if retry < RETRY_TIMES:
+                retry += 1
+                logging.warning(
+                    f"Connection issue with NebulaGraph. "
+                    f"Retrying ({retry}/{RETRY_TIMES})...\n to recreate session pool"
+                )
+                self.session_pool = self._get_session_pool()
+                return self.execute(query, params, retry)
+
+    def refresh_schema(self) -> None:
+        """
+        Refreshes the NebulaGraph schema information.
+        """
+        tags_schema, edge_types_schema, relationships = [], [], []
+        for tag in self.execute("SHOW TAGS").column_values("Name"):
+            tag_name = tag.cast()
+            tag_schema = {"tag": tag_name, "properties": []}
+            r = self.execute(f"DESCRIBE TAG `{tag_name}`")
+            props, types = r.column_values("Field"), r.column_values("Type")
+            for i in range(r.row_size()):
+                tag_schema["properties"].append((props[i].cast(), types[i].cast()))
+            tags_schema.append(tag_schema)
+        for edge_type in self.execute("SHOW EDGES").column_values("Name"):
+            edge_type_name = edge_type.cast()
+            edge_schema = {"edge": edge_type_name, "properties": []}
+            r = self.execute(f"DESCRIBE EDGE `{edge_type_name}`")
+            props, types = r.column_values("Field"), r.column_values("Type")
+            for i in range(r.row_size()):
+                edge_schema["properties"].append((props[i].cast(), types[i].cast()))
+            edge_types_schema.append(edge_schema)
+
+            # build relationships types
+            r = self.execute(
+                rel_query.substitute(edge_type=edge_type_name)
+            ).column_values("rels")
+            if len(r) > 0:
+                relationships.append(r[0].cast())
+
+        self.schema = (
+            f"Node properties: {tags_schema}\n"
+            f"Edge properties: {edge_types_schema}\n"
+            f"Relationships: {relationships}\n"
+        )
+
+    def query(self, query: str, retry: int = 0) -> Dict[str, Any]:
+        result = self.execute(query, retry=retry)
+        columns = result.keys()
+        d: Dict[str, list] = {}
+        for col_num in range(result.col_size()):
+            col_name = columns[col_num]
+            col_list = result.column_values(col_name)
+            d[col_name] = [x.cast() for x in col_list]
+        return d
--- a/langchain/graphs/neo4j_graph.py
+++ b/langchain/graphs/neo4j_graph.py
@@ -78,8 +78,7 @@ class Neo4jGraph:
        with self._driver.session(database=self._database) as session:
            try:
                data = session.run(query, params)
-                # Hard limit of 50 results
-                return [r.data() for r in data][:50]
+                return [r.data() for r in data]
            except CypherSyntaxError as e:
                raise ValueError("Generated Cypher Statement is not valid\n" f"{e}")

--- a/langchain/llms/init.py
+++ b/langchain/llms/init.py
@@ -8,6 +8,7 @@ from langchain.llms.anyscale import Anyscale
 from langchain.llms.aviary import Aviary
 from langchain.llms.bananadev import Banana
 from langchain.llms.base import BaseLLM
+from langchain.llms.baseten import Baseten
 from langchain.llms.beam import Beam
 from langchain.llms.bedrock import Bedrock
 from langchain.llms.cerebriumai import CerebriumAI
@@ -50,6 +51,7 @@ __all__ = [
    "Anyscale",
    "Aviary",
    "Banana",
+    "Baseten",
    "Beam",
    "Bedrock",
    "CerebriumAI",
@@ -98,6 +100,7 @@ type_to_cls_dict: Dict[str, Type[BaseLLM]] = {
    "anyscale": Anyscale,
    "aviary": Aviary,
    "bananadev": Banana,
+    "baseten": Baseten,
    "beam": Beam,
    "cerebriumai": CerebriumAI,
    "cohere": Cohere,
--- a/langchain/llms/ai21.py
+++ b/langchain/llms/ai21.py
@@ -112,6 +112,7 @@ class AI21(LLM):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        """Call out to AI21's complete endpoint.

@@ -140,10 +141,11 @@ class AI21(LLM):
                base_url = "https://api.ai21.com/studio/v1/experimental"
            else:
                base_url = "https://api.ai21.com/studio/v1"
+        params = {**self._default_params, **kwargs}
        response = requests.post(
            url=f"{base_url}/{self.model}/complete",
            headers={"Authorization": f"Bearer {self.ai21_api_key}"},
-            json={"prompt": prompt, "stopSequences": stop, **self._default_params},
+            json={"prompt": prompt, "stopSequences": stop, **params},
        )
        if response.status_code != 200:
            optional_detail = response.json().get("error")
--- a/langchain/llms/aleph_alpha.py
+++ b/langchain/llms/aleph_alpha.py
@@ -206,6 +206,7 @@ class AlephAlpha(LLM):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        """Call out to Aleph Alpha's completion endpoint.

@@ -232,6 +233,7 @@ class AlephAlpha(LLM):
            params["stop_sequences"] = self.stop_sequences
        else:
            params["stop_sequences"] = stop
+        params = {**params, **kwargs}
        request = CompletionRequest(prompt=Prompt.from_text(prompt), **params)
        response = self.client.complete(model=self.model, request=request)
        text = response.completions[0].completion
--- a/langchain/llms/anthropic.py
+++ b/langchain/llms/anthropic.py
@@ -162,6 +162,7 @@ class Anthropic(LLM, _AnthropicCommon):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        r"""Call out to Anthropic's completion endpoint.

@@ -181,11 +182,12 @@ class Anthropic(LLM, _AnthropicCommon):

        """
        stop = self._get_anthropic_stop(stop)
+        params = {**self._default_params, **kwargs}
        if self.streaming:
            stream_resp = self.client.completion_stream(
                prompt=self._wrap_prompt(prompt),
                stop_sequences=stop,
-                **self._default_params,
+                **params,
            )
            current_completion = ""
            for data in stream_resp:
@@ -197,7 +199,7 @@ class Anthropic(LLM, _AnthropicCommon):
        response = self.client.completion(
            prompt=self._wrap_prompt(prompt),
            stop_sequences=stop,
-            **self._default_params,
+            **params,
        )
        return response["completion"]

@@ -206,14 +208,16 @@ class Anthropic(LLM, _AnthropicCommon):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[AsyncCallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        """Call out to Anthropic's completion endpoint asynchronously."""
        stop = self._get_anthropic_stop(stop)
+        params = {**self._default_params, **kwargs}
        if self.streaming:
            stream_resp = await self.client.acompletion_stream(
                prompt=self._wrap_prompt(prompt),
                stop_sequences=stop,
-                **self._default_params,
+                **params,
            )
            current_completion = ""
            async for data in stream_resp:
@@ -225,7 +229,7 @@ class Anthropic(LLM, _AnthropicCommon):
        response = await self.client.acompletion(
            prompt=self._wrap_prompt(prompt),
            stop_sequences=stop,
-            **self._default_params,
+            **params,
        )
        return response["completion"]

--- a/langchain/llms/anyscale.py
+++ b/langchain/llms/anyscale.py
@@ -88,6 +88,7 @@ class Anyscale(LLM):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        """Call out to Anyscale Service endpoint.
        Args:
--- a/langchain/llms/aviary.py
+++ b/langchain/llms/aviary.py
@@ -105,6 +105,7 @@ class Aviary(LLM):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        """Call out to Aviary
        Args:
--- a/langchain/llms/bananadev.py
+++ b/langchain/llms/bananadev.py
@@ -87,6 +87,7 @@ class Banana(LLM):
        prompt: str,
        stop: Optional[List[str]] = None,
        run_manager: Optional[CallbackManagerForLLMRun] = None,
+        **kwargs: Any,
    ) -> str:
        """Call to Banana endpoint."""
        try:
@@ -97,6 +98,7 @@ class Banana(LLM):
                "Please install it with `pip install banana-dev`."
            )
        params = self.model_kwargs or {}
+        params = {**params, **kwargs}
        api_key = self.banana_api_key
        model_key = self.model_key
        model_inputs = {
--- a/Show More
+++ b/Show More