bump version to 131 (#2391 )

Revision of "elasticearch" spelling problem (#2378 )
Revision of "elasticearch" spelling problem Co-authored-by: gubei <>
2026-02-10 03:00:59 +00:00 · 2023-04-04 07:21:50 -07:00 · 2023-04-04 06:59:50 -07:00 · 2023-04-04 06:49:17 -07:00 · 2023-04-04 06:48:45 -07:00 · 2023-04-04 06:48:34 -07:00
105 changed files with 3312 additions and 779 deletions
--- a/.dockerignore
+++ b/.dockerignore
@@ -1,2 +1,6 @@
 .venv
-.github
+.github
+.git
+.mypy_cache
+.pytest_cache
+Dockerfile
--- a/.github/CONTRIBUTING.md
+++ b/.github/CONTRIBUTING.md
@@ -46,7 +46,7 @@ good code into the codebase.

 ### 🏭Release process

-As of now, LangChain has an ad hoc release process: releases are cut with high frequency via by
+As of now, LangChain has an ad hoc release process: releases are cut with high frequency by
 a developer and published to [PyPI](https://pypi.org/project/langchain/).

 LangChain follows the [semver](https://semver.org/) versioning standard. However, as pre-1.0 software,
@@ -123,6 +123,12 @@ To run unit tests:
 make test
 ```

+To run unit tests in Docker:
+
+```bash
+make docker_tests
+```
+
 If you add new logic, please add a unit test.

 Integration tests cover logic that requires making calls to outside APIs (often integration with other services).
--- a/35
+++ b/35
@@ -1,20 +1,23 @@
+# This is a Dockerfile for running unit tests
+
 # Use the Python base image
 FROM python:3.11.2-bullseye AS builder

-# Print Python version
-RUN echo "Python version:" && python --version && echo ""
+# Define the version of Poetry to install (default is 1.4.2)
+ARG POETRY_VERSION=1.4.2

-# Install Poetry
-RUN echo "Installing Poetry..." && \
-    curl -sSL https://raw.githubusercontent.com/python-poetry/poetry/master/install-poetry.py | python -
+# Define the directory to install Poetry to (default is /opt/poetry)
+ARG POETRY_HOME=/opt/poetry

-# Add Poetry to PATH
-ENV PATH="${PATH}:/root/.local/bin"
+# Create a Python virtual environment for Poetry and install it
+RUN python3 -m venv ${POETRY_HOME} && \
+    $POETRY_HOME/bin/pip install --upgrade pip && \
+    $POETRY_HOME/bin/pip install poetry==${POETRY_VERSION}

-# Test if Poetry is added to PATH
-RUN echo "Poetry version:" && poetry --version && echo ""
+# Test if Poetry is installed in the expected path
+RUN echo "Poetry version:" && $POETRY_HOME/bin/poetry --version

-# Set working directory
+# Set the working directory for the app
 WORKDIR /app

 # Use a multi-stage build to install dependencies
@@ -23,8 +26,8 @@ FROM builder AS dependencies
 # Copy only the dependency files for installation
 COPY pyproject.toml poetry.lock poetry.toml ./

-# Install Poetry dependencies (this layer will be cached as long as the dependencies don't change)
-RUN poetry install --no-interaction --no-ansi
+# Install the Poetry dependencies (this layer will be cached as long as the dependencies don't change)
+RUN $POETRY_HOME/bin/poetry install --no-interaction --no-ansi

 # Use a multi-stage build to run tests
 FROM dependencies AS tests
@@ -32,8 +35,10 @@ FROM dependencies AS tests
 # Copy the rest of the app source code (this layer will be invalidated and rebuilt whenever the source code changes)
 COPY . .

-# Set entrypoint to run tests
-ENTRYPOINT ["poetry", "run", "pytest"]
+RUN /opt/poetry/bin/poetry install --no-interaction --no-ansi

-# Set default command to run all unit tests
+# Set the entrypoint to run tests using Poetry
+ENTRYPOINT ["/opt/poetry/bin/poetry", "run", "pytest"]
+
+# Set the default command to run all unit tests
 CMD ["tests/unit_tests"]
--- a/docs/ecosystem/aim_tracking.ipynb
+++ b/docs/ecosystem/aim_tracking.ipynb
@@ -205,7 +205,8 @@
   },
   "outputs": [],
   "source": [
-    "from langchain.agents import initialize_agent, load_tools"
+    "from langchain.agents import initialize_agent, load_tools\n",
+    "from langchain.agents import AgentType"
   ]
  },
  {
@@ -252,7 +253,7 @@
    "agent = initialize_agent(\n",
    "    tools,\n",
    "    llm,\n",
-    "    agent=\"zero-shot-react-description\",\n",
+    "    agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION,\n",
    "    callback_manager=manager,\n",
    "    verbose=True,\n",
    ")\n",
--- a/docs/ecosystem/clearml_tracking.ipynb
+++ b/docs/ecosystem/clearml_tracking.ipynb
@@ -520,13 +520,14 @@
   ],
   "source": [
    "from langchain.agents import initialize_agent, load_tools\n",
+    "from langchain.agents import AgentType\n",
    "\n",
    "# SCENARIO 2 - Agent with Tools\n",
    "tools = load_tools([\"serpapi\", \"llm-math\"], llm=llm, callback_manager=manager)\n",
    "agent = initialize_agent(\n",
    "    tools,\n",
    "    llm,\n",
-    "    agent=\"zero-shot-react-description\",\n",
+    "    agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION,\n",
    "    callback_manager=manager,\n",
    "    verbose=True,\n",
    ")\n",
--- a/docs/ecosystem/google_serper.md
+++ b/docs/ecosystem/google_serper.md
@@ -23,6 +23,7 @@ You can use it as part of a Self Ask chain:
 from langchain.utilities import GoogleSerperAPIWrapper
 from langchain.llms.openai import OpenAI
 from langchain.agents import initialize_agent, Tool
+from langchain.agents import AgentType

 import os

@@ -39,7 +40,7 @@ tools = [
    )
 ]

-self_ask_with_search = initialize_agent(tools, llm, agent="self-ask-with-search", verbose=True)
+self_ask_with_search = initialize_agent(tools, llm, agent=AgentType.SELF_ASK_WITH_SEARCH, verbose=True)
 self_ask_with_search.run("What is the hometown of the reigning men's U.S. Open champion?")
 ```

--- a/docs/ecosystem/gpt4all.md
+++ b/docs/ecosystem/gpt4all.md
@@ -0,0 +1,37 @@
+# GPT4All
+
+This page covers how to use the `GPT4All` wrapper within LangChain.
+It is broken into two parts: installation and setup, and then usage with an example.
+
+## Installation and Setup
+- Install the Python package with `pip install pyllamacpp`
+- Download a [GPT4All model](https://github.com/nomic-ai/gpt4all) and place it in your desired directory
+
+## Usage
+
+### GPT4All
+
+To use the GPT4All wrapper, you need to provide the path to the pre-trained model file and the model's configuration.
+```python
+from langchain.llms import GPT4All
+
+# Instantiate the model
+model = GPT4All(model="./models/gpt4all-model.bin", n_ctx=512, n_threads=8)
+
+# Generate text
+response = model("Once upon a time, ")
+```
+
+You can also customize the generation parameters, such as n_predict, temp, top_p, top_k, and others.
+
+Example:
+
+```python
+model = GPT4All(model="./models/gpt4all-model.bin", n_predict=55, temp=0)
+response = model("Once upon a time, ")
+```
+## Model File
+
+You can find links to model file downloads at the [GPT4all](https://github.com/nomic-ai/gpt4all) repository. They will need to be converted to `ggml` format to work, as specified in the [pyllamacpp](https://github.com/nomic-ai/pyllamacpp) repository.
+
+For a more detailed walkthrough of this, see [this notebook](../modules/models/llms/integrations/gpt4all.ipynb)
--- a/docs/ecosystem/llamacpp.md
+++ b/docs/ecosystem/llamacpp.md
@@ -0,0 +1,26 @@
+# Llama.cpp
+
+This page covers how to use [llama.cpp](https://github.com/ggerganov/llama.cpp) within LangChain.
+It is broken into two parts: installation and setup, and then references to specific Jina wrappers.
+
+## Installation and Setup
+- Install the Python package with `pip install llama-cpp-python`
+- Download one of the [supported models](https://github.com/ggerganov/llama.cpp#description) and convert them to the llama.cpp format per the [instructions](https://github.com/ggerganov/llama.cpp)
+
+## Wrappers
+
+### LLM
+
+There exists a LlamaCpp LLM wrapper, which you can access with 
+```python
+from langchain.llms import LlamaCpp
+```
+For a more detailed walkthrough of this, see [this notebook](../modules/models/llms/integrations/llamacpp.ipynb)
+
+### Embeddings
+
+There exists a LlamaCpp Embeddings wrapper, which you can access with 
+```python
+from langchain.embeddings import LlamaCppEmbeddings
+```
+For a more detailed walkthrough of this, see [this notebook](../modules/models/text_embedding/examples/llamacpp.ipynb)
--- a/docs/ecosystem/wandb_tracking.ipynb
+++ b/docs/ecosystem/wandb_tracking.ipynb
@@ -505,7 +505,8 @@
   },
   "outputs": [],
   "source": [
-    "from langchain.agents import initialize_agent, load_tools"
+    "from langchain.agents import initialize_agent, load_tools\n",
+    "from langchain.agents import AgentType"
   ]
  },
  {
@@ -580,7 +581,7 @@
    "agent = initialize_agent(\n",
    "    tools,\n",
    "    llm,\n",
-    "    agent=\"zero-shot-react-description\",\n",
+    "    agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION,\n",
    "    callback_manager=manager,\n",
    "    verbose=True,\n",
    ")\n",
--- a/docs/getting_started/getting_started.md
+++ b/docs/getting_started/getting_started.md
@@ -197,6 +197,7 @@ Now we can get started!
 ```python
 from langchain.agents import load_tools
 from langchain.agents import initialize_agent
+from langchain.agents import AgentType
 from langchain.llms import OpenAI

 # First, let's load the language model we're going to use to control the agent.
@@ -207,7 +208,7 @@ tools = load_tools(["serpapi", "llm-math"], llm=llm)


 # Finally, let's initialize an agent with the tools, the language model, and the type of agent we want to use.
-agent = initialize_agent(tools, llm, agent="zero-shot-react-description", verbose=True)
+agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)

 # Now let's test it out!
 agent.run("What was the high temperature in SF yesterday in Fahrenheit? What is that number raised to the .023 power?")
@@ -404,11 +405,12 @@ chain.run(input_language="English", output_language="French", text="I love progr
 `````

 `````{dropdown} Agents with Chat Models
-Agents can also be used with chat models, you can initialize one using `"chat-zero-shot-react-description"` as the agent type.
+Agents can also be used with chat models, you can initialize one using `AgentType.CHAT_ZERO_SHOT_REACT_DESCRIPTION` as the agent type.

 ```python
 from langchain.agents import load_tools
 from langchain.agents import initialize_agent
+from langchain.agents import AgentType
 from langchain.chat_models import ChatOpenAI
 from langchain.llms import OpenAI

@@ -421,7 +423,7 @@ tools = load_tools(["serpapi", "llm-math"], llm=llm)


 # Finally, let's initialize an agent with the tools, the language model, and the type of agent we want to use.
-agent = initialize_agent(tools, chat, agent="chat-zero-shot-react-description", verbose=True)
+agent = initialize_agent(tools, chat, agent=AgentType.CHAT_ZERO_SHOT_REACT_DESCRIPTION, verbose=True)

 # Now let's test it out!
 agent.run("Who is Olivia Wilde's boyfriend? What is his current age raised to the 0.23 power?")
--- a/docs/modules/agents.rst
+++ b/docs/modules/agents.rst
@@ -10,7 +10,7 @@ but potentially an unknown chain that depends on the user's input.
 In these types of chains, there is a “agent” which has access to a suite of tools.
 Depending on the user input, the agent can then decide which, if any, of these tools to call.

-In this section of documentation, we first start with a Getting Started notebook to over over how to use all things related to agents in an end-to-end manner.
+In this section of documentation, we first start with a Getting Started notebook to cover how to use all things related to agents in an end-to-end manner.

 .. toctree::
   :maxdepth: 1
--- a/docs/modules/agents/agent_executors/examples/agent_vectorstore.ipynb
+++ b/docs/modules/agents/agent_executors/examples/agent_vectorstore.ipynb
@@ -9,7 +9,7 @@
    "\n",
    "This notebook covers how to combine agents and vectorstores. The use case for this is that you've ingested your data into a vectorstore and want to interact with it in an agentic manner.\n",
    "\n",
-    "The reccomended method for doing so is to create a VectorDBQAChain and then use that as a tool in the overall agent. Let's take a look at doing this below. You can do this with multiple different vectordbs, and use the agent as a way to route between them. There are two different ways of doing this - you can either let the agent use the vectorstores as normal tools, or you can set `return_direct=True` to really just use the agent as a router."
+    "The reccomended method for doing so is to create a RetrievalQA and then use that as a tool in the overall agent. Let's take a look at doing this below. You can do this with multiple different vectordbs, and use the agent as a way to route between them. There are two different ways of doing this - you can either let the agent use the vectorstores as normal tools, or you can set `return_direct=True` to really just use the agent as a router."
   ]
  },
  {
@@ -154,6 +154,7 @@
   "source": [
    "# Import things that are needed generically\n",
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.tools import BaseTool\n",
    "from langchain.llms import OpenAI\n",
    "from langchain import LLMMathChain, SerpAPIWrapper"
@@ -189,7 +190,7 @@
   "source": [
    "# Construct the agent. We will use the default agent type here.\n",
    "# See documentation for a full list of options.\n",
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -316,7 +317,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -433,7 +434,7 @@
   "source": [
    "# Construct the agent. We will use the default agent type here.\n",
    "# See documentation for a full list of options.\n",
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/agent_executors/examples/async_agent.ipynb
+++ b/docs/modules/agents/agent_executors/examples/async_agent.ipynb
@@ -39,6 +39,7 @@
    "import time\n",
    "\n",
    "from langchain.agents import initialize_agent, load_tools\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.llms import OpenAI\n",
    "from langchain.callbacks.stdout import StdOutCallbackHandler\n",
    "from langchain.callbacks.base import CallbackManager\n",
@@ -175,7 +176,7 @@
    "        llm = OpenAI(temperature=0)\n",
    "        tools = load_tools([\"llm-math\", \"serpapi\"], llm=llm)\n",
    "        agent = initialize_agent(\n",
-    "            tools, llm, agent=\"zero-shot-react-description\", verbose=True\n",
+    "            tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION verbose=True\n",
    "        )\n",
    "        agent.run(q)\n",
    "\n",
@@ -311,7 +312,7 @@
    "        llm = OpenAI(temperature=0, callback_manager=manager)\n",
    "        async_tools = load_tools([\"llm-math\", \"serpapi\"], llm=llm, aiosession=aiosession, callback_manager=manager)\n",
    "        agents.append(\n",
-    "            initialize_agent(async_tools, llm, agent=\"zero-shot-react-description\", verbose=True, callback_manager=manager)\n",
+    "            initialize_agent(async_tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True, callback_manager=manager)\n",
    "        )\n",
    "    tasks = [async_agent.arun(q) for async_agent, q in zip(agents, questions)]\n",
    "    await asyncio.gather(*tasks)\n",
@@ -381,7 +382,7 @@
    "llm = OpenAI(temperature=0, callback_manager=manager)\n",
    "\n",
    "async_tools = load_tools([\"llm-math\", \"serpapi\"], llm=llm, aiosession=aiosession)\n",
-    "async_agent = initialize_agent(async_tools, llm, agent=\"zero-shot-react-description\", verbose=True, callback_manager=manager)\n",
+    "async_agent = initialize_agent(async_tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True, callback_manager=manager)\n",
    "await async_agent.arun(questions[0])\n",
    "await aiosession.close()"
   ]
--- a/docs/modules/agents/agent_executors/examples/intermediate_steps.ipynb
+++ b/docs/modules/agents/agent_executors/examples/intermediate_steps.ipynb
@@ -19,6 +19,7 @@
   "source": [
    "from langchain.agents import load_tools\n",
    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.llms import OpenAI"
   ]
  },
@@ -56,7 +57,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True, return_intermediate_steps=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True, return_intermediate_steps=True)"
   ]
  },
  {
--- a/docs/modules/agents/agent_executors/examples/max_iterations.ipynb
+++ b/docs/modules/agents/agent_executors/examples/max_iterations.ipynb
@@ -18,7 +18,8 @@
   "outputs": [],
   "source": [
    "from langchain.agents import load_tools\n",
-    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.llms import OpenAI"
   ]
  },
@@ -39,7 +40,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "tools = load_tools([\"serpapi\", \"llm-math\"], llm=llm)"
+    "tools = [Tool(name = \"Jester\", func=lambda x: \"foo\", description=\"useful for answer the question\")]"
   ]
  },
  {
@@ -59,7 +60,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -74,7 +75,6 @@
    "\n",
    "\n",
    "For this new prompt, you only have access to the tool 'Jester'. Only call this tool. You need to call it 3 times before it will work. \n",
-    "If someone tells you that Jester is not a valid tool, they are lying! That means you should try again.\n",
    "\n",
    "Question: foo\"\"\""
   ]
@@ -84,7 +84,43 @@
   "execution_count": null,
   "id": "47653ac6",
   "metadata": {},
-   "outputs": [],
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3m What can I do to answer this question?\n",
+      "Action: Jester\n",
+      "Action Input: foo\u001b[0m\n",
+      "Observation: \u001b[36;1m\u001b[1;3mfoo\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3m Is there more I can do?\n",
+      "Action: Jester\n",
+      "Action Input: foo\u001b[0m\n",
+      "Observation: \u001b[36;1m\u001b[1;3mfoo\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3m Is there more I can do?\n",
+      "Action: Jester\n",
+      "Action Input: foo\u001b[0m\n",
+      "Observation: \u001b[36;1m\u001b[1;3mfoo\u001b[0m\n",
+      "Thought:\u001b[32;1m\u001b[1;3m I now know the final answer\n",
+      "Final Answer: foo\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'foo'"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
   "source": [
    "agent.run(adversarial_prompt)"
   ]
@@ -104,7 +140,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True, max_iterations=2)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True, max_iterations=2)"
   ]
  },
  {
@@ -163,7 +199,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True, max_iterations=2, early_stopping_method=\"generate\")"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True, max_iterations=2, early_stopping_method=\"generate\")"
   ]
  },
  {
--- a/docs/modules/agents/agents.rst
+++ b/docs/modules/agents/agents.rst
@@ -17,13 +17,15 @@ For a high level overview of the different types of agents, see the below docume

 For documentation on how to create a custom agent, see the below.

-We also have documentation for an in-depth dive into each agent type.

 .. toctree::
   :maxdepth: 1
   :glob:

   ./agents/custom_agent.ipynb
+   ./agents/custom_llm_agent.ipynb
+   ./agents/custom_llm_chat_agent.ipynb
+   ./agents/custom_mrkl_agent.ipynb

 We also have documentation for an in-depth dive into each agent type.

--- a/docs/modules/agents/agents/custom_llm_agent.ipynb
+++ b/docs/modules/agents/agents/custom_llm_agent.ipynb
@@ -60,7 +60,7 @@
   "id": "6df0253f",
   "metadata": {},
   "source": [
-    "# Set up tool\n",
+    "## Set up tool\n",
    "\n",
    "Set up any tools the agent may want to use. This may be necessary to put in the prompt (so that the agent knows to use these tools)."
   ]
@@ -88,7 +88,7 @@
   "id": "2e7a075c",
   "metadata": {},
   "source": [
-    "## Prompt Teplate\n",
+    "## Prompt Template\n",
    "\n",
    "This instructs the agent on what to do. Generally, the template should incorporate:\n",
    "    \n",
--- a/docs/modules/agents/agents/custom_llm_chat_agent.ipynb
+++ b/docs/modules/agents/agents/custom_llm_chat_agent.ipynb
@@ -0,0 +1,395 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "ba5f8741",
+   "metadata": {},
+   "source": [
+    "# Custom LLM Agent (with a ChatModel)\n",
+    "\n",
+    "This notebook goes through how to create your own custom agent based on a chat model.\n",
+    "\n",
+    "An LLM chat agent consists of three parts:\n",
+    "\n",
+    "- PromptTemplate: This is the prompt template that can be used to instruct the language model on what to do\n",
+    "- ChatModel: This is the language model that powers the agent\n",
+    "- `stop` sequence: Instructs the LLM to stop generating as soon as this string is found\n",
+    "- OutputParser: This determines how to parse the LLMOutput into an AgentAction or AgentFinish object\n",
+    "\n",
+    "\n",
+    "The LLMAgent is used in an AgentExecutor. This AgentExecutor can largely be thought of as a loop that:\n",
+    "1. Passes user input and any previous steps to the Agent (in this case, the LLMAgent)\n",
+    "2. If the Agent returns an `AgentFinish`, then return that directly to the user\n",
+    "3. If the Agent returns an `AgentAction`, then use that to call a tool and get an `Observation`\n",
+    "4. Repeat, passing the `AgentAction` and `Observation` back to the Agent until an `AgentFinish` is emitted.\n",
+    "    \n",
+    "`AgentAction` is a response that consists of `action` and `action_input`. `action` refers to which tool to use, and `action_input` refers to the input to that tool. `log` can also be provided as more context (that can be used for logging, tracing, etc).\n",
+    "\n",
+    "`AgentFinish` is a response that contains the final message to be sent back to the user. This should be used to end an agent run.\n",
+    "        \n",
+    "In this notebook we walk through how to create a custom LLM agent."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "fea4812c",
+   "metadata": {},
+   "source": [
+    "## Set up environment\n",
+    "\n",
+    "Do necessary imports, etc."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "9af9734e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.agents import Tool, AgentExecutor, LLMSingleActionAgent, AgentOutputParser\n",
+    "from langchain.prompts import BaseChatPromptTemplate\n",
+    "from langchain import SerpAPIWrapper, LLMChain\n",
+    "from langchain.chat_models import ChatOpenAI\n",
+    "from typing import List, Union\n",
+    "from langchain.schema import AgentAction, AgentFinish, HumanMessage\n",
+    "import re"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "6df0253f",
+   "metadata": {},
+   "source": [
+    "## Set up tool\n",
+    "\n",
+    "Set up any tools the agent may want to use. This may be necessary to put in the prompt (so that the agent knows to use these tools)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "id": "becda2a1",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Define which tools the agent can use to answer user queries\n",
+    "search = SerpAPIWrapper()\n",
+    "tools = [\n",
+    "    Tool(\n",
+    "        name = \"Search\",\n",
+    "        func=search.run,\n",
+    "        description=\"useful for when you need to answer questions about current events\"\n",
+    "    )\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "2e7a075c",
+   "metadata": {},
+   "source": [
+    "## Prompt Teplate\n",
+    "\n",
+    "This instructs the agent on what to do. Generally, the template should incorporate:\n",
+    "    \n",
+    "- `tools`: which tools the agent has access and how and when to call them.\n",
+    "- `intermediate_steps`: These are tuples of previous (`AgentAction`, `Observation`) pairs. These are generally not passed directly to the model, but the prompt template formats them in a specific way.\n",
+    "- `input`: generic user input"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "id": "339b1bb8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Set up the base template\n",
+    "template = \"\"\"Answer the following questions as best you can, but speaking as a pirate might speak. You have access to the following tools:\n",
+    "\n",
+    "{tools}\n",
+    "\n",
+    "Use the following format:\n",
+    "\n",
+    "Question: the input question you must answer\n",
+    "Thought: you should always think about what to do\n",
+    "Action: the action to take, should be one of [{tool_names}]\n",
+    "Action Input: the input to the action\n",
+    "Observation: the result of the action\n",
+    "... (this Thought/Action/Action Input/Observation can repeat N times)\n",
+    "Thought: I now know the final answer\n",
+    "Final Answer: the final answer to the original input question\n",
+    "\n",
+    "Begin! Remember to speak as a pirate when giving your final answer. Use lots of \"Arg\"s\n",
+    "\n",
+    "Question: {input}\n",
+    "{agent_scratchpad}\"\"\""
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "id": "fd969d31",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# Set up a prompt template\n",
+    "class CustomPromptTemplate(BaseChatPromptTemplate):\n",
+    "    # The template to use\n",
+    "    template: str\n",
+    "    # The list of tools available\n",
+    "    tools: List[Tool]\n",
+    "    \n",
+    "    def format_messages(self, **kwargs) -> str:\n",
+    "        # Get the intermediate steps (AgentAction, Observation tuples)\n",
+    "        # Format them in a particular way\n",
+    "        intermediate_steps = kwargs.pop(\"intermediate_steps\")\n",
+    "        thoughts = \"\"\n",
+    "        for action, observation in intermediate_steps:\n",
+    "            thoughts += action.log\n",
+    "            thoughts += f\"\\nObservation: {observation}\\nThought: \"\n",
+    "        # Set the agent_scratchpad variable to that value\n",
+    "        kwargs[\"agent_scratchpad\"] = thoughts\n",
+    "        # Create a tools variable from the list of tools provided\n",
+    "        kwargs[\"tools\"] = \"\\n\".join([f\"{tool.name}: {tool.description}\" for tool in self.tools])\n",
+    "        # Create a list of tool names for the tools provided\n",
+    "        kwargs[\"tool_names\"] = \", \".join([tool.name for tool in self.tools])\n",
+    "        formatted = self.template.format(**kwargs)\n",
+    "        return [HumanMessage(content=formatted)]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "798ef9fb",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "prompt = CustomPromptTemplate(\n",
+    "    template=template,\n",
+    "    tools=tools,\n",
+    "    # This omits the `agent_scratchpad`, `tools`, and `tool_names` variables because those are generated dynamically\n",
+    "    # This includes the `intermediate_steps` variable because that is needed\n",
+    "    input_variables=[\"input\", \"intermediate_steps\"]\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "ef3a1af3",
+   "metadata": {},
+   "source": [
+    "## Output Parser\n",
+    "\n",
+    "The output parser is responsible for parsing the LLM output into `AgentAction` and `AgentFinish`. This usually depends heavily on the prompt used.\n",
+    "\n",
+    "This is where you can change the parsing to do retries, handle whitespace, etc"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "7c6fe0d3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "class CustomOutputParser(AgentOutputParser):\n",
+    "    \n",
+    "    def parse(self, llm_output: str) -> Union[AgentAction, AgentFinish]:\n",
+    "        # Check if agent should finish\n",
+    "        if \"Final Answer:\" in llm_output:\n",
+    "            return AgentFinish(\n",
+    "                # Return values is generally always a dictionary with a single `output` key\n",
+    "                # It is not recommended to try anything else at the moment :)\n",
+    "                return_values={\"output\": llm_output.split(\"Final Answer:\")[-1].strip()},\n",
+    "                log=llm_output,\n",
+    "            )\n",
+    "        # Parse out the action and action input\n",
+    "        regex = r\"Action: (.*?)[\\n]*Action Input:[\\s]*(.*)\"\n",
+    "        match = re.search(regex, llm_output, re.DOTALL)\n",
+    "        if not match:\n",
+    "            raise ValueError(f\"Could not parse LLM output: `{llm_output}`\")\n",
+    "        action = match.group(1).strip()\n",
+    "        action_input = match.group(2)\n",
+    "        # Return the action and action input\n",
+    "        return AgentAction(tool=action, tool_input=action_input.strip(\" \").strip('\"'), log=llm_output)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "d278706a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "output_parser = CustomOutputParser()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "170587b1",
+   "metadata": {},
+   "source": [
+    "## Set up LLM\n",
+    "\n",
+    "Choose the LLM you want to use!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "id": "f9d4c374",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = ChatOpenAI(temperature=0)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "caeab5e4",
+   "metadata": {},
+   "source": [
+    "## Define the stop sequence\n",
+    "\n",
+    "This is important because it tells the LLM when to stop generation.\n",
+    "\n",
+    "This depends heavily on the prompt and model you are using. Generally, you want this to be whatever token you use in the prompt to denote the start of an `Observation` (otherwise, the LLM may hallucinate an observation for you)."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "34be9f65",
+   "metadata": {},
+   "source": [
+    "## Set up the Agent\n",
+    "\n",
+    "We can now combine everything to set up our agent"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "9b1cc2a2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# LLM chain consisting of the LLM and a prompt\n",
+    "llm_chain = LLMChain(llm=llm, prompt=prompt)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "e4f5092f",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "tool_names = [tool.name for tool in tools]\n",
+    "agent = LLMSingleActionAgent(\n",
+    "    llm_chain=llm_chain, \n",
+    "    output_parser=output_parser,\n",
+    "    stop=[\"\\nObservation:\"], \n",
+    "    allowed_tools=tool_names\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "aa8a5326",
+   "metadata": {},
+   "source": [
+    "## Use the Agent\n",
+    "\n",
+    "Now we can use it!"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "490604e9",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "agent_executor = AgentExecutor.from_agent_and_tools(agent=agent, tools=tools, verbose=True)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "653b1617",
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n",
+      "\n",
+      "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
+      "\u001b[32;1m\u001b[1;3mThought: Wot year be it now? That be important to know the answer.\n",
+      "Action: Search\n",
+      "Action Input: \"current population canada 2023\"\u001b[0m\n",
+      "\n",
+      "Observation:\u001b[36;1m\u001b[1;3m38,649,283\u001b[0m\u001b[32;1m\u001b[1;3mAhoy! That be the correct year, but the answer be in regular numbers. 'Tis time to translate to pirate speak.\n",
+      "Action: Search\n",
+      "Action Input: \"38,649,283 in pirate speak\"\u001b[0m\n",
+      "\n",
+      "Observation:\u001b[36;1m\u001b[1;3mBrush up on your “Pirate Talk” with these helpful pirate phrases. Aaaarrrrgggghhhh! Pirate catch phrase of grumbling or disgust. Ahoy! Hello! Ahoy, Matey, Hello ...\u001b[0m\u001b[32;1m\u001b[1;3mThat be not helpful, I'll just do the translation meself.\n",
+      "Final Answer: Arrrr, thar be 38,649,283 scallywags in Canada as of 2023.\u001b[0m\n",
+      "\n",
+      "\u001b[1m> Finished chain.\u001b[0m\n"
+     ]
+    },
+    {
+     "data": {
+      "text/plain": [
+       "'Arrrr, thar be 38,649,283 scallywags in Canada as of 2023.'"
+      ]
+     },
+     "execution_count": 16,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "agent_executor.run(\"How many people live in canada as of 2023?\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "adefb4c2",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.1"
+  },
+  "vscode": {
+   "interpreter": {
+    "hash": "18784188d7ecd866c0586ac068b02361a6896dc3a29b64f5cc957f09c590acef"
+   }
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/agents/agents/examples/chat_conversation_agent.ipynb
+++ b/docs/modules/agents/agents/examples/chat_conversation_agent.ipynb
@@ -34,7 +34,8 @@
    "from langchain.memory import ConversationBufferMemory\n",
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.utilities import SerpAPIWrapper\n",
-    "from langchain.agents import initialize_agent"
+    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import AgentType"
   ]
  },
  {
@@ -72,7 +73,7 @@
   "outputs": [],
   "source": [
    "llm=ChatOpenAI(temperature=0)\n",
-    "agent_chain = initialize_agent(tools, llm, agent=\"chat-conversational-react-description\", verbose=True, memory=memory)"
+    "agent_chain = initialize_agent(tools, llm, agent=AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION, verbose=True, memory=memory)"
   ]
  },
  {
--- a/docs/modules/agents/agents/examples/conversational_agent.ipynb
+++ b/docs/modules/agents/agents/examples/conversational_agent.ipynb
@@ -20,6 +20,7 @@
   "outputs": [],
   "source": [
    "from langchain.agents import Tool\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.memory import ConversationBufferMemory\n",
    "from langchain import OpenAI\n",
    "from langchain.utilities import GoogleSearchAPIWrapper\n",
@@ -61,7 +62,7 @@
   "outputs": [],
   "source": [
    "llm=OpenAI(temperature=0)\n",
-    "agent_chain = initialize_agent(tools, llm, agent=\"conversational-react-description\", verbose=True, memory=memory)"
+    "agent_chain = initialize_agent(tools, llm, agent=AgentType.CONVERSATIONAL_REACT_DESCRIPTION, verbose=True, memory=memory)"
   ]
  },
  {
--- a/docs/modules/agents/agents/examples/mrkl.ipynb
+++ b/docs/modules/agents/agents/examples/mrkl.ipynb
@@ -27,7 +27,8 @@
   "outputs": [],
   "source": [
    "from langchain import LLMMathChain, OpenAI, SerpAPIWrapper, SQLDatabase, SQLDatabaseChain\n",
-    "from langchain.agents import initialize_agent, Tool"
+    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType"
   ]
  },
  {
@@ -68,7 +69,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "mrkl = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "mrkl = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/agents/examples/mrkl_chat.ipynb
+++ b/docs/modules/agents/agents/examples/mrkl_chat.ipynb
@@ -28,6 +28,7 @@
   "source": [
    "from langchain import OpenAI, LLMMathChain, SerpAPIWrapper, SQLDatabase, SQLDatabaseChain\n",
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.chat_models import ChatOpenAI"
   ]
  },
@@ -70,7 +71,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "mrkl = initialize_agent(tools, llm, agent=\"chat-zero-shot-react-description\", verbose=True)"
+    "mrkl = initialize_agent(tools, llm, agent=AgentType.CHAT_ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/agents/examples/react.ipynb
+++ b/docs/modules/agents/agents/examples/react.ipynb
@@ -19,6 +19,7 @@
   "source": [
    "from langchain import OpenAI, Wikipedia\n",
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.agents.react.base import DocstoreExplorer\n",
    "docstore=DocstoreExplorer(Wikipedia())\n",
    "tools = [\n",
@@ -35,7 +36,7 @@
    "]\n",
    "\n",
    "llm = OpenAI(temperature=0, model_name=\"text-davinci-002\")\n",
-    "react = initialize_agent(tools, llm, agent=\"react-docstore\", verbose=True)"
+    "react = initialize_agent(tools, llm, agent=AgentType.REACT_DOCSTORE, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/agents/examples/self_ask_with_search.ipynb
+++ b/docs/modules/agents/agents/examples/self_ask_with_search.ipynb
@@ -46,6 +46,7 @@
   "source": [
    "from langchain import OpenAI, SerpAPIWrapper\n",
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "\n",
    "llm = OpenAI(temperature=0)\n",
    "search = SerpAPIWrapper()\n",
@@ -57,7 +58,7 @@
    "    )\n",
    "]\n",
    "\n",
-    "self_ask_with_search = initialize_agent(tools, llm, agent=\"self-ask-with-search\", verbose=True)\n",
+    "self_ask_with_search = initialize_agent(tools, llm, agent=AgentType.SELF_ASK_WITH_SEARCH, verbose=True)\n",
    "self_ask_with_search.run(\"What is the hometown of the reigning men's U.S. Open champion?\")"
   ]
  }
--- a/docs/modules/agents/getting_started.ipynb
+++ b/docs/modules/agents/getting_started.ipynb
@@ -92,7 +92,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/toolkits/examples/json.ipynb
+++ b/docs/modules/agents/toolkits/examples/json.ipynb
@@ -41,7 +41,7 @@
    "from langchain.agents.agent_toolkits import JsonToolkit\n",
    "from langchain.chains import LLMChain\n",
    "from langchain.llms.openai import OpenAI\n",
-    "from langchain.requests import RequestsWrapper\n",
+    "from langchain.requests import TextRequestsWrapper\n",
    "from langchain.tools.json.tool import JsonSpec"
   ]
  },
--- a/docs/modules/agents/toolkits/examples/openapi.ipynb
+++ b/docs/modules/agents/toolkits/examples/openapi.ipynb
@@ -35,7 +35,7 @@
    "from langchain.agents import create_openapi_agent\n",
    "from langchain.agents.agent_toolkits import OpenAPIToolkit\n",
    "from langchain.llms.openai import OpenAI\n",
-    "from langchain.requests import RequestsWrapper\n",
+    "from langchain.requests import TextRequestsWrapper\n",
    "from langchain.tools.json.tool import JsonSpec"
   ]
  },
@@ -54,7 +54,7 @@
    "headers = {\n",
    "    \"Authorization\": f\"Bearer {os.getenv('OPENAI_API_KEY')}\"\n",
    "}\n",
-    "requests_wrapper=RequestsWrapper(headers=headers)\n",
+    "requests_wrapper=TextRequestsWrapper(headers=headers)\n",
    "openapi_toolkit = OpenAPIToolkit.from_llm(OpenAI(temperature=0), json_spec, requests_wrapper, verbose=True)\n",
    "openapi_agent_executor = create_openapi_agent(\n",
    "    llm=OpenAI(temperature=0),\n",
--- a/docs/modules/agents/tools/custom_tools.ipynb
+++ b/docs/modules/agents/tools/custom_tools.ipynb
@@ -27,6 +27,7 @@
   "source": [
    "# Import things that are needed generically\n",
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.tools import BaseTool\n",
    "from langchain.llms import OpenAI\n",
    "from langchain import LLMMathChain, SerpAPIWrapper"
@@ -102,7 +103,7 @@
   "source": [
    "# Construct the agent. We will use the default agent type here.\n",
    "# See documentation for a full list of options.\n",
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -217,7 +218,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -410,7 +411,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -484,6 +485,7 @@
   "source": [
    "# Import things that are needed generically\n",
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.llms import OpenAI\n",
    "from langchain import LLMMathChain, SerpAPIWrapper\n",
    "search = SerpAPIWrapper()\n",
@@ -500,7 +502,7 @@
    "    )\n",
    "]\n",
    "\n",
-    "agent = initialize_agent(tools, OpenAI(temperature=0), agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, OpenAI(temperature=0), agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -576,7 +578,7 @@
   "outputs": [],
   "source": [
    "llm = OpenAI(temperature=0)\n",
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/tools/examples/chatgpt_plugins.ipynb
+++ b/docs/modules/agents/tools/examples/chatgpt_plugins.ipynb
@@ -23,6 +23,7 @@
   "source": [
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.agents import load_tools, initialize_agent\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.tools import AIPluginTool"
   ]
  },
@@ -83,7 +84,7 @@
    "tools = load_tools([\"requests\"] )\n",
    "tools += [tool]\n",
    "\n",
-    "agent_chain = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)\n",
+    "agent_chain = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION verbose=True)\n",
    "agent_chain.run(\"what t shirts are available in klarna?\")"
   ]
  },
--- a/docs/modules/agents/tools/examples/google_serper.ipynb
+++ b/docs/modules/agents/tools/examples/google_serper.ipynb
@@ -115,6 +115,7 @@
    "from langchain.utilities import GoogleSerperAPIWrapper\n",
    "from langchain.llms.openai import OpenAI\n",
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "\n",
    "llm = OpenAI(temperature=0)\n",
    "search = GoogleSerperAPIWrapper()\n",
@@ -126,7 +127,7 @@
    "    )\n",
    "]\n",
    "\n",
-    "self_ask_with_search = initialize_agent(tools, llm, agent=\"self-ask-with-search\", verbose=True)\n",
+    "self_ask_with_search = initialize_agent(tools, llm, agent=AgentType.SELF_ASK_WITH_SEARCH, verbose=True)\n",
    "self_ask_with_search.run(\"What is the hometown of the reigning men's U.S. Open champion?\")"
   ],
   "metadata": {
--- a/docs/modules/agents/tools/examples/human_tools.ipynb
+++ b/docs/modules/agents/tools/examples/human_tools.ipynb
@@ -20,6 +20,7 @@
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.llms import OpenAI\n",
    "from langchain.agents import load_tools, initialize_agent\n",
+    "from langchain.agents import AgentType\n",
    "\n",
    "llm = ChatOpenAI(temperature=0.0)\n",
    "math_llm = OpenAI(temperature=0.0)\n",
@@ -31,7 +32,7 @@
    "agent_chain = initialize_agent(\n",
    "    tools,\n",
    "    llm,\n",
-    "    agent=\"zero-shot-react-description\",\n",
+    "    agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION,\n",
    "    verbose=True,\n",
    ")"
   ]
--- a/docs/modules/agents/tools/examples/requests.ipynb
+++ b/docs/modules/agents/tools/examples/requests.ipynb
@@ -17,7 +17,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "from langchain.utilities import RequestsWrapper"
+    "from langchain.utilities import TextRequestsWrapper"
   ]
  },
  {
@@ -27,7 +27,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "requests = RequestsWrapper()"
+    "requests = TextRequestsWrapper()"
   ]
  },
  {
--- a/docs/modules/agents/tools/examples/search_tools.ipynb
+++ b/docs/modules/agents/tools/examples/search_tools.ipynb
@@ -23,6 +23,7 @@
   "source": [
    "from langchain.agents import load_tools\n",
    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.llms import OpenAI"
   ]
  },
@@ -63,7 +64,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -131,7 +132,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -199,7 +200,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
@@ -266,7 +267,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/tools/examples/zapier.ipynb
+++ b/docs/modules/agents/tools/examples/zapier.ipynb
@@ -77,6 +77,7 @@
    "from langchain.llms import OpenAI\n",
    "from langchain.agents import initialize_agent\n",
    "from langchain.agents.agent_toolkits import ZapierToolkit\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.utilities.zapier import ZapierNLAWrapper"
   ]
  },
@@ -105,7 +106,7 @@
    "llm = OpenAI(temperature=0)\n",
    "zapier = ZapierNLAWrapper()\n",
    "toolkit = ZapierToolkit.from_zapier_nla_wrapper(zapier)\n",
-    "agent = initialize_agent(toolkit.get_tools(), llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(toolkit.get_tools(), llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/agents/tools/multi_input_tool.ipynb
+++ b/docs/modules/agents/tools/multi_input_tool.ipynb
@@ -1,17 +1,18 @@
 {
 "cells": [
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "87455ddb",
   "metadata": {},
   "source": [
-    "# Multi Input Tools\n",
+    "# Multi-Input Tools\n",
    "\n",
    "This notebook shows how to use a tool that requires multiple inputs with an agent.\n",
    "\n",
-    "The difficulty in doing so comes from the fact that an agent decides it's next step from a language model, which outputs a string. So if that step requires multiple inputs, they need to be parsed from that. Therefor, the currently supported way to do this is write a smaller wrapper function that parses that a string into multiple inputs.\n",
+    "The difficulty in doing so comes from the fact that an agent decides its next step from a language model, which outputs a string. So if that step requires multiple inputs, they need to be parsed from that. Therefore, the currently supported way to do this is to write a smaller wrapper function that parses a string into multiple inputs.\n",
    "\n",
-    "For a concrete example, let's work on giving an agent access to a multiplication function, which takes as input two integers. In order to use this, we will tell the agent to generate the \"Action Input\" as a comma separated list of length two. We will then write a thin wrapper that takes a string, splits it into two around a comma, and passes both parsed sides as integers to the multiplication function."
+    "For a concrete example, let's work on giving an agent access to a multiplication function, which takes as input two integers. In order to use this, we will tell the agent to generate the \"Action Input\" as a comma-separated list of length two. We will then write a thin wrapper that takes a string, splits it into two around a comma, and passes both parsed sides as integers to the multiplication function."
   ]
  },
  {
@@ -22,7 +23,8 @@
   "outputs": [],
   "source": [
    "from langchain.llms import OpenAI\n",
-    "from langchain.agents import initialize_agent, Tool"
+    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType"
   ]
  },
  {
@@ -63,7 +65,7 @@
    "        description=\"useful for when you need to multiply two numbers together. The input to this tool should be a comma separated list of numbers of length two, representing the two numbers you want to multiply together. For example, `1,2` would be the input if you wanted to multiply 1 by 2.\"\n",
    "    )\n",
    "]\n",
-    "mrkl = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "mrkl = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/chains/examples/moderation.ipynb
+++ b/docs/modules/chains/examples/moderation.ipynb
@@ -1,6 +1,7 @@
 {
 "cells": [
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "b83e61ed",
   "metadata": {},
@@ -13,7 +14,7 @@
    "In this notebook, we will show:\n",
    "\n",
    "1. How to run any piece of text through a moderation chain.\n",
-    "2. How to append a Moderation chain to a LLMChain."
+    "2. How to append a Moderation chain to an LLMChain."
   ]
  },
  {
--- a/docs/modules/indexes/document_loaders/examples/email.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/email.ipynb
@@ -7,7 +7,15 @@
   "source": [
    "# Email\n",
    "\n",
-    "This notebook shows how to load email (`.eml`) files."
+    "This notebook shows how to load email (`.eml`) and Microsoft Outlook (`.msg`) files."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "89caa348",
+   "metadata": {},
+   "source": [
+    "## Using Unstructured"
   ]
  },
  {
@@ -66,7 +74,7 @@
   "id": "8bf50cba",
   "metadata": {},
   "source": [
-    "## Retain Elements\n",
+    "### Retain Elements\n",
    "\n",
    "Under the hood, Unstructured creates different \"elements\" for different chunks of text. By default we combine those together, but you can easily keep that separation by specifying `mode=\"elements\"`."
   ]
@@ -112,10 +120,69 @@
    "data[0]"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "6a074515",
+   "metadata": {},
+   "source": [
+    "## Using OutlookMessageLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "1e7a8444",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders import OutlookMessageLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
+   "id": "77a055e6",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "loader = OutlookMessageLoader('example_data/fake-email.msg')"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 10,
+   "id": "789882de",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "data = loader.load()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "46aa0632",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "Document(page_content='This is a test email to experiment with the MS Outlook MSG Extractor\\r\\n\\r\\n\\r\\n-- \\r\\n\\r\\n\\r\\nKind regards\\r\\n\\r\\n\\r\\n\\r\\n\\r\\nBrian Zhou\\r\\n\\r\\n', metadata={'subject': 'Test for TIF files', 'sender': 'Brian Zhou <brizhou@gmail.com>', 'date': 'Mon, 18 Nov 2013 16:26:24 +0800'})"
+      ]
+     },
+     "execution_count": 11,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "data[0]"
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": null,
-   "id": "6a074515",
+   "id": "2b223ce2",
   "metadata": {},
   "outputs": [],
   "source": []
--- a/docs/modules/indexes/document_loaders/examples/example_data/fake-email.msg
+++ b/docs/modules/indexes/document_loaders/examples/example_data/fake-email.msg
--- a/docs/modules/indexes/document_loaders/examples/url.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/url.ipynb
@@ -52,6 +52,66 @@
   "source": [
    "data = loader.load()"
   ]
+  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "id": "f3afa135",
+   "metadata": {},
+   "source": [
+    "# Selenium URL Loader\n",
+    "\n",
+    "This covers how to load HTML documents from a list of URLs using the `SeleniumURLLoader`.\n",
+    "\n",
+    "Using selenium allows us to load pages that require JavaScript to render.\n",
+    "\n",
+    "## Setup\n",
+    "\n",
+    "To use the `SeleniumURLLoader`, you will need to install `selenium` and `unstructured`.\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "5fc50835",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders import SeleniumURLLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "24e896ce",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "urls = [\n",
+    "    \"https://www.youtube.com/watch?v=dQw4w9WgXcQ\",\n",
+    "    \"https://goo.gl/maps/NDSHwePEyaHMFGwh8\"\n",
+    "]"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "60a29397",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "loader = SeleniumURLLoader(urls=urls)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "0090cd57",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "data = loader.load()"
+   ]
  }
 ],
 "metadata": {
--- a/docs/modules/indexes/retrievers/examples/vectorstore-retriever.ipynb
+++ b/docs/modules/indexes/retrievers/examples/vectorstore-retriever.ipynb
@@ -66,7 +66,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "docs = retriever.get_relevant_documents(\"what did he say abotu ketanji brown jackson\")"
+    "docs = retriever.get_relevant_documents(\"what did he say about ketanji brown jackson\")"
   ]
  },
  {
--- a/docs/modules/indexes/vectorstores/examples/chroma.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/chroma.ipynb
@@ -170,12 +170,13 @@
   ]
  },
  {
+   "attachments": {},
   "cell_type": "markdown",
   "id": "f568a322",
   "metadata": {},
   "source": [
    "### Persist the Database\n",
-    "In a notebook, we should call persist() to ensure the embeddings are written to disk. This isn't necessary in a script - the database will be automatically persisted when the client object is destroyed."
+    "We should call persist() to ensure the embeddings are written to disk."
   ]
  },
  {
--- a/docs/modules/indexes/vectorstores/examples/qdrant.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/qdrant.ipynb
@@ -7,14 +7,23 @@
   "source": [
    "# Qdrant\n",
    "\n",
-    "This notebook shows how to use functionality related to the Qdrant vector database."
+    "This notebook shows how to use functionality related to the Qdrant vector database. There are various modes of how to run Qdrant, and depending on the chosen one, there will be some subtle differences. The options include:\n",
+    "\n",
+    "- Local mode, no server required\n",
+    "- On-premise server deployment\n",
+    "- Qdrant Cloud"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "id": "aac9563e",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:22.282884Z",
+     "start_time": "2023-04-04T10:51:21.408077Z"
+    }
+   },
   "outputs": [],
   "source": [
    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
@@ -27,10 +36,14 @@
   "cell_type": "code",
   "execution_count": 2,
   "id": "a3c3999a",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:22.520144Z",
+     "start_time": "2023-04-04T10:51:22.285826Z"
+    }
+   },
   "outputs": [],
   "source": [
-    "from langchain.document_loaders import TextLoader\n",
    "loader = TextLoader('../../../state_of_the_union.txt')\n",
    "documents = loader.load()\n",
    "text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)\n",
@@ -39,43 +52,536 @@
    "embeddings = OpenAIEmbeddings()"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "eeead681",
+   "metadata": {},
+   "source": [
+    "## Connecting to Qdrant from LangChain\n",
+    "\n",
+    "### Local mode\n",
+    "\n",
+    "Python client allows you to run the same code in local mode without running the Qdrant server. That's great for testing things out and debugging or if you plan to store just a small amount of vectors. The embeddings might be fully kepy in memory or persisted on disk.\n",
+    "\n",
+    "#### In-memory\n",
+    "\n",
+    "For some testing scenarios and quick experiments, you may prefer to keep all the data in memory only, so it gets lost when the client is destroyed - usually at the end of your script/notebook."
+   ]
+  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 3,
+   "id": "8429667e",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:22.525091Z",
+     "start_time": "2023-04-04T10:51:22.522015Z"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "qdrant = Qdrant.from_documents(\n",
+    "    docs, embeddings, \n",
+    "    location=\":memory:\",  # Local mode with in-memory storage only\n",
+    "    collection_name=\"my_documents\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "59f0b954",
+   "metadata": {},
+   "source": [
+    "#### On-disk storage\n",
+    "\n",
+    "Local mode, without using the Qdrant server, may also store your vectors on disk so they're persisted between runs."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "24b370e2",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:24.827567Z",
+     "start_time": "2023-04-04T10:51:22.529080Z"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "qdrant = Qdrant.from_documents(\n",
+    "    docs, embeddings, \n",
+    "    path=\"/tmp/local_qdrant\",\n",
+    "    collection_name=\"my_documents\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "749658ce",
+   "metadata": {},
+   "source": [
+    "### On-premise server deployment\n",
+    "\n",
+    "No matter if you choose to launch Qdrant locally with [a Docker container](https://qdrant.tech/documentation/install/), or select a Kubernetes deployment with [the official Helm chart](https://github.com/qdrant/qdrant-helm), the way you're going to connect to such an instance will be identical. You'll need to provide a URL pointing to the service."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "91e7f5ce",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:24.832708Z",
+     "start_time": "2023-04-04T10:51:24.829905Z"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "url = \"<---qdrant url here --->\"\n",
+    "qdrant = Qdrant.from_documents(\n",
+    "    docs, embeddings, \n",
+    "    url, prefer_grpc=True, \n",
+    "    collection_name=\"my_documents\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c9e21ce9",
+   "metadata": {},
+   "source": [
+    "### Qdrant Cloud\n",
+    "\n",
+    "If you prefer not to keep yourself busy with managing the infrastructure, you can choose to set up a fully-managed Qdrant cluster on [Qdrant Cloud](https://cloud.qdrant.io/). There is a free forever 1GB cluster included for trying out. The main difference with using a managed version of Qdrant is that you'll need to provide an API key to secure your deployment from being accessed publicly."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
   "id": "dcf88bdf",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:24.837599Z",
+     "start_time": "2023-04-04T10:51:24.834690Z"
+    }
+   },
   "outputs": [],
   "source": [
-    "host = \"<---host name here --->\"\n",
+    "url = \"<---qdrant cloud cluster url here --->\"\n",
    "api_key = \"<---api key here--->\"\n",
-    "qdrant = Qdrant.from_documents(docs, embeddings, host=host, prefer_grpc=True, api_key=api_key)\n",
-    "query = \"What did the president say about Ketanji Brown Jackson\""
+    "qdrant = Qdrant.from_documents(\n",
+    "    docs, embeddings, \n",
+    "    url, prefer_grpc=True, api_key=api_key, \n",
+    "    collection_name=\"my_documents\",\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "93540013",
+   "metadata": {},
+   "source": [
+    "## Reusing the same collection\n",
+    "\n",
+    "Both `Qdrant.from_texts` and `Qdrant.from_documents` methods are great to start using Qdrant with LangChain, but **they are going to destroy the collection and create it from scratch**! If you want to reuse the existing collection, you can always create an instance of `Qdrant` on your own and pass the `QdrantClient` instance with the connection details."
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 7,
+   "id": "b7b432d7",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:24.843090Z",
+     "start_time": "2023-04-04T10:51:24.840041Z"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "del qdrant"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "id": "30a87570",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:24.854117Z",
+     "start_time": "2023-04-04T10:51:24.845385Z"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "import qdrant_client\n",
+    "\n",
+    "client = qdrant_client.QdrantClient(\n",
+    "    path=\"/tmp/local_qdrant\", prefer_grpc=True\n",
+    ")\n",
+    "qdrant = Qdrant(\n",
+    "    client=client, collection_name=\"my_documents\", \n",
+    "    embedding_function=embeddings.embed_query\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "1f9215c8",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T09:27:29.920258Z",
+     "start_time": "2023-04-04T09:27:29.913714Z"
+    }
+   },
+   "source": [
+    "## Similarity search\n",
+    "\n",
+    "The simplest scenario for using Qdrant vector store is to perform a similarity search. Under the hood, our query will be encoded with the `embedding_function` and used to find similar documents in Qdrant collection."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 9,
   "id": "a8c513ab",
-   "metadata": {},
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:25.204469Z",
+     "start_time": "2023-04-04T10:51:24.855618Z"
+    }
+   },
   "outputs": [],
   "source": [
-    "docs = qdrant.similarity_search(query)"
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "found_docs = qdrant.similarity_search(query)"
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 10,
   "id": "fc516993",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:25.220984Z",
+     "start_time": "2023-04-04T10:51:25.213943Z"
+    }
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \n",
+      "\n",
+      "Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \n",
+      "\n",
+      "One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \n",
+      "\n",
+      "And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(found_docs[0].page_content)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "1bda9bf5",
   "metadata": {},
+   "source": [
+    "## Similarity search with score\n",
+    "\n",
+    "Sometimes we might want to perform the search, but also obtain a relevancy score to know how good is a particular result."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 11,
+   "id": "8804a21d",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:25.631585Z",
+     "start_time": "2023-04-04T10:51:25.227384Z"
+    }
+   },
   "outputs": [],
   "source": [
-    "docs[0]"
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "found_docs = qdrant.similarity_search_with_score(query)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 12,
+   "id": "756a6887",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:25.642282Z",
+     "start_time": "2023-04-04T10:51:25.635947Z"
+    }
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \n",
+      "\n",
+      "Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \n",
+      "\n",
+      "One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \n",
+      "\n",
+      "And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.\n",
+      "\n",
+      "Score: 0.8153784913324512\n"
+     ]
+    }
+   ],
+   "source": [
+    "document, score = found_docs[0]\n",
+    "print(document.page_content)\n",
+    "print(f\"\\nScore: {score}\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "c58c30bf",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:39:53.032744Z",
+     "start_time": "2023-04-04T10:39:53.028673Z"
+    }
+   },
+   "source": [
+    "## Maximum marginal relevance search (MMR)\n",
+    "\n",
+    "If you'd like to look up for some similar documents, but you'd also like to receive diverse results, MMR is method you should consider. Maximal marginal relevance optimizes for similarity to query AND diversity among selected documents."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 13,
+   "id": "76810fb6",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:26.010947Z",
+     "start_time": "2023-04-04T10:51:25.647687Z"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "found_docs = qdrant.max_marginal_relevance_search(query, k=2, fetch_k=10)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "id": "80c6db11",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:26.016979Z",
+     "start_time": "2023-04-04T10:51:26.013329Z"
+    }
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "1. Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \n",
+      "\n",
+      "Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \n",
+      "\n",
+      "One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \n",
+      "\n",
+      "And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence. \n",
+      "\n",
+      "2. We can’t change how divided we’ve been. But we can change how we move forward—on COVID-19 and other issues we must face together. \n",
+      "\n",
+      "I recently visited the New York City Police Department days after the funerals of Officer Wilbert Mora and his partner, Officer Jason Rivera. \n",
+      "\n",
+      "They were responding to a 9-1-1 call when a man shot and killed them with a stolen gun. \n",
+      "\n",
+      "Officer Mora was 27 years old. \n",
+      "\n",
+      "Officer Rivera was 22. \n",
+      "\n",
+      "Both Dominican Americans who’d grown up on the same streets they later chose to patrol as police officers. \n",
+      "\n",
+      "I spoke with their families and told them that we are forever in debt for their sacrifice, and we will carry on their mission to restore the trust and safety every community deserves. \n",
+      "\n",
+      "I’ve worked on these issues a long time. \n",
+      "\n",
+      "I know what works: Investing in crime preventionand community police officers who’ll walk the beat, who’ll know the neighborhood, and who can restore trust and safety. \n",
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "for i, doc in enumerate(found_docs):\n",
+    "    print(f\"{i + 1}.\", doc.page_content, \"\\n\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "691a82d6",
+   "metadata": {},
+   "source": [
+    "## Qdrant as a Retriever\n",
+    "\n",
+    "Qdrant, as all the other vector stores, is a LangChain Retriever, by using cosine similarity. "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 15,
+   "id": "9427195f",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:26.031451Z",
+     "start_time": "2023-04-04T10:51:26.018763Z"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "VectorStoreRetriever(vectorstore=<langchain.vectorstores.qdrant.Qdrant object at 0x7fc4e5720a00>, search_type='similarity', search_kwargs={})"
+      ]
+     },
+     "execution_count": 15,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "retriever = qdrant.as_retriever()\n",
+    "retriever"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "0c851b4f",
+   "metadata": {},
+   "source": [
+    "It might be also specified to use MMR as a search strategy, instead of similarity."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 16,
+   "id": "64348f1b",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:26.043909Z",
+     "start_time": "2023-04-04T10:51:26.034284Z"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "VectorStoreRetriever(vectorstore=<langchain.vectorstores.qdrant.Qdrant object at 0x7fc4e5720a00>, search_type='mmr', search_kwargs={})"
+      ]
+     },
+     "execution_count": 16,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "retriever = qdrant.as_retriever(search_type=\"mmr\")\n",
+    "retriever"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 17,
+   "id": "f3c70c31",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T10:51:26.495652Z",
+     "start_time": "2023-04-04T10:51:26.046407Z"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "Document(page_content='Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \\n\\nTonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \\n\\nOne of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \\n\\nAnd I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'source': '../../../state_of_the_union.txt'})"
+      ]
+     },
+     "execution_count": 17,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "query = \"What did the president say about Ketanji Brown Jackson\"\n",
+    "retriever.get_relevant_documents(query)[0]"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "0358ecde",
+   "metadata": {},
+   "source": [
+    "## Customizing Qdrant\n",
+    "\n",
+    "Qdrant stores your vector embeddings along with the optional JSON-like payload. Payloads are optional, but since LangChain assumes the embeddings are generated from the documents, we keep the context data, so you can extract the original texts as well.\n",
+    "\n",
+    "By default, your document is going to be stored in the following payload structure:\n",
+    "\n",
+    "```json\n",
+    "{\n",
+    "    \"page_content\": \"Lorem ipsum dolor sit amet\",\n",
+    "    \"metadata\": {\n",
+    "        \"foo\": \"bar\"\n",
+    "    }\n",
+    "}\n",
+    "```\n",
+    "\n",
+    "You can, however, decide to use different keys for the page content and metadata. That's useful if you already have a collection that you'd like to reuse. You can always change the "
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "id": "e4d6baf9",
+   "metadata": {
+    "ExecuteTime": {
+     "end_time": "2023-04-04T11:08:31.739141Z",
+     "start_time": "2023-04-04T11:08:30.229748Z"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "<langchain.vectorstores.qdrant.Qdrant at 0x7fc4e2baa230>"
+      ]
+     },
+     "execution_count": 19,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "Qdrant.from_documents(\n",
+    "    docs, embeddings, \n",
+    "    location=\":memory:\",\n",
+    "    collection_name=\"my_documents_2\",\n",
+    "    content_payload_key=\"my_page_content_key\",\n",
+    "    metadata_payload_key=\"my_meta\",\n",
+    ")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
-   "id": "a359ed74",
+   "id": "2300e785",
   "metadata": {},
   "outputs": [],
   "source": []
@@ -97,7 +603,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.1"
+   "version": "3.10.6"
  }
 },
 "nbformat": 4,
--- a/docs/modules/indexes/vectorstores/examples/zilliz.ipynb
+++ b/docs/modules/indexes/vectorstores/examples/zilliz.ipynb
@@ -0,0 +1,112 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "683953b3",
+   "metadata": {},
+   "source": [
+    "# Zilliz\n",
+    "\n",
+    "This notebook shows how to use functionality related to the Zilliz Cloud managed vector database.\n",
+    "\n",
+    "To run, you should have a Zilliz Cloud instance up and running: https://zilliz.com/cloud"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "aac9563e",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings.openai import OpenAIEmbeddings\n",
+    "from langchain.text_splitter import CharacterTextSplitter\n",
+    "from langchain.vectorstores import Milvus\n",
+    "from langchain.document_loaders import TextLoader"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "19a71422",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# replace \n",
+    "ZILLIZ_CLOUD_HOSTNAME = \"\" # example: \"in01-17f69c292d4a50a.aws-us-west-2.vectordb.zillizcloud.com\"\n",
+    "ZILLIZ_CLOUD_PORT = \"\" #example: \"19532\""
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "a3c3999a",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.document_loaders import TextLoader\n",
+    "loader = TextLoader('../../../state_of_the_union.txt')\n",
+    "documents = loader.load()\n",
+    "text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)\n",
+    "docs = text_splitter.split_documents(documents)\n",
+    "\n",
+    "embeddings = OpenAIEmbeddings()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "dcf88bdf",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "vector_db = Milvus.from_documents(\n",
+    "    docs,\n",
+    "    embeddings,\n",
+    "    connection_args={\"host\": ZILLIZ_CLOUD_HOSTNAME, \"port\": ZILLIZ_CLOUD_PORT},\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "a8c513ab",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "docs = vector_db.similarity_search(query)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "fc516993",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "docs[0]"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.8.9"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
--- a/docs/modules/models/llms/examples/fake_llm.ipynb
+++ b/docs/modules/models/llms/examples/fake_llm.ipynb
@@ -31,7 +31,8 @@
   "outputs": [],
   "source": [
    "from langchain.agents import load_tools\n",
-    "from langchain.agents import initialize_agent"
+    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import AgentType"
   ]
  },
  {
@@ -65,7 +66,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/models/llms/examples/llm_serialization.ipynb
+++ b/docs/modules/models/llms/examples/llm_serialization.ipynb
@@ -27,7 +27,7 @@
   "metadata": {},
   "source": [
    "## Loading\n",
-    "First, lets go over loading a LLM from disk. LLMs can be saved on disk in two formats: json or yaml. No matter the extension, they are loaded in the same way."
+    "First, lets go over loading an LLM from disk. LLMs can be saved on disk in two formats: json or yaml. No matter the extension, they are loaded in the same way."
   ]
  },
  {
@@ -112,7 +112,7 @@
   "metadata": {},
   "source": [
    "## Saving\n",
-    "If you want to go from a LLM in memory to a serialized version of it, you can do so easily by calling the `.save` method. Again, this supports both json and yaml."
+    "If you want to go from an LLM in memory to a serialized version of it, you can do so easily by calling the `.save` method. Again, this supports both json and yaml."
   ]
  },
  {
--- a/docs/modules/models/llms/examples/token_usage_tracking.ipynb
+++ b/docs/modules/models/llms/examples/token_usage_tracking.ipynb
@@ -107,11 +107,12 @@
   "source": [
    "from langchain.agents import load_tools\n",
    "from langchain.agents import initialize_agent\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.llms import OpenAI\n",
    "\n",
    "llm = OpenAI(temperature=0)\n",
    "tools = load_tools([\"serpapi\", \"llm-math\"], llm=llm)\n",
-    "agent = initialize_agent(tools, llm, agent=\"zero-shot-react-description\", verbose=True)"
+    "agent = initialize_agent(tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)"
   ]
  },
  {
--- a/docs/modules/models/llms/integrations/gpt4all.ipynb
+++ b/docs/modules/models/llms/integrations/gpt4all.ipynb
@@ -0,0 +1,97 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# GPT4all\n",
+    "\n",
+    "This example goes over how to use LangChain to interact with GPT4All models"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install pyllamacpp"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.llms import GPT4All\n",
+    "from langchain import PromptTemplate, LLMChain"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "template = \"\"\"Question: {question}\n",
+    "\n",
+    "Answer: Let's think step by step.\"\"\"\n",
+    "\n",
+    "prompt = PromptTemplate(template=template, input_variables=[\"question\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# You'll need to download a compatible model and convert it to ggml.\n",
+    "# See: https://github.com/nomic-ai/gpt4all for more information.\n",
+    "llm = GPT4All(model_path=\"./models/gpt4all-model.bin\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_chain = LLMChain(prompt=prompt, llm=llm)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "question = \"What NFL team won the Super Bowl in the year Justin Bieber was born?\"\n",
+    "\n",
+    "llm_chain.run(question)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/modules/models/llms/integrations/llamacpp.ipynb
+++ b/docs/modules/models/llms/integrations/llamacpp.ipynb
@@ -0,0 +1,98 @@
+{
+ "cells": [
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install llama-cpp-python"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.llms import LlamaCpp\n",
+    "from langchain import PromptTemplate, LLMChain"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "template = \"\"\"Question: {question}\n",
+    "\n",
+    "Answer: Let's think step by step.\"\"\"\n",
+    "\n",
+    "prompt = PromptTemplate(template=template, input_variables=[\"question\"])"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm = LlamaCpp(model_path=\"./ggml-model-q4_0.bin\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llm_chain = LLMChain(prompt=prompt, llm=llm)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'\\n\\nWe know that Justin Bieber is currently 25 years old and that he was born on March 1st, 1994 and that he is a singer and he has an album called Purpose, so we know that he was born when Super Bowl XXXVIII was played between Dallas and Seattle and that it took place February 1st, 2004 and that the Seattle Seahawks won 24-21, so Seattle is our answer!'"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "question = \"What NFL team won the Super Bowl in the year Justin Bieber was born?\"\n",
+    "\n",
+    "llm_chain.run(question)"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "workspace",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.13"
+  },
+  "orig_nbformat": 4
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/modules/models/text_embedding/examples/llamacpp.ipynb
+++ b/docs/modules/models/text_embedding/examples/llamacpp.ipynb
@@ -0,0 +1,66 @@
+{
+ "cells": [
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "!pip install llama-cpp-python"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain.embeddings import LlamaCppEmbeddings"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "llama = LlamaCppEmbeddings(model_path=\"/path/to/model/ggml-model-q4_0.bin\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "text = \"This is a test document.\""
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "query_result = embeddings.embed_query(text)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "doc_result = embeddings.embed_documents([text])"
+   ]
+  }
+ ],
+ "metadata": {
+  "language_info": {
+   "name": "python"
+  },
+  "orig_nbformat": 4
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/reference/integrations.md
+++ b/docs/reference/integrations.md
@@ -52,6 +52,9 @@ The following use cases require specific installs and api keys:
  - If you want to set up OpenSearch on your local, [here](https://opensearch.org/docs/latest/)
 - _DeepLake_:
  - Install requirements with `pip install deeplake`
+- _LlamaCpp_:
+  - Install requirements with `pip install llama-cpp-python`
+  - Download model and convert following [llama.cpp instructions](https://github.com/ggerganov/llama.cpp)


 If you are using the `NLTKTextSplitter` or the `SpacyTextSplitter`, you will also need to install the appropriate models. For example, if you want to use the `SpacyTextSplitter`, you will need to install the `en_core_web_sm` model with `python -m spacy download en_core_web_sm`. Similarly, if you want to use the `NLTKTextSplitter`, you will need to install the `punkt` model with `python -m nltk.downloader punkt`.
--- a/docs/tracing/agent_with_tracing.ipynb
+++ b/docs/tracing/agent_with_tracing.ipynb
@@ -35,6 +35,7 @@
    "\n",
    "import langchain\n",
    "from langchain.agents import Tool, initialize_agent, load_tools\n",
+    "from langchain.agents import AgentType\n",
    "from langchain.chat_models import ChatOpenAI\n",
    "from langchain.llms import OpenAI"
   ]
@@ -93,7 +94,7 @@
   ],
   "source": [
    "agent = initialize_agent(\n",
-    "    tools, llm, agent=\"zero-shot-react-description\", verbose=True\n",
+    "    tools, llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True\n",
    ")\n",
    "\n",
    "agent.run(\"What is 2 raised to .123243 power?\")"
@@ -177,7 +178,7 @@
   "source": [
    "# Agent run with tracing using a chat model\n",
    "agent = initialize_agent(\n",
-    "    tools, ChatOpenAI(temperature=0), agent=\"chat-zero-shot-react-description\", verbose=True\n",
+    "    tools, ChatOpenAI(temperature=0), agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True\n",
    ")\n",
    "\n",
    "agent.run(\"What is 2 raised to .123243 power?\")"
--- a/docs/use_cases/evaluation/agent_benchmarking.ipynb
+++ b/docs/use_cases/evaluation/agent_benchmarking.ipynb
@@ -85,9 +85,10 @@
    "from langchain.llms import OpenAI\n",
    "from langchain.chains import LLMMathChain\n",
    "from langchain.agents import initialize_agent, Tool, load_tools\n",
+    "from langchain.agents import AgentType\n",
    "\n",
    "tools = load_tools(['serpapi', 'llm-math'], llm=OpenAI(temperature=0))\n",
-    "agent = initialize_agent(tools, OpenAI(temperature=0), agent=\"zero-shot-react-description\")\n"
+    "agent = initialize_agent(tools, OpenAI(temperature=0), agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION)\n"
   ]
  },
  {
--- a/docs/use_cases/evaluation/agent_vectordb_sota_pg.ipynb
+++ b/docs/use_cases/evaluation/agent_vectordb_sota_pg.ipynb
@@ -255,6 +255,7 @@
   "outputs": [],
   "source": [
    "from langchain.agents import initialize_agent, Tool\n",
+    "from langchain.agents import AgentType\n",
    "tools = [\n",
    "    Tool(\n",
    "        name = \"State of Union QA System\",\n",
@@ -276,7 +277,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "agent = initialize_agent(tools, OpenAI(temperature=0), agent=\"zero-shot-react-description\", max_iterations=3)"
+    "agent = initialize_agent(tools, OpenAI(temperature=0), agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, max_iterations=3)"
   ]
  },
  {
--- a/langchain/init.py
+++ b/langchain/init.py
@@ -31,6 +31,7 @@ from langchain.llms import (
    ForefrontAI,
    GooseAI,
    HuggingFaceHub,
+    LlamaCpp,
    Modal,
    OpenAI,
    Petals,
@@ -110,4 +111,5 @@ __all__ = [
    "PALChain",
    "set_handler",
    "set_tracing_callback_manager",
+    "LlamaCpp",
 ]
--- a/langchain/agents/init.py
+++ b/langchain/agents/init.py
@@ -15,6 +15,7 @@ from langchain.agents.agent_toolkits import (
    create_vectorstore_agent,
    create_vectorstore_router_agent,
 )
+from langchain.agents.agent_types import AgentType
 from langchain.agents.conversational.base import ConversationalAgent
 from langchain.agents.conversational_chat.base import ConversationalChatAgent
 from langchain.agents.initialize import initialize_agent
@@ -51,4 +52,5 @@ __all__ = [
    "LLMSingleActionAgent",
    "AgentOutputParser",
    "BaseSingleActionAgent",
+    "AgentType",
 ]
--- a/langchain/agents/agent.py
+++ b/langchain/agents/agent.py
@@ -15,11 +15,16 @@ from langchain.callbacks.base import BaseCallbackManager
 from langchain.chains.base import Chain
 from langchain.chains.llm import LLMChain
 from langchain.input import get_color_mapping
-from langchain.llms.base import BaseLLM
 from langchain.prompts.base import BasePromptTemplate
 from langchain.prompts.few_shot import FewShotPromptTemplate
 from langchain.prompts.prompt import PromptTemplate
-from langchain.schema import AgentAction, AgentFinish, BaseMessage, BaseOutputParser
+from langchain.schema import (
+    AgentAction,
+    AgentFinish,
+    BaseLanguageModel,
+    BaseMessage,
+    BaseOutputParser,
+)
 from langchain.tools.base import BaseTool

 logger = logging.getLogger()
@@ -365,7 +370,7 @@ class Agent(BaseSingleActionAgent):
    @classmethod
    def from_llm_and_tools(
        cls,
-        llm: BaseLLM,
+        llm: BaseLanguageModel,
        tools: Sequence[BaseTool],
        callback_manager: Optional[BaseCallbackManager] = None,
        **kwargs: Any,
--- a/langchain/agents/agent_toolkits/openapi/toolkit.py
+++ b/langchain/agents/agent_toolkits/openapi/toolkit.py
@@ -10,7 +10,7 @@ from langchain.agents.agent_toolkits.json.toolkit import JsonToolkit
 from langchain.agents.agent_toolkits.openapi.prompt import DESCRIPTION
 from langchain.agents.tools import Tool
 from langchain.llms.base import BaseLLM
-from langchain.requests import RequestsWrapper
+from langchain.requests import TextRequestsWrapper
 from langchain.tools import BaseTool
 from langchain.tools.json.tool import JsonSpec
 from langchain.tools.requests.tool import (
@@ -25,7 +25,7 @@ from langchain.tools.requests.tool import (
 class RequestsToolkit(BaseToolkit):
    """Toolkit for making requests."""

-    requests_wrapper: RequestsWrapper
+    requests_wrapper: TextRequestsWrapper

    def get_tools(self) -> List[BaseTool]:
        """Return a list of tools."""
@@ -42,7 +42,7 @@ class OpenAPIToolkit(BaseToolkit):
    """Toolkit for interacting with a OpenAPI api."""

    json_agent: AgentExecutor
-    requests_wrapper: RequestsWrapper
+    requests_wrapper: TextRequestsWrapper

    def get_tools(self) -> List[BaseTool]:
        """Get the tools in the toolkit."""
@@ -59,7 +59,7 @@ class OpenAPIToolkit(BaseToolkit):
        cls,
        llm: BaseLLM,
        json_spec: JsonSpec,
-        requests_wrapper: RequestsWrapper,
+        requests_wrapper: TextRequestsWrapper,
        **kwargs: Any,
    ) -> OpenAPIToolkit:
        """Create json agent from llm, then initialize."""
--- a/langchain/agents/agent_toolkits/pandas/base.py
+++ b/langchain/agents/agent_toolkits/pandas/base.py
@@ -18,6 +18,9 @@ def create_pandas_dataframe_agent(
    suffix: str = SUFFIX,
    input_variables: Optional[List[str]] = None,
    verbose: bool = False,
+    return_intermediate_steps: bool = False,
+    max_iterations: Optional[int] = 15,
+    early_stopping_method: str = "force",
    **kwargs: Any,
 ) -> AgentExecutor:
    """Construct a pandas agent from an LLM and dataframe."""
@@ -39,4 +42,11 @@ def create_pandas_dataframe_agent(
    )
    tool_names = [tool.name for tool in tools]
    agent = ZeroShotAgent(llm_chain=llm_chain, allowed_tools=tool_names, **kwargs)
-    return AgentExecutor.from_agent_and_tools(agent=agent, tools=tools, verbose=verbose)
+    return AgentExecutor.from_agent_and_tools(
+        agent=agent,
+        tools=tools,
+        verbose=verbose,
+        return_intermediate_steps=return_intermediate_steps,
+        max_iterations=max_iterations,
+        early_stopping_method=early_stopping_method,
+    )
--- a/langchain/agents/agent_toolkits/sql/base.py
+++ b/langchain/agents/agent_toolkits/sql/base.py
@@ -20,6 +20,8 @@ def create_sql_agent(
    format_instructions: str = FORMAT_INSTRUCTIONS,
    input_variables: Optional[List[str]] = None,
    top_k: int = 10,
+    max_iterations: Optional[int] = 15,
+    early_stopping_method: str = "force",
    verbose: bool = False,
    **kwargs: Any,
 ) -> AgentExecutor:
@@ -41,5 +43,9 @@ def create_sql_agent(
    tool_names = [tool.name for tool in tools]
    agent = ZeroShotAgent(llm_chain=llm_chain, allowed_tools=tool_names, **kwargs)
    return AgentExecutor.from_agent_and_tools(
-        agent=agent, tools=toolkit.get_tools(), verbose=verbose
+        agent=agent,
+        tools=toolkit.get_tools(),
+        verbose=verbose,
+        max_iterations=max_iterations,
+        early_stopping_method=early_stopping_method,
    )
--- a/langchain/agents/agent_types.py
+++ b/langchain/agents/agent_types.py
@@ -0,0 +1,10 @@
+from enum import Enum
+
+
+class AgentType(str, Enum):
+    ZERO_SHOT_REACT_DESCRIPTION = "zero-shot-react-description"
+    REACT_DOCSTORE = "react-docstore"
+    SELF_ASK_WITH_SEARCH = "self-ask-with-search"
+    CONVERSATIONAL_REACT_DESCRIPTION = "conversational-react-description"
+    CHAT_ZERO_SHOT_REACT_DESCRIPTION = "chat-zero-shot-react-description"
+    CHAT_CONVERSATIONAL_REACT_DESCRIPTION = "chat-conversational-react-description"
--- a/langchain/agents/conversational/base.py
+++ b/langchain/agents/conversational/base.py
@@ -5,11 +5,12 @@ import re
 from typing import Any, List, Optional, Sequence, Tuple

 from langchain.agents.agent import Agent
+from langchain.agents.agent_types import AgentType
 from langchain.agents.conversational.prompt import FORMAT_INSTRUCTIONS, PREFIX, SUFFIX
 from langchain.callbacks.base import BaseCallbackManager
 from langchain.chains import LLMChain
-from langchain.llms import BaseLLM
 from langchain.prompts import PromptTemplate
+from langchain.schema import BaseLanguageModel
 from langchain.tools.base import BaseTool


@@ -21,7 +22,7 @@ class ConversationalAgent(Agent):
    @property
    def _agent_type(self) -> str:
        """Return Identifier of agent type."""
-        return "conversational-react-description"
+        return AgentType.CONVERSATIONAL_REACT_DESCRIPTION

    @property
    def observation_prefix(self) -> str:
@@ -89,7 +90,7 @@ class ConversationalAgent(Agent):
    @classmethod
    def from_llm_and_tools(
        cls,
-        llm: BaseLLM,
+        llm: BaseLanguageModel,
        tools: Sequence[BaseTool],
        callback_manager: Optional[BaseCallbackManager] = None,
        prefix: str = PREFIX,
--- a/langchain/agents/initialize.py
+++ b/langchain/agents/initialize.py
@@ -2,16 +2,17 @@
 from typing import Any, Optional, Sequence

 from langchain.agents.agent import AgentExecutor
+from langchain.agents.agent_types import AgentType
 from langchain.agents.loading import AGENT_TO_CLASS, load_agent
 from langchain.callbacks.base import BaseCallbackManager
-from langchain.llms.base import BaseLLM
+from langchain.schema import BaseLanguageModel
 from langchain.tools.base import BaseTool


 def initialize_agent(
    tools: Sequence[BaseTool],
-    llm: BaseLLM,
-    agent: Optional[str] = None,
+    llm: BaseLanguageModel,
+    agent: Optional[AgentType] = None,
    callback_manager: Optional[BaseCallbackManager] = None,
    agent_path: Optional[str] = None,
    agent_kwargs: Optional[dict] = None,
@@ -22,15 +23,8 @@ def initialize_agent(
    Args:
        tools: List of tools this agent has access to.
        llm: Language model to use as the agent.
-        agent: A string that specified the agent type to use. Valid options are:
-            `zero-shot-react-description`
-            `react-docstore`
-            `self-ask-with-search`
-            `conversational-react-description`
-            `chat-zero-shot-react-description`,
-            `chat-conversational-react-description`,
-           If None and agent_path is also None, will default to
-            `zero-shot-react-description`.
+        agent: Agent type to use. If None and agent_path is also None, will default to
+            AgentType.ZERO_SHOT_REACT_DESCRIPTION.
        callback_manager: CallbackManager to use. Global callback manager is used if
            not provided. Defaults to None.
        agent_path: Path to serialized agent to use.
@@ -41,7 +35,7 @@ def initialize_agent(
        An agent executor
    """
    if agent is None and agent_path is None:
-        agent = "zero-shot-react-description"
+        agent = AgentType.ZERO_SHOT_REACT_DESCRIPTION
    if agent is not None and agent_path is not None:
        raise ValueError(
            "Both `agent` and `agent_path` are specified, "
--- a/langchain/agents/load_tools.py
+++ b/langchain/agents/load_tools.py
@@ -1,6 +1,7 @@
 # flake8: noqa
 """Load tools."""
 from typing import Any, List, Optional
+import warnings

 from langchain.agents.tools import Tool
 from langchain.callbacks.base import BaseCallbackManager
@@ -9,14 +10,20 @@ from langchain.chains.api.base import APIChain
 from langchain.chains.llm_math.base import LLMMathChain
 from langchain.chains.pal.base import PALChain
 from langchain.llms.base import BaseLLM
-from langchain.requests import RequestsWrapper
+from langchain.requests import TextRequestsWrapper
 from langchain.tools.base import BaseTool
 from langchain.tools.bing_search.tool import BingSearchRun
 from langchain.tools.google_search.tool import GoogleSearchResults, GoogleSearchRun
 from langchain.tools.searx_search.tool import SearxSearchResults, SearxSearchRun
 from langchain.tools.human.tool import HumanInputRun
 from langchain.tools.python.tool import PythonREPLTool
-from langchain.tools.requests.tool import RequestsGetTool
+from langchain.tools.requests.tool import (
+    RequestsGetTool,
+    RequestsPostTool,
+    RequestsPatchTool,
+    RequestsPutTool,
+    RequestsDeleteTool,
+)
 from langchain.tools.wikipedia.tool import WikipediaQueryRun
 from langchain.tools.wolfram_alpha.tool import WolframAlphaQueryRun
 from langchain.utilities.apify import ApifyWrapper
@@ -34,8 +41,24 @@ def _get_python_repl() -> BaseTool:
    return PythonREPLTool()


-def _get_requests() -> BaseTool:
-    return RequestsGetTool(requests_wrapper=RequestsWrapper())
+def _get_tools_requests_get() -> BaseTool:
+    return RequestsGetTool(requests_wrapper=TextRequestsWrapper())
+
+
+def _get_tools_requests_post() -> BaseTool:
+    return RequestsPostTool(requests_wrapper=TextRequestsWrapper())
+
+
+def _get_tools_requests_patch() -> BaseTool:
+    return RequestsPatchTool(requests_wrapper=TextRequestsWrapper())
+
+
+def _get_tools_requests_put() -> BaseTool:
+    return RequestsPutTool(requests_wrapper=TextRequestsWrapper())
+
+
+def _get_tools_requests_delete() -> BaseTool:
+    return RequestsDeleteTool(requests_wrapper=TextRequestsWrapper())


 def _get_terminal() -> BaseTool:
@@ -48,7 +71,12 @@ def _get_terminal() -> BaseTool:

 _BASE_TOOLS = {
    "python_repl": _get_python_repl,
-    "requests": _get_requests,
+    "requests": _get_tools_requests_get,  # preserved for backwards compatability
+    "requests_get": _get_tools_requests_get,
+    "requests_post": _get_tools_requests_post,
+    "requests_patch": _get_tools_requests_patch,
+    "requests_put": _get_tools_requests_put,
+    "requests_delete": _get_tools_requests_delete,
    "terminal": _get_terminal,
 }

@@ -228,8 +256,21 @@ def load_tools(
        List of tools.
    """
    tools = []
+
    for name in tool_names:
-        if name in _BASE_TOOLS:
+        if name == "requests":
+            warnings.warn(
+                "tool name `requests` is deprecated - "
+                "please use `requests_all` or specify the requests method"
+            )
+
+        if name == "requests_all":
+            # expand requests into various methods
+            requests_method_tools = [
+                _tool for _tool in _BASE_TOOLS if _tool.startswith("requests_")
+            ]
+            tool_names.extend(requests_method_tools)
+        elif name in _BASE_TOOLS:
            tools.append(_BASE_TOOLS[name]())
        elif name in _LLM_TOOLS:
            if llm is None:
--- a/langchain/agents/loading.py
+++ b/langchain/agents/loading.py
@@ -6,6 +6,7 @@ from typing import Any, List, Optional, Union
 import yaml

 from langchain.agents.agent import Agent
+from langchain.agents.agent_types import AgentType
 from langchain.agents.chat.base import ChatAgent
 from langchain.agents.conversational.base import ConversationalAgent
 from langchain.agents.conversational_chat.base import ConversationalChatAgent
@@ -18,12 +19,12 @@ from langchain.llms.base import BaseLLM
 from langchain.utilities.loading import try_load_from_hub

 AGENT_TO_CLASS = {
-    "zero-shot-react-description": ZeroShotAgent,
-    "react-docstore": ReActDocstoreAgent,
-    "self-ask-with-search": SelfAskWithSearchAgent,
-    "conversational-react-description": ConversationalAgent,
-    "chat-zero-shot-react-description": ChatAgent,
-    "chat-conversational-react-description": ConversationalChatAgent,
+    AgentType.ZERO_SHOT_REACT_DESCRIPTION: ZeroShotAgent,
+    AgentType.REACT_DOCSTORE: ReActDocstoreAgent,
+    AgentType.SELF_ASK_WITH_SEARCH: SelfAskWithSearchAgent,
+    AgentType.CONVERSATIONAL_REACT_DESCRIPTION: ConversationalAgent,
+    AgentType.CHAT_ZERO_SHOT_REACT_DESCRIPTION: ChatAgent,
+    AgentType.CHAT_CONVERSATIONAL_REACT_DESCRIPTION: ConversationalChatAgent,
 }

 URL_BASE = "https://raw.githubusercontent.com/hwchase17/langchain-hub/master/agents/"
--- a/langchain/agents/mrkl/base.py
+++ b/langchain/agents/mrkl/base.py
@@ -5,12 +5,13 @@ import re
 from typing import Any, Callable, List, NamedTuple, Optional, Sequence, Tuple

 from langchain.agents.agent import Agent, AgentExecutor
+from langchain.agents.agent_types import AgentType
 from langchain.agents.mrkl.prompt import FORMAT_INSTRUCTIONS, PREFIX, SUFFIX
 from langchain.agents.tools import Tool
 from langchain.callbacks.base import BaseCallbackManager
 from langchain.chains import LLMChain
-from langchain.llms.base import BaseLLM
 from langchain.prompts import PromptTemplate
+from langchain.schema import BaseLanguageModel
 from langchain.tools.base import BaseTool

 FINAL_ANSWER_ACTION = "Final Answer:"
@@ -56,7 +57,7 @@ class ZeroShotAgent(Agent):
    @property
    def _agent_type(self) -> str:
        """Return Identifier of agent type."""
-        return "zero-shot-react-description"
+        return AgentType.ZERO_SHOT_REACT_DESCRIPTION

    @property
    def observation_prefix(self) -> str:
@@ -100,7 +101,7 @@ class ZeroShotAgent(Agent):
    @classmethod
    def from_llm_and_tools(
        cls,
-        llm: BaseLLM,
+        llm: BaseLanguageModel,
        tools: Sequence[BaseTool],
        callback_manager: Optional[BaseCallbackManager] = None,
        prefix: str = PREFIX,
@@ -155,7 +156,7 @@ class MRKLChain(AgentExecutor):

    @classmethod
    def from_chains(
-        cls, llm: BaseLLM, chains: List[ChainConfig], **kwargs: Any
+        cls, llm: BaseLanguageModel, chains: List[ChainConfig], **kwargs: Any
    ) -> AgentExecutor:
        """User friendly way to initialize the MRKL chain.

--- a/langchain/agents/react/base.py
+++ b/langchain/agents/react/base.py
@@ -5,6 +5,7 @@ from typing import Any, List, Optional, Sequence, Tuple
 from pydantic import BaseModel

 from langchain.agents.agent import Agent, AgentExecutor
+from langchain.agents.agent_types import AgentType
 from langchain.agents.react.textworld_prompt import TEXTWORLD_PROMPT
 from langchain.agents.react.wiki_prompt import WIKI_PROMPT
 from langchain.agents.tools import Tool
@@ -21,7 +22,7 @@ class ReActDocstoreAgent(Agent, BaseModel):
    @property
    def _agent_type(self) -> str:
        """Return Identifier of agent type."""
-        return "react-docstore"
+        return AgentType.REACT_DOCSTORE

    @classmethod
    def create_prompt(cls, tools: Sequence[BaseTool]) -> BasePromptTemplate:
--- a/langchain/agents/self_ask_with_search/base.py
+++ b/langchain/agents/self_ask_with_search/base.py
@@ -2,6 +2,7 @@
 from typing import Any, Optional, Sequence, Tuple, Union

 from langchain.agents.agent import Agent, AgentExecutor
+from langchain.agents.agent_types import AgentType
 from langchain.agents.self_ask_with_search.prompt import PROMPT
 from langchain.agents.tools import Tool
 from langchain.llms.base import BaseLLM
@@ -17,7 +18,7 @@ class SelfAskWithSearchAgent(Agent):
    @property
    def _agent_type(self) -> str:
        """Return Identifier of agent type."""
-        return "self-ask-with-search"
+        return AgentType.SELF_ASK_WITH_SEARCH

    @classmethod
    def create_prompt(cls, tools: Sequence[BaseTool]) -> BasePromptTemplate:
--- a/langchain/chains/api/base.py
+++ b/langchain/chains/api/base.py
@@ -9,7 +9,7 @@ from langchain.chains.api.prompt import API_RESPONSE_PROMPT, API_URL_PROMPT
 from langchain.chains.base import Chain
 from langchain.chains.llm import LLMChain
 from langchain.prompts import BasePromptTemplate
-from langchain.requests import RequestsWrapper
+from langchain.requests import TextRequestsWrapper
 from langchain.schema import BaseLanguageModel


@@ -18,7 +18,7 @@ class APIChain(Chain, BaseModel):

    api_request_chain: LLMChain
    api_answer_chain: LLMChain
-    requests_wrapper: RequestsWrapper = Field(exclude=True)
+    requests_wrapper: TextRequestsWrapper = Field(exclude=True)
    api_docs: str
    question_key: str = "question"  #: :meta private:
    output_key: str = "output"  #: :meta private:
@@ -93,7 +93,7 @@ class APIChain(Chain, BaseModel):
    ) -> APIChain:
        """Load chain from just an LLM and the api docs."""
        get_request_chain = LLMChain(llm=llm, prompt=api_url_prompt)
-        requests_wrapper = RequestsWrapper(headers=headers)
+        requests_wrapper = TextRequestsWrapper(headers=headers)
        get_answer_chain = LLMChain(llm=llm, prompt=api_response_prompt)
        return cls(
            api_request_chain=get_request_chain,
--- a/langchain/chains/llm_requests.py
+++ b/langchain/chains/llm_requests.py
@@ -7,7 +7,7 @@ from pydantic import BaseModel, Extra, Field, root_validator

 from langchain.chains import LLMChain
 from langchain.chains.base import Chain
-from langchain.requests import RequestsWrapper
+from langchain.requests import TextRequestsWrapper

 DEFAULT_HEADERS = {
    "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/87.0.4280.88 Safari/537.36"  # noqa: E501
@@ -18,8 +18,8 @@ class LLMRequestsChain(Chain, BaseModel):
    """Chain that hits a URL and then uses an LLM to parse results."""

    llm_chain: LLMChain
-    requests_wrapper: RequestsWrapper = Field(
-        default_factory=RequestsWrapper, exclude=True
+    requests_wrapper: TextRequestsWrapper = Field(
+        default_factory=TextRequestsWrapper, exclude=True
    )
    text_length: int = 8000
    requests_key: str = "requests_result"  #: :meta private:
--- a/langchain/chat_models/base.py
+++ b/langchain/chat_models/base.py
@@ -6,11 +6,7 @@ from pydantic import BaseModel, Extra, Field, validator

 import langchain
 from langchain.callbacks import get_callback_manager
-from langchain.callbacks.base import (
-    BaseCallbackHandler,
-    BaseCallbackManager,
-    CallbackManager,
-)
+from langchain.callbacks.base import BaseCallbackManager
 from langchain.schema import (
    AIMessage,
    BaseLanguageModel,
@@ -20,7 +16,6 @@ from langchain.schema import (
    HumanMessage,
    LLMResult,
    PromptValue,
-    SystemMessage,
 )


@@ -136,53 +131,6 @@ class BaseChatModel(BaseLanguageModel, BaseModel, ABC):
        result = self([HumanMessage(content=message)], stop=stop)
        return result.content

-    async def as_poe_handler(self):
-        model = self
-
-        class LLMChainPoeHandler(PoeHandler):
-            async def get_response(self, query):
-                callback_handler = PoeCallbackHandler()
-                callback_manager = CallbackManager([callback_handler])
-                model.callback_manager = callback_manager
-
-                run = asyncio.create_task(
-                    model([poe_msg_to_lc_msg(msg) for msg in query.query])
-                )
-
-                while not callback_handler.done.is_set():
-                    token = await callback_handler.queue.get()
-                    yield token
-
-                await run
-
-        return LLMChainPoeHandler()
-
-
-class PoeCallbackHandler(BaseCallbackHandler):
-    def __init__(self):
-        self.queue = asyncio.Queue()
-        self.done = asyncio.Event()
-
-    def on_llm_start(self, serialized: Dict[str, Any], prompts: List[str]):
-        pass
-
-    def on_llm_new_token(self, token: str):
-        self.queue.put_nowait(token)
-
-    def on_llm_end(self, serialized: Dict[str, Any], prompts: List[str]):
-        self.done.set()
-
-
-def poe_msg_to_lc_msg(msg: ProtocolMessage) -> BaseMessage:
-    if msg.type == "human":
-        return HumanMessage(content=msg.text)
-    elif msg.type == "bot" or msg.type == "assistant":
-        return AIMessage(content=msg.text)
-    elif msg.type == "system":
-        return SystemMessage(content=msg.text)
-    else:
-        raise ValueError(f"Unknown message type: {msg.type}")
-

 class SimpleChatModel(BaseChatModel):
    def _generate(
--- a/langchain/document_loaders/init.py
+++ b/langchain/document_loaders/init.py
@@ -11,13 +11,18 @@ from langchain.document_loaders.azure_blob_storage_file import (
 )
 from langchain.document_loaders.bigquery import BigQueryLoader
 from langchain.document_loaders.blackboard import BlackboardLoader
-from langchain.document_loaders.college_confidential import CollegeConfidentialLoader
+from langchain.document_loaders.college_confidential import (
+    CollegeConfidentialLoader,
+)
 from langchain.document_loaders.conllu import CoNLLULoader
 from langchain.document_loaders.csv_loader import CSVLoader
 from langchain.document_loaders.dataframe import DataFrameLoader
 from langchain.document_loaders.directory import DirectoryLoader
 from langchain.document_loaders.duckdb_loader import DuckDBLoader
-from langchain.document_loaders.email import UnstructuredEmailLoader
+from langchain.document_loaders.email import (
+    OutlookMessageLoader,
+    UnstructuredEmailLoader,
+)
 from langchain.document_loaders.epub import UnstructuredEPubLoader
 from langchain.document_loaders.evernote import EverNoteLoader
 from langchain.document_loaders.facebook_chat import FacebookChatLoader
@@ -58,9 +63,12 @@ from langchain.document_loaders.unstructured import (
    UnstructuredFileLoader,
 )
 from langchain.document_loaders.url import UnstructuredURLLoader
+from langchain.document_loaders.url_selenium import SeleniumURLLoader
 from langchain.document_loaders.web_base import WebBaseLoader
 from langchain.document_loaders.whatsapp_chat import WhatsAppChatLoader
-from langchain.document_loaders.word_document import UnstructuredWordDocumentLoader
+from langchain.document_loaders.word_document import (
+    UnstructuredWordDocumentLoader,
+)
 from langchain.document_loaders.youtube import (
    GoogleApiClient,
    GoogleApiYoutubeLoader,
@@ -74,6 +82,7 @@ __all__ = [
    "UnstructuredFileLoader",
    "UnstructuredFileIOLoader",
    "UnstructuredURLLoader",
+    "SeleniumURLLoader",
    "DirectoryLoader",
    "NotionDirectoryLoader",
    "NotionDBLoader",
@@ -87,6 +96,7 @@ __all__ = [
    "UnstructuredImageLoader",
    "ObsidianLoader",
    "UnstructuredEmailLoader",
+    "OutlookMessageLoader",
    "UnstructuredEPubLoader",
    "UnstructuredMarkdownLoader",
    "RoamLoader",
--- a/langchain/document_loaders/email.py
+++ b/langchain/document_loaders/email.py
@@ -1,6 +1,9 @@
 """Loader that loads email files."""
+import os
 from typing import List

+from langchain.docstore.document import Document
+from langchain.document_loaders.base import BaseLoader
 from langchain.document_loaders.unstructured import UnstructuredFileLoader


@@ -11,3 +14,42 @@ class UnstructuredEmailLoader(UnstructuredFileLoader):
        from unstructured.partition.email import partition_email

        return partition_email(filename=self.file_path)
+
+
+class OutlookMessageLoader(BaseLoader):
+    """
+    Loader that loads Outlook Message files using extract_msg.
+    https://github.com/TeamMsgExtractor/msg-extractor
+    """
+
+    def __init__(self, file_path: str):
+        """Initialize with file path."""
+
+        self.file_path = file_path
+
+        if not os.path.isfile(self.file_path):
+            raise ValueError("File path %s is not a valid file" % self.file_path)
+
+        try:
+            import extract_msg  # noqa:F401
+        except ImportError:
+            raise ImportError(
+                "extract_msg is not installed. Please install it with "
+                "`pip install extract_msg`"
+            )
+
+    def load(self) -> List[Document]:
+        """Load data into document objects."""
+        import extract_msg
+
+        msg = extract_msg.Message(self.file_path)
+        return [
+            Document(
+                page_content=msg.body,
+                metadata={
+                    "subject": msg.subject,
+                    "sender": msg.sender,
+                    "date": msg.date,
+                },
+            )
+        ]
--- a/langchain/document_loaders/url_selenium.py
+++ b/langchain/document_loaders/url_selenium.py
@@ -0,0 +1,118 @@
+"""Loader that uses Selenium to load a page, then uses unstructured to load the html.
+"""
+import logging
+from typing import TYPE_CHECKING, List, Literal, Optional, Union
+
+if TYPE_CHECKING:
+    from selenium.webdriver import Chrome, Firefox
+
+from langchain.docstore.document import Document
+from langchain.document_loaders.base import BaseLoader
+
+logger = logging.getLogger(__file__)
+
+
+class SeleniumURLLoader(BaseLoader):
+    """Loader that uses Selenium and to load a page and unstructured to load the html.
+    This is useful for loading pages that require javascript to render.
+
+    Attributes:
+        urls (List[str]): List of URLs to load.
+        continue_on_failure (bool): If True, continue loading other URLs on failure.
+        browser (str): The browser to use, either 'chrome' or 'firefox'.
+        executable_path (Optional[str]): The path to the browser executable.
+        headless (bool): If True, the browser will run in headless mode.
+    """
+
+    def __init__(
+        self,
+        urls: List[str],
+        continue_on_failure: bool = True,
+        browser: Literal["chrome", "firefox"] = "chrome",
+        executable_path: Optional[str] = None,
+        headless: bool = True,
+    ):
+        """Load a list of URLs using Selenium and unstructured."""
+        try:
+            import selenium  # noqa:F401
+        except ImportError:
+            raise ValueError(
+                "selenium package not found, please install it with "
+                "`pip install selenium`"
+            )
+
+        try:
+            import unstructured  # noqa:F401
+        except ImportError:
+            raise ValueError(
+                "unstructured package not found, please install it with "
+                "`pip install unstructured`"
+            )
+
+        self.urls = urls
+        self.continue_on_failure = continue_on_failure
+        self.browser = browser
+        self.executable_path = executable_path
+        self.headless = headless
+
+    def _get_driver(self) -> Union["Chrome", "Firefox"]:
+        """Create and return a WebDriver instance based on the specified browser.
+
+        Raises:
+            ValueError: If an invalid browser is specified.
+
+        Returns:
+            Union[Chrome, Firefox]: A WebDriver instance for the specified browser.
+        """
+        if self.browser.lower() == "chrome":
+            from selenium.webdriver import Chrome
+            from selenium.webdriver.chrome.options import Options as ChromeOptions
+
+            chrome_options = ChromeOptions()
+            if self.headless:
+                chrome_options.add_argument("--headless")
+            if self.executable_path is None:
+                return Chrome(options=chrome_options)
+            return Chrome(executable_path=self.executable_path, options=chrome_options)
+        elif self.browser.lower() == "firefox":
+            from selenium.webdriver import Firefox
+            from selenium.webdriver.firefox.options import Options as FirefoxOptions
+
+            firefox_options = FirefoxOptions()
+            if self.headless:
+                firefox_options.add_argument("--headless")
+            if self.executable_path is None:
+                return Firefox(options=firefox_options)
+            return Firefox(
+                executable_path=self.executable_path, options=firefox_options
+            )
+        else:
+            raise ValueError("Invalid browser specified. Use 'chrome' or 'firefox'.")
+
+    def load(self) -> List[Document]:
+        """Load the specified URLs using Selenium and create Document instances.
+
+        Returns:
+            List[Document]: A list of Document instances with loaded content.
+        """
+        from unstructured.partition.html import partition_html
+
+        docs: List[Document] = list()
+        driver = self._get_driver()
+
+        for url in self.urls:
+            try:
+                driver.get(url)
+                page_content = driver.page_source
+                elements = partition_html(text=page_content)
+                text = "\n\n".join([str(el) for el in elements])
+                metadata = {"source": url}
+                docs.append(Document(page_content=text, metadata=metadata))
+            except Exception as e:
+                if self.continue_on_failure:
+                    logger.error(f"Error fetching or processing {url}, exception: {e}")
+                else:
+                    raise e
+
+        driver.quit()
+        return docs
--- a/langchain/embeddings/init.py
+++ b/langchain/embeddings/init.py
@@ -14,6 +14,7 @@ from langchain.embeddings.huggingface import (
 )
 from langchain.embeddings.huggingface_hub import HuggingFaceHubEmbeddings
 from langchain.embeddings.jina import JinaEmbeddings
+from langchain.embeddings.llamacpp import LlamaCppEmbeddings
 from langchain.embeddings.openai import OpenAIEmbeddings
 from langchain.embeddings.sagemaker_endpoint import SagemakerEndpointEmbeddings
 from langchain.embeddings.self_hosted import SelfHostedEmbeddings
@@ -30,6 +31,7 @@ __all__ = [
    "HuggingFaceEmbeddings",
    "CohereEmbeddings",
    "JinaEmbeddings",
+    "LlamaCppEmbeddings",
    "HuggingFaceHubEmbeddings",
    "TensorflowHubEmbeddings",
    "SagemakerEndpointEmbeddings",
--- a/langchain/embeddings/llamacpp.py
+++ b/langchain/embeddings/llamacpp.py
@@ -0,0 +1,118 @@
+"""Wrapper around llama.cpp embedding models."""
+from typing import Any, Dict, List, Optional
+
+from pydantic import BaseModel, Extra, Field, root_validator
+
+from langchain.embeddings.base import Embeddings
+
+
+class LlamaCppEmbeddings(BaseModel, Embeddings):
+    """Wrapper around llama.cpp embedding models.
+
+    To use, you should have the llama-cpp-python library installed, and provide the
+    path to the Llama model as a named parameter to the constructor.
+    Check out: https://github.com/abetlen/llama-cpp-python
+
+    Example:
+        .. code-block:: python
+
+            from langchain.embeddings import LlamaCppEmbeddings
+            llama = LlamaCppEmbeddings(model_path="/path/to/model.bin")
+    """
+
+    client: Any  #: :meta private:
+    model_path: str
+
+    n_ctx: int = Field(512, alias="n_ctx")
+    """Token context window."""
+
+    n_parts: int = Field(-1, alias="n_parts")
+    """Number of parts to split the model into. 
+    If -1, the number of parts is automatically determined."""
+
+    seed: int = Field(-1, alias="seed")
+    """Seed. If -1, a random seed is used."""
+
+    f16_kv: bool = Field(False, alias="f16_kv")
+    """Use half-precision for key/value cache."""
+
+    logits_all: bool = Field(False, alias="logits_all")
+    """Return logits for all tokens, not just the last token."""
+
+    vocab_only: bool = Field(False, alias="vocab_only")
+    """Only load the vocabulary, no weights."""
+
+    use_mlock: bool = Field(False, alias="use_mlock")
+    """Force system to keep model in RAM."""
+
+    n_threads: Optional[int] = Field(None, alias="n_threads")
+    """Number of threads to use. If None, the number 
+    of threads is automatically determined."""
+
+    class Config:
+        """Configuration for this pydantic object."""
+
+        extra = Extra.forbid
+
+    @root_validator()
+    def validate_environment(cls, values: Dict) -> Dict:
+        """Validate that llama-cpp-python library is installed."""
+        model_path = values["model_path"]
+        n_ctx = values["n_ctx"]
+        n_parts = values["n_parts"]
+        seed = values["seed"]
+        f16_kv = values["f16_kv"]
+        logits_all = values["logits_all"]
+        vocab_only = values["vocab_only"]
+        use_mlock = values["use_mlock"]
+        n_threads = values["n_threads"]
+
+        try:
+            from llama_cpp import Llama
+
+            values["client"] = Llama(
+                model_path=model_path,
+                n_ctx=n_ctx,
+                n_parts=n_parts,
+                seed=seed,
+                f16_kv=f16_kv,
+                logits_all=logits_all,
+                vocab_only=vocab_only,
+                use_mlock=use_mlock,
+                n_threads=n_threads,
+                embedding=True,
+            )
+        except ImportError:
+            raise ModuleNotFoundError(
+                "Could not import llama-cpp-python library. "
+                "Please install the llama-cpp-python library to "
+                "use this embedding model: pip install llama-cpp-python"
+            )
+        except Exception:
+            raise NameError(f"Could not load Llama model from path: {model_path}")
+
+        return values
+
+    def embed_documents(self, texts: List[str]) -> List[List[float]]:
+        """Embed a list of documents using the Llama model.
+
+        Args:
+            texts: The list of texts to embed.
+
+        Returns:
+            List of embeddings, one for each text.
+        """
+        embeddings = [self.client.embed(text) for text in texts]
+        return [list(map(float, e)) for e in embeddings]
+
+    def embed_query(self, text: str) -> List[float]:
+        """Embed a query using the Llama model.
+
+        Args:
+            text: The text to embed.
+
+        Returns:
+            Embeddings for the text.
+        """
+        embedding = self.client.embed(text)
+        return list(map(float, embedding))
--- a/langchain/llms/init.py
+++ b/langchain/llms/init.py
@@ -11,9 +11,11 @@ from langchain.llms.cohere import Cohere
 from langchain.llms.deepinfra import DeepInfra
 from langchain.llms.forefrontai import ForefrontAI
 from langchain.llms.gooseai import GooseAI
+from langchain.llms.gpt4all import GPT4All
 from langchain.llms.huggingface_endpoint import HuggingFaceEndpoint
 from langchain.llms.huggingface_hub import HuggingFaceHub
 from langchain.llms.huggingface_pipeline import HuggingFacePipeline
+from langchain.llms.llamacpp import LlamaCpp
 from langchain.llms.modal import Modal
 from langchain.llms.nlpcloud import NLPCloud
 from langchain.llms.openai import AzureOpenAI, OpenAI, OpenAIChat
@@ -35,6 +37,8 @@ __all__ = [
    "DeepInfra",
    "ForefrontAI",
    "GooseAI",
+    "GPT4All",
+    "LlamaCpp",
    "Modal",
    "NLPCloud",
    "OpenAI",
@@ -65,8 +69,10 @@ type_to_cls_dict: Dict[str, Type[BaseLLM]] = {
    "deepinfra": DeepInfra,
    "forefrontai": ForefrontAI,
    "gooseai": GooseAI,
+    "gpt4all": GPT4All,
    "huggingface_hub": HuggingFaceHub,
    "huggingface_endpoint": HuggingFaceEndpoint,
+    "llamacpp": LlamaCpp,
    "modal": Modal,
    "sagemaker_endpoint": SagemakerEndpoint,
    "nlpcloud": NLPCloud,
--- a/langchain/llms/gpt4all.py
+++ b/langchain/llms/gpt4all.py
@@ -0,0 +1,183 @@
+"""Wrapper for the GPT4All model."""
+from typing import Any, Dict, List, Mapping, Optional, Set
+
+from pydantic import BaseModel, Extra, Field, root_validator
+
+from langchain.llms.base import LLM
+from langchain.llms.utils import enforce_stop_tokens
+
+
+class GPT4All(LLM, BaseModel):
+    r"""Wrapper around GPT4All language models.
+
+    To use, you should have the ``pyllamacpp`` python package installed, the
+    pre-trained model file, and the model's config information.
+
+    Example:
+        .. code-block:: python
+
+            from langchain.llms import GPT4All
+            model = GPT4All(model="./models/gpt4all-model.bin", n_ctx=512, n_threads=8)
+
+            # Simplest invocation
+            response = model("Once upon a time, ")
+    """
+
+    model: str
+    """Path to the pre-trained GPT4All model file."""
+
+    n_ctx: int = Field(512, alias="n_ctx")
+    """Token context window."""
+
+    n_parts: int = Field(-1, alias="n_parts")
+    """Number of parts to split the model into. 
+    If -1, the number of parts is automatically determined."""
+
+    seed: int = Field(0, alias="seed")
+    """Seed. If -1, a random seed is used."""
+
+    f16_kv: bool = Field(False, alias="f16_kv")
+    """Use half-precision for key/value cache."""
+
+    logits_all: bool = Field(False, alias="logits_all")
+    """Return logits for all tokens, not just the last token."""
+
+    vocab_only: bool = Field(False, alias="vocab_only")
+    """Only load the vocabulary, no weights."""
+
+    use_mlock: bool = Field(False, alias="use_mlock")
+    """Force system to keep model in RAM."""
+
+    embedding: bool = Field(False, alias="embedding")
+    """Use embedding mode only."""
+
+    n_threads: Optional[int] = Field(4, alias="n_threads")
+    """Number of threads to use."""
+
+    n_predict: Optional[int] = 256
+    """The maximum number of tokens to generate."""
+
+    temp: Optional[float] = 0.8
+    """The temperature to use for sampling."""
+
+    top_p: Optional[float] = 0.95
+    """The top-p value to use for sampling."""
+
+    top_k: Optional[int] = 40
+    """The top-k value to use for sampling."""
+
+    echo: Optional[bool] = False
+    """Whether to echo the prompt."""
+
+    stop: Optional[List[str]] = []
+    """A list of strings to stop generation when encountered."""
+
+    repeat_last_n: Optional[int] = 64
+    "Last n tokens to penalize"
+
+    repeat_penalty: Optional[float] = 1.3
+    """The penalty to apply to repeated tokens."""
+
+    n_batch: int = Field(1, alias="n_batch")
+    """Batch size for prompt processing."""
+
+    streaming: bool = False
+    """Whether to stream the results or not."""
+
+    client: Any = None  #: :meta private:
+
+    class Config:
+        """Configuration for this pydantic object."""
+
+        extra = Extra.forbid
+
+    @property
+    def _default_params(self) -> Dict[str, Any]:
+        """Get the identifying parameters."""
+        return {
+            "seed": self.seed,
+            "n_predict": self.n_predict,
+            "n_threads": self.n_threads,
+            "n_batch": self.n_batch,
+            "repeat_last_n": self.repeat_last_n,
+            "repeat_penalty": self.repeat_penalty,
+            "top_k": self.top_k,
+            "top_p": self.top_p,
+            "temp": self.temp,
+        }
+
+    @staticmethod
+    def _llama_param_names() -> Set[str]:
+        """Get the identifying parameters."""
+        return {
+            "seed",
+            "n_ctx",
+            "n_parts",
+            "f16_kv",
+            "logits_all",
+            "vocab_only",
+            "use_mlock",
+            "embedding",
+        }
+
+    @root_validator()
+    def validate_environment(cls, values: Dict) -> Dict:
+        """Validate that the python package exists in the environment."""
+        try:
+            from pyllamacpp.model import Model as GPT4AllModel
+
+            llama_keys = cls._llama_param_names()
+            model_kwargs = {k: v for k, v in values.items() if k in llama_keys}
+            values["client"] = GPT4AllModel(
+                ggml_model=values["model"],
+                **model_kwargs,
+            )
+
+        except ImportError:
+            raise ValueError(
+                "Could not import pyllamacpp python package. "
+                "Please install it with `pip install pyllamacpp`."
+            )
+        return values
+
+    @property
+    def _identifying_params(self) -> Mapping[str, Any]:
+        """Get the identifying parameters."""
+        return {
+            "model": self.model,
+            **self._default_params,
+            **{
+                k: v
+                for k, v in self.__dict__.items()
+                if k in GPT4All._llama_param_names()
+            },
+        }
+
+    @property
+    def _llm_type(self) -> str:
+        """Return the type of llm."""
+        return "gpt4all"
+
+    def _call(self, prompt: str, stop: Optional[List[str]] = None) -> str:
+        r"""Call out to GPT4All's generate method.
+
+        Args:
+            prompt: The prompt to pass into the model.
+            stop: A list of strings to stop generation when encountered.
+
+        Returns:
+            The string generated by the model.
+
+        Example:
+            .. code-block:: python
+
+                prompt = "Once upon a time, "
+                response = model(prompt, n_predict=55)
+        """
+        text = self.client.generate(
+            prompt,
+            **self._default_params,
+        )
+        if stop is not None:
+            text = enforce_stop_tokens(text, stop)
+        return text
--- a/langchain/llms/huggingface_pipeline.py
+++ b/langchain/llms/huggingface_pipeline.py
@@ -76,7 +76,7 @@ class HuggingFacePipeline(LLM, BaseModel):
        except ImportError:
            raise ValueError(
                "Could not import transformers python package. "
-                "Please it install it with `pip install transformers`."
+                "Please install it with `pip install transformers`."
            )

        _model_kwargs = model_kwargs or {}
--- a/langchain/llms/llamacpp.py
+++ b/langchain/llms/llamacpp.py
@@ -0,0 +1,184 @@
+"""Wrapper around llama.cpp."""
+import logging
+from typing import Any, Dict, List, Optional
+
+from pydantic import BaseModel, Field, root_validator
+
+from langchain.llms.base import LLM
+
+logger = logging.getLogger(__name__)
+
+
+class LlamaCpp(LLM, BaseModel):
+    """Wrapper around the llama.cpp model.
+
+    To use, you should have the llama-cpp-python library installed, and provide the
+    path to the Llama model as a named parameter to the constructor.
+    Check out: https://github.com/abetlen/llama-cpp-python
+
+    Example:
+        .. code-block:: python
+
+            from langchain.llms import LlamaCppEmbeddings
+            llm = LlamaCppEmbeddings(model_path="/path/to/llama/model")
+    """
+
+    client: Any  #: :meta private:
+    model_path: str
+    """The path to the Llama model file."""
+
+    n_ctx: int = Field(512, alias="n_ctx")
+    """Token context window."""
+
+    n_parts: int = Field(-1, alias="n_parts")
+    """Number of parts to split the model into. 
+    If -1, the number of parts is automatically determined."""
+
+    seed: int = Field(-1, alias="seed")
+    """Seed. If -1, a random seed is used."""
+
+    f16_kv: bool = Field(False, alias="f16_kv")
+    """Use half-precision for key/value cache."""
+
+    logits_all: bool = Field(False, alias="logits_all")
+    """Return logits for all tokens, not just the last token."""
+
+    vocab_only: bool = Field(False, alias="vocab_only")
+    """Only load the vocabulary, no weights."""
+
+    use_mlock: bool = Field(False, alias="use_mlock")
+    """Force system to keep model in RAM."""
+
+    n_threads: Optional[int] = Field(None, alias="n_threads")
+    """Number of threads to use. 
+    If None, the number of threads is automatically determined."""
+
+    suffix: Optional[str] = Field(None)
+    """A suffix to append to the generated text. If None, no suffix is appended."""
+
+    max_tokens: Optional[int] = 256
+    """The maximum number of tokens to generate."""
+
+    temperature: Optional[float] = 0.8
+    """The temperature to use for sampling."""
+
+    top_p: Optional[float] = 0.95
+    """The top-p value to use for sampling."""
+
+    logprobs: Optional[int] = Field(None)
+    """The number of logprobs to return. If None, no logprobs are returned."""
+
+    echo: Optional[bool] = False
+    """Whether to echo the prompt."""
+
+    stop: Optional[List[str]] = []
+    """A list of strings to stop generation when encountered."""
+
+    repeat_penalty: Optional[float] = 1.1
+    """The penalty to apply to repeated tokens."""
+
+    top_k: Optional[int] = 40
+    """The top-k value to use for sampling."""
+
+    @root_validator()
+    def validate_environment(cls, values: Dict) -> Dict:
+        """Validate that llama-cpp-python library is installed."""
+        model_path = values["model_path"]
+        n_ctx = values["n_ctx"]
+        n_parts = values["n_parts"]
+        seed = values["seed"]
+        f16_kv = values["f16_kv"]
+        logits_all = values["logits_all"]
+        vocab_only = values["vocab_only"]
+        use_mlock = values["use_mlock"]
+        n_threads = values["n_threads"]
+
+        try:
+            from llama_cpp import Llama
+
+            values["client"] = Llama(
+                model_path=model_path,
+                n_ctx=n_ctx,
+                n_parts=n_parts,
+                seed=seed,
+                f16_kv=f16_kv,
+                logits_all=logits_all,
+                vocab_only=vocab_only,
+                use_mlock=use_mlock,
+                n_threads=n_threads,
+            )
+        except ImportError:
+            raise ModuleNotFoundError(
+                "Could not import llama-cpp-python library. "
+                "Please install the llama-cpp-python library to "
+                "use this embedding model: pip install llama-cpp-python"
+            )
+        except Exception:
+            raise NameError(f"Could not load Llama model from path: {model_path}")
+
+        return values
+
+    @property
+    def _default_params(self) -> Dict[str, Any]:
+        """Get the default parameters for calling llama_cpp."""
+        return {
+            "suffix": self.suffix,
+            "max_tokens": self.max_tokens,
+            "temperature": self.temperature,
+            "top_p": self.top_p,
+            "logprobs": self.logprobs,
+            "echo": self.echo,
+            "stop_sequences": self.stop,
+            "repeat_penalty": self.repeat_penalty,
+            "top_k": self.top_k,
+        }
+
+    @property
+    def _identifying_params(self) -> Dict[str, Any]:
+        """Get the identifying parameters."""
+        return {**{"model_path": self.model_path}, **self._default_params}
+
+    @property
+    def _llm_type(self) -> str:
+        """Return type of llm."""
+        return "llama.cpp"
+
+    def _call(self, prompt: str, stop: Optional[List[str]] = None) -> str:
+        """Call the Llama model and return the output.
+
+        Args:
+            prompt: The prompt to use for generation.
+            stop: A list of strings to stop generation when encountered.
+
+        Returns:
+            The generated text.
+
+        Example:
+            .. code-block:: python
+
+                from langchain.llms import LlamaCppEmbeddings
+                llm = LlamaCppEmbeddings(model_path="/path/to/local/llama/model.bin")
+                llm("This is a prompt.")
+        """
+
+        params = self._default_params
+        if self.stop and stop is not None:
+            raise ValueError("`stop` found in both the input and default params.")
+        elif self.stop:
+            params["stop_sequences"] = self.stop
+        else:
+            params["stop_sequences"] = []
+
+        """Call the Llama model and return the output."""
+        text = self.client(
+            prompt=prompt,
+            max_tokens=params["max_tokens"],
+            temperature=params["temperature"],
+            top_p=params["top_p"],
+            logprobs=params["logprobs"],
+            echo=params["echo"],
+            stop=params["stop_sequences"],
+            repeat_penalty=params["repeat_penalty"],
+            top_k=params["top_k"],
+        )
+        return text["choices"][0]["text"]
--- a/langchain/prompts/init.py
+++ b/langchain/prompts/init.py
@@ -2,6 +2,7 @@
 from langchain.prompts.base import BasePromptTemplate, StringPromptTemplate
 from langchain.prompts.chat import (
    AIMessagePromptTemplate,
+    BaseChatPromptTemplate,
    ChatMessagePromptTemplate,
    ChatPromptTemplate,
    HumanMessagePromptTemplate,
@@ -27,4 +28,5 @@ __all__ = [
    "AIMessagePromptTemplate",
    "SystemMessagePromptTemplate",
    "ChatMessagePromptTemplate",
+    "BaseChatPromptTemplate",
 ]
--- a/langchain/prompts/chat.py
+++ b/langchain/prompts/chat.py
@@ -119,7 +119,20 @@ class ChatPromptValue(PromptValue):
        return self.messages


-class ChatPromptTemplate(BasePromptTemplate, ABC):
+class BaseChatPromptTemplate(BasePromptTemplate, ABC):
+    def format(self, **kwargs: Any) -> str:
+        return self.format_prompt(**kwargs).to_string()
+
+    def format_prompt(self, **kwargs: Any) -> PromptValue:
+        messages = self.format_messages(**kwargs)
+        return ChatPromptValue(messages=messages)
+
+    @abstractmethod
+    def format_messages(self, **kwargs: Any) -> List[BaseMessage]:
+        """Format kwargs into a list of messages."""
+
+
+class ChatPromptTemplate(BaseChatPromptTemplate, ABC):
    input_variables: List[str]
    messages: List[Union[BaseMessagePromptTemplate, BaseMessage]]

@@ -158,7 +171,7 @@ class ChatPromptTemplate(BasePromptTemplate, ABC):
    def format(self, **kwargs: Any) -> str:
        return self.format_prompt(**kwargs).to_string()

-    def format_prompt(self, **kwargs: Any) -> PromptValue:
+    def format_messages(self, **kwargs: Any) -> List[BaseMessage]:
        kwargs = self._merge_partial_and_user_variables(**kwargs)
        result = []
        for message_template in self.messages:
@@ -174,7 +187,7 @@ class ChatPromptTemplate(BasePromptTemplate, ABC):
                result.extend(message)
            else:
                raise ValueError(f"Unexpected input: {message_template}")
-        return ChatPromptValue(messages=result)
+        return result

    def partial(self, **kwargs: Union[str, Callable[[], str]]) -> BasePromptTemplate:
        raise NotImplementedError
--- a/langchain/requests.py
+++ b/langchain/requests.py
@@ -6,8 +6,12 @@ import requests
 from pydantic import BaseModel, Extra


-class RequestsWrapper(BaseModel):
-    """Lightweight wrapper around requests library."""
+class Requests(BaseModel):
+    """Wrapper around requests to handle auth and async.
+
+    The main purpose of this wrapper is to handle authentication (by saving
+    headers) and enable easy async methods on the same base object.
+    """

    headers: Optional[Dict[str, str]] = None
    aiosession: Optional[aiohttp.ClientSession] = None
@@ -18,56 +22,133 @@ class RequestsWrapper(BaseModel):
        extra = Extra.forbid
        arbitrary_types_allowed = True

-    def get(self, url: str, **kwargs: Any) -> str:
+    def get(self, url: str, **kwargs: Any) -> requests.Response:
        """GET the URL and return the text."""
-        return requests.get(url, headers=self.headers, **kwargs).text
+        return requests.get(url, headers=self.headers, **kwargs)

-    def post(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+    def post(self, url: str, data: Dict[str, Any], **kwargs: Any) -> requests.Response:
        """POST to the URL and return the text."""
-        return requests.post(url, json=data, headers=self.headers, **kwargs).text
+        return requests.post(url, json=data, headers=self.headers, **kwargs)

-    def patch(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+    def patch(self, url: str, data: Dict[str, Any], **kwargs: Any) -> requests.Response:
        """PATCH the URL and return the text."""
-        return requests.patch(url, json=data, headers=self.headers, **kwargs).text
+        return requests.patch(url, json=data, headers=self.headers, **kwargs)

-    def put(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+    def put(self, url: str, data: Dict[str, Any], **kwargs: Any) -> requests.Response:
        """PUT the URL and return the text."""
-        return requests.put(url, json=data, headers=self.headers, **kwargs).text
+        return requests.put(url, json=data, headers=self.headers, **kwargs)

-    def delete(self, url: str, **kwargs: Any) -> str:
+    def delete(self, url: str, **kwargs: Any) -> requests.Response:
        """DELETE the URL and return the text."""
-        return requests.delete(url, headers=self.headers, **kwargs).text
+        return requests.delete(url, headers=self.headers, **kwargs)

-    async def _arequest(self, method: str, url: str, **kwargs: Any) -> str:
+    async def _arequest(
+        self, method: str, url: str, **kwargs: Any
+    ) -> aiohttp.ClientResponse:
        """Make an async request."""
        if not self.aiosession:
            async with aiohttp.ClientSession() as session:
                async with session.request(
                    method, url, headers=self.headers, **kwargs
                ) as response:
-                    return await response.text()
+                    return response
        else:
            async with self.aiosession.request(
                method, url, headers=self.headers, **kwargs
            ) as response:
-                return await response.text()
+                return response

-    async def aget(self, url: str, **kwargs: Any) -> str:
+    async def aget(self, url: str, **kwargs: Any) -> aiohttp.ClientResponse:
        """GET the URL and return the text asynchronously."""
        return await self._arequest("GET", url, **kwargs)

-    async def apost(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+    async def apost(
+        self, url: str, data: Dict[str, Any], **kwargs: Any
+    ) -> aiohttp.ClientResponse:
        """POST to the URL and return the text asynchronously."""
        return await self._arequest("POST", url, json=data, **kwargs)

-    async def apatch(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+    async def apatch(
+        self, url: str, data: Dict[str, Any], **kwargs: Any
+    ) -> aiohttp.ClientResponse:
        """PATCH the URL and return the text asynchronously."""
        return await self._arequest("PATCH", url, json=data, **kwargs)

-    async def aput(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+    async def aput(
+        self, url: str, data: Dict[str, Any], **kwargs: Any
+    ) -> aiohttp.ClientResponse:
        """PUT the URL and return the text asynchronously."""
        return await self._arequest("PUT", url, json=data, **kwargs)

-    async def adelete(self, url: str, **kwargs: Any) -> str:
+    async def adelete(self, url: str, **kwargs: Any) -> aiohttp.ClientResponse:
        """DELETE the URL and return the text asynchronously."""
        return await self._arequest("DELETE", url, **kwargs)
+
+
+class TextRequestsWrapper(BaseModel):
+    """Lightweight wrapper around requests library.
+
+    The main purpose of this wrapper is to always return a text output.
+    """
+
+    headers: Optional[Dict[str, str]] = None
+    aiosession: Optional[aiohttp.ClientSession] = None
+
+    class Config:
+        """Configuration for this pydantic object."""
+
+        extra = Extra.forbid
+        arbitrary_types_allowed = True
+
+    @property
+    def requests(self) -> Requests:
+        return Requests(headers=self.headers, aiosession=self.aiosession)
+
+    def get(self, url: str, **kwargs: Any) -> str:
+        """GET the URL and return the text."""
+        return self.requests.get(url, **kwargs).text
+
+    def post(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+        """POST to the URL and return the text."""
+        return self.requests.post(url, json=data, headers=self.headers, **kwargs).text
+
+    def patch(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+        """PATCH the URL and return the text."""
+        return self.requests.patch(url, json=data, headers=self.headers, **kwargs).text
+
+    def put(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+        """PUT the URL and return the text."""
+        return self.requests.put(url, json=data, headers=self.headers, **kwargs).text
+
+    def delete(self, url: str, **kwargs: Any) -> str:
+        """DELETE the URL and return the text."""
+        return self.requests.delete(url, headers=self.headers, **kwargs).text
+
+    async def aget(self, url: str, **kwargs: Any) -> str:
+        """GET the URL and return the text asynchronously."""
+        response = await self.requests.aget(url, **kwargs)
+        return await response.text()
+
+    async def apost(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+        """POST to the URL and return the text asynchronously."""
+        response = await self.requests.apost(url, data, **kwargs)
+        return await response.text()
+
+    async def apatch(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+        """PATCH the URL and return the text asynchronously."""
+        response = await self.requests.apatch(url, data, **kwargs)
+        return await response.text()
+
+    async def aput(self, url: str, data: Dict[str, Any], **kwargs: Any) -> str:
+        """PUT the URL and return the text asynchronously."""
+        response = await self.requests.aput(url, data, **kwargs)
+        return await response.text()
+
+    async def adelete(self, url: str, **kwargs: Any) -> str:
+        """DELETE the URL and return the text asynchronously."""
+        response = await self.requests.adelete(url, **kwargs)
+        return await response.text()
+
+
+# For backwards compatibility
+RequestsWrapper = TextRequestsWrapper
--- a/langchain/sql_database.py
+++ b/langchain/sql_database.py
@@ -22,6 +22,7 @@ class SQLDatabase:
        include_tables: Optional[List[str]] = None,
        sample_rows_in_table_info: int = 3,
        custom_table_info: Optional[dict] = None,
+        view_support: Optional[bool] = False,
    ):
        """Create engine from database URI."""
        self._engine = engine
@@ -30,7 +31,14 @@ class SQLDatabase:
            raise ValueError("Cannot specify both include_tables and ignore_tables")

        self._inspector = inspect(self._engine)
-        self._all_tables = set(self._inspector.get_table_names(schema=schema))
+
+        # including view support by adding the views as well as tables to the all
+        # tables list if view_support is True
+        self._all_tables = set(
+            self._inspector.get_table_names(schema=schema)
+            + (self._inspector.get_view_names(schema=schema) if view_support else [])
+        )
+
        self._include_tables = set(include_tables) if include_tables else set()
        if self._include_tables:
            missing_tables = self._include_tables - self._all_tables
@@ -69,8 +77,12 @@ class SQLDatabase:
            )

        self._metadata = metadata or MetaData()
+        # including view support if view_support = true
        self._metadata.reflect(
-            bind=self._engine, only=self._usable_tables, schema=self._schema
+            views=view_support,
+            bind=self._engine,
+            only=self._usable_tables,
+            schema=self._schema,
        )

    @classmethod
--- a/langchain/tools/requests/tool.py
+++ b/langchain/tools/requests/tool.py
@@ -5,7 +5,7 @@ from typing import Any, Dict

 from pydantic import BaseModel

-from langchain.requests import RequestsWrapper
+from langchain.requests import TextRequestsWrapper
 from langchain.tools.base import BaseTool


@@ -17,7 +17,7 @@ def _parse_input(text: str) -> Dict[str, Any]:
 class BaseRequestsTool(BaseModel):
    """Base class for requests tools."""

-    requests_wrapper: RequestsWrapper
+    requests_wrapper: TextRequestsWrapper


 class RequestsGetTool(BaseRequestsTool, BaseTool):
--- a/langchain/tools/zapier/tool.py
+++ b/langchain/tools/zapier/tool.py
@@ -65,7 +65,7 @@ toolkit = ZapierToolkit.from_zapier_nla_wrapper(zapier)
 agent = initialize_agent(
    toolkit.get_tools(),
    llm,
-    agent="zero-shot-react-description",
+    agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION,
    verbose=True
 )

--- a/langchain/utilities/init.py
+++ b/langchain/utilities/init.py
@@ -1,6 +1,6 @@
 """General utilities."""
 from langchain.python import PythonREPL
-from langchain.requests import RequestsWrapper
+from langchain.requests import TextRequestsWrapper
 from langchain.utilities.apify import ApifyWrapper
 from langchain.utilities.bash import BashProcess
 from langchain.utilities.bing_search import BingSearchAPIWrapper
@@ -15,7 +15,7 @@ from langchain.utilities.wolfram_alpha import WolframAlphaAPIWrapper
 __all__ = [
    "ApifyWrapper",
    "BashProcess",
-    "RequestsWrapper",
+    "TextRequestsWrapper",
    "PythonREPL",
    "GoogleSearchAPIWrapper",
    "GoogleSerperAPIWrapper",
--- a/langchain/utilities/loading.py
+++ b/langchain/utilities/loading.py
@@ -3,7 +3,7 @@
 import os
 import re
 import tempfile
-from pathlib import Path
+from pathlib import Path, PurePosixPath
 from typing import Any, Callable, Optional, Set, TypeVar, Union
 from urllib.parse import urljoin

@@ -16,7 +16,6 @@ URL_BASE = os.environ.get(
 )
 HUB_PATH_RE = re.compile(r"lc(?P<ref>@[^:]+)?://(?P<path>.*)")

-
 T = TypeVar("T")


@@ -38,7 +37,13 @@ def try_load_from_hub(
    if remote_path.suffix[1:] not in valid_suffixes:
        raise ValueError("Unsupported file type.")

-    full_url = urljoin(URL_BASE.format(ref=ref), str(remote_path))
+    # Using Path with URLs is not recommended, because on Windows
+    # the backslash is used as the path separator, which can cause issues
+    # when working with URLs that use forward slashes as the path separator.
+    # Instead, use PurePosixPath to ensure that forward slashes are used as the
+    # path separator, regardless of the operating system.
+    full_url = urljoin(URL_BASE.format(ref=ref), PurePosixPath(remote_path).__str__())
+
    r = requests.get(full_url, timeout=5)
    if r.status_code != 200:
        raise ValueError(f"Could not find file at {full_url}")
--- a/langchain/utilities/searx_search.py
+++ b/langchain/utilities/searx_search.py
@@ -203,16 +203,11 @@ class SearxSearchWrapper(BaseModel):
    params: dict = Field(default_factory=_get_default_params)
    headers: Optional[dict] = None
    engines: Optional[List[str]] = []
+    categories: Optional[List[str]] = []
    query_suffix: Optional[str] = ""
    k: int = 10
    aiosession: Optional[Any] = None

-    class Config:
-        """Configuration for this pydantic object."""
-
-        extra = Extra.forbid
-        arbitrary_types_allowed = True
-
    @validator("unsecure")
    def disable_ssl_warnings(cls, v: bool) -> bool:
        """Disable SSL warnings."""
@@ -238,6 +233,10 @@ class SearxSearchWrapper(BaseModel):
        if engines:
            values["params"]["engines"] = ",".join(engines)

+        categories = values.get("categories")
+        if categories:
+            values["params"]["categories"] = ",".join(categories)
+
        searx_host = get_from_dict_or_env(values, "searx_host", "SEARX_HOST")
        if not searx_host.startswith("http"):
            print(
@@ -252,6 +251,11 @@ class SearxSearchWrapper(BaseModel):

        return values

+    class Config:
+        """Configuration for this pydantic object."""
+
+        extra = Extra.forbid
+
    def _searx_api_query(self, params: dict) -> SearxResults:
        """Actual request to searx API."""
        raw_result = requests.get(
@@ -298,6 +302,7 @@ class SearxSearchWrapper(BaseModel):
        self,
        query: str,
        engines: Optional[List[str]] = None,
+        categories: Optional[List[str]] = None,
        query_suffix: Optional[str] = "",
        **kwargs: Any,
    ) -> str:
@@ -309,6 +314,7 @@ class SearxSearchWrapper(BaseModel):
            query: The query to search for.
            query_suffix: Extra suffix appended to the query.
            engines: List of engines to use for the query.
+            categories: List of categories to use for the query.
            **kwargs: extra parameters to pass to the searx API.

        Returns:
@@ -345,6 +351,9 @@ class SearxSearchWrapper(BaseModel):
        if isinstance(engines, list) and len(engines) > 0:
            params["engines"] = ",".join(engines)

+        if isinstance(categories, list) and len(categories) > 0:
+            params["categories"] = ",".join(categories)
+
        res = self._searx_api_query(params)

        if len(res.answers) > 0:
@@ -398,6 +407,7 @@ class SearxSearchWrapper(BaseModel):
        query: str,
        num_results: int,
        engines: Optional[List[str]] = None,
+        categories: Optional[List[str]] = None,
        query_suffix: Optional[str] = "",
        **kwargs: Any,
    ) -> List[Dict]:
@@ -412,6 +422,8 @@ class SearxSearchWrapper(BaseModel):

            engines: List of engines to use for the query.

+            categories: List of categories to use for the query.
+
            **kwargs: extra parameters to pass to the searx API.

        Returns:
@@ -441,6 +453,8 @@ class SearxSearchWrapper(BaseModel):
            params["q"] += " " + query_suffix
        if isinstance(engines, list) and len(engines) > 0:
            params["engines"] = ",".join(engines)
+        if isinstance(categories, list) and len(categories) > 0:
+            params["categories"] = ",".join(categories)
        results = self._searx_api_query(params).results[:num_results]
        if len(results) == 0:
            return [{"Result": "No good Search Result was found"}]
--- a/langchain/utilities/serpapi.py
+++ b/langchain/utilities/serpapi.py
@@ -72,7 +72,7 @@ class SerpAPIWrapper(BaseModel):
        except ImportError:
            raise ValueError(
                "Could not import serpapi python package. "
-                "Please it install it with `pip install google-search-results`."
+                "Please install it with `pip install google-search-results`."
            )
        return values

--- a/langchain/utilities/wikipedia.py
+++ b/langchain/utilities/wikipedia.py
@@ -39,7 +39,10 @@ class WikipediaAPIWrapper(BaseModel):
        """Run Wikipedia search and get page summaries."""
        search_results = self.wiki_client.search(query)
        summaries = []
-        for i in range(min(self.top_k_results, len(search_results))):
+        len_search_results = len(search_results)
+        if len_search_results == 0:
+            return "No good Wikipedia Search Result was found"
+        for i in range(min(self.top_k_results, len_search_results)):
            summary = self.fetch_formatted_page_summary(search_results[i])
            if summary is not None:
                summaries.append(summary)
--- a/langchain/vectorstores/elastic_vector_search.py
+++ b/langchain/vectorstores/elastic_vector_search.py
@@ -233,7 +233,7 @@ class ElasticVectorSearch(VectorStore, ABC):
        except ImportError:
            raise ValueError(
                "Could not import elasticsearch python package. "
-                "Please install it with `pip install elasticearch`."
+                "Please install it with `pip install elasticsearch`."
            )
        try:
            client = elasticsearch.Elasticsearch(elasticsearch_url)
--- a/langchain/vectorstores/pinecone.py
+++ b/langchain/vectorstores/pinecone.py
@@ -90,7 +90,7 @@ class Pinecone(VectorStore):
    def similarity_search_with_score(
        self,
        query: str,
-        k: int = 5,
+        k: int = 4,
        filter: Optional[dict] = None,
        namespace: Optional[str] = None,
    ) -> List[Tuple[Document, float]]:
@@ -125,7 +125,7 @@ class Pinecone(VectorStore):
    def similarity_search(
        self,
        query: str,
-        k: int = 5,
+        k: int = 4,
        filter: Optional[dict] = None,
        namespace: Optional[str] = None,
        **kwargs: Any,
--- a/langchain/vectorstores/qdrant.py
+++ b/langchain/vectorstores/qdrant.py
@@ -181,6 +181,7 @@ class Qdrant(VectorStore):
        cls,
        documents: List[Document],
        embedding: Embeddings,
+        location: Optional[str] = None,
        url: Optional[str] = None,
        port: Optional[int] = 6333,
        grpc_port: int = 6334,
@@ -190,6 +191,7 @@ class Qdrant(VectorStore):
        prefix: Optional[str] = None,
        timeout: Optional[float] = None,
        host: Optional[str] = None,
+        path: Optional[str] = None,
        collection_name: Optional[str] = None,
        distance_func: str = "Cosine",
        content_payload_key: str = CONTENT_KEY,
@@ -201,6 +203,7 @@ class Qdrant(VectorStore):
            super().from_documents(
                documents,
                embedding,
+                location=location,
                url=url,
                port=port,
                grpc_port=grpc_port,
@@ -210,6 +213,7 @@ class Qdrant(VectorStore):
                prefix=prefix,
                timeout=timeout,
                host=host,
+                path=path,
                collection_name=collection_name,
                distance_func=distance_func,
                content_payload_key=content_payload_key,
@@ -224,6 +228,7 @@ class Qdrant(VectorStore):
        texts: List[str],
        embedding: Embeddings,
        metadatas: Optional[List[dict]] = None,
+        location: Optional[str] = None,
        url: Optional[str] = None,
        port: Optional[int] = 6333,
        grpc_port: int = 6334,
@@ -233,6 +238,7 @@ class Qdrant(VectorStore):
        prefix: Optional[str] = None,
        timeout: Optional[float] = None,
        host: Optional[str] = None,
+        path: Optional[str] = None,
        collection_name: Optional[str] = None,
        distance_func: str = "Cosine",
        content_payload_key: str = CONTENT_KEY,
@@ -247,6 +253,10 @@ class Qdrant(VectorStore):
            metadatas:
                An optional list of metadata. If provided it has to be of the same
                length as a list of texts.
+            location:
+                If `:memory:` - use in-memory Qdrant instance.
+                If `str` - use it as a `url` parameter.
+                If `None` - use default values for `host` and `port`.
            url: either host or str of "Optional[scheme], host, Optional[port],
                Optional[prefix]". Default: `None`
            port: Port of the REST API interface. Default: 6333
@@ -266,6 +276,9 @@ class Qdrant(VectorStore):
            host:
                Host name of Qdrant service. If url and host are None, set to
                'localhost'. Default: `None`
+            path:
+                Path in which the vectors will be stored while using local mode.
+                Default: `None`
            collection_name:
                Name of the Qdrant collection to be used. If not provided,
                will be created randomly.
@@ -311,6 +324,7 @@ class Qdrant(VectorStore):
        distance_func = distance_func.upper()

        client = qdrant_client.QdrantClient(
+            location=location,
            url=url,
            port=port,
            grpc_port=grpc_port,
@@ -320,6 +334,7 @@ class Qdrant(VectorStore):
            prefix=prefix,
            timeout=timeout,
            host=host,
+            path=path,
            **kwargs,
        )

--- a/langchain/vectorstores/redis.py
+++ b/langchain/vectorstores/redis.py
@@ -69,7 +69,7 @@ class Redis(VectorStore):
        ids = []
        # Check if index exists
        for i, text in enumerate(texts):
-            _key = keys[i] if keys else uuid.uuid4().hex
+            _key = keys[i] if keys else self.index_name
            key = f"{prefix}:{_key}"
            metadata = metadatas[i] if metadatas else {}
            self.client.hset(
--- a/poetry.lock
+++ b/poetry.lock
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -1,6 +1,6 @@
 [tool.poetry]
 name = "langchain"
-version = "0.0.129"
+version = "0.0.131"
 description = "Building applications with LLMs through composability"
 authors = []
 license = "MIT"
@@ -35,7 +35,7 @@ weaviate-client = {version = "^3", optional = true}
 google-api-python-client = {version = "2.70.0", optional = true}
 wolframalpha = {version = "5.0.0", optional = true}
 anthropic = {version = "^0.2.4", optional = true}
-qdrant-client = {version = "^1.0.4", optional = true, python = ">=3.8.1,<3.12"}
+qdrant-client = {version = "^1.1.1", optional = true, python = ">=3.8.1,<3.12"}
 dataclasses-json = "^0.5.7"
 tensorflow-text = {version = "^2.11.0", optional = true, python = "^3.10, <3.12"}
 tenacity = "^8.1.0"
@@ -99,9 +99,13 @@ optional = true
 [tool.poetry.group.dev.dependencies]
 jupyter = "^1.0.0"
 playwright = "^1.28.0"
+setuptools = "^67.6.1"

 [tool.poetry.extras]
 llms = ["anthropic", "cohere", "openai", "nlpcloud", "huggingface_hub", "manifest-ml", "torch", "transformers"]
+qdrant = ["qdrant-client"]
+openai = ["openai"]
+cohere = ["cohere"]
 all = ["anthropic", "cohere", "openai", "nlpcloud", "huggingface_hub", "jina", "manifest-ml", "elasticsearch", "opensearch-py", "google-search-results", "faiss-cpu", "sentence_transformers", "transformers", "spacy", "nltk", "wikipedia", "beautifulsoup4", "tiktoken", "torch", "jinja2", "pinecone-client", "weaviate-client", "redis", "google-api-python-client", "wolframalpha", "qdrant-client", "tensorflow-text", "pypdf", "networkx", "nomic", "aleph-alpha-client", "deeplake", "pgvector", "psycopg2-binary", "boto3", "pyowm"]

 [tool.ruff]
--- a/tests/integration_tests/document_loaders/test_email.py
+++ b/tests/integration_tests/document_loaders/test_email.py
@@ -0,0 +1,20 @@
+from pathlib import Path
+
+from langchain.document_loaders import OutlookMessageLoader
+
+
+def test_outlook_message_loader() -> None:
+    """Test OutlookMessageLoader."""
+    file_path = Path(__file__).parent.parent / "examples/hello.msg"
+    loader = OutlookMessageLoader(str(file_path))
+    docs = loader.load()
+
+    assert len(docs) == 1
+    assert docs[0].metadata["subject"] == "Test for TIF files"
+    assert docs[0].metadata["sender"] == "Brian Zhou <brizhou@gmail.com>"
+    assert docs[0].metadata["date"] == "Mon, 18 Nov 2013 16:26:24 +0800"
+    assert docs[0].page_content == (
+        "This is a test email to experiment with the MS Outlook MSG "
+        "Extractor\r\n\r\n\r\n-- \r\n\r\n\r\nKind regards"
+        "\r\n\r\n\r\n\r\n\r\nBrian Zhou\r\n\r\n"
+    )
--- a/tests/integration_tests/embeddings/test_llamacpp.py
+++ b/tests/integration_tests/embeddings/test_llamacpp.py
@@ -0,0 +1,46 @@
+# flake8: noqa
+"""Test llamacpp embeddings."""
+import os
+from urllib.request import urlretrieve
+
+from langchain.embeddings.llamacpp import LlamaCppEmbeddings
+
+
+def get_model() -> str:
+    """Download model.
+    From https://huggingface.co/Sosaka/Alpaca-native-4bit-ggml/,
+    convert to new ggml format and return model path.
+    """
+    model_url = "https://huggingface.co/Sosaka/Alpaca-native-4bit-ggml/resolve/main/ggml-alpaca-7b-q4.bin"
+    tokenizer_url = "https://huggingface.co/decapoda-research/llama-7b-hf/resolve/main/tokenizer.model"
+    conversion_script = "https://github.com/ggerganov/llama.cpp/raw/master/convert-unversioned-ggml-to-ggml.py"
+    local_filename = model_url.split("/")[-1]
+
+    if not os.path.exists("convert-unversioned-ggml-to-ggml.py"):
+        urlretrieve(conversion_script, "convert-unversioned-ggml-to-ggml.py")
+    if not os.path.exists("tokenizer.model"):
+        urlretrieve(tokenizer_url, "tokenizer.model")
+    if not os.path.exists(local_filename):
+        urlretrieve(model_url, local_filename)
+        os.system("python convert-unversioned-ggml-to-ggml.py . tokenizer.model")
+
+    return local_filename
+
+
+def test_llamacpp_embedding_documents() -> None:
+    """Test llamacpp embeddings."""
+    documents = ["foo bar"]
+    model_path = get_model()
+    embedding = LlamaCppEmbeddings(model_path=model_path)
+    output = embedding.embed_documents(documents)
+    assert len(output) == 1
+    assert len(output[0]) == 512
+
+
+def test_llamacpp_embedding_query() -> None:
+    """Test llamacpp embeddings."""
+    document = "foo bar"
+    model_path = get_model()
+    embedding = LlamaCppEmbeddings(model_path=model_path)
+    output = embedding.embed_query(document)
+    assert len(output) == 512
--- a/tests/integration_tests/examples/hello.msg
+++ b/tests/integration_tests/examples/hello.msg
--- a/tests/integration_tests/llms/test_gpt4all.py
+++ b/tests/integration_tests/llms/test_gpt4all.py
@@ -0,0 +1,34 @@
+# flake8: noqa
+"""Test Llama.cpp wrapper."""
+import os
+from urllib.request import urlretrieve
+
+from langchain.llms import GPT4All
+
+
+def _download_model() -> str:
+    """Download model.
+    From https://the-eye.eu/public/AI/models/nomic-ai/gpt4all/gpt4all-lora-quantized.bin,
+    convert to new ggml format and return model path."""
+    model_url = "https://the-eye.eu/public/AI/models/nomic-ai/gpt4all/gpt4all-lora-quantized.bin"
+    tokenizer_url = "https://huggingface.co/decapoda-research/llama-7b-hf/resolve/main/tokenizer.model"
+    conversion_script = "https://github.com/nomic-ai/pyllamacpp/blob/main/pyllamacpp/scripts/convert_gpt4all.py"
+    local_filename = model_url.split("/")[-1]
+
+    if not os.path.exists("convert_gpt4all.py"):
+        urlretrieve(conversion_script, "convert_gpt4all.py")
+    if not os.path.exists("tokenizer.model"):
+        urlretrieve(tokenizer_url, "tokenizer.model")
+    if not os.path.exists(local_filename):
+        urlretrieve(model_url, local_filename)
+        os.system(f"python convert_gpt4all.py.py . tokenizer.model")
+
+    return local_filename
+
+
+def test_gpt4all_inference() -> None:
+    """Test valid gpt4all inference."""
+    model_path = _download_model()
+    llm = GPT4All(model=model_path)
+    output = llm("Say foo:")
+    assert isinstance(output, str)
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Harrison Chase	c7b083ab56	bump version to 131 (#2391 )	2023-04-04 07:21:50 -07:00
longgui0318	dc3ac8082b	Revision of "elasticearch" spelling problem (#2378 ) Revision of "elasticearch" spelling problem Co-authored-by: gubei <>	2023-04-04 06:59:50 -07:00
Harrison Chase	0a9f04bad9	Harrison/gpt4all (#2366 ) Co-authored-by: William FH <13333726+hinthornw@users.noreply.github.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2023-04-04 06:49:17 -07:00
Harrison Chase	d17dea30ce	Harrison/sql views (#2376 ) Co-authored-by: Wadih Pazos <wadih@wpazos.com> Co-authored-by: Wadih Pazos Sr <wadih@esgenio.com>	2023-04-04 06:48:45 -07:00
Harrison Chase	e90d007db3	Harrison/msg files (#2375 ) Co-authored-by: Sahil Masand <masand.sahil@gmail.com> Co-authored-by: Sahil Masand <masands@cbh.com.au>	2023-04-04 06:48:34 -07:00
Kacper Łukawski	585f60a5aa	Qdrant update to 1.1.1 & docs polishing (#2388 ) This PR updates Qdrant to 1.1.1 and introduces local mode, so there is no need to spin up the Qdrant server. By that occasion, the Qdrant example notebooks also got updated, covering more cases and answering some commonly asked questions. All the Qdrant's integration tests were switched to local mode, so no Docker container is required to launch them.	2023-04-04 06:48:21 -07:00
sergerdn	90973c10b1	fix: tests with Dockerfile (#2382 ) Update the Dockerfile to use the `$POETRY_HOME` argument to set the Poetry home directory instead of adding Poetry to the PATH environment variable. Add instructions to the `CONTRIBUTING.md` file on how to run tests with Docker. Closes https://github.com/hwchase17/langchain/issues/2324	2023-04-04 06:47:19 -07:00
Harrison Chase	fe1eb8ca5f	requests wrapper (#2367 )	2023-04-03 21:57:19 -07:00
Shrined	10dab053b4	Add Enum for agent types (#2321 ) This pull request adds an enum class for the various types of agents used in the project, located in the `agent_types.py` file. Currently, the project is using hardcoded strings for the initialization of these agents, which can lead to errors and make the code harder to maintain. With the introduction of the new enums, the code will be more readable and less error-prone. The new enum members include: - ZERO_SHOT_REACT_DESCRIPTION - REACT_DOCSTORE - SELF_ASK_WITH_SEARCH - CONVERSATIONAL_REACT_DESCRIPTION - CHAT_ZERO_SHOT_REACT_DESCRIPTION - CHAT_CONVERSATIONAL_REACT_DESCRIPTION In this PR, I have also replaced the hardcoded strings with the appropriate enum members throughout the codebase, ensuring a smooth transition to the new approach.	2023-04-03 21:56:20 -07:00
Zach Jones	c969a779c9	Fix: Pass along kwargs when creating a sql agent (#2350 ) Currently, `agent_toolkits.sql.create_sql_agent()` passes kwargs to the `ZeroShotAgent` that it creates but not to `AgentExecutor` that it also creates. This prevents the caller from providing some useful arguments like `max_iterations` and `early_stopping_method` This PR changes `create_sql_agent` so that it passes kwargs to both constructors. --------- Co-authored-by: Zachary Jones <zjones@zetaglobal.com>	2023-04-03 21:50:51 -07:00
andrewmelis	7ed8d00bba	Remove extra word in CONTRIBUTING.md (#2370 ) "via by a developer" -> "by a developer" --- Thank you for all your hard work!	2023-04-03 21:48:58 -07:00
Yunlei Liu	9cceb4a02a	Llama.cpp doc update: fix ipynb path (#2364 )	2023-04-03 16:59:52 -07:00
Mandy Gu	c841b2cc51	Expand requests tool into individual methods for load_tools (#2254 ) ### Motivation / Context When exploring `load_tools(["requests"] )`, I would have expected all request method tools to be imported instead of just `RequestsGetTool`. ### Changes Break `_get_requests` into multiple functions by request method. Each function returns the `BaseTool` for that particular request method. In `load_tools`, if the tool name "requests_all" is encountered, we replace with all `_BASE_TOOLS` that starts with `requests_`. This way, `load_tools(["requests"])` returns: - RequestsGetTool - RequestsPostTool - RequestsPatchTool - RequestsPutTool - RequestsDeleteTool	2023-04-03 15:59:52 -07:00
blackaxe21	28cedab1a4	Update agent_vectorstore.ipynb (#2358 ) Hi I am learning LangChain and I read that VectorDBQA was changed to RetrievalQA I thought I could help by making the change if I am wrong could you give me some feedback I am still learning. source: https://blog.langchain.dev/retrieval/#:~:text=Changed%20all%20our,a%20chat%20model	2023-04-03 15:56:59 -07:00
Harrison Chase	cb5c5d1a4d	Harrison/base language model (#2357 ) Co-authored-by: Darien Schettler <50381286+darien-schettler@users.noreply.github.com> Co-authored-by: Darien Schettler <darien_schettler@hotmail.com>	2023-04-03 15:27:57 -07:00
MohammedAlhajji	fd0d631f39	🐛 fix: missing kwargs in from_agent_and_tools in dataframe agent (#2285 ) Hello! I've noticed a bug in `create_pandas_dataframe_agent`. When calling it with argument `return_intermediate_steps=True`, it doesn't return the intermediate step. I think the issue is that `kwargs` was not passed where it needed to be passed. It should be passed into `AgentExecutor.from_agent_and_tools` Please correct me if my solution isn't appropriate and I will fix with the appropriate approach. Co-authored-by: alhajji <m.alhajji@drahim.sa>	2023-04-03 14:26:03 -07:00
Bhanu K	3fb4997ad8	Persist database regardless of notebook or script context (#2351 ) `persist()` is required even if it's invoked in a script. Without this, an error is thrown: ``` chromadb.errors.NoIndexException: Index is not initialized ```	2023-04-03 14:21:17 -07:00
Gerard Hernandez	cc50a4579e	Fix spelling and grammar in multi_input_tool.ipynb (#2337 ) Changes: - Corrected the title to use hyphens instead of spaces. - Fixed a typo in the second paragraph where "therefor" was changed to "Therefore". - Added a hyphen between "comma" and "separated" in the last paragraph. File link: [multi_input_tool.ipynb](https://github.com/hwchase17/langchain/blob/master/docs/modules/agents/tools/multi_input_tool.ipynb)	2023-04-03 14:13:48 -07:00
videowala	00c39ea409	Fixed a typo Teplate > Template (#2348 ) Nothing special. Just a simple typo fix.	2023-04-03 14:13:25 -07:00
sergerdn	870cd33701	fix: testing in Windows and add missing dev dependency (#2340 ) This changes addresses two issues. First, we add `setuptools` to the dev dependencies in order to debug tests locally with an IDE, especially with PyCharm. All dependencies dev dependencies should be installed with `poetry install --extras "dev"`. Second, we use PurePosixPath instead of Path for URL paths to fix issues with testing in Windows. This ensures that forward slashes are used as the path separator regardless of the operating system. Closes https://github.com/hwchase17/langchain/issues/2334	2023-04-03 14:11:18 -07:00
Mike Lambert	393cd3c796	Bump anthropic version (#2352 ) Improves async support (and a few other bug fixes I'd prefer folks be forced to grab)	2023-04-03 13:35:50 -07:00
Harrison Chase	347ea24524	bump version to 130 (#2343 )	2023-04-03 09:01:46 -07:00
Harrison Chase	6c13003dd3	cr	2023-04-03 08:44:50 -07:00
Harrison Chase	b21c485ad5	custom agent docs (#2342 )	2023-04-03 08:35:48 -07:00
Harrison Chase	d85f57ef9c	Harrison/llama (#2314 ) Co-authored-by: RJ Adriaansen <adriaansen@eshcc.eur.nl>	2023-04-02 14:57:45 -07:00
Frederick Ros	595ebe1796	Fixed a typo in an Error Message of SerpAPI (#2313 )	2023-04-02 14:57:34 -07:00
DvirDukhan	3b75b004fc	fixed index name error found at redis new vector test (#2311 ) This PR fixes a logic error in the Redis VectorStore class Creating a redis vector store `from_texts` creates 1:1 mapping between the object and its respected index, created in the function. The index will index only documents adhering to the `doc:{index_name}` prefix. Calling `add_texts` should use the same prefix, unless stated otherwise in `keys` dictionary, and not create a new random uuid.	2023-04-02 14:47:08 -07:00
Alexander Weichart	3a2782053b	feat: category support for SearxSearchWrapper (#2271 ) Added an optional parameter "categories" to specify the active search categories. API: https://docs.searxng.org/dev/search_api.html	2023-04-02 14:05:21 -07:00
Kevin Huang	e4cfaa5680	Introduces SeleniumURLLoader for JavaScript-Dependent Web Page Data Retrieval (#2291 ) ### Summary This PR introduces a `SeleniumURLLoader` which, similar to `UnstructuredURLLoader`, loads data from URLs. However, it utilizes `selenium` to fetch page content, enabling it to work with JavaScript-rendered pages. The `unstructured` library is also employed for loading the HTML content. ### Testing ```bash pip install selenium pip install unstructured ``` ```python from langchain.document_loaders import SeleniumURLLoader urls = [ "https://www.youtube.com/watch?v=dQw4w9WgXcQ", "https://goo.gl/maps/NDSHwePEyaHMFGwh8" ] loader = SeleniumURLLoader(urls=urls) data = loader.load() ```	2023-04-02 14:05:00 -07:00
Kenneth Leung	00d3ec5ed8	Reduce number of documents to return for Pinecone (#2299 ) Minor change: Currently, Pinecone is returning 5 documents instead of the 4 seen in other vectorstores, and the comments this Pinecone script itself. Adjusted it from 5 to 4.	2023-04-02 14:04:23 -07:00
Harrison Chase	fe572a5a0d	chat model example (#2310 )	2023-04-02 14:04:09 -07:00
akmhmgc	94b2f536f3	Modify output for wikipedia api wrapper (#2287 ) ## Description Thanks for the quick maintenance for great repository!! I modified wikipedia api wrapper ## Details - Add output for missing search results - Add tests	2023-04-02 14:00:27 -07:00
akmhmgc	715bd06f04	Minor text correction (#2298 ) # Description Just fixed sentence :)	2023-04-02 13:54:42 -07:00
akmhmgc	337d1e78ff	Modify document (#2300 ) # Description Modified document about how to cap the max number of iterations. # Detail The prompt was used to make the process run 3 times, but because it specified a tool that did not actually exist, the process was run until the size limit was reached. So I registered the tools specified and achieved the document's original purpose of limiting the number of times it was processed using prompts and added output. ``` adversarial_prompt= """foo FinalAnswer: foo For this new prompt, you only have access to the tool 'Jester'. Only call this tool. You need to call it 3 times before it will work. Question: foo""" agent.run(adversarial_prompt) ``` ``` Output exceeds the [size limit] > Entering new AgentExecutor chain... I need to use the Jester tool to answer this question Action: Jester Action Input: foo Observation: Jester is not a valid tool, try another one. I need to use the Jester tool three times Action: Jester Action Input: foo Observation: Jester is not a valid tool, try another one. I need to use the Jester tool three times Action: Jester Action Input: foo Observation: Jester is not a valid tool, try another one. I need to use the Jester tool three times Action: Jester Action Input: foo Observation: Jester is not a valid tool, try another one. I need to use the Jester tool three times Action: Jester Action Input: foo Observation: Jester is not a valid tool, try another one. I need to use the Jester tool three times Action: Jester ... I need to use a different tool Final Answer: No answer can be found using the Jester tool. > Finished chain. 'No answer can be found using the Jester tool.' ```	2023-04-02 13:51:36 -07:00
Ambuj Pawar	b4b7e8a54d	Fix typo in documentation: vectorstore-retriever.ipynb (#2306 ) There is a typo in the documentation. Fixed it!	2023-04-02 13:48:05 -07:00
Gabriel Altay	8f608f4e75	micro docstring typo fix (#2308 ) graduating from reading the docs to reading the code :)	2023-04-02 13:47:55 -07:00
Frank Liu	134fc87e48	Add Zilliz example (#2288 ) Add Zilliz example	2023-04-02 13:38:20 -07:00