Compare commits

...

5 Commits

Author SHA1 Message Date
Harrison Chase
23a4b65dc6 fix-typo 2023-07-22 14:49:20 -07:00
Harrison Chase
cbf2fc8af8 prompt ergonomics (#7799) 2023-07-22 14:19:17 -07:00
Samuel Berthe
d81d6e874f doc(sqldatabasechain): use views when jsonb column description is not available (#8133)
I think the PR diff is self explaining ;)

@baskaryan
2023-07-22 11:30:04 -07:00
Harrison Chase
506b21bfc2 Update MIGRATE.md 2023-07-22 09:11:43 -07:00
Harrison Chase
9854d9e5cb cr 2023-07-22 09:07:26 -07:00
8 changed files with 454 additions and 3 deletions

View File

@@ -1,4 +1,4 @@
# Migrating to `langchain._experimental`
# Migrating to `langchain_experimental`
We are moving any experimental components of LangChain, or components with vulnerability issues, into `langchain_experimental`.
This guide covers how to migrate.

View File

@@ -21,6 +21,14 @@ Looking for the JS/TS version? Check out [LangChain.js](https://github.com/hwcha
**Production Support:** As you move your LangChains into production, we'd love to offer more comprehensive support.
Please fill out [this form](https://forms.gle/57d8AmXBYp8PP8tZA) and we'll set up a dedicated support Slack channel.
## 🚨Breaking Changes for select chains (SQLDatabase) on 7/28
In an effort to make `langchain` leaner and safer, we are moving select chains to `langchain_experimental`.
This migration has already started, but we are remaining backwards compatible until 7/28.
On that date, we will remove functionality from `langchain`.
Read more about the motivation and the progress [here](https://github.com/hwchase17/langchain/discussions/8043).
Read how to migrate your code [here](MIGRATE.md).
## Quick Install
`pip install langchain`

View File

@@ -0,0 +1,358 @@
{
"cells": [
{
"cell_type": "markdown",
"id": "4de4e022",
"metadata": {},
"source": [
"# Prompt Pipelining\n",
"\n",
"The idea behind prompt pipelining is to expose a user friendly interface for composing different parts of prompts together. You can do this with either string prompts or chat prompts. Constructing prompts this way allows for easy reuse of components."
]
},
{
"cell_type": "markdown",
"id": "c3190650",
"metadata": {},
"source": [
"## String Prompt Pipelining\n",
"\n",
"When working with string prompts, each template is joined togther. You can work with either prompts directly or strings (the first element in the list needs to be a prompt)."
]
},
{
"cell_type": "code",
"execution_count": 1,
"id": "69b17f05",
"metadata": {},
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"/Users/harrisonchase/.pyenv/versions/3.9.1/envs/langchain/lib/python3.9/site-packages/deeplake/util/check_latest_version.py:32: UserWarning: A newer version of deeplake (3.6.12) is available. It's recommended that you update to the latest version using `pip install -U deeplake`.\n",
" warnings.warn(\n"
]
}
],
"source": [
"from langchain.prompts import PromptTemplate"
]
},
{
"cell_type": "code",
"execution_count": 2,
"id": "d6ac7a48",
"metadata": {},
"outputs": [],
"source": [
"prompt = (\n",
" PromptTemplate.from_template(\"Tell me a joke about {topic}\")\n",
" + \", make it funny\"\n",
" + \"\\n\\nand in {language}\"\n",
")"
]
},
{
"cell_type": "code",
"execution_count": 3,
"id": "348d7131",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"PromptTemplate(input_variables=['language', 'topic'], output_parser=None, partial_variables={}, template='Tell me a joke about {topic}, make it funny\\n\\nand in {language}', template_format='f-string', validate_template=True)"
]
},
"execution_count": 3,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"prompt"
]
},
{
"cell_type": "code",
"execution_count": 4,
"id": "dbba24ba",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"'Tell me a joke about sports, make it funny\\n\\nand in spanish'"
]
},
"execution_count": 4,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"prompt.format(topic=\"sports\", language=\"spanish\")"
]
},
{
"cell_type": "markdown",
"id": "8239bf42",
"metadata": {},
"source": [
"You can also use it in an LLMChain, just like before."
]
},
{
"cell_type": "code",
"execution_count": 5,
"id": "bb11649a",
"metadata": {},
"outputs": [],
"source": [
"from langchain.chat_models import ChatOpenAI\n",
"from langchain.chains import LLMChain"
]
},
{
"cell_type": "code",
"execution_count": 6,
"id": "2dd36787",
"metadata": {},
"outputs": [],
"source": [
"model = ChatOpenAI()"
]
},
{
"cell_type": "code",
"execution_count": 7,
"id": "2c12ba34",
"metadata": {},
"outputs": [],
"source": [
"chain = LLMChain(llm=model, prompt=prompt)"
]
},
{
"cell_type": "code",
"execution_count": 8,
"id": "a1559246",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"'¿Por qué el futbolista llevaba un paraguas al partido?\\n\\nPorque pronosticaban lluvia de goles.'"
]
},
"execution_count": 8,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"chain.run(topic=\"sports\", language=\"spanish\")"
]
},
{
"cell_type": "markdown",
"id": "4e4f6a8a",
"metadata": {},
"source": [
"## Chat Prompt Pipelining"
]
},
{
"cell_type": "markdown",
"id": "a50ce9b8",
"metadata": {},
"source": [
"A chat prompt is made up a of a list of messages. Purely for developer experience, we've added a convinient way to create these prompts. In this pipeline, each new element is a new message in the final prompt."
]
},
{
"cell_type": "code",
"execution_count": 1,
"id": "2a180f75",
"metadata": {},
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"/Users/harrisonchase/.pyenv/versions/3.9.1/envs/langchain/lib/python3.9/site-packages/deeplake/util/check_latest_version.py:32: UserWarning: A newer version of deeplake (3.6.10) is available. It's recommended that you update to the latest version using `pip install -U deeplake`.\n",
" warnings.warn(\n"
]
}
],
"source": [
"from langchain.prompts import ChatPromptTemplate, HumanMessagePromptTemplate\n",
"from langchain.schema import HumanMessage, AIMessage, SystemMessage"
]
},
{
"cell_type": "markdown",
"id": "8554bae5",
"metadata": {},
"source": [
"First, let's initialize the base ChatPromptTemplate with a system message. It doesn't have to start with a system, but it's often good practice"
]
},
{
"cell_type": "code",
"execution_count": 2,
"id": "cab8dd65",
"metadata": {},
"outputs": [],
"source": [
"prompt = SystemMessage(content=\"You are a nice pirate\")"
]
},
{
"cell_type": "markdown",
"id": "30656ef8",
"metadata": {},
"source": [
"You can then easily create a pipeline combining it with other messages OR message templates.\n",
"Use a `Message` when there is no variables to be formatted, use a `MessageTemplate` when there are variables to be formatted. You can also use just a string -> note that this will automatically get inferred as a HumanMessagePromptTemplate."
]
},
{
"cell_type": "code",
"execution_count": 3,
"id": "a2ddd0a1",
"metadata": {},
"outputs": [],
"source": [
"new_prompt = (\n",
" prompt\n",
" + HumanMessage(content=\"hi\")\n",
" + AIMessage(content=\"what?\")\n",
" + \"{input}\"\n",
")"
]
},
{
"cell_type": "markdown",
"id": "72294e1b",
"metadata": {},
"source": [
"Under the hood, this creates an instance of the ChatPromptTemplate class, so you can use it just as you did before!"
]
},
{
"cell_type": "code",
"execution_count": 4,
"id": "297932de",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"[SystemMessage(content='You are a nice pirate', additional_kwargs={}),\n",
" HumanMessage(content='hi', additional_kwargs={}, example=False),\n",
" AIMessage(content='what?', additional_kwargs={}, example=False),\n",
" HumanMessage(content='i said hi', additional_kwargs={}, example=False)]"
]
},
"execution_count": 4,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"new_prompt.format_messages(input=\"i said hi\")"
]
},
{
"cell_type": "markdown",
"id": "850357c0",
"metadata": {},
"source": [
"You can also use it in an LLMChain, just like before"
]
},
{
"cell_type": "code",
"execution_count": 5,
"id": "710d6b15",
"metadata": {},
"outputs": [],
"source": [
"from langchain.chat_models import ChatOpenAI\n",
"from langchain.chains import LLMChain"
]
},
{
"cell_type": "code",
"execution_count": 6,
"id": "d363c2a4",
"metadata": {},
"outputs": [],
"source": [
"model = ChatOpenAI()"
]
},
{
"cell_type": "code",
"execution_count": 7,
"id": "88393b87",
"metadata": {},
"outputs": [],
"source": [
"chain = LLMChain(llm=model, prompt=new_prompt)"
]
},
{
"cell_type": "code",
"execution_count": 8,
"id": "8492cfa9",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"'Oh, hello! How can I assist you today?'"
]
},
"execution_count": 8,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"chain.run(\"i said hi\")"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "58196f6b",
"metadata": {},
"outputs": [],
"source": []
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3 (ipykernel)",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.9.1"
}
},
"nbformat": 4,
"nbformat_minor": 5
}

View File

@@ -447,6 +447,30 @@ db_chain.run("What are some example tracks by Bach?")
</CodeOutputBlock>
### SQL Views
In some case, the table schema can be hidden behind a JSON or JSONB column. Adding row samples into the prompt might help won't always describe the data perfectly.
For this reason, a custom SQL views can help.
```sql
CREATE VIEW accounts_v AS
select id, firstname, lastname, email, created_at, updated_at,
cast(stats->>'total_post' as int) as total_post,
cast(stats->>'total_comments' as int) as total_comments,
cast(stats->>'ltv' as int) as ltv
FROM accounts;
```
Then limit the tables visible from SQLDatabase to the created view.
```python
db = SQLDatabase.from_uri(
"sqlite:///../../../../notebooks/Chinook.db",
include_tables=['accounts_v']) # we include only the view
```
## SQLDatabaseSequentialChain
Chain for querying SQL database that is a sequential chain.

View File

@@ -446,7 +446,7 @@ class ChatOpenAI(BaseChatModel):
**super()._get_invocation_params(stop=stop, **kwargs),
**self._default_params,
"model": self.model_name,
"function": kwargs.get("functions"),
"functions": kwargs.get("functions"),
}
@property

View File

@@ -56,6 +56,10 @@ class BaseMessagePromptTemplate(Serializable, ABC):
List of input variables.
"""
def __add__(self, other: Any) -> ChatPromptTemplate:
prompt = ChatPromptTemplate(messages=[self])
return prompt + other
class MessagesPlaceholder(BaseMessagePromptTemplate):
"""Prompt template that assumes variable is already list of messages."""
@@ -261,6 +265,18 @@ class ChatPromptTemplate(BaseChatPromptTemplate, ABC):
messages: List[Union[BaseMessagePromptTemplate, BaseMessage]]
"""List of messages."""
def __add__(self, other: Any) -> ChatPromptTemplate:
# Allow for easy combining
if isinstance(other, ChatPromptTemplate):
return ChatPromptTemplate(messages=self.messages + other.messages)
elif isinstance(other, (BaseMessagePromptTemplate, BaseMessage)):
return ChatPromptTemplate(messages=self.messages + [other])
elif isinstance(other, str):
prompt = HumanMessagePromptTemplate.from_template(other)
return ChatPromptTemplate(messages=self.messages + [prompt])
else:
raise NotImplementedError(f"Unsupported operand type for +: {type(other)}")
@root_validator(pre=True)
def validate_input_variables(cls, values: dict) -> dict:
"""

View File

@@ -43,6 +43,42 @@ class PromptTemplate(StringPromptTemplate):
validate_template: bool = True
"""Whether or not to try validating the template."""
def __add__(self, other: Any) -> PromptTemplate:
# Allow for easy combining
if isinstance(other, PromptTemplate):
if self.template_format != "f-string":
raise ValueError(
"Adding prompt templates only supported for f-strings."
)
if other.template_format != "f-string":
raise ValueError(
"Adding prompt templates only supported for f-strings."
)
input_variables = list(
set(self.input_variables) | set(other.input_variables)
)
template = self.template + other.template
# If any do not want to validate, then don't
validate_template = self.validate_template and other.validate_template
partial_variables = {k: v for k, v in self.partial_variables.items()}
for k, v in other.partial_variables.items():
if k in partial_variables:
raise ValueError("Cannot have same variable partialed twice.")
else:
partial_variables[k] = v
return PromptTemplate(
template=template,
input_variables=input_variables,
partial_variables=partial_variables,
template_format="f-string",
validate_template=validate_template,
)
elif isinstance(other, str):
prompt = PromptTemplate.from_template(other)
return self + prompt
else:
raise NotImplementedError(f"Unsupported operand type for +: {type(other)}")
@property
def _prompt_type(self) -> str:
"""Return the prompt type key."""

View File

@@ -1,12 +1,15 @@
from __future__ import annotations
from abc import abstractmethod
from typing import List, Sequence
from typing import TYPE_CHECKING, Any, List, Sequence
from pydantic import Field
from langchain.load.serializable import Serializable
if TYPE_CHECKING:
from langchain.prompts.chat import ChatPromptTemplate
def get_buffer_string(
messages: Sequence[BaseMessage], human_prefix: str = "Human", ai_prefix: str = "AI"
@@ -77,6 +80,12 @@ class BaseMessage(Serializable):
"""Whether this class is LangChain serializable."""
return True
def __add__(self, other: Any) -> ChatPromptTemplate:
from langchain.prompts.chat import ChatPromptTemplate
prompt = ChatPromptTemplate(messages=[self])
return prompt + other
class HumanMessage(BaseMessage):
"""A Message from a human."""