Files
langchain/docs/docs/tutorials/llm_chain.ipynb

430 lines
15 KiB
Plaintext

{
"cells": [
{
"cell_type": "raw",
"id": "63ee3f93",
"metadata": {},
"source": [
"---\n",
"sidebar_position: 0\n",
"---"
]
},
{
"cell_type": "markdown",
"id": "9316da0d",
"metadata": {},
"source": [
"# Build a simple LLM application with chat models and prompt templates\n",
"\n",
"In this quickstart we'll show you how to build a simple LLM application with LangChain. This application will translate text from English into another language. This is a relatively simple LLM application - it's just a single LLM call plus some prompting. Still, this is a great way to get started with LangChain - a lot of features can be built with just some prompting and an LLM call!\n",
"\n",
"After reading this tutorial, you'll have a high level overview of:\n",
"\n",
"- Using [language models](/docs/concepts/chat_models)\n",
"\n",
"- Using [prompt templates](/docs/concepts/prompt_templates)\n",
"\n",
"- Debugging and tracing your application using [LangSmith](https://docs.smith.langchain.com/)\n",
"\n",
"Let's dive in!\n",
"\n",
"## Setup\n",
"\n",
"### Jupyter Notebook\n",
"\n",
"This and other tutorials are perhaps most conveniently run in a [Jupyter notebooks](https://jupyter.org/). Going through guides in an interactive environment is a great way to better understand them. See [here](https://jupyter.org/install) for instructions on how to install.\n",
"\n",
"### Installation\n",
"\n",
"To install LangChain run:\n",
"\n",
"<!-- HIDE_IN_NB\n",
"import Tabs from '@theme/Tabs';\n",
"import TabItem from '@theme/TabItem';\n",
"import CodeBlock from \"@theme/CodeBlock\";\n",
"\n",
"<Tabs>\n",
" <TabItem value=\"pip\" label=\"Pip\" default>\n",
" <CodeBlock language=\"bash\">pip install langchain</CodeBlock>\n",
" </TabItem>\n",
" <TabItem value=\"conda\" label=\"Conda\">\n",
" <CodeBlock language=\"bash\">conda install langchain -c conda-forge</CodeBlock>\n",
" </TabItem>\n",
"</Tabs>\n",
"HIDE_IN_NB -->"
]
},
{
"cell_type": "code",
"execution_count": 1,
"id": "86874822",
"metadata": {},
"outputs": [],
"source": [
"# | output: false\n",
"\n",
"# %pip install langchain\n",
"# OR\n",
"# %conda install langchain -c conda-forge"
]
},
{
"cell_type": "markdown",
"id": "a546a5bc",
"metadata": {},
"source": [
"For more details, see our [Installation guide](/docs/how_to/installation).\n",
"\n",
"### LangSmith\n",
"\n",
"Many of the applications you build with LangChain will contain multiple steps with multiple invocations of LLM calls.\n",
"As these applications get more and more complex, it becomes crucial to be able to inspect what exactly is going on inside your chain or agent.\n",
"The best way to do this is with [LangSmith](https://smith.langchain.com).\n",
"\n",
"After you sign up at the link above, make sure to set your environment variables to start logging traces:\n",
"\n",
"```shell\n",
"export LANGSMITH_TRACING=\"true\"\n",
"export LANGSMITH_API_KEY=\"...\"\n",
"export LANGSMITH_PROJECT=\"default\" # or any other project name\n",
"```\n",
"\n",
"Or, if in a notebook, you can set them with:\n",
"\n",
"```python\n",
"import getpass\n",
"import os\n",
"\n",
"try:\n",
" # load environment variables from .env file (requires `python-dotenv`)\n",
" from dotenv import load_dotenv\n",
"\n",
" load_dotenv()\n",
"except ImportError:\n",
" pass\n",
"\n",
"os.environ[\"LANGSMITH_TRACING\"] = \"true\"\n",
"if \"LANGSMITH_API_KEY\" not in os.environ:\n",
" os.environ[\"LANGSMITH_API_KEY\"] = getpass.getpass(\n",
" prompt=\"Enter your LangSmith API key (optional): \"\n",
" )\n",
"if \"LANGSMITH_PROJECT\" not in os.environ:\n",
" os.environ[\"LANGSMITH_PROJECT\"] = getpass.getpass(\n",
" prompt='Enter your LangSmith Project Name (default = \"default\"): '\n",
" )\n",
" if not os.environ.get(\"LANGSMITH_PROJECT\"):\n",
" os.environ[\"LANGSMITH_PROJECT\"] = \"default\"\n",
"```"
]
},
{
"cell_type": "markdown",
"id": "e5558ca9",
"metadata": {},
"source": [
"## Using Language Models\n",
"\n",
"First up, let's learn how to use a language model by itself. LangChain supports many different language models that you can use interchangeably. For details on getting started with a specific model, refer to [supported integrations](/docs/integrations/chat/).\n",
"\n",
"<!-- HIDE_IN_NB>\n",
"import ChatModelTabs from \"@theme/ChatModelTabs\";\n",
"\n",
"<ChatModelTabs overrideParams={{openai: {model: \"gpt-4o-mini\"}}} />\n",
"HIDE_IN_NB -->"
]
},
{
"cell_type": "code",
"execution_count": 2,
"id": "e4b41234",
"metadata": {},
"outputs": [],
"source": [
"# | output: false\n",
"# | echo: false\n",
"\n",
"from langchain_openai import ChatOpenAI\n",
"\n",
"model = ChatOpenAI(model=\"gpt-4o-mini\")"
]
},
{
"cell_type": "markdown",
"id": "ca5642ff",
"metadata": {},
"source": [
"Let's first use the model directly. [ChatModels](/docs/concepts/chat_models) are instances of LangChain [Runnables](/docs/concepts/runnables/), which means they expose a standard interface for interacting with them. To simply call the model, we can pass in a list of [messages](/docs/concepts/messages/) to the `.invoke` method."
]
},
{
"cell_type": "code",
"execution_count": 3,
"id": "1b2481f0",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"AIMessage(content='Ciao!', additional_kwargs={'refusal': None}, response_metadata={'token_usage': {'completion_tokens': 3, 'prompt_tokens': 20, 'total_tokens': 23, 'completion_tokens_details': {'accepted_prediction_tokens': 0, 'audio_tokens': 0, 'reasoning_tokens': 0, 'rejected_prediction_tokens': 0}, 'prompt_tokens_details': {'audio_tokens': 0, 'cached_tokens': 0}}, 'model_name': 'gpt-4o-mini-2024-07-18', 'system_fingerprint': 'fp_0705bf87c0', 'finish_reason': 'stop', 'logprobs': None}, id='run-32654a56-627c-40e1-a141-ad9350bbfd3e-0', usage_metadata={'input_tokens': 20, 'output_tokens': 3, 'total_tokens': 23, 'input_token_details': {'audio': 0, 'cache_read': 0}, 'output_token_details': {'audio': 0, 'reasoning': 0}})"
]
},
"execution_count": 3,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"from langchain_core.messages import HumanMessage, SystemMessage\n",
"\n",
"messages = [\n",
" SystemMessage(\"Translate the following from English into Italian\"),\n",
" HumanMessage(\"hi!\"),\n",
"]\n",
"\n",
"model.invoke(messages)"
]
},
{
"cell_type": "markdown",
"id": "f83373db",
"metadata": {},
"source": [
":::tip\n",
"\n",
"If we've enabled LangSmith, we can see that this run is logged to LangSmith, and can see the [LangSmith trace](https://smith.langchain.com/public/88baa0b2-7c1a-4d09-ba30-a47985dde2ea/r). The LangSmith trace reports [token](/docs/concepts/tokens/) usage information, latency, [standard model parameters](/docs/concepts/chat_models/#standard-parameters) (such as temperature), and other information.\n",
"\n",
":::\n",
"\n",
"Note that ChatModels receive [message](/docs/concepts/messages/) objects as input and generate message objects as output. In addition to text content, message objects convey conversational [roles](/docs/concepts/messages/#role) and hold important data, such as [tool calls](/docs/concepts/tool_calling/) and token usage counts.\n",
"\n",
"LangChain also supports chat model inputs via strings or [OpenAI format](/docs/concepts/messages/#openai-format). The following are equivalent:\n",
"\n",
"```python\n",
"model.invoke(\"Hello\")\n",
"\n",
"model.invoke([{\"role\": \"user\", \"content\": \"Hello\"}])\n",
"\n",
"model.invoke([HumanMessage(\"Hello\")])\n",
"```\n",
"\n",
"### Streaming\n",
"\n",
"Because chat models are [Runnables](/docs/concepts/runnables/), they expose a standard interface that includes async and streaming modes of invocation. This allows us to stream individual tokens from a chat model:"
]
},
{
"cell_type": "code",
"execution_count": 4,
"id": "0abb0863-bee7-448d-b013-79d8db01e330",
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"|C|iao|!||"
]
}
],
"source": [
"for token in model.stream(messages):\n",
" print(token.content, end=\"|\")"
]
},
{
"cell_type": "markdown",
"id": "a5963141-468c-4570-8f2e-5f7cfb6eb3db",
"metadata": {},
"source": [
"You can find more details on streaming chat model outputs in [this guide](/docs/how_to/chat_streaming/)."
]
},
{
"cell_type": "markdown",
"id": "1ab8da31",
"metadata": {},
"source": [
"## Prompt Templates\n",
"\n",
"Right now we are passing a list of messages directly into the language model. Where does this list of messages come from? Usually, it is constructed from a combination of user input and application logic. This application logic usually takes the raw user input and transforms it into a list of messages ready to pass to the language model. Common transformations include adding a system message or formatting a template with the user input.\n",
"\n",
"[Prompt templates](/docs/concepts/prompt_templates/) are a concept in LangChain designed to assist with this transformation. They take in raw user input and return data (a prompt) that is ready to pass into a language model. \n",
"\n",
"Let's create a prompt template here. It will take in two user variables:\n",
"\n",
"- `language`: The language to translate text into\n",
"- `text`: The text to translate"
]
},
{
"cell_type": "code",
"execution_count": 5,
"id": "3e73cc20",
"metadata": {},
"outputs": [],
"source": [
"from langchain_core.prompts import ChatPromptTemplate\n",
"\n",
"system_template = \"Translate the following from English into {language}\"\n",
"\n",
"prompt_template = ChatPromptTemplate.from_messages(\n",
" [(\"system\", system_template), (\"user\", \"{text}\")]\n",
")"
]
},
{
"cell_type": "markdown",
"id": "7e876c2a",
"metadata": {},
"source": [
"Note that `ChatPromptTemplate` supports multiple [message roles](/docs/concepts/messages/#role) in a single template. We format the `language` parameter into the system message, and the user `text` into a user message."
]
},
{
"cell_type": "markdown",
"id": "d9711ba6",
"metadata": {},
"source": [
"The input to this prompt template is a dictionary. We can play around with this prompt template by itself to see what it does by itself"
]
},
{
"cell_type": "code",
"execution_count": 6,
"id": "f781b3cb",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"ChatPromptValue(messages=[SystemMessage(content='Translate the following from English into Italian', additional_kwargs={}, response_metadata={}), HumanMessage(content='hi!', additional_kwargs={}, response_metadata={})])"
]
},
"execution_count": 6,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"prompt = prompt_template.invoke({\"language\": \"Italian\", \"text\": \"hi!\"})\n",
"\n",
"prompt"
]
},
{
"cell_type": "markdown",
"id": "1a49ba9e",
"metadata": {},
"source": [
"We can see that it returns a `ChatPromptValue` that consists of two messages. If we want to access the messages directly we do:"
]
},
{
"cell_type": "code",
"execution_count": 7,
"id": "2159b619",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"[SystemMessage(content='Translate the following from English into Italian', additional_kwargs={}, response_metadata={}),\n",
" HumanMessage(content='hi!', additional_kwargs={}, response_metadata={})]"
]
},
"execution_count": 7,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"prompt.to_messages()"
]
},
{
"cell_type": "markdown",
"id": "47e70ee6-f0e0-4ae0-a290-002799ebf828",
"metadata": {},
"source": [
"Finally, we can invoke the chat model on the formatted prompt:"
]
},
{
"cell_type": "code",
"execution_count": 8,
"id": "3a509d8c-e122-4641-b9ee-91bc23aa155a",
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Ciao!\n"
]
}
],
"source": [
"response = model.invoke(prompt)\n",
"print(response.content)"
]
},
{
"cell_type": "markdown",
"id": "d7f0bf25-6efb-4853-9a8f-242f2855c84a",
"metadata": {},
"source": [
":::tip\n",
"Message `content` can contain both text and [content blocks](/docs/concepts/messages/#aimessage) with additional structure. See [this guide](/docs/how_to/output_parser_string/) for more information.\n",
":::\n",
"\n",
"If we take a look at the [LangSmith trace](https://smith.langchain.com/public/3ccc2d5e-2869-467b-95d6-33a577df99a2/r), we can see exactly what prompt the chat model receives, along with [token](/docs/concepts/tokens/) usage information, latency, [standard model parameters](/docs/concepts/chat_models/#standard-parameters) (such as temperature), and other information."
]
},
{
"cell_type": "markdown",
"id": "befdb168",
"metadata": {},
"source": [
"## Conclusion\n",
"\n",
"That's it! In this tutorial you've learned how to create your first simple LLM application. You've learned how to work with language models, how to create a prompt template, and how to get great observability into applications you create with LangSmith.\n",
"\n",
"This just scratches the surface of what you will want to learn to become a proficient AI Engineer. Luckily - we've got a lot of other resources!\n",
"\n",
"For further reading on the core concepts of LangChain, we've got detailed [Conceptual Guides](/docs/concepts).\n",
"\n",
"If you have more specific questions on these concepts, check out the following sections of the how-to guides:\n",
"\n",
"- [Chat models](/docs/how_to/#chat-models)\n",
"- [Prompt templates](/docs/how_to/#prompt-templates)\n",
"\n",
"And the LangSmith docs:\n",
"\n",
"- [LangSmith](https://docs.smith.langchain.com)"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3 (ipykernel)",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.10.4"
}
},
"nbformat": 4,
"nbformat_minor": 5
}