# Returning Structured Output

This notebook covers how to have an agent return a structured output.
By default, most of the agents return a single string.
It can often be useful to have an agent return something with more structure.


A good example of this is an agent tasked with doing question-answering over some sources.
Let's say we want the agent to respond not only with the answer, but also a list of the sources used.
We then want our output to roughly follow the schema below:

```python
class Response(BaseModel):
    """Final response to the question being asked"""
    answer: str = Field(description = "The final answer to respond to the user")
    sources: List[int] = Field(description="List of page chunks that contain answer to the question. Only include a page chunk if it contains relevant information")
```

In this notebook we will go over an agent that has a retriever tool and responds in the correct format.

## Create the Retriever

In this section we will do some setup work to create our retriever over some mock data containing the "State of the Union" address. Importantly, we will add a "page_chunk" tag to the metadata of each document. This is just some fake data intended to simulate a source field. In practice, this would more likely be the URL or path of a document.

In [1]:
from langchain.embeddings.openai import OpenAIEmbeddings
from langchain.vectorstores import Chroma
from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain.document_loaders import TextLoader

In [16]:
# Load in document to retrieve over
loader = TextLoader('../../state_of_the_union.txt')
documents = loader.load()

# Split document into chunks
text_splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=0)
texts = text_splitter.split_documents(documents)

# Here is where we add in the fake source information
for i, doc in enumerate(texts):
    doc.metadata['page_chunk'] = i

# Create our retriever
embeddings = OpenAIEmbeddings()
vectorstore = Chroma.from_documents(texts, embeddings, collection_name="state-of-union")
retriever = vectorstore.as_retriever()

## Create the tools

We will now create the tools we want to give to the agent. In this case, it is just one - a tool that wraps our retriever.

In [4]:
from langchain.agents.agent_toolkits.conversational_retrieval.tool import create_retriever_tool

retriever_tool = create_retriever_tool(
    retriever,
    "state-of-union-retriever",
    "Query a retriever to get information about state of the union address"
)

## Create response schema

Here is where we will define the response schema. In this case, we want the final answer to have two fields: one for the `answer`, and then another that is a list of `sources`

In [5]:
from pydantic import BaseModel, Field
from typing import List
from langchain.utils.openai_functions import convert_pydantic_to_openai_function

class Response(BaseModel):
    """Final response to the question being asked"""
    answer: str = Field(description = "The final answer to respond to the user")
    sources: List[int] = Field(description="List of page chunks that contain answer to the question. Only include a page chunk if it contains relevant information")

## Create the custom parsing logic

We now create some custom parsing logic.
How this works is that we will pass the `Response` schema to the OpenAI LLM via their `functions` parameter.
This is similar to how we pass tools for the agent to use.

When the `Response` function is called by OpenAI, we want to use that as a signal to return to the user.
When any other function is called by OpenAI, we treat that as a tool invocation.

Therefor, our parsing logic has the following blocks:

- If no function is called, assume that we should use the response to respond to the user, and therefor return `AgentFinish`
- If the `Response` function is called, respond to the user with the inputs to that function (our structured output), and therefor return `AgentFinish`
- If any other function is called, treat that as a tool invocation, and therefor return `AgentActionMessageLog`

Note that we are using `AgentActionMessageLog` rather than `AgentAction` because it lets us attach a log of messages that we can use in the future to pass back into the agent prompt.

In [6]:
from langchain.schema.agent import AgentActionMessageLog, AgentFinish
import json

In [17]:
def parse(output):
    # If no function was invoked, return to user
    if "function_call" not in output.additional_kwargs:
        return AgentFinish(return_values={"output": output.content}, log=output.content)
    
    # Parse out the function call
    function_call = output.additional_kwargs["function_call"]
    name = function_call['name']
    inputs = json.loads(function_call['arguments'])
    
    # If the Response function was invoked, return to the user with the function inputs
    if name == "Response":
        return AgentFinish(return_values=inputs, log=str(function_call))
    # Otherwise, return an agent action
    else:
        return AgentActionMessageLog(tool=name, tool_input=inputs, log="", message_log=[output])

## Create the Agent

We can now put this all together! The components of this agent are:

- prompt: a simple prompt with placeholders for the user's question and then the `agent_scratchpad` (any intermediate steps)
- tools: we can attach the tools and `Response` format to the LLM as functions
- format scratchpad: in order to format the `agent_scratchpad` from intermediate steps, we will use the standard `format_to_openai_functions`. This takes intermediate steps and formats them as AIMessages and FunctionMessages.
- output parser: we will use our custom parser above to parse the response of the LLM
- AgentExecutor: we will use the standard AgentExecutor to run the loop of agent-tool-agent-tool...

In [None]:
from langchain.prompts import ChatPromptTemplate, MessagesPlaceholder
from langchain.chat_models import ChatOpenAI
from langchain.tools.render import format_tool_to_openai_function
from langchain.agents.format_scratchpad import format_to_openai_functions
from langchain.agents import AgentExecutor

In [8]:
prompt = ChatPromptTemplate.from_messages([
    ("system", "You are a helpful assistant"),
    ("user", "{input}"),
    MessagesPlaceholder(variable_name="agent_scratchpad"),
])

In [9]:
llm = ChatOpenAI(temperature=0)

In [10]:
llm_with_tools = llm.bind(
    functions=[
        # The retriever tool
        format_tool_to_openai_function(retriever_tool), 
        # Response schema
        convert_pydantic_to_openai_function(Response)
    ]
)

In [11]:
agent = {
    "input": lambda x: x["input"],
    # Format agent scratchpad from intermediate steps
    "agent_scratchpad": lambda x: format_to_openai_functions(x['intermediate_steps'])
} | prompt | llm_with_tools | parse

In [14]:
agent_executor = AgentExecutor(tools=[retriever_tool], agent=agent, verbose=True)

## Run the agent

We can now run the agent! Notice how it responds with a dictionary with two keys: `answer` and `sources`

In [18]:
agent_executor.invoke({"input": "what did the president say about kentaji brown jackson"}, return_only_outputs=True)



[1m> Entering new AgentExecutor chain...[0m
[32;1m[1;3m[0m[36;1m[1;3m[Document(page_content='Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \n\nTonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \n\nOne of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \n\nAnd I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'page_chunk': 31, 'source': '../../state_of_the_union.txt'}), Document(page_content='One was

{'answer': "President mentioned Ketanji Brown Jackson as a nominee for the United States Supreme Court and praised her as one of the nation's top legal minds.",
 'sources': [31]}