langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-27 22:37:46 +00:00

Go to file

Dmitrii Rashchenko a43df006de Support of openai reasoning summary streaming (#30909 )

**langchain_openai: Support of reasoning summary streaming**

**Description:**
OpenAI API now supports streaming reasoning summaries for reasoning
models (o1, o3, o3-mini, o4-mini). More info about it:
https://platform.openai.com/docs/guides/reasoning#reasoning-summaries

It is supported only in Responses API (not Completion API), so you need
to create LangChain Open AI model as follows to support reasoning
summaries streaming:

```
llm = ChatOpenAI(
    model="o4-mini", # also o1, o3, o3-mini support reasoning streaming
    use_responses_api=True,  # reasoning streaming works only with responses api, not completion api
    model_kwargs={
        "reasoning": {
            "effort": "high",  # also "low" and "medium" supported
            "summary": "auto"  # some models support "concise" summary, some "detailed", but auto will always work
        }
    }
)
```

Now, if you stream events from llm:

```
async for event in llm.astream_events(prompt, version="v2"):
    print(event)
```

or

```
for chunk in llm.stream(prompt):
    print (chunk)
```

OpenAI API will send you new types of events:
`response.reasoning_summary_text.added`
`response.reasoning_summary_text.delta`
`response.reasoning_summary_text.done`

These events are new, so they were ignored. So I have added support of
these events in function `_convert_responses_chunk_to_generation_chunk`,
so reasoning chunks or full reasoning added to the chunk
additional_kwargs.

Example of how this reasoning summary may be printed:

```
    async for event in llm.astream_events(prompt, version="v2"):
        if event["event"] == "on_chat_model_stream":
            chunk: AIMessageChunk = event["data"]["chunk"]
            if "reasoning_summary_chunk" in chunk.additional_kwargs:
                print(chunk.additional_kwargs["reasoning_summary_chunk"], end="")
            elif "reasoning_summary" in chunk.additional_kwargs:
                print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"])
            elif chunk.content and chunk.content[0]["type"] == "text":
                print(chunk.content[0]["text"], end="")
```

or

```
    for chunk in llm.stream(prompt):
        if "reasoning_summary_chunk" in chunk.additional_kwargs:
            print(chunk.additional_kwargs["reasoning_summary_chunk"], end="")
        elif "reasoning_summary" in chunk.additional_kwargs:
            print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"])
        elif chunk.content and chunk.content[0]["type"] == "text":
            print(chunk.content[0]["text"], end="")
```

---------

Co-authored-by: Chester Curme <chester.curme@gmail.com>

2025-04-22 14:51:13 +00:00

.devcontainer

…

.github

infra: add langchain-google-genai to monorepo test deps and update notebook cassettes (#30925 )

2025-04-18 11:16:12 -04:00

cookbook

cookbook: Fix docs typos. (#30763 )

2025-04-10 09:13:24 -04:00

docs

Community: Valyu Integration docs (#30926 )

2025-04-21 17:43:00 -04:00

libs

Support of openai reasoning summary streaming (#30909 )

2025-04-22 14:51:13 +00:00

scripts

…

.gitattributes

…

.gitignore

[performance]: Adding benchmarks for common langchain-core imports (#30747 )

2025-04-09 13:00:15 -04:00

.pre-commit-config.yaml

docs: fix builds (#29890 )

2025-02-19 13:35:59 -05:00

.readthedocs.yaml

docs(readthedocs): streamline config (#30307 )

2025-03-18 11:47:45 -04:00

CITATION.cff

…

LICENSE

…

Makefile

langchain: clean pyproject ruff section (#30070 )

2025-03-09 15:06:02 -04:00

MIGRATE.md

Proofreading and Editing Report for Migration Guide (#28084 )

2024-11-13 11:03:09 -05:00

poetry.toml

…

pyproject.toml

infra: add langchain-google-genai to monorepo test deps and update notebook cassettes (#30925 )

2025-04-18 11:16:12 -04:00

README.md

[performance]: Adding benchmarks for common langchain-core imports (#30747 )

2025-04-09 13:00:15 -04:00

SECURITY.md

docs: single security doc (#28515 )

2024-12-04 18:15:34 +00:00

uv.lock

infra: add langchain-google-genai to monorepo test deps and update notebook cassettes (#30925 )

2025-04-18 11:16:12 -04:00

yarn.lock

box: add langchain box package and DocumentLoader (#25506 )

2024-08-21 02:23:43 +00:00

README.md

Note

Looking for the JS/TS library? Check out LangChain.js.

LangChain is a framework for building LLM-powered applications. It helps you chain together interoperable components and third-party integrations to simplify AI application development — all while future-proofing decisions as the underlying technology evolves.

pip install -U langchain

To learn more about LangChain, check out the docs. If you’re looking for more advanced customization or agent orchestration, check out LangGraph, our framework for building controllable agent workflows.

Why use LangChain?

LangChain helps developers build applications powered by LLMs through a standard interface for models, embeddings, vector stores, and more.

Use LangChain for:

Real-time data augmentation. Easily connect LLMs to diverse data sources and external / internal systems, drawing from LangChain’s vast library of integrations with model providers, tools, vector stores, retrievers, and more.
Model interoperability. Swap models in and out as your engineering team experiments to find the best choice for your application’s needs. As the industry frontier evolves, adapt quickly — LangChain’s abstractions keep you moving without losing momentum.

LangChain’s ecosystem

While the LangChain framework can be used standalone, it also integrates seamlessly with any LangChain product, giving developers a full suite of tools when building LLM applications.

To improve your LLM application development, pair LangChain with:

LangSmith - Helpful for agent evals and observability. Debug poor-performing LLM app runs, evaluate agent trajectories, gain visibility in production, and improve performance over time.
LangGraph - Build agents that can reliably handle complex tasks with LangGraph, our low-level agent orchestration framework. LangGraph offers customizable architecture, long-term memory, and human-in-the-loop workflows — and is trusted in production by companies like LinkedIn, Uber, Klarna, and GitLab.
LangGraph Platform - Deploy and scale agents effortlessly with a purpose-built deployment platform for long running, stateful workflows. Discover, reuse, configure, and share agents across teams — and iterate quickly with visual prototyping in LangGraph Studio.

Additional resources

Tutorials: Simple walkthroughs with guided examples on getting started with LangChain.
How-to Guides: Quick, actionable code snippets for topics such as tool calling, RAG use cases, and more.
Conceptual Guides: Explanations of key concepts behind the LangChain framework.
API Reference: Detailed reference on navigating base packages and integrations for LangChain.

Description

⚡ Building applications with LLMs through composability ⚡

Readme MIT Cite this repository 4.8 GiB

Languages

Jupyter Notebook 73.8%

Python 21.1%

omnetpp-msg 4.8%

Makefile 0.1%

MDX 0.1%

README.md Unescape Escape

Why use LangChain?

LangChain’s ecosystem

Additional resources

README.md