langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-16 16:11:02 +00:00

Author	SHA1	Message	Date
Andrew Jaeger	0189c50570	openai[fix]: Correctly set usage metadata for OpenAI Responses API (#31756 )	2025-06-27 15:35:14 +00:00
ccurme	88d5f3edcc	openai[patch]: allow specification of output format for Responses API (#31686 )	2025-06-26 13:41:43 -04:00
ccurme	6409498f6c	openai[patch]: route to Responses API if relevant attributes are set (#31645 ) Following https://github.com/langchain-ai/langchain/pull/30329.	2025-06-17 16:04:38 -04:00
ccurme	c1c3e13a54	openai[patch]: add Responses API attributes to BaseChatOpenAI (#30329 ) `reasoning`, `include`, `store`, `truncation`. Previously these had to be added through `model_kwargs`.	2025-06-17 14:45:50 -04:00
ccurme	b610859633	openai[patch]: support Responses streaming in AzureChatOpenAI (#31641 ) Resolves https://github.com/langchain-ai/langchain/issues/31303, https://github.com/langchain-ai/langchain/issues/31624	2025-06-17 14:41:09 -04:00
ccurme	b9357d456e	openai[patch]: refactor handling of Responses API (#31587 )	2025-06-16 14:01:39 -04:00
ccurme	ece9e31a7a	openai[patch]: VCR some tests (#31524 )	2025-06-06 23:00:57 +00:00
Eugene Yurtsev	17f34baa88	openai[minor]: add image generation to responses api (#31424 ) Does not support partial images during generation at the moment. Before doing that I'd like to figure out how to specify the aggregation logic without requiring changes in core. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-06-02 10:03:54 -04:00
ccurme	c8951ca124	infra: drop azure from streaming benchmarks (#31421 ) Covered by BaseChatOpenAI	2025-05-29 15:06:12 -04:00
ccurme	49eeb0f3c3	standard-tests: add benchmarks (#31302 ) Co-authored-by: Sydney Runkle <sydneymarierunkle@gmail.com>	2025-05-29 15:21:37 +00:00
ccurme	ab8b4003be	openai[patch]: add test case for code interpreter (#31383 )	2025-05-27 19:11:31 +00:00
ccurme	851fd438cf	openai[patch]: relax Azure llm streaming callback test (#31319 ) Effectively reverts https://github.com/langchain-ai/langchain/pull/29302, but check that counts are "less than" instead of equal to an expected count.	2025-05-22 16:14:53 +00:00
ccurme	053a1246da	openai[patch]: support built-in code interpreter and remote MCP tools (#31304 )	2025-05-22 11:47:57 -04:00
ccurme	beacedd6b3	openai[patch]: update tests for strict schemas (#31306 ) Following recent [changes](https://platform.openai.com/docs/changelog).	2025-05-21 22:06:17 +00:00
ccurme	dcb5aba999	openai[patch]: reduce tested constraints on strict schema adherence for Responses API (#31290 ) Scheduled testing started failing today because the Responses API stopped raising `BadRequestError` for a schema that was previously invalid when `strict=True`. Although docs still say that [some type-specific keywords are not yet supported](https://platform.openai.com/docs/guides/structured-outputs#some-type-specific-keywords-are-not-yet-supported) (including `minimum` and `maximum` for numbers), the below appears to run and correctly respect the constraints: ```python import json import openai maximums = list(range(1, 11)) arg_values = [] for maximum in maximums: tool = { "type": "function", "name": "magic_function", "description": "Applies a magic function to an input.", "parameters": { "properties": { "input": {"maximum": maximum, "minimum": 0, "type": "integer"} }, "required": ["input"], "type": "object", "additionalProperties": False }, "strict": True } client = openai.OpenAI() response = client.responses.create( model="gpt-4.1", input=[{"role": "user", "content": "What is the value of magic_function(3)? Use the tool."}], tools=[tool], ) function_call = next(item for item in response.output if item.type == "function_call") args = json.loads(function_call.arguments) arg_values.append(args["input"]) print(maximums) print(arg_values) # [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] # [1, 2, 3, 3, 3, 3, 3, 3, 3, 3] ``` Until yesterday this raised BadRequestError. The same is not true of Chat Completions, which appears to still raise BadRequestError ```python tool = { "type": "function", "function": { "name": "magic_function", "description": "Applies a magic function to an input.", "parameters": { "properties": { "input": {"maximum": 5, "minimum": 0, "type": "integer"} }, "required": ["input"], "type": "object", "additionalProperties": False }, "strict": True } } response = client.chat.completions.create( model="gpt-4.1", messages=[{"role": "user", "content": "What is the value of magic_function(3)? Use the tool."}], tools=[tool], ) response # raises BadRequestError ``` Here we update tests accordingly.	2025-05-20 14:50:31 +00:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00
ccurme	c51eadd54f	openai[patch]: propagate service_tier to response metadata (#31089 )	2025-05-01 13:50:48 -04:00
ccurme	629b7a5a43	openai[patch]: add explicit attribute for service tier (#31005 )	2025-04-25 18:38:23 +00:00
ccurme	a7903280dd	openai[patch]: delete redundant tests (#31004 ) These are covered by standard tests.	2025-04-24 17:56:32 +00:00
ccurme	faef3e5d50	core, standard-tests: support PDF and audio input in Chat Completions format (#30979 ) Chat models currently implement support for: - images in OpenAI Chat Completions format - other multimodal types (e.g., PDF and audio) in a cross-provider [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/) Here we update core to extend support to PDF and audio input in Chat Completions format. If an OAI-format PDF or audio content block is passed into any chat model, it will be transformed to the LangChain standard format. We assume that any chat model supporting OAI-format PDF or audio has implemented support for the standard format.	2025-04-23 18:32:51 +00:00
ccurme	a7c1bccd6a	openai[patch]: remove xfails from image token counting tests (#30963 ) These appear to be passing again.	2025-04-22 15:55:33 +00:00
Dmitrii Rashchenko	a43df006de	Support of openai reasoning summary streaming (#30909 ) langchain_openai: Support of reasoning summary streaming Description: OpenAI API now supports streaming reasoning summaries for reasoning models (o1, o3, o3-mini, o4-mini). More info about it: https://platform.openai.com/docs/guides/reasoning#reasoning-summaries It is supported only in Responses API (not Completion API), so you need to create LangChain Open AI model as follows to support reasoning summaries streaming: ``` llm = ChatOpenAI( model="o4-mini", # also o1, o3, o3-mini support reasoning streaming use_responses_api=True, # reasoning streaming works only with responses api, not completion api model_kwargs={ "reasoning": { "effort": "high", # also "low" and "medium" supported "summary": "auto" # some models support "concise" summary, some "detailed", but auto will always work } } ) ``` Now, if you stream events from llm: ``` async for event in llm.astream_events(prompt, version="v2"): print(event) ``` or ``` for chunk in llm.stream(prompt): print (chunk) ``` OpenAI API will send you new types of events: `response.reasoning_summary_text.added` `response.reasoning_summary_text.delta` `response.reasoning_summary_text.done` These events are new, so they were ignored. So I have added support of these events in function `_convert_responses_chunk_to_generation_chunk`, so reasoning chunks or full reasoning added to the chunk additional_kwargs. Example of how this reasoning summary may be printed: ``` async for event in llm.astream_events(prompt, version="v2"): if event["event"] == "on_chat_model_stream": chunk: AIMessageChunk = event["data"]["chunk"] if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` or ``` for chunk in llm.stream(prompt): if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-22 14:51:13 +00:00
ccurme	add6a78f98	standard-tests, openai[patch]: add support standard audio inputs (#30904 )	2025-04-17 10:30:57 -04:00
ccurme	86d51f6be6	multiple: permit optional fields on multimodal content blocks (#30887 ) Instead of stuffing provider-specific fields in `metadata`, they can go directly on the content block.	2025-04-17 12:48:46 +00:00
ccurme	9cfe6bcacd	multiple: multi-modal content blocks (#30746 ) Introduces standard content block format for images, audio, and files. ## Examples Image from url: ``` { "type": "image", "source_type": "url", "url": "https://path.to.image.png", } ``` Image, in-line data: ``` { "type": "image", "source_type": "base64", "data": "<base64 string>", "mime_type": "image/png", } ``` PDF, in-line data: ``` { "type": "file", "source_type": "base64", "data": "<base64 string>", "mime_type": "application/pdf", } ``` File from ID: ``` { "type": "file", "source_type": "id", "id": "file-abc123", } ``` Plain-text file: ``` { "type": "file", "source_type": "text", "text": "foo bar", } ```	2025-04-15 09:48:06 -04:00
Sydney Runkle	8c6734325b	partners[lint]: run `pyupgrade` to get code in line with 3.9 standards (#30781 ) Using `pyupgrade` to get all `partners` code up to 3.9 standards (mostly, fixing old `typing` imports).	2025-04-11 07:18:44 -04:00
ccurme	59d508a2ee	openai[patch]: make computer test more reliable (#30672 )	2025-04-04 13:53:59 +00:00
Bagatur	111dd90a46	openai[patch]: support structured output and tools (#30581 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-02 09:14:02 -04:00
ccurme	8119a7bc5c	openai[patch]: support streaming token counts in AzureChatOpenAI (#30494 ) When OpenAI originally released `stream_options` to enable token usage during streaming, it was not supported in AzureOpenAI. It is now supported. Like the [OpenAI SDK](`f66d2e6fdc/src/openai/resources/completions.py (L68)`), ChatOpenAI does not return usage metadata during streaming by default (which adds an extra chunk to the stream). The OpenAI SDK requires users to pass `stream_options={"include_usage": True}`. ChatOpenAI implements a convenience argument `stream_usage: Optional[bool]`, and an attribute `stream_usage: bool = False`. Here we extend this to AzureChatOpenAI by moving the `stream_usage` attribute and `stream_usage` kwarg (on `_(a)stream`) from ChatOpenAI to BaseChatOpenAI. --- Additional consideration: we must be sensitive to the number of users using BaseChatOpenAI to interact with other APIs that do not support the `stream_options` parameter. Suppose OpenAI in the future updates the default behavior to stream token usage. Currently, BaseChatOpenAI only passes `stream_options` if `stream_usage` is True, so there would be no way to disable this new default behavior. To address this, we could update the `stream_usage` attribute to `Optional[bool] = None`, but this is technically a breaking change (as currently values of False are not passed to the client). IMO: if / when this change happens, we could accompany it with this update in a minor bump. --- Related previous PRs: - https://github.com/langchain-ai/langchain/pull/22628 - https://github.com/langchain-ai/langchain/pull/22854 - https://github.com/langchain-ai/langchain/pull/23552 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-03-26 15:16:37 -04:00
ccurme	422ba4cde5	infra: handle flaky tests (#30501 )	2025-03-26 13:28:56 -04:00
ccurme	50ec4a1a4f	openai[patch]: attempt to make test less flaky (#30463 )	2025-03-24 17:36:36 +00:00
ccurme	ed5e589191	openai[patch]: support multi-turn computer use (#30410 ) Here we accept ToolMessages of the form ```python ToolMessage( content=<representation of screenshot> (see below), tool_call_id="abc123", additional_kwargs={"type": "computer_call_output"}, ) ``` and translate them to `computer_call_output` items for the Responses API. We also propagate `reasoning_content` items from AIMessages. ## Example ### Load screenshots ```python import base64 def load_png_as_base64(file_path): with open(file_path, "rb") as image_file: encoded_string = base64.b64encode(image_file.read()) return encoded_string.decode('utf-8') screenshot_1_base64 = load_png_as_base64("/path/to/screenshot/of/application.png") screenshot_2_base64 = load_png_as_base64("/path/to/screenshot/of/desktop.png") ``` ### Initial message and response ```python from langchain_core.messages import HumanMessage, ToolMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI( model="computer-use-preview", model_kwargs={"truncation": "auto"}, ) tool = { "type": "computer_use_preview", "display_width": 1024, "display_height": 768, "environment": "browser" } llm_with_tools = llm.bind_tools([tool]) input_message = HumanMessage( content=[ { "type": "text", "text": ( "Click the red X to close and reveal my Desktop. " "Proceed, no confirmation needed." ) }, { "type": "input_image", "image_url": f"data:image/png;base64,{screenshot_1_base64}", } ] ) response = llm_with_tools.invoke( [input_message], reasoning={ "generate_summary": "concise", }, ) response.additional_kwargs["tool_outputs"] ``` ### Construct ToolMessage ```python tool_call_id = response.additional_kwargs["tool_outputs"][0]["call_id"] tool_message = ToolMessage( content=[ { "type": "input_image", "image_url": f"data:image/png;base64,{screenshot_2_base64}" } ], # content=f"data:image/png;base64,{screenshot_2_base64}", # <-- also acceptable tool_call_id=tool_call_id, additional_kwargs={"type": "computer_call_output"}, ) ``` ### Invoke again ```python messages = [ input_message, response, tool_message, ] response_2 = llm_with_tools.invoke( messages, reasoning={ "generate_summary": "concise", }, ) ```	2025-03-24 15:25:36 +00:00
ccurme	eb9b992aa6	openai[patch]: support additional Responses API features (#30322 ) - Include response headers - Max tokens - Reasoning effort - Fix bug with structured output / strict - Fix bug with simultaneous tool calling + structured output	2025-03-17 12:02:21 -04:00
ccurme	c74e7b997d	openai[patch]: support structured output via Responses API (#30265 ) Also runs all standard tests using Responses API.	2025-03-14 15:14:23 -04:00
ccurme	cd1ea8e94d	openai[patch]: support Responses API (#30231 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2025-03-12 12:25:46 -04:00
ccurme	52b0570bec	core, openai, standard-tests: improve OpenAI compatibility with Anthropic content blocks (#30128 ) - Support thinking blocks in core's `convert_to_openai_messages` (pass through instead of error) - Ignore thinking blocks in ChatOpenAI (instead of error) - Support Anthropic-style image blocks in ChatOpenAI --- Standard integration tests include a `supports_anthropic_inputs` property which is currently enabled only for tests on `ChatAnthropic`. This test enforces compatibility with message histories of the form: ``` - system message - human message - AI message with tool calls specified only through `tool_use` content blocks - human message containing `tool_result` and an additional `text` block ``` It additionally checks support for Anthropic-style image inputs if `supports_image_inputs` is enabled. Here we change this test, such that if you enable `supports_anthropic_inputs`: - You support AI messages with text and `tool_use` content blocks - You support Anthropic-style image inputs (if `supports_image_inputs` is enabled) - You support thinking content blocks. That is, we add a test case for thinking content blocks, but we also remove the requirement of handling tool results within HumanMessages (motivated by existing agent abstractions, which should all return ToolMessage). We move that requirement to a ChatAnthropic-specific test.	2025-03-06 09:53:14 -05:00
ccurme	65a6dce428	openai[patch]: enable streaming for o1 (#29823 ) Verified streaming works for the `o1-2024-12-17` snapshot as well.	2025-02-15 12:42:05 -05:00
Marc Ammann	5690575f13	openai: Removed tool_calls from completion chunk after other chunks have already been sent. (#29649 ) - Description: Before sending a completion chunk at the end of an OpenAI stream, removing the tool_calls as those have already been sent as chunks. - Issue: - - Dependencies: - - Twitter handle: - @ccurme as mentioned in another PR --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-07 10:15:52 -05:00
ccurme	29a0c38cc3	openai[patch]: add test for message.name (#29651 )	2025-02-06 16:49:28 -05:00
Bagatur	317fb86fd9	openai[patch]: fix int test (#29395 )	2025-01-23 21:23:01 +00:00
Bagatur	8d566a8fe7	openai[patch]: detect old models in with_structured_output (#29392 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2025-01-23 20:47:32 +00:00
ccurme	c20f7418c7	openai[patch]: fix Azure LLM test (#29302 ) The tokens I get are: ``` ['', '\n\n', 'The', ' sun', ' was', ' setting', ' over', ' the', ' horizon', ',', ' casting', ''] ``` so possibly an extra empty token is included in the output. lmk @efriis if we should look into this further.	2025-01-19 17:25:42 +00:00
ccurme	c616b445f2	anthropic[patch]: support `parallel_tool_calls` (#29257 ) Need to: - Update docs - Decide if this is an explicit kwarg of bind_tools - Decide if this should be in standard test with flag for supporting	2025-01-17 19:41:41 +00:00
Erick Friis	bbc3e3b2cf	openai: disable streaming for o1 by default (#29147 ) Currently 400s https://community.openai.com/t/streaming-support-for-o1-o1-2024-12-17-resulting-in-400-unsupported-value/1085043 o1-mini and o1-preview stream fine	2025-01-11 02:24:11 +00:00
ccurme	6e63ccba84	openai[minor]: release 0.3 (#29100 ) ## Goal Solve the following problems with `langchain-openai`: - Structured output with `o1` [breaks out of the box](https://langchain.slack.com/archives/C050X0VTN56/p1735232400232099). - `with_structured_output` by default does not use OpenAI’s [structured output feature](https://platform.openai.com/docs/guides/structured-outputs). - We override API defaults for temperature and other parameters. ## Breaking changes: - Default method for structured output is changing to OpenAI’s dedicated [structured output feature](https://platform.openai.com/docs/guides/structured-outputs). For schemas specified via TypedDict or JSON schema, strict schema validation is disabled by default but can be enabled by specifying `strict=True`. - To recover previous default, pass `method="function_calling"` into `with_structured_output`. - Models that don’t support `method="json_schema"` (e.g., `gpt-4` and `gpt-3.5-turbo`, currently the default model for ChatOpenAI) will raise an error unless `method` is explicitly specified. - To recover previous default, pass `method="function_calling"` into `with_structured_output`. - Schemas specified via Pydantic `BaseModel` that have fields with non-null defaults or metadata (like min/max constraints) will raise an error. - To recover previous default, pass `method="function_calling"` into `with_structured_output`. - `strict` now defaults to False for `method="json_schema"` when schemas are specified via TypedDict or JSON schema. - To recover previous behavior, use `with_structured_output(schema, strict=True)` - Schemas specified via Pydantic V1 will raise a warning (and use `method="function_calling"`) unless `method` is explicitly specified. - To remove the warning, pass `method="function_calling"` into `with_structured_output`. - Streaming with default structured output method / Pydantic schema no longer generates intermediate streamed chunks. - To recover previous behavior, pass `method="function_calling"` into `with_structured_output`. - We no longer override default temperature (was 0.7 in LangChain, now will follow OpenAI, currently 1.0). - To recover previous behavior, initialize `ChatOpenAI` or `AzureChatOpenAI` with `temperature=0.7`. - Note: conceptually there is a difference between forcing a tool call and forcing a response format. Tool calls may have more concise arguments vs. generating content adhering to a schema. Prompts may need to be adjusted to recover desired behavior. --------- Co-authored-by: Jacob Lee <jacoblee93@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2025-01-10 10:50:32 -05:00
ccurme	815bfa1913	openai[patch]: support streaming with json_schema response format (#29044 ) - Stream JSON string content. Final chunk includes parsed representation (following OpenAI [docs](https://platform.openai.com/docs/guides/structured-outputs#streaming)). - Mildly (?) breaking change: if you were using streaming with `response_format` before, usage metadata will disappear unless you set `stream_usage=True`. ## Response format Before: ![Screenshot 2025-01-06 at 11 59 01 AM](https://github.com/user-attachments/assets/e54753f7-47d5-421d-b8f3-172f32b3364d) After: ![Screenshot 2025-01-06 at 11 58 13 AM](https://github.com/user-attachments/assets/34882c6c-2284-45b4-92f7-5b5b69896903) ## with_structured_output For pydantic output, behavior of `with_structured_output` is unchanged (except for warning disappearing), because we pluck the parsed representation straight from OpenAI, and OpenAI doesn't return it until the stream is completed. Open to alternatives (e.g., parsing from content or intermediate dict chunks generated by OpenAI). Before: ![Screenshot 2025-01-06 at 12 38 11 PM](https://github.com/user-attachments/assets/913d320d-f49e-4cbb-a800-b394ae817fd1) After: ![Screenshot 2025-01-06 at 12 38 58 PM](https://github.com/user-attachments/assets/f7a45dd6-d886-48a6-8d76-d0e21ca767c6)	2025-01-09 10:32:30 -05:00
Bagatur	c3ccd93c12	patch openai json mode test (#28831 )	2024-12-19 21:43:32 +00:00
Bagatur	ce6748dbfe	xfail openai image token count test (#28828 )	2024-12-19 21:23:30 +00:00
Bagatur	1378ddfa5f	openai[patch]: type reasoning_effort (#28825 )	2024-12-19 19:36:49 +00:00
Bagatur	4a531437bb	core[patch], openai[patch]: Handle OpenAI developer msg (#28794 ) - Convert developer openai messages to SystemMessage - store additional_kwargs={"__openai_role__": "developer"} so that the correct role can be reconstructed if needed - update ChatOpenAI to read in openai_role --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-18 21:54:07 +00:00

1 2 3

104 Commits