langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-14 15:16:21 +00:00

Author	SHA1	Message	Date
ccurme	868cfc4a8f	openai: ignore function_calls if tool_calls are present (#31198 ) Some providers include (legacy) function calls in `additional_kwargs` in addition to tool calls. We currently unpack both function calls and tool calls if present, but OpenAI will raise 400 in this case. This can come up if providers are mixed in a tool-calling loop. Example: ```python from langchain.chat_models import init_chat_model from langchain_core.messages import HumanMessage from langchain_core.tools import tool @tool def get_weather(location: str) -> str: """Get weather at a location.""" return "It's sunny." gemini = init_chat_model("google_genai:gemini-2.0-flash-001").bind_tools([get_weather]) openai = init_chat_model("openai:gpt-4.1-mini").bind_tools([get_weather]) input_message = HumanMessage("What's the weather in Boston?") tool_call_message = gemini.invoke([input_message]) assert len(tool_call_message.tool_calls) == 1 tool_call = tool_call_message.tool_calls[0] tool_message = get_weather.invoke(tool_call) response = openai.invoke( # currently raises 400 / BadRequestError [input_message, tool_call_message, tool_message] ) ``` Here we ignore function calls if tool calls are present.	2025-05-12 13:50:56 -04:00
zhurou603	1df3ee91e7	partners: (langchain-openai) total_tokens should not add 'Nonetype' t… (#31146 ) partners: (langchain-openai) total_tokens should not add 'Nonetype' t… # PR Description ## Description Fixed an issue in `langchain-openai` where `total_tokens` was incorrectly adding `None` to an integer, causing a TypeError. The fix ensures proper type checking before adding token counts. ## Issue Fixes the TypeError traceback shown in the image where `'NoneType'` cannot be added to an integer. ## Dependencies None ## Twitter handle None ![image](https://github.com/user-attachments/assets/9683a795-a003-455a-ada9-fe277245e2b2) Co-authored-by: qiulijie <qiulijie@yuaiweiwu.com>	2025-05-07 11:09:50 -04:00
Asif Mehmood	00ac49dd3e	Replace deprecated .dict() with .model_dump() for Pydantic v2 compatibility (#31107 ) What does this PR do? This PR replaces deprecated usages of ```.dict()``` with ```.model_dump()``` to ensure compatibility with Pydantic v2 and prepare for v3, addressing the deprecation warning ```PydanticDeprecatedSince20``` as required in [Issue# 31103](https://github.com/langchain-ai/langchain/issues/31103). Changes made: * Replaced ```.dict()``` with ```.model_dump()``` in multiple locations * Ensured consistency with Pydantic v2 migration guidelines * Verified compatibility across affected modules Notes * This is a code maintenance and compatibility update * Tested locally with Pydantic v2.11 * No functional logic changes; only internal method replacements to prevent deprecation issues	2025-05-03 13:40:54 -04:00
ccurme	77ecf47f6d	openai: release 0.3.16 (#31100 )	2025-05-02 13:14:46 -04:00
ccurme	94139ffcd3	openai[patch]: format system content blocks for Responses API (#31096 ) ```python from langchain_core.messages import HumanMessage, SystemMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI(model="gpt-4.1", use_responses_api=True) messages = [ SystemMessage("test"), # Works HumanMessage("test"), # Works SystemMessage([{"type": "text", "text": "test"}]), # Bug in this case HumanMessage([{"type": "text", "text": "test"}]), # Works SystemMessage([{"type": "input_text", "text": "test"}]) # Works ] llm._get_request_payload(messages) ```	2025-05-02 15:22:30 +00:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00
ccurme	c51eadd54f	openai[patch]: propagate service_tier to response metadata (#31089 )	2025-05-01 13:50:48 -04:00
ccurme	6110c3ffc5	openai[patch]: release 0.3.15 (#31087 )	2025-05-01 09:22:30 -04:00
Sydney Runkle	7e926520d5	packaging: remove Python upper bound for langchain and co libs (#31025 ) Follow up to https://github.com/langchain-ai/langsmith-sdk/pull/1696, I've bumped the `langsmith` version where applicable in `uv.lock`. Type checking problems here because deps have been updated in `pyproject.toml` and `uv lock` hasn't been run - we should enforce that in the future - goes with the other dependabot todos :).	2025-04-28 14:44:28 -04:00
湛露先生	5fb8fd863a	langchain_openai: clean duplicate code for openai embedding. (#30872 ) The `_chunk_size` has not changed by method `self._tokenize`, So i think these is duplicate code. Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-27 15:07:41 -04:00
ccurme	a60fd06784	docs: document OpenAI flex processing (#31023 ) Following https://github.com/langchain-ai/langchain/pull/31005	2025-04-25 15:10:25 -04:00
ccurme	629b7a5a43	openai[patch]: add explicit attribute for service tier (#31005 )	2025-04-25 18:38:23 +00:00
ccurme	a7903280dd	openai[patch]: delete redundant tests (#31004 ) These are covered by standard tests.	2025-04-24 17:56:32 +00:00
ccurme	10a9c24dae	openai: fix streaming reasoning without summaries (#30999 ) Following https://github.com/langchain-ai/langchain/pull/30909: need to retain "empty" reasoning output when streaming, e.g., ```python {'id': 'rs_...', 'summary': [], 'type': 'reasoning'} ``` Tested by existing integration tests, which are currently failing.	2025-04-24 16:01:45 +00:00
ccurme	faef3e5d50	core, standard-tests: support PDF and audio input in Chat Completions format (#30979 ) Chat models currently implement support for: - images in OpenAI Chat Completions format - other multimodal types (e.g., PDF and audio) in a cross-provider [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/) Here we update core to extend support to PDF and audio input in Chat Completions format. If an OAI-format PDF or audio content block is passed into any chat model, it will be transformed to the LangChain standard format. We assume that any chat model supporting OAI-format PDF or audio has implemented support for the standard format.	2025-04-23 18:32:51 +00:00
ccurme	4bc70766b5	core, openai: support standard multi-modal blocks in convert_to_openai_messages (#30968 )	2025-04-23 11:20:44 -04:00
ccurme	a7c1bccd6a	openai[patch]: remove xfails from image token counting tests (#30963 ) These appear to be passing again.	2025-04-22 15:55:33 +00:00
Dmitrii Rashchenko	a43df006de	Support of openai reasoning summary streaming (#30909 ) langchain_openai: Support of reasoning summary streaming Description: OpenAI API now supports streaming reasoning summaries for reasoning models (o1, o3, o3-mini, o4-mini). More info about it: https://platform.openai.com/docs/guides/reasoning#reasoning-summaries It is supported only in Responses API (not Completion API), so you need to create LangChain Open AI model as follows to support reasoning summaries streaming: ``` llm = ChatOpenAI( model="o4-mini", # also o1, o3, o3-mini support reasoning streaming use_responses_api=True, # reasoning streaming works only with responses api, not completion api model_kwargs={ "reasoning": { "effort": "high", # also "low" and "medium" supported "summary": "auto" # some models support "concise" summary, some "detailed", but auto will always work } } ) ``` Now, if you stream events from llm: ``` async for event in llm.astream_events(prompt, version="v2"): print(event) ``` or ``` for chunk in llm.stream(prompt): print (chunk) ``` OpenAI API will send you new types of events: `response.reasoning_summary_text.added` `response.reasoning_summary_text.delta` `response.reasoning_summary_text.done` These events are new, so they were ignored. So I have added support of these events in function `_convert_responses_chunk_to_generation_chunk`, so reasoning chunks or full reasoning added to the chunk additional_kwargs. Example of how this reasoning summary may be printed: ``` async for event in llm.astream_events(prompt, version="v2"): if event["event"] == "on_chat_model_stream": chunk: AIMessageChunk = event["data"]["chunk"] if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` or ``` for chunk in llm.stream(prompt): if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-22 14:51:13 +00:00
Aubrey Ford	b344f34635	partners/openai: OpenAIEmbeddings not respecting chunk_size argument (#30757 ) When calling `embed_documents` and providing a `chunk_size` argument, that argument is ignored when `OpenAIEmbeddings` is instantiated with its default configuration (where `check_embedding_ctx_length=True`). `_get_len_safe_embeddings` specifies a `chunk_size` parameter but it's not being passed through in `embed_documents`, which is its only caller. This appears to be an oversight, especially given that the `_get_len_safe_embeddings` docstring states it should respect "the set embedding context length and chunk size." Developers typically expect method parameters to take effect (also, take precedence) when explicitly provided, especially when instantiating using defaults. I was confused as to why my API calls were being rejected regardless of the chunk size I provided. This bug also exists in langchain_community package. I can add that to this PR if requested otherwise I will create a new one once this passes.	2025-04-18 15:27:27 -04:00
ccurme	61d2dc011e	openai: release 0.3.14 (#30908 )	2025-04-17 10:49:14 -04:00
ccurme	add6a78f98	standard-tests, openai[patch]: add support standard audio inputs (#30904 )	2025-04-17 10:30:57 -04:00
ccurme	86d51f6be6	multiple: permit optional fields on multimodal content blocks (#30887 ) Instead of stuffing provider-specific fields in `metadata`, they can go directly on the content block.	2025-04-17 12:48:46 +00:00
ccurme	fa362189a1	docs: document OpenAI reasoning summaries (#30882 )	2025-04-16 19:21:14 +00:00
ccurme	dd5f5902e3	openai: release 0.3.13 (#30858 )	2025-04-15 17:58:12 +00:00
ccurme	9cfe6bcacd	multiple: multi-modal content blocks (#30746 ) Introduces standard content block format for images, audio, and files. ## Examples Image from url: ``` { "type": "image", "source_type": "url", "url": "https://path.to.image.png", } ``` Image, in-line data: ``` { "type": "image", "source_type": "base64", "data": "<base64 string>", "mime_type": "image/png", } ``` PDF, in-line data: ``` { "type": "file", "source_type": "base64", "data": "<base64 string>", "mime_type": "application/pdf", } ``` File from ID: ``` { "type": "file", "source_type": "id", "id": "file-abc123", } ``` Plain-text file: ``` { "type": "file", "source_type": "text", "text": "foo bar", } ```	2025-04-15 09:48:06 -04:00
ccurme	f7c4965fb6	openai[patch]: update imports in test (#30828 ) Quick fix to unblock CI, will need to address in core separately.	2025-04-14 19:33:38 +00:00
Sydney Runkle	8c6734325b	partners[lint]: run `pyupgrade` to get code in line with 3.9 standards (#30781 ) Using `pyupgrade` to get all `partners` code up to 3.9 standards (mostly, fixing old `typing` imports).	2025-04-11 07:18:44 -04:00
Sydney Runkle	4556b81b1d	Clean up `numpy` dependencies and speed up 3.13 CI with `numpy>=2.1.0` (#30714 ) Generally, this PR is CI performance focused + aims to clean up some dependencies at the same time. 1. Unpins upper bounds for `numpy` in all `pyproject.toml` files where `numpy` is specified 2. Requires `numpy >= 2.1.0` for Python 3.13 and `numpy > v1.26.0` for Python 3.12, plus a `numpy` min version bump for `chroma` 3. Speeds up CI by minutes - linting on Python 3.13, installing `numpy < 2.1.0` was taking [~3 minutes](https://github.com/langchain-ai/langchain/actions/runs/14316342925/job/40123305868?pr=30713), now the entire env setup takes a few seconds 4. Deleted the `numpy` test dependency from partners where that was not used, specifically `huggingface`, `voyageai`, `xai`, and `nomic`. It's a bit unfortunate that `langchain-community` depends on `numpy`, we might want to try to fix that in the future... Closes https://github.com/langchain-ai/langchain/issues/26026 Fixes https://github.com/langchain-ai/langchain/issues/30555	2025-04-08 09:45:07 -04:00
ccurme	59d508a2ee	openai[patch]: make computer test more reliable (#30672 )	2025-04-04 13:53:59 +00:00
ccurme	fe0fd9dd70	openai[patch]: upgrade tiktoken and fix test (#30621 ) Related to https://github.com/langchain-ai/langchain/issues/30344 https://github.com/langchain-ai/langchain/pull/30542 introduced an erroneous test for token counts for o-series models. tiktoken==0.8 does not support o-series models in `tiktoken.encoding_for_model(model_name)`, and this is the version of tiktoken we had in the lock file. So we would default to `cl100k_base` for o-series, which is the wrong encoding model. The test tested against this wrong encoding (so it passed with tiktoken 0.8). Here we update tiktoken to 0.9 in the lock file, and fix the expected counts in the test. Verified that we are pulling [o200k_base](https://github.com/openai/tiktoken/blob/main/tiktoken/model.py#L8), as expected.	2025-04-02 10:44:48 -04:00
ccurme	816492e1d3	openai: release 0.3.12 (#30616 )	2025-04-02 13:20:15 +00:00
Bagatur	111dd90a46	openai[patch]: support structured output and tools (#30581 ) Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-02 09:14:02 -04:00
ccurme	8a69de5c24	openai[patch]: ignore file blocks when counting tokens (#30601 ) OpenAI does not appear to document how it transforms PDF pages to images, which determines how tokens are counted: https://platform.openai.com/docs/guides/pdf-files?api-mode=chat#usage-considerations Currently these block types raise ValueError inside `get_num_tokens_from_messages`. Here we update to generate a warning and continue.	2025-04-01 15:29:33 -04:00
Koshik Debanath	e7883d5b9f	langchain-openai: Support token counting for o-series models in ChatOpenAI (#30542 ) Related to #30344 Add support for token counting for o-series models in `test_token_counts.py`. * Update `_MODELS` and `_CHAT_MODELS` dictionaries - Add "o1", "o3", and "gpt-4o" to `_MODELS` and `_CHAT_MODELS` dictionaries. * Update token counts - Add token counts for "o1", "o3", and "gpt-4o" models. --- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/langchain-ai/langchain/pull/30542?shareId=ab208bf7-80a3-4b8d-80c4-2287486fedae).	2025-03-28 16:02:09 -04:00
omahs	6f8735592b	docs,langchain-community: Fix typos in docs and code (#30541 ) Fix typos	2025-03-28 19:21:16 +00:00
ccurme	a9b1e1b177	openai: release 0.3.11 (#30503 )	2025-03-26 19:24:37 +00:00
ccurme	8119a7bc5c	openai[patch]: support streaming token counts in AzureChatOpenAI (#30494 ) When OpenAI originally released `stream_options` to enable token usage during streaming, it was not supported in AzureOpenAI. It is now supported. Like the [OpenAI SDK](`f66d2e6fdc/src/openai/resources/completions.py (L68)`), ChatOpenAI does not return usage metadata during streaming by default (which adds an extra chunk to the stream). The OpenAI SDK requires users to pass `stream_options={"include_usage": True}`. ChatOpenAI implements a convenience argument `stream_usage: Optional[bool]`, and an attribute `stream_usage: bool = False`. Here we extend this to AzureChatOpenAI by moving the `stream_usage` attribute and `stream_usage` kwarg (on `_(a)stream`) from ChatOpenAI to BaseChatOpenAI. --- Additional consideration: we must be sensitive to the number of users using BaseChatOpenAI to interact with other APIs that do not support the `stream_options` parameter. Suppose OpenAI in the future updates the default behavior to stream token usage. Currently, BaseChatOpenAI only passes `stream_options` if `stream_usage` is True, so there would be no way to disable this new default behavior. To address this, we could update the `stream_usage` attribute to `Optional[bool] = None`, but this is technically a breaking change (as currently values of False are not passed to the client). IMO: if / when this change happens, we could accompany it with this update in a minor bump. --- Related previous PRs: - https://github.com/langchain-ai/langchain/pull/22628 - https://github.com/langchain-ai/langchain/pull/22854 - https://github.com/langchain-ai/langchain/pull/23552 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-03-26 15:16:37 -04:00
ccurme	422ba4cde5	infra: handle flaky tests (#30501 )	2025-03-26 13:28:56 -04:00
ccurme	50ec4a1a4f	openai[patch]: attempt to make test less flaky (#30463 )	2025-03-24 17:36:36 +00:00
ccurme	8486e0ae80	openai[patch]: bump openai sdk (#30461 ) [New required field](https://github.com/openai/openai-python/pull/2223/files#diff-530fd17eb1cc43440c82630df0ddd9b0893cf14b04065a95e6eef6cd2f766a44R26) for `ResponseUsage` released in 1.66.5.	2025-03-24 12:10:00 -04:00
ccurme	cbbc968903	openai: release 0.3.10 (#30460 )	2025-03-24 15:37:53 +00:00
ccurme	ed5e589191	openai[patch]: support multi-turn computer use (#30410 ) Here we accept ToolMessages of the form ```python ToolMessage( content=<representation of screenshot> (see below), tool_call_id="abc123", additional_kwargs={"type": "computer_call_output"}, ) ``` and translate them to `computer_call_output` items for the Responses API. We also propagate `reasoning_content` items from AIMessages. ## Example ### Load screenshots ```python import base64 def load_png_as_base64(file_path): with open(file_path, "rb") as image_file: encoded_string = base64.b64encode(image_file.read()) return encoded_string.decode('utf-8') screenshot_1_base64 = load_png_as_base64("/path/to/screenshot/of/application.png") screenshot_2_base64 = load_png_as_base64("/path/to/screenshot/of/desktop.png") ``` ### Initial message and response ```python from langchain_core.messages import HumanMessage, ToolMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI( model="computer-use-preview", model_kwargs={"truncation": "auto"}, ) tool = { "type": "computer_use_preview", "display_width": 1024, "display_height": 768, "environment": "browser" } llm_with_tools = llm.bind_tools([tool]) input_message = HumanMessage( content=[ { "type": "text", "text": ( "Click the red X to close and reveal my Desktop. " "Proceed, no confirmation needed." ) }, { "type": "input_image", "image_url": f"data:image/png;base64,{screenshot_1_base64}", } ] ) response = llm_with_tools.invoke( [input_message], reasoning={ "generate_summary": "concise", }, ) response.additional_kwargs["tool_outputs"] ``` ### Construct ToolMessage ```python tool_call_id = response.additional_kwargs["tool_outputs"][0]["call_id"] tool_message = ToolMessage( content=[ { "type": "input_image", "image_url": f"data:image/png;base64,{screenshot_2_base64}" } ], # content=f"data:image/png;base64,{screenshot_2_base64}", # <-- also acceptable tool_call_id=tool_call_id, additional_kwargs={"type": "computer_call_output"}, ) ``` ### Invoke again ```python messages = [ input_message, response, tool_message, ] response_2 = llm_with_tools.invoke( messages, reasoning={ "generate_summary": "concise", }, ) ```	2025-03-24 15:25:36 +00:00
ccurme	b78ae7817e	openai[patch]: trace strict in structured_output_kwargs (#30425 )	2025-03-21 14:37:28 -04:00
Ashwin	83cfb9691f	Fix typo: change 'ben' to 'be' in comment (#30358 ) Description: This PR fixes a minor typo in the comments within `libs/partners/openai/langchain_openai/chat_models/base.py`. The word "ben" has been corrected to "be" for clarity and professionalism. Issue: N/A Dependencies: None	2025-03-19 10:35:35 -04:00
ccurme	5684653775	openai[patch]: release 0.3.9 (#30325 )	2025-03-17 16:08:41 +00:00
ccurme	eb9b992aa6	openai[patch]: support additional Responses API features (#30322 ) - Include response headers - Max tokens - Reasoning effort - Fix bug with structured output / strict - Fix bug with simultaneous tool calling + structured output	2025-03-17 12:02:21 -04:00
ccurme	c74e7b997d	openai[patch]: support structured output via Responses API (#30265 ) Also runs all standard tests using Responses API.	2025-03-14 15:14:23 -04:00
ccurme	cd1ea8e94d	openai[patch]: support Responses API (#30231 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2025-03-12 12:25:46 -04:00
ccurme	62c570dd77	standard-tests, openai: bump core (#30202 )	2025-03-10 19:22:24 +00:00
ccurme	34638ccfae	openai[patch]: release 0.3.8 (#30164 )	2025-03-07 18:26:40 +00:00

1 2 3 4 5 ...

342 Commits