langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 10:17:00 +00:00

Author	SHA1	Message	Date
Lauren Hirata Singh	aac258eaaa	chore(docs): update comment for chatopenai (#37034 ) Fixes DOC-526	2026-04-27 11:43:57 -04:00
Mason Daugherty	5a37cd5537	fix(openai): add gpt-5.5 pro to Responses API check (#36994 )	2026-04-24 14:58:48 -04:00
Asamu David	4000c22376	feat(openai): prevent silent streaming hangs in `ChatOpenAI` (#36949 ) > [!IMPORTANT] > Behavior change on upgrade — minor bump (`1.1.16` → `1.2.0`). > > Streaming calls now raise `StreamChunkTimeoutError` (a `TimeoutError` subclass — existing `except TimeoutError:` / `except asyncio.TimeoutError:` handlers catch it) after 120s of content silence instead of hanging forever. Opt out with `stream_chunk_timeout=None` or `LANGCHAIN_OPENAI_STREAM_CHUNK_TIMEOUT_S=0`. > > Kernel-level TCP keepalive / `TCP_USER_TIMEOUT` are applied via a custom `httpx` transport. `httpx` disables its env-proxy auto-detection (`HTTP_PROXY` / `HTTPS_PROXY` / `ALL_PROXY` / `NO_PROXY` and macOS/Windows system proxy) whenever a transport is supplied, so to avoid silently breaking enterprise proxy users, `ChatOpenAI` now detects the "proxy-env-shadow" shape at construction and skips the custom transport entirely when all of these hold: > > - `http_socket_options` left at default (`None`) > - No `http_client` or `http_async_client` supplied > - No `openai_proxy` supplied > - A proxy env var / system proxy is visible to httpx > > On that shape the instance falls back to pre-PR behavior and env-proxy auto-detection still applies. A one-time `INFO` records the bypass. > > Users who explicitly set `http_socket_options=[...]` alongside an env proxy still get the shadowed behavior with a one-time `WARNING` log — they opted in. Full opt-outs below. --- Streaming chat completions can hang forever when the underlying TCP connection silently dies mid-stream (idle NAT/LB timeouts, sandboxed runtimes killing long-lived connections, peer gone without a FIN or RST). httpx's read timeout doesn't help here because it's reset by any bytes arriving on the socket, including OpenAI's SSE keepalive comments, so a stream that's quiet on content but still producing keepalives looks alive forever. This PR adds two knobs to `ChatOpenAI`, both on by default with opt-outs: - `stream_chunk_timeout` (default 120s): wraps the async streaming iterator in `asyncio.wait_for` per chunk. Measures the gap between parsed SSE chunks, so keepalives don't reset it. Fires on genuine content silence and raises `StreamChunkTimeoutError` — a `TimeoutError` subclass carrying `timeout_s`, `model_name`, and `chunks_received` as structured attributes (mirrored in the WARNING log's `extra=`) for alerting without message-regex. Override with the kwarg or `LANGCHAIN_OPENAI_STREAM_CHUNK_TIMEOUT_S`. - `http_socket_options`: applies `SO_KEEPALIVE` + `TCP_KEEPIDLE` / `TCP_KEEPINTVL` / `TCP_KEEPCNT` + `TCP_USER_TIMEOUT` on Linux (macOS equivalents where available). On platforms missing some options, they're dropped silently and the remaining set still does useful work. Pool limits are set explicitly on the custom transport to mirror the `openai` SDK — without that, passing `transport=` to `httpx.AsyncClient` silently shrinks the connection pool. ## Behavior change The default-shape proxy-env bypass (above) covers the common enterprise case. Beyond that: - Connections that would previously have hung forever will now error out via `StreamChunkTimeoutError`. - Users who explicitly opt into `http_socket_options` while also relying on env proxies will see a one-time `WARNING` and lose env-proxy auto-detection — the custom transport shadows it. This is the original shipped behavior, retained for anyone who wants socket tuning on top of an env-proxied setup. Full opt-outs: - `stream_chunk_timeout=None` or `LANGCHAIN_OPENAI_STREAM_CHUNK_TIMEOUT_S=0` - `http_socket_options=()` or `LANGCHAIN_OPENAI_TCP_KEEPALIVE=0` - Supply your own `http_client` and `http_async_client`. `http_socket_options` is applied per side: passing only one still leaves the other side's default builder getting socket options. Supply both (or combine with `http_socket_options=()`) to take full control. Unparseable or negative values for the `LANGCHAIN_OPENAI_*` env vars fall back to the default with a `WARNING` log rather than silently being accepted, so a misconfigured environment still boots but the fallback is discoverable. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-04-22 20:28:43 -04:00
Mason Daugherty	488c6a73bb	fix(openai): tolerate `prompt_cache_retention` drift in streaming (#36925 )	2026-04-21 14:54:32 -04:00
ccurme	19b0805bc1	fix(openai): accommodate dict `response` items in streaming (#36899 )	2026-04-20 15:44:01 -04:00
Thomas	8fec4e7cee	fix(openai): infer azure chat profiles from model name (#36858 )	2026-04-19 11:06:26 -04:00
ccurme	0516156ef9	fix(openai): use SSRF-safe transport for image token counting (#36819 )	2026-04-16 09:52:02 -04:00
William FH	885f2c2c2d	fix(openai): handle content blocks without type key in responses api conversion (#36725 )	2026-04-14 15:13:40 -04:00
Mason Daugherty	8c15649127	fix(openai,groq,openrouter): use is-not-None checks in usage metadata token extraction (#36500 ) Python's `or` operator treats `0` as falsy, so `token_usage.get("total_tokens") or fallback` silently replaces a provider-reported `total_tokens=0` with the computed sum of input + output tokens. Providers can legitimately report zero tokens (e.g., cached responses, empty completions). The same pattern exists in the dual-key lookups for `input_tokens`/`output_tokens` in Groq and OpenRouter. While current APIs don't return both key formats simultaneously (making the `or`-chain functionally correct today), the semantics are still wrong; `0` should not fall through to a fallback. ## Changes - Replace `x.get(key) or fallback` with explicit `is not None` checks in `_create_usage_metadata` across `langchain-openai`, `langchain-groq`, and `langchain-openrouter` for `input_tokens`, `output_tokens`, and `total_tokens` - Fix a concrete bug in the `total_tokens` path: a provider-reported `0` was silently replaced by the computed sum - Harden dual-key lookups in Groq and OpenRouter to correctly preserve zero values from the preferred key, should both key formats ever coexist - Update OpenAI's single-key extraction for consistency — the old `or 0` pattern happened to produce correct results (`0 or 0 == 0`) but was semantically wrong	2026-04-03 11:46:36 -04:00
Weiguang Li	feb992abfe	fix(openai): let user-provided User-Agent override the Azure default (#35523 )	2026-03-28 21:41:19 -04:00
Mason Daugherty	2f64d80cc6	fix(core,model-profiles): add missing `ModelProfile` fields, warn on schema drift (#36129 ) PR #35788 added 7 new fields to the `langchain-profiles` CLI output (`name`, `status`, `release_date`, `last_updated`, `open_weights`, `attachment`, `temperature`) but didn't update `ModelProfile` in `langchain-core`. Partner packages like `langchain-aws` that set `extra="forbid"` on their Pydantic models hit `extra_forbidden` validation errors when Pydantic encountered undeclared TypedDict keys at construction time. This adds the missing fields, makes `ModelProfile` forward-compatible, provides a base-class hook so partners can stop duplicating model-profile validator boilerplate, migrates all in-repo partners to the new hook, and adds runtime + CI-time warnings for schema drift. ## Changes ### `langchain-core` - Add `__pydantic_config__ = ConfigDict(extra="allow")` to `ModelProfile` so unknown profile keys pass Pydantic validation even on models with `extra="forbid"` — forward-compatibility for when the CLI schema evolves ahead of core - Declare the 7 missing fields on `ModelProfile`: `name`, `status`, `release_date`, `last_updated`, `open_weights` (metadata) and `attachment`, `temperature` (capabilities) - Add `_warn_unknown_profile_keys()` in `model_profile.py` — emits a `UserWarning` when a profile dict contains keys not in `ModelProfile`, suggesting a core upgrade. Wrapped in a bare `except` so introspection failures never crash model construction - Add `BaseChatModel._resolve_model_profile()` hook that returns `None` by default. Partners can override this single method instead of redefining the full `_set_model_profile` validator — the base validator calls it automatically - Add `BaseChatModel._check_profile_keys` as a separate `model_validator` that calls `_warn_unknown_profile_keys`. Uses a distinct method name so partner overrides of `_set_model_profile` don't inadvertently suppress the check ### `langchain-profiles` CLI - Add `_warn_undeclared_profile_keys()` to the CLI (`cli.py`), called after merging augmentations in `refresh()` — warns at profile-generation time (not just runtime) when emitted keys aren't declared in `ModelProfile`. Gracefully skips if `langchain-core` isn't installed - Add guard test `test_model_data_to_profile_keys_subset_of_model_profile` in model-profiles — feeds a fully-populated model dict to `_model_data_to_profile()` and asserts every emitted key exists in `ModelProfile.__annotations__`. CI fails before any release if someone adds a CLI field without updating the TypedDict ### Partner packages - Migrate all 10 in-repo partners to the `_resolve_model_profile()` hook, replacing duplicated `@model_validator` / `_set_model_profile` overrides: anthropic, deepseek, fireworks, groq, huggingface, mistralai, openai (base + azure), openrouter, perplexity, xai - Anthropic retains custom logic (context-1m beta → `max_input_tokens` override); all others reduce to a one-liner - Add `pr_lint.yml` scope for the new `model-profiles` package	2026-03-23 00:44:27 -04:00
ccurme	900f8a3513	fix(openai): support phase parameter (#36161 )	2026-03-22 14:23:24 -04:00
Jackjin	7d05cfb131	fix(openai): preserve namespace field in streaming function_call chunks (#36108 )	2026-03-20 12:51:13 -04:00
Giulio Leone	9e4a6013be	fix(openai): add type: message to Responses API input items (#35693 )	2026-03-15 12:43:16 -04:00
Matt Van Horn	9521c679db	fix(openai): close PIL Image handles in token counting to prevent fd leak (#35742 )	2026-03-11 23:07:45 -04:00
LincolnBurrows2017	f9dbd22fe1	fix(openai): typo (#35763 ) Fixed typo in comment: "equivelent" -> "equivalent" in libs/partners/openai/langchain_openai/chat_models/base.py Co-authored-by: AI Assistant <assistant@example.com>	2026-03-11 11:46:06 -04:00
Mohammad Mohtashim	3af0bc0141	fix(openai): update responses API model detection for pro and codex models (#35594 )	2026-03-09 09:20:20 -04:00
ccurme	fbfe4b812d	feat(openai): support tool search (#35582 )	2026-03-08 08:53:13 -04:00
Jason Meng	f698b43b9a	fix(openai): avoid PydanticSerializationUnexpectedValue for structured output (#35543 )	2026-03-04 21:46:46 -05:00
Mason Daugherty	e91da86efe	feat(openrouter): add streaming token usage support (#35559 ) Streaming token usage was silently dropped for `ChatOpenRouter`. Both `_stream` and `_astream` skipped any SSE chunk without a `choices` array — which is exactly the shape OpenRouter uses for the final usage-reporting chunk. This meant `usage_metadata` was never populated on streamed responses, causing downstream consumers (like the Deep Agents CLI) to show "unknown" model with 0 tokens. ## Changes - Add `stream_usage: bool = True` field to `ChatOpenRouter`, which passes `stream_options: {"include_usage": True}` to the OpenRouter API when streaming — matching the pattern already established in `langchain-openai`'s `BaseChatOpenAI` - Handle usage-only chunks (no `choices`, just `usage`) in both `_stream` and `_astream` by emitting a `ChatGenerationChunk` with `usage_metadata` via `_create_usage_metadata`, instead of silently `continue`-ing past them	2026-03-04 15:35:30 -05:00
Mattijs Ugen	5c6f8fe0a6	fix(openai): accept valid responses that are falsy at runtime (#35307 )	2026-02-18 21:06:43 -05:00
ccurme	8f1bc0d3ae	feat(openai): support automatic server-side compaction (#35212 )	2026-02-17 10:48:52 -05:00
ccurme	32c6ab3033	fix(openai): add `model` property (#35284 )	2026-02-17 10:46:49 -05:00
Mason Daugherty	df4a29b5d0	docs(openai): more nits (#35277 )	2026-02-16 23:10:31 -05:00
weiguang li	fb0233c9b9	docs(openai): clarify reasoning config for openai-compatible endpoints (#35202 )	2026-02-15 22:13:24 -05:00
ccurme	8e35924083	fix(openai): sanitize chat completions text content blocks (#35217 )	2026-02-15 15:31:02 -05:00
nightcityblade	ecac3d891c	fix(openai): improve error message for null choices in OpenAI-compatible APIs (#35236 )	2026-02-15 10:59:04 -05:00
Mason Daugherty	f9fd7be695	feat(openrouter): add `langchain-openrouter` provider package (#35211 ) Add a first-party `langchain-openrouter` partner package (`ChatOpenRouter`) that wraps the official `openrouter` Python SDK, providing native support for OpenRouter-specific features that `ChatOpenAI` intentionally does not handle. Also adds scope-clarifying docstrings to `ChatOpenAI` / `BaseChatOpenAI` warning users away from using `base_url` overrides with third-party providers. --- Closes #31325 Closes #32967 Closes #32977 Closes #32981 Closes #33643 Closes #33757 Closes #34056 Closes #34797 Closes #34962 Supersedes #33902, #34867 (thank you @elonfeng and @okamototk for your initial work on this!) --- Bugs with upstream sdk: - https://github.com/OpenRouterTeam/python-sdk/issues/38 - https://github.com/OpenRouterTeam/python-sdk/issues/51 - https://github.com/OpenRouterTeam/python-sdk/issues/52	2026-02-15 02:09:13 -05:00
ccurme	2b4b1dc29a	fix(openai): sanitize urls when counting tokens in images (#35143 )	2026-02-10 15:25:10 -05:00
ccurme	7c41298355	feat(core): add ContextOverflowError, raise in anthropic and openai (#35099 )	2026-02-09 15:15:34 -05:00
Guofang.Tang	06a7d079b0	fix(openai): detect codex models for responses api preference (#35058 )	2026-02-08 13:15:48 -05:00
OysterMax	92afcaae60	fix(openai): raise proper exception `OpenAIRefusalError` on structured output refusal (#34619 )	2026-01-07 14:34:02 -05:00
Sujal M H	7ad1c19d9c	fix: handle empty assistant content in Responses API (#34272 ) (#34296 )	2026-01-07 14:21:55 -05:00
Sujal M H	4be9407b09	fix(openai): filter function_call blocks in token counting (#34396 )	2025-12-19 13:53:44 -05:00
ccurme	e0950f29b7	fix(openai): rely on langchain-core for setting chunk_position (#34404 )	2025-12-17 12:44:12 -05:00
tom1299	f167c35243	fix(openai): Correct hyperlinks in documentation of function with_structured_output (#34385 ) Just a small fix of some broken hyperlinks in the documentation of the function `langchain_openai/chat_models/base.py#with_structured_output` and a rephrase of the reference to supported models. Co-authored-by: Thomas Reuhl <thomas.reuhl@telekom.de>	2025-12-16 10:49:57 -05:00
Towseef Altaf	0e5e33ba03	fix(openai): correct image resize aspect ratio caps (#34192 )	2025-12-12 14:34:17 -05:00
Jacob Lee	a528ea1796	feat(openai): Use responses API if model is gpt-5.2-pro (#34306 )	2025-12-12 10:11:15 -05:00
j3r0lin	5720dea41b	fix(openai): handle missing 'text' key in responses API content blocks (#34198 )	2025-12-12 09:39:12 -05:00
Jacob Lee	badc0cf1b6	fix(openai): Allow temperature when reasoning is set to the string 'none' (#34298 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-12-11 15:57:04 -05:00
Towseef Altaf	d27fb0c432	feat(langchain,openai): add strict flag to ProviderStrategy structured output (#34149 )	2025-12-10 15:35:23 -05:00
Mason Daugherty	ff6e3558d7	docs(fireworks,groq,huggingface,mistralai,ollama,openai): x-ref `convert_to_openai_tool` (#34276 )	2025-12-09 19:51:04 -05:00
Mason Daugherty	dff229d018	fix(openai): add missing `tools` param to `ChatOpenAI` `with_structured_output` (#34075 )	2025-12-08 15:47:31 -05:00
Marlene	ff3353f02f	fix(openai): Fixing error that comes up using the Responses API with built-in tools and custom tools (#34136 )	2025-12-08 09:10:44 -05:00
Mason Daugherty	3ace4e3680	docs(core,groq,openai): nits for ref docs (#34243 )	2025-12-07 19:45:38 -05:00
Abhinav	2ba3ce81a6	fix(openai): make GPT-5 temperature validation case-insensitive (#34012 ) Fixed a bug where GPT-5 temperature validation was case-sensitive, causing issues when users specified Azure deployment names or model names in uppercase (e.g., `"GPT-5-2025-01-01"`, `"GPT-5-NANO"`). The validation now correctly handles model names regardless of case. Changes made: - Updated `validate_temperature()` method in `BaseChatOpenAI` to perform case-insensitive model name comparisons - Updated `_get_encoding_model()` method to use case-insensitive checks for tiktoken encoder selection - Added comprehensive unit tests to verify case-insensitive behavior with various case combinations Issue: Fixes #34003 Dependencies: None Test Coverage: - All existing tests pass - New test `test_gpt_5_temperature_case_insensitive` covers uppercase, lowercase, and mixed-case model names - Tests verify both non-chat GPT-5 models (temperature removed) and chat models (temperature preserved) - Lint and format checks pass (`make lint`, `make format`) --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-11-23 20:17:03 -05:00
Mason Daugherty	cbaea351b2	style(core,langchain-classic,openai): fix griffe warnings (#34074 )	2025-11-23 01:06:46 -05:00
Mason Daugherty	47b79c30c0	chore(docs): fix a few refs syntax errors (#34044 ) missing whitespace for some admonitions	2025-11-22 00:58:21 -05:00
ccurme	33e5d01f7c	feat(model-profiles): distribute data across packages (#34024 )	2025-11-21 15:47:05 -05:00
Mason Daugherty	52b1516d44	style(langchain): fix some middleware ref syntax (#33988 )	2025-11-16 00:33:17 -05:00

1 2 3 4 5 ...

295 Commits