langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 10:17:00 +00:00

Author	SHA1	Message	Date
Mason Daugherty	6c091564ac	chore(core,langchain,openai): refresh stale OpenAI model references (#37487 )	2026-05-18 01:06:42 -05:00
ccurme	2259d29231	fix(openai): broaden condition for ContextOverflowError to accommodate other providers (#37457 )	2026-05-15 22:03:28 -04:00
open-swe[bot]	0831e445cf	docs(openai): document `base_url` env var fallback chain (#37436 ) Documents the env vars that influence `base_url` resolution on `ChatOpenAI`, `OpenAIEmbeddings`, and `BaseOpenAI`. The previous docstrings only said "leave blank if not using a proxy or service emulator" and did not explain that two different env vars are consulted by two different layers. Concretely: - `OPENAI_API_BASE` is read explicitly by LangChain at init and passed as `base_url` to the underlying client. - `OPENAI_BASE_URL` is read by the underlying `openai` SDK client itself. LangChain only inspects its presence to decide whether to default-enable `stream_usage` (left off when set, because many non-OpenAI endpoints do not support streaming token usage). Precedence: explicit `base_url=` kwarg → `OPENAI_API_BASE` → `OPENAI_BASE_URL` (via SDK fallback). Docs-only change — no behavior change. > AI-agent involvement: drafted by an AI agent and reviewed before submission. _Opened collaboratively by Mason Daugherty and open-swe._ Co-authored-by: open-swe[bot] <open-swe@users.noreply.github.com> Co-authored-by: Mason Daugherty <61371264+mdrxy@users.noreply.github.com>	2026-05-14 15:35:30 -07:00
langchain-model-profile-bot[bot]	6b4bea7d5d	chore(model-profiles): refresh model profile data (#37074 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-04-29 10:43:12 -04:00
Lauren Hirata Singh	aac258eaaa	chore(docs): update comment for chatopenai (#37034 ) Fixes DOC-526	2026-04-27 11:43:57 -04:00
langchain-model-profile-bot[bot]	83718b1129	chore(model-profiles): refresh model profile data (#37015 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-04-27 09:48:09 -04:00
Mason Daugherty	5a37cd5537	fix(openai): add gpt-5.5 pro to Responses API check (#36994 )	2026-04-24 14:58:48 -04:00
langchain-model-profile-bot[bot]	cc2feb1aea	chore(model-profiles): refresh model profile data (#36982 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-04-24 09:20:07 -04:00
Asamu David	4000c22376	feat(openai): prevent silent streaming hangs in `ChatOpenAI` (#36949 ) > [!IMPORTANT] > Behavior change on upgrade — minor bump (`1.1.16` → `1.2.0`). > > Streaming calls now raise `StreamChunkTimeoutError` (a `TimeoutError` subclass — existing `except TimeoutError:` / `except asyncio.TimeoutError:` handlers catch it) after 120s of content silence instead of hanging forever. Opt out with `stream_chunk_timeout=None` or `LANGCHAIN_OPENAI_STREAM_CHUNK_TIMEOUT_S=0`. > > Kernel-level TCP keepalive / `TCP_USER_TIMEOUT` are applied via a custom `httpx` transport. `httpx` disables its env-proxy auto-detection (`HTTP_PROXY` / `HTTPS_PROXY` / `ALL_PROXY` / `NO_PROXY` and macOS/Windows system proxy) whenever a transport is supplied, so to avoid silently breaking enterprise proxy users, `ChatOpenAI` now detects the "proxy-env-shadow" shape at construction and skips the custom transport entirely when all of these hold: > > - `http_socket_options` left at default (`None`) > - No `http_client` or `http_async_client` supplied > - No `openai_proxy` supplied > - A proxy env var / system proxy is visible to httpx > > On that shape the instance falls back to pre-PR behavior and env-proxy auto-detection still applies. A one-time `INFO` records the bypass. > > Users who explicitly set `http_socket_options=[...]` alongside an env proxy still get the shadowed behavior with a one-time `WARNING` log — they opted in. Full opt-outs below. --- Streaming chat completions can hang forever when the underlying TCP connection silently dies mid-stream (idle NAT/LB timeouts, sandboxed runtimes killing long-lived connections, peer gone without a FIN or RST). httpx's read timeout doesn't help here because it's reset by any bytes arriving on the socket, including OpenAI's SSE keepalive comments, so a stream that's quiet on content but still producing keepalives looks alive forever. This PR adds two knobs to `ChatOpenAI`, both on by default with opt-outs: - `stream_chunk_timeout` (default 120s): wraps the async streaming iterator in `asyncio.wait_for` per chunk. Measures the gap between parsed SSE chunks, so keepalives don't reset it. Fires on genuine content silence and raises `StreamChunkTimeoutError` — a `TimeoutError` subclass carrying `timeout_s`, `model_name`, and `chunks_received` as structured attributes (mirrored in the WARNING log's `extra=`) for alerting without message-regex. Override with the kwarg or `LANGCHAIN_OPENAI_STREAM_CHUNK_TIMEOUT_S`. - `http_socket_options`: applies `SO_KEEPALIVE` + `TCP_KEEPIDLE` / `TCP_KEEPINTVL` / `TCP_KEEPCNT` + `TCP_USER_TIMEOUT` on Linux (macOS equivalents where available). On platforms missing some options, they're dropped silently and the remaining set still does useful work. Pool limits are set explicitly on the custom transport to mirror the `openai` SDK — without that, passing `transport=` to `httpx.AsyncClient` silently shrinks the connection pool. ## Behavior change The default-shape proxy-env bypass (above) covers the common enterprise case. Beyond that: - Connections that would previously have hung forever will now error out via `StreamChunkTimeoutError`. - Users who explicitly opt into `http_socket_options` while also relying on env proxies will see a one-time `WARNING` and lose env-proxy auto-detection — the custom transport shadows it. This is the original shipped behavior, retained for anyone who wants socket tuning on top of an env-proxied setup. Full opt-outs: - `stream_chunk_timeout=None` or `LANGCHAIN_OPENAI_STREAM_CHUNK_TIMEOUT_S=0` - `http_socket_options=()` or `LANGCHAIN_OPENAI_TCP_KEEPALIVE=0` - Supply your own `http_client` and `http_async_client`. `http_socket_options` is applied per side: passing only one still leaves the other side's default builder getting socket options. Supply both (or combine with `http_socket_options=()`) to take full control. Unparseable or negative values for the `LANGCHAIN_OPENAI_*` env vars fall back to the default with a `WARNING` log rather than silently being accepted, so a misconfigured environment still boots but the fallback is discoverable. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2026-04-22 20:28:43 -04:00
Mason Daugherty	488c6a73bb	fix(openai): tolerate `prompt_cache_retention` drift in streaming (#36925 )	2026-04-21 14:54:32 -04:00
ccurme	19b0805bc1	fix(openai): accommodate dict `response` items in streaming (#36899 )	2026-04-20 15:44:01 -04:00
Thomas	8fec4e7cee	fix(openai): infer azure chat profiles from model name (#36858 )	2026-04-19 11:06:26 -04:00
langchain-model-profile-bot[bot]	02991cb4cf	chore(model-profiles): refresh model profile data (#36864 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-04-18 15:32:37 -05:00
ccurme	0516156ef9	fix(openai): use SSRF-safe transport for image token counting (#36819 )	2026-04-16 09:52:02 -04:00
William FH	885f2c2c2d	fix(openai): handle content blocks without type key in responses api conversion (#36725 )	2026-04-14 15:13:40 -04:00
langchain-model-profile-bot[bot]	ff35602e68	chore(model-profiles): refresh model profile data (#36539 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-04-05 12:01:45 -04:00
Mason Daugherty	8c15649127	fix(openai,groq,openrouter): use is-not-None checks in usage metadata token extraction (#36500 ) Python's `or` operator treats `0` as falsy, so `token_usage.get("total_tokens") or fallback` silently replaces a provider-reported `total_tokens=0` with the computed sum of input + output tokens. Providers can legitimately report zero tokens (e.g., cached responses, empty completions). The same pattern exists in the dual-key lookups for `input_tokens`/`output_tokens` in Groq and OpenRouter. While current APIs don't return both key formats simultaneously (making the `or`-chain functionally correct today), the semantics are still wrong; `0` should not fall through to a fallback. ## Changes - Replace `x.get(key) or fallback` with explicit `is not None` checks in `_create_usage_metadata` across `langchain-openai`, `langchain-groq`, and `langchain-openrouter` for `input_tokens`, `output_tokens`, and `total_tokens` - Fix a concrete bug in the `total_tokens` path: a provider-reported `0` was silently replaced by the computed sum - Harden dual-key lookups in Groq and OpenRouter to correctly preserve zero values from the preferred key, should both key formats ever coexist - Update OpenAI's single-key extraction for consistency — the old `or 0` pattern happened to produce correct results (`0 or 0 == 0`) but was semantically wrong	2026-04-03 11:46:36 -04:00
langchain-model-profile-bot[bot]	cd394b70c1	chore(model-profiles): refresh model profile data (#36455 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-04-02 11:29:00 -04:00
langchain-model-profile-bot[bot]	90d1365bf4	chore(model-profiles): refresh model profile data (#36368 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-03-30 10:00:17 -04:00
Weiguang Li	feb992abfe	fix(openai): let user-provided User-Agent override the Azure default (#35523 )	2026-03-28 21:41:19 -04:00
Mason Daugherty	2f64d80cc6	fix(core,model-profiles): add missing `ModelProfile` fields, warn on schema drift (#36129 ) PR #35788 added 7 new fields to the `langchain-profiles` CLI output (`name`, `status`, `release_date`, `last_updated`, `open_weights`, `attachment`, `temperature`) but didn't update `ModelProfile` in `langchain-core`. Partner packages like `langchain-aws` that set `extra="forbid"` on their Pydantic models hit `extra_forbidden` validation errors when Pydantic encountered undeclared TypedDict keys at construction time. This adds the missing fields, makes `ModelProfile` forward-compatible, provides a base-class hook so partners can stop duplicating model-profile validator boilerplate, migrates all in-repo partners to the new hook, and adds runtime + CI-time warnings for schema drift. ## Changes ### `langchain-core` - Add `__pydantic_config__ = ConfigDict(extra="allow")` to `ModelProfile` so unknown profile keys pass Pydantic validation even on models with `extra="forbid"` — forward-compatibility for when the CLI schema evolves ahead of core - Declare the 7 missing fields on `ModelProfile`: `name`, `status`, `release_date`, `last_updated`, `open_weights` (metadata) and `attachment`, `temperature` (capabilities) - Add `_warn_unknown_profile_keys()` in `model_profile.py` — emits a `UserWarning` when a profile dict contains keys not in `ModelProfile`, suggesting a core upgrade. Wrapped in a bare `except` so introspection failures never crash model construction - Add `BaseChatModel._resolve_model_profile()` hook that returns `None` by default. Partners can override this single method instead of redefining the full `_set_model_profile` validator — the base validator calls it automatically - Add `BaseChatModel._check_profile_keys` as a separate `model_validator` that calls `_warn_unknown_profile_keys`. Uses a distinct method name so partner overrides of `_set_model_profile` don't inadvertently suppress the check ### `langchain-profiles` CLI - Add `_warn_undeclared_profile_keys()` to the CLI (`cli.py`), called after merging augmentations in `refresh()` — warns at profile-generation time (not just runtime) when emitted keys aren't declared in `ModelProfile`. Gracefully skips if `langchain-core` isn't installed - Add guard test `test_model_data_to_profile_keys_subset_of_model_profile` in model-profiles — feeds a fully-populated model dict to `_model_data_to_profile()` and asserts every emitted key exists in `ModelProfile.__annotations__`. CI fails before any release if someone adds a CLI field without updating the TypedDict ### Partner packages - Migrate all 10 in-repo partners to the `_resolve_model_profile()` hook, replacing duplicated `@model_validator` / `_set_model_profile` overrides: anthropic, deepseek, fireworks, groq, huggingface, mistralai, openai (base + azure), openrouter, perplexity, xai - Anthropic retains custom logic (context-1m beta → `max_input_tokens` override); all others reduce to a one-liner - Add `pr_lint.yml` scope for the new `model-profiles` package	2026-03-23 00:44:27 -04:00
ccurme	900f8a3513	fix(openai): support phase parameter (#36161 )	2026-03-22 14:23:24 -04:00
Jackjin	7d05cfb131	fix(openai): preserve namespace field in streaming function_call chunks (#36108 )	2026-03-20 12:51:13 -04:00
langchain-model-profile-bot[bot]	9a17602633	chore(model-profiles): refresh model profile data (#36039 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-03-17 17:48:01 -04:00
Giulio Leone	9e4a6013be	fix(openai): add type: message to Responses API input items (#35693 )	2026-03-15 12:43:16 -04:00
Mason Daugherty	5d9568b5f5	feat(model-profiles): new fields + `Makefile` target (#35788 ) Extract additional fields from models.dev into `_model_data_to_profile`: `name`, `status`, `release_date`, `last_updated`, `open_weights`, `attachment`, `temperature` Move the model profile refresh logic from an inline bash script in the GitHub Actions workflow into a `make refresh-profiles` target in `libs/model-profiles/Makefile`. This makes it runnable locally with a single command and keeps the provider map in one place instead of duplicated between CI and developer docs.	2026-03-12 13:56:25 +00:00
Matt Van Horn	9521c679db	fix(openai): close PIL Image handles in token counting to prevent fd leak (#35742 )	2026-03-11 23:07:45 -04:00
LincolnBurrows2017	f9dbd22fe1	fix(openai): typo (#35763 ) Fixed typo in comment: "equivelent" -> "equivalent" in libs/partners/openai/langchain_openai/chat_models/base.py Co-authored-by: AI Assistant <assistant@example.com>	2026-03-11 11:46:06 -04:00
langchain-model-profile-bot[bot]	3e4c0d5949	chore(model-profiles): refresh model profile data (#35754 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-03-11 09:47:53 -04:00
Mohammad Mohtashim	3af0bc0141	fix(openai): update responses API model detection for pro and codex models (#35594 )	2026-03-09 09:20:20 -04:00
ccurme	fbfe4b812d	feat(openai): support tool search (#35582 )	2026-03-08 08:53:13 -04:00
langchain-model-profile-bot[bot]	3241d6429f	chore(model-profiles): refresh model profile data (#35593 ) Automated refresh of model profile data for all in-monorepo partner integrations via `langchain-profiles refresh`. 🤖 Generated by the `refresh_model_profiles` workflow. Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2026-03-06 10:14:14 -05:00
Jason Meng	f698b43b9a	fix(openai): avoid PydanticSerializationUnexpectedValue for structured output (#35543 )	2026-03-04 21:46:46 -05:00
Mason Daugherty	e91da86efe	feat(openrouter): add streaming token usage support (#35559 ) Streaming token usage was silently dropped for `ChatOpenRouter`. Both `_stream` and `_astream` skipped any SSE chunk without a `choices` array — which is exactly the shape OpenRouter uses for the final usage-reporting chunk. This meant `usage_metadata` was never populated on streamed responses, causing downstream consumers (like the Deep Agents CLI) to show "unknown" model with 0 tokens. ## Changes - Add `stream_usage: bool = True` field to `ChatOpenRouter`, which passes `stream_options: {"include_usage": True}` to the OpenRouter API when streaming — matching the pattern already established in `langchain-openai`'s `BaseChatOpenAI` - Handle usage-only chunks (no `choices`, just `usage`) in both `_stream` and `_astream` by emitting a `ChatGenerationChunk` with `usage_metadata` via `_create_usage_metadata`, instead of silently `continue`-ing past them	2026-03-04 15:35:30 -05:00
Mason Daugherty	70192690b1	fix(model-profiles): sort generated profiles by model ID for stable diffs (#35344 ) - Sort model profiles alphabetically by model ID (the top-level `_PROFILES` dictionary keys, e.g. `claude-3-5-haiku-20241022`, `gpt-4o-mini`) before writing `_profiles.py`, so that regenerating profiles only shows actual data changes in diffs — not random reordering from the models.dev API response order - Regenerate all 10 partner profile files with the new sorted ordering	2026-02-19 23:11:22 -05:00
Mattijs Ugen	5c6f8fe0a6	fix(openai): accept valid responses that are falsy at runtime (#35307 )	2026-02-18 21:06:43 -05:00
ccurme	8f1bc0d3ae	feat(openai): support automatic server-side compaction (#35212 )	2026-02-17 10:48:52 -05:00
ccurme	32c6ab3033	fix(openai): add `model` property (#35284 )	2026-02-17 10:46:49 -05:00
Mason Daugherty	df4a29b5d0	docs(openai): more nits (#35277 )	2026-02-16 23:10:31 -05:00
weiguang li	fb0233c9b9	docs(openai): clarify reasoning config for openai-compatible endpoints (#35202 )	2026-02-15 22:13:24 -05:00
Mohammad Mohtashim	0f5a314275	fix(openai): gpt-5.2-pro Model Profile `structured_output` key fixed (#35216 )	2026-02-15 22:00:00 -05:00
Mohammad Mohtashim	99192b01da	chore(openai): extend `model_token_mapping` till `gpt-5.2` for modelname_to_contextsize (#35214 )	2026-02-15 21:55:58 -05:00
yaowubarbara	c1e7cf69fb	fix(openai): enhance error message for non-OpenAI embedding providers (#35252 )	2026-02-15 21:16:45 -05:00
ccurme	8e35924083	fix(openai): sanitize chat completions text content blocks (#35217 )	2026-02-15 15:31:02 -05:00
nightcityblade	ecac3d891c	fix(openai): improve error message for null choices in OpenAI-compatible APIs (#35236 )	2026-02-15 10:59:04 -05:00
Mason Daugherty	f9fd7be695	feat(openrouter): add `langchain-openrouter` provider package (#35211 ) Add a first-party `langchain-openrouter` partner package (`ChatOpenRouter`) that wraps the official `openrouter` Python SDK, providing native support for OpenRouter-specific features that `ChatOpenAI` intentionally does not handle. Also adds scope-clarifying docstrings to `ChatOpenAI` / `BaseChatOpenAI` warning users away from using `base_url` overrides with third-party providers. --- Closes #31325 Closes #32967 Closes #32977 Closes #32981 Closes #33643 Closes #33757 Closes #34056 Closes #34797 Closes #34962 Supersedes #33902, #34867 (thank you @elonfeng and @okamototk for your initial work on this!) --- Bugs with upstream sdk: - https://github.com/OpenRouterTeam/python-sdk/issues/38 - https://github.com/OpenRouterTeam/python-sdk/issues/51 - https://github.com/OpenRouterTeam/python-sdk/issues/52	2026-02-15 02:09:13 -05:00
ccurme	2b4b1dc29a	fix(openai): sanitize urls when counting tokens in images (#35143 )	2026-02-10 15:25:10 -05:00
ccurme	7c41298355	feat(core): add ContextOverflowError, raise in anthropic and openai (#35099 )	2026-02-09 15:15:34 -05:00
Mason Daugherty	4ca586b322	feat(model-profiles): add `text_inputs` and `text_outputs` (#35084 ) - Add `text_inputs` and `text_outputs` fields to `ModelProfile` - Regenerate `_profiles.py` for all providers ## Why models.dev data includes `'text'` as both an input and output modality, but we didn't capture it. models.dev broadly contains models without text input (Whisper/ASR) and without text output (image generators, TTS). Without this, downstream consumers can't filter on model text support (e.g. preventing users from passing text input to an audio-only model). --- We'd need to also run for Google, AWS and cut releases for all to propagate	2026-02-09 14:50:09 -05:00
Guofang.Tang	06a7d079b0	fix(openai): detect codex models for responses api preference (#35058 )	2026-02-08 13:15:48 -05:00

1 2 3 4 5 ...

360 Commits