langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-17 16:39:52 +00:00

Author	SHA1	Message	Date
ccurme	45a067509f	fix(core): fix tracing for PDFs in v1 messages (#32434 )	2025-08-11 12:18:32 -04:00
Chester Curme	cfe13f673a	Merge branch 'master' into wip-v0.4 # Conflicts: # libs/core/langchain_core/version.py # libs/core/pyproject.toml # libs/core/uv.lock # libs/partners/openai/tests/integration_tests/chat_models/test_responses_api.py # libs/partners/openai/uv.lock	2025-08-08 09:04:57 -04:00
Mason Daugherty	00244122bd	feat(openai): `minimal` and `verbosity` (#32455 )	2025-08-08 02:24:21 +00:00
Mason Daugherty	5599c59d4a	chore: formatting across codebase (#32456 ) To prevent polluting future PRs	2025-08-07 22:09:26 -04:00
Michael Matloka	5036bd7adb	fix(openai): don't crash get_num_tokens_from_messages on gpt-5 (#32451 )	2025-08-07 16:33:19 -04:00
ccurme	ec2b34a02d	feat(openai): custom tools (#32449 )	2025-08-07 16:30:01 -04:00
Mason Daugherty	cbf4c0e565	Merge branch 'master' into wip-v0.4	2025-08-07 15:33:12 -04:00
Mason Daugherty	145d38f7dd	test(openai): add tests for `prompt_cache_key` parameter and update docs (#32363 ) Introduce tests to validate the behavior and inclusion of the `prompt_cache_key` parameter in request payloads for the `ChatOpenAI` model.	2025-08-07 15:29:47 -04:00
ccurme	68c70da33e	fix(openai): add in `output_text` (#32450 ) This property was deleted in `openai==1.99.2`.	2025-08-07 15:23:56 -04:00
ccurme	e02eed5489	feat: standard outputs (#32287 ) Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Nuno Campos <nuno@langchain.dev>	2025-08-05 15:17:32 -04:00
Mason Daugherty	5e9eb19a83	chore: update branch with changes from master (#32277 ) Co-authored-by: Maxime Grenu <69890511+cluster2600@users.noreply.github.com> Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: jmaillefaud <jonathan.maillefaud@evooq.ch> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: tanwirahmad <tanwirahmad@users.noreply.github.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: niceg <79145285+growmuye@users.noreply.github.com> Co-authored-by: Chaitanya varma <varmac301@gmail.com> Co-authored-by: dishaprakash <57954147+dishaprakash@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Kanav Bansal <13186335+bansalkanav@users.noreply.github.com> Co-authored-by: Aleksandr Filippov <71711753+alex-feel@users.noreply.github.com> Co-authored-by: Alex Feel <afilippov@spotware.com>	2025-07-28 10:39:41 -04:00
Mason Daugherty	96cbd90cba	fix: formatting issues in docstrings (#32265 ) Ensures proper reStructuredText formatting by adding the required blank line before closing docstring quotes, which resolves the "Block quote ends without a blank line; unexpected unindent" warning.	2025-07-27 23:37:47 -04:00
niceg	0d6f915442	fix: LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. (#32222 ) fix: Fix LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. - Description: This PR fixes an issue where the LLM would mimic Unicode responses due to forced Unicode conversion of non-ASCII characters in tool calls. The fix involves disabling the `ensure_ascii` flag in `json.dumps()` when converting tool calls to OpenAI format. - Issue: Fixes ↓↓↓ input： ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "你好啊集团"}'}}]} ``` output: ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "\\u4f60\\u597d\\u554a\\u96c6\\u56e2"}'}}]} ``` then: llm will mimic outputting unicode. Unicode's vast number of symbols can lengthen LLM responses, leading to slower performance. <img width="686" height="277" alt="image" src="https://github.com/user-attachments/assets/28f3b007-3964-4455-bee2-68f86ac1906d" /> --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 17:01:31 -04:00
Mason Daugherty	d53ebf367e	fix(docs): capitalization, codeblock formatting, and hyperlinks, note blocks (#32235 ) widespread cleanup attempt	2025-07-24 16:55:04 -04:00
Copilot	54542b9385	docs(openai): add comprehensive documentation and examples for `extra_body` + others (#32149 ) This PR addresses the common issue where users struggle to pass custom parameters to OpenAI-compatible APIs like LM Studio, vLLM, and others. The problem occurs when users try to use `model_kwargs` for custom parameters, which causes API errors. ## Problem Users attempting to pass custom parameters (like LM Studio's `ttl` parameter) were getting errors: ```python # ❌ This approach fails llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", model_kwargs={"ttl": 5} # Causes TypeError: unexpected keyword argument 'ttl' ) ``` ## Solution The `extra_body` parameter is the correct way to pass custom parameters to OpenAI-compatible APIs: ```python # ✅ This approach works correctly llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 5} # Custom parameters go in extra_body ) ``` ## Changes Made 1. Enhanced Documentation: Updated the `extra_body` parameter docstring with comprehensive examples for LM Studio, vLLM, and other providers 2. Added Documentation Section: Created a new "OpenAI-compatible APIs" section in the main class docstring with practical examples 3. Unit Tests: Added tests to verify `extra_body` functionality works correctly: - `test_extra_body_parameter()`: Verifies custom parameters are included in request payload - `test_extra_body_with_model_kwargs()`: Ensures `extra_body` and `model_kwargs` work together 4. Clear Guidance: Documented when to use `extra_body` vs `model_kwargs` ## Examples Added LM Studio with TTL (auto-eviction): ```python ChatOpenAI( base_url="http://localhost:1234/v1", api_key="lm-studio", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 300} # Auto-evict after 5 minutes ) ``` vLLM with custom sampling: ```python ChatOpenAI( base_url="http://localhost:8000/v1", api_key="EMPTY", model="meta-llama/Llama-2-7b-chat-hf", extra_body={ "use_beam_search": True, "best_of": 4 } ) ``` ## Why This Works - `model_kwargs` parameters are passed directly to the OpenAI client's `create()` method, causing errors for non-standard parameters - `extra_body` parameters are included in the HTTP request body, which is exactly what OpenAI-compatible APIs expect for custom parameters Fixes #32115. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 16:43:16 -04:00
ccurme	de13f6ae4f	fix(openai): support acknowledged safety checks in computer use (#31984 )	2025-07-14 07:33:37 -03:00
ccurme	612ccf847a	chore: [openai] bump sdk (#31958 )	2025-07-10 15:53:41 -04:00
Mason Daugherty	6594eb8cc1	docs(xai): update for Grok 4 (#31953 )	2025-07-10 11:06:37 -04:00
Mason Daugherty	33c9bf1adc	langchain-openai[patch]: Add ruff bandit rules to linter (#31788 )	2025-06-30 14:01:32 -04:00
Andrew Jaeger	0189c50570	openai[fix]: Correctly set usage metadata for OpenAI Responses API (#31756 )	2025-06-27 15:35:14 +00:00
ccurme	e8e89b0b82	docs: updates from langchain-openai 0.3.26 (#31764 )	2025-06-27 11:27:25 -04:00
ccurme	88d5f3edcc	openai[patch]: allow specification of output format for Responses API (#31686 )	2025-06-26 13:41:43 -04:00
ccurme	84500704ab	openai[patch]: fix bug where function call IDs were not populated (#31735 ) (optional) IDs were getting dropped in some cases.	2025-06-25 19:08:27 +00:00
ccurme	0bf223d6cf	openai[patch]: add attribute to always use previous_response_id (#31734 )	2025-06-25 19:01:43 +00:00
joshy-deshaw	8a0782c46c	openai[patch]: fix dropping response headers while streaming / Azure (#31580 )	2025-06-23 17:59:58 -04:00
ccurme	b268ab6a28	openai[patch]: fix client caching when request_timeout is specified via httpx.Timeout (#31698 ) Resolves https://github.com/langchain-ai/langchain/issues/31697	2025-06-23 14:37:49 +00:00
Li-Kuang Chen	4ee6112161	openai[patch]: Improve error message when response type is malformed (#31619 )	2025-06-21 14:15:21 -04:00
ccurme	e2a0ff07fd	openai[patch]: include 'type' key internally when streaming reasoning blocks (#31661 ) Covered by existing tests. Will make it easier to process streamed reasoning blocks.	2025-06-18 15:01:54 -04:00
ccurme	6409498f6c	openai[patch]: route to Responses API if relevant attributes are set (#31645 ) Following https://github.com/langchain-ai/langchain/pull/30329.	2025-06-17 16:04:38 -04:00
ccurme	c1c3e13a54	openai[patch]: add Responses API attributes to BaseChatOpenAI (#30329 ) `reasoning`, `include`, `store`, `truncation`. Previously these had to be added through `model_kwargs`.	2025-06-17 14:45:50 -04:00
ccurme	b610859633	openai[patch]: support Responses streaming in AzureChatOpenAI (#31641 ) Resolves https://github.com/langchain-ai/langchain/issues/31303, https://github.com/langchain-ai/langchain/issues/31624	2025-06-17 14:41:09 -04:00
ccurme	b9357d456e	openai[patch]: refactor handling of Responses API (#31587 )	2025-06-16 14:01:39 -04:00
ccurme	0c10ff6418	openai[patch]: handle annotation change in openai==1.82.0 (#31597 ) https://github.com/openai/openai-python/pull/2372/files#diff-91cfd5576e71b4b72da91e04c3a029bab50a72b5f7a2ac8393fca0a06e865fb3	2025-06-12 23:38:41 -04:00
Mohammad Mohtashim	42eb356a44	[OpenAI]: Encoding Model (#31402 ) - Description: Small Fix for when getting the encoder in case of KeyError and using the correct encoder for newer models - Issue: #31390	2025-06-10 16:00:00 -04:00
ccurme	575662d5f1	openai[patch]: accommodate change in image generation API (#31522 ) OpenAI changed their API to require the `partial_images` parameter when using image generation + streaming. As described in https://github.com/langchain-ai/langchain/pull/31424, we are ignoring partial images. Here, we accept the `partial_images` parameter (as required by OpenAI), but emit a warning and continue to ignore partial images.	2025-06-09 14:57:46 -04:00
Bagatur	761f8c3231	openai[patch]: pass through with_structured_output kwargs (#31518 ) Support ```python from langchain.chat_models import init_chat_model from pydantic import BaseModel class ResponseSchema(BaseModel): response: str def get_weather(location: str) -> str: """Get weather""" pass llm = init_chat_model("openai:gpt-4o-mini") structured_llm = llm.with_structured_output( ResponseSchema, tools=[get_weather], strict=True, include_raw=True, tool_choice="required", parallel_tool_calls=False, ) structured_llm.invoke("whats up?") ```	2025-06-06 11:17:34 -04:00
Bagatur	0375848f6c	openai[patch]: update with_structured_outputs docstring (#31517 ) Update docstrings	2025-06-06 10:03:47 -04:00
ccurme	4cc2f6b807	openai[patch]: guard against None text completions in BaseOpenAI (#31514 ) Some chat completions APIs will return null `text` output (even though this is typed as string).	2025-06-06 09:14:37 -04:00
ccurme	6d6f305748	openai[patch]: clarify docs on api_version in docstring for AzureChatOpenAI (#31502 )	2025-06-05 16:06:22 +00:00
Eugene Yurtsev	17f34baa88	openai[minor]: add image generation to responses api (#31424 ) Does not support partial images during generation at the moment. Before doing that I'd like to figure out how to specify the aggregation logic without requiring changes in core. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-06-02 10:03:54 -04:00
ccurme	afd349cc95	openai: cache httpx client (#31260 ) ![Screenshot 2025-05-16 at 3 49 54 PM](https://github.com/user-attachments/assets/4b377384-a769-4487-b801-bd1aa0ed66c1) Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com>	2025-05-29 14:03:06 -04:00
ccurme	053a1246da	openai[patch]: support built-in code interpreter and remote MCP tools (#31304 )	2025-05-22 11:47:57 -04:00
ccurme	1b5ffe4107	openai[patch]: run _tokenize in background thread in async embedding invocations (#31312 )	2025-05-22 10:27:33 -04:00
ccurme	32fcc97a90	openai[patch]: compat with Bedrock Converse (#31280 ) ChatBedrockConverse passes through reasoning content blocks in [Bedrock Converse format](https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ContentBlock.html). Similar to how we handle Anthropic thinking blocks, here we ensure these are filtered out of OpenAI request payloads. Resolves https://github.com/langchain-ai/langchain/issues/31279.	2025-05-19 10:35:26 -04:00
ccurme	0b8837a0cc	openai: support runtime kwargs in embeddings (#31195 )	2025-05-14 09:14:40 -04:00
ccurme	868cfc4a8f	openai: ignore function_calls if tool_calls are present (#31198 ) Some providers include (legacy) function calls in `additional_kwargs` in addition to tool calls. We currently unpack both function calls and tool calls if present, but OpenAI will raise 400 in this case. This can come up if providers are mixed in a tool-calling loop. Example: ```python from langchain.chat_models import init_chat_model from langchain_core.messages import HumanMessage from langchain_core.tools import tool @tool def get_weather(location: str) -> str: """Get weather at a location.""" return "It's sunny." gemini = init_chat_model("google_genai:gemini-2.0-flash-001").bind_tools([get_weather]) openai = init_chat_model("openai:gpt-4.1-mini").bind_tools([get_weather]) input_message = HumanMessage("What's the weather in Boston?") tool_call_message = gemini.invoke([input_message]) assert len(tool_call_message.tool_calls) == 1 tool_call = tool_call_message.tool_calls[0] tool_message = get_weather.invoke(tool_call) response = openai.invoke( # currently raises 400 / BadRequestError [input_message, tool_call_message, tool_message] ) ``` Here we ignore function calls if tool calls are present.	2025-05-12 13:50:56 -04:00
zhurou603	1df3ee91e7	partners: (langchain-openai) total_tokens should not add 'Nonetype' t… (#31146 ) partners: (langchain-openai) total_tokens should not add 'Nonetype' t… # PR Description ## Description Fixed an issue in `langchain-openai` where `total_tokens` was incorrectly adding `None` to an integer, causing a TypeError. The fix ensures proper type checking before adding token counts. ## Issue Fixes the TypeError traceback shown in the image where `'NoneType'` cannot be added to an integer. ## Dependencies None ## Twitter handle None ![image](https://github.com/user-attachments/assets/9683a795-a003-455a-ada9-fe277245e2b2) Co-authored-by: qiulijie <qiulijie@yuaiweiwu.com>	2025-05-07 11:09:50 -04:00
Asif Mehmood	00ac49dd3e	Replace deprecated .dict() with .model_dump() for Pydantic v2 compatibility (#31107 ) What does this PR do? This PR replaces deprecated usages of ```.dict()``` with ```.model_dump()``` to ensure compatibility with Pydantic v2 and prepare for v3, addressing the deprecation warning ```PydanticDeprecatedSince20``` as required in [Issue# 31103](https://github.com/langchain-ai/langchain/issues/31103). Changes made: * Replaced ```.dict()``` with ```.model_dump()``` in multiple locations * Ensured consistency with Pydantic v2 migration guidelines * Verified compatibility across affected modules Notes * This is a code maintenance and compatibility update * Tested locally with Pydantic v2.11 * No functional logic changes; only internal method replacements to prevent deprecation issues	2025-05-03 13:40:54 -04:00
ccurme	94139ffcd3	openai[patch]: format system content blocks for Responses API (#31096 ) ```python from langchain_core.messages import HumanMessage, SystemMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI(model="gpt-4.1", use_responses_api=True) messages = [ SystemMessage("test"), # Works HumanMessage("test"), # Works SystemMessage([{"type": "text", "text": "test"}]), # Bug in this case HumanMessage([{"type": "text", "text": "test"}]), # Works SystemMessage([{"type": "input_text", "text": "test"}]) # Works ] llm._get_request_payload(messages) ```	2025-05-02 15:22:30 +00:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00

1 2 3 4 5

242 Commits