langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-22 02:45:49 +00:00

Author	SHA1	Message	Date
ccurme	e2a0ff07fd	openai[patch]: include 'type' key internally when streaming reasoning blocks (#31661 ) Covered by existing tests. Will make it easier to process streamed reasoning blocks.	2025-06-18 15:01:54 -04:00
ccurme	6409498f6c	openai[patch]: route to Responses API if relevant attributes are set (#31645 ) Following https://github.com/langchain-ai/langchain/pull/30329.	2025-06-17 16:04:38 -04:00
ccurme	3044bd37a9	openai: release 0.3.24 (#31642 )	2025-06-17 15:06:52 -04:00
ccurme	c1c3e13a54	openai[patch]: add Responses API attributes to BaseChatOpenAI (#30329 ) `reasoning`, `include`, `store`, `truncation`. Previously these had to be added through `model_kwargs`.	2025-06-17 14:45:50 -04:00
ccurme	b610859633	openai[patch]: support Responses streaming in AzureChatOpenAI (#31641 ) Resolves https://github.com/langchain-ai/langchain/issues/31303, https://github.com/langchain-ai/langchain/issues/31624	2025-06-17 14:41:09 -04:00
ZhangShenao	0b5c06e89f	[Doc] Improve api doc for perplexity (#31636 ) - add param in api doc - fix word spelling --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-06-17 14:10:43 +00:00
FT	c4c39c1ae6	mistralai[patch]: Fix Typos in Comments and Improve Compatibility Note (#31616 ) Description: This pull request corrects minor spelling mistakes in the comments within the `chat_models.py` file of the MistralAI partner integration. Specifically, it fixes the spelling of "equivalent" and "compatibility" in two separate comments. These changes improve code readability and maintain professional documentation standards. No functional code changes are included.	2025-06-17 09:23:25 -04:00
ccurme	b9357d456e	openai[patch]: refactor handling of Responses API (#31587 )	2025-06-16 14:01:39 -04:00
Peter Schneider	cecfec5efa	huggingface: handle image-text-to-text pipeline task (#31611 ) Description: Allows for HuggingFacePipeline to handle image-text-to-text pipeline	2025-06-14 16:41:11 -04:00
ccurme	5839801897	openai: release 0.3.23 (#31604 )	2025-06-13 14:02:38 +00:00
ccurme	0c10ff6418	openai[patch]: handle annotation change in openai==1.82.0 (#31597 ) https://github.com/openai/openai-python/pull/2372/files#diff-91cfd5576e71b4b72da91e04c3a029bab50a72b5f7a2ac8393fca0a06e865fb3	2025-06-12 23:38:41 -04:00
ccurme	4071670f56	huggingface[patch]: bump transformers (#31559 )	2025-06-10 20:43:33 +00:00
ccurme	40d6d4c738	huggingface[patch]: bump core dep (#31558 )	2025-06-10 20:26:13 +00:00
Mohammad Mohtashim	42eb356a44	[OpenAI]: Encoding Model (#31402 ) - Description: Small Fix for when getting the encoder in case of KeyError and using the correct encoder for newer models - Issue: #31390	2025-06-10 16:00:00 -04:00
ccurme	71b0f78952	openai: release 0.3.22 (#31542 )	2025-06-09 15:29:15 -04:00
ccurme	575662d5f1	openai[patch]: accommodate change in image generation API (#31522 ) OpenAI changed their API to require the `partial_images` parameter when using image generation + streaming. As described in https://github.com/langchain-ai/langchain/pull/31424, we are ignoring partial images. Here, we accept the `partial_images` parameter (as required by OpenAI), but emit a warning and continue to ignore partial images.	2025-06-09 14:57:46 -04:00
ccurme	ece9e31a7a	openai[patch]: VCR some tests (#31524 )	2025-06-06 23:00:57 +00:00
Bagatur	5187817006	openai[release]: 0.3.21 (#31519 )	2025-06-06 11:40:09 -04:00
Bagatur	761f8c3231	openai[patch]: pass through with_structured_output kwargs (#31518 ) Support ```python from langchain.chat_models import init_chat_model from pydantic import BaseModel class ResponseSchema(BaseModel): response: str def get_weather(location: str) -> str: """Get weather""" pass llm = init_chat_model("openai:gpt-4o-mini") structured_llm = llm.with_structured_output( ResponseSchema, tools=[get_weather], strict=True, include_raw=True, tool_choice="required", parallel_tool_calls=False, ) structured_llm.invoke("whats up?") ```	2025-06-06 11:17:34 -04:00
Bagatur	0375848f6c	openai[patch]: update with_structured_outputs docstring (#31517 ) Update docstrings	2025-06-06 10:03:47 -04:00
ccurme	a1f068eb85	openai: release 0.3.20 (#31515 )	2025-06-06 13:29:12 +00:00
ccurme	4cc2f6b807	openai[patch]: guard against None text completions in BaseOpenAI (#31514 ) Some chat completions APIs will return null `text` output (even though this is typed as string).	2025-06-06 09:14:37 -04:00
Eugene Yurtsev	73655b0ca8	huggingface: 0.3.0 release (#31503 ) Breaking change to make some dependencies optional: https://github.com/langchain-ai/langchain/pull/31268 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-06-05 20:20:15 +00:00
Bagatur	f7f52cab12	anthropic[patch]: cache tokens nit (#31484 ) if you pass in beta headers directly cache_creation is a dict	2025-06-05 16:15:03 -04:00
ccurme	14c561e15d	infra: relax types-requests version range (#31504 )	2025-06-05 18:57:08 +00:00
ccurme	6d6f305748	openai[patch]: clarify docs on api_version in docstring for AzureChatOpenAI (#31502 )	2025-06-05 16:06:22 +00:00
Simon Stone	815bfa5408	huggingface[major]: Reduce disk footprint by 95% by making large dependencies optional (#31268 ) Description: `langchain_huggingface` has a very large installation size of around 600 MB (on a Mac with Python 3.11). This is due to its dependency on `sentence-transformers`, which in turn depends on `torch`, which is 320 MB all by itself. Similarly, the depedency on `transformers` adds another set of heavy dependencies. With those dependencies removed, the installation of `langchain_huggingface` only takes up ~26 MB. This is only 5 % of the full installation! These libraries are not necessary to use `langchain_huggingface`'s API wrapper classes, only for local inferences/embeddings. All import statements for those two libraries already have import guards in place (try/catch with a helpful "please install x" message). This PR therefore moves those two libraries to an optional dependency group `full`. So a `pip install langchain_huggingface` will only install the lightweight version, and a `pip install "langchain_huggingface[full]"` will install all dependencies. I know this may break existing code, because `sentence-transformers` and `transformers` are now no longer installed by default. Given that users will see helpful error messages when that happens, and the major impact of this small change, I hope that you will still consider this PR. Dependencies: No new dependencies, but new optional grouping.	2025-06-05 12:04:19 -04:00
Bagatur	ec8bab83f8	anthropic[fix]: bump langchain-core dep (#31483 )	2025-06-03 10:56:48 -04:00
Bagatur	310e643842	release[anthropic]: 0.3.15 (#31479 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-06-03 10:38:11 -04:00
Eugene Yurtsev	6cb3ea514a	openai: release 0.3.19 (#31466 ) Release 0.3.19	2025-06-02 12:44:49 -04:00
Eugene Yurtsev	17f34baa88	openai[minor]: add image generation to responses api (#31424 ) Does not support partial images during generation at the moment. Before doing that I'd like to figure out how to specify the aggregation logic without requiring changes in core. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-06-02 10:03:54 -04:00
ccurme	d3be4a0c56	infra: remove use of --vcr-record=none (#31452 ) This option is specific to `pytest-vcr`. `pytest-recording` runs in this mode by default.	2025-06-01 10:49:59 -04:00
ccurme	3db1aa0ba6	standard-tests: migrate to pytest-recording (#31425 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-31 15:21:15 -04:00
ccurme	5bf89628bf	groq[patch]: update model for integration tests (#31440 ) Llama-3.1 started failing consistently with > groq.BadRequestError: Error code: 400 - *'error': 'message': "Failed to call a function. Please adjust your prompt. See 'failed_generation' for more details.", 'type': 'invalid_request_error', 'code': 'tool_use_failed', 'failed_generation': '<function=brave_search>"query": "Hello!"</function>'**	2025-05-30 17:27:12 +00:00
अंkur गोswami	729526ff7c	huggingface: Undefined model_id fix (#31358 ) Description: This change fixes the undefined model_id issue when instantiating [ChatHuggingFace](https://github.com/langchain-ai/langchain/blob/master/libs/partners/huggingface/langchain_huggingface/chat_models/huggingface.py#L306) Issue: Fixes https://github.com/langchain-ai/langchain/issues/31357 @baskaryan @hwchase17	2025-05-29 15:59:35 -04:00
ccurme	c8951ca124	infra: drop azure from streaming benchmarks (#31421 ) Covered by BaseChatOpenAI	2025-05-29 15:06:12 -04:00
ccurme	afd349cc95	openai: cache httpx client (#31260 ) ![Screenshot 2025-05-16 at 3 49 54 PM](https://github.com/user-attachments/assets/4b377384-a769-4487-b801-bd1aa0ed66c1) Co-authored-by: Sydney Runkle <54324534+sydney-runkle@users.noreply.github.com>	2025-05-29 14:03:06 -04:00
ccurme	49eeb0f3c3	standard-tests: add benchmarks (#31302 ) Co-authored-by: Sydney Runkle <sydneymarierunkle@gmail.com>	2025-05-29 15:21:37 +00:00
ccurme	0e3f35effe	anthropic: store cache ttl details on usage metadata (#31393 )	2025-05-28 13:52:37 -04:00
ccurme	ab8b4003be	openai[patch]: add test case for code interpreter (#31383 )	2025-05-27 19:11:31 +00:00
ccurme	c8a656c05b	docs: update xai docs (#31382 )	2025-05-27 15:09:51 -04:00
ccurme	6ecc85c163	xai: document live search feature (#31381 )	2025-05-27 14:51:19 -04:00
ccurme	5bff018951	xai: release 0.2.4 (#31380 )	2025-05-27 14:33:36 -04:00
ccurme	8b1f54c419	xai: support live search (#31379 ) https://docs.x.ai/docs/guides/live-search	2025-05-27 14:08:59 -04:00
ccurme	443341a20d	anthropic: release 0.3.14 (#31378 )	2025-05-27 17:31:05 +00:00
ccurme	580986b260	anthropic: support for code execution, MCP connector, files API features (#31340 ) Support for the new [batch of beta features](https://www.anthropic.com/news/agent-capabilities-api) released yesterday: - [Code execution](https://docs.anthropic.com/en/docs/agents-and-tools/tool-use/code-execution-tool) - [MCP connector](https://docs.anthropic.com/en/docs/agents-and-tools/mcp-connector) - [Files API](https://docs.anthropic.com/en/docs/build-with-claude/files) Also verified support for [prompt cache TTL](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching#1-hour-cache-duration-beta).	2025-05-27 12:45:45 -04:00
ccurme	0ce2e69cc1	openai: release 0.3.18 (#31320 )	2025-05-22 12:53:53 -04:00
ccurme	851fd438cf	openai[patch]: relax Azure llm streaming callback test (#31319 ) Effectively reverts https://github.com/langchain-ai/langchain/pull/29302, but check that counts are "less than" instead of equal to an expected count.	2025-05-22 16:14:53 +00:00
ccurme	053a1246da	openai[patch]: support built-in code interpreter and remote MCP tools (#31304 )	2025-05-22 11:47:57 -04:00
ccurme	1b5ffe4107	openai[patch]: run _tokenize in background thread in async embedding invocations (#31312 )	2025-05-22 10:27:33 -04:00
Ishan Goswami	f16456139b	exa docs and python package update (#31307 ) Added support for new Exa API features. Updated Exa docs and python package (langchain-exa). Description Added support for new Exa API features in the langchain-exa package: - Added max_characters option for text content - Added support for summary and custom summary prompts - Added livecrawl option with "always", "fallback", "never" settings - Added "auto" option for search type - Updated documentation and tests Dependencies - No new dependencies required. Using existing features from exa-py. twitter: @theishangoswami --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-21 21:33:30 -04:00
ccurme	beacedd6b3	openai[patch]: update tests for strict schemas (#31306 ) Following recent [changes](https://platform.openai.com/docs/changelog).	2025-05-21 22:06:17 +00:00
ccurme	dcb5aba999	openai[patch]: reduce tested constraints on strict schema adherence for Responses API (#31290 ) Scheduled testing started failing today because the Responses API stopped raising `BadRequestError` for a schema that was previously invalid when `strict=True`. Although docs still say that [some type-specific keywords are not yet supported](https://platform.openai.com/docs/guides/structured-outputs#some-type-specific-keywords-are-not-yet-supported) (including `minimum` and `maximum` for numbers), the below appears to run and correctly respect the constraints: ```python import json import openai maximums = list(range(1, 11)) arg_values = [] for maximum in maximums: tool = { "type": "function", "name": "magic_function", "description": "Applies a magic function to an input.", "parameters": { "properties": { "input": {"maximum": maximum, "minimum": 0, "type": "integer"} }, "required": ["input"], "type": "object", "additionalProperties": False }, "strict": True } client = openai.OpenAI() response = client.responses.create( model="gpt-4.1", input=[{"role": "user", "content": "What is the value of magic_function(3)? Use the tool."}], tools=[tool], ) function_call = next(item for item in response.output if item.type == "function_call") args = json.loads(function_call.arguments) arg_values.append(args["input"]) print(maximums) print(arg_values) # [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] # [1, 2, 3, 3, 3, 3, 3, 3, 3, 3] ``` Until yesterday this raised BadRequestError. The same is not true of Chat Completions, which appears to still raise BadRequestError ```python tool = { "type": "function", "function": { "name": "magic_function", "description": "Applies a magic function to an input.", "parameters": { "properties": { "input": {"maximum": 5, "minimum": 0, "type": "integer"} }, "required": ["input"], "type": "object", "additionalProperties": False }, "strict": True } } response = client.chat.completions.create( model="gpt-4.1", messages=[{"role": "user", "content": "What is the value of magic_function(3)? Use the tool."}], tools=[tool], ) response # raises BadRequestError ``` Here we update tests accordingly.	2025-05-20 14:50:31 +00:00
ccurme	bf645c83f4	voyageai: remove from monorepo (#31281 ) langchain-voyageai is now maintained at https://github.com/voyage-ai/langchain-voyageai.	2025-05-19 16:33:38 +00:00
ccurme	32fcc97a90	openai[patch]: compat with Bedrock Converse (#31280 ) ChatBedrockConverse passes through reasoning content blocks in [Bedrock Converse format](https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_ContentBlock.html). Similar to how we handle Anthropic thinking blocks, here we ensure these are filtered out of OpenAI request payloads. Resolves https://github.com/langchain-ai/langchain/issues/31279.	2025-05-19 10:35:26 -04:00
mathislindner	e1af509966	anthropic: emit informative error message if there are only system messages in a prompt (#30822 ) PR message: Not sure if I put the check at the right spot, but I thought throwing the error before the loop made sense to me. Description: Checks if there are only system messages using AnthropicChat model and throws an error if it's the case. Check Issue for more details Issue: #30764 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-16 20:43:59 +00:00
ccurme	a401d7e52a	ollama: release 0.3.3 (#31253 )	2025-05-15 16:24:04 -04:00
Alexey Bondarenko	9efafe3337	ollama: Add separate kwargs parameter for async client (#31209 ) Description: Add a `async_client_kwargs` field to ollama chat/llm/embeddings adapters that is passed to async httpx client constructor. Motivation: In my use-case: - chat/embedding model adapters may be created frequently, sometimes to be called just once or to never be called at all - they may be used in bots sunc and async mode (not known at the moment they are created) So, I want to keep a static transport instance maintaining connection pool, so model adapters can be created and destroyed freely. But that doesn't work when both sync and async functions are in use as I can only pass one transport instance for both sync and async client, while transport types must be different for them. So I can't make both sync and async calls use shared transport with current model adapter interfaces. In this PR I add a separate `async_client_kwargs` that gets passed to async client constructor, so it will be possible to pass a separate transport instance. For sake of backwards compatibility, it is merged with `client_kwargs`, so nothing changes when it is not set. I am unable to run linter right now, but the changes look ok.	2025-05-15 16:10:10 -04:00
ccurme	6bbc12b7f7	chroma: release 0.2.4 (#31252 )	2025-05-15 15:58:29 -04:00
Jai Radhakrishnan	aa4890c136	partners: update deps for langchain-chroma (#31251 ) Updates dependencies to Chroma to integrate the major release of Chroma with improved performance, and to fix issues users have been seeing using the latest chroma docker image with langchain-chroma https://github.com/langchain-ai/langchain/issues/31047#issuecomment-2850790841 Updates chromadb dependency to >=1.0.9 This also removes the dependency of chroma-hnswlib, meaning it can run against python 3.13 runners for tests as well. Tested this by pulling the latest Chroma docker image, running langchain-chroma using client mode ``` httpClient = chromadb.HttpClient(host="localhost", port=8000) vector_store = Chroma( client=httpClient, collection_name="test", embedding_function=embeddings, ) ```	2025-05-15 15:55:15 -04:00
ccurme	8b145d5dc3	openai: release 0.3.17 (#31246 )	2025-05-15 09:18:22 -04:00
ccurme	0b8837a0cc	openai: support runtime kwargs in embeddings (#31195 )	2025-05-14 09:14:40 -04:00
ccurme	868cfc4a8f	openai: ignore function_calls if tool_calls are present (#31198 ) Some providers include (legacy) function calls in `additional_kwargs` in addition to tool calls. We currently unpack both function calls and tool calls if present, but OpenAI will raise 400 in this case. This can come up if providers are mixed in a tool-calling loop. Example: ```python from langchain.chat_models import init_chat_model from langchain_core.messages import HumanMessage from langchain_core.tools import tool @tool def get_weather(location: str) -> str: """Get weather at a location.""" return "It's sunny." gemini = init_chat_model("google_genai:gemini-2.0-flash-001").bind_tools([get_weather]) openai = init_chat_model("openai:gpt-4.1-mini").bind_tools([get_weather]) input_message = HumanMessage("What's the weather in Boston?") tool_call_message = gemini.invoke([input_message]) assert len(tool_call_message.tool_calls) == 1 tool_call = tool_call_message.tool_calls[0] tool_message = get_weather.invoke(tool_call) response = openai.invoke( # currently raises 400 / BadRequestError [input_message, tool_call_message, tool_message] ) ``` Here we ignore function calls if tool calls are present.	2025-05-12 13:50:56 -04:00
ccurme	9aac8923a3	docs: add web search to anthropic docs (#31169 )	2025-05-08 16:20:11 -04:00
ccurme	2d202f9762	anthropic[patch]: split test into two (#31167 )	2025-05-08 09:23:36 -04:00
ccurme	d4555ac924	anthropic: release 0.3.13 (#31162 )	2025-05-08 03:13:15 +00:00
ccurme	e34f9fd6f7	anthropic: update streaming usage metadata (#31158 ) Anthropic updated how they report token counts during streaming today. See changes to `MessageDeltaUsage` in [this commit](`2da00f26c5 (diff-1a396eba0cd9cd8952dcdb58049d3b13f6b7768ead1411888d66e28211f7bfc5)`). It's clean and simple to grab these fields from the final `message_delta` event. However, some of them are typed as Optional, and language [here](`e42451ab3f/src/anthropic/lib/streaming/_messages.py (L462)`) suggests they may not always be present. So here we take the required field from the `message_delta` event as we were doing previously, and ignore the rest.	2025-05-07 23:09:56 -04:00
ccurme	682f338c17	anthropic[patch]: support web search (#31157 )	2025-05-07 18:04:06 -04:00
ccurme	d7e016c5fc	huggingface: release 0.2 (#31153 )	2025-05-07 15:33:07 -04:00
ccurme	4b11cbeb47	huggingface[patch]: update lockfile (#31152 )	2025-05-07 15:17:33 -04:00
ccurme	b5b90b5929	anthropic[patch]: be robust to null fields when translating usage metadata (#31151 )	2025-05-07 18:30:21 +00:00
zhurou603	1df3ee91e7	partners: (langchain-openai) total_tokens should not add 'Nonetype' t… (#31146 ) partners: (langchain-openai) total_tokens should not add 'Nonetype' t… # PR Description ## Description Fixed an issue in `langchain-openai` where `total_tokens` was incorrectly adding `None` to an integer, causing a TypeError. The fix ensures proper type checking before adding token counts. ## Issue Fixes the TypeError traceback shown in the image where `'NoneType'` cannot be added to an integer. ## Dependencies None ## Twitter handle None ![image](https://github.com/user-attachments/assets/9683a795-a003-455a-ada9-fe277245e2b2) Co-authored-by: qiulijie <qiulijie@yuaiweiwu.com>	2025-05-07 11:09:50 -04:00
唐小鸭	50fa524a6d	partners: (langchain-deepseek) fix deepseek-r1 always returns an empty `reasoning_content` when reasoning (#31065 ) ## Description deepseek-r1 always returns an empty string `reasoning_content` to the first chunk when thinking, and sets `reasoning_content` to None when thinking is over, to determine when to switch to normal output. Therefore, whether the reasoning_content field exists should be judged as None. ## Demo deepseek-r1 reasoning output: ``` {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': 'assistant', 'tool_calls': None, 'reasoning_content': ''}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '好的'}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '，'}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': None, 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': '用户'}, 'finish_reason': None, 'index': 0, 'logprobs': None} ... ``` deepseek-r1 first normal output ``` ... {'delta': {'content': ' main', 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': None}, 'finish_reason': None, 'index': 0, 'logprobs': None} {'delta': {'content': '\n\nimport', 'function_call': None, 'refusal': None, 'role': None, 'tool_calls': None, 'reasoning_content': None}, 'finish_reason': None, 'index': 0, 'logprobs': None} ... ``` --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-05-05 22:31:58 +00:00
Asif Mehmood	00ac49dd3e	Replace deprecated .dict() with .model_dump() for Pydantic v2 compatibility (#31107 ) What does this PR do? This PR replaces deprecated usages of ```.dict()``` with ```.model_dump()``` to ensure compatibility with Pydantic v2 and prepare for v3, addressing the deprecation warning ```PydanticDeprecatedSince20``` as required in [Issue# 31103](https://github.com/langchain-ai/langchain/issues/31103). Changes made: * Replaced ```.dict()``` with ```.model_dump()``` in multiple locations * Ensured consistency with Pydantic v2 migration guidelines * Verified compatibility across affected modules Notes * This is a code maintenance and compatibility update * Tested locally with Pydantic v2.11 * No functional logic changes; only internal method replacements to prevent deprecation issues	2025-05-03 13:40:54 -04:00
ccurme	77ecf47f6d	openai: release 0.3.16 (#31100 )	2025-05-02 13:14:46 -04:00
ccurme	94139ffcd3	openai[patch]: format system content blocks for Responses API (#31096 ) ```python from langchain_core.messages import HumanMessage, SystemMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI(model="gpt-4.1", use_responses_api=True) messages = [ SystemMessage("test"), # Works HumanMessage("test"), # Works SystemMessage([{"type": "text", "text": "test"}]), # Bug in this case HumanMessage([{"type": "text", "text": "test"}]), # Works SystemMessage([{"type": "input_text", "text": "test"}]) # Works ] llm._get_request_payload(messages) ```	2025-05-02 15:22:30 +00:00
ccurme	26ad239669	core, openai[patch]: prefer provider-assigned IDs when aggregating message chunks (#31080 ) When aggregating AIMessageChunks in a stream, core prefers the leftmost non-null ID. This is problematic because: - Core assigns IDs when they are null to `f"run-{run_manager.run_id}"` - The desired meaningful ID might not be available until midway through the stream, as is the case for the OpenAI Responses API. For the OpenAI Responses API, we assign message IDs to the top-level `AIMessage.id`. This works in `.(a)invoke`, but during `.(a)stream` the IDs get overwritten by the defaults assigned in langchain-core. These IDs [must](https://community.openai.com/t/how-to-solve-badrequesterror-400-item-rs-of-type-reasoning-was-provided-without-its-required-following-item-error-in-responses-api/1151686/9) be available on the AIMessage object to support passing reasoning items back to the API (e.g., if not using OpenAI's `previous_response_id` feature). We could add them elsewhere, but seeing as we've already made the decision to store them in `.id` during `.(a)invoke`, addressing the issue in core lets us fix the problem with no interface changes.	2025-05-02 11:18:18 -04:00
ccurme	c51eadd54f	openai[patch]: propagate service_tier to response metadata (#31089 )	2025-05-01 13:50:48 -04:00
ccurme	6110c3ffc5	openai[patch]: release 0.3.15 (#31087 )	2025-05-01 09:22:30 -04:00
Ben Gladwell	da59eb7eb4	anthropic: Allow kwargs to pass through when counting tokens (#31082 ) - Description: `ChatAnthropic.get_num_tokens_from_messages` does not currently receive `kwargs` and pass those on to `self._client.beta.messages.count_tokens`. This is a problem if you need to pass specific options to `count_tokens`, such as the `thinking` option. This PR fixes that. - Issue: N/A - Dependencies: None - Twitter handle: @bengladwell Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-30 17:56:22 -04:00
Really Him	918c950737	DOCS: `partners/chroma`: Fix documentation around `chroma` query filter syntax (#31058 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: * Starting to put together some PR's to fix the typing around `langchain-chroma` `filter` and `where_document` query filtering, as mentioned: https://github.com/langchain-ai/langchain/issues/30879 https://github.com/langchain-ai/langchain/issues/30507 The typing of `dict[str, str]` is on the one hand too restrictive (marks valid filter expressions as ill-typed) and also too permissive (allows illegal filter expressions). That's not what this PR addresses though. This PR just removes from the documentation some examples of filters that are illegal, and also syntactically incorrect: (a) dictionaries with keys like `$contains` but the key is missing quotation marks; (b) dictionaries with multiple entries - this is illegal in Chroma filter syntax and will raise an exception. (`{"foo": "bar", "qux": "baz"}`). Filter dictionaries in Chroma must have one and one key only. Again this is just the documentation issue, which is the lowest hanging fruit. I also think we need to update the types for `filter` and `where_document` to be (at the very least `dict[str, Any]`), or, since we have access to Chroma's types, they should be `Where` and `WhereDocument` types. This has a wider blast radius though, so I'm starting small. This PR does not fix the issues mentioned above, it's just starting to get the ball rolling, and cleaning up the documentation. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Really Him <hesereallyhim@proton.me>	2025-04-30 17:51:07 -04:00
ccurme	bdb7c4a8b3	huggingface: fix embeddings return type (#31072 ) Integration tests failing cc @hanouticelina	2025-04-29 18:45:04 +00:00
célina	868f07f8f4	partners: (langchain-huggingface) Chat Models - Integrate Hugging Face Inference Providers and remove deprecated code (#30733 ) Hi there, I'm Célina from 🤗, This PR introduces support for Hugging Face's serverless Inference Providers (documentation [here](https://huggingface.co/docs/inference-providers/index)), allowing users to specify different providers for chat completion and text generation tasks. This PR also removes the usage of `InferenceClient.post()` method in `HuggingFaceEndpoint`, in favor of the task-specific `text_generation` method. `InferenceClient.post()` is deprecated and will be removed in `huggingface_hub v0.31.0`. --- ## Changes made - bumped the minimum required version of the `huggingface-hub` package to ensure compatibility with the latest API usage. - added a `provider` field to `HuggingFaceEndpoint`, enabling users to select the inference provider (e.g., 'cerebras', 'together', 'fireworks-ai'). Defaults to `hf-inference` (HF Inference API). - replaced the deprecated `InferenceClient.post()` call in `HuggingFaceEndpoint` with the task-specific `text_generation` method for future-proofing, `post()` will be removed in huggingface-hub v0.31.0. - updated the `ChatHuggingFace` component: - added async and streaming support. - added support for tool calling. - exposed underlying chat completion parameters for more granular control. - Added integration tests for `ChatHuggingFace` and updated the corresponding unit tests. ✅ All changes are backward compatible. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-29 09:53:14 -04:00
Sydney Runkle	7e926520d5	packaging: remove Python upper bound for langchain and co libs (#31025 ) Follow up to https://github.com/langchain-ai/langsmith-sdk/pull/1696, I've bumped the `langsmith` version where applicable in `uv.lock`. Type checking problems here because deps have been updated in `pyproject.toml` and `uv lock` hasn't been run - we should enforce that in the future - goes with the other dependabot todos :).	2025-04-28 14:44:28 -04:00
Sydney Runkle	d614842d23	ci: temporarily run chroma on 3.12 for CI (#31056 ) Waiting on a fix for https://github.com/chroma-core/chroma/issues/4382	2025-04-28 13:20:37 -04:00
湛露先生	5fb8fd863a	langchain_openai: clean duplicate code for openai embedding. (#30872 ) The `_chunk_size` has not changed by method `self._tokenize`, So i think these is duplicate code. Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-27 15:07:41 -04:00
ccurme	a60fd06784	docs: document OpenAI flex processing (#31023 ) Following https://github.com/langchain-ai/langchain/pull/31005	2025-04-25 15:10:25 -04:00
ccurme	629b7a5a43	openai[patch]: add explicit attribute for service tier (#31005 )	2025-04-25 18:38:23 +00:00
ccurme	a7903280dd	openai[patch]: delete redundant tests (#31004 ) These are covered by standard tests.	2025-04-24 17:56:32 +00:00
ccurme	10a9c24dae	openai: fix streaming reasoning without summaries (#30999 ) Following https://github.com/langchain-ai/langchain/pull/30909: need to retain "empty" reasoning output when streaming, e.g., ```python {'id': 'rs_...', 'summary': [], 'type': 'reasoning'} ``` Tested by existing integration tests, which are currently failing.	2025-04-24 16:01:45 +00:00
ccurme	faef3e5d50	core, standard-tests: support PDF and audio input in Chat Completions format (#30979 ) Chat models currently implement support for: - images in OpenAI Chat Completions format - other multimodal types (e.g., PDF and audio) in a cross-provider [standard format](https://python.langchain.com/docs/how_to/multimodal_inputs/) Here we update core to extend support to PDF and audio input in Chat Completions format. If an OAI-format PDF or audio content block is passed into any chat model, it will be transformed to the LangChain standard format. We assume that any chat model supporting OAI-format PDF or audio has implemented support for the standard format.	2025-04-23 18:32:51 +00:00
ccurme	4bc70766b5	core, openai: support standard multi-modal blocks in convert_to_openai_messages (#30968 )	2025-04-23 11:20:44 -04:00
ccurme	e4877e5ef1	fireworks: release 0.3.0 (#30977 )	2025-04-23 10:08:38 -04:00
ccurme	eedda164c6	fireworks[minor]: remove default model and temperature (#30965 ) `mixtral-8x-7b-instruct` was recently retired from Fireworks Serverless. Here we remove the default model altogether, so that the model must be explicitly specified on init: ```python ChatFireworks(model="accounts/fireworks/models/llama-v3p1-70b-instruct") # for example ``` We also set a null default for `temperature`, which previously defaulted to 0.0. This parameter will no longer be included in request payloads unless it is explicitly provided.	2025-04-22 15:58:58 -04:00
ccurme	a7c1bccd6a	openai[patch]: remove xfails from image token counting tests (#30963 ) These appear to be passing again.	2025-04-22 15:55:33 +00:00
Dmitrii Rashchenko	a43df006de	Support of openai reasoning summary streaming (#30909 ) langchain_openai: Support of reasoning summary streaming Description: OpenAI API now supports streaming reasoning summaries for reasoning models (o1, o3, o3-mini, o4-mini). More info about it: https://platform.openai.com/docs/guides/reasoning#reasoning-summaries It is supported only in Responses API (not Completion API), so you need to create LangChain Open AI model as follows to support reasoning summaries streaming: ``` llm = ChatOpenAI( model="o4-mini", # also o1, o3, o3-mini support reasoning streaming use_responses_api=True, # reasoning streaming works only with responses api, not completion api model_kwargs={ "reasoning": { "effort": "high", # also "low" and "medium" supported "summary": "auto" # some models support "concise" summary, some "detailed", but auto will always work } } ) ``` Now, if you stream events from llm: ``` async for event in llm.astream_events(prompt, version="v2"): print(event) ``` or ``` for chunk in llm.stream(prompt): print (chunk) ``` OpenAI API will send you new types of events: `response.reasoning_summary_text.added` `response.reasoning_summary_text.delta` `response.reasoning_summary_text.done` These events are new, so they were ignored. So I have added support of these events in function `_convert_responses_chunk_to_generation_chunk`, so reasoning chunks or full reasoning added to the chunk additional_kwargs. Example of how this reasoning summary may be printed: ``` async for event in llm.astream_events(prompt, version="v2"): if event["event"] == "on_chat_model_stream": chunk: AIMessageChunk = event["data"]["chunk"] if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` or ``` for chunk in llm.stream(prompt): if "reasoning_summary_chunk" in chunk.additional_kwargs: print(chunk.additional_kwargs["reasoning_summary_chunk"], end="") elif "reasoning_summary" in chunk.additional_kwargs: print("\n\nFull reasoning step summary:", chunk.additional_kwargs["reasoning_summary"]) elif chunk.content and chunk.content[0]["type"] == "text": print(chunk.content[0]["text"], end="") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-22 14:51:13 +00:00
ccurme	920d504e47	fireworks[patch]: update model in LLM integration tests (#30951 ) `mixtral-8x7b-instruct` has been retired.	2025-04-21 17:53:27 +00:00
Ahmed Tammaa	589bc19890	anthropic[patch]: make description optional on AnthropicTool (#30935 ) PR Summary This change adds a fallback in ChatAnthropic.with_structured_output() to handle Pydantic models that don’t include a docstring. Without it, calling: ```py from pydantic import BaseModel from langchain_anthropic import ChatAnthropic class SampleModel(BaseModel): sample_field: str llm = ChatAnthropic( model="claude-3-7-sonnet-latest" ).with_structured_output(SampleModel.model_json_schema()) llm.invoke("test") ``` will raise a ``` KeyError: 'description' ``` because Pydantic omits the description field when no docstring is present. This issue doesn’t occur when using ChatOpenAI or if you add a docstring to the model: ```py from pydantic import BaseModel from langchain_openai import ChatOpenAI class SampleModel(BaseModel): """Schema for sample_field output.""" sample_field: str llm = ChatOpenAI( model="gpt-4o-mini" ).with_structured_output(SampleModel.model_json_schema()) llm.invoke("test") ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-21 10:44:39 -04:00
Aubrey Ford	b344f34635	partners/openai: OpenAIEmbeddings not respecting chunk_size argument (#30757 ) When calling `embed_documents` and providing a `chunk_size` argument, that argument is ignored when `OpenAIEmbeddings` is instantiated with its default configuration (where `check_embedding_ctx_length=True`). `_get_len_safe_embeddings` specifies a `chunk_size` parameter but it's not being passed through in `embed_documents`, which is its only caller. This appears to be an oversight, especially given that the `_get_len_safe_embeddings` docstring states it should respect "the set embedding context length and chunk size." Developers typically expect method parameters to take effect (also, take precedence) when explicitly provided, especially when instantiating using defaults. I was confused as to why my API calls were being rejected regardless of the chunk size I provided. This bug also exists in langchain_community package. I can add that to this PR if requested otherwise I will create a new one once this passes.	2025-04-18 15:27:27 -04:00
Konsti-s	017c8079e1	partners: ChatAnthropic supports urls (#30809 ) Description: partners-anthropic: ChatAnthropic supports b64 and urls in the part[image_url][url] message variable Issue: ChatAnthropic right now only supports b64 encoded images in the part[image_url][url] message variable. This PR enables ChatAnthropic to also accept image urls in said variable and makes it compatible with OpenAI messages to make model switching easier. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-18 15:15:45 -04:00

1 2 3 4 5 ...

1337 Commits