langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 10:17:00 +00:00

Author	SHA1	Message	Date
Andras L Ferenczi	b5f49df86a	partner: ChatDeepSeek on openrouter not returning reasoning (#30240 ) Deepseek model does not return reasoning when hosted on openrouter (Issue [30067](https://github.com/langchain-ai/langchain/issues/30067)) the following code did not return reasoning: ```python llm = ChatDeepSeek( model = 'deepseek/deepseek-r1:nitro', api_base="https://openrouter.ai/api/v1", api_key=os.getenv("OPENROUTER_API_KEY")) messages = [ {"role": "system", "content": "You are an assistant."}, {"role": "user", "content": "9.11 and 9.8, which is greater? Explain the reasoning behind this decision."} ] response = llm.invoke(messages, extra_body={"include_reasoning": True}) print(response.content) print(f"REASONING: {response.additional_kwargs.get('reasoning_content', '')}") print(response) ``` The fix is to extract reasoning from response.choices[0].message["model_extra"] and from choices[0].delta["reasoning"]. and place in response additional_kwargs. Change is really just the addition of a couple one-sentence if statements. --------- Co-authored-by: andrasfe <andrasf94@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 16:35:37 +00:00
ccurme	e8e3b2bfae	ollama: release 0.3.0 (#30420 )	2025-03-21 15:50:08 +00:00
Bob Merkus	5700646cc5	ollama: add reasoning model support (e.g. deepseek) (#29689 ) # Description This PR adds reasoning model support for `langchain-ollama` by extracting reasoning token blocks, like those used in deepseek. It was inspired by [ollama-deep-researcher](https://github.com/langchain-ai/ollama-deep-researcher), specifically the parsing of [thinking blocks](`6d1aaf2139/src/assistant/graph.py (L91)`): ```python # TODO: This is a hack to remove the <think> tags w/ Deepseek models # It appears very challenging to prompt them out of the responses while "<think>" in running_summary and "</think>" in running_summary: start = running_summary.find("<think>") end = running_summary.find("</think>") + len("</think>") running_summary = running_summary[:start] + running_summary[end:] ``` This notes that it is very hard to remove the reasoning block from prompting, but we actually want the model to reason in order to increase model performance. This implementation extracts the thinking block, so the client can still expect a proper message to be returned by `ChatOllama` (and use the reasoning content separately when desired). This implementation takes the same approach as [ChatDeepseek](`5d581ba22c/libs/partners/deepseek/langchain_deepseek/chat_models.py (L215)`), which adds the reasoning content to chunk.additional_kwargs.reasoning_content; ```python if hasattr(response.choices[0].message, "reasoning_content"): # type: ignore rtn.generations[0].message.additional_kwargs["reasoning_content"] = ( response.choices[0].message.reasoning_content # type: ignore ) ``` This should probably be handled upstream in ollama + ollama-python, but this seems like a reasonably effective solution. This is a standalone example of what is happening; ```python async def deepseek_message_astream( llm: BaseChatModel, messages: list[BaseMessage], config: RunnableConfig \| None = None, , model_target: str = "deepseek-r1", kwargs: Any, ) -> AsyncIterator[BaseMessageChunk]: """Stream responses from Deepseek models, filtering out <think> tags. Args: llm: The language model to stream from messages: The messages to send to the model Yields: Filtered chunks from the model response """ # check if the model is deepseek based if (llm.name and model_target not in llm.name) or (hasattr(llm, "model") and model_target not in llm.model): async for chunk in llm.astream(messages, config=config, kwargs): yield chunk return # Yield with a buffer, upon completing the <think></think> tags, move them to the reasoning content and start over buffer = "" async for chunk in llm.astream(messages, config=config, *kwargs): # start or append if not buffer: buffer = chunk.content else: buffer += chunk.content if hasattr(chunk, "content") else chunk # Process buffer to remove <think> tags if "<think>" in buffer or "</think>" in buffer: if hasattr(chunk, "tool_calls") and chunk.tool_calls: raise NotImplementedError("tool calls during reasoning should be removed?") if "<think>" in chunk.content or "</think>" in chunk.content: continue chunk.additional_kwargs["reasoning_content"] = chunk.content chunk.content = "" # upon block completion, reset the buffer if "<think>" in buffer and "</think>" in buffer: buffer = "" yield chunk ``` # Issue Integrating reasoning models (e.g. deepseek-r1) into existing LangChain based workflows is hard due to the thinking blocks that are included in the message contents. To avoid this, we could match the `ChatOllama` integration with `ChatDeepseek` to return the reasoning content inside `message.additional_arguments.reasoning_content` instead. # Dependenices None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 15:44:54 +00:00
ccurme	d8145dda95	xai: release 0.2.2 (#30403 )	2025-03-20 20:25:16 +00:00
ccurme	e194902994	mistral: release 0.2.9 (#30402 )	2025-03-20 20:22:24 +00:00
ccurme	49466ec9ca	groq: release 0.3.1 (#30401 )	2025-03-20 20:19:49 +00:00
ccurme	db1e340387	fireworks: release 0.2.8 (#30400 )	2025-03-20 16:15:51 -04:00
ccurme	de3960d285	multiple: enforce standards on tool_choice (#30372 ) - Test if models support forcing tool calls via `tool_choice`. If they do, they should support - `"any"` to specify any tool - the tool name as a string to force calling a particular tool - Add `tool_choice` to signature of `BaseChatModel.bind_tools` in core - Deprecate `tool_choice_value` in standard tests in favor of a boolean `has_tool_choice` Will follow up with PRs in external repos (tested in AWS and Google already).	2025-03-20 17:48:59 +00:00
ccurme	b86cd8270c	multiple: support `strict` and `method` in with_structured_output (#30385 )	2025-03-20 13:17:07 -04:00
Mohammad Mohtashim	1103bdfaf1	(Ollama) Fix String Value parsing in _parse_arguments_from_tool_call (#30154 ) - Description: Fix String Value parsing in _parse_arguments_from_tool_call - Issue: #30145 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-19 21:47:18 -04:00
ccurme	aae8306d6c	groq: release 0.3.0 (#30374 )	2025-03-19 15:23:30 +00:00
Ashwin	83cfb9691f	Fix typo: change 'ben' to 'be' in comment (#30358 ) Description: This PR fixes a minor typo in the comments within `libs/partners/openai/langchain_openai/chat_models/base.py`. The word "ben" has been corrected to "be" for clarity and professionalism. Issue: N/A Dependencies: None	2025-03-19 10:35:35 -04:00
Lance Martin	46d6bf0330	ollama[minor]: update default method for structured output (#30273 ) From function calling to Ollama's [dedicated structured output feature](https://ollama.com/blog/structured-outputs). --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-18 12:44:22 -04:00
ccurme	b91daf06eb	groq[minor]: remove default model (#30341 ) The default model for `ChatGroq`, `"mixtral-8x7b-32768"`, is being retired on March 20, 2025. Here we remove the default, such that model names must be explicitly specified (being explicit is a good practice here, and avoids the need for breaking changes down the line). This change will be released in a minor version bump to 0.3. This follows https://github.com/langchain-ai/langchain/pull/30161 (released in version 0.2.5), where we began generating warnings to this effect. ![Screenshot 2025-03-18 at 10 33 27 AM](https://github.com/user-attachments/assets/f1e4b302-c62a-43b0-aa86-eaf9271e86cb)	2025-03-18 10:50:34 -04:00
ccurme	5684653775	openai[patch]: release 0.3.9 (#30325 )	2025-03-17 16:08:41 +00:00
ccurme	eb9b992aa6	openai[patch]: support additional Responses API features (#30322 ) - Include response headers - Max tokens - Reasoning effort - Fix bug with structured output / strict - Fix bug with simultaneous tool calling + structured output	2025-03-17 12:02:21 -04:00
ccurme	c74e7b997d	openai[patch]: support structured output via Responses API (#30265 ) Also runs all standard tests using Responses API.	2025-03-14 15:14:23 -04:00
Stavros Kontopoulos	ac22cde130	langchain_ollama: Support keep_alive in embeddings (#30251 ) - Description: Adds support for keep_alive in Ollama Embeddings see https://github.com/ollama/ollama/issues/6401. Builds on top of of https://github.com/langchain-ai/langchain/pull/29296. I have this use case where I want to keep the embeddings model in cpu forever. - Dependencies: no deps are being introduced. - Issue: haven't created an issue yet.	2025-03-14 14:56:50 -04:00
ccurme	d5d0134e7b	anthropic: release 0.3.10 (#30287 )	2025-03-14 16:23:21 +00:00
ccurme	226f29bc96	anthropic: support built-in tools, improve docs (#30274 ) - Support features from recent update: https://www.anthropic.com/news/token-saving-updates (mostly adding support for built-in tools in `bind_tools` - Add documentation around prompt caching, token-efficient tool use, and built-in tools.	2025-03-14 16:18:50 +00:00
ccurme	bbd4b36d76	mistralai[patch]: bump core (#30278 )	2025-03-13 23:04:36 +00:00
ccurme	733abcc884	mistral: release 0.2.8 (#30275 )	2025-03-13 21:54:34 +00:00
ccurme	cd1ea8e94d	openai[patch]: support Responses API (#30231 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2025-03-12 12:25:46 -04:00
ccurme	62c570dd77	standard-tests, openai: bump core (#30202 )	2025-03-10 19:22:24 +00:00
ccurme	f896e701eb	deepseek: install local langchain-tests in test deps (#30198 )	2025-03-10 16:58:17 +00:00
ccurme	b209d46eb3	mistral[patch]: set global ssl context (#30189 )	2025-03-09 21:27:41 +00:00
ccurme	17507c9ba6	groq[patch]: release 0.2.5 (#30168 )	2025-03-07 20:25:51 +00:00
ccurme	74e7772a5f	groq[patch]: warn if model is not specified (#30161 ) Groq is retiring `mixtral-8x7b-32768`, which is currently the default model for ChatGroq, on March 20. Here we emit a warning if the model is not specified explicitly. A version 0.3.0 will be released ahead of March 20 that removes the default altogether.	2025-03-07 15:21:13 -05:00
ccurme	34638ccfae	openai[patch]: release 0.3.8 (#30164 )	2025-03-07 18:26:40 +00:00
ccurme	806211475a	core[patch]: update structured output tracing (#30123 ) - Trace JSON schema in `options` - Rename to `ls_structured_output_format`	2025-03-07 13:05:25 -05:00
ccurme	230876a7c5	anthropic[patch]: add PDF input example to API reference (#30156 )	2025-03-07 14:19:08 +00:00
ccurme	52b0570bec	core, openai, standard-tests: improve OpenAI compatibility with Anthropic content blocks (#30128 ) - Support thinking blocks in core's `convert_to_openai_messages` (pass through instead of error) - Ignore thinking blocks in ChatOpenAI (instead of error) - Support Anthropic-style image blocks in ChatOpenAI --- Standard integration tests include a `supports_anthropic_inputs` property which is currently enabled only for tests on `ChatAnthropic`. This test enforces compatibility with message histories of the form: ``` - system message - human message - AI message with tool calls specified only through `tool_use` content blocks - human message containing `tool_result` and an additional `text` block ``` It additionally checks support for Anthropic-style image inputs if `supports_image_inputs` is enabled. Here we change this test, such that if you enable `supports_anthropic_inputs`: - You support AI messages with text and `tool_use` content blocks - You support Anthropic-style image inputs (if `supports_image_inputs` is enabled) - You support thinking content blocks. That is, we add a test case for thinking content blocks, but we also remove the requirement of handling tool results within HumanMessages (motivated by existing agent abstractions, which should all return ToolMessage). We move that requirement to a ChatAnthropic-specific test.	2025-03-06 09:53:14 -05:00
ccurme	ba5ddb218f	anthropic[patch]: release 0.3.9 (#30103 )	2025-03-04 10:53:55 -05:00
Samuel Dion-Girardeau	ccb64e9f4f	docs: Fix typo in code samples for max_tokens_for_prompt (#30088 ) - Description: Fix typo in code samples for max_tokens_for_prompt. Code blocks had singular "token" but the method has plural "tokens". - Issue: N/A - Dependencies: N/A - Twitter handle: N/A	2025-03-04 09:11:21 -05:00
ccurme	3b066dc005	anthropic[patch]: allow structured output when thinking is enabled (#30047 ) Structured output will currently always raise a BadRequestError when Claude 3.7 Sonnet's `thinking` is enabled, because we rely on forced tool use for structured output and this feature is not supported when `thinking` is enabled. Here we: - Emit a warning if `with_structured_output` is called when `thinking` is enabled. - Raise `OutputParserException` if no tool calls are generated. This is arguably preferable to raising an error in all cases. ```python from langchain_anthropic import ChatAnthropic from pydantic import BaseModel class Person(BaseModel): name: str age: int llm = ChatAnthropic( model="claude-3-7-sonnet-latest", max_tokens=5_000, thinking={"type": "enabled", "budget_tokens": 2_000}, ) structured_llm = llm.with_structured_output(Person) # <-- this generates a warning ``` ```python structured_llm.invoke("Alice is 30.") # <-- works ``` ```python structured_llm.invoke("Hello!") # <-- raises OutputParserException ```	2025-02-28 14:44:11 -05:00
ccurme	f8ed5007ea	anthropic, mistral: return `model_name` in response metadata (#30048 ) Took a "census" of models supported by init_chat_model-- of those that return model names in response metadata, these were the only two that had it keyed under `"model"` instead of `"model_name"`.	2025-02-28 18:56:05 +00:00
ccurme	0dbcc1d099	docs: document anthropic features (#30030 ) Update integrations page with extended thinking feature. Update API reference with extended thinking and citations.	2025-02-27 19:37:04 -05:00
ccurme	6c7c8a164f	openai[patch]: add unit test (#30022 ) Test `max_completion_tokens` is propagated to payload for AzureChatOpenAI.	2025-02-27 11:09:17 -05:00
ccurme	79f5bbfb26	anthropic[patch]: release 0.3.8 (#29973 )	2025-02-24 15:24:35 -05:00
ccurme	ded886f622	anthropic[patch]: support claude 3.7 sonnet (#29971 )	2025-02-24 15:17:47 -05:00
ccurme	b7a1705052	openai[patch]: release 0.3.7 (#29967 )	2025-02-24 11:59:28 -05:00
ccurme	291a232fb8	openai[patch]: set global ssl context (#29932 ) We set ```python global_ssl_context = ssl.create_default_context(cafile=certifi.where()) ``` at the module-level and share it among httpx clients.	2025-02-24 11:25:16 -05:00
ccurme	b1a7f4e106	core, openai[patch]: support serialization of pydantic models in messages (#29940 ) Resolves https://github.com/langchain-ai/langchain/issues/29003, https://github.com/langchain-ai/langchain/issues/27264 Related: https://github.com/langchain-ai/langchain-redis/issues/52 ```python from langchain.chat_models import init_chat_model from langchain.globals import set_llm_cache from langchain_community.cache import SQLiteCache from pydantic import BaseModel cache = SQLiteCache() set_llm_cache(cache) class Temperature(BaseModel): value: int city: str llm = init_chat_model("openai:gpt-4o-mini") structured_llm = llm.with_structured_output(Temperature) ``` ```python # 681 ms response = structured_llm.invoke("What is the average temperature of Rome in May?") ``` ```python # 6.98 ms response = structured_llm.invoke("What is the average temperature of Rome in May?") ```	2025-02-24 09:34:27 -05:00
ccurme	927ec20b69	openai[patch]: update system role to developer for o-series models (#29785 ) Some o-series models will raise a 400 error for `"role": "system"` (`o1-mini` and `o1-preview` will raise, `o1` and `o3-mini` will not). Here we update `ChatOpenAI` to update the role to `"developer"` for all model names matching `^o\d`. We only make this change on the ChatOpenAI class (not BaseChatOpenAI).	2025-02-24 08:59:46 -05:00
Ahmed Tammaa	8b511a3a78	[Exception Handling] DeepSeek JSONDecodeError (#29758 ) For Context please check #29626 The Deepseek is using langchain_openai. The error happens that it show `json decode error`. I added a handler for this to give a more sensible error message which is DeepSeek API returned empty/invalid json. Reproducing the issue is a bit challenging as it is inconsistent, sometimes DeepSeek returns valid data and in other times it returns invalid data which triggers the JSON Decode Error. This PR is an exception handling, but not an ultimate fix for the issue. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-23 15:00:32 -05:00
ccurme	512eb1b764	anthropic[patch]: update models for integration tests (#29938 )	2025-02-23 14:23:48 -05:00
Mohammad Mohtashim	8293142fa0	mistral[patch]: support model_kwargs (#29838 ) - Description: Frequency_penalty added as a client parameter - Issue: #29803 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-20 18:47:39 -05:00
ccurme	d227e4a08e	mistralai[patch]: release 0.2.7 (#29906 )	2025-02-20 17:27:12 +00:00
Hankyeol Kyung	2dd0ce3077	openai: Update reasoning_effort arg documentation (#29897 ) Description: Update docstring for `reasoning_effort` argument to specify that it applies to reasoning models only (e.g., OpenAI o1 and o3-mini), clarifying its supported models. Issue: None Dependencies: None	2025-02-20 09:03:42 -05:00
ccurme	68b13e5172	pinecone: delete from monorepo (#29889 ) This now lives in https://github.com/langchain-ai/langchain-pinecone	2025-02-19 12:55:15 -05:00

1 2 3 4 5 ...

1129 Commits