community[patch]: Added support for Ollama's num_predict option in ChatOllama (#16633)

Just a simple default addition to the options payload for a ollama generate call to support a max_new_tokens parameter. Should fix issue: https://github.com/langchain-ai/langchain/issues/14715
2025-09-17 23:41:46 +00:00 · 2024-01-26 17:00:19 -06:00
parent 6a75ef74ca
commit 6543e585a5
2 changed files with 7 additions and 0 deletions
--- a/libs/community/tests/unit_tests/llms/test_ollama.py
+++ b/libs/community/tests/unit_tests/llms/test_ollama.py
@@ -88,6 +88,7 @@ def test_handle_kwargs_top_level_parameters(monkeypatch: MonkeyPatch) -> None:
                "num_ctx": None,
                "num_gpu": None,
                "num_thread": None,
+                "num_predict": None,
                "repeat_last_n": None,
                "repeat_penalty": None,
                "stop": [],
@@ -133,6 +134,7 @@ def test_handle_kwargs_with_unknown_param(monkeypatch: MonkeyPatch) -> None:
                "num_ctx": None,
                "num_gpu": None,
                "num_thread": None,
+                "num_predict": None,
                "repeat_last_n": None,
                "repeat_penalty": None,
                "stop": [],