llamamodel: use greedy sampling when temp=0 (#2854)

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
2025-09-03 09:34:50 +00:00 · 2024-08-13 17:04:50 -04:00
parent 8ccf1fa2f5
commit 6518b33697
3 changed files with 27 additions and 10 deletions
--- a/gpt4all-bindings/python/CHANGELOG.md
+++ b/gpt4all-bindings/python/CHANGELOG.md
@@ -6,6 +6,9 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/).

 ## [Unreleased]

+### Added
+- Use greedy sampling when temperature is set to zero ([#2854](https://github.com/nomic-ai/gpt4all/pull/2854))
+
 ### Changed
 - Search for pip-installed CUDA 11 as well as CUDA 12 ([#2802](https://github.com/nomic-ai/gpt4all/pull/2802))
 - Stop shipping CUBINs to reduce wheel size ([#2802](https://github.com/nomic-ai/gpt4all/pull/2802))