llama.cpp: gemma: allow offloading the output tensor (#1997)

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
This commit is contained in:
Jared Van Bortel
2024-02-22 14:06:18 -05:00
committed by GitHub
parent c1dcb3f5b8
commit fc6c5ea0c7