feat(llm): autopull ollama models (#2019)

* chore: update ollama (llm) * feat: allow to autopull ollama models * fix: mypy * chore: install always ollama client * refactor: check connection and pull ollama method to utils * docs: update ollama config with autopulling info
2025-09-16 15:19:16 +00:00 · 2024-07-29 13:25:42 +02:00
parent dabf556dae
commit 20bad17c98
8 changed files with 129 additions and 21 deletions
--- a/fern/docs/pages/installation/installation.mdx
+++ b/fern/docs/pages/installation/installation.mdx
@@ -130,18 +130,22 @@ Go to [ollama.ai](https://ollama.ai/) and follow the instructions to install Oll

 After the installation, make sure the Ollama desktop app is closed.

-Install the models to be used, the default settings-ollama.yaml is configured to user `mistral 7b` LLM (~4GB) and `nomic-embed-text` Embeddings (~275MB). Therefore:
+Now, start Ollama service (it will start a local inference server, serving both the LLM and the Embeddings):
+```bash
+ollama serve
+```
+
+Install the models to be used, the default settings-ollama.yaml is configured to user mistral 7b LLM (~4GB) and nomic-embed-text Embeddings (~275MB)
+
+By default, PGPT will automatically pull models as needed. This behavior can be changed by modifying the `ollama.autopull_models` property.
+
+In any case, if you want to manually pull models, run the following commands:

 ```bash
 ollama pull mistral
 ollama pull nomic-embed-text
 ```

-Now, start Ollama service (it will start a local inference server, serving both the LLM and the Embeddings):
-```bash
-ollama serve
-```
-
 Once done, on a different terminal, you can install PrivateGPT with the following command:
 ```bash
 poetry install --extras "ui llms-ollama embeddings-ollama vector-stores-qdrant"