fix: add numpy issue to troubleshooting (#2048)

* docs: add numpy issue to troubleshooting * fix: troubleshooting link ...
2025-07-09 05:13:27 +00:00 · 2024-08-07 12:16:03 +02:00 · 2024-08-07 12:16:03 +02:00 · 4ca6d0cb55
commit 4ca6d0cb55
parent b16abbefe4
2 changed files with 22 additions and 5 deletions
--- a/fern/docs/pages/installation/installation.mdx
+++ b/fern/docs/pages/installation/installation.mdx
@ -307,11 +307,12 @@ If you have all required dependencies properly configured running the
 following powershell command should succeed.
 ```powershell
-$env:CMAKE_ARGS='-DLLAMA_CUBLAS=on'; poetry run pip install --force-reinstall --no-cache-dir llama-cpp-python
+$env:CMAKE_ARGS='-DLLAMA_CUBLAS=on'; poetry run pip install --force-reinstall --no-cache-dir llama-cpp-python numpy==1.26.0
 ```
 If your installation was correct, you should see a message similar to the following next
-time you start the server `BLAS = 1`.
+time you start the server `BLAS = 1`. If there is some issue, please refer to the
 [troubleshooting](/installation/getting-started/troubleshooting#building-llama-cpp-with-nvidia-gpu-support) section.
 ```console
 llama_new_context_with_model: total VRAM used: 4857.93 MB (model: 4095.05 MB, context: 762.87 MB)
@ -339,11 +340,12 @@ Some tips:
 After that running the following command in the repository will install llama.cpp with GPU support:
 ```bash
-CMAKE_ARGS='-DLLAMA_CUBLAS=on' poetry run pip install --force-reinstall --no-cache-dir llama-cpp-python
+CMAKE_ARGS='-DLLAMA_CUBLAS=on' poetry run pip install --force-reinstall --no-cache-dir llama-cpp-python numpy==1.26.0
 ```
 If your installation was correct, you should see a message similar to the following next
-time you start the server `BLAS = 1`.
+time you start the server `BLAS = 1`. If there is some issue, please refer to the
 [troubleshooting](/installation/getting-started/troubleshooting#building-llama-cpp-with-nvidia-gpu-support) section.
 ```
 llama_new_context_with_model: total VRAM used: 4857.93 MB (model: 4095.05 MB, context: 762.87 MB)
--- a/fern/docs/pages/installation/troubleshooting.mdx
+++ b/fern/docs/pages/installation/troubleshooting.mdx
@ -46,4 +46,19 @@ huggingface:
 embedding:
  embed_dim: 384
 ```
-</Callout>
+</Callout>
 # Building Llama-cpp with NVIDIA GPU support
 ## Out-of-memory error
 If you encounter an out-of-memory error while running `llama-cpp` with CUDA, you can try the following steps to resolve the issue:
 1. **Set the next environment:**
    ```bash
    TOKENIZERS_PARALLELISM=true
    ```
 2. **Run PrivateGPT:**
    ```bash
    poetry run python -m privategpt
    ```
 Give thanks to [MarioRossiGithub](https://github.com/MarioRossiGithub) for providing the following solution.