ollama - 💡(How to fix) Fix is there something wrong with the models folder setting [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
ollama/ollama#14988Fetched 2026-04-08 01:08:19
View on GitHub
Comments
0
Participants
1
Timeline
1
Reactions
0
Author
Participants
Timeline (top)
labeled ×1
RAW_BUFFERClick to expand / collapse

What is the issue?

I recently found that on my Windows, ollama.exe loads my models into 100% CPU, and ollama serve which is from cli, load my models as 60%/40% CPU/GPU, and another deferent thing is that i 've already pulled 6 models which ollama.exe put them into the folder D:\.ollama\models\ that i set in ollama.exe's setting, but in the situation that ollama serve runs ollama ls, then i see only 2 models listing and i found the 2 models refers to the default model folder at C:\Users\Me\.ollama\models\. i wonder why and i was trying to figure out why the model could loads into parts of GPU and parts of CPU with command line ollama serve and ollama run <model>, but the model loads into fully CPU with ollama.exe which is evry slowly loading and responsing 🤣.

Relevant log output

OS

Windows

GPU

Nvidia

CPU

Intel

Ollama version

0.18.2

extent analysis

Fix Plan

To resolve the issue of ollama.exe loading models into 100% CPU and the discrepancy in model listing between ollama.exe and ollama serve, follow these steps:

  • Configure Ollama to use GPU: Ensure that Ollama is configured to utilize the GPU for model loading and inference. This can be achieved by setting the OLLAMA_DEVICE environment variable to cuda (for Nvidia GPUs) before running ollama.exe or ollama serve.
  • Specify model directory: Use the --models-dir flag with ollama serve to specify the custom model directory (D:\.ollama\models\) to ensure that all models are listed and loaded correctly.
  • Update Ollama configuration: Modify the Ollama configuration file to include the custom model directory and GPU settings. This can be done by adding the following lines to the configuration file:
device: cuda
models_dir: D:\.ollama\models\
  • Verify GPU usage: Use tools like nvidia-smi to monitor GPU usage and ensure that Ollama is utilizing the GPU for model loading and inference.

Example Code

To set the OLLAMA_DEVICE environment variable and specify the custom model directory, use the following commands:

set OLLAMA_DEVICE=cuda
ollama serve --models-dir D:\.ollama\models\

Alternatively, you can modify the Ollama configuration file to include the custom settings:

# ollama.yml
device: cuda
models_dir: D:\.ollama\models\

Then, run ollama serve without specifying the --models-dir flag.

Verification

To verify that the fix worked, monitor the CPU and GPU usage using tools like taskmgr and nvidia-smi. Ensure that Ollama is utilizing the GPU for model loading and inference, and that all models are listed correctly when running ollama ls.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

ollama - 💡(How to fix) Fix is there something wrong with the models folder setting [1 participants]