ollama - 💡(How to fix) Fix Ollama can't run hf.co/unsloth/Qwen3.5-35B-A3B-GGUF:Q8_0 [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
ollama/ollama#14732Fetched 2026-04-08 00:32:22
View on GitHub
Comments
0
Participants
1
Timeline
3
Reactions
0
Participants
Timeline (top)
closed ×1labeled ×1subscribed ×1

Error Message

Even on unsloth webstie (https://unsloth.ai/docs/zh/mo-xing/qwen3.5), it says ollama is supported and provides the command line to run with ollama, I still got below error on Mac: Error: 500 Internal Server Error: unable to load model: /Users/elton/.ollama/models/blobs/sha256-3808866c016ab02b4adb26b873f7008a2cdd2c0704a39704050119ab0631db46

RAW_BUFFERClick to expand / collapse

What is the issue?

Even on unsloth webstie (https://unsloth.ai/docs/zh/mo-xing/qwen3.5), it says ollama is supported and provides the command line to run with ollama, I still got below error on Mac: $ ollama run hf.co/unsloth/Qwen3.5-35B-A3B-GGUF:Q8_0 pulling manifest pulling 3808866c016a: 100% ▕███████████████████████████████▏ 36 GB pulling a516ab92e824: 100% ▕███████████████████████████████▏ 899 MB pulling 26f0a43935dc: 100% ▕███████████████████████████████▏ 490 B verifying sha256 digest writing manifest success Error: 500 Internal Server Error: unable to load model: /Users/elton/.ollama/models/blobs/sha256-3808866c016ab02b4adb26b873f7008a2cdd2c0704a39704050119ab0631db46

Relevant log output

OS

macOS

GPU

Apple

CPU

No response

Ollama version

0.17.7

extent analysis

Fix Plan

The issue seems to be related to loading a large model with Ollama on a Mac. To fix this, we can try the following steps:

  • Increase the memory allocation for the Ollama container
  • Verify the model download and loading process

Here are the concrete steps:

  1. Stop the current Ollama process:

pkill -f ollama

2. **Increase the memory allocation**:
   You can do this by setting the `--memory` flag when running Ollama. For example:
   ```bash
ollama run --memory 16G hf.co/unsloth/Qwen3.5-35B-A3B-GGUF:Q8_0

Replace 16G with the amount of memory you want to allocate. 3. Verify the model download and loading process: You can try downloading the model manually using the following command:

ollama pull hf.co/unsloth/Qwen3.5-35B-A3B-GGUF:Q8_0

Then, try running Ollama again with the --no-pull flag:

ollama run --no-pull hf.co/unsloth/Qwen3.5-35B-A3B-GGUF:Q8_0

Verification

To verify that the fix worked, you can check the Ollama logs for any error messages. If the model loads successfully, you should see a message indicating that the model is ready to use.

Extra Tips

  • Make sure you have enough disk space and memory available on your Mac to load the model.
  • If you continue to encounter issues, try updating Ollama to the latest version or seeking help from the Ollama community.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING