ollama - 💡(How to fix) Fix gemma4:31b-coding-mtp-bf16 getting error "requires macOS" on Linux NVIDIA GDX Spark [1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
ollama/ollama#16019Fetched 2026-05-07 03:31:36
View on GitHub
Comments
1
Participants
2
Timeline
3
Reactions
1
Author
Timeline (top)
subscribed ×2commented ×1

Error Message

$ ollama pull gemma4:31b-coding-mtp-bf16 pulling manifest Error: pull model manifest: 412: this model requires macOS

$ ollama --version ollama version is 0.23.1

Code Example

$ ollama pull gemma4:31b-coding-mtp-bf16
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

$ ollama --version
ollama version is 0.23.1

---

May 06 10:04:33 systemd[1]: Started ollama.service - Ollama Service.
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1782 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLL>
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: false"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.120-06:00 level=INFO source=images.go:517 msg="total blobs: 53"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.121-06:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=routes.go:1847 msg="Listening on [::]:11434 (version 0.23.1)"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 43199"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.276-06:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=3h54m54.143217423s consecutive_failures=0
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.498-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 42669"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 35017"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 45293"
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=types.go:42 msg="inference compute" id=GPU-0ab97a9d-1079-edae-f585-30fa6e19c6a2 filter_id="" library=CUDA compute=12.1 name=CUDA0 description="NVIDIA GB10" libdirs=ollama,cuda_v13 driver=13.0 pci_>
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="119.7 GiB" default_num_ctx=262144
RAW_BUFFERClick to expand / collapse

Description:

I am attempting to pull the new gemma4:31b-coding-mtp-bf16 model on a Linux system with NVIDIA hardware. Since this is a standard architecture, it should pull and run via CUDA.

Actual Behavior:

The pull fails immediately with:

$ ollama pull gemma4:31b-coding-mtp-bf16
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

$ ollama --version
ollama version is 0.23.1

System Info

May 06 10:04:33 systemd[1]: Started ollama.service - Ollama Service.
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1782 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLL>
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: false"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.120-06:00 level=INFO source=images.go:517 msg="total blobs: 53"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.121-06:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=routes.go:1847 msg="Listening on [::]:11434 (version 0.23.1)"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 43199"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.276-06:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=3h54m54.143217423s consecutive_failures=0
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.498-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 42669"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 35017"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 45293"
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=types.go:42 msg="inference compute" id=GPU-0ab97a9d-1079-edae-f585-30fa6e19c6a2 filter_id="" library=CUDA compute=12.1 name=CUDA0 description="NVIDIA GB10" libdirs=ollama,cuda_v13 driver=13.0 pci_>
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="119.7 GiB" default_num_ctx=262144

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

ollama - 💡(How to fix) Fix gemma4:31b-coding-mtp-bf16 getting error "requires macOS" on Linux NVIDIA GDX Spark [1 comments, 2 participants]