ollama - 💡(How to fix) Fix gemma4:31b-coding-mtp-bf16 getting error "requires macOS" on Linux NVIDIA GDX Spark [1 comments, 2 participants]

ollama2026-05-06 16:29:26

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

ollama/ollama#16019•Fetched 2026-05-07 03:31:36

View on GitHub

Comments

Participants

Timeline

Reactions

Author

sjgiewont

Participants

rick-github

sjgiewont

Timeline (top)

subscribed ×2commented ×1

Error Message

$ ollama pull gemma4:31b-coding-mtp-bf16 pulling manifest Error: pull model manifest: 412: this model requires macOS

$ ollama --version ollama version is 0.23.1

Code Example

$ ollama pull gemma4:31b-coding-mtp-bf16
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

$ ollama --version
ollama version is 0.23.1

---

May 06 10:04:33 systemd[1]: Started ollama.service - Ollama Service.
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1782 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLL>
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: false"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.120-06:00 level=INFO source=images.go:517 msg="total blobs: 53"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.121-06:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=routes.go:1847 msg="Listening on [::]:11434 (version 0.23.1)"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 43199"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.276-06:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=3h54m54.143217423s consecutive_failures=0
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.498-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 42669"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 35017"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 45293"
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=types.go:42 msg="inference compute" id=GPU-0ab97a9d-1079-edae-f585-30fa6e19c6a2 filter_id="" library=CUDA compute=12.1 name=CUDA0 description="NVIDIA GB10" libdirs=ollama,cuda_v13 driver=13.0 pci_>
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="119.7 GiB" default_num_ctx=262144

RAW_BUFFERClick to expand / collapse

Description:

I am attempting to pull the new gemma4:31b-coding-mtp-bf16 model on a Linux system with NVIDIA hardware. Since this is a standard architecture, it should pull and run via CUDA.

Actual Behavior:

The pull fails immediately with:

$ ollama pull gemma4:31b-coding-mtp-bf16
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

$ ollama --version
ollama version is 0.23.1

System Info

May 06 10:04:33 systemd[1]: Started ollama.service - Ollama Service.
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1782 msg="server config" env="map[CUDA_VISIBLE_DEVICES: GGML_VK_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLL>
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.107-06:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: false"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.120-06:00 level=INFO source=images.go:517 msg="total blobs: 53"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.121-06:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=routes.go:1847 msg="Listening on [::]:11434 (version 0.23.1)"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.122-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 43199"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.276-06:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=3h54m54.143217423s consecutive_failures=0
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.498-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 42669"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 35017"
May 06 10:04:33 ollama[505222]: time=2026-05-06T10:04:33.878-06:00 level=INFO source=server.go:433 msg="starting runner" cmd="/usr/local/bin/ollama runner --ollama-engine --port 45293"
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=types.go:42 msg="inference compute" id=GPU-0ab97a9d-1079-edae-f585-30fa6e19c6a2 filter_id="" library=CUDA compute=12.1 name=CUDA0 description="NVIDIA GB10" libdirs=ollama,cuda_v13 driver=13.0 pci_>
May 06 10:04:34 ollama[505222]: time=2026-05-06T10:04:34.489-06:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="119.7 GiB" default_num_ctx=262144

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#authentication setup #request error #file not found #serialization error #model compatibility

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

ollama - 💡(How to fix) Fix gemma4:31b-coding-mtp-bf16 getting error "requires macOS" on Linux NVIDIA GDX Spark [1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Code Example

Description:

Actual Behavior:

System Info

Still need to ship something?

TRENDING

ollama - 💡(How to fix) Fix gemma4:31b-coding-mtp-bf16 getting error "requires macOS" on Linux NVIDIA GDX Spark [1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Code Example

Description:

Actual Behavior:

System Info

Still need to ship something?

RELATED_DISCOVERY

TRENDING