ollama - 💡(How to fix) Fix Can't pull qwen3.6:27b-coding-mxfp8 on macOS [1 participants]

ollama2026-05-01 17:25:24

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

ollama/ollama#15919•Fetched 2026-05-02 05:27:41

View on GitHub

Comments

Participants

Timeline

Reactions

Author

mario-grgic

Participants

mario-grgic

Timeline (top)

labeled ×1

Error Message

$ ollama pull qwen3.6:27b-coding-mxfp8 pulling manifest Error: pull model manifest: 412: this model requires macOS

Root Cause

[GIN-debug] HEAD / --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers) [GIN-debug] GET / --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers) [GIN-debug] HEAD /api/version --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers) [GIN-debug] GET /api/version --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func4 (5 handlers) [GIN-debug] GET /api/status --> github.com/ollama/ollama/server.(*Server).StatusHandler-fm (5 handlers) [GIN-debug] POST /api/pull --> github.com/ollama/ollama/server.(*Server).PullHandler-fm (5 handlers) [GIN-debug] POST /api/push --> github.com/ollama/ollama/server.(*Server).PushHandler-fm (5 handlers) [GIN-debug] HEAD /api/tags --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers) [GIN-debug] GET /api/tags --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers) [GIN-debug] POST /api/show --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (5 handlers) [GIN-debug] DELETE /api/delete --> github.com/ollama/ollama/server.(*Server).DeleteHandler-fm (5 handlers) [GIN-debug] POST /api/me --> github.com/ollama/ollama/server.(*Server).WhoamiHandler-fm (5 handlers) [GIN-debug] POST /api/signout --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers) [GIN-debug] DELETE /api/user/keys/:encodedKey --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers) [GIN-debug] POST /api/create --> github.com/ollama/ollama/server.(*Server).CreateHandler-fm (5 handlers) [GIN-debug] POST /api/blobs/:digest --> github.com/ollama/ollama/server.(*Server).CreateBlobHandler-fm (5 handlers) [GIN-debug] HEAD /api/blobs/:digest --> github.com/ollama/ollama/server.(*Server).HeadBlobHandler-fm (5 handlers) [GIN-debug] POST /api/copy --> github.com/ollama/ollama/server.(*Server).CopyHandler-fm (5 handlers) [GIN-debug] POST /api/experimental/web_search --> github.com/ollama/ollama/server.(*Server).WebSearchExperimentalHandler-fm (5 handlers) [GIN-debug] POST /api/experimental/web_fetch --> github.com/ollama/ollama/server.(*Server).WebFetchExperimentalHandler-fm (5 handlers) [GIN-debug] GET /api/experimental/model-recommendations --> github.com/ollama/ollama/server.(*Server).ModelRecommendationsExperimentalHandler-fm (5 handlers) [GIN-debug] GET /api/ps --> github.com/ollama/ollama/server.(*Server).PsHandler-fm (5 handlers) [GIN-debug] POST /api/generate --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (5 handlers) [GIN-debug] POST /api/chat --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (5 handlers) [GIN-debug] POST /api/embed --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (5 handlers) [GIN-debug] POST /api/embeddings --> github.com/ollama/ollama/server.(*Server).EmbeddingsHandler-fm (5 handlers) [GIN-debug] POST /v1/chat/completions --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers) [GIN-debug] POST /v1/completions --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers) [GIN-debug] POST /v1/embeddings --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (7 handlers) [GIN-debug] GET /v1/models --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (6 handlers) [GIN-debug] GET /v1/models/:model --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (7 handlers) [GIN-debug] POST /v1/responses --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers) [GIN-debug] POST /v1/images/generations --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers) [GIN-debug] POST /v1/images/edits --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers) [GIN-debug] POST /v1/audio/transcriptions --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (6 handlers) [GIN-debug] POST /v1/messages --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers) time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:59 msg="starting model recommendations cache" default_recommendations=6 refresh_interval=4h0m0s fetch_timeout=3s time=2026-05-01T13:23:52.037-04:00 level=INFO source=routes.go:1847 msg="Listening on 127.0.0.1:11434 (version v0.22.1 git-8f39fff7)" time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:246 msg="model recommendations snapshot not found" path=/Users/mariogrgic/.ollama/cache/model-recommendations.json time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:167 msg="skipping model recommendations refresh because cloud is disabled" time=2026-05-01T13:23:52.037-04:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=4h33m14.062196545s consecutive_failures=0 time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=sched.go:145 msg="starting llm scheduler" time=2026-05-01T13:23:52.038-04:00 level=INFO source=runner.go:67 msg="discovering available GPUs..." time=2026-05-01T13:23:52.039-04:00 level=INFO source=server.go:433 msg="starting runner" cmd="/Volumes/DATA/dev/ollama/ollama.git/ollama runner --ollama-engine --port 51163" time=2026-05-01T13:23:52.039-04:00 level=DEBUG source=server.go:434 msg=subprocess OLLAMA_MODELS=/Volumes/DATA/dev/ollama/models OLLAMA_NOHISTORY=1 OLLAMA_CONTEXT_LENGTH=8192 OLLAMA_DEBUG=1 PATH=/Library/Frameworks/Python.framework/Versions/3.14/bin:/Users/mariogrgic/.cargo/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin:/opt/pkg/env/active/bin:/opt/pmk/env/global/bin:/opt/X11/bin:/Library/Apple/usr/bin:/usr/local/Gambit/bin:/usr/local/pgsql/bin:/usr/local/texlive/2025/bin/universal-darwin:/usr/local/ant/bin:/usr/local/cassandra/bin:/usr/local/confluent/bin:/usr/local/elasticsearch/bin:/usr/local/erlang/bin:/usr/local/ghc/bin:/usr/local/go/bin:/usr/local/gradle/bin:/usr/local/memcached/bin:/usr/local/mongodb/bin:/usr/local/maven/bin:/usr/local/mysql/bin:/usr/local/pdftk/bin:/usr/local/sbt/bin:/usr/local/scala/bin:/usr/local/spark/bin:/usr/local/sbin DYLD_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git OLLAMA_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:433 msg="bootstrap discovery took" duration=47.701ms OLLAMA_LIBRARY_PATH=[/Volumes/DATA/dev/ollama/ollama.git] extra_envs=map[] time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:123 msg="evaluating which, if any, devices to filter out" initial_count=1 time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:192 msg="adjusting filtering IDs" FilterID=0 new_ID=0 time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:40 msg="GPU bootstrap discovery took" duration=49.008ms time=2026-05-01T13:23:52.086-04:00 level=INFO source=types.go:42 msg="inference compute" id=0 filter_id=0 library=Metal compute=0.0 name=Metal description="Apple M4 Max" libdirs="" driver=0.0 pci_id="" type=discrete total="107.5 GiB" available="107.5 GiB" time=2026-05-01T13:23:52.086-04:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="107.5 GiB" default_num_ctx=262144 [GIN] 2026/05/01 - 13:23:55 | 200 | 40.792µs | 127.0.0.1 | HEAD "/" [GIN] 2026/05/01 - 13:23:55 | 200 | 463.865958ms | 127.0.0.1 | POST "/api/pull"

Code Example

$ ollama pull qwen3.6:27b-coding-mxfp8
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

---

time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1782 msg="server config" env="map[HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:8192 OLLAMA_DEBUG:DEBUG OLLAMA_DEBUG_LOG_REQUESTS:false OLLAMA_EDITOR: OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/Volumes/DATA/dev/ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:true OLLAMA_NOPRUNE:false OLLAMA_NO_CLOUD:true OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false http_proxy: https_proxy: no_proxy:]"
time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: true"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:517 msg="total blobs: 22"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.
 - using env:	export GIN_MODE=release
 - using code:	gin.SetMode(gin.ReleaseMode)

[GIN-debug] HEAD   /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers)
[GIN-debug] GET    /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] HEAD   /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
[GIN-debug] GET    /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func4 (5 handlers)
[GIN-debug] GET    /api/status               --> github.com/ollama/ollama/server.(*Server).StatusHandler-fm (5 handlers)
[GIN-debug] POST   /api/pull                 --> github.com/ollama/ollama/server.(*Server).PullHandler-fm (5 handlers)
[GIN-debug] POST   /api/push                 --> github.com/ollama/ollama/server.(*Server).PushHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] POST   /api/show                 --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (5 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/ollama/ollama/server.(*Server).DeleteHandler-fm (5 handlers)
[GIN-debug] POST   /api/me                   --> github.com/ollama/ollama/server.(*Server).WhoamiHandler-fm (5 handlers)
[GIN-debug] POST   /api/signout              --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] DELETE /api/user/keys/:encodedKey --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] POST   /api/create               --> github.com/ollama/ollama/server.(*Server).CreateHandler-fm (5 handlers)
[GIN-debug] POST   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).CreateBlobHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).HeadBlobHandler-fm (5 handlers)
[GIN-debug] POST   /api/copy                 --> github.com/ollama/ollama/server.(*Server).CopyHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_search --> github.com/ollama/ollama/server.(*Server).WebSearchExperimentalHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_fetch --> github.com/ollama/ollama/server.(*Server).WebFetchExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/experimental/model-recommendations --> github.com/ollama/ollama/server.(*Server).ModelRecommendationsExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/ps                   --> github.com/ollama/ollama/server.(*Server).PsHandler-fm (5 handlers)
[GIN-debug] POST   /api/generate             --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (5 handlers)
[GIN-debug] POST   /api/chat                 --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (5 handlers)
[GIN-debug] POST   /api/embed                --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (5 handlers)
[GIN-debug] POST   /api/embeddings           --> github.com/ollama/ollama/server.(*Server).EmbeddingsHandler-fm (5 handlers)
[GIN-debug] POST   /v1/chat/completions      --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/completions           --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/embeddings            --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (7 handlers)
[GIN-debug] GET    /v1/models                --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (6 handlers)
[GIN-debug] GET    /v1/models/:model         --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (7 handlers)
[GIN-debug] POST   /v1/responses             --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/generations    --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/edits          --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/audio/transcriptions  --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (6 handlers)
[GIN-debug] POST   /v1/messages              --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:59 msg="starting model recommendations cache" default_recommendations=6 refresh_interval=4h0m0s fetch_timeout=3s
time=2026-05-01T13:23:52.037-04:00 level=INFO source=routes.go:1847 msg="Listening on 127.0.0.1:11434 (version v0.22.1 git-8f39fff7)"
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:246 msg="model recommendations snapshot not found" path=/Users/mariogrgic/.ollama/cache/model-recommendations.json
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:167 msg="skipping model recommendations refresh because cloud is disabled"
time=2026-05-01T13:23:52.037-04:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=4h33m14.062196545s consecutive_failures=0
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=sched.go:145 msg="starting llm scheduler"
time=2026-05-01T13:23:52.038-04:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
time=2026-05-01T13:23:52.039-04:00 level=INFO source=server.go:433 msg="starting runner" cmd="/Volumes/DATA/dev/ollama/ollama.git/ollama runner --ollama-engine --port 51163"
time=2026-05-01T13:23:52.039-04:00 level=DEBUG source=server.go:434 msg=subprocess OLLAMA_MODELS=/Volumes/DATA/dev/ollama/models OLLAMA_NOHISTORY=1 OLLAMA_CONTEXT_LENGTH=8192 OLLAMA_DEBUG=1 PATH=/Library/Frameworks/Python.framework/Versions/3.14/bin:/Users/mariogrgic/.cargo/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin:/opt/pkg/env/active/bin:/opt/pmk/env/global/bin:/opt/X11/bin:/Library/Apple/usr/bin:/usr/local/Gambit/bin:/usr/local/pgsql/bin:/usr/local/texlive/2025/bin/universal-darwin:/usr/local/ant/bin:/usr/local/cassandra/bin:/usr/local/confluent/bin:/usr/local/elasticsearch/bin:/usr/local/erlang/bin:/usr/local/ghc/bin:/usr/local/go/bin:/usr/local/gradle/bin:/usr/local/memcached/bin:/usr/local/mongodb/bin:/usr/local/maven/bin:/usr/local/mysql/bin:/usr/local/pdftk/bin:/usr/local/sbt/bin:/usr/local/scala/bin:/usr/local/spark/bin:/usr/local/sbin DYLD_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git OLLAMA_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:433 msg="bootstrap discovery took" duration=47.701ms OLLAMA_LIBRARY_PATH=[/Volumes/DATA/dev/ollama/ollama.git] extra_envs=map[]
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:123 msg="evaluating which, if any, devices to filter out" initial_count=1
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:192 msg="adjusting filtering IDs" FilterID=0 new_ID=0
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:40 msg="GPU bootstrap discovery took" duration=49.008ms
time=2026-05-01T13:23:52.086-04:00 level=INFO source=types.go:42 msg="inference compute" id=0 filter_id=0 library=Metal compute=0.0 name=Metal description="Apple M4 Max" libdirs="" driver=0.0 pci_id="" type=discrete total="107.5 GiB" available="107.5 GiB"
time=2026-05-01T13:23:52.086-04:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="107.5 GiB" default_num_ctx=262144
[GIN] 2026/05/01 - 13:23:55 | 200 |      40.792µs |       127.0.0.1 | HEAD     "/"
[GIN] 2026/05/01 - 13:23:55 | 200 |  463.865958ms |       127.0.0.1 | POST     "/api/pull"

RAW_BUFFERClick to expand / collapse

What is the issue?

I am using ollama v0.22.1 on macOS M4 Max, and trying to ollama pull qwen3.6:27b-coding-mxfp8. I get:

$ ollama pull qwen3.6:27b-coding-mxfp8
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

Relevant log output

time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1782 msg="server config" env="map[HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:8192 OLLAMA_DEBUG:DEBUG OLLAMA_DEBUG_LOG_REQUESTS:false OLLAMA_EDITOR: OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/Volumes/DATA/dev/ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:true OLLAMA_NOPRUNE:false OLLAMA_NO_CLOUD:true OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false http_proxy: https_proxy: no_proxy:]"
time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: true"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:517 msg="total blobs: 22"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.
 - using env:	export GIN_MODE=release
 - using code:	gin.SetMode(gin.ReleaseMode)

[GIN-debug] HEAD   /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers)
[GIN-debug] GET    /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] HEAD   /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
[GIN-debug] GET    /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func4 (5 handlers)
[GIN-debug] GET    /api/status               --> github.com/ollama/ollama/server.(*Server).StatusHandler-fm (5 handlers)
[GIN-debug] POST   /api/pull                 --> github.com/ollama/ollama/server.(*Server).PullHandler-fm (5 handlers)
[GIN-debug] POST   /api/push                 --> github.com/ollama/ollama/server.(*Server).PushHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] POST   /api/show                 --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (5 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/ollama/ollama/server.(*Server).DeleteHandler-fm (5 handlers)
[GIN-debug] POST   /api/me                   --> github.com/ollama/ollama/server.(*Server).WhoamiHandler-fm (5 handlers)
[GIN-debug] POST   /api/signout              --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] DELETE /api/user/keys/:encodedKey --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] POST   /api/create               --> github.com/ollama/ollama/server.(*Server).CreateHandler-fm (5 handlers)
[GIN-debug] POST   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).CreateBlobHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).HeadBlobHandler-fm (5 handlers)
[GIN-debug] POST   /api/copy                 --> github.com/ollama/ollama/server.(*Server).CopyHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_search --> github.com/ollama/ollama/server.(*Server).WebSearchExperimentalHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_fetch --> github.com/ollama/ollama/server.(*Server).WebFetchExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/experimental/model-recommendations --> github.com/ollama/ollama/server.(*Server).ModelRecommendationsExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/ps                   --> github.com/ollama/ollama/server.(*Server).PsHandler-fm (5 handlers)
[GIN-debug] POST   /api/generate             --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (5 handlers)
[GIN-debug] POST   /api/chat                 --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (5 handlers)
[GIN-debug] POST   /api/embed                --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (5 handlers)
[GIN-debug] POST   /api/embeddings           --> github.com/ollama/ollama/server.(*Server).EmbeddingsHandler-fm (5 handlers)
[GIN-debug] POST   /v1/chat/completions      --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/completions           --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/embeddings            --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (7 handlers)
[GIN-debug] GET    /v1/models                --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (6 handlers)
[GIN-debug] GET    /v1/models/:model         --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (7 handlers)
[GIN-debug] POST   /v1/responses             --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/generations    --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/edits          --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/audio/transcriptions  --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (6 handlers)
[GIN-debug] POST   /v1/messages              --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:59 msg="starting model recommendations cache" default_recommendations=6 refresh_interval=4h0m0s fetch_timeout=3s
time=2026-05-01T13:23:52.037-04:00 level=INFO source=routes.go:1847 msg="Listening on 127.0.0.1:11434 (version v0.22.1 git-8f39fff7)"
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:246 msg="model recommendations snapshot not found" path=/Users/mariogrgic/.ollama/cache/model-recommendations.json
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:167 msg="skipping model recommendations refresh because cloud is disabled"
time=2026-05-01T13:23:52.037-04:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=4h33m14.062196545s consecutive_failures=0
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=sched.go:145 msg="starting llm scheduler"
time=2026-05-01T13:23:52.038-04:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
time=2026-05-01T13:23:52.039-04:00 level=INFO source=server.go:433 msg="starting runner" cmd="/Volumes/DATA/dev/ollama/ollama.git/ollama runner --ollama-engine --port 51163"
time=2026-05-01T13:23:52.039-04:00 level=DEBUG source=server.go:434 msg=subprocess OLLAMA_MODELS=/Volumes/DATA/dev/ollama/models OLLAMA_NOHISTORY=1 OLLAMA_CONTEXT_LENGTH=8192 OLLAMA_DEBUG=1 PATH=/Library/Frameworks/Python.framework/Versions/3.14/bin:/Users/mariogrgic/.cargo/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin:/opt/pkg/env/active/bin:/opt/pmk/env/global/bin:/opt/X11/bin:/Library/Apple/usr/bin:/usr/local/Gambit/bin:/usr/local/pgsql/bin:/usr/local/texlive/2025/bin/universal-darwin:/usr/local/ant/bin:/usr/local/cassandra/bin:/usr/local/confluent/bin:/usr/local/elasticsearch/bin:/usr/local/erlang/bin:/usr/local/ghc/bin:/usr/local/go/bin:/usr/local/gradle/bin:/usr/local/memcached/bin:/usr/local/mongodb/bin:/usr/local/maven/bin:/usr/local/mysql/bin:/usr/local/pdftk/bin:/usr/local/sbt/bin:/usr/local/scala/bin:/usr/local/spark/bin:/usr/local/sbin DYLD_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git OLLAMA_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:433 msg="bootstrap discovery took" duration=47.701ms OLLAMA_LIBRARY_PATH=[/Volumes/DATA/dev/ollama/ollama.git] extra_envs=map[]
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:123 msg="evaluating which, if any, devices to filter out" initial_count=1
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:192 msg="adjusting filtering IDs" FilterID=0 new_ID=0
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:40 msg="GPU bootstrap discovery took" duration=49.008ms
time=2026-05-01T13:23:52.086-04:00 level=INFO source=types.go:42 msg="inference compute" id=0 filter_id=0 library=Metal compute=0.0 name=Metal description="Apple M4 Max" libdirs="" driver=0.0 pci_id="" type=discrete total="107.5 GiB" available="107.5 GiB"
time=2026-05-01T13:23:52.086-04:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="107.5 GiB" default_num_ctx=262144
[GIN] 2026/05/01 - 13:23:55 | 200 |      40.792µs |       127.0.0.1 | HEAD     "/"
[GIN] 2026/05/01 - 13:23:55 | 200 |  463.865958ms |       127.0.0.1 | POST     "/api/pull"

OS

maOS 26.4.1

GPU

M4 Max

CPU

M4 Max

Ollama version

0.22.1

extent analysis

TL;DR

The error message "this model requires macOS" suggests that the model qwen3.6:27b-coding-mxfp8 is not compatible with the current environment, but since the user is already on macOS, the issue might be related to the model's specific requirements or the Ollama version.

Guidance

Verify that the model qwen3.6:27b-coding-mxfp8 is compatible with the Ollama version 0.22.1 and macOS.
Check if there are any specific requirements or dependencies for the model that are not met in the current environment.
Consider updating Ollama to a newer version if available, as this might resolve compatibility issues.
If the model is supposed to be compatible, try pulling a different model to see if the issue is model-specific.

Notes

The provided log output does not directly point to a solution but indicates that Ollama is running and attempting to pull the model. The error message suggests a compatibility issue, but since the environment is already macOS, the problem might be more nuanced.

Recommendation

Apply workaround: Try pulling a different model to isolate if the issue is specific to qwen3.6:27b-coding-mxfp8 or a broader compatibility problem with Ollama 0.22.1 on macOS.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #serialization error #model compatibility #GPU setup #container setup

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

ollama - 💡(How to fix) Fix Can't pull qwen3.6:27b-coding-mxfp8 on macOS [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

What is the issue?

Relevant log output

OS

GPU

CPU

Ollama version

extent analysis

TL;DR

Guidance

Notes

Recommendation

Still need to ship something?

TRENDING

ollama - 💡(How to fix) Fix Can't pull qwen3.6:27b-coding-mxfp8 on macOS [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

What is the issue?

Relevant log output

OS

GPU

CPU

Ollama version

extent analysis

TL;DR

Guidance

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING