ollama - 💡(How to fix) Fix Can't pull qwen3.6:27b-coding-mxfp8 on macOS [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
ollama/ollama#15919Fetched 2026-05-02 05:27:41
View on GitHub
Comments
0
Participants
1
Timeline
1
Reactions
0
Participants
Timeline (top)
labeled ×1

Error Message

$ ollama pull qwen3.6:27b-coding-mxfp8 pulling manifest Error: pull model manifest: 412: this model requires macOS

Root Cause

[GIN-debug] HEAD / --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers) [GIN-debug] GET / --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers) [GIN-debug] HEAD /api/version --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers) [GIN-debug] GET /api/version --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func4 (5 handlers) [GIN-debug] GET /api/status --> github.com/ollama/ollama/server.(*Server).StatusHandler-fm (5 handlers) [GIN-debug] POST /api/pull --> github.com/ollama/ollama/server.(*Server).PullHandler-fm (5 handlers) [GIN-debug] POST /api/push --> github.com/ollama/ollama/server.(*Server).PushHandler-fm (5 handlers) [GIN-debug] HEAD /api/tags --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers) [GIN-debug] GET /api/tags --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers) [GIN-debug] POST /api/show --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (5 handlers) [GIN-debug] DELETE /api/delete --> github.com/ollama/ollama/server.(*Server).DeleteHandler-fm (5 handlers) [GIN-debug] POST /api/me --> github.com/ollama/ollama/server.(*Server).WhoamiHandler-fm (5 handlers) [GIN-debug] POST /api/signout --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers) [GIN-debug] DELETE /api/user/keys/:encodedKey --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers) [GIN-debug] POST /api/create --> github.com/ollama/ollama/server.(*Server).CreateHandler-fm (5 handlers) [GIN-debug] POST /api/blobs/:digest --> github.com/ollama/ollama/server.(*Server).CreateBlobHandler-fm (5 handlers) [GIN-debug] HEAD /api/blobs/:digest --> github.com/ollama/ollama/server.(*Server).HeadBlobHandler-fm (5 handlers) [GIN-debug] POST /api/copy --> github.com/ollama/ollama/server.(*Server).CopyHandler-fm (5 handlers) [GIN-debug] POST /api/experimental/web_search --> github.com/ollama/ollama/server.(*Server).WebSearchExperimentalHandler-fm (5 handlers) [GIN-debug] POST /api/experimental/web_fetch --> github.com/ollama/ollama/server.(*Server).WebFetchExperimentalHandler-fm (5 handlers) [GIN-debug] GET /api/experimental/model-recommendations --> github.com/ollama/ollama/server.(*Server).ModelRecommendationsExperimentalHandler-fm (5 handlers) [GIN-debug] GET /api/ps --> github.com/ollama/ollama/server.(*Server).PsHandler-fm (5 handlers) [GIN-debug] POST /api/generate --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (5 handlers) [GIN-debug] POST /api/chat --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (5 handlers) [GIN-debug] POST /api/embed --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (5 handlers) [GIN-debug] POST /api/embeddings --> github.com/ollama/ollama/server.(*Server).EmbeddingsHandler-fm (5 handlers) [GIN-debug] POST /v1/chat/completions --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers) [GIN-debug] POST /v1/completions --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers) [GIN-debug] POST /v1/embeddings --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (7 handlers) [GIN-debug] GET /v1/models --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (6 handlers) [GIN-debug] GET /v1/models/:model --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (7 handlers) [GIN-debug] POST /v1/responses --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers) [GIN-debug] POST /v1/images/generations --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers) [GIN-debug] POST /v1/images/edits --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers) [GIN-debug] POST /v1/audio/transcriptions --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (6 handlers) [GIN-debug] POST /v1/messages --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers) time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:59 msg="starting model recommendations cache" default_recommendations=6 refresh_interval=4h0m0s fetch_timeout=3s time=2026-05-01T13:23:52.037-04:00 level=INFO source=routes.go:1847 msg="Listening on 127.0.0.1:11434 (version v0.22.1 git-8f39fff7)" time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:246 msg="model recommendations snapshot not found" path=/Users/mariogrgic/.ollama/cache/model-recommendations.json time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:167 msg="skipping model recommendations refresh because cloud is disabled" time=2026-05-01T13:23:52.037-04:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=4h33m14.062196545s consecutive_failures=0 time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=sched.go:145 msg="starting llm scheduler" time=2026-05-01T13:23:52.038-04:00 level=INFO source=runner.go:67 msg="discovering available GPUs..." time=2026-05-01T13:23:52.039-04:00 level=INFO source=server.go:433 msg="starting runner" cmd="/Volumes/DATA/dev/ollama/ollama.git/ollama runner --ollama-engine --port 51163" time=2026-05-01T13:23:52.039-04:00 level=DEBUG source=server.go:434 msg=subprocess OLLAMA_MODELS=/Volumes/DATA/dev/ollama/models OLLAMA_NOHISTORY=1 OLLAMA_CONTEXT_LENGTH=8192 OLLAMA_DEBUG=1 PATH=/Library/Frameworks/Python.framework/Versions/3.14/bin:/Users/mariogrgic/.cargo/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin:/opt/pkg/env/active/bin:/opt/pmk/env/global/bin:/opt/X11/bin:/Library/Apple/usr/bin:/usr/local/Gambit/bin:/usr/local/pgsql/bin:/usr/local/texlive/2025/bin/universal-darwin:/usr/local/ant/bin:/usr/local/cassandra/bin:/usr/local/confluent/bin:/usr/local/elasticsearch/bin:/usr/local/erlang/bin:/usr/local/ghc/bin:/usr/local/go/bin:/usr/local/gradle/bin:/usr/local/memcached/bin:/usr/local/mongodb/bin:/usr/local/maven/bin:/usr/local/mysql/bin:/usr/local/pdftk/bin:/usr/local/sbt/bin:/usr/local/scala/bin:/usr/local/spark/bin:/usr/local/sbin DYLD_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git OLLAMA_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:433 msg="bootstrap discovery took" duration=47.701ms OLLAMA_LIBRARY_PATH=[/Volumes/DATA/dev/ollama/ollama.git] extra_envs=map[] time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:123 msg="evaluating which, if any, devices to filter out" initial_count=1 time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:192 msg="adjusting filtering IDs" FilterID=0 new_ID=0 time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:40 msg="GPU bootstrap discovery took" duration=49.008ms time=2026-05-01T13:23:52.086-04:00 level=INFO source=types.go:42 msg="inference compute" id=0 filter_id=0 library=Metal compute=0.0 name=Metal description="Apple M4 Max" libdirs="" driver=0.0 pci_id="" type=discrete total="107.5 GiB" available="107.5 GiB" time=2026-05-01T13:23:52.086-04:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="107.5 GiB" default_num_ctx=262144 [GIN] 2026/05/01 - 13:23:55 | 200 | 40.792µs | 127.0.0.1 | HEAD "/" [GIN] 2026/05/01 - 13:23:55 | 200 | 463.865958ms | 127.0.0.1 | POST "/api/pull"

Code Example

$ ollama pull qwen3.6:27b-coding-mxfp8
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

---

time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1782 msg="server config" env="map[HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:8192 OLLAMA_DEBUG:DEBUG OLLAMA_DEBUG_LOG_REQUESTS:false OLLAMA_EDITOR: OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/Volumes/DATA/dev/ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:true OLLAMA_NOPRUNE:false OLLAMA_NO_CLOUD:true OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false http_proxy: https_proxy: no_proxy:]"
time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: true"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:517 msg="total blobs: 22"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.
 - using env:	export GIN_MODE=release
 - using code:	gin.SetMode(gin.ReleaseMode)

[GIN-debug] HEAD   /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers)
[GIN-debug] GET    /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] HEAD   /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
[GIN-debug] GET    /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func4 (5 handlers)
[GIN-debug] GET    /api/status               --> github.com/ollama/ollama/server.(*Server).StatusHandler-fm (5 handlers)
[GIN-debug] POST   /api/pull                 --> github.com/ollama/ollama/server.(*Server).PullHandler-fm (5 handlers)
[GIN-debug] POST   /api/push                 --> github.com/ollama/ollama/server.(*Server).PushHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] POST   /api/show                 --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (5 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/ollama/ollama/server.(*Server).DeleteHandler-fm (5 handlers)
[GIN-debug] POST   /api/me                   --> github.com/ollama/ollama/server.(*Server).WhoamiHandler-fm (5 handlers)
[GIN-debug] POST   /api/signout              --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] DELETE /api/user/keys/:encodedKey --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] POST   /api/create               --> github.com/ollama/ollama/server.(*Server).CreateHandler-fm (5 handlers)
[GIN-debug] POST   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).CreateBlobHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).HeadBlobHandler-fm (5 handlers)
[GIN-debug] POST   /api/copy                 --> github.com/ollama/ollama/server.(*Server).CopyHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_search --> github.com/ollama/ollama/server.(*Server).WebSearchExperimentalHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_fetch --> github.com/ollama/ollama/server.(*Server).WebFetchExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/experimental/model-recommendations --> github.com/ollama/ollama/server.(*Server).ModelRecommendationsExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/ps                   --> github.com/ollama/ollama/server.(*Server).PsHandler-fm (5 handlers)
[GIN-debug] POST   /api/generate             --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (5 handlers)
[GIN-debug] POST   /api/chat                 --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (5 handlers)
[GIN-debug] POST   /api/embed                --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (5 handlers)
[GIN-debug] POST   /api/embeddings           --> github.com/ollama/ollama/server.(*Server).EmbeddingsHandler-fm (5 handlers)
[GIN-debug] POST   /v1/chat/completions      --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/completions           --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/embeddings            --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (7 handlers)
[GIN-debug] GET    /v1/models                --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (6 handlers)
[GIN-debug] GET    /v1/models/:model         --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (7 handlers)
[GIN-debug] POST   /v1/responses             --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/generations    --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/edits          --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/audio/transcriptions  --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (6 handlers)
[GIN-debug] POST   /v1/messages              --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:59 msg="starting model recommendations cache" default_recommendations=6 refresh_interval=4h0m0s fetch_timeout=3s
time=2026-05-01T13:23:52.037-04:00 level=INFO source=routes.go:1847 msg="Listening on 127.0.0.1:11434 (version v0.22.1 git-8f39fff7)"
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:246 msg="model recommendations snapshot not found" path=/Users/mariogrgic/.ollama/cache/model-recommendations.json
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:167 msg="skipping model recommendations refresh because cloud is disabled"
time=2026-05-01T13:23:52.037-04:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=4h33m14.062196545s consecutive_failures=0
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=sched.go:145 msg="starting llm scheduler"
time=2026-05-01T13:23:52.038-04:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
time=2026-05-01T13:23:52.039-04:00 level=INFO source=server.go:433 msg="starting runner" cmd="/Volumes/DATA/dev/ollama/ollama.git/ollama runner --ollama-engine --port 51163"
time=2026-05-01T13:23:52.039-04:00 level=DEBUG source=server.go:434 msg=subprocess OLLAMA_MODELS=/Volumes/DATA/dev/ollama/models OLLAMA_NOHISTORY=1 OLLAMA_CONTEXT_LENGTH=8192 OLLAMA_DEBUG=1 PATH=/Library/Frameworks/Python.framework/Versions/3.14/bin:/Users/mariogrgic/.cargo/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin:/opt/pkg/env/active/bin:/opt/pmk/env/global/bin:/opt/X11/bin:/Library/Apple/usr/bin:/usr/local/Gambit/bin:/usr/local/pgsql/bin:/usr/local/texlive/2025/bin/universal-darwin:/usr/local/ant/bin:/usr/local/cassandra/bin:/usr/local/confluent/bin:/usr/local/elasticsearch/bin:/usr/local/erlang/bin:/usr/local/ghc/bin:/usr/local/go/bin:/usr/local/gradle/bin:/usr/local/memcached/bin:/usr/local/mongodb/bin:/usr/local/maven/bin:/usr/local/mysql/bin:/usr/local/pdftk/bin:/usr/local/sbt/bin:/usr/local/scala/bin:/usr/local/spark/bin:/usr/local/sbin DYLD_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git OLLAMA_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:433 msg="bootstrap discovery took" duration=47.701ms OLLAMA_LIBRARY_PATH=[/Volumes/DATA/dev/ollama/ollama.git] extra_envs=map[]
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:123 msg="evaluating which, if any, devices to filter out" initial_count=1
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:192 msg="adjusting filtering IDs" FilterID=0 new_ID=0
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:40 msg="GPU bootstrap discovery took" duration=49.008ms
time=2026-05-01T13:23:52.086-04:00 level=INFO source=types.go:42 msg="inference compute" id=0 filter_id=0 library=Metal compute=0.0 name=Metal description="Apple M4 Max" libdirs="" driver=0.0 pci_id="" type=discrete total="107.5 GiB" available="107.5 GiB"
time=2026-05-01T13:23:52.086-04:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="107.5 GiB" default_num_ctx=262144
[GIN] 2026/05/01 - 13:23:55 | 200 |      40.792µs |       127.0.0.1 | HEAD     "/"
[GIN] 2026/05/01 - 13:23:55 | 200 |  463.865958ms |       127.0.0.1 | POST     "/api/pull"
RAW_BUFFERClick to expand / collapse

What is the issue?

I am using ollama v0.22.1 on macOS M4 Max, and trying to ollama pull qwen3.6:27b-coding-mxfp8. I get:

$ ollama pull qwen3.6:27b-coding-mxfp8
pulling manifest 
Error: pull model manifest: 412: this model requires macOS

Relevant log output

time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1782 msg="server config" env="map[HTTPS_PROXY: HTTP_PROXY: NO_PROXY: OLLAMA_CONTEXT_LENGTH:8192 OLLAMA_DEBUG:DEBUG OLLAMA_DEBUG_LOG_REQUESTS:false OLLAMA_EDITOR: OLLAMA_FLASH_ATTENTION:false OLLAMA_GPU_OVERHEAD:0 OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_KEEP_ALIVE:5m0s OLLAMA_KV_CACHE_TYPE: OLLAMA_LLM_LIBRARY: OLLAMA_LOAD_TIMEOUT:5m0s OLLAMA_MAX_LOADED_MODELS:0 OLLAMA_MAX_QUEUE:512 OLLAMA_MODELS:/Volumes/DATA/dev/ollama/models OLLAMA_MULTIUSER_CACHE:false OLLAMA_NEW_ENGINE:false OLLAMA_NOHISTORY:true OLLAMA_NOPRUNE:false OLLAMA_NO_CLOUD:true OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://* vscode-webview://* vscode-file://*] OLLAMA_REMOTES:[ollama.com] OLLAMA_SCHED_SPREAD:false http_proxy: https_proxy: no_proxy:]"
time=2026-05-01T13:23:52.035-04:00 level=INFO source=routes.go:1784 msg="Ollama cloud disabled: true"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:517 msg="total blobs: 22"
time=2026-05-01T13:23:52.036-04:00 level=INFO source=images.go:524 msg="total unused blobs removed: 0"
[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.
 - using env:	export GIN_MODE=release
 - using code:	gin.SetMode(gin.ReleaseMode)

[GIN-debug] HEAD   /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers)
[GIN-debug] GET    /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] HEAD   /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func3 (5 handlers)
[GIN-debug] GET    /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func4 (5 handlers)
[GIN-debug] GET    /api/status               --> github.com/ollama/ollama/server.(*Server).StatusHandler-fm (5 handlers)
[GIN-debug] POST   /api/pull                 --> github.com/ollama/ollama/server.(*Server).PullHandler-fm (5 handlers)
[GIN-debug] POST   /api/push                 --> github.com/ollama/ollama/server.(*Server).PushHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (5 handlers)
[GIN-debug] POST   /api/show                 --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (5 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/ollama/ollama/server.(*Server).DeleteHandler-fm (5 handlers)
[GIN-debug] POST   /api/me                   --> github.com/ollama/ollama/server.(*Server).WhoamiHandler-fm (5 handlers)
[GIN-debug] POST   /api/signout              --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] DELETE /api/user/keys/:encodedKey --> github.com/ollama/ollama/server.(*Server).SignoutHandler-fm (5 handlers)
[GIN-debug] POST   /api/create               --> github.com/ollama/ollama/server.(*Server).CreateHandler-fm (5 handlers)
[GIN-debug] POST   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).CreateBlobHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).HeadBlobHandler-fm (5 handlers)
[GIN-debug] POST   /api/copy                 --> github.com/ollama/ollama/server.(*Server).CopyHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_search --> github.com/ollama/ollama/server.(*Server).WebSearchExperimentalHandler-fm (5 handlers)
[GIN-debug] POST   /api/experimental/web_fetch --> github.com/ollama/ollama/server.(*Server).WebFetchExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/experimental/model-recommendations --> github.com/ollama/ollama/server.(*Server).ModelRecommendationsExperimentalHandler-fm (5 handlers)
[GIN-debug] GET    /api/ps                   --> github.com/ollama/ollama/server.(*Server).PsHandler-fm (5 handlers)
[GIN-debug] POST   /api/generate             --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (5 handlers)
[GIN-debug] POST   /api/chat                 --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (5 handlers)
[GIN-debug] POST   /api/embed                --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (5 handlers)
[GIN-debug] POST   /api/embeddings           --> github.com/ollama/ollama/server.(*Server).EmbeddingsHandler-fm (5 handlers)
[GIN-debug] POST   /v1/chat/completions      --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/completions           --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/embeddings            --> github.com/ollama/ollama/server.(*Server).EmbedHandler-fm (7 handlers)
[GIN-debug] GET    /v1/models                --> github.com/ollama/ollama/server.(*Server).ListHandler-fm (6 handlers)
[GIN-debug] GET    /v1/models/:model         --> github.com/ollama/ollama/server.(*Server).ShowHandler-fm (7 handlers)
[GIN-debug] POST   /v1/responses             --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/generations    --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/images/edits          --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (7 handlers)
[GIN-debug] POST   /v1/audio/transcriptions  --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (6 handlers)
[GIN-debug] POST   /v1/messages              --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (7 handlers)
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:59 msg="starting model recommendations cache" default_recommendations=6 refresh_interval=4h0m0s fetch_timeout=3s
time=2026-05-01T13:23:52.037-04:00 level=INFO source=routes.go:1847 msg="Listening on 127.0.0.1:11434 (version v0.22.1 git-8f39fff7)"
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:246 msg="model recommendations snapshot not found" path=/Users/mariogrgic/.ollama/cache/model-recommendations.json
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=model_recommendations.go:167 msg="skipping model recommendations refresh because cloud is disabled"
time=2026-05-01T13:23:52.037-04:00 level=INFO source=model_recommendations.go:179 msg="model recommendations cache sleep scheduled" wait=4h33m14.062196545s consecutive_failures=0
time=2026-05-01T13:23:52.037-04:00 level=DEBUG source=sched.go:145 msg="starting llm scheduler"
time=2026-05-01T13:23:52.038-04:00 level=INFO source=runner.go:67 msg="discovering available GPUs..."
time=2026-05-01T13:23:52.039-04:00 level=INFO source=server.go:433 msg="starting runner" cmd="/Volumes/DATA/dev/ollama/ollama.git/ollama runner --ollama-engine --port 51163"
time=2026-05-01T13:23:52.039-04:00 level=DEBUG source=server.go:434 msg=subprocess OLLAMA_MODELS=/Volumes/DATA/dev/ollama/models OLLAMA_NOHISTORY=1 OLLAMA_CONTEXT_LENGTH=8192 OLLAMA_DEBUG=1 PATH=/Library/Frameworks/Python.framework/Versions/3.14/bin:/Users/mariogrgic/.cargo/bin:/System/Cryptexes/App/usr/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/local/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/bin:/var/run/com.apple.security.cryptexd/codex.system/bootstrap/usr/appleinternal/bin:/opt/pkg/env/active/bin:/opt/pmk/env/global/bin:/opt/X11/bin:/Library/Apple/usr/bin:/usr/local/Gambit/bin:/usr/local/pgsql/bin:/usr/local/texlive/2025/bin/universal-darwin:/usr/local/ant/bin:/usr/local/cassandra/bin:/usr/local/confluent/bin:/usr/local/elasticsearch/bin:/usr/local/erlang/bin:/usr/local/ghc/bin:/usr/local/go/bin:/usr/local/gradle/bin:/usr/local/memcached/bin:/usr/local/mongodb/bin:/usr/local/maven/bin:/usr/local/mysql/bin:/usr/local/pdftk/bin:/usr/local/sbt/bin:/usr/local/scala/bin:/usr/local/spark/bin:/usr/local/sbin DYLD_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git OLLAMA_LIBRARY_PATH=/Volumes/DATA/dev/ollama/ollama.git
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:433 msg="bootstrap discovery took" duration=47.701ms OLLAMA_LIBRARY_PATH=[/Volumes/DATA/dev/ollama/ollama.git] extra_envs=map[]
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:123 msg="evaluating which, if any, devices to filter out" initial_count=1
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:192 msg="adjusting filtering IDs" FilterID=0 new_ID=0
time=2026-05-01T13:23:52.086-04:00 level=DEBUG source=runner.go:40 msg="GPU bootstrap discovery took" duration=49.008ms
time=2026-05-01T13:23:52.086-04:00 level=INFO source=types.go:42 msg="inference compute" id=0 filter_id=0 library=Metal compute=0.0 name=Metal description="Apple M4 Max" libdirs="" driver=0.0 pci_id="" type=discrete total="107.5 GiB" available="107.5 GiB"
time=2026-05-01T13:23:52.086-04:00 level=INFO source=routes.go:1897 msg="vram-based default context" total_vram="107.5 GiB" default_num_ctx=262144
[GIN] 2026/05/01 - 13:23:55 | 200 |      40.792µs |       127.0.0.1 | HEAD     "/"
[GIN] 2026/05/01 - 13:23:55 | 200 |  463.865958ms |       127.0.0.1 | POST     "/api/pull"

OS

maOS 26.4.1

GPU

M4 Max

CPU

M4 Max

Ollama version

0.22.1

extent analysis

TL;DR

The error message "this model requires macOS" suggests that the model qwen3.6:27b-coding-mxfp8 is not compatible with the current environment, but since the user is already on macOS, the issue might be related to the model's specific requirements or the Ollama version.

Guidance

  • Verify that the model qwen3.6:27b-coding-mxfp8 is compatible with the Ollama version 0.22.1 and macOS.
  • Check if there are any specific requirements or dependencies for the model that are not met in the current environment.
  • Consider updating Ollama to a newer version if available, as this might resolve compatibility issues.
  • If the model is supposed to be compatible, try pulling a different model to see if the issue is model-specific.

Notes

The provided log output does not directly point to a solution but indicates that Ollama is running and attempting to pull the model. The error message suggests a compatibility issue, but since the environment is already macOS, the problem might be more nuanced.

Recommendation

Apply workaround: Try pulling a different model to isolate if the issue is specific to qwen3.6:27b-coding-mxfp8 or a broader compatibility problem with Ollama 0.22.1 on macOS.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING