ollama - 💡(How to fix) Fix Not work on rx9060xt [2 comments, 3 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
ollama/ollama#14765Fetched 2026-04-08 00:31:54
View on GitHub
Comments
2
Participants
3
Timeline
3
Reactions
0
Timeline (top)
commented ×2closed ×1

Error Message

Even with OLLAMA_VULKAN=1, the runner falls back to CPU: level=INFO source=ggml.go:494 msg="offloaded 0/35 layers to GPU" vulkaninfo correctly identifies the device: deviceName = AMD Radeon RX 9060 XT (RADV GFX1200)

RAW_BUFFERClick to expand / collapse

Describe the bug

Ollama (v0.17.7/v0.17.8) fails to offload layers to the GPU on the new AMD Radeon RX 9060 XT (GFX1200 / Navi 44). The system is running Arch Linux with mesa-git (Mesa 26.1.0-devel).

System Information

  • OS: Arch Linux
  • GPU: AMD Radeon RX 9060 XT (GFX1200 / RADV)
  • Kernel: 7.0 cachyos-rc3
  • Mesa Version: 26.1.0_devel (mesa-git)
  • Ollama Version: 0.17.7

Log Output

Even with OLLAMA_VULKAN=1, the runner falls back to CPU: level=INFO source=ggml.go:494 msg="offloaded 0/35 layers to GPU" vulkaninfo correctly identifies the device: deviceName = AMD Radeon RX 9060 XT (RADV GFX1200)

Additional Context

The GFX1200 architecture seems to be missing from the supported GPU discovery list in the current llama.cpp/ggml integration.

extent analysis

Fix Plan

To resolve the issue of Ollama failing to offload layers to the GPU on the new AMD Radeon RX 9060 XT, we need to update the supported GPU discovery list in the llama.cpp/ggml integration to include the GFX1200 architecture.

Steps to Fix

  • Update the llama.cpp file to include the GFX1200 architecture in the supported GPU list.
  • Recompile the ggml library with the updated llama.cpp file.
  • Set the OLLAMA_VULKAN environment variable to 1 to enable Vulkan support.

Example Code Changes

// In llama.cpp, add the following line to the supported GPU list
supported_gpus.push_back("GFX1200"); // Add GFX1200 architecture

// Ensure the Vulkan library is linked correctly
#include <vulkan/vulkan.h>

// Initialize the Vulkan instance
VkInstance instance;
VkApplicationInfo app_info = {};
app_info.sType = VK_STRUCTURE_TYPE_APPLICATION_INFO;
app_info.pApplicationName = "Ollama";
app_info.applicationVersion = VK_MAKE_VERSION(1, 0, 0);
app_info.pEngineName = "Ollama Engine";
app_info.engineVersion = VK_MAKE_VERSION(1, 0, 0);
app_info.apiVersion = VK_API_VERSION_1_2;

VkInstanceCreateInfo create_info = {};
create_info.sType = VK_STRUCTURE_TYPE_INSTANCE_CREATE_INFO;
create_info.pApplicationInfo = &app_info;

vkCreateInstance(&create_info, nullptr, &instance);

Verification

  • Run vulkaninfo to verify that the device is correctly identified.
  • Set OLLAMA_VULKAN=1 and run Ollama to verify that layers are offloaded to the GPU.
  • Check the log output for level=INFO source=ggml.go:494 msg="offloaded X/35 layers to GPU" where X is greater than 0.

Extra Tips

  • Ensure that the mesa-git package is up-to-date, as it may contain fixes for the GFX1200 architecture.
  • If issues persist, try setting the VK_LAYER_KHRONOS_validation environment variable to 1 to enable Vulkan validation layers.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING