ollama - ✅(Solved) Fix qwen3next: layer 2 missing attn_qkv/attn_gate projections [1 pull requests, 7 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
ollama/ollama#15372Fetched 2026-04-08 03:01:27
View on GitHub
Comments
7
Participants
2
Timeline
8
Reactions
0
Timeline (top)
commented ×7labeled ×1

Error Message

the error states: 500 Internal Server Error: failed to initialize model: qwen3next: layer 2 missing attn_qkv/attn_gate projections

PR fix notes

PR #15133: model: add qwen3-next compatibility for legacy ssm_in projections

Description (problem / solution / changelog)

Fixes #14587

Changed files

  • model/models/qwen3next/deltanet.go (modified, +21/-6)
  • model/models/qwen3next/model.go (modified, +1/-1)
  • model/models/qwen3next/model_validate_test.go (modified, +25/-0)
RAW_BUFFERClick to expand / collapse

What is the issue?

similar to this issue but for layer 2 instead of layer 0: https://github.com/ollama/ollama/pull/15133

the error states: 500 Internal Server Error: failed to initialize model: qwen3next: layer 2 missing attn_qkv/attn_gate projections

I'm trying to use the model qwen3-coder-next

Relevant log output

OS

No response

GPU

No response

CPU

No response

Ollama version

No response

extent analysis

TL;DR

The issue is likely due to missing attn_qkv and attn_gate projections in layer 2 of the qwen3next model, and a potential fix involves ensuring these projections are properly defined or updated.

Guidance

  • Review the model configuration for qwen3next to verify that attn_qkv and attn_gate projections are correctly specified for layer 2.
  • Check the model initialization code to ensure that layer 2 is properly initialized with the required projections.
  • Compare the model configuration with the fixed version mentioned in the related issue (https://github.com/ollama/ollama/pull/15133) to identify potential differences.
  • If using a pre-trained model, consider updating to a version where the issue is resolved or using a different model that does not have this problem.

Notes

The exact solution depends on the specifics of the model and its implementation, which are not fully detailed in the provided information. The suggestions are based on the error message indicating missing projections in layer 2.

Recommendation

Apply workaround: Update the model configuration to include the missing attn_qkv and attn_gate projections for layer 2, or use a different model version where this issue is resolved, as the provided error directly points to a configuration or model initialization problem.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

ollama - ✅(Solved) Fix qwen3next: layer 2 missing attn_qkv/attn_gate projections [1 pull requests, 7 comments, 2 participants]