ollama - ✅(Solved) Fix qwen3.5:9b sometimes prints out tool call instead of executing it [1 pull requests, 12 comments, 8 participants]

Kangaroux · 2026-03-09T19:27:03Z

[ollama] PR 15022: model/parsers: Close think block if tool block starts in Qwen3.5 - Repository: ollama/ollama - Author: amatas - State: closed | merged: True… # PR #15022: model/parsers: Close think block if tool block starts in Qwen3.5 - Repository: ollama/ollama - Author: amatas - State: closed | merged: True - Link: https://github.com/ollama/ollama/pull/15022 ## Description (problem / solution / changelog) This change fixes https://github.com/ollama/ollama/issues/14745 when the model starts a `tool_call` block without closing the `think` block: ``` Thinking: I need to mock Valkey for tests. Let me check how the app initializes Valkey and update TESTING.md with this information. grep "redacted" | head -20 Find Valkey initialization ``` I saw this behavior while using Opencode + Ollama 0.18.2 + qwen3.5:9b. I have been working for a couple of days with the same setup and this patch without problems. Issues that can be fixed with this PR as well: - https://github.com/ollama/ollama/issues/14745 - https://github.com/ollama/ollama/issues/14601 - Partial https://github.com/ollama/ollama/issues/14493 - https://github.com/ollama/ollama/issues/14492 external: - https://github.com/openclaw/openclaw/issues/45000 Regards, and thank you all for this great tool. ## Changed files - `model/parsers/qwen35.go` (modified, +9/-0) - `model/parsers/qwen35_test.go` (modified, +14/-8) ## Fix / Workaround ## workaround: use ollama 0.17.5 ## workaround: use ollama 0.17.5 ### What is the issue? Using the model from ollama.com and Opencode 1.2.24. This happens fairly often and halts the work the agent was doing. ``` Thinking: I need to mock Valkey for tests. Let me check how the app initializes Valkey and update TESTING.md with this information. grep "redacted" | head -20 Find Valkey initialization ``` ``` $ ollama ls NAME ID SIZE MODIFIED qwen3.5:9b 6488c96fa5fa 6.6 GB 25 hours ago ``` ``` $ ollama show qwen3.5:9b Model architecture qwen35 parameters 9.7B context length 262144 embedding length 4096 quantization Q4_K_M requires 0.17.1 Capabilities completion vision tools thinking Parameters presence_penalty 1.5 temperature 1 top_k 20 top_p 0.95 License Apache License Version 2.0, January 2004 ... ``` ``` $ ollama show --modelfile qwen3.5:9b # Modelfile generated by "ollama show" # To build a new Modelfile based on this, replace FROM with: # FROM qwen3.5:9b FROM /usr/share/ollama/.ollama/models/blobs/sha256-dec52a44569a2a25341c4e4d3fee25846eed4f6f0b936278e3a3c900bb99d37c TEMPLATE {{ .Prompt }} RENDERER qwen3.5 PARSER qwen3.5 PARAMETER temperature 1 PARAMETER top_k 20 PARAMETER top_p 0.95 PARAMETER presence_penalty 1.5 ``` ### Relevant log output ```shell ``` ### OS Linux ### GPU AMD ### CPU AMD ### Ollama version 0.17.7

ollama2026-03-09 19:27:03

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

ollama/ollama#14745•Fetched 2026-04-08 00:32:12

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Participants

Assignees

Timeline (top)

subscribed ×14commented ×12cross-referenced ×3assigned ×1

Fix Action

Fix / Workaround

workaround: use ollama 0.17.5

PR fix notes

PR #15022: model/parsers: Close think block if tool block starts in Qwen3.5

Repository: ollama/ollama
Author: amatas
State: closed | merged: True
Link: https://github.com/ollama/ollama/pull/15022

Description (problem / solution / changelog)

This change fixes https://github.com/ollama/ollama/issues/14745 when the model starts a tool_call block without closing the think block:

Thinking: I need to mock Valkey for tests. Let me check how the app initializes Valkey and update TESTING.md with this information.
<tool_call>
<function=bash>
<parameter=command>
grep "redacted" | head -20
</parameter>
<parameter=description>
Find Valkey initialization
</parameter>
</function>
</tool_call>

I saw this behavior while using Opencode + Ollama 0.18.2 + qwen3.5:9b. I have been working for a couple of days with the same setup and this patch without problems.

Issues that can be fixed with this PR as well:

external:

https://github.com/openclaw/openclaw/issues/45000

Regards, and thank you all for this great tool.

Changed files

model/parsers/qwen35.go (modified, +9/-0)
model/parsers/qwen35_test.go (modified, +14/-8)

Code Example

Thinking: I need to mock Valkey for tests. Let me check how the app initializes Valkey and update TESTING.md with this information.
<tool_call>
<function=bash>
<parameter=command>
grep "redacted" | head -20
</parameter>
<parameter=description>
Find Valkey initialization
</parameter>
</function>
</tool_call>

---

$ ollama ls
NAME          ID              SIZE      MODIFIED     
qwen3.5:9b    6488c96fa5fa    6.6 GB    25 hours ago

---

$ ollama show qwen3.5:9b
  Model
    architecture        qwen35    
    parameters          9.7B      
    context length      262144    
    embedding length    4096      
    quantization        Q4_K_M    
    requires            0.17.1    

  Capabilities
    completion    
    vision        
    tools         
    thinking      

  Parameters
    presence_penalty    1.5     
    temperature         1       
    top_k               20      
    top_p               0.95    

  License
    Apache License               
    Version 2.0, January 2004    
    ...

---

$ ollama show --modelfile qwen3.5:9b
# Modelfile generated by "ollama show"
# To build a new Modelfile based on this, replace FROM with:
# FROM qwen3.5:9b

FROM /usr/share/ollama/.ollama/models/blobs/sha256-dec52a44569a2a25341c4e4d3fee25846eed4f6f0b936278e3a3c900bb99d37c
TEMPLATE {{ .Prompt }}
RENDERER qwen3.5
PARSER qwen3.5
PARAMETER temperature 1
PARAMETER top_k 20
PARAMETER top_p 0.95
PARAMETER presence_penalty 1.5

---

RAW_BUFFERClick to expand / collapse

workaround: use ollama 0.17.5

What is the issue?

Using the model from ollama.com and Opencode 1.2.24. This happens fairly often and halts the work the agent was doing.

Thinking: I need to mock Valkey for tests. Let me check how the app initializes Valkey and update TESTING.md with this information.
<tool_call>
<function=bash>
<parameter=command>
grep "redacted" | head -20
</parameter>
<parameter=description>
Find Valkey initialization
</parameter>
</function>
</tool_call>

$ ollama ls
NAME          ID              SIZE      MODIFIED     
qwen3.5:9b    6488c96fa5fa    6.6 GB    25 hours ago

$ ollama show qwen3.5:9b
  Model
    architecture        qwen35    
    parameters          9.7B      
    context length      262144    
    embedding length    4096      
    quantization        Q4_K_M    
    requires            0.17.1    

  Capabilities
    completion    
    vision        
    tools         
    thinking      

  Parameters
    presence_penalty    1.5     
    temperature         1       
    top_k               20      
    top_p               0.95    

  License
    Apache License               
    Version 2.0, January 2004    
    ...

$ ollama show --modelfile qwen3.5:9b
# Modelfile generated by "ollama show"
# To build a new Modelfile based on this, replace FROM with:
# FROM qwen3.5:9b

FROM /usr/share/ollama/.ollama/models/blobs/sha256-dec52a44569a2a25341c4e4d3fee25846eed4f6f0b936278e3a3c900bb99d37c
TEMPLATE {{ .Prompt }}
RENDERER qwen3.5
PARSER qwen3.5
PARAMETER temperature 1
PARAMETER top_k 20
PARAMETER top_p 0.95
PARAMETER presence_penalty 1.5

Relevant log output

OS

Linux

GPU

AMD

CPU

AMD

Ollama version

0.17.7

extent analysis

Fix Plan

To resolve the issue, we will downgrade the Ollama version to 0.17.5 as suggested in the workaround.

Steps:

Stop any ongoing Ollama processes
Uninstall the current Ollama version (0.17.7)
Install Ollama version 0.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #installation #tensor shape #autograd error #LLM response #prompt template #agent execution #callback error #memory management

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

ollama - ✅(Solved) Fix qwen3.5:9b sometimes prints out tool call instead of executing it [1 pull requests, 12 comments, 8 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fix / Workaround

workaround: use ollama 0.17.5

PR fix notes

PR #15022: model/parsers: Close think block if tool block starts in Qwen3.5

Description (problem / solution / changelog)

Changed files

Code Example

workaround: use ollama 0.17.5

What is the issue?

Relevant log output

OS

GPU

CPU

Ollama version

extent analysis

Fix Plan

Steps:

Still need to ship something?

TRENDING

ollama - ✅(Solved) Fix qwen3.5:9b sometimes prints out tool call instead of executing it [1 pull requests, 12 comments, 8 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fix / Workaround

workaround: use ollama 0.17.5

PR fix notes

PR #15022: model/parsers: Close think block if tool block starts in Qwen3.5

Description (problem / solution / changelog)

Changed files

Code Example

workaround: use ollama 0.17.5

What is the issue?

Relevant log output

OS

GPU

CPU

Ollama version

extent analysis

Fix Plan

Steps:

Still need to ship something?

RELATED_DISCOVERY

TRENDING