openclaw - 💡(How to fix) Fix [Bug]: openai-completions adapter hangs indefinitely on reasoning models — 5 models / 4 providers reproduced [2 comments, 2 participants]

EeroVakiparta · 2026-05-05T14:02:59Z

[openclaw] OpenClaw 2026.4.15 openai-completions adapter hangs the main reply lane indefinitely for reasoning models, regardless of provider. active-memory Ant… OpenClaw `2026.4.15` `openai-completions` adapter hangs the **main reply lane indefinitely** for reasoning models, regardless of provider. `active-memory` (Anthropic Haiku) completes cleanly, then no further log output for the affected session — no error, no failover, no timeout. Tested 5 model/provider combinations, all hang. Direct `curl` to the same endpoints from inside the gateway container returns valid JSON in 1-3s. This appears related to (and possibly the same root cause as) #44549, #45681, #71160, #69902, #5980, #46049 — but with broader reproduction scope including Kimi K2.6 direct, MiniMax M2.5 direct via OR, GLM-4.7 via OR. ## Fix / Workaround ## Workaround currently in place ## Summary OpenClaw `2026.4.15` `openai-completions` adapter hangs the **main reply lane indefinitely** for reasoning models, regardless of provider. `active-memory` (Anthropic Haiku) completes cleanly, then no further log output for the affected session — no error, no failover, no timeout. Tested 5 model/provider combinations, all hang. Direct `curl` to the same endpoints from inside the gateway container returns valid JSON in 1-3s. This appears related to (and possibly the same root cause as) #44549, #45681, #71160, #69902, #5980, #46049 — but with broader reproduction scope including Kimi K2.6 direct, MiniMax M2.5 direct via OR, GLM-4.7 via OR. ## Environment - OpenClaw `2026.4.15` - Image `ghcr.io/openclaw/openclaw:latest` (auto-update on) - Linux x86_64 / Docker (Hetzner CX22) - Channel: telegram (`@Oranssi_makkara_bot`) - Plugins: `active-memory` (Anthropic Haiku-pinned), `memory-core`, `telegram` - Workspace bootstrap: ~32k chars (`bootstrapMaxChars: 32000`) ## Reproduction (any of these) Set as `agents.defaults.model.primary`, send a Telegram chat message: | # | Model identifier | Reasoning declared in config | Provider path | Result | |---|---|:---:|---|---| | 1 | `openrouter/minimax/minimax-m2.5` | `true` | OR custom provider, `api: openai-completions`, baseUrl `openrouter.ai/api/v1` | hangs ≥5min | | 2 | `openrouter/minimax/minimax-m2.5` | `true` | OR **native** provider (no custom block, just env + allowlist) | hangs ≥5min | | 3 | `openrouter/z-ai/glm-4.7` | `true` | OR native | hangs ≥2min | | 4 | `openrouter/moonshotai/kimi-k2.6` (in built-in catalog) | `true` | OR native | hangs ≥3min | | 5 | `kimi/kimi-k2.6` | `true` | Kimi direct, custom provider, `api: openai-completions`, baseUrl `api.moonshot.ai/v1` | hangs ≥5min | | 5b | `kimi/kimi-k2.6` | **`false`** | Same custom provider, declared as non-reasoning | **also hangs ≥4min** | | 6 | `openrouter/auto` (non-reasoning) | `false` | OR native | first reply at **17 minutes**, second message also hung | So: - **Reasoning-declared-true models** (5/5 tested) → hang indefinitely, never return - **Reasoning-declared-false but model still reasons internally** (Kimi K2.6 with reasoning:false) → hangs the same way; declaration alone doesn't bypass the parser - **Truly non-reasoning model** (OR auto) → eventually returns but at unusable latency (17min first request, second hangs) ## Symptom in logs Identical pattern across all 5 reproductions: ``` [telegram] sendMessage ok chat=… message=… <- /new ack lands fast [plugins] active-memory: … start [diagnostic] liveness warning: … work=[active=…(processing/model_call,…,age=Xs last=model_call:started)] [plugins] active-memory: … done status=empty elapsedMs=~10000 [agent/embedded] embedded run failover decision: runId=active-memory-… stage=assistant decision=surface_error reason=none from=anthropic/claude-haiku-4-5 [no further log entries for this session] ``` No `[fetch-timeout]`, no `[model-fallback/decision]`, no error surfaced to the user, no `embedded run agent end`. ## Direct upstream verification From **inside the gateway container** (so same network, same env), identical requests to the same endpoints succeed cleanly in 1-3s. Example for Kimi: ```bash docker compose exec openclaw bash -c 'curl -s -X POST https://api.moonshot.ai/v1/chat/completions \ -H "Authorization: Bearer $KIMI_API_KEY" \ -H "Content-Type: application/json" \ -d "{\"model\":\"kimi-k2.6\",\"messages\":[{\"role\":\"user\",\"content\":\"pong\"}],\"max_tokens\":500}"' # Returns ~2.6s, valid JSON, choices[0].message.content="pong", # choices[0].message.reasoning_content=" " ``` For MiniMax M2.5 via OR (also tested raw bytes): ``` HTTP/2 200 content-type: application/json {first 5 lines whitespace/keep-alive padding} {"id":"gen-…","object":"chat.completion","model":"minimax/minimax-m2.5-…", "choices":[{"message":{"content":"Pong! 🎯", "reasoning":" ","reasoning_details":[…]}}]} ``` So: - Network works - Auth works - Models return valid responses - Responses include reasoning content (`reasoning_content` for Kimi, `reasoning`+`reasoning_details` for OR-routed models) ##

openclaw2026-05-05 14:02:59

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#77870•Fetched 2026-05-06 06:20:01

View on GitHub

Comments

Participants

Timeline

Reactions

Author

EeroVakiparta

Participants

clawsweeper[bot]

EeroVakiparta

Timeline (top)

commented ×2

OpenClaw 2026.4.15 openai-completions adapter hangs the main reply lane indefinitely for reasoning models, regardless of provider. active-memory (Anthropic Haiku) completes cleanly, then no further log output for the affected session — no error, no failover, no timeout. Tested 5 model/provider combinations, all hang. Direct curl to the same endpoints from inside the gateway container returns valid JSON in 1-3s.

This appears related to (and possibly the same root cause as) #44549, #45681, #71160, #69902, #5980, #46049 — but with broader reproduction scope including Kimi K2.6 direct, MiniMax M2.5 direct via OR, GLM-4.7 via OR.

Error Message

Root Cause

Fix Action

Fix / Workaround

Workaround currently in place

Code Example

[telegram] sendMessage ok chat=… message=…    <- /new ack lands fast
[plugins] active-memory: … start
[diagnostic] liveness warning: … work=[active=…(processing/model_call,…,age=Xs last=model_call:started)]
[plugins] active-memory: … done status=empty elapsedMs=~10000
[agent/embedded] embedded run failover decision: runId=active-memory-… stage=assistant decision=surface_error reason=none from=anthropic/claude-haiku-4-5
[no further log entries for this session]

---

docker compose exec openclaw bash -c 'curl -s -X POST https://api.moonshot.ai/v1/chat/completions \
  -H "Authorization: Bearer $KIMI_API_KEY" \
  -H "Content-Type: application/json" \
  -d "{\"model\":\"kimi-k2.6\",\"messages\":[{\"role\":\"user\",\"content\":\"pong\"}],\"max_tokens\":500}"'

# Returns ~2.6s, valid JSON, choices[0].message.content="pong",
# choices[0].message.reasoning_content="<chain of thought>"

---

HTTP/2 200
content-type: application/json
{first 5 lines whitespace/keep-alive padding}
{"id":"gen-…","object":"chat.completion","model":"minimax/minimax-m2.5-…",
 "choices":[{"message":{"content":"Pong! 🎯",
   "reasoning":"<thinking>","reasoning_details":[…]}}]}

RAW_BUFFERClick to expand / collapse

Summary

Environment

OpenClaw 2026.4.15
Image ghcr.io/openclaw/openclaw:latest (auto-update on)
Linux x86_64 / Docker (Hetzner CX22)
Channel: telegram (@Oranssi_makkara_bot)
Plugins: active-memory (Anthropic Haiku-pinned), memory-core, telegram
Workspace bootstrap: ~32k chars (bootstrapMaxChars: 32000)

Reproduction (any of these)

Set as agents.defaults.model.primary, send a Telegram chat message:

#	Model identifier	Reasoning declared in config	Provider path	Result
1	`openrouter/minimax/minimax-m2.5`	`true`	OR custom provider, `api: openai-completions`, baseUrl `openrouter.ai/api/v1`	hangs ≥5min
2	`openrouter/minimax/minimax-m2.5`	`true`	OR native provider (no custom block, just env + allowlist)	hangs ≥5min
3	`openrouter/z-ai/glm-4.7`	`true`	OR native	hangs ≥2min
4	`openrouter/moonshotai/kimi-k2.6` (in built-in catalog)	`true`	OR native	hangs ≥3min
5	`kimi/kimi-k2.6`	`true`	Kimi direct, custom provider, `api: openai-completions`, baseUrl `api.moonshot.ai/v1`	hangs ≥5min
5b	`kimi/kimi-k2.6`	`false`	Same custom provider, declared as non-reasoning	also hangs ≥4min
6	`openrouter/auto` (non-reasoning)	`false`	OR native	first reply at 17 minutes, second message also hung

So:

Reasoning-declared-true models (5/5 tested) → hang indefinitely, never return
Reasoning-declared-false but model still reasons internally (Kimi K2.6 with reasoning:false) → hangs the same way; declaration alone doesn't bypass the parser
Truly non-reasoning model (OR auto) → eventually returns but at unusable latency (17min first request, second hangs)

Symptom in logs

Identical pattern across all 5 reproductions:

[telegram] sendMessage ok chat=… message=…    <- /new ack lands fast
[plugins] active-memory: … start
[diagnostic] liveness warning: … work=[active=…(processing/model_call,…,age=Xs last=model_call:started)]
[plugins] active-memory: … done status=empty elapsedMs=~10000
[agent/embedded] embedded run failover decision: runId=active-memory-… stage=assistant decision=surface_error reason=none from=anthropic/claude-haiku-4-5
[no further log entries for this session]

No [fetch-timeout], no [model-fallback/decision], no error surfaced to the user, no embedded run agent end.

Direct upstream verification

From inside the gateway container (so same network, same env), identical requests to the same endpoints succeed cleanly in 1-3s. Example for Kimi:

docker compose exec openclaw bash -c 'curl -s -X POST https://api.moonshot.ai/v1/chat/completions \
  -H "Authorization: Bearer $KIMI_API_KEY" \
  -H "Content-Type: application/json" \
  -d "{\"model\":\"kimi-k2.6\",\"messages\":[{\"role\":\"user\",\"content\":\"pong\"}],\"max_tokens\":500}"'

# Returns ~2.6s, valid JSON, choices[0].message.content="pong",
# choices[0].message.reasoning_content="<chain of thought>"

For MiniMax M2.5 via OR (also tested raw bytes):

HTTP/2 200
content-type: application/json
{first 5 lines whitespace/keep-alive padding}
{"id":"gen-…","object":"chat.completion","model":"minimax/minimax-m2.5-…",
 "choices":[{"message":{"content":"Pong! 🎯",
   "reasoning":"<thinking>","reasoning_details":[…]}}]}

So:

Network works
Auth works
Models return valid responses
Responses include reasoning content (reasoning_content for Kimi, reasoning+reasoning_details for OR-routed models)

Hypotheses (untested, leaving for the team)

The openai-completions adapter cannot exit its response-read loop when the response includes reasoning fields, regardless of whether reasoning: true|false is declared in our config
Streaming framing mismatch: adapter expects a specific SSE pattern that reasoning models don't follow; non-reasoning models eventually emit it (hence OR auto returning at 17min instead of never)
Per #71160, follow-up turns require sending reasoning_content back, but the adapter doesn't preserve it — causing later validation hangs (we wouldn't be triggering this since we're hitting first-turn hangs)
Per #69902, adapter doesn't forward provider-specific "disable reasoning" parameters (thinking: false for Kimi, think: false for Ollama qwen3) — model always reasons, parser hangs

Workaround currently in place

Reverted primary to anthropic/claude-sonnet-4-6. Anthropic-messages adapter works fine. OpenAI-completions adapter is unusable for any reasoning-enabled model in 2026.4.15.

Why this is high-impact

The openai-completions adapter is the path for nearly every cheap modern alternative to Anthropic — DeepSeek V4 reasoners, GLM-4.x reasoning, Kimi K2.5/K2.6, MiniMax M2.x, Qwen reasoning variants. With reasoning models being the dominant cost-effective frontier tier in mid-2026, this bug effectively blocks OpenClaw users from any modern non-Anthropic primary.

Available for triage

Happy to provide:

Full server-side logs (sanitized, with my chat IDs / API keys redacted)
Config dumps before/after each test
Raw HTTP captures of the upstream responses

Just ping me with what would help.

extent analysis

TL;DR

The openai-completions adapter in OpenClaw 2026.4.15 hangs indefinitely for reasoning models, and a potential workaround is to modify the adapter to handle reasoning fields or streaming framing correctly.

Guidance

Verify the adapter's response handling: Check how the openai-completions adapter processes responses from reasoning models, specifically how it handles reasoning fields and streaming framing.
Test with modified adapter configuration: Attempt to modify the adapter's configuration to disable reasoning or adjust streaming framing to see if it resolves the hanging issue.
Compare with other adapters: Compare the behavior of the openai-completions adapter with other adapters, such as the Anthropic-messages adapter, to identify potential differences in response handling.
Gather detailed logs and captures: Collect full server-side logs, config dumps, and raw HTTP captures of upstream responses to help with further debugging and analysis.

Example

No code snippet is provided as the issue requires further investigation and analysis of the openai-completions adapter's behavior.

Notes

The issue appears to be related to the openai-completions adapter's handling of reasoning models and streaming framing. Further analysis and debugging are required to determine the root cause and develop a fix.

Recommendation

Apply a workaround by modifying the openai-completions adapter to handle reasoning fields and streaming framing correctly, or revert to using the Anthropic-messages adapter until a fix is available.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #authentication issue #prompt issue #agent setup #task chaining

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix [Bug]: openai-completions adapter hangs indefinitely on reasoning models — 5 models / 4 providers reproduced [2 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fix / Workaround

Workaround currently in place

Code Example

Summary

Environment

Reproduction (any of these)

Symptom in logs

Direct upstream verification

Hypotheses (untested, leaving for the team)

Workaround currently in place

Why this is high-impact

Available for triage

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

openclaw - 💡(How to fix) Fix [Bug]: openai-completions adapter hangs indefinitely on reasoning models — 5 models / 4 providers reproduced [2 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fix / Workaround

Workaround currently in place

Code Example

Summary

Environment

Reproduction (any of these)

Symptom in logs

Direct upstream verification

Hypotheses (untested, leaving for the team)

Workaround currently in place

Why this is high-impact

Available for triage

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING