openclaw - 💡(How to fix) Fix Per-request startup regression in 4.29: ~25–50s overhead before model output (core-plugin-tools 11s, auth 4–12s, model-resolution 8s) [2 comments, 3 participants]

openclaw2026-05-01 11:36:36

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#75616•Fetched 2026-05-02 05:32:46

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Participants

Timeline (top)

cross-referenced ×3mentioned ×3subscribed ×3commented ×2

v2026.4.29 introduced a per-request startup regression that adds 25–50 seconds of overhead before the model starts streaming. It's reproducible on every embedded-run, not a one-off cold-start cost. Rolling back to v2026.4.23 restores normal latency.

Root Cause

Fix Action

Workaround

npm install -g [email protected] resolves the issue. Config is forward-compatible.

Code Example

phase=attempt-dispatch totalMs=15973 stages=
  workspace:0ms@0ms,
  runtime-plugins:7ms@7ms,
  hooks:1ms@8ms,
  model-resolution:8503ms@8511ms,
  auth:4264ms@12775ms,
  context-engine:2ms@12777ms,
  attempt-dispatch:3196ms@15973ms

phase=stream-ready totalMs=24578 stages=
  workspace-sandbox:250ms@250ms,
  skills:63ms@313ms,
  core-plugin-tools:11004ms@11317ms,
  bootstrap-context:523ms@11840ms,
  bundle-tools:1002ms@12842ms,
  system-prompt:4226ms@17068ms,
  session-resource-loader:2193ms@19261ms,
  agent-session:1ms@19262ms,
  stream-setup:5316ms@24578ms

---

phase=attempt-dispatch totalMs=26504 stages=
  ...
  model-resolution:640ms@642ms,
  auth:11687ms@12329ms,
  context-engine:0ms@12329ms,
  attempt-dispatch:14175ms@26504ms

RAW_BUFFERClick to expand / collapse

Summary

Symptoms after upgrading 4.23 → 4.29

User sends a message. Gateway receives it. ~25–50 seconds pass before the model produces the first byte. Then the model itself runs at normal speed. Subsequent messages do not "warm up" — every single request pays the same overhead.

Embedded-run trace stages on 4.29

Sample request — chat continuation, ~840 char query:

phase=attempt-dispatch totalMs=15973 stages=
  workspace:0ms@0ms,
  runtime-plugins:7ms@7ms,
  hooks:1ms@8ms,
  model-resolution:8503ms@8511ms,
  auth:4264ms@12775ms,
  context-engine:2ms@12777ms,
  attempt-dispatch:3196ms@15973ms

phase=stream-ready totalMs=24578 stages=
  workspace-sandbox:250ms@250ms,
  skills:63ms@313ms,
  core-plugin-tools:11004ms@11317ms,
  bootstrap-context:523ms@11840ms,
  bundle-tools:1002ms@12842ms,
  system-prompt:4226ms@17068ms,
  session-resource-loader:2193ms@19261ms,
  agent-session:1ms@19262ms,
  stream-setup:5316ms@24578ms

Second consecutive request (different runId/session, same conversation, model-resolution now warm):

phase=attempt-dispatch totalMs=26504 stages=
  ...
  model-resolution:640ms@642ms,
  auth:11687ms@12329ms,
  context-engine:0ms@12329ms,
  attempt-dispatch:14175ms@26504ms

The same workload on 4.23 completes end-to-end in well under 10 seconds. (4.23 doesn't emit these trace:embedded-run stage logs, so I can't directly diff numbers, but observed latency is dramatically lower.)

Hot path

core-plugin-tools (11s) is bracketed by prepStages.mark("core-plugin-tools") in dist/selection-*.js, right after createOpenClawCodingTools(...) and applyEmbeddedAttemptToolsAllow(...). Runs on every embedded-run.

auth (4–12s) and model-resolution (8.5s on first hit) also dominate the first phase.

I grepped the dist for EAGER, PRELOAD, cacheTools, etc. — there's no apparent way to amortize this work via config or env.

Reproduction

npm install -g [email protected], restart gateway, send a message — note end-to-end latency.
npm install -g [email protected], restart gateway, send the same message — note ~25–50s overhead before model output.
Check gateway logs for [trace:embedded-run] lines to see the stage breakdown.

Workaround

npm install -g [email protected] resolves the issue. Config is forward-compatible.

Hypothesis

createOpenClawCodingTools() and/or applyEmbeddedAttemptToolsAllow() likely got new sync I/O or schema-resolution work added in 4.29. Same for the auth and model-resolution phases. Either:

the per-request work could be done once at gateway startup and cached, or
the hot path could be made async/parallel with the actual model call.

Happy to provide more traces if useful.

extent analysis

TL;DR

Downgrade to [email protected] to resolve the per-request startup regression.

Guidance

The issue is likely caused by new sync I/O or schema-resolution work added in v2026.4.29, specifically in createOpenClawCodingTools() and applyEmbeddedAttemptToolsAllow().
To verify, check the gateway logs for [trace:embedded-run] lines to see the stage breakdown and compare the latency between v2026.4.23 and v2026.4.29.
The core-plugin-tools, auth, and model-resolution phases are dominating the first phase, and making these phases async/parallel with the actual model call could potentially resolve the issue.
Consider providing more traces to further investigate the issue.

Notes

The provided information suggests that downgrading to v2026.4.23 resolves the issue, but it's unclear what changes were made in v2026.4.29 that caused the regression.

Recommendation

Apply the workaround by downgrading to [email protected], as it is a known fix and the config is forward-compatible.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#environment setup #docker error #permission error #memory optimization #batch processing

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix Per-request startup regression in 4.29: ~25–50s overhead before model output (core-plugin-tools 11s, auth 4–12s, model-resolution 8s) [2 comments, 3 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Workaround

Code Example

Summary

Symptoms after upgrading 4.23 → 4.29

Embedded-run trace stages on 4.29

Hot path

Reproduction

Workaround

Hypothesis

extent analysis

TL;DR

Guidance

Notes

Recommendation

Still need to ship something?

TRENDING

openclaw - 💡(How to fix) Fix Per-request startup regression in 4.29: ~25–50s overhead before model output (core-plugin-tools 11s, auth 4–12s, model-resolution 8s) [2 comments, 3 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Workaround

Code Example

Summary

Symptoms after upgrading 4.23 → 4.29

Embedded-run trace stages on 4.29

Hot path

Reproduction

Workaround

Hypothesis

extent analysis

TL;DR

Guidance

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING