openclaw - 💡(How to fix) Fix [Bug] Cron isolated session first LLM call times out in 2026.4.2 [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
openclaw/openclaw#63489Fetched 2026-04-09 07:53:11
View on GitHub
Comments
0
Participants
1
Timeline
2
Reactions
0
Participants
Timeline (top)
labeled ×2

Cron job with sessionTarget=isolated fails with LLM request timeout on first call, preventing all isolated cron tasks from executing

Error Message

2026-04-09T09:01:05.534 [agent/embedded] Profile minimax:cn timed out. Trying next account... 2026-04-09T09:01:05.642 [agent/embedded] embedded run failover decision: ... reason=timeout provider=minimax/MiniMax-M2.7-highspeed 2026-04-09T09:02:05.922 [diagnostic] lane task error: lane=nested durationMs=125759 error="FailoverError: LLM request timed out."

Root Cause

Root cause hypothesis: Isolated session bootstrap in 2026.4.2 has excessive fixed overhead (workspace file loading, skills snapshot processing) that causes the first LLM call to exceed the 60s timeout before it even gets to send the request.

Code Example

2026-04-09T09:01:05.534 [agent/embedded] Profile minimax:cn timed out. Trying next account...
2026-04-09T09:01:05.642 [agent/embedded] embedded run failover decision: ... reason=timeout provider=minimax/MiniMax-M2.7-highspeed
2026-04-09T09:02:05.922 [diagnostic] lane task error: lane=nested durationMs=125759 error="FailoverError: LLM request timed out."

---

Gateway error log excerpts:

2026-04-09T09:01:05.534+08:00 [agent/embedded] Profile minimax:cn timed out. Trying next account...
2026-04-09T09:01:05.642+08:00 [agent/embedded] embedded run failover decision: runId=5cf8ef8b-4ac9-46d9-8f3a-924996033fb7 stage=assistant decision=rotate_profile reason=timeout provider=minimax/MiniMax-M2.7-highspeed profile=sha256:c38c74a5066a
2026-04-09T09:02:05.915+08:00 [agent/embedded] Profile minimax:default timed out. Trying next account...
2026-04-09T09:02:05.919+08:00 [agent/embedded] embedded run failover decision: runId=5cf8ef8b-4ac9-46d9-8f3a-924996033fb7 stage=assistant decision=fallback_model reason=timeout provider=minimax/MiniMax-M2.7-highspeed profile=sha256:b58aa0cc713a
2026-04-09T09:02:05.922+08:00 [diagnostic] lane task error: lane=nested durationMs=125759 error="FailoverError: LLM request timed out."
2026-04-09T09:02:05.924+08:00 [diagnostic] lane task error: lane=session:agent:main:cron:8474672a-4354-4a95-aef9-9a4977bf2142 durationMs=125762 error="FailoverError: LLM request timed out."

Previous occurrences (same pattern):
- 2026-03-30T09:28: isolated cron timeout, durationMs=946218
- 2026-04-06T09:43: isolated cron timeout, durationMs=1161425
RAW_BUFFERClick to expand / collapse

Bug type

Regression (worked before, now fails)

Beta release blocker

No

Summary

Cron job with sessionTarget=isolated fails with LLM request timeout on first call, preventing all isolated cron tasks from executing

Steps to reproduce

  1. Create a cron job with sessionTarget: "isolated" (e.g., openclaw cron add with default isolated session target)
  2. Set payload.kind to agentTurn with a simple task (e.g., run a Python script and send a message)
  3. Schedule to trigger at a specific time (e.g., 0 9 * * 1-5)
  4. Wait for cron to fire

Observed timeline on 2026-04-09:

  • 09:00:00 — cron triggered
  • 09:00:05 — first LLM call issued
  • 09:01:05 — exactly 60s later, timeout: "Profile minimax:cn timed out. Trying next account..."
  • 09:02:05 — second attempt also times out
  • Final error: "FailoverError: LLM request timed out."

Expected behavior

The isolated session should complete bootstrap and execute the cron payload within a reasonable time. The first LLM call should not time out on a lightweight task with the same model that works normally in webchat/feishu sessions.

Actual behavior

Every isolated cron job times out on the first LLM call with "FailoverError: LLM request timed out." The bootstrap phase takes ~65 seconds before the first call is even issued, then the call itself hits the 60s timeout and fails. This has been consistently reproducible across multiple dates (2026-03-30, 2026-04-06, 2026-04-08, 2026-04-09).

Key log evidence:

2026-04-09T09:01:05.534 [agent/embedded] Profile minimax:cn timed out. Trying next account...
2026-04-09T09:01:05.642 [agent/embedded] embedded run failover decision: ... reason=timeout provider=minimax/MiniMax-M2.7-highspeed
2026-04-09T09:02:05.922 [diagnostic] lane task error: lane=nested durationMs=125759 error="FailoverError: LLM request timed out."

OpenClaw version

2026.4.2 (d74a122)

Operating system

macOS 24.6.0 (Darwin, x64)

Install method

npm global (npm install -g openclaw)

Model

minimax/MiniMax-M2.7-highspeed

Provider / routing chain

minimax direct API (auth profile: minimax:cn)

Additional provider/model setup details

Cron job configured with:

  • sessionTarget: "isolated"
  • agentId: "main"
  • payload.kind: "agentTurn"
  • timeoutSeconds: 300

The same minimax/MiniMax-M2.7-highspeed model works fine in webchat and feishu sessions on the same machine at the same time. The issue is specific to isolated session bootstrap. auth-profiles.json shows two minimax profiles (minimax:cn and minimax:default), both timing out during the isolated session first call.

Logs, screenshots, and evidence

Gateway error log excerpts:

2026-04-09T09:01:05.534+08:00 [agent/embedded] Profile minimax:cn timed out. Trying next account...
2026-04-09T09:01:05.642+08:00 [agent/embedded] embedded run failover decision: runId=5cf8ef8b-4ac9-46d9-8f3a-924996033fb7 stage=assistant decision=rotate_profile reason=timeout provider=minimax/MiniMax-M2.7-highspeed profile=sha256:c38c74a5066a
2026-04-09T09:02:05.915+08:00 [agent/embedded] Profile minimax:default timed out. Trying next account...
2026-04-09T09:02:05.919+08:00 [agent/embedded] embedded run failover decision: runId=5cf8ef8b-4ac9-46d9-8f3a-924996033fb7 stage=assistant decision=fallback_model reason=timeout provider=minimax/MiniMax-M2.7-highspeed profile=sha256:b58aa0cc713a
2026-04-09T09:02:05.922+08:00 [diagnostic] lane task error: lane=nested durationMs=125759 error="FailoverError: LLM request timed out."
2026-04-09T09:02:05.924+08:00 [diagnostic] lane task error: lane=session:agent:main:cron:8474672a-4354-4a95-aef9-9a4977bf2142 durationMs=125762 error="FailoverError: LLM request timed out."

Previous occurrences (same pattern):
- 2026-03-30T09:28: isolated cron timeout, durationMs=946218
- 2026-04-06T09:43: isolated cron timeout, durationMs=1161425

Impact and severity

Affected: All cron jobs configured with sessionTarget="isolated" (background scheduled tasks) Severity: High — completely blocks the cron feature, making it unusable Frequency: 100% — every isolated cron attempt fails on first call Consequence: Scheduled background tasks (monitoring, reports, alerts) never execute. Users must manually run tasks that should be automated.

Additional information

This is a regression: isolated cron sessions worked (or at least did not timeout) in earlier versions. The bootstrap phase appears to have an abnormally long overhead (~65 seconds before first LLM call is even made) in the 2026.4.2 isolated session environment.

Root cause hypothesis: Isolated session bootstrap in 2026.4.2 has excessive fixed overhead (workspace file loading, skills snapshot processing) that causes the first LLM call to exceed the 60s timeout before it even gets to send the request.

Note: The cron job has deleteAfterRun:true, so failed sessions leave zombie entries in sessions.json that are never cleaned up.

extent analysis

TL;DR

Increase the timeout for isolated cron jobs to accommodate the excessive bootstrap overhead in the 2026.4.2 version.

Guidance

  1. Verify the bootstrap overhead: Confirm that the 65-second delay before the first LLM call is consistent across multiple runs and is indeed causing the timeout.
  2. Adjust the timeout configuration: Increase the timeoutSeconds value in the cron job configuration to a value greater than 65 seconds (e.g., 120 seconds) to allow for the bootstrap overhead.
  3. Monitor and adjust: After increasing the timeout, monitor the cron job execution and adjust the timeout value as needed to ensure reliable execution.
  4. Investigate the root cause: Further investigate the cause of the excessive bootstrap overhead in the 2026.4.2 version to determine if it's a bug or a configuration issue.

Example

// Update the cron job configuration to increase the timeout
{
  "sessionTarget": "isolated",
  "agentId": "main",
  "payload": {
    "kind": "agentTurn"
  },
  "timeoutSeconds": 120 // Increased timeout to accommodate bootstrap overhead
}

Notes

The increased timeout is a temporary workaround to mitigate the issue. It's essential to investigate and address the root cause of the excessive bootstrap overhead to prevent similar issues in the future.

Recommendation

Apply the workaround by increasing the timeout for isolated cron jobs, as this will allow the cron feature to function temporarily while the root cause is being investigated.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

FAQ

Expected behavior

The isolated session should complete bootstrap and execute the cron payload within a reasonable time. The first LLM call should not time out on a lightweight task with the same model that works normally in webchat/feishu sessions.

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING