openclaw - 💡(How to fix) Fix [Bug]: Scheduled cron runs timeout at ~60s in 2026.4.1 — manual triggers work fine (regression from 2026.3.31) [3 comments, 4 participants]

openclaw2026-04-02 09:21:52

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#59579•Fetched 2026-04-08 02:42:58

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Participants

Timeline (top)

commented ×3labeled ×2subscribed ×2

After upgrading from 2026.3.31 to 2026.4.1, all scheduled isolated agentTurn cron jobs timeout at ~60s with profileFailureReason: "timeout" at the Anthropic API call level; manually triggering the same jobs via cron run succeeds every time.

Root Cause

The ~60s timeout appears to be an API HTTP request timeout applied only in the scheduled execution path, not the manual cron run path. The payload.timeoutSeconds (tested 300 and 600) and agents.defaults.timeoutSeconds (900) have no effect on this inner timeout. The 7:04 AM run succeeded — possibly because it ran before the gateway was fully loaded with sessions, suggesting the timeout may be related to session/context loading overhead in the scheduled path.

Fix Action

Fix / Workaround

Affected: All users running isolated agentTurn cron jobs on 2026.4.1 Severity: High (blocks all scheduled automation — security briefings, hourly check-ins, recurring tasks) Frequency: 100% of scheduled runs fail; 100% of manual triggers succeed (13/13 observed today) Consequence: All scheduled cron automation is non-functional after upgrading to 2026.4.1. Workaround is manual triggering only, which defeats the purpose of cron scheduling. Downgrade to 2026.3.31 is the only reliable workaround.

Workaround: Set LLM_REQUEST_TIMEOUT=120 as an environment variable in the systemd service file and restart. This overrides the ~60s default and restores scheduled cron functionality.

Code Example

Log entry from scheduled run (8:00 PM, Opus):

{
  "event": "embedded_run_failover_decision",
  "tags": ["error_handling", "failover", "assistant", "surface_error"],
  "stage": "assistant",
  "decision": "surface_error",
  "failoverReason": "timeout",
  "profileFailureReason": "timeout",
  "provider": "anthropic",
  "model": "claude-opus-4-6",
  "profileId": "sha256:154a23a3efe6",
  "fallbackConfigured": false,
  "timedOut": true,
  "aborted": true
}


Evidence table — same job, same day, same config:

| Time | Trigger | Model | Result | Duration |
|------|---------|-------|--------|----------|
| 7:04 AM | Scheduled | Sonnet | ✅ OK | 43s |
| 8:01 AM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 9:01 AM | Scheduled | Sonnet | ❌ Timeout | 61s |
| 10:01 AM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 11:27 AM | **Manual** | Sonnet | ✅ OK | 78s |
| 12:01 PM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 1:01 PM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 3:11 PM | **Manual** | Sonnet | ✅ OK | 39s |
| 4:01 PM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 5:01 PM | Scheduled | Sonnet | ❌ Timeout | 93s |
| 6:01 PM | Scheduled | Sonnet | ❌ Timeout | 61s |
| 7:51 PM | **Manual** | Opus | ✅ OK | 30s |
| 8:00 PM | **Scheduled** | Opus | ❌ Timeout | 64s |

All runs use sessionTarget: "isolated", same Anthropic API key, same gateway.
On 2026.3.31 (previous version), identical crons ran successfully for weeks.

RAW_BUFFERClick to expand / collapse

Bug type

Regression (worked before, now fails)

Beta release blocker

Summary

Steps to reproduce

Have any agentTurn cron job with sessionTarget: "isolated" (tested with both claude-sonnet-4-20250514 and claude-opus-4-6)
Wait for scheduled execution
Job fails at ~60-64s with "Request timed out before a response was generated"
Manually trigger the exact same job via cron run → succeeds every time (30-78s)

Expected behavior

Scheduled cron runs should complete successfully with the same timeout behaviour as manual triggers.

Actual behavior

All scheduled runs abort at ~60s. All manual triggers succeed. Tested across both models, multiple prompts (complex and minimal), various timeoutSeconds values (300, 600), and multiple gateway restarts. The timeout occurs at the Anthropic API request level (profileFailureReason: "timeout"), not the cron job timeout level.

Time	Trigger	Model	Result	Duration
11:27 AM	Manual	Sonnet	✅ OK	78s
12:01 PM	Scheduled	Sonnet	❌ Timeout	64s
1:01 PM	Scheduled	Sonnet	❌ Timeout	64s
3:11 PM	Manual	Sonnet	✅ OK	39s
4:01 PM	Scheduled	Sonnet	❌ Timeout	64s
7:51 PM	Manual	Opus	✅ OK	30s
8:00 PM	Scheduled	Opus	❌ Timeout	64s

OpenClaw version

2026.4.1 (da64a97) — regression from 2026.3.31

Operating system

Ubuntu 24.04 (Linux 6.8.0-106-generic x64)

Install method

npm global

Model

anthropic/claude-sonnet-4-20250514 and anthropic/claude-opus-4-6 (both tested, both fail on schedule)

Provider / routing chain

openclaw -> anthropic (direct, no proxy)

Additional provider/model setup details

Default model: anthropic/claude-opus-4-6 Cron jobs use anthropic/claude-sonnet-4-20250514 (also tested with claude-opus-4-6) Auth: Anthropic API key via auth-profiles.json No proxy, no router, no Cloudflare — direct to Anthropic API Node 22.22.2, systemd-managed gateway service Worked perfectly on 2026.3.31 with identical config and API key

Logs, screenshots, and evidence

Log entry from scheduled run (8:00 PM, Opus):

{
  "event": "embedded_run_failover_decision",
  "tags": ["error_handling", "failover", "assistant", "surface_error"],
  "stage": "assistant",
  "decision": "surface_error",
  "failoverReason": "timeout",
  "profileFailureReason": "timeout",
  "provider": "anthropic",
  "model": "claude-opus-4-6",
  "profileId": "sha256:154a23a3efe6",
  "fallbackConfigured": false,
  "timedOut": true,
  "aborted": true
}


Evidence table — same job, same day, same config:

| Time | Trigger | Model | Result | Duration |
|------|---------|-------|--------|----------|
| 7:04 AM | Scheduled | Sonnet | ✅ OK | 43s |
| 8:01 AM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 9:01 AM | Scheduled | Sonnet | ❌ Timeout | 61s |
| 10:01 AM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 11:27 AM | **Manual** | Sonnet | ✅ OK | 78s |
| 12:01 PM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 1:01 PM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 3:11 PM | **Manual** | Sonnet | ✅ OK | 39s |
| 4:01 PM | Scheduled | Sonnet | ❌ Timeout | 64s |
| 5:01 PM | Scheduled | Sonnet | ❌ Timeout | 93s |
| 6:01 PM | Scheduled | Sonnet | ❌ Timeout | 61s |
| 7:51 PM | **Manual** | Opus | ✅ OK | 30s |
| 8:00 PM | **Scheduled** | Opus | ❌ Timeout | 64s |

All runs use sessionTarget: "isolated", same Anthropic API key, same gateway.
On 2026.3.31 (previous version), identical crons ran successfully for weeks.

Impact and severity

Additional information

Possibly related to #57581 (LiveSessionModelSwitch blocks isolated cron sessions with explicit model override) which was closed in this release cycle. Our cron jobs use explicit model overrides (model: "anthropic/claude-sonnet-4-20250514") in the payload. The fix for #57581 may have introduced or failed to fully resolve the timeout regression for scheduled isolated sessions.

Also see #59265 (agents working in secret on 2026.4.1) for potentially related execution path issues.

Workaround: Set LLM_REQUEST_TIMEOUT=120 as an environment variable in the systemd service file and restart. This overrides the ~60s default and restores scheduled cron functionality.

extent analysis

TL;DR

Setting the LLM_REQUEST_TIMEOUT environment variable to a higher value, such as 120, in the systemd service file and restarting the service may resolve the scheduled cron job timeouts.

Guidance

Verify that the LLM_REQUEST_TIMEOUT environment variable is not already set in the systemd service file or elsewhere in the environment, to ensure that setting it to 120 will have the desired effect.
Test the workaround with different values for LLM_REQUEST_TIMEOUT to determine the minimum value required to prevent timeouts, as setting it too high may introduce other issues.
Investigate the relationship between the gateway's session loading overhead and the scheduled execution path, as the successful 7:04 AM run suggests that this may be a contributing factor to the timeouts.
Review the changes made in the fix for #57581 and #59265 to understand how they may have introduced or failed to fully resolve the timeout regression for scheduled isolated sessions.

Example

To set the LLM_REQUEST_TIMEOUT environment variable in the systemd service file, add the following line to the [Service] section: Environment=LLM_REQUEST_TIMEOUT=120.

Notes

The workaround may not address the underlying cause of the timeouts, and further investigation may be necessary to determine the root cause and implement a permanent fix.
The LLM_REQUEST_TIMEOUT environment variable may have other effects on the system, and its value should be carefully considered to avoid introducing other issues.

Recommendation

Apply the workaround by setting LLM_REQUEST_TIMEOUT=120 as an environment variable in the systemd service file and restarting the service, as this has been shown to restore scheduled cron functionality.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

Scheduled cron runs should complete successfully with the same timeout behaviour as manual triggers.

#api #environment variable #logging issue #authentication issue #request timeout

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

openclaw - 💡(How to fix) Fix [Bug]: Scheduled cron runs timeout at ~60s in 2026.4.1 — manual triggers work fine (regression from 2026.3.31) [3 comments, 4 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fix / Workaround

Code Example

Bug type

Beta release blocker

Summary

Steps to reproduce

Expected behavior

Actual behavior

OpenClaw version

Operating system

Install method

Model

Provider / routing chain

Additional provider/model setup details

Logs, screenshots, and evidence

Impact and severity

Additional information

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING