- Plugin/config maintenance should not leave Control UI in a partially alive but broken state - If Gateway is mid-reload, Control UI should get a clear degraded-state response instead of generic unknown error - `openclaw gateway restart` should recover from stale processes/port handoff more robustly - `doctor --fix` should not be the normal escape hatch for restart failures

openclaw - 💡(How to fix) Fix macOS launchd: plugin/config maintenance can leave Control UI in broken partial-reload state; restart may require doctor --fix

Root Cause

On macOS launchd installs, doing plugin/config maintenance from a live session can leave Gateway in a partially reloaded state where:

Control UI / webchat opens but shows "unable to connect" / unknown error
sessions.resolve returns INVALID_REQUEST: No session found
openclaw gateway restart can fail to recover cleanly
openclaw doctor --fix is required to restore a healthy state

This looks like a product gap in restart/reload resilience, even though the triggering action was an operator mistake.

Code Example

2026-05-25T17:26:25.777+08:00 [gateway] http server listening (5 plugins: acpx, browser, codex, memory-core, openai; 2.1s)
2026-05-25T17:26:56.412+08:00 [ws] ⇄ res ✗ sessions.resolve 1ms errorCode=INVALID_REQUEST errorMessage=No session found
2026-05-25T17:29:27.999+08:00 [gateway/reload] restart still deferred after 30092ms with 2 operation(s), 1 embedded run(s) active
2026-05-25T17:30:58.267+08:00 [gateway/reload] restart still deferred after 120360ms with 2 operation(s), 1 embedded run(s) active
2026-05-25T17:27:28.245+08:00 killing 1 stale gateway process(es) before restart: 45762
2026-05-25T17:32:03.528+08:00 [gateway] http server listening (7 plugins: acpx, browser, codex, feishu, memory-core, openai, openclaw-weixin; 2.7s)

Summary

On macOS launchd installs, doing plugin/config maintenance from a live session can leave Gateway in a partially reloaded state where:

Control UI / webchat opens but shows "unable to connect" / unknown error
sessions.resolve returns INVALID_REQUEST: No session found
openclaw gateway restart can fail to recover cleanly
openclaw doctor --fix is required to restore a healthy state

This looks like a product gap in restart/reload resilience, even though the triggering action was an operator mistake.

Environment

Host OS: macOS
OpenClaw: 2026.5.22
Service mode: launchd / LaunchAgent
Gateway port: 18789
Control UI client: local webchat / openclaw-control-ui

What happened

I was fixing config/plugin warnings from a live front-channel session and performed:

config update adding gateway.auth.rateLimit
plugin uninstall/install cycles to pin versions:
- @openclaw/[email protected]
- @openclaw/[email protected]
- @openclaw/[email protected]
- @tencent-weixin/[email protected]

After that, Control UI became unreliable:

webchat opened but showed connection failure / unknown error
websocket/API activity showed repeated sessions.resolve failures
one manual openclaw gateway restart reportedly failed
openclaw doctor --fix was needed before restart/startup recovered reliably

Observed behavior

From logs on 2026-05-25:

Gateway entered an intermediate startup with only 5 plugins loaded:
- acpx, browser, codex, memory-core, openai
- channel plugins were not yet fully back
During that window, Control UI requests hit:
- sessions.resolve -> INVALID_REQUEST: No session found
Restart was deferred for an extended period while operations/runs were still active:
- restart still deferred after 30092ms
- restart still deferred after 60270ms
- restart still deferred after 90315ms
- restart still deferred after 120360ms
A stale process had to be killed before restart:
- killing 1 stale gateway process(es) before restart: 45762
Only later did Gateway come back fully with 7 plugins:
- acpx, browser, codex, feishu, memory-core, openai, openclaw-weixin

This left the user-facing Control UI in a bad state that surfaced as a generic connection failure instead of a clear "gateway is reloading / session index unavailable / retry shortly" state.

Expected behavior

Plugin/config maintenance should not leave Control UI in a partially alive but broken state
If Gateway is mid-reload, Control UI should get a clear degraded-state response instead of generic unknown error
openclaw gateway restart should recover from stale processes/port handoff more robustly
doctor --fix should not be the normal escape hatch for restart failures

Why this seems like a product issue

Yes, the trigger here was operator error: maintenance was performed from a live user session and included disruptive plugin uninstall/install operations.

But the product behavior still seems wrong:

partial startup is externally visible before runtime is truly healthy
session APIs can fail with No session found during reload windows
stale gateway processes can survive long enough to poison restart
recovery path appears to depend on doctor --fix

Candidate areas to inspect

launchd restart / handoff flow on macOS
stale process cleanup before/after restart
readiness gating before exposing Control UI as healthy
behavior of session registry/index during partial reload
plugin reload sequencing when channel plugins are temporarily absent

Relevant log snippets

2026-05-25T17:26:25.777+08:00 [gateway] http server listening (5 plugins: acpx, browser, codex, memory-core, openai; 2.1s)
2026-05-25T17:26:56.412+08:00 [ws] ⇄ res ✗ sessions.resolve 1ms errorCode=INVALID_REQUEST errorMessage=No session found
2026-05-25T17:29:27.999+08:00 [gateway/reload] restart still deferred after 30092ms with 2 operation(s), 1 embedded run(s) active
2026-05-25T17:30:58.267+08:00 [gateway/reload] restart still deferred after 120360ms with 2 operation(s), 1 embedded run(s) active
2026-05-25T17:27:28.245+08:00 killing 1 stale gateway process(es) before restart: 45762
2026-05-25T17:32:03.528+08:00 [gateway] http server listening (7 plugins: acpx, browser, codex, feishu, memory-core, openai, openclaw-weixin; 2.7s)

Repro rough sketch

Use macOS launchd-managed Gateway
From an active front-channel session, perform config changes plus plugin uninstall/install operations affecting loaded plugins/channels
Open Control UI while Gateway is mid-reload
Observe possible unknown-error connection state and sessions.resolve failures
Try openclaw gateway restart
In some cases, recovery may require openclaw doctor --fix

Notes

I do not think the correct answer is "users should never do live maintenance"; that is good operational advice, but the runtime should still fail more clearly and recover more deterministically than this.

FAQ

Expected behavior

Plugin/config maintenance should not leave Control UI in a partially alive but broken state
If Gateway is mid-reload, Control UI should get a clear degraded-state response instead of generic unknown error
openclaw gateway restart should recover from stale processes/port handoff more robustly
doctor --fix should not be the normal escape hatch for restart failures

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix macOS launchd: plugin/config maintenance can leave Control UI in broken partial-reload state; restart may require doctor --fix

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

Summary

Environment

What happened

Observed behavior

Expected behavior

Why this seems like a product issue

Candidate areas to inspect

Relevant log snippets

Repro rough sketch

Notes

FAQ

Expected behavior

Still need to ship something?

TRENDING