openclaw - ✅(Solved) Fix subagent-registry: cleanupBrowserSessionsForLifecycleEnd wrapper invoked twice for same runId in embedded mode [1 pull requests, 1 participants]

Feelw00 · 2026-04-18T18:59:07Z

[openclaw] completeSubagentRun can be invoked in parallel from two completion paths for the same runId in embedded mode: - In-process listener on phase: "end"… `completeSubagentRun` can be invoked in parallel from two completion paths for the same `runId` in embedded mode: - In-process listener on `phase: "end"` — `src/agents/subagent-registry.ts` (via `onAgentEvent`). - Gateway `waitForSubagentCompletion` RPC — `src/agents/subagent-registry-run-manager.ts`. `registerSubagentRun` pairs both unconditionally. Both callers hit `cleanupBrowserSessionsForLifecycleEnd(...)` at `src/agents/subagent-registry-lifecycle.ts` for the same `childSessionKey`, so the wrapper runs twice. # PR #68669: fix(agents): dedupe subagent browser session cleanup wrapper with dispatch flag - Repository: openclaw/openclaw - Author: Feelw00 - State: open | merged: False - Link: https://github.com/openclaw/openclaw/pull/68669 ## Description (problem / solution / changelog) ## Summary - Problem: embedded subagent completion invokes `cleanupBrowserSessionsForLifecycleEnd` wrapper twice for the same `runId`. `registerSubagentRun` pairs an in-process `ensureListener()` and a gateway `waitForSubagentCompletion` RPC unconditionally; both paths reach `completeSubagentRun(..., triggerCleanup: true)` and the wrapper sits outside any dedup guard. - Why it matters: actual CDP `browserCloseTab` IPC is already idempotent via `extensions/browser/src/browser/session-tab-registry.ts` take-and-drain. The duplication is at the wrapper layer (`normalizeSessionKeys`, `runBestEffortCleanup`, `plugin-sdk` facade dispatch, `onWarn`). Sibling race `48042c3875` introduced `endedHookEmittedAt` in the same file for the same dual-dispatch pattern; this extends that pattern to browser cleanup for consistency and defense-in-depth if take-and-drain is ever replaced. - What changed: added `SubagentRunRecord.browserCleanupDispatchedAt?: number`, sync check-then-set guard in `completeSubagentRun` before the cleanup wrapper, and rearm reset in `subagent-registry-run-manager.ts` alongside `endedHookEmittedAt`. New regression test in `subagent-registry-lifecycle.test.ts`. - What did NOT change (scope boundary): no change to `startSubagentAnnounceCleanupFlow`, no change to the bundled browser extension, no change to `completeSubagentRun` behavior for single-caller paths. CDP IPC semantics unchanged. ## Change Type (select all) - [x] Bug fix ## Scope (select all touched areas) - [x] Gateway / orchestration ## Linked Issue/PR - Closes #68668 - Related 48042c3875 - [x] This PR fixes a bug or regression ## Root Cause (if applicable) - Root cause: `registerSubagentRun` fires both `ensureListener()` and `void waitForSubagentCompletion(runId, ...)` unconditionally for every subagent. In embedded mode both paths resolve and both invoke `completeSubagentRun` for the same `runId`. `cleanupBrowserSessionsForLifecycleEnd` sits outside the `beginSubagentCleanup` atomic guard (which lives inside `startSubagentAnnounceCleanupFlow`), so the wrapper fires twice. - Missing detection / guardrail: no per-entry dispatch flag for the browser cleanup side-effect, so double-dispatch was silent at runtime and not covered by any existing test. - Contributing context: `48042c3875` added `endedHookEmittedAt` for the sibling hook-emit race, but only covered the hook dispatch — the browser cleanup wrapper in the same function body remained unguarded. ## Regression Test Plan (if applicable) - Coverage level that should have caught this: - [x] Unit test - Target test or file: `src/agents/subagent-registry-lifecycle.test.ts` — new case `"dedupes browser cleanup when two callers complete the same run in parallel"`. - Scenario the test should lock in: `Promise.all([completeSubagentRun(p), completeSubagentRun(p)])` with the same `runId` + `triggerCleanup: true` must result in `cleanupBrowserSessionsForLifecycleEnd` being called exactly once, with `entry.browserCleanupDispatchedAt` set to a number. - Why this is the smallest reliable guardrail: both production completion paths (`ensureListener` callback in `subagent-registry.ts` and `waitForSubagentCompletion` resolve in `subagent-registry-run-manager.ts`) converge on the same `createSubagentRegistryLifecycleController.completeSubagentRun` helper, so a controller-level parallel invocation exercises the same branch without requiring a full gateway harness. - Existing test that already covers this (if any): none. ## User-visible / Behavior Changes None. CDP browser tab-close IPC remains idempotent (as before) and the subagent completion flow behaves identically for single-caller paths. ## Security Impact (required) - New permissions/capabilities? No - Secrets/tokens handling changed? No - New/changed network calls? No - Command/tool execution surface changed? No - Data access scope changed? No ## Repro + Verification ### Environment - OS: macOS 15.4 - Runtime/container: Node 22 (pnpm 10.33.0) - Model/provider: N/A (unit test only) - Integration/channel: N/A - Re

openclaw2026-04-18 18:59:07

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#68668•Fetched 2026-04-19 15:08:50

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Feelw00

Participants

Feelw00

Timeline (top)

cross-referenced ×1

completeSubagentRun can be invoked in parallel from two completion paths for the same runId in embedded mode:

In-process listener on phase: "end" — src/agents/subagent-registry.ts (via onAgentEvent).
Gateway waitForSubagentCompletion RPC — src/agents/subagent-registry-run-manager.ts.

registerSubagentRun pairs both unconditionally. Both callers hit cleanupBrowserSessionsForLifecycleEnd(...) at src/agents/subagent-registry-lifecycle.ts for the same childSessionKey, so the wrapper runs twice.

Root Cause

completeSubagentRun can be invoked in parallel from two completion paths for the same runId in embedded mode:

In-process listener on phase: "end" — src/agents/subagent-registry.ts (via onAgentEvent).
Gateway waitForSubagentCompletion RPC — src/agents/subagent-registry-run-manager.ts.

Fix Action

Fix / Workaround

The remaining duplication is at the wrapper layer:

cleanupBrowserSessionsForLifecycleEnd wrapper call (runs twice)
normalizeSessionKeys + runBestEffortCleanup (runs twice)
plugin-sdk facade dispatch (runs twice; resolution itself is cached)
onWarn logging path in the non-empty window
Sibling race 48042c3875 fix(agents): avoid duplicate subagent ended hook loads already introduced endedHookEmittedAt in the same file for the same dual-dispatch pattern. Extending that pattern to browser cleanup keeps the dedup surface consistent.
Closes a defense-in-depth gap if the bundled browser extension ever removes or replaces the take-and-drain idempotency.
Trims wrapper overhead on every embedded subagent completion.

Add a dedicated dispatch flag browserCleanupDispatchedAt?: number on SubagentRunRecord and a sync check-then-set guard in completeSubagentRun that matches the endedHookEmittedAt pattern. Reset the flag on rearm alongside endedHookEmittedAt in subagent-registry-run-manager.ts.

PR fix notes

PR #68669: fix(agents): dedupe subagent browser session cleanup wrapper with dispatch flag

Repository: openclaw/openclaw
Author: Feelw00
State: open | merged: False
Link: https://github.com/openclaw/openclaw/pull/68669

Description (problem / solution / changelog)

Summary

Problem: embedded subagent completion invokes cleanupBrowserSessionsForLifecycleEnd wrapper twice for the same runId. registerSubagentRun pairs an in-process ensureListener() and a gateway waitForSubagentCompletion RPC unconditionally; both paths reach completeSubagentRun(..., triggerCleanup: true) and the wrapper sits outside any dedup guard.
Why it matters: actual CDP browserCloseTab IPC is already idempotent via extensions/browser/src/browser/session-tab-registry.ts take-and-drain. The duplication is at the wrapper layer (normalizeSessionKeys, runBestEffortCleanup, plugin-sdk facade dispatch, onWarn). Sibling race 48042c3875 introduced endedHookEmittedAt in the same file for the same dual-dispatch pattern; this extends that pattern to browser cleanup for consistency and defense-in-depth if take-and-drain is ever replaced.
What changed: added SubagentRunRecord.browserCleanupDispatchedAt?: number, sync check-then-set guard in completeSubagentRun before the cleanup wrapper, and rearm reset in subagent-registry-run-manager.ts alongside endedHookEmittedAt. New regression test in subagent-registry-lifecycle.test.ts.
What did NOT change (scope boundary): no change to startSubagentAnnounceCleanupFlow, no change to the bundled browser extension, no change to completeSubagentRun behavior for single-caller paths. CDP IPC semantics unchanged.

Change Type (select all)

Bug fix

Scope (select all touched areas)

Gateway / orchestration

Linked Issue/PR

Closes #68668
Related 48042c3875
This PR fixes a bug or regression

Root Cause (if applicable)

Root cause: registerSubagentRun fires both ensureListener() and void waitForSubagentCompletion(runId, ...) unconditionally for every subagent. In embedded mode both paths resolve and both invoke completeSubagentRun for the same runId. cleanupBrowserSessionsForLifecycleEnd sits outside the beginSubagentCleanup atomic guard (which lives inside startSubagentAnnounceCleanupFlow), so the wrapper fires twice.
Missing detection / guardrail: no per-entry dispatch flag for the browser cleanup side-effect, so double-dispatch was silent at runtime and not covered by any existing test.
Contributing context: 48042c3875 added endedHookEmittedAt for the sibling hook-emit race, but only covered the hook dispatch — the browser cleanup wrapper in the same function body remained unguarded.

Regression Test Plan (if applicable)

Coverage level that should have caught this:
- Unit test
Target test or file: src/agents/subagent-registry-lifecycle.test.ts — new case "dedupes browser cleanup when two callers complete the same run in parallel".
Scenario the test should lock in: Promise.all([completeSubagentRun(p), completeSubagentRun(p)]) with the same runId + triggerCleanup: true must result in cleanupBrowserSessionsForLifecycleEnd being called exactly once, with entry.browserCleanupDispatchedAt set to a number.
Why this is the smallest reliable guardrail: both production completion paths (ensureListener callback in subagent-registry.ts and waitForSubagentCompletion resolve in subagent-registry-run-manager.ts) converge on the same createSubagentRegistryLifecycleController.completeSubagentRun helper, so a controller-level parallel invocation exercises the same branch without requiring a full gateway harness.
Existing test that already covers this (if any): none.

User-visible / Behavior Changes

None. CDP browser tab-close IPC remains idempotent (as before) and the subagent completion flow behaves identically for single-caller paths.

Security Impact (required)

New permissions/capabilities? No
Secrets/tokens handling changed? No
New/changed network calls? No
Command/tool execution surface changed? No
Data access scope changed? No

Repro + Verification

Environment

OS: macOS 15.4
Runtime/container: Node 22 (pnpm 10.33.0)
Model/provider: N/A (unit test only)
Integration/channel: N/A
Relevant config: N/A

Steps

pnpm install in a fresh checkout.
Add the new regression case to src/agents/subagent-registry-lifecycle.test.ts (or run the existing branch).
Without the fix: pnpm test src/agents/subagent-registry-lifecycle.test.ts -t "dedupes browser cleanup" → expected "vi.fn()" to be called 1 times, but got 2 times.
With the fix: same command → passes.

Expected

cleanupBrowserSessionsForLifecycleEnd fires exactly once; entry.browserCleanupDispatchedAt is a number.

Actual

Matches expected after this PR.

Evidence

Failing test/log before + passing after

Before (on clean upstream/main):

FAIL  |agents| src/agents/subagent-registry-lifecycle.test.ts
  × dedupes browser cleanup when two callers complete the same run in parallel
  AssertionError: expected "vi.fn()" to be called 1 times, but got 2 times

After (this PR):

Test Files  1 passed (1)
Tests  10 passed (10)

Full scoped suite (subagent-registry-lifecycle, subagent-registry-completion, subagent-registry.steer-restart, subagent-registry): 41/41 passing. pnpm tsgo:all, pnpm check, pnpm build all green.

Human Verification (required)

Verified scenarios: controller-level Promise.all parallel invocation with same runId + triggerCleanup: true; pnpm tsgo:all, pnpm check, pnpm build all clean.
Edge cases checked:
- Rearm path (subagent-registry-run-manager.ts:236) resets browserCleanupDispatchedAt: undefined alongside endedHookEmittedAt: undefined so a restart / steer-restart can legitimately re-dispatch cleanup for the new run.
- Verified session-tab-registry.ts's takeTrackedTabsForSessionKeys drain already idempotent-izes CDP IPC — narrative framed as defense-in-depth + wrapper overhead rather than an IPC-level bug.
- Synchronous check-then-set (before any await) — no micro-race between check and set.
What I did not verify: real embedded subagent run against a live browser driver — unit test covers the controller branch both production paths converge on.

Review Conversations

I replied to or resolved every bot review conversation I addressed in this PR.
I left unresolved only the conversations that still need reviewer or maintainer judgment.

Compatibility / Migration

Backward compatible? Yes
Config/env changes? No
Migration needed? No

Risks and Mitigations

Risk: browserCleanupDispatchedAt persisting across disk persistence means a crashed process that already dispatched cleanup will skip the wrapper on restore.
- Mitigation: wrapper-level skip is benign because the underlying session-tab-registry tabs are drained on first call; restart-side orphan handling (subagent-registry.ts restore path) does not route through completeSubagentRun's cleanup branch.

[AI-assisted]

Changed files

src/agents/subagent-registry-lifecycle.test.ts (modified, +35/-0)
src/agents/subagent-registry-lifecycle.ts (modified, +9/-0)
src/agents/subagent-registry-run-manager.ts (modified, +1/-0)
src/agents/subagent-registry.types.ts (modified, +1/-0)

RAW_BUFFERClick to expand / collapse

Summary

completeSubagentRun can be invoked in parallel from two completion paths for the same runId in embedded mode:

In-process listener on phase: "end" — src/agents/subagent-registry.ts (via onAgentEvent).
Gateway waitForSubagentCompletion RPC — src/agents/subagent-registry-run-manager.ts.

Scope

Actual CDP browserCloseTab IPC is already idempotent: extensions/browser/src/browser/session-tab-registry.ts's takeTrackedTabsForSessionKeys drains trackedTabsBySession.delete(sessionKey), and closeTrackedBrowserTabsForSessions short-circuits with tabs.length === 0 on the second call.

The remaining duplication is at the wrapper layer:

cleanupBrowserSessionsForLifecycleEnd wrapper call (runs twice)
normalizeSessionKeys + runBestEffortCleanup (runs twice)
plugin-sdk facade dispatch (runs twice; resolution itself is cached)
onWarn logging path in the non-empty window

Why it matters

Sibling race 48042c3875 fix(agents): avoid duplicate subagent ended hook loads already introduced endedHookEmittedAt in the same file for the same dual-dispatch pattern. Extending that pattern to browser cleanup keeps the dedup surface consistent.
Closes a defense-in-depth gap if the bundled browser extension ever removes or replaces the take-and-drain idempotency.
Trims wrapper overhead on every embedded subagent completion.

Reproduction

Promise.all([completeSubagentRun(p), completeSubagentRun(p)]) for the same runId + triggerCleanup: true calls cleanupBrowserSessionsForLifecycleEnd twice without a guard. See accompanying PR's regression test.

Proposed fix

PR: coming.

[AI-assisted]

extent analysis

TL;DR

Add a dedicated dispatch flag and a sync check-then-set guard in completeSubagentRun to prevent duplicate cleanupBrowserSessionsForLifecycleEnd calls.

Guidance

Introduce a browserCleanupDispatchedAt flag on SubagentRunRecord to track when browser cleanup has been dispatched.
Implement a check-then-set guard in completeSubagentRun to prevent duplicate calls to cleanupBrowserSessionsForLifecycleEnd.
Reset the browserCleanupDispatchedAt flag when rearming the subagent, similar to the endedHookEmittedAt flag.
Verify the fix by running the regression test provided in the accompanying PR.

Example

// Add a dedicated dispatch flag on SubagentRunRecord
interface SubagentRunRecord {
  // ...
  browserCleanupDispatchedAt?: number;
}

// Implement a check-then-set guard in completeSubagentRun
function completeSubagentRun(runId: string) {
  const runRecord = getSubagentRunRecord(runId);
  if (runRecord.browserCleanupDispatchedAt) {
    return; // Cleanup already dispatched, skip
  }
  runRecord.browserCleanupDispatchedAt = Date.now();
  // Call cleanupBrowserSessionsForLifecycleEnd
}

Notes

This fix assumes that the endedHookEmittedAt pattern is already working correctly and can be extended to the browser cleanup logic.

Recommendation

Apply the proposed workaround by introducing the browserCleanupDispatchedAt flag and the check-then-set guard in completeSubagentRun, as it addresses the specific issue of duplicate cleanupBrowserSessionsForLifecycleEnd calls and provides a consistent deduplication surface.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #installation #tensor shape #request timeout

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

openclaw - ✅(Solved) Fix subagent-registry: cleanupBrowserSessionsForLifecycleEnd wrapper invoked twice for same runId in embedded mode [1 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fix / Workaround

PR fix notes

PR #68669: fix(agents): dedupe subagent browser session cleanup wrapper with dispatch flag

Description (problem / solution / changelog)

Summary

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

Root Cause (if applicable)

Regression Test Plan (if applicable)

User-visible / Behavior Changes

Security Impact (required)

Repro + Verification

Environment

Steps

Expected

Actual

Evidence

Human Verification (required)

Review Conversations

Compatibility / Migration

Risks and Mitigations

Changed files

Summary

Scope

Why it matters

Reproduction

Proposed fix

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING