claude-code - 💡(How to fix) Fix [BUG] Claude Code parent process consumes ~143 GB of RAM during a long multi-agent iOS/Swift/backend session [2 comments, 3 participants]

vipinnader · 2026-04-20T03:03:35Z

[claude-code] The Claude Code desktop application consumed 143.52 GB of resident memory and triggered macOS's "Your system has run out of application memory" d… The Claude Code desktop application consumed **143.52 GB** of resident memory and triggered macOS's "Your system has run out of application memory" dialog during a prolonged development session involving multiple background subagents, iterative `xcodebuild` test runs, and a long-lived backend dev server run via the `Bash` tool. The machine reached a state where all user apps were paused and Claude was flagged for Force Quit. This reproduces reliably near the end of a long session and has now required the user to force-quit + restart Claude Code twice within the same day. ## Fix / Workaround ## Mitigations tried / discovered None of these are satisfying; they're workarounds for a harness-level issue. ### Preflight Checklist - [x] I have searched [existing issues](https://github.com/anthropics/claude-code/issues?q=is%3Aissue%20state%3Aopen%20label%3Abug) and this hasn't been reported yet - [x] This is a single bug report (please file separate reports for different bugs) - [x] I am using the latest version of Claude Code ### What's Wrong? ## Summary The Claude Code desktop application consumed **143.52 GB** of resident memory and triggered macOS's "Your system has run out of application memory" dialog during a prolonged development session involving multiple background subagents, iterative `xcodebuild` test runs, and a long-lived backend dev server run via the `Bash` tool. The machine reached a state where all user apps were paused and Claude was flagged for Force Quit. This reproduces reliably near the end of a long session and has now required the user to force-quit + restart Claude Code twice within the same day. ## Environment - **OS:** macOS (Apple silicon; Xcode 26.2 simulator runtime, iPhone 16 Pro simulator). - **Claude Code version:** current as of 2026-04-20. - **Session type:** single long conversation thread, 4+ hours of active use. - **Experimental flags set earlier in the session:** `CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1` in `.claude/settings.local.json` (team was created, failed to wake, then abandoned; session was kept running). - **Permission model:** `Bash(*)` blanket-allowed in `.claude/settings.local.json`. ## Observed macOS's Force Quit dialog shows: | App | State | Memory | | --- | --- | --- | | Claude | paused | **143.52 GB** | | Code (VS Code) | paused | 2.69 GB | | Activity Monitor | paused | 78.8 MB | | Terminal | paused | 48.9 MB | | Finder | running | 144.0 MB | Screenshot attached in session. ## Workload during the session The parent Claude Code session was orchestrating three iOS-app tracks with several amplifying patterns that probably compound memory use: 1. **~12–15 background subagents** spawned over the course of the session via the `Agent` tool, some re-used via `SendMessage`. Each produces a JSONL transcript written to `/private/tmp/claude-501/ /tasks/*.output`. Some of those files are 100+ KB. 2. **Repeated `xcodebuild` test runs** after iterative source changes. One specific failure mode amplifies output: an `XCTAssertEqual` on a complex `NatalChart` struct produces a **single-line ~30 KB** struct dump. I reran this test several times while iterating on a fix; each run returned the same giant-line output. 3. **Long-running `npm run dev`** backend server launched in the background (`run_in_background: true`) and deliberately kept running so an iOS simulator could hit it. Its stdout streams continuously to the harness's output file. 4. **Large Python outputs** from an ephemeris extractor that wrote ~23 K terms of generated Swift. The extractor runs quickly but produces long stdout listing all 36 file parse counts. 5. **`git diff` / `git status` / `git log` runs** on a working tree that has multiple large generated files (a 1.7 MB generated `ELP2000Data.swift`, a 3 MB JSON test corpus in `tools/test_corpus.json`). ## Expected Claude Code's parent process should stay in "usable desktop app" territory — hundreds of MB, not hundreds of GB — regardless of: - How many subagents have been spawned in a session - How long background `Bash` commands have been running - How large a single tool result is (with persisted-to-disk fallback for oversized outputs — which the harness seems to try for some outputs already) - How many repeated identical-output tool calls have been made ## Hypotheses about the root cause Listed by likelihood (without access to Claude Code internals): 1. **Per-tool-call output retained in-memory even when persisted to disk.** Several tool results during this session printed `Output too large (NNKB). Full output saved to: /Users/ /.claude/projects/.../tool-results/ .txt` — suggesting the harness has offload logic. But memory kept growing anyway. Possibly the offloaded content is still held in some other structure (UI history, undo buffer, rendered message DOM). 2. **Subagent transcri

claude-code2026-04-20 03:03:35

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#50980•Fetched 2026-04-20 12:07:44

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Participants

Timeline (top)

labeled ×5commented ×2

The Claude Code desktop application consumed 143.52 GB of resident memory and triggered macOS's "Your system has run out of application memory" dialog during a prolonged development session involving multiple background subagents, iterative xcodebuild test runs, and a long-lived backend dev server run via the Bash tool. The machine reached a state where all user apps were paused and Claude was flagged for Force Quit.

This reproduces reliably near the end of a long session and has now required the user to force-quit + restart Claude Code twice within the same day.

Error Message

Avoid re-running failing tests with large dumps. Filter output with grep -E "(Executed|error:)" before streaming back.

Error Messages/Logs

Root Cause

Hypotheses about the root cause

Fix Action

Fix / Workaround

Mitigations tried / discovered

None of these are satisfying; they're workarounds for a harness-level issue.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues and this hasn't been reported yet
This is a single bug report (please file separate reports for different bugs)
I am using the latest version of Claude Code

What's Wrong?

Summary

This reproduces reliably near the end of a long session and has now required the user to force-quit + restart Claude Code twice within the same day.

Environment

OS: macOS (Apple silicon; Xcode 26.2 simulator runtime, iPhone 16 Pro simulator).
Claude Code version: current as of 2026-04-20.
Session type: single long conversation thread, 4+ hours of active use.
Experimental flags set earlier in the session: CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1 in .claude/settings.local.json (team was created, failed to wake, then abandoned; session was kept running).
Permission model: Bash(*) blanket-allowed in .claude/settings.local.json.

Observed

macOS's Force Quit dialog shows:

App	State	Memory
Claude	paused	143.52 GB
Code (VS Code)	paused	2.69 GB
Activity Monitor	paused	78.8 MB
Terminal	paused	48.9 MB
Finder	running	144.0 MB

Screenshot attached in session.

Workload during the session

The parent Claude Code session was orchestrating three iOS-app tracks with several amplifying patterns that probably compound memory use:

~12–15 background subagents spawned over the course of the session via the Agent tool, some re-used via SendMessage. Each produces a JSONL transcript written to /private/tmp/claude-501/<session-uuid>/tasks/*.output. Some of those files are 100+ KB.
Repeated xcodebuild test runs after iterative source changes. One specific failure mode amplifies output: an XCTAssertEqual on a complex NatalChart struct produces a single-line ~30 KB struct dump. I reran this test several times while iterating on a fix; each run returned the same giant-line output.
Long-running npm run dev backend server launched in the background (run_in_background: true) and deliberately kept running so an iOS simulator could hit it. Its stdout streams continuously to the harness's output file.
Large Python outputs from an ephemeris extractor that wrote ~23 K terms of generated Swift. The extractor runs quickly but produces long stdout listing all 36 file parse counts.
git diff / git status / git log runs on a working tree that has multiple large generated files (a 1.7 MB generated ELP2000Data.swift, a 3 MB JSON test corpus in tools/test_corpus.json).

Expected

Claude Code's parent process should stay in "usable desktop app" territory — hundreds of MB, not hundreds of GB — regardless of:

How many subagents have been spawned in a session
How long background Bash commands have been running
How large a single tool result is (with persisted-to-disk fallback for oversized outputs — which the harness seems to try for some outputs already)
How many repeated identical-output tool calls have been made

Hypotheses about the root cause

Listed by likelihood (without access to Claude Code internals):

Per-tool-call output retained in-memory even when persisted to disk. Several tool results during this session printed Output too large (NNKB). Full output saved to: /Users/<user>/.claude/projects/.../tool-results/<id>.txt — suggesting the harness has offload logic. But memory kept growing anyway. Possibly the offloaded content is still held in some other structure (UI history, undo buffer, rendered message DOM).
Subagent transcript files opened but not closed. A dozen+ background subagent outputs were created under /private/tmp/claude-501/<session>/tasks/. If the harness mmaps or opens those transcripts and holds them for the lifetime of the session, that compounds.
Streaming stdout from long-running background processes is buffered without a cap. The backend npm run dev ran for hours, emitting a steady stream of request-response JSON logs. If its output file isn't periodically truncated from the harness's perspective, the in-memory tail grows without bound.
Rendered conversation turns retain raw tool-result payloads (not just references). A long session with many tool calls each emitting kilobytes of output would accumulate. The 30KB-per-failing-test dump reran N times is a clear amplifier.
Electron / renderer process leak unrelated to content size — e.g. DOM nodes for long-scrollback, syntax-highlighted monospace blocks, or message-list virtualization that doesn't recycle off-screen nodes.

Reproduction sketch

Not deterministic but reliably reproduces in my workflow. Approximate recipe:

Open a fresh Claude Code session in a real repo.
Spawn 3–4 background subagents via the Agent tool with run_in_background: true.
Start a long-running background process via Bash(run_in_background: true) — e.g. a Node dev server or tail -f.
Run a heavy tool repeatedly (e.g. xcodebuild ... test) 5–10 times, especially where one test case emits a large diff output.
Send 5–10 SendMessage calls to agents across iterations.
Watch macOS Activity Monitor → the Claude Code process.

Expected within 1–3 hours of active iteration: Claude Code memory rises into the tens of GB and eventually triggers the Force Quit prompt.

Impact

Forces the user to Force Quit and restart Claude Code, losing in-flight agent work that wasn't yet committed (mitigable by the agent aggressively committing — but that adds ceremony).
Machine becomes unresponsive for minutes during swap thrash.
Discourages long-session workflows that Claude Code is otherwise well-suited for.

Mitigations tried / discovered

Restart. Reclaims memory fully; session context survives via committed files + a written plan + memory files the agent maintains.
Avoid re-running failing tests with large dumps. Filter output with grep -E "(Executed|error:)" before streaming back.
Kill long-running background shells between uses rather than leaving them up.
Commit more frequently so a Force Quit costs less work.

None of these are satisfying; they're workarounds for a harness-level issue.

Requested

Confirmation whether this is a known issue with a tracking number.
Ideally: a per-session memory cap with a visible indicator + graceful warning before macOS takes over.
Harness-level backpressure: when a tool result exceeds N KB, persist to disk AND drop it from the active memory representation of the turn (keep only the pointer).
An explicit --debug-memory mode that dumps growth hotspots (per-turn, per-tool, per-subagent).

Notes for maintainers

Session uuid visible in task paths: 97187ef7-dba2-430a-b7ea-ec281e1680be (from /private/tmp/claude-501/-Users-<user>-Documents-GitHub-SkyWiserApp/97187ef7-dba2-430a-b7ea-ec281e1680be/tasks/). If Anthropic wants to inspect the transcript(s) server-side, that id + a rough wall-clock window should be enough to locate them.
Happy to capture an Activity Monitor "Sample" of the runaway process on the next repro if a memory-growth profile would help — just let me know where to send it.

What Should Happen?

Claude Code's parent process should stay in "usable desktop app" territory — hundreds of MB, not hundreds of GB — regardless of:

How many subagents have been spawned in a session
How long background Bash commands have been running
How large a single tool result is (with persisted-to-disk fallback for oversized outputs — which the harness seems to try for some outputs already)
How many repeated identical-output tool calls have been made

Error Messages/Logs

Steps to Reproduce

Reproduction sketch

Not deterministic but reliably reproduces in my workflow. Approximate recipe:

Open a fresh Claude Code session in a real repo.
Spawn 3–4 background subagents via the Agent tool with run_in_background: true.
Start a long-running background process via Bash(run_in_background: true) — e.g. a Node dev server or tail -f.
Run a heavy tool repeatedly (e.g. xcodebuild ... test) 5–10 times, especially where one test case emits a large diff output.
Send 5–10 SendMessage calls to agents across iterations.
Watch macOS Activity Monitor → the Claude Code process.

Expected within 1–3 hours of active iteration: Claude Code memory rises into the tens of GB and eventually triggers the Force Quit prompt.

Claude Model

Opus

Is this a regression?

I don't know

Last Working Version

No response

Claude Code Version

2.1.90

Platform

Anthropic API

Operating System

macOS

Terminal/Shell

Terminal.app (macOS)

Additional Information

No response

extent analysis

TL;DR

Implementing a per-session memory cap with a visible indicator and graceful warning before macOS takes over could help mitigate the issue of Claude Code's parent process consuming excessive memory.

Guidance

Investigate the hypothesis that per-tool-call output is retained in-memory even when persisted to disk, and consider implementing a mechanism to drop the output from active memory representation after a certain size threshold is exceeded.
Review the handling of subagent transcript files to ensure they are properly closed after use, and consider implementing a mechanism to periodically clean up unused files.
Consider implementing a cap on the buffering of stdout from long-running background processes to prevent unbounded memory growth.
Investigate the possibility of an Electron/renderer process leak unrelated to content size, and consider implementing measures to recycle off-screen nodes and reduce memory usage.

Example

No specific code example can be provided without access to the Claude Code internals, but a potential solution could involve modifying the tool result handling to persist large outputs to disk and drop them from memory, such as:

if output_size > MAX_OUTPUT_SIZE:
    # Persist output to disk
    with open(output_file, 'w') as f:
        f.write(output)
    # Drop output from memory
    output = None

Notes

The issue is complex and may require a combination of fixes to fully resolve. The provided guidance is based on the hypotheses presented in the issue report and may not be exhaustive. Further investigation and debugging may be necessary to identify the root cause and implement an effective solution.

Recommendation

Apply a workaround by implementing a per-session memory cap with a visible indicator and graceful warning, as this may help mitigate the issue until a more comprehensive solution can be developed.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #indexing error #inference speed #output truncation #response parsing

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

claude-code - 💡(How to fix) Fix [BUG] Claude Code parent process consumes ~143 GB of RAM during a long multi-agent iOS/Swift/backend session [2 comments, 3 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Error Messages/Logs

Root Cause

Hypotheses about the root cause

Fix Action

Fix / Workaround

Mitigations tried / discovered

Preflight Checklist

What's Wrong?

Summary

Environment

Observed

Workload during the session

Expected

Hypotheses about the root cause

Reproduction sketch

Impact

Mitigations tried / discovered

Requested

Notes for maintainers

What Should Happen?

Error Messages/Logs

Steps to Reproduce

Reproduction sketch

Claude Model

Is this a regression?

Last Working Version

Claude Code Version

Platform

Operating System

Terminal/Shell

Additional Information

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING