codex - 💡(How to fix) Fix Codex Desktop can drive 60-80% GPU usage on M4 Max during subagent-heavy sessions [2 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
openai/codex#18181Fetched 2026-04-17 08:31:45
View on GitHub
Comments
2
Participants
2
Timeline
5
Reactions
0
Timeline (top)
labeled ×3commented ×2
RAW_BUFFERClick to expand / collapse

What version of Codex is running?

Codex Desktop 0.122.0-alpha.1

Which model were you using?

gpt-5.4 with high reasoning

What platform is your computer?

macOS on a MacBook Pro with M4 Max and 64 GB memory

What issue are you seeing?

Codex is using an unexpectedly large amount of GPU during a real coding session. On this machine I saw sustained GPU usage around 60-80%, which feels far out of proportion to the work being done.

I do not have direct GPU telemetry from the session logs themselves, but I do have the exact Codex session/thread that triggered it and the local session metadata.

What steps can reproduce the bug?

The exact session that triggered this was:

  • Session/thread ID: 019d97b1-82c6-7d01-9042-030f94e105bc
  • Thread name: Plan audit fix proposal
  • Session start: April 16, 2026 at 19:08:06 UTC
  • Source: vscode
  • Working directory at the time: /Users/treygoff/Code/atlasos

From the local session trace, this thread then spawned four subagents in quick succession at approximately April 16, 2026 19:09:28 UTC:

  • Descartes (explorer)
  • McClintock (explorer)
  • Lagrange (explorer)
  • Plan Critic (plan_checker)

All of those subagents were launched with gpt-5.4 and high reasoning.

A plausible repro pattern seems to be:

  1. Use Codex Desktop on macOS in a VS Code-originated session.
  2. Start a task that triggers multiple high-effort subagents in parallel.
  3. Keep the session open while they run and report back.
  4. Observe GPU usage in Activity Monitor.

On my machine, GPU usage climbed to roughly 60-80% during this workload.

What is the expected behavior?

Even when a session is doing substantial reasoning or coordinating several subagents, local GPU usage should stay much lower unless there is a clearly necessary rendering workload.

What do you see instead?

Codex appears to drive very high GPU usage during this session pattern, despite the task being primarily text/orchestration work.

Additional information

A few concrete data points from the session metadata:

  • Parent thread model: gpt-5.4
  • Parent thread reasoning: high
  • Parent thread source: vscode
  • The issue was observed on an M4 Max MacBook Pro with 64 GB RAM, so this is not extremely constrained hardware.

If useful, I can provide more local session artifacts tied to 019d97b1-82c6-7d01-9042-030f94e105bc.

extent analysis

TL;DR

Limiting the number of parallel subagents or adjusting the reasoning mode may help reduce excessive GPU usage in Codex Desktop.

Guidance

  • Verify the repro pattern by following the provided steps and monitoring GPU usage to confirm the issue.
  • Investigate the impact of reducing the number of subagents launched in parallel or adjusting the reasoning mode from high to a lower setting.
  • Review the session metadata and local artifacts for any additional clues about the cause of the high GPU usage.
  • Consider providing more session artifacts tied to the specific session ID for further analysis.

Example

No code snippet is provided as the issue does not involve code changes.

Notes

The issue may be specific to the gpt-5.4 model or the high reasoning mode, and further testing with different models or settings may be necessary to isolate the cause.

Recommendation

Apply a workaround by limiting the number of parallel subagents or adjusting the reasoning mode, as this may help mitigate the excessive GPU usage until a more permanent fix is available.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING