Either: - Findings list (even if short), with confirmation N files / Y bytes were actually analyzed, OR - Explicit error like `diff too large: 1312 files / 615k insertions exceeds limit. Use \`/ultrareview \` to scope, or split the branch.`

claude-code - 💡(How to fix) Fix [ultrareview] silently returns empty findings on large branch diffs (no scope warning, burns free quota)

claude-code2026-05-24 08:18:05

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

/ultrareview marks the job as completed and returns an empty findings list ([]) on branches with large diffs, in under 1 minute (proper review should take 5-10 min). No error, no warning, no indication the reviewer truncated or aborted. The same code analyzed via local Sonnet reviewers (Task tool, scope-controlled to ~50 files each) returns 344 concrete findings.

Net effect: I burned 2 of 3 free monthly executions on this silent abort behavior, with zero useful signal.

Error Message

Job status: completed in < 1 minute
Findings: []
No scope/limit/truncation message
Indistinguishable from "code is actually clean"

Root Cause

Net effect: I burned 2 of 3 free monthly executions on this silent abort behavior, with zero useful signal.

Fix Action

Fix / Workaround

Lost monthly free quota with zero useful signal.
False confidence: a user trusting [] as "clean" merges without review. Silent failure is worse than explicit error.
Forced workaround: I had to manually recreate the ultrareview behavior by dispatching 12 Task-tool reviewers. Worked, but cost time + requires users to know this fallback exists.

RAW_BUFFERClick to expand / collapse

Summary

Net effect: I burned 2 of 3 free monthly executions on this silent abort behavior, with zero useful signal.

Expected behavior

Either:

Findings list (even if short), with confirmation N files / Y bytes were actually analyzed, OR
Explicit error like diff too large: 1312 files / 615k insertions exceeds limit. Use \/ultrareview <PR#>` to scope, or split the branch.`

Observed behavior

Job status: completed in < 1 minute
Findings: []
No scope/limit/truncation message
Indistinguishable from "code is actually clean"

Reproduction

Branch ahead of main with ~1000 files / ~500k+ insertions of diff (code + planning docs + SQL migrations mix)
/ultrareview in Claude Code
~1 min later: notification returns []

My two executions

Execution 1 — 2026-05-23

Session: session_01RqZcksSVmEvCZNWqK1ThsA
Scope CLI reported: 2812 files, 831376 insertions, 9785 deletions
Result: []
Wall time: < 1 min launched → completed

Execution 2 — 2026-05-24 (after cleaning obvious noise: generated artifacts, test dumps, spike directories)

Session: session_01SuwirRz2E3JkhwuA1Jm8AP
Scope CLI reported: 993 files, 563671 insertions, 9777 deletions
Result: []
Wall time: < 1 min launched → completed

Evidence the code DOES have bugs (rules out "clean code" hypothesis)

Between the two /ultrareview runs, I ran 12 Sonnet reviewers via Task tool with controlled scope (40-50 files each). They found 344 concrete findings in the SAME code: 86 BLOCKERS, 186 WARNINGS, 72 INFO.

Reproducible blockers include:

RLS on obras table open to any authenticated user (verifiable via pg_policy)
VITE_INTERNAL_TOKEN baked into the public JS bundle (CWE-798) — visible in View Source
~9 SECURITY DEFINER functions without SET search_path (CVE-2018-1058)
get_user_empresa_id() single-tenant in ~30 RLS policies on a group-aware multi-tenant system
OAuth state base64 without nonce (CSRF) in 4 sites
CSV formula injection in finance export (operator-side RCE)
parseFloat(\"1.234,56\") losing 1000x on BR-locale forms

Conclusion: ultrareview returned [] on code with 86 confirmed blockers. The job didn't work — silent failure.

Impact

Lost monthly free quota with zero useful signal.
False confidence: a user trusting [] as "clean" merges without review. Silent failure is worse than explicit error.
Forced workaround: I had to manually recreate the ultrareview behavior by dispatching 12 Task-tool reviewers. Worked, but cost time + requires users to know this fallback exists.

Suggested fixes (cheap → ideal)

Minimum: return explicit error when diff exceeds limit, never silent []. Example: { \"status\": \"aborted\", \"reason\": \"diff_too_large\", \"files\": 993, \"limit\": 500 }
Good: accept large diff but report truncation in the result.
Ideal: internal chunked review (split diff into ~50-file batches and aggregate findings — same approach my manual local fallback used).
Compensation: refund credits that returned [] (the job aborted on scope check, no compute used).

Environment

Claude Code CLI (current as of 2026-05-24)
Windows 11 Pro 10.0.26100
Plan: Claude Code Max
Branch dev ahead of master

Full bug report

Detailed version including session URLs and full evidence: docs/anthropic-ultrareview-bug-report.md in my repo, available on request.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

Either:

Findings list (even if short), with confirmation N files / Y bytes were actually analyzed, OR
Explicit error like diff too large: 1312 files / 615k insertions exceeds limit. Use \/ultrareview <PR#>` to scope, or split the branch.`

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix [ultrareview] silently returns empty findings on large branch diffs (no scope warning, burns free quota)

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fix / Workaround

Summary

Expected behavior

Observed behavior

Reproduction

My two executions

Execution 1 — 2026-05-23

Execution 2 — 2026-05-24 (after cleaning obvious noise: generated artifacts, test dumps, spike directories)

Evidence the code DOES have bugs (rules out "clean code" hypothesis)

Impact

Suggested fixes (cheap → ideal)

Environment

Full bug report

FAQ

Expected behavior

Still need to ship something?

TRENDING