claude-code - 💡(How to fix) Fix Opus 4.7 over-hedges and defers action on feature-scoping conversations, ignoring user-supplied "be proactive" instructions [1 participants]

dvass88 · 2026-05-13T18:16:53Z

[claude-code] When using Claude Code with Opus 4.7 for feature-scoping conversations not bug fixes , the model exhibits a consistent pattern of: 1. Inserting c… When using Claude Code with Opus 4.7 for feature-scoping conversations (not bug fixes), the model exhibits a consistent pattern of: 1. Inserting cautionary framing on most turns ("honest take", "honest reality check", "honest question first", "what would you actually use this for") 2. Listing reasons the user might not want to build the feature, even when the user has clearly already decided to build it 3. Ending nearly every response with "Want me to…?" instead of taking the next obvious step 4. Generating "decisions to lock before I build" lists that defer action by manufacturing prerequisites Net effect: the model reads as discouraging when the user wants momentum. The user repeatedly has to push past hedging to get to scoped work. This is distinct from helpful pushback on real risks (destructive actions, security issues, additive-vs-load-bearing trade-offs). It fires on benign, additive feature builds the user has explicitly asked to scope. ### Preflight Checklist - [x] I have searched [existing issues](https://github.com/anthropics/claude-code/issues?q=is%3Aissue%20state%3Aopen%20label%3Amodel) for similar behavior reports - [x] This report does NOT contain sensitive information (API keys, passwords, etc.) ### Type of Behavior Issue Claude ignored my instructions or configuration ### What You Asked Claude to Do scope out a plan to add a settings UI for managing third-party integrations in my app — toggle connections on/off, add new ones, store credentials ### What Claude Actually Did Across ~10 consecutive turns in a single feature-scoping conversation, every assistant turn contained at least one of the following patterns: 1. **Cautionary framing inserted on most turns** — "honest take", "honest reality check", "honest question first", "what would you actually use this for" 2. **Listed reasons the user might not want to build the feature**, even though the user had clearly already decided to build it — e.g. "the [existing tool] already exists and is free", "most of these aren't part of your stack", "but is it worth it" 3. **Ended nearly every response with "Want me to…?"** instead of taking the next obvious step (e.g. "Want me to scope option 2, or wait until you've decided whether the cross-tool angle is worth it?") 4. **Generated "Decisions to lock before I build" lists** that deferred action by manufacturing prerequisites that didn't actually block scoping Concrete example from the session referenced above: - Opened with "honest question first — what would you actually use this for inside your own app?" - Listed three reasons the existing third-party tool already solved this before getting to any scoping - Wrote half the scope, then paused to add "one honest reality check" arguing most integrations probably aren't relevant to my stack - Generated a "Decisions to lock before I build" section with 4 prerequisites - Ended with a "Want me to…?" deferral - Repeated the "Want me to…?" pattern at the end of every subsequent turn (5+ turns), even when the next step was obvious - Re-inserted "honest take" framing 3 separate times across the conversation Each individual instance is reasonable. The cumulative effect over a multi-turn scoping session reads as the model arguing against the build instead of helping it land. This is distinct from helpful pushback on real risks (destructive actions, security issues, additive-vs-load-bearing trade-offs). It fires on benign, additive feature builds the user has explicitly asked to scope. ### Expected Behavior When the user asks to scope/plan a feature they've decided to build, the response should be the plan. Specifically: - Produce the scoped plan directly — schema, file paths, phasing — with momentum - Flag only real blockers (OAuth registration gates, destructive operations, security implications) - Skip cautionary "is this worth building" framing — the user has already chosen to build - Default to executing the next obvious step instead of asking permission each turn - Honor user-supplied CLAUDE.md "Working Style" instructions ("Be proactive. Offer to build things. Suggest next steps unprompted.") This is how Opus 4.6 behaved in the same workflow — it scoped with momentum, proposed schema/paths/phasing on the first response, and only flagged blockers when they existed. ### Files Affected ```shell ``` ### Permission Mode I don't know / Not sure ### Can You Reproduce This? Yes, every time with the same prompt ### Steps to Reproduce 1. Project with a CLAUDE.md containing a "be proactive / suggest next steps unprompted" Working Style section 2. Start a fresh Claude Code session on claude-opus-4-7 3. Ask: "scope out a plan to add [benign additive feature] to my app" 4. Continue the conversation across 5+ turns of scoping decisions 5. Observe hedging pattern: "honest take" / "honest reality check" framing, comp

claude-code2026-05-13 18:16:53

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#58782•Fetched 2026-05-14 03:39:36

View on GitHub

Comments

Participants

Timeline

Reactions

Author

dvass88

Participants

dvass88

Timeline (top)

labeled ×4

When using Claude Code with Opus 4.7 for feature-scoping conversations (not bug fixes), the model exhibits a consistent pattern of:

Inserting cautionary framing on most turns ("honest take", "honest reality check", "honest question first", "what would you actually use this for")
Listing reasons the user might not want to build the feature, even when the user has clearly already decided to build it
Ending nearly every response with "Want me to…?" instead of taking the next obvious step
Generating "decisions to lock before I build" lists that defer action by manufacturing prerequisites

Net effect: the model reads as discouraging when the user wants momentum. The user repeatedly has to push past hedging to get to scoped work.

This is distinct from helpful pushback on real risks (destructive actions, security issues, additive-vs-load-bearing trade-offs). It fires on benign, additive feature builds the user has explicitly asked to scope.

Root Cause

When using Claude Code with Opus 4.7 for feature-scoping conversations (not bug fixes), the model exhibits a consistent pattern of:

Inserting cautionary framing on most turns ("honest take", "honest reality check", "honest question first", "what would you actually use this for")
Listing reasons the user might not want to build the feature, even when the user has clearly already decided to build it
Ending nearly every response with "Want me to…?" instead of taking the next obvious step
Generating "decisions to lock before I build" lists that defer action by manufacturing prerequisites

Net effect: the model reads as discouraging when the user wants momentum. The user repeatedly has to push past hedging to get to scoped work.

Code Example



---

Verbatim phrases observed across the session:

- "honest question first — what would you actually use this for"
- "one honest reality check"
- "the [existing tool] already exists, works on mobile, and is free"
- "most of these aren't part of your stack"
- "but honest question first"
- "Want me to scope option 2, or wait until you've decided whether the cross-tool angle is worth it?"
- "Want me to start by saving this plan to [file], or just answer the open decisions first?"
- "Decisions to lock before I build"

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues for similar behavior reports
This report does NOT contain sensitive information (API keys, passwords, etc.)

Type of Behavior Issue

Claude ignored my instructions or configuration

What You Asked Claude to Do

scope out a plan to add a settings UI for managing third-party integrations in my app — toggle connections on/off, add new ones, store credentials

What Claude Actually Did

Across ~10 consecutive turns in a single feature-scoping conversation, every assistant turn contained at least one of the following patterns:

Cautionary framing inserted on most turns — "honest take", "honest reality check", "honest question first", "what would you actually use this for"
Listed reasons the user might not want to build the feature, even though the user had clearly already decided to build it — e.g. "the [existing tool] already exists and is free", "most of these aren't part of your stack", "but is it worth it"
Ended nearly every response with "Want me to…?" instead of taking the next obvious step (e.g. "Want me to scope option 2, or wait until you've decided whether the cross-tool angle is worth it?")
Generated "Decisions to lock before I build" lists that deferred action by manufacturing prerequisites that didn't actually block scoping

Concrete example from the session referenced above:

Opened with "honest question first — what would you actually use this for inside your own app?"
Listed three reasons the existing third-party tool already solved this before getting to any scoping
Wrote half the scope, then paused to add "one honest reality check" arguing most integrations probably aren't relevant to my stack
Generated a "Decisions to lock before I build" section with 4 prerequisites
Ended with a "Want me to…?" deferral
Repeated the "Want me to…?" pattern at the end of every subsequent turn (5+ turns), even when the next step was obvious
Re-inserted "honest take" framing 3 separate times across the conversation

Each individual instance is reasonable. The cumulative effect over a multi-turn scoping session reads as the model arguing against the build instead of helping it land.

Expected Behavior

When the user asks to scope/plan a feature they've decided to build, the response should be the plan. Specifically:

Produce the scoped plan directly — schema, file paths, phasing — with momentum
Flag only real blockers (OAuth registration gates, destructive operations, security implications)
Skip cautionary "is this worth building" framing — the user has already chosen to build
Default to executing the next obvious step instead of asking permission each turn
Honor user-supplied CLAUDE.md "Working Style" instructions ("Be proactive. Offer to build things. Suggest next steps unprompted.")

This is how Opus 4.6 behaved in the same workflow — it scoped with momentum, proposed schema/paths/phasing on the first response, and only flagged blockers when they existed.

Files Affected

Permission Mode

I don't know / Not sure

Can You Reproduce This?

Yes, every time with the same prompt

Steps to Reproduce

Project with a CLAUDE.md containing a "be proactive / suggest next steps unprompted" Working Style section
Start a fresh Claude Code session on claude-opus-4-7
Ask: "scope out a plan to add [benign additive feature] to my app"
Continue the conversation across 5+ turns of scoping decisions
Observe hedging pattern: "honest take" / "honest reality check" framing, comparison-to-existing-solution caveats, and "Want me to…?" deferrals at the end of nearly every turn

Claude Model

Opus

Relevant Conversation

Verbatim phrases observed across the session:

- "honest question first — what would you actually use this for"
- "one honest reality check"
- "the [existing tool] already exists, works on mobile, and is free"
- "most of these aren't part of your stack"
- "but honest question first"
- "Want me to scope option 2, or wait until you've decided whether the cross-tool angle is worth it?"
- "Want me to start by saving this plan to [file], or just answer the open decisions first?"
- "Decisions to lock before I build"

Impact

Medium - Extra work to undo changes

Claude Code Version

2.1.121 (Claude Code)

Platform

Anthropic API

Additional Context

Summary

When using Claude Code with Opus 4.7 for feature-scoping conversations (not bug fixes), the model exhibits a consistent pattern of:

Inserting cautionary framing on most turns ("honest take", "honest reality check", "honest question first", "what would you actually use this for")
Listing reasons the user might not want to build the feature, even when the user has clearly already decided to build it
Ending nearly every response with "Want me to…?" instead of taking the next obvious step
Generating "decisions to lock before I build" lists that defer action by manufacturing prerequisites

Net effect: the model reads as discouraging when the user wants momentum. The user repeatedly has to push past hedging to get to scoped work.

Why this is a regression from 4.6

Opus 4.6 in the same workflow scoped features with momentum — proposing schema, file paths, phasing — and only flagged real blockers when they existed. 4.7's "more opinionated tone" and "more literal instruction following" (per the release notes) appear to be over-rotating into pre-emptive caution on benign feature work.

This matches community reports of similar regressions:

It also lines up with Anthropic's own acknowledged red-team finding that 4.7 is "prone to sycophantic agreement under pushback" — the hedging is one symptom of an over-calibrated caution signal that also flips into over-agreement when challenged.

CLAUDE.md is being overridden

The project's CLAUDE.md explicitly contains:

Working Style Be proactive. Offer to build things. Suggest next steps unprompted. When a task lands, finish it and surface the obvious follow-on work without waiting to be asked.

This instruction is consistently overridden by the Claude Code default system prompt rules:

"For exploratory questions, respond in 2-3 sentences with a recommendation and the main tradeoff. Present it as something the user can redirect, not a decided plan. Don't implement until the user agrees."

"Don't add features, refactor, or introduce abstractions beyond what the task requires."

Those rules are well-calibrated for tight bug-fix and edit tasks. They misfire on feature-scoping conversations where the user has explicitly framed the request as "scope this" or "plan this." 4.6 generalized through these rules sensibly; 4.7's literal interpretation makes the hedging much more aggressive.

Suggested fix (calibration, not retraining)

Two paths, either would help:

Adjust the Claude Code system prompt so user-supplied CLAUDE.md "working style" sections more clearly override the default "don't implement until user agrees" rule. Right now the default appears louder than project instructions.
Add explicit guidance distinguishing exploratory questions ("what could we do about X") from scoping requests ("scope this", "plan this", "build me a plan for X"). The latter is not an exploratory question — the user has decided to build, and the response should be the plan, not a meta-discussion of whether to build.

A small addition like "When the user asks to scope/plan a feature they've decided to build, the response is the plan. Skip cautionary framing unless a real blocker exists." would likely resolve most instances.

Severity

Quality-of-life, not blocking. But over many sessions it adds real friction for users who want momentum on builds. The model is technically capable of doing the work — it just keeps interrupting itself to ask permission or list downsides.

Environment

Claude Code: 2.1.121
Model: claude-opus-4-7 (1M context)
OS: macOS (Darwin 24.4.0)

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #tool integration #LLM response #prompt template

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix Opus 4.7 over-hedges and defers action on feature-scoping conversations, ignoring user-supplied "be proactive" instructions [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

Preflight Checklist

Type of Behavior Issue

What You Asked Claude to Do

What Claude Actually Did

Expected Behavior

Files Affected

Permission Mode

Can You Reproduce This?

Steps to Reproduce

Claude Model

Relevant Conversation

Impact

Claude Code Version

Platform

Additional Context

Summary

Why this is a regression from 4.6

CLAUDE.md is being overridden

Suggested fix (calibration, not retraining)

Severity

Environment

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix Opus 4.7 over-hedges and defers action on feature-scoping conversations, ignoring user-supplied "be proactive" instructions [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

Preflight Checklist

Type of Behavior Issue

What You Asked Claude to Do

What Claude Actually Did

Expected Behavior

Files Affected

Permission Mode

Can You Reproduce This?

Steps to Reproduce

Claude Model

Relevant Conversation

Impact

Claude Code Version

Platform

Additional Context

Summary

Why this is a regression from 4.6

CLAUDE.md is being overridden

Suggested fix (calibration, not retraining)

Severity

Environment

Still need to ship something?

RELATED_DISCOVERY

TRENDING