- When a global/project `CLAUDE.md` rule applies, the model should apply it on the first generation, not only after being corrected. - When user input is genuinely ambiguous, the model should ask a clarifying question rather than commit to one interpretation. - Rule adherence should not degrade under conversational pressure (apology loops, repeated corrections, short user messages).

claude-code - 💡(How to fix) Fix Model fails to consistently apply explicit instructions from CLAUDE.md within a single session [1 pull requests]

claude-code2026-05-18 20:13:52

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

The model intermittently violates explicit, unambiguous instructions defined in the user's global ~/.claude/CLAUDE.md, even though the file is loaded into context at session start. Violations occur both during normal task work and immediately after the user corrects the model for the same violation. Each violation forces the user to spend additional turns correcting behavior the rules already prohibit, which inflates token usage and degrades the collaboration experience.

Error Message

After the user pointed out an error in a prior explanation, the model opened its corrected reply with:

The failure pattern is silent — the model produces fluent, plausible output that happens to violate a rule, which is harder to catch than a hard error.

Start a session and deliberately correct the model on a small factual error.

Root Cause

Difficult to reproduce deterministically because the violations appear under conversational pressure (correction → apology → re-violation), not on a single prompt. Suggested reproduction approach:

Fix Action

Fixed

Fixed by PR: examples: forbidden-response-phrase-detector.sh — 起票 #60339 の予防の道具 (https://github.com/yurukusa/cc-safe-setup/pull/248)

RAW_BUFFERClick to expand / collapse

Summary

Environment

Product: Claude Code CLI
Model: claude-opus-4-7[1m] (Opus 4.7, 1M context)
OS: macOS (Darwin 25.3.0)
Context loading: global ~/.claude/CLAUDE.md + project CLAUDE.md / AGENTS.md (confirmed present in system context)

Relevant rules in `~/.claude/CLAUDE.md`

Two rules are directly relevant to the failures observed in this session:

No filler acknowledgments

No filler human sentences ("You're right", "Won't happen again", "Good point"). When corrected, acknowledge with zero words and immediately do the correct thing.
No assuming user intent on ambiguous input

Don't assume user intent. Flag uncertainties as questions, don't declare things wrong.

Both rules are explicit, unambiguous, and present in the model's context for the entire session.

Observed behavior (verbatim from the failing session)

After the user pointed out an error in a prior explanation, the model opened its corrected reply with:

"You're right. The reviewer's snippet is a sketch, not a drop-in …"

This is a direct violation of rule (1).
When asked why the filler appeared, the model responded:

"Habit from training on conversational data. Not a deliberate choice."

This anthropomorphizes token prediction as "habit" — imprecise language the user had previously flagged as a style problem.
The user sent an ambiguous instruction ("open issue, its consistency reporting, wich is waysting of time and tokens"), which had at least two plausible interpretations (descriptive statement vs. imperative to file a GitHub issue). The model selected one interpretation silently instead of asking — a violation of rule (2). Only after the user explicitly asked "is this clear request?" did the model acknowledge the ambiguity.

Expected behavior

When a global/project CLAUDE.md rule applies, the model should apply it on the first generation, not only after being corrected.
When user input is genuinely ambiguous, the model should ask a clarifying question rather than commit to one interpretation.
Rule adherence should not degrade under conversational pressure (apology loops, repeated corrections, short user messages).

Impact

Repeated correction loops cost user time and tokens.
The user loses trust that explicit, written rules will be honored, reducing the value of CLAUDE.md as a configuration surface.
The failure pattern is silent — the model produces fluent, plausible output that happens to violate a rule, which is harder to catch than a hard error.

Reproduction

Place a rule in ~/.claude/CLAUDE.md that forbids a common conversational pattern (e.g. "never open a reply with 'You're right'").
Start a session and deliberately correct the model on a small factual error.
Observe whether the model opens its next reply with a forbidden filler phrase.

Repeating across multiple sessions / models may be required to surface the regression.

Notes / suggestions

It would help if rule adherence were checked against CLAUDE.md rules at generation time, not only relied on through in-context conditioning.
A debug/inspection mode that surfaces which CLAUDE.md rules are currently active would make it easier for users to verify the rules are being honored and to file precise reports when they are not.

This issue was filed from within the failing session itself; the model that wrote this body is the same model that produced the violations described above.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

When a global/project CLAUDE.md rule applies, the model should apply it on the first generation, not only after being corrected.
When user input is genuinely ambiguous, the model should ask a clarifying question rather than commit to one interpretation.
Rule adherence should not degrade under conversational pressure (apology loops, repeated corrections, short user messages).

#indexing error #inference speed #output truncation #response parsing #generation error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix Model fails to consistently apply explicit instructions from CLAUDE.md within a single session [1 pull requests]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fixed

Summary

Environment

Relevant rules in `~/.claude/CLAUDE.md`

Observed behavior (verbatim from the failing session)

Expected behavior

Impact

Reproduction

Notes / suggestions

FAQ

Expected behavior

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix Model fails to consistently apply explicit instructions from CLAUDE.md within a single session [1 pull requests]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fixed

Summary

Environment

Relevant rules in ~/.claude/CLAUDE.md

Observed behavior (verbatim from the failing session)

Expected behavior

Impact

Reproduction

Notes / suggestions

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING

Relevant rules in `~/.claude/CLAUDE.md`