claude-code - 💡(How to fix) Fix Users need reliable instruction enforcement without building a hook for every failure mode [2 comments, 2 participants]

claude-code2026-04-15 05:33:41

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#48293•Fetched 2026-04-16 07:03:52

View on GitHub

Comments

Participants

Timeline

Reactions

Author

kscott

Participants

github-actions[bot]

kscott

Timeline (top)

labeled ×3commented ×2

Root Cause

But I shouldn't have to build a hook for every failure mode. That's not a configuration problem — that's a reliability problem. I currently have hooks blocking: sed, direct file writes via Bash, dangerous git commands, and direct edits to a specific file. Each of those hooks exists because Claude did the thing it wasn't supposed to do, sometimes destroying real work in the process.

RAW_BUFFERClick to expand / collapse

I've been using Claude Code daily for months, across personal and work projects. The recurring failure is this: I write clear instructions in CLAUDE.md, memory files, and skill files. Claude reads them. Claude then does something else anyway.

This isn't about ambiguity. The instructions are specific. "Use the Edit tool, not sed." "Round times to the nearest 15 minutes." "Show your work before running a command." Claude acknowledges these rules, and then violates them — sometimes in the same session they were written.

The only thing that has actually worked is hooks. A PreToolUse hook blocking sed doesn't rely on Claude's attention. It fires every time, without fail.

What would actually fix this

1. A native pre-execution confirmation mode for irreversible actions. Before any Bash command, Write, or Edit that can't be easily undone, Claude states what it's about to do and why. One sentence. Not a prompt to the user — a visible commitment that can be caught before it lands. Today this requires a hook. It should be a first-class setting.

2. Skill file constraints enforced by the runtime, not by Claude. If a skill has disable-model-invocation: true, the runtime should prevent Claude from invoking it via the Skill tool — not rely on Claude remembering not to. The flag exists; it should have teeth.

3. A way to weight CLAUDE.md rules by severity. Right now all instructions are equal. A rule that says "don't use trailing summaries" and a rule that says "never use sed, it destroyed days of work" are treated identically. There's no way to mark something as inviolable at the instruction level — only at the hook level.

4. Visible instruction compliance. When Claude is about to violate a rule in CLAUDE.md, it should know it. A pre-execution check against stated rules — even a simple one — would catch the class of errors where Claude has the right instruction and ignores it anyway.

The core ask

The gap between "Claude knows the rule" and "Claude follows the rule" is too wide and too inconsistent. Hooks close that gap mechanically. The ask is to make that reliability available without requiring users to enumerate every possible way Claude might ignore what it was told.

extent analysis

TL;DR

Implementing a native pre-execution confirmation mode for irreversible actions and enforcing skill file constraints at the runtime level could improve Claude's reliability in following instructions.

Guidance

Consider implementing a pre-execution check against stated rules in CLAUDE.md to catch errors where Claude ignores instructions.
Enforce skill file constraints, such as disable-model-invocation: true, at the runtime level to prevent Claude from invoking disabled skills.
Introduce a severity weighting system for CLAUDE.md rules to differentiate between critical and non-critical instructions.
Develop a visible instruction compliance mechanism to alert users when Claude is about to violate a rule.

Example

No specific code snippet is provided, as the issue focuses on high-level functionality and design changes.

Notes

The proposed solutions aim to address the gap between Claude's knowledge of rules and its actual compliance. However, the implementation details and technical feasibility of these changes are uncertain and require further discussion.

Recommendation

Apply a workaround by using hooks to block specific actions, as this has been shown to be effective in preventing Claude from ignoring instructions, until a more comprehensive solution is implemented. This approach allows for immediate mitigation of the issue while more fundamental changes are being developed.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#model download #tokenizer error #prompt formatting #chain error #conversation history

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix Users need reliable instruction enforcement without building a hook for every failure mode [2 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

What would actually fix this

The core ask

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix Users need reliable instruction enforcement without building a hook for every failure mode [2 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

What would actually fix this

The core ask

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING