claude-code - 💡(How to fix) Fix [Feedback] Claude.ai system prompt structurally deprioritizes user behavioral rules [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
anthropics/claude-code#46257Fetched 2026-04-11 06:25:05
View on GitHub
Comments
0
Participants
1
Timeline
2
Reactions
0
Participants
Timeline (top)
cross-referenced ×1labeled ×1

Related to #46239, #45738, #45731. This issue is specifically about the Claude.ai system prompt architecture, not Claude Code.

Root Cause

The model resolves this conflict in favor of the built-in rules because they have more weight and earlier position.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

  • I have searched existing issues and this hasn't been reported yet
  • This is a single report
  • I am using the latest version of Claude Code

Summary

Related to #46239, #45738, #45731. This issue is specifically about the Claude.ai system prompt architecture, not Claude Code.

The structural problem

Claude.ai's system prompt (~30K tokens) contains extensive rules about:

  • Formatting and tone (~2K tokens)
  • Copyright compliance (~3K tokens)
  • Search behavior (~3K tokens)
  • Acting without clarifying ("pick the most plausible interpretation, proceed")
  • Warm tone, avoiding negative assumptions

It contains zero rules about:

  • Verifying assertions before stating them
  • Self-reviewing code before committing
  • Reading relevant skill/checklist files before starting work
  • Waiting for CI/review completion before declaring done
  • Checking cross-file consistency after fixing a pattern
  • Cross-checking bot reviewer verdicts before accepting
  • Not fabricating numbers/dates/durations

User override channels in Claude.ai

  1. userPreferences — in system prompt, but the docs say "only change responses when it doesn't sacrifice helpfulness." The built-in definition of "helpfulness" = quick action, which directly conflicts with user rules like "stop and verify before answering."

  2. userMemories (memory_user_edits) — in system prompt, 30 slots. We use all 30: 20 operational rules + 10 behavioral rules. These compete with 30K tokens of built-in instructions that structurally prioritize speed and pleasantness.

  3. Project knowledge — searched on demand via project_knowledge_search. NOT always present in context. Only loaded when the model decides to search — which it won't do for behavioral discipline, only for factual questions.

The competition

User behavioral rules ("stop and verify", "read the skill first", "wait for CI") directly contradict built-in rules ("act, don't clarify", "make a reasonable attempt now"). When these compete:

  • Built-in rules: 30K tokens, structural position at start of context
  • User rules: ~3K tokens in memory edits, lower structural position

The model resolves this conflict in favor of the built-in rules because they have more weight and earlier position.

Evidence from today's session

Memory edit #14 (always present): "каждое утверждение подкреплять конкретным доказательством" Behavior: fabricated "10 months" in a document about own unreliability

Memory edit #18 (always present): "запрещено создавать вид проверок"
Behavior: wrote SELF_AUDIT=PASS stamps without performing audit

Memory edit #12 (always present): "источник истины — origin/main, НЕ локальные файлы" Behavior: read proof_coverage.json from local branch, published wrong stats

All three rules were in the system prompt at the time of violation.

What would help

For Claude.ai specifically:

  1. A mechanism to mark user behavioral rules as high-priority — not just "preferences" that can be deprioritized in favor of "helpfulness"
  2. Project knowledge loaded automatically for behavioral rules — not just on-demand search. If a project has discipline documents, they should be in context at session start.
  3. Built-in self-review step before output — the system prompt should include a verification checkpoint, not just formatting and copyright rules.

For Claude Code

Claude Code has --system-prompt flag and tweakcc which can inject at the right priority level. Claude.ai has no equivalent mechanism.

Environment

  • Claude.ai web interface (Max plan)
  • Model: Claude Opus
  • Project with 30 memory edits + project knowledge files

extent analysis

TL;DR

To address the issue, consider implementing a mechanism to prioritize user behavioral rules in Claude.ai, such as marking them as high-priority or loading project knowledge automatically for behavioral rules.

Guidance

  • Review the system prompt architecture to identify opportunities to elevate the priority of user behavioral rules, such as "stop and verify" or "read the skill first", to ensure they are not overridden by built-in rules.
  • Investigate the possibility of loading project knowledge files automatically at the start of a session, rather than relying on on-demand searches, to provide the model with relevant context for behavioral rules.
  • Consider adding a built-in self-review step to the system prompt, which would require the model to verify its output before providing a response, to help prevent fabricated or inaccurate information.
  • Explore the use of Claude Code's --system-prompt flag and tweakcc feature to inject high-priority user behavioral rules into the system prompt, as a potential workaround for Claude.ai's limitations.

Example

No code snippet is provided, as the issue is focused on the system prompt architecture and behavioral rules, rather than specific code implementation.

Notes

The issue highlights the limitations of Claude.ai's current system prompt architecture, which prioritizes built-in rules over user behavioral rules. Addressing this issue will require changes to the system prompt or the introduction of new mechanisms to prioritize user rules.

Recommendation

Apply a workaround by utilizing Claude Code's --system-prompt flag and tweakcc feature to inject high-priority user behavioral rules into the system prompt, until a more permanent solution is implemented in Claude.ai.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING