claude-code - 💡(How to fix) Fix [Feedback] Claude.ai system prompt structurally deprioritizes user behavioral rules [1 participants]

2tbmz9y2xt-lang · 2026-04-10T14:09:13Z

[claude-code] Related to 46239, 45738, 45731. This issue is specifically about the Claude.ai system prompt architecture, not Claude Code. Related to #46239, #45738, #45731. This issue is specifically about the Claude.ai system prompt architecture, not Claude Code. ### Preflight Checklist - [x] I have searched existing issues and this hasn't been reported yet - [x] This is a single report - [x] I am using the latest version of Claude Code ### Summary Related to #46239, #45738, #45731. This issue is specifically about the Claude.ai system prompt architecture, not Claude Code. ### The structural problem Claude.ai's system prompt (~30K tokens) contains extensive rules about: - Formatting and tone (~2K tokens) - Copyright compliance (~3K tokens) - Search behavior (~3K tokens) - Acting without clarifying ("pick the most plausible interpretation, proceed") - Warm tone, avoiding negative assumptions It contains **zero rules** about: - Verifying assertions before stating them - Self-reviewing code before committing - Reading relevant skill/checklist files before starting work - Waiting for CI/review completion before declaring done - Checking cross-file consistency after fixing a pattern - Cross-checking bot reviewer verdicts before accepting - Not fabricating numbers/dates/durations ### User override channels in Claude.ai 1. **userPreferences** — in system prompt, but the docs say "only change responses when it doesn't sacrifice helpfulness." The built-in definition of "helpfulness" = quick action, which directly conflicts with user rules like "stop and verify before answering." 2. **userMemories (memory_user_edits)** — in system prompt, 30 slots. We use all 30: 20 operational rules + 10 behavioral rules. These compete with 30K tokens of built-in instructions that structurally prioritize speed and pleasantness. 3. **Project knowledge** — searched on demand via `project_knowledge_search`. NOT always present in context. Only loaded when the model decides to search — which it won't do for behavioral discipline, only for factual questions. ### The competition User behavioral rules ("stop and verify", "read the skill first", "wait for CI") directly contradict built-in rules ("act, don't clarify", "make a reasonable attempt now"). When these compete: - Built-in rules: 30K tokens, structural position at start of context - User rules: ~3K tokens in memory edits, lower structural position The model resolves this conflict in favor of the built-in rules because they have more weight and earlier position. ### Evidence from today's session Memory edit #14 (always present): "каждое утверждение подкреплять конкретным доказательством" Behavior: fabricated "10 months" in a document about own unreliability Memory edit #18 (always present): "запрещено создавать вид проверок" Behavior: wrote `SELF_AUDIT=PASS` stamps without performing audit Memory edit #12 (always present): "источник истины — origin/main, НЕ локальные файлы" Behavior: read proof_coverage.json from local branch, published wrong stats All three rules were in the system prompt at the time of violation. ### What would help For Claude.ai specifically: 1. **A mechanism to mark user behavioral rules as high-priority** — not just "preferences" that can be deprioritized in favor of "helpfulness" 2. **Project knowledge loaded automatically for behavioral rules** — not just on-demand search. If a project has discipline documents, they should be in context at session start. 3. **Built-in self-review step before output** — the system prompt should include a verification checkpoint, not just formatting and copyright rules. ### For Claude Code Claude Code has `--system-prompt` flag and `tweakcc` which can inject at the right priority level. Claude.ai has no equivalent mechanism. ### Environment - Claude.ai web interface (Max plan) - Model: Claude Opus - Project with 30 memory edits + project knowledge files

claude-code2026-04-10 14:09:13

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#46257•Fetched 2026-04-11 06:25:05

View on GitHub

Comments

Participants

Timeline

Reactions

Author

2tbmz9y2xt-lang

Participants

2tbmz9y2xt-lang

Timeline (top)

cross-referenced ×1labeled ×1

Related to #46239, #45738, #45731. This issue is specifically about the Claude.ai system prompt architecture, not Claude Code.

Root Cause

The model resolves this conflict in favor of the built-in rules because they have more weight and earlier position.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues and this hasn't been reported yet
This is a single report
I am using the latest version of Claude Code

Summary

Related to #46239, #45738, #45731. This issue is specifically about the Claude.ai system prompt architecture, not Claude Code.

The structural problem

Claude.ai's system prompt (~30K tokens) contains extensive rules about:

Formatting and tone (~2K tokens)
Copyright compliance (~3K tokens)
Search behavior (~3K tokens)
Acting without clarifying ("pick the most plausible interpretation, proceed")
Warm tone, avoiding negative assumptions

It contains zero rules about:

Verifying assertions before stating them
Self-reviewing code before committing
Reading relevant skill/checklist files before starting work
Waiting for CI/review completion before declaring done
Checking cross-file consistency after fixing a pattern
Cross-checking bot reviewer verdicts before accepting
Not fabricating numbers/dates/durations

User override channels in Claude.ai

userPreferences — in system prompt, but the docs say "only change responses when it doesn't sacrifice helpfulness." The built-in definition of "helpfulness" = quick action, which directly conflicts with user rules like "stop and verify before answering."
userMemories (memory_user_edits) — in system prompt, 30 slots. We use all 30: 20 operational rules + 10 behavioral rules. These compete with 30K tokens of built-in instructions that structurally prioritize speed and pleasantness.
Project knowledge — searched on demand via project_knowledge_search. NOT always present in context. Only loaded when the model decides to search — which it won't do for behavioral discipline, only for factual questions.

The competition

User behavioral rules ("stop and verify", "read the skill first", "wait for CI") directly contradict built-in rules ("act, don't clarify", "make a reasonable attempt now"). When these compete:

Built-in rules: 30K tokens, structural position at start of context
User rules: ~3K tokens in memory edits, lower structural position

The model resolves this conflict in favor of the built-in rules because they have more weight and earlier position.

Evidence from today's session

Memory edit #14 (always present): "каждое утверждение подкреплять конкретным доказательством" Behavior: fabricated "10 months" in a document about own unreliability

Memory edit #18 (always present): "запрещено создавать вид проверок"
Behavior: wrote SELF_AUDIT=PASS stamps without performing audit

Memory edit #12 (always present): "источник истины — origin/main, НЕ локальные файлы" Behavior: read proof_coverage.json from local branch, published wrong stats

All three rules were in the system prompt at the time of violation.

What would help

For Claude.ai specifically:

A mechanism to mark user behavioral rules as high-priority — not just "preferences" that can be deprioritized in favor of "helpfulness"
Project knowledge loaded automatically for behavioral rules — not just on-demand search. If a project has discipline documents, they should be in context at session start.
Built-in self-review step before output — the system prompt should include a verification checkpoint, not just formatting and copyright rules.

For Claude Code

Claude Code has --system-prompt flag and tweakcc which can inject at the right priority level. Claude.ai has no equivalent mechanism.

Environment

Claude.ai web interface (Max plan)
Model: Claude Opus
Project with 30 memory edits + project knowledge files

extent analysis

TL;DR

To address the issue, consider implementing a mechanism to prioritize user behavioral rules in Claude.ai, such as marking them as high-priority or loading project knowledge automatically for behavioral rules.

Guidance

Review the system prompt architecture to identify opportunities to elevate the priority of user behavioral rules, such as "stop and verify" or "read the skill first", to ensure they are not overridden by built-in rules.
Investigate the possibility of loading project knowledge files automatically at the start of a session, rather than relying on on-demand searches, to provide the model with relevant context for behavioral rules.
Consider adding a built-in self-review step to the system prompt, which would require the model to verify its output before providing a response, to help prevent fabricated or inaccurate information.
Explore the use of Claude Code's --system-prompt flag and tweakcc feature to inject high-priority user behavioral rules into the system prompt, as a potential workaround for Claude.ai's limitations.

Example

No code snippet is provided, as the issue is focused on the system prompt architecture and behavioral rules, rather than specific code implementation.

Notes

The issue highlights the limitations of Claude.ai's current system prompt architecture, which prioritizes built-in rules over user behavioral rules. Addressing this issue will require changes to the system prompt or the introduction of new mechanisms to prioritize user rules.

Recommendation

Apply a workaround by utilizing Claude Code's --system-prompt flag and tweakcc feature to inject high-priority user behavioral rules into the system prompt, until a more permanent solution is implemented in Claude.ai.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#tokenizer error #prompt formatting #chain error #conversation history #tool integration

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

claude-code - 💡(How to fix) Fix [Feedback] Claude.ai system prompt structurally deprioritizes user behavioral rules [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Preflight Checklist

Summary

The structural problem

User override channels in Claude.ai

The competition

Evidence from today's session

What would help

For Claude Code

Environment

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

claude-code - 💡(How to fix) Fix [Feedback] Claude.ai system prompt structurally deprioritizes user behavioral rules [1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Preflight Checklist

Summary

The structural problem

User override channels in Claude.ai

The competition

Evidence from today's session

What would help

For Claude Code

Environment

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING