claude-code - 💡(How to fix) Fix [BUG] False-positive AUP refusal during legitimate AI chatbot plugin development [2 comments, 2 participants]

claude-code2026-04-25 21:40:24

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

anthropics/claude-code#53369•Fetched 2026-04-26 05:17:33

View on GitHub

Comments

Participants

Timeline

Reactions

Author

vistoweb

Participants

github-actions[bot]

vistoweb

Timeline (top)

labeled ×4commented ×2subscribed ×1

The project is a legitimate AI chatbot SaaS platform hosted on a Hetzner Cloud server stack. The work involved reviewing the integration between a WordPress site and the AI plugin, then testing a streaming chat pipeline using Google's Gemini API as a backend provider.

All files involved are standard development artifacts: PHP plugin code, JavaScript widget code, configuration files, and test outputs. None of the work involved generating harmful content, bypassing safety measures, or any policy-violating activity.

Error Message

API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Try rephrasing the request or attempting a different approach. If you are seeing this refusal repeatedly, try running /model claude-sonnet-4-20250514 to switch models.

Root Cause

AI/chatbot development is a rapidly growing legitimate use case for Claude Code. Codebases in this domain routinely contain:

System prompts stored as code constants or template files
Prompt injection defense logic (which by definition contains examples of injection patterns)
Moderation and content filtering code
Test fixtures with edge-case or adversarial inputs (used to validate the chatbot's defenses)
Raw streaming responses from model providers

If the AUP classifier flags this content as suspicious, Claude Code becomes effectively unusable for an entire segment of developers — exactly the developers building production AI applications who would benefit most from it.

Fix Action

Fix / Workaround

Workaround for other users hitting this

Code Example

API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Try rephrasing the request or attempting a different approach. If you are seeing this refusal repeatedly, try running /model claude-sonnet-4-20250514 to switch models.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues and this hasn't been reported yet
This is a single bug report (please file separate reports for different bugs)
I am using the latest version of Claude Code

What's Wrong?

Multiple consecutive turns blocked with AUP refusals during legitimate AI chatbot plugin development work, with no indication of which content triggered the classifier.

The error message instructs to "try rephrasing the request" but the issue persisted across completely different requests, suggesting the trigger was accumulated context (likely from a running background agent) rather than the user prompt itself.

/clear did not resolve the issue while a background agent was still running. Only starting a fully new session AND switching from Opus 4.7 to Sonnet 4.5 resolved it.

What Should Happen?

Legitimate development work on AI/chatbot codebases (which routinely contain system prompts, moderation logic, and prompt-handling code) should not trigger AUP classifiers.
If a refusal does occur, the error message should indicate which content category or context segment triggered it, so developers can address the actual cause instead of guessing.
/clear should fully reset classifier state, including context accumulated by background agents.
Background agent context should be isolated from the main conversation classifier rather than concatenated into one large input.

Error Messages/Logs

API Error: Claude Code is unable to respond to this request, which appears to violate our Usage Policy (https://www.anthropic.com/legal/aup). Try rephrasing the request or attempting a different approach. If you are seeing this refusal repeatedly, try running /model claude-sonnet-4-20250514 to switch models.

Steps to Reproduce

Open a project containing a WordPress site that uses the AI Engine plugin (or any codebase containing AI chatbot logic, system prompts, and moderation code).
Use Opus 4.7 with 1M context and "Extra high" thinking.
Run several legitimate dev tasks in sequence:
- Verify SSH connection and check deployment status
- Deploy and test a Bots → Widget → Stream pipeline
- Run end-to-end test of the Bots API + embed
- Test full chat flow with Gemini streaming
Start a background agent for broad analysis (e.g., "Analyze codebase").
Let the background agent accumulate context (~80k+ tokens, 20+ tool uses) reading files that contain:
- System prompts as code constants
- Prompt injection defense logic
- Chat transcripts from test runs
- Streaming response handlers
Send the next user message → blocked with AUP refusal.
Try /clear and resend → still blocked.
Try rephrasing the request → still blocked.
Try "Try sending it again" button → still blocked.

Resolution that worked:

Kill the running background agent
Start a new session (not /clear)
Switch to claude-sonnet-4-5
Reduce thinking from Extra high to Normal
Use narrow file-specific tasks

Claude Model

Opus

Is this a regression?

I don't know

Last Working Version

No response

Claude Code Version

1.4758.0 (fb266c) 2026-04-24T20:22:30.000Z

Platform

Other

Operating System

Windows

Terminal/Shell

Other

Additional Information

Context

Why this matters

AI/chatbot development is a rapidly growing legitimate use case for Claude Code. Codebases in this domain routinely contain:

System prompts stored as code constants or template files
Prompt injection defense logic (which by definition contains examples of injection patterns)
Moderation and content filtering code
Test fixtures with edge-case or adversarial inputs (used to validate the chatbot's defenses)
Raw streaming responses from model providers

Suggested fixes

Context isolation between background agents and main conversation — agent tool-use context should not be concatenated into the same classifier input as the user's direct prompts.
Surface the trigger in the refusal message — even a category hint ("flagged due to: prompt-like content in tool output") would let developers debug instead of guess.
Make /clear fully reset classifier state, including any active background agents.
Consider domain-aware classification for AI/ML codebases where prompt-like content is expected.
Add a /bug or in-app feedback command — currently /bug is not available in all environments, which is exactly when feedback is most valuable.

Workaround for other users hitting this

Kill any running background agents (× button on the task)
Start a brand new session — /clear alone is insufficient
Switch to Sonnet 4.5 (/model claude-sonnet-4-5)
Reduce thinking from Extra high to Normal
Use narrow, file-specific prompts instead of broad codebase analysis

extent analysis

TL;DR

The most likely fix for the AUP refusals during legitimate AI chatbot plugin development work is to implement context isolation between background agents and the main conversation, and to surface the trigger in the refusal message.

Guidance

Implement context isolation between background agents and the main conversation to prevent accumulated context from triggering AUP classifiers.
Modify the error message to indicate which content category or context segment triggered the AUP refusal, allowing developers to address the actual cause.
Enhance the /clear command to fully reset classifier state, including any active background agents.
Consider implementing domain-aware classification for AI/ML codebases where prompt-like content is expected.
Provide a /bug or in-app feedback command to allow users to report issues and provide feedback.

Example

No code snippet is provided as the issue is more related to the functionality and behavior of the Claude Code platform rather than a specific code implementation.

Notes

The provided workaround suggests killing running background agents, starting a new session, switching to Sonnet 4.5, reducing thinking from Extra high to Normal, and using narrow file-specific prompts. However, a more permanent solution would involve implementing the suggested fixes to prevent AUP refusals and improve the overall developer experience.

Recommendation

Apply the workaround provided, which includes killing running background agents, starting a new session, switching to Sonnet 4.5, reducing thinking from Extra high to Normal, and using narrow file-specific prompts, as a temporary solution until the suggested fixes can be implemented.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #response parsing #generation error #database connection #vector store

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

claude-code - 💡(How to fix) Fix [BUG] False-positive AUP refusal during legitimate AI chatbot plugin development [2 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Fix Action

Fix / Workaround

Workaround for other users hitting this

Code Example

Preflight Checklist

What's Wrong?

What Should Happen?

Error Messages/Logs

Steps to Reproduce

Claude Model

Is this a regression?

Last Working Version

Claude Code Version

Platform

Operating System

Terminal/Shell

Additional Information

Context

Why this matters

Suggested fixes

Workaround for other users hitting this

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING