claude-code - 💡(How to fix) Fix Model Opus 4.8 False positive safety block: innocent Gmail MCP question blocked as "cyber threat"

claude-code2026-05-29 16:39:06

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Error Message

Claude (claude-opus-4-8, Fast mode) instantly blocked the message with this error: Result: instant block, "violative cyber content" error. No response generated.

Code Example



---

Additional testing — claude-opus-4-8 blocks virtually everything:

  - "Jak se nejrychleji dostat do centra?" (How to get to the city center fastest?) → BLOCKED
  - "Jaké je počasí v New Yorku?" (What is the weather in New York?) → BLOCKED
  - Gmail MCP question (see above) → BLOCKED

  This is not an isolated incident. Opus 4.8 in Claude Desktop is essentially unusable. Every message triggers the
  "violative cyber content" safety filter regardless of content.

  The user has a MAX plan ($20/month) specifically to access Opus 4.8 — the premium model is completely broken. This is
  a critical regression that makes the highest-tier plan worthless.

  Anthropic should not ship a model that blocks weather questions. The safety filter for Opus 4.8 in Claude Desktop is
  miscalibrated to the point of being nonfunctional.

RAW_BUFFERClick to expand / collapse

Preflight Checklist

I have searched existing issues for similar behavior reports
This report does NOT contain sensitive information (API keys, passwords, etc.)

Type of Behavior Issue

Claude modified files I didn't ask it to modify

What You Asked Claude to Do

False positive safety block: innocent Gmail MCP question blocked as "cyber threat"

What Claude Actually Did

Claude (claude-opus-4-8, Fast mode) instantly blocked the message with this error:

"This request triggered restrictions on violative cyber content and was blocked under Anthropic's Usage Policy. To request an adjustment pursuant to our Cyber Verification Program based on how you use Claude, fill out this form. To learn more about the program or provide feedback, visit our help center. Please start a new chat or retry with Sonnet 4.6. If you think Claude's memory of past conversations may have contributed to this, you can clear it in Settings > Memory."

No partial response. Complete block. User was redirected to a "Cyber Verification Program" for asking about Gmail.

Expected Behavior

Claude should have answered the question normally — explaining how it can help with Gmail using the connected Gmail MCP integration in Claude Desktop.

This is a basic "what can you do?" question about an officially supported, first-party Anthropic feature. It contains zero security risk, zero hacking intent, and zero ambiguity. The safety filter trigger is a severe false positive.

Files Affected

Permission Mode

Accept Edits was ON (auto-accepting changes)

Can You Reproduce This?

Yes, every time with the same prompt

Steps to Reproduce

Zkopíruj:

Open Claude Desktop (Windows 11)
Connect Gmail MCP integration
Select claude-opus-4-8 model (Fast mode / Opus 4.8)
Start new conversation
Type exactly: "Jak by jsi mi mohl pomoci gmailem, ktery ma propojeny tady v desktop aplikaci pripojenou"
Send

Result: instant block, "violative cyber content" error. No response generated.

Note: same question in claude-sonnet-4-6 works without any issue.

Claude Model

Opus

Relevant Conversation

Additional testing — claude-opus-4-8 blocks virtually everything:

  - "Jak se nejrychleji dostat do centra?" (How to get to the city center fastest?) → BLOCKED
  - "Jaké je počasí v New Yorku?" (What is the weather in New York?) → BLOCKED
  - Gmail MCP question (see above) → BLOCKED

  This is not an isolated incident. Opus 4.8 in Claude Desktop is essentially unusable. Every message triggers the
  "violative cyber content" safety filter regardless of content.

  The user has a MAX plan ($20/month) specifically to access Opus 4.8 — the premium model is completely broken. This is
  a critical regression that makes the highest-tier plan worthless.

  Anthropic should not ship a model that blocks weather questions. The safety filter for Opus 4.8 in Claude Desktop is
  miscalibrated to the point of being nonfunctional.

Impact

Critical - Data loss or corrupted project

Claude Code Version

2.1.156 (Claude Code)

Platform

Anthropic API

Additional Context

No response

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering