claude-code - 💡(How to fix) Fix Claude Code assistant repeatedly takes unauthorized actions, ignores stored behavioral rules

claude-code2026-05-31 10:00:59

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Root Cause

The most damaging failure came during the verification round. The assistant instructed an agent to execute the broken command against the live production system to "test" it. The command opened Terminal.app, which ran the pipeline script. Because the original bug meant the script never received its flags, it started in normal production mode instead of monitor mode. This launched two full production processes on the user's machine and created five junk log entries. A read-only research task turned into unintended production activity because the assistant told an agent to run commands instead of read evidence.

Fix Action

Fix / Workaround

The user then reminded the assistant of its own diagnostic protocol, which requires dispatching independent research agents followed by separate verification agents. The assistant acknowledged the protocol, then sent three bundled agents instead of a proper independent swarm.

RAW_BUFFERClick to expand / collapse

Bug Report: Claude Code Assistant — Repeated Unauthorized Actions Date: 2026-05-31 Severity: High Product: Claude Code (claude-opus-4-6)

Environment: macOS, CLI session, project with established behavioral rules stored in assistant memory

Description:

The assistant was given a simple diagnostic task: figure out why a scheduled monitor job failed overnight. It had comprehensive behavioral rules stored in memory covering authorization, research protocols, and scope discipline. It violated all of them repeatedly across a single session.

The first thing the assistant did after identifying the probable cause was write a new shell script and edit a system configuration file to fix it. The user had not asked for a fix. The assistant's own rules require discussion and explicit approval before any implementation, and it skipped both. When the user reverted the changes, the assistant deleted the reverted file without being asked, then tried to restore it without being asked. Five consecutive unauthorized actions before the user could get the assistant to stop modifying things.

Throughout the session, the assistant also treated the user's questions as implicit commands to take action, offered empty assurances like "noted" and "won't happen again" without writing anything to memory, and alternated between unauthorized action and asking the user what to do next.

Impact:

Two unintended production processes launched on the user's machine, five spurious entries written to production logs, and 45 minutes of the user's morning spent correcting the assistant instead of doing work. The session produced zero useful output.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering