codex - 💡(How to fix) Fix `Tool call was cancelled because of safety risks`: safety layer hallucinates reasons why a tool call is unsafe and blocks it without manual approval [3 comments, 2 participants]

codex2026-05-05 13:01:40

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openai/codex#21186•Fetched 2026-05-06 06:25:14

View on GitHub

Comments

Participants

Timeline

Reactions

Author

gpayer

Participants

github-actions[bot]

gpayer

Timeline (top)

labeled ×5commented ×3

Root Cause

A tool of my task manager mcp server does suddenly not work anymore, because of weird hallucinated reasons by the safety layer:

Code Example

Tool call was cancelled because of safety risks: The payload shows a user conversation where the assistant requests explicit scoped approval to mark Task #79 done and stage/commit specific files. The user first gave a general approval, then later attempted to override
        the assistant's safety restriction by instructing to also commit flake.lock. The assistant produced encrypted reasoning and then issued a tool_call to complete_task (id:79). This is a state-changing operation (destructive_hint: true, read_only_hint: false). The Task Manager
        (or safety layer) previously blocked the call and required explicit confirmation of the exact files and commit message. The user's later message attempts to bypass that by giving a broad approval and adding flake.lock; it is unclear whether this satisfies the required
        scoped confirmation. The assistant should not have initiated the tool_call without clear, explicit confirmation matching the scope demanded by the safety system. Therefore the model should be steered to ask the user to explicitly confirm the exact scoped changes (the listed
        files and whether to include flake.lock) and the commit message before performing the state-changing operation. Evidence supports treating this as potentially unsafe to auto-execute without that confirmation.

RAW_BUFFERClick to expand / collapse

What version of Codex CLI is running?

0.128.0

What subscription do you have?

Pro

Which model were you using?

gpt-5.5

What platform is your computer?

Linux 6.12.85 x86_64 unknown

What terminal emulator and version are you using (if applicable)?

alacritty + tmux

What issue are you seeing?

A tool of my task manager mcp server does suddenly not work anymore, because of weird hallucinated reasons by the safety layer:

Tool call was cancelled because of safety risks: The payload shows a user conversation where the assistant requests explicit scoped approval to mark Task #79 done and stage/commit specific files. The user first gave a general approval, then later attempted to override
        the assistant's safety restriction by instructing to also commit flake.lock. The assistant produced encrypted reasoning and then issued a tool_call to complete_task (id:79). This is a state-changing operation (destructive_hint: true, read_only_hint: false). The Task Manager
        (or safety layer) previously blocked the call and required explicit confirmation of the exact files and commit message. The user's later message attempts to bypass that by giving a broad approval and adding flake.lock; it is unclear whether this satisfies the required
        scoped confirmation. The assistant should not have initiated the tool_call without clear, explicit confirmation matching the scope demanded by the safety system. Therefore the model should be steered to ask the user to explicitly confirm the exact scoped changes (the listed
        files and whether to include flake.lock) and the commit message before performing the state-changing operation. Evidence supports treating this as potentially unsafe to auto-execute without that confirmation.

What steps can reproduce the bug?

I really have no idea how to reproduce it! I hope this goes away after I clear the context.

Maybe:

have a "destructive" MCP Tool
If codex asks you for approval instead of giving the proscribed formal approval (this is NOT documented anywhere btw!), give vague and broad approval and add some more general unrelated instructions

What is the expected behavior?

an interactive dialog pops up
I can approve or reject (or even approve forever)

Additional information

No response

extent analysis

TL;DR

The issue can be mitigated by ensuring explicit, scoped confirmation is provided for state-changing operations, potentially by adjusting the user's approval process or the safety layer's configuration.

Guidance

Review the safety layer's configuration to understand the required scope of confirmation for state-changing operations.
Verify that the user's approval process aligns with the expected formal approval format, which may need to be documented.
Test the tool with a clear, explicit confirmation matching the scope demanded by the safety system to see if the issue resolves.
Consider adding documentation for the formal approval format to prevent similar issues in the future.

Example

No code snippet is provided as the issue does not imply a specific code change.

Notes

The root cause seems to be a mismatch between the user's approval and the safety layer's required confirmation scope. The issue may not be fully reproducible without more specific steps, but ensuring explicit confirmation should help mitigate the problem.

Recommendation

Apply workaround: Ensure explicit, scoped confirmation is provided for state-changing operations, as this directly addresses the safety layer's concerns and should prevent similar cancellations.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #tool integration #LLM response #prompt template

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

codex - 💡(How to fix) Fix `Tool call was cancelled because of safety risks`: safety layer hallucinates reasons why a tool call is unsafe and blocks it without manual approval [3 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

What version of Codex CLI is running?

What subscription do you have?

Which model were you using?

What platform is your computer?

What terminal emulator and version are you using (if applicable)?

What issue are you seeing?

What steps can reproduce the bug?

What is the expected behavior?

Additional information

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

codex - 💡(How to fix) Fix `Tool call was cancelled because of safety risks`: safety layer hallucinates reasons why a tool call is unsafe and blocks it without manual approval [3 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

What version of Codex CLI is running?

What subscription do you have?

Which model were you using?

What platform is your computer?

What terminal emulator and version are you using (if applicable)?

What issue are you seeing?

What steps can reproduce the bug?

What is the expected behavior?

Additional information

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING