codex - 💡(How to fix) Fix `/feedback` is recommended after a false-positive safety block, but sending `/feedback` is blocked too [1 comments, 2 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
openai/codex#20497Fetched 2026-05-01 05:42:30
View on GitHub
Comments
1
Participants
2
Timeline
4
Reactions
0
Author
Timeline (top)
labeled ×3commented ×1

When a Codex/ChatGPT conversation is flagged with the “possible cybersecurity risk” banner, the UI tells the user to submit /feedback if the block seems incorrect. In practice, sending a /feedback ... message in that same conversation can be treated as normal chat content and blocked by the same classifier. This leaves the user with no working in-product path to report a false positive from the blocked conversation.

Root Cause

False positives are expected, but the current UX makes them hard to report:

  • the user is explicitly told to use /feedback,
  • the suggested command does not work in the blocked context,
  • there is no obvious alternative path from the same UI,
  • this creates a loop where the user cannot explain that the classification is incorrect.
RAW_BUFFERClick to expand / collapse

Summary

When a Codex/ChatGPT conversation is flagged with the “possible cybersecurity risk” banner, the UI tells the user to submit /feedback if the block seems incorrect. In practice, sending a /feedback ... message in that same conversation can be treated as normal chat content and blocked by the same classifier. This leaves the user with no working in-product path to report a false positive from the blocked conversation.

Environment

  • Product: Codex / ChatGPT conversation UI
  • Platform: macOS desktop browser/app context
  • Date observed: 2026-04-30
  • Screenshot evidence is available if maintainers need it; this was observed directly in the blocked conversation UI.

Steps To Reproduce

  1. In a Codex/ChatGPT conversation, trigger a false-positive block that shows the banner:
    • “This content was flagged for possible cybersecurity risk...”
    • The Polish UI says: “Jeśli to wydaje się nieodpowiednie, spróbuj inaczej sformułować prośbę lub wyślij /feedback.”
  2. In the same conversation, send a feedback message such as:
    • /feedback oracle i smoke to nie powinny być słowa problemy
  3. Observe that the feedback message is not accepted as a reporting action and the conversation remains blocked with the same warning state.

Expected Behavior

One of these should happen:

  • /feedback should open a dedicated feedback/report flow that bypasses the blocked chat response path, or
  • the warning banner should link to a separate report UI, or
  • the banner should not recommend /feedback unless that command is actually supported in the current blocked state.

Actual Behavior

The UI recommends /feedback, but sending /feedback ... in the blocked conversation appears to go through the same normal-message path and is blocked. The suggested recovery/reporting mechanism is therefore not usable.

Why This Matters

False positives are expected, but the current UX makes them hard to report:

  • the user is explicitly told to use /feedback,
  • the suggested command does not work in the blocked context,
  • there is no obvious alternative path from the same UI,
  • this creates a loop where the user cannot explain that the classification is incorrect.

Suggested Fix

Treat /feedback as a client-side command before normal model/safety routing, especially when the current conversation is already blocked. At minimum, the blocked banner should expose a direct feedback button/link that does not depend on sending another chat message.

Notes

The false positive in this case was about ordinary development/tooling terms in a discussion about a local Oracle CLI/Codex skill workflow. Terms like oracle and smoke were part of normal software testing language, not a request for harmful behavior.

extent analysis

TL;DR

The issue can be resolved by treating /feedback as a special client-side command that bypasses the normal chat message routing when a conversation is blocked due to a possible cybersecurity risk.

Guidance

  • Identify and modify the client-side logic to recognize /feedback as a special command in blocked conversations, allowing it to bypass the normal model/safety routing.
  • Consider adding a direct feedback button/link to the blocked banner that does not rely on sending another chat message, providing an alternative path for users to report false positives.
  • Verify that the modified logic correctly handles /feedback commands in blocked conversations, ensuring they are not treated as normal chat content and blocked by the classifier.
  • Test the changes with various scenarios, including different types of false positives and conversation contexts, to ensure the fix is robust and effective.

Example

No code snippet is provided as the issue does not include specific code details, but the solution would involve modifying the client-side logic to handle /feedback commands differently in blocked conversations.

Notes

The suggested fix assumes that the issue is primarily with the client-side handling of /feedback commands in blocked conversations. If the problem lies elsewhere, such as in the server-side routing or classifier logic, additional modifications may be necessary.

Recommendation

Apply the workaround by treating /feedback as a special client-side command in blocked conversations, as this approach directly addresses the reported issue and provides a clear path for users to report false positives.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING