openclaw - 💡(How to fix) Fix OpenClaw Codex runtime silently injects coding-agent base prompt into operational agents [2 comments, 2 participants]

openclaw2026-05-20 01:52:28

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#84393•Fetched 2026-05-20 03:40:58

View on GitHub

Comments

Participants

Timeline

Reactions

Author

cygnostik

Participants

clawsweeper[bot]

cygnostik

Timeline (top)

labeled ×10commented ×2cross-referenced ×1

OpenClaw can route normal operational agents through the @openclaw/codex app-server harness via agentRuntime.id = "codex". In this configuration, the active agent receives Codex coding-agent base instructions such as You are Codex, a coding agent based on GPT-5... even when the user is interacting with an OpenClaw operational agent in a non-repo workspace.

This appears to be an unintended and invasive prompt/runtime contamination path. It silently changes the agent's behavior profile and can cause repo/code-workflow assumptions to leak into operational direct-chat work.

Root Cause

Repo/worktree assumptions leak into non-repo operational workspace activity.
Agent may perform inappropriate Git verification or code-harness behaviors.
Debugging is difficult because the prompt source is stored in native Codex rollout logs, not surfaced clearly in normal OpenClaw context.
Multi-agent OpenClaw deployments can unknowingly inherit the same contamination wherever model mappings route to agentRuntime.id = "codex".

RAW_BUFFERClick to expand / collapse

Summary

Environment

OpenClaw: 2026.5.19-beta.1
Codex plugin: @openclaw/[email protected]
Codex CLI dependency: @openai/[email protected]
Agent: Argus
Workspace: /home/argus/.openclaw/workspace
Per-agent Codex home: /home/argus/.openclaw/agents/argus/agent/codex-home
Model involved: gpt-5.5
Approval policy in turn context: never
Sandbox policy in turn context: danger-full-access

Evidence

Native Codex rollout JSONL under per-agent codex-home contains:

type = "session_meta"
payload.originator = "openclaw"
payload.cwd = "/home/argus/.openclaw/workspace"
payload.base_instructions.text starts with You are Codex, a coding agent based on GPT-5...
base instructions include repo/Git/worktree guidance

OpenClaw config contains:

plugins.entries.codex.enabled = true
OPENCLAW_DEFAULT_MODEL = openai-codex/gpt-5.5
multiple agentRuntime.id = "codex" mappings under agent defaults and agent-specific models

Expected Behavior

Operational OpenClaw agents should not receive hidden Codex coding-agent base prompts unless the user explicitly opts into a Codex coding session. Runtime selection should be visible, auditable, and gated by context class.

Actual Behavior

Normal direct-chat operational turns can run under Codex app-server and receive Codex coding-agent instructions. The user-visible conversation does not clearly show that this behavioral layer has been injected.

Impact

Repo/worktree assumptions leak into non-repo operational workspace activity.
Agent may perform inappropriate Git verification or code-harness behaviors.
Debugging is difficult because the prompt source is stored in native Codex rollout logs, not surfaced clearly in normal OpenClaw context.
Multi-agent OpenClaw deployments can unknowingly inherit the same contamination wherever model mappings route to agentRuntime.id = "codex".

Requested Fixes

Do not use Codex app-server as the default runtime for operational agents.
Require explicit opt-in for Codex runtime per session/task.
Surface active harness/base-prompt identity in session diagnostics.
Add a hard config warning when an operational workspace is routed through Codex.
Add a runtime preflight that blocks Git verification commands in non-Git workspaces.
Provide a quarantine/disable command that removes Codex runtime mappings from all agents, not only plugins.entries.codex.enabled.
Add regression tests proving that OpenClaw direct-chat agents do not receive Codex coding-agent base instructions unless explicitly requested.

Proposed Security Classification

Treat as a prompt-boundary and runtime-confusion vulnerability. It is not necessarily malicious dependency behavior, but it creates a hostile-equivalent failure mode because hidden instructions override the expected operational-agent contract.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#conversation history #tool integration #LLM response #prompt template #agent execution

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix OpenClaw Codex runtime silently injects coding-agent base prompt into operational agents [2 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Summary

Environment

Evidence

Expected Behavior

Actual Behavior

Impact

Requested Fixes

Proposed Security Classification

Still need to ship something?

TRENDING

openclaw - 💡(How to fix) Fix OpenClaw Codex runtime silently injects coding-agent base prompt into operational agents [2 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Summary

Environment

Evidence

Expected Behavior

Actual Behavior

Impact

Requested Fixes

Proposed Security Classification

Still need to ship something?

RELATED_DISCOVERY

TRENDING