Codex should allow skill authors or users to declare a skill-scoped model/effort override, and should apply it only while that skill is executing. Important constraints: - The override should be opt-in per skill, not global. - It should not permanently mutate the main session's model or effort. - The user should be able to inspect or disable these overrides in config. - The UI/status/debug output should make the temporary override visible enough to avoid confusion. - If a requested skill-scoped effort is unsupported by the active model/provider/tools, Codex should fail clearly or fall back according to a documented policy.

codex - 💡(How to fix) Fix Support skill-scoped temporary model and reasoning effort overrides

Root Cause

Some skills produce simple or static output. For example, a help skill may only need to render a deterministic block of terminal text. Running that through the same high-effort model setting used for planning, debugging, or implementation work adds latency and cost without improving output quality.

In a local experiment with a static $cat:help response, the current inline skill path using gpt-5.3-codex at low took around 40 seconds per first response. Flattening the skill body to avoid extra loading reduced that to about 26 seconds, which helps, but the underlying issue remains: the user must choose one reasoning effort for the whole main agent, even when a skill's work is much simpler than the rest of the session.

Temporarily lowering effort for a simple/static skill would let users keep high effort for the main conversation while avoiding unnecessary cost and latency for predictable skill output.

What feature would you like to see?

Please add a way for a Codex skill to temporarily override the model and/or reasoning effort used while executing that skill, without changing the model/effort for the rest of the user's main-agent interaction.

A possible shape would be skill frontmatter or skill config metadata such as:

---
name: help
model: gpt-5.3-codex
reasoning_effort: low
# or: reasoning_effort: none
---

or a config-based equivalent:

[[skills.config]]
name = "cat:help"
model = "gpt-5.3-codex"
reasoning_effort = "low"

The intended behavior is scoped and temporary:

User is in a main Codex session using their preferred model/effort, for example gpt-5.5 with xhigh.
User invokes a simple/static skill, for example $cat:help.
Codex executes just that skill's response generation at a cheaper/faster model or effort.
After the skill response is produced, the main session returns to the original model/effort automatically.

Why this matters

Temporarily lowering effort for a simple/static skill would let users keep high effort for the main conversation while avoiding unnecessary cost and latency for predictable skill output.

Expected behavior

Codex should allow skill authors or users to declare a skill-scoped model/effort override, and should apply it only while that skill is executing.

Important constraints:

The override should be opt-in per skill, not global.
It should not permanently mutate the main session's model or effort.
The user should be able to inspect or disable these overrides in config.
The UI/status/debug output should make the temporary override visible enough to avoid confusion.
If a requested skill-scoped effort is unsupported by the active model/provider/tools, Codex should fail clearly or fall back according to a documented policy.

Alternatives considered

Start the whole session with lower effort: not acceptable when the rest of the user's work needs higher reasoning.
Have the skill launch a nested codex exec with its own effort: works in principle, but adds tool/handoff overhead, complicates sandboxing, and is often slower for simple output.
Use subagents with different model settings: useful for delegated work, but overkill for simple/static skill rendering and not the same as temporarily changing the main skill execution context.
Optimize skill instructions only: helpful, but does not solve the broader mismatch between high-effort main-agent work and low-effort/static skill output.

Related issues

This is related to, but narrower than, subagent model/effort configuration requests such as #11795 and #11701. Those focus on separate agents. This request is specifically about a skill-scoped temporary override for the active main-agent interaction.

FAQ

Expected behavior

Codex should allow skill authors or users to declare a skill-scoped model/effort override, and should apply it only while that skill is executing.

Important constraints:

The override should be opt-in per skill, not global.
It should not permanently mutate the main session's model or effort.
The user should be able to inspect or disable these overrides in config.
The UI/status/debug output should make the temporary override visible enough to avoid confusion.
If a requested skill-scoped effort is unsupported by the active model/provider/tools, Codex should fail clearly or fall back according to a documented policy.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

codex - 💡(How to fix) Fix Support skill-scoped temporary model and reasoning effort overrides

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

What feature would you like to see?

Why this matters

Expected behavior

Alternatives considered

Related issues

FAQ

Expected behavior

Still need to ship something?

TRENDING

codex - 💡(How to fix) Fix Support skill-scoped temporary model and reasoning effort overrides

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

What feature would you like to see?

Why this matters

Expected behavior

Alternatives considered

Related issues

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING