codex - 💡(How to fix) Fix Codex Desktop voice dictation intermittently outputs Simplified Chinese for Traditional Chinese users [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
openai/codex#20003Fetched 2026-04-29 06:23:53
View on GitHub
Comments
0
Participants
1
Timeline
3
Reactions
0
Author
Participants
Timeline (top)
labeled ×3

Error Message

  • if Simplified Chinese appears in the input, it should be treated as a voice transcription error

Root Cause

For Traditional Chinese users, this is not a minor formatting preference. Simplified and Traditional Chinese are different written standards, and automatic post-hoc conversion is not always a safe 1:1 mapping. Some Simplified characters map to multiple possible Traditional characters depending on context.

This also breaks workflows that rely on strict project instructions such as:

  • all replies must be in Traditional Chinese
  • all generated comments and documentation must use Taiwan terminology
  • if Simplified Chinese appears in the input, it should be treated as a voice transcription error

Those instructions only help after the agent receives the text. They do not fix the composer transcript itself.

Code Example

codex-cli 0.125.0

---

macOS
codex-cli 0.125.0
RAW_BUFFERClick to expand / collapse

What variant of Codex are you using?

Codex Desktop App on macOS.

Related local CLI version, for reference:

codex-cli 0.125.0

What issue are you seeing?

Codex Desktop built-in voice dictation frequently transcribes spoken Traditional Chinese / Taiwan Mandarin into Simplified Chinese.

This makes the voice input workflow unusable for Traditional Chinese users, especially in projects or workspaces where all communication, code comments, documentation, and user instructions must remain in Traditional Chinese.

The important point is that this happens before the message is submitted to the agent. System, developer, or AGENTS.md instructions can force the agent's reply to use Traditional Chinese, but they cannot reliably fix the raw voice transcript already inserted into the composer.

Steps to reproduce

  1. Open Codex Desktop App on macOS.
  2. Use the built-in voice input / dictation feature in the composer.
  3. Speak Mandarin intended to be written in Traditional Chinese, for example Taiwan usage.
  4. Observe the text inserted into the composer.

Expected behavior

Codex should provide a way to force voice dictation output to Traditional Chinese, ideally Taiwan Traditional Chinese:

  • zh-Hant
  • zh-Hant-TW
  • Traditional Chinese (Taiwan)

When the user has configured Traditional Chinese in system/developer/project instructions, Codex should not insert Simplified Chinese into the composer for Chinese voice dictation.

Actual behavior

The transcript intermittently uses Simplified Chinese. In practice this happens often enough that the built-in voice input is not usable for Traditional Chinese workflows.

Examples of undesired output include Simplified Chinese terms or characters being inserted into the composer even though the workspace instructions require Traditional Chinese.

Why this matters

For Traditional Chinese users, this is not a minor formatting preference. Simplified and Traditional Chinese are different written standards, and automatic post-hoc conversion is not always a safe 1:1 mapping. Some Simplified characters map to multiple possible Traditional characters depending on context.

This also breaks workflows that rely on strict project instructions such as:

  • all replies must be in Traditional Chinese
  • all generated comments and documentation must use Taiwan terminology
  • if Simplified Chinese appears in the input, it should be treated as a voice transcription error

Those instructions only help after the agent receives the text. They do not fix the composer transcript itself.

Requested fix

Please add one or more of the following:

  1. A voice dictation language/script setting for Chinese, including zh-Hant-TW.
  2. A transcription prompt or style hint specifically for Codex Desktop dictation.
  3. A post-transcription normalization layer before inserting text into the composer.
  4. An option to automatically normalize Chinese voice transcripts to Traditional Chinese (Taiwan), similar to OpenCC s2twp behavior.

Related context

This appears related to the general limitation where Chinese transcription may use a single zh language bucket and produce either Simplified or Traditional output depending on model behavior. However, Codex Desktop currently does not expose a practical control to users who require Traditional Chinese.

Related issues:

  • #12883 requested configurable voice transcription instructions.
  • #13714 requested post-transcription cleanup for voice-dictated drafts.

This issue is narrower: Traditional Chinese users need reliable script control for Codex Desktop voice dictation.

Additional information

Local environment:

macOS
codex-cli 0.125.0

I can provide repeated examples or screenshots if needed.

extent analysis

TL;DR

The most likely fix is to add a voice dictation language/script setting for Chinese, including zh-Hant-TW, to ensure Traditional Chinese output.

Guidance

  • Investigate the feasibility of implementing a transcription prompt or style hint specifically for Codex Desktop dictation to force Traditional Chinese output.
  • Consider adding a post-transcription normalization layer before inserting text into the composer to convert Simplified Chinese to Traditional Chinese.
  • Review related issues #12883 and #13714 to determine if their requested features can be adapted to address this specific problem.
  • Explore the possibility of integrating OpenCC s2twp behavior to automatically normalize Chinese voice transcripts to Traditional Chinese (Taiwan).

Example

No code snippet is provided as the issue does not imply a specific code-based solution.

Notes

The solution may require changes to the Codex Desktop App's voice dictation feature and potentially its integration with the codex-cli. The feasibility of these changes depends on the app's architecture and the codex-cli version.

Recommendation

Apply a workaround by exploring the implementation of a post-transcription normalization layer or integrating OpenCC s2twp behavior, as these solutions can potentially provide the required script control for Traditional Chinese users.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

FAQ

Expected behavior

Codex should provide a way to force voice dictation output to Traditional Chinese, ideally Taiwan Traditional Chinese:

  • zh-Hant
  • zh-Hant-TW
  • Traditional Chinese (Taiwan)

When the user has configured Traditional Chinese in system/developer/project instructions, Codex should not insert Simplified Chinese into the composer for Chinese voice dictation.

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

codex - 💡(How to fix) Fix Codex Desktop voice dictation intermittently outputs Simplified Chinese for Traditional Chinese users [1 participants]