pytorch - ✅(Solved) Fix fix: data race on global `cached_tensorimpls_enabled` flag causes undefined behavior [1 pull requests, 1 participants]

pytorch2026-04-27 18:02:16

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

pytorch/pytorch#181619•Fetched 2026-04-28 06:24:28

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Nam0101

Participants

Nam0101

Timeline (top)

labeled ×5mentioned ×4subscribed ×4cross-referenced ×1

cached_tensorimpls_enabled is a non-atomic global bool read from multiple threads (is_cached_tensor, add_cached_tensor, remove_cached_tensor) and written in set_cached_tensors_enabled without synchronization. This is a C++ data race (UB), which can manifest as intermittent crashes or incorrect cache-state decisions under concurrency. In practice this can also skew adjusted_use_count, affecting uniqueness/inplacing behavior.

Severity: high File: aten/src/ATen/CachedTensorUtils.cpp

Root Cause

Severity: high File: aten/src/ATen/CachedTensorUtils.cpp

Fix Action

Fixed

Fixed by PR: Quality: Data race on global cached_tensorimpls_enabled flag causes undefined behavior (https://github.com/pytorch/pytorch/pull/181620)

PR fix notes

PR #181620: Quality: Data race on global `cached_tensorimpls_enabled` flag causes undefined behavior

Repository: pytorch/pytorch
Author: Nam0101
State: open | merged: False
Link: https://github.com/pytorch/pytorch/pull/181620

Description (problem / solution / changelog)

✨ Code Quality

Problem

Severity: high File: aten/src/ATen/CachedTensorUtils.cpp

Solution

Make the flag atomic (or guard all reads/writes with the same mutex). A minimal fix is: static std::atomic<bool> cached_tensorimpls_enabled{false}; then use load(std::memory_order_acquire) in readers and store(enabled, std::memory_order_release) in set_cached_tensors_enabled. If you need strict consistency with the map state, also take cached_tensorimpl_mutex in set_cached_tensors_enabled (and optionally clear the map when disabling).

Changes

aten/src/ATen/CachedTensorUtils.cpp (modified)

Testing

Existing tests pass
Manual review completed
No new warnings/errors introduced

<details> <summary>🤖 About this PR</summary>

This pull request was generated by ContribAI, an AI agent that helps improve open source projects. The change was:

Discovered by automated code analysis
Generated by AI with context-aware code generation
Self-reviewed by AI quality checks

If you have questions or feedback about this PR, please comment below. We appreciate your time reviewing this contribution!

</details>

Closes #181619

Changed files

aten/src/ATen/CachedTensorUtils.cpp (modified, +6/-4)

RAW_BUFFERClick to expand / collapse

Description

Severity: high File: aten/src/ATen/CachedTensorUtils.cpp

Expected Behavior

The code should handle this case properly to avoid unexpected errors or degraded quality.

cc @albanD @pragupta

extent analysis

TL;DR

The most likely fix involves synchronizing access to the cached_tensorimpls_enabled global variable to prevent data races.

Guidance

Identify all read and write operations on cached_tensorimpls_enabled and ensure they are properly synchronized using atomic operations or mutexes.
Review the set_cached_tensors_enabled, is_cached_tensor, add_cached_tensor, and remove_cached_tensor functions to understand how they interact with cached_tensorimpls_enabled.
Consider using a thread-safe atomic boolean type for cached_tensorimpls_enabled to simplify synchronization.
Verify that the synchronization mechanism is correctly implemented by testing the code under concurrent access scenarios.

Example

std::atomic<bool> cached_tensorimpls_enabled = false;

Notes

The exact synchronization approach may depend on the specific requirements and constraints of the codebase.
Additional testing and verification may be necessary to ensure that the synchronization fix does not introduce new issues.

Recommendation

Apply workaround: Use atomic operations or mutexes to synchronize access to cached_tensorimpls_enabled, as this directly addresses the identified data race issue.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#tokenizer error #prompt formatting #chain error #conversation history #tool integration

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

pytorch - ✅(Solved) Fix fix: data race on global `cached_tensorimpls_enabled` flag causes undefined behavior [1 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fixed

PR fix notes

PR #181620: Quality: Data race on global `cached_tensorimpls_enabled` flag causes undefined behavior

Description (problem / solution / changelog)

✨ Code Quality

Problem

Solution

Changes

Testing

Changed files

Description

Expected Behavior

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

pytorch - ✅(Solved) Fix fix: data race on global `cached_tensorimpls_enabled` flag causes undefined behavior [1 pull requests, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fixed

PR fix notes

PR #181620: Quality: Data race on global cached_tensorimpls_enabled flag causes undefined behavior

Description (problem / solution / changelog)

✨ Code Quality

Problem

Solution

Changes

Testing

Changed files

Description

Expected Behavior

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING

PR #181620: Quality: Data race on global `cached_tensorimpls_enabled` flag causes undefined behavior