transformers - 💡(How to fix) Fix [BUG] HF Hub is DOWN on AWS in Xet mode [1 participants]

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
GitHub stats
huggingface/transformers#45797Fetched 2026-05-06 06:14:42
View on GitHub
Comments
0
Participants
1
Timeline
5
Reactions
0
Participants
Timeline (top)
mentioned ×2subscribed ×2labeled ×1

Downloading Qwen/Qwen3.6-35B-A3B on a p5en.48xlarge in us-east-2 on AWS hangs partway through when using the default Hugging Face download path. Clearing ~/.cache does not fix it.

The evidence suggests the hang is in the Hugging Face Xet-backed download path (hf-xet / CAS), not local disk, cache corruption, or general connectivity. Disabling Xet with HF_HUB_DISABLE_XET=1 works around the issue.

Error Message

Retry on 403 (Forbidden) enabled): "s3::get_range" api call failed ... HTTP status client error (403 Forbidden) for url (https://us.aws.cdn.hf.co/xorbs/...) The model download should either complete successfully or fail with a clear error. It should not stall indefinitely without surfacing the repeated 403 Forbidden failures to the CLI.

Root Cause

Downloading Qwen/Qwen3.6-35B-A3B on a p5en.48xlarge in us-east-2 on AWS hangs partway through when using the default Hugging Face download path. Clearing ~/.cache does not fix it.

The evidence suggests the hang is in the Hugging Face Xet-backed download path (hf-xet / CAS), not local disk, cache corruption, or general connectivity. Disabling Xet with HF_HUB_DISABLE_XET=1 works around the issue.

Fix Action

Workaround

Disabling Xet allows the download path to work:

HF_HUB_DISABLE_XET=1 uv run hf download Qwen/Qwen3.6-35B-A3B

A smaller shard test completed successfully with Xet disabled:

HF_HUB_DISABLE_XET=1 uv run hf download Qwen/Qwen3.6-35B-A3B \
  --include model-00007-of-00026.safetensors \
  --local-dir /tmp/qwen-no-xet-test

That downloaded a 1.1G safetensors shard successfully in a few seconds.

Code Example

uv run hf download Qwen/Qwen3.6-35B-A3B

---

uv run hf download Qwen/Qwen3.6-35B-A3B --dry-run

---

[dry-run] Will download 40 files (out of 40) totalling 71.9G.

---

Received CAS response ... status_code=403
Refreshing expired retrieval URLs
Retry on 403 (Forbidden) enabled): "s3::get_range" api call failed ... HTTP status client error (403 Forbidden) for url (https://us.aws.cdn.hf.co/xorbs/...)
Retrieval URLs refreshed successfully

---

HF_HUB_DISABLE_XET=1 uv run hf download Qwen/Qwen3.6-35B-A3B

---

HF_HUB_DISABLE_XET=1 uv run hf download Qwen/Qwen3.6-35B-A3B \
  --include model-00007-of-00026.safetensors \
  --local-dir /tmp/qwen-no-xet-test

---

OS: Linux 6.8.0-1050-aws
Python: 3.12.13
huggingface_hub: 1.13.0
hf-xet: 1.5.0rc0
requests: 2.34.0.dev1
urllib3: 2.6.3
CLI: hf 1.13.0

---

Filesystem: /dev/root
Size: 1.5T
Available: ~1.5T
Use: 3%
Inodes used: 1%

---

Warning: You are sending unauthenticated requests to the HF Hub. Please set a HF_TOKEN to enable higher rate limits and faster downloads.

---

uv run hf download Qwen/Qwen3.6-35B-A3B
RAW_BUFFERClick to expand / collapse

System Info

Xet-backed model download hangs midway for Qwen/Qwen3.6-35B-A3B

Summary

Downloading Qwen/Qwen3.6-35B-A3B on a p5en.48xlarge in us-east-2 on AWS hangs partway through when using the default Hugging Face download path. Clearing ~/.cache does not fix it.

The evidence suggests the hang is in the Hugging Face Xet-backed download path (hf-xet / CAS), not local disk, cache corruption, or general connectivity. Disabling Xet with HF_HUB_DISABLE_XET=1 works around the issue.

Reproduction

uv run hf download Qwen/Qwen3.6-35B-A3B

The dry run succeeds immediately:

uv run hf download Qwen/Qwen3.6-35B-A3B --dry-run

Output:

[dry-run] Will download 40 files (out of 40) totalling 71.9G.

Observed Behavior

The real download starts, writes several GB to the Hugging Face cache, and then stalls indefinitely with no visible progress.

In one reproduction:

  • The cache grew to about 4.6G.
  • The Python process stayed alive.
  • CPU usage continued, but disk writes stopped.
  • TCP sockets to Hugging Face / Xet endpoints stayed open but idle.
  • strace showed the process mostly waking on timers, with no meaningful network reads/writes.
  • The Xet log showed repeated 403 Forbidden responses from signed us.aws.cdn.hf.co/xorbs/... range URLs, followed by retrieval URL refresh/retry loops.

Representative Xet log lines:

Received CAS response ... status_code=403
Refreshing expired retrieval URLs
Retry on 403 (Forbidden) enabled): "s3::get_range" api call failed ... HTTP status client error (403 Forbidden) for url (https://us.aws.cdn.hf.co/xorbs/...)
Retrieval URLs refreshed successfully

Expected Behavior

The model download should either complete successfully or fail with a clear error. It should not stall indefinitely without surfacing the repeated 403 Forbidden failures to the CLI.

Workaround

Disabling Xet allows the download path to work:

HF_HUB_DISABLE_XET=1 uv run hf download Qwen/Qwen3.6-35B-A3B

A smaller shard test completed successfully with Xet disabled:

HF_HUB_DISABLE_XET=1 uv run hf download Qwen/Qwen3.6-35B-A3B \
  --include model-00007-of-00026.safetensors \
  --local-dir /tmp/qwen-no-xet-test

That downloaded a 1.1G safetensors shard successfully in a few seconds.

Environment

All tests run on a p5en.48xlarge instance in us-east-2 on AWS.

OS: Linux 6.8.0-1050-aws
Python: 3.12.13
huggingface_hub: 1.13.0
hf-xet: 1.5.0rc0
requests: 2.34.0.dev1
urllib3: 2.6.3
CLI: hf 1.13.0

No relevant HF_*, HUGGINGFACE_*, HTTP_PROXY, or HTTPS_PROXY environment variables were set during the test.

Disk space was not the issue:

Filesystem: /dev/root
Size: 1.5T
Available: ~1.5T
Use: 3%
Inodes used: 1%

Additional Context

The CLI warns that the request is unauthenticated:

Warning: You are sending unauthenticated requests to the HF Hub. Please set a HF_TOKEN to enable higher rate limits and faster downloads.

UPDATE: logging in did not fix it, this was not the issue.

However, the observed behavior looks like a Xet/CAS transfer stall rather than ordinary rate limiting:

  • Metadata/dry-run requests succeed quickly.
  • Downloads proceed partially before stalling.
  • The Xet backend logs repeated 403 Forbidden responses from signed range URLs.
  • Disabling Xet makes the same repository downloadable.

This may belong in huggingface_hub or xet-core rather than transformers, but it affects model loading/downloading workflows that Transformers users will hit by default.

Who can help?

@ArthurZucker @SunMarc

Information

  • The official example scripts
  • My own modified scripts

Tasks

  • An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • My own task or dataset (give details below)

Reproduction

uv run hf download Qwen/Qwen3.6-35B-A3B

on a p5en.48xlarge instance in us-east-2 on AWS.

Expected behavior

Xet download works in us-east-2.

extent analysis

TL;DR

Disabling Xet by setting HF_HUB_DISABLE_XET=1 allows the model download to work, suggesting a potential issue with the Xet-backed download path.

Guidance

  • The repeated 403 Forbidden responses from signed range URLs in the Xet log indicate a possible authentication or permission issue with the Xet backend.
  • Disabling Xet allows the download to complete successfully, which suggests that the issue is specific to the Xet-backed download path.
  • To mitigate the issue, set HF_HUB_DISABLE_XET=1 before running the download command, as shown in the provided workaround.
  • Verify that the download completes successfully after disabling Xet and investigate further to determine the root cause of the Xet-backed download issue.

Example

HF_HUB_DISABLE_XET=1 uv run hf download Qwen/Qwen3.6-35B-A3B

Notes

The issue may be related to the Xet backend or the CAS transfer process, and further investigation is needed to determine the root cause. Disabling Xet is a temporary workaround, and a more permanent solution may require updates to the Xet backend or the Hugging Face download process.

Recommendation

Apply the workaround by setting HF_HUB_DISABLE_XET=1 to allow the model download to complete successfully, as this is a reliable and tested solution.

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

FAQ

Expected behavior

Xet download works in us-east-2.

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING