transformers - 💡(How to fix) Fix [deepseekv4]Does Transformers provide a weight conversion script to convert the Hugging Face weights into a format that can be read by Transformers from_pretrained?

Official PRs (…)
ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

Helpful · Quick feedback

Loading…
RAW_BUFFERClick to expand / collapse

Feature request

For [deepseekv4], the weight names provided in the Hugging Face DeepSeek-V4-Flash weights seem not to match the Transformers weight names. Does Transformers provide a weight conversion script to convert the Hugging Face weights into a format that can be read by Transformers from_pretrained?

<img width="3098" height="1744" alt="Image" src="https://github.com/user-attachments/assets/63eed220-9a7f-48dd-83dc-328b7b1ea22c" /> <img width="1792" height="524" alt="Image" src="https://github.com/user-attachments/assets/9d477055-40ad-4517-bef0-b5bdc5bba08f" />

Motivation

Does Transformers provide a weight conversion script to convert the Hugging Face weights into a format that can be read by Transformers from_pretrained?

Your contribution

If not, can we submit a PR?

Vote matrix · Quick signals

Works
Did the solution work? Tap to confirm.
Easy Fix
Was it a quick fix?
Time Saver
Did it save you time?
Blocking
Was it severely blocking?
Common Issue
Are others likely hitting this too?
Flaky / Intermittent
Is it intermittent?
Verified / Reproducible
Can you reproduce it reliably?
Loading…

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Back to top recommendations

TRENDING

transformers - 💡(How to fix) Fix [deepseekv4]Does Transformers provide a weight conversion script to convert the Hugging Face weights into a format that can be read by Transformers from_pretrained?