transformers - 💡(How to fix) Fix The torch.split() return values in GlmMoeDsaIndexer [1 comments, 1 participants]

transformers2026-02-24 16:32:23

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

huggingface/transformers#44263•Fetched 2026-04-08 00:29:29

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Jintao-Huang

Participants

Jintao-Huang

Timeline (top)

closed ×1commented ×1cross-referenced ×1labeled ×1

RAW_BUFFERClick to expand / collapse

System Info

transformers:

https://github.com/huggingface/transformers/blob/e2bc54f29a58b2d2ee7e7d6eac949c959e063e0f/src/transformers/models/glm_moe_dsa/modular_glm_moe_dsa.py#L515

vllm:

https://github.com/vllm-project/vllm/blob/a0c70816956298f7dd1d0cf47cfa1a169a413692/vllm/model_executor/models/deepseek_v2.py#L746

deepseek_v3.2

https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/inference/model.py#L462

Who can help?

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Expected behavior

extent analysis

Problem Summary

Fixing the issue with DeepSeek-V3.2 model inference

Root Cause Analysis

The issue is likely related to the model architecture or configuration.

Fix Plan

Update Model Configuration

Check the model configuration file (model.py) for any potential issues.
Ensure that the model architecture is correctly defined and matches the expected behavior.
Verify that the input and output shapes are correctly configured.

Update Model Code

Update the inference function in model.py to correctly handle input data.
Use the transformers library to load the pre-trained model and fine-tune it if necessary.
Use the vllm library to load the VLLM model and integrate it with the DeepSeek-V3.2 model.

Example Code

import torch
from transformers import GLMForSequenceClassification
from vllm import VLLM

class DeepSeekV3_2Model(torch.nn.Module):
    def __init__(self):
        super(DeepSeekV3_2Model, self).__init__()
        self.glm = GLMForSequenceClassification.from_pretrained('glm-moe-dsa')
        self.vllm = VLLM.from_pretrained('vllm-model')

    def forward(self, input_ids, attention_mask):
        # Run the GLM model
        glm_output = self.glm(input_ids, attention_mask)
        
        # Run the VLLM model
        vllm_output = self.vllm(input_ids, attention_mask)
        
        # Combine the outputs
        output = torch.cat((glm_output, vllm_output), dim=1)
        
        return output

Verification

Run the updated model on a test dataset to verify that it produces the expected output.
Compare the output with the expected behavior to ensure that the fix is successful.

Extra Tips

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

#api #ssr #installation #tensor shape #autograd error #file not found #serialization error #model compatibility #GPU setup #container setup

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

transformers - 💡(How to fix) Fix The torch.split() return values in GlmMoeDsaIndexer [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

extent analysis

Problem Summary

Root Cause Analysis

Fix Plan

Update Model Configuration

Update Model Code

Example Code

Verification

Extra Tips

FAQ

Expected behavior

Still need to ship something?

TRENDING

transformers - 💡(How to fix) Fix The torch.split() return values in GlmMoeDsaIndexer [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

extent analysis

Problem Summary

Root Cause Analysis

Fix Plan

Update Model Configuration

Update Model Code

Example Code

Verification

Extra Tips

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING