361 issues found
[New Model] Add Microsoft Samba - Hybrid SSM + Sliding Window Attention
5/25/2026
[New Model] Add TIGER — Recommender Systems with Generative Retrieval (Google DeepMind, NeurIPS 2023)
5/25/2026
cohere2_moe fails training + tensor parallel tests
5/25/2026
Qwen3.5 GatedDeltaNet: Large logit divergence between full-sequence forward and prefill+decode with cache
5/25/2026
return_tensors is silently ignored when text_kwargs is explicitly passed
5/25/2026
HIGH: Pickle model loading (torch.load) enabled by default — no warning when loading untrusted weights
5/25/2026
MEDIUM: All CI jobs use secrets:inherit — full org secrets passed to every reusable workflow
5/25/2026
Roundtrip Failure for Gemma Pipeline on "▁"
5/25/2026
ValueError: Backend should be defined in the BACKENDS_MAPPING. Offending backend: tensorflow_text
5/24/2026
[New model] Add Fun-ASR-Nano (FunAudioLLM/Fun-ASR-Nano-2512)
5/24/2026
ImportError / TypeError on Windows with AMD ROCm PyTorch due to torch.distributed dependency
5/23/2026
[BUG] deepseek-v4 `comb.to(dtype).transpose(-1, -2)`
5/23/2026
[Gemma4] `Gemma4VisionPatchEmbedder._position_embeddings` materializes a ~19 GiB one-hot tensor that's mathematically a 2-row embedding lookup
5/23/2026
[Feature Proposal] Cognitive Weighting Protocol: Hierarchical context management to mitigate "Contextual Drift" in long-running sessions.
5/22/2026
Regression in v5.x: `ProcessorMixin._load_tokenizer_from_pretrained` forces subfolder for non-primary sub-tokenizers, breaking repos that put tokenizer files at root
5/22/2026
[RT-DETRv2] MPS crash: build_2d_sinusoidal_position_embedding hardcodes torch.float64, breaking Apple Silicon / MPS inference
5/22/2026
Unable to resume model training with a locally downloaded model and trust remote code
5/22/2026
[deepseek_v4] save_pretrained silently downcasts FP32 tensors to BF16 (hc_*, attn_sink, ffn.gate.bias, compressor.ape, indexer.compressor.ape)
5/23/2026
Discussion: optional RankSEG-style decoding for Transformers semantic segmentation post-processing
5/21/2026
`**kwargs` not passed through methods of RoFormer models
5/21/2026