-
Notifications
You must be signed in to change notification settings - Fork 28.1k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Qwen2VLForConditionalGeneration doesn't work with MPS devices
bug
#36413
opened Feb 26, 2025 by
tonywu71
2 of 4 tasks
Make Mamba Export Compatible with ONNX
Feature request
Request for a new feature
#36403
opened Feb 25, 2025 by
AyoubMDL
Please add support for TransPixar AI
New model
#36398
opened Feb 25, 2025 by
jozefchutka
2 tasks done
[BUG]npu zero3 训练自定义模型时,报错Function SumBackward0 returned an invalid gradient at index 0
bug
#36387
opened Feb 25, 2025 by
Zane-Qbb
4 tasks
Set non_blocking=True When moving data from the CPU to the GPU
bug
#36384
opened Feb 25, 2025 by
Hukongtao
2 of 4 tasks
WARNING: transformers 4.49.0 does not provide the extra 'emu3'
Feature request
Request for a new feature
#36381
opened Feb 25, 2025 by
Prasaderp
Add EVEv2 : a series of Encoder-free VLM's
New model
#36379
opened Feb 24, 2025 by
sbucaille
2 tasks done
modeling_deformable_detr.py DeformableDetrMultiheadAttention.foward function report error for "hidden_states_original" if position_embeddings is None
bug
#36378
opened Feb 24, 2025 by
susanbao
4 tasks
目前使用Ktransformers进行DEEPSEEK-R1满血版和4bit量化版模型进行推理,推理速度有多少tokens/s?对应的计算资源配置分别是多少?
#36363
opened Feb 24, 2025 by
William-Cai123
The arguments in
utils/modular_model_converter.py
is different from those in docs
#36362
opened Feb 24, 2025 by
zhoubay
warning bug in Qwen2DecoderLayer in transformers ==4.49
bug
#36361
opened Feb 24, 2025 by
Kyrie666
2 of 4 tasks
Are there any plans to provide some performance analysis tools for transformers?
Feature request
Request for a new feature
#36360
opened Feb 24, 2025 by
Hukongtao
Allow setting a seed for DataCollatorForLanguageModeling
Feature request
Request for a new feature
#36357
opened Feb 23, 2025 by
capemox
Groq inference provider
Feature request
Request for a new feature
#36353
opened Feb 23, 2025 by
VladOS95-cyber
Implement Titans Architecture with GRPO Fine-Tuning
New model
#36352
opened Feb 23, 2025 by
rajveer43
2 tasks
Support sliding_window for sdpa in qwen2
Feature request
Request for a new feature
#36351
opened Feb 23, 2025 by
cyr0930
Failed to import transformers.models.auto.modeling_auto because numpy.core.multiarray failed to import
bug
#36343
opened Feb 22, 2025 by
forrestbao
4 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2025-02-23.