S1quence
S1quence
### System Info transformers version: 4.43.0.dev0 (installed from source) ### Who can help? @ArthurZucker ### Information - [X] The official example scripts - [ ] My own modified scripts ###...
# What does this PR do? Fixes issue #31791. 1. Fixed bugs in JetMoE casual language model and sequence classification model. 2. The original code will report errors when we...
## Summary I encountered a size mismatch problem like in https://github.com/EleutherAI/lm-evaluation-harness/issues/1707#issue-2244839671 when evaluating peft model (DeepSeek-R1-Distill-Qwen-1.5B), but the root cause is due to the inherent mismatch with model's `vocab_size` and...