xd2333
xd2333
in my case, [PAD151645] [PAD151644] [PAD151643] shows in output(Qwen-14B-Chat q4k)
类似问题,恢复检查点后显存占用正常
> You're correct! It seems like `max_seq_length`'s default of 4096 is auto scaling TinyLlama, causing bad outputs - I'll fix this asap - thanks for the report! Hi unslothai, thx...
> > ### Reminder > > > > * [x] I have read the README and searched the existing issues. > > > > ### System Info > > transformers>=4.43.0...
> > ### Reminder > > > > * [x] I have read the README and searched the existing issues. > > > > ### System Info > > transformers>=4.43.0...
for anyone saw this with Wan2.2-I2V-A14B, try use `arch: 'wan22_14b_i2v'`