Patrick Kenny

Results 3 comments of Patrick Kenny

I'm gonna try taking a look at this one, will update if I get stuck(this is my first issue in this project).

I released the PR anyway, if anything it might be a decent temporary fix.

Also seeing this issue with Qwen based models, https://huggingface.co/dunzhang/stella_en_400M_v5 and the 1.5B variant both have this problem.