shhn1
shhn1
Any updates on this? Thanks @yaozhewei
Hi, I also encountered a similar problem, did you solve it? If it is solved, could you please tell me how to fix it? Thank you so much!
> > I am also facing the same issue, not able to build from source also and I am using torch version-1.11.0, even then I am facing this issue. This...
> I also trained qwen3-8b eagle3 using SpecForge on a Chinese dataset(Chinese-DeepSeek-R1-Distill-data-110k)(generated answer using Qwen3-8B), but after 10 epoch only got (Position 0) train acc 0.7. and test using vllm...
I have the same problem. When I try to train the qwen3 32b model with 32k data, the OOM problem occurs after 2000 steps. How can I solve it? I...