Augenstern

Results 5 comments of Augenstern

> Thanks for your pointing out. However, that is no matter as they are symmetrical. OK, and I also have a question about that formulation. The original formulation has two...

> You can check the differences carefully to examine whether they are equivalent. Whatever, implementation details may be a little different, while the performance is the key. Do not sink...

The server in the copy_data.sh is down. How can we get the pretrian model and wiki.tar ?

I encountered the same problem, have you solved it now?

请问你解决这个问题了吗?我现在用qwen-vl也是lora时会训练很久之后遇到这个childfailederror,而且最后给的也是traceback : Signal 11 (SIGSEGV) received by PID xxx的错误,为了避免是显存的问题,特意调成了2b的模型,而且开了bf16,帧率和pixel,token数量都做了压缩,确认显存是足够的