Wen Yang
Results
3
comments of
Wen Yang
I got the same error, have you solve this problem?
@lixinliu1995 @yaozhewei @lw3259111 @jiacheng-ye @HeyangQin I have solved this problem by adjusting the version of deepspeed to 0.7.7 . Before that, the error will happen when the version of deepspeed...
@CrazyBoyM @chanel111 你好,请问一下你们在使用LLama3-Instruction直接在中文数据上进行DPO的过程中,有遇到DPO训练过后的模型response会出现生成重复的这种现象吗,有通用的稳定的解决方案吗?谢谢