Aleksandr Isakov

Results 1 comments of Aleksandr Isakov

@yaozhewei Same error for training Llama, step1 and step 2 are normal, but step 3 just won't converge