Aleksandr Isakov
Results
1
comments of
Aleksandr Isakov
@yaozhewei Same error for training Llama, step1 and step 2 are normal, but step 3 just won't converge