Lesi Chen comments

Results 9 comments of


                                            Lesi Chen

识别模块可用了吗

同问，训练过程似乎不收敛

StopIteration: Caught StopIteration in replica 0 on device 0.

> when running with bash run_wt103_base.sh train --work_dir TRAIN_wt103, the same problem happens to me as well. The pytorch version is 1.12, gpu is 3090 and cuda version is 11.3....

Model collapse

> Might be relevant to my observations as well. See #4 Thanks a lot.

Model collapse

> Hi, thank you for your interest in our work! Did it happen a lot over a wide range of queries after training or just a small number? > >...

Model collapse

Thanks a lot for your reply! I will keep trying.

Model collapse

I think I may have found the reason. The "use_gt_labels" in the file `train_cllm_global.py` should be set to **False**, instead of the default setting of **True**. After this modification, the...

Only has 0.44 accuracy on GSM8K after running the provided codes

I use exactly the same dataset and train for a whole epoch, but no matter whether setting use_gt_labels as True/ False can not have the desired result.

Only has 0.44 accuracy on GSM8K after running the provided codes

But after this modification, the accuracy becomes 0.0. It seems that this modification is not correct.

Only has 0.44 accuracy on GSM8K after running the provided codes

But it is strange that setting use_gt_labels=False still does not solve this problem.