Yifan Ding

Results 2 comments of Yifan Ding

Hey, Thanks for raising the question. So basically hetseq generates "HDF5" files following the logic provided by NVIDIA at https://github.com/NVIDIA/DeepLearningExamples/blob/master/PyTorch/LanguageModeling/BERT/data/create_datasets_from_start.sh with downloaded wikipedia. You may need to adatp the code...

You definately can do that. Since you have A100, probabily one A100 is good enough to train a large language model.