Megatron-DeepSpeed
Megatron-DeepSpeed copied to clipboard
How to continue pre-training Bloom?
Hi I'm trying to continue pre-training the bloom-560m on my own dataset on a single GPU. I modified this script to fit my case. However, i cannot figure out how to load the checkpoint.
Is there any guide for what i'm doing?
Hi, did you find any solution on this?
@ShinoharaHare Hi, have you solved this problem?