Haoran1234567
Results
2
comments of
Haoran1234567
When I train train_sft with very small training data instructino_wild, it can run normally. When I increase some data, this happens, but the increased data is not much. i use...
Here is my run script: torchrun --standalone --nproc_per_node=8 train_sft.py \ --pretrain $PRETRAIN \ --model 'llama' \ --strategy colossalai_zero2 \ --log_interval 10 \ --save_path $SAVE_PATH \ --dataset $DATASET \ --batch_size 2...