Bince Qu
Results
5
issues of
Bince Qu
'''usage: tokenizer.py [-h] [-c CONFIG] [--print_config[=flags]] {fit,validate,test,predict} ... error: argument subcommand: invalid choice: 'train' (choose from 'fit', 'validate', 'test', 'predict')''' there's bug with the script
请问支持4卡4090 PPO/DPO吗?用哪个脚本呢?train_ppo_llama.sh会OOM
i wonder how the sim automatically judge success? thanks