Bince Qu

Results 5 issues of Bince Qu

'''usage: tokenizer.py [-h] [-c CONFIG] [--print_config[=flags]] {fit,validate,test,predict} ... error: argument subcommand: invalid choice: 'train' (choose from 'fit', 'validate', 'test', 'predict')''' there's bug with the script

请问支持4卡4090 PPO/DPO吗?用哪个脚本呢?train_ppo_llama.sh会OOM

i wonder how the sim automatically judge success? thanks