Kane
Results
1
issues of
Kane
**Environment** OS: [Ubuntu 22.04.5 LTS] Python Version: [3.10.14] Package Version: [openrlhf 0.6.0.post3] **Reproduction Steps** 1. Ran command: from Readme example `deepspeed --module openrlhf.cli.train_ppo \ --pretrain OpenRLHF/Llama-3-8b-sft-mixture \ --reward_pretrain OpenRLHF/Llama-3-8b-rm-mixture \...