samulew
Results
1
comments of
samulew
met the some problem , and i add the code at line 72 of train_reward_model.py , imitating train_sft.py : tokenizer = LlamaTokenizer.from_pretrained(args.pretrain) tokenizer.eos_token = '' tokenizer.pad_token = tokenizer.eos_token