t-sifanwu
Results
2
comments of
t-sifanwu
Thanks for your reply! I still have another question about the training of bradley-terry-rm models. In the file of bradley-terry-rm/llama3_rm.py, you are using the dataset "hendrydong/preference_700K", is that the same...
> yes, you can use henrydong/preference_700K and the script we provide to process it into the format used by pairwise preference dataset! Thanks for your reply! Since the provided data...