ImageReward
ImageReward copied to clipboard
w/o prertain loss in the code
Formula 3 is in the paper, but not in the code. If I missed it, please let me know.
We have simplified the code to better demonstrate the ReFL algorithm. You can add it directly inside the code. 😊