beta-DPO
beta-DPO copied to clipboard
[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
Results
0
beta-DPO issues
Sort by
recently updated
recently updated
newest added