direct-preference-optimization icon indicating copy to clipboard operation
direct-preference-optimization copied to clipboard

Implementation for Plackett-Luce rank model

Open rohan598 opened this issue 1 year ago • 1 comments

@eric-mitchell Will you be adding the implementation for Plackett-Luce rank model in addition to the current Bradley-Terry model?

Looking forward to hearing from you!

rohan598 avatar Mar 04 '24 22:03 rohan598

@rohan598 I was wondering if you made any headway in this direction? Thanks!

jdchang1 avatar Apr 26 '24 13:04 jdchang1