direct-preference-optimization
direct-preference-optimization copied to clipboard
Implementation for Plackett-Luce rank model
@eric-mitchell Will you be adding the implementation for Plackett-Luce rank model in addition to the current Bradley-Terry model?
Looking forward to hearing from you!
@rohan598 I was wondering if you made any headway in this direction? Thanks!