Seungone Kim
Seungone Kim
Thanks for your response @natolambert! I was trying to test generative reward modeling (with GPT-4, Prometheus, Auto-J) and it seems like `run_dpo.py` has a slightly different functionality than what I...
Sounds fair enough!! I'll organize the code & upload the models until this week. Thanks in advance!
@lintangsutawika Is there currently any initiative for this feature? I would love to help
@baberabb Hello Baber, nice to meet you! I'd love to collaborate with you on working on this! On which platform could I best communicate with you (e.g., slack, discord)?
Hello @95jinchul , the model isn't specifically trained to evaluate Korean responses, but it does work to some extent. The trick here is to only convert the {instruction, response} into...