Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Instructions to reproduce training

Open sanagno opened this issue 3 years ago • 0 comments

Still TODOs:

  • Need to fix #1661
  • @theblackcat102 please provide scripts on how you are preprocessing data for the RM

We also need:

  • Simpler RM based on only our dataset
  • Some refactoring on RM code
  • More experiments with RL...

sanagno avatar Feb 20 '23 22:02 sanagno