Open-Assistant
Open-Assistant copied to clipboard
Instructions to reproduce training
Still TODOs:
- Need to fix #1661
- @theblackcat102 please provide scripts on how you are preprocessing data for the RM
We also need:
- Simpler RM based on only our dataset
- Some refactoring on RM code
- More experiments with RL...