Sean

Results 4 comments of Sean

Yes, I second this, in OpenRLHF the special tokens are provided to the custom reward function. I had a real problem that my model didn't know to generate EOS token...

Oh I see what happened here: V2 inherited V1, but the super() is supposed to be calling the function from `ChatTemplateStrategy`, but instead, it called the function from the parent...

My issue was actually my bad, I did not prepare data as "a list of dicts": https://openrlhf.readthedocs.io/en/latest/quick_start.html#prepare-datasets

Thanks! I'll close this.