Sean
Sean
Yes, I second this, in OpenRLHF the special tokens are provided to the custom reward function. I had a real problem that my model didn't know to generate EOS token...
Oh I see what happened here: V2 inherited V1, but the super() is supposed to be calling the function from `ChatTemplateStrategy`, but instead, it called the function from the parent...
My issue was actually my bad, I did not prepare data as "a list of dicts": https://openrlhf.readthedocs.io/en/latest/quick_start.html#prepare-datasets
Thanks! I'll close this.