Ko Sung

Results 3 comments of Ko Sung

@amulil 请问现在有DPO训练的模型指标对比吗?我想参考这个实现[RLHF-V](https://arxiv.org/abs/2312.00849) code: https://github.com/RLHF-V/RLHF-V, https://github.com/thunlp/Muffin

直接在config中定义好新加的special token,传到dataset和model中,model 需要resize_embedding ``` for special_token in special_tokens: if special_token not in tokenizer.get_vocab(): tokenizer.add_tokens([special_token], special_tokens=True) print(f'After adding special tokens, Vocabulary Size: {len(tokenizer)}') ```