Ko Sung
Ko Sung
TSM/TSN... mmaction2 tensorrt
@amulil 请问现在有DPO训练的模型指标对比吗?我想参考这个实现[RLHF-V](https://arxiv.org/abs/2312.00849) code: https://github.com/RLHF-V/RLHF-V, https://github.com/thunlp/Muffin
直接在config中定义好新加的special token,传到dataset和model中,model 需要resize_embedding ``` for special_token in special_tokens: if special_token not in tokenizer.get_vocab(): tokenizer.add_tokens([special_token], special_tokens=True) print(f'After adding special tokens, Vocabulary Size: {len(tokenizer)}') ```