MAxx8371
MAxx8371
Does reference model, proxy model and main model have to be initialized with the same method? When continue pretraining LlaMA2 with doremi, the weights of the main model are initialized...
In the paper, it mentions that the score is defined as follow. Is that calculated by summing the logprob of each token of the ground true y conditioned on (e,x)?...
### Description / 描述 仓库中提供的V1.0测试数据和脚本的链接失效了“https://cloud.tsinghua.edu.cn/f/71b5232264ae4833a4d0/?dl=1”,请问这份数据还会更新吗?感谢 ### Case Explaination / 案例解释 _No response_