2 comments

Repositories
Issues
Comments

Results 4 comments of

亲，跳的太准了，排行榜只能自己看

微信可以检测到开挂了

未匹配的县，结果输出重庆市

改掉 adcodes.csv的1054行

BGE-M3的预训练问题——loss产生偶尔上升的情况

> There is no appropriate metric to evaluate the performance of pre-training task. We recommend selecting the ckpt based on the performance of fine-tuning downstream task. After pretraining on my...

关于RetroMAE预训练问题

> > 那预训练的log里边应该只能看loss吧，通过观察loss曲线来选择预训练较好的模型，是吗？为什么训练完后需要经过微调才能用于句子相似度计算呢？ > > 预训练的目标不是计算句子相似度，是通过句子向量还原整个句子。因此在下游任务使用时需要微调。请问一下，如果pretrain完成后，finetune还需要准备多少数据呢？只用少量下游任务数据finetune有效果如何？