user2311717757
Results
2
issues of
user2311717757
ppo 训练过程中出现UserWarning: KL divergence is starting to become negative: -233.50 - this might be a precursor for failed training. sometimes this happens because the generation kwargs are not correctly set....
question
尊敬的Data-Juicer框架开发者,你们好。最近,我们有对大模型数据进行处理的需求。从论文“Data-Juicer: A One-Stop Data Processing System for Large Language Models”调研到Data-Juicer的开源大模型数据处理框架。我们想进一步使用和探索这个框架。正好,我们看到了你们在天池比赛中发布了“FT-Data Ranker_大语言模型微调数据赛(7B模型赛道)”比赛。但是比赛已经结束无法获取原始数据。是否可以提供原始数据以供我们探索和使用Data-Juicer框架。万分感谢🙏。
question