chensongcan

Results 15 comments of chensongcan

could you provide the dataset?

@Sanqiang Could you please provide the file for dataset? /zfs1/hdaqing/saz31/dataset/

@DhairyaPatel7 have your solved your problem?

SO am I,could you tell me whether your problem has been solved or not?

我的数据加载是先处理完cache存起来加载的,目前已经是进入训练阶段,根据错误提示,应该是卡在了all_gather阶段 补充一下: 一样的环境跑其他的模型,如qwen1.5非moe模型没有问题,可以正常训练

Restricted text generation, I can think of the direction of landing is to make a sentence, I do not know what other direction of landing. Could you share some of...

使用预训练脚本,通过去掉overwrite_output_dir参数,加载保存在output的checkpoint-100文件夹,出现RuntimeError: Error(s) in loading state_dict for PeftModelForCausalLM:

请问目前公开的代码预训练的时候加上bias项了吗

这个项目是基于llama从0开始训练是吗