MOSS icon indicating copy to clipboard operation
MOSS copied to clipboard

使用SFT后的FP32模型进行生成,报错RuntimeError: where expected condition to be a boolean tensor, but got a tensor with dtype Half

Open ARIELDENG opened this issue 2 years ago • 4 comments

image

ARIELDENG avatar May 05 '23 13:05 ARIELDENG

通过zero_to_fp32.py文件将上述多组pt文件转成pytorch_model.bin,此外在index.json里将所有参数都指向了pytorch_model.bin

ARIELDENG avatar May 05 '23 13:05 ARIELDENG

您好,我这里也出现了同样的问题,请问您解决了嘛。如解决了,能给个提示吗,谢谢

hingkan avatar May 25 '23 06:05 hingkan

我也是这个问题,mark一下

cjrzh avatar May 26 '23 06:05 cjrzh

我比较简陋的在模型加载时指定的torch_dtype删除,如: raw_model = MossForCausalLM._from_config(config) model = load_checkpoint_and_dispatch( raw_model, model_path, device_map="auto", no_split_module_classes=["MossBlock"]) 如你们找到好的办法,希望告诉我,谢谢。

hingkan avatar May 26 '23 06:05 hingkan