Mengjie Wu

Results 3 comments of Mengjie Wu

> > @jaywongs , did upgrading deepspeed work for you? > > not work for me,i use the deepspeed 0.14.2 Hello, have you solved it? I also encountered the same...

> [@hungnvk54](https://github.com/hungnvk54) I think it is normal. The picture shows the training early, only 100 batches have passed, and no complete epoch has passed. Hi @cdliang11 I have a question...

> 您好,我的infer命令是: > > swift infer --model --val_dataset --result_path --max_pixels 131712 --max_new_tokens 32 --logprobs true --train_type full --torch_dtype bfloat16 \ > > 想获得的帮助是: > > 1. val_dataset 中有些url是不能下载的,导致整体程序报错,有无办法跳过这些(类似训练那样) > 2....