qiji2023
qiji2023
@chesik-amd Can you help me?
ok, thank you very much!
@chesik-amd can you help me?
@chesik-amd ok but i don't know how to give you my profile file
 I noticed a very large time gap here. Can you explain it @chesik-amd
@jeremyfelder I have some questions, could you give me some questions? 1. Can you explain `fast twiddle`? I can't understand your codes and your codes don't have any comments!!!! 2....
加了use_gradient_checkpointing_offload和training_strategy deepspeed_stage_3 同样会OOM
@Artiprocher @wenmengzhou Could you help me?thanks!
@Artiprocher 你们是否有支持(模型)流水并行微调的计划,把模型分割到多个显卡上来训练,普通玩家很难有80G显存的,拿出24GB都很不错了
@Artiprocher 有没有qlora的方法,起码让48G的卡跑起来呗