zhaobin Chu

Results 3 issues of zhaobin Chu

我在用Accelerator的deepspeed做u-net微调时,即使batch_size=1,仍会出现显存溢出

I tried using LoRA to fine-tune the U-Net with SVD, and even with a batch size of 1, memory overflow occurs on the A100-80G GPU when the dataset consists of...

将模型量化成int8后执行不了,不量化的模型就能运行