Lei Zuo

Results 1 issues of Lei Zuo

**Describe the bug** I'm working on a stable diffusion model. When I use torch.compile together with zero2 at 32 GPUs (4 machines 8 A100s each), the training hangs at the...

bug
training