Lei Zuo
Results
1
issues of
Lei Zuo
**Describe the bug** I'm working on a stable diffusion model. When I use torch.compile together with zero2 at 32 GPUs (4 machines 8 A100s each), the training hangs at the...
bug
training