DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

[BUG]Deepspeed zero3 student+teacher memory leak

Open Quan-Sun opened this issue 2 years ago • 0 comments

Hi there,

when using zero3 and zero.Init in a distillation scenario, it was observed that a memory leak can occur, with the maximum allocated memory increasing with each iteration. However, no memory leak when disabling zero.Init.

enable zero.Init image

Quan-Sun avatar Apr 18 '23 08:04 Quan-Sun