DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

memory reallocation for bigger batch size

Open reymondzzzz opened this issue 3 years ago • 2 comments

reymondzzzz avatar Aug 18 '22 13:08 reymondzzzz

CLA assistant check
All CLA requirements met.

ghost avatar Aug 18 '22 13:08 ghost

Hi @reymondzzzz Thanks for the PR. I see this can fix some assumptions we have on model size or batch size during the runtime. But, would you mind give a description here to see what it solves? I also see that you opened an issue about using ds-inference for a GPT-based model, is this PR related to that? Thanks, Reza

RezaYazdaniAminabadi avatar Aug 18 '22 16:08 RezaYazdaniAminabadi

Closing due to age and lack of description.

jomayeri avatar Aug 25 '23 23:08 jomayeri