memory reallocation for bigger batch size

Open reymondzzzz opened this issue 3 years ago • 2 comments

Aug 18 '22 13:08 reymondzzzz

All CLA requirements met.

Aug 18 '22 13:08 ghost

Hi @reymondzzzz Thanks for the PR. I see this can fix some assumptions we have on model size or batch size during the runtime. But, would you mind give a description here to see what it solves? I also see that you opened an issue about using ds-inference for a GPT-based model, is this PR related to that? Thanks, Reza

Aug 18 '22 16:08 RezaYazdaniAminabadi

Closing due to age and lack of description.

Aug 25 '23 23:08 jomayeri