snwen123
Results
1
comments of
snwen123
Use deepspeed to evaluate the model's requirement for memory. For llama-7b model, zero2 requires a CPU RAM > 147G, and zero3 requires a CPU RAM > 166G. This may be...