Alex Su

Results 2 issues of Alex Su

Hi, I am training my Llama2-7b model with Megatron-LM, using four H20s, 32 GPUs in total. The parallel strategy is set to: TP=8/PP=2/DP=2. Now, I want to know the data...

### 请提出你的问题 目前的PaddleNLP是否支持多机部署,并允许用户自定义划分 DP/PP/TP ?

question