Alex Su
Alex Su
hi bro, i got the same problem. did you solve it?
> 训练是支持的,请问具体是哪个模型和需求呢? 想使用两台P800机器16卡,跑满血版deepseek V3,启动命令: - python -m paddle.distributed.launch --devices=0,1,2,3,4,5,6,7 --master=192.168.0.16:8090 --nnodes 2 --nproc_per_node 8 --rank 0 deepseek_V3.py - python -m paddle.distributed.launch --devices=0,1,2,3,4,5,6,7 --master=192.168.0.16:8090 --nnodes 2 --nproc_per_node 8 --rank 1 deepseek_V3.py...
> 训练是支持的,请问具体是哪个模型和需求呢? 辛苦看下启动命令是否有问题,还是两机16卡的P800本身跑不了满血版的deepseek V3吗
The same problem comes for H20