zhongzhe
zhongzhe
Hi@SmorkalovME, I was running multi-nodes intel-caffe on amd cpus(x86)(64cores),when i use single node ,the cores are used 100%,  but when i run on multinodes(4nodes or more),only few cores are...
when i run intel caffe on multi-node(four node) with mlsl on AMD cpus,something is wrong ,the training stopped at the Iteration 0, when run on single node ,it is ok....
[03/07/24 14:50:30] INFO colossalai - colossalai - INFO: train.py:155 main INFO colossalai - colossalai - INFO: Dataset contains 105060 samples [03/07/24 14:52:00] INFO colossalai - colossalai - INFO: train.py:165 main...