mpiP
mpiP copied to clipboard
Incorrect timing values with Python Code
While using mpiP with a python based application that uses Horovod + NCCL, I am seeing wrong timings reported by mpiP. A sample report along with system details is attached for reference. As per the Linux time command (time mpiexec ....), the application finishes in 6950 seconds whereas mpiP reports show apptime of 13900 seconds for each rank. Does mpiP work correctly with Python + NCCL code or let me know if I am missing something?
Thanks, Amit