charm
charm copied to clipboard
CmiCheckAffinity causes hang with UCX build on Summit
On Summit, CmiCheckAffinity in _initCharm routine (src/ck-core/init.C) causes a hang when run on two nodes.
Observed with the UCX and PAMILRTS builds.
Runs fine when these routines are commented out.
Charm++ with UCX machine layer built with:
./build charm++ ucx-linux-ppc64le smp openpmix -j --with-production --basedir=$HOME/pmix-3.1.5/install --basedir=$HOME/ucx-1.11.1/install
Test program (1darray/hello) run with:
jsrun -n2 -a1 -c2 -K1 -r1 ./hello +ppn 1 +pemap L0 +commap L1
Still hangs on Summit.