charm
charm copied to clipboard
AMPI nbor colls: sort neighbor ranks by PE-locality for overlap
Ensure that for neighborhood collectives we post messages for non-PE-local ranks before PE-local ones in order to overlap non-local sends/recvs with local ones which happen inline on the PE.
We sort at each collective invocation rather than at topological communicator creation in order to handle migrations / load balancing.