CONQUEST-release Run weak scaling tests

The weak scaling test in benchmarks can now be run automatically (tested on Archer2), using ReFrame and Spack. The spack package is upstreamed to spack, the ReFrame test is currently in a branch of excalibur-tests that I'll try to merge soon.

To run the weak scaling test (on Archer2), follow the setup instructions on excalibur-tests, and launch the test with reframe using

reframe -c ../excalibur-tests/benchmarks/apps/conquest/ -r -n SiWeakScaling --system=archer2 -J'--qos=standard'

Current result is below.

Questions raised by the results

Why are we not seeing much performance degradation from going from one full node to two or more full nodes? Performance actually improves from 256 cores (2 nodes) to 512 cores (4 nodes)
Why does thread scaling get so much worse going to 4 and 8 threads?
Why are we not seeing much performance degradation from filling up a node with MPI ranks?

Sep 20 '23 16:09 tkoskela

Briefly discussed this on Dec 5th, trying to write down what I remember

[ ] Try scaling up further to larger system size to better see the multi-node behaviour. @davidbowler needs to generate more input files
[x] Try running with more work per MPI rank. @tkoskela running with 16 atoms per rank

Dec 13 '23 15:12 tkoskela

Comparison of 16 atoms per rank (left) vs 8 atoms per rank (right).

Dec 14 '23 13:12 tkoskela

Archer2 budget has been increased by 3000 CU

Apr 15 '24 13:04 tkoskela