Y. Wang
Y. Wang
parallelCluster Manager 3.2.0 created with https://www.hpcworkshops.com/03-deploy-pcm/01-deploy-pcm.html does not propagate PerUnitStorageThroughput to the final Cluster Configuration. This caused the "Dry Run" failed. The workaround is adding a line manually to the...
The following query for the "AutoScaling Groups In-Service Capacity" displays different values depending on different Period chosen. When choosing one day interval from console, the period becomes 5 minutes, which...
The latest version of the [nccl-tests.Dockerfile based on NCCL 2.27.7](https://github.com/aws-samples/awsome-distributed-training/blob/main/micro-benchmarks/nccl-tests/nccl-tests.Dockerfile#L9) has a severe performance degradation compared with previous version based on NCCL 2.27.5. Please update to a newer version to...