Afzal issues

Results 4 issues of


                                            Afzal

GPU power consumption doesn't go back to idle state after CUDA finishes GeMM execution

I am running some experiments using NVML and CUDA GeMM implementation for power consumption. I measured the following trend of power consumption for multiplication of two 16384 sized square matrices....

Anchor Sizes for the NuScenes Dataset

Thanks for making your project open-source. The paper states that that you utilized small anchor sizes for the TUM dataset (Section V A. (b)) but it doesn't specify the anchor...

Clock Speed Set to 1530MHz when V100 has a max boost clock of 1380 MHz

In your GPU benchmark, you set the persistence mode to ON and then lock the GPU clocks to 1530,1530 as follows: ``` python3 process = subprocess.Popen( 'sudo nvidia-smi --lock-gpu-clocks=1530,1530'.split(' '),...

`all_reduce` does not apply `scale` when `xr.world_size == 1`

## ❓ Questions and Help Hi, I have noticed that when `world_size == 1`, `all_reduce` is a no-op and does not apply `scale`: In `torch_xla.core.xla_model` in `def all_reduce`: ``` #...

question

distributed