David Beer
David Beer
@sanghvimanan I'm working on a fix here. Can you share details on how you created this container?
This issue has now been fixed and will released with DCGM 3.2.6.
@luccabb what is the output of nvidia-smi? What GPU generation are you using?
@optyang can you post the Python code you're using that isn't working correctly?
We have updated the documentation. Instead of setting freq0, you can request certain patterns of parameters that test the GPU well. Please refer to the updated documentation.
Hi Ligeweiwu - the memory bandwidth test is unfortunately not yet releasable as open source. To run the test with open source, you can download a released version of DCGM...
That's correct, although the memory test should run with -r 2 and higher.
Hi, as of DCGM 2.4.7, you can turn off this check by adding '-p pcie.test_nvlink_status=false' to your dcgmi diag line. As for now, can you paste the output of nvidia-smi...
Adding that parameter is a good workaround if you do not have NVLinks on the board. If it's okay with you, then it should be alright. As I said before,...