Contra_Z

Results 7 issues of Contra_Z

It's a great job. But when i test, I have a problem.Why is this problem, the figure1 isgreat, and the figure is bad. Is the downsampled mask not cover all...

If I register an XID through DCGM's policy and listen, when a certain XID (for example, 79) occurs, will the policy keep reporting that XID until it recovers, or will...

When running extended-level diagnostics on 8 cards simultaneously, 8 H20s may occupy approximately 8GB of memory at most, while 8 H800s may occupy up to 16GB of memory. What causes...

1. When I run diagnostics on GPU0 alone, it will fail. $ dcgmi diag -r 2 -g 7 ![image](https://github.com/NVIDIA/go-dcgm/assets/65804647/3b12bc0c-1898-4998-8c94-58534435e268) 2. When I run diagnostics on GPU1 alone, the diagnostics result...

When I run a long diagnostic on one GPU, the Memory Usage of the other GPUs goes from 0 to 3M, and then back to 0 when the diagnostic is...

I reviewed the code for the dcgmi tool and found that before querying metrics using the dcgmGetLatestValues_v function in the dmon feature, it first calls dcgmWatchFields and dcgmUpdateAllFields to start...

In the policy manager, I skipped the time interval check for XID to prevent XID events from being lost