DCGM
DCGM copied to clipboard
infoROM is corrupted. However, diagnostics of dcgm is all pass.
description of the problem
-
nvidia-smireport that infoROM is corrupted

-
However, diagnostics of dcgm is all pass

environment information
- Bare Metal Server : QuantaGrid D52G-4U
- GPU SKU(s) : Tesla V100-SXM2-32GB
- OS : CentOS 7.8
- DRIVER : 450.80.02
- GPU power settings (
nvidia-smi -q -d POWER) : nv_power.log - CPU(s) : Intel(R) Xeon(R) Gold 6154 CPU
- RAM : 768 GB
- Topology (
nvidia-smi topo -m) : nv_topo.log - The output of
nvidia-smi -q: nv_q.log - Full output of
dcgmi -v: dcgm_v.log
OS is Ubuntu 18.04.6 LTS