DCGM icon indicating copy to clipboard operation
DCGM copied to clipboard

infoROM is corrupted. However, diagnostics of dcgm is all pass.

Open likueimo opened this issue 4 years ago • 15 comments

description of the problem

  • nvidia-smi report that infoROM is corrupted
    image

  • However, diagnostics of dcgm is all pass image

environment information

  • Bare Metal Server : QuantaGrid D52G-4U
  • GPU SKU(s) : Tesla V100-SXM2-32GB
  • OS : CentOS 7.8
  • DRIVER : 450.80.02
  • GPU power settings (nvidia-smi -q -d POWER) : nv_power.log
  • CPU(s) : Intel(R) Xeon(R) Gold 6154 CPU
  • RAM : 768 GB
  • Topology (nvidia-smi topo -m) : nv_topo.log
  • The output of nvidia-smi -q : nv_q.log
  • Full output of dcgmi -v : dcgm_v.log

likueimo avatar Mar 19 '21 08:03 likueimo

OS is Ubuntu 18.04.6 LTS

jvschw avatar Oct 14 '21 16:10 jvschw