superbenchmark icon indicating copy to clipboard operation
superbenchmark copied to clipboard

why '/dev/nvidia-uvm' is a required file check for nvidia GPU?

Open LiweiPeng opened this issue 3 years ago • 2 comments

What's the issue, what's expected?: This is v0.6.0. In (superbench code gpu.py) [(https://github.com/microsoft/superbenchmark/blob/main/superbench/common/devices/gpu.py#L24), it checks whether a GPU is nvidia by checking both '/dev/nvidiactl' and '/dev/nvidia-uvm'. The question is: why does it require /dev/nvidia-uvm? I found some GPU type like 'Tesla K80' with cuda 11.4 doesn't have this.

How to reproduce it?: Logon to a machine with 'Tesla K80' + cuda 11.4.

LiweiPeng avatar Sep 09 '22 21:09 LiweiPeng

It's required when using gpus in docker (here's an example), and should be included as one step during nvidia driver installation

If you cannot find /dev/nvidia-uvm in your system, you can run following commands to create it accroding to above document:

/sbin/modprobe nvidia-uvm
D=`grep nvidia-uvm /proc/devices | awk '{print $1}'`
mknod -m 666 /dev/nvidia-uvm c $D 0

abuccts avatar Sep 10 '22 11:09 abuccts

Because K80 GPU is not supported by superbench, it is OK not to support it.

LiweiPeng avatar Sep 20 '22 14:09 LiweiPeng