vcuda-controller icon indicating copy to clipboard operation
vcuda-controller copied to clipboard

get "invalid device context" or "segmentation fault" problems in aarch64 machine

Open DennisYoung96 opened this issue 2 years ago • 4 comments

thanks for ur excellent codes and open source spirit.

Enviroment Info gpu-manager version: built on master vcuda version: built on master nvidia driver: 470.199.02 or 470.42.01 or 460.106.00 cpu : aarch64 gpu: Tesla T4

details these days i get "invalid device context" or "segmentation fault" problems in my aarch64 machine. when every app init ,it reports 5 functions not found image when i use CUDA samples image when use pytorch demo image when change to 460.x driver. it reports segmentation fault

but, it will works if i give whole gpu rates to one pod(set vcore=100) the last, it does well in x86 machine (same 470.x driver and T4 gpu card) at the same time.

so, are there any diffrences between aarch64 driver and x86 drivers? can any gentleman give advice on this? need ur help

DennisYoung96 avatar Aug 18 '23 06:08 DennisYoung96

@mYmNeo @hzliangbin please help

DennisYoung96 avatar Aug 18 '23 07:08 DennisYoung96

I meet the same problem, can anyone help

hiahia121 avatar Dec 26 '23 02:12 hiahia121