gpu-manager icon indicating copy to clipboard operation
gpu-manager copied to clipboard

kubelet_internal_checkpoint not exist

Open xing0821 opened this issue 3 years ago • 1 comments

gpu-manager daemonset start with reading gpu info from kubelet_internal_checkpoint, if the file is not exist, gpu-manager with stop start and log with an error. in my cluster(kubernetes 1.23), kubelet_internal_checkpoint only exist if the node has allocated at least one pod with third part device, such as nvidia/gpu. and i read the logic of kubelet, kubelet_internal_checkpoint will be created after pod allocated. so how to use gpu-manager without nvidia-gpu-device-plugin and additional initialization?

xing0821 avatar Mar 21 '22 06:03 xing0821

This seems a bug, it'll be fixed ASAP.

mYmNeo avatar Mar 23 '22 12:03 mYmNeo