Dennis96

Results 13 comments of Dennis96

@seanchen022 i had the same problem like u days ago. when i change the images/codes of gpu-manager newer, it solved. maybe the branch master, commit: f0669de works. @paragor by the...

i had save problem too. so i made this file(/var/lib/kubelet/device-plugins/kubelet_internal_checkpoint) before gpu-manager start; it works now

same with u bro, so do u solve it these days?

in my case, if process running exceeds the limits, is will return CUDA Out of Memory, but vcuda-core won't

save with me. i tried version 0.4.2 and 0.5.0, both end up restarting periodically

save with u. A100 80G did not be detected by gpu-manager, but A100 SXM 40G works good

i fixed it 1. before callPreStartContainerIfNeeded, u should allocate ur extend-resource. For some reason pre-allocate in there did not work. 2. the cgroup fs path in edgenode is also different,...