Dennis96
Dennis96
same with me. did u solve it?
@seanchen022 i had the same problem like u days ago. when i change the images/codes of gpu-manager newer, it solved. maybe the branch master, commit: f0669de works. @paragor by the...
i had save problem too. so i made this file(/var/lib/kubelet/device-plugins/kubelet_internal_checkpoint) before gpu-manager start; it works now
same with u bro, so do u solve it these days?
@mYmNeo @hzliangbin please help
in my case, if process running exceeds the limits, is will return CUDA Out of Memory, but vcuda-core won't
save with me. i tried version 0.4.2 and 0.5.0, both end up restarting periodically
save with u. A100 80G did not be detected by gpu-manager, but A100 SXM 40G works good
how's now? can u solve it?
i fixed it 1. before callPreStartContainerIfNeeded, u should allocate ur extend-resource. For some reason pre-allocate in there did not work. 2. the cgroup fs path in edgenode is also different,...