Ze

Results 6 comments of Ze

I'm encountering the same issue. Training was killed because of CPU OOM after ~7000 inters of training. Our machine has 512G memory. @acver1 Were you able to solve this issue?...

> > I'm encountering the same issue. Training was killed because of CPU OOM after ~7000 inters of training. Our machine has 512G memory. @acver1 Were you able to solve...

Thank you! I'll run some comparisons and update my implementations. And I don't think there is so much thing we can do to improve the efficiency. Conv operation is heavily...

Thanks for sharing! One quick question: how to interpret the 'setting' in the second table? For example, what does '360x512 (370k)' mean here? Thank you!

> +1 同样会遇到这个问题 一般迭代到一两百次,代码就会hang住 Have you solved this? I met the same issue with multi-node training. Thanks!

> Thanks for your insight but I was wondering these are all work around. Finally I will need trianing on long videos. Hello! What's you progress on solving this? I'm...