ShapeFormer icon indicating copy to clipboard operation
ShapeFormer copied to clipboard

Problem about Multi-gpu Training Vqdif

Open yixuanli1230 opened this issue 1 year ago • 1 comments

Hello, this paper seems amazing! I encountered an error while executing the following command in VS Code: python -m shapeformer.trainer --opts configs/vqdif/shapenet_res16.yaml --gpu 0 1 2 3 4.

However, I received the following traceback: Traceback (most recent call last): File "", line 1, in File "/home/xx/anaconda3/envs/shapeformer/lib/python3.7/multiprocessing/spawn.py", line 105, in spawn_main exitcode = _main(fd) File "/home/xx/anaconda3/envs/shapeformer/lib/python3.7/multiprocessing/spawn.py", line 115, in _main self = reduction.pickle.load(from_parent) _pickle.UnpicklingError: state is not a dictionary

This error seems to be related to inter-process communication when using multiple GPUs. The state information is not being properly serialized and transferred between processes. Do you have any insights or suggestions on how to resolve this issue? Thanks!

yixuanli1230 avatar Jun 19 '24 08:06 yixuanli1230

Hello @yixuanli1230 , I am facing the same issue, did you find a solution to this issue?

kajalsanklecha avatar Oct 07 '24 09:10 kajalsanklecha