cuda execution failed with error 2
Could anyone tell me how to solve this problem?
python 3.6 torch 1.0 cuda 10.0 cudnn 7.1.4 Traceback (most recent call last): File "./pytorch/train.py", line 306, in train ret_dict = net_parallel(example_torch) File "/home/xxx/.conda/envs/dwzpy/lib/python3.6/site-packages/torch/nn/modules/module.py", line 489, in call result = self.forward(*input, **kwargs) File "/home/xxx/second.pytorch-master/second/pytorch/models/voxelnet.py", line 363, in forward preds_dict = self.network_forward(voxels, num_points, coors, batch_size_dev) File "/home/xxx/second.pytorch-master/second/pytorch/models/voxelnet.py", line 332, in network_forward voxel_features, coors, batch_size) File "/home/xxx/.conda/envs/dwzpy/lib/python3.6/site-packages/torch/nn/modules/module.py", line 489, in call result = self.forward(*input, **kwargs) File "/home/xxx/second.pytorch-master/second/pytorch/models/middle.py", line 203, in forward ret = self.middle_conv(ret) File "/home/xxx/.conda/envs/dwzpy/lib/python3.6/site-packages/torch/nn/modules/module.py", line 489, in call result = self.forward(*input, **kwargs) File "/home/xxx/.conda/envs/dwzpy/lib/python3.6/site-packages/spconv/modules.py", line 130, in forward input = module(input) File "/home/xxx/.conda/envs/dwzpy/lib/python3.6/site-packages/torch/nn/modules/module.py", line 489, in call result = self.forward(*input, **kwargs) File "/home/xxx/.conda/envs/dwzpy/lib/python3.6/site-packages/spconv/conv.py", line 170, in forward grid=input.grid) File "/home/xxx/.conda/envs/dwzpy/lib/python3.6/site-packages/spconv/ops.py", line 91, in get_indice_pairs stride, padding, dilation, out_padding, int(subm), int(transpose)) RuntimeError: /home/xxx/second.pytorch-master/spconv/src/spconv/indice.cu 125 cuda execution failed with error 2
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "./pytorch/train.py", line 663, in
Did you manage to solve this?
Make sure you have enough free memory on your GPU. I was facing this issue because my GPU was fully occupied. It started working when I cleared some memory.
I also have the same issue. My model needs at most 16GB, and I am using 32G GPU RAM. Any idea as how to solve this issue?
I met the same issue. Has anyone solved that?
Update: I solved this problem with spconv 1.2.1