Prabhat Kumar

Results 8 comments of Prabhat Kumar

same issue on pytorch 1.8

same issue here

Hi, were you able to train the same using a 3090 GPU. Thanks

Error encountered when replacing the A6000 with RTX3090 RuntimeError: CUDA error: unspecified launch failure CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might...

Similar to the issue [here](https://github.com/pytorch/pytorch/issues/27837) But needs reopening and urgency

A much simplified code that can be run is [repo](https://github.com/kentaroy47/pytorch-mgpu-cifar10) We run the command `CUDA_LAUNCH_BLOCKING=1 python train_cifar10.py --net res101 --bs 256` File "/home/avlabs_blue/.virtualenvs/ultralytics/lib/python3.8/site-packages/torch/_tensor.py", line 399, in backward torch.autograd.backward(self, gradient, retain_graph,...

Hi @zwang233 I tried building from source as well as using docker from [here ](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/pytorch)but still is the same error.