Question about torch.distributed.elastic.multiprocessing.errors
I have the same problem
It could happen due to the mismatch of your cuda version and the version of cuda that your installed pytorch supports. Please run nvcc --version and python -c "import torch; print(torch.__version__); print(torch.version.cuda)" to check if they match. If they already match, you may update the LD_LIBRARY_PATH to include cuda libs by export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH. Otherwise, please reinstall pytorch via conda install pytorch torchvision torchaudio pytorch-cuda=YOUR_CUDA_VERSION_HERE -c pytorch -c nvidia.
This issue is stale because it has been open for 7 days with no activity.
This issue was closed because it has been inactive for 7 days since being marked as stale.