YZH0216
YZH0216
Same error, I found that the model cannot correctely call the GPU in the docker container, thus no model weight `.pkl` file is generated. I am still working on this...
I think I have right docker file, the codes are listed below. `FROM pytorch/pytorch:2.2.1-cuda12.1-cudnn8-runtime # For GPU support, please choose the proper tag from https://hub.docker.com/r/pytorch/pytorch/tags RUN apt-get clean && apt-get...
Besides, it seems the docker container can correctly detect the gpu device, the log detail are listed below. 2024-10-21 20:20:18.348 | INFO | rdagent.utils.env:_gpu_kwargs:269 - GPU Devices are available.