SimMTM icon indicating copy to clipboard operation
SimMTM copied to clipboard

GPU out of memory error

Open ajati opened this issue 2 years ago • 2 comments

https://github.com/thuml/SimMTM/blob/main/SimMTM_Forecasting/scripts/pretrain/ECL_script/ECL.sh fails with GPU out of memory error in a A100 80GB.

RuntimeError: CUDA out of memory. Tried to allocate 13.26 GiB (GPU 0; 79.15 GiB total capacity; 60.37 GiB already allocated; 7.43 GiB free; 70.33 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

ajati avatar Jan 04 '24 10:01 ajati

Hello, I have also encountered this issue. Have you resolved it now?

linxi20 avatar Jun 16 '24 13:06 linxi20

I also have this problem, why?

Hardware: 3090

Script: bash ./scripts/pretrain/ECL_script/ECL.sh

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 207.37 GiB (GPU 0; 23.70 GiB total capacity; 8.14 GiB already allocated; 13.37 GiB free; 9.09 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

iDestro avatar Jul 02 '24 07:07 iDestro