Yashar78
Yashar78
If it runs out of memory why the first epoch runs successfully?
This is the command: #!/bin/bash export CUDA_VISIBLE_DEVICES=1 python -m training.main \ --name test_run \ --zeroshot-frequency 0 \ --save-frequency 1 \ --report-to tensorboard \ --train-data="path to train data csv file" \...
I have 128GB of RAM on this machine and when runs in total 15% of it is taken by all processes. Also, I realized that at the end of the...