KPConv-PyTorch icon indicating copy to clipboard operation
KPConv-PyTorch copied to clipboard

Share Training Log

Open aoligei178 opened this issue 3 years ago • 2 comments

Hi HuguesTHOMAS! Thank you for your awesome work! I followed readme.md and successfully test your pretrained model, but when I train your model, I found that the validation loss varied a lot during training process. For example, in epoch 400 I got mean IOU 68.6% on S3DIS validation data, but just after one epoch it changed to 60.8%. Is this Ok? Could anyone share his/her training log? Looking forward to your reply.

aoligei178 avatar May 09 '22 06:05 aoligei178

It could be normal, as the validation during training is performed on a small subset of the whole validation cloud. Depending on which part get picked, the score can be quite different. YOu should not refer to the printed message for evaluating your training performances.

Instead, you can use the plot_convergence.py script which plots both training metrics (time, loss), and then validation metrics. The validation metrics include the averaged score on the subsets picked at each epoch (the numbers you see printed), but also an overall score computed on the whole point cloud once in a while (only for epochs where a checkpoint is saved). This last score is the one that you should refer to for real performances.

Eventually, when your training is finished, you can use the test_models.py to have a final evaluation of the score with voting etc.

HuguesTHOMAS avatar May 09 '22 14:05 HuguesTHOMAS

Thank you!!! I got 67.0% mIOU, I think it is close enough!!

aoligei178 avatar May 10 '22 03:05 aoligei178