diffae icon indicating copy to clipboard operation
diffae copied to clipboard

about the partition of training and validation sets

Open string-ellipses opened this issue 1 year ago • 1 comments

I am confused about the division of training and validation sets, because I see "self.val_data = self.train_data ” in experiment.py (line 184), which seems to cause the training set and validation set to not be separatedd, thus affecting a fair evaluation of model performance. 1851842d77f50ef37c30c163a505003

perhaps there are steps in other sections to separate the training and validation sets?

string-ellipses avatar Mar 01 '24 08:03 string-ellipses

Validation in generative models is done a bit differently. For example, FID calculation takes 50k images to get a reasonably stable result. If all of the 50k images are set aside for validation, there will be only 20k images left for training which is impractical.

konpatp avatar Mar 02 '24 07:03 konpatp