u-2-net-portrait icon indicating copy to clipboard operation
u-2-net-portrait copied to clipboard

Training dataset size

Open FraPochetti opened this issue 5 years ago • 2 comments

Hi! Thanks a lot for the great work here. Would you mind sharing how big was the dataset you used to train your model? Also, which strategies did you use to create the 440x440 images you used for training? Simply resizing original pics + alpha mattes or random cropping to preserve size ratios?

Thanks a lot again and have a great one!

FraPochetti avatar Apr 26 '21 16:04 FraPochetti

Hey @FraPochetti,

thanks for your interest. The augmentation pipeline is part of the configuration files (dataset.yaml). The published model was trained on roughly 30k quite noisy samples. The model is overall able to pick up the details even though the synthesized dataset was quite noisy but you will most likely not reach visually perfect quality with this approach.

dbpprt avatar Apr 26 '21 18:04 dbpprt

Hey @FraPochetti,

thanks for your interest. The augmentation pipeline is part of the configuration files (dataset.yaml). The published model was trained on roughly 30k quite noisy samples. The model is overall able to pick up the details even though the synthesized dataset was quite noisy but you will most likely not reach visually perfect quality with this approach.

Hi~ @dennisbappert, in your project, the mask of training is binary(0/255)?? or alpha mask(0~255)?? thanks best regards in your project, the mask of training is binary(0/255)?? or alpha mask(0~255)?? thanks best regards

xiongzhu666 avatar Jun 27 '21 11:06 xiongzhu666