YangZhaohui

Results 3 issues of YangZhaohui

Thank you for sharing this beautiful code! I use the default DARTS_V2 architecture with appriximately 3.3M parameters to train CIFAR10 dataset. However, I found that it requires about 24h to...

Hi, In PyTorch BatchNorm2d, there is a track_running_status parameter records the number of iterations, momentum used to merge previous mean and std with the current batch. I wonder this group...

Thank you for your selfless sharing. In the author's application, EMA has been applied to all parameters that can be trained, but there are some statistics, such as the mean...