diffusion_models
diffusion_models copied to clipboard
EMA model usage
In notebooks I see updates of EMA model def update, where we change shallow model, but def ema method was never called. It should be called ones per N epochs or it will be used at the end of all training process? As I can see, def update does not affect actual model and smoothing is not happen.