pytorch-optimizer
pytorch-optimizer copied to clipboard
Adan optimiser: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Looks promising https://arxiv.org/abs/2208.06677
Hi, @iiSeymour
Here are the results for Adan.

It seems that many optimizers can reach the optimal point, but the practical performance varies greatly, such as Adam and Adabound. We may prefer to focus on the practical results.