DarshanDeshpande

Results 1 issues of DarshanDeshpande

# Description The MADGRAD algorithm is a reliably high-performing non-convex optimization algorithm that matches or outperforms the SGD and Adam algorithms on a variety of image-to-image and language tasks. This...

optimizers
cla: yes
github