Giyeong Oh
Giyeong Oh
--use_dadaptation_optimizer args added. with this option, dadaptation is avalilable. while using dadaptation, it recommend 1-digit float numbers( 0.1
Hi, how about [D-adaptation](https://github.com/facebookresearch/dadaptation)? This is a kind of algorithm that end-user doesn't need to set specific learning rate. In short, D-adaptation use boundedness to find proper learning rate. So,...
Same as title. ~I'm in investigating.~ With following modification, scripts can save optimizer state **only for small dataset**, _IDK why it does not work big dataset._ Error messages will be...