examples icon indicating copy to clipboard operation
examples copied to clipboard

the learning rate of word_language_model

Open zhangyingbit opened this issue 5 years ago • 3 comments

Hi, I have a question about the learning rate in the example "word_language_model", the init lr = 20, which seems very large, can you tell me why lr is set to equal 20? Thanks a lot! If you have some advices about improving the performance, please let me know and thanks

zhangyingbit avatar Apr 22 '20 09:04 zhangyingbit

i also have the same question. Look like nobody answers that

kienld3049 avatar Sep 12 '22 10:09 kienld3049

me too, when i used lr=20, then the loss was very strange and crazy......

AlbertMa123 avatar Mar 28 '24 09:03 AlbertMa123

I'm happy to accept contributions for a more reasonable learning rate

msaroufim avatar Apr 02 '24 21:04 msaroufim