big_transfer
big_transfer copied to clipboard
Linear warmup learning rate is 0 when step = 0
Hi there
In the function get_lr() in bit_hyperrule.py, when step == 0, the returned learning rate is 0.0.
I wanted to check if this is a bug or intentional?