Distilling-Object-Detectors icon indicating copy to clipboard operation
Distilling-Object-Detectors copied to clipboard

warm step

Open ShaniGam opened this issue 6 years ago • 3 comments

Can you please explain the intuition for using warm_step=200 for only 1 epoch? It doesn't seem like enough for meaningful training without distillation. What happens if I use the distillation loss from scratch?

ShaniGam avatar Dec 05 '19 12:12 ShaniGam

can you rephrase your question?

twangnh avatar Dec 18 '19 08:12 twangnh

The warm step is not mentioned in the paper. Does it improve the result?

ShaniGam avatar Dec 18 '19 08:12 ShaniGam

no, warm up is not related to distillation, it is used for stable training

twangnh avatar Dec 18 '19 11:12 twangnh