TracKit icon indicating copy to clipboard operation
TracKit copied to clipboard

Some problems about how to train network based on OCEAN.

Open nianlun4425 opened this issue 4 years ago • 2 comments

Hello, thanks for sharing the great work. I'd like to ask some questions about how to training network if I add small sub-networks to an existing model (e. g. OCEAN). Can you give me some advice on learning rate setting (e.g., if warm up)? Is it a good choice to freeze the original model and only unfix the added subnetwork at the beginning of the training? And do I need to train both original part and added subnetwork together after then? Besides, how do you quickly verify that if one idea is valid? I’d appreciate it if you share any experience or lectures!

nianlun4425 avatar Feb 06 '21 05:02 nianlun4425

Hello, thanks for sharing the great work. I'd like to ask some questions about how to training network if I add small sub-networks to an existing model (e. g. OCEAN). Can you give me some advice on learning rate setting (e.g., if warm up)? Is it a good choice to freeze the original model and only unfix the added subnetwork at the beginning of the training? And do I need to train both original part and added subnetwork together after then? Besides, how do you quickly verify that if one idea is valid? I’d appreciate it if you share any experience or lectures!

  1. For generality, you should train all modules end to end. If you can not (e.g. not convergence),try to fix the original modules. However, this would limit the model capacity.
  2. First, try to use the same learning rate as the baseline model. Then, try its 0.1x and 10x version.

JudasDie avatar Jun 18 '21 03:06 JudasDie

pls refer to our most recent work https://github.com/researchmm/Stark and https://github.com/researchmm/LightTrack

penghouwen avatar Aug 06 '21 03:08 penghouwen