why n_tasks * alpha?

Open mlresh opened this issue 5 years ago • 1 comments

Hi,

I noticed that in the experiments, epo_train.py scales alpha (task weight) by the number of tasks but not for the other methods like pmtl_train.py and linscalar_train.py. epo_search.py in toy_experiments also uses a different multiplier x = x - 10. * max(lp.mu_rl, 0.1) * step_size * d_nd.

Is there any particular reason behind these choices?

Thanks.

Aug 24 '20 07:08 mlresh

Hi,

I noticed that in the experiments, epo_train.py scales alpha (task weight) by the number of tasks but not for the other methods like pmtl_train.py and linscalar_train.py. epo_search.py in toy_experiments also uses a different multiplier x = x - 10. * max(lp.mu_rl, 0.1) * step_size * d_nd.

Is there any particular reason behind these choices?

Thanks.

I also have this question, do you solve it now ?

May 01 '22 07:05 wjm-wjm