Zhuliang Yao

Results 5 comments of Zhuliang Yao

Hi @skaarthik, have you decided to update the [zip-dataset](https://bertonazuremlwestus2.blob.core.windows.net/public2/bert_data.tar.gz) or the data prep instruction? Besides, I wonder what if I did as @usuyama suggested? Will there be any performance influence/drop?...

@carpedm20 As far as I know, the E[Reward(m, omega)] should be calculated under the meaning of expectation, which means you are supposed to sample several models and average those rewards...

1. This equation is initially described at Page.5, and we put the proof at Appendix.C. 2. λ is a normalization scaler to keep the sum over Ω equals 1.

'fixed BN' means that only BN is frozen (use the params from imagenet-pretrained model).