Weilong Hu

Results 2 issues of Weilong Hu

``` def ctr_eval(sess, model, data, batch_size): start = 0 auc_list = [] f1_list = [] while start + batch_size

I cannot get increasing rewards on REINFORCE and DDPG algorithms. Is this normal? Can you provide the final results of the three algorithms? Thank you!