Hieu Pham comments

Results 30 comments of


                                            Hieu Pham

dot_product = s_loss_old - s_loss_new but s_loss_new - s_loss_old?

**About your derivations.** I do not see anything wrong with @dgedon's derivation. > `f(x) = f(a) + f'(a)(x-a)`. > So there is f'(a) instead of f'(x). Then with the same...

Reproducibility of the results from the paper (RNN)

Hi @nsmetanin, Thank you for your interest in our work. Given the description of your experiments, we suspect that you ran the script `ptb_search.sh`, and not the `ptb_final.sh`. The script...

Reproducibility of the results from the paper (RNN)

@nsmetanin Thanks! Please let us know how it goes 😃

Reproducibility of the results from the paper (RNN)

Hi Hanxiao @quark0, That's definitely a bug. Thank you for spotting it. We have pushed [a commit](https://github.com/melodyguan/enas/blob/master/src/ptb/ptb_enas_child.py#L480) that fixes it. We have tried rerunning the code and the output looks...

Reproducibility of the results from the paper (RNN)

**Update on results:** @quark0 We finished rerunning the script with the fix and indeed got the test perplexity of `56.6`.

Reproducibility of the results from the paper (RNN)

Yes that's what we got too. We think the reason is that the validation perplexity we computed using a `batch_size` of `35`. If we use `batch_size = 1` for validation,...

RNN results not reproducible

Hi, Thanks for your interest. Commit [2734eb2](https://github.com/melodyguan/enas/commit/2734eb2657847f090e1bc5c51c2b9cbf0be51887) fixed a bug in the evaluation process. After the fix, we had to further tune the model's hyper-parameters to reach a good performance....

Hieu Pham

dot_product = s_loss_old - s_loss_new but s_loss_new - s_loss_old?

Reproducibility of the results from the paper (RNN)

Reproducibility of the results from the paper (RNN)

Reproducibility of the results from the paper (RNN)

Reproducibility of the results from the paper (RNN)

Reproducibility of the results from the paper (RNN)

RNN results not reproducible

Expected performance

Expected performance

ValueError: Incompatible shapes between op input and calculated input gradient.