Videh Raj Nema comments

Repositories
Issues
Comments

Results 3 comments of


                                            Videh Raj Nema

Sorry,I can't find the training code!Is there any code that I can train my own model?

Yes, please. I look for the same.

Guessing the seed hyperparameter

Thanks, @alexis-jacq. The variance due to the Monte-Carlo rollouts is very high and I think using better advantage estimators can make the algorithm more robust. Unfortunately, the LOLA-DiCE objective, by...

Code with Actual Human Trainer

> Thank you very much for releasing the code. It looks like the current code only supports scripted teachers. Is there any plan to also release the part to support...