luomuqinghan issues

Repositories
Issues
Comments

Results 4 issues of


                                            luomuqinghan

Qusetion about ease of answering

Thanks for your sharing. In RL for ease of answering, the reward is calculated by RL model itself, not another model? Why not input the action into another pretrained model...

A question about ease of answering

Thanks for your sharing. In RL for ease of answering, the reward is calculated by RL model itself, not another model? Why not input the action into another pretrained model...

decode for generating only one response?

Thank you for your code! But in original HRED, for k context and one response, HRED generates k utterances. It seems that you only generate the final response in training...

Would you mind sharing more personality data?