DDQ icon indicating copy to clipboard operation
DDQ copied to clipboard

what is pre-training dqn model and world model ? initialize Q(s; a; θQ) and M(s; a; θM) via pre-training on human conversational data?

Open netrookiecn opened this issue 6 years ago • 1 comments

Hi I dont understand the pretraining of the world model because I can not find the pretraining process in your code, can you explain me what is that? and where is the pretraining dqn model and world model in your repo? thanks

netrookiecn avatar Dec 17 '19 07:12 netrookiecn

I have the same question. lol

Dr-Corgi avatar Apr 08 '20 03:04 Dr-Corgi