random-network-distillation icon indicating copy to clipboard operation
random-network-distillation copied to clipboard

intrinsic reward and extrinsic reward

Open rainbow979 opened this issue 5 years ago • 2 comments

rainbow979 avatar May 31 '20 17:05 rainbow979

Hi,Thanks for your great work. I wonder how large intrinsic reward is in a step when meeting a novel state.

rainbow979 avatar May 31 '20 17:05 rainbow979

And when I train then environment 'venture', the intrinsic reward seems normal(the loss would suddenly increase), but the extrinsic reward was always zero. Any solution?

rainbow979 avatar Jun 01 '20 06:06 rainbow979