random-network-distillation
random-network-distillation copied to clipboard
intrinsic reward and extrinsic reward
Hi,Thanks for your great work. I wonder how large intrinsic reward is in a step when meeting a novel state.
And when I train then environment 'venture', the intrinsic reward seems normal(the loss would suddenly increase), but the extrinsic reward was always zero. Any solution?