Why remove n-steps function?

Open RozenAstrayChen opened this issue 7 years ago • 0 comments

Hello, I have question. If I use n-step to updated episode-buffer which will work better?

I thinks off-policy will work better than on-policy. sorry I haven't understand meta-learning, I just want ask your opinion

Jan 27 '19 14:01 RozenAstrayChen