Implicit-Q-Learning
Implicit-Q-Learning copied to clipboard
bad result on Antmaze enviornment
It works well on mujoco environments, but not on antmze environment .It did not work even if I changed the parameters according to the paper(expectile=0.9, temperature=10). Can you help me please?
I'm not sure what the problem is but I will have a closer look once I have time. Could be that its caused due to the cosine schedule for the actor learning rate which I didn't add yet.