XQL icon indicating copy to clipboard operation
XQL copied to clipboard

Bad performance in MuJoCo benchmarks

Open jity16 opened this issue 2 years ago • 3 comments

Hello, I face some problems when running XQL on MuJoCo benchmarks (online).

  1. I have tested XQL on MuJoCo benchmarks, and none of the runs would return good performance, for example, Ant-v2: -100
  2. Only set the loss function as MSE, it will work well in MuJoCo benchmarks.

I don't really know if there are any sensitive parameters, but it seems that if the MSE loss (aka SAC backbone) mentioned in point 2 can work, the parameters might be reasonable

jity16 avatar Apr 12 '23 07:04 jity16

Hi!

Thanks for your interest in our work! We have not tried XQL on the Mujoco benchmark, only on DM Control. The reward structure of these environments is very different. If you are interested in getting XQL to work on these environments, I would suggest tuning the value of beta, which our method is extremely sensitive to. I suggest starting with a large beta = 10, and progressively lowering it until good performance is reached.

jhejna avatar Apr 14 '23 18:04 jhejna

Thank you very much! We will try to tune beta later on.

jity16 avatar Apr 14 '23 18:04 jity16

Hi @jity16, I'm currently also testing XQL on MuJoCo. Have you tuned XQL to work on MuJoCo, and could you share some parameter settings you've found effective? Thank you!

tianyyiii avatar Apr 08 '25 16:04 tianyyiii