skimo
skimo copied to clipboard
Skill-based Model-based Reinforcement Learning (CoRL 2022)
Hi, When training the high-level policy in skimo_agent.py, z_next_pred is initialized as the first observation(line 616) and it is not updated at all after that. Assuming from the comment and...
Dear authors, thank you for such a smooth-running code. I would greatly appreciate if you provided the SPiRL model pre-trained on CALVIN. If unavailable, please share the SPiRL hyperparameters and...
Hello. I'm trying to use MPI to speedup the pre-training, but the program crashes when syncing grads. Running without mpi (or with 1 process) is fine, but when trying with...
I unzipped the Calvin dataset, iterated through the dataset and was surprised to find that many of the 'obs' sequences were of length 500. This is strange because Calvin has...