Jack Vice
Results
13
comments of
Jack Vice
Running `python3 multiagent.py --env 'leaderfollower'`, the last set of actions and observation are below. The policy diverges and spits out [nan] after 3 hours. These are the last ten actions...
No I didn't. is it still a problem in new versions? I switched to Isaac gym.
I am getting the same `KeyError: 'rl_module'` with the following versions: python==3.11.9 ray==2.36.0 tensorflow==2.12.0 tensorflow-probability==0.19.0