tseyde
tseyde
Loading the reacher-easy task I received an error about the observation dimensionality (6 vs. 7). The original DeepMind paper states that the observation dimensionality is 7, which is also set...
Failing to build mujoco-py. Initially failed due to missing 'swig' and 'patchelf', which was manually resolved. Now failing on 'glfw'. Calling 'import glfw' within (rlkit) conda environment works, though. `File...
Hi Acme team, I think JAX DQN might set the evaluation epsilon to the exploration epsilon if deterministic evaluation is requested (eps=0.0, [here](https://github.com/deepmind/acme/blob/860dbab686042573569b84223d8da6d43d09c304/acme/agents/jax/dqn/builder.py#L183)). Replacing this with `self._config.eval_epsilon is not None`...