coltra-rl issues

Sample script for training MARL crowd simulation?

running into issues in replicating your work, your help would be greatly appreciated

Switch data storage from numpy to torch

No point in switching formats back and forth, just put everything into a tensor

Fix the agent ID string formats

With MultiGymEnv, it's `agent&env=0`. I need to figure this out in general, and for unity specifically because their ?team=0 is weird

RedTachyon

Consider switching to jax

Jax seems pretty dope, and the realm of PyTorch RL libraries is somewhat saturated

RedTachyon

Add proper wandb sweep supprt, clean up wandb integration

RedTachyon

Make a proper connection between Observation/Action, and gym spaces

Right now, there is a vague connection between (`Box(...)` and `Action(continuous=...)`), and between (`Discrete(...)` and `Action(discrete=...)`) In principle, all of these can support Dict action/observation spaces, but I don't want...

RedTachyon

before publishing

Rethink config names in MLPModel

Right now it's "input_size", "num_actions" and "discrete". The first two are inconsistent in the style, need to make it more intuitive

RedTachyon

before publishing

Check device management with wrappers

Need to make sure that if an agent with a wrapper is saved on a GPU, it can be gracefully loaded on a CPU, and the other way around.

RedTachyon

before publishing

Parametrize tests

Currently the pytest tests use some arbitrary network architectures, environments etc. By using `pytest.mark.parametrize`, this can be expanded into much more reliable tests, particularly between different sizes an discreteness

RedTachyon

Add a general-purpose HeterogeneousGroup

Self-explanatory

RedTachyon

coltra-rl
coltra-rl copied to clipboard

Metadata

Sample script for training MARL crowd simulation?

Switch data storage from numpy to torch

Fix the agent ID string formats

Consider switching to jax

Add proper wandb sweep supprt, clean up wandb integration

Make a proper connection between Observation/Action, and gym spaces

Rethink config names in MLPModel

Check device management with wrappers

Parametrize tests

Add a general-purpose HeterogeneousGroup

← Metadata

Owner

Metadata

coltra-rl coltra-rl copied to clipboard

Metadata

← Metadata

Owner

Metadata

coltra-rl
coltra-rl copied to clipboard