btaba
btaba
Hi @lebrice can you add gymnasium to the requirements, so that the builds pass? Thank you!
Ah ok, sorry I was misreading this. I think the right thing to do is what @lebrice is saying. Switch to gymnasium altogether and hash out differences in the API...
@giotherobot any updates? Will close the PR if there is no further activity
Acknowledged, this has been on our TODO for some time. I implemented orbax checkpointing in PPO https://github.com/google/brax/blob/d48b0b373a6478838eac325cadc6d8983837a968/brax/training/agents/ppo/train.py#L534 as a start. But a bigger refactor is needed to bubble up this...
8526f9a64ee02010615a57a026a7b6aad05cbda0 starts to address this, but we likely won't fully deprecate `brax.io.model` for some time
Hi @scott-yj-yang there aren't immediate plans to switch to `nnx` as `flax.linen` isn't being deprecated, but we'd be happy to review clean/minimal PRs. `nnx` does generally look like a cleaner...
Hi @nico-bohlinger thanks for the heads up on the missing feature. Would you like to implement it?
Hi @varunagrawal , nothing looks obviously wrong to me for the small code snippet you provided. What's different compared to the quadruped example? And which quadruped example are you using...
What example are you basing this on?
Hi @varunagrawal let us know when you have a reproducible example