stable-baselines3
stable-baselines3 copied to clipboard
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
hello I try to solve an AI problem related to a graph using RL and stable baselines. but it seems like the RL model cannot understand and communicate with the...
### 🚀 Feature PyTorch recently released support for GPU acceleration using the Apple Silicon chips. This should be supported in stable-baselines3 by the `"mps"` device (I believe). ### Minimal Example...
## Description - [x] I created a `HParam` data class in the same way as `Figure`, `Image` ones. It can take any number of distinct hyperparameters and metrics as input....
I have been spending quite some time reading the codes here and I have been learning quite a lot so far. I got a small question when I backtrack some...
I have created a gym wrapper around a snake game I made. I am using stable baseline 3 to train a snake to play the game. I have verified that...
### 🚀 Feature It would be nice if the used hyperparameters would be included in the tensorboard log for better comparability. ### Motivation Manually comparing the hyperparameters in a separate...
**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [Reddit](https://www.reddit.com/r/reinforcementlearning/) or [Stack Overflow](https://stackoverflow.com/) in...
## Description Make `HerReplayBuffer` compatible with Multiprocessing. ## Motivation and Context - [ ] I have raised an issue to propose this change ([required](https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md) for new features and bug fixes)...
**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [Reddit](https://www.reddit.com/r/reinforcementlearning/) or [Stack Overflow](https://stackoverflow.com/) in...
**See comment https://github.com/DLR-RM/stable-baselines3/pull/780#issuecomment-1164774401 to use this PR** ## Description Gym 0.24 has been released and with it breaking changes. The objective of this PR is to fix all the failing...