softlearning
softlearning copied to clipboard
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Bumps [lxml](https://github.com/lxml/lxml) from 4.6.2 to 4.9.1. Changelog Sourced from lxml's changelog. 4.9.1 (2022-07-01) Bugs fixed A crash was resolved when using iterwalk() (or canonicalize()) after parsing certain incorrect input. Note...
Bumps [numpy](https://github.com/numpy/numpy) from 1.19.5 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...
Hi and thank you for such a genius algorithm. I wonder how by using mu of gaussian policy in sac in enviroments like lunar lander is it guranteed to converge...
Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.4.1 to 2.7.2. Release notes Sourced from tensorflow's releases. TensorFlow 2.7.2 Release 2.7.2 This releases introduces several vulnerability fixes: Fixes a code injection in saved_model_cli (CVE-2022-29216) Fixes...
Bumps [pillow](https://github.com/python-pillow/Pillow) from 7.2.0 to 9.0.1. Release notes Sourced from pillow's releases. 9.0.1 https://pillow.readthedocs.io/en/stable/releasenotes/9.0.1.html Changes In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [@radarhere, @hugovk] Restrict builtins within...
https://github.com/rail-berkeley/softlearning/blob/46f14436f62465a02b99f431bbcf57a7fa0fd09d/softlearning/algorithms/sac.py#L42 Are you planning to implement this? What would be a good value for a MultiDiscrete([3 3 2 3]) action space? Depending on how I calculate I get -4, -11...
Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.14.0 to 3.15.0. Release notes Sourced from protobuf's releases. Protocol Buffers v3.15.0 Protocol Compiler Optional fields for proto3 are enabled by default, and no longer require the...
Bumps [ipython](https://github.com/ipython/ipython) from 7.19.0 to 7.31.1. Commits e321e76 release 7.31.1 67ca2b3 Merge pull request from GHSA-pq7m-3gw7-gq5x 2794330 back to dev be343e7 release 7.31.0 0fcf2c4 Merge pull request #13428 from meeseeksmachine/auto-backport-of-pr-13427-on-7.x...
Bumps [pip](https://github.com/pypa/pip) from 20.3.3 to 21.1. Changelog Sourced from pip's changelog. 21.1 (2021-04-24) Process Start installation scheme migration from distutils to sysconfig. A warning is implemented to detect differences between...
Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.3 to 1.26.5. Release notes Sourced from urllib3's releases. 1.26.5 :warning: IMPORTANT: urllib3 v2.0 will drop support for Python 2: Read more in the v2.0 Roadmap Fixed...