Magnus
Results
2
issues of
Magnus
https://github.com/rail-berkeley/softlearning/blob/46f14436f62465a02b99f431bbcf57a7fa0fd09d/softlearning/algorithms/sac.py#L42 Are you planning to implement this? What would be a good value for a MultiDiscrete([3 3 2 3]) action space? Depending on how I calculate I get -4, -11...
enhancement
Opening a new issue because the [old issue](https://github.com/rail-berkeley/softlearning/issues/163) was closed but didn't really explain the differences in the softlearning implementation and the other issue was not reopened on request(yet, 1wk)....