Maxtoq
Results
2
issues of
Maxtoq
**Describe the bug** When using PER with QMIX, an issue arises with the idx_range returned by the insert function of RecPolicyBuffer: > line 267, in insert for idx in range(idx_range[0],...
Hi, I find something odd and I'd like to know if there's something I'm missing or if it's normal. In the buffers, you define the action_log_probs to have "act_shape" as...