Le Zhang comments

Results 7 comments of


                                            Le Zhang

EfficientZero high memory consumption / keeps increasing after replay buffer is full

in lines 240-241 of `core / reanalyze_worker.py`, try changing them to ``` trained_steps = ray.get(self.storage.get_counter.remote()) target_weights = None ``` and changing lines 252-253 to ``` if new_model_index > self.last_model_index: self.last_model_index...

EfficientZero high memory consumption / keeps increasing after replay buffer is full

btw, in train/mean_score of your posted plot, 100K in x-axis is not for Atari 100K, but for Atrai 10M (i.e., 10M interactions with the env)? is the understanding above right?

EfficientZero high memory consumption / keeps increasing after replay buffer is full

did not try the exp in the large scale as you discussed. but the change on codes relevant to `target_weights` makes the `train.sh` be runnable. and decreasing the `gpu_actor` really...

EfficientZero high memory consumption / keeps increasing after replay buffer is full

no. it is just because this would save RAM memory, so `train.sh` would run without breaking until the end.

SpaceInvaders(SAC_Discrete) : Error

+1 error also for SpaceInvaders + SAC_Discrete. So this now makes things harder to be verified for the results in the SAC_Discrete arXiv preprint paper. (Paper: [Soft Actor-Critic for Discrete...

Comparing between the different object (int == tuple)

> Thanks for your interest. I reviewed this part of the code carefully, it is about parent node update. These codes are used to implement the functionality where all the...

Code for computing the metrics

+1 Would appreciate if the code for computing DCS (esp. how the biophysical and statistical properties are implemented in the form of code when computing DCS) is given (rather than...