Rishabh Agarwal comments

Results 51 comments of


                                            Rishabh Agarwal

Asterix/1 dataset broken?

Hmm .. I'm also not sure what could be causing the issue but we have used `Asterix/1` for some of our recent ICLR/NeurIPS papers and it does seem to replicate...

Reading atari files directly.

Yes, the dataset is composed of (s, a, r, s') tuples like the way you indicated (you can visualize the data in a colab/jupyter notebook). For a Pytorch version of...

So, the evaluation is done every iteration in `dopamine` [here](https://github.com/google/dopamine/blob/master/dopamine/discrete_domains/run_experiment.py#L450). Also, if you launched the code in the repository, the evaluation results should be logged in your tensorboard logs as...

How to evaluate agents?

@Altriaex was this issue resolved?

Can a customized env be added to the current framework?

@psc might answer more details .. but Dopamine supports openAI gym environments too and there is data logging code in the repo (look in the baselines directory) that can be...

Raw results

Yes, I can release the raw results and I would try to do so by the end of this week. Which results do you specifically need?

Raw results

I think you meant the CQL paper? Actually, I can send you these results directly over email now (as I have them stored as zipped panda dataframes) -- can you...

Raw results

@GoingMyWay Yes, I'll post the raw results on github by next month. In the meantime, you can send me an email and I can send you some of those results...

Getting 7 as action for a game with 3 actions

> Any ideas what could be the cause of this? My hunch is that in each of the checkpoint files, there are `stack size` (typically 4) elements near the end...

How to train offline agent on the huge dataset (50 Million) ?

That's correct, it's not possible to load the entire dataset into memory. So, the way the dataset is stored is in 50 files of size 1M each corresponding to replay...