蒲源 comments

Results 60 comments of


                                            蒲源

Question about the effect of torch_amp

Hi, One more question about runtime. In the EfficientZero paper A.5 evaluation, the paper states that "To train an Atari agent for 100k steps, it only needs 4 GPUs to...

Adoption for single player game

> Hello I'm trying to adapt the code for the single-player game like 2048 but it seems that I got RecursionError often and if I set sys.sys.setrecursionlimit(...). The program may...

Single Player Game

Hello, thank you to the contributors for their outstanding work on this repository. Regarding the issue you've raised, you might be interested in the project "[LightZero: A Unified Benchmark for...

Chess and other non-trivial games

Hello, thank you to the contributors for their outstanding work on this repository. Regarding the issue you've raised, you might be interested in the project "[LightZero: A Unified Benchmark for...

Only One Player: Can we use MuZero?

Hello, thank you to the contributors for their outstanding work on this repository. Regarding the issue you've raised, you might be interested in the project "[LightZero: A Unified Benchmark for...

Hello! Regarding the support on the Go environment, we have been working on this PR (https://github.com/opendilab/LightZero/pull/65/files) for a while. This PR includes the initial implementation of the regular self-play version...

how to well model a grid env when it changes frequently?

To effectively model this game environment and train an agent to maximize the value of gold nuggets obtained from a 10x10 grid, you can take the following steps: ### 1....

how to well model a grid env when it changes frequently?

Hello! According to your description, first please confirm: is the episode length fixed at 5 steps? If MuZero is performing poorly and there's no issue with the environment part you've...

how to well model a grid env when it changes frequently?

Hello, I recommend starting by adjusting MuZero rather than EfficientZero, as the latter adds complexity, particularly in predicting the value prefix, which may not be advantageous in your environment. To...

how to well model a grid env when it changes frequently?

Hello, the action space is fixed at 100, which is considerably smaller compared to the maximum action space of 19*19 in Go. Therefore, theoretically, MuZero should manage this scale effectively....