rl icon indicating copy to clipboard operation
rl copied to clipboard

[Feature Request] Muzero and MCTS implementations

Open Prakyathkantharaju opened this issue 2 years ago • 1 comments

Motivation

It would be great to have an MCTS and Alphazero implementation, including other model-based RL for benchmarking and comparison.

Solution

I can write a loss function of this policy.

Alternatives

There are limited RL libraries that have a base implementation of Muzero.

Additional context

None.

Checklist

  • [x] I have checked that there is no similar issue in the repo (required)

Prakyathkantharaju avatar Jan 29 '24 12:01 Prakyathkantharaju

Interestingly someone just dropped a suggestion to help us implement alpha zero https://github.com/pytorch/rl/discussions/1844 If you want to collaborate or follow the progress, i'd suggest to join our discord challenge here, I just created an MCTS channel!

vmoens avatar Jan 29 '24 13:01 vmoens