Any example using AlphaZero with Ray's tuner and trainer?

Hi all,

I would like to use single-player AlphaZero, but it does not work as conveniently as other algorithms like PPO. I wonder if anyone has a script or a notebook using AlphZero with Ray’s tuner or trainer?


Hello @arturn @sven1977 ,
Sorry I wonder do you have any codes for this?

There is no example python script, but a tuned example. This tuned example obviously translated to a python script.

If you play around with it and procude a cool learning example using tune, please share it within the scope of a PR though.

1 Like