After AlphaZero, DeepMind went on to develop MuZero (https://en.wikipedia.org/wi...

After AlphaZero, DeepMind went on to develop MuZero (https://en.wikipedia.org/wiki/MuZero, paper: https://arxiv.org/abs/1911.08265), which is able to learn to play games without being provided an explicit model of their rules. "When evaluated on Go, chess and shogi, without any knowledge of the game rules, MuZero matched the superhuman performance of the AlphaZero algorithm that was supplied with the game rules."