Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

After AlphaZero, DeepMind went on to develop MuZero (https://en.wikipedia.org/wiki/MuZero, paper: https://arxiv.org/abs/1911.08265), which is able to learn to play games without being provided an explicit model of their rules. "When evaluated on Go, chess and shogi, without any knowledge of the game rules, MuZero matched the superhuman performance of the AlphaZero algorithm that was supplied with the game rules."


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: