How AlphaGo Zero works – Google DeepMind
In this episode I dive into the technical details of the AlphaGo Zero paper by Google DeepMind.
This AI system uses Reinforcement Learning to beat the world’s Go champion using only self-play, a remarkable display of clever engineering on the path to stronger AI systems.
DeepMind Blogpost: https://deepmind.com/blog/alphago-zero-learning-scratch/
AlphaGo Zero paper: https://storage.googleapis.com/deepmind-media/alphago/AlphaGoNaturePaper.pdf
If you want to support this channel, here is my patreon link:
https://patreon.com/ArxivInsights — You are amazing!! 😉
If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: https://pensight.com/x/xander-steenbrugge