Move Over AlphaGo: AlphaZero Taught Itself to Play Three Different Games

posted by martyb on Tuesday December 11 2018, @03:02AM

from the how-about-a-nice-game-of-global-thermonuclear-war? dept.

upstart writes in with a submission, via IRC, for SoyCow1984:

Move over AlphaGo: AlphaZero taught itself to play three different games

Google's DeepMind—the group that brought you the champion game-playing AIs AlphaGo and AlphaGoZero—is back with a new, improved, and more-generalized version. Dubbed AlphaZero, this program taught itself to play three different board games (chess, Go, and shogi, a Japanese form of chess) in just three days, with no human intervention.
A paper describing the achievement was just published in Science. "Starting from totally random play, AlphaZero gradually learns what good play looks like and forms its own evaluations about the game," said Demis Hassabis, CEO and co-founder of DeepMind. "In that sense, it is free from the constraints of the way humans think about the game."
[...] As [chess grand master Garry] Kasparov points out in an accompanying editorial in Science, these days your average smartphone chess playing app is far more powerful than Deep Blue. So AI researchers turned their attention in recent years to creating programs that can master the game of Go, a hugely popular board game in East Asia that dates back more than 2,500 years. It's a surprisingly complicated game, much more difficult than chess, despite only involving two players with a fairly simple set of ground rules. That makes it an ideal testing ground for AI.

AlphaZero is a direct descendent of DeepMind's AlphaGo, which made headlines worldwide in 2016 by defeating Lee Sedol, the reigning (human) world champion in Go. Not content to rest on its laurels, AlphaGo got a major upgrade last year, becoming capable of teaching itself winning strategies with no need for human intervention. By playing itself over and over again, AlphaGo Zero (AGZ) trained itself to play Go from scratch in just three days and soundly defeated the original AlphaGo 100 games to 0. The only input it received was the basic rules of the game.
[...] AGZ was designed specifically to play Go. AlphaZero generalizes this reinforced-learning approach to three different games: Go, chess, and shogi, a Japanese version of chess. According to an accompanying perspective penned by Deep Blue team member Murray Campbell, this latest version combines deep reinforcement learning (many layers of neural networks) with a general-purpose Monte Carlo tree search method.
"AlphaZero learned to play each of the three board games very quickly by applying a large amount of processing power, 5,000 tensor processing units (TPUs), equivalent to a very large supercomputer," Campbell wrote.
[...] DOI: Science, 2018. 10.1126/science.aar6404 (About DOIs).

Original Submission

This discussion has been archived. No new comments can be posted.

Move Over AlphaGo: AlphaZero Taught Itself to Play Three Different Games | Log In/Create an Account | Top | 14 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Move Over AlphaGo: AlphaZero Taught Itself to Play Three Different Games

Related Stories

Commented games (Score: 3, Interesting) by shrewdsheep on Tuesday December 11 2018, @12:23PM

General AI General AI (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @12:26PM (10 children)

Re:General AI Re:General AI (Score: 2) by sgleysti on Tuesday December 11 2018, @02:39PM (4 children)

Re:General AI Re:General AI (Score: 2) by takyon on Tuesday December 11 2018, @02:41PM (1 child)

Give it a few years (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @06:43PM

Re:General AI Re:General AI (Score: 2) by DannyB on Tuesday December 11 2018, @03:36PM (1 child)

Re:General AI (Score: 3, Informative) by takyon on Tuesday December 11 2018, @05:27PM

Re:General AI Re:General AI (Score: 4, Insightful) by DannyB on Tuesday December 11 2018, @03:27PM (2 children)

Re:General AI Re:General AI (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @04:27PM (1 child)

Re:General AI (Score: 2) by takyon on Tuesday December 11 2018, @04:53PM

Re:General AI Re:General AI (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @11:46PM (1 child)

Re:General AI (Score: 2) by takyon on Wednesday December 12 2018, @12:02AM

Time to level up Time to level up (Score: 3, Interesting) by takyon on Tuesday December 11 2018, @01:03PM (1 child)

Re:Time to level up (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @08:27PM

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

Move Over AlphaGo: AlphaZero Taught Itself to Play Three Different Games

Related Stories

Commented games (Score: 3, Interesting) by shrewdsheep on Tuesday December 11 2018, @12:23PM

General AI General AI (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @12:26PM (10 children)

Re:General AI Re:General AI (Score: 2) by sgleysti on Tuesday December 11 2018, @02:39PM (4 children)

Re:General AI Re:General AI (Score: 2) by takyon on Tuesday December 11 2018, @02:41PM (1 child)

Give it a few years (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @06:43PM

Re:General AI Re:General AI (Score: 2) by DannyB on Tuesday December 11 2018, @03:36PM (1 child)

Re:General AI (Score: 3, Informative) by takyon on Tuesday December 11 2018, @05:27PM

Re:General AI Re:General AI (Score: 4, Insightful) by DannyB on Tuesday December 11 2018, @03:27PM (2 children)

Re:General AI Re:General AI (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @04:27PM (1 child)

Re:General AI (Score: 2) by takyon on Tuesday December 11 2018, @04:53PM

Re:General AI Re:General AI (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @11:46PM (1 child)

Re:General AI (Score: 2) by takyon on Wednesday December 12 2018, @12:02AM

Time to level up Time to level up (Score: 3, Interesting) by takyon on Tuesday December 11 2018, @01:03PM (1 child)

Re:Time to level up (Score: 0) by Anonymous Coward on Tuesday December 11 2018, @08:27PM