AlphaGo Zero Makes AlphaGo Obsolete

posted by martyb on Thursday October 19 2017, @02:39PM

from the Zeroing-in-on-AI dept.

Google DeepMind researchers have made their old AlphaGo program obsolete:

The old AlphaGo relied on a computationally intensive Monte Carlo tree search to play through Go scenarios. The nodes and branches created a much larger tree than AlphaGo practically needed to play. A combination of reinforcement learning and human-supervised learning was used to build "value" and "policy" neural networks that used the search tree to execute gameplay strategies. The software learned from 30 million moves played in human-on-human games, and benefited from various bodges and tricks to learn to win. For instance, it was trained from master-level human players, rather than picking it up from scratch.
AlphaGo Zero did start from scratch with no experts guiding it. And it is much more efficient: it only uses a single computer and four of Google's custom TPU1 chips to play matches, compared to AlphaGo's several machines and 48 TPUs. Since Zero didn't rely on human gameplay, and a smaller number of matches, its Monte Carlo tree search is smaller. The self-play algorithm also combined both the value and policy neural networks into one, and was trained on 64 GPUs and 19 CPUs over a few days by playing nearly five million games against itself. In comparison, AlphaGo needed months of training and used 1,920 CPUs and 280 GPUs to beat Lee Sedol.
Though self-play AlphaGo Zero even discovered for itself, without human intervention, classic moves in the theory of Go, such as fuseki opening tactics, and what's called life and death. More details can be found in Nature, or from the paper directly here. Stanford computer science academic Bharath Ramsundar has a summary of the more technical points, here.

Go is an abstract strategy board game for two players, in which the aim is to surround more territory than the opponent.

Previously: Google's New TPUs are Now Much Faster -- will be Made Available to Researchers
Google's AlphaGo Wins Again and Retires From Competition

Original Submission

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

AlphaGo Zero Makes AlphaGo Obsolete

Related Stories

The Singularity is Here The Singularity is Here (Score: 2) by turgid on Thursday October 19 2017, @03:00PM (16 children)

Re:The Singularity is Here Re:The Singularity is Here (Score: 3, Funny) by takyon on Thursday October 19 2017, @03:01PM (2 children)

Re:The Singularity is Here (Score: 3, Insightful) by looorg on Thursday October 19 2017, @03:24PM

Re:The Singularity is Here (Score: 2) by Bot on Saturday October 21 2017, @11:01AM

The singularity remains off in the future The singularity remains off in the future (Score: 5, Insightful) by fyngyrz on Thursday October 19 2017, @03:45PM (6 children)

Re:The singularity remains off in the future Re:The singularity remains off in the future (Score: 0) by Anonymous Coward on Thursday October 19 2017, @03:56PM (3 children)

Re:The singularity remains off in the future Re:The singularity remains off in the future (Score: 2) by fyngyrz on Thursday October 19 2017, @09:15PM (2 children)

Re:The singularity remains off in the future Re:The singularity remains off in the future (Score: 2, Disagree) by maxwell demon on Thursday October 19 2017, @10:18PM (1 child)

Re:The singularity remains off in the future (Score: 2) by rylyeh on Thursday October 19 2017, @10:22PM

Re:The singularity remains off in the future (Score: 2) by turgid on Thursday October 19 2017, @04:28PM

Re:The singularity remains off in the future (Score: 2) by rylyeh on Thursday October 19 2017, @10:19PM

Re:The Singularity is Here Re:The Singularity is Here (Score: 2) by HiThere on Thursday October 19 2017, @07:09PM (5 children)

Re:The Singularity is Here Re:The Singularity is Here (Score: 2) by turgid on Thursday October 19 2017, @07:14PM (2 children)

Re:The Singularity is Here Re:The Singularity is Here (Score: 0) by Anonymous Coward on Thursday October 19 2017, @07:39PM (1 child)

Re:The Singularity is Here (Score: 2) by takyon on Thursday October 19 2017, @08:09PM

Re:The Singularity is Here Re:The Singularity is Here (Score: 2) by takyon on Thursday October 19 2017, @07:40PM (1 child)

Re:The Singularity is Here (Score: 2) by HiThere on Thursday October 19 2017, @11:38PM

The self-play algorithm The self-play algorithm (Score: 0) by Anonymous Coward on Thursday October 19 2017, @03:05PM (1 child)

LOL (Score: 0) by Anonymous Coward on Thursday October 19 2017, @03:18PM

New and improved New and improved (Score: 3, Interesting) by looorg on Thursday October 19 2017, @03:21PM (17 children)

Oh....... Fuck off, already! Oh....... Fuck off, already! (Score: 0, Insightful) by Anonymous Coward on Thursday October 19 2017, @03:30PM (2 children)

Comment Below Threshold (1 child)

Re:Oh....... Fuck off, already! Re:Oh....... Fuck off, already! (Score: -1, Troll) by Anonymous Coward on Thursday October 19 2017, @04:54PM (1 child)

Re:Oh....... Fuck off, already! (Score: 0) by Anonymous Coward on Thursday October 19 2017, @06:50PM

Re:New and improved Re:New and improved (Score: 2) by vux984 on Thursday October 19 2017, @03:46PM (3 children)

Re:New and improved Re:New and improved (Score: 2) by looorg on Thursday October 19 2017, @04:08PM (2 children)

Re:New and improved Re:New and improved (Score: 4, Insightful) by vux984 on Thursday October 19 2017, @06:31PM (1 child)

Re:New and improved (Score: 2) by rylyeh on Thursday October 19 2017, @10:34PM

Re:New and improved Re:New and improved (Score: 2, Touché) by Anonymous Coward on Thursday October 19 2017, @03:47PM (3 children)

Re:New and improved Re:New and improved (Score: 2) by looorg on Thursday October 19 2017, @04:11PM (2 children)

Re:New and improved Re:New and improved (Score: 2) by HiThere on Thursday October 19 2017, @07:13PM (1 child)

Re:New and improved (Score: 2) by takyon on Thursday October 19 2017, @08:13PM

Comment Below Threshold

Oh....... Fuck off, already! (Score: -1, Redundant) by Anonymous Coward on Thursday October 19 2017, @03:59PM

Oh....... Fuck off, already! Oh....... Fuck off, already! (Score: -1, Redundant) by Anonymous Coward on Thursday October 19 2017, @04:56PM (2 children)

Re:Oh....... Fuck off, already! Re:Oh....... Fuck off, already! (Score: 2) by DeathMonkey on Thursday October 19 2017, @05:31PM (1 child)

As you can tell... (Score: 0) by Anonymous Coward on Thursday October 19 2017, @05:45PM

Re:New and improved (Score: 2) by DannyB on Thursday October 19 2017, @06:59PM

Re:New and improved (Score: 2, Interesting) by Meepy on Friday October 20 2017, @02:26PM

ok (Score: 0) by Anonymous Coward on Thursday October 19 2017, @09:58PM

It's a shame... (Score: 0) by Anonymous Coward on Friday October 20 2017, @11:12AM