Judan

Posts: 6576

Location: Cambridge, UK

Liked others: 405

Was liked: 3619

Rank: UK 4 dan

KGS: Uberdude 4d

OGS: Uberdude 7d



- After the Lee Sedol match they developed an "anti-AlphaGo" that was designed to pick moves AlphaGo didn't explore (but presumably not crap ones!) to fill in the gaps and avoid over-fitting. [It no longer makes the mistake against Lee's divine move, I don't know if due to this approach]. This adversarial learning is hot in computer vision atm.

- I asked about Master playing a more unconventional opening than AlphaGo, was this simply the evolution of its style or could they tweak parameters. He said there is a parameter they call 'heat' [I've seen another term for this elsewhere] which is how strictly or not it has to pick what it thinks is the best move and they changed this so it might pick the fifth best move if it's only a little worse and as moves in the opening are all pretty similar it did new things. [So is the small shimari very inferior?].

- Master only ran on 1 GPU!

- No comment on does an AlphaGo trained from scratch beat human-game trained AlphaGo, announcements soon. This evening I went to a talk in Cambridge by Demis. It was mostly the standard DeepMind story from beginnings through Atari to AlphaGo (the meat of the presentation with an intro to Go) and future plans which is probably familiar to followers of AlphaGo here (a lot of slides in common with the video above, but about 1 hour long). It was recorded (and relayed live to an overspill room, the theatre of 300 was packed) so check the host Cambridge Society for the Application of Research website for that later: http://www.csar.org.uk/ . There were however a few new titbits I found particularly interesting:- After the Lee Sedol match they developed an "anti-AlphaGo" that was designed to pick moves AlphaGo didn't explore (but presumably not crap ones!) to fill in the gaps and avoid over-fitting. [It no longer makes the mistake against Lee's divine move, I don't know if due to this approach]. This adversarial learning is hot in computer vision atm.- I asked about Master playing a more unconventional opening than AlphaGo, was this simply the evolution of its style or could they tweak parameters. He said there is a parameter they call 'heat' [I've seen another term for this elsewhere] which is how strictly or not it has to pick what it thinks is the best move and they changed this so it might pick the fifth best move if it's only a little worse and as moves in the opening are all pretty similar it did new things. [So is the small shimari very inferior?].- Master only ran on 1 GPU!- No comment on does an AlphaGo trained from scratch beat human-game trained AlphaGo, announcements soon.



Last edited by Uberdude on Mon Mar 20, 2017 4:34 pm, edited 1 time in total.

This post by Uberdude was liked by 6 people: Bonobo, jptavan, Kirby, maceta, Simba, Waylon