mastering the game of go with deep
play

Mastering the game of Go with deep neural networks and tree search - PowerPoint PPT Presentation

Mastering the game of Go with deep neural networks and tree search Nature, Jan, 2016 Roadmap What this paper is about? Deep Learning Search problem How to explore a huge tree (graph) AlphaGo Video


  1. Mastering the game of Go with deep neural networks and tree search Nature, Jan, 2016

  2. Roadmap What this paper is about? • Deep Learning • Search problem • How to explore a huge tree (graph)

  3. AlphaGo Video https://www.youtube.com/watch?v=53YLZBSS0cc https://www.youtube.com/watch?v=g-dKXOlsf98

  4. rank Al AlphaGo vs vs*European*Champion*(Fan*Hui 27Da Dan) * October$5$– 9,$2015 <Official$match> I Time+limit:+1+hour I AlphaGo Wins (5:0)

  5. Al AlphaGo vs vs*Wo World*Champion*(Lee*Se Sedol 97Da Dan) March$9$– 15,$2016 <Official$match> I Time+limit:+2+hours Venue :+ Seoul ,+Four+Seasons+Hotel

  6. Lee*Sedol wiki Photo+source: Maeil+Economics 2013/04

  7. Lee Sedol

  8. =$multiple$machines European$champion

  9. The Game

  10. Go Elo Ranking http://www.goratings.org/history/

  11. Lee Sedol VS Ke Jie

  12. How about Other Games?

  13. Tic Tac Toe

  14. Chess

  15. Chess (1996)

  16. Deep Blue (1996)

  17. AlphaGo is the Skynet?

  18. Go Game

  19. Simple Rules

  20. High Complexity

  21. High Complexity

  22. Different Games

  23. Search Problem (the search space)

  24. Tic Tac Toe

  25. Tic Tac Toe

  26. The “Tree” in Tic Tac Toe

  27. The “Tree” of Chess

  28. The “Tree” of Go Game

  29. Search Problem (how to search)

  30. MiniMax in Tic Tac Toe

  31. Adversarial"Search"–"MiniMax"" 1" 1" 1" 0" 0" J1" J1" 0" J1" 0" J1" J1" 5"

  32. Adversarial"Search"–"MiniMax"" J1" 1" J1" 0" 1" 1" 0" J1" J1" J1" 1" 1" 0" 1" 0" J1" J1" 0" J1" 0" J1" J1" 6"

  33. What is the problem? 1. Generate the Search Tree 2. use MinMax Search

  34. The Size of the Tree Tic Tac Toe: b = 9, d =9 Chess: b = 35, d =80 Go: b = 250, d =150 b : number of legal move per position d : its depth (game length)

  35. One Grain of Rice https://www.youtube.com/watch?v=byk3pA1GPgU

  36. The “Space” of GO Game

  37. How about other Games? • Flappy bird? • Angry Bird? Tic Tac Toe: • Starcraft? b = 9, d =9 • learning a language Chess: b = 35, d =80 • Write a paper Go: • Get a MS/PhD degree b = 250, d =150 • Finding a job • Life

  38. How to solve?

  39. Chess (1996)

  40. Monte Carlo

  41. Las Vegas

  42. Monte"Carlo"Tree"Search" Tree"search" ……." ……." ……." ……." Monte"Carlo"search" ……." ……." ……." ……." ……." 7"

  43. Monte"Carlo"Tree"Search" • Tree"Search"+"Monte"Carlo"Method"" – SelecIon" white"wins"/"total" 3/5" – Expansion" – SimulaIon" 2/3" 1/2" – BackJPropagaIon" 1/1" 1/2" 1/1" 0/1" 1/1" 0/1" 8"

Recommend


More recommend