Learning Artificial Intelligence in Large-Scale Video Games A - PowerPoint PPT Presentation

Learning Artificial Intelligence in Large-Scale Video Games — A First Case Study with Hearthstone: Heroes of WarCraft — Master Thesis Submitted for the Degree of MSc in Computer Science & Engineering David Taralla Prof. Damien Ernst Author Supervisor Academic year 2014 – 2015

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Video Games, Then and Now ◮ Then, the problems to solve were representable easily → Example: Pac-Man • Fully observable maze • Limited number of agents • Small, well-defined action space ◮ Now, the problems feature numerous variables → Example: StarCraft • Vast, partially observable map • Complex state representation • Prohibitively large action space, difficult to represent 2 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Video Games, Then and Now Games continue to feature richer environments... 3 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Video Games, Then and Now Games continue to feature richer environments... ... but designing robust AIs becomes increasingly difficult! 3 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Video Games, Then and Now Games continue to feature richer environments... ... but designing robust AIs becomes increasingly difficult! ⇓ Making AI learn instead of being taught: a better solution? 3 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Objectives of this Thesis 1. Design & study of a theory for creating autonomous agents in the case of large-scale video games → Study applied to the game Hearthstone: Heroes of Warcraft 2. Develop a modular and extensible clone of the game Hearthstone: HoW → Makes us able to test the theory practically 4 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Problem Statement 1. State Vectors ◮ World vector w ∈ W contains all information available in a given state → Everything is not relevant ◮ If σ ( · ) is the projection operator such that ∀ w ∈ W , s = σ ( w ) is the relevant part of w for the targeted application, we define S := { σ ( w ) | w ∈ W} the set of all state vectors. 5 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Problem Statement 2. Action Vectors ◮ Available actions have unknown consequences ◮ Let A be the set of available actions in the game ◮ Let A s be the set of actions that can be taken in state s ∈ S 6 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Problem Statement 3. State Scoring Function ◮ There should exist a bounded function ρ : S → R having the following properties:  ρ ( s ) < 0 if, from s info, the player is considered as likely to lose,   ρ ( s ) > 0 if, from s info, the player is considered as likely to win,  ρ ( s ) = 0 otherwise.  ◮ Based on expert knowledge 7 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Problem Statement 4. Problem Formalization ◮ Games follow discrete-time dynamics: τ : S × A → S | ( s t , a ) �→ s t +1 for a ∈ A s t , t = 0 , 1 , ... ◮ Let R ρ be an objective function whose analytical expression depends on ρ : R ρ : S × A → R | ( s , a ) �→ R ρ ( s , a ) for a ∈ A s . 8 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Problem Statement 4. Problem Formalization ◮ R ρ ( s , a ) is considered uncomputable from state s → Difficulty to simulate side-trajectories in large-scale games ◮ Find an action selection policy h such that h : S → A | s �→ argmax R ρ ( s , a ) . a ∈A s 9 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Getting Intuition on Actions from State Scoring Differences ◮ Our analytical expression for R ρ : R ρ ( s , a ) := ρ ( τ ( s , a )) − ρ ( s ) . Report erratum – In Figure 3.2, the classifier is asked to predict the sign of R ρ , and not ρ . 10 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Nora: Design & Results 11 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Action Selection Process Report erratum – In Figure 4.5, the classifiers are asked to predict the sign of R ρ , and not ρ . 12 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Caveats ◮ Memory usage → Approx. 14GB is needed to keep the models in RAM → Fix: tree pruning and parameters tuning ◮ Play actions classifier underestimates the value of some actions → Random target selection is assumed after playing an action that needs a target → Fix: Two-step training 13 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Results Matchup Win rate Nora vs. Random 93% Nora vs. Scripted 10%... But compared to the random player performance... 14 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Results Matchup Win rate Nora vs. Random 93% Nora vs. Scripted 10% Random vs. Scripted < 1% ! ◮ Nora applies some strategy the random player does not ◮ Qualitatively, this translates into a board control behavior → Never target her allies with harmful actions, even though it is allowed → Accurate understanding of the Fireblast special power 15 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Conclusion Any questions? Thank you for your attention. 16 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Appendix – Why Extremely Randomized Trees? ◮ Ensemble methods can often surpass single classifiers → From a statistical, computational and representational point of view ◮ Decision trees are particularly suited for ensemble methods → Low computational cost of the standard tree growing algorithm → But careful about memory... ◮ Random trees suited for problems with many features → Each node can be built with a random subset of features ◮ Feature importances → Useful for designing the projection operator σ : W → S 17 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Appendix – Computation of the ExtraTrees Classifier Confidence ◮ It is the predicted positive class probability of the classifier ◮ Computed as the mean predicted positive class probability of the trees in the forest ◮ Predicted positive class probability of a sample s in a tree: # { s ′ ∈ leaf in which s falls | s ′ labelled positive } # { s ′ ∈ leaf in which s falls } 18 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Appendix – Basics of Hearthstone: Heroes of WarCraft ◮ Stylized combat game ◮ Cards are obtained by drawing from your deck → Your hand is hidden to your opponent Goal: Make the enemy player’s hero health go to zero. 19 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Appendix – Basics of Hearthstone: Heroes of WarCraft ◮ Cards are played using a resource: the Mana → Minions that join the battle → Spells ◮ Rules are objects in the game → Game based on creating new and breaking/modifying rules 20 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Appendix – Basics of Hearthstone: Heroes of WarCraft 21 / 23

Learning Artificial Intelligence in Large-Scale Video Games David Taralla University of Liège 2nd Master in Computer Science & Engineering Appendix – Basics of Hearthstone: Heroes of WarCraft Things Might Get Tricky...! 22 / 23

Learning Artificial Intelligence in Large-Scale Video Games A - PowerPoint PPT Presentation

Learning Artificial Intelligence in Large-Scale Video Games A First Case Study with Hearthstone: Heroes of WarCraft Master Thesis Submitted for the Degree of MSc in Computer Science & Engineering David Taralla Prof. Damien Ernst

Artificial Intelligence Artificial Intelligence Artificial Intelligence Study and design of

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial Intelligence Course Presentation Summary Artificial Intelligence Motivations

Artificial intelligence Artificial Intelligence is the science of PHILOSOPHY OF ARTIFICIAL

Artificial Intelligence Intro (Chapter 1 of AIMA) Summary Artificial Intelligence What is AI?

What is Artificial Intelligence? CPSC 322 Lecture 1 September 5, 2007 What is Artificial

Traditional Definition of Artificial Intelligence Trends Artificial Intelligence (AI) is

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

Learning from Unlabeled Video Carl Vondrick Columbia University Survivor Bias of Video Data

8th November 2019 Artificial Intelligence Finance Institute NYU Courant Artificial Intelligence

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Artificial Intelligence as Law Bart Verheij Department of Artificial Intelligence, Bernoulli

CSCI 446 ARTIFICIAL INTELLIGENCE EXAM 1 STUDY OUTLINE Introduction to Artificial Intelligence

Lecture Overview What is Artificial Intelligence? Agents acting in an environment

CSCI 446: Artificial Intelligence CSCI 446: Artificial Intelligence Course Website:

1.1 What is AI? 1. What is Artificial Intelligence? 2. AI Past and Present 3. Rational

Theory of Computer Games: An A.I. Oriented Introduction Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

SMALL SCALE, LOW-DENSE CITY Urban planning, climate and energy policy Chris Butters, Ali

2016 Christmas Special Three Portraits of the Birth of the Messiah - Lesson #2 December 18,

Genesis 39-41 Ol Old 1) How did Joseph experience solitude? As a slave in Potiphars

Pipelined Computations In the pipeline technique, the problem is divided into a series of tasks

Sets Measures of Central Tendency Standard Deviation and Normal Distribution Two-Way Frequency

University of Helsinki Ask Questions Dont be shy, I like to answer them I tell lots

Chapter6 In AI we limit games to: - deterministic - turn-taking - two-player -