CS344M Autonomous Multiagent Systems Todd Hester Department or - PowerPoint PPT Presentation

CS344M Autonomous Multiagent Systems Todd Hester Department or Computer Science The University of Texas at Austin

Good Afternoon, Colleagues Are there any questions? Todd Hester

Logistics • Readings Todd Hester

Logistics • Readings – Specify which papers you read! Todd Hester

Logistics • Readings – Specify which papers you read! – 2 case studies and 1 TDP Todd Hester

Logistics • Readings – Specify which papers you read! – 2 case studies and 1 TDP • How to read a research paper Todd Hester

Logistics • Readings – Specify which papers you read! – 2 case studies and 1 TDP • How to read a research paper – Some have too few details... Todd Hester

Logistics • Readings – Specify which papers you read! – 2 case studies and 1 TDP • How to read a research paper – Some have too few details... – Others have too many. Todd Hester

Logistics • Readings – Specify which papers you read! – 2 case studies and 1 TDP • How to read a research paper – Some have too few details... – Others have too many. • Next week’s readings posted Todd Hester

Logistics • Readings – Specify which papers you read! – 2 case studies and 1 TDP • How to read a research paper – Some have too few details... – Others have too many. • Next week’s readings posted • Use the undergrad writing center! – Friday afternoon workshops (3 p.m.) Todd Hester

Overview of the Readings • Darwin: genetic programming approach Todd Hester

Overview of the Readings • Darwin: genetic programming approach • Stone and McAllester: Architecture for action selection Todd Hester

Overview of the Readings • Darwin: genetic programming approach • Stone and McAllester: Architecture for action selection • Riley et al: Coach competition, extracting models Todd Hester

Overview of the Readings • Darwin: genetic programming approach • Stone and McAllester: Architecture for action selection • Riley et al: Coach competition, extracting models • Kuhlmann et al: Learning for coaching Todd Hester

Overview of the Readings • Darwin: genetic programming approach • Stone and McAllester: Architecture for action selection • Riley et al: Coach competition, extracting models • Kuhlmann et al: Learning for coaching • Withopf and Riedmiller: Reinforcement learning Todd Hester

Overview of the Readings • Darwin: genetic programming approach • Stone and McAllester: Architecture for action selection • Riley et al: Coach competition, extracting models • Kuhlmann et al: Learning for coaching • Withopf and Riedmiller: Reinforcement learning • MacAlpine et al: UT Austin Villa 2011 Todd Hester

Overview of the Readings • Darwin: genetic programming approach • Stone and McAllester: Architecture for action selection • Riley et al: Coach competition, extracting models • Kuhlmann et al: Learning for coaching • Withopf and Riedmiller: Reinforcement learning • MacAlpine et al: UT Austin Villa 2011 • Barrett et al: SPL Kicking strategy Todd Hester

Evolutionary Computation • Motivated by biological evolution: GA, GP Todd Hester

Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space Todd Hester

Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space Todd Hester

Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space • Randomized, parallel hill-climbing through space Todd Hester

Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space • Randomized, parallel hill-climbing through space • Learning is an optimization problem (fitness) Todd Hester

Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space • Randomized, parallel hill-climbing through space • Learning is an optimization problem (fitness) Some slides from Machine Learning [Mitchell, 1997] Todd Hester

Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) Todd Hester

Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction Todd Hester

Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function Todd Hester

Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function • Evolved on huge (at the time) hypercube Todd Hester

Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function • Evolved on huge (at the time) hypercube • Lots of spinning, but figured out dribbling, offsides Todd Hester

Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function • Evolved on huge (at the time) hypercube • Lots of spinning, but figured out dribbling, offsides • 1-1-1 record. Tied a good team, but didn’t advance Todd Hester

Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function • Evolved on huge (at the time) hypercube • Lots of spinning, but figured out dribbling, offsides • 1-1-1 record. Tied a good team, but didn’t advance • Success of the method, but not pursued Todd Hester

Architecture for Action Selection • (other slides, video) Todd Hester

Architecture for Action Selection • (other slides, video) • downsides Todd Hester

Architecture for Action Selection • (other slides, video) • downsides • Keepaway Todd Hester

Coaching • Learn best strategy to play a fixed team Todd Hester

Coaching • Learn best strategy to play a fixed team • Give high level advice to players at low frequency Todd Hester

Coaching • Learn best strategy to play a fixed team • Give high level advice to players at low frequency • Focus on learning formations Todd Hester

Coaching • Learn best strategy to play a fixed team • Give high level advice to players at low frequency • Focus on learning formations • Learn when successful teams passed/kicked Todd Hester

Coaching • Learn best strategy to play a fixed team • Give high level advice to players at low frequency • Focus on learning formations • Learn when successful teams passed/kicked • Learn when opponent will pass and try to block Todd Hester

Coaching • Learn best strategy to play a fixed team • Give high level advice to players at low frequency • Focus on learning formations • Learn when successful teams passed/kicked • Learn when opponent will pass and try to block • What if players switch roles? Todd Hester

Coaching • Learn best strategy to play a fixed team • Give high level advice to players at low frequency • Focus on learning formations • Learn when successful teams passed/kicked • Learn when opponent will pass and try to block • What if players switch roles? • Why just imitate another team? Todd Hester

Coaching • Learn best strategy to play a fixed team • Give high level advice to players at low frequency • Focus on learning formations • Learn when successful teams passed/kicked • Learn when opponent will pass and try to block • What if players switch roles? • Why just imitate another team? • Other slides Todd Hester

Reinforcement Learning • RL Slides Todd Hester

Reinforcement Learning • RL Slides • Extend to grid soccer Todd Hester

Reinforcement Learning • RL Slides • Extend to grid soccer • Large state space, joint actions Todd Hester

UT Austin Villa 2011 • Other slides Todd Hester

UT Austin Villa 2011 • Other slides • Why not use CMA-ES on role positions as well? Todd Hester

UT Austin Villa 2011 • Other slides • Why not use CMA-ES on role positions as well? • Changes for 2012? Todd Hester

Kicking Under Uncertainty • Used by our SPL team Todd Hester

Kicking Under Uncertainty • Used by our SPL team • Kick engine to kick at various distances/headings Todd Hester

Kicking Under Uncertainty • Used by our SPL team • Kick engine to kick at various distances/headings • Adjust to seen ball location Todd Hester

CS344M Autonomous Multiagent Systems Todd Hester Department or - PowerPoint PPT Presentation

CS344M Autonomous Multiagent Systems Todd Hester Department or Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are there any questions? Todd Hester Logistics Readings Todd Hester Logistics Readings

CHAPTER 11: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

CHAPTER 6: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to Multiagent Systems

LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department or Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department or Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department or Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department or Computer Science The

CS344M Autonomous Multiagent Systems Todd Hester Department or Computer Science The University

CS344M Autonomous Multiagent Systems Todd Hester Department or Computer Science The University

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

IMPROVEMENT MICHAEL ORLOV SHAMOON COLLEGE OF ENGINEERING ISRAEL GECCO GI 2018, KYOTO, JAPAN 2018

THE EXPECTATION ECONOMY: EXCEEDING THE HIGH EXPECTATIONS OF TODAYS RENTERS H O W C A N W E

APNA 30th Annual Conference Session 3012: October 21, 2016 Building Resilience through Positive

Discovery of Activity Patterns using Topic Models Paper by Tm Hunh, Mario Fritz and Bernt

On the value of acquired information in gambling, thermodynamics and population dynamics Luca

Exemplar dynamics and the emergence of categories Gerhard J ager

Metaheuristic models for decision support in the software construction process Ph.D. Thesis

Evolutionary Algorithms for Complex Designs of Experiments and Data Analysis Irene Poli Dep. of