cs344m autonomous multiagent systems
play

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department - PowerPoint PPT Presentation

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department or Computer Science The University of Texas at Austin Good Afternoon, Colleagues Are there any questions? Patrick MacAlpine Logistics How to read a research paper Patrick


  1. CS344M Autonomous Multiagent Systems Patrick MacAlpine Department or Computer Science The University of Texas at Austin

  2. Good Afternoon, Colleagues Are there any questions? Patrick MacAlpine

  3. Logistics • How to read a research paper Patrick MacAlpine

  4. Logistics • How to read a research paper – Some have too few details... Patrick MacAlpine

  5. Logistics • How to read a research paper – Some have too few details... – Others have too many. Patrick MacAlpine

  6. Logistics • How to read a research paper – Some have too few details... – Others have too many. • Next week’s readings posted Patrick MacAlpine

  7. Logistics • How to read a research paper – Some have too few details... – Others have too many. • Next week’s readings posted Patrick MacAlpine

  8. Overview of the Readings Darwin: genetic programming approach Patrick MacAlpine

  9. Overview of the Readings Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Patrick MacAlpine

  10. Overview of the Readings Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Patrick MacAlpine

  11. Overview of the Readings Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching Patrick MacAlpine

  12. Overview of the Readings Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching Wihthop and Reidmiller: Reinforcement learning Patrick MacAlpine

  13. Overview of the Readings Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching Wihthop and Reidmiller: Reinforcement learning MacAlpine, Price, and Stone: Role assignment Patrick MacAlpine

  14. Overview of the Readings Darwin: genetic programming approach Stone and McAllester: Architecture for action selection Riley et al: Coach competition, extracting models Kuhlmann et al: Learning for coaching Wihthop and Reidmiller: Reinforcement learning MacAlpine, Price, and Stone: Role assignment MacAlpine, Depinet, and Stone: Overlapping layered learning Patrick MacAlpine

  15. Evolutionary Computation • Motivated by biological evolution: GA, GP Patrick MacAlpine

  16. Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space Patrick MacAlpine

  17. Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space Patrick MacAlpine

  18. Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space • Randomized, parallel hill-climbing through space Patrick MacAlpine

  19. Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space • Randomized, parallel hill-climbing through space • Learning is an optimization problem (fitness) Patrick MacAlpine

  20. Evolutionary Computation • Motivated by biological evolution: GA, GP • Search through a space − Need a representation, fitness function − Probabilistically apply search operators to set of points in search space • Randomized, parallel hill-climbing through space • Learning is an optimization problem (fitness) Some slides from Machine Learning [Mitchell, 1997] Patrick MacAlpine

  21. Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) Patrick MacAlpine

  22. Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction Patrick MacAlpine

  23. Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function Patrick MacAlpine

  24. Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function • Lots of spinning, but figured out dribbling, offsides Patrick MacAlpine

  25. Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function • Lots of spinning, but figured out dribbling, offsides • 1-1-1 record. Tied a good team, but didn’t advance Patrick MacAlpine

  26. Darwin United • More ambitious follow-up to Luke, 97 (made 2nd round) • Motivated in part by Peter’s detailed team construction • Evolves whole teams — lexicographic fitness function • Lots of spinning, but figured out dribbling, offsides • 1-1-1 record. Tied a good team, but didn’t advance • Success of the method, but not pursued Patrick MacAlpine

  27. Overlapping Layered Learning • Machine learning paradigms (not algorithms) Patrick MacAlpine

  28. Overlapping Layered Learning • Machine learning paradigms (not algorithms) • Useful for learning complex skills that work well together Patrick MacAlpine

  29. Overlapping Layered Learning • Machine learning paradigms (not algorithms) • Useful for learning complex skills that work well together • (slides) Patrick MacAlpine

Recommend


More recommend