Agent for Ms. Pac-Man vs. Ghost Team Competition , - PowerPoint PPT Presentation

Dec 12, 2022 •401 likes •504 views

Agent for Ms. Pac-Man vs. Ghost Team Competition , 2012030142 513 - Target Try to maximize your score by eating as many pills/power pills/ghosts as you can

Agent for Ms. Pac-Man vs. Ghost Team Competition Πλατανιώτης Στέργιος, 2012030142 ΠΛΗ513 - Αυτόνομοι Πράκτορες
Target • Try to maximize your score by eating as many pills/power pills/ghosts as you can • Available moves are UP/DOWN/LEFT/RIGHT/NEUTRAL • Partial observability(PO): Ms. Pac-Man can only see in a vertical and horizontal line • This yields many problems as it is more likely to get stuck in local maxima states when you can see no food or ghosts around you • Also, it is more difficult because the ghosts have internal communication and can get you trapped very easily without you even realizing it
Q-learning • Implementation of the reinforcement learning Q- learning algorithm • A table with a value for every pair of (state, move) • After every round we update the entry for the previous (state, move) • Takes as parameters a:learning rate and γ :discount factor • The values are proven to converge to an optimal policy for 0 <=α<= 1 and 0 <=γ<= 1
Move selection • ε -soft implementation: in each round we choose a random move with a small probability ε, this is used only during learning to encourage exploration • Otherwise, we choose our move greedily by choosing the move with the highest Q value • If multiple moves have the same best value, we can either keep our old move, if it is still optimal, or just choose a random from the optimal ones
State generalization • There is a huge amount of different states in Ms. Pac- Man game • We generalize the states based on specific features • We check: • If there is a wall up, down, left and right of Ms. Pac-Man • If there is an intimidating ghost approaching her • And finally, the direction of the nearest food or exit if we are being chased • This way we decrease the number of possible states dramatically and make the learning process faster
Reward Function • Reward function gives a positive value if Ms. Pac-Man did something good or negative if she did something bad • We encourage her to eat pills/power/pills/ghost (+20) • We give a penalty for being eaten by a ghost (-350), for hitting a wall (-100) , for doing an opposite move (-6) and for every step she takes (-2.5) to make her find a quickest optimal path
Results • Training for thousands of games using a decaying ε probability starting at 0.1 Score Average MAX MIN StarterGhostsComm 3671 13200 810 StarterGhosts 3650 14860 670
Future Work • In the future we can make use of a genetic algorithm to find an optimal pair of parameters α and γ • Also there can be implemented a Neural Network to better train our agent, this is also known as deep learning and is popular for its results
Thank you for your time!

Recommend

Mike New man Mike New man Mike New man Mike New man Mike New man Mike New man Mike New man

Mike New man Mike New man Mike New man Mike New man Mike New man Mike New man Mike New man Mike New man SVP & Chief Financial Officer SVP & Chief Financial Officer 4Q 2002 Sales 4Q 2002 Sales Comparable Store Sales +2%

478 views • 32 slides

Guiding Financial Controls and Practices for PACs and PAC Treasurers PAC Treasurers Workshop

Guiding Financial Controls and Practices for PACs and PAC Treasurers PAC Treasurers Workshop November 1, 2018 Agenda PAC Purpose and Structure Role of PAC Treasurer Treasurer Upon Taking Office PAC Bank Accounts

854 views • 36 slides

Pa C aml Pac-Man Game Programming Language Chun-Kang Chen/Hui-Hsiang Kuo/Wenxin Zhu/Shuwei Cao

Pa C aml Pac-Man Game Programming Language Chun-Kang Chen/Hui-Hsiang Kuo/Wenxin Zhu/Shuwei Cao Overview PaCaml = Pac-Man + Ocaml A game programming language facilitating the design of elements in PAC-MAN scene Simple people with

325 views • 10 slides

NAPSLO PAC Contributions How contributing to the NAPSLO PAC will benefit you, your company and the

NAPSLO PAC Contributions How contributing to the NAPSLO PAC will benefit you, your company and the surplus lines industry www.napslo.org What is a PAC? A PAC is a political action committee Political action committees are special

527 views • 11 slides

WELCOME June 2011 PAC Presentation Opening Remarks Introductions June 2011 PAC

PAC Meeting 1 WELCOME June 2011 PAC Presentation Opening Remarks Introductions June 2011 PAC Presentation 2 June 2011 PAC Presentation 3 Purpose of Meeting 4 Confirm Corridor Vision Confirm Goals and Objectives Confirm

483 views • 35 slides

AAOS Orthopaedic PAC The Orthopaedic PAC is the only national political action committee

AAOS Orthopaedic PAC The Orthopaedic PAC is the only national political action committee representing the interests of orthopaedic surgeons before Congress. It is your sole voice on Capitol Hill! The Orthopaedic PAC AAOS Orthopaedic PAC is

377 views • 25 slides

LArIAT Fermilab PAC Meeting November 11, 2016 Jen Raaf PAC Charge Fermilab PAC Meeting, J.

LArIAT Fermilab PAC Meeting November 11, 2016 Jen Raaf PAC Charge Fermilab PAC Meeting, J. Raaf Nov. 11, 2016 2 Motivation: Needs of Neutrino Experiments Typical neutrino event Outgoing lepton: Flavor: CC vs. NC, + vs. - , e vs.

979 views • 45 slides

Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent

CPE/CSC 580-S06 Artificial Intelligence Intelligent Agents Overview Multi-Agent Systems Introduction to multi-agent systems and agent societies Agent Communication knowledge exchange among agents Agent Interaction eliminates explicit

623 views • 26 slides

MAN ELECTRIC | ELECTRONIC SYSTEMS MAN TRUCK & BUS AG MAN Electric/Electronic Systems

395 views • 36 slides

Modeling Land Competition Modeling Land Competition Modeling Land Competition Ron Sands Ron

Modeling Land Competition Modeling Land Competition Modeling Land Competition Ron Sands Ron Sands Ron Sands Man- -Keun Keun Kim Kim Man Man-Keun Kim Joint Global Change Research Institute Joint Global Change Research Institute Joint

317 views • 29 slides

Ghost River State of the Watershed (2018) - Ghost Watershed Alliance Society Photo credit: R.

Ghost River State of the Watershed (2018) - Ghost Watershed Alliance Society Photo credit: R. Drury State of the Watershed Report Formal process by Alberta Government Followed Handbook for State of the Watershed Reporting Used

470 views • 44 slides

Erin Murray The general first idea was to have a playful ghost flying around a building for

Erin Murray The general first idea was to have a playful ghost flying around a building for their own enjoyment. The story of the ghost then became more prominent as I developed my storyboards. The ghost was looking for a place to haunt

336 views • 10 slides

A GHOST FROM POSTSCRIPT for RUXCON 2017 A GHOST FROM POSTSCRIPT WHO ARE WE redrain

REDRAIN & MIN(SPARK) ZHENG A GHOST FROM POSTSCRIPT for RUXCON 2017 A GHOST FROM POSTSCRIPT WHO ARE WE redrain min(spark) zheng Qihoo 360CERT CUHK PhD Alibaba Security Expert Low-level security researcher Pentester with interest in

818 views • 52 slides

The Ghost in the Machine Dawie van den Heever SSM with implants 2 SSM with implants 3 SSM

1 The Ghost in the Machine Dawie van den Heever SSM with implants 2 SSM with implants 3 SSM with implants 4 ECG reconstruction & classification 5 Robotic limbs 6 PANDAS 7 Ghost in the Shell 8 Ghost in the Machine 9

419 views • 17 slides

Ghost Peaks: How to Fix a Haunting Problem Jacob A. Rebholz Teledyne Tekmar VOC Product Line

Ghost Peaks: How to Fix a Haunting Problem Jacob A. Rebholz Teledyne Tekmar VOC Product Line Manager Outline of Topics What is a ghost peak? How to identify ghost peaks and their source How to correct various sources of

561 views • 43 slides

Piloting HCV GHOST in Michigan www.michigan.gov/hepatitis November 29, 2017 Joe Coyle, MPH

Piloting HCV GHOST in Michigan www.michigan.gov/hepatitis November 29, 2017 Joe Coyle, MPH Outline MDHHS HCV Surveillance Basics of HCV GHOST Example of GHOST in Action Implications for Public Health Surveillance MDHHS HCV

455 views • 34 slides

responsibilities as keepers of the earth Through united efforts to keep our mighty Fraser River

To continue our roles and responsibilities as keepers of the earth Through united efforts to keep our mighty Fraser River mighty ! Began September 24 at Lillooet and ended September 25, 2015 at Yale, BC. This two day Fraser River shore

353 views • 12 slides

The Ghost of Modality in Quantum Physics Abstract for Invited Presentation for Physics Beyond

The Ghost of Modality in Quantum Physics Abstract for Invited Presentation for Physics Beyond Relativity 2019 Akira Kanda Omega Mathematical Institute/ University of Toronto Mihai Prunescu University of Bucharest, Romanian Academy of

541 views • 9 slides

Nonlinear massive gravity and Cosmology Shinji Mukohyama (Kavli IPMU, U of Tokyo) Based on

Nonlinear massive gravity and Cosmology Shinji Mukohyama (Kavli IPMU, U of Tokyo) Based on collaboration with Antonio DeFelice, Emir Gumrukcuoglu, Chunshan Lin Happy Birthdays! I would like to congratulate Kodama-san, Sasaki-san and

841 views • 37 slides

Expanding Your Field of Vision What! Only one drink ticket?? Ghost of Christmas Past

Expanding Your Field of Vision What! Only one drink ticket?? Ghost of Christmas Past (Ghost of KYTC Past) KYTC Cash Model Christmas 2007 Cash Model Catastrophe! Cash Balance projected to be $10M in December 2009 Cash Balance

454 views • 19 slides

using a ghost-mode participation tool by Eguono Wayne Omagamre Kausik Das Outline of

Evaluation of impact of class participation on students performance using a ghost-mode participation tool by Eguono Wayne Omagamre Kausik Das Outline of Presentation 1.0 Introduction Value of STEM Education Challenges

890 views • 24 slides

EeIP, Brussels 15 th September 2017 Vitor Judcibus I_HeERO Portuguese Project Coordinator

I_HeERO - Ghost calls from mobile handsets EeIP, Brussels 15 th September 2017 Vitor Judcibus I_HeERO Portuguese Project Coordinator General Secretariat of Internal Administration Portugal Agenda Introduction ETSI Standard

119 views • 11 slides

Roger Hosein TEDU UWI St Augustine Outline of presentation a) background on Make work

Roger Hosein TEDU UWI St Augustine Outline of presentation a) background on Make work programs b) some simple economics of Make work programs in TT, Suriname and Guyana c) basic welfare calculations d) Make work programs as a

1.61k views • 11 slides

A Separation Logic to Verify Termination of Busy-Waiting for Abrupt Program Exit Tobias Reinhard 1

A Separation Logic to Verify Termination of Busy-Waiting for Abrupt Program Exit Tobias Reinhard 1 , Amin Timany 2 , Bart Jacobs 1 1 KU Leuven, imec-DistriNet Research Group 2 Aarhus University, Logic and Semantics Group 23rd July 2020 Motivation

460 views • 23 slides