Autonomous Agents (COMP513) Q-Learning Super Mario World - PowerPoint PPT Presentation

Mar 02, 2024 •266 likes •368 views

Autonomous Agents (COMP513) Q-Learning Super Mario World Papathanasiou Theodoros 2011030058 Implementation Model Four Foundational Parts: 1. Q-value and Constant Initialization 2. Input Management 3. Move Selection 4. Q-Value &

Autonomous Agents (COMP513) Q-Learning Super Mario World Papathanasiou Theodoros 2011030058
Implementation Model Four Foundational Parts: 1. Q-value and Constant Initialization 2. Input Management 3. Move Selection 4. Q-Value & State Update OR State Update
Q-value and Constant Initialization After extensive experimental runs we concluded on the following values for the Q-Value update function constants: learning rate: α = 0.5 discount factor: γ = 0.8 We also concluded on the value and annealing rate of the temperature parameter in Boltzmann exploration: temperature: T = 4 annealing rate: 0.001 / run
Input Management On this part of our implementation, we use MarI/O’s infrastructure to detect mario’s movement to the right. We keep track of the rightmost mario has traveled through the change of pixels in the emulated rom, and compute an analogous reward.
Move Selection After each state advance, our agent considers what it’s next move should be. The first step is to compute each action’s probability based on an exploitive exploration technique, Boltzmann exploration. Then, a pseudo-random number is generated and we choose an action based on their probabilities. The largest the Q-Value of an action, the largest its probability.
Q-Value & State Update OR State Update At each iteration of our game algorithm we check if mario has been stationary for too long. We check if our rightmost position has changed and if not we update a timeout counter. If the timeout counter reaches 60 we end the run and proceed to the Q-Value update, else we just proceed to the next state
Q-Value & State Update OR State Update cont. When the run comes to an end, the agent updates it’s Q-Values. We iterate from the first state to the last and update the values based on the result we received. Q(s,a) = Q(s,a)+a(r+maxQ(s',a')-Q(s,a)) If s' is a terminal state then: Q(s,a) = Q(s,a)+a(r-Q(s,a)) The reward is given as: r = Rightmost Pixel on X Axis / 8
Results With our current configuration we get the following statistics Avg. runs per obstacle Avg. runs to finish 1.47 32
Future Work For future work, we suggest the following: Improvement in recognising when a run has ended e.g. through the death ● animation. Improvement in the design of the state space and its representation either ● through neural networks or a more efficient lua based architecture Introduction of machine vision(pattern recognition) to create a more observable ● game and a knowledge based on causes rather than only the effects of events
Questions? Thank you for your time Thodoris Papathanasiou

Recommend

AUTONOMOUS DRIVING AGENT An agent by Stylianos Zafeiris for the Autonomous Agents (COMP513)

AUTONOMOUS DRIVING AGENT An agent by Stylianos Zafeiris for the Autonomous Agents (COMP513) course INTRODUCTION 01 02 DEEP Q-NETWORK 03 RESULTS 01 INTRODUCTION What is the project about? MAIN The main idea of the project is to create

333 views • 15 slides

Building an agent for the board game Hex COMP513 - Autonomous Agents Stefanos Kontos -

Building an agent for the board game Hex COMP513 - Autonomous Agents Stefanos Kontos - 2013030195 Our Goal In terms of this project we created an autonomous agent for the board game Hex with graphict Getting Started (1/2) The hex board is an

265 views • 10 slides

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments Rationality PEAS (Performance measure, Environment, Actuators, Sensors) Environment types Agent types Intelligent Agents p.2/25 Agents and

364 views • 23 slides

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments Examples of agents ? Agents & environments Agents include humans, robots, softbots, thermostats etc The agent function maps from precept

866 views • 17 slides

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

578 views • 25 slides

LMIs and autonomous work 1 From autonomous work to discontinuous career paths Autonomous

LMIs and autonomous work 1 From autonomous work to discontinuous career paths Autonomous work raises many questions in terms of continuity of income, skills development, access to the social security system, integration into the social

613 views • 14 slides

Math 211 Math 211 Lecture #2 2 Autonomous Equations Autonomous Equations General equation:

1 Math 211 Math 211 Lecture #2 2 Autonomous Equations Autonomous Equations General equation: dy dy dt = t y 2 dt = f ( t, y ) Autonomous equation: dy dy dt = f ( y ) dt = y (1 y ) In an autonomous equation the right hand side

281 views • 4 slides

Autonomous Agents Assault game - A3C agent 2016030010-Kosmas Pinitas Technical University of

Autonomous Agents Assault game - A3C agent 2016030010-Kosmas Pinitas Technical University of Crete February 23, 2020 2016030010-Kosmas Pinitas (Technical University of Crete) Autonomous Agents February 23, 2020 1 / 16 Outline Background

305 views • 16 slides

Subjectivity of Autonomous August 28, 2012 Agents. Some Philosophical and Legal Remarks

ECAI 20 th European Conference on Artificial Intelligence Montpellier, August 27-31, 2012 1st Workshop on Rights and Duties of Autonomous Agents (RDA2) Subjectivity of Autonomous August 28, 2012 Agents. Some Philosophical and Legal

525 views • 30 slides

LECTURE 11: autonomous action is required. Intelligent agents are usefully applied in domains

Application Areas Agents are usefully applied in domains where LECTURE 11: autonomous action is required. Intelligent agents are usefully applied in domains Applications where flexible autonomous action is required. This is not an unusual

479 views • 4 slides

Where is the Semantics on the Semantic Web? Ontologies and Agents Workshop Autonomous Agents

Where is the Semantics on the Semantic Web? Ontologies and Agents Workshop Autonomous Agents Montreal, 29 May 2001 Mike Uschold Mathematics and Computing Technology Boeing Phantom Works Acknowledgements Material from this lecture was

444 views • 21 slides

Real World Autonomous Agents Coordinate multiple agents Robust Execution of Contingent,

Massachusetts Institute of Technology Real World Autonomous Agents Coordinate multiple agents Robust Execution of Contingent, Provide robustness Temporally Flexible Plans Stephen Block Andreas Wehowsky, Brian Williams 2 Robustness

278 views • 5 slides

Tutorial Outline Introduction to Autonomous Agents and Multi-Agent Systems I Agents N What are

Tutorial Outline Introduction to Autonomous Agents and Multi-Agent Systems I Agents N What are they? N Why are they a good idea? Michael Luck I Agent Architectures Dept of Electronics and Computer Science N Deliberative (especially BDI models)

205 views • 16 slides

2015 OUTSTANDING YOUNG AGENTS COMMITTEE: Membership Development The Young Agents Council of the

2015 OUTSTANDING YOUNG AGENTS COMMITTEE: Membership Development The Young Agents Council of the Michigan Association of Insurance Agents Membership Development In recognition of the outstanding efforts of a state Young Agents Committee for

444 views • 16 slides

Intelligent Driving Agents Intelligent Driving Agents Microscopic traffic simulation with

Intelligent Driving Agents Intelligent Driving Agents Microscopic traffic simulation with reactive Microscopic traffic simulation with reactive driving agents driving agents ITS 2001 Conference, Oakland ITS 2001 Conference,

473 views • 13 slides

Innovative Ideas to Engage Agents Will Bickmore & Sarah-Lynne Rand Senior Account Managers

Innovative Ideas to Engage Agents Will Bickmore & Sarah-Lynne Rand Senior Account Managers Innovative Ideas to Engage Agents Innovative Ideas to Engage Agents Innovative Ideas to Engage Agents Innovative Ideas to Engage Agents Innovative

472 views • 8 slides

Variants of Mersenne Twister Suitable for Graphic Processors Mutsuo Saito 1 , Makoto Matsumoto 2 1

Variants of Mersenne Twister Suitable for Graphic Processors Mutsuo Saito 1 , Makoto Matsumoto 2 1 Hiroshima University, 2 University of Tokyo August 16, 2010 This study is granted in part by JSPS Grant-In-Aid #21654004, #19204002, #21654017,

501 views • 35 slides

1 / 81 1 / 81 About me Dr. Uwe Schmitt Work for Scientific IT Services (SIS) Scientific

1 / 81 1 / 81 About me Dr. Uwe Schmitt Work for Scientific IT Services (SIS) Scientific programmer I also work as tutor and consultant. 2 / 81 Our Goal: Our Goal: always always produce same results produce same results from same data

970 views • 81 slides

Teaching Labs on Pseudorandom Number Generation Elizabeth Patitsas University of Toronto &

Introduction Lab Details Our Experience Conclusion Teaching Labs on Pseudorandom Number Generation Elizabeth Patitsas University of Toronto & University of British Columbia July 5, 2012 Teaching Labs on Pseudorandom Number Generation

192 views • 16 slides

Effjcient Deterministjc and Non- Deterministjc Pseudorandom Number Generatjon Jie Li, Jianliang

Effjcient Deterministjc and Non- Deterministjc Pseudorandom Number Generatjon Jie Li, Jianliang Zheng, Paula Whitlock Outline Introductjon MaD1 Algorithm A Building Block: MARC-bb Data Structure Key Scheduling

612 views • 26 slides

RPC Simulation Vincent Franais 3 fvrier 2016 Vincent Franais (LPC) RPC Simulation 3

RPC Simulation Vincent Franais 3 fvrier 2016 Vincent Franais (LPC) RPC Simulation 3 fvrier 2016 1 / 15 Gas and Geometry 1 Simulation 2 Avalanche multiplication model Diffusion Results 3 Signal and charge induction Threshold

197 views • 15 slides

Pseudorandomness of a Markoff Automorphism over F p Alois Cerbu Elijah Gunther Luke Peilen Yale

Pseudorandomness of a Markoff Automorphism over F p Alois Cerbu Elijah Gunther Luke Peilen Yale University 4 August, 2016 1 / 17 Alois Cerbu, Elijah Gunther, Luke Peilen Pseudorandomness of a Markoff Automorphism over F p The Markoff

603 views • 56 slides

blocking synchronization Yang Xu Outline Disadvantages of locking Hardware support for

Atomic variables and non- blocking synchronization Yang Xu Outline Disadvantages of locking Hardware support for concurrency Atomic variable classes Non-blocking algorithms Disadvantages of locking A lot of overhead

414 views • 13 slides

Using a class from the class library public class URLSplitter { public static void main(String[]

Using a class from the class library public class URLSplitter { public static void main(String[] args) { for (int i = 0; i < args.length; i++) { try { java.net.URL u = new java.net.URL(args[i]); System.out.println("Protocol: " +

761 views • 34 slides

Autonomous Agents (COMP513) Q-Learning Super Mario World - PowerPoint PPT Presentation

Autonomous Agents (COMP513) Q-Learning Super Mario World Papathanasiou Theodoros 2011030058 Implementation Model Four Foundational Parts: 1. Q-value and Constant Initialization 2. Input Management 3. Move Selection 4. Q-Value &

AUTONOMOUS DRIVING AGENT An agent by Stylianos Zafeiris for the Autonomous Agents (COMP513)

Building an agent for the board game Hex COMP513 - Autonomous Agents Stefanos Kontos -

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents &amp; environments

Intelligent Agents Chapter 2 Intelligent Agents p.1/25 Outline Agents and environments

LMIs and autonomous work 1 From autonomous work to discontinuous career paths Autonomous

Math 211 Math 211 Lecture #2 2 Autonomous Equations Autonomous Equations General equation:

Autonomous Agents Assault game - A3C agent 2016030010-Kosmas Pinitas Technical University of

Subjectivity of Autonomous August 28, 2012 Agents. Some Philosophical and Legal Remarks

LECTURE 11: autonomous action is required. Intelligent agents are usefully applied in domains

Where is the Semantics on the Semantic Web? Ontologies and Agents Workshop Autonomous Agents

Real World Autonomous Agents Coordinate multiple agents Robust Execution of Contingent,

Tutorial Outline Introduction to Autonomous Agents and Multi-Agent Systems I Agents N What are

2015 OUTSTANDING YOUNG AGENTS COMMITTEE: Membership Development The Young Agents Council of the

Intelligent Driving Agents Intelligent Driving Agents Microscopic traffic simulation with

Innovative Ideas to Engage Agents Will Bickmore &amp; Sarah-Lynne Rand Senior Account Managers

Variants of Mersenne Twister Suitable for Graphic Processors Mutsuo Saito 1 , Makoto Matsumoto 2 1

1 / 81 1 / 81 About me Dr. Uwe Schmitt Work for Scientific IT Services (SIS) Scientific

Teaching Labs on Pseudorandom Number Generation Elizabeth Patitsas University of Toronto &amp;

Effjcient Deterministjc and Non- Deterministjc Pseudorandom Number Generatjon Jie Li, Jianliang

RPC Simulation Vincent Franais 3 fvrier 2016 Vincent Franais (LPC) RPC Simulation 3

Pseudorandomness of a Markoff Automorphism over F p Alois Cerbu Elijah Gunther Luke Peilen Yale

blocking synchronization Yang Xu Outline Disadvantages of locking Hardware support for

Using a class from the class library public class URLSplitter { public static void main(String[]

CSC421 Intro to Artificial Intelligence UNIT 01: Intelligent Agents Agents & environments

Innovative Ideas to Engage Agents Will Bickmore & Sarah-Lynne Rand Senior Account Managers

Teaching Labs on Pseudorandom Number Generation Elizabeth Patitsas University of Toronto &