OOP Inheri -tance En- capsu- lation Learning to Play Tetris via - PowerPoint PPT Presentation

Poly- mor- phism Abstra ction Class OOP Inheri -tance En- capsu- lation Learning to Play Tetris via Kuan-Ting Lai 2020/5/25 Deep Reinforcement Learning

Deep Reinforcment Learning (DRL)

Establishing Communiation between Java and Python • Remote Procedure Call (RPC) • MsgPack RPC − MessagePack: It’s like JSON, but fast and small https://msgpack.org/index.html Action State

OpenAI Gym • https://gym.openai.com/

Hello Gym Example import gym env = gym.make("CartPole-v1") observation = env.reset() for _ in range(1000): env.render() # your agent here (this takes random actions) action = env.action_space.sample() observation, reward, done, info = env.step(action) if done: observation = env.reset() env.close() https://gym.openai.com/docs/

step() • observation (state 𝑇 𝑢 ): − Observation of the environment. Ex: pixel data from a camera, joint angles and joint velocities of a robot, or the board state in a board game. • reward ( 𝑠 𝑢 ) − amount of reward achieved by the previous action. • done: − True indicates the episode has terminated • info: − diagnostic information useful for debugging.

Creating Tetris DRL Project

Creating a Maven Project • New Project -> Maven − GroupId: org.aiotlab − ArtifactId: Tetris4DRL

<dependencies dependencies> Adding Dependencies <dependency dependency> <groupId groupId>org.msgpack</groupId groupId> <artifactId artifactId>msgpack</artifactId artifactId> to “pom.xml” <version version>0.6.12</version version> </dependency dependency> <dependency dependency> <groupId groupId>org.jboss.netty</groupId groupId> • Add the dependencies of <artifactId artifactId>netty</artifactId artifactId> <version version>3.2.1.Final</version version> <exclusions exclusions> msgpack-rpc-java to pom.xml <exclusion exclusion> <groupId groupId>javax.servlet</groupId groupId> <artifactId artifactId>servlet-api</artifactId artifactId> </exclusion exclusion> <exclusion exclusion> <groupId groupId>commons-logging</groupId groupId> <artifactId artifactId>commons-logging</artifactId artifactId> </exclusion exclusion> </exclusions exclusions> </dependency dependency> <dependency dependency> <groupId groupId>org.slf4j</groupId groupId> <artifactId artifactId>slf4j-api</artifactId artifactId> <version version>1.6.1</version version> </dependency dependency> <dependency dependency> <groupId groupId>org.slf4j</groupId groupId> <artifactId artifactId>slf4j-log4j12</artifactId artifactId> <version version>1.6.1</version version> </dependency dependency> </dependencies dependencies>

Reimporting Maven

MsgPack-RPC for Java

Download and Compile msgpack-rpc-java 1. Install Maven (link) 2. Download msgpack-rpc-java − https://github.com/msgpack-rpc/msgpack-rpc-java 3. Compile source code − mvn compile 4. Create jar file − mvn package 5. Fix a bug “type optional is ambiguous” ( link) 6. Copy and add the library to IntelliJ project − Add “target/msgpack -rpc-0.7.1- SNAPSHOT.jar” to our project

Fixing Error “type optional is ambiguous” • https://github.com/msgpack-rpc/msgpack-rpc-java/pull/12 • Issue: Maven can't compile the package because of ambiguous optional type in AnnotationTest • Solution: Explicitly Import org.msgpack.annotation.Optional

Add “msgpack -rpc- x.x.x.jar” • Create a folder “lib” • Copy and paste “msgpack -rpc- 0.7.1.jar” to the lib/ folder • Right click the jar file, and select “Add as Library…”

Remove Timer in Board.java private void private void initBoard(Tetris parent) { setFocusable(true true); /*timer = new Timer(); timer.scheduleAtFixedRate(new ScheduleTask(), INITIAL_DELAY, PERIOD_INTERVAL);*/ … } private void private void newPiece() { … if if (!tryMove(curPiece curPiece, curX curX, curY curY)) { curPiece curPiece.setShape(Tetrominoe. NoShape NoShape ); //timer.cancel(); isStarted isStarted = false false; statusbar statusbar.setText("Game over" "Game over"); } }

Create the RPC Server public class public class TetrisServer { public static void public static void main(String[] args) { // Create a RPC server EventLoop loop = EventLoop. defaultEventLoop (); Server svr = new new Server(); svr.serve(new new TetrisServer()); try try { svr.listen( server_port ); System. out out .println("Tetris RPC server is listening at " "Tetris RPC server is listening at " + server_port ); loop.join(); } catch catch (IOException exp1) { exp1.printStackTrace(); } catch catch (InterruptedException exp2) { exp2.printStackTrace(); } } }

public class TetrisServer { Add OpenAI Gym API static int server_port = 10612; static Tetris game ; // OpenAI GYM API public int [] reset() { game .restart(); return game .getBoardState(); } public int [] step(int action_type) { return game .step(action_type); } public int getTotalReward() { return game .getLinesRemoved(); } public boolean isDone() { return game .isGameOver(); } public static void main(String[] args) { game = new Tetris(); game .setVisible(true); // Create a RPC server EventLoop loop = EventLoop. defaultEventLoop (); Server svr = new Server(); svr.serve(new TetrisServer()); try { svr.listen( server_port ); System. out .println("Tetris RPC server is listening at " + server_port ); loop.join(); } catch (IOException exp1) { exp1.printStackTrace(); } catch (InterruptedException exp2) { exp2.printStackTrace(); } } }

getBoardState() public public int int [] getBoardState() { // Get board state int int b_size = boar oard.getBoardSize(); Shape.Tetrominoe[] b_data = boar board.getBoardState(); int int [] state = new int new int[b_size]; for for (int int i=0; i<b_size; i++) { state[i] = b_data[i].ordinal(); } return return state; }

public int [] step(int action_type) { int [] step(int action_type) if if (isGameOver()) return getBoardState(); switch(action_type) { case case 0: // Move left move(-1); break; case case 1: // Move right move(1); break; case case 2: // Rotate left rotate(false); break; case case 3: // Rotate right rotate(true); break; case case 4: // Drop dropDown(); break; } board.doGameCycle(); return getBoardState(); }

MsgPack-RPC for Python

Install msgpack-rpc • pip install msgpack-rpc-python

import msgpackrpc; Random Play import random import time server_ip = "localhost" server_port = 10612 client = msgpackrpc.Client(msgpackrpc.Address(server_ip, server_port)) client.call('reset') while True: action = random.randint(0, 4) state = client.call('step', action) #print(state) reward = client.call('getTotalReward') done = client.call('isDone') if done: print('Game over. Reward {}'.format(reward)) client.call('reset')

# Implement OpenAI Gym interface Inheriting import gym import msgpackrpc # install msgpack-rpc-python import numpy as np OpenAI Gym class TetrisGym(gym.Env): board_width = 10 board_height = 22 def __init__(self, ip='localhost', port=10612): self.action_space = gym.spaces.Discrete(5) self.observation_space = gym.spaces.Box( low=0, high=7, shape=(self.board_height*self.board_width, 1), dtype=np.float32); self.client = msgpackrpc.Client(msgpackrpc.Address(ip, port)) # initialize environment settings self.done = False def reset(self): return self.client.call('reset') def step(self, action): observation = self.client.call('step', action) reward = self.client.call('getTotalReward') self.done = self.client.call('isDone') info = None return observation, reward, self.done, info;

References • https://msgpack.org/index.html

OOP Inheri -tance En- capsu- lation Learning to Play Tetris via - PowerPoint PPT Presentation

Poly- mor- phism Abstra ction Class OOP Inheri -tance En- capsu- lation Learning to Play Tetris via Kuan-Ting Lai 2020/5/25 Deep Reinforcement Learning Deep Reinforcment Learning (DRL) Establishing Communiation between Java and

Why? Todays Goals: Introduce OOP philosophy Explain why OOP is useful Show OOP in

MCIS/UA PHP Training 2003 Chapter 5 Object Oriented Concepts Datasource/Database Classes OOP

OOP with Java Yuanbin Wu cs@ecnu OOP with Java Project 5: 5 9 9

Abstraction and OOP Tiziana Ligorio 1 Todays Plan Announcements Recap Abstraction OOP

CSCI-1101A Object-Oriented Programming (OOP) Mohammad T . Irfan Reading: Ch 16 (Guzdial)

OOP in Java ESOF 427: Software Design and Architecture Keith Vertanen Overview Object

JAVA Basics & OOP 2020/3/14 Write Once, Run Anywhere Java Virtual Machine (JVM)

Object Oriented Programming (OOP) and introduction to BioPerl Laurent Falquet (original course

Advanced Computer Programming Inf1BOb ject-Oriented Programming LAB 07 (week 8) - Widgets OOP

C++ Basics & OOP 2019/3/14 History of C++ Developed by Bjarne Stroustrup starting in

CSCI-1101A Object-Oriented Programming (OOP) Mohammad T . Irfan Reading: Handout (also on

Objects for the Masses Marcus Brger PHP Quebec 0 9 : http:/ / talks.som abo.de/ 2 0 0 9 0 3

CS 251 Fall 2019 CS 251 Fall 2019 Two world views Principles of Programming Languages

Contents Differences of OOP from procedural programming Chapter 1. Introduction to Objects

PROBLEM BASED APPROACH IN TEACHING OOP COURSES ANA MADEVSKA BOGDANOVA FCSE UKIM MACEDONIA

Inheritance Tiziana Ligorio 1 Todays Plan Recap Useful C++ / OOP Intro to Inheritance

DS595/CS525: Reinforcement Learning --Introduction & Logistics Prof. Yanhua Li Time: 6:00pm

Deep Learning and Mixed Integer Optimization Matteo Fischetti, University of Padova 1 Designing

CSE 291D/234 Data Systems for Machine Learning Fall 2020 Arun Kumar 1 About Myself 2009:

Adversarial Search (Game Playing) Chapter 5 Adapted from materials by Tim Finin, Marie

Classical Planning Systems Chapter 10 R&N ICS 271 Fall 2016 Outline: Planning Planning

The Situation Calculus and Golog Golog A Tutorial Gerhard Lakemeyer Dept. of Computer

The Situation Calculus and the Frame Problem Using Theorem Proving to Generate Plans 1

HSBC Finance Corporation IFRS Management Basis 9 November 2011 1 Disclosure Statement This

Sambuz

Useful Links

Newsletter

Mail Us

OOP Inheri -tance En- capsu- lation Learning to Play Tetris via - PowerPoint PPT Presentation

Poly- mor- phism Abstra ction Class OOP Inheri -tance En- capsu- lation Learning to Play Tetris via Kuan-Ting Lai 2020/5/25 Deep Reinforcement Learning Deep Reinforcment Learning (DRL) Establishing Communiation between Java and

Why? Todays Goals: Introduce OOP philosophy Explain why OOP is useful Show OOP in

MCIS/UA PHP Training 2003 Chapter 5 Object Oriented Concepts Datasource/Database Classes OOP

OOP with Java Yuanbin Wu cs@ecnu OOP with Java Project 5: 5 9 9

Abstraction and OOP Tiziana Ligorio 1 Todays Plan Announcements Recap Abstraction OOP

CSCI-1101A Object-Oriented Programming (OOP) Mohammad T . Irfan Reading: Ch 16 (Guzdial)

OOP in Java ESOF 427: Software Design and Architecture Keith Vertanen Overview Object

JAVA Basics &amp; OOP 2020/3/14 Write Once, Run Anywhere Java Virtual Machine (JVM)

Object Oriented Programming (OOP) and introduction to BioPerl Laurent Falquet (original course

Advanced Computer Programming Inf1BOb ject-Oriented Programming LAB 07 (week 8) - Widgets OOP

C++ Basics &amp; OOP 2019/3/14 History of C++ Developed by Bjarne Stroustrup starting in

CSCI-1101A Object-Oriented Programming (OOP) Mohammad T . Irfan Reading: Handout (also on

Objects for the Masses Marcus Brger PHP Quebec 0 9 : http:/ / talks.som abo.de/ 2 0 0 9 0 3

CS 251 Fall 2019 CS 251 Fall 2019 Two world views Principles of Programming Languages

Contents Differences of OOP from procedural programming Chapter 1. Introduction to Objects

PROBLEM BASED APPROACH IN TEACHING OOP COURSES ANA MADEVSKA BOGDANOVA FCSE UKIM MACEDONIA

Inheritance Tiziana Ligorio 1 Todays Plan Recap Useful C++ / OOP Intro to Inheritance

DS595/CS525: Reinforcement Learning --Introduction &amp; Logistics Prof. Yanhua Li Time: 6:00pm

Deep Learning and Mixed Integer Optimization Matteo Fischetti, University of Padova 1 Designing

CSE 291D/234 Data Systems for Machine Learning Fall 2020 Arun Kumar 1 About Myself 2009:

Adversarial Search (Game Playing) Chapter 5 Adapted from materials by Tim Finin, Marie

Classical Planning Systems Chapter 10 R&amp;N ICS 271 Fall 2016 Outline: Planning Planning

The Situation Calculus and Golog Golog A Tutorial Gerhard Lakemeyer Dept. of Computer

The Situation Calculus and the Frame Problem Using Theorem Proving to Generate Plans 1

HSBC Finance Corporation IFRS Management Basis 9 November 2011 1 Disclosure Statement This

Sambuz

Useful Links

Newsletter

Mail Us

JAVA Basics & OOP 2020/3/14 Write Once, Run Anywhere Java Virtual Machine (JVM)

C++ Basics & OOP 2019/3/14 History of C++ Developed by Bjarne Stroustrup starting in

DS595/CS525: Reinforcement Learning --Introduction & Logistics Prof. Yanhua Li Time: 6:00pm

Classical Planning Systems Chapter 10 R&N ICS 271 Fall 2016 Outline: Planning Planning