Super Mario Bros. problem Input space 5 buttons per frame 24000 - PowerPoint PPT Presentation

Super Mario Bros. problem

Input space ◮ 5 buttons per frame ◮ 24000 frames ◮ 5 24000 ≈ 1 . 9 × 10 16775 possible input sequences Exhaustive search won’t work here.

Tuning process

Naive Representation 1 http://youtu.be/nyYdq1jJQrw

Naive Representation ◮ Bad, because most configurations make no sense. ◮ Just mashing random buttons. ◮ Doesn’t work at all (Video 1 ). 1 http://youtu.be/nyYdq1jJQrw

Better Representation ◮ Movements (list): ◮ Direction (left, right, run left, or run right) ◮ Duration (frames)

Better Representation ◮ Movements (list): ◮ Direction (left, right, run left, or run right) ◮ Duration (frames) ◮ Jumps (list): ◮ Start frame ◮ Duration (frames)

Better Representation ◮ Movements (list): ◮ Direction (left, right, run left, or run right) ◮ Duration (frames) ◮ Jumps (list): ◮ Start frame ◮ Duration (frames) Choosing the right representation is critical ◮ Search space size 10 6328 ◮ Winning run found in 13641 ( ≈ 10 4 ) attempts ◮ Under 5 minutes of training time

Super Mario Bros Results 3500 Pixels Moved Right (Progress) 3000 2500 2000 1500 Win Level OpenTuner 1000 0 60 120 180 240 300 Autotuning Time (seconds)

StreamJIT input 5 LowPassFilter 1 1 (2) Synchronous dataflow programs FMDemodulator are graphs of (mostly) stateless 1 6 workers with statically-known DuplicateSplitter 1 x6 data rates. 1 x6 DuplicateSplitter 1 x2 1 (4) 1 (4) Using the data rates, the LowPassFilter LowPassFilter compiler can compute a schedule 1 1 1 x2 of worker executions, fuse RoundrobinJoiner 2 workers and introduce buffers to 2 Subtractor remove synchronization, then 1 1 choose a combination of data, Amplifier 1 task and pipeline parallelism to 1 x6 RoundrobinJoiner fit the machine. 6 6 Summer 1 output

Fusion, data-parallel fission and splitter/joiner removal BandPass BandPass BandPass BandPass Compress Compress Compress Compress Process Process Expand Expand Process Process Expand Expand BandStop BandStop BandStop BandStop Adder Adder Adder Adder Adder Adder

Autotuning StreamJIT delegates its optimization decisions to OpenTuner, which decides ◮ an overall schedule multiplier (to amortize synchronization) ◮ whether to fuse workers ◮ whether to remove splitters and joiners ◮ buffer implementations ◮ how to allocate fused groups to cores

Autotuning work allocation Equal distribution across all cores is usually the best, but we need to load-balance around stateful workers. ◮ Bitset per worker, one bit per core: exponentially hard to get equal distribution (all bits set). ◮ Array of floats summing to 1.0, one float per core: allows load-balancing, but equal distribution is even harder.

Autotuning work allocation Equal distribution across all cores is usually the best, but we need to load-balance around stateful workers. ◮ Bitset per worker, one bit per core: exponentially hard to get equal distribution (all bits set). ◮ Array of floats summing to 1.0, one float per core: allows load-balancing, but equal distribution is even harder. ◮ Permutation of cores, total count, bias count and bias fraction: equal division across cores, biased for load balancing.

Bias fraction work allocation Use the first count cores of the permutation, moving fraction of the work from the first bias count cores. Doesn’t cover all possibilities, but covers the good ones.

Custom techniques StreamJIT uses custom techniques that force the obvious defaults. Other techniques make some good and some bad changes: ↑ - ↓ -- ↑ - ↓↑↑ - ↓ Custom techniques will then force some of the bad changes back: ↑ ---- ↑ - ↓↑↑ -- Bandit will learn to stop using the custom techniques when they stop working or for unusual graphs where the obvious defaults are bad.

Super Mario Bros. problem Input space 5 buttons per frame 24000 - PowerPoint PPT Presentation

Super Mario Bros. problem Input space 5 buttons per frame 24000 frames 5 24000 1 . 9 10 16775 possible input sequences Exhaustive search wont work here. Tuning process Naive Representation 1 http://youtu.be/nyYdq1jJQrw

Super Mario Bros. Is Harder/Easier than We Thought FUN 2016 Erik D. Demaine, Giovanni Viglietta,

Social Media Audit: Dutch Bros. Coffee Maddie Beaulieu, Nicole Rideout, Taylor Davis and Joe

AnimeNintendo Tue 25 Feb 2020 10:39:30 AM CST

UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER-ORBITAL UNVEILING

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Following the Packets: A Walk Through Bros Internal Processing Pipeline Robin Sommer

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

THE FALL 2018 NFL PRIMETIME SEASON & THE SUPER BOWL KTG CONTENT STRATEGY SUPER BOWL

Bigger is Better Trends in super computers, super software, and super data Michael L. Norman,

Super- -Kamiokande Kamiokande s s Solar Neutrino results Solar Neutrino results Super

Input Devices Managing text and positional input 1 CS 349 - Input Devices iPod Wheel 2 CS 349

Expanding the Reach of Fuzzing Caroline Lemieux September 8 th , 2020 Fuzzcon Europe

APIs and Mobile and Online Privacy Scene-setting, Regulations and Controversies W3C Device API

WS-* vs. REST: Mashing up the Truth from Facts, Myths and Lies Sanjiva Weerawarana, Ph.D.

Databases SQL, NoSQL, ORMs, REST/GraphQL, JSON/gRPC SQL QL (1974) 4) Initially, relational

and Motion Planning Multi robot motion planning: Extended review Dan Halperin School of Computer

Data- and Service-Centric Orchestration of the Internet of Things Overview Sheet Table of

Fuzzy-based Sensor Search in the Web of Things by Cuong Truong University of Lbeck, Germany

Section 1.4: Non-annual compounded interest MATH 105: Contemporary Mathematics University of

Definition of a subring Let R be a ring, and let S R be a subset. Idea We say S is a subring

Super Mario Bros. problem Input space 5 buttons per frame 24000 - PowerPoint PPT Presentation

Super Mario Bros. problem Input space 5 buttons per frame 24000 frames 5 24000 1 . 9 10 16775 possible input sequences Exhaustive search wont work here. Tuning process Naive Representation 1 http://youtu.be/nyYdq1jJQrw

Super Mario Bros. Is Harder/Easier than We Thought FUN 2016 Erik D. Demaine, Giovanni Viglietta,

Social Media Audit: Dutch Bros. Coffee Maddie Beaulieu, Nicole Rideout, Taylor Davis and Joe

AnimeNintendo Tue 25 Feb 2020 10:39:30 AM CST

UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER-ORBITAL UNVEILING

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Following the Packets: A Walk Through Bros Internal Processing Pipeline Robin Sommer

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Super GPU &amp; Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

THE FALL 2018 NFL PRIMETIME SEASON &amp; THE SUPER BOWL KTG CONTENT STRATEGY SUPER BOWL

Bigger is Better Trends in super computers, super software, and super data Michael L. Norman,

Super- -Kamiokande Kamiokande s s Solar Neutrino results Solar Neutrino results Super

Input Devices Managing text and positional input 1 CS 349 - Input Devices iPod Wheel 2 CS 349

Expanding the Reach of Fuzzing Caroline Lemieux September 8 th , 2020 Fuzzcon Europe

APIs and Mobile and Online Privacy Scene-setting, Regulations and Controversies W3C Device API

WS-* vs. REST: Mashing up the Truth from Facts, Myths and Lies Sanjiva Weerawarana, Ph.D.

Databases SQL, NoSQL, ORMs, REST/GraphQL, JSON/gRPC SQL QL (1974) 4) Initially, relational

and Motion Planning Multi robot motion planning: Extended review Dan Halperin School of Computer

Data- and Service-Centric Orchestration of the Internet of Things Overview Sheet Table of

Fuzzy-based Sensor Search in the Web of Things by Cuong Truong University of Lbeck, Germany

Section 1.4: Non-annual compounded interest MATH 105: Contemporary Mathematics University of

Definition of a subring Let R be a ring, and let S R be a subset. Idea We say S is a subring

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

THE FALL 2018 NFL PRIMETIME SEASON & THE SUPER BOWL KTG CONTENT STRATEGY SUPER BOWL