Time-efficient Offloading for Machine Learning Tasks between - PowerPoint PPT Presentation

Time-efficient Offloading for Machine Learning Tasks between Embedded Systems and Fog Nodes Darren Saguil darren.saguil@uoit.net Akramul Azim akramul.azim@uoit.ca

Introduction Embedded Systems can leverage this to Machine Learning allows you to gain insights provide novel functions, such as automation, and analysis from seemingly unrelated navigation, and classification features ML Model 2 ontariotechu.ca 2

Motivation • Machine learning is computationally expensive and embedded systems are resource- constrained , so it is the status quo to perform all Machine Learning applications on external devices . This may lead to some problems, such as: – The runtime of the ML models are bottlenecked by the transmission time – Losing connection can impact the device’s functionality 1 • ML algorithms come in many different complexities, and embedded systems may be able to run some, but not all of them • Our contribution was to find a time-efficient distribution threshold to lessen the reliance of embedded systems on fog servers by running some inputs locally, and offloading when needed ontariotechu.ca 3

Methodology • A simulated sensor sent machine learning inputs to a component on the embedded system called the “ Offloader ”. This component determined: – which model the input is being sent to – whether to send the input to local or remote model • Pre-Runtime, the WCET of each model is measured using a validation set. The offloader compared this WCET to a threshold which was measured during runtime. It consisted of: – T L , the time wirelessly transfer the data – T F , the time execute the model on the external device ontariotechu.ca 4

Results and Analysis ▪ The system model was ran using both linear (1D) and image datasets on a Multilayer Perceptron (MLP) and a Convolutional Neural Network (CNN) ▪ The top graphs show the time taken to run every model locally and the time taken to run the model externally. It shows that the models using the MLP can be run locally ▪ The bottom graph shows the results of a theoretical device running multiple models of varying complexities. When using the proposed offloader, it shows that only offloading inputs that bypass the threshold can reduce the total runtime ontariotechu.ca 5

Conclusion • The status quo of only performing every embedded system’s Machine Learning application on external devices can be improved. The transmission time is a severe bottleneck , and simple Machine Learning applications can bypass it by running locally • One of the main factors which determines if a Model’s input should be offloaded is the model’s complexity. For example, the CNN used in this experiment had a large Dense Layer with 128 output nodes. This made the runtime much longer, as opposed to the MLP’s dense layer of only 32 output nodes. • Runtime is not the only aspect to observe when running Machine Learning applications. Energy consumption and temperature should also be looked at. • The Machine Learning models themselves could also be partitioned, instead of offloading the functionality of the entire models 2 ontariotechu.ca 6

References 1. Sam Leroux, Steven Bohez, Elias De Coninck, Tim Verbelen, Bert Vankeirsbilck, Pieter Simoens, and Bart Dhoedt. The cascading neural network: building the internet of smart things. Knowledge and Information Systems , 52:791 – 814, 2017. 2. Yiping Kang, Johann Hauswald, Cao Gao, Austin Rovinski, Trevor Mudge, Jason Mars, and Lingjia Tang. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. In Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems , ASPLOS ’17, pages 615– 629, New York, NY, USA, 2017. ACM. ontariotechu.ca 7

Time-efficient Offloading for Machine Learning Tasks between - PowerPoint PPT Presentation

Time-efficient Offloading for Machine Learning Tasks between Embedded Systems and Fog Nodes Darren Saguil darren.saguil@uoit.net Akramul Azim akramul.azim@uoit.ca Introduction Embedded Systems can leverage this to Machine Learning allows

Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland

Time Management Beth Asbury Outline Time Bandits Scheduling tasks Prioritising tasks

Scheduling Aperiodic Tasks Background Scheduling Treat aperiodic tasks as lowest-priority

Modeling Wind Shielding for FPSO Tandem Offloading using CFD Bob Gordon, Granherne Satpreet

OpenMP Device Offloading to FPGA Accelerators Lukas Sommer, Jens Korinth, Andreas Koch

FRR WorkShop Donald Sharp, Principal Engineer NVIDIA Agenda ASIC Offloading Netlink

EPCC Training Day 1: Offload James Briggs 1 COSMOS DiRAC April 29, 2015 Concepts Offloading

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Shared Memory Programming with OpenMP Lecture 6: Tasks What are tasks? Tasks are

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

KANDOO A FRAMEWORK FOR EFFICIENT & SCALABLE OFFLOADING OF CONTROL APPLICATIONS Soheil

Towards Efficient Cellular Traffic Offloading via Dynamic MPTCP Path Configuration with SDN Qi

ROLE & FUNCTION OF PWBC Collegial assistance vs. legal duties Tension between PWBC

HIC 2018| Neurosurgeon Monday, 30 th July, 2018 Director of Communications, SHC tom@shcx.io

Frame for this Workshop Making the simple complicated is commonplace; making the complicated

Affect- and Personality-based Recommender Systems Part II: Acquisition, Usage in Recommender

Co Co-Inf Inference erence wi with th De Device ice-Edge Edge Sy Syne nerg rgy En Li,

Deep Learning on the mobile edge Georg Eickelpasch advised by Marton Kajo Thursday 10 th October,

Generalists and Specialists Using Community Embeddings to Quantify Activity Diversity in Online

From the Operating Room to the Board Room Assoc Prof Ng Wai Hoe Medical Director, National

Time-efficient Offloading for Machine Learning Tasks between - PowerPoint PPT Presentation

Time-efficient Offloading for Machine Learning Tasks between Embedded Systems and Fog Nodes Darren Saguil darren.saguil@uoit.net Akramul Azim akramul.azim@uoit.ca Introduction Embedded Systems can leverage this to Machine Learning allows

Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland

Time Management Beth Asbury Outline Time Bandits Scheduling tasks Prioritising tasks

Scheduling Aperiodic Tasks Background Scheduling Treat aperiodic tasks as lowest-priority

Modeling Wind Shielding for FPSO Tandem Offloading using CFD Bob Gordon, Granherne Satpreet

OpenMP Device Offloading to FPGA Accelerators Lukas Sommer, Jens Korinth, Andreas Koch

FRR WorkShop Donald Sharp, Principal Engineer NVIDIA Agenda ASIC Offloading Netlink

EPCC Training Day 1: Offload James Briggs 1 COSMOS DiRAC April 29, 2015 Concepts Offloading

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Shared Memory Programming with OpenMP Lecture 6: Tasks What are tasks? Tasks are

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

KANDOO A FRAMEWORK FOR EFFICIENT &amp; SCALABLE OFFLOADING OF CONTROL APPLICATIONS Soheil

Towards Efficient Cellular Traffic Offloading via Dynamic MPTCP Path Configuration with SDN Qi

ROLE &amp; FUNCTION OF PWBC Collegial assistance vs. legal duties Tension between PWBC

HIC 2018| Neurosurgeon Monday, 30 th July, 2018 Director of Communications, SHC tom@shcx.io

Frame for this Workshop Making the simple complicated is commonplace; making the complicated

Affect- and Personality-based Recommender Systems Part II: Acquisition, Usage in Recommender

Co Co-Inf Inference erence wi with th De Device ice-Edge Edge Sy Syne nerg rgy En Li,

Deep Learning on the mobile edge Georg Eickelpasch advised by Marton Kajo Thursday 10 th October,

Generalists and Specialists Using Community Embeddings to Quantify Activity Diversity in Online

From the Operating Room to the Board Room Assoc Prof Ng Wai Hoe Medical Director, National

KANDOO A FRAMEWORK FOR EFFICIENT & SCALABLE OFFLOADING OF CONTROL APPLICATIONS Soheil

ROLE & FUNCTION OF PWBC Collegial assistance vs. legal duties Tension between PWBC