Shallow RNNs: A Method for Accurate Time-series Classification on - PowerPoint PPT Presentation

Dec 29, 2022 •255 likes •402 views

Shallow RNNs: A Method for Accurate Time-series Classification on Tiny Devices* Don Kurian Dennis, Durmus Alp Emre Acar, Vikram Mandikal, Vinu Sankar Sadasivan, Harsha Vardhan Simhadri, Venkatesh Saligrama, Prateek Jain *Slides to be updated.

Shallow RNNs: A Method for Accurate Time-series Classification on Tiny Devices* Don Kurian Dennis, Durmus Alp Emre Acar, Vikram Mandikal, Vinu Sankar Sadasivan, Harsha Vardhan Simhadri, Venkatesh Saligrama, Prateek Jain *Slides to be updated.
Outline • Introduction • Background • Shallow RNNs • Results
Introduction • Time series classification: • Detecting events in a continuous stream of data. • Data partitioned into overlapping windows (sliding windows). • Detection/Classification performed on each window.
Introduction • Time Series on Tiny Devices: • Resource scarscity (few KBs of RAM, tiny processors) • Cannot run standard DNN techniques. • Examples: • Interactive cane for people with visual impairment [24]: • Recognizes gestures coming as time-traces on a sensor. 32kB RAM, 40MHz Processor. • Audio-keyword classification on MXChip: • Detect speech commands and keywords. 100MHz processor, 256KB RAM.
Background • How to solve time series problem on tiny devices • RNNs: • Good fit for time series problems with long dependencies, • Smaller models, but no parallelization [28, 14], requires O(T) time. Small but too Slow! • CNNs: • Can be adapted to time series problems. • Higher parallelization [28, 14] but much larger working RAM. Fast but too big!
Shallow RNN - ShaRNN Parallelization Small Size Compute Reuse
Shallow RNN - ShaRNN • Hierarchical collection of RNNs organized at two levels. • Output of first layer is the input of second layer. • 𝑦 ":$ data is split into bricks of size 𝑙 .
Shallow RNN - ShaRNN • ℛ (") RNN is applied to each brick: (") : ℛ (") outputs. • 𝜑 * • ℛ (") bricks: • Operate completely in parallel, • Fully shared parameters.
Shallow RNN - ShaRNN • 𝑙 is hyperparameter: • Controls inference time. • ℛ (") bricks on 𝑙 length series • ℛ (+) bricks on , - length series , • Overall 𝑃( - + k) inference time. • If 𝑙 = 𝑃( 𝑈) : • Overall time is 𝑃 𝑈 instead of O(T)
Results - Datasets • Our method is able to achieve similar or better accuracy compared to baselines in all but one datasets. • Different model sizes (different hidden-state sizes) -> numbers in bracket, • MI-ShaRNN reports two numbers for the first and the second layer. • Computational cost (amortized number of flops required per data point inference) for each method. • MI refers to method of [10] which leads to smaller models and it is orthogonal to ShaRNN.
Results - Deployment • Accuracy of different methods vs inference time cost (ms). • Deployment on Cortex M4: • 256KB RAM and 100MHz processor, • The total inference time budget is 120 ms. • Low-latency keyword spotting (Google-13).
Demo Video Here: dkdennis.xyz/static/sharnn-neurips19-demo.mp4
Thank you!

Recommend

Recursive Neural Networks and Its Applications LU Yangyang luyy11@sei.pku.edu.cn KERE Seminar

Outline RNNs RNNs-FQA RNNs-NEM Recursive Neural Networks and Its Applications LU Yangyang luyy11@sei.pku.edu.cn KERE Seminar Oct. 29, 2014 Outline RNNs RNNs-FQA RNNs-NEM Outline Recursive Neural Networks RNNs for Factoid Question

907 views • 67 slides

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA SO FAR; RNNS THAT MODEL CATEGORICAL DATA Remember that

1.26k views • 99 slides

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series LSM23 Series Linear Slides MS28 Series MS35 Series MS42 Series CS35 Series CS42 Series Stepper Drives SR Series ST

719 views • 52 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class Recurrent Neural Network Cell Recurrent Neural Networks (RNNs) Bi-Directional Recurrent Neural Networks (Bi-RNNs) Multiple-layer /

583 views • 47 slides

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented by: Adam Balint Presented by: Adam Balint Email: balint@uoguelph.ca Email: balint@uoguelph.ca Working with more complex data Working with more

608 views • 25 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

793 views • 21 slides

Time Series Analysis and Mining with R Time Series Decomposi- tion Time Series Forecasting

R and Time Series Data Time Series Analysis and Mining with R Time Series Decomposi- tion Time Series Forecasting Yanchang Zhao Time Series Clustering Time Series RDataMining.com Classification http://www.rdatamining.com/ R Functions

1.91k views • 42 slides

GEOTHERMAL SYSTEMS AND TECHNOLOGIES 5. SHALLOW GEOTHERMAL SYSTEMS 5. SHALLOW GEOTHERMAL SYSTEMS

1 GEOTHERMAL SYSTEMS AND TECHNOLOGIES 5. SHALLOW GEOTHERMAL SYSTEMS 5. SHALLOW GEOTHERMAL SYSTEMS (SGS) 2 Shallow geothermal resources (< 400 m depth) are omnipresent. Below 15 - 20 m depth everything is geothermal: the temp. field

911 views • 27 slides

Outline Time series and forecasting Time series objects 1 in R Basic time series functionality

Time series and forecasting in R 1 Time series and forecasting in R 2 Outline Time series and forecasting Time series objects 1 in R Basic time series functionality 2 The forecast package 3 Rob J Hyndman Exponential smoothing 4 ARIMA

714 views • 8 slides

TAKING DATA ON FORM TAKING DATA ON FORM- -WOUND WOUND MOTORS MOTORS By : Manuel Manny

TAKING DATA ON FORM TAKING DATA ON FORM- -WOUND WOUND MOTORS MOTORS By : Manuel Manny Garcia, Jr. ACCURATE ACCURATE ACCURATE ACCURATE DATA DATA DATA DATA Taking accurate data allows the coil Taking accurate data allows the coil

331 views • 28 slides

A Low-dose, Accurate Medical A Low-dose, Accurate Medical Imaging Method for Proton Therapy:

A Low-dose, Accurate Medical A Low-dose, Accurate Medical Imaging Method for Proton Therapy: Imaging Method for Proton Therapy: Proton Computed Tomography Proton Computed Tomography Bela Erdelyi Department of Physics, Northern I llinois

479 views • 29 slides

ACCURATE FLOATING-POINT SUMMATION IN CUB URI VERNER Summer intern OUTLINE Who needs accurate

ACCURATE FLOATING-POINT SUMMATION IN CUB URI VERNER Summer intern OUTLINE Who needs accurate floating-point summation? ! Round-off error: source and recovery A new method for accurate FP summation on a GPU Added as a function to the open-source

619 views • 28 slides

standard series Overview DP series DX series H series M series bitte hier

Maier rotary joints standard series Overview DP series DX series H series M series bitte hier Kurzzeichen Seite 2 11.02.2019 einfgen DP SERIES Overview 1. DP for water applications 2. DP for hot oil applications

1.77k views • 163 slides

SHALLOW WATER BATHYMETRY WITH AN SHALLOW WATER BATHYMETRY WITH AN INCOHERENT X- -BAND RADAR

IGARSS 2010 IGARSS 2010 Session: Ocean Radar Remote Sensing at Grazing Incidence Paper: FR3.LO2.2 r: FR3.LO2.2 Session: Ocean Radar Remote Sensing at Grazing Incidence Pape SHALLOW WATER BATHYMETRY WITH AN SHALLOW WATER BATHYMETRY WITH

495 views • 20 slides

1.25 1.25 Moz Moz HIGH HIGH - GRADE, SHALLOW GRADE, SHALLOW WA GOLD PROJECT WA GOLD PROJECT

1.25 1.25 Moz Moz HIGH HIGH - GRADE, SHALLOW GRADE, SHALLOW WA GOLD PROJECT WA GOLD PROJECT PROGRESSING TO DEVELOPMENT ASX:CAI RIU PRESENTATION FEBRUARY 2019 DISCLAIMER DISCLAIMER DISCLAIMER This presentation does not constitute

198 views • 17 slides

Shallow vs. deep networks Restricted Boltzmann Machines Shallow : one hidden layer Features

Shallow vs. deep networks Restricted Boltzmann Machines Shallow : one hidden layer Features can be learned more-or-less independently Arbitrary function approximator (with enough hidden units) No connections among units within a layer;

197 views • 5 slides

Functional Steins Institut method Mines-Telecom L. Decreusefond Borchard symposium Roadmap

Functional Steins Institut method Mines-Telecom L. Decreusefond Borchard symposium Roadmap Probability Optimal Types Transportation metrics Rubinstein Wasserstein Entropy Prohorov 2/39 Institut Mines-Telecom Functional Steins

827 views • 72 slides

CS 287 Advanced Robotics (Fall 2019) Lecture 6: Unconstrained Optimization Pieter Abbeel UC

CS 287 Advanced Robotics (Fall 2019) Lecture 6: Unconstrained Optimization Pieter Abbeel UC Berkeley EECS Many slides and figures adapted from Stephen Boyd [optional] Boyd and Vandenberghe, Convex Optimization, Chapters 9 11 [optional]

770 views • 45 slides

Logic of the Scientific Method s e d i l S 2 n o i s s e S - - 0 4 2 k W c

Logic of the Scientific Method s e d i l S 2 n o i s s e S - - 0 4 2 k W c S 2 Introduc Intr oduction to the tion to the Sc Scie ientif ntific ic Me Method thod Basic sic R Requir quirements: nts:

142 views • 12 slides

18.650 Statistics for Applications Chapter 4: The Method of Moments 1/14 Weierstrass

18.650 Statistics for Applications Chapter 4: The Method of Moments 1/14 Weierstrass Approximation Theorem (WAT) Theorem f be [ a, b ] , Let a continuous function on the interval then, for any > 0 , there exists a 0 , a

171 views • 15 slides

Collaborators Figen Oztoprak Stefan Solntsev Richard Byrd 2 Outline 1. How to improve

An Evolving Gradient Resampling Method for Machine Learning Jorge Nocedal Northwestern University NIPS, Montreal 2015 1 Collaborators Figen Oztoprak Stefan Solntsev Richard Byrd 2 Outline 1. How to improve upon the stochastic gradient

703 views • 36 slides

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization Vien V. Mai and Mikael Johansson KTH - Royal Institute of Technology Stochastic optimization Stochastic optimization problem: minimize f ( x

381 views • 20 slides

The Entropy Rounding Method in Approximation Algorithms Thomas Rothvo Department of

The Entropy Rounding Method in Approximation Algorithms Thomas Rothvo Department of Mathematics, M.I.T. Carg` ese 2011 A general rounding problem Problem: Given: A R n m , fractional solution x [0 , 1] m Find: y { 0 ,

1.55k views • 126 slides

Eliciting Informative Feedback: The Peer-Prediction Method Nolan Miller, Paul Resnick, &

Problem and Setup Initial Game Extensions Further Work Conclusion Experiment Eliciting Informative Feedback: The Peer-Prediction Method Nolan Miller, Paul Resnick, & Richard Zeckhauser Thomas Steinke & David Rezza Baqaee Problem

764 views • 17 slides

Shallow RNNs: A Method for Accurate Time-series Classification on - PowerPoint PPT Presentation

Shallow RNNs: A Method for Accurate Time-series Classification on Tiny Devices* Don Kurian Dennis, Durmus Alp Emre Acar, Vikram Mandikal, Vinu Sankar Sadasivan, Harsha Vardhan Simhadri, Venkatesh Saligrama, Prateek Jain *Slides to be updated.

Recursive Neural Networks and Its Applications LU Yangyang luyy11@sei.pku.edu.cn KERE Seminar

MIXTURE DENSITY NETWORKS MIXTURE DENSITY NETWORKS Charles Martin SO FAR; RNNS THAT MODEL

Lead Screw Motors LSM08 Series LSM11 Series LSM14 Series LSM17 Series

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Introduction to CNNs and RNNs with PyTorch Introduction to CNNs and RNNs with PyTorch Presented

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

Time Series Analysis and Mining with R Time Series Decomposi- tion Time Series Forecasting

GEOTHERMAL SYSTEMS AND TECHNOLOGIES 5. SHALLOW GEOTHERMAL SYSTEMS 5. SHALLOW GEOTHERMAL SYSTEMS

Outline Time series and forecasting Time series objects 1 in R Basic time series functionality

TAKING DATA ON FORM TAKING DATA ON FORM- -WOUND WOUND MOTORS MOTORS By : Manuel Manny

A Low-dose, Accurate Medical A Low-dose, Accurate Medical Imaging Method for Proton Therapy:

ACCURATE FLOATING-POINT SUMMATION IN CUB URI VERNER Summer intern OUTLINE Who needs accurate

standard series Overview DP series DX series H series M series bitte hier

SHALLOW WATER BATHYMETRY WITH AN SHALLOW WATER BATHYMETRY WITH AN INCOHERENT X- -BAND RADAR

1.25 1.25 Moz Moz HIGH HIGH - GRADE, SHALLOW GRADE, SHALLOW WA GOLD PROJECT WA GOLD PROJECT

Shallow vs. deep networks Restricted Boltzmann Machines Shallow : one hidden layer Features

Functional Steins Institut method Mines-Telecom L. Decreusefond Borchard symposium Roadmap

CS 287 Advanced Robotics (Fall 2019) Lecture 6: Unconstrained Optimization Pieter Abbeel UC

Logic of the Scientific Method s e d i l S 2 n o i s s e S - - 0 4 2 k W c

18.650 Statistics for Applications Chapter 4: The Method of Moments 1/14 Weierstrass

Collaborators Figen Oztoprak Stefan Solntsev Richard Byrd 2 Outline 1. How to improve

Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization

The Entropy Rounding Method in Approximation Algorithms Thomas Rothvo Department of

Eliciting Informative Feedback: The Peer-Prediction Method Nolan Miller, Paul Resnick, &amp;

Eliciting Informative Feedback: The Peer-Prediction Method Nolan Miller, Paul Resnick, &