Recurrent Neural Networks Sharan Narang May 9, 2017 Silicon Valley - PowerPoint PPT Presentation

Oct 19, 2023 •409 likes •603 views

Exploring Sparsity in Recurrent Neural Networks Sharan Narang May 9, 2017 Silicon Valley AI Lab Speech Recognition with Deep Learning English Scaling with Data Compa paris rison on of Spe peech Recogni gnition tion App pproac roaches

Exploring Sparsity in Recurrent Neural Networks Sharan Narang May 9, 2017 Silicon Valley AI Lab
Speech Recognition with Deep Learning English
Scaling with Data Compa paris rison on of Spe peech Recogni gnition tion App pproac roaches hes Title Accuracy Deep Learning Traditional methods Data + Model Size (Speed)
Model Sizes 500.00 461.87 400.00 300.00 270.79 Number of Parameters (in millions) 200.00 Size (in MB) 115.47 100.00 67.70 32.56 8.14 0.00 Deep Speech 1 Deep Speech 2 Deep Speech 2 (RNN) (GRU) Baidu du Speec ech h Models els
Future Vision
Sparse Neural Networks
Pruning Weights Dense Initial Network Pruning Weights Sparse Final Network Start of Training During Training End of Training Epochs
Pruning Approach 0.5 0.4 Prune Threshold 0.3 Recurrent 0.2 Linear 0.1 0 0 5 10 15 20 Epoch
Pruning Layers 100% 95% 90% Sparsity 85% Pruned Percent 80% 75% 70% 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Layers
Results Model Layer Size # of Params CER Relative Perf RNN Dense 1760 67 million 10.67 0.0% RNN Sparse 1760 8.3 million 12.88 -20.71% RNN Sparse 2560 11.1 million 10.59 0.75% RNN Sparse 3072 16.7 million 10.25 3.95% GRU Dense 2560 115 million 9.55 0.0% GRU Sparse 2560 13 million 10.87 -13.82% GRU Sparse 3568 17.8 million 9.76 -2.2%
Equal Parameter Networks 60 small_dense_train small_dense_dev0 55 large_sparse_train large_sparse_dev0 50 45 CTC Cost 40 35 30 25 20 0 5 10 15 20 Epoch Number
Sparsity v/s Accuracy 10% 10.89 CER Baseline line 0% -10% 13.0 CER Relative Accuracy -20% -30% -40% -50% -60% 17.4 CER -70% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Sparsity
Models don’t need to be retrained
Compression RNN Model 10 8.11 8 Compression 6.06 6 4.04 4 2 0 1760 Sparse 2560 Sparse 3072 Sparse Sparse se Models els
Speedup 12 10 10 8 6 5.33 Measured Speedup 3.89 Expected Speedup 4 2.90 1.93 2 1.16 0 1760 Sparse 2560 Sparse 3072 Sparse Sparse se Models els
Conclusion • Sparse Neural Networks can achieve good accuracy while significantly reducing the number of parameters • Threshold based approach works for fully connected layers, recurrent layers and GRU layers • Improvements in Sparse Matrix Vector libraries can result in higher speedup for Sparse Neural Networks
Thank You!
Sharan Narang sharan@baidu.com http://research.baidu.com Silicon Valley AI Lab

Recommend

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class Recurrent Neural Network Cell Recurrent Neural Networks (RNNs) Bi-Directional Recurrent Neural Networks (Bi-RNNs) Multiple-layer /

583 views • 47 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

793 views • 21 slides

The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente

The Power of Linear Recurrent The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente neuronale Netze? Frieder Stolzenburg Overview Frieder Stolzenburg Introduction Recurrent Neural Networks

1.17k views • 66 slides

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang Wang (CUHK) Recurrent Neural Network February 26, 2019 1 / 52 Outline 1 Recurrent neural networks Recurrent neural networks BP on RNN Variants

1.22k views • 52 slides

CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER VII VII CHAPTER Learning in Recurrent Networks Learning in Recurrent Networks CHAPTER VI : VI : Learning in CHAPTER Learning in Recurrent Recurrent Networks Networks Introduction We

464 views • 17 slides

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep Learning textbook Ch. 10 Recurrent Neural Networks Long

667 views • 18 slides

Understanding LSTM Networks Recurrent Neural Networks An unrolled recurrent neural network The

Understanding LSTM Networks Recurrent Neural Networks An unrolled recurrent neural network The Problem of Long-Term Dependencies RNN short-term dependencies Language model trying to predict the next word based on the previous ones the clouds

729 views • 27 slides

CSEP 517: Natural Language Processing Recurrent Neural Networks Autumn 2018 Luke Zettlemoyer

CSEP 517: Natural Language Processing Recurrent Neural Networks Autumn 2018 Luke Zettlemoyer University of Washington [most slides from Yejin Choi] RECURRENT NEURAL L NE NETWOR WORKS Recurrent Neural Networks (RNNs) Each input

787 views • 29 slides

Recurrent Neural Networks CS60010: Deep Learning Abir Das IIT Kharagpur Mar 11, 2020

Recurrent Neural Networks CS60010: Deep Learning Abir Das IIT Kharagpur Mar 11, 2020 Introduction Recurrent Neural Network LSTM Agenda Get introduced to different recurrent neural architecture e.g. , RNNs, LSTMs, GRUs etc. Get

743 views • 48 slides

Computa(on through dynamics Using recurrent neural networks to unveil mechanism in neural

Computa(on through dynamics Using recurrent neural networks to unveil mechanism in neural circuits David Sussillo with Valerio Mante and Bill Newsome Table of contents Introduc(on Training recurrent neural networks(RNNs) Understanding how

1.57k views • 105 slides

IN5550 Neural Methods in Natural Language Processing Recurrent Neural Networks Stephan Oepen

IN5550 Neural Methods in Natural Language Processing Recurrent Neural Networks Stephan Oepen University of Oslo March 10, 2019 Our Roadmap Today Language structure: sequences, trees, graphs Recurrent Neural Networks Different

780 views • 76 slides

NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and

NLP Programming Tutorial 8 Recurrent Neural Nets NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and Technology (NAIST) 1 NLP Programming Tutorial 8 Recurrent Neural Nets Feed Forward Neural

604 views • 34 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Introduction to Recurrent Neural Networks Jakob Verbeek Modeling sequential data with Recurrent

Introduction to Recurrent Neural Networks Jakob Verbeek Modeling sequential data with Recurrent Neural Networks Compact schematic drawing of standard multi-layer perceptron (MLP) output hidden input Modeling sequential data So far we

796 views • 50 slides

Q1 FY2018 RESULTS PRESENTATION 8 February 2018 Good start of the year Delivered strong

Q1 FY2018 RESULTS PRESENTATION 8 February 2018 Good start of the year Delivered strong performance during the first quarter of the fiscal year +7.6% like-for-like revenue growth +4.7% like-for-like recurrent EBITDA growth Strong

399 views • 15 slides

Lecture 10: Recurrent Neural Networks CS109B Data Science 2 Pavlos Protopapas and Mark Glickman

Lecture 10: Recurrent Neural Networks CS109B Data Science 2 Pavlos Protopapas and Mark Glickman Sequence Modeling: Handwritten Text Translation Input : Image Output: Text

1.02k views • 85 slides

ICON Clinical Research SAS How to standardize solutions to recurrent issues PhUSE Conference

ICON Clinical Research SAS How to standardize solutions to recurrent issues PhUSE Conference 19-21OCT2009 Introduction Introduction to the Authors Objectives of the paper Examples Applications / Conclusions Opportunity

421 views • 22 slides

FAA Compliance Guidance Letter (CGL) Appraisal Standards for the Sale and Disposal of Federally

FAA Compliance Guidance Letter (CGL) Appraisal Standards for the Sale and Disposal of Federally Obligated Airport Property 2018 Recurrent Compliance Presented to: Torrance, CA By: Rick Etter, APP 400 Date: September 11, 2018 Appraisal

230 views • 11 slides

Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions Authors : S.

2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP18) Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions Authors : S. Scardapane, S. Van Vaerenbergh, D. Comminiello, S. Totaro and A.

545 views • 40 slides

THE ILLINOIS FREEDOM OF INFORMATION ACT 1 Disclaimer The Illinois and Federal Freedom of

THE ILLINOIS FREEDOM OF INFORMATION ACT 1 Disclaimer The Illinois and Federal Freedom of Information Acts (FOIA) are a powerful tool for monitoring government. This presentation provides an overview of FOIA and provides practical advice

765 views • 60 slides

ENGIE Energa Per Results as of December 2016 2016 HIGHLIGHTS Total Installed Capacity grew

ENGIE Energa Per Results as of December 2016 2016 HIGHLIGHTS Total Installed Capacity grew 723MW reaching 2,673 MW In December, Chilca 2 combined cycle project entered into commercial operation , reaching a total installed capacity of

750 views • 36 slides

Corporate Presentation 1st Quarter 2016 Highlights Increase of 80% in volume when compared to

Corporate Presentation 1st Quarter 2016 Highlights Increase of 80% in volume when compared to the Exports same period in 2015 Colombian subsidiary is presenting excelent results. A tender offer was made for the Tablemac Companys

492 views • 14 slides