Efficient Deep Neural Networks EMC2 Workshop @ NeurIPS 2019 1 - PowerPoint PPT Presentation

Mar 31, 2024 •313 likes •380 views

Trained Rank Pruning For Efficient Deep Neural Networks EMC2 Workshop @ NeurIPS 2019 1 Outline Low Rank (LR) Models Methods on obtaining LR models Decompose a pre-trained model Retrain a LR decomposed model Challenges on

Trained Rank Pruning For Efficient Deep Neural Networks EMC2 Workshop @ NeurIPS 2019 1
Outline • Low Rank (LR) Models • Methods on obtaining LR models • Decompose a pre-trained model • Retrain a LR decomposed model • Challenges on existing methods • Trained Rank Pruning • Training LR model directly with 2 interleaved steps: • Step A: rank conditioning with nuclear norm constraint and sub-gradient • Step B: rank pruning with LR decomposition • Experimental Results EMC2 Workshop @ NeurIPS 2019 2
LR Models • Rank pruning with LR decomposition • Decompose a pre-trained model • Small approximation errors can ripple a large prediction loss. Fine-tuning is required to recover some accuracy loss. • Retrain low-rank decomposed model • Hard to select optimal rank for each layer to achieve good balance of model capacity and compression EMC2 Workshop @ NeurIPS 2019 3
Trained Rank Pruning Our trained rank pruning method has 2 interleaved steps: (A) Conventional SGD training with nuclear norm regularization and sub-gradient, conditioning the network to be LR compatible • Nuclear norm constraint 𝑀 𝑛𝑗𝑜 𝑔 𝑦; 𝑥 + 𝜇 ෍ ||𝑋|| ∗ 𝑚=1 • Sub-gradient descent[1] 𝑈 𝑕 𝑡𝑣𝑐 = ∆𝑔 + 𝜇𝑉 𝑢𝑠𝑣 𝑊 𝑢𝑠𝑣 where 𝑋 = 𝑉∑𝑊 𝑈 is the SVD decomposition and 𝑉 𝑢𝑠𝑣 , 𝑊 𝑢𝑠𝑣 are truncated 𝑉 , 𝑊 with 𝑠𝑏𝑜𝑙(𝑋) . (B) Training with LR decomposition, obtaining the LR network with rank pruning -- forward: decompose original filters T into LR filters T_low; -- backward: update decomposed LR filters T_low with SGD and then substitute original filters. [1] H. Avron, S. Kale, S. P. Kasiviswanathan, and V. Sindhwani. Efficient and practical stochastic subgradient descent for nuclear norm regularization. In ICML, 2012. EMC2 Workshop @ NeurIPS 2019 4
Trained Rank Pruning • Step B is inserted into training process after every m SGD iterations of step A. SGD with SGD with Training with Nuclear Norm Nuclear Norm low-rank regularization regularization decomposition m SGD iterations • Capable of generating LR model parameters with diverse optimal ranks. • Applicable to most existing decompositions, i.e. channel-wise and spatial-wise decompositions. EMC2 Workshop @ NeurIPS 2019 5
Experimental Results All comparison decomposition and pruning results here are finetuned to improve accuracy, while our methods results are from direct decomposition after training. • TRP_spatial : our trained rank pruning method with spatial-wise decomposition; • TRP_channel : our trained rank pruning method with channel-wise decomposition; • Nu : nuclear norm regularization in training; • Speedup : the reduction ratio of model FLOPs On both CIFAR-10 and ImageNet datasets, it shows that our TRP methods can outperform other existing methods both in channel-wise decomposition and spatial-wise decomposition formats. It achieves better balance of accuracy and complexity. EMC2 Workshop @ NeurIPS 2019 6

Recommend

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

380 views • 9 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

889 views • 41 slides

Deep Learning with Neural Networks The Structure and Optimization of Deep Neural Networks Allan

Deep Learning with Neural Networks The Structure and Optimization of Deep Neural Networks Allan Zelener Machine Learning Reading Group January 7 th 2016 The Graduate Center, CUNY Objectives Explain some of the trends of deep learning and

680 views • 44 slides

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova November 21, 2016 Neural Networks 2/20 Neural Networks 3/20 Neural Networks Neural computing requires a number of neurons , to be connected

813 views • 21 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks David L Dill Stanford

Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks David L Dill Stanford University 1 / 39 Based on: Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks Guy Katz, Clark Barrett, David Dill, Kyle

448 views • 25 slides

Optimizing Deep Neural Networks Leena Chennuru Vankadara 26-10-2015 Table of Contents Neural

Optimizing Deep Neural Networks Leena Chennuru Vankadara 26-10-2015 Table of Contents Neural Networks and loss surfaces Problems of Deep Architectures Optimization in Neural Networks Under fitting Proliferation of Saddle

1.67k views • 41 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

On the Expressive Power of Deep Neural Networks Maithra Raghu, Ben Poole, Jon Kleinberg, Surya

On the Expressive Power of Deep Neural Networks Maithra Raghu, Ben Poole, Jon Kleinberg, Surya Ganguli, Jascha Sohl Dickstein Tom Brady Deep Neural Networks Recent successes in using Deep neural networks for image classification,

1.32k views • 17 slides

Weight Parameterizations in Deep Neural Networks Sergey Zagoruyko e Paris-Est, Universit

Weight Parameterizations in Deep Neural Networks Weight Parameterizations in Deep Neural Networks Sergey Zagoruyko e Paris-Est, Universit Ecole des Ponts ParisTech December 26, 2017 Weight Parameterizations in Deep Neural Networks

1.05k views • 45 slides

(Very) Brief Introduction to Neural Networks IITP-03 Algorithms for NLP 1 / 31 Learning

(Very) Brief Introduction to Neural Networks IITP-03 Algorithms for NLP 1 / 31 Learning objectives What are neural networks? What are deep neural networks? How do we train neural networks? What variants of neural network

331 views • 31 slides

Introduction to Deep Neural Networks 0. Logistics Spring 2020 1 Neural Networks are taking

Introduction to Deep Neural Networks 0. Logistics Spring 2020 1 Neural Networks are taking over! Neural networks have become one of the major thrust areas recently in various pattern recognition, prediction, and analysis problems In

987 views • 40 slides

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory Associative Memory CHAPTER III : III : Neural Networks as Associative Memory CHAPTER Neural Networks as

513 views • 22 slides

Inconsistency-Tolerant Reasoning with Classical Logic and Large Databases Jui-Yi Kao Stanford

Inconsistency-Tolerant Reasoning with Classical Logic and Large Databases Jui-Yi Kao Stanford University Presenting on joint work with: Timothy L. Hinrichs University of Chicago Michael Genesereth Stanford University Jui-Yi Kao Stanford

671 views • 30 slides

First-order Quasi-canonical Proof Systems Speaker: Yotam Dvir Yotam Dvir Arnon Avron Tel-Aviv

First-order Quasi-canonical Proof Systems Speaker: Yotam Dvir Yotam Dvir Arnon Avron Tel-Aviv University, Israel 2019-09-04 1/32 Existential Information Processing A Motivating Example 2/32 Background Belnap [1977]: 3/32 Background

1.28k views • 117 slides

G odel Logic: from Natural Deduction to Parallel Computation Federico Aschieri Agata

G odel Logic: from Natural Deduction to Parallel Computation Federico Aschieri Agata Ciabattoni Francesco A. Genco Institute of Discrete Mathematics and Geometry Theory and Logic Group Theory and Logic Group TU Wien, Austria TU Wien,

246 views • 12 slides

The Ethical Choice in Body Care. Presented by MOTIVATION / CONCEPT / POSITIONING / FORMULA /

The Ethical Choice in Body Care. Presented by MOTIVATION / CONCEPT / POSITIONING / FORMULA / FRAGRANCE / DESIGN / AT A GLANCE Why fair trade? - Farming families and planting - A defined and fair price is paid for the employees in the

430 views • 16 slides

On logics of formal inconsistency and fuzzy logics . Esteva 2 and L. Godo 2 M Coniglio 1 , F 1

On logics of formal inconsistency and fuzzy logics . Esteva 2 and L. Godo 2 M Coniglio 1 , F 1 Department of Philosophy Campinas University (Brasil) and 2 Artificial Intelligence Research Institute (IIIA - CSIC) (Spain) Manyval 2013, Prague 4-6

408 views • 24 slides

Intuitionistic Modal Logic: 15 Years Later... Valeria de Paiva Nuance Communications Berkeley

Intuitionistic Modal Logic: 15 Years Later... Valeria de Paiva Nuance Communications Berkeley March 2015 Valeria de Paiva (Nuance) Intuitionistic Modal Logic: 15 Years Later... Berkeley March 2015 1 / 47 Intuitionistic Modal Logics

660 views • 47 slides

www.goldstandardglobal.com Size of the Prize The middle class is Twice the size of expected to

www.goldstandardglobal.com Size of the Prize The middle class is Twice the size of expected to grow to the US population 600 million by 2020 If you miss the opportunity to sell to China, you will miss the future. Jack Ma, Alibaba

433 views • 18 slides

Survival advantage in selected populations Vanessa G. di Lego (CEDEPLAR-UFMG) Cassio M. Turra

Survival advantage in selected populations Vanessa G. di Lego (CEDEPLAR-UFMG) Cassio M. Turra (CEDEPLAR-UFMG) Introduction In the demographic study of mortality, there has been growing attention to population subgroups that are more likely to

379 views • 33 slides