Lightweight Neural Networks from PCA & LDA Based Distilled Dense - PowerPoint PPT Presentation

Oct 25, 2023 •184 likes •243 views

Lightweight Neural Networks from PCA & LDA Based Distilled Dense Neural Networks ICIP 2020 MEA. Seddik 1 , 2 , , H. Essafi 1 , A. Benzine 1 , 3 , M. Tamaazousti 1 1 CEA List, France 2 CentraleSuplec, L2S, France 3 Sorbonne University,

Lightweight Neural Networks from PCA & LDA Based Distilled Dense Neural Networks ICIP 2020 MEA. Seddik 1 , 2 , ∗ , H. Essafi 1 , A. Benzine 1 , 3 , M. Tamaazousti 1 1 CEA List, France 2 CentraleSupélec, L2S, France 3 Sorbonne University, CNRS, France ∗ http://melaseddik.github.io/ August 21, 2020 1 / 5
/ 2/5 Abstract Context: ◮ Compression of dense neural networks with the teacher-student approach. Motivation: ◮ Build lightweight neural networks that can fit into edge and IoT devices with limited resources (memory and computation). Proposed methods: ◮ We proposed two methods which rely on dimension reduction techniques (PCA and LDA). ◮ The dimension reduction is applied at each layer of the teacher net and then mapped to the layers of the student net using a multi-task loss function. 2 / 5
/ 3/5 Setting Given a Teacher Network (TN) trained on a dataset D with loss L TN � h (0) = x ∈ R p 0 (TN) : � W ( ℓ ) h ( ℓ − 1) + b ( ℓ ) � ∀ ℓ ∈ [ L ] h ( ℓ ) = f ℓ ∈ R p ℓ Construct a Student Network (SN) to train on D � h (0) = x ∈ R p 0 ˜ � b ( ℓ ) � (SN) : ∀ ℓ ∈ [ L ] ˜ h ( ℓ ) = f ℓ W ( ℓ )˜ ˜ h ( ℓ − 1) + ˜ ∈ R k ℓ Such that k ℓ ≪ p ℓ & Performance (SN) � Performance (TN) 3 / 5
/ 4/5 Proposed Methods (Net-PCAD & Net-LDAD) Given (TN) , a data matrix X and (TN) loss function L TN For each layer ℓ : 1. Extract the representations H ℓ of X from (TN) 2. Compute a projection matrix U ℓ ∈ R p ℓ × k ℓ through PCA or LDA on H ℓ Train (SN) as a multi-task 1 problem with L − 1 � ℓ h ( ℓ ) � � h ( ℓ ) , U ⊺ ˜ L SN = e − σ L TN + σ e − σ ℓ L mse + + σ ℓ � �� ℓ =1 Learning Task � �� (SN) Hidden Layers Task where σ and { σ ℓ } L − 1 ℓ =1 are learnable parameters. 1 Using the Homoscedastic loss function: A. Kendall et al. “Multitask learning using uncertainty to weigh losses for scene geometry and semantics” in Proceedings of IEEE CVPR, 2018. 4 / 5
/ 5/5 Experimental Setting & Results Layer (TN) (SN) Dense 1 p 0 × 1024 p 0 × k Dense 2 1024 × 512 k × k Dense 3 512 × 256 k × k Dense 4 256 × 10 k × 10 Table: Networks architectures. (SN) Datasets (TN) k = 50 100 200 MNIST 2 . 23s 0 . 38s 0 . 45s 0 . 65s 98% 97% 97 . 5% 97 . 8% FASHION 2 . 23s 0 . 38s 0 . 45s 0 . 65s 88% 87 . 5% 88 . 5% 88 . 5% CIFAR10 4 . 63s 0 . 75s 0 . 92s 1 . 35s 45% 50% 50 . 1% 50 . 3% Table: Networks performances. ⇒ k ℓ ≪ p ℓ & Performance (SN) � Performance (TN) 5 / 5

Recommend

ECS231 PCA, revisited May 28, 2019 1 / 18 Outline 1. PCA for lossy data compression 2. PCA for

ECS231 PCA, revisited May 28, 2019 1 / 18 Outline 1. PCA for lossy data compression 2. PCA for learning a representation of data 3. Extra: learning XOR 2 / 18 1. PCA for lossy data compression 1 Data compression: given data points { x (1)

347 views • 18 slides

SALT LAKE LEGAL DEFENDER (LDA) AND SOCIAL SERVICES Who we are, what we do, court system and how LDA

SALT LAKE LEGAL DEFENDER (LDA) AND SOCIAL SERVICES Who we are, what we do, court system and how LDA can assist in working with the DSPD LDA Salt Lake City Established in 1965 Provides indigent/court appointed criminal defense to SL

690 views • 37 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

MLCC 2015 Dimensionality Reduction and PCA Lorenzo Rosasco UNIGE-MIT-IIT June 25, 2015 Outline

MLCC 2015 Dimensionality Reduction and PCA Lorenzo Rosasco UNIGE-MIT-IIT June 25, 2015 Outline PCA & Reconstruction PCA and Maximum Variance PCA and Associated Eigenproblem Beyond the First Principal Component PCA and Singular Value

942 views • 54 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

889 views • 41 slides

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

380 views • 9 slides

Understanding Landscape Visualisation for Visual Impact Assessments Lock, David.J. 1 1 LDA Design,

Understanding Landscape Visualisation for Visual Impact Assessments Lock, David.J. 1 1 LDA Design, Worton Rectory Park, Oxford OX29 4SX Tel. +44 865 887050 Fax +44 865 887055 David.lock@lda-design.co.uk, web http://www.lda-design.co.uk/ Summary:

279 views • 7 slides

Your local partner of choice THE ENGCO GROUP ENGCO Group consists of six companies: ENGCO, Lda

Your local partner of choice THE ENGCO GROUP ENGCO Group consists of six companies: ENGCO, Lda (Holding Company), founded in Mozambique in 2001; ENGCO Investimentos, Lda founded in 2001; PIERLITE Moambique, Lda founded in 2002; ENGCO

404 views • 21 slides

Linking words to topics Pavel Oleinikov Associate Director DataCamp Topic Modeling in R LDA

DataCamp Topic Modeling in R TOPIC MODELING IN R Linking words to topics Pavel Oleinikov Associate Director DataCamp Topic Modeling in R LDA and random numbers LDA call mod = LDA(x=dtm, k=2, method="Gibbs",control=list(alpha=1,

515 views • 30 slides

LDA 1 [Credits: Mike Smith, Las Vegas Sun 2013] LDA 2 [Credits: IITD Library] 4 5 6 In

LDA 1 [Credits: Mike Smith, Las Vegas Sun 2013] LDA 2 [Credits: IITD Library] 4 5 6 In text, the hidden variables are the thematic structure. What are the topics that describe this collection? How does a new document fit into the topic

1.15k views • 64 slides

PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, LOGISTIC REGRESSION

PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, LOGISTIC REGRESSION LOGISTIC REGRESSION Matthieu R Bloch Tuesday, January 28, 2020 1 LOGISTICS LOGISTICS TAs and Office hours Monday: Mehrdad (TSRB 523a) -

261 views • 11 slides

SVD-LDA: Topic Modeling for Full-Text Recommender Systems Sergey Nikolenko Steklov Mathematical

Recommender systems From LDA to SVD-LDA SVD-LDA: Topic Modeling for Full-Text Recommender Systems Sergey Nikolenko Steklov Mathematical Institute at St. Petersburg Laboratory for Internet Studies, National Research University Higher School of

311 views • 30 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

Ive Got You Under My Skin: A Comparison of IV and s/c PCA Nick Williamson Clinical Nurse

Ive Got You Under My Skin: A Comparison of IV and s/c PCA Nick Williamson Clinical Nurse Specialist How did PCA get under my skin? Started in 2009 when I started working at KCH Subcut PCA ! ! ! PCA refers to an electronically

447 views • 44 slides

Exploratory Factor Analysis PCA Analysis A Review Precipitation Temperature Ecosystems PCA

Multivariate Fundamentals: Rotation Exploratory Factor Analysis PCA Analysis A Review Precipitation Temperature Ecosystems PCA Analysis with Spatial Data Proportion of variance explained Comp.1 + Comp.2 + Comp.3 ~= 95% Loadings PCA

1.01k views • 16 slides

Lecture 25: Autoencoders Kernel PCA Aykut Erdem January 2017 Hacettepe University Today

Lecture 25: Autoencoders Kernel PCA Aykut Erdem January 2017 Hacettepe University Today Motivation PCA algorithms Applications PCA shortcomings Autoencoders Kernel PCA 2 Autoencoders 3

550 views • 34 slides

1 last time SIMD (single instruction multiple data) hardware idea: wider ALUs and registers

1 last time SIMD (single instruction multiple data) hardware idea: wider ALUs and registers Intels interface _mm sharing the CPU: context switching context = visible CPU state (registers, condition codes, PC, ) exceptions = OS gets run

1.42k views • 129 slides

CENG3420 Lecture 01: Introduction Bei Yu (Latest update: January 9, 2019) Spring 2019 1 / 50

CENG3420 Lecture 01: Introduction Bei Yu (Latest update: January 9, 2019) Spring 2019 1 / 50 Overview Course Information Background Organization First Glance Summary 2 / 50 Overview Course Information Background Organization

754 views • 60 slides

Introduction 2 A Modern Computer iPhone XS Computer Systems and Networks Spring 2019 3

Computer Systems and Networks ECPE 170 Jeff Shafer University of the Pacific Introduction 2 A Modern Computer iPhone XS Computer Systems and Networks Spring 2019 3 Applications Computer Systems and Networks Spring 2019 4

644 views • 53 slides

CSE141: Introduction to Computer Architecture Hung-Wei Tseng CSE141: Lets say something!

CSE141: Introduction to Computer Architecture Hung-Wei Tseng CSE141: Lets say something! Whats your name? Whats the most exciting thing you did so far this summer? Whats the most interesting computer science topic for you?

947 views • 68 slides

System Structuring with Threads Example: A Transcoding Web Proxy Appliance Proxy clients

System Structuring with Threads Example: A Transcoding Web Proxy Appliance Proxy clients Interposed between Web (HTTP) clients and servers. Masquerade as (represent) the server to the client. Masquerade as (represent) the client to the

617 views • 14 slides

FloWaveNet: A Generative Flow for Raw Audio Sungwon Kim 1 , Sang-gil Lee 1 , Jongyoon Song 1 ,

ICML 2019 FloWaveNet: A Generative Flow for Raw Audio Sungwon Kim 1 , Sang-gil Lee 1 , Jongyoon Song 1 , Jaehyeon Kim 2 , Sungron Yoon 1,3 1 Seoul National University, 2 Kakao Corporation, 3 ASRI, INMC, Institute of Engineering Research, Seoul

346 views • 16 slides

Form over Function Teaching Beginners How to Construct Programs Michael Sperber Collaborators:

Form over Function Teaching Beginners How to Construct Programs Michael Sperber Collaborators: Marcus Crestani, Martin Gasbichler, Herbert Klaeren, Eric Knauel @ University of Tbingen Wednesday, September 12, 12 Back at the Ranch ...

791 views • 58 slides

Another Dynamic Algorithm: Scoreboard Summary Tomasulo Algorithm Speedup 1.7 from compiler;

Another Dynamic Algorithm: Scoreboard Summary Tomasulo Algorithm Speedup 1.7 from compiler; 2.5 by hand For IBM 360/91 about 3 years after CDC 6600 BUT slow memory (no cache) limits benefit Goal: High Performance without special

303 views • 3 slides