Model Compression Presented by : Ashutosh Adhikari Neural Networks - PowerPoint PPT Presentation

Feb 22, 2024 •103 likes •203 views

Model Compression Presented by : Ashutosh Adhikari Neural Networks Can be Too Huge !! - NNs have been growing a lot more complex with time - Objective : learn efficient NNs, prune redundant parameters, connections - Helps in reducing the

Model Compression Presented by : Ashutosh Adhikari
Neural Networks Can be Too Huge !! - NNs have been growing a lot more complex with time - Objective : learn efficient NNs, prune redundant parameters, connections - Helps in reducing the processing time - Reduces the run-time memory requirement
Categories of Model Compression - Parameter pruning and sharing - Low-rank factorization - transferred/compact convolutional filters - Knowledge distillation
Parameter Pruning and Sharing - One of the oldest techniques - Optimal Brain Damage : - objective function to characterize importance of parameters - Delete the less important parameters - Done using second derivative and some other approximation - Quantization and binarization
Model Compression & Computer Vision - A lot of work on Model Compression for Computer Vision problems - Many Convolutional Neural Network specific approaches developed - Channel Pruning has been successful - CondenseNets - Group the features; prune the less important - Device a methodology to learn the groups - Network Slimming
Pretrained Models for NLP - Paradigm of pre-trained models for NLP - Transformer based models (BERT, GPT) - BERT ⇒ Transformer based model with 300M parameters !! - Pre-trained models are huge and cumbersome - All the major works use knowledge distillation - Future Work!!
Knowledge Distillation - Model Agnostic approach - Student-teacher system - Teacher ⇒ Larger model, knows more - Student ⇒ Smaller model, is limited - Allow the student to learn “rich” representations from the teacher - Using class probabilities produced by the teacher. - Add a regression objective for “distilling knowledge”
Questions?
References 1. Y. LeCun, J. S. Denker, S. A. Solla, R. E. Howard, and L. D.Jackel. Optimal brain damage. In NIPS, volume 2, pages 598–605, 1989. 2. Zhuang Liu, Jianguo Li, Zhiqiang Shen, Gao Huang, Shoumeng Yan, and Changshui Zhang.Learning efficient convolutional networks through networkslimming. In 2017 IEEE International Conference on Computer Vision (ICCV), pages 2755–2763, 2017. 3. G. Huang, S. Liu, L. van der Maaten, and K. Q. Weinberger. Condensenet: An efficient densenet using learned group convolutions.CVPR, 2018. 4. Yu Cheng, Duo Wang, Pan Zhou, and Tao Zhang. A survey of model compression and acceleration for deep neural networks.arXiv preprint arXiv:1710.09282, 2017.

Recommend

Lossless compression in lossy compression systems Almost every lossy compression system

Lossless compression in lossy compression systems Almost every lossy compression system contains a lossless compression system Lossy compression system Dequantizer Transform Lossless Lossless Inverse Quantizer Encoder Decoder

771 views • 29 slides

14.9.2 JPEG2000 compression DCT compression basis for JPEG wavelet compression

14.9 JPEG and MPEG image compression 31 14.9.2 JPEG2000 compression DCT compression basis for JPEG wavelet compression basis for JPEG2000 JPEG2000 new international standard for still image compression

492 views • 12 slides

JPEG Compression Ian Snyder December 11, 2009 Ian Snyder JPEG Compression Outline

Outline Introduction Images and Compression Walkthrough of JPEG Compression Steps Complete Compression Process Results and Conclusion JPEG Compression Ian Snyder December 11, 2009 Ian Snyder JPEG Compression Outline Introduction Images

594 views • 45 slides

Lecture 9: Compression 1 / 52 Compression Recap Bu ff er Management Recap 2 / 52 Compression

Compression Lecture 9: Compression 1 / 52 Compression Recap Bu ff er Management Recap 2 / 52 Compression Recap Bu ff er Management Thread Safety A piece of code is thread-safe if it functions correctly during simultaneous execution

784 views • 52 slides

A Model to Address Salary Compression for Faculty (an anti-compression model) Presented to

3/13/12 A Model to Address Salary Compression for Faculty (an anti-compression model) Presented to President Joe Shepard and the Faculty Salary and Benefits Committee 13 March 2012 The Problem: Salary Compression Characteristics include:

223 views • 11 slides

Digital Image Compression Digital Image Compression Digital Image Compression and JPEG Standards

Digital Image Compression Digital Image Compression Digital Image Compression and JPEG Standards and JPEG Standards and JPEG Standards Fernando Pereira Fernando Pereira Fernando Pereira Klagenfurt, Austria, October 2008 Klagenfurt,

1.68k views • 137 slides

Digital Video Compression Digital Video Compression Digital Video Compression and H.261

Digital Video Compression Digital Video Compression Digital Video Compression and H.261 Recommendation and H.261 Recommendation and H.261 Recommendation Fernando Pereira Fernando Pereira Fernando Pereira Klagenfurt, Austria, October 2008

988 views • 73 slides

From Sorting to Heaps to Compression Data Compression video on demand/set top box jpeg

From Sorting to Heaps to Compression Data Compression video on demand/set top box jpeg in browsers gzip, pkzip, compress, zip, ... for files (stacker?) Lossy compression, Lossless compression Huffman coding possible to

275 views • 6 slides

Tradeoffs in XML Database Compression James Cheney University of Edinburgh Data Compression

Tradeoffs in XML Database Compression James Cheney University of Edinburgh Data Compression Conference March 30, 2006 Tradeoffs in XML Database Compression p.1/22 XML Compression XML: a format for tree-structured data Increasingly used

382 views • 22 slides

Compression Overview Multimedia Encoding and Compression Huffman codes Lossless

Compression Overview Multimedia Encoding and Compression Huffman codes Lossless Outline data received = data sent Compression used for executables, text files, numeric data RTP Lossy Scheduling data received

83 views • 4 slides

Compression Programs File Compression: Gzip, Bzip Archivers :Arc, Pkzip, Winrar,

Compression Programs File Compression: Gzip, Bzip Archivers :Arc, Pkzip, Winrar, File Systems: NTFS Analysis of Algorithms Piyush Kumar (Lecture e 5: Compression) on) Welcome to 4531 Source: Guy E. Blelloch, Emad, Tseng

297 views • 12 slides

Scientific Data Compression: From Stone-Age to Renaissance Factor 10,100 compression

Scientific Data Compression: From Stone-Age to Renaissance Factor 10,100 compression Background Focus on spatial compression Best in class lossy compressor Point wise max error bound: 10 -5 Open questions Random noise

564 views • 19 slides

Information Retrieval Tutorial 3: Index Compression Professor: Michel Schellekens TA: Ang Gao

Introduction Dictionary compression Postings compression Information Retrieval Tutorial 3: Index Compression Professor: Michel Schellekens TA: Ang Gao University College Cork 2012-11-09 Index Compression 1 / 36 Introduction Dictionary

1.55k views • 152 slides

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of

Basic Techniques II: Iterative Compression Marek Cygan Institute of Informatics University of Warsaw 18th August 2014, Bdlewo Marek Cygan Iterative compression 1/36 What iterative compression is? Iterative compression - jist Recursive

1.47k views • 105 slides

Compression Strategies & Alternate Summarization Systems and Applications Ling 573 May 23,

Compression Strategies & Alternate Summarization Systems and Applications Ling 573 May 23, 3017 Roadmap Content Realization: Compression Deep, Heuristic Approaches Compression Integration Compression Learning

317 views • 31 slides

Video Compression Lecture # 5 6 Shahab Baqai LUMS Outline Image compression

CS 584 / CMPE 584 Multimedia Communication Video Compression Lecture # 5 6 Shahab Baqai LUMS Outline Image compression Transform, uniform quantization, Huffman coding Video compression Exploit temporal dimension of video

571 views • 25 slides

Geosphere: C Consistently T y Turning M MIMO C O Capaci city y into T Throughput

Geosphere: C Consistently T y Turning M MIMO C O Capaci city y into T Throughput Konstan'nos Nikitopoulos Juan Zhou, Ben Congdon , Kyle Jamieson 5G Innova)on Centre

502 views • 27 slides

Outline Games Perfect play: principles of adversarial search minimax decisions

Adversarial Search (a.k.a. Game Playing) C h a p t e r 5 (Adapted from Stuart Russell, Dan Klein, and others. Thanks guys!) Outline Games Perfect play: principles of adversarial search minimax decisions pruning

616 views • 30 slides

- Pruning the SASLOG Digging into the Roots of NOTEs, WARNINGs, and ERRORs A

- Pruning the SASLOG Digging into the Roots of NOTEs, WARNINGs, and ERRORs A clipping from the original, provided for the Toronto Area SAS Society, December 2007 Andrew T. Kuligowski The Nielsen Company Pruning the SASLOG

392 views • 18 slides

Model Compression Seminar: Advanced Machine Learning, SS 2016 Markus Beuckelmann

Model Compression Seminar: Advanced Machine Learning, SS 2016 Markus Beuckelmann markus.beuckelmann@stud.uni-heidelberg.de July 19, 2016 Markus Beuckelmann Model Compression July 19, 2016 1 / 33 Introduction Outline Outline 1 Overview

844 views • 43 slides

Introduction to Machine Learning Session 2a: Introduction to Classification and Regression Trees

Introduction to Machine Learning Session 2a: Introduction to Classification and Regression Trees Reto West Department of Political Science and International Relations University of Geneva Outline 1 The Basics of Decision Trees 2 Regression

754 views • 29 slides

Accelerating Convolutional Neural networks on FPGA SoC Francesco Restuccia, Ph.D. Fellow

Accelerating Convolutional Neural networks on FPGA SoC Francesco Restuccia, Ph.D. Fellow 04/12/2020 1 Overview Background Xilinx CHaiDNN Framework Overview Working flow Hardware architecture Xilinx DNNDK framework

867 views • 29 slides

The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria Arnau Antonio Gonzlez

45 th International Symposium on Computer Architecture, Los Angeles, US, June 2018 The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria Arnau Antonio Gonzlez DNN Pruning Efficient reduction of DNN size Higher

696 views • 42 slides

Second Wednesdays | 1:00 2:15 pm ET www.fs.fed.us/research/urban-webinars USDA is an equal

Second Wednesdays | 1:00 2:15 pm ET www.fs.fed.us/research/urban-webinars USDA is an equal opportunity provider and employer. Angie DiSalvo Vivek Shandas Outreach and Science Supervisor, Urban Professor, Urban Studies and Planning

340 views • 18 slides