CMSC5743 Lab05 Introduction to Distiller Qi Sun (Latest update: - PowerPoint PPT Presentation

CMSC5743 Lab05 Introduction to Distiller Qi Sun (Latest update: October 13, 2020) Fall 2020 1 / 12

Distiller ◮ Distillerm is an open-source Python package (PyTorch environment) for neural network compression research. ◮ Comprehensive documentation and a mature forum. ◮ Example implementations of state-of-the-art compression algorithms. ◮ A friendly framework that you can add your own pruning, regularization and quantization algorithms easily. ◮ Supports of lots of mainstream DNN models and datasets, e.g. , SqueezeNet and ImageNet. 2 / 12

Using The Sample Application An example python file is provided: ./examples/classifier_compression/compress_classifier.py ◮ Check all of the program options via python ./compress_classifier.py -h , including the pretrained models. ◮ You can try the Jupyter notebook to learn the usage of Distiller. ◮ Specify the algorithm configurations in a YAML file. version: 1 pruners: my_pruner: class: 'SensitivityPruner' sensitivities: 'features.module.0.weight': 0.25 'features.module.3.weight': 0.35 'classifier.1.weight': 0.875 3 / 12

Pruning Sensitivity Analysis Command flag - -sense = element or filter ◮ Distiller supports element-wise and filter-wise pruning sensitivity analysis. ◮ In both cases, L1-norm is used to rank which elements or filters to prune. ◮ For example, when running filter-pruning sensitivity analysis, the L1-norm of the filters of each layer’s weights tensor are calculated, and the bottom x % are set to zero. ◮ Use a small dataset for this would save much time, if this will provide sufficient results. 4 / 12

Pruning Algorithms 5 / 12

Pruning Algorithms ◮ All of the pruning algorithms are defined in ./distiller/pruning . ◮ Channel and filter pruning. ◮ Pay attention to the model structure to guarantee the pruning strategies are mutually compatible. 6 / 12

Magnitude Pruner ◮ It applies a thresholding function, thresh ( · ) , on each element, w i , of a weights tensor. ◮ Because the threshold is applied on individual elements, this pruner belongs to the element-wise pruning algorithm family. � w i : if | w i | > λ thresh ( w i ) = (1) 0 : if | w i | ≤ λ 7 / 12

Sensitivity Pruner ◮ The model weights approximately follow the Gaussian distributions, with standard deviation σ and mean value µ . ◮ 3 − σ rule: 68 − 95 − 99 . 7 rule. Pr ( µ − σ ≤ X ≤ µ + σ ) ≈ 0 . 6827 (2) ◮ If we set the threshold to s × σ , then basically we are thresholding s × 68 % of the tensor elements. 8 / 12

Automated Gradual Pruner (AGP) ◮ The sparsity is increased from an initial sparsity value s i (usually 0) to a final sparsity value sf over a span of n pruning steps. ◮ The intuition behind this sparsity function is to prune the network rapidly in the initial phase when the redundant connections are abundant and gradually reduce the number of weights being pruned each time as there are fewer and fewer weights remaining in the network. 9 / 12

Post-training Quantization ◮ It does not require any Policies nor a Scheduler. ◮ A checkpoint with the quantized model will be dumped in the run directory. ◮ It will contain the quantized model parameters (the data type will still be FP32, but the values will be integers). ◮ The calculated quantization parameters (scale and zero-point) are stored as well in each quantized layer. 10 / 12

Check Model Parameters ◮ Use Netron . If a prototxt file is available, you can visualize the model. ◮ Use model[’state_dict’].items() . ◮ Use named_parameters() . 11 / 12

Experiment Reproducibility To guarantee the reproducibility of your results. ◮ Set j = 1 to use only one data loading worker. ◮ Use - -deterministic flag. 12 / 12

CMSC5743 Lab05 Introduction to Distiller Qi Sun (Latest update: - PowerPoint PPT Presentation

CMSC5743 Lab05 Introduction to Distiller Qi Sun (Latest update: October 13, 2020) Fall 2020 1 / 12 Distiller Distillerm is an open-source Python package (PyTorch environment) for neural network compression research. Comprehensive

Create PDF in MS Word 2013 using Adobe Distiller 10 Sep 2020 V0C V0C Create PDF In MS Word 2013

ZING 72 BOTANICAL GIN Handcrafted Fine Spirit Shaping Spirit of the World since 2006 ROCKWOOD

P15484 Solar Assisted Essential Oil Distiller System Level Design Pre-Read October 2, 2014

CMSC5743 L05: Quantization Bei Yu (Latest update: October 12, 2020) Fall 2020 1 / 25 Overview

CMSC5743 L06: Binary/Ternary Network Bei Yu (Latest update: November 2, 2020) Fall 2020 1 / 21

CMSC5743 L09: Network Architecture Search Bei Yu (Latest update: September 13, 2020) Fall 2020

CMSC5743 L03: CNN Accurate Speedup II Bei Yu (Latest update: September 29, 2020) Fall 2020 1 /

CMSC5743 L02: CNN Accurate Speedup I Bei Yu (Latest update: September 28, 2020) Fall 2020 1 /

Lec07: Return-oriented programming Taesoo Kim 2 Scoreboard 3 Administrivia Please submit

Lec06: DEP and ASLR Taesoo Kim 2 Scoreboard 3 Administrivia Due: Lab04 is extended for

CS 31: Intro to Systems Arrays, Structs and Pointers Martin Gagne Swarthmore College February

MOL2NET, 2018 , 4, http://sciforum.net/conference/mol2net-04 2 Introduction Finite element method

BASIC SAFETY FOR POT STILL AND CONTINUOUS STILL SYSTEMS AMERICAN DISTILLING INSTITUTE 2016

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

Introduction ATV Introduction A T V Introduction A lphabet T V Introduction A lphabet

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Issues%in%FPGA%Technologies Complexity%of%Logic%Element

Ultra-peripheral collisions in ALICE experiment Jaroslav Adam On behalf of ALICE Collaboration

Developing the Right Skills for the Future of the UK Aerospace Industry Mark Stewart , General

A Tabling Engine for the Yap Prolog System Ricardo Rocha Fernando Silva V tor Santos Costa

A computer comprises 1 A cabinet 5: Inside the box Power Supply,

Span-program-based G ( k ) G ( 1 ) k 1 quantum algorithm for . . . . . . OR

Equivalence and Independence in Controlled Graph-Rewriting Processes ICGT@STAF 2018, Toulouse ar

Distributed Architecture MPSoC . Prof . Yuriy Sheynin, Director, Doctor of Science 190 000 St.