Same, Same But Different Recovering Neural Network Quantization - PowerPoint PPT Presentation

Nov 21, 2022 •379 likes •482 views

Same, Same But Different Recovering Neural Network Quantization Error Through Weight Factorization Eldad Meller ICML 2019 Neural Network Quantization Quantization of Neural Networks is needed for efficient inference Quantization adds

Same, Same But Different Recovering Neural Network Quantization Error Through Weight Factorization Eldad Meller ICML 2019
Neural Network Quantization • Quantization of Neural Networks is needed for efficient inference • Quantization adds noise to the network and degrades its performance
Quantization Dynamic Range • The most common quantization setting is layer-wise quantization where all the channels in a layer are quantized using the same dynamic range • Equalizing the dynamic range of all the channels in a layer by amplifying channels with small dynamic range will reduce overall quantization noise
A simple trick to amplify channels • For any homogeneous activation functions • Any channel in the network can be scaled by any positive scalar if the weights in the consecutive layer are properly inversely scaled • The network's output remains unchanged
Network Equalization
Network Equalization
Quantization Degradation on Imagenet[%]
Quantization Degradation on Imagenet[%]
Summary • Equalization is an easy to use post-training quantization method to recover quantization noise in neural networks • Can be applied to any network • A novel approach to quantization by searching for the best equivalent representation • The method can be combined with other quantization methods - e.g. quantization-aware training and smart clipping

Recommend

? Same time Same time Same place Same time Same place 2 different painters Our story really

? Same time Same time Same place Same time Same place 2 different painters Our story really begins here The Wise Man Grard Mari Professor of Art History at Sciences-Po The art guru who told fascinating stories Myself Coline Debayle

1.14k views • 57 slides

Different Story? CS4031 Introduction to Digital Media 2017 Same Story Different Medium;

Same Story Different Medium; Different Story? CS4031 Introduction to Digital Media 2017 Same Story Different Medium; Different Story? We will discuss how a story created for a particular medium can be translated into a very different

434 views • 21 slides

Public Health Challenges of Syphilis Prevention Same, same but different Sol olom omon on M

Public Health Challenges of Syphilis Prevention Same, same but different Sol olom omon on M M, Mayer K r K. Evol olution on of the s he syph philis epi epidem emic a among men who have s sex w with m men. Sexual Hea Health 201

328 views • 7 slides

Collision Detection 1 2 Many Different Situations Many Different Situations Thin moving

2/12/2012 Many Different Situations Few moving objects, but complex geometry Collision Detection 1 2 Many Different Situations Many Different Situations Thin moving objects, with simple geometry Many simple objects 3 4 Many Different

114 views • 8 slides

RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in

RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in all cells There is a wide variety of different functional RNAs Which RNAs (and sometimes then translated to proteins) varies between samples

527 views • 37 slides

Trust But Verify Trust But Verify Trust But Verify Trust But Verify What Is CEC Entertainment?

Trust But Verify Trust But Verify Trust But Verify Trust But Verify What Is CEC Entertainment? CEC Entertainment, Irving TX Founded by Nolan Bushnell Revenue What Is CEC Entertainment? What Is CEC Entertainment? What Is CEC Entertainment?

305 views • 19 slides

Same size, same social characteristics, same performance ? Comparative study of Moncton and

Same size, same social characteristics, same performance ? Comparative study of Moncton and Trois-Rivires City-regions Yves Bourgeois, Universit de Moncton/Centre for Innovation and Productivity Michel Trpanier, INRS/INRPME/CIRM

370 views • 10 slides

Trees, Derivations and Ambiguity A grammar A tree 3 derivations correspond to same tree (same

Trees, Derivations and Ambiguity A grammar A tree 3 derivations correspond to same tree (same rules being used in the same places, just written in different orders in the linear derivation) 1) E => P+E => a+E => a+P => a+a 2) E

107 views • 10 slides

Sequential circuits If the same input may produce different output signal, we have a sequential

Sequential circuits If the same input may produce different output signal, we have a sequential logic circuit. It must then have an internal memory that allows the Same input can produce different output output to be affected by both the

795 views • 60 slides

Sequential circuits If the same input may produce different output signal, we have a sequential

763 views • 49 slides

Hiding & Overriding Hiding & Overriding Overriding : two functions in different

Overloading, Overriding, Hiding g, g, g Overloading : two functions in the same service(int) scope, have the same name, different scope have the same name different service(double int) service(double, int) signatures (virtual is not

230 views • 3 slides

Types (different views) Value collection values having the same properties can be

Types (different views) Value collection values having the same properties can be collected together e.g. integers, floating points different representations for different types operations allowed for a type controlling

894 views • 62 slides

Homes in the Past Today we will be... Investigating what is the same and what is different about

Homes in the Past Today we will be... Investigating what is the same and what is different about different houses. NEXT www.planbee.com Have a careful look at the sets of houses on the next slides. What is different about the two houses?

272 views • 8 slides

$Neuro-imaging to understand refractory breathlessness Copenhagen May 2015 1 Different$

Neuro-imaging to understand refractory breathlessness Copenhagen May 2015 1 Different

Neuro-imaging to understand refractory breathlessness Copenhagen May 2015 1 Different perceptions Two people with the same patho-physiology Different perceptions Different restrictions Different lives 2 2 Central perception

242 views • 23 slides

Algorithm Efficiency and Sorting How to Compare Different Problems and Solutions Two different

Algorithm Efficiency and Sorting How to Compare Different Problems and Solutions Two different problems Which is harder/more complex? Two different solutions to the same problem Which is better? Questions: How can we compare

571 views • 37 slides

Parts of a Plant Plants have different parts to them, just like you. We have different body parts

Parts of a Plant Plants have different parts to them, just like you. We have different body parts such as arms, legs and a mouth. A plant has different parts too and they all do different jobs. We can eat parts of some plants. Roots The

153 views • 11 slides

Q-Clouds: Managing Performance Interference Effects for QoS-Aware Clouds Ripal Nathuji Aman

Q-Clouds: Managing Performance Interference Effects for QoS-Aware Clouds Ripal Nathuji Aman Kansal Alireza Ghaffarkhah Presented by Joshua Davis Motivation and Background Cloud computing Off load processing and storage Charged per

549 views • 28 slides

Southern Region CQI Learning Collaborative Webinar San Diego County Profile September 3, 2015

Southern Region CQI Learning Collaborative Webinar San Diego County Profile September 3, 2015 11:30 am 1:00 pm If you do not have speakers or a headset Dial 1-877-873-8017 to participate by teleconference Use the Access Code 4732345# Please

482 views • 20 slides

Demonstrating the Impact of Service High Quality Performance Measures Overview of the e-Course

High Quality Performance Measures Demonstrating the Impact of Service High Quality Performance Measures Overview of the e-Course Series AmeriCorps Prohibited Activities AmeriCorps Allowable/Unallowable Activities Demonstrating the

416 views • 20 slides

COMMON: Coordinated Multi-layer Multi-domain Optical Network Framework for Large-scale Science

COMMON: Coordinated Multi-layer Multi-domain Optical Network Framework for Large-scale Science Applications Vinod Vokkarane (University of Massachusetts at Dartmouth) Addendum - Updated list of Project Tasks and Deliverables We intend to

562 views • 3 slides

Quantization for TVM Ziheng Jiang TVM Conference, Dec 12th 2018 Quantization for TVM What is

Quantization for TVM Ziheng Jiang TVM Conference, Dec 12th 2018 Quantization for TVM What is Quantization? source: Han et al Converting weight value to low-bit integer like 8bit precision from float-point without significant accuracy drop.

421 views • 7 slides

Implementing DNNs What this lecture is about: on Embedded Overview of frameworks for

This lecture Implementing DNNs What this lecture is about: on Embedded Overview of frameworks for implementing DNNs with hardware acceleration on GPU-based embedded platforms A deep dive into TensorRT , the state-of-the-art

367 views • 10 slides

Learning Accurate Low-bit Deep Neural Networks with Stochastic Quantization Yinpeng Dong 1 ,

Learning Accurate Low-bit Deep Neural Networks with Stochastic Quantization Yinpeng Dong 1 , Renkun Ni 2 , Jianguo Li 3 , Yurong Chen 3 , Jun Zhu 1 , Hang Su 1 1 Department of CST, Tsinghua University 2 University of Virginia 3 Intel Labs China

321 views • 14 slides

Obstacles to the quantization of general relativity using symplectic structures Tom McClain

Obstacles to the quantization of general relativity using symplectic structures Tom McClain Department of Physics and Engineering, Washington and Lee University Overview The problem Classical field theory with symplectic structures

348 views • 24 slides

Same, Same But Different Recovering Neural Network Quantization - PowerPoint PPT Presentation

Same, Same But Different Recovering Neural Network Quantization Error Through Weight Factorization Eldad Meller ICML 2019 Neural Network Quantization Quantization of Neural Networks is needed for efficient inference Quantization adds

? Same time Same time Same place Same time Same place 2 different painters Our story really

Different Story? CS4031 Introduction to Digital Media 2017 Same Story Different Medium;

Public Health Challenges of Syphilis Prevention Same, same but different Sol olom omon on M

Collision Detection 1 2 Many Different Situations Many Different Situations Thin moving

RNA-seq Introduction DNA is the same in all cells but which RNAs that is present is different in

Trust But Verify Trust But Verify Trust But Verify Trust But Verify What Is CEC Entertainment?

Same size, same social characteristics, same performance ? Comparative study of Moncton and

Trees, Derivations and Ambiguity A grammar A tree 3 derivations correspond to same tree (same

Sequential circuits If the same input may produce different output signal, we have a sequential

Sequential circuits If the same input may produce different output signal, we have a sequential

Hiding &amp; Overriding Hiding &amp; Overriding Overriding : two functions in different

Types (different views) Value collection values having the same properties can be

Homes in the Past Today we will be... Investigating what is the same and what is different about

Neuro-imaging to understand refractory breathlessness Copenhagen May 2015 1 Different

Algorithm Efficiency and Sorting How to Compare Different Problems and Solutions Two different

Parts of a Plant Plants have different parts to them, just like you. We have different body parts

Q-Clouds: Managing Performance Interference Effects for QoS-Aware Clouds Ripal Nathuji Aman

Southern Region CQI Learning Collaborative Webinar San Diego County Profile September 3, 2015

Demonstrating the Impact of Service High Quality Performance Measures Overview of the e-Course

COMMON: Coordinated Multi-layer Multi-domain Optical Network Framework for Large-scale Science

Quantization for TVM Ziheng Jiang TVM Conference, Dec 12th 2018 Quantization for TVM What is

Implementing DNNs What this lecture is about: on Embedded Overview of frameworks for

Learning Accurate Low-bit Deep Neural Networks with Stochastic Quantization Yinpeng Dong 1 ,

Obstacles to the quantization of general relativity using symplectic structures Tom McClain

Hiding & Overriding Hiding & Overriding Overriding : two functions in different