using monte carlo methods
play

using Monte Carlo Methods EMC2 Workshop Gonalo Mordido Hasso - PowerPoint PPT Presentation

Instant Quantization of Neural Networks using Monte Carlo Methods EMC2 Workshop Gonalo Mordido Hasso Plattner Institute @ NeurIPS 2019 Matthijs Van Keirsbilck NVIDIA Alexander Keller NVIDIA 1 Motivation and idea neural network


  1. Instant Quantization of Neural Networks using Monte Carlo Methods EMC2 Workshop Gonçalo Mordido Hasso Plattner Institute @ NeurIPS 2019 Matthijs Van Keirsbilck NVIDIA Alexander Keller NVIDIA 1

  2. Motivation and idea ● neural network quantization/sparsity lower cost: compute, memory, power, bandwidth, ... ○ ● quantization usually requires retraining ● idea: use importance sampling fast and efficient due to stratified sampling ○ sparsity and bit-width adjustable by the number of samples ○ no additional training ○ Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller 2

  3. Monte Carlo Quantization (MCQ) ... full precision values PDF ... Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller 3

  4. Monte Carlo Quantization (MCQ) ... CDF integer values ... Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller 4

  5. Results Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller 5

  6. Monte Carlo Neural Networks simple method to quantize/sparsify models ● ○ low accuracy loss ○ no retraining general applicability ● ○ weights and/or activations ○ related to random walks future work ● ○ quantized gradients ○ integer neural networks Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller 6

Recommend


More recommend