the dark side of dnn pruning
play

The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria - PowerPoint PPT Presentation

45 th International Symposium on Computer Architecture, Los Angeles, US, June 2018 The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria Arnau Antonio Gonzlez DNN Pruning Efficient reduction of DNN size Higher


  1. 45 th International Symposium on Computer Architecture, Los Angeles, US, June 2018 The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria Arnau Antonio González

  2. DNN Pruning ● Efficient reduction of DNN size ✔ Higher performance ✔ Significant energy-saving ✔ Ultra-low power ✔ Lower area 2 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  3. Side-Effect of DNN Pruning ● Lack of confidence in DNN classification – Speech network of acoustic modeling 1 Baseline 0.8 Pruned Model Probability 0.6 0.4 0.2 0 Output Class 3 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  4. Confidence Issue ● DNN dependent applications – Automatic Speech Recognition (ASR) – Machine Translation ● Example: ASR evaluation for pruned DNN Normalized Decoding Time (%) 140 100 Dnn Viterbi Word-Error-Rate (%) 90 120 80 WER 100 70 60 80 50 60 40 30 40 20 20 10 0 0 4 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  5. Outline ● Motivation ● DNN pruning & Confidence loss ● ASR using pruned DNN ● Accelerator's baseline ● Efficient design with DNN pruning ● Experimental results ● Conclusions

  6. DNN Pruning: Accuracy ● Maintaining top-5 accuracy 100% T op 1 T op 5 80% 60% 40% 20% 0% 5 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  7. Loss of Confidence ● The more the pruning rate in DNNs, the lower the classification probability 0.7 Average Confidence 0.65 0.6 0.55 0.5 6 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  8. Outline ● Motivation ● DNN pruning & Confidence loss ● ASR using pruned DNN ● Accelerator's baseline ● Efficient design with DNN pruning ● Experimental results ● Conclusions

  9. ASR ● ASR systems include two phases – DNN: computes probabilities of different phonemes at each frame Frame i 7 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  10. ASR ● ASR systems include two phases – DNN: computes probabilities of different phonemes at each frame DNN Frame i Hidden . . n . Layers . m . . 7 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  11. ASR ● ASR systems include two phases – DNN: computes probabilities of different phonemes at each frame DNN Frame i Hidden . . n . Layers . m . . 1 DNN Score 0.8 0.6 0.4 0.2 0 Output Class 7 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  12. ASR ● ASR systems include two phases – DNN: computes probabilities of different phonemes at each frame – Viterbi search: explores WFST based on DNN scores Frame 0 Frame 1 Frame 2 ... ... S1 ... S2 S1 ... S2 S2 ... S3 S4 ... 7 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  13. ASR Evaluation ● Viterbi search under pruned DNN model DNN Scores of Frame 2 Frame 2 8 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  14. ASR Evaluation ● Viterbi search under pruned DNN model Frame 2 DNN Scores of Frame 2 8 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  15. Viterbi Workload ● Increase in Viterbi's search activity 9 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  16. Outline ● Motivation ● DNN pruning & Confidence loss ● ASR using pruned DNN ● Accelerator's baseline ● Efficient design with DNN pruning ● Experimental results ● Conclusions

  17. Hardware Baseline ● UNFOLD: state-of-the-art Viterbi accelerator 10 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  18. Hardware Baseline ● UNFOLD: state-of-the-art Viterbi accelerator st0 st1 10 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  19. Hardware Baseline ● UNFOLD: state-of-the-art Viterbi accelerator st0 st1 10

  20. Hardware Baseline ● UNFOLD: state-of-the-art Viterbi accelerator st0 st1 10 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  21. Hardware Baseline ● UNFOLD: state-of-the-art Viterbi accelerator Likelihoods st0 0.00015 0.31 st1 0.0014 0.0002 0.00005 10 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  22. Hardware Baseline ● UNFOLD: state-of-the-art Viterbi accelerator Likelihoods st0 0.00015 0.31 st1 0.0014 0.0002 0.00005 10 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  23. Hardware Baseline ● UNFOLD: state-of-the-art Viterbi accelerator Hash Bottlenecks Collision handling ● Backup buffer Overflows ● Overflow buffer Access delay ● Backup ● Overflow 10 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  24. Outline ● Motivation ● DNN pruning & Confidence loss ● ASR using pruned DNN ● Accelerator's baseline ● Efficient design with DNN pruning ● Experimental results ● Conclusions

  25. Efficient Hash Design ● Keeping the best N hypotheses at each frame – Known as Histogram Pruning 11 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  26. Efficient Hash Design ● Keeping the best N hypotheses at each frame – Known as Histogram Pruning ● Implementation issue – Sorting tokens at every frame – Expensive: O(m*log(m)) for m hypotheses 11 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  27. Efficient Hash Design ● Keeping the best N hypotheses at each frame – Known as Histogram Pruning ● Implementation issue – Sorting tokens at every frame – Expensive: O(m*log(m)) for m hypotheses ● Our scheme – Loosely keeping N-best using hash mechanism 11 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  28. Efficient Hash Design ● Direct-mapped 12 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  29. Efficient Hash Design ● Direct-mapped ● Way-Associative 12 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  30. Efficient Hash Design ● Our scheme efficiency 13 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  31. Efficient Hash Design ● Way-associative main challenge – Replace when set is full – Finding hypothesis with max cost 14 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  32. Efficient Hash Design ● Way-associative main challenge – Replace when set is full – Finding hypothesis with max cost ● Our solution – Store index of each set based on max-heap – Replace with the root of tree – Updating max-heap fits in one cycle 14 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  33. Outline ● Motivation ● DNN pruning & Confidence loss ● ASR using pruned DNN ● Accelerator's baseline ● Efficient design with DNN pruning ● Experimental results ● Conclusions

  34. Evaluation Methodology ● Cycle-accurate simulation of DNN and Viterbi ● Model accelerator's components in hardware – Verilog implementation of logic parts – Synthesized by design compiler – Cacti: Cache and memory components – Micron: main memory ● Combine simulation results with hardware models – Decoding time – Decoding power and energy consumption – Accelerator's area usage 15 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  35. Accelerator's Parameters ● DNN and Viterbi accelerators 16 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  36. Experiment Configs ● Viterbi Search: – Baseline: Unfold's design – Beam: reduce beam without changing baseline – N-Best: our proposal ● DNN: – Non-pruned version – Pruned version: 70%, 80% and 90% pruning 17 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  37. Experimental Results ● Decoding time 18 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  38. Experimental Results ● Decoding time ● Energy consumption 18 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  39. Experimental Results ● Decoding time ● Energy consumption ● Area usage: 10.74 mm2 (2x reduction) 18 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  40. Outline ● Motivation ● DNN pruning & Confidence loss ● ASR using pruned DNN ● Accelerator's baseline ● Efficient design with DNN pruning ● Experimental results ● Conclusions

  41. Conclusions ● Major side effect of DNN pruning – Confidence loss: top-1's low likelihood ● DNN pruning in ASR systems – 20% confidence loss, 33% slowdown ● Our solution: A novel Viterbi accelerator – Resilient to DNN pruning – Less search activity while maintaining accuracy ● Compared to state-of-art ASR accelerated system – 9x energy-saving, 4.5x speedup, 2x area reduction 19 The Dark Side of DNN Pruning, Session 9A, Wednesday June 6th, ISCA'18

  42. 45 th International Symposium on Computer Architecture, Los Angeles, US, June 2018 The Dark Side of DNN Pruning Reza Yazdani Marc Riera Jose-Maria Arnau Antonio González

Recommend


More recommend