neuroscience
play

Neuroscience The Study of Neurons and Brains 2 /32 DNN did this - PowerPoint PPT Presentation

Flexon: A Flexible Digital Neuron for Efficient Spiking Neural Network Simulations Dayeol Lee , Gwangmu Lee * , Dongup Kwon * , Sunghwa Lee * , Youngsok Kim * , and Jangwoo Kim * * Dept. of Electrical and Computer Engineering, Seoul National


  1. Flexon: A Flexible Digital Neuron for Efficient Spiking Neural Network Simulations Dayeol Lee † , Gwangmu Lee * , Dongup Kwon * , Sunghwa Lee * , Youngsok Kim * , and Jangwoo Kim * * Dept. of Electrical and Computer Engineering, Seoul National University † Dept. of Electrical Engineering and Computer Sciences, University of California, Berkeley

  2. Recognition Degeneration Parkinson's Disease, Object detection, CJD, Dementia Classification Consciousness Emotion Morality, Sympathy, Social Value Happiness, Apathy Neuroscience The Study of Neurons and Brains 2 /32

  3. DNN did this Recognition Degeneration Parkinson's Disease, Object detection, CJD, Dementia Classification Consciousness Emotion Morality, Sympathy, Social Value Happiness, Apathy Neuroscience The Study of Neurons and Brains Unexplored yet 3 /32

  4. Modeling a Brain: Spiking Neural Network How to compute spiking neural network? for each time-step: Synapse Spike Gen. Spike 1 Gen. Time-step Spike Neuron Neuron 4 /32 Generator

  5. Modeling a Brain: Spiking Neural Network How to compute spiking neural network? for each time-step: 1) stimulus generation Synapse Spike Gen. Spike 1 Gen. Time-step Spike Neuron Neuron 5 /32 Generator

  6. Modeling a Brain: Spiking Neural Network How to compute spiking neural network? for each time-step: 1) stimulus generation 2) neuron computation Synapse Spike Gen. Spike 1 Gen. Time-step Spike Neuron Neuron 6 /32 Generator

  7. Modeling a Brain: Spiking Neural Network How to compute spiking neural network? for each time-step: 1) stimulus generation 2) neuron computation Synapse 3) synapse calculation Spike Gen. … Spike 1 Gen. Time-step Spike Neuron Neuron 7 /32 Generator

  8. Where Does Time Go? 10 Representative Benchmarks on CPU/GPU CPU: Intel Xeon E5-2630 v4 CPU (12-core, 2.2 GHz) / GPU: NVIDIA Titan X (Pascal) GPU ~50% of overheads coming from Neuron Computation 8 /32

  9. How Does a Neuron Behave? Threshold Membrane Voltage Resting (millivolt scale) Potential Time (millisecond scale) Neuron Biological Neuron 9 /32

  10. How Does a Neuron Behave? Threshold Membrane Voltage Resting (millivolt scale) Potential Time (millisecond scale) Neuron Biological Neuron 10 /32

  11. How Does a Neuron Behave? Threshold Membrane Voltage Resting (millivolt scale) Potential Time (millisecond scale) Neuron Biological Neuron 11 /32

  12. How Does a Neuron Behave? Threshold Membrane Voltage Resting (millivolt scale) Potential Time (millisecond scale) Neuron Biological Neuron 12 /32

  13. Various Neuron Behaviors Membrane Voltage (millivolt scale) Time Input spike (millisecond scale) Membrane decay Spike inhibition Spike initiation accumulation Dendrite Soma Axon Tons of variants exist, depending on their feature set. We need to support various features for accurate brain simulations. Biological Neuron 13 /32

  14. Solutions and Limitations Flexibility Custom Software Hardware Accuracy Simulation Framework High Performance Low Energy 14 /32

  15. Design Goals & Key Ideas Hardware-based Goal 1 High Performance Feature-driven Goal 2 High Flexibility Spatially-folded Goal 3 Low cost 15 /32

  16. Neuron Feature #1 : Input Spike Accumulation Conductance-based Conductance-based Current-based (Exponential-shaped) (Alpha function-shaped) 16 /32

  17. Neuron Feature #1 : Input Spike Accumulation Conductance-based Conductance-based Current-based (Exponential-shaped) (Alpha function-shaped) 17 /32

  18. Neuron Feature #2: Spike Initiation (without feature) (with feature) Exponential Quadratic 18 /32

  19. Neuron Feature #3: Spike-triggered Current (without) (with) (without) (with) Sub-threshold Oscillation Adaptation 19 /32

  20. Neuron Feature #4: Refractory Period (without) (with) (without) (with) Relative Absolute 20 /32

  21. Neuron Feature: Flexible Feature Support “Feature-driven” Flexon supports 11 major neuron models ( LLIF, SLIF, DSRM0, DLIF, QIF, EIF, Izhikevich, AdEx, …) (without) (with) (without) (with) Relative Absolute 21 /32

  22. Evaluation ( 12x Feature-driven Design) 8 CPU + 2 GPU Representative Benchmarks Flexon : TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM) Brette et al. Brunel 87.4x CPU Destexhe-LTS Destexhe-UpDown CPU Over CPU Muller et al. Intel Xeon E5-2630 v4 Potjans-Diesmann (12-core, 2.2 GHz) Vogels-Abbott 8.19x Vogels et al. Geomean Izhikevich Over GPU GPU GPU Nowotny et al. NVIDIA Titan X (Pascal) Geomean 1 10 100 1000 Speed-up (Normalized to the baseline) 22 /32

  23. Evaluation ( 12x Feature-driven Design) 8 CPU + 2 GPU Representative Benchmarks Flexon : TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM) Brette et al. Brunel 6,186x CPU Destexhe-LTS Destexhe-UpDown CPU Over CPU Muller et al. Intel Xeon E5-2630 v4 Potjans-Diesmann (12-core, 2.2 GHz) Vogels-Abbott 422x Vogels et al. Geomean Izhikevich Over GPU GPU GPU Nowotny et al. NVIDIA Titan X (Pascal) Geomean 1 10 100 1000 10000 100000 Energy Efficiency (Normalized to the baseline) 23 /32

  24. Intrinsic Space-inefficiency Lots of redundant units (multiplier, adder, …) “Feature-driven” Flexon supports 11 major neuron models ( LLIF, SLIF, DSRM0, DLIF, QIF, EIF, Izhikevich, AdEx, …) (without) (with) (without) (with) Absolute Relative 24 /32

  25. Constructing Spatially-folded Flexon Spatially-folded design è reduce area - Remove redundant MAC operators Conductance-based (Exponential) Adaptation Quadratic 25 /32

  26. Constructing Spatially-folded Flexon Spatially-folded design è reduce area - Remove redundant MAC operators Modifications from the baseline - 2-stage pipeline, multi-cycle implementation Relative Exponential (ADT: Adaptation) 26 /32

  27. Constructing Spatially-folded Flexon Spatially-folded design è reduce area - Remove redundant MAC operators What we should change - 2-stage pipeline, multi-cycle implementation “Spatially-folded” Flexon supports various major neuron models 6x area saving 27 /32

  28. Evaluation ( 72x Spatially-folded Design ) 8 CPU + 2 GPU Representative Benchmarks Flexon : TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM) Brette et al. Brunel 122x 9.83x CPU Destexhe-LTS Destexhe-UpDown Over CPU Over GPU CPU Muller et al. Intel Xeon E5-2630 v4 Potjans-Diesmann (12-core, 2.2 GHz) Vogels-Abbott Vogels et al. Feature-driven Geomean Spatially-folded Izhikevich GPU GPU Nowotny et al. NVIDIA Titan X (Pascal) Geomean 1 10 100 1000 Speed-up (Normalized to the baseline) 28 /32

  29. Evaluation ( 72x Spatially-folded Design ) 8 CPU + 2 GPU Representative Benchmarks Flexon : TSMC 45nm, Synopsys Design Compiler (neuron), CACTI 6.5 (SRAM) Brette et al. Brunel 5,413x 135x CPU Destexhe-LTS Destexhe-UpDown Over CPU Over GPU CPU Muller et al. Intel Xeon E5-2630 v4 Potjans-Diesmann (12-core, 2.2 GHz) Vogels-Abbott Vogels et al. Geomean Feature-driven Izhikevich GPU GPU Nowotny et al. Spatially-folded NVIDIA Titan X (Pascal) Geomean 1 10 100 1000 10000 100000 Energy Efficiency (Normalized to the baseline) 29 /32

  30. Baseline Flexon vs. Spatially-folded Flexon • Baseline “Feature-driven” Flexon − Fast: 87.4x over CPUs, 8.19x over GPUs − Energy-efficient: 6,186x over CPUs, 422x over GPUs • “Spatially-folded” Flexon − Fast: 122x over CPUs, 9.83x over GPUs − Energy-efficient: 5,413x over CPUs, 135x over GPUs 30 /32

  31. Conclusion • Flexon is a flexible feature-driven digital neuron design, capable of realizing various major neuron models. − Flexible & power-efficient (6,186x over CPU) • Spatially-folded Flexon makes features share units, reducing 6x circuit area. − Flexible & fast when integrated (122x over CPU) 31 /32

  32. Flexon A Flexible Digital Neuron for Efficient Spiking Neural Network Simulations Thank you for listening 32 /32

Recommend


More recommend