Tsing nghua hua University versity Introduction Deep learning - PowerPoint PPT Presentation

Deep ep500 500 BOF 2018 Jidong ong Zhai Tsing nghua hua University versity

Introduction • Deep learning has widely used in lots of areas

Introduction • A lot of deep learning frameworks, compute libraries and acceleration devices CNTK Frameworks ··· Compute BLAS ··· Libraries Compute TPU ··· Devices

Introduction • However, how to evaluate? ? ? ? Benchmark CNTK Frameworks ··· Compute BLAS ··· Libraries Compute TPU ··· Devices

Introduction • However, how to evaluate? ? ? ? Benchmark Set Which is better? Optimization Target CNTK Frameworks ··· Compute Running Time BLAS ··· Libraries Resource Use Promote Scalability Development Efficiency Compute … TPU ··· Devices

Related Deep Learning Benchmarks convnet- TensorFlow DeepBench 2 DAWNBench 3 benchmarks 1 Benchmark 4 Framework Compute Library Compute Library Target Framework Compute Library Compute Device Framework Granularity Neural Network Basic Operation Neural Network Neural Network Models Training Low Diversity Diversity Only CNN 2 CNN + 1 RNN 4 CNN Inference CIFAR10 、 ImageNet Limited Dataset Dataset ImageNet Dummy Data ImageNet SQuAD Training Time and Single Metric Metrics Time Per Iteration Time Cost to certain Total Training Time Accuracy 1. convnet-benchmarks: https://github.com/soumith/convnet-benchmarks 2. Baidu DeepBench: https://github.com/baidu-research/DeepBench 3. Cody A. Coleman et al. DAWNBench: An End-to-End Deep Learning Benchmark and Competition . NIPS 2017 4. TensorFlow Benchmark https://www.tensorflow.org/performance/benchmarks

Related Deep Learning Benchmarks MLPerf 1 Framework Evaluation Target Compute Device Granularity Neural Network 1. Image(Classification, Detection) Characteristics 2. NLP(Translation, Sentiment Analysis) Various Applications Diversity 3. Speech(Recognition) 4. Reinforcement Learning & Recommendation Dataset ImageNet, COCO, WMT, Librispeech, MovieLens , … Various Datasets Evaluation Metrics Training Time, Power Use and Cost to certain Accuracy 1. https://mlperf.org/

How to evaluate HPC systems for machine learning?

Our Work on Workload Analysis for Deep Learning • Preliminary workload analysis Applications Image Machine Language Question Classification Translation Model Answering Models VGG ResNet Seq2seq RNN LM AoA Reader WikiText-2 Easy to obtain Cifar Real time Real Data Dummy Data Dataset CBTest Tatoeba Controllable Generative

Our Work • Time • Time of every operation type within one iteration • Time of phases within one iteration Seq2seq AoA Reader RNN LM ResNet VGG 0 100 200 300 400 500 600 700 Time(ms) Data Forward Backward Loss Update

Workload Analysis 18,432 1.0 • Memory Usage 16,384 0.8 Memory Use(MB) 14,336 • Memory Usage Break Down 12,288 0.6 Ratio • Memory Usage – Input Size 10,240 0.4 8,192 6,144 0.2 4,096 2,048 0.0 0 50000 100000 150000 200000 Pic Area(Pixel 2 ) Traning Inference Training/Inference Seq2seq 18,432 1.0 16,384 AoA Reader 0.8 14,336 Memory Use(MB) 12,288 RNN LM 0.6 10,240 Ratio 8,192 0.4 ResNet 6,144 4,096 0.2 VGG 2,048 0 0.0 0 2000 4000 6000 8000 10000 12000 14000 16000 0 200 400 600 800 1000 1200 Memory Use(MB) Sequence Length Weight Mediate Result + Temp Training Inference Training/Inference

Workload Characterization • Hardware Counters • For GPU GPU Warp Execution Warp Non-Pred Execution Bandwidth TFLPOS Occupancy Efficiency Efficiency Utilization Normalized 1 0.46 1.00 1.00 4.02 5.65

Questions about an HPC Oriented Deep Learning Benchmark • Questions we need to think: • Model Selection • Various application areas? • A synthetic model with main features? • Dataset • Fixed data set (Imagenet)? • A Generative Data? • Metrics • Time for training? • Gflops? • AI operations per second?

Tsing nghua hua University versity Introduction Deep learning - PowerPoint PPT Presentation

Deep ep500 500 BOF 2018 Jidong ong Zhai Tsing nghua hua University versity Introduction Deep learning has widely used in lots of areas Introduction A lot of deep learning frameworks, compute libraries and acceleration devices

N.C .C. . A&T A&T STATE TE UNI UNIVE VERSITY: UNIVE UNI VERSITY O OVE VERVI

Commercial MLaaSPlatforms Yun-Yun Tsai & Tsung-Yi Ho National Tsing Hua University #BHUSA

HEI HEIR PROPER ER TY TY Robe ber t t Za Zabawa, Tuskegee egee University versity THE

opportunities of FinTech in the insurance industry Prof. Che Lin National Tsing Hua University

Chapter 4. Markov Chains Prof. Shun-Ren Yang Department of Computer Science, National Tsing Hua

Space GW Detection Proposals Wei-Tou Ni National Tsing Hua University Refs: WTN, GW

2012 International Symposium on Physical Design C. L. Liu National Tsing Hua University Design

Co-authors: C-F Chien, Y-J Chen National Tsing Hua University ISMI 2015, 16 th -18 th Oct. KAIST,

POPULATIONS OF GAMMA-RAY POINT SOURCES Ting-Ni Lu IoA, National Tsing Hua University

Competition and coexistence of two species for one nutrient with internal storage and predation

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss Project

Very High-Energy Gamma-ray Astronomy Thomas P.H. Tam (National Tsing Hua Univ) University of Hong

National Tsing Hua University Taiwan Advisor: Prof. Jerry Chou Computer Science Department

BI BIODI ODIVERSITY VERSITY CO CONS NSERVATI ERVATION ON IN IN CO COCO COA A PR PROD

UNICA CAC - UNiversi versity ty Coo ooperati peration on Fra rame mework work for or Kn

Convolutional Neural Networks Hwann-Tzong Chen Naitonal Tsing Hua University 3 Januray 2017 1 /

Structured Algorithms for Palindromic Quadratic Eigenvalue Problems : Vibration of Fast Trains

We Weste stern rn Go Gover vernors nors Uni Univer versity sity WGU Background

Scaling Up Passivhaus THE CENTRE TRE FOR MEDIC ICINE, INE, UNIVER VERSITY SITY OF LEICES

mirror --- A retrospect Prof. Shiuh Chao Institute of Photonics Technologies National Tsing Hua

Running vacuum model in non-flat universe Yan-Ting Hsu National Tsing Hua University NCTS Dark

2013 A collaboration betwe ween Prienu Ziburio Gimnazija and The Unive versity of San Diego

Lock H Have ven U Univer versity Introductions Michael Hall, Associate Director of MaryJo

PGY1C PGY 1CaR aRMS Re Results Un Unive versity of To Toronto to Pr Pres esen entatio

Tsing nghua hua University versity Introduction Deep learning - PowerPoint PPT Presentation

Deep ep500 500 BOF 2018 Jidong ong Zhai Tsing nghua hua University versity Introduction Deep learning has widely used in lots of areas Introduction A lot of deep learning frameworks, compute libraries and acceleration devices

N.C .C. . A&amp;T A&amp;T STATE TE UNI UNIVE VERSITY: UNIVE UNI VERSITY O OVE VERVI

Commercial MLaaSPlatforms Yun-Yun Tsai &amp; Tsung-Yi Ho National Tsing Hua University #BHUSA

HEI HEIR PROPER ER TY TY Robe ber t t Za Zabawa, Tuskegee egee University versity THE

opportunities of FinTech in the insurance industry Prof. Che Lin National Tsing Hua University

Chapter 4. Markov Chains Prof. Shun-Ren Yang Department of Computer Science, National Tsing Hua

Space GW Detection Proposals Wei-Tou Ni National Tsing Hua University Refs: WTN, GW

2012 International Symposium on Physical Design C. L. Liu National Tsing Hua University Design

Co-authors: C-F Chien, Y-J Chen National Tsing Hua University ISMI 2015, 16 th -18 th Oct. KAIST,

POPULATIONS OF GAMMA-RAY POINT SOURCES Ting-Ni Lu IoA, National Tsing Hua University

Competition and coexistence of two species for one nutrient with internal storage and predation

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss Project

Very High-Energy Gamma-ray Astronomy Thomas P.H. Tam (National Tsing Hua Univ) University of Hong

National Tsing Hua University Taiwan Advisor: Prof. Jerry Chou Computer Science Department

BI BIODI ODIVERSITY VERSITY CO CONS NSERVATI ERVATION ON IN IN CO COCO COA A PR PROD

UNICA CAC - UNiversi versity ty Coo ooperati peration on Fra rame mework work for or Kn

Convolutional Neural Networks Hwann-Tzong Chen Naitonal Tsing Hua University 3 Januray 2017 1 /

Structured Algorithms for Palindromic Quadratic Eigenvalue Problems : Vibration of Fast Trains

We Weste stern rn Go Gover vernors nors Uni Univer versity sity WGU Background

Scaling Up Passivhaus THE CENTRE TRE FOR MEDIC ICINE, INE, UNIVER VERSITY SITY OF LEICES

mirror --- A retrospect Prof. Shiuh Chao Institute of Photonics Technologies National Tsing Hua

Running vacuum model in non-flat universe Yan-Ting Hsu National Tsing Hua University NCTS Dark

2013 A collaboration betwe ween Prienu Ziburio Gimnazija and The Unive versity of San Diego

Lock H Have ven U Univer versity Introductions Michael Hall, Associate Director of MaryJo

PGY1C PGY 1CaR aRMS Re Results Un Unive versity of To Toronto to Pr Pres esen entatio

N.C .C. . A&T A&T STATE TE UNI UNIVE VERSITY: UNIVE UNI VERSITY O OVE VERVI

Commercial MLaaSPlatforms Yun-Yun Tsai & Tsung-Yi Ho National Tsing Hua University #BHUSA