Parallelized Training of Deep NN Comparison of Current Concepts and - PowerPoint PPT Presentation

Nov 30, 2022 •357 likes •515 views

Parallelized Training of Deep NN Comparison of Current Concepts and Frameworks Sebastian Jger, Hans-Peter Zorn, Stefan Igel, Christian Zirpins Rennes, Dec 10, 2018 Motivation Need to scale the training of neural networks horizontally

Parallelized Training of Deep NN Comparison of Current Concepts and Frameworks Sebastian Jäger, Hans-Peter Zorn, Stefan Igel, Christian Zirpins Rennes, Dec 10, 2018
Motivation › Need to scale the training of neural networks horizontally › Kubernetes based technology stack › Scalability of concepts and frameworks 2
Distributed Training Methods Data Parallelism 3
Data Parallelism Centralized Parameter Server TensorFlow: https://www.tensorflow.org 4
Data Parallelism Decentralized Parameter Server Apache MXNet: http://mxnet.apache.org 5
Experimental Setup Environment › Google Kubernetes Engine › CPU: 2.6 GHz › Ubuntu 16.04 › TensorFlow 1.8.0 › MXNet 1.3.0 6
Experimental Setup Networks Convolutional NN Recurrent NN › LeNet-5 › LSTM › 5 layer › 2 layer › 10 classes › 200 units › Fashion MNIST › Penn Tree Bank › 28x28 gray-scale › 1.000.000 words 7
Experimental Setup Metrics 8
Results Convolutional Neural Network 9
Results Convolutional Neural Network 10
Results Recurrent Neural Network 11
Summarizating the Experiments Decentralized Parameter Server ... › more robust regarding increasing communication effort › scales better for small NN For bigger/ more complex NN … › no significant difference between concepts 12
Conclusion MXNet ... › for small NN better scalability and throughput › for bigger NN higher throughput › less and less complicated code › easier to scale up training 13
Thank you Sebastian Jäger @se_jaeger inovex GmbH Ludwig-Erhard-Allee 6 76131 Karlsruhe sebastian.jaeger@inovex.de

Recommend

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys: surveys: the case of the Chandra Deep Field South the case of the Chandra Deep Field South the case of the Chandra Deep Field South Fabrizio Fiore

423 views • 21 slides

Multicore OSes: Looking Forward from 1991, er, 2011 David A. Holland and Margo I. Seltze (Harvard

Multicore OSes: Looking Forward from 1991, er, 2011 David A. Holland and Margo I. Seltze (Harvard University) 18.01.2012, Presentation By Pawe Hajdan software is not parallelized enough to take advantage of the more and more parallelized

644 views • 16 slides

Parallelized RSA Algorithm: An Analysis With Performance Evaluation using OpenMP Library in High

Parallelized RSA Algorithm: An Analysis With Performance Evaluation using OpenMP Library in High Performance Computing Environment Md. Ahsan Ayub , Zishan Ahmed Onik, Steven Smith Department of Computer Science Tennessee Technological

507 views • 19 slides

affyPara: Parallelized preprocessing algorithms for high-density oligonucleotide array data

affyPara: Parallelized preprocessing algorithms for high-density oligonucleotide array data Markus Schmidberger Ulrich Mansmann UseR 2008 IBE August 12-14, Technische Universitt Dortmund, http://ibe.web.med.uni-muenchen.de Germany

359 views • 23 slides

The Elliptic Curves Discrete Logarithm Problem and an implementation of parallelized Pollards

What is an elliptic curve? Elliptic Curves in Cryptography. ECDLP resolution. -Pollard and CUDA. Conclusions The Elliptic Curves Discrete Logarithm Problem and an implementation of parallelized Pollards algorithm for ECDLP Alberto

1.52k views • 103 slides

Design and Performance Trade-offs in Parallelized RF SDR Architecture Ville Saari 1 arssinen 2

Design and Performance Trade-offs in Parallelized RF SDR Architecture Ville Saari 1 arssinen 2 Sami Kiminki Aarno P Antti Immonen 2 Vesa Hirvisalo Jussi Ryyn anen Tommi Zetterman Aalto University Aalto University Nokia Research Center

600 views • 17 slides

Speeding R up on your computer by parallelized computations a geostatistical case study

Speeding R up on your computer by parallelized computations a geostatistical case study Andreas Papritz Department of Environmental Systems Science, ETH Zurich Statistik Stadt Zrich papritz@env.ethz.ch outline motivating example:

368 views • 34 slides

A Simple and Easily Parallelized Video Copy Detection Method G. Roth, R. Laganire, M. Bouchard,

A Simple and Easily Parallelized Video Copy Detection Method G. Roth, R. Laganire, M. Bouchard, T. Janati, I . Lakhmirie School of Information Technology and Engineering (SITE) University of Ottawa, Ottawa ON Canada G. Roth 2009 Video Copy

434 views • 25 slides

Parallelized and Vectorized Tracking Using Kalman Filter with CMS Detector Geometry and Events G.

Parallelized and Vectorized Tracking Using Kalman Filter with CMS Detector Geometry and Events G. Cerati 4 , P. Elmer 3 , M. Kortelainen 4 , S. Krutelyov 1 , S. Lantz 2 , M. Lefebvre 3 , M. Masciovecchio 1 , K. McDermott 2 , B. Norris 5 , D. Riley

390 views • 23 slides

A Parallelized Theorem Prover for Interactive Theorem Proving David L. Rager, Warren A. Hunt Jr.,

Introduction Parallelism Primitives Parallelizing ACL2 Evaluate Approach Conclusion A Parallelized Theorem Prover for Interactive Theorem Proving David L. Rager, Warren A. Hunt Jr., and Matt Kaufmann ragerdl@defthm.com, hunt@cs.utexas.edu,

835 views • 59 slides

Parallelized Kalman-Filter-Based Reconstruction of Particle Tracks with Accurate Detector

Parallelized Kalman-Filter-Based Reconstruction of Particle Tracks with Accurate Detector Geometry G. Cerati 4 , P. Elmer 3 , M. Kortelainen 4 , S. Krutelyov 1 , S. Lantz 2 , M. Lefebvre 3 , M. Masciovecchio 1 , K. McDermott 2 , D. Riley 2 ,

406 views • 26 slides

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

Deep 3D Representation Learning for Visual Computing Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms Conclusion 2 Outline Overview of 3D deep learning Background 3D deep learning tasks 3D deep

1.66k views • 122 slides

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep

1.15k views • 79 slides

Optimization for Training Deep Models Xiaogang Wang xgwang@ee.cuhk.edu.hk February 12, 2019

Optimization Basics Optimization of training deep neural networks Multi-GPU Training Optimization for Training Deep Models Xiaogang Wang xgwang@ee.cuhk.edu.hk February 12, 2019 cuhk Xiaogang Wang Optimization for Training Deep Models

1.69k views • 72 slides

Arne Naess Founder of Deep Ecology: biospheric egalitarianism Coined term deep

Arne Naess Founder of Deep Ecology: biospheric egalitarianism Coined term deep ecology in 1973 Deep ecology now has many, many adherents in philosophy, science, political activism and literature 1 Why Deep? Deep

628 views • 23 slides

Goutham Veeramachaneni @putadent Me Goutham Veeramachaneni | 2 Debugging slow queries Cortex |

696 views • 26 slides

Apache Solr la piattaforma di ricerca enterprise LucaBonesini | Titulus User Group, Kion

Apache Solr la piattaforma di ricerca enterprise LucaBonesini | Titulus User Group, Kion Bologna 4/dic/2013 Chi s hi son ono Luca Bonesini Infor orma matico Lanciatore di giavellotti Prog ogramma mmator ore Suonatore di chitarra

986 views • 39 slides

REST... with Peace Content Management with Apache Sling 1 Freitag, 25. Februar 2011 The Problem

REST... with Peace Content Management with Apache Sling 1 Freitag, 25. Februar 2011 The Problem (abstract) Store large amounts of different types of content Associate meta data to content Access control Search for stuff

539 views • 16 slides

JCR, Jackrabbit, Jackalope Liip Techday 1. July 2009 Tobias Ebnther & Christian Stocker

JCR, Jackrabbit, Jackalope Liip Techday 1. July 2009 Tobias Ebnther & Christian Stocker Jackalope What is a Content Repository A thing to store and retrieve content Hierarchical -> See CRX Explorer Jackalope Content

709 views • 14 slides

The Moon as a Detector for Extreme-Energy Cosmic Rays

The Moon as a Detector for Extreme-Energy Cosmic Rays Hallsie Reno University of Iowa March 7, 2013 Jeong, Reno and Sarcevic, AstroparIcle Physics

418 views • 25 slides

The politics of helping the poor Lane Kenworthy June 2010 Economic growth has been good for the

The politics of helping the poor Lane Kenworthy June 2010 Economic growth has been good for the poor Denmark Sweden 13,000 13,000 P10 income (2000 US$ P10

794 views • 37 slides

Service Mess to Service Mesh Observe. Control. Secure. Rob Richardson Technical Evangelist,

Service Mess to Service Mesh Observe. Control. Secure. Rob Richardson Technical Evangelist, MemSQL Kavya Pearlman Cybersecurity Strategist, Wallarm https://www.shutterstock.com/image-photo/ca r-technology-autonomous-self-driving-concep

306 views • 26 slides

Vector Flows and integer flows Yi Wang School of mathematical sciences, Anhui University Joint

Introduction to integer flow 5-flow Conjecture and 3-flow Conjecture Vector flow Vector Flows and integer flows Yi Wang School of mathematical sciences, Anhui University Joint with Jian Cheng, Rong Luo and Cun-Quan Zhang Dec. 20, 2016,

639 views • 25 slides