BREAKING THE BARRIERS TO AI- SCALE IN THE ENTERPRISE Charlie Boyle - PowerPoint PPT Presentation

BREAKING THE BARRIERS TO AI- SCALE IN THE ENTERPRISE Charlie Boyle Senior Director, DGX Systems 1

DEEP LEARNING DATA CENTER Reference Architecture 2

DISTRIBUTED DEEP LEARNING SINGLE GPU DATA PARALLEL DATA AND MODEL PARALLEL MODEL PARALLEL 16x GPU 4x GPU 1x GPU 4x GPU A A A A B B B B C C C C D D D D Data & Model Parallel training yields increasingly faster time-to-solution 3

THE FASTEST PATH TO AI SCALE ON A WHOLE NEW LEVEL Today’s business needs to scale-out AI, without scaling-up cost or complexity • Powered by DGX software • Accelerated AI-at-scale deployment and effortless operations Unrestricted model parallelism and faster • time-to-solution 4 4

DESIGNED TO TRAIN THE PREVIOUSLY IMPOSSIBLE Two GPU Boards 2 8 V100 32GB GPUs per board 6 NVSwitches per board 512GB Total HBM2 Memory NVIDIA Tesla V100 32GB 1 interconnected by Plane Card 9 Twelve NVSwitches 3 4 Eight EDR Infiniband/100 GigE 2.4 TB/sec bi-section 1600 Gb/sec Total bandwidth Bi-directional Bandwidth 5 PCIe Switch Complex 6 Two Intel Xeon Platinum CPUs 30 TB NVME SSDs 8 Internal Storage 7 1.5 TB System Memory Dual 10/25 Gb/sec 9 Ethernet 5 5

10X PERFORMANCE GAIN IN LESS THAN A YEAR Time to Train (days) DGX-1, SEP’17 DGX-2, Q3‘18 DGX-1 with h V100 15 days 10 Times Faster DG DGX-2 1.5 days 0 5 10 15 20 software improvements across the stack including NCCL, cuDNN, etc. Workload: FairSeq, 55 epochs to solution. PyTorch training performance. NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. 6

THE WORLD’S FIRST 16 GPU AI PLATFORM • Revolutionary SXM3 GPU package design • Innovative 2 GPU board interconnect 32GB HBM2 stacked memory • per GPU 7 7

NVSWITCH: THE REVOLUTIONARY AI NETWORK FABRIC Inspired by leading edge research • that demands unrestricted model parallelism • Like the evolution from dial-up to broadband, NVSwitch delivers a networking fabric for the future, today Delivering 2.4 TB/s bisection • bandwidth, equivalent to a PCIe bus with 1,200 lanes • NVSwitches on DGX-2 = all of Netflix HD <45s 8

NVME SSD STORAGE Rapidly ingest the largest datasets into cache Faster than SATA SSD, optimized for • transferring huge datasets Dramatically larger user scratch space • The protocol of choice for next-gen • storage technologies 8 x 3.84TB NVMe in RAID0 (Data) • • 25.5 GB/sec Sequential Read bandwidth (vs. 2 GB/sec for 7TB of SAS SSDs on DGX-1) 9

LATEST GENERATION CPU AND 1.5TB SYSTEM MEMORY Faster, more resilient, boot and storage management • More system memory to handle larger DL and HPC applications • 2 Intel Skylake Xeon Platinum 8168 - 2.7GHz, 24 cores 24 x 64GB DIMM System Memory • 10 10

THE ULTIMATE IN NETWORKING FLEXIBILITY Grow your DL cluster effortlessly, using the connectivity you prefer • Support for RDMA over Converged Ethernet (ROCE) • 8 EDR Infiniband / 100 GigE 1600 Gb/sec Total Bi-directional • Bandwidth with low-latency Also supports Ethernet mode: • Dual 10/25 Gb/sec 11 11

FLEXIBILITY WITH VIRTUALIZATION Enable your own private DL Training Cloud for your Enterprise KVM hypervisor for Ubuntu Linux • • Enable teams of developers to simultaneously access DGX-2 Flexibly allocate GPU resources to • each user and their experiments Full GPU’s and NVSwitch access • within VMs — either all GPU’s or as few as 1 12 12

Container Orchestration for KUBERNETES on DL Training & Inference NVIDIA GPUs AWS-EC2 | GCP | Azure | DGX • Scale-up Thousands of GPUs Instantly • Self-healing Cluster Orchestration KUBERNETES • GPU Optimized Out-of-the-Box NVIDIA CONTAINER • Powered by NVIDIA Container Runtime NVIDIA GPU CLOUD RUNTIME • Included with Enterprise Support on DGX • Available end of April 2018 NVIDIA GPUs 13 13

NVIDIA DGX-2 LIMITLESS DEEP LEARNING FOR EXPLORATION WITHOUT BOUNDARIES The World’s Most Powerful Deep Learning System for the Most Complex Deep Learning Challenges • Performance to Train the Previously Impossible • Revolutionary AI Network Fabric • Fastest Path to AI Scale • Powered by NVIDIA GPU Cloud For More Information: nvidia.com/dgx-2 14 14

BREAKING THE BARRIERS TO AI- SCALE IN THE ENTERPRISE Charlie Boyle - PowerPoint PPT Presentation

BREAKING THE BARRIERS TO AI- SCALE IN THE ENTERPRISE Charlie Boyle Senior Director, DGX Systems 1 DEEP LEARNING DATA CENTER Reference Architecture 2 DISTRIBUTED DEEP LEARNING SINGLE GPU DATA PARALLEL DATA AND MODEL PARALLEL MODEL

S7750 - THE MAKING OF DGX SATURNV: BREAKING THE BARRIERS TO AI SCALE Presenter: Louis Capps,

Best Practices From Across The Country Morning Session 2014 NCWBA Summit Breaking Barriers

Breaking Down the Barriers: The Steps to Getting New Innovation Into the U.S. Transportation

Breaking Down Barrie iers Asya Choudry Genetic Counsellor Community Engagement Manager for

1 Breaking Barriers with HTML 5 Communication How to enable a stateful Web 2 Challenge If

Dreaming of Inclusion Breaking Down Barriers to Library Work for People with Disabilities

Breaking down non-cost barriers to technology adoption is critical for the transport-energy

X-Containers: Breaking Down Barriers to Improve Performance and Isolation of Cloud-Native

Breaking Barriers to Criminal Justice Transformation - Opportunities and Risks for Devolution

Topics TRANSITION ONE MEMBER, ONE VOTE BREAKING THROUGH BARRIERS AWARD NEW RESEARCH: Why So

Breaking Down Barriers to the Sharing of Behavioral Health Information Ch Chri rist sty A

BARRIERS/CONSTRAINTS TO ENTERPRISE FORMALIZATION, THE WORKERS PERSPECTIVES BY SUSANITA

BREAKING TRADITIONAL WORK BARRIERS THROUGH ADAPTATION. was founded to serve the demand for high

Breaking Ground and Barriers: COMMUNITY LEADERSHIP CENTER Building a Legacy of Excellence Gloria

Breaking the Barriers: supporting and engaging mature age first-in-family university learners

BREAKING DOWN THE BARRIERS TO EFT/ERA March 1, 2016 Chris Bruns, Head of Product Development for

Breaking Down Barriers: Achieving Real Board Diversity and Inclusion June 7, 2018 ArtsFunds

Session Abstract Breaking Asset Tracking Cost Barriers with IoT Technology What if you could track

Breaking Through the Barriers to GPU Accelerated Monte Carlo Particle Transport GTC 2018 Jeremy

Breaking Down Barrie iers Monitoring & Evaluation Oscar Bingham, Consultant (M&E)

Breaking the Barriers Teach What Matters Alpha M. Sanford K-12 Special Education Coordinator

BREAKING BARRIERS AIRLINES FLIGHT DATA MONITORING A PILOTS PERSPECTIVE Session 9

Breaking the Barriers The Lynas Story Diggers and Dealers Mining Forum Amanda Lacaze Managing

Breaking Down Barrie iers Workshop 19 th th & 20 th th September 2018 19 KERRY

BREAKING THE BARRIERS TO AI- SCALE IN THE ENTERPRISE Charlie Boyle - PowerPoint PPT Presentation

BREAKING THE BARRIERS TO AI- SCALE IN THE ENTERPRISE Charlie Boyle Senior Director, DGX Systems 1 DEEP LEARNING DATA CENTER Reference Architecture 2 DISTRIBUTED DEEP LEARNING SINGLE GPU DATA PARALLEL DATA AND MODEL PARALLEL MODEL

S7750 - THE MAKING OF DGX SATURNV: BREAKING THE BARRIERS TO AI SCALE Presenter: Louis Capps,

Best Practices From Across The Country Morning Session 2014 NCWBA Summit Breaking Barriers

Breaking Down the Barriers: The Steps to Getting New Innovation Into the U.S. Transportation

Breaking Down Barrie iers Asya Choudry Genetic Counsellor Community Engagement Manager for

1 Breaking Barriers with HTML 5 Communication How to enable a stateful Web 2 Challenge If

Dreaming of Inclusion Breaking Down Barriers to Library Work for People with Disabilities

Breaking down non-cost barriers to technology adoption is critical for the transport-energy

X-Containers: Breaking Down Barriers to Improve Performance and Isolation of Cloud-Native

Breaking Barriers to Criminal Justice Transformation - Opportunities and Risks for Devolution

Topics TRANSITION ONE MEMBER, ONE VOTE BREAKING THROUGH BARRIERS AWARD NEW RESEARCH: Why So

Breaking Down Barriers to the Sharing of Behavioral Health Information Ch Chri rist sty A

BARRIERS/CONSTRAINTS TO ENTERPRISE FORMALIZATION, THE WORKERS PERSPECTIVES BY SUSANITA

BREAKING TRADITIONAL WORK BARRIERS THROUGH ADAPTATION. was founded to serve the demand for high

Breaking Ground and Barriers: COMMUNITY LEADERSHIP CENTER Building a Legacy of Excellence Gloria

Breaking the Barriers: supporting and engaging mature age first-in-family university learners

BREAKING DOWN THE BARRIERS TO EFT/ERA March 1, 2016 Chris Bruns, Head of Product Development for

Breaking Down Barriers: Achieving Real Board Diversity and Inclusion June 7, 2018 ArtsFunds

Session Abstract Breaking Asset Tracking Cost Barriers with IoT Technology What if you could track

Breaking Through the Barriers to GPU Accelerated Monte Carlo Particle Transport GTC 2018 Jeremy

Breaking Down Barrie iers Monitoring &amp; Evaluation Oscar Bingham, Consultant (M&amp;E)

Breaking the Barriers Teach What Matters Alpha M. Sanford K-12 Special Education Coordinator

BREAKING BARRIERS AIRLINES FLIGHT DATA MONITORING A PILOTS PERSPECTIVE Session 9

Breaking the Barriers The Lynas Story Diggers and Dealers Mining Forum Amanda Lacaze Managing

Breaking Down Barrie iers Workshop 19 th th &amp; 20 th th September 2018 19 KERRY

Breaking Down Barrie iers Monitoring & Evaluation Oscar Bingham, Consultant (M&E)

Breaking Down Barrie iers Workshop 19 th th & 20 th th September 2018 19 KERRY