GPU-B ASED D EEP L EARNING IN C LOUD AND E MBEDDED S YSTEMS F - PowerPoint PPT Presentation

Aug 16, 2022 •138 likes •320 views

GPU-B ASED D EEP L EARNING IN C LOUD AND E MBEDDED S YSTEMS F REDERICK S OO , CTO April 4, 2016 Confidential and Proprietary Nauto is launching a connected camera for professional drivers Drive more than most consumers Exposed to

GPU-B ASED D EEP L EARNING IN C LOUD AND E MBEDDED S YSTEMS F REDERICK S OO , CTO April 4, 2016 Confidential and Proprietary
Nauto is launching a connected camera for professional drivers • Drive more than most consumers • Exposed to passenger and driver liability • Driver quality unknown - small number of very bad drivers 2
Massive shift in transportation due to synergistic technologies Autonomous Connected 90% reduction Fleet in accidents optimization $0.08 / mile Shared Electric 50-70% 85% efficient utilization drivetrain 3
Why use deep learning? Good at visual tasks Scalable Most important for NAUTO Deployable 4
Small brains have a lot of functionality 1 million 1mW 10 million 10mW 26 billion neurons 20 watts 100 million 100mW 5
Required performance depends on use case 6
Small changes in F1 with size • Order of magnitude improvements in speed with basic exploration • Always worth measuring performance/size tradeoff • Large networks can be used in later stages of cascade 7
Test your chipsets - algorithm speed important but not entire story 150 Nauto CNN forward pass (msec) 120 • Chipsets released in 2014, 2015 and 2016 90 • Pricing varying from $25 to $60+ 60 • Varying degrees of 30 HW/SW support 0 A B C D E Embedded SoC 8
Algorithm is not the bottleneck Image Conversion to CNN forward Other steps processing CNN space pass … msec 15msec 30msec 30msec 9
Entire system must be optimized Collect data Train Label Deploy Pre-GPU months/years months years months 10
Entire system must be optimized Collect data Train Label Deploy Pre-GPU months/years months years months Post-GPU months/years months weeks months 11
Entire system must be optimized Collect data Train Label Deploy Pre-GPU months/years months years months Post-GPU months/years months weeks months Nauto weeks weeks days weeks prototype 12
Entire system must be optimized Collect data Train Label Deploy Pre-GPU months/years months years months Post-GPU months/years months weeks months Nauto weeks weeks days weeks prototype Nauto at- ? ? ? ? scale 13
Easy to think of optimization; hard to think of system Programmers waste enormous amounts of time thinking about, or worrying about, the speed of noncritical parts of their programs, and these attempts at efficiency actually have a strong negative impact when debugging and maintenance are considered. We should forget about small efficiencies, say about 97% of the time: premature optimization is the root of all evil . Yet we should not pass up our opportunities in that critical 3%. Donald Knuth 14
Lessons • Match algorithm performance to use case • Embedded pipeline as important as raw CNN performance • Overall system performance (data acquisition, labeling, training) is where big progress to be made 15
The future is in distributed awareness Real world search 16
Team Ludmila Levkova Nikhil Deshmukh Joe Virzi Jonathan Soo 17

Recommend

ENGIE Energa Per September 2016 Highlights Q316: EEP Total Installed Capacity reached 2,638

ENGIE Energa Per September 2016 Highlights Q316: EEP Total Installed Capacity reached 2,638 MW September 30 th EEP signed an EPC contract with Solairedirect for the construction of its first solar project Intipampa (40MW). EEP

629 views • 39 slides

Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland

Status of GPU offloading on Wayland Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland How to do GPU offloading 1 GPU offloading with X DRI2 2 GPU offloading with Wayland 3 and XWayland? 4

427 views • 29 slides

Motivation to Learn GPGPU Julius Parulek Why to Learn About GPU? Computational power of GPU vs.

Motivation to Learn GPGPU Julius Parulek Why to Learn About GPU? Computational power of GPU vs. CPU Why to Learn About GPU? NVIDIA GPU relative performances Why to Learn About GPU? Hardware Why to Learn About GPU? Interactive rendering

852 views • 46 slides

UNIFIED MEMORY ON PASCAL AND VOLTA Nikolay Sakharnykh - May 10, 2017 1 HETEROGENEOUS

UNIFIED MEMORY ON PASCAL AND VOLTA Nikolay Sakharnykh - May 10, 2017 1 HETEROGENEOUS ARCHITECTURES GPU 0 GPU 1 GPU 2 CPU GPU 0 GPU 1 GPU 2 MEM MEM MEM SYS MEM 2 UNIFIED MEMORY FUNDAMENTALS Single Pointer CPU code GPU code void

870 views • 70 slides

Advancements in V-Ray RT GPU Vlado Koylazov, CTO & Co-founder Blagovest Taskov, RT GPU Team

Advancements in V-Ray RT GPU Vlado Koylazov, CTO & Co-founder Blagovest Taskov, RT GPU Team Lead Alexander Soklev, RT GPU R&D Agenda Recent improvements in RT GPU Rounded edges MDL material support Next-gen GPU

534 views • 24 slides

Wh Whis iskey y Mou ounta ntain in Bi Bighorn orn Sheep eep Draft Plan Presentation and

Wh Whis iskey y Mou ounta ntain in Bi Bighorn orn Sheep eep Draft Plan Presentation and Discussion June 5 and 6, 2019 The e Whisk skey y Mountai untain n Bi Bigh ghorn rn Shee eep p Collaborativ laborative e Process ocess

441 views • 40 slides

D EEP B ELIEF N ETWORKS (DBN S ) Deep belief nets are probabilistic generative models that are

R ESTRICTED B OLTZMANN M ACHINES AND D EEP B ELIEF N ETWORKS ON M ULTI -C ORE P ROCESSORS Jo Noel Lopes Bernardete Ribeiro ao Gonc alves University of Coimbra Polytechnic Institute of Guarda June 11, 2012 WCCIIJCNN D EEP B ELIEF N

693 views • 50 slides

Use Tesla to provide first GPU VM Service in China Feng Zhu

Use Tesla to provide first GPU VM Service in China Feng Zhu Outline UCloud Introduction K80 GPU VM P40 GPU VM UCloud GPU PaaS Service: UAI-Service UCloud GPU ecosystem 2 About UCloud Top 3

726 views • 33 slides

THEIA GPU Open Source multicore programmable GPU Problem Statement Develop an open source 3D

THEIA GPU Open Source multicore programmable GPU Problem Statement Develop an open source 3D Graphic Processor (GPU). Develop a high level language to program the GPU. Provide all of the necessary tools, test-bench and regressions.

411 views • 18 slides

Performance Evaluation of a Multithreaded GPU Using CUDA GPU architecture GeForce 8800 GPU

Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA GPU architecture GeForce 8800 GPU 16 Streaming multiprocessors 8 Streaming processors pr SM 8192 registers pr SM 768 threads pr SM

211 views • 6 slides

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8, 2017 Why super GPU is needed Extending CUDA view into clusters Why super GPU is needed Extending CUDA view into clusters Example: Sparse Matrix

484 views • 13 slides

MULTI-GPU TRAINING WITH NCCL Sylvain Jeaugey MULTI-GPU COMPUTING Harvesting the power of

MULTI-GPU TRAINING WITH NCCL Sylvain Jeaugey MULTI-GPU COMPUTING Harvesting the power of multiple GPUs NCCL Multiple GPUs per system 1 GPU Multiple systems connected NCCL : N VIDIA C ollective C ommunication L ibrary 2 MULTI-GPU DL

1.39k views • 19 slides

GPU Architecture and chitecture and GPU Ar The good The good The bad The bad

Today s Topic s Topic Today GPU architecture GPU architecture What and why What and why GPU Architecture and chitecture and GPU Ar The good The good The bad The bad Programming with OpenCL

570 views • 23 slides

GPU programming in Haskell Henning Thielemann 2015-01-23 GPU programming in Haskell Motivation:

GPU programming in Haskell GPU programming in Haskell Henning Thielemann 2015-01-23 GPU programming in Haskell Motivation: Sensor calibration 1 Motivation: Sensor calibration 2 Haskell GPU programming 3 Fact-check 4 Accelerate programming 5

459 views • 33 slides

MVAPICH2-GPU: Op0mized GPU to GPU Communica0on for InfiniBand

MVAPICH2-GPU: Op0mized GPU to GPU Communica0on for InfiniBand Clusters H. Wang, S. Potluri, M. Luo, A. K. Singh, S. Sur D. K. Panda

606 views • 27 slides

Real-Time GPU Management Heechul Yun 1 This Week Topic: General Purpose Graphic Processing

Real-Time GPU Management Heechul Yun 1 This Week Topic: General Purpose Graphic Processing Unit (GPGPU) management Today GPU architecture GPU programming model Challenges Real-Time GPU management 2 History GPU

834 views • 66 slides

Big Data and the New Regulatory Regime Ryan Abbott, M.D., J.D., M.T.O.M. Professor of Law and

Big Data and the New Regulatory Regime Ryan Abbott, M.D., J.D., M.T.O.M. Professor of Law and Health Sciences, University of Surrey School of Law Adjust Assistant Professor, David Geffen School of Medicine at UCLA Big Data and Drug Regulation

579 views • 11 slides

Colom Colombia bian P n Pha harma rmace ceutic utical al Se Secto ctor Frederick M.

Compa Comparative rative Stud Study y of Se of Selecte lected d Gove Governme rnment nt Poli Po licies cies f for Promo or Promoti ting ng Tran Transfer of sfer of Te Tech chno nology logy an and d Compe Competi titi

493 views • 30 slides

Spokane Transit Authority North Monroe Advisory Board September 8, 2016 Karl Otterstrom, AICP

Spokane Transit Authority North Monroe Advisory Board September 8, 2016 Karl Otterstrom, AICP Steve Hopkins Goals Share history of transit on North Monroe Street Describe existing transit conditions Provide data to address board

746 views • 51 slides

Maryland Chesapeake Bay Watershed Implementation Plan Jag Khuman, Director Maryland Water

Department of the Environment Addressing Stormwater Maryland Chesapeake Bay Watershed Implementation Plan Jag Khuman, Director Maryland Water Quality Financing Administration CIFA - November 11, 2014 Chesapeake Bay Watershed The nutrients

408 views • 15 slides

Solar Development Village of Potsdam 180, Article XII Photovoltaic Siting Ordinance

Zoning for Large Scale Solar Development Village of Potsdam 180, Article XII Photovoltaic Siting Ordinance Frederick Hanss, Director of Planning and Development Heres What Well Cover: The Challenge and Opportunity. Yikes!

336 views • 9 slides

Integration One Year On 3 rd and 4 th May 2016 Eddie Fraser Director, East Ayrshire Health

Integration One Year On 3 rd and 4 th May 2016 Eddie Fraser Director, East Ayrshire Health and Social Care Partnership Background Public Bodies (Joint Working) (Scotland) Act 2014. Duty to prepare a Strategic Plan Deliver on the

296 views • 26 slides

Fourth Quarter 2016 Earnings Call Jeff Woodbury Vice President, Investor Relations &

Fourth Quarter 2016 Earnings Call Jeff Woodbury Vice President, Investor Relations & Secretary January 31, 2017 Cautionary Statement Forward-Looking Statements. Statements of future events or conditions in this presentation or the

331 views • 31 slides

MLP Investor Conference MLP Investor Conference September 17, 2009 September 17, 2009 Bill

MLP Investor Conference MLP Investor Conference September 17, 2009 September 17, 2009 Bill Davis - E xecutive Vice President and CFO Bill Davis - E xecutive Vice President and CFO 1 Forward Looking Statements Forward Looking Statements

438 views • 27 slides