Towards 1000x with Heterogeneous, Programmable Hardware Datacenter - PowerPoint PPT Presentation

Apr 17, 2023 •183 likes •295 views

Towards 1000x with Heterogeneous, Programmable Hardware Datacenter Name: Anton Burtsev, UC Irvine Summary: 1 Related work: What will hardware look like in 10-20 years? Massively heterogeneous Not just many-cores GPUs, Xeon

Towards 1000x with Heterogeneous, Programmable Hardware Datacenter ● Name: Anton Burtsev, UC Irvine ● Summary: 1 ● Related work:
What will hardware look like in 10-20 years? ● Massively heterogeneous ○ Not just many-cores ■ GPUs, Xeon Phi, Tilera TILE, PowerEN ○ But also ■ Fine-grained hardware ASICs accelerators ■ Programmable hardware (FPGA) 2
Ubiquitous, fine-grained, heterogeneous hardware-acceleration ● Execution will no longer stay on 1 CPU 3
Ubiquitous, fine-grained, heterogeneous hardware-acceleration ● A chain of hardware accelerators (ASIC/FPGA) ■ On-chip, and over PCIe ○ Co-located with storage and network devices ● A single machine is a distributed system ○ Yet you have to use it efficiently 4
Even your memory is distributed ● Your memory is not local either ● We will see large memories ○ 6TB are possible today (Dell R930, 96x64GB DIMMs) ○ 10x higher density in the near future [Meena et al.] ■ ~100TB of NVM on the memory bus ■ 20-80 ns latency of access 5
Big/New Ideas of 1000x ● Your biggest problem is ... ○ Latency and parallelism ■ Sent a request to another core/accelerator ● 355ns on a cache-coherent Intel HARP [Choi, DAC’16] ■ Have to find something to do… ○ Parallelism ■ Expressing, and running the graph of the computation on a set of execution units 6
Big/New Ideas of 1000x ● Your have more problems... ○ Reliability ■ A single bug can destroy your in-memory dataset ● 100TB of non-volatile memory are cache-coherent Any FPGA unit, or core can wipe it ● 7
Indicated R&D for 1000x ● OS/VMM support for heterogeneous hardware ○ Novel execution runtime ■ Spatial scheduling, preemption, load-balancing ● Sharing across multiple users One host and in a virtual datacenter ● ■ Unified OS platform for GPU, multi-cores, FPGA Proprietary stacks and device drivers should go… ● Direct (low-latency) access to hardware ● 8
Indicated R&D for 1000x ● Language support ○ Programmable hardware ■ C/C++/Rust to FPGA ○ Parallelism ■ Async & delegate [Grappa, USENIX’16] ● Works good for analytical workloads ■ Streaming languages ■ Your favorite model here Well, MPI will work too ● 9
Questions for the Software Institute ● Analyze potential performance gains for HEP workloads ■ Assume a clean-slate ideal slate software stack ■ Only hardware limitations ■ Can we get to 1000x? ■ What are the bottlenecks? 10
Questions for the Software Institute ● Encouraging example: ○ D.E. Shaw Anton/Anton 2 dynamic molecular simulation machine ■ Custom ASIC ■ 1000x speedup ● Same acceleration is possible for HEP 11

Recommend

Workloads with Heterogeneous Programmable Datacenters Anton Burtsev, Alex Veidenbaum

Towards 1000x Speedup for HEP Workloads with Heterogeneous Programmable Datacenters Anton Burtsev, Alex Veidenbaum aburtsev@uci.edu, alexv@ics.uci.edu University of California, Irvine March, 2018 Compute Ex #1: Exploratory Data Analysis

382 views • 23 slides

ROMs, PLAs and FPGAs October 5, 2006 Typeset by Foil T EX Why Programmable Logic?

ROMs, PLAs and FPGAs October 5, 2006 Typeset by Foil T EX Why Programmable Logic? Programmable logic technologies: Read-Only Memory (ROM) Programmable Logic Array (PLA) Programmable Array Logic (PAL) Field Programmable

284 views • 13 slides

Hardware Observability Framework Hardware Observability Framework Hardware Observability

Hardware Observability Framework Hardware Observability Framework Hardware Observability Framework Hardware Observability Framework Hardware Observability Framework Hardware Observability Framework Hardware Observability Framework Hardware

746 views • 22 slides

Programmable Switch Hardware ECE/CS598HPN Radhika Mittal Conventional SDN Programmable

Programmable Switch Hardware ECE/CS598HPN Radhika Mittal Conventional SDN Programmable control plane . Data plane can support high bandwidth. But has limited flexibility. Restricted to conventional packet protocols. Software

560 views • 40 slides

PROGRAMMABLE LOGIC CONTROLLER Control Systems Types Programmable Logic Controllers

PROGRAMMABLE LOGIC CONTROLLER Control Systems Types Programmable Logic Controllers Distributed Control System PC- Based Controls Programmable Logic Controllers PLC Sequential logic solver PID Calculations. Advanced

297 views • 27 slides

Field Programmable Gate Arrays by Ketil Red Field Programmable Gate Array Integrated

A brief introduction to Field Programmable Gate Arrays by Ketil Red Field Programmable Gate Array Integrated circuit including a matrix of general-purpose programmable logic I I I I I I I I I I I I I I I I I I I I

440 views • 14 slides

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College London UC4G Beijing Workshop August 2010 Outline Introduction Heterogeneous networks Heterogeneous networks Challenges Coverage in

346 views • 32 slides

A Crash Course on Programmable Graphics Hardware Li-Yi Wei Microsoft Research Asia Abstract

A Crash Course on Programmable Graphics Hardware Li-Yi Wei Microsoft Research Asia Abstract Application Recent years have witnessed tremendous growth for programmable graphics hardware (GPU), both in terms of performance and func- tionality.

331 views • 5 slides

VC. VC. Hardware Startup The Hardware Revolu/on The Hardware Revolution Removing Barriers to

The Business of Making Strategies for Success from Startup to Exit Hardware and Robotic Startup Accelerator what hardware used to be . VC. VC. Hardware Startup The Hardware Revolu/on The Hardware Revolution Removing Barriers to Entry Open

606 views • 32 slides

Sec Secure ure Hardware Hardware and Hardware and Hardware- En Enabled abled Security

SP SPACE ACE 201 2016 Sec Secure ure Hardware Hardware and Hardware and Hardware- En Enabled abled Security Security: : New Front New Frontiers iers Swarup Bhunia Professor Electrical & Computer Engineering SPACE | Dec 2016 1

610 views • 29 slides

12.2 Programmable Graphics Hardware Kyle Morgenroth http://cs420.hao-li.com 1

Fall 2018 CSCI 420: Computer Graphics 12.2 Programmable Graphics Hardware Kyle Morgenroth http://cs420.hao-li.com 1 Introduction Recent major advance in real time graphics is the programmable pipeline: - First introduced by NVIDIA

717 views • 35 slides

12.2 Programmable Graphics Hardware Kyle Olszewski http://cs420.hao-li.com 1 Introduction

Fall 2017 CSCI 420: Computer Graphics 12.2 Programmable Graphics Hardware Kyle Olszewski http://cs420.hao-li.com 1 Introduction Recent major advance in real time graphics is the programmable pipeline: - First introduced by NVIDIA

622 views • 35 slides

8.2 Programmable Graphics Hardware Kyle Olszewski http://cs420.hao-li.com 1 Introduction

Fall 2014 CSCI 420: Computer Graphics 8.2 Programmable Graphics Hardware Kyle Olszewski http://cs420.hao-li.com 1 Introduction Recent major advance in real time graphics is the programmable pipeline: - First introduced by NVIDIA

694 views • 35 slides

Unifying Heterogeneous Cray Unifying Heterogeneous Cray Resources and Systems into an

Unifying Heterogeneous Cray Unifying Heterogeneous Cray Resources and Systems into an Intelligent Single-scheduled Environment Scott Jackson Engineering Confidential and Proprietary Overview Introduction Heterogeneous Resources

692 views • 34 slides

Regulatory Guidance on the Use of Field Programmable Gate of Field Programmable Gate Arrays in

Regulatory Guidance on the Use of Field Programmable Gate of Field Programmable Gate Arrays in the U.S. October 13, 2015 Steven A. Arndt, Ph.D., P.E. Office of Nuclear Reactor Regulation The views expressed in this presentation are solely

542 views • 23 slides

Outline FPGA clocking Programmable clocks Dynamic programmable oscillators EMI

Enhance FPGA- -based Systems with based Systems with Enhance FPGA Programmable Oscillators Programmable Oscillators Sassan Tabatabaei Sassan Tabatabaei Director, Strategic Applications Director, Strategic Applications SiTime Corporation

320 views • 11 slides

Low mass dark matter Christopher M c Cabe Effective Theories and Dark Matter, Mainz 19 th

Low mass dark matter Christopher M c Cabe Effective Theories and Dark Matter, Mainz 19 th March 2015 1. General considerations 2. A peculiar neutralino model Results from: Boehm, Dolan, CM, Increasing N eff with particles in thermal

485 views • 32 slides

Prospects for dark matter detection with inelastic transitions of xenon Christopher M c Cabe

Prospects for dark matter detection with inelastic transitions of xenon Christopher M c Cabe preliminary results work in progress TeVPA, Tokyo, Japan - 27th October 2015 An old idea The original direct detection paper: Christopher

445 views • 30 slides

DJ Distributed JIT Matthew Francis-Landau UC Berkeley September, 2015 Structure of DJ

DJ Distributed JIT Matthew Francis-Landau UC Berkeley September, 2015 Structure of DJ Runtime Performs dynamic code rewriting Remote memory access Distributed locks **Ensure correctness of program regardless of distributed

497 views • 21 slides

Dark Matter Detection with Angular Power Spectrum Marco Chianese 5 March 2020, 1st Joint

Dark Matter Detection with Angular Power Spectrum Marco Chianese 5 March 2020, 1st Joint Nikhef+Grappa Neutrino Meeting MC, Fiorillo, Miele, Morisi, Pisanti, JCAP 1911 [arXiv:1907.11222] Dekker, MC, Ando, arXiv:1910:12917 <latexit

754 views • 46 slides

Introductory talk on Beyond the Standard Models of Particle Physics and Cosmology Steve King

Introductory talk on Beyond the Standard Models of Particle Physics and Cosmology Steve King SHEP BSM Group 18/19 Steve King Rome Samanta Ye-Ling Zhou Pasquale Di Bari Josu Adam Adam Hernandez Huchan Lee Murphy Forster Susana Elena

355 views • 22 slides

Comparative Performance and Optimization of Chapel in Modern Manycore Architectures* Engin

Comparative Performance and Optimization of Chapel in Modern Manycore Architectures* Engin Kayraklioglu , Wo Chang, Tarek El-Ghazawi *This work is partially funded through an Intel Parallel Computing Center gift. Outline Introduction &

868 views • 48 slides

A new era in the quest for Dark Matter Gianfranco Bertone GRAPPA center of excellence, U. of

A new era in the quest for Dark Matter Gianfranco Bertone GRAPPA center of excellence, U. of Amsterdam Astrophysics and MAGIC workshop, 26-29 June 2018 ~ based on a review article (to appear soon!) with T. Tait A problem with a long history

608 views • 36 slides

T h e a n g u l a r p o we r s p e c t r u m a n d e R O S I T A '

T h e a n g u l a r p o we r s p e c t r u m a n d e R O S I T A ' s p o t e n t i a l r o l e f o r s t e r i l e n e u t r i n o s e a r c h e s Christoph Weniger GRAPPA,

440 views • 17 slides