GPGPU Introduction Alan Gray EPCC The University of Edinburgh - PowerPoint PPT Presentation

Jan 28, 2024 •22 likes •92 views

GPGPU Introduction Alan Gray EPCC The University of Edinburgh Introduction Central Processing Unit (CPU) of a computer system must be able to perform a wide variety of tasks efficiently. Until (relatively) recently, most CPUs

GPGPU Introduction Alan Gray EPCC The University of Edinburgh
Introduction • Central Processing Unit (CPU) of a computer system must be able to perform a wide variety of tasks efficiently. • Until (relatively) recently, most CPUs comprised of 1 sophisticated compute core (for arithmetic), plus complex arrangement of controllers, memory caches, etc • Increases in CPU performance were achieved through increases in the clock frequency of the core. – This has now reached it’s limit mainly due to power requirements • Today, processor cores are not getting any faster, but instead we are getting increasing numbers of cores per chip. – Plus other forms of parallelism such as SSE,AVX vector instruction support • Harder for applications to exploit such technology advances. Alan Gray 2
Introduction Meanwhile…. • In recent years computer gaming industry has driven development of a different type of chip: the Graphics Processing Unit (GPU) • Silicon largely dedicated to high numbers (hundreds) of simplistic cores, – at the expense of controllers, caches, sophistication etc • GPUs work in tandem with the CPU (communicating over PCIe), and are responsible for generating the graphical output display – Computing pixel values • Inherently parallel - each core computes a certain set of pixels • Architecture has evolved for this purpose Alan Gray 3
Introduction • GPU performance has been increasing much more rapidly than CPU • Can we use GPUs for general purpose computation? – Yes (with some effort). Alan Gray 4
GPGPU • GPGPU: General Purpose computation on Graphics Processing Units. • GPU acts as an “accelerator” to the CPU (heterogeneous system) – Most lines of code are executed on the CPU (serial computing) – Key computational kernels are executed on the GPU (stream computing) – Taking advantage of the large number of cores and high graphics memory bandwidth – AIM: code performs better than use of CPU alone. • GPUs now firmly established in HPC industry – Can augment each node of parallel system with GPUs Alan Gray 5
GPGPU: Stream Computing • Data set decomposed into a stream of elements • A single computational function ( kernel ) operates on each element – “thread” defined as execution of kernel on one data element • Multiple cores can process multiple elements in parallel – i.e. many threads running in parallel • Suitable for data-parallel problems Alan Gray 6
Programming Considerations • Standard (CPU) code will not run on a GPU unless it is adapted • Programmer must – decompose problem onto the hardware in a specific way (e.g. using a hierarchical thread/grid model in CUDA) – Manage data transfers between the separate CPU and GPU memory spaces. – Traditional language (C, C++, Fortran etc) not enough, need extensions, directives, or new language. • Once code is ported to GPU, optimization work is usually required to tailor it to the hardware and achieve good performance • Many researchers are now successfully exploiting GPUs – Across a wide range of application areas Alan Gray 7

Recommend

Welcome! Global Agenda: 1. GPGPU (1) : Introduction, architecture, concepts 2. GPGPU (2) :

/INFOMOV/ Optimization & Vectorization J. Bikker - Sep-Nov 2019 - Lecture 9: GPGPU (1) Welcome! Global Agenda: 1. GPGPU (1) : Introduction, architecture, concepts 2. GPGPU (2) : Practical Code using GPGPU 3. GPGPU (3) : Parallel

926 views • 47 slides

Welcome! Todays Agenda: GPU Execution Model GPGPU Flow GPGPU Low Level Notes

/INFOMOV/ Optimization & Vectorization J. Bikker - Sep-Nov 2019 - Lecture 10: GPGPU (3) Welcome! Todays Agenda: GPU Execution Model GPGPU Flow GPGPU Low Level Notes P3 INFOMOV Lecture 10 GPGPU

437 views • 32 slides

Parallel Incep+on MPP Databases GPGPU Kyle Dunn Me Data nerd for Recovering HPC/GPGPU

Parallel Incep+on MPP Databases GPGPU Kyle Dunn Me Data nerd for Recovering HPC/GPGPU researcher Poll Adap+ve JIT Mature Cu?ng Edge SQL GPGPU Rigid SQL MemSQL pg_strom numba JIT GPGPU Edmund Cartwright (1784/5) Scale vs.

829 views • 23 slides

Welcome! Todays Agenda: Introduction to GPGPU Example: Voronoi Noise GPGPU

/INFOMOV/ Optimization & Vectorization J. Bikker - Sep-Nov 2015 - Lecture 12: GPGPU (1) Welcome! Todays Agenda: Introduction to GPGPU Example: Voronoi Noise GPGPU Programming Model OpenCL Template INFOMOV

853 views • 42 slides

Welcome! Todays Agenda: Practical GPGPU: Verlet Fluid GPGPU Algorithms Optimizing

/INFOMOV/ Optimization & Vectorization J. Bikker - Sep-Nov 2015 - Lecture 14: GPGPU (2) Welcome! Todays Agenda: Practical GPGPU: Verlet Fluid GPGPU Algorithms Optimizing GPU code INFOMOV Lecture 14

494 views • 48 slides

Efficient Abstractions for GPGPU Programming . Mathias Bourgoin 10.03.2015 Efficient

. Efficient Abstractions for GPGPU Programming . Mathias Bourgoin 10.03.2015 Efficient abstractions for GPGPU programming . PhD (LIP6/UPMC) . GPGPU programming general purpose computations on the GPU Abstractions languages and

594 views • 32 slides

K E D b . D a L a t a B a s e Jordan Vincent XML processing using GPGPU Jordan

XML processing using GPGPU Research proposal Jordan Vincent University of Tsukuba February 2, 2011 K E D b . D a L a t a B a s e Jordan Vincent XML processing using GPGPU Jordan Vincent Academic achievements Engineering degree

528 views • 18 slides

GPGPU: General-Purpose Computation on GPUs Prekshu Ajmera 03d05006 Overview 1. Motivation: Why

GPGPU: General-Purpose Computation on GPUs Prekshu Ajmera 03d05006 Overview 1. Motivation: Why GPGPU ? 2. CPU-GPU Analogies 3. GPU Resources The Graphics Pipeline Textures Programmable Vertex Processor Fixed Function Rasterizer

604 views • 48 slides

K Pre-Post Cloud Tutorial for the use of GPGPU instances RIKEN R-CCS MARCH 29, 2019 About this

K Pre-Post Cloud Tutorial for the use of GPGPU instances RIKEN R-CCS MARCH 29, 2019 About this Slides This material provides additional information regarding the use of GPGPU instance. (GPGPUs are installed in March 2019.) and is based on the

756 views • 13 slides

GPGPU Programming in Haskell with Accelerate Trevor L. McDonell University of New South Wales

GPGPU Programming in Haskell with Accelerate Trevor L. McDonell University of New South Wales @tlmcdonell tmcdonell@cse.unsw.edu.au https://github.com/AccelerateHS Friday, 17 May 13 What is GPGPU Programming? General Purpose Programming

1.41k views • 126 slides

Node-Level Deep Learning Input Pipeline Optimization on GPGPU-Accelerated HPC Systems 28 Mar

Node-Level Deep Learning Input Pipeline Optimization on GPGPU-Accelerated HPC Systems 28 Mar 2018 Captain Justin Fletcher Air Force Research Laboratory Integrity Service Excellence Distribution A. Approved for public release;

408 views • 14 slides

Marcus Bakker & Roel van der Jagt Background information Main question Test

Marcus Bakker & Roel van der Jagt Background information Main question Test approach GPGPU vs CPU Conclusion Discussion Future 2 General computations with GPUs has become available (GPGPU) GPU performances

497 views • 23 slides

using GPGPU Joner Duarte jduartejr@tecgraf.puc-rio.br Outline Introduction Why is

GPU Technology Conference 2016 April, 4-7 San Jose, CA, USA Structure-preserving Smoothing for Seismic Amplitude Data by Anisotropic Diffusion using GPGPU Joner Duarte jduartejr@tecgraf.puc-rio.br Outline Introduction Why is

459 views • 29 slides

High Performance GPGPU Implementation of a Large 2D Histogram (S9734) Mark Roulo Wed, March

High Performance GPGPU Implementation of a Large 2D Histogram (S9734) Mark Roulo Wed, March 20, 2019 Principal Software Engineer 2:00PM The Problem 1. Create a large (2M bins) 2D histogram 2. ~1M input values 3. The histogram data

535 views • 30 slides

GPGPU Applications for Hydrological and Atmospheric Simulations and Visualizations on the Web

GPGPU Applications for Hydrological and Atmospheric Simulations and Visualizations on the Web Ibrahim Demir Big Data We are collecting and generating data on a petabyte scale (1Pb = 1,000 Tb = 1M Gb) Data contains valuable information

640 views • 36 slides

GPGPU and Stream Computing Julian Fietkau University of Hamburg June 30th, 2011 Julian Fietkau

GPGPU and Stream Computing Julian Fietkau University of Hamburg June 30th, 2011 Julian Fietkau Things to clear up beforehand. . . These slides are published under the CC-BY-SA 3.0 license. Sources for the numbered figures are in the list of

236 views • 21 slides

Introduction to Computer Architecture and Digital Logic I Fall 2013 Carola Wenk Whats In

Introduction to Computer Architecture and Digital Logic I Fall 2013 Carola Wenk Whats In There? 1940s 1980s 1990s Present-day Every modern computational device has a von Neumann architecture. Whats In There? von Neumann

593 views • 25 slides

Computer Architecture and OS 1 Recap What is an OS? An intermediary between users and

Computer Architecture and OS 1 Recap What is an OS? An intermediary between users and hardware A program that is always running A resource manager Manage resources efficiently and fairly A easy to use virtual machine

428 views • 28 slides

Database Management Systems (CPTR 312) Preliminaries Me: Raheel Ahmad Ph.D., Southern

Database Management Systems (CPTR 312) Preliminaries Me: Raheel Ahmad Ph.D., Southern Illinois University M.S., University of Southern Mississippi B.S., Zakir Hussain College, India Contact: Science 116,

503 views • 37 slides

ESJ Public Meeting Technology August 29, 2018 Model Background Water Resources Model Over

ESJ Public Meeting Technology August 29, 2018 Model Background Water Resources Model Over the past decades, agencies in the Eastern San Joaquin Subbasin have worked together to build, calibrate, validate, and refine an integrated surface-

208 views • 17 slides

Parallel programming 01 Walter Boscheri walter.boscheri@unife.it University of Ferrara -

Parallel programming 01 Walter Boscheri walter.boscheri@unife.it University of Ferrara - Department of Mathematics and Computer Science A.Y. 2018/2019 - Semester I Outline Introduction and motivation 1 Parallel architectures 2

568 views • 21 slides

Programming Instructor PanteA Zardoshti Department of Computer Engineering Sharif University of

Parallel Programming Instructor PanteA Zardoshti Department of Computer Engineering Sharif University of Technology e-mail: azad@sharif.edu Object Learn how to program numberical methods 2 Computational Mathematics, OpenMP , Sharif

869 views • 51 slides

Multiprocessor Synchronization Multiprocessor Systems Memory Consistency In addition,

CPSC-410/611 Operating Systems Multiprocessor Synchronization Multiprocessor Synchronization Multiprocessor Systems Memory Consistency In addition, read Doeppner, 5.1 and 5.2 (Much material in this section has been freely borrowed

171 views • 6 slides

6502 Introduction Philipp Koehn 18 September 2019 Philipp Koehn Computer Systems Fundamentals:

6502 Introduction Philipp Koehn 18 September 2019 Philipp Koehn Computer Systems Fundamentals: 6502 Introduction 18 September 2019 1 some history Philipp Koehn Computer Systems Fundamentals: 6502 Introduction 18 September 2019 1971 2

813 views • 24 slides