Extending MPI to Accelerators* Jeff A. Stuart, John D. Owens - PowerPoint PPT Presentation

Apr 16, 2023 •135 likes •299 views

Extending MPI to Accelerators* Jeff A. Stuart, John D. Owens University of California, Davis cpunerd@gmail.com, jowens@ece.ucdavis.edu Pavan Balaji Argonne National Laboratories balaji@mcs.anl.gov * For this presentation, we mean GPUs

Extending MPI to Accelerators* Jeff A. Stuart, John D. Owens University of California, Davis cpunerd@gmail.com, jowens@ece.ucdavis.edu Pavan Balaji Argonne National Laboratories balaji@mcs.anl.gov * For this presentation, we mean GPUs
Outline ● Motivation ● Previous Work ● Proposal ● Challenges
Motivation ● HPC no longer (just) CPU ● GPUs Have Problems ● Slave Device ● No system calls
Previous Work ● Three Main Works ● cudaMPI ● GAMPI ● DCGN
Previous Work ● cudaMPI ● Handles buffer movement ● No ranks for GPUs
Previous Work ● GAMPI ● GPUs have ranks* ● More communicators ● Handles buffer movement
Previous Work ● DCGN ● GPUs have ranks ● GPUs source/sink communication* ● Doesn't implement standard MPI
Proposal ● Several Ideas ● No Ranks for GPUs ● Multiple Ranks per GPU Context ● One Rank per GPU Context ● New MPI Function(s) to Spawn Kernels
Proposal ● No Ranks for GPUs ● The way things work right now ● No changes necessary to MPI
Proposal ● Multiple Ranks Per Accelerator Context ● Ranks exist for lifetime of application – # of ranks chosen at runtime by user ● Modifications to MPI – Bind GPU threads to rank/MPI functions take source rank – Host must listen for requests ● Extra threads on CPU (one for each GPU)
Proposal ● One Rank per Accelerator Context ● Ranks exist for lifetime of Application ● Mapping of Processes:Contexts? ● Can CPU Processes use MPI communication?
Proposal ● New MPI Function(s) to Spawn Kernels ● New communicators and ranks after every spawn – Cleaned up after all kernels finish ● Intercommunicator(s) available upon request
Challenges ● Threads vs Processes ● Extra Communicators? ● Collectives ● Source/Sink Communication
Looking Forward ● GPU-Direct is good ● GPU-Direct 2 is great ● We want GPU-Direct 3 to ● Let GPU source/sink ● Use GPU-Direct 2 to interface with NIC ● Administer MPI ranks without CPU interference
One Last Note ● Graduating with Ph.D. In June 2012 ● Resume at http://jeff.bleugris.com/resume.pdf

Recommend

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application Programming Application Application Interface from MPI-1.0:``Design an application MPI programming interface (not necessarily for

561 views • 19 slides

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme Scalability Group & Parallel Computing Lab Intel Corporation (Portland, OR) 26 September 2014 Jeff Hammond MPI+MPI Jeff Hammond MPI+MPI

615 views • 27 slides

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI concepts -- communicators, processes, ranks MPI functions to manipulate these Timing functions Barriers and the reduction collective

1.83k views • 134 slides

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1 Message Passing Programming with MPI 2 MPI Forum Goals and Scope of MPI First message-passing interface standard. MPIs prime goals are:

511 views • 29 slides

Application Accelerators: Application Accelerators: Application Accelerators: Application

Application Accelerators: Application Accelerators: Application Accelerators: Application Accelerators: Dues ex Dues ex machina machina machina ? ? Dues ex Dues ex machina CCGSC, Flat Rock, North Carolina CCGSC, Flat Rock, North

679 views • 40 slides

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25, 2017 Outline Origins of MPI-IO How it became part of MPI-2 What is in MPI-IO Which features of MPI-IO turned out to be useful, and which

630 views • 16 slides

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI? Message Passing Programming with MPI 2 MPI Forum First message-passing interface standard. Sixty people from forty different organisations.

1.61k views • 113 slides

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

Advanced Parallel Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors are not fatal - probably dont want your program to crash if a file open fails - always need to check the error code! Many

265 views • 23 slides

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards MPICH-1 MPI-2 MPICH-2 MPI-3 Overview MPI (Message Passing Interface) Specification for a standard library for message passing

659 views • 29 slides

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open source project / community Consolidation and evolution of several prior MPI implementations All of MPI-1 and MPI-2 Production quality

457 views • 35 slides

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication purposes Datatype tells MPI where to take the data when sending or where to put data when receiving Elementary datatypes (MPI_INT, MPI_REAL, ...)

499 views • 34 slides

DETECTORS AND ACCELERATORS DETECTORS AND ACCELERATORS APPLIED TO MEDICINE Jos Bernabu Jos

IFIC TRAINING COURSE DETECTORS AND ACCELERATORS DETECTORS AND ACCELERATORS APPLIED TO MEDICINE Jos Bernabu Jos Bernabu IFIC-Valencia IFIMED Project Leader PARTNER-CERN 16 October 2008 IFIC TRAINING COURSE DETECTORS AND ACCELERATORS

634 views • 17 slides

Accelerators for Americas Future ACCELERATORS - MODERN SHIPS OF DISCOVERY October 26, 2009

Accelerators for Americas Future ACCELERATORS - MODERN SHIPS OF DISCOVERY October 26, 2009 MODERN SHIPS OF DISCOVERY THEY ( accelerators and the detectors that go with them) TAKE US WHERE WE CANNOT GO UNAIDED ENABLE US TO SEE WHAT

1k views • 57 slides

R265: Advanced Topics in Computer Architecture Seminar 7: HW accelerators and accelerators for

R265: Advanced Topics in Computer Architecture Seminar 7: HW accelerators and accelerators for machine learning Robert Mullins This lecture Computer architecture trends Hardware accelerators Design choices and trade-offs Hardware

384 views • 22 slides

Confidential Accelerators Stavros Volos Microsoft Research Accelerators Play Pivotal Role in

Confidential Accelerators Stavros Volos Microsoft Research Accelerators Play Pivotal Role in Cloud GPU, FPGA, AI Accelerators (e.g., Brainwave, TPU) Increasing compute performance and bandwidth E.g., Nvidia V100 offers 14 TFLOPS & ~1

300 views • 12 slides

Activities on accelerators in Spain Francis Perez ALBA Accelerators Head on behalf of

Activities on accelerators in Spain Francis Perez ALBA Accelerators Head on behalf of CONECTA: Spanish Coordination on Accelerators Science and Technology 102 nd Plenary ECFA Meeting ALBA, July 19 th , 2018 Francis Perez CONECTA Spanish

764 views • 42 slides

The first million is always the hardest. May 21, 2013 - Heavybit Industries Javier A. Soltero

The first million is always the hardest. May 21, 2013 - Heavybit Industries Javier A. Soltero jsoltero@gmail.com | @jsoltero Quick Background From Puerto Rico to San Francisco via Carnegie Mellon 15 years in technology as a developer &

564 views • 10 slides

1 Streamtube example Streamtube example 6 streamlines 6 streamlines Streamline

Advection methods comparison Stream-ribbon We really would like to see vorticities, I.e. places were the flow twists. A point primitive or an icon can hardly convey this idea: trace neighboring particles and connect them with

451 views • 13 slides

Analyzing Delays in Trajectories Maximilian Konzack , Thomas McKetterick, Georgina Wilcox, Maike

Analyzing Delays in Trajectories Maximilian Konzack , Thomas McKetterick, Georgina Wilcox, Maike Buchin, Luca Giuggioli, Joachim Gudmundsson, Michel Westenberg, Kevin Buchin Action-reaction in a pair of moving animals What is interaction?

260 views • 15 slides

Hypervariate Information Visualization Hauptseminar Information Visualization"

Hypervariate Information Visualization Hauptseminar Information Visualization" Wintersemester 2008/2009 Florian Mller LFE Medieninformatik 16/17.02.2008 LMU Department of Media Informatics | Hauptseminar WS 2008/2009

371 views • 20 slides

Darry ryl Nicholson olson ContactDarrylNicholson@gmail.com Introduction Context /

Darry ryl Nicholson olson ContactDarrylNicholson@gmail.com Introduction Context / Background The Problem Scenarios & Calibration Scenario Lifecycle Deliverable Questions Who am I and why am I here? Risk and

367 views • 21 slides

Intro to Zoom Lecture Math 482, Lecture 20.5 Misha Lavrov March 23, 2020 Plans for the online

Intro to Zoom Lecture Math 482, Lecture 20.5 Misha Lavrov March 23, 2020 Plans for the online future Homework due Friday to: uiuc.math482@gmail.com Plans for the online future Homework due Friday to: uiuc.math482@gmail.com Lectures on Zoom

437 views • 20 slides

Vaex: Out of core dataframes for Python Maarten A. Breddels & Jovan Veljanoski Article:

Vaex: Out of core dataframes for Python Maarten A. Breddels & Jovan Veljanoski Article: A&A 618, 2017 / Arxiv 1801.02638 PyParis - Nov 13/2018 Maarten Breddels Ex: astronomer (working on software for big data and visualization: vaex)

1.15k views • 54 slides

GR committee J. Orduna, Chair Louise Suter, Deputy Chair Important links and announcements DC

GR committee J. Orduna, Chair Louise Suter, Deputy Chair Important links and announcements DC Trip dates March 16-18, 2016 (Wed. to Fri.) Planning meeting: January 22, 2016. UEC Meeting during the morning followed by planning meeting. The

612 views • 10 slides

Extending MPI to Accelerators* Jeff A. Stuart, John D. Owens - PowerPoint PPT Presentation

Extending MPI to Accelerators* Jeff A. Stuart, John D. Owens University of California, Davis cpunerd@gmail.com, jowens@ece.ucdavis.edu Pavan Balaji Argonne National Laboratories balaji@mcs.anl.gov * For this presentation, we mean GPUs

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

Application Accelerators: Application Accelerators: Application Accelerators: Application

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

DETECTORS AND ACCELERATORS DETECTORS AND ACCELERATORS APPLIED TO MEDICINE Jos Bernabu Jos

Accelerators for Americas Future ACCELERATORS - MODERN SHIPS OF DISCOVERY October 26, 2009

R265: Advanced Topics in Computer Architecture Seminar 7: HW accelerators and accelerators for

Confidential Accelerators Stavros Volos Microsoft Research Accelerators Play Pivotal Role in

Activities on accelerators in Spain Francis Perez ALBA Accelerators Head on behalf of

The first million is always the hardest. May 21, 2013 - Heavybit Industries Javier A. Soltero

1 Streamtube example Streamtube example 6 streamlines 6 streamlines Streamline

Analyzing Delays in Trajectories Maximilian Konzack , Thomas McKetterick, Georgina Wilcox, Maike

Hypervariate Information Visualization Hauptseminar Information Visualization"

Darry ryl Nicholson olson ContactDarrylNicholson@gmail.com Introduction Context /

Intro to Zoom Lecture Math 482, Lecture 20.5 Misha Lavrov March 23, 2020 Plans for the online

Vaex: Out of core dataframes for Python Maarten A. Breddels & Jovan Veljanoski Article:

GR committee J. Orduna, Chair Louise Suter, Deputy Chair Important links and announcements DC

Sambuz

Useful Links

Newsletter

Mail Us

Extending MPI to Accelerators* Jeff A. Stuart, John D. Owens - PowerPoint PPT Presentation

Extending MPI to Accelerators* Jeff A. Stuart, John D. Owens University of California, Davis cpunerd@gmail.com, jowens@ece.ucdavis.edu Pavan Balaji Argonne National Laboratories balaji@mcs.anl.gov * For this presentation, we mean GPUs

MPI is too High-Level MPI is too Low-Level Marc Snir High-Level MPI MPI is an Application

The MPI+MPI programming model and why we need shared-memory MPI libraries Jeff Hammond Extreme

Introduction to MPI T opics to be covered MPI vs shared memory Initializing MPI MPI

Message Passing Programming with MPI What is MPI? Message Passing Programming with MPI 1

Application Accelerators: Application Accelerators: Application Accelerators: Application

MPI-IO: A Retrospective Rajeev Thakur 25 th Anniversary of MPI Workshop Argonne, IL, Sept 25,

Message Passing Programming with MPI Message Passing Programming with MPI 1 What is MPI?

Programming Miscellaneous MPI-IO topics MPI-IO Errors Unlike the rest of MPI, MPI-IO errors

MPI &amp; MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

Open MPI on the Cray XT presented by Richard L. Graham Galen Shipman Open MPI Is Open

Advanced MPI USER-DEFINED DATATYPES MPI datatypes MPI datatypes are used for communication

DETECTORS AND ACCELERATORS DETECTORS AND ACCELERATORS APPLIED TO MEDICINE Jos Bernabu Jos

Accelerators for Americas Future ACCELERATORS - MODERN SHIPS OF DISCOVERY October 26, 2009

R265: Advanced Topics in Computer Architecture Seminar 7: HW accelerators and accelerators for

Confidential Accelerators Stavros Volos Microsoft Research Accelerators Play Pivotal Role in

Activities on accelerators in Spain Francis Perez ALBA Accelerators Head on behalf of

The first million is always the hardest. May 21, 2013 - Heavybit Industries Javier A. Soltero

1 Streamtube example Streamtube example 6 streamlines 6 streamlines Streamline

Analyzing Delays in Trajectories Maximilian Konzack , Thomas McKetterick, Georgina Wilcox, Maike

Hypervariate Information Visualization Hauptseminar Information Visualization&quot;

Darry ryl Nicholson olson ContactDarrylNicholson@gmail.com Introduction Context /

Intro to Zoom Lecture Math 482, Lecture 20.5 Misha Lavrov March 23, 2020 Plans for the online

Vaex: Out of core dataframes for Python Maarten A. Breddels &amp; Jovan Veljanoski Article:

GR committee J. Orduna, Chair Louise Suter, Deputy Chair Important links and announcements DC

Sambuz

Useful Links

Newsletter

Mail Us

MPI & MPICH Presenter: Naznin Fauzia CSE 788.08 Winter 2012 Outline MPI-1 standards

Hypervariate Information Visualization Hauptseminar Information Visualization"

Vaex: Out of core dataframes for Python Maarten A. Breddels & Jovan Veljanoski Article: