Lecture 2: Terminology and Definitions Abhinav Bhatele, Department - PowerPoint PPT Presentation

Introduction to Parallel Computing (CMSC498X / CMSC818X) Lecture 2: Terminology and Definitions Abhinav Bhatele, Department of Computer Science

Announcements • Piazza space for the course is live. Sign up link: • https://piazza.com/umd/fall2020/cmsc498xcmsc818x • Slides from previous class are posted online on the course website • Recorded video is available via Panopto or ELMS Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 2

Summary of last lecture • Need for parallel and high performance computing • Parallel architecture: nodes, memory, network, storage Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 3

Cores, sockets, nodes • CPU: processor • Single-core or multi-core • Core is a processing unit, multiple such units on a single chip make it a multi-core processor • Socket: same as chip or processor • Node: packaging of sockets https://www.glennklockwood.com/hpc-howtos/process-affinity.html Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 4

Job scheduling Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 5

Job scheduling • HPC systems use job or batch scheduling • Each user submits their parallel programs for execution to a “job” scheduler Job Queue #Nodes Time Requested Requested 1 128 30 mins 2 64 24 hours 3 56 6 hours 4 192 12 hours 5 … … 6 … … Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 5

Job scheduling • HPC systems use job or batch scheduling • Each user submits their parallel programs for execution to a “job” scheduler • The scheduler decides: • what job to schedule next (based on an algorithm: FCFS, priority-based, ….) Job Queue • what resources (compute nodes) to allocate to the ready job #Nodes Time Requested Requested 1 128 30 mins 2 64 24 hours 3 56 6 hours 4 192 12 hours 5 … … 6 … … Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 5

Job scheduling • HPC systems use job or batch scheduling • Each user submits their parallel programs for execution to a “job” scheduler • The scheduler decides: • what job to schedule next (based on an algorithm: FCFS, priority-based, ….) Job Queue • what resources (compute nodes) to allocate to the ready job #Nodes Time Requested Requested • Compute nodes: dedicated to each job 1 128 30 mins 2 64 24 hours 3 56 6 hours • Network, filesystem: shared by all jobs 4 192 12 hours 5 … … 6 … … Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 5

Compute nodes vs. login nodes • Compute nodes: dedicated nodes for running jobs • Can only be accessed when they have been allocated to a user by the job scheduler • Login nodes: nodes shared by all users to compile their programs, submit jobs etc. Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 6

Supercomputers vs. commodity clusters • Supercomputer refers to a large expensive installation, typically using custom hardware • High-speed interconnect • IBM Blue Gene, Cray XT, Cray XC • Cluster refers to a cluster of nodes, typically put together using commodity (off-the- shelf) hardware Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 7

Serial vs. parallel code • Thread: a thread or path of execution managed by the OS • Share memory • Process: heavy-weight, processes do not share resources such as memory, file descriptors etc. • Serial or sequential code: can only run on a single thread or process • Parallel code: can be run on one or more threads or processes Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 8

Scaling and scalable • Scaling: running a parallel program on 1 to n processes • 1, 2, 3, … , n • 1, 2, 4, 8, …, n • Scalable: A program is scalable if it’s performance improves when using more resources Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 9

Scaling and scalable • Scaling: running a parallel program on 1 10000 Actual to n processes Extrapolation Execution time (minutes) 1000 • 1, 2, 3, … , n 100 • 1, 2, 4, 8, …, n 10 1 • Scalable: A program is scalable if it’s 0.1 performance improves when using more 1 4 16 64 256 1K 4K 16K resources Number of cores Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 9

Weak versus strong scaling • Strong scaling: Fixed total problem size as we run on more processes • Sorting n numbers on 1 process, 2 processes, 4 processes, … • Weak scaling: Fixed problem size per process but increasing total problem size as we run on more processes • Sorting n numbers on 1 process • 2n numbers on 2 processes • 4n numbers on 4 processes Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 10

Speedup and efficiency • Speedup: Ratio of execution time on one process to that on p processes Speedup = t 1 t p • Efficiency: Speedup per process t 1 E ffi ciency = t p × p Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 11

Amdahl’s law • Speedup is limited by the serial portion of the code • Often referred to as the serial “bottleneck” • Lets say only a fraction f of the code can be parallelized on p processes 1 Speedup = (1 − f ) + f / p Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 12

1 Amdahl’s law Speedup = (1 − p ) + p / n fprintf(stdout,"Process %d of %d is on %s\n", myid, numprocs, processor_name); fflush(stdout); 100 - p = 40 s on 1 process n = 10000; /* default # of rectangles */ if (myid == 0) startwtime = MPI_Wtime(); 1 Speedup = MPI_Bcast(&n, 1, MPI_INT, 0, MPI_COMM_WORLD); (1 − 0.6) + 0.6/ n h = 1.0 / (double) n; sum = 0.0; /* A slightly better approach starts from large i and works back */ for (i = myid + 1; i <= n; i += numprocs) p = 60 s on 1 process { x = h * ((double)i - 0.5); sum += f(x); } mypi = h * sum; MPI_Reduce(&mypi, &pi, 1, MPI_DOUBLE, MPI_SUM, 0, MPI_COMM_WORLD); Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 13

Communication and synchronization • Each process may execute serial code independently for a while • When data is needed from other (remote) processes, messaging occurs • Referred to as communication or synchronization or MPI messages • Intra-node vs. inter-node communication • Bulk synchronous programs: All processes compute simultaneously, then synchronize together Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 14

Different models of parallel computation • SIMD: Single Instruction Multiple Data • MIMD: Multiple Instruction Multiple Data • SPMD: Single Program Multiple Data • Typical in HPC Abhinav Bhatele (CMSC498X/CMSC818X) LIVE RECORDING 15

Abhinav Bhatele 5218 Brendan Iribe Center (IRB) / College Park, MD 20742 phone: 301.405.4507 / e-mail: bhatele@cs.umd.edu

Lecture 2: Terminology and Definitions Abhinav Bhatele, Department - PowerPoint PPT Presentation

Introduction to Parallel Computing (CMSC498X / CMSC818X) Lecture 2: Terminology and Definitions Abhinav Bhatele, Department of Computer Science Announcements Piazza space for the course is live. Sign up link:

SACM Terminology https://sacmwg.github.io/draft-ietf-sacm-terminology/

1 Basic Definitions Below are some basic definitions and terminology that will be used throughout

Mesh Basics Mesh Basics 1 Spring 2010 Definitions: Definitions: 1/2 Definitions:

Slide 1: Tech Terminology Overview There are lots of terminology terms that we use freely. I

TomcatCon Tomcat Load-balancing Mark Thomas TM Introduction TM Terminology TM Terminology:

Spinology, since 1980 1 Terminology - TE R Content Semantics Non Spinology terms

Spinology, since 1980 1 Terminology - TER Content Semantics Non Spinology terms

61A Lecture 16 Wednesday, October 3 Terminology: Attributes, Functions, and Methods 2

+ Section 3 Threading and Locking + Definitions What is a thread? + Definitions What is a

Tendon Issues Definitions Definitions DSI Catalogue 2006 Definitions Tendon protection

Locating Local Extrema Definitions: Locations . . . Definitions: . . . under Interval

Option Terminology Option Terminology Assets Underlying Exchange-Traded Options Assets

Cryptography Concepts and Terminology Cryptography Concepts Cryptography Notation and

Cryptography Concepts and Terminology Cryptography Concepts Cryptography Notation and

Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo

Bridging OpenID and SAML 2.0 Andreas kre Solberg andreas@uninett.no Terminology OpenID

Truth On The Web Ministries & A Church of God at Woodstock, IL WWW.TOTW.ORG When

Life Never Ends RESOURCES George Anderson Lessons from the Light Michael Newton Journey

April 26, 2020 Glory be to the Father And to the Son And to the Holy Ghost As it was in the

Heimdallr: A Dataset Michael Riegler, Simula & UiO Duc-Tien Dang-Nguyen, University of Trento

CPSC 490 Finale Heavy-Light Decomposition and Suffix Array Lucca Siaudzionis and Jack

ON THE ADVERSARIAL ROBUSTNESS OF UNCERTAINTY AWARE DEEP NEURAL NETWORKS APRIL 29 TH , 2019

A Two-Factor Error Model for Quantitative Steganalysis Security, Steganography and Watermarking

Bayesian Multi-Fidelity Optimization under Uncertainty Phaedon S. Koutsourelakis Maximilian

Lecture 2: Terminology and Definitions Abhinav Bhatele, Department - PowerPoint PPT Presentation

Introduction to Parallel Computing (CMSC498X / CMSC818X) Lecture 2: Terminology and Definitions Abhinav Bhatele, Department of Computer Science Announcements Piazza space for the course is live. Sign up link:

SACM Terminology https://sacmwg.github.io/draft-ietf-sacm-terminology/

1 Basic Definitions Below are some basic definitions and terminology that will be used throughout

Mesh Basics Mesh Basics 1 Spring 2010 Definitions: Definitions: 1/2 Definitions:

Slide 1: Tech Terminology Overview There are lots of terminology terms that we use freely. I

TomcatCon Tomcat Load-balancing Mark Thomas TM Introduction TM Terminology TM Terminology:

Spinology, since 1980 1 Terminology - TE R Content Semantics Non Spinology terms

Spinology, since 1980 1 Terminology - TER Content Semantics Non Spinology terms

61A Lecture 16 Wednesday, October 3 Terminology: Attributes, Functions, and Methods 2

+ Section 3 Threading and Locking + Definitions What is a thread? + Definitions What is a

Tendon Issues Definitions Definitions DSI Catalogue 2006 Definitions Tendon protection

Locating Local Extrema Definitions: Locations . . . Definitions: . . . under Interval

Option Terminology Option Terminology Assets Underlying Exchange-Traded Options Assets

Cryptography Concepts and Terminology Cryptography Concepts Cryptography Notation and

Cryptography Concepts and Terminology Cryptography Concepts Cryptography Notation and

Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo

Bridging OpenID and SAML 2.0 Andreas kre Solberg andreas@uninett.no Terminology OpenID

Truth On The Web Ministries &amp; A Church of God at Woodstock, IL WWW.TOTW.ORG When

Life Never Ends RESOURCES George Anderson Lessons from the Light Michael Newton Journey

April 26, 2020 Glory be to the Father And to the Son And to the Holy Ghost As it was in the

Heimdallr: A Dataset Michael Riegler, Simula &amp; UiO Duc-Tien Dang-Nguyen, University of Trento

CPSC 490 Finale Heavy-Light Decomposition and Suffix Array Lucca Siaudzionis and Jack

ON THE ADVERSARIAL ROBUSTNESS OF UNCERTAINTY AWARE DEEP NEURAL NETWORKS APRIL 29 TH , 2019

A Two-Factor Error Model for Quantitative Steganalysis Security, Steganography and Watermarking

Bayesian Multi-Fidelity Optimization under Uncertainty Phaedon S. Koutsourelakis Maximilian

Truth On The Web Ministries & A Church of God at Woodstock, IL WWW.TOTW.ORG When

Heimdallr: A Dataset Michael Riegler, Simula & UiO Duc-Tien Dang-Nguyen, University of Trento