1 Space-Partition Constraints Solving the Space-Partition - PowerPoint PPT Presentation

Cetus Compiler Synchronization-Free Parallelism Status Today – Written in Java – SPMD and OpenMP programming models – Parses C – Synchronization-free affine partitioning algorithm – Announced new release late last night – Deriving primitive affine transformations – J CS553 Lecture Synchronization-Free Parallelism 1 CS553 Lecture Synchronization-Free Parallelism 2 Two Parallel Programming Models Diagonal Partitioning Example SPMD – single program multiple data Example – program should check what processor it is running on and execute some do i = 1,6 subset of the iterations based on that do j = 1, 5 MPI_Init(&Argc,&Argv); A(i,j) = A(i-1,j-1)+1 // p is the processor id enddo MPI_Comm_rank(MPI_COMM_WORLD,&p); enddo j i Goal – Determine an affine space partitioning that results in no synchronization OpenMP needed between processors. – shared memory, thread-based parallelism – pragmas indicate that a loop is fully parallel #pragma omp for for (i=0; i<N; i++) { } CS553 Lecture Synchronization-Free Parallelism 3 CS553 Lecture Synchronization-Free Parallelism 4 1

Space-Partition Constraints Solving the Space-Partition Constraints Accesses sharing a dependence should be mapped to the same processor Ad-hoc approach – loop bounds – Reduce the number of unknowns – equality constraints on dependence – Simplify – equality constraints on space partition – Determine independent solutions for space partition matrix – Find constant terms (would like min of mapping to be non-negative) CS553 Lecture Synchronization-Free Parallelism 5 CS553 Lecture Synchronization-Free Parallelism 6 Generate Simple Code Eliminate Empty Iterations Algorithm 11.45 Generate code that executes partitions of a program Apply FM-based code generation algorithm to resulting iteration space sequentially – for each statement – for each statement, project out all loop index variables from the system – use unioned p bounds, the statement iteration space and the space with original loop bounds and space partition constraints partition constraints – determine new bounds for the statement iteration space – union the iteration space for all statements in the same loop – use FM-based code gen algorithm to determine bounds over partitions – For example, did this when determining bounds on p – not needed for example – union the partition bounds over all statements – not needed for example do p = 0, 9 – insert space partition predicate before each statement do i = max(1,-3+p), min(6,p+1) do j = max(1,i+4-p), min(5,i+4-p) do p = 0, 9 if (i-j+4 = p) A(i,j) = A(i-1,j-1)+1 do i = 1,6 do j = 1, 5 if (i-j+4 = p) A(i,j) = A(i-1,j-1)+1 CS553 Lecture Synchronization-Free Parallelism 7 CS553 Lecture Synchronization-Free Parallelism 8 2

Eliminate Tests from Innermost Loops Using the Two Programming Models General approach: apply the following repeatedly SPMD and MPI – select an inner loop with statements with different bounds MPI_Comm_rank(MPI_COMM_WORLD,&p); – split the loop using a condition that causes a statement to be in only one of the splits for (i = max(1,-3+p); i<= min(6,p+1); i++) for (j = max(1,i+4-p); j<=min(5,i+4-p); j++) – generate code for the split iteration spaces A[i][j] = A[i-1][j-1]+1 OpenMP do p = 0, 9 do i = max(1,-3+p), min(6,p+1) #pragma omp for do j = max(1,i+4-p), min(5,i+4-p) for (p = 0; p<=9; p++) A(i,j) = A(i-1,j-1)+1 for (i = max(1,-3+p); i<=min(6,p+1); i++) for (j = max(1,i+4-p); j<= min(5,i+4-p); j++) A[i][j] = A[i-1][j-1]+1 CS553 Lecture Synchronization-Free Parallelism 9 CS553 Lecture Synchronization-Free Parallelism 10 Derive Re-indexing by using Space Partition Constraints Concepts Source Code Two Parallel Programming Models – SPMD for (i=1; i<=N; i++ { – OpenMP Y[i] = Z[i]; /* s1 */ X[i] = Y[i-1]; /* s2 */ } Deriving a Synchronization-Free Affine Partitioning – setting up the space partition constraints (keep iterations involved in a dependence on the same processor) – solve the sparse partition constraints (linear algebra) – eliminate empty iterations (Fourier-Motzkin) Transformed Code – eliminate tests from inner loop (more Fourier-Motzkin) if (N>=1) X[1]=Y[0]; – using the above to derive primitive affine transformations for (p=1; p<=N-1; p++) { Y[p] = Z[p]; X[p+1]=Y[p]; } if (N>=1) Y[N] = Z[N]; CS553 Lecture Synchronization-Free Parallelism 11 CS553 Lecture Synchronization-Free Parallelism 12 3

Next Time Lecture – Tiling! Suggested Exercises – Be able to derive the synchronization-free affine partitioning for Example 11.41 in the book. – Show how the other primitive affine transformations are derived. CS553 Lecture Synchronization-Free Parallelism 13 4

1 Space-Partition Constraints Solving the Space-Partition - PowerPoint PPT Presentation

Cetus Compiler Synchronization-Free Parallelism Status Today Written in Java SPMD and OpenMP programming models Parses C Synchronization-free affine partitioning algorithm Announced new release late last night

AM 205: lecture 21 Today: eigenvalue sensitivity Eigenvalue Decomposition In some cases, the

ChaCha, a variant of Salsa20 D. J. Bernstein University of Illinois at Chicago NSF

Timed automata with diagonal constraints B. Srivathsan Chennai Mathematical Institute, India In

Existence of frames with prescribed norms and frame operator Marcin Bownik University of Oregon

The Meson Spectroscopy Program at Jefferson Laboratory Alessandra Filippi INFN Torino, Italy 1

Pros rospe per Research Report Challenges and recommendations for the commercial HVAC industry

Senior PWP Network 19 January 2017 Andy Wright, IAPT Clinical Advisor, Heather Stonebank,

CMS Strip Readout Architecture for SLHC OUTLINE brief review of LHC strip readout architecture p

Spectra of digraph transformations Aiping Deng Donghua University, University of Wisconsin Joint

Oriented incidence colouring of digraphs Andr e Raspaud (Joint work with Chris Duffy, Gary

Expressibility of digraph homomorphisms in the logic LFP+Rank (joint work with C. Heggerud and F.

1-bend Upward Planar Drawings of SP-digraphs with the Optimal Number of Slopes Emilio Di

Solving Evacuation Problems in Polynomial Space Miriam Schlter & Martin Skutella

Development of Deep Unconventional Development of Deep Unconventional Geothermal Resources

Modeling seismic swarms triggered by aseismic transients Andrea L. Llenos, Jeffrey J. McGuire,

UML Status Report . Richard Weinberger sigma star gmbh . . . . . . . . . . . . . .

Complex Electrical Impedance Tomography in Geoelectrics Andreas Helfrich-Schkarbanenko July 17,

Solitons 18.354 L22 John Scott Russell 1834 Discovery of solitons 9 May 1808 8 June 1882

NASA Sounder Science Team Meeting, October 14, 2009, Greenbelt, MD (submitted to ACPD)

Dipartimento di Matematica e Informatica Universit` a di Cagliari EXACT SOLUTIONS OF INTEGRABLE

PRACTICE A work in progress Jane Boggan Goldsmiths, University of London Ben Pimlott

Discharging and List coloring Bernard Lidick Department of Applied Math Charles University

Lec 2 Insulation Materials, Properties and Breakdown Theory Prof Chengke Zhou c.zhou@gcu.ac.uk

High Voltage Tests At CERN Heng-Ye Liao, Francesco Pietropaolo, Francesca Stocker, Kevin Wood

1 Space-Partition Constraints Solving the Space-Partition - PowerPoint PPT Presentation

Cetus Compiler Synchronization-Free Parallelism Status Today Written in Java SPMD and OpenMP programming models Parses C Synchronization-free affine partitioning algorithm Announced new release late last night

AM 205: lecture 21 Today: eigenvalue sensitivity Eigenvalue Decomposition In some cases, the

ChaCha, a variant of Salsa20 D. J. Bernstein University of Illinois at Chicago NSF

Timed automata with diagonal constraints B. Srivathsan Chennai Mathematical Institute, India In

Existence of frames with prescribed norms and frame operator Marcin Bownik University of Oregon

The Meson Spectroscopy Program at Jefferson Laboratory Alessandra Filippi INFN Torino, Italy 1

Pros rospe per Research Report Challenges and recommendations for the commercial HVAC industry

Senior PWP Network 19 January 2017 Andy Wright, IAPT Clinical Advisor, Heather Stonebank,

CMS Strip Readout Architecture for SLHC OUTLINE brief review of LHC strip readout architecture p

Spectra of digraph transformations Aiping Deng Donghua University, University of Wisconsin Joint

Oriented incidence colouring of digraphs Andr e Raspaud (Joint work with Chris Duffy, Gary

Expressibility of digraph homomorphisms in the logic LFP+Rank (joint work with C. Heggerud and F.

1-bend Upward Planar Drawings of SP-digraphs with the Optimal Number of Slopes Emilio Di

Solving Evacuation Problems in Polynomial Space Miriam Schlter &amp; Martin Skutella

Development of Deep Unconventional Development of Deep Unconventional Geothermal Resources

Modeling seismic swarms triggered by aseismic transients Andrea L. Llenos, Jeffrey J. McGuire,

UML Status Report . Richard Weinberger sigma star gmbh . . . . . . . . . . . . . .

Complex Electrical Impedance Tomography in Geoelectrics Andreas Helfrich-Schkarbanenko July 17,

Solitons 18.354 L22 John Scott Russell 1834 Discovery of solitons 9 May 1808 8 June 1882

NASA Sounder Science Team Meeting, October 14, 2009, Greenbelt, MD (submitted to ACPD)

Dipartimento di Matematica e Informatica Universit` a di Cagliari EXACT SOLUTIONS OF INTEGRABLE

PRACTICE A work in progress Jane Boggan Goldsmiths, University of London Ben Pimlott

Discharging and List coloring Bernard Lidick Department of Applied Math Charles University

Lec 2 Insulation Materials, Properties and Breakdown Theory Prof Chengke Zhou c.zhou@gcu.ac.uk

High Voltage Tests At CERN Heng-Ye Liao, Francesco Pietropaolo, Francesca Stocker, Kevin Wood

Solving Evacuation Problems in Polynomial Space Miriam Schlter & Martin Skutella