Parallelization of Jacobi Iteration Solving 2-D Laplace equation - PowerPoint PPT Presentation

Parallelization of Jacobi Iteration Solving 2-D Laplace equation Abolfazl. Ziaeemehr 1 1 Department of Physics Institute for Advanced Studies in Basic Sciences (IASBS) Introductory School on Parallel Programming and Parallel Architecture for High-Performance Computing(Oct,2016) Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

Outline Background 1 Laplace equation Exercise 1: Starting Out 2 Serial version Exercise 2: Feet a Little Wet- OpenMP 3 Exercise 3: MPI - 1D Decomposition 4 Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

Laplace equation ∂ 2 φ ∂ x 2 + ∂ 2 φ ∂ y 2 = 0 1 Initialise phi to some initial guess. 2 Apply the boundary conditions. 3 For each internal mesh point set 4 Replace old solution F with new estimate phi 5 If solution does not satisfy tolerance, repeat from step 2. Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

Outline Background 1 Laplace equation Exercise 1: Starting Out 2 Serial version Exercise 2: Feet a Little Wet- OpenMP 3 Exercise 3: MPI - 1D Decomposition 4 Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

Serial 1 Download the serial version of the code in your language of choice. 2 Compile the code with optimization level -O3. 3 Test the code on a very small matrix 4 Make a plot of matrix dimension vs. time reported to determine the scaling of the algorithm. Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

Serial 100.000000 10.000000 1.000000 0.100000 Time(s) 0.010000 0.001000 0.000100 0.000010 10 100 1000 10000 Grid Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

OpenMP 1 Insert an OpenMP pragma at the appropriate spot to parallelize the loop. 2 Test and plot the performance of the code over 1, 2, 4, 8 and 16 threads, with matrix sizes of 128,256,512,1024,2048 and 4096. Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

OpenMP 100 omp16.txt omp1.txt omp2.txt omp4.txt omp8.txt 10 1 Time(s) 0.1 0.01 0.001 100 1000 10000 N grids Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

OpenMP #pragma omp p a r a l l e l { // I t e r t a t e double TimeStart = seconds ( ) ; f o r ( i n t iCount = 1 ; iCount < =I t e r a t i o n s ; iCount++) { #pragma omp f o r p r i v a t e ( i , j ) f o r ( i =1; i < =Dimension ; i ++) f o r ( j =1; j < =Dimension ; j++) S u r f a c e M a t r i x t [ i ] [ j ] = (0.25) ∗ ( SurfaceMatrix [ i − 1][ j ] + SurfaceMatrix [ i ] [ j +1] + SurfaceMatrix [ i +1][ j ] + SurfaceMatrix [ i ] [ j − 1]); // P r i n t S u r f a c e M a t r i x ( S u r f a c e M a t r i x t , Dimension ) ; double ∗∗ tmp ; tmp = SurfaceMatrix ; SurfaceMatrix = S u r f a c e M a t r i x t ; S u r f a c e M a t r i x t = tmp ; } double TimeEnd = seconds ( ) ; #pragma omp master { cout < < Dimension < < ” \ t ” < < TimeEnd − TimeStart ; cout < < ” \ n ” ; } Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

OpenMP rm -f omp *. txt g++ -o jacobi_omp jacobi_omp .cpp -fopenmp for i in 1 2 4 8 16 do export OMP_NUM_THREADS =${i} for j in 128 256 512 1024 2048 4096 do ./ jacobi_omp ${j} 100 5 5 >> omp${i}. txt done done gnuplot plot.gp display scaling.png rm -f *. txt Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

MPI - 1D Decomposition 1 The grid matrix must be completely distributed. 2 The whole process must be parallel. 3 Only asynchronous MPI-Isend and MPI-Irecv can be used for communication between processors. 4 Only use a 1 dimensional decomposition Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

MPI - 1D Decomposition Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

MPI - 1D Decomposition /* Send up unless I’m at the top , then receive from below */ /* Note the use of xlocal[i] for &xlocal[i][0] */ for(time loop for 100 cycle) { if (rank < size - 1) MPI_Send( xlocal[maxn/size], maxn , MPI_DOUBLE , rank + 1, 0, MPI_COMM_WORLD ); if (rank > 0) MPI_Recv( xlocal [0], maxn , MPI_DOUBLE , rank - 1, 0, MPI_COMM_WORLD , &status ); /* Send down unless I’m at the bottom */ if (rank > 0) MPI_Send(xlocal [1],maxn ,MPI_DOUBLE ,rank -1,1, MPI_COMM_WORLD ); if (rank < size - 1) MPI_Recv(xlocal[maxn/size +1],maxn ,MPI_DOUBLE ,rank +1,1, MPI_COMM_WORLD , &status ); /* Compute new values (but not on boundary) */ for (i=i_first; i<= i_last; i++) for (j=1; j<maxn -1; j++) { xnew[i][j]=( xlocal[i][j+1] + xlocal[i][j -1] + xlocal[i+1][j] + xlocal[i -1][j]) / 4.0; } /* Only transfer the interior points */ for (i=i_first; i<= i_last; i++) for (j=1; j<maxn -1; j++) xlocal[i][j] = xnew[i][j]; } Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

MPI - 1D Decomposition 10 2 core 4 core 1 Time(s) 0.1 0.01 0.001 100 1000 10000 N grids Introductory School on Parallel Programming Abolfazl. Ziaeemehr (IASBS) Parallelization of Jacobi Iteration / 1

Parallelization of Jacobi Iteration Solving 2-D Laplace equation - PowerPoint PPT Presentation

Parallelization of Jacobi Iteration Solving 2-D Laplace equation Abolfazl. Ziaeemehr 1 1 Department of Physics Institute for Advanced Studies in Basic Sciences (IASBS) Introductory School on Parallel Programming and Parallel Architecture for

Matrix Iteration Higher Modes Inverse Iteration Matrix Iteration Giacomo Boffi with Shifts

Jacobi Forms of Lattice Index Andreea Mocanu The University of Nottingham 7th of December, 2016

The transverse Jacobi equation Luis Guijarro Universidad Aut onoma de Madrid-ICMAT Symmetry

Parallel solution of large sparse eigenproblems using a Block-Jacobi-Davidson method Melven

Iteration/loops variety of iteration constructs provided with varying degrees of complexity,

Speed up evaluation by parallelization /////////// November 2018 Michael Weiss Bayer AG

Parallelization and Parallelization and Proling Proling Programming for Statistical

Parallelization Parallelization Programming for Statistical Programming for Statistical Science

Code Parallelization Fabrice Schlegel Introduction Goal: Efficient parallelization and memory

CATTLE MARKET ITERATION 01 CIRCULATION PRESENTATION ITERATION 01 INTRODUCTION This document

CONVERGENCE OF A GENERALIZED MIDPOINT ITERATION JARED ABLE, DANIEL BRADLEY, ALVIN MOON, AND

Trust region policy optimization (TRPO) Value Iteration Value Iteration This is what we

Iteration and Debugging Check out Iteration from SVN Loop review Debugging Java programs

Policy iteration comments Each step of policy iteration is guaranteed to strictly improve the

Markov Decision Processes and Exact Solution Methods: Value Iteration Policy Iteration Linear

Manipulating an Abstraction (Iteration) CT @ VT An algorithm with iteration START BOOK LIST =

Aligning Development Tools with the Way Programmers Think About Code Changes Marat Boshernitsan

Detectability of Kilonovae with Zwicky Transient Facility Detection and follow-up of Kilonovae

GW170817 GRB 170817A AT2017gfo: a multi-messenger binary neutron star merger Carlo

Simulating Ly Sources in the Epoch of Reionization AAR AARON SMI MITH Sakura Cosmic

Ptt

Introduction to Computer Graphics Image Processing (1) June 9, 2016 Kenshi Takayama

On a non-increasing Lindley-type equation Maria Vlasiou EURANDOM, Eindhoven email:

Some classes of solutions to quasilinear elliptic equations of p -Laplace type I. E. Verbitsky