On the Variety of Static Control Parts in Real-World Applications: - PowerPoint PPT Presentation

On the Variety of Static Control Parts in Real-World Applications: from Affine via Multi-dimensional to Polynomial and Just-in-Time Andreas Simbürger Armin Größlinger 4th International Workshop on Polyhedral Compilation Techniques 1 / 25

Defining the Real World 2 / 25

Defining the Real World ◮ LLVM ( llvm.org ) 2 / 25

Defining the Real World ◮ LLVM ( llvm.org ) ◮ Polly ( polly.llvm.org ) 2 / 25

Defining the Real World ◮ LLVM ( llvm.org ) ◮ Polly ( polly.llvm.org ) ◮ PolyJIT ( www.infosun.fim.uni-passau.de/cl/PolyJIT ) 2 / 25

Automatic Detection of SCoPs in LLVM Loop Scop Polly IR LLVM IR detection detection Loop Scalar normalization evolution 3 / 25

Effectiveness of Automatic Polyhedral Optimization Polly SCoP optimized LLVM IR optimizer detection LLVM IR 4 / 25

Effectiveness of Automatic Polyhedral Optimization Polly SCoP optimized LLVM IR optimizer detection LLVM IR Exploitation of parallelism (transformations) 4 / 25

Effectiveness of Automatic Polyhedral Optimization Polly SCoP optimized LLVM IR optimizer detection LLVM IR Detection: Applicability and Exploitation of parallelism potential of valid loops (transformations) 4 / 25

Effectiveness of Automatic Polyhedral Optimization Polly SCoP optimized LLVM IR optimizer detection LLVM IR Detection: Applicability and Exploitation of parallelism potential of valid loops (transformations) The detection process lacks thorough empirical evaluation! 4 / 25

PolyJIT: pprof ◮ Set of 50 programs commonly used in various domains. ◮ 8 domains (Multimedia, Scientific, Simulation, Encryption, Compilation, Compression, Databases, Verification). ◮ Extract run time and compile time statistics. 5 / 25

Measuring a SCoP’s fraction of the total run time What fraction of a program’s total run time is spent inside SCoPs? for ( int i=0; i<=n; ++i) for ( int j=i; j<=n; ++j) if (i >= n-j) { S: A[i+n][j+i] = B[n+2*i-1][j]; T: B[i+n][j-i] = A[n-2*i+1][j]; } Definition (Execution SCoP coverage) ExecCov = Time spent inside SCoPs Total program run time 6 / 25

Measuring a SCoP’s fraction of the total run time What fraction of a program’s total run time is spent inside SCoPs? Definition (Execution SCoP coverage) ExecCov = Time spent inside SCoPs Total program run time 6 / 25

Static Control Parts: Class Static Detection at compile time for ( int i=0; i<=n; ++i) for ( int j=i; j<=n; ++j) if (i >= n-j) { S: A[i+n][j+i] = B[n+2*i-1][j]; T: B[i+n][j-i] = A[n-2*i+1][j]; } 7 / 25

Static Control Parts: Class Static Detection at compile time for ( int i=0; i<=n; ++i) for ( int j=i; j<=n; ++j) if (i >= n-j) { S: A[i+n][j+i] = B[n+2*i-1][j]; T: B[i+n][j-i] = A[n-2*i+1][j]; } 1. Affine expressions in ◮ Loop bounds ◮ Conditions ◮ Memory accesses 7 / 25

Static Control Parts: Class Static Detection at compile time for ( int i=0; i<=n; ++i) for ( int j=i; j<=n; ++j) if (i >= n-j) { S: A[i+n][j+i] = B[n+2*i-1][j]; T: B[i+n][j-i] = A[n-2*i+1][j]; } 1. Affine expressions in ◮ Loop bounds ◮ Conditions ◮ Memory accesses 2. Static control flow 7 / 25

Static Control Parts: Class Static Detection at compile time for ( int i=0; i<=n; ++i) for ( int j=i; j<=n; ++j) if (i >= n-j) { S: A[i+n][j+i] = B[n+2*i-1][j]; T: B[i+n][j-i] = A[n-2*i+1][j]; } 1. Affine expressions in ◮ Loop bounds ◮ Conditions ◮ Memory accesses 2. Static control flow 3. Side-effect known function calls 7 / 25

Static Control Parts: Class Static Detection at compile time for ( int i=0; i<=n; ++i) for ( int j=i; j<=n; ++j) if (i >= n-j) { S: A[i+n][j+i] = B[n+2*i-1][j]; T: B[i+n][j-i] = A[n-2*i+1][j]; } 1. Affine expressions in ◮ Loop bounds ◮ Conditions ◮ Memory accesses 2. Static control flow 3. Side-effect known function calls What can we do, if it is not a static (affine) SCoP? 7 / 25

Problem 1: Multi-dimensional array accesses Contiguous A[i][j]; 8 / 25

Problem 1: Multi-dimensional array accesses Contiguous clang -O0 %0 = mul nsw i32 %i, %n %idx = getelementptr float * %A, i32 %0 %idx1 = getelementptr float * %idx, i32 %j A[i][j] 8 / 25

Problem 1: Multi-dimensional array accesses Contiguous clang -O1 %0 = mul nsw i32 %i, %n %idx.s = add i32 %0, %j %idx1 = getelementptr float * %A, i32 %idx.s A[n*i+j] 8 / 25

Delinearization of array accesses A[n*i+i+j] n ∗ i + i + j = ( n + 1 ) ∗ i + j 9 / 25

Delinearization of array accesses A[n*i+i+j] n ∗ i + i + j = ( n + 1 ) ∗ i + j A[i][i+j] i j 0 n 9 / 25

Delinearization of array accesses A[n*i+i+j] n ∗ i + i + j = ( n + 1 ) ∗ i + j A[i][i+j] A[i][j] i i j j 0 0 n+1 n 9 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ a − a ′ = 0 ni + 2 i + mi − i ′ − 2 m − 2 n = 0 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ a − a ′ = 0 ni + 2 i + mi − i ′ − 2 m − 2 n = 0 Split into terms ni , 2 i , mi , − i ′ , − 2 m , − 2 n 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ a − a ′ = 0 ni + 2 i + mi − i ′ − 2 m − 2 n = 0 Split into terms ni , 2 i , mi , − i ′ , − 2 m , − 2 n Group by parameters n ( i − 2 ) + m ( i − 2 ) + 1 ( 2 i − i ′ ) 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ a − a ′ = 0 ni + 2 i + mi − i ′ − 2 m − 2 n = 0 Split into terms ni , 2 i , mi , − i ′ , − 2 m , − 2 n Group by parameters n ( i − 2 ) + m ( i − 2 ) + 1 ( 2 i − i ′ ) Factor out common expressions ( n + m )( i − 2 ) + ( 1 )( 2 i − i ′ ) 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ a − a ′ = 0 ni + 2 i + mi − i ′ − 2 m − 2 n = 0 Split into terms ni , 2 i , mi , − i ′ , − 2 m , − 2 n Group by parameters n ( i − 2 ) + m ( i − 2 ) + 1 ( 2 i − i ′ ) Factor out common expressions ( n + m )( i − 2 ) + ( 1 )( 2 i − i ′ ) 0 i -n-m n+m | 1 ( 2 i − i ′ ) | ≤ | n + m | − 1 Bounds check 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ a − a ′ = 0 ni + 2 i + mi − i ′ − 2 m − 2 n = 0 Split into terms ni , 2 i , mi , − i ′ , − 2 m , − 2 n Group by parameters n ( i − 2 ) + m ( i − 2 ) + 1 ( 2 i − i ′ ) Factor out common expressions ( n + m )( i − 2 ) + ( 1 )( 2 i − i ′ ) 0 i -n-m n+m | 1 ( 2 i − i ′ ) | ≤ | n + m | − 1 Bounds check a − a ′ = 0 ⇔ i − 2 = 0 ∧ 2 i − i ′ = 0 10 / 25

Static Control Parts: Class Multi Let’s allow delinearizeable accesses! A[(n+2+m)*i] A[i’+2*m+2*n] ni + 2 i + mi = i ′ + 2 m + 2 n a = a ′ a − a ′ = 0 ni + 2 i + mi − i ′ − 2 m − 2 n = 0 Split into terms ni , 2 i , mi , − i ′ , − 2 m , − 2 n Group by parameters n ( i − 2 ) + m ( i − 2 ) + 1 ( 2 i − i ′ ) Factor out common expressions ( n + m )( i − 2 ) + ( 1 )( 2 i − i ′ ) 0 i -n-m n+m | 1 ( 2 i − i ′ ) | ≤ | n + m | − 1 Bounds check a − a ′ = 0 ⇔ i − 2 = 0 ∧ 2 i − i ′ = 0 i = 2 and i ′ = 4 10 / 25

On the Variety of Static Control Parts in Real-World Applications: - PowerPoint PPT Presentation

On the Variety of Static Control Parts in Real-World Applications: from Affine via Multi-dimensional to Polynomial and Just-in-Time Andreas Simbrger Armin Grlinger 4th International Workshop on Polyhedral Compilation Techniques 1 / 25

Parts Tracking and Parts Room Management System Parts Tracking and Parts Room Management System

Static and Method Overloading static One per class, not per object static variables

Parts of a Plant Plants have different parts to them, just like you. We have different body parts

Real graduates, Real graduates, real transitions, real transitions, real stories: real

Static and dynamic verification Static and dynamic V&V Software inspections Concerned

P bli P bli Public Soybean Public Soybean S S b b Breeding Research in Breeding Research

1 Static Equilibrium From Static Eq. to Dynamic Eq. System of mass points Static

STARTS: STARTS: STARTS: STARTS: STAtic STAtic Regression Test Selection Regression Test

static vs automatic storage classes Three types of memory allocations static storage class

Wrap Up Static, Packages, Exceptions Static methods // Example: // Java's built in Math class

Static name control for FreshML Franc ois Pottier Franc ois Pottier Static name control

Variety in Content Presentation It is a good idea to have variety in lecture content presentation:

GEometryANdTracking Simulation toolkit in C++: Variety of geometries choose your own setup

Parts of the Eye and Eye Disorders Pretest By: Colby Tharp Parts of the Eye Parts of the Eye

Best Practices Workshop: Parts & Mesh-Based Operations Overview What are Parts and Mesh

EXPLORING NEW HORIZONS TREX.PARTS THE FORKLIFT PARTS EXPERTS www.trex.parts OUR IDEA A

The lattice of linear Malcev conditions Jakub Opr sal Charles University in Prague

Cohomology and support varieties: two points Stratification theorem Extensions of view Support

OPTIMIZING A PARAMETRIC LINEAR FUNCTION OVER A NON-COMPACT REAL VARIETY Feng Guo 1 Mohab Safey El

Bertini irreducibility theorems via statistics Bjorn Poonen (joint work with Kaloyan Slavov of

Stratifications of affine Deligne-Lusztig varieties Defjnition (DeligneLusztig variety) Fix

Maximal Fermat Varieties Iwan Duursma RICAM Workshop Algebraic curves over finite fields Linz,

Variety of semi-conformal vectors in a vertex operator algebra Zongzhu Lin Kansas State

Singularities in characteristic zero and singularities in characteristic p Karl Schwede 1 1

On the Variety of Static Control Parts in Real-World Applications: - PowerPoint PPT Presentation

On the Variety of Static Control Parts in Real-World Applications: from Affine via Multi-dimensional to Polynomial and Just-in-Time Andreas Simbrger Armin Grlinger 4th International Workshop on Polyhedral Compilation Techniques 1 / 25

Parts Tracking and Parts Room Management System Parts Tracking and Parts Room Management System

Static and Method Overloading static One per class, not per object static variables

Parts of a Plant Plants have different parts to them, just like you. We have different body parts

Real graduates, Real graduates, real transitions, real transitions, real stories: real

Static and dynamic verification Static and dynamic V&amp;V Software inspections Concerned

P bli P bli Public Soybean Public Soybean S S b b Breeding Research in Breeding Research

1 Static Equilibrium From Static Eq. to Dynamic Eq. System of mass points Static

STARTS: STARTS: STARTS: STARTS: STAtic STAtic Regression Test Selection Regression Test

static vs automatic storage classes Three types of memory allocations static storage class

Wrap Up Static, Packages, Exceptions Static methods // Example: // Java's built in Math class

Static name control for FreshML Franc ois Pottier Franc ois Pottier Static name control

Variety in Content Presentation It is a good idea to have variety in lecture content presentation:

GEometryANdTracking Simulation toolkit in C++: Variety of geometries choose your own setup

Parts of the Eye and Eye Disorders Pretest By: Colby Tharp Parts of the Eye Parts of the Eye

Best Practices Workshop: Parts &amp; Mesh-Based Operations Overview What are Parts and Mesh

EXPLORING NEW HORIZONS TREX.PARTS THE FORKLIFT PARTS EXPERTS www.trex.parts OUR IDEA A

The lattice of linear Malcev conditions Jakub Opr sal Charles University in Prague

Cohomology and support varieties: two points Stratification theorem Extensions of view Support

OPTIMIZING A PARAMETRIC LINEAR FUNCTION OVER A NON-COMPACT REAL VARIETY Feng Guo 1 Mohab Safey El

Bertini irreducibility theorems via statistics Bjorn Poonen (joint work with Kaloyan Slavov of

Stratifications of affine Deligne-Lusztig varieties Defjnition (DeligneLusztig variety) Fix

Maximal Fermat Varieties Iwan Duursma RICAM Workshop Algebraic curves over finite fields Linz,

Variety of semi-conformal vectors in a vertex operator algebra Zongzhu Lin Kansas State

Singularities in characteristic zero and singularities in characteristic p Karl Schwede 1 1

Static and dynamic verification Static and dynamic V&V Software inspections Concerned

Best Practices Workshop: Parts & Mesh-Based Operations Overview What are Parts and Mesh