Chapel With Polyhedral Transformation Using Autotuning TuowenZhao - PowerPoint PPT Presentation

Chapel With Polyhedral Transformation Using Autotuning TuowenZhao and Mary Hall The 3rd Annual Chapel Implementers and Users Workshop,2016

Loop Transformation • Manipulation of loop nest • Structure • Schedule • Prior work: manually apply loop transformations in Chapel • I. J. Bertolacci et al. Parameterized diamond tiling for stencil computations with Chapel parallel iterators. ICS 2015 • A. Sharma et al. Affine loop optimization based on modulo unrolling in Chapel. PGAS 2014 • We: Automatically applied loop transformations using recipes from script which enables integration with autotuning framework

Contribution • Uses C code to capture sequential computation • Generates Chapel programs by composing polyhedral transformations on the sequential computation and mapping from iteration spaces to Chapel domains and iterator • Demonstrates with a simple example in Chapel the benefits of applying such transformations in conjunction with autotuning

Chapel Language proc mm(A:[] real ,B:[] real , an: int ,ambn: int ,bm: int ){ const D = {0..an-1, 0..bm-1}; // Domain var C : [D] real ; // Domain mapped array forall (i,j) in D do { // Iterator C[i,j] = 0; for k in {0..ambn-1} do C[i, j] += A[i, k] * B[k,j]; } return C; }

Polyhedral Framework • Iteration Spaces • A set of iteration vectors represented as integer tuples • Direct mapping from Chapel domain • Transformation done by linear mapping • Affine loop bounds, conditional expressions, array subscripts

Dependence analysis • Ensure validity of transformation and correctness of program • Have to know the order of references to each array elements • Cannot be applied to Chapel iterator without programmer intervention or runtime information

CHiLL • Composable High-Level Loop transformation framework • A polyhedral transformation and code generation framework • Relies on autotuning to generate highly-tuned implementations for a specific target architecture • Uses a transformation recipe to express optimization strategy (recipe may be generated by a compiler)

Architecture Overview

Experiment – matrix multiply • Input in C • Tile sizes {8; 16; 32; 64; 128; 256} for (i = 0; i < an; i++) • Distribution of the for (j = 0; j < bm; j++) initialization code { C[i][j]=0.0f; • Tile sizes for (n = 0; n < ambn; n++) • Chapel’s configuration variable C[i][j] += A[i][n] * B[n][j]; • Literal constant } • Intel Haswell i7-4790K • 16GB DDR3 RAM

Result

Stencil Computations • Operations on structured grids • MiniGMG • Geometric multigrid benchmark • Uses stencil computations extensively especially in smooth and residual operators • CHiLL on MiniGMG • P. Basu (2015) Compiler Optimizations and Autotuning for Stencils and Geometric Multigrid. PhD thesis. University of Utah

Stencil Optimizations • Communication avoiding optimizations • Wavefront(loop fusing) • Deeper ghost zones with redundant computation

User-defined library • StencilDist library • Problems • Can’t guarantee correctness(dependence) • Handwrite optimized code • Generality concern

Multi-locale Stencil

Multi-locale Stencil • Programmer writes simple serial code fragments • Recipes provided by programmer or generated by autotuner • Behind-the-scene generation of distributed computation and distributed data • Produce fine-tuned code without programmer’s rewriting

Conclusion • Integrating Chapel with CHiLL • Instantly enables a lot of different optimization techniques that can be composed in complex sequences • Autotuningcan be used to find the best performing combination of transformations under target architecture Future work • Expanding the domain of autotuning by generating and tuning domain maps and iterators • Relaxing the transformation requirements by generalize to non-affine loop bounds and subscripts that employ indirection through an index array

Questions?

Chapel With Polyhedral Transformation Using Autotuning TuowenZhao - PowerPoint PPT Presentation

Chapel With Polyhedral Transformation Using Autotuning TuowenZhao and Mary Hall The 3rd Annual Chapel Implementers and Users Workshop,2016 Loop Transformation Manipulation of loop nest Structure Schedule Prior work: manually

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

A study of some pitfalls preventing peak performance in polyhedral compilation using a polyhedral

Chapel: Global HPCC Benchmarks and Status Update Brad Chamberlain Chapel Team CUG 2007 May 7,

CHAPEL + LAPACK Ian Bertolacci NEW DOG, MEET OLD DOG. INTRO: WHAT IS CHAPEL Chapel is a

Computing the Cohomology Ring of a Polyhedral Complex Joint work with D. Kravatz, R.

The Polyhedral Model Beyond Loops Recursion Optimization and Parallelization Through Polyhedral

Computing the Cohomology Algebra of a Polyhedral Complex Joint work with R. Gonzalez-Diaz &

AlphaZ: A System for Design Space Exploration in the Polyhedral Model Tomofumi Yuki, Gautam

William Dalmer 20 Psalm & Hymn Tunes Trim Street Chapel, Bath. Completed 1796. Northgate

Chapel: Status/Community Brad Chamberlain Cray Inc. CSEP 524 May 20, 2010 Outline Chapel

Polyhedral Loop Optimization (Part I) Armin Grlinger SPPEXA Doctoral Retreat 2015 September

Combining Polyhedral and AST Transformations in CHiLL Huihui Zhang , Anand Venkat, Protonu Basu,

Energy Auto-Tuning using the Polyhedral Approach Wei Wang 1 John Cavazos 1 Allan Porterfield 2 1

Scalable Polyhedral Compilation, Syntax vs. Semantics: 10 in the First Round IMPACT

Polly Polyhedral Optimizations for LLVM Tobias Grosser - Hongbin Zheng - Raghesh Aloor Andreas

Tbilisi Georgia 1 22.05.2019, DUNE-IB GTU is THE LARGEST UNIVERSITY IN TRANSCAUCASIA

Code Reviews & Inspections CSC 4700 Software Engineering Dr. Tom Way CSC 4700 1 Software

School of Computer Science Sophomore Advising Meeting Spring 2015 Tim Richards

Applications on Heterogeneous Platforms with Accelerators Accelerators and Hybrid Exascale

Aggregation of Chunky Monkeys 13 November 2020 Association for Computing Machinery 13 November

Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic

3 Tips for Writing Winning Content Charlotte Hicks Crockett Content Marketing Goals Attract

Academic Regulations 2 Academic Regulations 3 Deadlines 4 86 Credits Academic Regulations

Chapel With Polyhedral Transformation Using Autotuning TuowenZhao - PowerPoint PPT Presentation

Chapel With Polyhedral Transformation Using Autotuning TuowenZhao and Mary Hall The 3rd Annual Chapel Implementers and Users Workshop,2016 Loop Transformation Manipulation of loop nest Structure Schedule Prior work: manually

Polyhedral Volumes Visual Techniques T. V. Raman &amp; M. S. Krishnamoorthy Polyhedral Volumes

Polyhedral Volumes Visual Techniques T. V. Raman &amp; M. S. Krishnamoorthy Polyhedral Volumes

A study of some pitfalls preventing peak performance in polyhedral compilation using a polyhedral

Chapel: Global HPCC Benchmarks and Status Update Brad Chamberlain Chapel Team CUG 2007 May 7,

CHAPEL + LAPACK Ian Bertolacci NEW DOG, MEET OLD DOG. INTRO: WHAT IS CHAPEL Chapel is a

Computing the Cohomology Ring of a Polyhedral Complex Joint work with D. Kravatz, R.

The Polyhedral Model Beyond Loops Recursion Optimization and Parallelization Through Polyhedral

Computing the Cohomology Algebra of a Polyhedral Complex Joint work with R. Gonzalez-Diaz &amp;

AlphaZ: A System for Design Space Exploration in the Polyhedral Model Tomofumi Yuki, Gautam

William Dalmer 20 Psalm &amp; Hymn Tunes Trim Street Chapel, Bath. Completed 1796. Northgate

Chapel: Status/Community Brad Chamberlain Cray Inc. CSEP 524 May 20, 2010 Outline Chapel

Polyhedral Loop Optimization (Part I) Armin Grlinger SPPEXA Doctoral Retreat 2015 September

Combining Polyhedral and AST Transformations in CHiLL Huihui Zhang , Anand Venkat, Protonu Basu,

Energy Auto-Tuning using the Polyhedral Approach Wei Wang 1 John Cavazos 1 Allan Porterfield 2 1

Scalable Polyhedral Compilation, Syntax vs. Semantics: 10 in the First Round IMPACT

Polly Polyhedral Optimizations for LLVM Tobias Grosser - Hongbin Zheng - Raghesh Aloor Andreas

Tbilisi Georgia 1 22.05.2019, DUNE-IB GTU is THE LARGEST UNIVERSITY IN TRANSCAUCASIA

Code Reviews &amp; Inspections CSC 4700 Software Engineering Dr. Tom Way CSC 4700 1 Software

School of Computer Science Sophomore Advising Meeting Spring 2015 Tim Richards

Applications on Heterogeneous Platforms with Accelerators Accelerators and Hybrid Exascale

Aggregation of Chunky Monkeys 13 November 2020 Association for Computing Machinery 13 November

Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic

3 Tips for Writing Winning Content Charlotte Hicks Crockett Content Marketing Goals Attract

Academic Regulations 2 Academic Regulations 3 Deadlines 4 86 Credits Academic Regulations

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

Polyhedral Volumes Visual Techniques T. V. Raman & M. S. Krishnamoorthy Polyhedral Volumes

Computing the Cohomology Algebra of a Polyhedral Complex Joint work with R. Gonzalez-Diaz &

William Dalmer 20 Psalm & Hymn Tunes Trim Street Chapel, Bath. Completed 1796. Northgate

Code Reviews & Inspections CSC 4700 Software Engineering Dr. Tom Way CSC 4700 1 Software