The PyOP2 abstraction, its role in Firedrake, and some optimisations - PowerPoint PPT Presentation

The PyOP2 abstraction, its role in Firedrake, and some optimisations that it enables Paul H J Kelly Group Leader, Software Performance Optimisation Co-Director, Centre for Computational Methods in Science and Engineering Department of Computing, Imperial College London Joint work with : David Ham (Imperial Computing/Maths/Grantham Inst for Climate Change) Gerard Gorman, (Imperial Earth Science Engineering – Applied Modelling and Computation Group) Mike Giles, Gihan Mudalige, Istvan Reguly (Mathematical Inst, Oxford) Doru Bercea, Fabio Luporini, Graham Markall, Lawrence Mitchell, Florian Rathgeber, George Rokos (Software Perf Opt Group, Imperial Computing) Spencer Sherwin (Aeronautics, Imperial), Chris Cantwell (Cardio-mathematics group, Mathematics, Imperial) Michelle Mills Strout, Chris Krieger, Cathie Olschanowsky (Colorado State University) Carlo Bertolli (IBM Research) Ram Ramanujam (Louisiana State University) 1

What we Vectorisation, PyOP2/OP2 Aeroengine Finite- parametric turbo- Unstructured- are volume CFD polyhedral tiling machinery mesh stencils doing…. Tiling for Firedrake unstructured- Finite- Weather and mesh stencils Finite-element element climate assembly Lazy, data-driven compute- PAMELA Real-time 3D communicate scene Tidal turbines Dense SLAM understanding – 3D vision Runtime code Targetting generation PRAgMaTIc Domestic MPI, Adaptive- robotics, Dynamic OpenMP, mesh CFD augmented mesh Multicore graph OpenCL, reality adaptation worklists Dataflow/ Unsteady GiMMiK FPGA, from CFD - higher- Formula-1, Small-matrix Massive common UAVs order flux- supercomp multiplication sub-expressions reconstruction uters to Ab-initio mobile, TINTL Optimisation of computational Solar energy, embedded Fourier composite chemistry drug design interpolation transforms and (ONETEP) wearable Projects Contexts Technologies Applications

This talk OP2 and PyOP2: A stencil “DSL” for unstructured meshes An instance of a “decoupled access-execute” model Firedrake: a compiler for a higher-level DSL That uses PyOP2 as an intermediate representation (IR) COFFEE: a domain-specific compiler for a kernels This talk’s message: Optimise at the right level of abstraction Stencil ideas generalise The “DSL” can be an IR (and can look like a library) Runtime code generation can be incredibly powerful 5

From DSL to loop chains 4 Firedrake provides a DSL for finite element methods phi, p = Function ( mesh , …) Loop over the mesh! … while not convergence : { … Loop over the mesh! phi -= dt / 2 * p if …: ! p += ( assemble (dt *inner ( nabla_grad (v),…))*dx) ! else: Call to third party library! solve (…) … phi += dt / 2 * p … Loop over the mesh! } … Each of these loops is implemented in PyOP2

The OP2/PyOP2 programming model The OP2 programming model 5 void incrVertices ( double* e_weight, double* v1, double* v2) { *v1 += f(e_weight) *v2 += f(e_weight) } op_par_loop ( incrVertices , edges, op_arg_dat (edgeWeight, -1, OP_ID, OP_READ), op_arg_dat (vertexDat, 0, edges2vertices, OP_INC), op_arg_dat (vertexDat, 1, edges2vertices, OP_INC));

The OP2/PyOP2 programming model The OP2 programming model 5 void incrVertices ( double* e_weight, double* v1, double* v2) { *v1 += f(e_weight) *v2 += f(e_weight) } op_par_loop ( incrVertices , edges, op_arg_dat (edgeWeight, -1, OP_ID, OP_READ), op_arg_dat (vertexDat, 0, edges2vertices, OP_INC), op_arg_dat (vertexDat, 1, edges2vertices, OP_INC)); INDIRECT MEMORY ACCESSES ( A[B[i]] )!

The OP2/PyOP2 programming model The OP2 programming model 5 void incrVertices ( double* e_weight, double* v1, double* v2) { *v1 += f(e_weight) *v2 += f(e_weight) } op_par_loop ( incrVertices , edges, op_arg_dat (edgeWeight, -1, OP_ID, OP_READ), op_arg_dat (vertexDat, 0, edges2vertices, OP_INC), op_arg_dat (vertexDat, 1, edges2vertices, OP_INC)); INDIRECT MEMORY ACCESSES ( A[B[i]] )! op_par_loop (X, cells, …)

Loop chains in OP2/PyOP2 The OP2 programming model 5 void incrVertices ( double* e_weight, double* v1, double* v2) { *v1 += f(e_weight) *v2 += f(e_weight) } op_par_loop ( incrVertices , edges, op_arg_dat (edgeWeight, -1, OP_ID, OP_READ), op_arg_dat (vertexDat, 0, edges2vertices, OP_INC), op_arg_dat (vertexDat, 1, edges2vertices, OP_INC)); INDIRECT MEMORY ACCESSES ( A[B[i]] )! op_par_loop (X, cells, …) Synchronization point (function call e.g., PETSc) op_par_loop (Y, vertices, …)

6 Implementation of an op_par_loop in CUDA void incrVertices ( op_par_loop ( incrVertices , edges, double* e, op_arg_dat (edgeWeight, -1, OP_ID, OP_READ), double* v1, op_arg_dat (vertexDat, 0, edges2vertices, OP_INC), double* v2) { op_arg_dat (vertexDat, 1, edges2vertices, OP_INC)); *v1 += *e; *v2 += *e; } Coloring used for avoiding race conditions in shared memory parallel execution ��

Implementation of an op_par_loop in CUDA 6 void incrVertices ( op_par_loop ( incrVertices , edges, double* e, op_arg_dat (edgeWeight, -1, OP_ID, OP_READ), double* v1, op_arg_dat (vertexDat, 0, edges2vertices, OP_INC), double* v2) { op_arg_dat (vertexDat, 1, edges2vertices, OP_INC)); *v1 += *e; *v2 += *e; } Coloring used for avoiding race conditions in shared memory parallel execution �� Each partition assigned ! � �� to a Thread Block and ! �� further colored ��

The PyOP2 abstraction, its role in Firedrake, and some optimisations - PowerPoint PPT Presentation

The PyOP2 abstraction, its role in Firedrake, and some optimisations that it enables Paul H J Kelly Group Leader, Software Performance Optimisation Co-Director, Centre for Computational Methods in Science and Engineering Department of Computing,

Anisotropic Goal-Oriented Mesh Adaptation in Firedrake Joe Wallwork 1 Nicolas Barral 2 David Ham 1

Extruded meshes for high aspect ratio simulations in Firedrake and PyOP2 Gheoghe-Teodor (Doru)

Data Abstraction Announcements Data Abstraction Data Abstraction 4 Data Abstraction

Data Abstraction Announcements Data Abstraction Data Abstraction Programmers Compound

Predicate Abstraction with SATABS Existential Abstraction Predicate Abstraction for Software

Chapter 3: Data Abstraction Modularity and Abstraction Abstraction, modularity, information

Partitioning and numbering meshes for efficient MPI-parallel execution in PyOP2 Lawrence Mitchell,

Point, Line, & Plane 1 Abstraction Abstraction is the act of considering something as a

Exploiting Performance Benefits of Extruded Meshes in PyOP2 Department of Computing - Software

On Visual Abstraction Ivan Viola Visual Abstraction Fundamental concept in visualization and

Managing Water Abstraction Reforming Abstraction and Modernising Regulation Richard Austen Water

Announcements Data Abstraction Data Abstraction Compound values combine other values together

Predicate Abstraction with SATABS Version 1.0, 2010 Outline Introduction Existential

61A Lecture 18 Announcements Sequences The Sequence Abstraction 4 The Sequence Abstraction

CS 1331 Introduction to Object Oriented Programming Data Abstraction Christopher Simpkins

Lecture 11: Abstraction I ntro. to Programming, lecture 11: Abstraction 2 Topics for today

Ergodic Mean-Payoff Games for the Analysis of Attacks in Crypto-Currencies Krishnendu Chatterjee 1

Telling your story with data. John E. Adams ms Director john.e.adams@vermont.gov

Networking Overview: Everything you need to know, in 50 minutes CS 161: Computer Security

Ontology-based automatic generation of computerized cognitive exercises Giorgio Leonardi a,c ,

Tabernacle from above Tabernacle with altar and bronze laver Tabernacle court with altar and

XS-Stabilizer Xiaotong Ni joint work with Buerschaper, Van den Nest QEC14

Tracking in City Traffic Scenarios Hamma Tadjine, Daniel Goehring Hamma Tadjine IAV GmbH, 08.

Facilitating the Education of Gam e Developm ent Defense of the Diploma Thesis of Lennart Nacke

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

The PyOP2 abstraction, its role in Firedrake, and some optimisations - PowerPoint PPT Presentation

The PyOP2 abstraction, its role in Firedrake, and some optimisations that it enables Paul H J Kelly Group Leader, Software Performance Optimisation Co-Director, Centre for Computational Methods in Science and Engineering Department of Computing,

Anisotropic Goal-Oriented Mesh Adaptation in Firedrake Joe Wallwork 1 Nicolas Barral 2 David Ham 1

Extruded meshes for high aspect ratio simulations in Firedrake and PyOP2 Gheoghe-Teodor (Doru)

Data Abstraction Announcements Data Abstraction Data Abstraction 4 Data Abstraction

Data Abstraction Announcements Data Abstraction Data Abstraction Programmers Compound

Predicate Abstraction with SATABS Existential Abstraction Predicate Abstraction for Software

Chapter 3: Data Abstraction Modularity and Abstraction Abstraction, modularity, information

Partitioning and numbering meshes for efficient MPI-parallel execution in PyOP2 Lawrence Mitchell,

Point, Line, &amp; Plane 1 Abstraction Abstraction is the act of considering something as a

Exploiting Performance Benefits of Extruded Meshes in PyOP2 Department of Computing - Software

On Visual Abstraction Ivan Viola Visual Abstraction Fundamental concept in visualization and

Managing Water Abstraction Reforming Abstraction and Modernising Regulation Richard Austen Water

Announcements Data Abstraction Data Abstraction Compound values combine other values together

Predicate Abstraction with SATABS Version 1.0, 2010 Outline Introduction Existential

61A Lecture 18 Announcements Sequences The Sequence Abstraction 4 The Sequence Abstraction

CS 1331 Introduction to Object Oriented Programming Data Abstraction Christopher Simpkins

Lecture 11: Abstraction I ntro. to Programming, lecture 11: Abstraction 2 Topics for today

Ergodic Mean-Payoff Games for the Analysis of Attacks in Crypto-Currencies Krishnendu Chatterjee 1

Telling your story with data. John E. Adams ms Director john.e.adams@vermont.gov

Networking Overview: Everything you need to know, in 50 minutes CS 161: Computer Security

Ontology-based automatic generation of computerized cognitive exercises Giorgio Leonardi a,c ,

Tabernacle from above Tabernacle with altar and bronze laver Tabernacle court with altar and

XS-Stabilizer Xiaotong Ni joint work with Buerschaper, Van den Nest QEC14

Tracking in City Traffic Scenarios Hamma Tadjine, Daniel Goehring Hamma Tadjine IAV GmbH, 08.

Facilitating the Education of Gam e Developm ent Defense of the Diploma Thesis of Lennart Nacke

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Point, Line, & Plane 1 Abstraction Abstraction is the act of considering something as a