Interfaces for Runtime Correctness Checking of Parallel Programs - PowerPoint PPT Presentation

Interfaces for Runtime Correctness Checking of Parallel Programs Joachim Protze (protze@itc.rwth-aachen.de)

Motivation • OpenMP 3 introduced tasks (2008) • Several data race detection tools for OpenMP tasks popped up just last year • How can we effectively reduce the porting effort for new programming paradigms? Memory accesses Concurrency Synchronization 2 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Synchronization in OpenMP Parallel region parallel-begin • Encountering a parallel directive happens before execution of the implicit-task-begin parallel region • Encountering a barrier directive barrier-begin happens before execution of code barrier-end following the barrier region • Encountering the implicit barrier happens before the master barrier-begin continues code following the implicit-task-end ! parallel region parallel-end 3 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Synchronization in OpenMP Task region task task depend(out:a) task-create task task depend(in:a) • Encountering a task directive +task-dependencies happens before execution of the task-begin task region • Finishing execution of a child task task-end happens before execution of code task-begin following a taskwait, barrier, or taskgroup region • Finishing a predecessor task task-end happens before a dependent task starts execution taskwait-end taskwait • Deferring a task happens before scheduling the task again 4 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Archer based on ThreadSanitizer • ThreadSanitizer comes with clang and gcc (-fsanitize=thread) • Compiler instrumentation of memory accesses − Less overhead than binary instrumentation (e.g., PIN, valgrind) • ThreadSanitizer is not aware of OpenMP synchronization • Happens before analysis with simplified fast track algorithm. − 4 records of memory access to a word, storing (epoch,tid,r/w) • Archer annotates OpenMP synchronization − Initially instrumentation of the LLVM/OpenMP runtime − Now based on OMPT events 5 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Data race analysis overhead for SPEC OMP 2012 (train) • Expected overhead according to base tool: 2-20x • 359.botsspar and 370.mgrid331 > 20x − Both run <1 second with high synchronization rate ▪ 359.botsspar: 353400 task switches ▪ 370.mgrid331: 6383 parallel regions 50 99.8 40 Tool Slowdown 30 2 Threads 4 Threads 20 12 Threads 10 0 350 351 352 357 358 359 360 362 363 367 370 371 372 376 6 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Concurrency for OpenMP Tasks • Observed actual • Lamport • Separating • Execution of execution with happens- the logical the thread as HB before slices observed by a tool thread Wallclock time Logical clock Wallclock time Wallclock time Happens-before Observed execution order 7 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

TLC: Marking execution within a thread as concurrent • Observed actual • Lamport • Separating • Execution of execution with happens- the logical the thread as HB before slices observed by the tool thread Wallclock time Logical clock Wallclock time Wallclock time Happens-before Observed execution order Fork / spawn 8 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Generic events • Fork(curr, *new) − Fork(curr, *new, *msg) • Join(curr, next) • Switch(curr, next) − Switch(curr, next, msg) • Send(curr, *msg) • Recv(curr, msg) 9 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Concurrency / Synchronization in Shared Memory Parallel, Tasks, Loops Fork(curr, *new) • Fork → P2P synchronization, concurrency Join(curr, next) • Join → P2P synchronization Switch(curr, next) • Barrier → global synchronization Send(curr, *msg) − Can translate into N2N synchronization Recv(curr, msg) • Dependencies → P2P synchronization • Locks → ? − Should be flexible to enable lock-set and HB analysis • Parallel loop → concurrency for each iteration • Doacross loops → P2P synchronization 10 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Applying this semantics to MPI MPI Non-Blocking • MPI_Isend / MPI_Irecv → concurrency, P2P synchronization − Bind the new execution unit handle to the request • MPI_Wait → synchronize task MPI_Irecv • Buffer access → read/write task thread MPI_Wait 11 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Applying this semantics to MPI MPI One-sided • MPI One-sided epochs → concurrency, P2P synchronization • MPI One-sided target completion → synchronize • Remote memory access → read/write 12 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Device Offloading

Basic memory operations in device offloading • Memory access • Alloc/release memory • (Dis-)Associate memory • Update memory (memcopy) OpenMP mapping semantics: • Alloc alloc + associate • Map-to ((alloc +) associate +) update to device • Map-from update from device (+ disassociate (+ release)) • Update-to/from update to/from device • Release disassociate + release Challenge: semantics of global/static memory 14 Generic Tool Interface for Runtime Correctness Checking Joachim Protze

Distributed Memory ?

Thank you for your attention.

Interfaces for Runtime Correctness Checking of Parallel Programs - PowerPoint PPT Presentation

Interfaces for Runtime Correctness Checking of Parallel Programs Joachim Protze (protze@itc.rwth-aachen.de) Motivation OpenMP 3 introduced tasks (2008) Several data race detection tools for OpenMP tasks popped up just last year How can

Proving Program Correctness The Axiomatic Approach What is Correctness? Correctness:

Another approach to runtime checking Typical runtime checking is by duplicating entire CPU

Checking & Spot-Checking the Correctness of Priority Queues Matthew Chu & Sampath Kannan

T Topic 7 i 7 Interfaces and Abstract Interfaces and Abstract Classes Interfaces Interfaces

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

CSSE 220 Interfaces and Polymorphism Check out Interfaces from SVN Interfaces What, When,

The History of Interaction Batch Interfaces Command-Line Interfaces Graphical User

Virtual xfrm interfaces Steffen Klassert secunet Security Networks AG Dresden Linux IPsec

Testing Concurrency Runtime via a Testing Concurrency Runtime via a Stochastic Stress Framework

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

3. Satisfiability Checking 3.1 SAT-Checking Procedures Verification Technology

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

Optimal Prices in the Towards a Precise . . . Towards a Precise . . . Presence of Discounts:

Interval Computations as Why Intervals? Applied Constructive Interval Computations . . . Wiener

Apply the Gospel Shame and Honor Shame: A sense of

on Reading Skills 24 March 2018 Outline of Sharing Overview of Reading Extensive Reading

Shar Shared Memory ed Memory Pr Programming Paradigm ogramming Paradigm Ivan Girotto

OpenMP parallelization of the complex magnetohydrodynamic model BATS-R-US Gbor Tth Hongyang

Language Models Philipp Koehn 8 September 2020 Philipp Koehn Machine Translation: Language

The Axiomatic Method in Social Choice Theory: Preference Aggregation, Judgment Aggregation, Graph

Interfaces for Runtime Correctness Checking of Parallel Programs - PowerPoint PPT Presentation

Interfaces for Runtime Correctness Checking of Parallel Programs Joachim Protze (protze@itc.rwth-aachen.de) Motivation OpenMP 3 introduced tasks (2008) Several data race detection tools for OpenMP tasks popped up just last year How can

Proving Program Correctness The Axiomatic Approach What is Correctness? Correctness:

Another approach to runtime checking Typical runtime checking is by duplicating entire CPU

Checking &amp; Spot-Checking the Correctness of Priority Queues Matthew Chu &amp; Sampath Kannan

T Topic 7 i 7 Interfaces and Abstract Interfaces and Abstract Classes Interfaces Interfaces

From Model Checking to Proof Checking ... and Back Kedar Namjoshi Bell Labs April 29, 2005

CSSE 220 Interfaces and Polymorphism Check out Interfaces from SVN Interfaces What, When,

The History of Interaction Batch Interfaces Command-Line Interfaces Graphical User

Virtual xfrm interfaces Steffen Klassert secunet Security Networks AG Dresden Linux IPsec

Testing Concurrency Runtime via a Testing Concurrency Runtime via a Stochastic Stress Framework

Real Real Real Time Real-Time Time Time Model Checking Model Model Checking Model

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Software Model Checking Using Bogor Software Model Checking Using Bogor a Modular and

Statistical Statistical Statistical Model Statistical Model Model Checking Model Checking

3. Satisfiability Checking 3.1 SAT-Checking Procedures Verification Technology

Hoare Logic and Model Checking Model Checking Lecture 11: Model checking for Computation Tree

Optimal Prices in the Towards a Precise . . . Towards a Precise . . . Presence of Discounts:

Interval Computations as Why Intervals? Applied Constructive Interval Computations . . . Wiener

Apply the Gospel Shame and Honor Shame: A sense of

on Reading Skills 24 March 2018 Outline of Sharing Overview of Reading Extensive Reading

Shar Shared Memory ed Memory Pr Programming Paradigm ogramming Paradigm Ivan Girotto

OpenMP parallelization of the complex magnetohydrodynamic model BATS-R-US Gbor Tth Hongyang

Language Models Philipp Koehn 8 September 2020 Philipp Koehn Machine Translation: Language

The Axiomatic Method in Social Choice Theory: Preference Aggregation, Judgment Aggregation, Graph

Checking & Spot-Checking the Correctness of Priority Queues Matthew Chu & Sampath Kannan