Dataflow analysis Theory and Applications cs6463 1 Control-flow - PowerPoint PPT Presentation

Dataflow analysis Theory and Applications cs6463 1

Control-flow graph  Graphical representation of runtime control-flow paths  Nodes of graph: basic blocks (straight-line computations)  Edges of graph: flows of control  Useful for collecting information about computation  Detect loops, remove redundant computations, register allocation, instruction scheduling…  Alternative CFG: Each node contains a single statement i =0; …… i = 0 if I < 50 while (i < 50) { t1 = b * 2; a = a + t1; t1 := b * 2; i = i + 1; …… a := a + t1; } i = i + 1; …. cs6463 2

Building control-flow graphs Identifying basic blocks Input: a sequence of three-address statements  Output: a list of basic blocks  Method:  Determine each statement that starts a new basic block, including   The first statement of the input sequence  Any statement that is the target of a goto statement  Any statement that immediately follows a goto statement Each basic block consists of   A starting statement S0 (leader of the basic block)  All statements following S0 up to but not including the next starting statement (or the end of input) …… Starting statements: i := 0 i := 0 s0: if i < 50 goto s1 S0, goto s2 goto S2 s1: t1 := b * 2 S1, a := a + t1 goto s0 S2 S2: … cs6463 3

Building control-flow graphs Identify all the basic blocks  Create a flow graph node for each basic block  For each basic block B1  If B1 ends with a jump to a statement that starts basic block B2,  create an edge from B1 to B2 If B1 does not end with an unconditional jump, create an edge from  B1 to the basic block that immediately follows B1 in the original evaluation order i :=0 …… i := 0 S0: if i < 50 goto s1 s0: if i < 50 goto s1 goto s2 s1: t1 := b * 2 s1: t1 := b * 2 a := a + t1 goto s2 a := a + t1 goto s0 goto s0 S2: … S2: …… cs6463 4

Example Dataflow Live variable analysis  A data-flow analysis problem  A variable v is live at CFG point p iff there is a path from p to a use of v along which v is not redefined  At any CFG point p, what variables are alive?  Live variable analysis can be used in  Global register allocation  Dead variables no longer need to be in registers  Useless-store elimination  Dead variable don’t need to be stored back to memory  Uninitialized variable detection  No variable should be alive at program entry point cs6463 5

Computing live variables  For each basic block n, let  UEVar(n)=variables used before any definition in n  VarKill(n)=variables defined (modified) in n (killed by n) for each basic block n:S1;S2;S3;…;Sk VarKill := ∅ M UEVar(n) := ∅ for i = 1 to k S1: m := y * z suppose Si is “x := y op z” S2: y := y -z if y ∉ VarKill S3: o := y * z UEVar(n) = UEVar(n) ∪ {y} if z ∉ VarKill UEVar(n) = UEVar(n) ∪ {z} VarKill = VarKill ∪ {x} cs6463 6

Computing live variables  Domain m:=a+b A  All variables inside a function n:=a+b  For each basic block n, let B q:=a+b UEVar(n) p:=c+d  C vars used before defined r:=c+d r:=c+d VarKill(n)  vars defined (killed by n) e:=b+18 e:=a+17 Goal: evaluate vars alive on D s:=a+b E t:=c+d entry to and exit from n u:=e+f u:=e+f LiveOut(n)= ∪ m ∈ succ(n) LiveIn(m) LiveIn(m)=UEVar(m) ∪ v:=a+b (LiveOut(m) - VarKill(m)) F w:=c+d ==> LiveOut(n)= ∪ m ∈ succ(n) (UEVar(m) ∪ m:=a+b (LiveOut(m) - VarKill(m)) G n:=c+d cs6463 7

Algorithm: computing live variables  For each basic block n, let UEVar(n)=variables used before any definition in n  VarKill(n)=variables defined (modified) in n (killed by n)  Goal: evaluate names of variables alive on exit from n LiveOut(n)= ∪ (UEVar(m) ∪ (LiveOut(m) - VarKill(m))  m ∈ succ(n) for each basic block bi compute UEVar(bi) and VarKill(bi) LiveOut(bi) := ∅ for (changed := true; changed; ) changed = false for each basic block bi old = LiveOut(bi) LiveOut(bi)= ∪ (UEVar(m) ∪ (LiveOut(m) - VarKill(m)) m ∈ succ(bi) if (LiveOut(bi) != old) changed := true cs6463 8

Solution Computing live variables  Domain  a,b,c,d,e,f,m,n,p,q,r,s,t,u,v,w m:=a+b A n:=a+b UE Vark Live LiveOu LiveOut ill Out t var B q:=a+b p:=c+d C A a,b m,n a,b,c,d a,b,c,d, ∅ r:=c+d r:=c+d ,f f B c,d p,r a,b,c,d a,b,c,d ∅ e:=b+18 e:=a+17 C a,b, q,r a,b,c,d a,b,c,d, D E ∅ s:=a+b t:=c+d c,d ,f f u:=e+f u:=e+f D a,b, e,s, a,b,c,d a,b,c,d, ∅ f u f v:=a+b E a,c, e,t,u a,b,c,d a,b,c,d, ∅ F w:=c+d d,f f F a,b, v,w a,b,c,d a,b,c,d, ∅ c,d f m:=a+b G a,b, m,n G ∅ ∅ ∅ n:=c+d c,d cs6463 9

Another Example Available Expressions Analysis  The aim of the Available Expressions Analysis is to determine  For each program point, which expressions must have already been computed, and not later modified, on all paths to the program point.  Example Optimized code: [x:= a+b ]1; [x:= a+b]1; [y:=a*b]2; [y:=a*b]2; while [y> a+b ]3 { while [y> x ]3 { [a:=a+1]4; [a:=a+1]4; [x:= a+b ]5 [x:= a+b]5 } } cs6463 10

Available Expression Analysis  Domain of analysis m:=a+b  All expressions within a A b:=c+d function  For each basic block n, let B q:=a+b p:=a+b DEexp(n) C  e:=c+d Exps evaluated without any e:=c+d operand redefined ExpKill(n)  e:=b+18 e:=a+17 Exps whose operands are D s:=a+b E t:=c+d redefined (exps killed by n) a:=e+f b:=e+f Goal: evaluate exps available on all paths entering n AvailIn(n)= ∩ m ∈ pred(n) AvailOut(m) w:=a+b F AvailOut(m) = DEexp(m) ∪ X:=e+f (AvailIn(m) - ExpKill(m)) ==> AvailIn(n)= ∩ m ∈ pred(n) y:=a+b G (DEexp(m) ∪ c:=c+d (AvailIn(m) - ExpKill(m)) cs6463 11

Algorithm: computing available expressions For each basic block n, let  DEexp(n)=expressions evaluated without any operand redefined  ExpKill(n)=expressions whose operands are redefined in n  Goal: evaluate expressions available from entry to n AvailIn(n)= ∩ m ∈ pred(n) ( DEexp(m) ∪ (AvailIn(m) - ExpKill(m)) for each basic block bi compute DEexp(bi) and ExpKill(bi) AvailIn(bi) := isEntry(bi)? ∅ : Domain(Exp); for (changed := true; changed; ) changed = false for each basic block bi old = Avail(bi) AvailIn(bi)= ∩ m ∈ pred(bi) ( DEexp(m) ∪ (AvailIn(m) - ExpKill(m)) if (AvailIn(bi) != old) changed := true cs6463 12

Solution Available Expression Analysis Domain: a+b(1), c+d(2), m:=a+b b+18(3),e+f(4), a+17(5) A b:=c+d DEexp ExpKil Avail Avail B q:=a+b A 2 1,3 ∅ ∅ p:=a+b C e:=c+d e:=c+d B 1,2 4 12345 2 e:=b+18 e:=a+17 C 1,2 4 12345 2 D s:=a+b E t:=c+d a:=e+f b:=e+f D 3,4 1,4,5 12345 1,2 w:=a+b E 2,4,5 1,3,4 12345 1,2 F X:=e+f F 1,4 12345 2,4 ∅ G 1,2 2 12345 1,2 y:=a+b G c:=c+d cs6463 13

Iterative dataflow algorithm  Iterative evaluation of result for each basic block bi sets until a fixed point is compute Gen(bi) and Kill(bi) reached Result(bi) := ∅ or Domain  Does the algorithm always for (changed := true; changed; ) terminate? changed = false  If the result sets are bounded and grow for each basic block bi monotonically, then yes; old = Result(bi) Otherwise, no. Result(bi)=  Fixed-point solution is independent of evaluation ∩ or ∪ order [m ∈ pred(bi) or succ(bi)]  What answer does the (Gen(m) ∪ (Result(m)-Kill(m)) algorithm compute? if (Result(bi) != old)  Unique fixed-point solution changed := true  The meet-over-all-paths solution  How long does it take the algorithm to terminate?  Depends on traversing order of basic blocks cs6463 14

Traversing order of basic blocks  Facilitate fast convergence to the fixed point 4  Postorder traversal postorder 2 3  Visits as many of a nodes successors as possible before visiting the node 1  Used in backward data-flow analysis  Reverse postorder traversal  Visits as many of a node’s predecessors as possible 1 before visiting the node  Used in forward data-flow 3 2 Reverse analysis postorder 4 cs6463 15

The Overall Pattern  Each data-flow analysis takes the form Input(n) := ∅ if n is program entry/exit := Λ m ∈ Flow(n) Result(m) otherwise Result(n) = ƒ n (Input(n))  where Λ is ∩ or ∪ (m ay vs. must analysis)  May analysis: detect properties satisfied by at least one path ( ∪ )  Must analysis: detect properties satisfied by all paths( ∩ )  Flow(n) is either pred(n) or succ(n) (forward vs. backward flow)  Forward flow: data flow forward along control-flow edges.  Input(n) is data entering n, Result is data exiting n  Input(n) is ∅ if n is program entry  Backward flow: data flow backward along control-flow edges.  Input(n) is data exiting n, Result is data entering n  Input(n) is ∅ if n is program exit  Function ƒ n is the transfer function associated with each block n cs6463 16

The Mathematical Foundation of Dataflow Analysis  Mathematical formulation of dataflow analysis  The property space L is used to represent the data flow domain information  The combination operator Λ : P(L) → L is used to combine information from different paths  A set P is an ordered set if a partial order ≤ can be defined s.t. ∀ x,y,z ∈ P  x ≤ x (reflexive)  If x ≤ y and y ≤ x, then x = y (asymmetric)  If x ≤ y and y ≤ z implies x ≤ z (transitive)  Example: Power(L) with ⊆ define the partial order cs6463 17

Dataflow analysis Theory and Applications cs6463 1 Control-flow - PowerPoint PPT Presentation

Dataflow analysis Theory and Applications cs6463 1 Control-flow graph Graphical representation of runtime control-flow paths Nodes of graph: basic blocks (straight-line computations) Edges of graph: flows of control Useful for

Naiad (Timely Dataflow) & Streaming Systems CS 848: Models and Applications of Distributed

Quantifying Dataflow Analysis with Gradients in LLVM Gabriel Ryan 1 , Abhishek Shah 1 , Dongdong

Google Cloud Dataflow Cosmin Arad , Senior Software Engineer carad@google.com August 7, 2015

CO444H Dataflow Dataflow frameworks Ben Livshits Masters Projects Available 1. Crashes to

Chapter 8 Dataflow Descriptions in VHDL 1 benyamin@mehr.sharif.edu Dataflow Description

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

WaveScalar Dataflow machine good at exploiting ILP dataflow parallelism + traditional

Dataflow computation, tree transformations and comonads Tarmo Uustalu, Tallinn Joint work with

Biggest Challenge: Dataflow in Meetup for Android Mike Castleman Meetup New York Android

Dataflow Supercomputers Michael J. Flynn Maxeler T echnologies and Stanford University Outline

Oversampling in a Dataflow Synchronous Language (Heptagon) erard 1 L eonard G 1 PARKAS team

approach to parallelism www.pervasivedatarush.com Agenda Background Dataflow Overview

Dataflow Execution Dataflow Execution Craig Knoblock University of Southern California This

Differential Dataflow McSherry, Frank D., Murray, Derek G., Isaacs, Rebecca, Isard, Michael

Dataflow Process Network Goals Formalize dataflow process network Widely used in signal

A characterization of Random Variables Paolo Leonetti (based on joint work with Simone

Galois Connections Roland Backhouse 3rd December, 2002 2 Fusion Many problems are expressed in

Independence Posets Nathan Williams (UTD) Joint with Hugh Thomas (UQAM) Triangle Lectures in

On scattered convex geometries joint work with Maurice Pouzet Universit Claude-Bernard, Lyon

Course Script IN 5110: Specification and Verification of Parallel Sys- tems IN5110, autumn 2019

Completeness Christophe Ritzenthaler Institut de Mathmatiques de Luminy, CNRS Montral 04-10

INF3170 / INF4171 Soundness and completeness of sequent calculus Andreas Nakkerud September 14,

Completeness of Online Planners for Partially Observable Deterministic Tasks Blai Bonet Gabriel

Dataflow analysis Theory and Applications cs6463 1 Control-flow - PowerPoint PPT Presentation

Dataflow analysis Theory and Applications cs6463 1 Control-flow graph Graphical representation of runtime control-flow paths Nodes of graph: basic blocks (straight-line computations) Edges of graph: flows of control Useful for

Naiad (Timely Dataflow) &amp; Streaming Systems CS 848: Models and Applications of Distributed

Quantifying Dataflow Analysis with Gradients in LLVM Gabriel Ryan 1 , Abhishek Shah 1 , Dongdong

Google Cloud Dataflow Cosmin Arad , Senior Software Engineer carad@google.com August 7, 2015

CO444H Dataflow Dataflow frameworks Ben Livshits Masters Projects Available 1. Crashes to

Chapter 8 Dataflow Descriptions in VHDL 1 benyamin@mehr.sharif.edu Dataflow Description

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

Dataflow Testing Chapter 10 Dataflow Testing Testing All-Nodes and All-Edges in a control

WaveScalar Dataflow machine good at exploiting ILP dataflow parallelism + traditional

Dataflow computation, tree transformations and comonads Tarmo Uustalu, Tallinn Joint work with

Biggest Challenge: Dataflow in Meetup for Android Mike Castleman Meetup New York Android

Dataflow Supercomputers Michael J. Flynn Maxeler T echnologies and Stanford University Outline

Oversampling in a Dataflow Synchronous Language (Heptagon) erard 1 L eonard G 1 PARKAS team

approach to parallelism www.pervasivedatarush.com Agenda Background Dataflow Overview

Dataflow Execution Dataflow Execution Craig Knoblock University of Southern California This

Differential Dataflow McSherry, Frank D., Murray, Derek G., Isaacs, Rebecca, Isard, Michael

Dataflow Process Network Goals Formalize dataflow process network Widely used in signal

A characterization of Random Variables Paolo Leonetti (based on joint work with Simone

Galois Connections Roland Backhouse 3rd December, 2002 2 Fusion Many problems are expressed in

Independence Posets Nathan Williams (UTD) Joint with Hugh Thomas (UQAM) Triangle Lectures in

On scattered convex geometries joint work with Maurice Pouzet Universit Claude-Bernard, Lyon

Course Script IN 5110: Specification and Verification of Parallel Sys- tems IN5110, autumn 2019

Completeness Christophe Ritzenthaler Institut de Mathmatiques de Luminy, CNRS Montral 04-10

INF3170 / INF4171 Soundness and completeness of sequent calculus Andreas Nakkerud September 14,

Completeness of Online Planners for Partially Observable Deterministic Tasks Blai Bonet Gabriel

Naiad (Timely Dataflow) & Streaming Systems CS 848: Models and Applications of Distributed