Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep - PowerPoint PPT Presentation

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep 05

Call graphs • Statically compute a precise call graph • Maps call sites to functions called • Challenge: • Methods • Higher-order functions • Can use precise call graph for: • optimization • reduce dispatch overhead • convert calls to lambdas to direct jumps • reduce code size • program understanding 2

Various techniques • Unique Name [Calder and Grunwald, POPL’94] • Class Hierarchy Analysis [Dean, Grove, Chambers, ECOOP’95] [Fernandez, PLDI’95] • Optimistic Reachability Analysis • Rapid Type Analysis [Bacon and Sweeney, OOPSLA’96] • Propagation-based analysis • 0-CFA [Shivers, PLDI’88] • k -CFA [Shivers ‘91] • Unification-based analysis [Steensgaard, POPL’96] • Interprocedural Class Analysis [DeFouw, Grove, Chambers, POPL’98] 3

Unique Name • Does not build call graph, but does resolve virtual calls class A { int foo() { return 1; } • If only one method named m in } entire program class B extends A { • Replace all virtual calls to a int foo() { return 2; } int bar(int i) { return i+1; } method named m with a non- } virtual call void main() { • Do at link time on object files B p = new B(); • Can resolve (1) only int r1 = p.bar(1); // 1: B.bar int r2 = p.foo(); // 2: B.foo • For C++ benchmarks, resolves 15% A q = p; of virtual calls int r3 = q.foo(); // 3: B.foo • Can’t handle same method name in } different classes 4

Class Hierarchy Analysis class A { int foo() { return 1; } • Use static type of receiver and the } class hierarchy to narrow set of class B extends A { possible targets int foo() { return 2; } int bar(int i) { return i+1; } • Whole program analysis } • Flow insensitive void main() { • O ( N ) B p = new B(); • Can resolve (1) and (2) int r1 = p.bar(1); // 1: B.bar int r2 = p.foo(); // 2: B.foo • For C++ benchmarks, resolves 51% A q = p; of virtual calls int r3 = q.foo(); // 3: B.foo } 5

Rapid Type Analysis class A { int foo() { return 1; } • Do CHA to build call graph } • If no object of class C allocated in class B extends A { the program, int foo() { return 2; } int bar(int i) { return i+1; } • Remove edges to methods of C } • O ( N ) void main() { • Slightly more expensive than CHA B p = new B(); • Can resolve (1), (2), and (3) int r1 = p.bar(1); // 1: B.bar int r2 = p.foo(); // 2: B.foo • For C++ benchmarks, resolves 71% A q = p; of virtual calls int r3 = q.foo(); // 3: B.foo } 6

Disjoint polymorphism • Multiple related object types used class Base { independently void m() { assert(false); } • e.g., Square and Circle objects void p() { assert(false); } are never mixed together in, say, } a Collection of Shapes class Derived1 extends Base { • Pathological case: void m() { ... } • Derived1 and Derived2 are } disjoint class Derived2 extends Base { • No Base objects allocated void p() { ... } • All calls are through Base } pointers 7

Unification-based analysis • Partitions variables in program and maps each partition to a set of classes class A { • Initialize with each variable in own int foo() { return 1; } partition } • If classes can flow between variables, class B extends A { int foo() { return 2; } unify the classes for those variables } target = source; void main() { A p = new B(); T1 m(T2 target) { ... } int r1 = p.foo(); // 4: B.foo m(source); A q = new A(); q = new B(); • Resolves (4), but not (5) int r2 = q.foo(); // 5: B.foo • O ( Nα ( N , N )) } 8

Interprocedural class analysis • Framework integrates • propagation-based analysis (0-CFA) • unification-based analysis • optimistic reachability analysis (RTA) • Computes set of classes for each program variable • Builds call graph as side effect 9

Flow graph representation • Node for each variable, method, new, call • Algorithm computes set of classes for each node • Edge between two nodes if classes can flow between them target = source; source target T1 m(T2 target) { ... } m(source); 10

Basic algorithm (0-CFA) • Construct nodes and edges for top-level variables, statements, and expressions (e.g., main) • Propagate classes through flow graph starting with main and top-level new expressions • When call encountered, add edge to target and construct flow graph for target method (if not already done) • If method not reachable, it will be pruned (as in RTA) 11

Edge filters • Edges may have a filter set • encode constraints ensured by type declarations or by dynamic dispatch • Don’t propagate class if filter does not include that class • Makes algorithm more precise than 0-CFA B.m() filter: {B} class B { m() { ... this ... } } this B class C ext B { m() { ... this ... } } o this C filter: {C} B o = new C(); C.m() o.m() 12

Call merging • Analysis parameterized by MergeCalls • When MergeCalls = false: o1 this B o1.m(a1) B.m(x1) { ... } a1 x1 o2 this C o2.m(a2) C.m(x2) { ... } a2 x2 • When MergeCalls = true: o1 this B o1.m(a1) B.m(x1) { ... } a1 m0 x1 o2 m1 this C o2.m(a2) C.m(x2) { ... } a2 x2 13

Node merging • Can speedup analysis by merging nodes into supernodes • Nodes merged with successors source target target = source; T1 m(T2 target) { ... } m(source); source target • Always merging is equivalent to unification-based analysis 14

Merging parameters • Analysis parameterized by P and MergeWithGlobal • When P = k , merge node with its successors if node visited more than k times • When P = 0, always merge • When P = N , never merge • When MergeWithGlobal = true, use only one global supernode 15

Instantiations Algorithm P MergeWithGlobal MergeCalls Complexity 0-CFA N N/A false O ( N 3 ) linear-edge 0-CFA N N/A true O ( N 2 ) bounded 0-CFA O (1) false false O ( N 2 α ( N , N )) bounded linear-edge 0-CFA O (1) false true O ( Nα ( N , N )) simply bounded 0-CFA O (1) true false O ( N 2 ) simply bounded linear-edge 0-CFA O (1) true true O ( N ) equivalence class analysis 0 false true O ( Nα ( N , N )) RTA 0 true true O ( N ) 16

Analysis time • Analysis time increases slightly with P • Mostly flat when P small, finite • MergeWithGlobal = true (simply bounded) • saves ~10% on Cecil • negligible improvement for Java • but all the benchmarks are Java compilers • 250% for one case when P = N • MergeCalls = true (linear edge) • up to 3x for Cecil, or more • only 5-20% savings for Java • no multimethods, so less edge filtering? • some programs can only be analyzed with linear edge (or small P ) 17

Precision • Larger P more precise (less merging) • Run-time speedup 0-10% for P = 0, 10-350% for P = N • MergeCalls = true (linear edge) • About as precise as quadratic edge • Less so for Java, but no difference in speedup • MergeWithGlobal = true (simply bounded) • Slightly less precision • but on some Cecil benchmarks, improved precision of MergeWithGlobal = false caused 2.5x speedup • precision lost on hot virtual calls? 18

Questions • All of these analyses are whole-program • Can they be modularized? • Integrating alias analysis, or more precise points to analysis • Extend class analysis to incorporate context as in k -CFA 19

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep - PowerPoint PPT Presentation

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep 05 Call graphs Statically compute a precise call graph Maps call sites to functions called Challenge: Methods Higher-order functions Can use precise call

Interprocedural Analysis and Abstract Interpretation cs6463 1 Outline Interprocedural

Interprocedural Analysis Last time Interprocedural analysis Today Interprocedural alias

Interprocedural Analysis 15819M 15819M Program Analysis Jonathan Aldrich Interprocedural

Interprocedural Analysis Last time Alias analysis Today Interprocedural analysis CS553

1 What Is Control-Flow Analysis? Loop Concepts Control-flow analysis discovers the flow of

Interprocedural Analysis and Optimization Mod/Ref Analysis Alias Analysis Constant Propagation

Class 9 Review; questions Discussion of Semester Project Arbitrary interprocedural

Flow Analysis Data-flow analysis, Control-flow analysis, Abstract interpretation, AAM Helpful

Class 10 Review; questions Questions about project Arbitrary interprocedural control

Program Analysis Chris Hankin(clh) Efficiency Type and Effect Control Flow Analysis Systems

Control-Flow Analysis and Loop Detection Last time PRE Today Control-flow

Alias Analysis Last time Interprocedural analysis Today Intro to alias analysis (pointer

1 Liveness by Example (cont) Control Flow Graphs (CFGs) Live range of a Definition a is live

Dataflow analysis Theory and Applications cs6463 1 Control-flow graph Graphical

Motivation Intra-procedural analysis depends upon accurate control-flow information. In the

Towards Dynamic Interprocedural Analysis in JVMs Feng Qian and Laurie Hendren { fqian,hendren }

HW/SW Codesign Analysis of Control & Data Flow I ECE 522 Control and Data Edges C programs

Control-flow analysis ? ? ? ? ? Discovering information about how control (e.g. the program

Flow and Congestion Control CS 218 F2003 Oct 29, 03 Flow control goals Classification

Interprocedural Heap Analysis using Access Graphs and Value Contexts with applications to

Fundamentals of (Static ic and Dynamic ic) ) Software Verif ific icatio ion Control-flow

Diagnostic Information for Control-Flow Analysis of Workflow Graphs (aka Free-Choice Workflow Nets)

Control flow Conditionals and Control Flow l A conditional branch

1 Why Why flow flow control? control? sender

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep - PowerPoint PPT Presentation

Interprocedural control-flow analysis Nate Nystrom CS 711 6 Sep 05 Call graphs Statically compute a precise call graph Maps call sites to functions called Challenge: Methods Higher-order functions Can use precise call

Interprocedural Analysis and Abstract Interpretation cs6463 1 Outline Interprocedural

Interprocedural Analysis Last time Interprocedural analysis Today Interprocedural alias

Interprocedural Analysis 15819M 15819M Program Analysis Jonathan Aldrich Interprocedural

Interprocedural Analysis Last time Alias analysis Today Interprocedural analysis CS553

1 What Is Control-Flow Analysis? Loop Concepts Control-flow analysis discovers the flow of

Interprocedural Analysis and Optimization Mod/Ref Analysis Alias Analysis Constant Propagation

Class 9 Review; questions Discussion of Semester Project Arbitrary interprocedural

Flow Analysis Data-flow analysis, Control-flow analysis, Abstract interpretation, AAM Helpful

Class 10 Review; questions Questions about project Arbitrary interprocedural control

Program Analysis Chris Hankin(clh) Efficiency Type and Effect Control Flow Analysis Systems

Control-Flow Analysis and Loop Detection Last time PRE Today Control-flow

Alias Analysis Last time Interprocedural analysis Today Intro to alias analysis (pointer

1 Liveness by Example (cont) Control Flow Graphs (CFGs) Live range of a Definition a is live

Dataflow analysis Theory and Applications cs6463 1 Control-flow graph Graphical

Motivation Intra-procedural analysis depends upon accurate control-flow information. In the

Towards Dynamic Interprocedural Analysis in JVMs Feng Qian and Laurie Hendren { fqian,hendren }

HW/SW Codesign Analysis of Control &amp; Data Flow I ECE 522 Control and Data Edges C programs

Control-flow analysis ? ? ? ? ? Discovering information about how control (e.g. the program

Flow and Congestion Control CS 218 F2003 Oct 29, 03 Flow control goals Classification

Interprocedural Heap Analysis using Access Graphs and Value Contexts with applications to

Fundamentals of (Static ic and Dynamic ic) ) Software Verif ific icatio ion Control-flow

Diagnostic Information for Control-Flow Analysis of Workflow Graphs (aka Free-Choice Workflow Nets)

Control flow Conditionals and Control Flow l A conditional branch

1 Why Why flow flow control? control? sender

HW/SW Codesign Analysis of Control & Data Flow I ECE 522 Control and Data Edges C programs