Refinement-Based Context-Sensitive Points-To Analysis for Java Manu - PowerPoint PPT Presentation

Refinement-Based Context-Sensitive Points-To Analysis for Java Manu Sridharan, Rastislav Bodík UC Berkeley PLDI 2006 1

What Does Refinement Buy You? Increased scalability: enable new clients • Memory: orders of magnitude savings • Time: answer for a variable comes back in 1 second • ) Suitable for IDE Cast Safety Client Precision : 2

Approach: Focus on the Client Demand-driven: only do requested work Client-driven refinement: stop when client satisfied Example: • client asks: “can x point to o?” • we refine until we answer NO (the good answer) or we time out 3

Context-Sensitive Analysis Costly Context-sensitive analysis (def): • Compute result as if all calls inlined • But, collapse recursive methods Exponential blowup (code growth) 4

Why Not Existing Technique? Most analyses approximate same way in all code • E.g., k-CFA • Precision lost, esp. for data structures Our analysis focuses precision where it matters • Fully precise in the limit • Only small amount of code analyzed precisely • First refinement algorithm for Java 5

Points-To Analysis Overview Compute objects each variable can point to For each var x, points-to set pt(x) Model objects with abstract locations 1: x = new Foo() yields pt(x) = { o 1 } Flow-insensitive: statements in any order 6

Points-To Analysis as CFL-Reachability 1) Assignments x = new Obj(); // o 1 y = new Obj(); // o 2 z = x; o 1 x z a 2) Method calls ( 1 [ f ) 1 id(p) { return p; } ] f a = id(x); d c p id ret id b = id(y); [ g ) 2 ( 2 3) Heap accesses o 2 y b c.f = x; c.g = y; d = c.f; pt(x) = { o | o flowsTo x } flowsTo : path exists flowsTo : balanced call and field parens flowsTo : balanced call parens 7

Summary of Formulation Graph represents program Compute reachability with two filters • Language of balanced call parens • Language of balanced field parens 8

Single path problem … … t 9 t 7 ] j [ p t 8 ) 8 t 6 … t 5 ) 5 ( 7 ( 1 ) 1 o t 0 t 1 t 2 x [ h [ f t 12 ] k [ f ( 1 ) 1 [ h ] g o 2 t 10 Problem: show path is unbalanced t 11 Goal: reduce number of visited edges Insight: enough to find one unbalanced paren 9

Approximation via Match Edges o t 0 t 1 t 2 t 3 t 4 x [ g ] h [ f [ h ] j ] f [ f [ g [ h ] h ] j ] f Match edges connect matched field parens • From source of open to sink of close • Initially, all pairs connected Use match edges to skip subpaths 10

Refining the Approximation o t 0 t 1 t 3 t 4 x [ g [ f ] j ] f [ f [ g [ h ] h ] j ] f Refine by removing some match edges • Exposes more of original path for checking Soundness: Traverse match edge ) assume field parens balanced on skipped path Remove where unbalanced parens expected • Explore deeper levels of pointer indirection 11

Refinement With Both Languages ( 2 ) 3 ( 1 ) 1 o t 0 t 1 t 2 t 3 t 4 t 5 t 6 x [ g ] g ] f [ f Fields: [ f [ g ] g ] f Calls: ( 1 ) 1 ( 2 ) 3 Match edges enable approximation of calls • Only can check calls on match-free subpaths Match edge removal ) more call checking • Key point: refine heap and calls together 12

Evaluation 13

Experimental Configuration Implemented in Soot framework Tested on large benchmarks x 2 clients • SPECjvm98, Dacapo suite • Downcast checking, factory method props Refine context-insensitive result Timeout for long-running queries 14

Precision: Cast Checking 15

Scalability: Time and Memory Average query time less than 1 second • Interactive performance (for IDE) • At most 13 minutes for casts, 4 minutes for factory client Very low memory usage: at most 35MB • Of this, 30MB for context-insensitive result • Compare with >2GB for 1-ObjSens analysis 16

Demand-Driven vs. Exhaustive Demand advantage: no caching required • Hence, low memory overhead • No engineering of efficient sets • Good for changing code; just re-compute Demand advantage: faster for many clients • Often only care about some variables Demand disadvantage: slower querying all vars • At most 90 minutes for all app. vars • But, still good precision, memory 17

Conclusions Novel refinement-based analysis • More precise for tested clients • Interactive performance for queries • Low memory: could scale even more • Relatively easy to implement Insight: refine heap and calls together • Useful for other balanced-paren analyses? 18

Refinement-Based Context-Sensitive Points-To Analysis for Java Manu - PowerPoint PPT Presentation

Refinement-Based Context-Sensitive Points-To Analysis for Java Manu Sridharan, Rastislav Bodk UC Berkeley PLDI 2006 1 What Does Refinement Buy You? Increased scalability: enable new clients Memory: orders of magnitude savings Time:

Context Sensitivity Example of a CSG Informatics 2A: Lecture 26 2 Context in Programming

A6: Sensitive Data Exposure A6 Sensitive Data Exposure Sensitive data stored or transmitted

Context Sensitive Solutions Context Context Sensitive Solutions (CSS) is a collaborative approach

Context-sensitive Analysis Attribute Grammar And Type Checking cs5363 1 Context-Sensitive

Adaptive Mesh Refinement CS 101 - Meshing Winter 2007 1 Mesh Refinement Applications

SAT based Abstraction-Refinement using ILP and Machine Learning Techniques Edmund Clarke Anubhav

Oak Hill Parkway Oak Hill Parkway Context Sensitive Solutions CSS Workshop No. 1 October 9,

Context-sensitive languages Informatics 2A: Lecture 28 Alex Simpson School of Informatics

Context-sensitive languages Informatics 2A: Lecture 28 John Longley School of Informatics

Quadratic Interval Refinement Nikolaos Arvanitopoulos Seminar on Computational Geometry and

Points-to Analysis y = &z; y z Points-to Analysis y = &z; x = &y; x y z

Context-sensitive languages Informatics 2A: Lecture 28 John Longley School of Informatics

7 Refinement Options November 3, 2016 Overview Recap the HS Boundary Refinement Process

Crystallographic refinement Roberto A. Steiner roberto.steiner@kcl.ac.uk with many slides

Data Refinement: model-oriented proof methods and their comparison Willem-Paul de Roever

A Refinement of Cayley Graphs Associated to A. R. Naghipour Rings Shahrekord University,

events at ProtoDUNE with PandoraPFA Pantelis Melas, Niki Saoulidou 17/10/2019 2 2 Outline

Semi-Online Bipartite Matching Zoya Svitkina with Ravi Kumar, Manish Purohit, Aaron Schild, Erik

Bring it to Pitch: Combining Video and Movement Data to Enhance Team Sport Analysis Presenter:

NFC Payments: The Art of Relay & Replay Attacks Who are we? Troopers 2018? NFC

ISPD 2006 Arjun Rajagopal Arjun Rajagopal Dallas DSP Design Dallas DSP Design Texas

Compiler Construction Lecture 4: Lexical Analysis III (Practical Aspects) Thomas Noll Lehrstuhl

Navigating Government Support Financial Resources for BC Not-For-Profits Presenting from the

Coordinating Supply and Demand on an On-demand Service Platform with Impatient Customers

Refinement-Based Context-Sensitive Points-To Analysis for Java Manu - PowerPoint PPT Presentation

Refinement-Based Context-Sensitive Points-To Analysis for Java Manu Sridharan, Rastislav Bodk UC Berkeley PLDI 2006 1 What Does Refinement Buy You? Increased scalability: enable new clients Memory: orders of magnitude savings Time:

Context Sensitivity Example of a CSG Informatics 2A: Lecture 26 2 Context in Programming

A6: Sensitive Data Exposure A6 Sensitive Data Exposure Sensitive data stored or transmitted

Context Sensitive Solutions Context Context Sensitive Solutions (CSS) is a collaborative approach

Context-sensitive Analysis Attribute Grammar And Type Checking cs5363 1 Context-Sensitive

Adaptive Mesh Refinement CS 101 - Meshing Winter 2007 1 Mesh Refinement Applications

SAT based Abstraction-Refinement using ILP and Machine Learning Techniques Edmund Clarke Anubhav

Oak Hill Parkway Oak Hill Parkway Context Sensitive Solutions CSS Workshop No. 1 October 9,

Context-sensitive languages Informatics 2A: Lecture 28 Alex Simpson School of Informatics

Context-sensitive languages Informatics 2A: Lecture 28 John Longley School of Informatics

Quadratic Interval Refinement Nikolaos Arvanitopoulos Seminar on Computational Geometry and

Points-to Analysis y = &amp;z; y z Points-to Analysis y = &amp;z; x = &amp;y; x y z

Context-sensitive languages Informatics 2A: Lecture 28 John Longley School of Informatics

7 Refinement Options November 3, 2016 Overview Recap the HS Boundary Refinement Process

Crystallographic refinement Roberto A. Steiner roberto.steiner@kcl.ac.uk with many slides

Data Refinement: model-oriented proof methods and their comparison Willem-Paul de Roever

A Refinement of Cayley Graphs Associated to A. R. Naghipour Rings Shahrekord University,

events at ProtoDUNE with PandoraPFA Pantelis Melas, Niki Saoulidou 17/10/2019 2 2 Outline

Semi-Online Bipartite Matching Zoya Svitkina with Ravi Kumar, Manish Purohit, Aaron Schild, Erik

Bring it to Pitch: Combining Video and Movement Data to Enhance Team Sport Analysis Presenter:

NFC Payments: The Art of Relay &amp; Replay Attacks Who are we? Troopers 2018? NFC

ISPD 2006 Arjun Rajagopal Arjun Rajagopal Dallas DSP Design Dallas DSP Design Texas

Compiler Construction Lecture 4: Lexical Analysis III (Practical Aspects) Thomas Noll Lehrstuhl

Navigating Government Support Financial Resources for BC Not-For-Profits Presenting from the

Coordinating Supply and Demand on an On-demand Service Platform with Impatient Customers

Points-to Analysis y = &z; y z Points-to Analysis y = &z; x = &y; x y z

NFC Payments: The Art of Relay & Replay Attacks Who are we? Troopers 2018? NFC