Finding Latent Code Errors via Machine Learning over Program - PowerPoint PPT Presentation

Finding Latent Code Errors via Machine Learning over Program Executions Yuriy Brun Michael D. Ernst University of Southern Massachusetts Institute California of Technology

Bubble Sort // Return a sorted copy of the argument double[] bubble_sort(double[] in) { double[] out = array_copy(in); for (int x = out.length - 1; x >= 1; x--) for (int y = x - 1; y >= 1; y--) if (out[y] > out[y+1]) swap (out[y], out[y+1]); return out; } 2

Bubble Sort Faulty (?) Code: Fault-revealing properties // Return a sorted copy of the argument double[] bubble_sort(double[] in) { out[0] = in[0] double[] out = array_copy(in); out[1] ≤ in[1] for (int x = out.length - 1; x >= 1; x--) for (int y = x - 1; y >= 1; y--) if (out[y] > out[y+1]) swap (out[y], out[y+1]); return out; } 3

Bubble Sort Faulty Code: Fault-revealing properties // Return a sorted copy of the argument double[] bubble_sort(double[] in) { out[0] = in[0] double[] out = array_copy(in); out[1] ≤ in[1] for (int x = out.length - 1; x >= 1; x--) // lower bound should be 0, not 1 for (int y = x - 1; y >= 1; y--) if (out[y] > out[y+1]) swap (out[y], out[y+1]); return out; } 4

Outline • Intuition for Fault Detection • Latent Error Finding Technique • Fault Invariant Classifier Implementation • Accuracy Experiment • Usability Experiment • Conclusion 5

Outline ! Intuition for Fault Detection • Latent Error Finding Technique • Fault Invariant Classifier Implementation • Accuracy Experiment • Usability Experiment • Conclusion 6

Targeted Errors • Latent Errors – unknown errors • may be discovered later • no manifestation – not discovered by test suite 7

Targeted Programs • Programs that contain latent errors • Test inputs are easy to generate, but test outputs can be hard to compute, e.g.: – Complex computation programs – GUI programs – Programs without formal specification 8

Learning from Fixes Program A: Fixed Program A: … … print (a[a.size] + “elements”); print (a[a.size - 1] + “elements”); … … Program B: … if (store[store.length] > 0); … 9

Outline • Intuition for Fault Detection ! Latent Error Finding Technique • Fault Invariant Classifier Implementation • Accuracy Experiment • Usability Experiment • Conclusion 10

Program Description Mapping 11

Machine Learning Approach • Extracts knowledge from a training set • Creates a model that classifies new objects • Requires a numerical description of the samples 12

Training a Model Examples: out[1] ≤ in[1] 〈 1,0,0,2 〉 13

Training a Model Examples: out[1] ≤ in[1] 〈 1,0,0,2 〉 14

Classifying Properties user program program analysis properties characteristic extractor features model machine classifier fault-revealing properties 15

Related Work • Redundancy in • Relevance: source code [Xie et al. 2002] • same goal – find an error • we have 50 times – 1.5-2 times improvement over improvement over random sampling (for C random sampling programs) 16

Related Work • [Xie et al. 2002] • Relevance: • Partial invariant violation • similar program analysis [Hangal et al. 2002] • similar goal – is there an error? 17

Related Work • [Xie et al. 2002] • Relevance: • [Hangal et al. 2002] • uses machine learning • Clustering of function call profiles [Dickinson et al. 2001, Podgurski et al. 2003] – find relevant tests – select faulty executions 18

Latent Error-Finding Technique program with program with errors removed known errors • Abstract properties program program • Abstract features analysis analysis • Generalizes to new properties properties properties and characteristic characteristic programs extractor extractor features features machine learner model 19

Model • A function: – {set of features} " {fault-revealing, non-fault-revealing} • Examples: – Linear combination functions – If-Then rules 20

Outline • Intuition for Fault Detection • Latent Error Finding Technique ! Fault Invariant Classifier Implementation • Accuracy Experiment • Usability Experiment • Conclusion 21

Tools Required for Fault Invariant Classifier • Program Property Extractor program with program with known errors errors removed – Daikon: Dynamic analysis tool program program analysis analysis • Property to Characteristic Vector properties properties Converter characteristic characteristic extractor extractor • Machine Learning features features – Support Vector Machines (SVMfu) machine learner model • technique is equally applicable to static and dynamic analysis 22

Daikon: Program Property Extractor • Daikon – Dynamic analysis tool – Reports properties that are true over program executions – Examples: • myPositiveInt > 0 • length = data.size 23

Characteristic Vector Extractor • Daikon uses Java objects to represent properties • Converter extracts all possible numeric information from those objects e.g. x>5 " 1 x ∈ array " 2 – # of variables e.g. x>5 " 1 x ∈ array " 0 – is inequality? – involves an array? e.g. x>5 " 0 x ∈ array " 1 • Total: 388 features 24

Support Vector Machine Model • Predictive power • But not explicative power • Consists of thousands of support vectors that define a separating area of the search space 25

Outline • Intuition for Fault Detection • Latent Error Finding Technique • Fault Invariant Classifier Implementation ! Accuracy Experiment • Usability Experiment • Conclusion 26

Subject Programs • 12 Programs – C and Java programs – Largest: 9500 lines – 373 errors (132 seeded, 241 real) • with corrected versions – Authors (at least 132): • Students • Industry • Researchers 27

Accuracy Experiment • Goal: – Test if machine learning can extrapolate knowledge from some programs to others • Train on errors from all but one program • Classify properties for each version of that one program • Compare to expected results 28

Measurements and Definitions • Fault-revealing property: – property of an erroneous program but not of that program with the error corrected – indicative of an error • Brevity: – average number of properties one must examine to find a fault-revealing property – best possible brevity is 1 29

Accuracy Experiment Results • C programs (single-error) – brevity = 2.2 – improvement = 49.6 times • Java programs (mostly multiple-error) – brevity = 1.7 – improvement = 4.8 times 30

Outline • Intuition for Fault Detection • Latent Error Finding Technique • Fault Invariant Classifier Implementation • Accuracy Experiment ! Usability Experiment • Conclusion 31

Fault Invariant Classifier Usability Study • Would properties identified by the fault invariant classifier lead a programmer to errors in code? • Preliminary experimentation: – 1 programmer’s evaluation – 2 programs (41 errors, 410 properties) 32

Usability Study Results • Replace (32 errors) – 68% of properties reported fault-revealing would lead a programmer to the error • Schedule (9 errors) – 58% of properties reported fault-revealing would lead a programmer to the error The majority of the reported properties were effective in indicating errors 33

Outline • Intuition for Fault Detection • Latent Error Finding Technique • Fault Invariant Classifier Implementation • Accuracy Experiment • Usability Experiment ! Conclusion 34

Conclusion • Designed a technique for finding latent errors • Implemented a fully automated Fault Invariant Classifier • Fault Invariant Classifier revealed fault-revealing properties with brevity around 2 • Most of the fault-revealing properties are expected to lead a programmer to the error • Overall, examining 3 properties is expected to lead a programmer to the error in our tests 35

Backup Slides • Works Cited • Explicative Machine Learning Model 36

Works Cited [Dickinson et al. 2001] W. Dickinson, D. Leon, and A. Podgurski. Finding failures by clust execution profiles. In ICSE, pages 339–348, May 2001. [Hangal at al. 2002] S. Hangal and M. S. Lam. Tracking down software bugs using autom detection. In ICSE, pages 291–301, May 2002. [Podgurski at al. 2003] A. Podgurski, D. Leon, P. Francis, W. Masri, M. Minch, J. Sun, an Automated support for classifying software failure reports. In ICSE, pages 465–475, May 2003. [Xie et al. 2002] Y. Xie and D. Engler. Using redundancies to find errors. In FSE, pages 5 Nov. 2002. 37

Explicative Machine Learning Model • C5.0 decision tree machine learner • Examples: • Based on large number of samples and neither an equality nor a linear relationship of three variables # likely fault-revealing • Sequences contains no duplicates or always contains an element # likely fault-revealing – No field accesses # even more likely fault-revealing 38

Finding Latent Code Errors via Machine Learning over Program - PowerPoint PPT Presentation

Finding Latent Code Errors via Machine Learning over Program Executions Yuriy Brun Michael D. Ernst University of Southern Massachusetts Institute California of Technology Bubble Sort // Return a sorted copy of the argument double[]

Basic Errors Compiling in Unix Syntax errors Common Errors, and Debugging Run-Time errors

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

Unified error reporting -- A worthy goal? Andi Kleen, Intel Corporation Sep 2009

Review Ch 1-5 Executing code Compile code (convert from C++ to computer code) - Syntax errors

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

EventRacer: Finding Concurrency Errors in Event-Driven Applications Pavol Bielik Android Errors

Finding your way in a graph Finding your way in a graph Finding your way in a graph Finding your

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

1 Latent variable models In the next section we will discuss latent variable models for

Part III: Latent Tree Models Le Song ICML 2012 Tutorial on Spectral Algorithms for Latent

Instruction Selection and Scheduling Machine code generation cs5363 1 Machine code generation

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

ELO TRANSLATION PROJECT SARAH **** SOME VOCAB Errors Logic Errors Runtime Errors

Learning Latent Dynamics for Planning from Pixels Danijar Hafner, Timothy Lillicrap, Ian Fischer,

A Discriminative Latent Variable Model for Online Clustering Rajhans Samdani, Kai-Wei Chang , Dan

Poster #24 1 Applied AI Lab, Oxford Robotics Institute 2 Department of Statistics, University of

Variational Sequential Labelers for Semi-Supervised Learning Mingda Chen, Qingming Tang, Karen

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested

Part 3: Latent representations and unsupervised learning Dale Schuurmans University of Alberta

The semnova Package for Latent Repeated Measures ANOVA Benedikt Langenberg, RWTH Aachen

POIR 613: Computational Social Science Pablo Barber a School of International Relations