Verification-Aided Regression Testing Fabrizio Pastore 1 Leonardo - PowerPoint PPT Presentation

Verification-Aided Regression Testing Fabrizio Pastore 1 Leonardo Mariani 1 arinen 2 Grigory Fedyukovich 2 Antti E. J. Hyv¨ Natasha Sharygina 2 Ondrej Sery 2 Stephan Sehestedt 3 Ali Muhammed 4 1 University of Milano-Bicocca, Italy 2 University of Lugano, Switzerland 3 ABB Corporate Research, Ladenburg, Germany 4 VTT Technical Research Centre, Tampere, Finland October 17, 2013

Motivation • Regression testing is an integral part of many software development processes • Given an upgrade of a software, does it satisfy a validation test suite passed by the base version of the software • The detection of faults depends critically on the quality of the validation test suite • This work aims at reducing the dependency on the test suite by (i) automatically producing properties that hold for the base version (ii) automatically identifying and checking on the upgraded program only the properties that the developer intends the upgrade to preserve (iii) Reporting faults not revealed by the regression tests • We use dynamic property generation together with bounded model checking to achieve the goal.

Regression Testing & Dynamic Property Detection Program Tests Monitoring and inference Dynamic properties • The main purpose of regression testing is to validate that an already tested code has not been broken by an upgrade • Property detection aims at identifying “likely invariants” by observing the program behavior on the validation suite • This work deals with properties expressed as assertions

Bounded Model Checking • Given the C source code of a program P , we generate Boolean representation φ P of an unwound version of the program • Each loop is inlined up to a fixed bound k • Each function call is inlined • The inlined version is converted to a bit-precise representation as an instance of the propositional satisfiability problem • Heap operations and reference arguments are mostly ignored • Any assertion a in the source code is converted into a Boolean formula φ a , negated, and conjoined with the program, resulting in φ P ∧ ¬ φ a • The satisfying truth assignments of φ P ∧ ¬ φ a correspond to the executions of P which repeat each loop at most k times and violate the assertion a

Verification-Aided Regression Testing (VART) Phase 1: property generation Phase 2: checking upgraded Verified properties base program program upgrade tests base tests for base Monitoring and inference Monitoring and filtering Dynamic properties Non−regression for base properties Intra−version Property verification Inter−version Property verification Verified properties Regression problems for base counterexamples

VART Phase 1: monitoring and inference upgraded base program program base tests Monitoring and inference Dynamic properties for base • Generates a large number of dynamic properties • Based on observing the base program behavior in the regression test suite • To limit the number of generated properties, only locations “likely affected by the change” are monitored • Uses the Daikon invariant generator

VART Phase 1: Detecting Dynamic Properties • Dynamic properties are collected by monitoring the base version while it executes its regression test suite • To keep number of generated assertions sustainable, the property generation is localized to places affected by the change • The modified functions are identified, and monitoring is done on unchanged statements in functions • that contain changes • that call functions that contain changes; and • that are called by the functions that contain changes.

VART Phase 1: Generating Verified Properties • Dynamic properties often overfit the regression test, Base program P Dynamic properties a resulting in large number of for base false positives • We reduce the number of Intra−version Property verification φ P ∧ ¬ φ a false positives with BMC, passing forward only true assertions a (for which the Verified properties SAT check φ P ∧ ¬ φ a for base returns unsatisfiable ). • The scope of BMC is limited to the call trees rooted at the callers of the function containing the changes • Rest of the program treated non-deterministically

VART Phase 2: Filtering Verified Properties upgraded Verified properties program upgrade tests for base Monitoring and filtering Non−regression properties • Some properties that hold for the previous version might be intentionally broken by the developer • The regression test suite for the upgrade is used to filter out such verified but outdated properties

VART Phase 2: Upgrade Checking upgraded Non−regression program P ′ properties a Inter−version Property verification φ P ′ ∧ ¬ φ a Regression problems counterexamples • Finally, the non-regression properties are checked against the upgrade P ′ using BMC • Properties reported as false or unreachable indicate the presence of faults

Implementation • VART is implemented for C programs • Generation of dynamic properties is implemented on top of the Radar tool [PMG13] using GDB and Daikon [ECGN01] • Model checking with eVolCheck [FSS13] • Support also for CBMC [PMG13] F. Pastore, L. Mariani, and A. Goffi. RADAR a tool for debugging regression problems in C/C++ Software . ICSE Tool Demo Track, 2013. [ECGN01] M. D. Ernst, J. Cockrell, W. G. Griswold, and D. Notkin. Dynamically discovering likely program invariants to support program evolution . IEEE Transactions on Software Engineering, 27(2): 99-123, 2001. [FSS13] G. Fedyukovich, O. Sery, and N. Sharygina: eVolCheck: Incremental Upgrade Checker for C . TACAS 2013.

Empirical Evaluation: Insufficient test suite • We test VART in detecting faults in the implementation of the Grep utility • Different degrees of coverage using Grep regression test suite • Faults are injected from the SIR repository (total 11) Revealed Faults Test suite Testing VART TP FP Cov20 3 5 5 0 Cov50 7 8 2 0 MRT 10 10 0 0 • Cov20 — 20 % coverage, Cov50 — 50 % coverage MRT — smallest subset of tests that gives the same coverage as full test suite • TP — true positives, FP – false positives

Empirical evaluation: case studies Subject Test suite App. Size (LOCS) Size Dyn. Prop Non-Reg Prop TP FP VTT 488 1000 1045 658 15 0 Sort 4653 427 356 2 1 0 Grep 590 817 3303 51 3 0 • VTT is a motion trajectory control system executed by a robotic arm designed to perform maintenance tasks in the Iter fusion reactor • Regression test consist of random inputs as 12 numbers • Grep and Sort are the GNU coreutil tools with their respective test suites • Faults inserted from mailing lists and SIR • Identified faults are not revealed by the available test suites

Conclusions • Regression testing is widely used, but compelling test suites are difficult to design • VART can detect faults that are undetected by the test suites by • Automatically producing properties from the base version test suite • filtering out the properties intentionally broken by the upgrade • reporting faults and counterexamples not revealed by tests • Empirical evaluation shows that VART complements and increases the effectiveness of regression testing

Verification-Aided Regression Testing Fabrizio Pastore 1 Leonardo - PowerPoint PPT Presentation

Verification-Aided Regression Testing Fabrizio Pastore 1 Leonardo Mariani 1 arinen 2 Grigory Fedyukovich 2 Antti E. J. Hyv Natasha Sharygina 2 Ondrej Sery 2 Stephan Sehestedt 3 Ali Muhammed 4 1 University of Milano-Bicocca, Italy 2 University of

Regression Testing vs. Regression Testing Development Testing Developed first version of

DIVS DL/ID Verification Systems Verification of Legal Status DIVS Passport Verification

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Formal Verification and Testing for Formal Verification and Testing for Reactive Systems

Overview Objective Types of testing ECE 553: TESTING AND Verification testing

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Computer- -Aided Diagnosis in Aided Diagnosis in Computer Medical Imaging: From Pattern

Beyond Simulation: Beyond Simulation: Computer Aided Control System Design using Computer Aided

Why and What is the National 3D CAE Software Computer Aided Engineering Computer Aided

Computer Aided Translation Philipp Koehn 1 September 2017 Philipp Koehn Computer Aided

CSE507 Computer-Aided Reasoning for Software Solver-Aided Languages

Computer Aided Many sorts of computer assisted/aided Learning learning From PowerPoint

Existence of Noise Induced Order, a computer aided proof. S. Galatolo Dip. Mat, Univ. Pisa CIRM

Growing Solver-Aided Languages with ROSETTE Emina Torlak & Rastislav Bodik U.C. Berkeley

Formal Verification of P Systems with Active Membranes through Model Checking Florentin Ipate 1 ,

Learning Register Automata Models Falk Howar IPSSE, TU Clausthal, Goslar, Germany Dagstuhl

Mining Software Engineering Data Tao Xie Ahmed E. Hassan North Carolina State University

Introduction to JML David Cok, Joe Kiniry, and Erik Poll Eastman Kodak Company, University

Introduction to JML Erik Poll, Joe Kiniry, David Cok University of Nijmegen; Eastman Kodak

The Mobius Program Verification Environment Recent Advances in Extended Static Checking Joe

The Use of JML in Embedded Real-Time Systems Joseph Kiniry Technical University of Denmark

Dynamic Shape and Data Structure Analysis in Java Presented by Sokhom Pheng (Supervised by

Verification-Aided Regression Testing Fabrizio Pastore 1 Leonardo - PowerPoint PPT Presentation

Verification-Aided Regression Testing Fabrizio Pastore 1 Leonardo Mariani 1 arinen 2 Grigory Fedyukovich 2 Antti E. J. Hyv Natasha Sharygina 2 Ondrej Sery 2 Stephan Sehestedt 3 Ali Muhammed 4 1 University of Milano-Bicocca, Italy 2 University of

Regression Testing vs. Regression Testing Development Testing Developed first version of

DIVS DL/ID Verification Systems Verification of Legal Status DIVS Passport Verification

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Formal Verification and Testing for Formal Verification and Testing for Reactive Systems

Overview Objective Types of testing ECE 553: TESTING AND Verification testing

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Computer- -Aided Diagnosis in Aided Diagnosis in Computer Medical Imaging: From Pattern

Beyond Simulation: Beyond Simulation: Computer Aided Control System Design using Computer Aided

Why and What is the National 3D CAE Software Computer Aided Engineering Computer Aided

Computer Aided Translation Philipp Koehn 1 September 2017 Philipp Koehn Computer Aided

CSE507 Computer-Aided Reasoning for Software Solver-Aided Languages

Computer Aided Many sorts of computer assisted/aided Learning learning From PowerPoint

Existence of Noise Induced Order, a computer aided proof. S. Galatolo Dip. Mat, Univ. Pisa CIRM

Growing Solver-Aided Languages with ROSETTE Emina Torlak &amp; Rastislav Bodik U.C. Berkeley

Formal Verification of P Systems with Active Membranes through Model Checking Florentin Ipate 1 ,

Learning Register Automata Models Falk Howar IPSSE, TU Clausthal, Goslar, Germany Dagstuhl

Mining Software Engineering Data Tao Xie Ahmed E. Hassan North Carolina State University

Introduction to JML David Cok, Joe Kiniry, and Erik Poll Eastman Kodak Company, University

Introduction to JML Erik Poll, Joe Kiniry, David Cok University of Nijmegen; Eastman Kodak

The Mobius Program Verification Environment Recent Advances in Extended Static Checking Joe

The Use of JML in Embedded Real-Time Systems Joseph Kiniry Technical University of Denmark

Dynamic Shape and Data Structure Analysis in Java Presented by Sokhom Pheng (Supervised by

Growing Solver-Aided Languages with ROSETTE Emina Torlak & Rastislav Bodik U.C. Berkeley