Mutation Testing Reid Holmes Key questions Is a test suite: Su - PowerPoint PPT Presentation

Mutation Testing Reid Holmes

Key questions Is a test suite: Su ffi ciently broad ? Su ffi ciently deep ? 2

Test suite depth Mutation testing 3

Program Generate Mutants 4

Program Generate Mutants 5

Program Generate Mutants Mutant 6 6

Program Test Suite Generate Mutants Execute Kill Mutant Suites Score 9

what mutations? flip boolean boundaries (<, >=, etc) remove conditional increment to decrement 10

mutation operators Conditional Boundary < —> <= <= —> < > —> >= >= —> > if (a<b) {..} —> if (a<=b) {..} 11

mutation operators Negate Conditionals == —> != != —> == … if (a==b) {..} —> if (a!=b) {..} 12

mutation operators Remove Conditionals if (a==b) {..} —> if (true) {..} 13

mutation operators Math + —> - * —> / | —> & int a = b + c; … —> int a = b - c; 14

mutation operators Increments/Decrements ++ —> - - - - —> ++ i++ —> i— 15

mutation operators Inline Constant int i = 0; —> int i = 3; 16

mutation operators Return mutator return o; —> return null; 17

mutation operators Skip void calls void somethingImportant(){..} int foo() { int i = 5; somethingImportant(); return i; } —> int foo() { int i = 5; // somethingImportant(); return i; 18 }

public float avg(float[] data){ float sum = 0;   for (float num : data){ sum += num; } return sum * data.length; }

Test suite: ✔ assertEq(avg([1]), 1); public float avg(float[] data){ float sum = 0;   for (float num : data){ sum += num; } return sum * data.length; }

Test suite: ✖ assertEq(avg([1]), 1); public float avg(float[] data){ float sum = 1;   for (float num : data){ sum += num; } return sum * data.length; }

Test suite: ✖ assertEq(avg([1]), 1); public float avg(float[] data){ float sum = 0;   for (float num : data){ sum -= num; } return sum * data.length; }

Test suite: ✔ assertEq(avg([1]), 1); public float avg(float[] data){ float sum = 0;   for (float num : data){ sum += num; } return sum / data.length; }

Test suite: Kill Score: ✔ assertEq(avg([1]), 1); 66% ✖ sum = 0 —> sum = 1 ✖ sum += num —> sum += num ✔ sum * length —> sum / length

Test suite: New test: ✔ ✔ assertEq(avg([1]), 1); assertEq(avg([1,1]), 1); ✖ sum = 0 —> sum = 1 ✖ sum += num —> sum += num ✔ sum * length —> sum / length should have been / not * all along

Test suite: New test: ✔ ✔ assertEq(avg([1]), 1); assertEq(avg([1,1]), 1); public float avg(float[] data){ float sum = 0;   for (float num : data){ sum += num; } return sum / data.length; }

Test suite: New test: ✔ ✔ assertEq(avg([1]), 1); assertEq(avg([1,1]), 1); From the expected return of this function, this test should pass in the program; instead it reveals a fault in the program itself.

mutation assumptions 1) Competent Programmer Hypothesis: —>Most programs are nearly correct. 2) Coupling Hypothesis: —> Big bugs are composed of a series of small errors. 28

Assessing quality of the test suites 29

“If the program works … on specified data , then it will always work on any data . — Hoare Mutation testing

31 + Programmatic oracle Synthetic Past studies: Correctness Small programs focus Few faults Few mutants

ISSTA ICSE FSE 1996 2005 2014 321 KLOC 1 6 Faults 12 38 357 230,000 Mutants 24 1,100 developer-written & Tests generated generated generated Coverage ✖ ✖ ✔ controlled Examine ✖ ✖ ✔ shortcomings

ISSTA ICSE FSE 1996 2005 2014 Do stronger tests detect more mutants? 321 KLOC 1 6 Faults 12 38 357 Is mutant detection correlated with 230,000 Mutants 24 1,100 fault detection ? developer-written & Tests generated generated generated Coverage Can mutants describe ✖ ✖ ✔ controlled all real faults? Examines ✖ ✖ ✔ shortcomings

Experimental method Define Compilable Triggering Analyze Generate Candidates Faults Tests Misses Suites 34

Experimental Do stronger tests detect results more mutants? Statement Mutant coverage detection 27% 40% Increased 60% 73% Unchanged 35

Experimental results What kinds of faults are not represented by mutants? if (x) { … if (cK.length != return; sD[0].length) } 10% 17% if (x) { … if (cK.length != 73% // del getCatCount()) } Increased Weak/missing No operator 36

Mutation takeaway A correlation exists between mutant detection and real fault detection . 37

Impact Adding tests can on be more testing impactful than increasing coverage Mutants can serve as effective Mutants can describe proxies for real faults many real faults Kill score is a better predictor of test quality than coverage 60% of real faults are Stronger coverage already criteria o ff er little 38 covered additional insight

Mutation Testing Reid Holmes Key questions Is a test suite: Su - PowerPoint PPT Presentation

Mutation Testing Reid Holmes Key questions Is a test suite: Su ffi ciently broad ? Su ffi ciently deep ? 2 Test suite depth Mutation testing 3 Program Generate Mutants 4 Program Generate Mutants 5 Program Generate Mutants Mutant

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

Mutation Testing in Python Austin Bingham @austin_bingham @sixty_north 1 Sunday, October 4, 15

Milu: A Higher Order Mutation Testing Tool Yue Jia University College London Joint work with

Advances in Mutation Testing Research for C++ Pedro Delgado-Prez TAROT: Intro Talk June 2015

Drift, mutation, selection, and the evolutionary dispersion of mean phenotypes recombination

Mutation Testing of Memory- Related Operators Jay Nanavati, Fan Wu, Mark Harman, Yue Jia, Jens

Multi-Objective Higher Order Mutation Testing with Genetic Programming W. B. Langdon Kings

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 & Gregory

BRCA Founder Mutations Ashkenazi Jewish (Hungarian and Russian): BRCA1 - 185delAG and

Equivalence Modulo States Bo Wang, Yingfei Xiong , Yangqingwei Shi, Lu Zhang, Dan Hao Peking

Mutation models: probabilistic study and parameter estimation Adrien Mazoyer, supervised by

CSE 331 Mutation and immutability slides created by Marty Stepp based on materials by M. Ernst,

2: Old English Sound Laws and Verbs Proto-Germanic fts , dative fti , plural *ftiz Front

CS 251 Fall 2019 CS 251 Fall 2019 Principles of Programming Languages Principles of

Mutate and Test Your Tests Benoit Baudry KTH, Sweden 1 baudry@kth.se Test Your Tests What

Teaching Software Testing with Automated Feedback James Perretta and Andrew DeOrio, University of

Evolutionary Graph Theory J. D az LSI-UPC Nice, May, 2014 Population Genetics Models

Applying a pairwise coverage criterion to scenario-based testing Lydie du Bousquet, Michael

tt rs

CS61A Lecture 43 Amir Kamil UC Berkeley May 1, 2013 Announcements HW13 due tonight

Tame quivers have finitely many m-maximal green sequences Kiyoshi Igusa 1 Ying Zhou 2 1 Department

Mutation Testing Reid Holmes Key questions Is a test suite: Su - PowerPoint PPT Presentation

Mutation Testing Reid Holmes Key questions Is a test suite: Su ffi ciently broad ? Su ffi ciently deep ? 2 Test suite depth Mutation testing 3 Program Generate Mutants 4 Program Generate Mutants 5 Program Generate Mutants Mutant

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

1 Mutation pressure Let = the mutation rate from A a Let = the mutation rate from a

Mutation Testing in Python Austin Bingham @austin_bingham @sixty_north 1 Sunday, October 4, 15

Milu: A Higher Order Mutation Testing Tool Yue Jia University College London Joint work with

Advances in Mutation Testing Research for C++ Pedro Delgado-Prez TAROT: Intro Talk June 2015

Drift, mutation, selection, and the evolutionary dispersion of mean phenotypes recombination

Mutation Testing of Memory- Related Operators Jay Nanavati, Fan Wu, Mark Harman, Yue Jia, Jens

Multi-Objective Higher Order Mutation Testing with Genetic Programming W. B. Langdon Kings

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 &amp; Gregory

BRCA Founder Mutations Ashkenazi Jewish (Hungarian and Russian): BRCA1 - 185delAG and

Equivalence Modulo States Bo Wang, Yingfei Xiong , Yangqingwei Shi, Lu Zhang, Dan Hao Peking

Mutation models: probabilistic study and parameter estimation Adrien Mazoyer, supervised by

CSE 331 Mutation and immutability slides created by Marty Stepp based on materials by M. Ernst,

2: Old English Sound Laws and Verbs Proto-Germanic *fts , dative *fti , plural *ftiz Front

CS 251 Fall 2019 CS 251 Fall 2019 Principles of Programming Languages Principles of

Mutate and Test Your Tests Benoit Baudry KTH, Sweden 1 baudry@kth.se Test Your Tests What

Teaching Software Testing with Automated Feedback James Perretta and Andrew DeOrio, University of

Evolutionary Graph Theory J. D az LSI-UPC Nice, May, 2014 Population Genetics Models

Applying a pairwise coverage criterion to scenario-based testing Lydie du Bousquet, Michael

tt rs

CS61A Lecture 43 Amir Kamil UC Berkeley May 1, 2013 Announcements HW13 due tonight

Tame quivers have finitely many m-maximal green sequences Kiyoshi Igusa 1 Ying Zhou 2 1 Department

Using Conditional Mutation to Increase the Efficiency of Mutation Analysis e Just 1 & Gregory

2: Old English Sound Laws and Verbs Proto-Germanic fts , dative fti , plural *ftiz Front