An Empirical Study on the Use of Defect U N I VE R SI TY OF WASHI N - PowerPoint PPT Presentation

D AVID PATERSON, U N I VE RSI TY O F S HE FFI ELD JOSE CAMPOS, An Empirical Study on the Use of Defect U N I VE R SI TY OF WASHI N G TON Prediction for Test Case Prioritization RUI ABREU, U N I VE R SI TY OF LI SB ON GREGORY M. KAPFHAMMER, AL L EGHENY CO L L EGE International Conference on Software Testing, Verification and Validation GORDON FRASER, Xi'an, China April 22-27 2019 UNIV ERS ITY O F PAS S AU PHIL MCMINN, UNIV ERS ITY O F S HEF F IEL D DPATERSON1@SHEFFIELD.AC.UK

In software development, our goal is to minimize the impact of faults Defect If we know that a fault exists, we can use fault localization to pinpoint the Prediction code unit responsible If we don’t know that a fault exists, we can use defect prediction to estimate which code units are likely to be faulty DPATERSON1@SHEFFIELD.AC.UK

ClassA ClassB 33% 10% Defect Prediction ClassC ClassD 72% 3%

Defect Prediction Version Control Code Smells Code Features Information • Feature Envy • Cyclomatic • Number of Changes Complexity • God Class • Number of Authors • Method Length • Inappropriate • Number of Fixes Intimacy • Class Length DPATERSON1@SHEFFIELD.AC.UK

Regression testing can account for up to 80% of the total testing budget, and up to 50% of the cost of software maintenance Why Do We In some situations, it may not be possible to re-run all test cases on a system Prioritize Test Cases? By prioritizing test cases , we aim to ensure faults are detected in the smallest amount of time irrespective of program changes DPATERSON1@SHEFFIELD.AC.UK

How Do We Prioritize Test Cases? t 1 t 2 t 3 t 4 t n-3 t n-2 t n-1 t n Version 1 ✅ ✅ ✅ ❌ ✅ ✅ ✅ ✅ Version 2 ✅ ✅ ✅ ❌ ✅ ✅ ✅ ✅ Version 3 ✅ ✅ ✅ ❌ ✅ ✅ ✅ ✅ ... Version 4 ✅ ✅ ✅ ❌ ❌ ✅ ✅ ✅ Version 5 ✅ ✅ ✅ ✅ ✅ ✅ ✅ ✅ Version 6 ✅ ✅ ✅ ✅ ✅ ❌ ✅ ✅ Version 7 ✅ ✅ ❌ ✅ ✅ ❌ ✅ ✅ Version 8 ✅ ✅ ✅ ✅ ✅ ❌ ✅ ✅ Version 9 ❌ ✅ ✅ ✅ ✅ ✅ ✅ ✅ Version n ❓ ❓ ❓ ❓ ❓ ❓ ❓ ❓ Version n+1 ❓ ❓ ❓ ❓ ❓ ❓ ❓ ❓ DPATERSON1@SHEFFIELD.AC.UK

How Do We Prioritize Test Cases? This Paper Code Coverage Test History Defect Prediction: “How many lines of “Has this test case “What is the code are executed failed recently?” likelihood that this by this test case?” code is faulty?” public int abs( int x){ if (x >= 0) { return x; } else { return – x; } } DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassA ClassB ClassC ClassD 33% 10% 72% 3% DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassC 72% DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassC Test Cases that execute code in ClassC: - TestClass.testOne - TestClass.testSeventy - OtherTestClass.testFive - OtherTestClass.testThirteen 72% - TestClassThree.test165 How do we order these test cases before placing them in the prioritized suite? DPATERSON1@SHEFFIELD.AC.UK

Secondary Objectives Test Cases that execute code in ClassC: - TestClass.testOne - TestClass.testSeventy - OtherTestClass.testFive - OtherTestClass.testThirteen - TestClassThree.test165 We can use one of the features described earlier (e.g. code coverage) as a way of ordering the subset of test cases DPATERSON1@SHEFFIELD.AC.UK

Secondary Objectives Test Cases that execute code in ClassC: Lines Covered: - TestClass.testOne 25 - TestClass.testSeventy 32 - OtherTestClass.testFive 144 - OtherTestClass.testThirteen 8 - TestClassThree.test165 39 We can use one of the features described earlier (e.g. code coverage) as a way of ordering the subset of test cases DPATERSON1@SHEFFIELD.AC.UK

Secondary Objectives Test Cases that execute code in ClassC: Lines Covered: - OtherTestClass.testFive 144 - TestClassThree.test165 39 - TestClass.testSeventy 32 - TestClass.testOne 25 - OtherTestClass.testThirteen 8 We can use one of the features described earlier (e.g. code coverage) as a way of ordering the subset of test cases DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassC Test Cases that execute code in ClassC: - OtherTestClass.testFive - TestClassThree.test165 - TestClass.testSeventy - TestClass.testOne 72% - OtherTestClass.testThirteen Prioritized Test Suite: DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassC Test Cases that execute code in ClassC: 72% Prioritized Test Suite: - OtherTestClass.testFive - TestClassThree.test165 - TestClass.testSeventy - TestClass.testOne - OtherTestClass.testThirteen DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassA Test Cases that execute code in ClassA: Lines Covered: - ClassATest.testA 14 - ClassATest.testB 27 - ClassATest.testC 9 33% Prioritized Test Suite: - OtherTestClass.testFive - TestClassThree.test165 - TestClass.testSeventy - TestClass.testOne - OtherTestClass.testThirteen DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassA Test Cases that execute code in ClassA: Lines Covered: - ClassATest.testB 27 - ClassATest.testA 14 - ClassATest.testC 9 33% Prioritized Test Suite: - OtherTestClass.testFive - TestClassThree.test165 - TestClass.testSeventy - TestClass.testOne - OtherTestClass.testThirteen DPATERSON1@SHEFFIELD.AC.UK

Defect Prediction for Test Case Prioritization ClassA Test Cases that execute code in ClassA: Prioritized Test Suite: - OtherTestClass.testFive 33% - TestClassThree.test165 - TestClass.testSeventy - TestClass.testOne - OtherTestClass.testThirteen - ClassATest.testB - ClassATest.testA - ClassATest.testC DPATERSON1@SHEFFIELD.AC.UK

By repeating this process for all Defect classes in the system, we Prediction for Test Case generate a fully prioritized test Prioritization suite based on defect prediction DPATERSON1@SHEFFIELD.AC.UK

Empirical Evaluation DPATERSON1@SHEFFIELD.AC.UK

Empirical Evaluation Defect Prediction: Schwa [1] Uses version control information to produce defect prediction scores comprised of weighted number of commits, authors, and fixes related to a file [1] - https://github.com/andrefreitas/schwa DPATERSON1@SHEFFIELD.AC.UK

Empirical Evaluation Defect Prediction: Schwa [1] Uses version control information to produce defect prediction scores comprised of weighted number of commits, authors, and fixes related to a file D EFECTS 4J [2] Faults: Repository containing 395 real faults collected across 6 open- source Java projects [1] - https://github.com/andrefreitas/schwa [2] - https://github.com/rjust/defects4j DPATERSON1@SHEFFIELD.AC.UK

Empirical Evaluation Defect Prediction: Schwa [1] Uses version control information to produce defect prediction scores comprised of weighted number of commits, authors, and fixes related to a file D EFECTS 4J [2] Faults: Repository containing 395 real faults collected across 6 open- source Java projects Test Prioritization: K ANONIZO [3] Test Case Prioritization tool built for Java Applications [1] - https://github.com/andrefreitas/schwa [2] - https://github.com/rjust/defects4j [3] - https://github.com/kanonizo/kanonizo DPATERSON1@SHEFFIELD.AC.UK

1 2 3 Discover the best Compare our approach Compare our approach parameters for defect against existing against existing prediction in order to coverage-based / history-based predict faulty classes as approaches approaches soon as possible Research Objectives DPATERSON1@SHEFFIELD.AC.UK

1 1.Revisions Weight 2.Authors Weight Parameter 3.Fixes Weight Tuning 4.Time Weight ෍𝑺𝒇𝒘𝒋𝒕𝒋𝒑𝒐𝒕𝑿𝒇𝒋𝒉𝒊𝒖 + 𝑩𝒗𝒖𝒊𝒑𝒔𝒕𝑿𝒇𝒋𝒉𝒊𝒖 + 𝑮𝒋𝒚𝒇𝒕𝑿𝒇𝒋𝒉𝒊𝒖 = 𝟐 DPATERSON1@SHEFFIELD.AC.UK

1 ෍ 𝑺𝒇𝒘𝒋𝒕𝒋𝒑𝒐𝒕𝑿𝒇𝒋𝒉𝒊𝒖 + 𝑩𝒗𝒖𝒊𝒑𝒔𝒕𝑿𝒇𝒋𝒉𝒊𝒖 + 𝑮𝒋𝒚𝒇𝒕𝑿𝒇𝒋𝒉𝒊𝒖 = 𝟐 Revisions Weight Authors Weight Fixes Weight Time Range 1.0 0.0 0.0 0.0 0.9 0.1 0.0 0.0 Parameter 0.8 0.2 0.0 0.0 . Tuning . . 0.0 0.0 1.0 0.9 0.0 0.0 1.0 1.0 726 Valid Configurations

1 - Select 5 bugs from each project at random - For each bug/valid configuration Parameter - Initialize Schwa with configuration and run Tuning - Collect “true” faulty class from D EFECTS 4J - Calculate index of “true” faulty class according to prediction

An Empirical Study on the Use of Defect U N I VE R SI TY OF WASHI N - PowerPoint PPT Presentation

D AVID PATERSON, U N I VE RSI TY O F S HE FFI ELD JOSE CAMPOS, An Empirical Study on the Use of Defect U N I VE R SI TY OF WASHI N G TON Prediction for Test Case Prioritization RUI ABREU, U N I VE R SI TY OF LI SB ON GREGORY M. KAPFHAMMER,

Defect Removal Metrics SE 350 Software Process & Product Quality 1 Objectives Understand

Defect Removal Metrics September 30, 2004 Swami Natarajan RIT Software Engineering Defect

Analyzing fluid flows via the ergodicity defect ergodicity defect Sherry E. Scott FFT 2013

A Defect- -Tolerant Tolerant A Defect Computer Architecture: Computer Architecture:

Circuit Analysis and Defect Characteristics Estimation Method Using Bimodal Defect-Centric Random

Defect Prevention and Removal SE 350 Software Process & Product Quality 1 Objectives

Defect Classification and Defect Types Revisited Stefan Wagner Technische Universitt Mnchen,

Context: Defect Detection Task Alessio Ferrari ISTI-CNR, Pisa, Italy alessio.ferrari@isti.cnr.it

DEFECT DETECTION IN A DEFECT DETECTION IN A DISTRIBUTED SOFTWARE DISTRIBUTED SOFTWARE

(DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D. Relevant Links: Defect Segmentation

Automatic Defect Detection Andrzej Wasylkowski Overview Automatic Defect Detection

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

An Empirical Security Study of An Empirical Security Study of the Native Code in the JDK the

Empirical problem solving Statistical method R.W. Oldford Empirical problem solving - PPDAC The

Introduction to Machine Learning Vapnik Chervonenkis Theory Barnabs Pczos Empirical Risk

Clinical presentation and outcome in a series of 88 patients with the cblC defect. Sabine Fischer 1

COUNCIL OF THE DISTRICT OF COLUMBIA OFFICE OF THE BUDGET DIRECTOR J E N N I F E R B U D O F F,

Reducing Nutrient-Algal Biomass Relationship Uncertainty Through Mechanistic Modeling Thomas W.

MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes,

Academic Perspectives on The Design of Treasury Auctions Ali Horta csu, University of Chicago

Applying Classification Techniques to Remotely-Collected Program Execution Data Alessandro Orso

Low-cost Management Training in the Bangladeshi Garment Sector Vanessa Schreiber , Atonu

Religious Freedom and Economic Development. A Conceptual and Empirical Review Waqas Ahmad

Discussing proof in STEM fields Math and Science teachers use of inductive evidence Nick

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

An Empirical Study on the Use of Defect U N I VE R SI TY OF WASHI N - PowerPoint PPT Presentation

D AVID PATERSON, U N I VE RSI TY O F S HE FFI ELD JOSE CAMPOS, An Empirical Study on the Use of Defect U N I VE R SI TY OF WASHI N G TON Prediction for Test Case Prioritization RUI ABREU, U N I VE R SI TY OF LI SB ON GREGORY M. KAPFHAMMER,

Defect Removal Metrics SE 350 Software Process &amp; Product Quality 1 Objectives Understand

Defect Removal Metrics September 30, 2004 Swami Natarajan RIT Software Engineering Defect

Analyzing fluid flows via the ergodicity defect ergodicity defect Sherry E. Scott FFT 2013

A Defect- -Tolerant Tolerant A Defect Computer Architecture: Computer Architecture:

Circuit Analysis and Defect Characteristics Estimation Method Using Bimodal Defect-Centric Random

Defect Prevention and Removal SE 350 Software Process &amp; Product Quality 1 Objectives

Defect Classification and Defect Types Revisited Stefan Wagner Technische Universitt Mnchen,

Context: Defect Detection Task Alessio Ferrari ISTI-CNR, Pisa, Italy alessio.ferrari@isti.cnr.it

DEFECT DETECTION IN A DEFECT DETECTION IN A DISTRIBUTED SOFTWARE DISTRIBUTED SOFTWARE

(DEFECT SEGMENTATION) Peter Pyun Ph.D. Andrew Liu Ph.D. Relevant Links: Defect Segmentation

Automatic Defect Detection Andrzej Wasylkowski Overview Automatic Defect Detection

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

An Empirical Security Study of An Empirical Security Study of the Native Code in the JDK the

Empirical problem solving Statistical method R.W. Oldford Empirical problem solving - PPDAC The

Introduction to Machine Learning Vapnik Chervonenkis Theory Barnabs Pczos Empirical Risk

Clinical presentation and outcome in a series of 88 patients with the cblC defect. Sabine Fischer 1

COUNCIL OF THE DISTRICT OF COLUMBIA OFFICE OF THE BUDGET DIRECTOR J E N N I F E R B U D O F F,

Reducing Nutrient-Algal Biomass Relationship Uncertainty Through Mechanistic Modeling Thomas W.

MELODI M achin E L earning, O ptimization, &amp; D ata I nterpretation @ UW Iyer &amp; Bilmes,

Academic Perspectives on The Design of Treasury Auctions Ali Horta csu, University of Chicago

Applying Classification Techniques to Remotely-Collected Program Execution Data Alessandro Orso

Low-cost Management Training in the Bangladeshi Garment Sector Vanessa Schreiber , Atonu

Religious Freedom and Economic Development. A Conceptual and Empirical Review Waqas Ahmad

Discussing proof in STEM fields Math and Science teachers use of inductive evidence Nick

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Defect Removal Metrics SE 350 Software Process & Product Quality 1 Objectives Understand

Defect Prevention and Removal SE 350 Software Process & Product Quality 1 Objectives

MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes,