Achieving Software Reliability Without Breaking the Budget g g - PowerPoint PPT Presentation

Achieving Software Reliability Without Breaking the Budget g g Bojan Cukic Lane Department of CSEE West Virginia University West Virginia University University of Houston September 2013 September 2013 CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Software Engineering (I) (I)maturity i • 35% of large applications are cancelled, 35% f l li ti ll d • 75% of the remainder run late and are over budget, • Defect removal efficiency is only about 85% • Defect removal efficiency is only about 85% • Software needs better measures of results and better quality control. • Right now various methods act like religious cults more than technical disciplines more than technical disciplines. – Capers Jones , Feb. 3, 2012, in Data & Analysis Center for Software (DACS), LinkedIn Discussion Forum CITeR CITeR The Center for Identification Technology Research 2 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Software Engineering (I)maturity (I) i • Major cost drivers for software in the U.S., rank order M j t d i f ft i th U S k d 1) The cost of finding and fixing bugs 2) The cost of cancelled projects 3) The cost of producing / analyzing English words ) p g y g g 4) The cost of security flaws and attacks 5) The cost of requirements changes 6) The cost of programming or coding 7) The cost of customer support ) pp … 11) The cost of innovation and new kinds of software 12) The cost of litigation for failures and disasters 13) The cost of training and learning ) g g 14) The cost of avoiding security flaws 15) The cost of assembling reusable components • This list is based on analysis of ~13,000 projects. – Capers Jones , Feb. 4, 2012, in DACS CITeR CITeR The Center for Identification Technology Research 3 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Outline – Software E Engineering as Data Science i i D S i • Fault prediction p – Early in the life cycle. – Lower the cost of V&V by directing the effort Lower the cost of V&V by directing the effort to places that most likely hide faults. • Effort prediction Effort prediction – With few data points from past projects • Problem report triage • Problem report triage • Summary CITeR CITeR The Center for Identification Technology Research 4 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Software Reliability P Prediction di i • Probability of failure given known operational y g usage. – Reliability growth • Extrapolates reliability from test failure frequency. • Applicable late in the life cycle. – Statistical testing and sampling Statistical testing and sampling • Prohibitively large number of test cases. – Formal analysis • Applied to software models • All prohibitively expensive -> Predict where faults hide, optimize verification. > Predict where faults hide optimize verification CITeR CITeR The Center for Identification Technology Research 5 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Fault Prediction Research Fault Prediction Research • Extensive research in software quality prediction. di ti – Faulty modules identified through the analysis and modeling of static code metrics modeling of static code metrics. • Significant payoff in software engineering practice by concentrating V&V resources on problem areas. • Are all the prediction methods practical? – Predominantly applied to multiple version systems Predominantly applied to multiple version systems • A wealth of historical information from previous versions. – What if we are creating Version 1.0? CITeR CITeR The Center for Identification Technology Research 6 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Prediction within V1.0 Prediction within V1.0 • Not as rare a problem as some tend to believe. Not as rare a problem as some tend to believe. – Customized products are developed regularly. – One of a kind applications: • Embedded systems space systems defense applications • Embedded systems, space systems, defense applications. • Typically high dependability domains. – NASA MDP data sets fall into this category. • Labeling modules for fault content is COSTLY! – The fewer labels needed to build a model, the cheaper the prediction task. • The absence of problem report does not imply fault free module. • Standard fault prediction literature assumes massive amounts of labeled data available for training amounts of labeled data available for training. CITeR CITeR The Center for Identification Technology Research 7 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Goals Goals • How much data does one need to build a fault prediction model? – What happens when most modules do not have a label? • Explore suitable machine learning techniques and compare results with previously published approaches. approaches. – Semi –supervised learning (SSL). – An intermediate approach between supervised and unsupervised learning. p g – Labeled and unlabeled data used to train the model – No specific assumptions on label distributions. CITeR CITeR The Center for Identification Technology Research 8 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

SSL: Basic idea SSL: Basic idea CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Basic idea Basic idea • Iteratively train a supervised learning algorithm from It ti l t i i d l i l ith f “currently labeled” modules. – Predict the labels of unlabeled modules. – Migrate instances with “high confidence” predictions into the pool of labeled modules (FTcF algorithm). – Repeat until all modules labeled. Repeat until all modules labeled. • Large number of independent variables (>40). – Dimensional reduction (not feature selection). Di i l d ti ( t f t l ti ) – Multidimensional scaling as the data preprocessing technique. CITeR CITeR The Center for Identification Technology Research 10 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

A variant of self-training Algorithm Algorithm approach and Yaworski’s h d Y ki’ algorithm. An unlabeled module An unlabeled module may change the label in each iteration… φ Base learner : Random forest - robust to noise CITeR CITeR The Center for Identification Technology Research 11 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Fault Prediction Data Sets Fault Prediction Data Sets • Large NASA MDP projects (> 1,000 modules) CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Experimentation Experimentation • Compare the performance of four fault prediction approaches, all using RF as the base learner: – Supervised learning (SL) – Supervised learning with dimensionality reduction (SL.MDS) Supervised learning with dimensionality reduction (SL.MDS) – Semi-supervised learning (SSL) – Semi-supervised learning w dimensionality reduction (SSL.MDS) • Assume 2% - 50% of modules are labeled . A 2% 50% f d l l b l d – Randomly selected, 10 times. • Performance evaluation: Area under ROC, PD Performance evaluation: Area under ROC, PD  – PD =   Y | | | |    U U { { 0 0 . 1 1 , 0 0 . 5 5 , 0 0 . 75 75 } }  Y | 1 | U CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Results on PC4 CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Comparing Techniques: AUC p g q CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Comparing Techniques: PD p g q CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Statistical Analysis y H 0 : There is no difference between the 4 algorithms across all data sets H a : Prediction performance of at least one algorithm is significantly better than the others across all data sets P-value from ANOVA measures evidence against H 0 Which approaches differ significantly? Use post-hoc Tukey’s “honestly significant honestly significant difference (HSD)” CITeR CITeR The Center for Identification Technology Research An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Benchmarking Benchmarking • Lessman (TSE 2008) and Menzies (TSE 2007) offer benchmark performance for NASA MDP data sets benchmark performance for NASA MDP data sets – Lessman et al. on 66% of the data, Menzies trains on 90%, CITeR CITeR The Center for Identification Technology Research 18 An NSF I/UCR Center advancing ID management research www.citer.wvu.edu

Achieving Software Reliability Without Breaking the Budget g g - PowerPoint PPT Presentation

Achieving Software Reliability Without Breaking the Budget g g Bojan Cukic Lane Department of CSEE West Virginia University West Virginia University University of Houston September 2013 September 2013 CITeR CITeR The Center for

Software Reliability and System Reliability Introduction 1 Software Reliability and System

Reliability Engineering - Discussions and Clarifications Reliability Engineering VS.

Reliability of Cloud-Scale Systems (CS 598) Fall 2018 Tianyin Xu 1 Reliability of Cloud-Scale

Breaking out of the box Understanding rela5onships between learning and assessment Breaking

CSS creative by @aganaplocha breaking the norm with CSS creative by @aganaplocha breaking

Financial Impacts of Achieving Aggressive Financial Impacts of Achieving Aggressive Financial

Reliability Perspectives on Clean Power Plan Implications NERC Reliability Assessments John Moura

The Future of Reliability: Stanton Energy Reliability Center DCBO Bidders Conference

SUSY breaking and the MSSM Spontaneous SUSY breaking at tree-level ORaifeartaigh, Fayet,

Software Reliability 18-849b Dependable Embedded Systems Jiantao Pan Feb 2, 1999 Handbook of

Software Reliability Categorizing and specifying the reliability of software systems CS 422

The National Adoption Service Suzanne Griffiths, Director of Achieving More Together Achieving

Without sustaining injury Without sustaining injury Without sustaining injury Without sustaining

Achieving FP in Java John Napier Achieving FP in Java John Napier Background Software

Software Reliability and System reliability Steven J Zeil Old Dominion Univ. Spring 2012 1

Software Reliability Engineering: An Introduction SE 350 Software Process & Product Quality

Feedback Loops for Software Delivery and how to become 6 orders of magnitude more productive

The End of Projects and what happens next Allan Kelly -

HMIS Advisory Committee FL-507 Continuum of Care May 20, 2019 Agenda Introductions (10 mins)

RECURSION Lecture 9 CS2110 Summer 2019 Since Michael is leaving for a week I will take

Running and Utilizing the HUD Annual Performance Report (APR) How to Read and Utilize the Report

ADDRESSING ADDRESSING Community Revitalization Affordable Housing The Honorable Neal Rackleff

5 (or maybe 6) things you should touch before going ZeroTouch Jos Bonnet, AlticeLabs, 22 nd

HOME Essentials Presented by Monte Franke January 7, 2018 Slide 1 Program History 2015 : 2004 :

Achieving Software Reliability Without Breaking the Budget g g - PowerPoint PPT Presentation

Achieving Software Reliability Without Breaking the Budget g g Bojan Cukic Lane Department of CSEE West Virginia University West Virginia University University of Houston September 2013 September 2013 CITeR CITeR The Center for

Software Reliability and System Reliability Introduction 1 Software Reliability and System

Reliability Engineering - Discussions and Clarifications Reliability Engineering VS.

Reliability of Cloud-Scale Systems (CS 598) Fall 2018 Tianyin Xu 1 Reliability of Cloud-Scale

Breaking out of the box Understanding rela5onships between learning and assessment Breaking

CSS creative by @aganaplocha breaking the norm with CSS creative by @aganaplocha breaking

Financial Impacts of Achieving Aggressive Financial Impacts of Achieving Aggressive Financial

Reliability Perspectives on Clean Power Plan Implications NERC Reliability Assessments John Moura

The Future of Reliability: Stanton Energy Reliability Center DCBO Bidders Conference

SUSY breaking and the MSSM Spontaneous SUSY breaking at tree-level ORaifeartaigh, Fayet,

Software Reliability 18-849b Dependable Embedded Systems Jiantao Pan Feb 2, 1999 Handbook of

Software Reliability Categorizing and specifying the reliability of software systems CS 422

The National Adoption Service Suzanne Griffiths, Director of Achieving More Together Achieving

Without sustaining injury Without sustaining injury Without sustaining injury Without sustaining

Achieving FP in Java John Napier Achieving FP in Java John Napier Background Software

Software Reliability and System reliability Steven J Zeil Old Dominion Univ. Spring 2012 1

Software Reliability Engineering: An Introduction SE 350 Software Process &amp; Product Quality

Feedback Loops for Software Delivery and how to become 6 orders of magnitude more productive

The End of Projects and what happens next Allan Kelly -

HMIS Advisory Committee FL-507 Continuum of Care May 20, 2019 Agenda Introductions (10 mins)

RECURSION Lecture 9 CS2110 Summer 2019 Since Michael is leaving for a week I will take

Running and Utilizing the HUD Annual Performance Report (APR) How to Read and Utilize the Report

ADDRESSING ADDRESSING Community Revitalization Affordable Housing The Honorable Neal Rackleff

5 (or maybe 6) things you should touch before going ZeroTouch Jos Bonnet, AlticeLabs, 22 nd

HOME Essentials Presented by Monte Franke January 7, 2018 Slide 1 Program History 2015 : 2004 :

Software Reliability Engineering: An Introduction SE 350 Software Process & Product Quality