Method Based on Morphological Analysis, Clustering and the - PowerPoint PPT Presentation

A Test Case Recommendation Method Based on Morphological Analysis, Clustering and the Mahalanobis-Taguchi Method Hirohisa Aman 1) Takashi Nakano 2) Hideto Ogasawara 2) Minoru Kawahara 1) 1) Ehime University, Japan 2) Toshiba Corporation, Japan (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 1

Overview Purpose To recommend similar but different test cases in order to reduce the risk of overlooking regressions Method Quantify the similarity between test cases through the morphological analysis , and categorized them ( clustering ) Once a test case is selected by a test engineer , the proposed method automatically recommends additional test cases based on the results of clustering Result The proposed method is about six times more effective than the random test case selection; it would be useful in making a regression test plan (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 2

Outline  Background, Motivation & Situation  Test Case Recommendation ◦ Morphological Analysis ◦ Test Case Clustering ◦ Test Case Prioritization  Empirical Study  Related Work  Conclusion & Future Work (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 3

Background: Regression Testing  In fact, it is difficult to always make a one- shot release of a perfect product which has no need to be modified in the future test instal retest l reinstall modification report  Program modifications may cause other failures (regressions) (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 5

Motivation: Unexpected Failures & Testing Cost  We may encounter unexpected failures in unexpected functions after modifications modification modification Unexpected failure in another function which seemed to be independent of the modified functions!  While it is ideal to rerun all test cases every time, we have the restriction of cost … (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 6

Motivation: Risk of Overlooking regressions  We have a lot of test cases , and it's unrealistic to rerun all of them whenever a modification is made  We have to select test cases , but there is the risk of overlooking regressions since we might miss rerunning important test cases missed test cases selected test cases set of all test cases (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 7

Our Available Data versions (revisions) current version V1 V2 V3 V4 V5 V6 V7 V8 V9 test cases T1 P T2 P T3 F P T4 P F P T5 F F P T6 F P … (P: pass, F: fail, Blank: no run) (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 9

Scenario for Our Test Case Recommendation 1. For each version, a practitioner decides on a set of test cases to rerun ( 𝑆 0 ) 2. We recommend another set of test cases similar to the ones in 𝑆 0 in regards to their priorities 10 12 6 11 recommends 7 practitioner's selection set of all test cases (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 11

Morphological Analysis  A morphological analysis is used to analyze texts written in a natural language  It divides text strings into component words and detects their parts of speech (noun, verb, …) This is a simple example. This is a simple example . this be a simple example . noun verb adjective determiner determiner  There are many applications of it like machine translations (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 13

Analysis of Our Test Case  Our test case is written in Japanese  A test engineer performs his/her test according to the test case An example of a test case (translated into English) A project creation: Enter a name of project, and check if we can successfully create a new project on the system. The length of project's name should be around 10 characters.  We used MeCab (one of the most popular morphological analysis tool for Japanese), and extracted a set of words (nouns, adjectives and verbs) (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 14

Similarity between Test Cases  We compute the similarity between test cases 𝑢 𝑗 and 𝑢 𝑘 by using the Jaccard index : 𝑋 𝑗 ∩ 𝑋 𝑘 𝐾 𝑢 𝑗 , 𝑢 𝑘 = 𝑋 𝑗 ∪ 𝑋 𝑘 ◦ 𝑋 𝑗 : the set of words in test case 𝑢 𝑗 ◦ 𝑋 𝑘 : the set of words in test case 𝑢 𝑘  This is a simple but useful index; it has been widely used in the natural language processing world (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 15

Example  Suppose our sets of words are 𝑋 𝑋 1 button, click, chronological, date, display, 1 button, click, chronological, date, display, download, file, log, order download, file, log, order 𝑋 𝑋 2 archive, button, click, chronological, date, 2 archive, button, click, chronological, date, download, file, order download, file, order 𝑋 1 ∩ 𝑋 1 button, click, chronological, date, 7 download, file, order 𝑋 1 ∪ 𝑋 2 archive, button, click, chronological, date, 10 display, download, file, log, order 𝐾 𝑢 1 , 𝑢 2 = 0.7 (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 16

Clustering  Clustering is the task of grouping a set of objects together (making a cluster )  Objects belonging to the same group are more similar to each other than they are to objects of other groups (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 18

Test Case Clustering  Define the distance between test cases 𝑒 𝑢 𝑗 , 𝑢 𝑘 = 1 − 𝐾 𝑢 𝑗 , 𝑢 𝑘 This is referred to as Jaccard distance  Then, perform a clustering ◦ We used hclust function in R (a popular statistical computing environment) ◦ The function performs a hierarchical cluster analysis with the complete linkage method (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 19

Dendrogram (tree diagram)  We can obtain the results of clustering Jaccard distance cut level we will group test cases whose distances are less than the cut level in the same cluster  We empirically set 0.3 as the cut level : we consider that two test cases are similar when their Jaccard index ≥ 0.7 (= 1 − 0.3) (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 20

Test Case Prioritization  After our test case clustering, we select test cases to rerun  Within a cluster, we prioritize certain test cases  We have empirically used two criteria: I. Gap between the Last run version and the Current version ( GLC ) II. Failure Rate ( FR ) (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 22

Priority of a Test Case: Type-I Gap between the Last run version and the Current version ( GLC ) current version versions (revisions) V1 V2 V3 V4 V5 V6 V7 V8 V9 1 test cases T1 P 8 T2 P T3 F P 6 T4 P F P 2 T5 F F P 3 T6 F P 0 A greater GLC value means it’s not been tested for more … versions. Ignoring such a test case has a higher risk of overlooking regressions. (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 23

Priority of a Test Case: Type-II Failure Rate ( FR ) current version versions (revisions) V1 V2 V3 V4 V5 V6 V7 V8 V9 0/1 T1 P test cases 0/1 T2 P T3 F P 1/2 T4 P F P 1/3 T5 F F P 2/3 T6 F P 1/2 … A higher FR value means a better track record for finding a failure in the past. Such a test case may test a part which is fault-prone and we might expect a higher ability to find a regression. (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 24

How should we combine them? We have to consistently combine two different criteria for all test cases To implement such an integration, we adopt the notion of the Mahalanobis-Taguchi Method close to (it looks abnormal) normal objects far from normal objects objects working normally (C) 2017 Hirohisa Aman TAIC PART 2017 in Tokyo 25

Method Based on Morphological Analysis, Clustering and the - PowerPoint PPT Presentation

A Test Case Recommendation Method Based on Morphological Analysis, Clustering and the Mahalanobis-Taguchi Method Hirohisa Aman 1) Takashi Nakano 2) Hideto Ogasawara 2) Minoru Kawahara 1) 1) Ehime University, Japan 2) Toshiba Corporation, Japan

An Unsupervised Method for Uncovering Morphological Chains Karthik Narasimhan Regina Barzilay

Supervised Learning of Complete Morphological Paradigms Greg Durrett and John DeNero UC

Morphology & Transducers Intro to morphological analysis of languages Motivation for

Russian Morphological Processing for ICALL System architecture Exercise design Error types

A New Universal Morphological Feature Schema for Rich Morphological Annotation and Cross-Lingual

Morphological Analysis Morphological Analysis and Generation for Pali and Generation for Pali

Maca a configurable tool to Maca a configurable tool to integrate Polish morphological

Morphological Types Ling 203 Languages of the World Based on Lyovin (1997:26) Typology Exercise

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Feature-Based Tagging The Task, Again Recall: tagging ~ morphological disambiguation

Why do we make mistakes in morphological diagnosis how can we improve? Michelle Brereton

Combining Data-Intense and Compute-Intense Methods for Fine-Grained Morphological Analyses Petra

Some morphological operators in graph spaces Jean Cousty , Laurent Najman, and Jean Serra Workshop

Contents List of algorithms iii 13 Mathematical morphology 1 13.1 Basic morphological

MORPHOLOGICAL OPERATORS INEL 6088 - M. Toledo Ref: Sec. 2.6, 2.7 Jain et. al, Ch. 8 Davies

Yayi A generic framework for morphological image processing IPOL Raffi Enficiaud June 2012

s rts t ss

First Quarter Results 2010 Zurich April 22, 2010 Cautionary statement Cautionary statement

Market Timing Is ... Mark Pankin MDP Associates LLC Registered Investment Advisor November 15,

2018-04-02 Attention PSY 525.001 Vision Science 2018 Spring Rick Gilmore 2018-04-02

Continuous Improvement Toolkit Design of Experiment (Introduction) Continuous Improvement Toolkit

Jai Narayan Tripathi -IIT Bombay, INDIA.Rajkumar Nagpal - STMicroelectronics, INDIA.Rakesh Malik -

Learning morphology and phonology John Goldsmith University of Chicago MoDyCo/Paris X Learning

Architectural Fractals Daniel Lordick Institute of Geometry Dresden University of Technology,

Method Based on Morphological Analysis, Clustering and the - PowerPoint PPT Presentation

A Test Case Recommendation Method Based on Morphological Analysis, Clustering and the Mahalanobis-Taguchi Method Hirohisa Aman 1) Takashi Nakano 2) Hideto Ogasawara 2) Minoru Kawahara 1) 1) Ehime University, Japan 2) Toshiba Corporation, Japan

An Unsupervised Method for Uncovering Morphological Chains Karthik Narasimhan Regina Barzilay

Supervised Learning of Complete Morphological Paradigms Greg Durrett and John DeNero UC

Morphology &amp; Transducers Intro to morphological analysis of languages Motivation for

Russian Morphological Processing for ICALL System architecture Exercise design Error types

A New Universal Morphological Feature Schema for Rich Morphological Annotation and Cross-Lingual

Morphological Analysis Morphological Analysis and Generation for Pali and Generation for Pali

Maca a configurable tool to Maca a configurable tool to integrate Polish morphological

Morphological Types Ling 203 Languages of the World Based on Lyovin (1997:26) Typology Exercise

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Feature-Based Tagging The Task, Again Recall: tagging ~ morphological disambiguation

Why do we make mistakes in morphological diagnosis how can we improve? Michelle Brereton

Combining Data-Intense and Compute-Intense Methods for Fine-Grained Morphological Analyses Petra

Some morphological operators in graph spaces Jean Cousty , Laurent Najman, and Jean Serra Workshop

Contents List of algorithms iii 13 Mathematical morphology 1 13.1 Basic morphological

MORPHOLOGICAL OPERATORS INEL 6088 - M. Toledo Ref: Sec. 2.6, 2.7 Jain et. al, Ch. 8 Davies

Yayi A generic framework for morphological image processing IPOL Raffi Enficiaud June 2012

s rts t ss

First Quarter Results 2010 Zurich April 22, 2010 Cautionary statement Cautionary statement

Market Timing Is ... Mark Pankin MDP Associates LLC Registered Investment Advisor November 15,

2018-04-02 Attention PSY 525.001 Vision Science 2018 Spring Rick Gilmore 2018-04-02

Continuous Improvement Toolkit Design of Experiment (Introduction) Continuous Improvement Toolkit

Jai Narayan Tripathi -IIT Bombay, INDIA.Rajkumar Nagpal - STMicroelectronics, INDIA.Rakesh Malik -

Learning morphology and phonology John Goldsmith University of Chicago MoDyCo/Paris X Learning

Architectural Fractals Daniel Lordick Institute of Geometry Dresden University of Technology,

Morphology & Transducers Intro to morphological analysis of languages Motivation for