Evaluation of Example Tools For Hairy Tasks Presenter: Changsheng - PowerPoint PPT Presentation

Evaluation of Example Tools For Hairy Tasks Presenter: Changsheng chen CS 846 project Presentation Department of Computer Science

Outline ▪ Motivation ▪ Introduction ▪ Related works ▪ Case study 1 ▪ Case study 2 ▪ Conclusion

Motivation ▪ In some scenarios, for some tasks, any tool with less than 100% recall is not helpful and the user may be better off doing the task entirely manually. ▪ The trade off between precision and recall may make it difficult to interpret the true result. ▪ Improper use of precision and recall may affect evaluation. ▪ Different tasks need different weight for F-measure

Introduction – Recall and Precision ▪ Precision (P) is the ▪ Recall (R) is the percentage of the tool- percentage of the returned answers that correct answers that are correct. the tool returns ▪ Precision is the ▪ Which is the percentage of the found percentage of the right stuff that is right stuff that is found.

Introduction – F-Measure ▪ F-measure: harmonic mean of Precision and Recall ▪ Weighted F-Measure: For situations in which R and P are not equally important. β is the ratio by which it is desired to weight Recall more than Precision.

Case Study 1: ▪ Using Tools to Assist Identification of Non-requirements in Requirements Specifications – A Controlled Experiment(Jonas Paul Winkler and Andreas Vogelsang) ▪ Categorizing textual fragments into requirements and non-requirements. ▪ In practice, this categorization is performed manually ▪ Developed a tool to assist users in this task by providing warnings based on classification. ▪ Performed a controlled experiment with two groups of students. ▪ The results show that the application of an automated classification approach may provide benefits, given that the accuracy is high enough.

Case Study 1: ▪ Using Tools to Assist Identification of Non-requirements in Requirements Specifications – A Controlled Experiment(Jonas Paul Winkler and Andreas Vogelsang) ▪ Investigation of the effectiveness of automated tools for RE tasks ▪ Their experiment supports that claim that the accuracy of the tool may have an effect on the observed performance. ▪ A human working with the tool on the task should at least achieve better recall than a human working on the task entirely manually. ▪ The experimental setup follows this idea by comparing tool-assisted and manual reviews.

Case Study 2: ▪ Evaluation of Techniques to Detect Wrong Interaction Based Trace Links(Paul Hubner and Barbara Paech) ▪ Trace links are created and used continuously during the development ▪ Support developers with an automatic trace link creation approach with high precision. ▪ In their previous study we showed an interaction based trace link creation approach which is better than traditional IR based approaches. Performed a controlled experiment with two groups of students. ▪ Performed the study within a student project. ▪ Evaluated different techniques to identify relevant trace link candidates such as focus on edit interactions or thresholds for frequency and duration of trace link candidates.

Conclusion ▪ Most RE and SE tasks involving NL documents are hairy tasks and need tools support. ▪ We may evaluate these tools with the different F-measure because the importance of recall and precision may be different for different tasks. ▪ We must to research and understand which measures are appropriate to evaluate any tool for the task.

THANK YOU! QUESTIONS?

Evaluation of Example Tools For Hairy Tasks Presenter: Changsheng - PowerPoint PPT Presentation

Evaluation of Example Tools For Hairy Tasks Presenter: Changsheng chen CS 846 project Presentation Department of Computer Science Outline Motivation Introduction Related works Case study 1 Case study 2 Conclusion

SENSORY EVALUATION .. Basics of Sensory evaluation, Tools, Techniques, Methods and

Title goes here Tools for Performance Evaluation Timing and performance evaluation has been

Quota Assessment Tools Evaluation April 4, 2017 Agenda 1. Opening Remarks 2. History of Quota

The Tools of the Trade: How to The Tools of the Trade: How to Find or Create the Evaluation Find

Performance Evaluation for Petascale Quantum Simulation Tools

2008 Ryder Scott Reserves Conference Evaluation Challenges in a Changing World Tools for

Evaluation of validation tools of Java Agata Gruza and Ramya Krishna Koricherla Department of

Roadmap for Section 11.1 Performance Evaluation and Prediction Tools for Monitoring Windows

Evaluation of example tools for hairy tasks. Presenter: Hardik Sahi (20743327) Outline

Evaluation An overview of tools that can be used to improve programme efficiency, document impact

Evaluation & Engineering Tools WORKING DRAFT PRESENTATION TO THE ACTC ON MAY 1, 2019 1

Evaluation of Textual Knowledge Acquisition Tools: a Challenging Task Ha fa Zargayouna,

Monitoring and Evaluation The Nuts and Bolts of Monitoring and Evaluation How the scheme is

Evaluation of Ontology Evaluation of Ontology Merging Tools in Merging Tools in Bioinformatics

Integrating Device Registries and Innovative Tools for Enhanced Medical Device Evaluation and

Integrating Device Registries, UDI and Innovative Tools for Medical Device Evaluation An Update to

Supporting SRF Programs through Applied Research and Program Evaluation Tools CIFA Election Day

Heritage Grants Program: Evaluation Plan Presenters: Nancy Hewat, Ph.D. Synthesis Evaluation

Mapping to the Milestones A CONCEPTUAL PRESENTATION ON HOW TO THINK ABOUT LINKING YOUR

Why Cant Johnny Fix Vulnerabilities: A Usability Evaluation of Static Analysis Tools for

EVALUATING FASD PREVENTION & SUPPORT PROGRAMS Tools to Support Planning and Evaluation Nancy

IPM Evaluation Tools for Fruit and Field Crops Peter Werts Project Assistant IPM Institute of

A Systematic Literature Review on Evaluation of Digital Tools for Authoring Evidence - based

Evaluation CS294-184: Building User-Centered Programming Tools UC Berkeley Sarah E. Chasins

Evaluation of Example Tools For Hairy Tasks Presenter: Changsheng - PowerPoint PPT Presentation

Evaluation of Example Tools For Hairy Tasks Presenter: Changsheng chen CS 846 project Presentation Department of Computer Science Outline Motivation Introduction Related works Case study 1 Case study 2 Conclusion

SENSORY EVALUATION .. Basics of Sensory evaluation, Tools, Techniques, Methods and

Title goes here Tools for Performance Evaluation Timing and performance evaluation has been

Quota Assessment Tools Evaluation April 4, 2017 Agenda 1. Opening Remarks 2. History of Quota

The Tools of the Trade: How to The Tools of the Trade: How to Find or Create the Evaluation Find

Performance Evaluation for Petascale Quantum Simulation Tools

2008 Ryder Scott Reserves Conference Evaluation Challenges in a Changing World Tools for

Evaluation of validation tools of Java Agata Gruza and Ramya Krishna Koricherla Department of

Roadmap for Section 11.1 Performance Evaluation and Prediction Tools for Monitoring Windows

Evaluation of example tools for hairy tasks. Presenter: Hardik Sahi (20743327) Outline

Evaluation An overview of tools that can be used to improve programme efficiency, document impact

Evaluation &amp; Engineering Tools WORKING DRAFT PRESENTATION TO THE ACTC ON MAY 1, 2019 1

Evaluation of Textual Knowledge Acquisition Tools: a Challenging Task Ha fa Zargayouna,

Monitoring and Evaluation The Nuts and Bolts of Monitoring and Evaluation How the scheme is

Evaluation of Ontology Evaluation of Ontology Merging Tools in Merging Tools in Bioinformatics

Integrating Device Registries and Innovative Tools for Enhanced Medical Device Evaluation and

Integrating Device Registries, UDI and Innovative Tools for Medical Device Evaluation An Update to

Supporting SRF Programs through Applied Research and Program Evaluation Tools CIFA Election Day

Heritage Grants Program: Evaluation Plan Presenters: Nancy Hewat, Ph.D. Synthesis Evaluation

Mapping to the Milestones A CONCEPTUAL PRESENTATION ON HOW TO THINK ABOUT LINKING YOUR

Why Cant Johnny Fix Vulnerabilities: A Usability Evaluation of Static Analysis Tools for

EVALUATING FASD PREVENTION &amp; SUPPORT PROGRAMS Tools to Support Planning and Evaluation Nancy

IPM Evaluation Tools for Fruit and Field Crops Peter Werts Project Assistant IPM Institute of

A Systematic Literature Review on Evaluation of Digital Tools for Authoring Evidence - based

Evaluation CS294-184: Building User-Centered Programming Tools UC Berkeley Sarah E. Chasins

Evaluation & Engineering Tools WORKING DRAFT PRESENTATION TO THE ACTC ON MAY 1, 2019 1

EVALUATING FASD PREVENTION & SUPPORT PROGRAMS Tools to Support Planning and Evaluation Nancy