Software Testing E6891 Lecture 5 2014-02-26 Todays plan Overview - PowerPoint PPT Presentation

Software Testing E6891 Lecture 5 2014-02-26

Today’s plan ● Overview of software testing ○ adapted from Software Carpentry ● Best practices for numerical computation ● Examples

Software testing ● Automatic failure detection ● NOT “correctness detection” ● Bugs are inevitable, but we’d like to find them quickly ● Do this right, and the tests will dictate program behavior

Why do we need tests? Correctness ● Does implementation match specification? ○ Implementation = code ○ Specification = equation, algorithm, paper, etc. ● Important for both ends of research ○ Accurately reporting your own method ○ Ensuring accurate replication of a reported method

Why do we need tests? Software design ● Thinking about failure modes improves software design ○ Isolate critical functions ○ Specify behavior ○ Explicit error handling ● Test each component ● End result ○ simplified high-level functions

Why do we need tests? Debugging ● Something’s wrong in my code! ● But submodules X and Y pass tests... ● so the bug must be in Z! ○ well, probably...

Why do we need tests? Optimization, refactoring ● My experiments are taking too long! ● Maybe I can optimize my algorithm… ● Is the faster version equivalent?

Unit testing ● Software is built from small components ○ input parser ○ feature extractor ○ number crunching ○ ... ● Don’t try to test the whole thing at once ● Test each component ( unit ) independently

Unit testing ● Unit ● Expected result ○ function being tested ○ What should the action produce? ● Fixture ○ return value? ○ Maybe an exception ○ the test input ○ many for each unit ● Actual result ● Action ○ Did it match expectation? ○ How to combine the unit + fixture ○ that is, test code

Example: computing norms ● Unit ● Action ○ ○ n = norm(fix[0],fix[1]) def norm(x, p): assert n == result n = 0 for xi in x: n += xi**p n = n**(1.0/p) return p ● Fixtures + results ○ ( ([ 1, 0, 0], 1), 1.0) ○ ( ([ 1, 0, 0], 2), 1.0) ○ ( ([-1, 0, 0], 1), 1.0) ○ ( ([-1, 0, 0], 2), 1.0) ○ ...

Example: computing norms ● Unit ● Action ○ ○ n = norm(fix[0],fix[1]) def norm(x, p): assert n == result n = 0 for xi in x: n += xi**p ● Report n = n**(1.0/p) return p ○ Pass ○ Pass ● Fixtures + results ○ Fail ○ ( ([ 1, 0, 0], 1), 1.0) ○ Pass ○ ( ([ 1, 0, 0], 2), 1.0) ○ ... ○ ( ([-1, 0, 0], 1), 1.0) ○ ( ([-1, 0, 0], 2), 1.0) ○ ...

Example: computing norms ● Unit ● Action ○ ○ n = norm(fix[0],fix[1]) def norm(x, p): assert n == result n = 0 for xi in x: n += abs(xi)**p ● Report n = n**(1.0/p) return p ○ Pass ○ Pass ● Fixtures + results ○ Pass ○ ( ([ 1, 0, 0], 1), 1.0) ○ Pass ○ ( ([ 1, 0, 0], 2), 1.0) ○ ... ○ ( ([-1, 0, 0], 1), 1.0) ○ ( ([-1, 0, 0], 2), 1.0) ○ ...

Designing test cases ● Exhaustive testing is generally impossible ● But don’t just use a single case either ● Seek out corner cases and assumptions ○ anywhere there’s a condition (if-then-else) ○ calls to other functions

Designing test cases: exercise def norm(x, p): n = 0 for xi in x: n += abs(xi)**p n = n**(1.0/p) return p ● What are the assumptions in this code? ● What are good test cases?

Designing test cases: exercise ● p > 0 def norm(x, p): ● p finite n = 0 ● len(x) > 0 for xi in x: ● xi +, -, 0? n += abs(xi)**p ● xi finite n = n**(1.0/p) ● Others? return p ● What are the assumptions in this code? ● What are good test cases?

Success vs failure? ● Tests can only identify incorrect behavior ○ Tests are never 100% complete ● Failure is correct behavior if the input is bad ○ Silent failure is a debugging nightmare ○ Use exceptions ! ○ Even MATLAB has exceptions now... ● If you only test success cases, failure may not be identified in practice

Don’t go overboard... ● Not all failures need to be def norm(x, p): ```Requires: handled type(x) = ndarray ○ what if p is a string? type(p) = float ○ what if x is a matrix? p > 0 ``` if p <= 0: ● Use test cases to guide raise ValueError() development n = 0 for xi in x: n += abs(xi)**p ● Testing makes n = n**(1.0/p) return p documentation easier

Testing numerical methods ● Some numerical routines are complicated ○ integral(f, a, b) ● Tests should depend on the interface ○ Not the implementation! ● Try to design test cases with known answers ○ ([f(x) = 1.0, a=0, b=2], 2.0) ○ ([f(x) = abs(x), a=-1, b=1], 1.0)

Testing numerical methods ● In floating point, things are rarely identical ● BAD : too strict, relies on machine precision ○ assert f(x) == result ● BETTER : allows small absolute differences ○ assert abs(f(x) - result) < 1e-10 ● BEST : allows small relative differences ○ assert np.allclose(f(x), result)

What if solutions are not unique? ● Examples: ○ sqrt(x), positive or negative? ○ eigenvectors ○ k-means, mixture models, etc. ● Don’t test quantitatively ○ assert np.allclose(sqrt_x, exp_sqrt_x) ● Test qualitatively ○ assert np.allclose(sqrt_x**2, x)

What is correct anyway? ● Often, behavior is not clearly specified ○ e.g.: automatic beat tracking ○ no right answer for a given input ● Maybe we’re just matching a previous implementation ○ while refactoring or optimizing code ○ or porting/re-implementing in a new language ● Generate fixture/result pairs by running the old version

Testing frameworks ● Writing test code is no fun ● Fortunately, most languages have test suites ○ (yes, even MATLAB) ● We’ll talk about python’s nosetest module

nosetest ● Implement actions as functions test_* ● Automatic report generation ● Advanced features ○ exception handling ○ fixture setup/teardown ○ function/class/module/package support ○ test generators: iterate over fixtures

Using nosetest

An example: librosa

Wrap up ● Automated testing will make your life easier (in the long run) ● It’s not difficult ● Your code will be better ● No (fewer?) late-night panic attacks

Software Testing E6891 Lecture 5 2014-02-26 Todays plan Overview - PowerPoint PPT Presentation

Software Testing E6891 Lecture 5 2014-02-26 Todays plan Overview of software testing adapted from Software Carpentry Best practices for numerical computation Examples Software testing Automatic failure detection

Software Testing Software testing 1 V model Software testing 2 Program testing goals To

A review of software testing P DAVID COWARD 200511347 Software testing Software

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

Introduction to Software Testing Software Testing - Module 1 Part 1 The Software Engineering

UI TDD COCOAHEADS AUG 2018 TDD UI TDD SOFTWARE TESTING SOFTWARE TESTING Repeatability

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

1. Test page This page is for testing. This page is for testing. This page is for testing.

Software Testing Outline Software Quality Unit Testing Integration Testing

TESTING SOFTWARE TESTING "Software testing is an investigation conducted to provide

Software Testing Techniques Chapter 17 Software Testing Strategies Chapter 18 1 Software

Overview Objective Types of testing ECE 553: TESTING AND Verification testing

Object Oriented Testing Chapter 23 1 OO Testing Class Testing: Equivalent to unit testing

Software Testing Strategies Chapter 18 1 Review SW Testing Techniques Chapter 17 2 Software

RF Sources Ralph J. Pasquinelli PIP-II Machine Advisory Committee Meeting 15-17 March 2016 High

AGENDA TTO follow up Run 11 -12 Goals Efficiency TTO FOLLOW UP P . Krejcik wrote

Predicting Computer System Failures Using Support Vector Machines Errin W. Fulp a Glenn A. Fink b

volatile unsigned short DMA1SA @ 0x01eau; void iar_buggy_func(unsigned char ch) { DMA1SA =

ECE590-03 Enterprise Storage Architecture Fall 2016 Failures in hard disks and SSDs Tyler

Cisco Security Authentication Failure Rate Cisco Security Authentication Failure Rate or SHIT

Resilient Data Collection of Wireless Sensor Networks in Oil and Gas Refineries Tianyuan Liu,

Doomsday Anwesha Das, Frank Mueller, Paul Hargrove, Eric Roman, Scott Baden Lawrence

Software Testing E6891 Lecture 5 2014-02-26 Todays plan Overview - PowerPoint PPT Presentation

Software Testing E6891 Lecture 5 2014-02-26 Todays plan Overview of software testing adapted from Software Carpentry Best practices for numerical computation Examples Software testing Automatic failure detection

Software Testing Software testing 1 V model Software testing 2 Program testing goals To

A review of software testing P DAVID COWARD 200511347 Software testing Software

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

Introduction to Software Testing Software Testing - Module 1 Part 1 The Software Engineering

UI TDD COCOAHEADS AUG 2018 TDD UI TDD SOFTWARE TESTING SOFTWARE TESTING Repeatability

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

1. Test page This page is for testing. This page is for testing. This page is for testing.

Software Testing Outline Software Quality Unit Testing Integration Testing

TESTING SOFTWARE TESTING &quot;Software testing is an investigation conducted to provide

Software Testing Techniques Chapter 17 Software Testing Strategies Chapter 18 1 Software

Overview Objective Types of testing ECE 553: TESTING AND Verification testing

Object Oriented Testing Chapter 23 1 OO Testing Class Testing: Equivalent to unit testing

Software Testing Strategies Chapter 18 1 Review SW Testing Techniques Chapter 17 2 Software

RF Sources Ralph J. Pasquinelli PIP-II Machine Advisory Committee Meeting 15-17 March 2016 High

AGENDA TTO follow up Run 11 -12 Goals Efficiency TTO FOLLOW UP P . Krejcik wrote

Predicting Computer System Failures Using Support Vector Machines Errin W. Fulp a Glenn A. Fink b

volatile unsigned short DMA1SA @ 0x01eau; void iar_buggy_func(unsigned char ch) { DMA1SA =

ECE590-03 Enterprise Storage Architecture Fall 2016 Failures in hard disks and SSDs Tyler

Cisco Security Authentication Failure Rate Cisco Security Authentication Failure Rate or SHIT

Resilient Data Collection of Wireless Sensor Networks in Oil and Gas Refineries Tianyuan Liu,

Doomsday Anwesha Das, Frank Mueller, Paul Hargrove, Eric Roman, Scott Baden Lawrence

TESTING SOFTWARE TESTING "Software testing is an investigation conducted to provide