Methods for Experimental Analysis Marco Chiarandini Department of - PowerPoint PPT Presentation

DM811 Heuristics for Combinatorial Optimization Lecture 15 Methods for Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark

Experimental Methods Course Overview Sequential Testing ✔ Combinatorial Optimization, Methods and Models ✔ CH and LS: overview ✔ Working Environment and Solver Systems ˜ Methods for the Analysis of Experimental Results ✔ Construction Heuristics ✔ Local Search: Components, Basic Algorithms ✔ Local Search: Neighborhoods and Search Landscape ✔ Efficient Local Search: Incremental Updates and Neighborhood Pruning ✔ Stochastic Local Search & Metaheuristics ˜ Configuration Tools: F-race Very Large Scale Neighborhoods Examples: GCP, CSP, TSP, SAT, MaxIndSet, SMTWP, Steiner Tree, Unrelated Parallel Machines, p-median, set covering, QAP, ... 2

Experimental Methods Outline Sequential Testing 1. Experimental Methods: Inferential Statistics Statistical Tests Experimental Designs Applications to Our Scenarios 2. Race: Sequential Testing 3

Experimental Methods Outline Sequential Testing 1. Experimental Methods: Inferential Statistics Statistical Tests Experimental Designs Applications to Our Scenarios 2. Race: Sequential Testing 4

Experimental Methods Inferential Statistics Sequential Testing We work with samples (instances, solution quality) But we want sound conclusions: generalization over a given population (all runs, all possible instances) Thus we need statistical inference Random Sample Population Inference X n P ( x, θ ) Statistical Estimator � θ Parameter θ Since the analysis is based on finite-sized sampled data, statements like “the cost of solutions returned by algorithm A is smaller than that of algorithm B ” must be completed by “at a level of significance of 5% ”. 11

Experimental Methods A Motivating Example Sequential Testing There is a competition and two stochastic algorithms A 1 and A 2 are submitted. We run both algorithms once on n instances. On each instance either A 1 wins (+) or A 2 wins (-) or they make a tie (=). Questions: 1. If we have only 10 instances and algorithm A 1 wins 7 times how confident are we in claiming that algorithm A 1 is the best? 2. How many instances and how many wins should we observe to gain a confidence of 95% that the algorithm A 1 is the best? 12

Experimental Methods A Motivating Example Sequential Testing p : probability that A 1 wins on each instance (+) n : number of runs without ties Y : number of wins of algorithm A 1 If each run is indepenedent and consitent: � n � p y (1 − p ) n − y Y ∼ B ( n, p ) : Pr[ Y = y ] = y Binomial Distribution: Trials = 30, Probability of success = 0.5 ● ● ● 0.12 ● ● Probability Mass 0.08 ● ● ● ● 0.04 ● ● ● ● 0.00 ● ● ● ● ● ● 10 15 20 Number of Successes 13

Experimental Methods Sequential Testing 1 If we have only 10 instances and algorithm A 1 wins 7 times how confident are we in claiming that algorithm A 1 is the best? Under these conditions, we can check how unlikely the situation is if it were p (+) ≤ p ( − ) . If p = 0 . 5 then the chance that algorithm A 1 wins 7 or more times out of 10 is 17 . 2% : quite high! Binomial distribution: Trials = 30 Probability of success 0.5 0.25 0.20 0.15 Pr[Y=y] 0.10 0.05 0.00 0 2 4 6 8 10 number of successes y 14

Experimental Methods Sequential Testing 2 How many instances and how many wins should we observe to gain a confidence of 95% that the algorithm A 1 is the best? To answer this question, we compute the 95% quantile, i.e. , y : Pr[ Y ≥ y ] < 0 . 05 with p = 0 . 5 at different values of n : n 10 11 12 13 14 15 16 17 18 19 20 y 9 9 10 10 11 12 12 13 13 14 15 This is an application example of sign test, a special case of binomial test in which p = 0 . 5 15

Experimental Methods Statistical tests Sequential Testing General procedure: Assume that data are consistent with a null hypothesis H 0 (e.g., sample data are drawn from distributions with the same mean value). Use a statistical test to compute how likely this is to be true, given the data collected. This “likely” is quantified as the p-value. Do not reject H 0 if the p-value is larger than an user defined threshold called level of significance α . Alternatively, (p-value < α ), H 0 is rejected in favor of an alternative hypothesis, H 1 , at a level of significance of α . 16

Experimental Methods Inferential Statistics Sequential Testing Two kinds of errors may be committed when testing hypothesis: α = P ( type I error ) = P ( reject H 0 | H 0 is true ) β = P ( type II error ) = P ( fail to reject H 0 | H 0 is false ) General rule: 1. specify the type I error or level of significance α 2. seek the test with a suitable large statistical power, i.e., 1 − β = P ( reject H 0 | H 0 is false ) 17

Experimental Methods Sequential Testing Theorem: Central Limit Theorem If X n is a random sample from an arbitrary distribution with mean µ and X n is asymptotically normally distributed, i.e. , variance σ then the average ¯ X n − µ ¯ X n ≈ N ( µ, σ 2 ¯ n ) or z = σ/ √ n ≈ N (0 , 1) Consequences: allows inference from a sample allows to model errors in measurements: X = µ + ǫ Issues: n should be enough large µ and σ must be known 18

Experimental Methods Sequential Testing Weibull distribution 0.6 dweibull(x, shape = 1.4) ¯ X − µ z = 0.4 σ/ √ n 0.2 0.0 0 10 20 30 40 x Samples of size 1, 5, 15, 50 repeated 100 times n=1 n=5 n=15 n=50 0.4 0.6 0.4 0.4 0.5 0.3 0.3 0.3 0.4 Density Density Density Density 0.2 0.3 0.2 0.2 0.2 0.1 0.1 0.1 0.1 0.0 0.0 0.0 0.0 −1 0 1 2 3 4 5 −2 −1 0 1 2 3 4 5 −2 −1 0 1 2 −2 −1 0 1 2 3 19 x x x x

Experimental Methods Hypothesis Testing and Confidence Intervals Sequential Testing A test of hypothesis determines how likely a sampled estimate ˆ θ is to occur under some assumptions on the parameter θ of the population. µ � � σ σ ¯ √ n ≤ ¯ X 1 Pr µ − z 1 X ≤ µ + z 2 √ n = 1 − α ¯ X 2 ¯ X 3 A confidence interval contains all those values that a parameter θ is likely to assume with probability 1 − α : Pr (ˆ θ 1 < θ < ˆ θ 2 ) = 1 − α µ � � σ σ ¯ ¯ √ n ≤ µ ≤ ¯ X 1 Pr X − z 1 X + z 2 √ n = 1 − α ¯ X 2 ¯ X 3 20

Statistical Tests Experimental Methods Sequential Testing The Procedure of Test of Hypothesis 1. Specify the parameter θ and the test hypothesis, � H 0 : θ = 0 θ = µ 1 − µ 2 H 1 : θ � = 0 µ 1 µ 2 2. Obtain P ( θ | θ = 0) , the null distribution θ of θ 3. Compare ˆ θ with the α/ 2 -quantiles (for two-sided tests) of P ( θ | θ = 0) and reject or not H 0 according to whether ˆ θ is larger or smaller than this value. 21

Statistical Tests Experimental Methods Sequential Testing The Confidence Intervals Procedure N ( µ 1 , σ ) N ( µ 2 , σ ) 1. Specify the parameter θ and the test hypothesis, � H 0 : θ = 0 µ 1 µ 2 θ = µ 1 − µ 2 ( ¯ ( ¯ X 1 , S X 1 ) X 2 , S X 2 ) H 1 : θ � = 0 θ 2. Obtain P ( θ, θ = 0) , the null distribution of θ in correspondence of the observed estimate ˆ θ of the sample X 3. Determine (ˆ θ − , ˆ θ + ) such that θ − ≤ θ ≤ ˆ Pr { ˆ θ + } = 1 − α . 4. Do not reject H 0 if θ = 0 falls inside the interval (ˆ θ − , ˆ θ + ) . Otherwise � θ reject H 0 . � θ θ = 0 22

Statistical Tests Experimental Methods Sequential Testing The Confidence Intervals Procedure P ( θ 1 ) P ( θ 2 ) 1. Specify the parameter θ and the test hypothesis, � H 0 : θ = 0 θ = µ 1 − µ 2 H 1 : θ � = 0 2. Obtain P ( θ, θ = 0) , the null � � ( ¯ X 1 − ¯ X 2) − µ 1 − µ 2 T = distribution of θ in correspondence of � SX 1 − SX 2 the observed estimate ˆ r θ of the sample T � Student’s t Distribution X θ ∗ = ¯ 1 − ¯ X ∗ X ∗ 2 3. Determine (ˆ θ − , ˆ θ + ) such that θ − ≤ θ ≤ ˆ Pr { ˆ θ + } = 1 − α . 4. Do not reject H 0 if θ = 0 falls inside � the interval (ˆ θ − , ˆ θ + ) . Otherwise θ reject H 0 . � θ θ = 0 22

Experimental Methods Kolmogorov-Smirnov Tests Sequential Testing The test compares empirical cumulative distribution functions. 1.0 0.8 0.6 F(x) 0.4 F(x) 2 0.2 F(x) 1 0.0 25 30 35 40 45 x It uses maximal difference between the two curves, sup x | F 1 ( x ) − F 2 ( x ) | , and assesses how likely this value is under the null hypothesis that the two curves come from the same data The test can be used as a two-samples or single-sample test (in this case to test against theoretical distributions: goodness of fit) 23

Experimental Methods Parametric vs Nonparametric Sequential Testing Parametric assumptions: Nonparametric assumptions: independence independence homoschedasticity homoschedasticity normality N ( µ, σ ) P ( θ ) Rank based tests Permutation tests Exact Conditional Monte Carlo 24

Methods for Experimental Analysis Marco Chiarandini Department of - PowerPoint PPT Presentation

DM811 Heuristics for Combinatorial Optimization Lecture 15 Methods for Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Experimental Methods Course Overview Sequential

DESIGN & ANALYSIS METHODS DESIGN & ANALYSIS METHODS FOR DESIGN & ANALYSIS METHODS

Basic Experimental Design Basic Concepts in Experimental Design Prof. Dr. Luc Duchateau Ghent

Methods for Experimental Analysis Marco Chiarandini Department of Mathematics & Computer

Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science

Meshless Meshless Methods Meshless Meshless Methods Methods Methods Contents

In vitro tests and experimental animal In vitro tests and experimental animal In vitro tests and

What Can Experimental Philosophy Do? David Chalmers Cast of Characters n X-Phi: Experimental

EMBEDDED SYSTEMS BASICS WORKSHOP by ELC Skyward SKYWARD EXPERIMENTAL ROCKETRY SKYWARD

Experimental Design and Probability Introduction to course Robin Elahi Experimental Design and

Experimental Design in R Kaelen Medeiros Product Data Scientist at DataCamp DataCamp

WHAT WOULD TREX DO? From Experimental Design to Analysis, the TREX Approach EXPERIMENTAL DESIGN

METHODS METHODS METHODS METHODS of of of of RADIONUCLIDE PRODUCTION RADIONUCLIDE PRODUCTION

Generic Methods 36 What are Generic Methods? Generic methods = methods that introduce type

Formal Methods and Cryptography Lecture 25 Formal Methods Formal Methods Logical foundations

Formal Methods and Cryptography Lecture 24 1 Formal Methods 2 Formal Methods Logical

EAP roadmap Or What to do about methods? Erik Nordmark erik.nordmark@sun.com Methods, methods,

Overview Course 02402 Introduction to Statistics 1 Introduction to simulation Example 1 Lecture

Outline Density Estimation 1 Nonparametric Methods Bins Kernel Estimators k-Nearest Neighbor

Imputing missing values in satellite data: From parametric to non-parametric approaches

Chapter 16 Nonparametric Statistics Introduction: Distribution-Free Tests Distribution-free

women and men in Africa and Asia: Lessons from ILOs School to Work Transition Survey Andy

the causes and consequences of self-employment over the life cycle . John Eric Humphries May 1,

NBER Working Paper 17392 September 2011 Journal of Monetary Economics, November 2013 Becker and

Deriscope by Ioannis Rigopoulos, owner of deriscope.com Part 1 Architecture QuantLib User

Sambuz

Useful Links

Newsletter

Mail Us

Methods for Experimental Analysis Marco Chiarandini Department of - PowerPoint PPT Presentation

DM811 Heuristics for Combinatorial Optimization Lecture 15 Methods for Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Experimental Methods Course Overview Sequential

DESIGN &amp; ANALYSIS METHODS DESIGN &amp; ANALYSIS METHODS FOR DESIGN &amp; ANALYSIS METHODS

Basic Experimental Design Basic Concepts in Experimental Design Prof. Dr. Luc Duchateau Ghent

Methods for Experimental Analysis Marco Chiarandini Department of Mathematics &amp; Computer

Experimental Analysis Marco Chiarandini Department of Mathematics &amp; Computer Science

Meshless Meshless Methods Meshless Meshless Methods Methods Methods Contents

In vitro tests and experimental animal In vitro tests and experimental animal In vitro tests and

What Can Experimental Philosophy Do? David Chalmers Cast of Characters n X-Phi: Experimental

EMBEDDED SYSTEMS BASICS WORKSHOP by ELC Skyward SKYWARD EXPERIMENTAL ROCKETRY SKYWARD

Experimental Design and Probability Introduction to course Robin Elahi Experimental Design and

Experimental Design in R Kaelen Medeiros Product Data Scientist at DataCamp DataCamp

WHAT WOULD TREX DO? From Experimental Design to Analysis, the TREX Approach EXPERIMENTAL DESIGN

METHODS METHODS METHODS METHODS of of of of RADIONUCLIDE PRODUCTION RADIONUCLIDE PRODUCTION

Generic Methods 36 What are Generic Methods? Generic methods = methods that introduce type

Formal Methods and Cryptography Lecture 25 Formal Methods Formal Methods Logical foundations

Formal Methods and Cryptography Lecture 24 1 Formal Methods 2 Formal Methods Logical

EAP roadmap Or What to do about methods? Erik Nordmark erik.nordmark@sun.com Methods, methods,

Overview Course 02402 Introduction to Statistics 1 Introduction to simulation Example 1 Lecture

Outline Density Estimation 1 Nonparametric Methods Bins Kernel Estimators k-Nearest Neighbor

Imputing missing values in satellite data: From parametric to non-parametric approaches

Chapter 16 Nonparametric Statistics Introduction: Distribution-Free Tests Distribution-free

women and men in Africa and Asia: Lessons from ILOs School to Work Transition Survey Andy

the causes and consequences of self-employment over the life cycle . John Eric Humphries May 1,

NBER Working Paper 17392 September 2011 Journal of Monetary Economics, November 2013 Becker and

Deriscope by Ioannis Rigopoulos, owner of deriscope.com Part 1 Architecture QuantLib User

Sambuz

Useful Links

Newsletter

Mail Us

DESIGN & ANALYSIS METHODS DESIGN & ANALYSIS METHODS FOR DESIGN & ANALYSIS METHODS

Methods for Experimental Analysis Marco Chiarandini Department of Mathematics & Computer

Experimental Analysis Marco Chiarandini Department of Mathematics & Computer Science