POWER AND DESIGN Business Statistics
CONTENTS Two types of error The power of a test Experimental design Choosing sample size Power curves Power and โbig dataโ Old exam question Further study
TWO TYPES OF ERROR What is the precise meaning of the significance level ๐ฝ ? โช ๐ฝ is the maximum acceptable probability of rejecting the null hypothesis when it is in fact true โช so ๐ reject ๐ผ 0 ๐ผ 0 To be read as: โthe conditional probability There are two possible decisions: of rejecting ๐ผ 0 , given that โช reject ๐ผ 0 ๐ผ 0 is trueโ โช do not reject ๐ผ 0 And there are two possible realities: โช ๐ผ 0 is true โช ๐ผ 0 is not true
TWO TYPES OF ERROR Organize the situations in a 2 ร 2 -table: Correct decision โช no further concern, because OK Wrong decision โช type I error โช type II error
TWO TYPES OF ERROR Probability of a type I error: ๐ reject ๐ผ 0 ๐ผ 0 โค ๐ฝ โช conventionally ๐ฝ = 5% โช but you may choose another significance level if you think that is appropriate โช e.g., aircraft safety: better use ๐ฝ = 1% or even ๐ฝ = 0.1% โช e.g., choosing a colour of your shampoo flask: ๐ฝ = 10% is OK โช Anyhow, you control the maximum type I error by choosing ๐ฝ in advance โช and accept that you will once in a while reject ๐ผ 0 while it is true
TWO TYPES OF ERROR Probability of a type II error: ๐ do not reject ๐ผ 0 specific ๐ผ 1 = ๐พ for instance, ๐ผ 0 : ๐ = 10 vs. โช how to choose ๐พ ? ๐ผ 1 : ๐ = 12 โช You cannot simply control the maximum type II error ๐พ โช because it depends on the true value of the unknown (!) parameter (that is to be tested itself) โช as well as on ๐ฝ โช but it could be good to know it, at least โช usually, this is done through the power concept
TWO TYPES OF ERROR Controlling ๐ฝ and ๐พ is important โช Example: airport security โช ๐ผ 0 : bag does not contain a weapon โช Type I error: โช the bag did not contain a weapon, but the machine said it did โช loss of time in manual search, offended clients, delayed flights โช management will try to minimize the probability of this error type โช Type II error: โช the bag did contain a weapon, but the machine did not detect it โช hijacks, loss of crew and aircraft, liability claims, loss of credibility โช management will try to minimize the probability of this error type
TWO TYPES OF ERROR There is a trade-off: โช minimizing ๐ฝ typically leads to increasing ๐พ โช minimizing ๐พ typically leads to increasing ๐ฝ ๐ฝ and ๐พ can only be decreased simultaneously by changing the set-up of the research โช most importantly, by increasing sample size
EXERCISE 1 What error do we make? a. A population has ๐ = 120 . We reject ๐ผ 0 : ๐ โค 125 . b. A population has ๐ = 0.3 . We do not reject ๐ผ 0 : ๐ โค 0.4 . c. A population has ๐ 2 = 2.5 . We do not reject ๐ผ 0 : ๐ 2 โค 2 .
EXPERIMENTAL DESIGN In business and politics, a lot depends on what clients, the market and the public require So you do experiments: โช market surveys โช questionnaires โช customer cards โช polls โช website analysis (tracking cookies, etc) โช etc.
EXPERIMENTAL DESIGN How to set up such experiments โช qualitative research methods (interview techniques, etc.) โช quantitative research methods (choosing sample size, etc.) Here we will focus on sample size for ๐ and ๐
าง CHOOSING SAMPLE SIZE Recall that the confidence interval of a mean ๐ is ๐ ๐ ๐ฆ โ ๐จ ๐ฝ/2 ๐ , าง ๐ฆ + ๐จ ๐ฝ/2 ๐ โช This means that the width of the confidence interval scales 1 with a factor ๐ If we need to estimate ๐ with a (minimal) precision of ยฑ๐น (the allowable error), you would need a certain (minimal) sample size
CHOOSING SAMPLE SIZE This gives ๐ ๐น = ๐จ ๐ฝ/2 ๐ โช so 2 ๐ ๐ = ๐จ ๐ฝ/2 ๐น Example โช to realize a 95% confidence interval for the mean of a population with ๐ = 3 with precision ๐น = 1 , ๐ = 34.5 , so use ๐ = 35 always round to the higher value in determining sample size
CHOOSING SAMPLE SIZE Observe that we need to know ๐ โช how to know it? Three suggestions: โช take a small preliminary sample and use the sample ๐ก instead of ๐ in the formula โช estimate rough upper and lower limits ๐ and ๐ and set ๐ = ๐โ๐ based on a uniform 12 distribution โช estimate rough upper and lower limits ๐ and ๐ and set ๐ = ๐โ๐ based on the fact that most of the values od a normal distribution are 4 between ๐ โ 2๐ and ๐ + 2๐
CHOOSING SAMPLE SIZE Likewise, the confidence interval of a proportion ๐ is ๐ 1 โ ๐ ๐ 1 โ ๐ ๐ โ ๐จ ๐ฝ/2 , ๐ + ๐จ ๐ฝ/2 ๐ ๐ This gives ๐ 1 โ ๐ ๐น = ๐จ ๐ฝ/2 ๐ โช so 2 ๐จ ๐ฝ/2 ๐ = ๐ 1 โ ๐ ๐น
CHOOSING SAMPLE SIZE Observe that we need to know ๐ โช so to determine the sample size to estimate ๐ , you need ๐ Four suggestions โช take a small preliminary sample and use the sample ๐ instead of ๐ in the sample size formula โช take a small preliminary sample, find a confidence interval and from this interval use the value closest to 0.5 instead of ๐ in the sample size formula โช use a prior sample or historical data this conservative method ensures the โช assume that ๐ = 0.50 desired precision ( ๐ 1 โ ๐ has a maximum at ๐ = 0.50 )
EXERCISE 2 We ask a sample of persons if they are in favor or against Brexit. We want to deduce the proportion in favor, with a margin of no more than ยฑ2% . What sample size to use?
POWER โช Suppose we test a one-sample mean โช the null hypothesis is ๐ผ 0 : ๐ = ๐ โ๐ง๐ = 3 โช at a significance level ๐ฝ = 0.05 โช If the true parameter is ๐ = ๐ ๐ข๐ ๐ฃ๐ = 3 โช there is a probability ๐ฝ = 0.05 to reject ๐ผ 0 โช so this is the probability to make a type I error
POWER โช But if the true parameter is ๐ = ๐ ๐ข๐ ๐ฃ๐ = 3.1 instead โช there is a larger probability to reject ๐ผ 0 โช which is the correct decision ๐ โช how large depends on ๐ and ๐ (recall ๐จ ๐ฝ/2 ๐ ) โช And if the true parameter is ๐ = ๐ ๐ข๐ ๐ฃ๐ = 10 instead โช there is an even larger probability to reject ๐ผ 0 โช which is the correct decision
POWER So, the probability of a rejecting an incorrect ๐ผ 0 on the mean depends on โช the pre-defined probability ๐ฝ of not rejecting a correct ๐ผ 0 โช the sample size ๐ โช the standard deviation of the popolution ๐ โช the difference between the hypothesized ๐ ( ๐ โ๐ง๐ ) and the true ๐ ( ๐ ๐ข๐ ๐ฃ๐ )
POWER Power is defined as the probability of rejecting ๐ผ 0 when it should indeed be rejected โช when a specific ๐ผ 1 is true So: power = ๐ reject ๐ผ 0 specific ๐ผ 1 Therefore: power = 1 โ ๐ do not reject ๐ผ 0 specific ๐ผ 1 = 1 โ ๐พ
POWER To calculate the power of a test for the mean, you need โช the significance level ๐ฝ (you choose it) โช the sample size ๐ (you choose it) โช the standard deviation ๐ โช the hypothesized mean ๐ โ๐ง๐ (you choose it) โช the true mean ๐ ๐ข๐ ๐ฃ๐ (you have no clue) Therefore, we typically do not calculate power โช but rather calculate a power function or power curves for different values of ๐ ๐ข๐ ๐ฃ๐
POWER CURVES For a fixed ๐ผ 0 : ๐ = ๐ โ๐ง๐ what happens for different values P (R e j e c t H o ) 1 of ๐ ๐ข๐ ๐ฃ๐ ๏ข = P (t y p e I I e r r o r ) ๏ฎ ๐ -axis: different options for ๐ ๐ข๐ ๐ฃ๐ ๐ 0 = ๐ โ๐ง๐ ๏ก 0 ๏ญ ๏ฎ ๏ญ 1 ๏ญ 0 one specific ๐ 1 = ๐ ๐ข๐ ๐ฃ๐
POWER CURVES Effect of different values of ๐ ๐ข๐ ๐ฃ๐ : ๐ ๐ข๐ ๐ฃ๐ = 6 ๏ญ =3 ๏ญ =6 0.4 0.3 0.2 0.1 0 - 1 0 1 2 3 4 5 6 7 8 9 10 1 p o w e r 0.8 0.6 0.4 0.2 0 - 1 0 1 2 3 4 5 6 7 8 9 10 ๏ก =0.05 T W O - S I D E D T E S T I N G
POWER CURVES Effect of different values of ๐ ๐ข๐ ๐ฃ๐ : ๐ ๐ข๐ ๐ฃ๐ = 5 ๏ญ =3 ๏ญ =5 0.4 0.3 0.2 0.1 0 - 1 0 1 2 3 4 5 6 7 8 9 10 1 0.8 0.6 p o w e r 0.4 0.2 0 - 1 0 1 2 3 4 5 6 7 8 9 10 ๏ก =0.05 T W O - S I D E D T E S T I N G
POWER CURVES Effect of different values of ๐ ๐ข๐ ๐ฃ๐ : ๐ ๐ข๐ ๐ฃ๐ = 4 ๏ญ =3 ๏ญ =4 0.4 0.3 0.2 0.1 0 - 1 0 1 2 3 4 5 6 7 8 9 10 1 0.8 0.6 0.4 0.2 p o w e r 0 - 1 0 1 2 3 4 5 6 7 8 9 10 ๏ก =0.05 T W O - S I D E D T E S T I N G
POWER CURVES Effect of different values of ๐ ๐ข๐ ๐ฃ๐ : ๐ ๐ข๐ ๐ฃ๐ = 3.4 ๏ญ =3 ๏ญ =3.4 0.4 0.3 0.2 0.1 0 - 1 0 1 2 3 4 5 6 7 8 9 10 1 0.8 0.6 0.4 0.2 p o w e r 0 - 1 0 1 2 3 4 5 6 7 8 9 10 ๏ก =0.05 T W O - S I D E D T E S T I N G
POWER CURVES Effect of different values of ๐ฝ : ๐ฝ = 0.05 ๏ญ =3 ๏ญ =5.5 0.4 0.3 0.2 0.1 0 - 1 0 1 2 3 4 5 6 7 8 9 10 1 0.8 p o w e r 0.6 0.4 0.2 0 - 1 0 1 2 3 4 5 6 7 8 9 10 ๏ก =0.05 T W O - S I D E D T E S T I N G
POWER CURVES Effect of different values of ๐ฝ : ๐ฝ = 0.1 ๏ญ =3 ๏ญ =5.5 0.4 0.3 0.2 0.1 0 - 1 0 1 2 3 4 5 6 7 8 9 10 1 p o w e r 0.8 0.6 0.4 0.2 0 - 1 0 1 2 3 4 5 6 7 8 9 10 ๏ก =0.1 T W O - S I D E D T E S T I N G
Recommend
More recommend