business statistics
play

Business Statistics CONTENTS Comparing the variance of two - PowerPoint PPT Presentation

TWO 2 S: COMPARISONS Business Statistics CONTENTS Comparing the variance of two populations The -distribution The -test Levenes test Old exam question Further study COMPARING THE VARIANCE OF TWO POPULATIONS So far, the


  1. TWO 𝜏 2 S: COMPARISONS Business Statistics

  2. CONTENTS Comparing the variance of two populations The 𝐺 -distribution The 𝐺 -test Levene’s test Old exam question Further study

  3. COMPARING THE VARIANCE OF TWO POPULATIONS β–ͺ So far, the emphasis was on differences in centrality β–ͺ There are also questions on differences in dispersion β–ͺ Context: β–ͺ you can choose between two drilling machines β–ͺ both make holes of the specified size β–ͺ but the precision of the two may be different β–ͺ so you do an experiment (intended hole size: 3 mm)

  4. COMPARING THE VARIANCE OF TWO POPULATIONS β–ͺ A second case β–ͺ Recall that we can compare two population means under the assumption of equal population variances β–ͺ using the pooled variance β–ͺ Thus, we may need to check if the populations variances are indeed equal

  5. COMPARING THE VARIANCE OF TWO POPULATIONS Test statistic to consider 2 and S 𝑍 2 β–ͺ A combination of S π‘Œ 2 is problematic 2 βˆ’ 𝑇 𝑍 β–ͺ but the null distribution of 𝑇 π‘Œ 2 𝑇 π‘Œ β–ͺ Instead, 2 is possible 𝑇 𝑍 2 𝑇 𝑍 β–ͺ or 2 (sometimes easier) 𝑇 π‘Œ These hypotheses are β–ͺ What is the hypothesis? 2 equivalent to 𝑇 π‘Œ 2 = 𝜏 𝑍 2 = 1 2 β–ͺ 𝐼 0 : 𝜏 π‘Œ 𝑇 𝑍 2 β‰₯ 𝜏 𝑍 (or β‰₯ 1 or ≀ 1 ) 2 β–ͺ 𝐼 0 : 𝜏 π‘Œ 2 ≀ 𝜏 𝑍 2 β–ͺ 𝐼 0 : 𝜏 π‘Œ 2 = 𝜏 𝑍 2 + 3 etc. is not possible! β–ͺ but 𝜏 π‘Œ

  6. THE 𝐺 -DISTRIBUTION β–ͺ For normally distributed populations, it is known that: 2 = 𝜏 𝑍 β–ͺ under 𝐼 0 : 𝜏 π‘Œ 2 : 2 𝑇 π‘Œ 2 ~𝐺 π‘œ π‘Œ βˆ’1,π‘œ 𝑍 βˆ’1 𝑇 𝑍 β–ͺ where 𝐺 df 1 ,df 2 is the F -distribution β–ͺ with df 1 and df 2 degrees of freedom β–ͺ note: 𝐺 is a ratio of two variances, use df 1 for the numerator and df 2 for the denominator

  7. THE 𝐺 -DISTRIBUTION β–ͺ So, we compute the value of a test statistic 2 𝐺 calc = 𝑑 π‘Œ 2 𝑑 𝑍 β–ͺ and expect it to be around 1 if 𝐼 0 is true β–ͺ and reject 𝐼 0 if 𝐺 calc is β€œtoo” small or β€œtoo” large β–ͺ we need to look up the critical values of the 𝐺 -distribution 𝐺 crit,lower 1 𝐺 𝑑𝑠𝑗𝑒,π‘£π‘žπ‘žπ‘“π‘  Of course (!) you expect 𝐺 = 1 when 𝐼 0 is true

  8. THE 𝐺 -DISTRIBUTION β–ͺ 𝐺 -distribution β–ͺ like πœ“ 2 not symmetrical and strictly positive β–ͺ need to find 𝐺 crit,lower and 𝐺 crit,upper

  9. THE 𝐺 -DISTRIBUTION df 1 Looking up critical values for 𝐺 𝛽/2 Is this 𝐺 crit,lower or 𝐺 crit,upper ? df 2

  10. THE 𝐺 -DISTRIBUTION β–ͺ Finding 𝐺 crit,lower when you know 𝐺 crit,upper 1 𝐺 crit,lower df 1 , df 2 = 𝐺 crit,upper df 2 , df 1 1 β–ͺ so 𝐺 crit,lower = 3.85 = 0.26 2 2 𝑇 1 2 > 𝑏 ⇔ 𝑇 2 2 < 1 𝑇 2 𝑇 1 𝑏

  11. THE 𝐺 -TEST Step 1: 2 = 𝜏 2 2 β‰  𝜏 2 2 ; 𝐼 1 : 𝜏 1 2 ; 𝛽 = 0.05 β–ͺ 𝐼 0 : 𝜏 1 Step 2: 2 𝑇 1 β–ͺ sample statistic: 𝐺 = 2 ; reject for β€œtoo small” and β€œtoo large” 𝑇 2 values Step 3: β–ͺ null distribution: 𝐺~𝐺 df 1 ,df 2 β–ͺ both populations must be normally distributed (no CLT here!) Step 4: 2 𝑑 1 2 ; 𝐺 crit,lower = β‹― ; 𝐺 crit,upper = β‹― (use 𝛽/2 ) β–ͺ 𝐺 calc = 𝑑 2 Step 5: β–ͺ reject 𝐼 0 if 𝐺 calc < 𝐺 crit,lower or 𝐺 calc > 𝐺 crit,upper

  12. THE 𝐺 -TEST Example: 2 = 0.012 with π‘œ 1 = 6 β–ͺ machine 1 gives 𝑑 1 2 = 0.016 with π‘œ 2 = 7 β–ͺ machine 2 gives 𝑑 2 Computations: upper critical values 0.012 can be read in the 𝐺 - β–ͺ 𝐺 calc = 0.016 = 0.75 table, so easier to look up 𝐺 crit,upper β–ͺ swap the two machines 0.016 β–ͺ 𝐺 calc = 0.012 = 1.33 β–ͺ null distribution: 𝐺~𝐺 6,5 β–ͺ 𝐺 crit;upper = 𝐺 6,5;0.025 = 6.98 β–ͺ 𝐺 calc < 𝐺 crit;upper , do not reject 𝐼 0

  13. THE 𝐺 -TEST 2 = 𝜏 2 Trick to avoid the use of 𝐺 crit,lower in 𝐼 0 : 𝜏 1 2 β–ͺ put largest sample variance in numerator 2 𝑑 1 β–ͺ so calculated value of test statistic is 𝐺 calc = 2 or 𝐺 calc = 𝑑 2 2 𝑑 2 2 𝑑 1 β–ͺ with this, 𝐺 π‘‘π‘π‘šπ‘‘ > 1 , so we need only consider 𝐺 𝑑𝑠𝑗𝑒,π‘£π‘žπ‘žπ‘“π‘  β–ͺ because 𝐺 𝑑𝑠𝑗𝑒,π‘šπ‘π‘₯𝑓𝑠 is always < 1 β–ͺ formally reject for β€œtoo large” and β€œtoo small” β–ͺ so keep using 𝛽/2 and not 𝛽 (it is still a two-sided test)

  14. THE 𝐺 -TEST One-sided tests: what is different compared to two-sided? Step 1: 2 β‰₯ 𝜏 2 2 < 𝜏 2 2 ; 𝐼 1 : 𝜏 1 2 ; 𝛽 = 0.05 β–ͺ 𝐼 0 : 𝜏 1 2 ≀ 𝜏 1 2 > 𝜏 1 β–ͺ reformulate as 𝐼 0 : 𝜏 2 2 ; 𝐼 1 : 𝜏 2 2 ; 𝛽 = 0.05 Step 2: 2 𝑇 2 β–ͺ sample statistic: 𝐺 = 2 ; reject for β€œtoo large” values 𝑇 1 Step 3: β–ͺ null distribution: 𝐺~𝐺 df 2 ,df 1 β–ͺ both populations must be normally distributed Step 4: 2 𝑑 2 2 ; 𝐺 crit,upper = β‹― (use 𝛽 ) β–ͺ 𝐺 calc = 𝑑 1 Step 5: β–ͺ reject 𝐼 0 if 𝐺 π‘‘π‘π‘šπ‘‘ > 𝐺 crit,upper

  15. EXERCISE 1 Given a sample 1 with π‘œ 1 = 9 and 𝑑 1 = 4 and a sample 2 2 = 𝜏 2 2 . with π‘œ 2 = 7 and 𝑑 2 = 5 . We want to test 𝐼 0 : 𝜏 1 a. What is step 2? b. What is step 3?

  16. LEVENE’S TEST Recall that SPSS also did a comparison of two variances when we asked for comparing two means β–ͺ Levene’s test

  17. LEVENE’S TEST The Levene test β–ͺ is based on a different principle β–ͺ requires no normal populations (!) β–ͺ also yields an 𝐺 calc β–ͺ but with different values of df β–ͺ reject for large values only 2 = 𝜏 2 2 β–ͺ yields a π‘ž -value for 𝐼 0 : 𝜏 1

  18. LEVENE’S TEST β–ͺ When comparing two 𝜈 s with the 𝑒 -test, we needed to 2 and 𝜏 2 2 estimate 𝜏 1 2 by 𝑑 1 2 and 𝜏 2 2 by 𝑑 2 β–ͺ we could estimate 𝜏 1 2 2 to estimate both 2 and use the pooled 𝑑 P 2 = 𝜏 2 β–ͺ or assume 𝜏 1 β–ͺ The second is only allowed if the two sample variances 𝑑 1 2 2 are not β€œtoo unequal” and 𝑑 2 Usually, a low π‘ž - β–ͺ Levene’s test tests this value means bingo, but not so here ... β–ͺ a low π‘ž - value means: unequal, so don’t do it β–ͺ do not necessarily use the same 𝛽 Rule of thumb: β–ͺ Why Levene and not the β€œnormal” 𝐺 -test? use 𝛽 = 0.1 Levene is a non-parametric test ...

  19. OLD EXAM QUESTION 26 March 2015, Q1c

  20. FURTHER STUDY Doane & Seward 5/E 10.7 Tutorial exercises week 3 Fisher 𝐺 -test

Recommend


More recommend