on combining significances trivial examples
play

On Combining Significances. Trivial examples. S.I. Bityukov and N.V. - PDF document

PhyStat 2011 CERN, Geneva, January 18, 2011 On Combining Significances. Trivial examples. S.I. Bityukov and N.V. Krasnikov Institute for nuclear research RAS, Moscow, Russia Suppose one experiment sees a 3-sigma effect and another sees a


  1. PhyStat 2011 CERN, Geneva, January 18, 2011 On Combining Significances. Trivial examples. S.I. Bityukov and N.V. Krasnikov Institute for nuclear research RAS, Moscow, Russia “Suppose one experiment sees a 3-sigma effect and another sees a 4-sigma effect. What is combined sig- nificance?” (R. Cousins (2008). His conclusion - the question is not well-posed.) In this talk we discuss the problem of significances combining. Plan • Introduction • Combining methods • Examples • Weighted combination • The case of small statistics • Systematics • Conclusion 1

  2. PhyStat 2011 CERN, Geneva, January 18, 2011 Introduction Significance S is related with the probability by the relation (one-sided tail probability) S = Φ − 1 (1 − p ) = − Φ − 1 ( p ) , 1 + erf ( S 2 ) √ 1 � S − t 2 2 dt = √ Φ( S ) = −∞ e , 2 2 π √ 2 erf − 1 (1 − 2 p ) . S = For example, S = 5 corresponds to a p − value of 2 . 9 · 10 − 7 . ⇒ p = 3 . 2 · 10 − 5 . S = 4 = S = 3 = ⇒ p = 0 . 0013 . S = 2 = ⇒ p = 0 . 023 . S = 1 = ⇒ p = 0 . 16 . So the problem – how to combine a set of a p − values or a set of of S − significances ? 2

  3. PhyStat 2011 CERN, Geneva, January 18, 2011 Combining methods I There are several methods for significances combin- ing: 1. Fisher’s (R.A. Fisher, 1932) method based on the N choice of P = i =1 p i . A simple way to combine p i is � the use of relation i ln p i = χ 2 2 N,p , where χ 2 − 2 ν,p denotes the upper p � point of the probability integral of a central chi- squared of ν degrees of freedom. For two p 1 and p 2 combining (for example, F. James (2006)) p ( p 1 , p 2 ) = p 1 p 2 (1 − ln ( p 1 p 2 )) . Plus a lot of generalizations: I.J. Good (1955), H. Lancaster (1961) et al. 2. Tippett’s (L. Tippett, 1931) method using the small- est p i p = 1 − (1 − ( min p i )) N ≈ N · min p i . Plus generalizations: B. Wilkinson (1991), . . . . 3

  4. PhyStat 2011 CERN, Geneva, January 18, 2011 Combining methods II 3. Stouffer’s (S. Stouffer et al., 1949) method adding the inverse normal of the p i ’s √ � Φ − 1 ( p i ) = N Φ − 1 ( p ) , equivalently � S i √ S = N . Plus generalizations: F. Mosteller and R. Bush (1954), T. Liptak (1958), . . . , S. Bityukov et al. (2006). Here we shall compare the Fisher and Stouffer ap- proaches. Our conclusion: For high energy physics with Pois- son distribution in many cases the Stouffer approach is more natural. 4

  5. PhyStat 2011 CERN, Geneva, January 18, 2011 Example I Suppose the CMS experiment for some signature measures in july (2010) 10300 events and in august it detects 9700 events with the theoretical expectation λ july = λ august = 10000 in Poisson distribution Pois ( n, λ ) = λ n n ! e − λ . For λ ≫ 1 , n obs ≫ 1 Poisson distribution is approx- imated by normal distribution with mean µ = λ and variance σ 2 = λ and we find that S july = | 10300 − 10000 | √ = 3 , 10000 S august = | 9700 − 10000 | √ = 3 . 10000 If we analyze data for july plus august we find (using the fact that the sum of two Poisson processes with λ 1 and λ 2 is a Poisson process with λ = λ 1 + λ 2 ) that S july + august = | n obs,july + n obs,august − λ july − λ august | = 0 � λ july + λ august in perfect agreement with a theory. 5

  6. PhyStat 2011 CERN, Geneva, January 18, 2011 Example I If we combine significances using formula p ( p 1 , p 2 ) = p 1 p 2 (1 − ln ( p 1 p 2 )) , we find that (remember that S = 3 = ⇒ p i = 0 . 0013 , i = 1 , 2 ) p = 0 . 000026 ; S Fisher july + august = 4 . 05 Example II For other example with n july = 10300 , n august = 10400 λ july = λ august = 10000 we find that S july = 3 , S august = 4 , and S july + august = 3 + 4 √ 2 ≈ 4 . 95 . Fisher’s method gives: p july + august = 0 . 00000077 ; S Fisher july + august = 4 . 80 6

  7. PhyStat 2011 CERN, Geneva, January 18, 2011 Weighted combination For Poisson distribution with n obs ≫ 1 and λ ≫ 1 (when we can approximate this distribution by Gaus- sian) we can define significance as S 1 = n obs 1 − n 0 1 , S 2 = n obs 2 − n 0 2 . σ 1 σ 2 Here S i > 0 corresponds for excess of events, S i < 0 corresponds for shortage of events and σ i = √ λ i , i = 1 , 2 . Usualy people use S = | n obs − n 0 | . σ The rule for significances combining is S ( S 1 , S 2 ) = S 1 σ 1 + S 2 σ 2 . � σ 2 1 + σ 2 2 For general case S ( S 1 , S 2 , . . . , S n ) = S 1 σ 1 + . . . + S n σ 2 . σ 2 1 + . . . + σ 2 � 2 7

  8. PhyStat 2011 CERN, Geneva, January 18, 2011 The case of small statistics Consider now the case of small statistics. Namely, as an example, consider CMS experiment with n obs,july = n obs,august = 1 and λ = λ august = λ ≪ 1 . Then the proba- bility to observe n ≥ 1 events is determined by ∞ Pois ( n ≥ 1 , λ ) = i =1 Pois ( i, λ ) ≈ λ . � Correspondingly, i =2 Pois ( i, λ ) ≈ λ 2 ∞ Pois ( n ≥ 2 , λ ) = � 2 and we find that P july = λ july , P august = λ august , ( λ july + λ august ) 2 ( P july + P august ) 2 P july + august = = . 2 2 A. For P july = P august = 0 . 023 (2 σ : one − side ) P july + august = 0 . 001 (3 . 07 σ ) . Compare with Fisher’s formula P Fisher july + august = P july · P august (1 − ln ( P july · P august )) . Fisher’s method gives the value P Fisher july + august = 0 . 0044 (2 . 62 σ ) . 8

  9. PhyStat 2011 CERN, Geneva, January 18, 2011 The case of small statistics B. For the case S july = 3 σ , S august = 4 σ , n july = 1 and n august = 1 P july + august = 0 . 95 · 10 − 6 (4 . 76 σ ) july + august = 0 . 77 · 10 − 6 (4 . 81 σ ) P Fisher C. For the case S july = 3 σ , S august = 3 σ , n july = 1 and n august = 1 P july + august = 0 . 36 · 10 − 5 (4 . 49 σ ) july + august = 0 . 26 · 10 − 4 (4 . 05 σ ) P Fisher In general for n obs ≫ λ n = n 1 P ( n, λ 1 ) ≈ λ n 1 ∞ 1 n 1 ! e − λ 1 ; � n = n 2 P ( n, λ 2 ) ≈ λ n 2 ∞ 2 n 2 ! e − λ 2 � and P ( n, λ 1 + λ 2 ) ≈ ( λ 1 + λ 2 ) n 1 + n 2 ∞ e − ( λ 1 + λ 2 ) . � n 1 + n 2 ! n = n 1 + n 2 9

  10. PhyStat 2011 CERN, Geneva, January 18, 2011 Systematics Let us consider the influence of systematic effects related with nonexact knowledge of parameter λ in Poisson formula. Suppose n obs,july = n obs,august = 600 , λ july = λ august = 300 , and ǫ = 1 3 (uncertainty in the parameter λ determina- tion). For such case the significance is determined by ap- proximate formula n obs − λ S = λ + ( ǫλ ) 2 . � According to this formula 300 S july = S august = 300 + (100) 2 ≈ 3 . � For july+august 600 S july + august = 600 + (2 · 100) 2 ≈ 3 . � So we find that july+august combining does not help to increase significance, since the systematic er- ror ∼ ( ǫλ ) 2 dominates. 10

  11. PhyStat 2011 CERN, Geneva, January 18, 2011 Conclusions To conclude we think that in many cases for Poisson statistics the most natural rules of significances com- bining is the use of the fact that the sum of Poisson processes is the Poisson process.  Pois ( n 1 , λ 1 )      Pois ( n 2 , λ 2 )     � n i , � λ i ) = ⇒ Pois ( . . .       Pois ( n n , λ n )    In fact it is natural generalization of the original Stouffer method. 11

  12. PhyStat 2011 CERN, Geneva, January 18, 2011 References [1] R.D. Cousins, Annotated Bibliography of Some Papers on Combining Significances or p -values , arXiv:0705.2209 [physics.data-an] . [2] R.A. Fisher, Statistical Methods for Research Workers , Hafner, Darien, Connecticut, 14th edition, 1970. The method of combining significances seems to have appeared in the 4th edition of 1932. [3] F. James, Statistical Methods in Experimental Physics , 2nd edition, World Scientific, 2006. [4] I.J. Good, On the weighted combination of significance tests , Journal of the Royal Statistical Society. Series B (Methodological) 17(2):264-265. [5] H. Lancaster, The combination of probabilities: an applica- tion of orthinormal functions . Australian Journal of Statis- tics, 3 , 20-33, 1961. [6] L. Tippett, The Methods of Statistics , Williams and Nor- gate, Ltd., London, 1st edition. Sec. 3.5, 53-6, 1931, as cited by Birnbaum and by Westberg. [7] B. Wilkinson, A statistical consideration in psychological research , Psychological Bulletin , 48 , 156-158, 1951. [8] S. Stouffer, E. Suchman, L. DeVinnery, S. Star, and R.W. Jr, The American Soldier , volume I: Adjustment during Army Life. Princeton University Press, 1949. [9] F. Mosteller and R. Bush, Selected quantitative techniques , in ed. G. Lindzey, Handbookof Social Psychology vol. I , 289-334, Addison-Wesley, Cambridge, Mass., 1954. 12

  13. PhyStat 2011 CERN, Geneva, January 18, 2011 [10] T. Liptak, On the combination of independent tests , Maguar Tud. Akad. Mat. Kutato Int. Kozl. , 3 , 171-197, 1958. [11] S.Bityukov, N.Krasnikov and A.Nikitenko, On the combin- ing significances, arXiv:physics/0612178(2006); S.Bityukov, N.Krasnikov, A.Nikitenko and V.Smirnova, Two approaches to combining significances, PoS ACAT08:118(2008). 13

Recommend


More recommend