On Combining Significances. Trivial examples. S.I. Bityukov and N.V. - PDF document

PhyStat 2011 CERN, Geneva, January 18, 2011 On Combining Significances. Trivial examples. S.I. Bityukov and N.V. Krasnikov Institute for nuclear research RAS, Moscow, Russia “Suppose one experiment sees a 3-sigma effect and another sees a 4-sigma effect. What is combined significance?” (R. Cousins (2008). His conclusion - the question is not well-posed.) In this talk we discuss the problem of significances combining. Plan • Introduction • Combining methods • Examples • Weighted combination • The case of small statistics • Systematics • Conclusion 1

PhyStat 2011 CERN, Geneva, January 18, 2011 Introduction Significance S is related with the probability by the relation (one-sided tail probability) S = Φ − 1 (1 − p ) = − Φ − 1 ( p ) , 1 + erf ( S 2 ) √ 1 � S − t 2 2 dt = √ Φ( S ) = −∞ e , 2 2 π √ 2 erf − 1 (1 − 2 p ) . S = For example, S = 5 corresponds to a p − value of 2 . 9 · 10 − 7 . ⇒ p = 3 . 2 · 10 − 5 . S = 4 = S = 3 = ⇒ p = 0 . 0013 . S = 2 = ⇒ p = 0 . 023 . S = 1 = ⇒ p = 0 . 16 . So the problem – how to combine a set of a p − values or a set of of S − significances ? 2

PhyStat 2011 CERN, Geneva, January 18, 2011 Combining methods I There are several methods for significances combining: 1. Fisher’s (R.A. Fisher, 1932) method based on the N choice of P = i =1 p i . A simple way to combine p i is � the use of relation i ln p i = χ 2 2 N,p , where χ 2 − 2 ν,p denotes the upper p � point of the probability integral of a central chi- squared of ν degrees of freedom. For two p 1 and p 2 combining (for example, F. James (2006)) p ( p 1 , p 2 ) = p 1 p 2 (1 − ln ( p 1 p 2 )) . Plus a lot of generalizations: I.J. Good (1955), H. Lancaster (1961) et al. 2. Tippett’s (L. Tippett, 1931) method using the small- est p i p = 1 − (1 − ( min p i )) N ≈ N · min p i . Plus generalizations: B. Wilkinson (1991), . . . . 3

PhyStat 2011 CERN, Geneva, January 18, 2011 Combining methods II 3. Stouffer’s (S. Stouffer et al., 1949) method adding the inverse normal of the p i ’s √ � Φ − 1 ( p i ) = N Φ − 1 ( p ) , equivalently � S i √ S = N . Plus generalizations: F. Mosteller and R. Bush (1954), T. Liptak (1958), . . . , S. Bityukov et al. (2006). Here we shall compare the Fisher and Stouffer approaches. Our conclusion: For high energy physics with Pois- son distribution in many cases the Stouffer approach is more natural. 4

PhyStat 2011 CERN, Geneva, January 18, 2011 Example I Suppose the CMS experiment for some signature measures in july (2010) 10300 events and in august it detects 9700 events with the theoretical expectation λ july = λ august = 10000 in Poisson distribution Pois ( n, λ ) = λ n n ! e − λ . For λ ≫ 1 , n obs ≫ 1 Poisson distribution is approx- imated by normal distribution with mean µ = λ and variance σ 2 = λ and we find that S july = | 10300 − 10000 | √ = 3 , 10000 S august = | 9700 − 10000 | √ = 3 . 10000 If we analyze data for july plus august we find (using the fact that the sum of two Poisson processes with λ 1 and λ 2 is a Poisson process with λ = λ 1 + λ 2 ) that S july + august = | n obs,july + n obs,august − λ july − λ august | = 0 � λ july + λ august in perfect agreement with a theory. 5

PhyStat 2011 CERN, Geneva, January 18, 2011 Example I If we combine significances using formula p ( p 1 , p 2 ) = p 1 p 2 (1 − ln ( p 1 p 2 )) , we find that (remember that S = 3 = ⇒ p i = 0 . 0013 , i = 1 , 2 ) p = 0 . 000026 ; S Fisher july + august = 4 . 05 Example II For other example with n july = 10300 , n august = 10400 λ july = λ august = 10000 we find that S july = 3 , S august = 4 , and S july + august = 3 + 4 √ 2 ≈ 4 . 95 . Fisher’s method gives: p july + august = 0 . 00000077 ; S Fisher july + august = 4 . 80 6

PhyStat 2011 CERN, Geneva, January 18, 2011 Weighted combination For Poisson distribution with n obs ≫ 1 and λ ≫ 1 (when we can approximate this distribution by Gaus- sian) we can define significance as S 1 = n obs 1 − n 0 1 , S 2 = n obs 2 − n 0 2 . σ 1 σ 2 Here S i > 0 corresponds for excess of events, S i < 0 corresponds for shortage of events and σ i = √ λ i , i = 1 , 2 . Usualy people use S = | n obs − n 0 | . σ The rule for significances combining is S ( S 1 , S 2 ) = S 1 σ 1 + S 2 σ 2 . � σ 2 1 + σ 2 2 For general case S ( S 1 , S 2 , . . . , S n ) = S 1 σ 1 + . . . + S n σ 2 . σ 2 1 + . . . + σ 2 � 2 7

PhyStat 2011 CERN, Geneva, January 18, 2011 The case of small statistics Consider now the case of small statistics. Namely, as an example, consider CMS experiment with n obs,july = n obs,august = 1 and λ = λ august = λ ≪ 1 . Then the probability to observe n ≥ 1 events is determined by ∞ Pois ( n ≥ 1 , λ ) = i =1 Pois ( i, λ ) ≈ λ . � Correspondingly, i =2 Pois ( i, λ ) ≈ λ 2 ∞ Pois ( n ≥ 2 , λ ) = � 2 and we find that P july = λ july , P august = λ august , ( λ july + λ august ) 2 ( P july + P august ) 2 P july + august = = . 2 2 A. For P july = P august = 0 . 023 (2 σ : one − side ) P july + august = 0 . 001 (3 . 07 σ ) . Compare with Fisher’s formula P Fisher july + august = P july · P august (1 − ln ( P july · P august )) . Fisher’s method gives the value P Fisher july + august = 0 . 0044 (2 . 62 σ ) . 8

PhyStat 2011 CERN, Geneva, January 18, 2011 The case of small statistics B. For the case S july = 3 σ , S august = 4 σ , n july = 1 and n august = 1 P july + august = 0 . 95 · 10 − 6 (4 . 76 σ ) july + august = 0 . 77 · 10 − 6 (4 . 81 σ ) P Fisher C. For the case S july = 3 σ , S august = 3 σ , n july = 1 and n august = 1 P july + august = 0 . 36 · 10 − 5 (4 . 49 σ ) july + august = 0 . 26 · 10 − 4 (4 . 05 σ ) P Fisher In general for n obs ≫ λ n = n 1 P ( n, λ 1 ) ≈ λ n 1 ∞ 1 n 1 ! e − λ 1 ; � n = n 2 P ( n, λ 2 ) ≈ λ n 2 ∞ 2 n 2 ! e − λ 2 � and P ( n, λ 1 + λ 2 ) ≈ ( λ 1 + λ 2 ) n 1 + n 2 ∞ e − ( λ 1 + λ 2 ) . � n 1 + n 2 ! n = n 1 + n 2 9

PhyStat 2011 CERN, Geneva, January 18, 2011 Systematics Let us consider the influence of systematic effects related with nonexact knowledge of parameter λ in Poisson formula. Suppose n obs,july = n obs,august = 600 , λ july = λ august = 300 , and ǫ = 1 3 (uncertainty in the parameter λ determina- tion). For such case the significance is determined by approximate formula n obs − λ S = λ + ( ǫλ ) 2 . � According to this formula 300 S july = S august = 300 + (100) 2 ≈ 3 . � For july+august 600 S july + august = 600 + (2 · 100) 2 ≈ 3 . � So we find that july+august combining does not help to increase significance, since the systematic er- ror ∼ ( ǫλ ) 2 dominates. 10

PhyStat 2011 CERN, Geneva, January 18, 2011 Conclusions To conclude we think that in many cases for Poisson statistics the most natural rules of significances combining is the use of the fact that the sum of Poisson processes is the Poisson process.  Pois ( n 1 , λ 1 )      Pois ( n 2 , λ 2 )     � n i , � λ i ) = ⇒ Pois ( . . .       Pois ( n n , λ n )    In fact it is natural generalization of the original Stouffer method. 11

PhyStat 2011 CERN, Geneva, January 18, 2011 References [1] R.D. Cousins, Annotated Bibliography of Some Papers on Combining Significances or p -values , arXiv:0705.2209 [physics.data-an] . [2] R.A. Fisher, Statistical Methods for Research Workers , Hafner, Darien, Connecticut, 14th edition, 1970. The method of combining significances seems to have appeared in the 4th edition of 1932. [3] F. James, Statistical Methods in Experimental Physics , 2nd edition, World Scientific, 2006. [4] I.J. Good, On the weighted combination of significance tests , Journal of the Royal Statistical Society. Series B (Methodological) 17(2):264-265. [5] H. Lancaster, The combination of probabilities: an applica- tion of orthinormal functions . Australian Journal of Statis- tics, 3 , 20-33, 1961. [6] L. Tippett, The Methods of Statistics , Williams and Nor- gate, Ltd., London, 1st edition. Sec. 3.5, 53-6, 1931, as cited by Birnbaum and by Westberg. [7] B. Wilkinson, A statistical consideration in psychological research , Psychological Bulletin , 48 , 156-158, 1951. [8] S. Stouffer, E. Suchman, L. DeVinnery, S. Star, and R.W. Jr, The American Soldier , volume I: Adjustment during Army Life. Princeton University Press, 1949. [9] F. Mosteller and R. Bush, Selected quantitative techniques , in ed. G. Lindzey, Handbookof Social Psychology vol. I , 289-334, Addison-Wesley, Cambridge, Mass., 1954. 12

PhyStat 2011 CERN, Geneva, January 18, 2011 [10] T. Liptak, On the combination of independent tests , Maguar Tud. Akad. Mat. Kutato Int. Kozl. , 3 , 171-197, 1958. [11] S.Bityukov, N.Krasnikov and A.Nikitenko, On the combining significances, arXiv:physics/0612178(2006); S.Bityukov, N.Krasnikov, A.Nikitenko and V.Smirnova, Two approaches to combining significances, PoS ACAT08:118(2008). 13

On Combining Significances. Trivial examples. S.I. Bityukov and N.V. - PDF document

PhyStat 2011 CERN, Geneva, January 18, 2011 On Combining Significances. Trivial examples. S.I. Bityukov and N.V. Krasnikov Institute for nuclear research RAS, Moscow, Russia Suppose one experiment sees a 3-sigma effect and another sees a

JUST THE MATHS SLIDES NUMBER 7.4 DETERMINANTS 4 (Homogeneous linear equations) by

Hillas Plot: trivial and non-trivial implications Felix Aharonian 1984 Hillas Plot : B L

Combining Models Oliver Schulte - CMPT 726 Bishop PRML Ch. 14 Combining Models: Some Theory

About trivial software patents: the IsNot case Jan Bergstra

Computing Shortest Non-Trivial Cycles on Orientable Surfaces of Bounded Genus in Almost Linear

Data Dependence in Data Dependence in Combining Classifiers Combining Classifiers Mohamed

Validity-preservation properties of rules for combining inferential models combining

Feature Extraction Combining Feature Extraction Combining Spectral Noise Reduction and Spectral

General Schemes of Combining Rules and the Quality Characteristics of Combining Alexander Lepskiy

Combining GLM and ABI Data for Enhanced GOES-R Rainfall Estimates A New GOES-R3 Project (combining

Combining Local and Global History for High Combining Local and Global History for High

Combining XML querying Combining XML querying with ontology reasoning: with ontology reasoning:

Reducing Label Cost by Combining Feature Labels and Crowdsourcing Combining Learning Strategies

Combining Point and Line Samples for Direct Illumination Points only Points + Lines Katherine

Combining Combining Constraint Programming Constraint Programming and Integer Programming and

The Calculus of Computation: Decision Procedures with 10. Combining Decision Procedures

Projects: IoT based on Cloud Ruixuan Dai Spring, 2020 Internet-of-Things Things (Devices) q

Twins 47% A. Bill Clinton B. Jeremy Irons A Generalists 28% C. Scarlett Johansson 24%

Range Maxima Queries (presented at ICALP 2011) Gerth Stlting Brodal Aarhus University Kostas

real large-scale randomised controlled trial Vic Menzies Research Trial Officer Centre for

Day 2: Diving Deeper into Day 2: Diving Deeper into Data Visualization with R Data Visualization

Statistics of the multiplicative group Greg Martin University of British Columbia joint work

Measurement and Analysis of Traffic in a Hybrid Satellite-Terrestrial Network Qing (Kenny) Shao

Incorporating Intra-flow Dependencies and Inter-flow Correlations for Traffic Matrix Prediction

Sambuz

Useful Links

Newsletter

Mail Us