cherry picking multiple testing for exploratory research
play

Cherry-picking Multiple Testing for Exploratory Research Jelle - PowerPoint PPT Presentation

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman Aldo Solari Leiden University Medical Center University of Milano-Bicocca Journ


  1. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman Aldo Solari Leiden University Medical Center University of Milano-Bicocca Journ´ ees de Statistique, 2012-05-24 Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  2. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion A genomics data analysis result Top 10 genes Gene p-value multiplicity-corrected p-value OCIAD2 5.5e-6 0.015 NEK3 6.7e-6 0.019 TAF5 7.1e-6 0.020 FOXD4L6 7.5e-6 0.021 ADIG 8.8e-6 0.025 ZNF19 1.3e-5 0.038 ERICH1 1.5e-5 0.044 SKP1 1.7e-5 0.050 GDF3 2.0e-5 0.059 CCDC25 2.0e-5 0.059 . . . . . . . . . Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  3. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion The empirical cycle Confirmatory data analysis Limited number of research questions Research questions well-defined a priori Focus: strict error control Traditionally: (multiple) testing is important Exploratory data analysis Many possible research questions Research questions not well-defined a priori Focus: finding promising research avenues Traditionally: (multiple) testing not so important Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  4. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Microarray data analysis More like exploratory than confirmatory research Probing many genes simultaneously Decision which questions are interesting taken a posteriori Findings are subject to follow up validation Still: multiple testing performed Reason: prevent unsuccessful validation experiments Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  5. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Exploratory data analysis Mild It is not bad to select some true null hypotheses Flexible Procedures should not completely prescribe what to reject Post hoc Decide what/how much to follow up after seeing the data Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  6. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Exploratory data analysis Mild It is not bad to select some true null hypotheses Flexible Procedures should not completely prescribe what to reject Post hoc Decide what/how much to follow up after seeing the data Multiple testing in exploratory research Should sanction mild, flexible, post hoc inference Should advise, not prescribe Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  7. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Set-up Hypotheses H 1 , . . . , H n True hypotheses T ⊆ { 1 , . . . , n } indices of true hypotheses Rejections R ⊆ { 1 , . . . , n } set of rejected hypotheses (usually random) Type I errors T ∩ R ⊆ { 1 , . . . , n } Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  8. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion FWER, FDR, k-FWER User role Before seeing the data Choose error rate to be controlled : P ( T ∩ R = ∅ ) FWER: � #( T ∩ R ) � FDR : E # R ∨ 1 � � #( R ∩ T ) ≥ k k-FWER : P Procedure Chooses R that controls the chosen error rate Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  9. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  10. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Role of the multiple testing procedure Inform user of the number of false rejections incurred Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  11. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Role of the multiple testing procedure Inform user of the number of false rejections incurred Number of false rejections = #( T ∩ R ) = function of the model parameters = something we can estimate or make a confidence interval for Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  12. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Role of the multiple testing procedure Inform user of the number of false rejections incurred Number of false rejections = #( T ∩ R ) = function of the model parameters = something we can estimate or make a confidence interval for Post hoc If we make a simultaneous CI, post hoc choice of R is allowed Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  13. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed Testing: ingredients Marcus, Peritz and Gabriel (1976) Fundamental principle of FWER control Intersection hypothesis H C = � i ∈ C H i , for C ⊆ { 1 , . . . , n } Closure Collection of all intersection hypotheses � � C = H C : C ⊆ { 1 , . . . , n } Local test Valid α -level test for every intersection hypothesis Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  14. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing (graphically) A B C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  15. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing (graphically) A ∩ B A B A ∩ B ∩ C A ∩ C B ∩ C C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  16. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing: procedure Raw rejections Hypotheses U ⊆ C rejected by the local test Multiplicity-rejected rejections Reject H ∈ C if J ∈ U for every J ⊆ H Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  17. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing: procedure Raw rejections Hypotheses U ⊆ C rejected by the local test Multiplicity-rejected rejections Reject H ∈ C if J ∈ U for every J ⊆ H Statement P ( R ∩ T = ∅ ) ≥ 1 − α with R = { C ∈ C : C rejected } and T = { C ∈ C : C true } Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  18. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing: procedure Raw rejections Hypotheses U ⊆ C rejected by the local test Multiplicity-rejected rejections Reject H ∈ C if J ∈ U for every J ⊆ H Statement P ( R ∩ T = ∅ ) ≥ 1 − α with R = { C ∈ C : C rejected } and T = { C ∈ C : C true } Proof {R ∩ T = ∅} ⊇ { H T / ∈ U} Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  19. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Consonance Traditionally, only rejection of elementary hypotheses is of interest A ∩ B ∩ C A ∩ B A ∩ C B ∩ C A B C The closed graph of hypotheses A , B and C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  20. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Consonance Traditionally, only rejection of elementary hypotheses is of interest A ∩ B ∩ C A ∩ B ∩ C A ∩ B ∩ C A ∩ B A ∩ B A ∩ C A ∩ C A ∩ C B ∩ C A B C Consonant rejections Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  21. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Consonance Traditionally, only rejection of elementary hypotheses is of interest A ∩ B ∩ C A ∩ B ∩ C A ∩ B ∩ C A ∩ B A ∩ B A ∩ C A ∩ C A ∩ C B ∩ C B ∩ C B ∩ C A B C Non-consonant rejection of B ∩ C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  22. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Parameter, confidence bound and coverage Parameter τ ( R ) = #( T ∩ R ) for a fixed set R Closed testing Let X be the collection of hypotheses rejected Confidence bound t α ( R ) = max(# C : C ⊆ R , H C / ∈ X} Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

  23. Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion In the example A ∩ B ∩ C A ∩ B ∩ C A ∩ B ∩ C A ∩ B A ∩ B A ∩ C A ∩ C A ∩ C B ∩ C B ∩ C B ∩ C A B C t α ( { B , C } ) = 1 Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Recommend


More recommend