Cherry-picking Multiple Testing for Exploratory Research Jelle - PowerPoint PPT Presentation

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman Aldo Solari Leiden University Medical Center University of Milano-Bicocca Journ´ ees de Statistique, 2012-05-24 Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion A genomics data analysis result Top 10 genes Gene p-value multiplicity-corrected p-value OCIAD2 5.5e-6 0.015 NEK3 6.7e-6 0.019 TAF5 7.1e-6 0.020 FOXD4L6 7.5e-6 0.021 ADIG 8.8e-6 0.025 ZNF19 1.3e-5 0.038 ERICH1 1.5e-5 0.044 SKP1 1.7e-5 0.050 GDF3 2.0e-5 0.059 CCDC25 2.0e-5 0.059 . . . . . . . . . Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion The empirical cycle Confirmatory data analysis Limited number of research questions Research questions well-defined a priori Focus: strict error control Traditionally: (multiple) testing is important Exploratory data analysis Many possible research questions Research questions not well-defined a priori Focus: finding promising research avenues Traditionally: (multiple) testing not so important Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Microarray data analysis More like exploratory than confirmatory research Probing many genes simultaneously Decision which questions are interesting taken a posteriori Findings are subject to follow up validation Still: multiple testing performed Reason: prevent unsuccessful validation experiments Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Exploratory data analysis Mild It is not bad to select some true null hypotheses Flexible Procedures should not completely prescribe what to reject Post hoc Decide what/how much to follow up after seeing the data Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Exploratory data analysis Mild It is not bad to select some true null hypotheses Flexible Procedures should not completely prescribe what to reject Post hoc Decide what/how much to follow up after seeing the data Multiple testing in exploratory research Should sanction mild, flexible, post hoc inference Should advise, not prescribe Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Set-up Hypotheses H 1 , . . . , H n True hypotheses T ⊆ { 1 , . . . , n } indices of true hypotheses Rejections R ⊆ { 1 , . . . , n } set of rejected hypotheses (usually random) Type I errors T ∩ R ⊆ { 1 , . . . , n } Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion FWER, FDR, k-FWER User role Before seeing the data Choose error rate to be controlled : P ( T ∩ R = ∅ ) FWER: � #( T ∩ R ) � FDR : E # R ∨ 1 � � #( R ∩ T ) ≥ k k-FWER : P Procedure Chooses R that controls the chosen error rate Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Role of the multiple testing procedure Inform user of the number of false rejections incurred Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Role of the multiple testing procedure Inform user of the number of false rejections incurred Number of false rejections = #( T ∩ R ) = function of the model parameters = something we can estimate or make a confidence interval for Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Alterative: exploratory inference Role of the user In complete freedom the user rejects collection of hypotheses R . Role of the multiple testing procedure Inform user of the number of false rejections incurred Number of false rejections = #( T ∩ R ) = function of the model parameters = something we can estimate or make a confidence interval for Post hoc If we make a simultaneous CI, post hoc choice of R is allowed Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed Testing: ingredients Marcus, Peritz and Gabriel (1976) Fundamental principle of FWER control Intersection hypothesis H C = � i ∈ C H i , for C ⊆ { 1 , . . . , n } Closure Collection of all intersection hypotheses � � C = H C : C ⊆ { 1 , . . . , n } Local test Valid α -level test for every intersection hypothesis Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing (graphically) A B C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing (graphically) A ∩ B A B A ∩ B ∩ C A ∩ C B ∩ C C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing: procedure Raw rejections Hypotheses U ⊆ C rejected by the local test Multiplicity-rejected rejections Reject H ∈ C if J ∈ U for every J ⊆ H Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing: procedure Raw rejections Hypotheses U ⊆ C rejected by the local test Multiplicity-rejected rejections Reject H ∈ C if J ∈ U for every J ⊆ H Statement P ( R ∩ T = ∅ ) ≥ 1 − α with R = { C ∈ C : C rejected } and T = { C ∈ C : C true } Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Closed testing: procedure Raw rejections Hypotheses U ⊆ C rejected by the local test Multiplicity-rejected rejections Reject H ∈ C if J ∈ U for every J ⊆ H Statement P ( R ∩ T = ∅ ) ≥ 1 − α with R = { C ∈ C : C rejected } and T = { C ∈ C : C true } Proof {R ∩ T = ∅} ⊇ { H T / ∈ U} Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Consonance Traditionally, only rejection of elementary hypotheses is of interest A ∩ B ∩ C A ∩ B A ∩ C B ∩ C A B C The closed graph of hypotheses A , B and C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Consonance Traditionally, only rejection of elementary hypotheses is of interest A ∩ B ∩ C A ∩ B ∩ C A ∩ B ∩ C A ∩ B A ∩ B A ∩ C A ∩ C A ∩ C B ∩ C A B C Consonant rejections Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Consonance Traditionally, only rejection of elementary hypotheses is of interest A ∩ B ∩ C A ∩ B ∩ C A ∩ B ∩ C A ∩ B A ∩ B A ∩ C A ∩ C A ∩ C B ∩ C B ∩ C B ∩ C A B C Non-consonant rejection of B ∩ C Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Parameter, confidence bound and coverage Parameter τ ( R ) = #( T ∩ R ) for a fixed set R Closed testing Let X be the collection of hypotheses rejected Confidence bound t α ( R ) = max(# C : C ⊆ R , H C / ∈ X} Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion In the example A ∩ B ∩ C A ∩ B ∩ C A ∩ B ∩ C A ∩ B A ∩ B A ∩ C A ∩ C A ∩ C B ∩ C B ∩ C B ∩ C A B C t α ( { B , C } ) = 1 Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman, Aldo Solari

Cherry-picking Multiple Testing for Exploratory Research Jelle - PowerPoint PPT Presentation

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman Aldo Solari Leiden University Medical Center University of Milano-Bicocca Journ

LIC-16-16.50 SR 16-Cherry Valley Interchange (PID 80704) SR 16/Cherry Valley Road Interchange

Example Suppose there are five kinds of bags of candies: 10% are h 1 : 100% cherry candies 20% are

Add Steak to Exploratory Add Steak to Exploratory Testing's Parlor Parlor- -Trick Sizzle Trick

Exploratory Data Analysis Paul Cohen ISTA 370 Spring, 2012 Paul Cohen ISTA 370 () Exploratory

Introduction to Data Science: x (1) x 1 x 2 x ( n ) x i n 1 1 Size: size

CME/STATS 195 CME/STATS 195 Lecture 5: Exploratory Data Analysis Lecture 5: Exploratory Data

Exploratory Monitoring at Bing AUTOMATED SYNTHETIC EXPLORATORY MONITORING OF DYNAMIC WEB SITES

Warehouse Operations Pallet Rack Replenish Block Stacking (20 lanes) Forward Picking Reserve

Picking up the pieces A guide to Post Incident Review @kleeut Picking up the pieces A guide to

Picking the Low- -Hanging Fruit: Hanging Fruit: Picking the Low Saving Money and Energy?

Cherry Picking: A New Robustness Tool David Banks and Leanna House Institute of Statistics &

Cognitive cherry picking: the patchwork process of examining A level essays

Summer Reading June 2020 CHERRY HILL PUBLIC SCHOOLS CHERRY HILL PUBLIC SCHOOLS Summer Reading

Summer Reading June 2020 CHERRY HILL PUBLIC SCHOOLS CHERRY HILL PUBLIC SCHOOLS Summer Reading

Cherry Tree Inn Address: 1 Cherry Tree Lane Debenham IP14 6QT Slide 2 Verbal Updates: -

Speaker 13 Mr James Cherry Environmental Manager Greencore Group james.cherry@greencore.com

Towards robust feature selection for high-dimensional, small sample settings Yvan Saeys

Factor Analysis for Multiple Testing : an R package for large-scale significance testing under

STK-IN4300 The bet on sparsity principle Statistical Learning Methods in Data Science

Hommels Method for False Discovery Proportions Jelle Goeman Joint work with: Aldo Solari,

Hypertension in Renal Tx Transplants 100% Hypertension most common modifiable CV risk factor

Nonparametric Density Estimation October 1, 2018 Introduction If we cant fit a

Building Community-Based Research Opportunities December 5th, 2018 Hallie Ford Center #115

Modelling Biochemical Reaction Networks Lecture 10: Glycerol metabolism, Part II Marc R. Roussel

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Cherry-picking Multiple Testing for Exploratory Research Jelle - PowerPoint PPT Presentation

Exploratory data ananlysis Closed testing A Confidence Set Applications Discussion Cherry-picking Multiple Testing for Exploratory Research Jelle Goeman Aldo Solari Leiden University Medical Center University of Milano-Bicocca Journ

LIC-16-16.50 SR 16-Cherry Valley Interchange (PID 80704) SR 16/Cherry Valley Road Interchange

Example Suppose there are five kinds of bags of candies: 10% are h 1 : 100% cherry candies 20% are

Add Steak to Exploratory Add Steak to Exploratory Testing's Parlor Parlor- -Trick Sizzle Trick

Exploratory Data Analysis Paul Cohen ISTA 370 Spring, 2012 Paul Cohen ISTA 370 () Exploratory

Introduction to Data Science: x (1) x 1 x 2 x ( n ) x i n 1 1 Size: size

CME/STATS 195 CME/STATS 195 Lecture 5: Exploratory Data Analysis Lecture 5: Exploratory Data

Exploratory Monitoring at Bing AUTOMATED SYNTHETIC EXPLORATORY MONITORING OF DYNAMIC WEB SITES

Warehouse Operations Pallet Rack Replenish Block Stacking (20 lanes) Forward Picking Reserve

Picking up the pieces A guide to Post Incident Review @kleeut Picking up the pieces A guide to

Picking the Low- -Hanging Fruit: Hanging Fruit: Picking the Low Saving Money and Energy?

Cherry Picking: A New Robustness Tool David Banks and Leanna House Institute of Statistics &amp;

Cognitive cherry picking: the patchwork process of examining A level essays

Summer Reading June 2020 CHERRY HILL PUBLIC SCHOOLS CHERRY HILL PUBLIC SCHOOLS Summer Reading

Summer Reading June 2020 CHERRY HILL PUBLIC SCHOOLS CHERRY HILL PUBLIC SCHOOLS Summer Reading

Cherry Tree Inn Address: 1 Cherry Tree Lane Debenham IP14 6QT Slide 2 Verbal Updates: -

Speaker 13 Mr James Cherry Environmental Manager Greencore Group james.cherry@greencore.com

Towards robust feature selection for high-dimensional, small sample settings Yvan Saeys

Factor Analysis for Multiple Testing : an R package for large-scale significance testing under

STK-IN4300 The bet on sparsity principle Statistical Learning Methods in Data Science

Hommels Method for False Discovery Proportions Jelle Goeman Joint work with: Aldo Solari,

Hypertension in Renal Tx Transplants 100% Hypertension most common modifiable CV risk factor

Nonparametric Density Estimation October 1, 2018 Introduction If we cant fit a

Building Community-Based Research Opportunities December 5th, 2018 Hallie Ford Center #115

Modelling Biochemical Reaction Networks Lecture 10: Glycerol metabolism, Part II Marc R. Roussel

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Cherry Picking: A New Robustness Tool David Banks and Leanna House Institute of Statistics &