Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS - PowerPoint PPT Presentation

Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS Teaching Seminar, 11 Nov 2015 Photo by José Martín Ramírez Carrasco https://www.behance.net/martini_rc

“Statistics is a subject that many medics find easy, but most statisticians find di ffi cult” — Stephen Senn (attrib.)

Purposes Summarising data, describing • features such as central tendency and dispersion Making inferences about the • population that a given sample was drawn from

Hypothesis testing A null hypothesis is a default position (no e ff ect, no di ff erence, no • relationship, etc.) This is set against an alternative hypothesis, generally the opposite of the null • A hypothesis test estimates the probability, p , of observing data at least as • extreme as the sample, under the assumption that the null is true If this p -value is less than a threshold, α , usually 0.05, then the null is rejected • and treated as false 5% of rejections are therefore expected to be false positives • The rate at which the null hypothesis is correctly rejected is the power • NB: Failing to reject the null hypothesis does not constitute strong evidence • in support of it

The t -test A test for a di ff erence in means … • … which may be of a particular sign (one-tailed) or either sign (two-tailed) … • … either between two groups of observations (two sample), or one group and • a fixed value, often zero (one sample) … … which is valid under the assumptions that the groups are approximately • normally distributed, independently sampled and (for some implementations) have equal population variance

Anatomy of a test t = X 1 − X 2 s 1 q s 2 s 2 1 2 n 1 + X 1 n 2 s 2 ◆ 2 ✓ s 2 s 2 1 2 n 1 + n 2 X 2 ν = ◆ 2 ⇣ ◆ 2 ⇣ ✓ ✓ s 2 s 2 ⌘ ⌘ 1 1 1 2 + n 1 − 1 n 2 − 1 n 1 n 2 P ( t | ν ) − t 0 t

In R > t.test(a, b) > se2.a <- var(a) / length(a) > se2.b <- var(b) / length(b) Welch Two Sample t-test > t <- (mean(a) - mean(b)) / sqrt(se2.a + se2.b) > t data: a and b [1] -2.6492 t = -2.6492, df = 197.232, p-value = 0.008722 > df <- (se2.a + se2.b)^2 / ((se2.a^2)/ alternative hypothesis: true difference in (length(a)-1) + (se2.b^2)/(length(b)-1)) means is not equal to 0 > df 95 percent confidence interval: [1] 197.2316 -0.63820792 -0.09351402 > pt(t, df) * 2 sample estimates: [1] 0.00872208 mean of x mean of y -0.1366332 0.2292278

E ff ect of sample size Mean of 1000 p -values at each n

Other common hypothesis tests t -test for significant correlation coe ffi cient • t -test for significant regression coe ffi cient • F -test for di ff erence between multiple means • F -test for model comparison • Nonparametric equivalents, e.g. signed-rank test • Robustness to violations of assumptions varies •

Issues with significance tests Arbitrary p -value threshold • Significance vs e ff ect size, especially with many observations • Publication bias: non-significant results are rarely published • Choice of null hypothesis can be controversial • Ignores any prior information • Probability of data (obtained) vs probability that hypothesis is correct (often • desired)

The big-picture problem The Economist , 19th October 2013

Multiple comparisons See R’s p.adjust function for p -value adjustments

The picture in imaging Hypothesis tests may be performed on a variety of scales • Worth carefully considering the appropriate scale for the research question • Dimensionality reduction can be helpful • Mass univariate testing (e.g. voxelwise) produces a major multiple • comparisons issue

Linear (regression) models We have some measurement, y , for each subject • We have some predictor variables, x 1 , x 2 , x 3 , etc., for which we have • measurements for each subject We want to know ß 1 , ß 2 , ß 3 , etc., the influences of each x on y • We use the model • y i = β 0 + β 1 x i 1 + . . . + β p x i p + ε i where the errors (or residuals), ε i , are assumed to be normally distributed with zero mean Typically fitted with ordinary least squares, a simple matrix operation • Assumes constant variance, independent errors, noncollinearity in predictors •

A versatile tool With one predictor, a regression model is closely related to (Pearson) • correlation or t -test With more predictors, also covers analysis of (co)variance • Extension to multivariate outcomes (general linear model) covers MANOVA, • MANCOVA

Anscombe’s quartet, or, why you should look at your data Same mean • Same variance • Same • correlation coe ffi cient Same • regression line Anscombe, Amer Stat , 1973

SPM Savitz et al., Sci Reports , 2012

Beyond hypothesis tests Models of data as outcomes, plus derivatives such as reference ranges • Parameter estimates, confidences intervals, etc. • Model comparison via likelihood, information theory approaches • Clustering • Predictive power, e.g. ROC analysis • Measures of uncertainty via resampling methods • Bayesian inference: prior and posterior distributions •

thing like fjgure 3. Figure 3 has been obtained from fjgure 2 by removing those patients who were normotensive at confjdence interval for that difference, choose the latter and the log-hazard ratio, a statistic used to model the difference Regression to the mean those bility of 100% someone who is French is European. Howev of writ- this to mean a citizen of the European Union) is French is in only about 13% (since the population of France is about 65 are, million and that of the European Union about 500 million). medi- are drug the everything. is a going of . 999,999 chances out of a million that he is guilty. However, The fjrst is a widespread phenomenon that has a powerful in a population of 10 infmuence on the way that results appear to us, the second of adult males in the USA) there must be 100 individuals Grail- many Senn, Write Stu ff , 2009 plot is given in fjgure 2. Just as was the case in fjgure 1 fjnd it hard to grasp that the treme to be less extreme when measured again [4, 5]. Be and so forth. The way that the data are collected suffjces. 95 mmHg, Hamilton depression score greater than or equal to 22, forced expiratory volume in one second less than 75% of predicted etc.), regression to the mean is a phe a medical statistician. If you ask him, “how’s your wife?” he answers, “compared to what?” Only head to head com How does it occur? Consider fjgure 1. This shows a simu lated set of results for a group of 1000 individuals who have occasions: at ‘baseline’, X, and at ‘outcome’, Y. The fjgure to 90 mmHg and that the standard deviations are 8 mmHg with a correlation of 0.79. An arbitrary but common cut off of 95 mmHg is taken as being the boundary for hyperten

Some advice Plan ahead • Be clear what you really want to know • Use R • Visualise and understand your data • Save scripts • Keep statistical tests to a minimum • Be aware of sources of bias • Use available resources at ICH and beyond •

Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS - PowerPoint PPT Presentation

Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS Teaching Seminar, 11 Nov 2015 Photo by Jos Martn Ramrez Carrasco https://www.behance.net/martini_rc Statistics is a subject that many medics find easy, but most

Hierarchical Bayes models for Perfusion Imaging Volker J Schmid Department of Statistics,

Imaging Studies Section Imaging Studies Section Studies using imaging are often

Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS Teaching Seminar, 11 Nov 2016

Nuclear Imaging Medical Imaging Medical Imaging Nuclear Imaging Nuclear Imaging Nuclear

Objectives : Lower Extremity Imaging Lower Extremity Imaging What to Different Imaging

Magnetic resonance imaging of structural and functional connectivity of the brain Henning U. Voss

The Case for More Imaging in PCI and How to Get There Intravascular Imaging and Coronary

CALIBRATION AND IMAGING WITH LOFAR Emanuela Orru on behalf of the Calibration and Imaging

Appropriately select and tailor imaging to

Introduction to Medical Imaging Dr Kevin Ho-Shon Head of Medical Imaging Macquarie Medical

to Neuroradiology Brain Imaging The basic imaging tools Fluoroscopy/Radiography Computed

The Promise of Modern Imaging P t i Patrice Bret B t Professor & Chair Medical Imaging at

Imaging G. P. S. Smith, C. M. McGoverin, S. J. Fraser, K. C. Gordon, Raman imaging of drug

Imaging biomarkers in oncologic liver disease Bernard Van Beers Laboratory of Imaging Biomarkers

23 Na MRI Arm in M. Nagel Professorship for metabolic and functional MR imaging Institute of

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

2 Dependence of Standardized Uptake Value (SUV) Instrument-related factors affecting

A Brief History of Astronomical A Brief History of Astronomical Imaging Systems Imaging Systems

Guidelines for 35mm Slide Imaging Files may be submitted for imaging using any of the following

Introduction to Astronomical Introduction to Astronomical Imaging Systems Imaging Systems 1

Lumbar Imaging with Reporting of Epidemiology (LIRE) Jeffrey (Jerry) Jarvik, M.D., M.P.H.

ECE Imaging for KSTAR ECE Imaging for KSTAR G. S. Yun, W. Lee, M. J. Choi, H. K. Park @ POSTECH

Introduction to optoacoustic imaging Xos Lus Den Ben IBMI Institute of Biological and

Imaging of the fetus John A. Cassese M.D. Assistant Professor Departments of Diagnostic Imaging

Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS - PowerPoint PPT Presentation

Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS Teaching Seminar, 11 Nov 2015 Photo by Jos Martn Ramrez Carrasco https://www.behance.net/martini_rc Statistics is a subject that many medics find easy, but most

Hierarchical Bayes models for Perfusion Imaging Volker J Schmid Department of Statistics,

Imaging Studies Section Imaging Studies Section Studies using imaging are often

Statistics and Imaging Jon Clayden &lt;j.clayden@ucl.ac.uk&gt; DIBS Teaching Seminar, 11 Nov 2016

Nuclear Imaging Medical Imaging Medical Imaging Nuclear Imaging Nuclear Imaging Nuclear

Objectives : Lower Extremity Imaging Lower Extremity Imaging What to Different Imaging

Magnetic resonance imaging of structural and functional connectivity of the brain Henning U. Voss

The Case for More Imaging in PCI and How to Get There Intravascular Imaging and Coronary

CALIBRATION AND IMAGING WITH LOFAR Emanuela Orru on behalf of the Calibration and Imaging

Appropriately select and tailor imaging to

Introduction to Medical Imaging Dr Kevin Ho-Shon Head of Medical Imaging Macquarie Medical

to Neuroradiology Brain Imaging The basic imaging tools Fluoroscopy/Radiography Computed

The Promise of Modern Imaging P t i Patrice Bret B t Professor &amp; Chair Medical Imaging at

Imaging G. P. S. Smith, C. M. McGoverin, S. J. Fraser, K. C. Gordon, Raman imaging of drug

Imaging biomarkers in oncologic liver disease Bernard Van Beers Laboratory of Imaging Biomarkers

23 Na MRI Arm in M. Nagel Professorship for metabolic and functional MR imaging Institute of

Statistics for Social Sciences I: Introduction to Statistics Introduction to Statistics

2 Dependence of Standardized Uptake Value (SUV) Instrument-related factors affecting

A Brief History of Astronomical A Brief History of Astronomical Imaging Systems Imaging Systems

Guidelines for 35mm Slide Imaging Files may be submitted for imaging using any of the following

Introduction to Astronomical Introduction to Astronomical Imaging Systems Imaging Systems 1

Lumbar Imaging with Reporting of Epidemiology (LIRE) Jeffrey (Jerry) Jarvik, M.D., M.P.H.

ECE Imaging for KSTAR ECE Imaging for KSTAR G. S. Yun, W. Lee, M. J. Choi, H. K. Park @ POSTECH

Introduction to optoacoustic imaging Xos Lus Den Ben IBMI Institute of Biological and

Imaging of the fetus John A. Cassese M.D. Assistant Professor Departments of Diagnostic Imaging

Statistics and Imaging Jon Clayden <j.clayden@ucl.ac.uk> DIBS Teaching Seminar, 11 Nov 2016

The Promise of Modern Imaging P t i Patrice Bret B t Professor & Chair Medical Imaging at