Welcome to the co u rse ! FOU N DATION S OF IN FE R E N C E Jo - PowerPoint PPT Presentation

Welcome to the co u rse ! FOU N DATION S OF IN FE R E N C E Jo Hardin Instr u ctor

What is statistical inference ? The process of making claims abo u t a pop u lation based on information from a sample FOUNDATIONS OF INFERENCE

What is statistical inference ? FOUNDATIONS OF INFERENCE

Ass u me t w o pop u lations prefer cola at same rate FOUNDATIONS OF INFERENCE

The sample data FOUNDATIONS OF INFERENCE

The sample data ( take 2) FOUNDATIONS OF INFERENCE

Vocab u lar y N u ll h y pothesis ( H ) : The claim is not that interesting 0 Alternati v e h y pothesis ( H ) : The claim corresponding to the research h y pothesis A The " goal " is to dispro v e the n u ll h y pothesis FOUNDATIONS OF INFERENCE

E x ample : cheetah speed Compare speed of t w o di � erent s u bspecies of cheetah H : Asian and African cheetahs r u n the same 0 speed , on a v erage H : African cheetahs are faster than Asian A cheetahs , on a v erage FOUNDATIONS OF INFERENCE

E x ample : election From a sample , the researchers w o u ld like to claim that Candidate X w ill w in H : Candidate X w ill get half the v otes 0 H : Candidate X w ill get more than half the A v otes FOUNDATIONS OF INFERENCE

Let ' s practice ! FOU N DATION S OF IN FE R E N C E

Randomi z ed distrib u tions FOU N DATION S OF IN FE R E N C E Jo Hardin Instr u ctor

Logic of inference FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion Generating a distrib u tion of the statistic from the n u ll pop u lation gi v es information abo u t w hether the obser v ed data are inconsistent w ith the n u ll h y pothesis FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion Original data Location Cola Orange East 28 6 West 19 7 ^ east = 28/(28 + 6) = 0.82 p ^ west = 19/(19 + 7) = 0.73 p FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion First sh u� e , same as original Location Cola Orange East 28 6 West 19 7 FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion Second sh u� e Location Cola Orange East 27 7 West 20 6 FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion Third sh u� e Location Cola Orange East 28 8 West 21 5 FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion Fo u rth sh u� e Location Cola Orange East 25 9 West 22 4 FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion Fi � h sh u� e Location Cola Orange East 29 5 West 18 8 FOUNDATIONS OF INFERENCE

Understanding the n u ll distrib u tion FOUNDATIONS OF INFERENCE

One random perm u tation soda %>% library(infer) group_by(location) %>% soda %>% specify(drink ~ location, summarize(prop_cola = success = "cola") %>% mean(drink == "cola")) %>% hypothesize(null = "independence") %>% summarize(diff(prop_cola)) generate(reps = 1, type = "permute") %>% calculate(stat = "diff in props", order = c("west","east")) # A tibble: 1 x 1 `diff(prop_cola)` <dbl> # A tibble: 1 x 2 1 -0.09276018 replicate stat <int> <dbl> 1 1 -0.02488688 FOUNDATIONS OF INFERENCE

Man y random perm u tations soda %>% specify(drink ~ location, success = "cola") %>% hypothesize(null = "independence") %>% generate(reps = 5, type = "permute") %>% calculate(stat = "diff in props", order = c("west", "east")) # A tibble: 5 x 2 replicate stat <int> <dbl> 1 1 0.04298643 2 2 -0.09276018 3 3 0.11085973 4 4 0.17873303 5 5 -0.16063348 FOUNDATIONS OF INFERENCE

Random distrib u tion FOUNDATIONS OF INFERENCE

Using the randomi z ation distrib u tion FOU N DATION S OF IN FE R E N C E Jo Hardin Instr u ctor

Understanding the n u ll distrib u tion FOUNDATIONS OF INFERENCE

Data consistent w ith n u ll ? table(soda) soda %>% group_by(location) %>% summarize(mean(drink == "cola")) location drink East West # A tibble: 2 × 2 cola 28 19 location `mean(drink == "cola")` orange 6 7 <fctr> <dbl> 1 East 0.8235294 2 West 0.7307692 FOUNDATIONS OF INFERENCE

Significance FOUNDATIONS OF INFERENCE

Ho w e x treme are the obser v ed data ? # A tibble: 1 x 1 diff_orig <- soda %>% proportion group_by(location) %>% <dbl> summarize(prop_cola = mean(drink == "cola")) %>% 1 0.380 summarize(diff(prop_cola)) %>% pull() soda_perm <- soda %>% specify(drink ~ location, success = "cola") %>% hypothesize(null = "independence") %>% generate(reps = 100, type = "permute") %>% calculate(stat = "diff in props", order = c("west", "east")) soda_perm %>% summarize(proportion = mean(diff_orig >= stat)) FOUNDATIONS OF INFERENCE

St u d y concl u sions FOU N DATION S OF IN FE R E N C E Jo Hardin Instr u ctor

Significance We fail to reject the n u ll h y pothesis : There is no e v idence that o u r data are inconsistent w ith the n u ll h y pothesis FOUNDATIONS OF INFERENCE

NHANES : random sample Representati v e sample of US pop u lation Concl u sions from sample ma y appl y to pop u lation Nothing to report in this case FOUNDATIONS OF INFERENCE

Welcome to the co u rse ! FOU N DATION S OF IN FE R E N C E Jo - PowerPoint PPT Presentation

Welcome to the co u rse ! FOU N DATION S OF IN FE R E N C E Jo Hardin Instr u ctor What is statistical inference ? The process of making claims abo u t a pop u lation based on information from a sample FOUNDATIONS OF INFERENCE What is

RSE 2.0 RSE 2.0 Mark Woodbridge, Imperial College London deRSE19 Potsdam 6 June 2019

RSE - STEWARDSHIP AND SUSTAINABILITY George Mason General Manager Employment Services 2018

RSE Curriculum Focus Group Relationships and sex education Objectives We want to: Explain the

Relationship and Sex Education MONDAY, 18 TH JUNE RSE POLICY: COMPULSORY SCIENCE CURRICULUM RSE

Welcome to the co u rse ! L IN E AR C L ASSIFIE R S IN P YTH ON Michael ( Mike ) Gelbart Instr u

CURRE NT ST AT E OF CYBE RSE CURI T Y Big Spe nding Wide spre a d Va c a nc ie s

E nro n Pre se nta tio n Outline E a rly L ife o f Arthur Ande rse n E nro nT he

HI MSS Cyb e rse c urity Co mmunity Spo nso r 1 Se c urity F unda me nta ls b a se d o n the

1 https://trallard.github.io/Talks/RSE-shefeld The state of machine learning The state of

Welcome to the co u rse MAR K E TIN G AN ALYTIC S : P R E D IC TIN G C U STOME R C H U R N IN

Welcome to the co u rse ! IN TR OD U C TION TO IMP OR TIN G DATA IN P YTH ON H u go Bo w ne -

Welcome to the co u rse ! TIME SE R IE S AN ALYSIS IN R Da v id S . Ma eson Associate

Welcome to the co u rse ! VISU AL IZIN G TIME SE R IE S DATA IN P YTH ON Thomas Vincent Head

Welcome to the co u rse ! FU N DAME N TAL S OF BAYE SIAN DATA AN ALYSIS IN R Rasm u s Bth

Welcome to the co u rse DATA VISU AL IZATION W ITH L ATTIC E IN R Deepa y an Sarkar Associate

RIGHT STUFF EQUIPMENT PRODUCTS & SERVICES FOR THE THC/CBD MARKETS RSE AND THC AUTOMATION

Learning State of the Art 1 19.11.2019 What is Deep Learning? https://youtu.be/Kfe5hKNwrCU

Lecture 23: Spectral Meshes COMPSCI/MATH 290-04 Chris Tralie, Duke University 4/7/2016

A Multi-Paradigm C++-based Hardware Description Language Chad D. Kersey ( cdkersey@gatech.edu )

Disk Drive Workload Captured in Logs Collected During the Field Return Incoming Test Alma Riska

Collect ollectiv ive e Fr Framew amewor ork k and and Per erfor ormance mance Optimiz

Introduction to I/O and Disk Management 1 Secondary Storage Management Disks just like

Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model Paul

Introduction Kevin has over 20 years experience of working within financial markets technology

Welcome to the co u rse ! FOU N DATION S OF IN FE R E N C E Jo - PowerPoint PPT Presentation

Welcome to the co u rse ! FOU N DATION S OF IN FE R E N C E Jo Hardin Instr u ctor What is statistical inference ? The process of making claims abo u t a pop u lation based on information from a sample FOUNDATIONS OF INFERENCE What is

RSE 2.0 RSE 2.0 Mark Woodbridge, Imperial College London deRSE19 Potsdam 6 June 2019

RSE - STEWARDSHIP AND SUSTAINABILITY George Mason General Manager Employment Services 2018

RSE Curriculum Focus Group Relationships and sex education Objectives We want to: Explain the

Relationship and Sex Education MONDAY, 18 TH JUNE RSE POLICY: COMPULSORY SCIENCE CURRICULUM RSE

Welcome to the co u rse ! L IN E AR C L ASSIFIE R S IN P YTH ON Michael ( Mike ) Gelbart Instr u

CURRE NT ST AT E OF CYBE RSE CURI T Y Big Spe nding Wide spre a d Va c a nc ie s

E nro n Pre se nta tio n Outline E a rly L ife o f Arthur Ande rse n E nro nT he

HI MSS Cyb e rse c urity Co mmunity Spo nso r 1 Se c urity F unda me nta ls b a se d o n the

1 https://trallard.github.io/Talks/RSE-shefeld The state of machine learning The state of

Welcome to the co u rse MAR K E TIN G AN ALYTIC S : P R E D IC TIN G C U STOME R C H U R N IN

Welcome to the co u rse ! IN TR OD U C TION TO IMP OR TIN G DATA IN P YTH ON H u go Bo w ne -

Welcome to the co u rse ! TIME SE R IE S AN ALYSIS IN R Da v id S . Ma eson Associate

Welcome to the co u rse ! VISU AL IZIN G TIME SE R IE S DATA IN P YTH ON Thomas Vincent Head

Welcome to the co u rse ! FU N DAME N TAL S OF BAYE SIAN DATA AN ALYSIS IN R Rasm u s Bth

Welcome to the co u rse DATA VISU AL IZATION W ITH L ATTIC E IN R Deepa y an Sarkar Associate

RIGHT STUFF EQUIPMENT PRODUCTS &amp; SERVICES FOR THE THC/CBD MARKETS RSE AND THC AUTOMATION

Learning State of the Art 1 19.11.2019 What is Deep Learning? https://youtu.be/Kfe5hKNwrCU

Lecture 23: Spectral Meshes COMPSCI/MATH 290-04 Chris Tralie, Duke University 4/7/2016

A Multi-Paradigm C++-based Hardware Description Language Chad D. Kersey ( cdkersey@gatech.edu )

Disk Drive Workload Captured in Logs Collected During the Field Return Incoming Test Alma Riska

Collect ollectiv ive e Fr Framew amewor ork k and and Per erfor ormance mance Optimiz

Introduction to I/O and Disk Management 1 Secondary Storage Management Disks just like

Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model Paul

Introduction Kevin has over 20 years experience of working within financial markets technology

RIGHT STUFF EQUIPMENT PRODUCTS & SERVICES FOR THE THC/CBD MARKETS RSE AND THC AUTOMATION