Null Hypothesis Significance Testing and the Problem of - PowerPoint PPT Presentation

Null Hypothesis Significance Testing and the Problem of Underpowered Studies in Economics Le (Lyla) Zhang, Curtin University (with Andreas Ortmann, UNSW) 2015 workshop in Experimental Methods: The replicability crisis in the social sciences and how to address it November, 2015

Outline  Null Hypothesis Significance Testing (NHST)  Commonly Used Procedure  Two Types of Errors  The Statistical Power Analysis  A Meta-analysis (to calculate effect size)  Statistical power of dictator game experiments

Null Hypothesis Significance Testing • Widely used routine Reject Calculate Null Hypothesis Statistics Fail to Reject • Set “no treatment effect” as null hypothesis • A common used (“conventional”) criterion:  =5% (10%, 1%)

Two Types of Errors Null is true (H 0 ) Null is false (H 1 ) α -Type I error 1- β (power) Reject false positive 1- α β – Type II error Fail to reject false negative

Dictator Game Experiments

Dictator Game Experiments e.g., $10

Dictator Game Experiments • Over the past 15 years, hundreds of dictator game experiments have been conducted (Engel, 2010; Zhang & Ortmann, 2014). • These studies vary in experimental design variables (e.g., asset legitimacy, real money, etc) and substantial variables (e.g., country, student, age). • Some of them are published, while others are not.

A meta-analysis of dictator game experiments Group Paper Uncertaint Decision Quality Incentive y Identificati on Asset Action Legitimacy Space Deserving Recipient Social cue Efficiency Country Communic Age ation Student Double Repeated Blind Game

Dictator Game Experiments Often used threshold

The severe situation of under-powered studies  Large variations in statistical power of studies included in meta-analysis of DG game experiments (130 studies). (Min: 5%; Max: 100%; Median: 22.5%)  The majority of them are under-powered (less likely to find an effect which exists).  It depends on the sample size and the variables of interest (various design and implementation characteristics).

Dictator Game Experimen ts Large ES • High statistical power Medium • Statistical power varies and it depends on sample size ES • Need a large sample to achieve Small ES the required statistical power

Dictator Game Experiments

What can we do?  Rules of thumb: List et al (EE, 2010). However, it does not guarantee a high level of statistical power.  Include a meta-analysis in the literature review, if possible.  Use the average effect size in the meta-analysis for power analysis of future projects.  It requires open data.  If there is no extant study, pilot sessions would be helpful.

Thank you!

Null Hypothesis Significance Testing and the Problem of - PowerPoint PPT Presentation

Null Hypothesis Significance Testing and the Problem of Underpowered Studies in Economics Le (Lyla) Zhang, Curtin University (with Andreas Ortmann, UNSW) 2015 workshop in Experimental Methods: The replicability crisis in the social sciences

Multiple Tests Reality Null is True Null is False (No effect/relation) (Effect/relation

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

STAT 113 Hypothesis Testing I Colin Reimer Dawson Oberlin College October 5, 2017 1 / 17

CS 103 Unit 11 Linked Lists Mark Redekopp 2 NULL Pointer Just like there was a null

Null Hypothesis Significance Testing Signifcance Level, Power, t -Tests 18.05 Spring 2014 Jeremy

Null Hypothesis Significance Testing Signifcance Level, Power, t -Tests 18.05 Spring 2014 Jeremy

Chapter 5.5: Hypothesis Tests 1. What is a hypothesis test? 2. The elements of a test: null and

Chapter 5.5: Hypothesis Tests 1. What is a hypothesis test? 2. The elements of a test: null and

Hypothesis testing get data that differ from the null hypothesis. If the data would be quite

Hypothesis tests with binomial example STAT 587 (Engineering) Iowa State University October 2,

t -tests STAT 587 (Engineering) Iowa State University October 2, 2020 Statistical hypothesis

Greenhouse Gas CEQA Greenhouse Gas CEQA Significance Threshold Significance Threshold

Null Hypothesis Significance Testing Gallery of Tests 18.05 Spring 2014 January 1, 2017 1

Null Hypothesis Significance Testing Gallery of Tests 18.05 Spring 2014 Jeremy Orloff and Jonathan

Issue

Sampling and Sample Size Rohit Naimpally J-PAL Course Overview 1. What is Evaluation? 2.

Storm ormwater Effectiveness S Studies Detail iled Study D y Design P Proposal l & Qu

The diffjculty of verifying small improvements in forecast quality Alan Geer Satellite microwave

By Roberto Venturini - https://www.flickr.com/photos/robven/1953413479, CC BY 2.0,

Junior Laboratory PHYC 307L, Spring 2016 Webpage:

IP Scoring Rules: Foundations and Applications Jason Konek Department of Philosophy University

Adult Correctional Adult Correctional Recidivism Legislative Budget Board Criminal Justice Data

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Null Hypothesis Significance Testing and the Problem of - PowerPoint PPT Presentation

Null Hypothesis Significance Testing and the Problem of Underpowered Studies in Economics Le (Lyla) Zhang, Curtin University (with Andreas Ortmann, UNSW) 2015 workshop in Experimental Methods: The replicability crisis in the social sciences

Multiple Tests Reality Null is True Null is False (No effect/relation) (Effect/relation

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

Null Hypothesis Significance Testing p -values, significance level, power, t -tests 18.05 Spring

STAT 113 Hypothesis Testing I Colin Reimer Dawson Oberlin College October 5, 2017 1 / 17

CS 103 Unit 11 Linked Lists Mark Redekopp 2 NULL Pointer Just like there was a null

Null Hypothesis Significance Testing Signifcance Level, Power, t -Tests 18.05 Spring 2014 Jeremy

Null Hypothesis Significance Testing Signifcance Level, Power, t -Tests 18.05 Spring 2014 Jeremy

Chapter 5.5: Hypothesis Tests 1. What is a hypothesis test? 2. The elements of a test: null and

Chapter 5.5: Hypothesis Tests 1. What is a hypothesis test? 2. The elements of a test: null and

Hypothesis testing get data that differ from the null hypothesis. If the data would be quite

Hypothesis tests with binomial example STAT 587 (Engineering) Iowa State University October 2,

t -tests STAT 587 (Engineering) Iowa State University October 2, 2020 Statistical hypothesis

Greenhouse Gas CEQA Greenhouse Gas CEQA Significance Threshold Significance Threshold

Null Hypothesis Significance Testing Gallery of Tests 18.05 Spring 2014 January 1, 2017 1

Null Hypothesis Significance Testing Gallery of Tests 18.05 Spring 2014 Jeremy Orloff and Jonathan

Issue

Sampling and Sample Size Rohit Naimpally J-PAL Course Overview 1. What is Evaluation? 2.

Storm ormwater Effectiveness S Studies Detail iled Study D y Design P Proposal l &amp; Qu

The diffjculty of verifying small improvements in forecast quality Alan Geer Satellite microwave

By Roberto Venturini - https://www.flickr.com/photos/robven/1953413479, CC BY 2.0,

Junior Laboratory PHYC 307L, Spring 2016 Webpage:

IP Scoring Rules: Foundations and Applications Jason Konek Department of Philosophy University

Adult Correctional Adult Correctional Recidivism Legislative Budget Board Criminal Justice Data

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Storm ormwater Effectiveness S Studies Detail iled Study D y Design P Proposal l & Qu