Welcome to the course! EX P ERIMEN TAL DES IGN IN P YTH ON Luke - PowerPoint PPT Presentation

Welcome to the course! EX P ERIMEN TAL DES IGN IN P YTH ON Luke Hayden Instructor

Experimental design Data Allows us to answer questions How do we get answers? Need rigorous methods Approach Build hypotheses with exploratory data analysis T est hypotheses with statistical tests EXPERIMENTAL DESIGN IN PYTHON

Mapping variables Variable types Discrete: Finite set of possible values (Ex: True or False) Continuous: Any value (Ex: Measurement) Mapping X or Y axes Change color with fill or color arguments EXPERIMENTAL DESIGN IN PYTHON

Making plots with plotnine 1. Call ggplot() function and give it a import plotnine as p9 DataFrame (p9.ggplot([pandas DataFrame])+ 2. Assign mapping of variables with aes() p9.aes( 3. Specify a geometry x='variable to put on X-axis', y='variable to put on Y-axis', color='variable ')+ p9.geom_point() ) EXPERIMENTAL DESIGN IN PYTHON

Scatter plot geom_point() import plotnine as p9 import pandas as pd df = pd.DataFrame(data= {'Sex': ["Male", "Male", "Female","Female"] , "Height (cm)": [183, 179, 160, 172], "Weight (kg)": [82,75.1, 50, 58.7]}) print(p9.ggplot(df)+ p9.aes(x='Height (cm)',y='Weight (kg)', color='Sex')+ p9.geom_point()) EXPERIMENTAL DESIGN IN PYTHON

EXPERIMENTAL DESIGN IN PYTHON

Boxplot geom_boxplot() import plotnine as p9 import pandas as pd df = pd.DataFrame(data= {'Sex': ["Male", "Male","Male", "Male","Male", "Male", "Female","Female", "Female","Female", "Female","Female"] , "Height": [183, 179, 190, 181, 170, 175, 160, 165, 158, 154, 170, 160]}) (p9.ggplot(df)+ p9.aes(x='Sex',y='Height', fill='Sex')+ p9.geom_boxplot()) EXPERIMENTAL DESIGN IN PYTHON

Density plot geom_density() import plotnine as p9 import pandas as pd df = pd.DataFrame(data= {'Sex': ["Male", "Male","Male", "Male","Male", "Male", "Female","Female", "Female","Female", "Female","Female"] , "Height": [183, 179, 190, 181, 170, 175, 160, 165, 158, 154, 170, 160]}) (p9.ggplot(df)+ p9.aes(x='Height', fill='Sex') + p9.geom_density(alpha=0.5)) EXPERIMENTAL DESIGN IN PYTHON

Let's practice! EX P ERIMEN TAL DES IGN IN P YTH ON

Our �rst hypothesis test - Student's t-test EX P ERIMEN TAL DES IGN IN P YTH ON Luke Hayden Instructor

From observed pattern to reliable result Data contains patterns Some expected Others surprising Random variation also Dealing with this How do we go from observation to result? EXPERIMENTAL DESIGN IN PYTHON

Are these groups different? Weights of two groups of adults (p9.ggplot(df)+ p9.aes('Value', fill='Sample')+ Sample A: p9.geom_density(alpha=0.5)) [66.1, 69.8,67.7,69.6,71.1] Sample B: [83.7,81.5, 80.6, 83.9, 84.4] EXPERIMENTAL DESIGN IN PYTHON

Two hypotheses Null hypothesis A = B Observed patterns are the product of random chance Alternative hypothesis A != B Difference between samples represents a real difference between the populations EXPERIMENTAL DESIGN IN PYTHON

Some statistical terms p-value Likelihood of pattern under null hypothesis alpha Crucial threshold of p-value Usually alpha < 0.05: reject null hypothesis EXPERIMENTAL DESIGN IN PYTHON

Student's t-test Invented by William Sealy Gosset Coding a t-test Two basic types: from scipy import stats One-sample: Mean of population different from a given value? stats.ttest_ind(Sample_A, Sample_B) Two-sample: Two means equal? EXPERIMENTAL DESIGN IN PYTHON

Implementing a one-sample t-test from scipy import stats Sample_A = df[df.Sample == "A"] t_result = stats.ttest_1sample(Sample_A, 65) alpha = 0.05 if (t_result[1] < alpha): print("mean(A) != 65") mean(A) != 65 EXPERIMENTAL DESIGN IN PYTHON

Implementing a two-sample t-test from scipy import stats Sample_A = df[df.Sample == "A"] Sample_B = df[df.Sample == "B"] t_result = stats.ttest_ind(Sample_A, Sample_B) alpha = 0.05 if (t_result[1] < alpha): print("A and B are different!") A and B are different! EXPERIMENTAL DESIGN IN PYTHON

Now let's try it out! EX P ERIMEN TAL DES IGN IN P YTH ON

Testing proportion and correlation EX P ERIMEN TAL DES IGN IN P YTH ON Luke Hayden Instructor

Hypothesis tests t-test: Compare means of continuous variables Chi-square: Examine proportions of discrete categories Fisher exact test: Examine proportions of discrete categories Pearson test: Examine if continuous variables are correlated EXPERIMENTAL DESIGN IN PYTHON

Chi-square T est distinguishes between: Example Null hypothesis: Coin �ipped 30 times Observed outcomes �t distribution Expected: 15 heads, 15 tails coin is not biased Observed: 24 heads, 6 tails Alternative hypothesis: Expected outcomes signi�cantly different from Observed outcomes doesn't �t distribution expected? coin is biased EXPERIMENTAL DESIGN IN PYTHON

Implementing a simple Chi-square test from scipy import stats coins = df['Flip'].value_counts() chi = stats.chisquare(coins) print(chi) Power_divergenceResult(statistic=10.8, pvalue=0.0010150009471130682) EXPERIMENTAL DESIGN IN PYTHON

Fisher exact test Two-sample version of Chi-square test Example T est distinguishes between: Two coins each �ipped 30 times Null hypothesis: Expected outcomes signi�cantly differ? Two samples have same distribution of Are these two discrete variables related? outcomes Alternative hypothesis: Two samples have different distribution of outcomes EXPERIMENTAL DESIGN IN PYTHON

Implementing a Fisher exact test from scipy import stats import pandas as pd table = pd.crosstab(df.Coin,df.Flip) print(table) Flip heads tails Coin 1 22 8 2 17 13 EXPERIMENTAL DESIGN IN PYTHON

Implementing a Fisher exact test chi = stats.fisher_exact(table, alternative='two-sided') print(chi[1]) 0.421975381019902 EXPERIMENTAL DESIGN IN PYTHON

Correlation import plotnine as p9 (p9.ggplot(olyAmericans)+ p9.aes(x='Weight',y='Height')+ p9.geom_point()) EXPERIMENTAL DESIGN IN PYTHON

Pearson test for correlation from scipy import stats import pandas as pd pearson = stats.pearsonr(df.Weight, df.Height) print(pearson) (0.7922545330545416, 0.0) (Correlation coef�cient, p-value) EXPERIMENTAL DESIGN IN PYTHON

Let's practice! EX P ERIMEN TAL DES IGN IN P YTH ON

Welcome to the course! EX P ERIMEN TAL DES IGN IN P YTH ON Luke - PowerPoint PPT Presentation

Welcome to the course! EX P ERIMEN TAL DES IGN IN P YTH ON Luke Hayden Instructor Experimental design Data Allows us to answer questions How do we get answers? Need rigorous methods Approach Build hypotheses with exploratory data

Course Orientation q Course Description q Course Outcomes q Course Requirements q Course Outline

Course Search Widget Topics StudyLink Course Search Widget Demo Generic Course Search

Course Specifications/Detailed Course Outline Course code : STA 331 2.0 Course title :

Welcome to CS 61A About the Course Parts of the Course 4 Parts of the Course Lecture : Videos

DPD Basic Bicycle Course Course Objectives COURSE GOAL: The course will provide the trainee with

CANVAS COURSE PROFILE STUDENT PERFORMANCE COURSE OVERVIEW ASSIGNMENT AND SUBMISSION ANALYSIS

Leadplane Training Course Leadplane Training Course Course Objectives Describe procedures for

Statistics II Xavier Vil Course 2004-2005 1.- Course Contents 2.- Course Resources 3.-

ARM Microcontroller Course June 3, 2015 ARM Microcontroller Course The Course Direct Digital

Course Home Page Course Design Course Structure main source reading-intensive course

Level 1, V2.0 Level 1, V2.0 1 Course Contents Course Contents Course Contents Course

Welcome back... Welcome back... ..to me. Welcome back... ..to me. Test out Welcome back...

Welcome to the Course on Negotiation Skills Course Objectives By the end of this course, you

Welcome to MA 16010! 1 / Course Information All basic course information can be found on the

Welcome! Welcome Welcome to the Copenhagen FreeSurfer course 2016! 3 days of lectures and

to the 1 year Foundation Course Aims of the Foundation course The course has four distinct

Fundamentals Tamuno Alfred, PhD Biostatistician DataCamp Designing and Analyzing Clinical

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture18: Alternative tests and

Testing proportions BIO5312 FALL2017 STEPHANIE J. SPIELMAN, PHD Estimation An estimator is a

Statistical Methods: Lecture 10 Dennis Dobler Vrije Universiteit Amsterdam December 6, 2017

Sta$s$cal Hypothesis Tes$ng Ghostbusters Ghostbusters How many

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics

Section 3: Permutation Inference Yotam Shem-Tov Fall 2015 Yotam Shem-Tov STAT 239/ PS 236A

Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where