applied statistical analysis
play

Applied Statistical Analysis EDUC 6050 Week 13 Finding clarity - PowerPoint PPT Presentation

Applied Statistical Analysis EDUC 6050 Week 13 Finding clarity using data Today Categorical Outcomes 2 Categorical Outcomes For simple research questions Not controlling for other factors Doesnt provide a lot of information (ie.,


  1. Applied Statistical Analysis EDUC 6050 Week 13 Finding clarity using data

  2. Today Categorical Outcomes 2

  3. Categorical Outcomes For simple research questions Not controlling for other factors Doesn’t provide a lot of information (ie., only tells us difference or not) Logistic Chi Regression Square Simple Complex 3

  4. ID X Y General 1 0 0 Requirements 2 2 1 3 1 0 1. One or more 4 2 1 categorical 5 0 1 variables 6 0 1 Test of Independence 7 2 0 Goodness of Fit 8 1 0 4

  5. Hypothesis Testing with Chi Square (Independence) The same 6 step approach! 1. Examine Variables to Assess Statistical Assumptions 2. State the Null and Research Hypotheses (symbolically and verbally) 3. Define Critical Regions 4. Compute the Test Statistic 5. Compute an Effect Size and Describe it 6. Interpreting the results 5

  6. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables for the analysis 3. Expected frequency 5+ 6

  7. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables Individuals are independent of for the analysis each other (one person’s scores 3. Expected frequency 5+ does not affect another’s) 7

  8. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables for the analysis 3. Expected frequency 5+ Here we need interval/ratio outcome 8

  9. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions Variance around the line should 1. Independence of data be roughly equal across the 2. Appropriate measurement of variables whole line for the analysis 3. Expected frequency 5+

  10. 1 Examine Variables to Assess Statistical Assumptions Examining the Basic Assumptions 1. Independence: random sample 2. Appropriate measurement: know what your variables are 3. Expected frequency 5+: Check expected frequencies

  11. 2 State the Null and Research Hypotheses (symbolically and verbally) Hypothesis Symbolic Verbal Difference between Type means created by: Research Observed frequency is True relationship 𝑃𝐺 ≠ 𝐹𝐺 Hypothesis not equal to expected frequency Null Observed frequency is Random chance 𝑃𝐺 = 𝐹𝐺 Hypothesis the same as the (sampling error) expected frequency 11

  12. 3 Define Critical Regions How much evidence is enough to believe the null is not true? generally based on an alpha = .05 Use software’s p-value to judge if it is below .05 12

  13. 4 Compute the Test Statistic Jamovi Tutorial 13

  14. 5 Compute an Effect Size and Describe it 𝝍 𝟑 𝝍 𝟑 𝝔 = 𝝔 = Cramer’s 𝒐(𝒆𝒈) 𝒐 “Phi” 𝝔 Cramer’s 𝝔 Estimated Size of the Effect Close to .1 Depends Small Close to .3 on df Moderate Close to .5 (pg 557) Large 14

  15. 6 Interpreting the results “The voters’ opinions of the president’s policies were associated with the voters’ political affiliations, 𝝍 𝟑 (2, N = 58) = 16.40, p = .02, 𝝔 = .53. More democrats and fewer republicans approved of the president’s policies than would be expected by chance.” – pg 577. 15

  16. Logistic Regression 16

  17. Intro to Logistic Regression So far, we have always wanted continuous outcome variables But what if our outcome is a categorical variable?? Logistic Regression is just like linear regression but works with binary (dichotomous) outcomes • Substance Use or Not • Cancer or Not • Buy it or Not 17

  18. Logic of Logistic Regression We are trying to 1 find the best fitting S curve Y 0 X 18

  19. Logic of Logistic Regression We are trying to 1 find the best fitting S curve Y The curve is the model estimated probability of Y = 1 0 X 19

  20. Logistic Regression Simple Multiple • • Only one predictor in More than one variable in the model the model • • Tells you if that one Tells you if, while predictor is associated holding the other with the odds of Y = 1 variables constant, if that predictor is associated with the odds of Y = 1 20

  21. Logistic Regression • Logistic does what regression does but with a little bit of mathematical magic 𝒎𝒑𝒉𝒋𝒖(𝒁) = 𝜸 𝟏 + 𝜸 𝟐 𝒀 + 𝝑 21

  22. Logistic Regression • Logistic does what regression does but with a little bit of mathematical magic slope 𝒎𝒑𝒉𝒋𝒖(𝒁) = 𝜸 𝟏 + 𝜸 𝟐 𝒀 + 𝝑 intercept 22

  23. Logistic Regression • Logistic does what regression does but with a little bit of mathematical magic slope 𝒎𝒑𝒉𝒋𝒖(𝒁) = 𝜸 𝟏 + 𝜸 𝟐 𝒀 + 𝝑 unexplained stuff intercept in the odds of Y 23

  24. Logistic Regression 𝒎𝒑𝒉𝒋𝒖(𝒁) = 𝜸 𝟏 + 𝜸 𝟐 𝒀 + 𝝑 We have two variables, X and Y. X is continuous, Y is binary. We want Example to know if increases/decreases in X are associated (or predict) changes in the chance of Y equaling 1. 24

  25. Logistic Regression • It is trying to predict the outcome accurately using the information from the predictor • Better prediction tells us that the predictor(s) is/are more strongly related to the outcome 25

  26. ID X Y General 1 8 0 Requirements 2 6 1 3 9 1 1. Two or more 4 7 1 variables, 5 7 0 2. Outcome needs to be 6 8 0 binary 3. Others can be 7 5 1 continuous or 8 5 0 categorical 26

  27. Hypothesis Testing with Logistic Regression The same 6 step approach! 1. Examine Variables to Assess Statistical Assumptions 2. State the Null and Research Hypotheses (symbolically and verbally) 3. Define Critical Regions 4. Compute the Test Statistic 5. Compute an Effect Size and Describe it 6. Interpreting the results 27

  28. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables for the analysis 3. Normality of distributions 4. Homoscedastic 28

  29. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables Individuals are independent of for the analysis each other (one person’s scores 3. Normality of distributions does not affect another’s) 4. Homoscedastic 29

  30. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables for the analysis 3. Normality of distributions 4. Homoscedastic Here we need nominal outcome 30

  31. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions Residuals should be normally 1. Independence of data distributed 2. Appropriate measurement of variables for the analysis 3. Normality of distributions 4. Homoscedastic

  32. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data Variance around the line should 2. Appropriate measurement of variables be roughly equal across the for the analysis whole line 3. Normality of distributions 4. Homoscedastic 32

  33. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables for the analysis 3. Normality of distributions 4. Homoscedastic 5. Logistic Relationship 6. No omitted variables 33

  34. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables The “S-shaped” curve should fit for the analysis to the data 3. Normality of distributions 4. Homoscedastic 5. Logistic Relationships 6. No omitted variables 34

  35. 1 Examine Variables to Assess Statistical Assumptions Basic Assumptions 1. Independence of data 2. Appropriate measurement of variables Any variable that is related to for the analysis both the predictor and the 3. Normality of distributions outcome should be included in 4. Homoscedastic the regression model 5. Logistic Relationships 6. No omitted variables 35

  36. 1 Examine Variables to Assess Statistical Assumptions Examining the Basic Assumptions 1. Independence: random sample 2. Appropriate measurement: know what your variables are 3. Normality: Histograms, Q-Q, skew and kurtosis 4. Homoscedastic: Scatterplots 5. Logistic: Scatterplots 6. No Omitted: check correlations, know the theory

  37. 2 State the Null and Research Hypotheses (symbolically and verbally) Hypothesis Symbolic Verbal Difference between Type means created by: Research X predicts Y True relationship 𝛾 ≠ 0 Hypothesis Null There is no real Random chance 𝛾 = 0 Hypothesis relationship. (sampling error) 37

  38. 3 Define Critical Regions How much evidence is enough to believe the null is not true? generally based on an alpha = .05 Use software’s p-value to judge if it is below .05 38

  39. 4 Compute the Test Statistic Click on “2 Outcomes Binomial” 39

  40. 4 Compute the Test Statistic Results Outcome goes here Continuous predictors go here Other model options Categorical predictors go here 40

Recommend


More recommend