comparative judgement
play

Comparative Judgement An alternative approach to essay grading MEET - PowerPoint PPT Presentation

Randomly Distributed Comparative Judgement An alternative approach to essay grading MEET THE research team Dr. Cox Mornie Sims Dr. Eckstein Dr. Hartshorn Judson Hart Dr. Wilcox Col Reliability consistency d Validity authenticity


  1. Randomly Distributed Comparative Judgement An alternative approach to essay grading

  2. MEET THE research team Dr. Cox Mornie Sims Dr. Eckstein Dr. Hartshorn Judson Hart Dr. Wilcox

  3. Col Reliability consistency d Validity authenticity War

  4. reliability? 1880s – inconsistent scoring reliability → ? validity indirect → MC testing component skills highly reliable strongly correlated with writing grades

  5. validity? 1961 Study – opposite effect spurious correlations (# of bathrooms) teacher focus on component skills (Braddock, et al.) writing → active skill MC → passive, undue attention to less important features

  6. direct RELIABILITY IN writing assessme Rubrics Training nt Double-rating Adjudication MFRM

  7. THE rubric METHOD • Absolute judgment • External standard • Training/calibration

  8. comparativ RANDOMLY DISTRIBUTED e judgment • Comparison • Relative choice • Instinctual skill

  9. “There is no absolute judgment. All judgments are comparisons of one thing to another.” [Donald Laming]

  10. RDCJ RR & Implicit comparison Explicit comparison Training for consensus Minimizes training Unavoidable bias Minimizes bias MFRM Inherent algorithm

  11. HOW IT works.

  12. demo nomoremarking.com https://www.nomoremarking.com/demo1

  13. test it! nomoremarking.com https://www.nomoremarking.com/judges/reg/sLRRwmGAe65Wx3mbv

  14. CJ CJ eliminates common scoring biases Strictness vs leniency Central or extreme tendencies Additionally RATIONALE it is less cognitively demanding/time consuming per judgment Steedle and Ferrara, 2016 it requires less training evidence suggests that it is highly accurate (Gill & Bramley, 2008)

  15. comparative judgment …is a promising alternative, BUT is it… Reliable and Practical? and Can we trust the results?

  16. research question How does traditional rubric rating compare with MFRM (many facet Rasch model) and RDCJ (randomly distributed Comparative Judgment) in an ESL setting in terms of reliability , validity , and practicality ?

  17. Rater Group A Rater Group B Raters 4 Novice 4 Novice 4 Experienced 4 Experienced Analysis ANCOVA Essay Set 1 Essay Set 2 Essays 20% I. Samples t Tests (n=37) (n=38) Spearman's Rho Randomly Distributed Comparative Rubric Rating (RR) Ratings Judgment (RDCJ) MFRM Fair Average RDCJ True Score Figure 2. Study design to compare traditional rubric rating (RR) to multi-facet Rasch modeling (MFRM) and randomly distributed comparative judgment (RDCJ). Analysis of variance (ANOVA) run to test for effects on rating time and Spearman’s rho used to correlate between MFRM adjusted fair average, the study rubric rating fair averages, and RDCJ true scores to show evidence of validity.

  18. SELECTED Essays

  19. Rubric Ratin g WITHOUT MFRM

  20. Evidence RELIABILITY & VALIDITY

  21. Practicality DATA

  22. d COHEN’S

  23. t TESTS

  24. Covarianc ANALYSIS OF e

  25. Covarianc ANALYSIS OF e

  26. essay LENGTH & RATINGS

  27. CJ APPLICATIONS Especially suited to productive tasks Portfolios, essays, short answer Barkhaoui, 2016 Many subject areas Bramley, 2015 English, ESL, History, Geography Christodolou, 2016 Interesting Applications Mathematical problem solving Heldsinger & Humphrey, Peer Assessment (highly 2013 reliable & correlated with expert ratings)

  28. SUBJECT Areas

  29. Peer ASSESSMENT

  30. Peer ASSESSMENT (cont)

  31. calibrate d EXEMPLARS

  32. Comparative Judgment thank you! Mornie Sims Dr. Grant Eckstein eslmornie@gmail.com grant_eckstein@byu.edu Dr. Troy Cox Dr. K. James Hartshorn Troy_cox@byu.edu James_Hartshorn@byu.edu Dr. Matthew Wilcox Judson Hart wilcoxmp@byu.edu hatuhart@gmail.com

  33. essay prompt Identify one improvement that would make your city a better place to live for people your age and explain why people your age would benefit from this change. Use specific reasons and examples to support your opinion and describe the potential immediate and long-term consequences of this improvement. You have 30 minutes to write your response.

  34. Rubric STUDY

Recommend


More recommend