Welcome and Introduction 5 Presenter G. Gag age e Kingsbur - PowerPoint PPT Presentation

Decisions to be made in developing an adaptive testing system for K – 12 education G. Gage Kingsbury March 9, 2012

Welcome and Introduction 5

Presenter G. Gag age e Kingsbur sbury Vice President for the International Association for Computerized Adaptive Testing (IACAT) and Senior Research Fellow at the Northwest Evaluation Association (NWEA) 6

Decisions to be made in developing an adaptive testing system for K – 12 education 7

The Idea An adaptive test is a test that adjusts its characteristics based on the performance of a test taker. 8

Questions and Answers 9

Computerized Adaptive Testing 20 Item Test 250 240 Achievement Score 235 234 235 234 234 233 232 231 230 230 229 Advanced 228 228 226 225 221 220 216 Proficient 210 210 202 200 Basic 191 190 180 175 170 160 150 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Test Questions 10

Pioneers of adaptive testing • Alfred Binet • Frederick Lord • David J. Weiss • Fumiko Samejima • Mark Reckase 11

First implementers • David Foster • Jim McBride • Tony Zara • Gage Kingsbury 12

You have chosen to use an adaptive test because … • It can be more efficient than a fixed-form test • It provides good information across a broader spectrum of student performance • It can provide immediate scoring and reporting • It can provide better security than a fixed-form test • It can be designed to measure growth 13

Since the first implementations • We have seen international growth in the use of CAT for – Educational testing – Medical outcomes assessment – Certification and licensure 14

Accuracy of adaptive tests • Compared to a fixed-form test • As a function of test length • Depending on termination procedure 15

Relationship between Spring and Fall Reading Scores 250 240 230 220 Fall RIT 210 200 190 180 170 160 150 150 160 170 180 190 200 210 220 230 240 250 Spring RIT PP to CAT PP to PP 16

Students' Mean = 211.7 s.d. = 11.11 Proficiency = 205 Basic = 192 Test Information Functions for Grade 4 Mathematics .12 .10 Information .08 .06 .04 .02 .00 165 175 185 195 205 215 225 235 245 RIT 17

Choosing to use an adaptive test requires making a series of decisions in the areas of… • Psychometrics • Interface (including accommodations) • Item designs • Test designs • Test distribution • Item usage • Item and test security • Proctor training • Reporting 18

Basics of a theoretical CAT • IRT model • Item pool • Select first item • Select next item • Terminate test • Score 19

Decision areas for an operational CAT for measuring student achievement • Be Before ore th the e tes est t (T (Test est stu tuff) – How will we develop the measurement scale? – What mix of item styles will we need? – Which IRT model is appropriate? – What depth do we need in our item bank? – How will we choose an operational item pool? – What will our test blueprint include? – How will we QA everything involved? 20

Questions and Answers 21

Decision areas for an operational CAT for measuring student achievement • Be Before ore th the e tes est t (S (School chool stu tuff) f) – School, teacher, and student identification – Establishing a testing environment – Teacher training – Software/hardware setup – Proctor training – Student familiarization – Student scheduling – QA 22

Decision areas for an operational CAT for measuring student achievement • Test est ad admi ministra stration tion – Student verification process – Test selection – Proctor throughout – Identify previously used items 23

Decision areas for an operational CAT for measuring student achievement • Test est even ent – Apply test blueprint – Select first item or set of items – Check for effort – Update item selection theta hat – Update constraints – Select next item – Terminate test 24

Decision areas for an operational CAT for measuring student achievement • After er the e tes est – Calculate final score – Calculate growth – Terminate test session – Store data – Identify student as completing test – Compare to norms, growth norms, content, etc. – Create individual student report – Add information to teacher/administrator reports 25

Measuring growth and adaptive testing • Measuring at multiple points in time • The standard deviation of growth • The standard error of growth • Reduction of uncertainty • Growth and instruction 26

Adaptive testing and idiosyncratic knowledge patterns • Can there be multiple thetas without multidimensionality? • Selecting items to reveal knowledge patterns • A simple algorithm • The impact on instruction 27

Field testing within an adaptive testing system • Calibration differences from paper to CAT • Random sampling for calibration in CAT • Using provisional calibrations in CAT field tests 28

Cautionary notes • Adaptive testing needs to be well tuned to avoid bad tests. • The item pool must support the stakes. • Adaptive testing changes, but doesn’t eliminate, security issues. – Brain dump sites • Limit desire. No test can do everything. • Adaptive test development is never done. 29

Have fun • The decisions to be made should consider the good of the students for whom the test is designed. • Don’t try to build the perfect test—it won’t be. • Consider a ―dry eye‖ policy— making kids cry isn’t the purpose of the test. 30

Thank you Gage Kingsbury gagekingsbury@comcast.net 31

Welcome and Introduction 5 Presenter G. Gag age e Kingsbur - PowerPoint PPT Presentation

Decisions to be made in developing an adaptive testing system for K 12 education G. Gage Kingsbury March 9, 2012 Welcome and Introduction 5 Presenter G. Gag age e Kingsbur sbury Vice President for the International Association for

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

INTRODUCTION Introduction 2/42 INTRODUCTION Alternations I am giving bread to the

Introduction Introduction Introduction Introduction Outline Motivation Failures

Introduction Introduction Introduction Nationwide Cause for Concern 1

DAQ introduction Purpose of this talk : (1) Introduction for those who have not been in every

INTRODUCTION cf. Schneider, Chapter 1 INTRODUCTION THEY ARE OUT TO GET YOU INTRODUCTION WHAT

Overview Introduction to SMIL Introduction to W3C and XML Introduction to SMIL

14 Introduction Introduction Bad guys can put malware into Example: hosts

Previous Lecture Introduction to SMIL 2 Introduction to W3C and XML Introduction to SMIL

Lecture 1 Andreas Habegger Introduction Zynq Introduction Zynq Introduction Zynq PS vs. PL

2018.06 01 SMILE5 Introduction S E 5 02 Alpha Cloud M I L 03 Company Introduction 04

Introduction to CICS Course introduction Course introduction What is CICS? What is an

Network Visualization Introduction Presented by Shahed Introduction Introduction Basic

Introduction Introduction (1) Main argument of the paper: universities are not only

Introduction Introduction to storage and to storage and filesystems filesystems Introduction

JABLOTRON JA-100 introduction November 2011 Introduction Todays goals First introduction of

1 Introduction Introduction Communication System Baseband an adjective that describes

Welcome! Todays Agenda: Topic Introduction Course Introduction Team Practical

Introduction (00) RNDr. Martin Madaras, PhD. madaras@skeletex.xyz Introduction Introduction

PRESENTATION The Introduction What is the Introduction ? The introduction prepares the audience

How to make a presentation with L A T EX? Introduction to L A T EX Introduction to Beamer

Shenzhen Cuilu jewelry Co., Ltd was founded in 1996 and its a large private enterprise

Introduction to ICS- -214 214 Introduction to ICS Official Unit / Incident Log - A V-C-N.org

Introduction to ICS- -214 214 Introduction to ICS Official Unit / Incident Log - A V-C-N.org