fleshing out systems of assessment in the context of the
play

Fleshing out Systems of Assessment in the Context of the Next - PowerPoint PPT Presentation

Fleshing out Systems of Assessment in the Context of the Next Generation Science Standards Nathan Dadey Center for Assessment National Conference On Student Assessment San Diego, CA This work is licensed under June 27 th , 2018 a Creative


  1. Fleshing out Systems of Assessment in the Context of the Next Generation Science Standards Nathan Dadey Center for Assessment National Conference On Student Assessment San Diego, CA This work is licensed under June 27 th , 2018 a Creative Commons Attribution 4.0 International License.

  2. ` 1 Systems of Assessment • Definition & Context • Re-framing 6/27/2018 2

  3. Why? 6/27/2018 3

  4. Why? Why systems of assessment? 6/27/2018 4

  5. Too many assessments given by too many people for too many purposes, all providing too little information. The alternative? Develop a system of assessments. ^Work towards 6/27/2018 Definition & Context | Re-framing 5

  6. One Starting Point Call for “coordinated systems of multiple assessments that work together, along with curriculum and instruction, to promote learning” (p. 242). Pellegrino, Chudowsky & Glaser, 2001 6/27/2018 Definition & Context | Re-framing 6

  7. Conceptual work defining “balanced”, “next generation” or “comprehensive” assessment systems followed. 6/27/2018 Definition & Context | Re-framing 7

  8. Conceptual work defining “balanced”, “next generation” or “comprehensive” assessment systems followed. Which led to the National Research Council Report, Developing Assessments for the Next Generation Science Standards. 6/27/2018 Definition & Context | Re-framing 8

  9. Systems & the NGSS A system of assessments will be needed to measure the NGSS performance expectations and provide students, teachers, administrators, policy makers, and the public with the information each needs about student learning. National Research Council, 2014 6/27/2018 Definition & Context | Re-framing 9

  10. Systems & the NGSS A system of assessments will be needed to measure “It will not be feasible to assess all of the NGSS performance the performance expectations for a expectations and provide given grade level during a single students, teachers, assessment occasion. Students will need administrators, policy multiple—and varied—assessment makers, and the public with opportunities” the information each needs about student learning. National Research Council, 2014 6/27/2018 Definition & Context | Re-framing 10

  11. So what exactly is a system of assessments? Here’s my working definition. 6/27/2018 11

  12. Multiple assessments with potentially different designs , sponsored by different people, who are at different levels of control Coordinated by a common theory of learning Working together to meet a specific use or uses. 6/27/2018 12

  13. Despite all of this work, much of the vision laid out by the NRC report and others has not been realized. 6/27/2018 13

  14. A Response • Instead of working from the perspective that a system is tightly defied and purpose built, • Consider all of the assessments students take as a system – Then consider the ways which the levels, or layers, of that system can be made to work together, i.e., complement one another 6/27/2018 Definition & Context | Re-framing 14

  15. Operationalizing in Terms of Layers Let’s consider one way to visualize the layers of an assessment system. 6/27/2018 Definition & Context | Re-framing 15

  16. Operationalizing in Terms of Layers Purpose Level or Layer Timing 6/27/2018 Definition & Context | Re-framing 16

  17. Operationalizing in Terms of Layers Purpose Level or Layer There is no one “system.” Instead, within each layer decision makers build, or at least implement, their own set of assessments. Timing 6/27/2018 Definition & Context | Re-framing 17

  18. Operationalizing in Terms of Layers Purpose Level or Layer Thus each classroom’s, school’s or district’s layer may look different. Meaning within any given state, there are numerous “systems,” when taken as a whole. Timing 6/27/2018 Definition & Context | Re-framing 18

  19. Operationalizing in Terms of Layers Purpose Level or Layer By juxtaposing the layers, hopefully we can start thinking about ways the layers can complement one another. Timing 6/27/2018 Definition & Context | Re-framing 19

  20. This approach doesn’t fully meet the vison for a system of assessments, but it is a practical start. 6/27/2018 Definition & Context | Re-framing 20

  21. By showing what is actually going on, we can then start to think about ways in which each layer can be made more complementary, along the lines of comprehensiveness, coherence and continuity. 6/27/2018 Definition & Context | Re-framing 21

  22. A Starting Point • Define the state layer as concretely as possible, then examine where and how the district layer can be made more complementary (and vice-versa). • Consider key chunks of each layer: – Use – Assessment Design – Assessment Framework (i.e., Domain) – Measurement 6/27/2018 Definition & Context | Re-framing 22

  23. A Starting Point • Define the district layer as concretely as possible, then examine where and how the district layer can be made more complementary (and vice-versa). • Consider key chunks of each layer: – Use – Assessment Framework (i.e., Domain) – Assessment Design – Measurement 6/27/2018 Definition & Context | Re-framing 23

  24. 2 The District Layer • Assessment Framework • Assessment Design 6/27/2018 24

  25. Assessment Framework What does the assessment measure and how does it do so? Possible focuses of district assessment: • The same domain as the state-level… – exactly – at a lesser degree of complexity • A different domain at the state level… – progressions or precursors to the state-level domain – parts of the standards not covered by the domain 6/27/2018 25

  26. Complementarity District State The Same Domain Model 6/27/2018 26

  27. Complementarity District State The Different Domain Model 6/27/2018 27

  28. Assessment Design What does the assessment(s) look like? Often, sensitivity to curriculum and instruction is paramount for district-level assessments. One way to make assessments better match local curriculum and instruction is through modular design. 6/27/2018 28

  29. Fixed Designs Block Designs Modular Designs Multiple A single Multiple assessment that assessments, each assessments, each measures the measuring a chunk measuring a very of the domain entire domain small chunk of the domain Fixed Modular 6/27/2018

  30. Supplemental Slides 6/28/2018 3D Science Assessment Reporting 30

  31. The State Layer • Uses • Assessment Framework • Measurement 6/27/2018 31

  32. Uses How are the assessment results used? • For better or worse, state-level assessments are often “one use, and one use only” – As part of a system of school identification • Unlike Math and Reading/English Language Arts, Science is not a mandatory part of state accountability plans 1 under ESSA. – And, possibly, to signal the importance of a given set of standards • Any other uses are likely to not be directly supported via the state-layer alone. – The massive use of assessments by districts is testament to this. 1 Although an initial look at state plans did suggest many states were including science in their accountability systems . 6/27/2018 32

  33. Uses How are the assessment results used? • For any intended use, a key question is on sufficiency: – Is the use of the state-level assessment results likely to lead to the intended outcome (hopefully when couched in terms of a theory of action)? 6/27/2018 33

  34. Defining the use should help us determine what we want to measure, and vice versa. 6/27/2018 34

  35. Assessment Framework What does the assessment measure and how? • At the state-level, the domain is almost always premised on an approximation of the domain sampling approach • Even under this approach, defining the domain in any subject requires a judgmental process to translate standards into assessment specifications – As a reflection of the nature of the NGSS, the judgements for the NGSS seem particularly complex 6/27/2018 35

  36. Assessment Framework In terms of the NGSS, much of this translation NGSS Specific involves the Performance Expectations (PEs), including whether: • The state considers the PEs as the standards or as examples of valuable intersections – If the latter, will new PEs be defined? • All PEs vs. Full Sampling (aka Matrix, within or across years) vs. a Limited Sample are assessed 6/27/2018 36

  37. Assessment Framework Also key is in which the PEs are mapped to NGSS Specific items, and item clusters, including – Whether and how PEs are bundled to create item clusters – Phenomenon also play a substantial role in cluster design 6/27/2018 37

  38. These types of questions help better define the domain. A detailed domain definition comes from a back and forth between design choices and domain definition. 6/27/2018 38

  39. These types of questions help better define the domain. A detailed domain definition comes from a back Design Design Domain Domain and forth between design choices and domain Definition Definition Decisions Decisions definition. 6/27/2018 39

Recommend


More recommend