Week 3 Video 2 Data Synchronization and Grain-Sizes You have - PowerPoint PPT Presentation

Week 3 Video 2 Data Synchronization and Grain-Sizes

You have ground truth training labels… ◻ How do you connect them to your log files? ◻ The problem of synchronization ◻ Turns out to be intertwined with the question of what grain-size to use

Grain-size ◻ What level do you want to detect the construct at?

Orienting Example ◻ Let’s say that you want to detect whether a student is gaming the system, and you have field observations of gaming ◻ Each observation has an entry time (e.g. when the coder noted the observation), but no start of observation time ◻ The problem is similar even if you have a time for the start of each observation

Data Monday 8am Gaming Monday 3pm Not Gaming Friday 3pm

Data Monday 8am Monday 3pm Notice the gap; maybe students were off this day… or maybe the observer couldn’t make it Friday 3pm

Orienting Example ◻ What grain-size do you want to detect gaming at? ◻ Student-level? ◻ Day-level? ◻ Lesson-level? ◻ Problem-level? ◻ Observation-level? ◻ Action-level?

Student level ◻ Average across all of your observations of the student, to get the percent of observations that were gaming

Student level Monday 8am Gaming 5 Gaming Monday 3pm 10 Not Gaming This student is 33.33% Gaming Not Gaming Friday 3pm

Student level Monday 8am 5 Gaming Monday 3pm 10 Not Gaming This student is 33.33% Gaming Friday 3pm

Notes ◻ Seen early in behavior detection work, when synchronization was difficult (cf. Baker et al., 2004) ◻ Makes sense sometimes � When you want to know how much students engage in a behavior � To drive overall reporting to teachers, administrators � To drive very coarse-level interventions ■ For example, if you want to select six students to receive additional tutoring over the next month

Day level ◻ Average across all of your observations of the student on a specific day, to get the percent of observations that were gaming

Day level Monday 8am Monday 40% Monday 3pm Tuesday 0% Wednesday 20% Thursday 0% Friday 40% Friday 3pm

Notes ◻ Affords finer intervention than student-level ◻ Still better for coarse-level interactions

Lesson level ◻ Average across all of your observations of the student within a specific level, to get the percent of observations that were gaming

Lesson level Monday 8am Lesson 1: 40% gaming Monday 3pm Lesson 2: 30% gaming Friday 3pm

Notes ◻ Can be used for end-of-lesson interventions ◻ Can be used for evaluating lesson quality

Problem level ◻ Average across all of your observations of the student within a specific problem, to get the percent of observations that were gaming

Problem level Monday 8am Monday 3pm Friday 3pm

Notes ◻ Can be used for end-of-problem or between- problem interventions � Fairly common type of intervention ◻ Can be used for evaluating problem quality

Challenge ◻ Sometimes observations cut across problems ◻ You can assign observation to � problem when observation entered � problem which had majority of observation time � both problems

Observation level ◻ Take each observation, and try to predict it

Observation level Monday 8am Gaming Monday 3pm Not Gaming Friday 3pm

Notes ◻ “Most natural” mapping ◻ Affords close-to-immediate intervention ◻ Also supports fine-grained discovery with models analyses

Challenge ◻ Synchronizing observations with log files ◻ Need to determine time window which observation occurred in � Usually only an end-time for field observations; you have to guess start-time � Even if you have start-time, exactly where in window did desired behavior occur? � How much do you trust your synchronization between observations and logs? ■ If you don’t trust it very much, you may want to use a wider window

Challenge ◻ How do you transform from action-level logs to time-window-level clips? � You can conduct careful feature engineering to create meaningful features out of all the actions in a clip � Or you can just hack counts, averages, stdev’s, min, max from the features of the actions in a clip (cf. Sao Pedro et al., 2012; Baker et al., 2012)

Action level ◻ You could also apply your observation labels to each action in the time window ◻ And then fit a model at the level of actions � Treating actions from the same clip as independent from one another ◻ Offers the potential for truly immediate intervention

Action level ◻ Some models identify the overall construct at the action level, but validate at the clip level (Paquette et al., 2015) ◻ Less certain, action by action, but allows more rapid and targeted intervention

Bottom-line ◻ There are several grain-sizes you can build models at ◻ Which grain-size you use determines � How much work you have to put in (coarser grain- sizes are less work to set up) � When you can use your models (more immediate use requires finer grain-sizes) ◻ It also influences how good your models are, although not in a perfectly deterministic way

Next Lecture ◻ Feature Engineering

Week 3 Video 2 Data Synchronization and Grain-Sizes You have - PowerPoint PPT Presentation

Week 3 Video 2 Data Synchronization and Grain-Sizes You have ground truth training labels How do you connect them to your log files? The problem of synchronization Turns out to be intertwined with the question of what grain-size

MATH2130-F17 Week 13 Week 14 Week 15, Inner Farid Aliniaeifard Product Space CU BOULDER

Time Matters Week 7 Week 6 Prototyping + Needfinding Week 7 Week 8 Implementation Week 9

Math 610 Section 700 - Recitation week 3 week 4 week 6 week 8 TA: Peng Wei Office: Blocker

Video Games Written and Researched by: Patrick Kania First Video Game The first Video Game made

Galatians: week 3 Galatians 3:1-29 Week 1: Galatians 1:1-2:14 Week 2: Galatians 2:15-21 Week 3:

Vermont M nt Marble: A e: Americas s nt Stone Monument Sto Class S s Schedule e Week

Week 1: Christ: The Source of True Happiness Week 2: Happiness, the Gospel and Living Well Week

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/20/2019 NVIDIA Video Technologies Overview Turing

NVIDIA VIDEO TECHNOLOGIES Abhijit Patait, 3/26/2018 NVIDIA Video Technologies Overview Video

Video Sur Video Sur rveillance, rveillance, , Video Analyti Video Analyti ics, and You.

Islands of the Pacific Northwest One or Two Week Cruise Week 1: September 14 th 20 th Week 2:

Menu Day Week 1 Week 2 Week 3 Week 4 Monday +Pork and Apple Casserole or +Meat Loaf or Lamb

www. velpaprojects .com Finishing your property the VELPA way Time plan Week 1 - 4 Week 5 - 8

Case-X Progress Report By: MELRR Engineering Group #3 Weekly Updates Week Week Week Week

INSTRUCTION WEEK OF MAY 18 TH 2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK:

INSTRUCTION WEEK OF MAY 18 TH 2020 MS. KELLYS SIXTH GRADE GLOBAL THINKERS STUDENT OF THE WEEK:

Lecture 17:Inference Michael Fourman https://www.youtube.com/watch?v=Lvcnx6-0GhA An argument is

Hidden Markov Models Aar$ Singh Slides courtesy: Eric Xing

Solicit Human Input Data Through Social Gaming Guozhang Wang DB Lunch, April 4 th , 2012 Human

Advanced Algorithms (VI) Shanghai Jiao Tong University Chihao Zhang April 13, 2020 Martingale

Secure Mobile Mobile Gambling Gambling Secure RSA Conference 2001 San Francisco,

Inverse gamma distribution STAT 587 (Engineering) Iowa State University September 17, 2020

Statistics of One-Way Internet Packet Delays Andrew Corlett CQOS Inc., Irvine, CA D. I. Pullin

Phylogenetic trees IV Maximum Likelihood Gerhard Jger Words, Bones, Genes, Tools February 28,