automatic classifiers as scientific instruments one step
play

Automatic Classifiers as Scientific Instruments: One Step Further - PowerPoint PPT Presentation

Automatic Classifiers as Scientific Instruments: One Step Further Away from Ground-Truth Jacob Whitehill and Anand Ramakrishnan Worcester Polytechnic Institute (WPI), Massachusetts, USA ICML2019 Machine learning to advance basic


  1. Automatic Classifiers as Scientific Instruments: 
 One Step Further Away from Ground-Truth Jacob Whitehill and Anand Ramakrishnan 
 Worcester Polytechnic Institute (WPI), Massachusetts, USA ICML’2019

  2. Machine learning to advance basic science • Machine perception can advance basic science in: • Psychology • Education • Medicine • …by providing automatic classifiers as new scientific instruments, e.g.: • Automatic stress detectors from wrist monitors 
 Empatica E4 EDA instead of questionnaires. • Facial action unit detectors from video 
 Emotient/ iMotions instead of electromyography. • Student engagement detectors from video 
 Kaur et al. instead of observational protocols. 2018 ICML’2019

  3. Correlation study • Suppose a researcher wishes to measure the relationship between two constructs U and V , e.g.: • U = stress • V = academic performance. • Standard methodology: • Use a standard measurement tool (e.g., survey, observational protocol) to estimate the values of U and V from a sample of n participants. • This produces two vectors , which we can u , v ∈ R n assume w.l.o.g. have 0-mean and 1-length. • Estimate the correlation between U and V as: r = ρ ( u , v ) = u > v = cos ∠ ( u , v ) Only the angle between the two vectors determines their correlation. ICML’2019

  4. Correlation study • But what if the researcher instead uses an automatic stress detector d whose correlation with ground-truth measurements is q (known from prior validation)? • Instead of , the researcher obtains a vector . u ˆ u • What kind of spurious deductions about the correlation between U and V could result? ICML’2019

  5. Trivariate correlation v 1.00 0.75 0.50 r 0.25 u 0.00 −0.25 −0.50 −0.75 −1.00 −1.0 −0.5 0.0 0.5 1.0 • Suppose and are ground-truth values of U and V . u v • The correlation between and is r = cos(105°) = -.259. u v ICML’2019

  6. Trivariate correlation v 1.00 0.75 0.50 0.25 u 0.00 q −0.25 −0.50 u ˆ −0.75 −1.00 −1.0 −0.5 0.0 0.5 1.0 • Using a detector d , the researcher might obtain , whose u ˆ correlation with is q . u • The correlation between and is cos(135°)= -.707 — u ˆ v much larger than, but same sign as, the ground-truth correlation. ICML’2019

  7. Trivariate correlation v 1.00 0.75 u 0 ˆ 0.50 q 0.25 u 0.00 −0.25 −0.50 u ˆ −0.75 −1.00 −1.0 −0.5 0.0 0.5 1.0 • But they might also obtain vector , whose correlation 
 u 0 ˆ with is also q . u • The correlation between and is cos(75°)= +.259 — u 0 ˆ v this is the opposite sign as the ground-truth correlation. We call this a false correlation. ICML’2019

  8. Main results 1.The set of all vectors whose correlation with is q , is an u T n ∈ R n ( n -3)-sphere . 2.If the correlation between and is r , then the expected sample correlation between and , where is drawn uniformly at random from , is qr . 3.We derive a formula h ( n , q , r ) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n . ICML’2019

  9. Main results 1.The set of all vectors whose correlation with is q , is an u T n ∈ R n ( n -3)-sphere . 2.If the correlation between and is r , then the expected u v sample correlation between and , where is drawn u ˆ u ˆ v uniformly at random from , is qr . T n 3.We derive a formula h ( n , q , r ) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n . ICML’2019

  10. Main results 1.The set of all vectors whose correlation with is q , is an u T n ∈ R n ( n -3)-sphere . 2.If the correlation between and is r , then the expected u v sample correlation between and , where is drawn u ˆ u ˆ v uniformly at random from , is qr . T n 3.We derive a formula h ( n , q , r ) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n . ICML’2019

  11. Main results 1.The set of all vectors whose correlation with is q , is an u T n ∈ R n ( n -3)-sphere . 2.If the correlation between and is r , then the expected u v sample correlation between and , where is drawn u ˆ u ˆ v uniformly at random from , is qr . T n 3.We derive a formula h ( n , q , r ) for the probability of a false correlation. 4.We show that h is monotonically decreasing in q and n . But it can still be non-negligible for values of n , q used in recent a ff ective computing studies — despite a small p-value. ICML’2019

  12. Case study: Student engagement vs. cognitive task performance V : Cognitive task U : Engagement performance • Whitehill et al. 2014 measured student engagement using (1) observational protocol and (2) automatic engagement detector d ( q =0.50). • Using hand-coded labels, corr( U , V ) was estimated as r =0.37. • Given n , q , r , what is probability of false correlation from d ? ICML’2019

  13. Case study: Student engagement vs. cognitive task performance (ngagePent new: 3roEaEility of "false negative" correlation ( q 0.5, r 0.37) 0.5 V : Cognitive task U : Engagement performance 0.4 3roEaEility 0.3 0.2 0.1 0.0 0 25 50 75 100 125 150 175 200 n # participants ( n ) • Whitehill et al. 2014 measured student engagement using (1) observational protocol and (2) automatic engagement detector d ( q =0.50). • Using hand-coded labels, corr( U , V ) was estimated as r =0.37. • Given n , q , r , what is probability of false correlation from d ? ICML’2019

Recommend


More recommend