empirical user studies
play

EMPIRICAL USER-STUDIES human-computer interaction CSE 440 WINTER - PowerPoint PPT Presentation

Maya Cakmak, Matt Kay, Brad Jacobson, King Xia EMPIRICAL USER-STUDIES human-computer interaction CSE 440 WINTER 2015 University of FEB 19 - WEEK 7 - THURSDAY Washington Methods for observing interaction hmmmm blah blah blah bla Passive


  1. Maya Cakmak, Matt Kay, Brad Jacobson, King Xia EMPIRICAL USER-STUDIES human-computer interaction CSE 440 WINTER 2015 University of FEB 19 - WEEK 7 - THURSDAY Washington

  2. Methods for observing interaction hmmmm blah blah blah bla Passive observation Think-aloud protocol Comparative study Last week University of 2 Washington

  3. Methods for observing interaction hmmmm blah blah blah bla Passive observation Think-aloud protocol Comparative study “Empirical user study” Last week “Controlled experiment” Today University of 2 Washington

  4. Evaluation Techniques (re-cap) •Asking users –Questionnaires, interviews, focus groups •Observing users –Passive observation, think-aloud protocol, ethnography, empirical user studies •Make users observe themselves –Diaries, experience sampling •Ask experts –Heuristic evaluation, cognitive walkthrough University of 3 Washington

  5. Evaluation Techniques (re-cap) •Asking users –Questionnaires, interviews, focus groups •Observing users –Passive observation, think-aloud protocol, ethnography, empirical user studies •Make users observe themselves –Diaries, experience sampling •Ask experts –Heuristic evaluation, cognitive walkthrough University of 3 Washington

  6. Designing an empirical study University of 4 Washington

  7. Designing an empirical study •What is being compared? –Independent variables University of 4 Washington

  8. Designing an empirical study •What is being compared? –Independent variables •What are they being compared in? –Dependent variables (“metrics”) University of 4 Washington

  9. Designing an empirical study •What is being compared? –Independent variables •What are they being compared in? –Dependent variables (“metrics”) •What (else) is being varied? What is kept constant? –Extraneous variables University of 4 Washington

  10. Designing an empirical study •What is being compared? –Independent variables •What are they being compared in? –Dependent variables (“metrics”) •What (else) is being varied? What is kept constant? –Extraneous variables University of 4 Washington

  11. What is being compared? “conditions” University of 5 Washington

  12. What is being compared? interval Continuous values Independent variable ordinal Ordered discrete values categorical Unordered discrete values “conditions” University of 5 Washington

  13. What is being compared? •Example: Interval independent variable –What is the effect of height on telepresence systems? Rae et al. University of 6 Washington

  14. Robotic telepresence University of 7 Washington

  15. What is being compared? •Example: Interval independent variable –What is the effect of height on telepresence systems? Rae et al. University of 8 Washington

  16. What is being compared? •Example: Ordinal independent variable –What is the effect of educational background on acceptance of robots in the workplace? high school < college < graduate degree Rae et al. University of 9 Washington

  17. What is being compared? •Example: Categorical independent variable –What is the effect of input modality on telepresence systems? – keyboard – mouse – joystick Rae et al. University of 10 Washington

  18. Within-subject vs. between subject Participant-1 Participant-2 Same participant within between University of 11 Washington

  19. Within-subject vs. between subject Participant-1 Participant-2 Same participant within between + allows comparison + requires less participants - subject to ordering effects University of 11 Washington

  20. Within-subject vs. between subject Participant-1 Participant-2 Same participant within between + allows comparison + requires less participants - subject to ordering effects > Order counterbalancing University of 11 Washington

  21. Designing an empirical study •What is being compared? –Independent variables •What are they being compared in? –Dependent variables (“metrics”) •What (else) is being varied? What is kept constant? –Extraneous variables University of 12 Washington

  22. Independent vs. dependent variable •Example: –What is the effect of height on telepresence systems? in terms of what? Rae et al. University of 13 Washington

  23. What to measure or observe? (objective) Data Source (subjective) Data type University of 14 Washington

  24. What to measure or observe? What were the How accurately is communication information (objective) challenges? remembered? Data Source How highly do What frustrated participants rate the participants? the system? (subjective) Data type University of 14 Washington

  25. Dependent variables what people do.. what people say.. University of 15 Washington

  26. What is being measured? •Example: Interval dependent variable –What is the effect of height on conversation control? -ratio of time speaking -ratio of decisions influenced -self assessment of control ... Rae et al. University of 16 Washington

  27. What is being measured? •Example: Ordinal dependent variable –What is the effect of height on user preference? -user rating of the system Rae et al. University of 17 Washington

  28. What is being measured? •Example: Categorical dependent variable –What is the effect of height on conversation control? -choose one: “I felt like the leader” “I felt like the follower” Rae et al. University of 18 Washington

  29. Designing an empirical study •What is being compared? –Independent variables •What are they being compared in? –Dependent variables (“metrics”) •What (else) is being varied? •(What is kept constant?) –Extraneous variables University of 19 Washington

  30. Extraneous variables •Similar to independent variables but we are not looking for an effect –What is the effect of on conversation control? - things that vary unless you control for them gender, age, background of participants - things that you explicitly vary to demonstrate lack of effect tasks performed using the system University of 20 Washington

  31. Interpreting the results •What is being compared? –Independent variables •What are they being compared in? –Dependent variables (“metrics”) University of 21 Washington

  32. Interpreting the results •What is being compared? –Independent variables •What are they being compared in? –Dependent variables (“metrics”) Main question: Does <independent variable> cause differences in <dependent variable>? University of 21 Washington

  33. Interpreting the results Does height effect ratio of time speaking? Yes/No? University of 22 Washington

  34. Analyzing the data •Factors –Within vs. between groups –Number of variables –Type of dependent variables –Type of independent variables University of 23 Washington

  35. A common case: A/B testing •Two categorical independent variables (A vs. B) •One interval dependent variable –key performance indicator A: control key performance indicator B: treatment T-Test A B University of 24 Washington

  36. (Student’s) T-tests •Check if two means (averages) are reliably different from each other –t = (variance between groups)/(variance within groups) –Large t means different groups –Small t means similar groups University of 25 Washington

  37. (Student’s) T-tests Example https://www.youtube.com/watch?v=0Pd3dc1GcHc University of 26 Washington

  38. (Student’s) T-tests Example University of 27 Washington

  39. (Student’s) T-tests Example t = 2/6 University of 28 Washington

  40. (Student’s) T-tests p-value: probability that our data could be produced randomly University of 29 Washington

  41. (Student’s) T-tests p-value: probability that our data could be produced randomly p<0.05 University of 29 Washington

  42. (Student’s) T-tests p-value: probability that our data could be produced randomly p<0.05 This means that there is only a 5% chance that there is no real difference between the two groups. University of 29 Washington

  43. (Student’s) T-tests p-value: probability that our data could be produced randomly –depends on number of participants University of 30 Washington

  44. (Student’s) T-tests p-value: probability that our data could be produced randomly –depends on number of participants bigger samples help but with diminishing returns University of 30 Washington

  45. Types of t-tests “independent” “dependent” “unpaired” “paired” “between samples” “within subjects” “repeated measures” University of 31 Washington

  46. Limitations of t-tests •Generalizes to similar population •Assumes that your data has Normal (Gaussian) distribution •Sample size should be roughly the same •All data should be independent/ not influenced by each other •Interval type variables (will not work for rankings) University of 32 Washington

  47. Lots of statistical tools available http://www.graphpad.com/quickcalcs/ttest1.cfm University of 33 Washington

  48. Which statistical test to use? http://www.ats.ucla.edu/stat/mult_pkg/whatstat/ University of 34 Washington

Recommend


More recommend