MTurk Unscrubbed: Dealing with the good, the Super, and the - PowerPoint PPT Presentation

MTurk ‘Unscrubbed’: Dealing with the good, the ‘Super’, and the unreliable on Amazon’s Mechanical Turk Jea Jeanett ette D Deetlef etlefs M. Chylinski, A. Ortmann Motivation Research Results Discussion 1

Motivation Research Results Discussion Amazon’s Mechanical Turk Low-cost Fast turnaround Acceptable validity But…. Super-Turkers (the experienced) & Spammers (the unreliable) 2

Motivation Research Results Discussion We know they’re out there, but we swim on  About one third of all MTurk research has between 3% and 37% of subjects removed (Chandler et al. 2014)  The unreliable  create misleading results  The experienced = practice effects  Standard objective measures become unreliable  May strategize unnaturally  Speed up response times (Camerer & Loewenstein 2004; Chandler et al. 2014, 2015)  No set protocol to remove the unreliable and the experienced 3

Motivation Motivation Research Research Results Results Discussion Discussion Our research…  12 studies with 2736 subjects  9% are experienced with our risk-type experiment (Super- Turkers)  11% are unreliable (Spammers) with faster response times and poorer completion  Detailed analysis at overall (n=505) and sub-sample level (n=17 to n=42)  Comparison of a Bizlab (n=149) and MTurk (n=154) study 4

Motivation Motivation Research Research Results Results Discussion Discussion What we found…  Objective measures are most influenced e.g.,  the experienced have response times that are 38% faster  the unreliable score 10% lower on financial literacy measures 5

Motivation Motivation Research Research Results Results Discussion Discussion What we found… Education and employment related demographics contrast one another, as does time on choice 1.50 Indexed to mean of Excluding 1.25 1.00 0.75 Excluding 'Experienced' 0.50 'Unreliable' Figure shows Experienced and Unreliable means indexed to mean of 'Excluding'. For demographics: female=1, full-time employment=1, highest education is high school=1, earn <$75000p.a.=1. Financial-literacy (FL) indexed mean of correct responses. 6

Motivation Motivation Research Research Results Results Discussion Discussion What we found ctd…  Objective measures are most influenced e.g.,  the experienced have response times that are 38% faster  the unreliable score 10% lower on financial literacy measures  Little difference in outcomes when both are included BUT …  Exclusion doubles our effect sizes 7

Motivation Motivation Research Research Results Results Discussion Discussion MTurk excl. MTurk incl. F 23.90 14.80 Obs 104 135 Adj R-squared 0.395 0.236 Coefficient Coefficient (time on choice^L-1)/L (std. err) (std. err) eta-squared eta-squared treatment 0.342 0.349 (0.271) (0.254) 0.01 0.01 prime -1.459*** -0.956*** (0.257) (0.243) 0.19 0.09 8 treatment x prime -0.335 -0.522 (0 390) (0 367)

Motivation Motivation Research Research Results Results Discussion Discussion Implications  The problem is probably larger than we found  Our participation hurdle was high  99% acceptance rate for Turkers  Not rewarded if participated more than once  Lotteries are possibly less common  This problem will grow  Academic preference for the tried and tested  No way to track subjects collectively  55% of Turkers report that they follow particular Requesters (Chandler et al. 2014) 9

Motivation Research Results Discussion Staying safe… 10

Motivation Research Results Discussion Include a bonus 11

Motivation Research Results Discussion Add time-limited instructions at the start of the experiment to eliminate Spammers or ‘bots’ 12

Motivation Research Results Discussion Record the Turker id number and IP address 13

Motivation Research Results Discussion Maintain a master database of Turker identity numbers and IP addresses 14

Motivation Research Results Discussion Stringently clean the data using a multi-pronged approach 15

Motivation Research Results Discussion q496_7> q487_7> q487_9== q496_8 Poor Quest q487_8 (diff q487_11 (diff 3 q496_9==q496_11 comple- Inattentive Lottery Choice Choice Total id q49==2 3 plus) (diff==0) plus) (diff==0) q48<>q8 tion Score time 1 time 2 time Duration Unreliable a b c d e f g h i j k l m n 92 92 92 1 2 458 1 119 119 1 3.515 1 129 129 1 9.619 1 185 185 1 5.205 1 213 213 213 2 8.779 1 301 301 1 9.026 1 361 361 1 1 9.176 434 1 370 370 1 9.762 1 379 379 1 9.128 1 380 380 380 2 3.771 2.458 320 1 1 449 449 1 9.798 1 509 509 1 5.143 1 578 578 578 2 6.386 1 621 621 1 467 1 636 636 1 1 8.24 457 1 Table shows an example spreadsheet used to identify Unreliable subjects. Columns b to g identify subjects who have been flagged on validation questions. ‘Poor completion’ flags subjects for poor scale completion identified in the database of responses. ‘Inattentive score’ sums flags in columns b to g. Extreme response times to risky choices are recorded in columns j to l. Extremes for total duration of survey are recorded in column m. Subjects tagged as Unreliable are recorded in column n. 16

Motivation Research Results Discussion Over-sample 17

Thank you – Questions? 18

MTurk Unscrubbed: Dealing with the good, the Super, and the - PowerPoint PPT Presentation

MTurk Unscrubbed: Dealing with the good, the Super, and the unreliable on Amazons Mechanical Turk Jea Jeanett ette D Deetlef etlefs M. Chylinski, A. Ortmann Motivation Research Results Discussion 1 Motivation Research

Dealing With The Irate Customer Dealing With The Irate Customer Dealing with difficult

UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER-ORBITAL UNVEILING

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

An NFR Pattern Approach to Dealing An NFR Pattern Approach to Dealing An NFR Pattern Approach to

Bigger is Better Trends in super computers, super software, and super data Michael L. Norman,

Super- -Kamiokande Kamiokande s s Solar Neutrino results Solar Neutrino results Super

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

THE FALL 2018 NFL PRIMETIME SEASON & THE SUPER BOWL KTG CONTENT STRATEGY SUPER BOWL

Architecture Aromatique Good Taste Good Food Good Health Based on sustainability Technical

How Well Do My Results Generalize? Comparing Security and Privacy Survey Results from MTurk, Web,

Dealing Dealing with the News with the News Media in Media in Crisis Crisis Response

Cross Border Update Dermot Corry Dealing/Transaction Accounts Dealing/transaction accounts

Dealing with Winter Neighbourhood Operations 1 Dealing with Winter Background to

Spelling, Punctuation and Grammar Prefixes super- anti- auto- Year One SPaG | Prefixes super-

2018 SUPER Conference Biddenham Sharing research and practice from SUPER Members INTRODUCING

SUPER CHIA HAIR Super Foods are nutrient-rich foods that are considered to be especially

The Role of Housing and Financial Wealth on Healthcare Expenditure Nayan Krishna Joshi

WHAT IS A TRIPLE MOTION Philip B. Schwartz Parliamentarian Business Law Section Florida Bar

OMFS Partnering and City Services Agreement with the City of Kent Motion No. M2020-26 System

Feature-Sensitive Motion Planning Terra Horton http://parasol.tamu.edu/~tth4515 Motion Planning

Custom Writing Service - Special Prices Literature review magazine presentation slideshare

Key Findings for Creative Strategy November 14, 2018 0 Authorized Use Only Outli line 1

Cultivating Institutional Support for Zero Waste Jennifer Hobson Zero Waste Senior Program

Engaging Adult Learners March 10, 2016 Aina G. Irbe Ph.D., ACDI/VOCA Senior Director, Learning

Sambuz

Useful Links

Newsletter

Mail Us

MTurk Unscrubbed: Dealing with the good, the Super, and the - PowerPoint PPT Presentation

MTurk Unscrubbed: Dealing with the good, the Super, and the unreliable on Amazons Mechanical Turk Jea Jeanett ette D Deetlef etlefs M. Chylinski, A. Ortmann Motivation Research Results Discussion 1 Motivation Research

Dealing With The Irate Customer Dealing With The Irate Customer Dealing with difficult

UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER-ORBITAL UNVEILING

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

An NFR Pattern Approach to Dealing An NFR Pattern Approach to Dealing An NFR Pattern Approach to

Bigger is Better Trends in super computers, super software, and super data Michael L. Norman,

Super- -Kamiokande Kamiokande s s Solar Neutrino results Solar Neutrino results Super

Super GPU &amp; Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

THE FALL 2018 NFL PRIMETIME SEASON &amp; THE SUPER BOWL KTG CONTENT STRATEGY SUPER BOWL

Architecture Aromatique Good Taste Good Food Good Health Based on sustainability Technical

How Well Do My Results Generalize? Comparing Security and Privacy Survey Results from MTurk, Web,

Dealing Dealing with the News with the News Media in Media in Crisis Crisis Response

Cross Border Update Dermot Corry Dealing/Transaction Accounts Dealing/transaction accounts

Dealing with Winter Neighbourhood Operations 1 Dealing with Winter Background to

Spelling, Punctuation and Grammar Prefixes super- anti- auto- Year One SPaG | Prefixes super-

2018 SUPER Conference Biddenham Sharing research and practice from SUPER Members INTRODUCING

SUPER CHIA HAIR Super Foods are nutrient-rich foods that are considered to be especially

The Role of Housing and Financial Wealth on Healthcare Expenditure Nayan Krishna Joshi

WHAT IS A TRIPLE MOTION Philip B. Schwartz Parliamentarian Business Law Section Florida Bar

OMFS Partnering and City Services Agreement with the City of Kent Motion No. M2020-26 System

Feature-Sensitive Motion Planning Terra Horton http://parasol.tamu.edu/~tth4515 Motion Planning

Custom Writing Service - Special Prices Literature review magazine presentation slideshare

Key Findings for Creative Strategy November 14, 2018 0 Authorized Use Only Outli line 1

Cultivating Institutional Support for Zero Waste Jennifer Hobson Zero Waste Senior Program

Engaging Adult Learners March 10, 2016 Aina G. Irbe Ph.D., ACDI/VOCA Senior Director, Learning

Sambuz

Useful Links

Newsletter

Mail Us

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

THE FALL 2018 NFL PRIMETIME SEASON & THE SUPER BOWL KTG CONTENT STRATEGY SUPER BOWL