Crowdsourcing Quality Control Madalina and Jao-ke CS 286r '12

Motivation What do we do when we want to answer a question?

Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ●

Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others?

Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others? ● We want to weight the good workers' answers more heavily than those of the bad workers.

Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others? ● We want to weight the good workers' answers more heavily than those of the bad workers. How can we tell which workers are "good" and which workers are "bad"? Proof by majority? ●

Motivation What do we do when we want to answer a question? We ask the questions to lots of people! ● What do we want to do when some workers are better than others? ● We want to weight the good workers' answers more heavily than those of the bad workers. How can we tell which workers are "good" and which workers are "bad"? Proof by majority? ● Nope. (Sorry, democracy.) ●

Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j ●

Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: ○

Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○

Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○ Crowd has average quality ●

Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○ Crowd has average quality q := E[(2 p j -1) 2 ] ●

Set-up Tasks have binary answers ● Each worker performs same number of tasks; each task performed ● by same number of workers Each worker w j has a certain reliability p j or probability of answering ● a task correctly, for all tasks Example: spammer-hammer model ○ Reasonable? How so? How not? ○ Crowd has average quality q := E[(2 p j -1) 2 ] ● Roles of reliability and quality in algorithm? ●

Graph Theory G(V,E) Bipartite Graph: ● ● ● ● ● ● ● ● ● ●

Graph Theory Tasks Workers ● ● ● ● ● ● ● ● ● ● m = 4 n = 6 l = ? r = ?

Graph Theory Workers Tasks ● 1. ● ● 1. 2. ● ● 2. 3. ● ● 3. 4. ● ● 4. 5. ● 6.

Algorithm

Algorithm Example: Updates Workers Tasks ● 1. ● ● 1. 2. ● ● 2. 3. ● ● 3. 4. ● ● 4. 5. ● 6.

Commercial Break Say you have a biased coin that with probability p =/= 1/2 comes up Heads and with probability 1- p comes up Tails. How can you estimate p ?

Algorithm Example: Updates

Optimality Discussion Oracle error: The minimax error rate achieved by the best possible graph G in G(m; l) using the best possible inference algorithm is at least Majority vote: Iterative algorithm:

Algorithm Properties does not require prior, unlike belief propagation ● performance guarantees, unlike expectation maximization ● Performance: ●

Worker Bias Worker A always reverses answers; Worker B always gives same ● answer

Worker Bias Worker A always reverses answers; Worker B always gives same ● answer How can we separate out bias from error? ●

Worker Bias Main idea: Given each worker's error rates, error costs, and priors for the correct answer distributions, we transform workers' "hard" answers into "soft" answers that have minimal (error-associated) costs.

Thank You!

Crowdsourcing Quality Control Madalina and Jao-ke CS 286r '12 - PowerPoint PPT Presentation

Crowdsourcing Quality Control Madalina and Jao-ke CS 286r '12 Motivation What do we do when we want to answer a question? Motivation What do we do when we want to answer a question? We ask the questions to lots of people! Motivation

A/B Testing Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website:

Crowdsourcing and Human Computer Interaction Design Crowdsourcing and Human Computation

How Crowdsourcing Enabled Computer Vision Crowdsourcing and Human Computation Instructor: Chris

Rise of Crowdsourcing Crowdsourcing = Harvesting societys wisdom, skill, creativity, and scale

Crowdsourcing and HCI 2: Privacy and Latency Crowdsourcing and Human Computation Instructor:

Quality Control - part 1 Crowdsourcing and Human Computation Instructor: Chris Callison-Burch

Quality Control - part 2 Crowdsourcing and Human Computation Instructor: Chris Callison-Burch

crowdsourcing workflow control Nate Tucker and Perry Green barriers to effective crowdsourcing

Crowdsourcing Nickolai Riabov, Kenneth Tiong Brown University Fall 2013 Nickolai Riabov,

Crowdsourcing of Weather Data on Mobile App and Deep Learning Lior Perez 99th AMS annual

Crowdsourcing Cytogenetic Biodosimetry Dose Estimation Crowdsourcing Cytogenetic Biodosimetry Dose

Using CrowdSourcing for Data Analytics Hector Garcia-Molina (work with Steven Whang, Peter

Crowdsourcing and Human Computation Instructor: Chris Callison-Burch Website:

Speech Transcrip-on with Crowdsourcing Crowdsourcing and Human Computa2on Instructor: Chris

A Micro Crowdsourcing Architecture to Localize A Micro Crowdsourcing Architecture to Localize Web

Incentives in Crowdsourcing: A Game-theoretic Approach ARPITA GHOSH Cornell University NIPS

with Low-Income Community Groups on Solar September 25, 2019 Housekeeping Join audio:

Web Discovery

Rebuilding local discs with gas-rich major mergers Mathieu PUECH Coll.: F. Hammer, H. Flores, M.

Components of a Hammer for Type Theory Goal Translation and Proof Reconstruction ukasz Czajka

GNU/Hurd AKA Extensibility from the Ground Samuel Thibault 2011 August 26th 1 <marcus>

If I had a hammer The role of infrastructure in creative, innovative clusters and the

Verbal VP-modifiers in Samoan verb serialization Jens Hopperdietzel Leibniz-ZAS Berlin

CSCI 104 Qt Intro Mark Redekopp David Kempe 2 Qt What is QT? Pronounced cute