Learning from Crowds in the Presence of Schools of Thought Yuandong - PowerPoint PPT Presentation

Learning from Crowds in the Presence of Schools of Thought Yuandong Tian 1 and Jun Zhu 2 1 Carnegie Mellon University 2 Tsinghua University 1

Crowd-sourcing Worker 1 Worker 2 Worker 3 Worker 4 Task 1 x x x x x Task 2 Task 3 x x x 2

Crowd-sourcing Objective Tasks Subjective Tasks E.g. Labeling dataset E.g. Demographical Survey Knowledge Test Personal Opinions Creative thoughts Ill-designed ambiguous tasks. 3

Crowd-sourcing Objective Tasks Subjective Tasks Noise 4

Crowd-sourcing Objective Tasks Subjective Tasks Noise Worker reliability Task clarity 5

Previous works Objective Tasks Subjective Tasks Majority Voting [J. Whitehill et al., NIPS’09] [V.C. Raykar et al., JMLR’10] Gold Worker [P. Welinder et al., NIPS’10] standard Reliability ….. 6

Our Contribution Objective Tasks Subjective Tasks Contributions: 1. Applicable to both objective and subjective tasks. 2. Simple , no iterative procedure, no initial guess. 7

Two Principles A worker is reliable if he agrees with other workers in many tasks. A task is clear if it has only a few answers. 8

Clustering Analysis Task k Workers A B C D E F G H L 1 0 1 1 1 1 0 1 0 9

Group-size Matrix #Z Task k Worker Assign. Cluster size A D 5 A I I E L 3 B II G C II 3 D I 5 B E I 5 C II 3 F II F 5 G I H III 1 H III L I 5

Group-size Matrix #Z # Z Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Worker A 5 3 2 3 4 2 6 Worker B 3 3 4 5 4 3 6 Worker C 3 2 2 5 2 4 6 Worker D 5 3 4 5 4 4 6 Worker E 5 2 2 5 2 3 2 Worker F 3 2 2 5 2 4 2 Worker G 5 2 4 3 1 3 6 Worker H 1 1 1 1 2 2 1 Worker L 5 1 4 3 4 4 6 12

Worker Reliability Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Worker A 5 3 2 3 4 2 6 Worker B 3 3 4 5 4 3 6 Worker C 3 2 2 5 2 4 6 Worker D 5 3 4 5 4 4 6 Worker E 5 2 2 5 2 3 2 Worker F 3 2 2 5 2 4 2 Worker G 5 2 4 3 1 3 6 Worker H 1 1 1 1 2 2 1 Worker L 5 1 4 3 4 4 6 13

Task Clarity Task 1 Task 2 Task 3 Task 4 Task 5 Task 6 Task 7 Worker A 5 3 2 3 4 2 6 Worker B 3 3 4 5 4 3 6 Worker C 3 2 2 5 2 4 6 Worker D 5 3 4 5 4 4 6 Worker E 5 2 2 5 2 3 2 Worker F 3 2 2 5 2 4 2 Worker G 5 2 4 3 1 3 6 Worker H 1 1 1 1 2 2 1 Worker L 5 1 4 3 4 4 6 14

Factorization # Z T 1 T 2 T 3 T4 T5 T6 T 7 WA 5 3 2 3 4 2 6 WB 3 3 4 5 4 3 6 WC 3 2 2 5 2 4 6 = WD 5 3 4 5 4 4 6 WE 5 2 2 5 2 3 2 WF 3 2 2 5 2 4 2 WG 5 2 4 3 1 3 6 Task clarity WH 1 1 1 1 2 2 1 WL 5 1 4 3 4 4 6 Worker Reliability Perron-Frobenius theorem: # Z > 0  λ > 0 and μ > 0 18

Clustering Model Task k 19

Clustering Model Task k M N 20

Clustering Model Task k cluster centers cluster labels M answers N 21

Clustering Model Task k M N 22

Clustering Model A D T 1 T 2 T 3 T4 T5 T6 T 7 E L W1 5 3 2 3 4 2 6 G W2 3 3 4 5 4 3 6 W3 3 2 2 5 2 4 6 W4 5 3 4 5 4 4 6 B C W5 5 2 2 5 2 3 2 F W6 3 2 2 5 2 4 2 W7 5 2 4 3 1 3 6 W8 1 1 1 1 2 2 1 H W9 5 1 4 3 4 4 6 Clustering Label #Z Model assignment 24

Clustering Model A D T 1 T 2 T 3 T4 T5 T6 T 7 E L W1 5 3 2 3 4 2 6 G W2 3 3 4 5 4 3 6 W3 3 2 2 5 2 4 6 W4 5 3 4 5 4 4 6 B C W5 5 2 2 5 2 3 2 F W6 3 2 2 5 2 4 2 W7 5 2 4 3 1 3 6 W8 1 1 1 1 2 2 1 H W9 5 1 4 3 4 4 6 Clustering Label #Z Model assignment 25

Close form solution to #Z 30

Close form solution to #Z Squared Euclidean Distance between worker i and worker j in task k 31

Hyper-Parameters Estimation Hyper-parameters: σ σ = 0.2 32

Experiments Setting Mission I: Image Classification (Sky/Building/Computer) Do these images contain sky? Mission II: Counting Objects Mission III: Images Aesthetics Do these images look pretty? 33

Statistics Mission I Mission II Mission III Sky Building Computer Counting Images Aesthetics (12) (12) (12) (4) (12 + 12) 402 workers Dataset link: http://www.cs.cmu.edu/~yuandong/kdd2012-dataset.zip 34

The Groupsize Matrix Tasks Workers Small Large Group Size Group Size 35

Rank-1 Factorization = 0.27 36

Rank-1 Factorization Worker Reliability 37

Tasks’ clarity Count 2: Clarity = 69.4 38

Task’s clarity Beauty1 and Beauty2: Clarity = 12.4/11.8 39

Task’s clarity Count 4: Clarity = 10.2 40

Workers’ Reliability Count 65 workers ~ 20% 337 workers ~ 80% 70 60 50 40 30 20 1.52 6.62 10 0 1.5 6.5 5 41

Ranking Workers Mission I Mission II Mission III Sky Building Computer Counting Images Aesthetics (12) (12) (12) (4) (12 + 12) D most unreliable D most reliable 42

Ranking Workers Std. 18 16 14 12 10 Std of D best 8 Std of D worst 6 4 2 0 Count1 Count2 Count3 Count4 D = 10 43

Ranking Workers Std. 16 14 12 10 8 Std of D best 6 Std of D worst 4 2 0 Count1 Count2 Count3 Count4 D = 30 44

Comparison with Clustering Difference in Variance (a) Our Approach (c) PCA-Kmeans (b) Spectral Clustering (d) Gibbs Sampling 45

Time Cost Methods Time (sec) 1.41 ± 0.05 (a) Our approach 3.90 ± 0.36 (b) Spectral Clustering 0.19 ± 0.06 (c) PCA-Kmeans 53.63 ± 0.19 (d) Gibbs Sampling 46

Predicting Ground truth Count1 Count2 Count3 Count4 65 5 8 26 Ours, D = 5/10 53.7 5.0 9.9 22.9 Majority Voting 60 5.0 8 24 Majority Voting (Median) Learning from Crowd 56 5 8 24 [JMLR’10] Multidimensional Wisdom of 63.7 5 8 26.0 Crowds [NIPS’10] 65 5 8 27 Ground truth 47

Conclusion and Future Work Conclusion 1. Estimating workers’ reliability and tasks’ clarity in the presence of schools of thought . 2. Applicable to both objective and subjective tasks. 3. Simple solution without iteration, no initial guess. Future Work Handling possible missing entries Improving the scalability. 49

Thanks! 50

Learning from Crowds in the Presence of Schools of Thought Yuandong - PowerPoint PPT Presentation

Learning from Crowds in the Presence of Schools of Thought Yuandong Tian 1 and Jun Zhu 2 1 Carnegie Mellon University 2 Tsinghua University 1 Crowd-sourcing Worker 1 Worker 2 Worker 3 Worker 4 Task 1 x x x x x Task 2 Task 3 x x x 2

THOUGHT> thought > experience > thought > experience > thought >

The Wisdom of Crowds The Wisdom of Crowds and Social Innovation in the and Social Innovation in

The Wisdom of Crowds: attacks and optimal constructions George Danezis 1 Claudia Diaz 2 asper 2

Coins, Clubs, and Crowds: Coins, Clubs, and Crowds: Scaling and Decentralization in Scaling and

Greek Thought Says: What Do You Believe? Hebrew Thought Says: Who Are You? and

SPEED OF THOUGHT SPEED OF THOUGHT 120m/s SPEED OF THOUGHT COMMUNICATIVE The Artist is Absent:

Presence Presence Presence When we wake up in the morning we may automatically leave our

IMPLICIT CROWDS: OPTIMIZATION INTEGRATOR FOR ROBUST CROWD SIMULATION Ioannis Karamouzas 1 , Nick

Crowds: Anonymity for Web Transactions Paper by: Michael K. Reiter & Aviel D. Rubin of

The Wisdom of Crowds: Network effects, and the Importance of Experts Aris Anagnostopoulos

Optimal Control in the space of probability measures Claudia Totzeck joint work with M. Burger,

Please pick up a syllabus and a notecard. On one side of the notecard write: Your name. If

Getting Crowds to Work Leah Birch Naor Brown October 24, 2012 Leah Birch Naor Brown Getting

Marijuana In Seattle Our Schools 95 total schools 59 elementary schools 12 high

Gateway Church FOOD FOR THOUGHT FOOD FOR THOUGHT Healthy Cook & Ea Healthy Cook & Eat

Consciousness and Thought: Wrap-Up Talk David Chalmers The Critique of Pure Thought David

Color Categorization in Bilingual Populations: Korean-English Bilinguals Prutha S. Deshpande

Mission Purposes Describes the Colleges aspirations for student access, equity, and success

ACTION-ORIENTED TASKS: The Basics The motivation to communicate is enhanced through having a

2014 Federal Grant Writing Workshop: Dissecting the Work Plan Kathryn Miller September 30, 2014

Creating Rapid-Cycle Change Jeff Capobianco, PhD, LLP National Council for Behavioral Health

TracDat Electronic Planning & Assessment Reporting System (AP) OFFICE OF PLANNING,

2019 Research Experience for Undergraduates Predicting hospital readmission for patients with

Sustaina nabl ble A e Agricul ultur ural Lands ds Conser ervation on Program GSA Advi

Sambuz

Useful Links

Newsletter

Mail Us

Learning from Crowds in the Presence of Schools of Thought Yuandong - PowerPoint PPT Presentation

Learning from Crowds in the Presence of Schools of Thought Yuandong Tian 1 and Jun Zhu 2 1 Carnegie Mellon University 2 Tsinghua University 1 Crowd-sourcing Worker 1 Worker 2 Worker 3 Worker 4 Task 1 x x x x x Task 2 Task 3 x x x 2

THOUGHT&gt; thought &gt; experience &gt; thought &gt; experience &gt; thought &gt;

The Wisdom of Crowds The Wisdom of Crowds and Social Innovation in the and Social Innovation in

The Wisdom of Crowds: attacks and optimal constructions George Danezis 1 Claudia Diaz 2 asper 2

Coins, Clubs, and Crowds: Coins, Clubs, and Crowds: Scaling and Decentralization in Scaling and

Greek Thought Says: What Do You Believe? Hebrew Thought Says: Who Are You? and

SPEED OF THOUGHT SPEED OF THOUGHT 120m/s SPEED OF THOUGHT COMMUNICATIVE The Artist is Absent:

Presence Presence Presence When we wake up in the morning we may automatically leave our

IMPLICIT CROWDS: OPTIMIZATION INTEGRATOR FOR ROBUST CROWD SIMULATION Ioannis Karamouzas 1 , Nick

Crowds: Anonymity for Web Transactions Paper by: Michael K. Reiter &amp; Aviel D. Rubin of

The Wisdom of Crowds: Network effects, and the Importance of Experts Aris Anagnostopoulos

Optimal Control in the space of probability measures Claudia Totzeck joint work with M. Burger,

Please pick up a syllabus and a notecard. On one side of the notecard write: Your name. If

Getting Crowds to Work Leah Birch Naor Brown October 24, 2012 Leah Birch Naor Brown Getting

Marijuana In Seattle Our Schools 95 total schools 59 elementary schools 12 high

Gateway Church FOOD FOR THOUGHT FOOD FOR THOUGHT Healthy Cook &amp; Ea Healthy Cook &amp; Eat

Consciousness and Thought: Wrap-Up Talk David Chalmers The Critique of Pure Thought David

Color Categorization in Bilingual Populations: Korean-English Bilinguals Prutha S. Deshpande

Mission Purposes Describes the Colleges aspirations for student access, equity, and success

ACTION-ORIENTED TASKS: The Basics The motivation to communicate is enhanced through having a

2014 Federal Grant Writing Workshop: Dissecting the Work Plan Kathryn Miller September 30, 2014

Creating Rapid-Cycle Change Jeff Capobianco, PhD, LLP National Council for Behavioral Health

TracDat Electronic Planning &amp; Assessment Reporting System (AP) OFFICE OF PLANNING,

2019 Research Experience for Undergraduates Predicting hospital readmission for patients with

Sustaina nabl ble A e Agricul ultur ural Lands ds Conser ervation on Program GSA Advi

Sambuz

Useful Links

Newsletter

Mail Us

THOUGHT> thought > experience > thought > experience > thought >

Crowds: Anonymity for Web Transactions Paper by: Michael K. Reiter & Aviel D. Rubin of

Gateway Church FOOD FOR THOUGHT FOOD FOR THOUGHT Healthy Cook & Ea Healthy Cook & Eat

TracDat Electronic Planning & Assessment Reporting System (AP) OFFICE OF PLANNING,