Aggregating and Predicting Sequence Labels from Crowd Annotations An - PowerPoint PPT Presentation

Aggregating and Predicting Sequence Labels from Crowd Annotations An T. Nguyen 1 ∗ Byron C. Wallace 2 Jessy Li 1 , 3 Ani Nenkova 3 Matthew Lease 1 1 University of Texas at Austin 2 Northeastern University 3 University of Pennsylvania ACL 2017 ∗ Presenter 1

Problem: Sequence Labeling with Crowd Labels 2

Problem: Sequence Labeling with Crowd Labels Example: Named Entity Recognition. X U.N. official Ekeus heads for Baghdad Y Org O Per O O Loc 2

Problem: Sequence Labeling with Crowd Labels Example: Named Entity Recognition. X U.N. official Ekeus heads for Baghdad Y Org O Per O O Loc W 1 Org O Org O O Loc 2

Problem: Sequence Labeling with Crowd Labels Example: Named Entity Recognition. X U.N. official Ekeus heads for Baghdad Y Org O Per O O Loc W 1 Org O Org O O Loc W 2 Org Per Per O O Loc 2

Problem: Sequence Labeling with Crowd Labels Example: Named Entity Recognition. X U.N. official Ekeus heads for Baghdad Y Org O Per O O Loc W 1 Org O Org O O Loc W 2 Org Per Per O O Loc W 3 Org O Per O O Loc 2

Problem: Sequence Labeling with Crowd Labels Example: Named Entity Recognition. X U.N. official Ekeus heads for Baghdad Y Org O Per O O Loc W 1 Org O Org O O Loc W 2 Org Per Per O O Loc W 3 Org O Per O O Loc Two tasks: 2

Problem: Sequence Labeling with Crowd Labels Example: Named Entity Recognition. X U.N. official Ekeus heads for Baghdad Y Org O Per O O Loc W 1 Org O Org O O Loc W 2 Org Per Per O O Loc W 3 Org O Per O O Loc Two tasks: ◮ Aggregation: Given ( X , W 1 , 2 , 3 ), Estimate Y 2

Problem: Sequence Labeling with Crowd Labels Example: Named Entity Recognition. X U.N. official Ekeus heads for Baghdad Y Org O Per O O Loc W 1 Org O Org O O Loc W 2 Org Per Per O O Loc W 3 Org O Per O O Loc Two tasks: ◮ Aggregation: Given ( X , W 1 , 2 , 3 ), Estimate Y ◮ Prediction: Given train data ( X , W 1 , 2 , 3 ), Predict Y test for X test 2

Our work Contribution: Two Joint models of sequences and crowd. 3

Our work Contribution: Two Joint models of sequences and crowd. 1. Aggregation. ◮ Hidden Markov Models (HMMs) + Crowd Confusion Matrices. 3

Our work Contribution: Two Joint models of sequences and crowd. 1. Aggregation. ◮ Hidden Markov Models (HMMs) + Crowd Confusion Matrices. 2. Prediction. ◮ Long Short Term memory (LSTM) + Crowd Embedding Vectors. 3

Our work Contribution: Two Joint models of sequences and crowd. 1. Aggregation. ◮ Hidden Markov Models (HMMs) + Crowd Confusion Matrices. 2. Prediction. ◮ Long Short Term memory (LSTM) + Crowd Embedding Vectors. Evaluation: ◮ News NER + Biomedical IE. ◮ A range of baselines. 3

Our work Contribution: Two Joint models of sequences and crowd. 1. Aggregation. ◮ Hidden Markov Models (HMMs) + Crowd Confusion Matrices. 2. Prediction. ◮ Long Short Term memory (LSTM) + Crowd Embedding Vectors. Evaluation: ◮ News NER + Biomedical IE. ◮ A range of baselines. Code + Data on Github. 3

HMM-Crowd (for task 1 - aggregation) HMM (position i ): h i +1 | h i ∼ Discrete ( τ h i ) v i | h i ∼ Discrete ( Ω h i ) 4

HMM-Crowd (for task 1 - aggregation) HMM (position i ): h i +1 | h i ∼ Discrete ( τ h i ) v i | h i ∼ Discrete ( Ω h i ) Crowd model (worker j ): l ij | h i ∼ Discrete ( C ( j ) h i ) 4

HMM-Crowd (for task 1 - aggregation) HMM (position i ): h i +1 | h i ∼ Discrete ( τ h i ) v i | h i ∼ Discrete ( Ω h i ) Crowd model (worker j ): l ij | h i ∼ Discrete ( C ( j ) h i ) C ( j ) : confusion matrix for j 4

HMM-Crowd: Parameter Learning Expectation Maximization (EM) algorithm: 5

HMM-Crowd: Parameter Learning Expectation Maximization (EM) algorithm: E-step ◮ Estimate posterior p ( h ) ◮ Extend Forward-Backward algorithm. 5

HMM-Crowd: Parameter Learning Expectation Maximization (EM) algorithm: E-step ◮ Estimate posterior p ( h ) ◮ Extend Forward-Backward algorithm. M-step: ◮ Estimate parameters τ, Ω , C ◮ Variational Bayes estimate. 5

LSTM for NER (Lample et al. 2016) 6

LSTM for NER (Lample et al. 2016) LSTM: word rep. → sent. rep. 6

LSTM for NER (Lample et al. 2016) LSTM: word rep. → sent. rep. Hidden Layer: fully connected. 6

LSTM for NER (Lample et al. 2016) LSTM: word rep. → sent. rep. Hidden Layer: fully connected. Tags Scores: ∼ prob. each label for each word. 6

LSTM for NER (Lample et al. 2016) LSTM: word rep. → sent. rep. Hidden Layer: fully connected. Tags Scores: ∼ prob. each label for each word. CRF: word prediction → sent. prediction. 6

LSTM-Crowd (for task 2 - prediction) 7

LSTM-Crowd (for task 2 - prediction) ◮ vectors represented noise by worker. 7

LSTM-Crowd (for task 2 - prediction) ◮ vectors represented noise by worker. ◮ v(good worker) ≈ 0 7

Data Dataset Application Documents Gold Labels Crowd Labels CoNLL’03 NER 1393 All 400 Medical IE 5000 200 All 8

Evaluation: Task 1 - aggregation Baselines: 1. Non-sequential: ◮ Majority Voting ◮ Dawid & Skene (1979) ◮ MACE (Hovy et al. 2013) 9

Evaluation: Task 1 - aggregation Baselines: 1. Non-sequential: ◮ Majority Voting ◮ Dawid & Skene (1979) ◮ MACE (Hovy et al. 2013) 2. Sequential: ◮ CRF-MA (Rodrigues et al. 2014) 9

Results: NER task 1 - aggregation Method F1 Majority Vote 65.71 10

Results: NER task 1 - aggregation Method F1 Majority Vote 65.71 MACE (Hovy et al. 2013) 67.37 10

Results: NER task 1 - aggregation Method F1 Majority Vote 65.71 MACE (Hovy et al. 2013) 67.37 Dawid-Skene (DS) 71.39 10

Results: NER task 1 - aggregation Method F1 Majority Vote 65.71 MACE (Hovy et al. 2013) 67.37 Dawid-Skene (DS) 71.39 CRF-MA (Rodrigues et al. 2014) 62.53 10

Results: NER task 1 - aggregation Method F1 Majority Vote 65.71 MACE (Hovy et al. 2013) 67.37 Dawid-Skene (DS) 71.39 CRF-MA (Rodrigues et al. 2014) 62.53 HMM-Crowd 74.76 10

Evaluation: Task 2 - prediction Baselines: 1. Aggregate then train: ◮ Majority Vote then CRF ◮ Dawid-Skene then LSTM 11

Evaluation: Task 2 - prediction Baselines: 1. Aggregate then train: ◮ Majority Vote then CRF ◮ Dawid-Skene then LSTM 2. Train directly on crowd labels: ◮ CRF-MA (Rodrigues et al. 2014) ◮ LSTM (original, Lample et al. 2016) 11

Results: NER task 2 - prediction Method F1 Majority Vote then CRF 58.20 12

Results: NER task 2 - prediction Method F1 Majority Vote then CRF 58.20 CRF-MA (Rodrigues et al. 2014) 62.60 12

Results: NER task 2 - prediction Method F1 Majority Vote then CRF 58.20 CRF-MA (Rodrigues et al. 2014) 62.60 LSTM (Lample et al. 2016) 67.73 12

Results: NER task 2 - prediction Method F1 Majority Vote then CRF 58.20 CRF-MA (Rodrigues et al. 2014) 62.60 LSTM (Lample et al. 2016) 67.73 Dawid-Skene then LSTM 66.27 12

Results: NER task 2 - prediction Method F1 Majority Vote then CRF 58.20 CRF-MA (Rodrigues et al. 2014) 62.60 LSTM (Lample et al. 2016) 67.73 Dawid-Skene then LSTM 66.27 LSTM-Crowd 70.82 12

Results: NER task 2 - prediction Method F1 Majority Vote then CRF 58.20 CRF-MA (Rodrigues et al. 2014) 62.60 LSTM (Lample et al. 2016) 67.73 Dawid-Skene then LSTM 66.27 LSTM-Crowd 70.82 HMM-Crowd then LSTM 70.87 12

Results: NER task 2 - prediction Method F1 Majority Vote then CRF 58.20 CRF-MA (Rodrigues et al. 2014) 62.60 LSTM (Lample et al. 2016) 67.73 Dawid-Skene then LSTM 66.27 LSTM-Crowd 70.82 HMM-Crowd then LSTM 70.87 LSTM on Gold Labels (upper-bound) 84.22 12

Conclusion ◮ Joint models of sequences and crowd labels. 13

Conclusion ◮ Joint models of sequences and crowd labels. ◮ HMMs good for aggregation, ... ◮ ... LSTMs good for prediction. 13

Conclusion ◮ Joint models of sequences and crowd labels. ◮ HMMs good for aggregation, ... ◮ ... LSTMs good for prediction. Paper: ◮ Alternative LSTM-Crowd model. ◮ Results for Biomedical IE. 13

Conclusion ◮ Joint models of sequences and crowd labels. ◮ HMMs good for aggregation, ... ◮ ... LSTMs good for prediction. Paper: ◮ Alternative LSTM-Crowd model. ◮ Results for Biomedical IE. Acknowledgment: Reviewers, Workers, NSF & NIH. 13

Conclusion ◮ Joint models of sequences and crowd labels. ◮ HMMs good for aggregation, ... ◮ ... LSTMs good for prediction. Paper: ◮ Alternative LSTM-Crowd model. ◮ Results for Biomedical IE. Acknowledgment: Reviewers, Workers, NSF & NIH. Questions? 13

Aggregating and Predicting Sequence Labels from Crowd Annotations An - PowerPoint PPT Presentation

Aggregating and Predicting Sequence Labels from Crowd Annotations An T. Nguyen 1 Byron C. Wallace 2 Jessy Li 1 , 3 Ani Nenkova 3 Matthew Lease 1 1 University of Texas at Austin 2 Northeastern University 3 University of Pennsylvania ACL 2017

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or

Aggregating information from the crowd Anirban Dasgupta IIT Gandhinagar Joint work with Flavio

61A Lecture 30 Announcements Efficient Sequence Processing Sequence Operations 4 Sequence

2020 Projections were then developed by 2020 Projections were then developed by aggregating

Sequence Alignment Gerhard Jger ESSLLI 2016 Gerhard Jger Sequence Alignment ESSLLI 2016 1

Sequence to Sequence models: Connectionist Temporal Classification 5 March 2018 1

Introduction to sequence to sequence models N ATURAL LAN GUAGE GEN ERATION IN P YTH ON

Sequence-to-Sequence Learning with Neural Networks Ilya Sutskever, Oriol Vinyals, Quoc V. Le,

Welcome Predicting Change Outcomes Leveraging SQL Server Profiler Lee Everest SQL Rx Predicting

and Sealing vs Restenosis: Utility of the Micronet MGuard Stent for MI, SVG, Aneurysms, and More

14.12 Economic Applications of Game Theory Professor: Muhamet Yildiz 1 Merthyr Banbury

Combining learned and highly- reactive management Alva L. Couch and Marc Chiarini Tufts

eduPerson and SCHAC sending attributes outside your organization Victoriano Giralt Central ICT

A Fast Worm Scan Detection Tool for VPN Congestion Avoidance Arno Wagner,Thomas D ubendorfer,

Determination of the infrared radiative forcing at the tropical tropopause with AIRS AIRS

Application areas of Application areas of Scalable Adaptive Multicast Scalable Adaptive

Quantile Stein Variational Gradient Descent for Batch Bayesian Optimization Chengyue Gong [1] Jian

Sambuz

Useful Links

Newsletter

Mail Us

Aggregating and Predicting Sequence Labels from Crowd Annotations An - PowerPoint PPT Presentation

Aggregating and Predicting Sequence Labels from Crowd Annotations An T. Nguyen 1 Byron C. Wallace 2 Jessy Li 1 , 3 Ani Nenkova 3 Matthew Lease 1 1 University of Texas at Austin 2 Northeastern University 3 University of Pennsylvania ACL 2017

Protein Sequence Analysis Protein Sequence Analysis Protein sequence motifs Protein sequence

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Attention Models 1 Sequence-to-sequence modelling Problem:

Sequence to Sequence models: Connectionist Temporal Classification 1 Sequence-to-sequence

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

SEQUENCE ANALYSIS The term &quot; sequence analysis &quot; in biology implies subjecting a DNA or

Aggregating information from the crowd Anirban Dasgupta IIT Gandhinagar Joint work with Flavio

61A Lecture 30 Announcements Efficient Sequence Processing Sequence Operations 4 Sequence

2020 Projections were then developed by 2020 Projections were then developed by aggregating

Sequence Alignment Gerhard Jger ESSLLI 2016 Gerhard Jger Sequence Alignment ESSLLI 2016 1

Sequence to Sequence models: Connectionist Temporal Classification 5 March 2018 1

Introduction to sequence to sequence models N ATURAL LAN GUAGE GEN ERATION IN P YTH ON

Sequence-to-Sequence Learning with Neural Networks Ilya Sutskever, Oriol Vinyals, Quoc V. Le,

Welcome Predicting Change Outcomes Leveraging SQL Server Profiler Lee Everest SQL Rx Predicting

and Sealing vs Restenosis: Utility of the Micronet MGuard Stent for MI, SVG, Aneurysms, and More

14.12 Economic Applications of Game Theory Professor: Muhamet Yildiz 1 Merthyr Banbury

Combining learned and highly- reactive management Alva L. Couch and Marc Chiarini Tufts

eduPerson and SCHAC sending attributes outside your organization Victoriano Giralt Central ICT

A Fast Worm Scan Detection Tool for VPN Congestion Avoidance Arno Wagner,Thomas D ubendorfer,

Determination of the infrared radiative forcing at the tropical tropopause with AIRS AIRS

Application areas of Application areas of Scalable Adaptive Multicast Scalable Adaptive

Quantile Stein Variational Gradient Descent for Batch Bayesian Optimization Chengyue Gong [1] Jian

Sambuz

Useful Links

Newsletter

Mail Us

SEQUENCE ANALYSIS The term " sequence analysis " in biology implies subjecting a DNA or