Collective Annotation of Linguistic Resources: Basic Principles and - PowerPoint PPT Presentation

Collective Annotation BNAIC-2013 Collective Annotation of Linguistic Resources: Basic Principles and a Formal Model Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam � � joint work with Raquel Fern´ andez Ulle Endriss 1

Collective Annotation BNAIC-2013 Outline • Annotation and Crowdsourcing in Linguistics • Proposal: Use Social Choice Theory • Two New Methods of Aggregation • Results from a Case Study on Textual Entailment Ulle Endriss 2

Collective Annotation BNAIC-2013 Annotation and Crowdsourcing in Linguistics To test theories in linguistics and to benchmark algorithms in NLP, we require information on the linguistic judgments of speakers . Examples: grammaticality, word senses, speech acts, . . . People need corpora with gold standard annotations: • set of items (e.g., text fragment with one utterance highlighted) • assignment of a category to each item (e.g., it’s an agreement act) Modern approach is to use crowdsourcing (e.g., Mechanical Turk) to collect annotations: fast, cheap, more judgments from more speakers. But: how to aggregate individual annotations into a gold standard? • some work on maximum likelihood estimators • dominant approach: for each item, adopt the majority choice Ulle Endriss 3

Collective Annotation BNAIC-2013 Social Choice Theory Aggregating information from individuals is what social choice theory is all about. Example: aggregation of preferences in an election. F : vector of individual preferences �→ election winner F : vector of individual annotations �→ collective annotation Research agenda: • develop a variety of aggregation methods for collective annotation • analyse those methods in a principled manner, as in SCT • understand features specific to linguistics via empirical studies For this talk: assume there are just two categories (0 and 1). Ulle Endriss 4

Collective Annotation BNAIC-2013 Proposal 1: Bias-Correcting Rules If an annotator appears to be biased towards a particular category, then we could try to correct for this bias during aggregation. • Freq i ( k ) : relative frequency of annotator i choosing category k • Freq ( k ) : relative frequency of k across the full profile Freq i ( k ) > Freq ( k ) suggests that i is biased towards category k . A bias-correcting rule tries to account for this by varying the weight given to k -annotations provided by annotator i : • difference-based: 1 + Freq ( k ) − Freq i ( k ) • ratio-based: Freq ( k ) / Freq i ( k ) For comparison: the simple majority rule always assigns weight 1. Ongoing work: axiomatise this class of rules ` a la SCT Ulle Endriss 5

Collective Annotation BNAIC-2013 Proposal 2: Greedy Consensus Rules If there is (near-)consensus on an item, we should adopt that choice. And: we might want to classify annotators who disagree as unreliable . The greedy consensus rule GreedyCR t (with tolerance threshold t ) repeats two steps until all items are decided: (1) Lock in the majority decision for the item with the strongest majority not yet locked in. (2) Eliminate any annotator who disagrees with more than t decisions. Greedy consensus rules appar to be good at recognising item difficulty . Ongoing work: try to better understand this phenomenon Ulle Endriss 6

Collective Annotation BNAIC-2013 Case Study: Recognising Textual Entailment In RTE tasks you try to develop algorithms to decide whether a given piece of text entails a given hypothesis. Examples: Text Hypothesis GS Eyeing the huge market potential, currently Yahoo bought Overture. 1 led by Google, Yahoo took over search company Overture Services Inc last year. The National Institute for Psychobiology in Israel was established in 0 Israel was established in May 1971 as the May 1971. Israel Center for Psychobiology. We used a dataset collected by Snow et al. (2008): • Gold standard: 800 items (T-H pairs) with an ‘expert’ annotation • Crowdsourced data: 10 MTurk annotations per item (164 people) R. Snow, B. O’Connor, D. Jurafsky, and A.Y. Ng. Cheap and fast—but is it good? Evaluating non-expert annotations for natural language tasks. Proc. EMNLP-2008. Ulle Endriss 7

Collective Annotation BNAIC-2013 Case Study: Results How did we do? Observed agreement with the gold standard: • Simple Majority Rule (produced 65 ties for 800 items): – 89.7% under uniform tie-breaking – 85.6% if ties are counted as misses • Bias-Correcting Rules (no ties encountered): – 91.5% for the difference-based rule – 90.8% for the ratio-based rule • Greedy Consensus Rules (for certain implementation choices): – 86.6% for tolerance threshold 0 (found coalition of 46/164) – 92.5% for tolerance threshold 15 (found coalition of 156/164) Ongoing work: understand better what performance depends on Ulle Endriss 8

Collective Annotation BNAIC-2013 Example An example where GreedyCR 15 correctly overturns a 7-3 majority against the gold standard (0, i.e., T does not entail H): T: The debacle marked a new low in the erosion of the SPD’s popularity, which began after Mr. Schr¨ oder’s election in 1998. H: The SPD’s popularity is growing. The item ends up being the 631st to be considered: Annotator Choice disagr’s In/Out AXBQF8RALCIGV 1 83 × A14JQX7IFAICP0 1 34 × A1Q4VUJBMY78YR 1 81 × A18941IO2ZZWW6 1 148 × AEX5NCH03LWSG 1 19 × A3JEUXPU5NEHXR 0 2 � A11GX90QFWDLMM 1 143 × A14WWG6NKBDWGP 1 1 � A2CJUR18C55EF4 0 2 � AKTL5L2PJ2XCH 0 1 � Ulle Endriss 9

Collective Annotation BNAIC-2013 Last Slide • Took inspiration from social choice theory to formulate model for aggregating expertise of speakers in annotation projects . • Proposed two families of aggregation methods that are more sophisticated than the standard majority rule, by accounting for the reliability of individual annotators . • Our broader aim is to reflect on the methods used to aggregate annotation information: social choice theory can help. Ulle Endriss 10

Collective Annotation of Linguistic Resources: Basic Principles and - PowerPoint PPT Presentation

Collective Annotation BNAIC-2013 Collective Annotation of Linguistic Resources: Basic Principles and a Formal Model Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam joint work with Raquel Fern

Annotation Processing in a Kotlin World Zac Sweers @pandanomic Annotation Processing in a

Annotation Time Stamps Temporal Metadata from the Linguistic Annotation Process Katrin

Cross-linguistic annotation of tense and aspect syntax and semantics Mark-Matthias Zymla

Treebanking in the World of Thucydides Linguistic annotation for the Hellespont Project Francesco

Resources for Computational Linguistics Annotation Tools: RSTTool &MMAX Presentation by

Annotation Graphs, Annotation Servers and Multi-Modal Resources Infrastructure for

Collective Annotation: From Crowdsourcing to Social Choice Ulle Endriss Institute for Logic,

Collective Annotation: Applying Voting Theory to Computational Linguistics Ulle Endriss

Collective Annotation: From Crowdsourcing to Social Choice Ulle Endriss Institute for Logic,

Annotation and Evaluation Diana Maynard, Niraj Aswani University of Sheffield University of

Lecture 2 Annotation tools & Segmentation Summary of Part 1 Annotation theory

Systematic Annotation Mark Voorhies 4/5/2012 Mark Voorhies Systematic Annotation Review RTFM

Assessing annotation Assessing annotation consistency in the Gene consistency in the Gene

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

Web Annotations Building the Experience Annotation An annotation is something added. It is not

Master EmLex CiTIUS Design and use of linguistic tools Introduction Linguistic Analysis

Introduction to G Introduction to GATE Developer ATE Developer Ian Roberts University of

MODELING ANNOTATED DATA Reviewer: Saurabh Singh (ss1@uiuc.edu) Problem Modeling of

Algorithms for NLP Parsing III Maria Ryskina CMU Slides adapted from: Dan Klein UC

Follow the brief presentation instructions Sharing PowerPoint slides is an effective way to get

Typed Clojure in Ti eory and Practice Ambrose Bonnaire-Sergeant Clojure Dynamic typing \_(

lti

Image Annotations in ResearchSpace By Jana Parvanova, Vladimir Alexiev, Stanislav Kostadinov

Visualization & Visual Analytics 1 Angus Forbes creativecoding.evl.uic.edu/courses/cs424

Collective Annotation of Linguistic Resources: Basic Principles and - PowerPoint PPT Presentation

Collective Annotation BNAIC-2013 Collective Annotation of Linguistic Resources: Basic Principles and a Formal Model Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam joint work with Raquel Fern

Annotation Processing in a Kotlin World Zac Sweers @pandanomic Annotation Processing in a

Annotation Time Stamps Temporal Metadata from the Linguistic Annotation Process Katrin

Cross-linguistic annotation of tense and aspect syntax and semantics Mark-Matthias Zymla

Treebanking in the World of Thucydides Linguistic annotation for the Hellespont Project Francesco

Resources for Computational Linguistics Annotation Tools: RSTTool &amp;MMAX Presentation by

Annotation Graphs, Annotation Servers and Multi-Modal Resources Infrastructure for

Collective Annotation: From Crowdsourcing to Social Choice Ulle Endriss Institute for Logic,

Collective Annotation: Applying Voting Theory to Computational Linguistics Ulle Endriss

Collective Annotation: From Crowdsourcing to Social Choice Ulle Endriss Institute for Logic,

Annotation and Evaluation Diana Maynard, Niraj Aswani University of Sheffield University of

Lecture 2 Annotation tools &amp; Segmentation Summary of Part 1 Annotation theory

Systematic Annotation Mark Voorhies 4/5/2012 Mark Voorhies Systematic Annotation Review RTFM

Assessing annotation Assessing annotation consistency in the Gene consistency in the Gene

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

Web Annotations Building the Experience Annotation An annotation is something added. It is not

Master EmLex CiTIUS Design and use of linguistic tools Introduction Linguistic Analysis

Introduction to G Introduction to GATE Developer ATE Developer Ian Roberts University of

MODELING ANNOTATED DATA Reviewer: Saurabh Singh (ss1@uiuc.edu) Problem Modeling of

Algorithms for NLP Parsing III Maria Ryskina CMU Slides adapted from: Dan Klein UC

Follow the brief presentation instructions Sharing PowerPoint slides is an effective way to get

Typed Clojure in Ti eory and Practice Ambrose Bonnaire-Sergeant Clojure Dynamic typing \_(

lti

Image Annotations in ResearchSpace By Jana Parvanova, Vladimir Alexiev, Stanislav Kostadinov

Visualization &amp; Visual Analytics 1 Angus Forbes creativecoding.evl.uic.edu/courses/cs424

Resources for Computational Linguistics Annotation Tools: RSTTool &MMAX Presentation by

Lecture 2 Annotation tools & Segmentation Summary of Part 1 Annotation theory

Visualization & Visual Analytics 1 Angus Forbes creativecoding.evl.uic.edu/courses/cs424