vox populi annotation measuring intensity of ideological
play

Vox Populi Annotation: Measuring Intensity of Ideological - PowerPoint PPT Presentation

Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments LREC, Marrakech, Morocco, May 28-30, 2008 Wei-Hao Lin and Alexander Hauptmann Language Technologies Institute School of Computer Science


  1. Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments LREC, Marrakech, Morocco, May 28-30, 2008 Wei-Hao Lin and Alexander Hauptmann Language Technologies Institute School of Computer Science Carnegie Mellon University 1

  2. Goal: Annotating Intensity of Expressing Ideology at the Sentence Level 2

  3. Sentence of High Intensity • In the first weeks of the Intifada, for example, Palestinian public protests and civilian demonstrations were answered brutally by Israel, which killed tens of unarmed protesters. 3

  4. Sentence of Low Intensity • The Rhodes aggrements of 1949 set them as the ceasefire lines between Israel and the Arab states. 4

  5. Annotating Intensity is Hard • Hard to define Strong, Medium, and Weak • Hard to train annotators • Hard to achieve high inter-rater agreement 5

  6. Solution: Vox Populi Annotation • Aggregate group judgments on a simple, forced binary question • “Which side do you think the sentence was written from?” 6

  7. Two Problems • How many annotators are needed? • Are these group judgments random? 7

  8. Number of Annotators • A statistical testing problem • The more annotators, the finer difference in intensity we can discern. 8

  9. Number of Annotators 1.0 � 0.9 � 0.75 0.8 0.6 0.6 p value � 0.4 � 0.2 � � � 0.0 � � � � � � � � � � � � � � � � � � � 5 10 15 20 25 sample size 9

  10. Reliability • Reliable = two groups agree with each other • Measure Pearson’s correlation coefficient 10

  11. Annotation Study • 250 sentences from editorials on the Israeli- Palestinian conflict • 18 participants • “Do you think the sentence is written from the Israeli or Palestinian perspective?” 11

  12. Distribution of Intensity 50 40 Frequency 30 20 10 0 0.0 0.2 0.4 0.6 0.8 1.0 Vox Populi Intensity 12

  13. Reliability Assessment 0.5 Vox Populi � � � random 0.5 0.4 � random 0.99 � 0.3 correlation 0.2 � � 0.1 0.0 � 0.1 1 2 3 4 5 6 group size 13

  14. Where to recruit many annotators? 14

  15. Conclusion • Vox Populi Annotation for hard annotation tasks • Solution to two problems in VPA • Positive correlation observed in an empirical annotation study 15

Recommend


More recommend