crowdtruth metrics for capturing ambiguity
play

CrowdTruth Metrics for Capturing Ambiguity Interlinking Workers, - PowerPoint PPT Presentation

CrowdTruth Metrics for Capturing Ambiguity Interlinking Workers, Annotations and Input Data Traditional Humans provide annotations establishing the ground Human truth = the correct output for each example (the gold standard ) Annotation


  1. CrowdTruth Metrics for Capturing Ambiguity Interlinking Workers, Annotations and Input Data

  2. Traditional • Humans provide annotations establishing the ground Human truth = the correct output for each example (the gold standard ) Annotation • Machines learn the ground truth • Ground Truth Quality: typically measured by inter-annotator agreement (e.g. majority vote); founded on the ideal for single, universally constant truth • which means - ambiguity of textual interpretation is often lost Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  3. CrowdTruth Methodology • Annotator disagreement is signal, not noise. • It can indicate of the variation in human semantic interpretation • Can be used to capture ambiguity, vagueness, similarity, over-generality, as well as quality What causes disagreement to happen? Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  4. Disagreement because of Low Quality Workers Do the sentences express a relation? Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  5. Disagreement because of Sentence Clarity Do the sentences express a relation between and ? Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  6. Disagreement because of Sentence Clarity Do the sentences express a relation between and ? → agreement 95% → agreement 75% → agreement 50% Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  7. Disagreement because of an Ambiguous Annotation Task What is the relation expressed? or ? Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  8. Triangle of disagreement as model for crowdsourcing systems e.g. sentence, paragraph, image, sound etc. Ambiguity at any corner disseminates in the other corners Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  9. CrowdTruth quality metrics ● ● ● Anca Dumitrache CrowdTruth.org @anca_dmtrch #CrowdTruth

  10. CrowdTruth.org github.com/CrowdTruth/CrowdTruth-core pypi.org/project/CrowdTruth data.CrowdTruth.org

Recommend


More recommend