combining distant and partial supervision for relation
play

Combining Distant and Partial Supervision for Relation Extraction - PowerPoint PPT Presentation

Combining Distant and Partial Supervision for Relation Extraction Gabor Angeli , Julie Tibshirani, Jean Y. Wu, Christopher D. Manning Stanford University October 28, 2014 Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial


  1. Combining Distant and Partial Supervision for Relation Extraction Gabor Angeli , Julie Tibshirani, Jean Y. Wu, Christopher D. Manning Stanford University October 28, 2014 Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 1 / 19

  2. Motivation: Knowledge Base Completion Unstructured Text Structured Knowledge Base ⇒ Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 2 / 19

  3. Motivation: Question Answering Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 3 / 19

  4. Motivation: Question Answering Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 3 / 19

  5. Motivation: Question Answering Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 3 / 19

  6. Motivation: Question Answering Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 3 / 19

  7. Relation Extraction Input : Sentences containing (entity, slot value). Output : Relation between entity and slot value. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 4 / 19

  8. Relation Extraction Input : Sentences containing (entity, slot value). Output : Relation between entity and slot value. Consider two approaches: Supervised: Trivial as a supervised classifier. Training data: { (sentence, relation) } . But ... Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 4 / 19

  9. Relation Extraction Input : Sentences containing (entity, slot value). Output : Relation between entity and slot value. Consider two approaches: Supervised: Trivial as a supervised classifier. Training data: { (sentence, relation) } . But ... this training data is expensive to produce. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 4 / 19

  10. Relation Extraction Input : Sentences containing (entity, slot value). Output : Relation between entity and slot value. Consider two approaches: Supervised: Trivial as a supervised classifier. Training data: { (sentence, relation) } . But ... this training data is expensive to produce. Distantly Supervised: Artificially produce “supervised” data. Training data: { (entity, relation, slot value) } . But ... Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 4 / 19

  11. Relation Extraction Input : Sentences containing (entity, slot value). Output : Relation between entity and slot value. Consider two approaches: Supervised: Trivial as a supervised classifier. Training data: { (sentence, relation) } . But ... this training data is expensive to produce. Distantly Supervised: Artificially produce “supervised” data. Training data: { (entity, relation, slot value) } . But ... this training data is much more noisy. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 4 / 19

  12. Contribution: Combine Benefits of Both Adding carefully selected supervision improves distantly supervised relation extraction. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 5 / 19

  13. Contribution: Combine Benefits of Both Adding carefully selected supervision improves distantly supervised relation extraction. What is “carefully selected”: Propose new active learning criterion. Evaluate a number of questions: Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 5 / 19

  14. Contribution: Combine Benefits of Both Adding carefully selected supervision improves distantly supervised relation extraction. What is “carefully selected”: Propose new active learning criterion. Evaluate a number of questions: Is the proposed criterion better than other methods? Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 5 / 19

  15. Contribution: Combine Benefits of Both Adding carefully selected supervision improves distantly supervised relation extraction. What is “carefully selected”: Propose new active learning criterion. Evaluate a number of questions: Is the proposed criterion better than other methods? Where is the supervision helping? Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 5 / 19

  16. Contribution: Combine Benefits of Both Adding carefully selected supervision improves distantly supervised relation extraction. What is “carefully selected”: Propose new active learning criterion. Evaluate a number of questions: Is the proposed criterion better than other methods? Where is the supervision helping? How far can we get with a supervised classifier? Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 5 / 19

  17. Distant Supervision (Barack Obama, EmployedBy , United States) Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 6 / 19

  18. Multiple-Instance Multiple-Label (MIML) Learning (Barack Obama, EmployedBy , United States) Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 6 / 19

  19. Distant Supervision ↓ EmployedBy y y y x x x ↑ Barack Obama is the 44th and current president of the United States Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 7 / 19

  20. Distant Supervision ↓ EmployedBy y y y x x x ↑ Barack Obama is the 44th and current president of the United States Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 7 / 19

  21. Multiple-Instance y Latent per-mention relation → z 1 z 2 z 3 x 3 x 1 x 2 Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 8 / 19

  22. Multiple-Instance y Latent per-mention relation → z 1 z 2 z 3 x 3 x 1 x 2 Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 8 / 19

  23. Multiple-Instance Multiple-Label (MIML-RE) y n − 1 y 1 y 2 y n ... z 1 z 2 z 3 x 1 x 2 x 3 Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 8 / 19

  24. Active Learning Old problem: Supervision is expensive, but very useful. Old solution: Active learning! Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 9 / 19

  25. Active Learning Old problem: Supervision is expensive, but very useful. Old solution: Active learning! Select a subset of latent z to annotate. Fix these labels during training. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 9 / 19

  26. Active Learning Old problem: Supervision is expensive, but very useful. Old solution: Active learning! Select a subset of latent z to annotate. Fix these labels during training. Bonus: this creates a supervised training set. We initialize from a supervised classifier on this training set. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 9 / 19

  27. Active Learning Old problem: Supervision is expensive, but very useful. Old solution: Active learning! Select a subset of latent z to annotate. Fix these labels during training. Bonus: this creates a supervised training set. We initialize from a supervised classifier on this training set. Some Statistics 1,208,524 latent z which we could annotate. $0.13 per annotation. $160,000 to annotate everything. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 9 / 19

  28. Active Learning Old problem: Supervision is expensive, but very useful. Old solution: Active learning! Select a subset of latent z to annotate. Fix these labels during training. Bonus: this creates a supervised training set. We initialize from a supervised classifier on this training set. Some Statistics 1,208,524 latent z which we could annotate. $0.13 per annotation. $160,000 to annotate everything. New spin: Have to get it right the first time. Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 9 / 19

  29. Example Selection Criteria Train k MIML-RE models on k subsets of the data. 1 y 1 y 2 y n − 1 y n y 1 y 2 y n − 1 y n y 1 y 2 y n − 1 y n y 1 y 2 y n − 1 y n y 1 y 2 y n − 1 y n ... ... ... ... ... z 1 z 2 z 3 z 1 z 2 z 3 z 1 z 2 z 3 z 1 z 2 z 3 z 1 z 2 z 3 x 1 x 2 x 3 x 1 x 2 x 3 x 1 x 2 x 3 x 1 x 2 x 3 x 1 x 2 x 3 Angeli, Tibshirani, Wu, Manning (Stanford) Combining Distant and Partial Supervision ... October 28, 2014 10 / 19

Recommend


More recommend