representing a concept by the distribution of names of
play

Representing a concept by the distribution of names of its instances - PowerPoint PPT Presentation

Representing a concept by the distribution of names of its instances Matthijs Westera, Gemma Boleda and Sebastian Pad Representing a concept by the distribution of names of its instances A b h i j e e t G u p t a & Matthijs


  1. Existing data/model we use ● The Instantiation dataset (Boleda, Gupta, and Padó, 2017, EACL) : – e.g., <Emmy Noether, scientist>, <Edinburgh, capital> – derived from WordNet’s ‘instance hyponym’ relation. 52

  2. Existing data/model we use ● The Instantiation dataset (Boleda, Gupta, and Padó, 2017, EACL) : – e.g., <Emmy Noether, scientist>, <Edinburgh, capital> – derived from WordNet’s ‘instance hyponym’ relation. ● We focus on the 159 categories that have at least 5 entities. 53

  3. Existing data/model we use ● The Instantiation dataset (Boleda, Gupta, and Padó, 2017, EACL) : – e.g., <Emmy Noether, scientist>, <Edinburgh, capital> – derived from WordNet’s ‘instance hyponym’ relation. ● We focus on the 159 categories that have at least 5 entities. ● As DS representations of the entities’ names and categories’ predicates we use the Google News embeddings (Mikolov, Sutskever, et al., 2013, ANIPS) . 54

  4. Evaluation: gathering human judgments 55

  5. Evaluation: gathering human judgments Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR): 56

  6. Evaluation: gathering human judgments Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR): ● We semi-randomly sampled 1000 category pairs (out of 12.5K). 57

  7. Evaluation: gathering human judgments Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR): ● We semi-randomly sampled 1000 category pairs (out of 12.5K). ● ‘Comparative’ task : which pair of categories are more related to each other? 58

  8. Evaluation: gathering human judgments Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR): ● We semi-randomly sampled 1000 category pairs (out of 12.5K). ● ‘Comparative’ task : which pair of categories are more related to each other? ● Also same way of computing aggregated ‘relatedness’ scores. 59

  9. Crowdsource task 60

  10. Main result 61

  11. Main result ● Spearman (ranking) correlations between: 62

  12. Main result ● Spearman (ranking) correlations between: – cosine similarities from Name-based / Predicate-based and – aggregate scores from our human judgments 63

  13. Main result ● Spearman (ranking) correlations between: – cosine similarities from Name-based / Predicate-based and – aggregate scores from our human judgments ● Result: – Predicate-based: 0.56 64

  14. Main result ● Spearman (ranking) correlations between: – cosine similarities from Name-based / Predicate-based and – aggregate scores from our human judgments ● Result: – Predicate-based: 0.56 – Name-based: 0.74 65

  15. Artist’s impression 66

  16. Artist’s impression 67

  17. How many names do we need? 68

  18. How many names do we need? 69

  19. How many names do we need? S u r p r i s i n g l y f e w ! 70

  20. Entities need to be representative 71

  21. Entities need to be representative ● E.g., the Name-based model overestimates surgeon ~ siege ... 72

  22. Entities need to be representative ● E.g., the Name-based model overestimates surgeon ~ siege ... ● Instances of surgeon in the Instantiation dataset: – William Cowper – James Parkinson – Alexis Carrel – Walter Reed – William Beaumont – Joseph Lister 73

  23. Entities need to be representative ● E.g., the Name-based model overestimates surgeon ~ siege ... ● Instances of surgeon in the Instantiation dataset: – William Cowper – James Parkinson – Alexis Carrel – Walter Reed I n v o l v e d i n WW1 – William Beaumont – Joseph Lister 74

  24. Entities need to be representative ● E.g., the Name-based model overestimates surgeon ~ siege ... ● Instances of surgeon in the Instantiation dataset: – William Cowper – James Parkinson – Alexis Carrel – Walter Reed I n v o l v e d i n WW1 – William Beaumont M e mb e r s o f U S mi l i t a r y c o r p s – Joseph Lister 75

  25. Entities need to be representative ● E.g., the Name-based model overestimates surgeon ~ siege ... ● Instances of surgeon in the Instantiation dataset: – William Cowper – James Parkinson Wr o t e “ t h e s i e g e o f c h e s t e r ” ( ? ) – Alexis Carrel – Walter Reed I n v o l v e d i n WW1 – William Beaumont M e mb e r s o f U S mi l i t a r y c o r p s – Joseph Lister 76

  26. Discussion 77

  27. Discussion 78

  28. Discussion ● Main finding: 79

  29. Discussion ● Main finding: – Name-based representations of category concepts align better with ‘the world’ than Predicate-based representations. 80

  30. Discussion ● Main finding: – Name-based representations of category concepts align better with ‘the world’ than Predicate-based representations. – Even a small number of (representative) names can be enough. 81

  31. Discussion ● Main finding: – Name-based representations of category concepts align better with ‘the world’ than Predicate-based representations. – Even a small number of (representative) names can be enough. ● Outlook: – Not every category has named instances... 82

  32. Discussion ● Main finding: – Name-based representations of category concepts align better with ‘the world’ than Predicate-based representations. – Even a small number of (representative) names can be enough. ● Outlook: – Not every category has named instances... – NLP relevance? Vs. sense disambiguation? Contextualized word embeddings (ELMo, BERT, …)? 83

  33. Discussion ● Main finding: – Name-based representations of category concepts align better with ‘the world’ than Predicate-based representations. – Even a small number of (representative) names can be enough. ● Outlook: – Not every category has named instances... – NLP relevance? Vs. sense disambiguation? Contextualized word embeddings (ELMo, BERT, …)? – Cognitive relevance? E.g., prototype theory? 84

  34. Acknowledgments This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 715154). This paper reflects the authors’ view only, and the EU is not responsible for any use that may be made of the information it contains. 85

  35. Image sources https://ui-ex.com/explore/whale-transparent-dark/ https://commons.wikimedia.org/wiki/File:Cowicon.svg https://commons.wikimedia.org/wiki/File:Bird_1010720_drawing.svg https://commons.wikimedia.org/wiki/File:Dog_silhouette.svg https://commons.wikimedia.org/wiki/File:Cat_silhouette_darkgray.svg https://commons.wikimedia.org/wiki/File:Frog_(example).svg https://commons.wikimedia.org/wiki/File:PeregrineFalconSilhouettes.svg https://commons.wikimedia.org/wiki/File:Common_goldfish_silhouette.svg https://commons.wikimedia.org/wiki/File:Six_weeks_old_cat_(aka).jpg https://nl.m.wikipedia.org/wiki/Bestand:Kooikerhondje_puppy.jpg https://nl.m.wikipedia.org/wiki/Bestand:Golden_Retriever_eating_crust_of_pizza.jpg https://commons.wikimedia.org/wiki/File:Cat-eating-prey.jpg 86

  36. Where are predicates and names, anyway? name predicate 87

  37. Where are predicates and names, anyway? name predicate 88

  38. Crowdsource task 89

  39. Crowdsource task 90

  40. Crowdsource task instructions 91

  41. Crowdsource task instructions 92

  42. Why definitions? 93

  43. Why definitions? ● The same words can often be used to denote various categories. 94

  44. Why definitions? ● The same words can often be used to denote various categories. ● To properly evaluate the Name-based approach, the human judgments should be about the categories as intended by the Instantiation dataset we use. 95

  45. Why definitions? ● The same words can often be used to denote various categories. ● To properly evaluate the Name-based approach, the human judgments should be about the categories as intended by the Instantiation dataset we use. ● (Would be good practice more generally – e.g., vs. the good subject effect. ) 96

  46. Why definitions? ● The same words can often be used to denote various categories. ● To properly evaluate the Name-based approach, the human judgments should be about the categories as intended by the Instantiation dataset we use. ● (Would be good practice more generally – e.g., vs. the good subject effect. ) ● This may give the Predicate-based approach a disadvantage… 97

  47. Why definitions? ● The same words can often be used to denote various categories. ● To properly evaluate the Name-based approach, the human judgments should be about the categories as intended by the Instantiation dataset we use. ● (Would be good practice more generally – e.g., vs. the good subject effect. ) ● This may give the Predicate-based approach a disadvantage… – but this disadvantage is not an unfair one. 98

  48. A closer look per ontological domain 99

  49. A closer look per ontological domain Predicate -based: 100

Recommend


More recommend