lexical sense alignment using weighted bipartite b
play

Lexical sense alignment using weighted bipartite b -matching Sina - PowerPoint PPT Presentation

Lexical sense alignment using weighted bipartite b -matching Sina Ahmadi (ULD) Supervisors: John McCrae, Mihael Arcan 16th Ph.D. day on April 8, 2018 Lexical resources 2 Lexical resources Expert-made Collaboratively-curated 3 Why


  1. Lexical sense alignment using weighted bipartite b -matching Sina Ahmadi (ULD) Supervisors: John McCrae, Mihael Arcan 16th Ph.D. day on April 8, 2018

  2. Lexical resources �2

  3. Lexical resources Expert-made Collaboratively-curated �3

  4. Why combining resources? • To improve word and concept coverage • e.g., named entities, new senses • To improve domain coverage • To improve multilinguality • Creating resources for new language pairs • To combine expert-made semantic relations • e.g., Hypernymy, meronymy, etc. �4

  5. A few applications • Semantic parsing • Word-sense disambiguation and entity linking bat: or • Semantic role labeling �5

  6. Difficulty of resource alignment Spring (n) 6 senses 18 senses �6

  7. How does our resource alignment work? Wiktionary:spring a leap; a bound; a jump WordNet:spring traditionally the first of the the season of growth four seasons of the year in temperate regions the elasticity of something that can be stretched and a place where water or oil returns to its original length emerges from the ground a natural flow of ground water elasHc power or force. the property of a body of a light, self-propelled movement springing to its original form aJer upwards or forwards being compressed, stretched, etc. the Hme of growth and progress; early porHon; first stage. �7

  8. Step 1. Alignment problem as a graph V a leap; a bound; a jump U traditionally the first of the the season of growth four seasons of the year in temperate regions the elasticity of something that can be stretched and a place where water or oil returns to its original length emerges from the ground a natural flow of ground water elasHc power or force. the property of a body of a light, self-propelled movement springing to its original form aJer upwards or forwards being compressed, stretched, etc. the Hme of growth and progress; early porHon; first stage. U and V are disjoint and independent → bipartite �8

  9. Step 2. Extract similarity scores Determine how similar two senses are by training a model using textual and definitional similarity features such as: • Word length ratio * • Longest common subsequence • Jaccard measure • Word embeddings • Forward precision * McCrae, John P., and Paul Buitelaar. "Linking Datasets Using SemanHc Textual �9 Similarity." Cyberne)cs and Informa)on Technologies 18.1 (2018): 109-123.

  10. Step 2. Extract similarity scores V a leap; a bound; a jump U 0.00187 traditionally the first of the the season of growth four seasons of the year in 0.84391 temperate regions the elasticity of something that can be stretched and a place where water or oil 0.01021 returns to its original length emerges from the ground a natural flow of ground water 0.38672 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 the Hme of growth and progress; early porHon; first stage. Weighted bipartite graph �10

  11. Step 3. Graph matching

  12. Exhaustive matching V → Use a threshold a leap; a bound; a jump U 0.00187 traditionally the first of the the season of growth four seasons of the year in 0.84391 temperate regions the elasticity of something that can be stretched and a place where water or oil 0.01021 returns to its original length emerges from the ground a natural flow of ground water 0.38672 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 0.98951 • High precision, low recall the Hme of growth and progress; early porHon; first stage. • Difficult to find optimal threshold • Uniform matching �12

  13. Exact matching V → 1-to-1 links maximising overall weight a leap; a bound; a jump U 0.00187 0.00091 traditionally the first of the the season of growth four seasons of the year in 0.84391 0.00363 temperate regions the elasticity of something that can be stretched and a place where water or oil 0.01021 0.89191 0.89191 returns to its original length emerges from the ground a natural flow of ground water 0.38672 0.00362 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 0.02012 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 0.98951 0.00101 the Hme of growth and progress; • High precision, not so high recall early porHon; first stage. • Bijective mapping restriction �13

  14. Weighted bipartite b -matching (WB b M) V → maximising overall weight + node capacity [l, b] a leap; a bound; a jump U 0.00187 0.00091 traditionally the first of the the season of growth four seasons of the year in 0.84391 0.84391 0.00363 temperate regions l = 0 l = 0 the elasticity of something b = 1 b = 1 that can be stretched and a place where water or oil 0.01021 0.89191 0.89191 0.89191 returns to its original length l = 1 emerges from the ground l = 1 b = 2 b = 2 a natural flow of ground water 0.38672 0.00362 elasHc power or force. the property of a body of a light, self-propelled movement 0.00021 0.02012 springing to its original form aJer upwards or forwards being compressed, stretched, etc. 0.98951 0.98951 0.98951 0.00101 • High precision, high recall the Hme of growth and progress; early porHon; first stage. • Efficient in linking polysemous items • Still difficult to tune the parameters �14

  15. Our resource alignment mechanism: schema �15

  16. Alignment Experiments: Datasets WordNet synsets manually mapped to their corresponding concepts * * Matuschek, Michael, and Iryna Gurevych. "Dijkstra-wsa: A graph-based approach to word sense alignment." Transac)ons of the Associa)on for �16 Computa)onal Linguis)cs 1 (2013): 151-164.

  17. Experiments: WordNet-Wiktionary alignment Previous work: Our current method: �17

  18. Conclusion • Graph matching algorithms can be efficiently applied to lexical alignment problems • WB b M • includes more possible linking combinations by defining capacity • efficient in lexical alignment providing high precision and recall • difficult to find optimal parameters • highly dependent on the textual and definitional similarities �18

  19. Future directions • Exploring link prediction methods for lexical alignment • Extend our researches to multi-lingual resources • Using graph neural networks (GNNs) �19

Recommend


More recommend