decomposing generalization
play

decomposing generalization models of generic, habitual and episodic - PowerPoint PPT Presentation

decomposing generalization models of generic, habitual and episodic statements Venkata S Govindarajan, Benjamin Van Durme, Aaron Steven White Transactions of the ACL Volume 7, 2019 p.501-517 generalization 1 The service at that restaurant was


  1. decomposing generalization models of generic, habitual and episodic statements Venkata S Govindarajan, Benjamin Van Durme, Aaron Steven White Transactions of the ACL Volume 7, 2019 p.501-517

  2. generalization 1 The service at that restaurant was good How to capture linguistic generalization like in the above in a framework for research and annotation? The ability to capture different modes of generalization is key to building systems with robust commonsense reasoning . (Zhang, Rudinger, Duh, et al. 2017, Bauer et al. 2018, McCarthy 1960, 1980, Minsky 1974, Hobbs et al. 1987) 1

  3. our claim Linguistic generalizations should be captured in a continuous multi-label system , using simple real-valued referential properties. Our framework is based on Decompositional Semantics . (White et al. 2016) 2

  4. background

  5. standard classification stative G. N. Carlson et al. 1995, Carlson 2005 meat. generic eat kind Lions 6 from Asia. episodic disappeared kind The lion 5 in the cage. is 2 individual The lion 4 oatmeal for breakfast. habitual eats individual Mary 3 lunch. episodic ate individual Mary 3

  6. problems Know where crimes usually happen, and be safe . That bureaucrat takes the 90 bus to work. b. Open the window , will you please? a. Indefjnite defjnites (G. Carlson et al. 2006) 9 The atmosphere may not be for everyone. b. a. Arguments and Predicates do not always fall under such well defjned categories Abstract Reference (Grimm 2014, 2016) 8 That vintner makes three different wines. b. One whale , namely the blue whale, is nearly extinct. a. Taxonomic Reference (G. N. Carlson et al. 1995) 7 as described. 4

  7. current corpora The ACE-2 program (Doddington et al. 2004, Reiter et al. 2010) associated entity mentions with two classes - specifjc and generic. The ACE-2005 (Walker et al. 2006) corpus adds data and provides two additional classes - neg (empty sets), and usp (underspecifjed). The EventCorefBank (ECB) (Bejan et al. 2010, Lee et al. 2012) annotates event and entity mentions with a generic class. SitEnt – the Situational Entities Corpus (Friedrich et al. 2016, 2015, 2014) annotates NPs and clauses separately for their genericity, habituality, and lexical aspectual class of main verb. They fail to deal with taxonomic reference, abstract reference and indefjnite defjnites. 5 All of these frameworks employ multi-class annotation schemes.

  8. annotation framework and data collection

  9. annotation framework Decompose arguments and predicates into simple referential properties. Collect annotations for argument and predicate properties separately, with confjdence ratings for each annotation. annotation schema. 6 Multiple properties can be true of a predicate/argument – multi-label

  10. axes of reference Spatiotemporal Type Tangible 7

  11. You wonder if he was manipulating the market with his bombing targets . with (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing his targets Universal Dependencies (Bies et al. 2012) the market was he if manipulating You wonder 8

  12. You wonder if he was manipulating the market with his bombing targets . with (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing his targets Universal Dependencies (Bies et al. 2012) the market was he if manipulating You wonder 8

  13. You wonder if he was manipulating the market with his bombing targets . with (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing his targets Universal Dependencies (Bies et al. 2012) the market was he if manipulating You wonder 8

  14. You wonder if he was manipulating the market with his bombing targets . with (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing his targets Universal Dependencies (Bies et al. 2012) the market was he if manipulating You wonder 8

  15. You wonder if he was manipulating the market with his bombing targets . with (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing his targets Universal Dependencies (Bies et al. 2012) the market was he if manipulating You wonder 8

  16. argument annotation 9

  17. predicate annotation 10

  18. You wonder if he was manipulating the market with his bombing targets . with (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing his targets Universal Dependencies (Bies et al. 2012) the market was he if manipulating You wonder 11

  19. You wonder if he was manipulating the market with his bombing targets . his Normalization (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing with Universal Dependencies (Bies et al. 2012) targets the market was he if manipulating You wonder 12

  20. data normalization The need to adjust annotation bias has long been recognized in psycholinguistics literature (Baayen 2008). We employ such procedures to arrive at a single real-valued score . Confjdence Normalization To adjust for annotator bias while using confjdence scales, we use ridit scoring (Agresti 2003). It reweights confjdences based on frequency. Binary Normalization To adjust for annotator bias while assigning labels to properties, we use a mixed effects logistic model (Gelman et al. 2014) We thus estimate a real-valued score for each property and each token based on the average annotator . 13

  21. You wonder if he was manipulating the market with his bombing targets . his 3.2, -2.3, 1.1, ... Normalization (True,4), (False, 3), (True,2), ... Annotation on Mechanical Turk wonder, manipulating, you, market, targets Filtering PredPatt (Zhang, Rudinger & Durme 2017) extracts Arguments & Predicates . bombing with Universal Dependencies (Bies et al. 2012) targets the market was he if manipulating You wonder 14

  22. Universal Decompositional Semantics-Genericity (UDS-G) dataset: 37,146 Arguments , 33,114 Predicates Data (and code) available at decomp.io 15

  23. preliminary analysis

  24. argument normalized distribution 16

  25. argument normalized distribution 17

  26. argument normalized distribution 18

  27. 11 Some places do the registration right at the hospital... 19

  28. 12 Meanwhile, his reputation seems to be improving... 20

  29. predicate normalized distribution 21

  30. predicate normalized distribution 22

  31. 13 I have faxed to you the form of Bond... 23

  32. 14 Is gare montparnasse storage still available ? 24

  33. 15 Who knows what the future might hold, and it might still be expensive? 25

  34. 16 I have tryed to give him water but he wont take it..what should i do ? 26

  35. modeling

  36. feature representations To predict the real-valued properties using a computational model, arguments and predicates need rich feature representations. • Hand engineered: • Type level VerbNet classes, FrameNet frames, WordNet supersenses, Concreteness ratings (Brysbaert et al. 2014) • Token level Part-of-Speech tags, Infmectional features, Syntactic Relations • Learned (word embeddings): • Type level GloVe static embeddings (Pennington et al. 2014) • Token level ELMO contextual embeddings (Peters et al. 2018) 27

  37. labelling model Multi-Layer Neural Network that takes as input one (or more) of the feature representations of the argument/predicate token that was annotated, and outputs 3 real values corresponding to the 3 properties. 28

  38. results - argument Hand Type Hand Token Learned Type Learned Token 20 30 40 50 60 Pearson Correlation 𝜍 29

  39. results - argument Hand Type Hand Token Learned Type Learned Token 20 30 40 50 60 Pearson Correlation 𝜍 particular kind 30

  40. results - argument Hand Type Hand Token Learned Type Learned Token 20 30 40 50 60 Pearson Correlation 𝜍 particular kind abstract 30

  41. results - predicate Hand Type Hand Token Learned Type Learned Token 0 10 20 30 40 50 Pearson Correlation 𝜍 31

  42. results - predicate Hand Type Hand Token Learned Type Learned Token 0 10 20 30 40 50 Pearson Correlation 𝜍 particular hypothetical 32

  43. results - predicate Hand Type Hand Token Learned Type Learned Token 0 10 20 30 40 50 Pearson Correlation 𝜍 particular hypothetical dynamic 32

Recommend


More recommend