International Joint Conference on Natural Language Processing November 29, 2017 Towards Bootstrapping a Polarity Shifter Lexicon using Linguistic Features Marc Schulder Michael Wiegand Josef Ruppenhofer Benjamin Roth Spoken Language Institute for German Center for Information and Systems Language Language Processing Saarland University Mannheim LMU Munich
What are Polarity Shifters? Shifters , like negation words, move the polarity of a phrase towards the opposite of the polar term they contain. Negation Peter [did not [pass] + the exam] - . They [did not [destroy] - the temple] + . Verbal Shifter Peter [ failed to [pass] + the exam] - . They [ failed to [destroy] - the temple] + . Saarland University Marc Schulder 2
Overview • Motivation • Bootstrapping a Lexicon • Features • Classification • Output Verification • Extrinsic Evaluation • Conclusion Saarland University Marc Schulder 3
Negation VS Verbal Shifters Negation Verbal Shifters Word Type Function words Content Words Saarland University Marc Schulder 4
Negation VS Verbal Shifters Negation Verbal Shifters Word Type Function words Content Words Large Vocabulary Size Small (15% of verbs) Saarland University Marc Schulder 4
Negation VS Verbal Shifters Negation Verbal Shifters Word Type Function words Content Words Large Vocabulary Size Small (15% of verbs) Individual High Low Frequency Saarland University Marc Schulder 4
Negation VS Verbal Shifters Negation Verbal Shifters Word Type Function words Content Words Large Vocabulary Size Small (15% of verbs) Individual High Low Frequency Full Coverage Yes No Saarland University Marc Schulder 4
Negation VS Verbal Shifters Negation Verbal Shifters Word Type Function words Content Words Large Vocabulary Size Small (15% of verbs) Individual High Low Frequency Full Coverage Yes No Saarland University Marc Schulder 4
Pipeline Bootstrapping Large Shifter Lexicon Saarland University Marc Schulder 5
Pipeline Expensive Bootstrapping Large Shifter Lexicon Saarland University Marc Schulder 5
Pipeline Bootstrapping Large Base Shifter Shifter Lexicon Lexicon Saarland University Marc Schulder 5
Pipeline Bootstrapping Large Base Shifter Shifter Lexicon Lexicon Saarland University Marc Schulder 6
Pipeline Bootstrapping Large Base Shifter Classifier Shifter Lexicon Lexicon (labelled) Saarland University Marc Schulder 7
Pipeline Bootstrapping Large Base Shifter Features Classifier Shifter Lexicon Lexicon (labelled) Saarland University Marc Schulder 7
Pipeline Bootstrapping Large Base Shifter Features Classifier Shifter Lexicon Lexicon (labelled) WordNet verbs (unlabelled) Saarland University Marc Schulder 7
Pipeline Bootstrapping Large Base Verify Shifter Features Classifier Shifter Shifters Lexicon Lexicon (labelled) WordNet verbs (unlabelled) Saarland University Marc Schulder 8
Pipeline Bootstrapping Large Base Verify Shifter Features Classifier Shifter Shifters Lexicon Lexicon (labelled) WordNet verbs (unlabelled) Saarland University Marc Schulder 9
Generic Features WordNet • Glosses: Word definition (bag-of-words). • Hypernyms: Words with more general meaning. • Supersenses: Coarse semantic categories. FrameNet • Verb Frames: Semantic verb groups. Frame AVOIDING: desist, dodge, evade, shun, shirk,... Saarland University Marc Schulder 10
Task-specific Features 1. Distributional Similarity Choose verbs similar to negation words like not , no , etc. 2. Polarity Clash Negative verb with positive object. She [ lost [hope] + ] - . 3. Particle Verbs Some particles indicate "loss" (e.g. aside, down, o ff ,... ). Please [ lay aside all your [worries] - ] + . 4. any- Heuristic The word any co-occurs with negation/shifters. They did [ not give us any [help] + ] - . Best They [ denied us any [help] + ] - . Saarland University Marc Schulder 11
Anti-Shifter Feature Anti-Shifter: Co-occurrence with adverbs that are • attracted to verbs of creation; • repelled by verbs of destruction. Black bears exclusively live anti-shifter on fish. Keyboards on phones were first introduced anti-shifter in 1997. These buildings have been newly constructed anti-shifter . They specially prepared anti-shifter vegan dishes for me. Saarland University Marc Schulder 12
Pipeline Bootstrapping 2000 verbs Large Base Verify Shifter Features Classifier Shifter Shifters Lexicon Lexicon 304 shifters (labelled) 8581 verbs WordNet verbs (unlabelled) Saarland University Marc Schulder 13
Classifier Setup SVM • Training: Base Lexicon 2000 verbs, incl. 304 shifters • Labels : Shifter, non-shifter • Evaluation: 10-fold cross validation Saarland University Marc Schulder 14
Classifier Setup SVM • Training: Base Lexicon 2000 verbs, incl. 304 shifters • Labels : Shifter, non-shifter • Evaluation: 10-fold cross validation Baselines • Majority Label: All verbs are non-shifters Saarland University Marc Schulder 14
Classifier Setup SVM • Training: Base Lexicon 2000 verbs, incl. 304 shifters • Labels : Shifter, non-shifter • Evaluation: 10-fold cross validation Baselines • Majority Label: All verbs are non-shifters • Graph Clustering (Approach with no labelled training data) • Input: Word Embedding Graph + Seeds • Positive Seeds: ANY (best shifter feature) • Negative Seeds: ANTI Saarland University Marc Schulder 14
Classifier Performance 1.00 0.79 0.75 Macro F1 0.62 0.50 0.46 0.25 Majority Graph Clustering SVM Saarland University Marc Schulder 15
Pipeline Bootstrapping 2000 verbs Large Base 1043 shifters Verify Shifter Features Classifier Shifter Shifters Lexicon Lexicon 304 shifters (labelled) 8581 verbs WordNet verbs (unlabelled) Saarland University Marc Schulder 16
Shifter Verification • Task: Human annotator verifies predicted shifters. • Input: 1043 verbs predicted as shifters. • Output: 676 verbs confirmed as shifters. 1.00 0.93 0.73 0.67 Precision 0.62 0.33 0.33 0.00 1-250 251-500 501-750 751-1043 Classifier Confidence Ranking Saarland University Marc Schulder 17
Pipeline Bootstrapping 2000 verbs 304 shifters 676 shifters Large Base Verify Shifter Features Classifier Shifter Shifters Lexicon Lexicon (labelled) 8581 verbs WordNet verbs (unlabelled) Saarland University Marc Schulder 18
Pipeline Bootstrapping 2000 verbs 304 shifters 676 shifters Large Base Verify Shifter Features Classifier Shifter Shifters Lexicon Lexicon 304 + 676 = (labelled) 980 shifters 8581 verbs WordNet verbs (unlabelled) Saarland University Marc Schulder 18
Pipeline Bootstrapping 2000 verbs 304 shifters 676 shifters Large Base Verify Shifter Features Classifier Shifter Shifters Lexicon Lexicon 304 + 676 = (labelled) 980 shifters 8581 verbs Fine-grained WordNet Sentiment verbs Analysis (unlabelled) Saarland University Marc Schulder 18
Extrinsic Evaluation Sentiment Analysis Task: Given a verb phrase with a polar noun, decide whether phrase polarity has shifted from the polarity of the noun. Input: Norah Jones’ smooth voice could [soothe V any savage [beast N ] - ] ? . VP Output Labels: Shifted, not shifted Gold Data: Amazon Product Review Corpus (Jindal and Liu, 2008) 2631 phrases Balanced for ratio of shifters among verbs. Saarland University Marc Schulder 19
Extrinsic Evaluation Classifiers Proposed Classifier using Bootstrapped Lexicon • If verb in shifter lexicon ⇒ Shifted Saarland University Marc Schulder 20
Extrinsic Evaluation Classifiers Proposed Classifier using Bootstrapped Lexicon • If verb in shifter lexicon ⇒ Shifted Baselines • Majority Label: All sentences are not shifted. Saarland University Marc Schulder 20
Extrinsic Evaluation Classifiers Proposed Classifier using Bootstrapped Lexicon • If verb in shifter lexicon ⇒ Shifted Baselines • Majority Label: All sentences are not shifted. • Recursive Neural Tensor Network (Socher et al., 2013) • Compositional sentence-level polarity classifier. • Provides polarities for each constituency tree node. • No explicit knowledge of shifters. Saarland University Marc Schulder 20
Extrinsic Evaluation Results 1.00 0.8 0.75 Macro F1 0.50 0.51 0.44 0.25 Majority RNTN LEX Saarland University Marc Schulder 21
Conclusion • Produced a large lexicon of 980 shifters. Available at https://github.com/marcschulder/ijcnlp2017 Saarland University Marc Schulder 22
Conclusion • Produced a large lexicon of 980 shifters. Available at https://github.com/marcschulder/ijcnlp2017 • Bootstrapping reduces cost of high quality annotation. Saarland University Marc Schulder 22
Recommend
More recommend