Unsupervised Knowledge-Free Word Sense Disambiguation Dr. Alexander - PowerPoint PPT Presentation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Unsupervised Knowledge-Free Word Sense Disambiguation Dr. Alexander Panchenko University of Hamburg, Language Technology Group 23 February, 2017 Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Overview Introduction Dense Sense Representations Sparse Sense Representations Future Work Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work About me ◮ 2008, Engineering degree (MS.) in Computer Science, Moscow State Technical University ◮ 2009, Research intern , Xerox Research Centre Europe ◮ 2013, PhD in Natural Language Processing , University of Louvain ◮ 2013, Research engineer at a start-up related to social network analysis (Digsolab) ◮ 2015, Postdoc at Technical University of Darmstadt ◮ 2017, Postdoc at University of Hamburg Topics : computational lexical semantics (semantic similarity/relatedness, semantic relations, sense induction, sense disambiguation), nlp for social network analysis, text categorization Papers, presentations, datasets : http://panchenko.me Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Publications Related to the Talk ◮ Pelevina M., Arefiev N., Biemann C., Panchenko A. (2016) Making Sense of Word Embeddings . In Proceedings of the 1st Workshop on Representation Learning for NLP. ACL 2016, Berlin, Germany. Best Paper Award ◮ Panchenko A., Simon J., Riedl M., Biemann C. (2016) Noun Sense Induction and Disambiguation using Graph-Based Distributional Semantics . In Proceedings of the KONVENS 2016, Bochum, Germany ◮ Panchenko A., Ruppert E., Faralli S., Ponzetto S. P., and Biemann C. (2017). Unsupervised Does Not Mean Uninterpretable: The Case for Word Sense Induction and Disambiguation . In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL’2017). Valencia, Spain Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Motivation for Unsupervised Knowledge-Free WSD ◮ A word sense disambiguation (WSD) system: ◮ Input : word and its context. ◮ Output : a sense of this word. Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Motivation for Unsupervised Knowledge-Free WSD ◮ A word sense disambiguation (WSD) system: ◮ Input : word and its context. ◮ Output : a sense of this word. ◮ Existing approaches (Navigli, 2009): ◮ Knowledge-based approaches that rely on hand-crafted resources, such as WordNet. ◮ Supervised approaches learn from hand-labeled training data, such as SemCor. Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Motivation for Unsupervised Knowledge-Free WSD ◮ A word sense disambiguation (WSD) system: ◮ Input : word and its context. ◮ Output : a sense of this word. ◮ Existing approaches (Navigli, 2009): ◮ Knowledge-based approaches that rely on hand-crafted resources, such as WordNet. ◮ Supervised approaches learn from hand-labeled training data, such as SemCor. ◮ Problem 1: hand-crafted lexical resources and training data expensive, often inconsistent, domain-dependent. ◮ Problem 2: These methods assume a fixed sense inventory: ◮ senses emerge and disappear over time. ◮ different applications require different granularities. Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Motivation for Unsupervised Knowledge-Free WSD (cont.) ◮ An alternative route is the unsupervised knowledge-free approach . ◮ learn an interpretable sense inventory ◮ learn a disambiguation model Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Dense Sense Representations for WSD ◮ Pelevina M., Arefiev N., Biemann C., Panchenko A. Making Sense of Word Embeddings . In Proceedings of the 1st Workshop on Representation Learning for NLP. ACL 2016, Berlin, Germany. ◮ An approach to learn word sense embeddings . Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Overview of the contribution Prior methods: ◮ Induce inventory by clustering of word instances (Li and Jurafsky, 2015) ◮ Use existing inventories (Rothe and Sch¨ utze, 2015) Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Overview of the contribution Prior methods: ◮ Induce inventory by clustering of word instances (Li and Jurafsky, 2015) ◮ Use existing inventories (Rothe and Sch¨ utze, 2015) Our method: ◮ Input: word embeddings ◮ Output: word sense embeddings ◮ Word sense induction by clustering of word ego-networks ◮ Word sense disambiguation based on the induced sense representations Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Learning Word Sense Embeddings Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Word Sense Induction: Ego-Network Clustering ◮ Graph clustering using the Chinese Whispers algorithm (Biemann, 2006). Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Neighbours of Word and Sense Vectors Vector Nearest Neighbours tray, bottom, diagram, bucket, brackets, stack, table basket, list, parenthesis, cup, trays, pile, play- field, bracket, pot, drop-down, cue, plate leftmost#0, column#1, randomly#0, tableau#1, top-left0, indent#1, bracket#3, table#0 pointer#0, footer#1, cursor#1, diagram#0, grid#0 pile#1, stool#1, tray#0, basket#0, bowl#1, table#1 bucket#0, box#0, cage#0, saucer#3, mir- ror#1, birdcage#0, hole#0, pan#1, lid#0 ◮ Neighbours of the word “table” and its senses produced by our method. ◮ The neighbours of the initial vector belong to both senses . ◮ The neighbours of the sense vectors are sense-specific . Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Word Sense Disambiguation 1. Context Extraction ◮ use context words around the target word 2. Context Filtering ◮ based on context word’s relevance for disambiguation 3. Sense Choice ◮ maximize similarity between context vector and sense vector Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Word Sense Disambiguation: Example Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Evaluation on SemEval 2013 Task 13 Dataset: Comparison to the State-of-the-art Model Jacc. Tau WNDCG F.NMI F.B-Cubed AI-KU (add1000) 0.176 0.609 0.205 0.033 0.317 AI-KU 0.176 0.619 0.393 0.066 0.382 AI-KU (remove5-add1000) 0.228 0.654 0.330 0.040 0.463 Unimelb (5p) 0.198 0.623 0.374 0.056 0.475 Unimelb (50k) 0.198 0.633 0.384 0.060 0.494 UoS (#WN senses) 0.171 0.600 0.298 0.046 0.186 UoS (top-3) 0.220 0.637 0.370 0.044 0.451 La Sapienza (1) 0.131 0.544 0.332 – – La Sapienza (2) 0.131 0.535 0.394 – – AdaGram, α = 0.05, 100 dim 0.274 0.644 0.318 0.058 0.470 w2v 0.197 0.615 0.291 0.011 0.615 w2v (nouns) 0.179 0.626 0.304 0.011 0.623 JBT 0.205 0.624 0.291 0.017 0.598 JBT (nouns) 0.198 0.643 0.310 0.031 0.595 TWSI (nouns) 0.215 0.651 0.318 0.030 0.573 Dr. Alexander Panchenko University of Hamburg, Language Technology Group Unsupervised Knowledge-Free Word Sense Disambiguation

Unsupervised Knowledge-Free Word Sense Disambiguation Dr. Alexander - PowerPoint PPT Presentation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Unsupervised Knowledge-Free Word Sense Disambiguation Dr. Alexander Panchenko University of Hamburg, Language Technology Group 23 February, 2017 Dr. Alexander

Word Sense Word Sense Word Sense Disambiguation Disambiguation Disambiguation Presented by

Word Sense Disambiguation Word Sense Disambiguation (WSD) Given A

Word Meaning & Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

Word Sense Disambiguation WORD SENSE DISAMBIGUATION Homonymy and Polysemy As we have seen,

WSD Word Sense Disambiguation: Determine from context (or otherwise) what Word Sense

Word Sense Disambiguation Unsupervised WSD Modern WSD L645 / B659 (Some material from Jurafsky

Final Projects Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison

Word Sense Disambiguation for Ontological Document Classification Speaker: Georgiana Ifrim

Similarity-based Word Sense Disambiguation Yael Karov Shimon Edelman Weizmann Institute MIT

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role Labeling/Verb

Natural Language Processing: Word Sense Disambiguation Roman Kern <rkern@tugraz.at>

A SEMANTIC UNSUPERVISED LEARNING APPROACH TO WORD SENSE DISAMBIGUATION Dissertation Presentation

Data-driven sense induction for disambiguation and lexical selection in translation Marianna

Topic Models for Word Sense Disambiguation and Token-based Idiom Detection Linlin Li, Benjamin

HW #8 WordNet-based WSD Perform word sense disambiguation of probe word In context of

English Understanding: From Annotations to AMRs Nathan Schneider August 28, 2012 :: ISI NLP

Machine Translation The noisy channel model [Brown et al. 1990, Knight 1999] Classical and

AIRS EPO AIRS Science Team Meeting May 3-6, 2005 Sharon Okonek Sharon Okonek California

Talk to the International Experts May 13, 2019 May 13, 2019 I. Program Overview II.

An R-parity Violating Supersymmetric Explanation of the EeV Events at ANITA Yicong Sui

Service 145 The following information is based on the 2005 Food Code. The Food Code is available

N-gram Language Models CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT marine@cs.umd.edu Roadmap

Question Processing: Formulation & Expansion Ling573 NLP Systems and Applications May 8,

Unsupervised Knowledge-Free Word Sense Disambiguation Dr. Alexander - PowerPoint PPT Presentation

Introduction Dense Sense Representations Sparse Sense Representations Future Work Unsupervised Knowledge-Free Word Sense Disambiguation Dr. Alexander Panchenko University of Hamburg, Language Technology Group 23 February, 2017 Dr. Alexander

Word Sense Word Sense Word Sense Disambiguation Disambiguation Disambiguation Presented by

Word Sense Disambiguation Word Sense Disambiguation (WSD) Given A

Word Meaning &amp; Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

Word Sense Disambiguation WORD SENSE DISAMBIGUATION Homonymy and Polysemy As we have seen,

WSD Word Sense Disambiguation: Determine from context (or otherwise) what Word Sense

Word Sense Disambiguation Unsupervised WSD Modern WSD L645 / B659 (Some material from Jurafsky

Final Projects Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison

Word Sense Disambiguation for Ontological Document Classification Speaker: Georgiana Ifrim

Similarity-based Word Sense Disambiguation Yael Karov Shimon Edelman Weizmann Institute MIT

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role

Semantics Avalanche: Word Sense Disambiguation, Dependency Parsing, Semantic Role Labeling/Verb

Natural Language Processing: Word Sense Disambiguation Roman Kern &lt;rkern@tugraz.at&gt;

A SEMANTIC UNSUPERVISED LEARNING APPROACH TO WORD SENSE DISAMBIGUATION Dissertation Presentation

Data-driven sense induction for disambiguation and lexical selection in translation Marianna

Topic Models for Word Sense Disambiguation and Token-based Idiom Detection Linlin Li, Benjamin

HW #8 WordNet-based WSD Perform word sense disambiguation of probe word In context of

English Understanding: From Annotations to AMRs Nathan Schneider August 28, 2012 :: ISI NLP

Machine Translation The noisy channel model [Brown et al. 1990, Knight 1999] Classical and

AIRS EPO AIRS Science Team Meeting May 3-6, 2005 Sharon Okonek Sharon Okonek California

Talk to the International Experts May 13, 2019 May 13, 2019 I. Program Overview II.

An R-parity Violating Supersymmetric Explanation of the EeV Events at ANITA Yicong Sui

Service 145 The following information is based on the 2005 Food Code. The Food Code is available

N-gram Language Models CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT marine@cs.umd.edu Roadmap

Question Processing: Formulation &amp; Expansion Ling573 NLP Systems and Applications May 8,

Word Meaning & Word Sense Disambiguation CMSC 723 / LING 723 / INST 725 M ARINE C ARPUAT

Natural Language Processing: Word Sense Disambiguation Roman Kern <rkern@tugraz.at>

Question Processing: Formulation & Expansion Ling573 NLP Systems and Applications May 8,