Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity - PowerPoint PPT Presentation

SemEval 2014 Task-3 Cross-Level Semantic Similarity MultiJEDI ERC 259234

Semantic Similarity

Semantic Similarity Mostly focused on similar types of lexical items

Semantic Similarity What if we have different types of inputs?

CLSS: Cross-Level Semantic Similarity A new type of similarity task

CLSS: Cross-Level Semantic Similarity A new type of similarity task • • •

CLSS: Comparison Types Paragraph to Sentence

CLSS: Comparison Types Paragraph to Sentence Sentence to Phrase

CLSS: Comparison Types Paragraph to Sentence Sentence to Phrase Phrase to Word

CLSS: Comparison Types Paragraph to Sentence Sentence to Phrase Phrase to Word Word to Sense

Task Data 4000 pairs in total Training set Test set

Task Data A wide range of domains and text styles

word-to-sense pairs Word to Sense

Rating Scale

Crafting an idealized similarity distribution

Crafting an idealized similarity distribution larger side

Crafting an idealized similarity distribution 2 0 4 1 3 larger side

Crafting an idealized similarity distribution 2 0 4 1 3 smaller side larger side

Crafting an idealized similarity distribution 2 0 4 1 3

Test and Training data IAA Paragraph-Sentence Sentence-Phrase Phrase-Word Word-Sense Krippendorff’s α Training (all) Training (unadjudicated) Test (all) Test (unadjudicated)

The annotation procedure produces a balanced rating distribution

Experimental Setup Baslines: The quick brown fox • The brown fox was quick The quick brown fox • The brown fox es were quick

Experimental Setup Baslines: The quick brown fox • The brown fox was quick The quick brown fox • The brown fox es were quick Evaluation Measure:

Number of participants Paragraph-Sentence Sentence-Phrase Phrase-Word Word-Sense

Top 5 Systems and Baselines Gold LCS Baseline GST Baseline SemantiKLUE run1 UNAL-NLP run2 ECNU run1 SimCompass run1 Meerkat Mafia pw* 0 1 2 3 4 paragraph-sentence sentence-phrase phrase-word word-sense

Where do the baselines stand? LCS Baseline GST Baseline SemantiKLUE run1 UNAL-NLP run2 ECNU run1 SimCompass run1 Meerkat Mafia pw* 0 0.75 1.5 2.25 3 paragraph-sentence sentence-phrase phrase-word word-sense

Correlation per genre paragraph-to-sentence

Correlation per genre phrase-to-word

What makes the task difficult?

Handling OOV words and novel usages

Dealing with social media text

CLSS: Cross-Level Semantic Similarity Similarity of different types of lexical items High-quality dataset: 4000 pairs for four comparison types 38 systems from 19 teams

Thank you! MultiJEDI ERC 259234 David Jurgens Mohammad Taher Pilehvar Roberto Navigli

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity - PowerPoint PPT Presentation

SemEval 2014 Task-3 Cross-Level Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly focused on similar types of lexical items Semantic Similarity What if we have different types of inputs? CLSS:

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

A Semantic Similarity Measure for Formal Ontologies Mark Hall Final presentation for the master

Different methods of using the judgements of natural language speakers on a semantic similarity

Semantic T extual Similarity & more on Alignment CMSC 723 / LING 723 / INST 725 M ARINE C

Multi-Relational Semantic Similarity Li Harry Zhang, Steven R. Wilson, Rada Mihalcea

Semantic Similarity Knowledge and its Applications Diana Diana Diana Diana Inkpen Inkpen

Evaluating Text Coherence Based on Semantic Similarity Graph Jan Wira Gotama Putra and Takenobu T

Predicting the relevance of distributional semantic similarity with contextual information

Investigating semantic similarity measures across the Gene Ontology: the relationship between

How much meaning can you pack into a real-valued vector? Semantic similarity measuring using

Cross-species comparison of GO annotations : advantages and limitations of semantic similarity

Interspecies gene function prediction using semantic similarity Guoxian Yu*, Wei Luo, Guangyuan

A Study of Hybrid Similarity Measures for Semantic Relation Extraction Alexander Panchenko and

Similarity-based Learning Methods for the Semantic Web Claudia dAmato Dipartimento di

Identifying Prominent Arguments in Online Debates Using Semantic Textual Similarity Filip

Towards an Efficient Combination of Similarity Measures for Semantic Relation Extraction

Knowledge-, Corpus-, and Web-based Similarity Measures for Semantic Relations Extraction

Frequently Asked Questions Retrieval for Croatian Based on Semantic Textual Similarity Mladen

Context & Semantic Similarity Measurement Carsten Keler May 15, 2008 GEOG 288MR ToDo

Detecting Singleton Review Spammers Using Semantic Similarity Vlad Sandulescu, joint work with

Investigating bias in semantic similarity measures Marco Mina mina@dei.unipd.it University of

Semantic Web 2008 Se a t c eb 008 Semantic Web ca. 2008 S ti W b 2008 Semantic Web

1 So, similarity is not a Boolean notion It is Similarity Are they similar? relatively

What the #%*&! is the Semantic Web? The Semantic Web is a collaborative movement led by

Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity - PowerPoint PPT Presentation

SemEval 2014 Task-3 Cross-Level Semantic Similarity MultiJEDI ERC 259234 Semantic Similarity Semantic Similarity Mostly focused on similar types of lexical items Semantic Similarity What if we have different types of inputs? CLSS:

Align, Disambiguate, and Walk A Unified Approach for Measuring Semantic Similarity Semantic

A Semantic Similarity Measure for Formal Ontologies Mark Hall Final presentation for the master

Different methods of using the judgements of natural language speakers on a semantic similarity

Semantic T extual Similarity &amp; more on Alignment CMSC 723 / LING 723 / INST 725 M ARINE C

Multi-Relational Semantic Similarity Li Harry Zhang, Steven R. Wilson, Rada Mihalcea

Semantic Similarity Knowledge and its Applications Diana Diana Diana Diana Inkpen Inkpen

Evaluating Text Coherence Based on Semantic Similarity Graph Jan Wira Gotama Putra and Takenobu T

Predicting the relevance of distributional semantic similarity with contextual information

Investigating semantic similarity measures across the Gene Ontology: the relationship between

How much meaning can you pack into a real-valued vector? Semantic similarity measuring using

Cross-species comparison of GO annotations : advantages and limitations of semantic similarity

Interspecies gene function prediction using semantic similarity Guoxian Yu*, Wei Luo, Guangyuan

A Study of Hybrid Similarity Measures for Semantic Relation Extraction Alexander Panchenko and

Similarity-based Learning Methods for the Semantic Web Claudia dAmato Dipartimento di

Identifying Prominent Arguments in Online Debates Using Semantic Textual Similarity Filip

Towards an Efficient Combination of Similarity Measures for Semantic Relation Extraction

Knowledge-, Corpus-, and Web-based Similarity Measures for Semantic Relations Extraction

Frequently Asked Questions Retrieval for Croatian Based on Semantic Textual Similarity Mladen

Context &amp; Semantic Similarity Measurement Carsten Keler May 15, 2008 GEOG 288MR ToDo

Detecting Singleton Review Spammers Using Semantic Similarity Vlad Sandulescu, joint work with

Investigating bias in semantic similarity measures Marco Mina mina@dei.unipd.it University of

Semantic Web 2008 Se a t c eb 008 Semantic Web ca. 2008 S ti W b 2008 Semantic Web

1 So, similarity is not a Boolean notion It is Similarity Are they similar? relatively

What the #%*&amp;! is the Semantic Web? The Semantic Web is a collaborative movement led by

Semantic T extual Similarity & more on Alignment CMSC 723 / LING 723 / INST 725 M ARINE C

Context & Semantic Similarity Measurement Carsten Keler May 15, 2008 GEOG 288MR ToDo

What the #%*&! is the Semantic Web? The Semantic Web is a collaborative movement led by