SemEval-2013 Task 2: Sentiment Analysis in Twitter Preslav Nakov - PowerPoint PPT Presentation

SemEval-2013 Task 2: Sentiment Analysis in Twitter Preslav Nakov Sara Rosenthal Zornitsa Kozareva Veselin Stoyanov Alan Ritter Theresa Wilson

Task 2 - Overview Sentiment Analysis Social Media • Understanding how opinions • Short messages • Informal language and sentiments are expressed in language • Creative spelling, punctuation, • Extracting opinions and words, and word use • Genre-specific terminology sentiments from human (#hashtags) and discourse (RT) language data Task Goal : Promote sentiment analysis research in Social Media  SemEval Tweet Corpus • Publically available (within Twitter TOS) • Phrase and message-level sentiment 1 From NUS SMS • Tweets and SMS 1 for evaluating generalizability Corpus (Chen and Kan, 2012)

Task Description Two subtasks: A. Phrase-level sentiment B. Message-level sentiment Classify as positive, negative, neutral/objective : – Words and phrases identified as subjective [Subtask A] – Messages (tweets/SMS) [Subtask B]

Data Collection Extract NEs (Ritter et al., 2011) Identify Popular Topics (Ritter et al., 2012) • NEs frequently associated with specific dates Extract Messages Mentioning Topics Filter Messages for Sentiment • Keep if ≥ pos/neg term from SentiWordNet (>0.3) Data for Annotation

Annotation Task Mechanical Turk HIT (3-5 workers per tweet) Instructions: Subjective words are ones which convey an opinion. Given a sentence, identify whether it is objective, positive, negative, or neutral. Then, identify each subjective word or phrase in the context of the sentence and mark the position of its start and end in the text boxes below. The number above each word indicates its position. The word/phrase will be generated in the adjacent textbox so that you can confirm that you chose the correct range. Choose the polarity of the word or phrase by selecting one of the radio buttons: positive, negative, or neutral. If a sentence is not subjective please select the checkbox indicating that ”There are no subjective words/phrases”. Please read the examples and invalid responses before beginning if this is your first time answering this hit.

Data Annotations Final annotations determined using majority vote I would love to watch Vampire Diaries tonight :) and Worker 1 some Heroes! Great combination I would love to watch Vampire Diaries tonight :) and Worker 2 some Heroes! Great combination I would love to watch Vampire Diaries tonight :) and Worker 3 some Heroes! Great combination I would love to watch Vampire Diaries tonight :) and Worker 4 some Heroes! Great combination I would love to watch Vampire Diaries tonight :) and Worker 5 some Heroes! Great combination I would love to watch Vampire Diaries tonight :) and Intersection some Heroes! Great combination

Distribution of Classes Subtask A Train Dev Test-TWEET Test-SMS Positive 5,895 648 2,734 (60%) 1,071 (46%) Negative 3,131 430 1,541 (33%) 1,104 (47%) Neutral 471 57 160 (3%) 159 (7%) Total 4,635 2,334 Subtask B Train Dev Test-TWEET Test-SMS Positive 3,662 575 1,573 (41%) 492 (23%) Negative 1,466 340 601 (16%) 394 (19%) Neutral/O 4,600 739 1,640 (43%) 1,208 (58%) bjective Total 3,814 2,094

Options for Participation 1. Subtask A and/or Subtask B 2. Constrained* and/or Unconstrained • Refers to data used for training 3. Tweets and/or SMS * Used for ranking

Participation Unconstrained (15) Constrained (21) Unconstrained (7) Constrained (36) Submissions (148)

Scoring • Recall, Precision, F-measure calculated for pos/neg classes for each run submitted Score = Ave(Pos F, Neg F)

Subtask A (words/phrases) Results SMS Tweets 100 100 90 90 80 80 70 70 60 60 50 50 40 40 30 30 Top Systems Top Systems 1. NRC-Canada 1. GU-MLT-LT 20 20 2. AVAYA 2. NRC-Canada 10 10 3. Bounce 3. AVAYA 0 0 Constrained Unconstrained Constrained Unconstrained

Subtask B (messages) Results SMS Tweets 80 80 70 70 60 60 50 50 40 40 30 30 Top Systems Top Systems 20 20 1. NRC-Canada 1. NRC-Canada 2. GU-MLT-LT 2. GU-MLT-LT 10 10 3. teragram 3. KLUE 0 0 Constrained Unconstrained Constrained Unconstrained

Observations Majority of systems were supervised and constrained • 5 semi-supervised, 1 fully unsupervised Systems that made best use of unconstrained option: • Subtask A : senti.ue-en • Subtask B Tweet: AVAYA, bwbaugh, ECNUCS, OPTIMA, sinai • Subtask B SMS: bwbaugh, nlp.cs.aueb.gr, OPTIMA, SZTE-NLP Most popular classifiers • SVM, MaxEnt , linear classifier, Naive Bayes

Thank You! Special thanks to co-organizers: Preslav Nakov, Sara Rosenthal, Alan Ritter Zonitsa Kozareva, Veselin Stoyanov SemEval Tweet Corpus • Funding for annotations provided by: • JHU Human Language Technology Center of Excellence • ODNI IARPA

SemEval-2013 Task 2: Sentiment Analysis in Twitter Preslav Nakov - PowerPoint PPT Presentation

SemEval-2013 Task 2: Sentiment Analysis in Twitter Preslav Nakov Sara Rosenthal Zornitsa Kozareva Veselin Stoyanov Alan Ritter Theresa Wilson Task 2 - Overview Sentiment Analysis Social Media Understanding how opinions Short

SemEval 2012 STS task http://www.cs.york.ac.uk/semeval-2012/task6/ Eneko Agirre Daniel Cer

SemEval-2013 Task 4: Free Paraphrases of Noun Compounds Iris Hendrickx, Zornitsa Kozareva,

SemEval-2013 Task 13: Word Sense Induction for Graded and Non-Graded Senses David Jurgens

7/8/2013 1 7/8/2013 2 7/8/2013 3 7/8/2013 4 7/8/2013 5 7/8/2013 6 7/8/2013 7 7/8/2013

SemEval 2019 Task 1: Cross-lingual Semantic Parsing with UCCA Daniel Hershcovich, Leshem

LIMSI-COT at SemEval-2016 Task 12: Temporal relation identification using a pipeline of

IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Question Answering and

SemEval-2019 Task 4: Hyperpartisan News Detection Johannes Maria Rishabh Emmanuel Payam David

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

p wered Yva productivity AI Task Manager @nerdybff Task Management Task Management Todoist

Revised: March 4, 2013 3/19/2013 3/19/2013 2 3/19/2013 3 3/19/2013 4 3/19/2013 5

CGO Task Presentation CGO Task Presentation CGO Task Presentation Effective Task Presentation

WASC 2019 Findings Presentation to stakeholders February 2019 2 3 Process Task 1 Task 2

IEA Bioenergy IEA BIOENERGY Task 42 Biorefinery 5 th Task Meeting Dublin, Ireland, 25/26 March

Telematics Task Force Telematics Task Force Charlie Gorman Charlie Gorman Talking Points

formal futures ubiquity and physicality From Formalism to Physicality, Alan Dix,

How to adapt your marketing during a crisis WARNING! The content of this talk is more marketing

Stephen'Hamill ' Associate'Director,'Communica6ons'and'Advocacy ' World'Lung'Founda6on '

Cache Coherency Cache coherent processors most current value for an address is the last

Methods & Expressions Venkat Subramaniam svenkat@cs.uh.edu 1 Methods Take list of

Turk workflow and tools . mitcho + Hadas mitcho,hkotek@mit.edu Hackl Lab Turkshop March 2013

Network/Internet Threats/Attacks 1 Internet Structure backbone ISP local network Internet

Building Java Programs Chapter 3 Lecture 3-1: Parameters and Scope reading: 3.1 2 Scope

Sambuz

Useful Links

Newsletter

Mail Us

SemEval-2013 Task 2: Sentiment Analysis in Twitter Preslav Nakov - PowerPoint PPT Presentation

SemEval-2013 Task 2: Sentiment Analysis in Twitter Preslav Nakov Sara Rosenthal Zornitsa Kozareva Veselin Stoyanov Alan Ritter Theresa Wilson Task 2 - Overview Sentiment Analysis Social Media Understanding how opinions Short

SemEval 2012 STS task http://www.cs.york.ac.uk/semeval-2012/task6/ Eneko Agirre Daniel Cer

SemEval-2013 Task 4: Free Paraphrases of Noun Compounds Iris Hendrickx, Zornitsa Kozareva,

SemEval-2013 Task 13: Word Sense Induction for Graded and Non-Graded Senses David Jurgens

7/8/2013 1 7/8/2013 2 7/8/2013 3 7/8/2013 4 7/8/2013 5 7/8/2013 6 7/8/2013 7 7/8/2013

SemEval 2019 Task 1: Cross-lingual Semantic Parsing with UCCA Daniel Hershcovich, Leshem

LIMSI-COT at SemEval-2016 Task 12: Temporal relation identification using a pipeline of

IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Question Answering and

SemEval-2019 Task 4: Hyperpartisan News Detection Johannes Maria Rishabh Emmanuel Payam David

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

p wered Yva productivity AI Task Manager @nerdybff Task Management Task Management Todoist

Revised: March 4, 2013 3/19/2013 3/19/2013 2 3/19/2013 3 3/19/2013 4 3/19/2013 5

CGO Task Presentation CGO Task Presentation CGO Task Presentation Effective Task Presentation

WASC 2019 Findings Presentation to stakeholders February 2019 2 3 Process Task 1 Task 2

IEA Bioenergy IEA BIOENERGY Task 42 Biorefinery 5 th Task Meeting Dublin, Ireland, 25/26 March

Telematics Task Force Telematics Task Force Charlie Gorman Charlie Gorman Talking Points

formal futures ubiquity and physicality From Formalism to Physicality, Alan Dix,

How to adapt your marketing during a crisis WARNING! The content of this talk is more marketing

Stephen'Hamill ' Associate'Director,'Communica6ons'and'Advocacy ' World'Lung'Founda6on '

Cache Coherency Cache coherent processors most current value for an address is the last

Methods &amp; Expressions Venkat Subramaniam svenkat@cs.uh.edu 1 Methods Take list of

Turk workflow and tools . mitcho + Hadas mitcho,hkotek@mit.edu Hackl Lab Turkshop March 2013

Network/Internet Threats/Attacks 1 Internet Structure backbone ISP local network Internet

Building Java Programs Chapter 3 Lecture 3-1: Parameters and Scope reading: 3.1 2 Scope

Sambuz

Useful Links

Newsletter

Mail Us

Methods & Expressions Venkat Subramaniam svenkat@cs.uh.edu 1 Methods Take list of