QA Lab-PoliInfo Classification Task Minoru Sasaki and Tetsuya - PowerPoint PPT Presentation

Ibrk at the NTCIR-14 QA Lab-PoliInfo Classification Task Minoru Sasaki and Tetsuya Nogami Ibaraki University 1

Introduction • Stance Classification • automatically identify speaker's position on a specific target of topic from text. • The speaker's position is one of Three labels. • Support ( favour/favor, agree, pro) • Against (oppose, disagree, con) • Neutral ( none, unrelated, neither) • For example, • we want to know whether the former president Barack Obama is in favor of stricter gun laws from his speeches. 2

Introduction • Previous researches have demonstrated many approaches to solve stance classification tasks. • (Rajadesingan 2014) • Use semi-supervised learning in online forum. • (Bamman 2015) • Use unsupervised method • (Ebrahimi 2016) • Use a supervised probabilistic classification in tweets. 3

Stance Classification Using Machine Learning • In supervised approach, • this task is difficult due to imbalanced class sizes. • Stance classification task usually requires a large amount of training data to obtain many sentiment expressions. • We propose to use sentiment dictionary for stance classification. • a sentiment dictionary is introduced to label each word with polarity information in the dictionary. 4

Purpose of This Study • We propose a stance classification system using sentiment dictionary. • To evaluate the effectiveness of our system, • we conduct some experiments to compare with the result of the baseline method using Support Vector Machine (SVM). 5

System Description Input Sentiment Sentence Dictionary Count Output Words Matching positive and Stance negative labels Relevance Output Classifier Relevance Fact-Checkability Output Classifier Fact-Checkability 6

Stance Classifier (1/2) • If each extracted word exists in the sentiment dictionary, • the polarity of the word is extracted to identify sentiment polarity label (positive or negative). • The system counts up the number of positive and negative labels in the sentence. Input Sentiment Sentence Dictionary Count Output Words Matching positive and Stance negative labels 7

Stance Classifier (2/2) • If the number of positive labels is greater than the number of negative labels, • the system assigns “support” label to the sentence, otherwise the system assigns “against” label. Input Sentiment Sentence Dictionary Count Output Words Matching positive and Stance negative labels 8

Relevance Classifier and Fact-checkability Classifier • We extract nouns, verbs and adjectives from the input sentence in the training data. • Each set is represented as a feature vector by calculating frequencies of the features. • We construct two classifiers by Support Vector Machine (SVM) from labeled feature vectors. • The both classifiers are used to predict labels. Output Relevance Relevance Classifier Input Words Sentence Fact-Checkability Output Classifier Fact-Checkability 9

Experiments • NTCIR14 QA Lab-PoliInfo Classification Task Dataset • 14 Topics • about 30,000 sentences in training data • 3,412 sentences in test data • Sentiment Dictionary • Japanese Sentiment Polarity Dictionary • created by Tohoku University • We use this dictionary to obtain a sentiment polarity of word. 10

Experimental Results (1/6) • Precision for the topic “Integrated Resort” Methods Support Against Neutral Our System 7.19% 15.63% 92.10% Baseline System 0% 0% 90.73% • Precision, recall and F-measure for this topic Methods Precision Recall F-measure Our System 77.80% 77.80% 77.80% Baseline System 90.70% 90.70% 90.73% 11

Experimental Results (2/6) • Precision for the topic “Integrated Resort” Methods Support Against Neutral Our System 7.19% 15.63% 92.10% Baseline System 0% 0% 90.73% • The proposed system obtained higher precision than the baseline system using SVM. • These results show that the sentiment dictionary is effective for stance classification. • When we use the baseline system, all samples are classified into “neutral”. 12

Experimental Results (3/6) • Precision, recall and F-measure of test data for this topic • All scores are decreased about 13% in comparison to the baseline system. • Because there are a lot of neutral samples in the training and test data. Methods Precision Recall F-measure Our System 77.80% 77.80% 77.80% Baseline System 90.70% 90.70% 90.73% 13

Experimental Results (4/6) • Results for the “relevance” of the topic label Relevance Not Relevance Method Precision Recall Precision Recall Our System 86.50% 100% NaN 0% • All data were classified as relevant to the topic. • It is difficult to detect sentences that are not related to the topic by using SVM. 14

Experimental Results (5/6) • Results for the “fact -checkability ” classification label fact-checkable not fact-checkable Method Precision Recall Precision Recall Our System NaN 0% 64.6% 100% • All data were classified as “not fact - checkable”. • It is difficult to detect sentences that we can conduct a fact-check by using SVM. 15

Experimental Results (6/6) • Results for the class label using our system label Precision Recall F-measure 6.3% 17.8% 9.3% fact-check-support 4.5% 20.2% 7.4% fact-check-against class-other 93.4% 77.0% 84.4% • The small number of test data can be classified correctly. • In the future, we will improve our system to classify “class - other” samples effectively. 16

Conclusions • We proposed a new method for stance classification using sentiment dictionary. • The effectiveness of the proposed method was evaluated on the NTCIR-14 QA Lab-PoliInfo classification task formal run dataset. • The experimental results show that the proposed methods obtains higher precision than the baseline method using SVM. • However, the precision of our system is decreased about 13% in comparison to the baseline system for the “neutral” samples. 17

QA Lab-PoliInfo Classification Task Minoru Sasaki and Tetsuya - PowerPoint PPT Presentation

Ibrk at the NTCIR-14 QA Lab-PoliInfo Classification Task Minoru Sasaki and Tetsuya Nogami Ibaraki University 1 Introduction Stance Classification automatically identify speaker's position on a specific target of topic from text.

KitAi-PI: Summarization System for NTCIR-14 QA Lab-PoliInfo Satoshi Hiai, Yuka Otani, Takashi

CUTKB at NTCIR-14 QALab-PoliInfo Task Toshiki Tomihira and Yohei Seki University of Tsukuba,

RICT at the NTCIR-14 QALab- PoliInfo Task Jiawei Yong, Shintaro Kawamura, Katsumi Kanasaki,

HCC@UF Lab Resources Overview (and Tour) Lisa Anthony, PhD January 12, 2017 HCC@UF Lab

Lab 7 Lab 6 Review Review for Lab 7 March 5, 2019 Sprenkle - CSCI111 1 Lab 7: Pair

Tuberculosis Researches in Thailand

Medical Lab Medical Lab Technology Technology - ELO ELO What is a Medical lab What is a

Computer Applications Lab Computer Applications Lab Lab 1 Lab 1 Introduction to Matlab

Week 1 Tutorial: Lab Preview & Building Gates Lab 0 Using the DE2. Creating a project

Computer Applications Lab Computer Applications Lab Lab 7 Lab 7 Designing GUI with Matlab

Computer Applications Lab Computer Applications Lab Lab 9 Lab 9 Numerical Calculus and Symbolic

Lab Overview Review lab 8 Prep for lab 9 March 20, 2018 Sprenkle - CSCI111 1 Lab 8:

Penny Lab.gwb - 1/15 - Thu Apr 22 2010 08:21:51 Penny Lab.gwb - 2/15 - Thu Apr 22 2010 08:22:28

SMART LAB Full lab equipment package Complete range of tests performed to all major standard

Ideal Clinic Realisation and Maintenance Post-Lab planning Post-Lab workplan 17 18 19 20 21 22

CS 2334: Lab 2 Unit Testing Andrew H. Fagg: CS2334: Lab 2 1 Notes Rubric for each lab and

Test progress monitoring and control Chapter 5 Part 2 3. Test progress monitoring and control

ECE 3574: Applied Software Design Integration Testing Today we will take a look at integration

gholzmann@acm.org ISO 26262: highly recommended EN 50128: highly recommended IEC 61508: highly

Energy Rating Rebate Example Clean Water Pump Opportunity Rebates [XX Utility] offers

2017 HUD Preservation Workbook and Recapitalization Excel Tool Webinar Conceiving a

First Quarter 2016 Earnings Teleconference May 6, 2016 One of North Americas largest electric

ThreeBallot, VAV, and Twin Ronald L. Rivest MIT CSAIL Warren D. Smith - CRV Talk at EVT07

Gender and Americas Safety Net Reach Linda Burton, Marybeth Mattingly, Juan Pedroza, and

QA Lab-PoliInfo Classification Task Minoru Sasaki and Tetsuya - PowerPoint PPT Presentation

Ibrk at the NTCIR-14 QA Lab-PoliInfo Classification Task Minoru Sasaki and Tetsuya Nogami Ibaraki University 1 Introduction Stance Classification automatically identify speaker's position on a specific target of topic from text.

KitAi-PI: Summarization System for NTCIR-14 QA Lab-PoliInfo Satoshi Hiai, Yuka Otani, Takashi

CUTKB at NTCIR-14 QALab-PoliInfo Task Toshiki Tomihira and Yohei Seki University of Tsukuba,

RICT at the NTCIR-14 QALab- PoliInfo Task Jiawei Yong, Shintaro Kawamura, Katsumi Kanasaki,

HCC@UF Lab Resources Overview (and Tour) Lisa Anthony, PhD January 12, 2017 HCC@UF Lab

Lab 7 Lab 6 Review Review for Lab 7 March 5, 2019 Sprenkle - CSCI111 1 Lab 7: Pair

Tuberculosis Researches in Thailand

Medical Lab Medical Lab Technology Technology - ELO ELO What is a Medical lab What is a

Computer Applications Lab Computer Applications Lab Lab 1 Lab 1 Introduction to Matlab

Week 1 Tutorial: Lab Preview &amp; Building Gates Lab 0 Using the DE2. Creating a project

Computer Applications Lab Computer Applications Lab Lab 7 Lab 7 Designing GUI with Matlab

Computer Applications Lab Computer Applications Lab Lab 9 Lab 9 Numerical Calculus and Symbolic

Lab Overview Review lab 8 Prep for lab 9 March 20, 2018 Sprenkle - CSCI111 1 Lab 8:

Penny Lab.gwb - 1/15 - Thu Apr 22 2010 08:21:51 Penny Lab.gwb - 2/15 - Thu Apr 22 2010 08:22:28

SMART LAB Full lab equipment package Complete range of tests performed to all major standard

Ideal Clinic Realisation and Maintenance Post-Lab planning Post-Lab workplan 17 18 19 20 21 22

CS 2334: Lab 2 Unit Testing Andrew H. Fagg: CS2334: Lab 2 1 Notes Rubric for each lab and

Test progress monitoring and control Chapter 5 Part 2 3. Test progress monitoring and control

ECE 3574: Applied Software Design Integration Testing Today we will take a look at integration

gholzmann@acm.org ISO 26262: highly recommended EN 50128: highly recommended IEC 61508: highly

Energy Rating Rebate Example Clean Water Pump Opportunity Rebates [XX Utility] offers

2017 HUD Preservation Workbook and Recapitalization Excel Tool Webinar Conceiving a

First Quarter 2016 Earnings Teleconference May 6, 2016 One of North Americas largest electric

ThreeBallot, VAV, and Twin Ronald L. Rivest MIT CSAIL Warren D. Smith - CRV Talk at EVT07

Gender and Americas Safety Net Reach Linda Burton, Marybeth Mattingly, Juan Pedroza, and

Week 1 Tutorial: Lab Preview & Building Gates Lab 0 Using the DE2. Creating a project