NTCIR-7 Almost-Unsupervised Cross-Language Opinion Analysis NLCL - PowerPoint PPT Presentation

NTCIR-7 Almost-Unsupervised Cross-Language Opinion Analysis NLCL group Taras Zagibalov* T.Zagibalov@sussex.ac.uk John Carroll J.A.Carroll@sussex.ac.uk Department of Informatics University of Sussex * supported by the Ford Foundation International Fellowships Program.

Overview ● Introduction ● Tasks ● Our Approach ● Lexical Item Extraction ● Relevance Classification ● Subjectivity Classification ● Results ● Error Analysis and Conclusion 19/12/2008 NLCL 2

Introduction ● Our main focus is portability of natural language processing systems across languages ● Our basic approach is an almost unsupervised approach 19/12/2008 NLCL 3

Tasks ● Japanese ● English ● Simplified Chinese ● Traditional Chinese 19/12/2008 NLCL 4

Tasks ● Relevance Classification ● Subjectivity Classification ● Opinion Classification ● Target Detection ● Opinion Holder Detection 19/12/2008 NLCL 5

Our Approach ● Lexical Item Extraction ● Relevance Classification ● Subjectivity Classification 19/12/2008 NLCL 6

Lexical Item Extraction Lexical Item (LI) extraction problems: ● A problem of the word boundary detection in Chinese and Japanese. ● A problem of idioms / collocations 19/12/2008 NLCL 7

Lexical Item Extraction LI extraction technique used: ● Any sequence of characters that occurs at least three times is a candidate to be a LI ● If the frequency of a LI is the same as that of a shorter sub-unit then the latter is deleted. 19/12/2008 NLCL 8

Lexical Item Extraction LI extraction technique used: ● Any sequence of characters that occurs at least three times is a candidate to be a LI ● If the frequency of a LI is the same as that of a shorter sub-unit then the latter is deleted. LI candidate Frequency Length 31 4 √ 美国司法 31 3 X 美国司 519 1 √ 司 19/12/2008 NLCL 9

Relevance Classification ● All LI are ranked according to their frequency in each document ● LI frequency ranks are compared across all the documents ● LI with the biggest rank differences are selected as relevance indicators 19/12/2008 NLCL 10

Relevance Classification ● All LI are ranked according to their frequency in each document ● LI frequency ranks are compared across all the documents ● LI with the biggest rank differences are selected as relevance indicators LI Topic 1 rank Topic 2 rank Difference the 2 3 1 X netscape 0 10 10 √ √ law 24 6 18 19/12/2008 NLCL 11

Relevance Classification  Example:  Topic: 'What is the relationship between AOL and Netscape?' (N11)  Relevance indicators: america online, appliances, designed, dominant, link, maker, netscape, online, services, start-ups, sun, technological change, they have, windows operating 19/12/2008 NLCL 12

Subjectivity Classification ● For each LI we found immediate neighbours: 第五次缔约方大会的中国代表团 19/12/2008 NLCL 13

Subjectivity Classification ● For each LI we found immediate neighbours: 第五次缔约方大会的中国代表团中国 : 的 _0, 大会的 _0, 代表团 _1 19/12/2008 NLCL 14

Subjectivity Classification ● For each neighbour word we calculated chi- 2 ) score square ( χ ● LI with χ 2 > 3.84 were included into the list ● All such words were ranked according to their score ● Lists of every two headwords were compared to find how much of context words they shared 19/12/2008 NLCL 15

Subjectivity Classification ● Syntactic and Semantic relations separated: 跟中国经济的快速信心对美国经济的 Syntactic relations Semantic relations + 中国中国 + 美国跟中国 + 经济美国 + 经济经济 + 的 19/12/2008 NLCL 16

Subjectivity Classification Headwords 中国美国经济的 Context 经济经济中国经济 words Context 跟对的快速 words ● Good pairs: 中国 + 美国 ● Bad pairs: 中国 + 经济 ; 美国 + 经济 ; 经济 + 的 19/12/2008 NLCL 17

Subjectivity Classification ● Syntactic and Semantic relations separated: there are good years and bad years stable and good conditions Syntactic relations Semantic relations are + good good + bad good + years and + bad and + good 19/12/2008 NLCL 18

Subjectivity Classification Headwords good bad and years Context and and bad bad words Context years years good and words ● Good pairs: good + bad ● Bad pairs: and + bad; and + good; and + years; years + bad; good + years 19/12/2008 NLCL 19

Subjectivity Classification  Filtering the paired headwords:  Filter 1: Excluded all pairs with a too small association x  score (the score value less than -1.96σ)  Filter 2: Deleted all words that occurred in too many x  pairs ( LI that occurred in more than +1.96σ pairs); 19/12/2008 NLCL 20

Subjectivity Classification  RunID1:  Use manually filtered words: important, difficult, effective, popular, successful, easily, troubled, striking, best, bad, painful, strong, good Result: low recall 19/12/2008 NLCL 21

Subjectivity Classification  RunID1:  Use manually filtered words  RunID2:  RunID1 + (χ 2 >average)  RunID3:  RunID1 + (χ 2 >3.84) 19/12/2008 NLCL 22

Subjectivity Classification Classification algorithm: 1. If a sentence contains a relevance marker > RELEVANT 2. If a sentence is RELEVANT and contains a subjectivity marker > OPINIONATED 3. Otherwise > NA 19/12/2008 NLCL 23

Results: Trad. Chinese (lenient) 100 90 80 70 P -rel 60 R -rel F -rel 50 P -opin 40 R -opin F -opin 30 20 10 0 1 2 3 19/12/2008 NLCL 24

Results: Simp. Chinese (lenient) 100 90 80 70 P -rel 60 R -rel F -rel 50 P -opin 40 R -opin F -opin 30 20 10 0 1 2 3 19/12/2008 NLCL 25

Results: Japanese (lenient) 100 90 80 70 P -rel 60 R -rel F -rel 50 P -opin 40 R -opin F -opin 30 20 10 0 1 2 3 19/12/2008 NLCL 26

Results: English (lenient) 100 90 80 70 P -rel 60 R -rel F -rel 50 P -opin 40 R -opin F -opin 30 20 10 0 1 2 3 19/12/2008 NLCL 27

Results Best results (lenient ) Language Sub-task (RunID) Precision Recall F-value T. Chinese Relevance (3) 48.2 68.9 56.7 Opinion (3) 27.7 84.6 41.7 S. Chinese Relevance (3) 97.1 58.5 73.0 Opinion (3) 43.2 69.9 53.4 Japanese Relevance (3)* 47.7 63.8 54.6 Opinion (3)* 30.2 91.0 45.3 English Relevance (3) 87.5 41.1 55.6 Opinion (3) 47.6 74.2 58.0 *Note that the RunID3 results were obtained after the official submission. 19/12/2008 NLCL 28

Error Analysis ● Small amount of data ● More noise with higher recall ● Word segmentation for the Asian languages  发展中国家 : 发展中 + 国家 / 发展 + 中国 + 家 ● POS tagging 19/12/2008 NLCL 29

Conclusion ● Simple almost unsupervised cross-lingual system ● Satisfactory results for the Japanese and English tasks ● Rather poor performance for the Chinese (both) 19/12/2008 NLCL 30

Future Work ● Reduce noise ● Automate subjectivity marker selection ● Develop unsupervised language independent (quasi-)POS tagging technique 19/12/2008 NLCL 31

ありがとうございます謝謝谢谢 Thank you 19/12/2008 NLCL 32

Results Traditional Chinese (lenient ) Sub-task (RunID) Precision Recall F-value Relevance (1) 84.9 14.5 24.8 Opinion (1) 53.6 26.8 35.7 Relevance (2) 86.4 28.6 43.0 Opinion (2) 49.4 50.6 50.0 Relevance (3) 85.7 41.1 55.6 Opinion (3) 47.6 74.2 58.0 19/12/2008 NLCL 33

Results Simplified Chinese (lenient ) Sub-task (RunID) Precision Recall F-value Relevance (1) 96.3 32.6 48.7 Opinion (1) 44.3 39.9 42.0 Relevance (2) 97.5 28.0 43.5 Opinion (2) 48.2 36.9 41.8 Relevance (3) 97.1 58.5 73.0 Opinion (3) 43.2 69.9 53.4 19/12/2008 NLCL 34

Results Japanese (lenient ) Sub-task (RunID) Precision Recall F-value Relevance (1) 53.7 18.9 28.0 Opinion (1) 42.6 22.3 29.3 Relevance (2) - - - Opinion (2) - - - Relevance (3)* 47.7 63.8 54.6 Opinion (3)* 30.2 91.0 45.3 *Note that the RunID3 results were obtained after the official submission. 19/12/2008 NLCL 35

Results English (lenient ) Sub-task (RunID) Precision Recall F-value Relevance (1) 13.0 6.8 9.0 Opinion (1) 37.8 10.1 16.0 Relevance (2) 17.5 14.4 15.8 Opinion (2) 33.8 18.6 24.0 Relevance (3) 48.2 68.9 56.7 Opinion (3) 27.7 84.6 41.7 19/12/2008 NLCL 36

NTCIR-7 Almost-Unsupervised Cross-Language Opinion Analysis NLCL - PowerPoint PPT Presentation

NTCIR-7 Almost-Unsupervised Cross-Language Opinion Analysis NLCL group Taras Zagibalov* T.Zagibalov@sussex.ac.uk John Carroll J.A.Carroll@sussex.ac.uk Department of Informatics University of Sussex * supported by the Ford Foundation

NTCIR-9 Kick-Off Event ff 2010.10.05 : 13:30- English Session: 15:30-

Neuchatel at NTCIR-4 From CLEF to NTCIR Jacques Savoy University of Neuchatel, Switzerland

I t Introduction to NTCIR-7 d ti t NTCIR 7 N Noriko Kando k K d National Institute of

Overview of the Sixth NTCIR Workshop Noriko Kando National Institute of Informatics

NTCIR 2014 Slides - TUW-IMP at the NTCIR-11 Math-2 Presentation February 2015 CITATIONS READS

KSU Teams QA System for World History Exams at the NTCIR-13 QA Lab-3 Task Tasuku Kimura, Ryo

Kyoto-U: Syntactical EBMT System for NTCIR 7 Patent System for NTCIR-7 Patent Translation Task

Analysis of Similarity Measures between Short Text for the NTCIR-12 Short Text Conversation Task

IASL System for NTCIR-6 Korean-Chinese CLIR Yu-Chun Wang Cheng-Wei Lee Richard Tzong-Han Tsai

RMIT at the NTCIR-13 We Want Web Task Luke Gallagher with Joel Mackenzie, Rodger Benham,

NiCT/ATR in NTCIR-7 CCLQA Track Youzheng WU, Wenliang CHEN, Hideki KASHIOKA NiCT/ATR, Japan

SG01 at the NTCIR-13 STC-2 task Haizhou Zhao , Yi Du, Hangyu Li, Qiao Qian, Hao Zhou, Minlie

Overview of the 7 th NTCIR f Workshop N Noriko Kando k K d National Institute of

CUTKB at NTCIR-14 QALab-PoliInfo Task Toshiki Tomihira and Yohei Seki University of Tsukuba,

Overview of NTCIR-14 Makoto P. Kato Yiqun Liu University of Tsukuba Tsinghua University

POSTECH at NTCIR-4: CJKE Monolingual and Korean-related Cross-Language Retrieval Experiments

Conclusions No one approach will work for every setting Good to remember all approaches

CS 309: Autonomous Intelligent Robotics Instructor: Jivko Sinapov

Let ' s talk abo u t o u r feelings SE N TIME N T AN ALYSIS IN R Ted K w artler Data D u de

Cosmo: a concurrent separation logic for Multicore OCaml Glen Mvel , Jacques-Henri Jourdan,

Probabilistic Fitting Marcel Lthi, University of Basel Slides based on presentation by Sandro

Spring 2015 Cycle: Large Pragmatic Studies to Evaluate Patient-Centered Outcomes Applicant Town

PHPE 400 Individual and Group Decision Making Eric Pacuit University of Maryland 1 / 14

Case-oriented research strategies: Are intended to show how specific social processes develop