A Few Bad Votes Too Many? Towards Robust Ranking in Social Media - PowerPoint PPT Presentation

A Few Bad Votes Too Many? Towards Robust Ranking in Social Media Jiang Bian Georgia Tech Yandong Liu Emory University Eugene Agichtein Emory University Hongyuan Zha Georgia Tech

Outline • Background and Motivation • Learning Ranking Functions in Social Media • Vote Spam in Social Media • Experiments on Community Question Answering 2

Online Social Media User Interactions: Voting/Rating the content Information share need Information Online Social Media … Thumbs up or down votes to answers votes to news votes to videos or comments or comments 3

Community Question Answering (CQA) • users can express specific information needs by posting questions, and get direct Search responses authored by other portal web users. • Both questions and answers are stored for future use • Allow searchers to attempt to locate an answer to their question Question • Existing answers can be voted on by any users who wants to share her evaluations of the Answers answers u The quality of the content in this QA portals varies drastically [Agichtein et al. 2008] QA u User votes can provide crucial indicators into the quality and reliability of the content archives u User votes can help to improve the quality of ranking CQA content [Bian et al. 2008] 4

Vote Spam • Not all user votes are reliable – Many “thumbs up” or “thumbs down” votes are generated without much thought – In some cases, users intend to game the system by promoting specific answers for fun or profit We refer those bad or fraudulent votes as vote spam – • How to handle vote spam for robust ranking of social media content? – Yahoo! Team semi-automatically removes some of more obvious vote spam after the fact – It is not adequate • The amount and the patterns of vote spam evolve • Vote spam methods can change significantly due to varying popularity of content, specifics of media and topic • Challenge – A robust method to train a ranking function that remains resilient to evolving vote spam attacks 5

Social Content and User Votes in Social Media Topic thread poster Responses creator Voter Topic p 1 n 1 Response1 Thread … Response2 p 2 n 2 … … … p n Response n n n User Votes 7

Learning-based Approach Content features relevance Community interaction , , < > query topic response Quality Features Preference User Votes GBrank 8

� ✁ ✂ Vote Spam Attack Models • Two main types of vote spam – Incorrect votes – not an expert – Malicious votes – promote some specific responses thumbs up vote spam one thumb up vote to chosen response Choose number of Choose one Choose % topic attackers based on response to N ( 2 ) for each threads to attack , promote for each Thumbs down vote spam chose thread chosen thread one thumb up vote to chosen response AND one thumb down vote to each others 10

Robust Ranking Method GBrank in QA retrieval [Bian et al. 2008] • – Promising performance – User vote information provides much contribution to the high accuracy (no vote spam) Robust ranking method – GBrank-robust • – Apply the general vote spam model to generate vote spam into unpolluted QA data – Train the ranking function based on new polluted data – Transfer more weight to other content and community interaction features Content features relevance Ranking < , , > qr qst ans Community function Quality interaction Features Preference User Votes 12

Experimental Setup • Dataset – Factoid questions from the TREC QA benchmarks: • Total question set: 3000 factoid questions from 1999 to 2006 • 1250 factoid questions from total question set—have at least one similar question in the Yahoo! Answers archive – Question-answer collection dataset • To simulate a user’s experience with a community QA site • Submit each TREC query to Yahoo! Answers and retrieve up to 10 top- ranked questions according to Yahoo! Answer ranking • For each of Yahoo! Questions, we retrieve all of its answers • 89642 < qr, qst, ans > tuples – Relevance Judgments • Automatically labels using the TREC factoid answer patterns • 17711 tuples (19.8%) are labeled as “relevant” • 71931 tuples (81.2%) are labeled as “non-relevant” 13

Experimental Setup • Evaluation Metrics – Precision at K • For a given query, P(K) reports the fraction of answers ranked in the top K results that are labeled as relevant – Mean Reciprocal Rank (MRR) • The MRR of each individual query in the reciprocal of the rank at which the first relevant answer was returned – Mean Average of Precision (MAP) • The mean of average precision of all queries in the test set 14

Ranking Methods Compared • Baseline: – Let “ best answer ” always be on top – Following answers are ranked in decreasing order by number of (thumbs up votes – thumbs down votes) • GBrank: – Ranking function with textual and community interaction features and preference extracted from voting information • GBrank-robust: – Similar to GBrank – The training data is polluted according to the chosen spam model 15

Experimental Results • QA Retrieval % 10%; ( , 2 ) (3,1 ) 2 β = µ σ = N N – Vote spam model: – Training data: randomly select 800 TREC queries and all related QA – Testing data (polluted): remainder 450 TREC queries and all related QA GBrank (clear testing data) GBrank-robust GBrank-robust (clear testing data) GBrank Baseline 16

Robustness to Vote Spam Thumbs up vote spam Thumbs up&down vote spam GBrank-robust GBrank-robust GBrank GBrank Baseline Baseline 17

Analyzing Feature Contribution Feature Name Info Gain Similarity between 0.048 query and question Number of resolved 0.045 questions of the answerer Length ratio between 0.043 query and answer Number of thumbs 0.032 0.003 down vote Number of stars for the 0.030 0.029 answerer Number of thumb up 0.021 0.002 vote Similarity between 0.014 0.026 query and qst+ans Number of answer 0.013 No textual features No community 0.018 terms interaction features 18

Contribution and Future Work • Contributions – A parameterized vote spam model to describe and analyze some common forms of vote spam – A method for increasing the robustness of ranking by injecting noise at training – A comprehensive evaluation on ranking performance for community question answering under a variety of simulated vote spam attacks, demonstrating robustness of our ranking • Future work – Explore further the different spam strategies and corresponding robust ranking methods 19

Thank you!

Related Work • Robustness of web search ranking to click spam – [Jansen 2006] reveal the influence of malicious clicks on online advertising – [Radlinkski 2006] present how click spam bias the ranking results – [Immorlica et al. 2005] demonstrate that a particular class of learning algorithm are resistant to click fraud in some sense • Ranking the content in social media site [Bian et al. 2008] – Present a ranking framework to utilize user interaction information (including user votes) to retrieve high quality relevant content in social media 21

A Few Bad Votes Too Many? Towards Robust Ranking in Social Media - PowerPoint PPT Presentation

A Few Bad Votes Too Many? Towards Robust Ranking in Social Media Jiang Bian Georgia Tech Yandong Liu Emory University Eugene Agichtein Emory University Hongyuan Zha Georgia Tech Outline Background and Motivation Learning Ranking

Prophecy International Holdings Limited Annual General Meeting 2017 Resolution 1 Votes Number

Upstream Graphics: Too Little, Too Late Upstream Graphics: Too Little, Too Late Daniel Vetter,

Project Selection RESULTS OF THE FIRST SCREENING Screening process Ranking by votes

Easy and Hard Outline Constraint Ranking in OT The Constraint Ranking problem Making fast

Tutorial: TF-Ranking for sparse features Tutorial: TF-Ranking for sparse features This tutorial

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

Not a Fairy Tale Year for State Risk Managers Too hot, Too cold Too little, Too much Lessons

too hilly, too hard What are the barriers to cycling in the UK? David Wildman 15 th

Kernel Principal Component Ranking: Robust Ranking on Noisy Data Evgeni Tsivtsivadze Botond

Ordinal social ranking : simulations for CP-majority rule Nicolas Fayard 1 and Meltem ztrk 1 1

Ranking candidate genes from Ranking candidate genes from perturbation experiments Niko

Online Submodular Set Cover, Ranking, and Repeated Active Learning Online Ranking: At each round,

TVM for Ads Ranking @ Facebook Hao Lu, Ansha Yu, Yinghai Lu, Andrew Tulloch Ads Ranking at

Toward Efficient Many-to-Many Broadcast in Dynamic Wireless Networks Fabian Mager , Carsten

Return-oriented Programming: Exploitation without Code Injection Erik Buchanan, Ryan Roemer,

WHY BAD DATA RUINS WHY BAD DATA RUINS PROJECTS AND HOW PROJECTS AND HOW TO FIX IT TO FIX IT

TAXY TAXY Gives your thumb a lift. Gives your thumb a lift. Julia Hafner L O G L I N E L O G

Recent and Upcoming Advances in the Dyninst Toolkits Sasha Nicolas and Xiaozhu Meng Paradyn

My default postgresql.conf file, step by step Ilya Kosmodemiansky ik@dataegret.com Before

CompSci 514: Computer Networks Lecture 14 Datacenter Transport protocols II Xiaowei Yang

MMI 2: Mobile Human- Computer Interaction Small and Large Display Interaction Prof. Dr. Michael

Raise your hand in Zoom Click on Participants Your hand is raised Click hand to lower it

Nonlinear Spectroscopy and Bilinear Control Theory Using light to estimate stuff NE 155, Spring

Boosting: Foundations and Algorithms Boosting: Foundations and Algorithms Boosting: Foundations