Like Sheep Among Wolves: Characterizing Hateful Users on Twitter - PowerPoint PPT Presentation

Like Sheep Among Wolves: Characterizing Hateful Users on Twitter Manoel Horta Ribeiro Pedro H. Calais Yuri A. Santos Virgílio A. F. Almeida Wagner Meira Jr.

Motivation |||| • In recent years plenty of work was done on characterizing and detecting hate speech. Hate Tweets Ann. characterization Related Posts or Data Words Comments Social Network Turks detection [Burnap and Williams 2017] [Waseem and Hovy 2016] [Davidson et al. 2016] Motivation > Data Collection > Results > Future Stuff/Discussion

Motivation |||| Hate Tweets Ann. characterization Related Posts or Data Words Comments Social Network Turks detection - the meaning of such content is often not self-contained; Time’s up, you all getting what should have happened long ago Motivation > Data Collection > Results > Future Stuff/Discussion

Motivation |||| Hate Tweets Ann. characterization Related Posts or Data Words Comments Social Network Turks detection - hate speech != offensive speech You stupid {insert racial slur here} [Davidson et al. 2016] Motivation > Data Collection > Results > Future Stuff/Discussion

Motivation |||| • The previous work focuses on content , and has shortcomings related to context . • Idea: change the focus from the content , to the user . - Allows for more sophisticated data collection - Give annotators context - not isolated tweets - Richer feature space: activity, net. analysis Motivation > Data Collection > Results > Future Stuff/Discussion

Data Collection ||||| • We begin by sampling Twitter’s retweet network. We employ a Direct Unbiased Random Walk Hate Related ( DURW ) algorithm. Words • Obtained 100,386 users, along with up to 200 tweets of their timelines. Stratified Annotators Sampling [Ribeiro, Wang and Tosley 2010] Motivation > Data Collection > Results > Future Stuff/Discussion

Data Collection ||||| • Given the graph, we employ a hate related lexicon, tagging the users that employed the words. Hate Related Words • We use this users as seeds in a diffusion process based on DeGroot’s learning. Stratified Annotators Sampling [Golub and Jackson 2010] Motivation > Data Collection > Results > Future Stuff/Discussion

Data Collection ||||| • After that, we have a real number in the range [0,1] associated with each individual in the graph. Hate Related Words • We then perform stratified sampling, obtaining up to 1500 users in the intervals [0,.25), [.25,.5), [.5,.75), [.75,1). Stratified Annotators Sampling Motivation > Data Collection > Results > Future Stuff/Discussion

Data Collection ||||| • We ask annotators to determine if users are hateful or not. They were asked to use Twitter’s hateful conduct Hate Related guideline. Words • 3-5 annotators/user, obtained 4972 annotated users. 544 were considered hateful Stratified Annotators Sampling Motivation > Data Collection > Results > Future Stuff/Discussion

Data Collection ||||| • Lastly we also collect the users who have been suspended 4 months after the data collection. Hate Related Words • We use Twitter’s API and obtain 686 suspended users. Stratified Annotators Sampling Motivation > Data Collection > Results > Future Stuff/Discussion

Results ||||| Hateful User Normal User Hateful Neigh. Normal Neigh. Suspended Active • We analyze how hateful and normal users differ w.r.t. their activity, vocabulary and network centrality. • We also compare the neighbors of hateful and of normal users, and suspended/active users to reinforce our findings. • We compare those in pairs as the sampling mechanism for each of the populations is different. • We argue that each one of these pairs contains a proxy for hateful speech in Twitter. Motivation > Data Collection > Results > Future Stuff/Discussion

Results ||||| Hateful Users are power users Hateful User Normal User Hateful Neigh. Normal Neigh. Suspended Active #statuses/day #followers/day #followees/day #favorites avg(interval) 100K 30K 6.0 30 40 20K 4.0 20 50K 20 10K 2.0 10.0 0 0 0 0 0 • Hateful users tweet more, in shorter intervals, favorite more tweets by other people and follow others more (p-values <0.01). • We observe similar results when comparing their neighborhood and when comparing active vs. suspended users. Motivation > Data Collection > Results > Future Stuff/Discussion

Results ||||| Hateful users have newer accounts Hateful User Normal User Hateful Neigh. Normal Neigh. Suspended Active • Hateful users were created Creation Date of Users later than normal ones (p-value < 0.001). • A hypothesis for this difference is that hateful users are banned more often due to Twitter's guidelines infringement. 2006-03 2007-03 2008-03 2009-03 2010-03 2011-03 2012-03 2013-03 2014-03 2015-03 2016-03 2017-03 Motivation > Data Collection > Results > Future Stuff/Discussion

Results ||||| The median hateful user is more central Hateful User Normal User Hateful Neigh. Normal Neigh. Suspended Active median(betweenness) median(eigenvector) median(out degree) • Median hateful user is more 1e-07 0.0001 central in all three measures. 20K 5e-08 5e-05 10K • Average hateful user isn’t, 0 0 0 avg(betweenness) avg(eigenvector) avg(out degree) deformed by very 0.0004 100K influential users. 0.0004 0.0002 50K 0.0002 0 0 0 Motivation > Data Collection > Results > Future Stuff/Discussion

Results ||||| Hateful users use non-trivial vocabulary Hateful User Normal User Hateful Neigh. Normal Neigh. Suspended Active Sadness Swearing Independence Pos. Emotions Neg. Emotions Government Love 0.001 0.002 0.002 0.005 0.005 0.002 0.005 0 0 0 0 0 0 0 Ridicule Masculine Feminine Violence Su ff ering Dispute Anger 0.002 0.0005 0.005 0.001 0.002 0.0005 0.0025 0.001 0 0 0 0 0 0 0 Envy Work Politics Terrorism Shame Confusion Hate 0.01 0.005 0.002 0.001 0.005 0.005 0.0025 0 0 0 0 0 0 0 • Average values for the usage of EMPATH lexical categories. Motivation > Data Collection > Results > Future Stuff/Discussion

Future Stuff/Discussion ||| Suspended Active Hateful User Normal User 7 92.5 • hateful users are 71x more likely to retweet another hateful user. • suspended users are 11x more likely to retweet another suspended user. Motivation > Data Collection > Results > Future Stuff/Discussion

Future Stuff/Discussion ||| • We can also bring the idea of bringing the focus to the user for the task of classification. • Features: - GloVe vectors for the tweets (average); - Activity/Network centrality attributes; • Beyond new features, we may use the very structure of the network in the classification task. Motivation > Data Collection > Results > Future Stuff/Discussion

Future Stuff/Discussion ||| Summary 1. Proposed changing the focus from content to user; 2. Proposed a data collection method with less bias towards a specific lexicon; 3. Observed significant differences w.r.t. activity, lexicon and net centrality between hateful and normal users. 4. Showed how the network structure of users can be used to improve detecting hateful and suspended users. github manoelhortaribeiro twitter manoelribeiro mail manoelribeiro at dcc.ufmg.br Motivation > Data Collection > Results > Future Stuff/Discussion

EXTRA Hateful users don't behave like spammers Hateful User Normal User Hateful Neigh. Normal Neigh. Suspended Active #followers/followees #URLs/tweet hashtags/tweet 1.5 30 1.5 1.0 20 1.0 0.5 10.0 0.5 0 0 0 • We analyze metrics that have been used to detect spammers. • Hateful user in our dataset do not seem to be abusing hashtags or mentions, and do not have higher ratios of followers per followees. Motivation > Data Collection > Results > Future Stuff/Discussion

Like Sheep Among Wolves: Characterizing Hateful Users on Twitter - PowerPoint PPT Presentation

Like Sheep Among Wolves: Characterizing Hateful Users on Twitter Manoel Horta Ribeiro Pedro H. Calais Yuri A. Santos Virglio A. F. Almeida Wagner Meira Jr. Motivation |||| In recent years plenty of work was done on characterizing and

National Sheep Association Who we are and what we do for the UK sheep sector National Sheep

Sheep Breeding Sheep Ireland Industry Meeting 29 th April 2019 New Research Projects: Sheep

Week 4 They Smell Like Sheep Characteris3cs of the Sheep Sheep People Poor eyesight when

DIGITAL WOLVES CHANGING DIGITAL RIVERS => INFO@MIXEL.BE <= REINTRODUCING WOLVES YELLOWSTONE

Dancing with the Wolves: Dancing with the Wolves: Localization of MNC R&D Localization of

Sheep Ireland update Eamon Wall Sheep Ireland: Profit through science 1 1 Sheep Ireland Update

15-410 Democracy is three wolves and a sheep... ...voting on what's for dinner. Exam #1

JET Job Skills Elementary School I Like Rain By Sarah Rogers-Tanner I like rain I dont like

The Alarming Decline of Denalis Wolves Causes, Consequences and What We Must Do to Stop it

HISTORY Basketball Club Sea Wolves is a club created after many transformations on the

MODERN SHEEP YARD Tom Austin, Senior Designer Sheep Technology & Innovation Day | Tuesday

Getting Started in Sheep Production Whit Stewart, Ph.D., Assistant Professor-Extension Sheep

Minimum Energy Scheduling Marek Chrobak University of California, Riverside How to Keep Sheep

Persistence in CH 4 yield ranking across four seasons in sheep on pasture AJ Jonker *, S Hickey,

EGONOPOULOS CORPORATE CONSULTING Project presentation For a sheep milking For a sheep milking

Opportunities for New Zealand sheep milk products: a study of 5 export markets Sheep Milk New

@andy_pavlo Part #1 Background Part # 2 Engineering Part # 3 Oracle Rant 3 AUTONOMOUS DBMSs

11 And he gave the apostles, the prophets, the evangelists, the shepherds and teachers, 12 to equip

3 John Series Lesson #001 May 4, 2003 Dean Bible Ministries www.deanbibleministries.org Dr.

Monday, 8 August 11 Monday, 8 August 11 Monday, 8 August 11 Monday, 8 August 11 Monday, 8

INSERT SHEPHERD VIDEO Truly, truly, I say to you, he who does not enter the sheepfold by

God of War or God of Peace? Question Question Various approaches Question Various approaches

Causal Loops Stock and Flow Spring 2016 Q126. Why do people scream into cell phones? Answer in

The Base Names The Base Names Elohim Elohim El El Shaddai El El Shaddai Shaddai Shaddai

Sambuz

Useful Links

Newsletter

Mail Us

Like Sheep Among Wolves: Characterizing Hateful Users on Twitter - PowerPoint PPT Presentation

Like Sheep Among Wolves: Characterizing Hateful Users on Twitter Manoel Horta Ribeiro Pedro H. Calais Yuri A. Santos Virglio A. F. Almeida Wagner Meira Jr. Motivation |||| In recent years plenty of work was done on characterizing and

National Sheep Association Who we are and what we do for the UK sheep sector National Sheep

Sheep Breeding Sheep Ireland Industry Meeting 29 th April 2019 New Research Projects: Sheep

Week 4 They Smell Like Sheep Characteris3cs of the Sheep Sheep People Poor eyesight when

DIGITAL WOLVES CHANGING DIGITAL RIVERS =&gt; INFO@MIXEL.BE &lt;= REINTRODUCING WOLVES YELLOWSTONE

Dancing with the Wolves: Dancing with the Wolves: Localization of MNC R&amp;D Localization of

Sheep Ireland update Eamon Wall Sheep Ireland: Profit through science 1 1 Sheep Ireland Update

15-410 Democracy is three wolves and a sheep... ...voting on what's for dinner. Exam #1

JET Job Skills Elementary School I Like Rain By Sarah Rogers-Tanner I like rain I dont like

The Alarming Decline of Denalis Wolves Causes, Consequences and What We Must Do to Stop it

HISTORY Basketball Club Sea Wolves is a club created after many transformations on the

MODERN SHEEP YARD Tom Austin, Senior Designer Sheep Technology &amp; Innovation Day | Tuesday

Getting Started in Sheep Production Whit Stewart, Ph.D., Assistant Professor-Extension Sheep

Minimum Energy Scheduling Marek Chrobak University of California, Riverside How to Keep Sheep

Persistence in CH 4 yield ranking across four seasons in sheep on pasture AJ Jonker *, S Hickey,

EGONOPOULOS CORPORATE CONSULTING Project presentation For a sheep milking For a sheep milking

Opportunities for New Zealand sheep milk products: a study of 5 export markets Sheep Milk New

@andy_pavlo Part #1 Background Part # 2 Engineering Part # 3 Oracle Rant 3 AUTONOMOUS DBMSs

11 And he gave the apostles, the prophets, the evangelists, the shepherds and teachers, 12 to equip

3 John Series Lesson #001 May 4, 2003 Dean Bible Ministries www.deanbibleministries.org Dr.

Monday, 8 August 11 Monday, 8 August 11 Monday, 8 August 11 Monday, 8 August 11 Monday, 8

INSERT SHEPHERD VIDEO Truly, truly, I say to you, he who does not enter the sheepfold by

God of War or God of Peace? Question Question Various approaches Question Various approaches

Causal Loops Stock and Flow Spring 2016 Q126. Why do people scream into cell phones? Answer in

The Base Names The Base Names Elohim Elohim El El Shaddai El El Shaddai Shaddai Shaddai

Sambuz

Useful Links

Newsletter

Mail Us

DIGITAL WOLVES CHANGING DIGITAL RIVERS => INFO@MIXEL.BE <= REINTRODUCING WOLVES YELLOWSTONE

Dancing with the Wolves: Dancing with the Wolves: Localization of MNC R&D Localization of

MODERN SHEEP YARD Tom Austin, Senior Designer Sheep Technology & Innovation Day | Tuesday