Hate Speech Detection is Not as Easy as You May Think: A Closer Look - PowerPoint PPT Presentation

Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation Aymé Arango, Jorge Pérez and Bárbara Poblete

UNDETECTED ALMOST PERFECT HATE SPEECH VS STATE-OF-THE-ART IN RESULTS SOCIAL MEDIA

UNDETECTED HATE SPEECH IN SOCIAL MEDIA

94% F1 [Agrawal and Awekar] ECIR 2018 93% F1 ALMOST PERFECT [Badjatiya et al.] WWW STATE-OF-THE-ART 2017 RESULTS 92% F1 [Zeerak Waseem] NAACL 2016

Hate Speech Detection is Not as Easy as You May Think We show that state of the art results are highly overestimated due to experimental issues in the models: Including the testing set during training phase Oversampling the data before splitting User-biased datasets

State-of-the-art replication User distribution Generalization

94% F1 [Agrawal and Awekar] ECIR 2018 93% F1 ALMOST PERFECT [Badjatiya et al.] WWW STATE-OF-THE-ART 2017 RESULTS 92% F1 [Zeerak Waseem] NAACL 2016

DATASET 1 [Waseem and Hovy] NAACL 2016 Tweet Label Hate Non-Hate

Model 1 [Badjatiya et al.] 2017 DATASET 1 PHASE 1 PHASE 2 [Waseem and Hovy] NAACL Feature Extraction Classification Method 2016 93% F1

PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method DATASET 1 [Waseem and Hovy] NAACL 2016 Embeddings LSTM Fully Connected Softmax Prediction

PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method DATASET 1 [Waseem and Hovy] Splitting TRAIN TEST NAACL 2016 Embeddings Embeddings LSTM Fully Connected Softmax Prediction

PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method DATASET 1 [Waseem and Hovy] Splitting TRAIN TEST NAACL 2016 93% F1 Embeddings AVG( Embeddings ) LSTM GBDT Prediction Fully Connected Softmax Prediction

This looks great! But there is a problem.

PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method DATASET 1 [Waseem and Hovy] Splitting TRAIN TEST NAACL 2016 TEST Embeddings AVG( Embeddings ) LSTM GBDT Prediction Fully Connected Softmax Prediction

Let’s create the model only with the training set.

PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method DATASET 1 [Waseem and Hovy] NAACL 2016

New PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method TRAIN TEST Same Splitting TRAIN TEST

New PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method TRAIN Same Splitting TRAIN TEST Embeddings LSTM Fully Connected Softmax Prediction

New PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method TRAIN Same Splitting TRAIN TEST Embeddings Embeddings LSTM Fully Connected Softmax Prediction

New PHASE 1 PHASE 2 Model 1 [Badjatiya et al.] 2017 Feature Extraction Classification Method TRAIN Same Splitting TRAIN TEST 73% F1 Embeddings AVG( Embeddings ) 93% F1 LSTM GBDT Prediction Fully Connected Softmax Prediction

The result is overestimated due to the inclusion of the testing set during the training phase.

Model 2 [Agrawal and Awekar] 2018 DATASET 1 Feature Extraction Oversampling [Waseem and Hovy] + Data NAACL Classification Method 2016 94% F1

Model 2 [Agrawal and Awekar] 2018 DATASET 1 [Waseem and Hovy] NAACL 2016

Model 2 [Agrawal and Awekar] 2018 TRAIN Oversampling Splitting TEST 94% F1 Embeddings LSTM Fully Connected Softmax Prediction

This also looks great! But there is another problem.

Model 2 [Agrawal and Awekar] 2018 DATASET 1 [Waseem and Hovy] NAACL 2016

Model 2 [Agrawal and Awekar] 2018 TRAIN Oversampling Splitting TEST

Model 2 [Agrawal and Awekar] 2018 Splitting Oversampling TEST 79% F1 Embeddings 94% F1 LSTM Fully Connected Softmax Prediction

The result is overestimated due to the fact that the oversampling phase occurs before splitting the data.

However, there is another issue to take into account.

% Tweets from the most prolific user per class 96 % 96% 44% 44 % 38% 25 % 25% Non-Hate Sexism Racism Hate

DATASET 1 Splitting without [Waseem and Hovy] TRAIN TEST overlapped users NAACL 2016 Model 1 44% F1 73% F1 93% F1 [Badjatiya et al.] 2017 Model 2 35% F1 79% F1 94% F1 [Agrawal and Awekar] 2018

What happens if we have a dataset with a better user distribution?

DATASET 1 DATASET 2 DATASET 2 NEW 250 tweets [Davidson et al.] Hateful tweets DATASET per user ICWSM per class 2017

NEW Splitting without TRAIN TEST DATASET overlapped users Model 1 78% F1 44% F1 73% F1 93% F1 [Badjatiya et al.] 2017 Model 2 76% F1 35% F1 79% F1 94% F1 [Agrawal and Awekar] 2018

User distribution on datasets has an impact on the classification results.

TRAINING TESTING SET SET

DATASET 3 TRAINING [Basile et al.] SET SemEval 2019

DATASET 1 DATASET 3 47% F1 [Waseem and Hovy] [Basile et al.] NAACL SemEval 2016 2019 Model 1 [Badjatiya et al.] 2017 DATASET 3 NEW 51% F1 [Basile et al.] DATASET SemEval 2019 DATASET 1 DATASET 3 51% F1 [Waseem and Hovy] [Basile et al.] NAACL SemEval 2016 2019 Model 2 [Agrawal and Awekar] 2018 DATASET 3 NEW 54% F1 [Basile et al.] DATASET SemEval 2019

Better user-distributed datasets lead to better generalization.

Conclusions

Hate Speech Detection is Not as Easy as You May Think We show that state of the art results are highly overestimated due to experimental issues in the models: Including the testing set during training phase Oversampling the data before splitting User-biased datasets

Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation Aymé Arango, Jorge Pérez and Bárbara Poblete

Hate Speech Detection is Not as Easy as You May Think: A Closer Look - PowerPoint PPT Presentation

Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation Aym Arango, Jorge Prez and Brbara Poblete UNDETECTED ALMOST PERFECT HATE SPEECH VS STATE-OF-THE-ART IN RESULTS SOCIAL MEDIA UNDETECTED HATE

Responding to Online Hate Speech commonsense.org/education Shareable with attribution for

NO HATE SPEECH MOVEMENT YOUTH CAMPAIGN FOR HUMAN RIGHTS ONLINE & OFFLINE FROM INDIVIDUAL

Computational Ethics for NLP Lecture 10: Ethics in Conversational Agents Abuse, hate-speech, and

Hate speech is: Negative or abusive language Targeting or discriminating against a

Outlier Detection Outlier detection is both easy and difficult. It is easy since there are

CONTENDING WITH THE MEDICAL IMPACT OF HATE SPEECH OVERVIEW THINKING ABOUT CHARLOTTESVILLE

Tackling Hate Crime Partnership working Why report Hate Incidents and/or Crime.. All hate

The Conference Mobilising, Planning and Networking for Campaigning Effectively with Young People

Active Error Detection and Resolution for Speech-to-Speech (S2S) Translation Rohit Prasad Rohit

Speech Detection for Text-Dependent Speaker Verification Orith Toledo-Ronen Persay Ltd. Outline

Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian

Deception Detection in Transcribed Speech and Written Text Rebecca Pottenger Background

A Preliminary Studies on Landmark detection Advisor: Hsiao-Chuan Wang Speaker: Jyh-Min CHENG

The Effect of Extremist Violence on Hateful Speech Online Alexandra Olteanu (IBM Research)

Sequential Attention-based Detection of Semantic Incongruities from EEG While Listening to Speech

Fanning the Flames of Hate: Social Media and Hate Crime Karsten Mller (karstenmuller.eu) Carlo

Easy-to-Use Easy-to-Install Easy on the Budget orecx.com Easy-to-Use

? classification user model speech = sensor adapts its dialog behavior inference from (e.g.

Machine Intelligence made easy: Vision/Speech API, TensorFlow and Cloud ML Kaz Sato Staff

Automatic Detection of Parkinsons Disease from Continuous Speech Recorded in Non- Controlled

Communities Uniting Against Hate Not In Our Town APA Presentation New film A

Outliers Detection vs. Control Questions to Ensure Reliable Results in Crowdsourcing. A Speech

Maryland Early Hearing Detection and Intervention (MD EHDI) Program Update Hearing and Speech

A Community Conversation: Creating Inclusion and Confronting Hate Our purpose To engage in

Hate Speech Detection is Not as Easy as You May Think: A Closer Look - PowerPoint PPT Presentation

Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation Aym Arango, Jorge Prez and Brbara Poblete UNDETECTED ALMOST PERFECT HATE SPEECH VS STATE-OF-THE-ART IN RESULTS SOCIAL MEDIA UNDETECTED HATE

Responding to Online Hate Speech commonsense.org/education Shareable with attribution for

NO HATE SPEECH MOVEMENT YOUTH CAMPAIGN FOR HUMAN RIGHTS ONLINE &amp; OFFLINE FROM INDIVIDUAL

Computational Ethics for NLP Lecture 10: Ethics in Conversational Agents Abuse, hate-speech, and

Hate speech is: Negative or abusive language Targeting or discriminating against a

Outlier Detection Outlier detection is both easy and difficult. It is easy since there are

CONTENDING WITH THE MEDICAL IMPACT OF HATE SPEECH OVERVIEW THINKING ABOUT CHARLOTTESVILLE

Tackling Hate Crime Partnership working Why report Hate Incidents and/or Crime.. All hate

The Conference Mobilising, Planning and Networking for Campaigning Effectively with Young People

Active Error Detection and Resolution for Speech-to-Speech (S2S) Translation Rohit Prasad Rohit

Speech Detection for Text-Dependent Speaker Verification Orith Toledo-Ronen Persay Ltd. Outline

Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian

Deception Detection in Transcribed Speech and Written Text Rebecca Pottenger Background

A Preliminary Studies on Landmark detection Advisor: Hsiao-Chuan Wang Speaker: Jyh-Min CHENG

The Effect of Extremist Violence on Hateful Speech Online Alexandra Olteanu (IBM Research)

Sequential Attention-based Detection of Semantic Incongruities from EEG While Listening to Speech

Fanning the Flames of Hate: Social Media and Hate Crime Karsten Mller (karstenmuller.eu) Carlo

Easy-to-Use Easy-to-Install Easy on the Budget orecx.com Easy-to-Use

? classification user model speech = sensor adapts its dialog behavior inference from (e.g.

Machine Intelligence made easy: Vision/Speech API, TensorFlow and Cloud ML Kaz Sato Staff

Automatic Detection of Parkinsons Disease from Continuous Speech Recorded in Non- Controlled

Communities Uniting Against Hate Not In Our Town APA Presentation New film A

Outliers Detection vs. Control Questions to Ensure Reliable Results in Crowdsourcing. A Speech

Maryland Early Hearing Detection and Intervention (MD EHDI) Program Update Hearing and Speech

A Community Conversation: Creating Inclusion and Confronting Hate Our purpose To engage in

NO HATE SPEECH MOVEMENT YOUTH CAMPAIGN FOR HUMAN RIGHTS ONLINE & OFFLINE FROM INDIVIDUAL