A Crowd-Annotated Spanish Corpus for Humor Analysis Santiago - PowerPoint PPT Presentation

A Crowd-Annotated Spanish Corpus for Humor Analysis Santiago Castro, Luis Chiruzzo, Aiala Rosá, Diego Garat and Guillermo Moncecchi July 20 th , 2018 Grupo de Procesamiento de Lenguaje Natural, Universidad de la República — Uruguay 1

Outline Background Extraction Annotation Dataset Analysis Conclusion HAHA Task 2

Background

Background i • Humor Detection is about telling if a text is humorous (e. g., a joke). My grandpa came to America looking for freedom, but it didn’t work out, in the next flight my grandma was coming. IT’S REALLY HOT 3

Background ii • Some previous work, such as Barbieri and Saggion (2014), Mihalcea and Strapparava (2005), and Sjöbergh and Araki (2007), created binary Humor Classifiers for short texts written in English. • They extracted one-liners from the Internet and from Twitter, such as: Beauty is in the eye of the beer holder. • Castro et al. (2016) worked on Spanish tweets since our group is interested in leveraging tools for Spanish. • Back then, we conceived the first and only Spanish dataset to study Humor. 4

Background iii • Castro et al. (2016) corpus provided 40k tweets from 18 accounts, with 34k annotations. The annotators decided if the tweets were humorous or not, and if so they rated them from 1 to 5. • However, the dataset has some issues: 2. limited variety of sources (humorous: 9 Twitter accounts, non-humorous: 3 about news accounts, 3 about inspirational thoughts and 3 about curious facts) 3. very few annotations per tweet (less than 2 in average, 4. only 6k were considered humorous by the crowd 5 1. low inter-annotator agreement (Fleiss’ κ = 0 . 3654) around 500 with ≥ 5 annotations)

Background iv 6

Related work Potash, Romanov, and Rumshisky (2017) built a corpus based on tweets in English that aims to distinguish the degree of funniness in a given tweet. They used the tweet set issued in response to a TV game show, labeling which tweets were considered humorous by the show. Used in SemEval 2017 Task 6 — #HashtagWars. 7

Extraction

Extraction i 1. We wanted to have at least 20k tweets as balanced as possible , at least 5 annotations each. 2. We fetched tweets from 50 humorous accounts from Spanish speaking countries, taking 12k at random. 3. We fetched tweet samples written in Spanish throughout February 2018, taking 12k at random. 8

Extraction ii 4. As expected, both sources contained a mix of humorous and non-humorous tweets. 9

Annotation

Annotation i We built a web page, similar to the one used by Castro et al. (2016): 10

Annotation ii clasificahumor.com 11

Annotation iii • Tweets were randomly shown to annotators, but avoiding duplicates (by using web cookies). • We wanted UI to be the more intuitive and self-explanatory as possible, trying not to induce any bias on users and letting them come up with their own definition of humor. • The simple and friendly interface is meant to keep the users engaged and having fun while classifying tweets. 12

Annotation iv • The first tweets shown to every session were the same: 3 tweets for which we know a clear answer. • During the annotation process, we added around 4,500 tweets coming from humorous accounts to help the balance. 13 • People annotated from March 8 th to 27 th , 2018.

Dataset

Dataset i • The dataset consists of two CSV files: tweets and annotations . tweet ID origin 24 humorous account tweet ID session ID date value 24 YOH113F…C4R 2018-03-15 19:30:34 2 14

Dataset ii • 27,282 tweets • 117,800 annotations (including 2,959 skips) • 107,634 “high quality” annotations (excluding skips) 15

Analysis

Annotation Distribution 12 Tweets Number of annotations 0 0 16 10 8 6 4 2 12 , 000 10 , 000 8 , 000 6 , 000 4 , 000 2 , 000

Class Distribution 1% 3.2% 7% 10.3% 13.3% 65.2% Excellent Good Regular Little Funny Not Funny Not Humorous 17

Annotators Distribution 1 10 100 1000 0 20k 40k 60k 80k 100k Annotators Annotations 18

Agreement • If we only consider the 11 annotators who tagged more than a 1,000 times (who tagged 50,939 times in total), the humor and funniness agreement are respectively 0.6345 and 0.2635. 19 • Krippendorff’s α = 0 . 5710 (vs. 0.3654) • If we include the “low quality”, α = 0 . 5512 • Funniness: α = 0 . 1625

Conclusion

Conclusion • We created a better version of a dataset to study Humor in Spanish. 27,282 tweets coming from multiple sources, with 107,634 annotations “high quality” annotations. • Significant inter-annotator agreement value. • It is also a first step to study subjectivity. Although more annotations per tweet would be appropriate, there is a subset of a thousand tweets with at least six annotations that could be used to study people’s opinion on the same instances. 20

HAHA Task

HAHA Task • An IberEval 2018 task. • Two subtasks: Humor Classification and Funniness Average Prediction. • Subset of 20k tweets. • 3 participants, • 7 and 2 submissions respectively. 21

Analysis 85.04% 82.42% 5/5 80.83% 4/5 68.54% 3/5 Not humorous 5/5 Category 75.33% 4/5 52.25% 3/5 Humorous Hits Votes 22

References i References Barbieri, Francesco and Horacio Saggion (2014). “Automatic Detection of Irony and Humour in Twitter”. In: ICCC , pp. 155–162. Castro, Santiago et al. (2016). “Is This a Joke? Detecting Humor Artificial Intelligence . Springer, pp. 139–150. doi: 10.1007/978-3-319-47955-2_12 . 23 in Spanish Tweets”. In: Ibero-American Conference on

References ii Fleiss, Joseph L (1971). “Measuring nominal scale agreement doi: 10.1037/h0031619 . Krippendorff, Klaus (2012). Content analysis: An introduction to its methodology . Sage. doi: 10.1111/j.1468-4446.2007.00153_10.x . Mihalcea, Rada and Carlo Strapparava (2005). “Making Computers Laugh: Investigations in Automatic Humor Language Technology and Empirical Methods in Natural Language Processing . HLT ’05. Vancouver, British Columbia, Canada: Association for Computational Linguistics, pp. 531–538. doi: 10.3115/1220575.1220642 . 24 among many raters”. In: Psychological bulletin 76.5, p. 378. Recognition”. In: Proceedings of the Conference on Human

References iii Potash, Peter, Alexey Romanov, and Anna Rumshisky (2017). “SemEval-2017 Task 6:# HashtagWars: Learning a sense of on Semantic Evaluation (SemEval-2017) , pp. 49–57. doi: 10.18653/v1/s17-2004 . Sjöbergh, Jonas and Kenji Araki (2007). “Recognizing Humor Francesco Masulli, Sushmita Mitra, and Gabriella Pasi. Vol. 4578. Lecture Notes in Computer Science. Springer, pp. 469–476. isbn: 978-3-540-73399-7. doi: 10.1007/978-3-540-73400-0_59 . 25 humor”. In: Proceedings of the 11th International Workshop Without Recognizing Meaning”. In: WILF . Ed. by

Questions? https://pln-fing-udelar.github.io/humor/ 26

A Crowd-Annotated Spanish Corpus for Humor Analysis Santiago - PowerPoint PPT Presentation

A Crowd-Annotated Spanish Corpus for Humor Analysis Santiago Castro, Luis Chiruzzo, Aiala Ros, Diego Garat and Guillermo Moncecchi July 20 th , 2018 Grupo de Procesamiento de Lenguaje Natural, Universidad de la Repblica Uruguay 1

Candide Analysis Rubin, Andrew, Rex, Nathan, and Matt Black Humor .Black humor a type of humor

Artifact 2: Annotated Bibliography, Digital Poster, and Presentation Part 1: Annotated

MACAQ : A Multi Annotated Corpus to study how we adapt Answers to various Questions Anne

Metaphor Corpus Annotated for Source Target Domain Mappings Ekaterina Shutova 1 Simone Teufel

GRAMMAR THROUGH HUMOR BRANDY SHOOKS & WHITNEY SCHARER TEACHING GRAMMAR THROUGH HUMOR Having

We Are Humor Beings: Understanding and Predicting Visual Humor Shuai Wang University of Toronto

The need for Corpus Statistics: Corpus analysis and the identification of linguistically relevant

ERROR ANALYSIS IN A WRITTEN LEARNER CORPUS FROM SPANISH SPEAKERS EFL LEARNERS. A CORPUS BASED

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

Corpus Stylistics: Speech, Writing and Thought Presentation in a Corpus of English Writing

M. A. in Spanish M.A. in Spanish at UCA Designed for students with an undergraduate degree in

WELCOME TO A SPANISH SPEAKING WORLD THE WORLD SPEAKS SPANISH SPANISH IS A DYNAMIC , LIVING

Bondurant - Farrars Growing Spanish Program Allie Kerper, Lexie Klein & Haley Vance

Paving the Way to a Large-scale Pseudosense-annotated Dataset The problem: Paucity of

Health, Humor and Spirituality Work Life Balance Olinda Johnson, PhD. RNC.CNS Assistant

~DOPEIA~ A LOOK AT HUMOR IN SCIENCE SAIL ANNUAL CONFERENCE -- GREAT LAKES, GREAT LIBRARIES MAY

AFT PHARMACEUTICALS Investor Presentation: H1 FY2019 November 2018 IMPORTANT NOTICE This

Workshop JJ Beyond the Basics: Air Pollution Control Innovative Control Technologies for VOC

DIG IGITAL AND ENVIRONMENTAL SK SKIL ILLS FOR FACI CILITIES MANAGEMENT O3 A1 Lea Learnin

The Evolution of Humor Caused by Technological Innovation By TAMUC Graduate Student Abby Diaz

Leadership in Dietetics & Healthcare Nutrition Update 2017 Josh Shepherd, FACHE Liz Joy,

Frans Muller President and CEO of Delhaize Group 1 Pierre-Olivier Beckers Didier Smits 2

BERGEN COUNTY MEDIATORS PEER CONSULTATION GROUP A FUNNY THING HAPPENED ON THE WAY TO THE

Dynami Dynamic Pr c Presenta esentation tion JIM GRANT NAESP Webinar May 21, 2015

Sambuz

Useful Links

Newsletter

Mail Us

A Crowd-Annotated Spanish Corpus for Humor Analysis Santiago - PowerPoint PPT Presentation

A Crowd-Annotated Spanish Corpus for Humor Analysis Santiago Castro, Luis Chiruzzo, Aiala Ros, Diego Garat and Guillermo Moncecchi July 20 th , 2018 Grupo de Procesamiento de Lenguaje Natural, Universidad de la Repblica Uruguay 1

Candide Analysis Rubin, Andrew, Rex, Nathan, and Matt Black Humor .Black humor a type of humor

Artifact 2: Annotated Bibliography, Digital Poster, and Presentation Part 1: Annotated

MACAQ : A Multi Annotated Corpus to study how we adapt Answers to various Questions Anne

Metaphor Corpus Annotated for Source Target Domain Mappings Ekaterina Shutova 1 Simone Teufel

GRAMMAR THROUGH HUMOR BRANDY SHOOKS &amp; WHITNEY SCHARER TEACHING GRAMMAR THROUGH HUMOR Having

We Are Humor Beings: Understanding and Predicting Visual Humor Shuai Wang University of Toronto

The need for Corpus Statistics: Corpus analysis and the identification of linguistically relevant

ERROR ANALYSIS IN A WRITTEN LEARNER CORPUS FROM SPANISH SPEAKERS EFL LEARNERS. A CORPUS BASED

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

Corpus Stylistics: Speech, Writing and Thought Presentation in a Corpus of English Writing

M. A. in Spanish M.A. in Spanish at UCA Designed for students with an undergraduate degree in

WELCOME TO A SPANISH SPEAKING WORLD THE WORLD SPEAKS SPANISH SPANISH IS A DYNAMIC , LIVING

Bondurant - Farrars Growing Spanish Program Allie Kerper, Lexie Klein &amp; Haley Vance

Paving the Way to a Large-scale Pseudosense-annotated Dataset The problem: Paucity of

Health, Humor and Spirituality Work Life Balance Olinda Johnson, PhD. RNC.CNS Assistant

~DOPEIA~ A LOOK AT HUMOR IN SCIENCE SAIL ANNUAL CONFERENCE -- GREAT LAKES, GREAT LIBRARIES MAY

AFT PHARMACEUTICALS Investor Presentation: H1 FY2019 November 2018 IMPORTANT NOTICE This

Workshop JJ Beyond the Basics: Air Pollution Control Innovative Control Technologies for VOC

DIG IGITAL AND ENVIRONMENTAL SK SKIL ILLS FOR FACI CILITIES MANAGEMENT O3 A1 Lea Learnin

The Evolution of Humor Caused by Technological Innovation By TAMUC Graduate Student Abby Diaz

Leadership in Dietetics &amp; Healthcare Nutrition Update 2017 Josh Shepherd, FACHE Liz Joy,

Frans Muller President and CEO of Delhaize Group 1 Pierre-Olivier Beckers Didier Smits 2

BERGEN COUNTY MEDIATORS PEER CONSULTATION GROUP A FUNNY THING HAPPENED ON THE WAY TO THE

Dynami Dynamic Pr c Presenta esentation tion JIM GRANT NAESP Webinar May 21, 2015

Sambuz

Useful Links

Newsletter

Mail Us

GRAMMAR THROUGH HUMOR BRANDY SHOOKS & WHITNEY SCHARER TEACHING GRAMMAR THROUGH HUMOR Having

Bondurant - Farrars Growing Spanish Program Allie Kerper, Lexie Klein & Haley Vance

Leadership in Dietetics & Healthcare Nutrition Update 2017 Josh Shepherd, FACHE Liz Joy,