Fools Gold: Understanding the Linguistic Features of Deception and - PowerPoint PPT Presentation

Fools’ Gold: Understanding the Linguistic Features of Deception and Humour Through April Fools’ Hoaxes Ed Dearden e.dearden@lancaster.ac.uk

Hell Planet

Why do we care about April Fools’?

False Information

But where does April Fools’ day fit into this?

April Fools’ Day

What’s the Difference?

Deceptive Intent: Is the author trying to deceive me? Not Deceive? Deceive?

Research Questions

What are the Linguistic features of an April Fools’ article compared to regular news?

How similar are the features of April Fools’ to those of “Fake News”?

I need some background

Deception • Exaggeration. • Vagueness. • Details.

Humour • Contextual Imbalance. • Emotional Language. • Ambiguity.

Irony • Part humour, part deception. • Negative Emotional Language. • Polarity Contrast.

How about the data?

Catching Fools’! 519 April Fools’ articles. • 371 websites. • 213776 words. • 2004-2018 •

Matching Fools’! 519 regular news articles. • 240 Websites. • 344927 Words. • 2004-2018 •

Fake News! Flagged as fake by Buzzfeed. • 2016 Election. • Horne and Adali, 2017. •

But what are you going to do with it?

Vagueness Details Imagination Building a feature set Deception Humour Formality Complexity

CLAWS Ambiguity USAS Ambiguity Wordnet Ambiguity Vague Degree Vagueness Superlatives Degree Adverbs Comparative Adverbs Exaggeration

Time Related Sense Terms Motion Terms Proper Nouns Details Spatial Terms Numbers Dates

Imaginative Informative Verbs Imaginative Verbs Conjunctions Prepositions Imagination Articles Imaginative Adjectives Determiners

First Person Pronouns Deception Negative Negations Emotional Terms

Head Contextual Positive Emotion Relationships Imbalance Body Contextual Alliteration Humour Imbalance Profanity

Associated Press Number Guidelines Associated Press Associated Press Formality Date Guidelines Title Guidelines Spelling Errors

Average Body Punctuation Head Punctuation Sentence Length Readability Complexity Lexical Diversity Function Words Lexical Density

Corpus Feature 1 … Feature N Class 0.111 … 0.552 AF … … … … Create feature matrix 0.444 … 0.654 NAF Feature Selection Which features are most informative? Can we learn to automatically differentiate? Classification What do the results mean? Analysis

Feature Selection Chi-squared test • ANOVA • Mutual Information • Recursive Feature Elimination • Logistic Regression Coefficients •

Feature Selection Formali lity ty Details ls • • Time Rela lated Term rms Associated Press Date • • Associated Press Number Sense Terms Compl plexity ty • Proper Nouns • Avg Sentence Length • Decepti tion Body Punctuation • Readabili lity • First Person n Prono nouns uns Imagination Im • Lexical l Diversity • Preposition • Adjectives Vag agueness • Imagination Conjunctions • Degree Adverb rbs

Classification Feature 1 … Feature N Class 0.111 … 0.552 AF … … … … 0.444 … 0.654 NAF Artjcle Predictjon Truth 1 AF AF 2 NAF AF … … … n-1 NAF NAF n AF NAF

Classification Accuracies for all Feature Sets Hoax Set: 74% Bag-of-Words: 80% Complexity: 71% + Detail

What are we seeing so far?

Our feature set can differentiate between hoax and genuine.

Most individual feature groups don’t do so well.

Complexity and Detail are Important.

How does this compare to Fake News?

Classifying Fakes 1. One classifier trained on Fake News. 2. Second Classifier trained on April Fools’ and tested on Fake News.

Classification Accuracies for Fake News Hoax Set: 76.9% Bag-of-Words: 77.7% Complexity: 78.1% + Detail

Classification Accuracies for Fake News Hoax Set: 64.5% Bag-of-Words: 49.4% Complexity: 65.7% + Detail Complexity: 75.7%

What does this suggest?

Our feature set differentiates fake news similarly well to April Fools’.

Some feature groups perform much worse.

Complexity and Detail remain the most important feature groups.

Our classifier trained on AF seems to work (to some extent) on Fake News.

But what does the data say?

Readability (Complexity)

Lexical Diversity (Complexity)

Time Related Vocabulary (Detail)

Proper Nouns (Detail)

Dates (Detail)

First Person Pronouns (Deception)

Can you sum it all up?

Conclusions – Part 1 Cr Created a a ne new c corp rpus us of April F ril Fools ls’ h hoax axes. • Used features from deception, humour, and irony • detection to classify hoaxes with moderate success. Showed that features relating to complexity and detail • seem to be the most important.

Conclusions – Part 1  Created a new corpus of April Fools’ hoaxes.  Us Used f d features f from d m deceptio tion, humo umour, an r, and ir d irony ny de detectio tion t n to c clas assif ify h hoax axes w with ith mo mode derate s suc uccess.  Showed that features relating to complexity and detail seem to be the most important.

Conclusions – Part 1 Created a new corpus of April Fools’ hoaxes. • Used features from deception, humour, and irony • detection to classify hoaxes with moderate success. Sh Showed d that f t featu atures r rela lating ting t to compl mplexit ity an and d d detail ail • seem t m to be be the mo most t impo important. ant.

Conclusions – Part 2 Found und th that s at simil imilar ar featur atures ar are us useful in l in ide identif tifyin ying Apr April il • Fools ls’ an ’ and d Fake Ne News. Some of these features manifest themselves similarly • for both AF Hoaxes and Fake News.

Conclusions – Part 2 Found that similar features are useful in identifying April • Fools’ and Fake News. So Some o of th these featur tures ma manif nifest th t thems mselv lves s simil imilarly arly • for bo both th AF H AF Hoax axes and and F Fak ake N News.

Future Work

Questions? Thanks for listening! e.dearden@lancaster.ac.uk

Fools Gold: Understanding the Linguistic Features of Deception and - PowerPoint PPT Presentation

Fools Gold: Understanding the Linguistic Features of Deception and Humour Through April Fools Hoaxes Ed Dearden e.dearden@lancaster.ac.uk Hell Planet Why do we care about April Fools? False Information But where does April

Gold and Fools Gold: Successes, Failures, and Futures in Computer Systems Research Butler

Over the last quarter of a century something fundamental seems to have changed in the way in

FIN 48 Quantifying the unknowable Serious people shortage hits tax departments Sovereign wealth

6/18/2018 Sunday, April 1 This is Easter AND April Fools Day! If you believe in the resurrection

Proverbs 1:32 For the waywardness of the simple will kill them, and the complacency of fools will

Investor Presentation 1 Creek Important notice and disclaimer This presentation has been

My tables of elements project My element is gold. By: Lauren Goldsborough Gold I chose gold

~ The Iceberg Gold Deposit ~ One of the best gold prospects in Nevada Discovering gold

- or how Microsoft's WOW64 technology unintentionally fools IT Security analysts Christian

Gold and the Gold Standard Gold and the Gold Standard Nathan Lewis Principal, Kiku Capital

GOLD FORUM Process 2019/20 Information presentation WELCOME GOLD FORUM Process 2019/20 2 GOLD

Proverbs 1:7 NIV The fear of the LORD is the beginning of knowledge, but fools despise wisdom and

In the Kingdom of Fools by A.K. Ramanujan Presentation Module - I A. K. Ramanujan Indian

Living in a fools wireless - secured paradise Stefan Kiese Topics Wireless (consumer)

Our next presentation is from Gold Fields. Gold Fields is a globally diversified gold producer

New Gold Play New Gold Player er November 2011 Disclaimer Certain statements in this

New New Gold Pr Gold Province vince New New Gold Gold Play Player er June 2012 Disclaimer

1 Properties of Matter and Solutions 2 Properties of Matter and Solutions Pyrite, otherwise

Linux Pranks RITlug April Fools Edition Disclaimer: These pranks range from funny, to annoying,

Bounded independence plus noise fools products Chin Ho Lee Northeastern University Elad

Scotgold: Developin ing the Cononis ish Gold ld Min ine and Exp xplorin ing for Gold ld in

GOLD GOLD WARS WARS A Golden Renaissance A Tribute to Ferdinand Lips (1931 2005) WHAT ARE

TUDOR GOLD CORP. Turning Gold Claims into Gold Mines November, 2018 | TSX-V: TUD | Frankfurt:

Expanding Gold Resources & Advancing Multiple New Gold Discoveries at Eagle Mountain Gold

Fools Gold: Understanding the Linguistic Features of Deception and - PowerPoint PPT Presentation

Fools Gold: Understanding the Linguistic Features of Deception and Humour Through April Fools Hoaxes Ed Dearden e.dearden@lancaster.ac.uk Hell Planet Why do we care about April Fools? False Information But where does April

Gold and Fools Gold: Successes, Failures, and Futures in Computer Systems Research Butler

Over the last quarter of a century something fundamental seems to have changed in the way in

FIN 48 Quantifying the unknowable Serious people shortage hits tax departments Sovereign wealth

6/18/2018 Sunday, April 1 This is Easter AND April Fools Day! If you believe in the resurrection

Proverbs 1:32 For the waywardness of the simple will kill them, and the complacency of fools will

Investor Presentation 1 Creek Important notice and disclaimer This presentation has been

My tables of elements project My element is gold. By: Lauren Goldsborough Gold I chose gold

~ The Iceberg Gold Deposit ~ One of the best gold prospects in Nevada Discovering gold

- or how Microsoft's WOW64 technology unintentionally fools IT Security analysts Christian

Gold and the Gold Standard Gold and the Gold Standard Nathan Lewis Principal, Kiku Capital

GOLD FORUM Process 2019/20 Information presentation WELCOME GOLD FORUM Process 2019/20 2 GOLD

Proverbs 1:7 NIV The fear of the LORD is the beginning of knowledge, but fools despise wisdom and

In the Kingdom of Fools by A.K. Ramanujan Presentation Module - I A. K. Ramanujan Indian

Living in a fools wireless - secured paradise Stefan Kiese Topics Wireless (consumer)

Our next presentation is from Gold Fields. Gold Fields is a globally diversified gold producer

New Gold Play New Gold Player er November 2011 Disclaimer Certain statements in this

New New Gold Pr Gold Province vince New New Gold Gold Play Player er June 2012 Disclaimer

1 Properties of Matter and Solutions 2 Properties of Matter and Solutions Pyrite, otherwise

Linux Pranks RITlug April Fools Edition Disclaimer: These pranks range from funny, to annoying,

Bounded independence plus noise fools products Chin Ho Lee Northeastern University Elad

Scotgold: Developin ing the Cononis ish Gold ld Min ine and Exp xplorin ing for Gold ld in

GOLD GOLD WARS WARS A Golden Renaissance A Tribute to Ferdinand Lips (1931 2005) WHAT ARE

TUDOR GOLD CORP. Turning Gold Claims into Gold Mines November, 2018 | TSX-V: TUD | Frankfurt:

Expanding Gold Resources &amp; Advancing Multiple New Gold Discoveries at Eagle Mountain Gold

Expanding Gold Resources & Advancing Multiple New Gold Discoveries at Eagle Mountain Gold