Natural Language Processing (CSEP 517): Computational Pragmatics - PowerPoint PPT Presentation

Natural Language Processing (CSEP 517): Computational Pragmatics Chenhao Tan � 2017 c University of Washington chenhao@chenhaot.com May 22, 2017 1 / 67

What do we use language for? 2 / 67

What do we use language for? Communicating with other humans ◮ exchanging emails ◮ talking to friends ◮ writing ◮ giving lectures ◮ ... 3 / 67

Throw back Monday Can you pass me the salt? 4 / 67

Pragmatics The study of meaning as communicated by a speaker to a listener (Yule, 1996). 5 / 67

Pragmatics The study of meaning as communicated by a speaker to a listener (Yule, 1996). Or, contextual meaning 6 / 67

Pragmatics The study of meaning as communicated by a speaker to a listener (Yule, 1996). Or, contextual meaning Pragmatics is important for building conversational agents, understanding human decision making, understanding language, etc. 7 / 67

Pragmatics vs. Syntax, Semantics (Yule, 1996) ◮ Syntax: the relationships between linguistic forms, how they are arranged in sequences, and which sequences are well-formed. ◮ Semantics: the relationships between linguistic forms and entities in the world. ◮ Pragmatics: the relationships between linguistic forms and the users of those forms. 8 / 67

Outline Speech act theory The effect of wording choices (big data pragmatics) Modeling conversations: dialogue act categorization Rational speech acts model 9 / 67

Speech act theory We do not simply produce utterances containing grammatical structures; we perform actions via those utterances. 11 / 67

Speech act theory We do not simply produce utterances containing grammatical structures; we perform actions via those utterances. Actions performed via utterances are generally called speech acts (Austin, 1975). 12 / 67

Speech act theory ◮ locutionary act (the actual utterance and its ostensible meaning) ◮ illocutionary act (its real, intended meaning) ◮ perlocutionary act (its actual effect, whether intended or not) 13 / 67

Wording matters Motivate voter turnout (Bryan et al., 2011) “How important is it to you to be a voter “How important is it to you to vote in the in the upcoming election?” upcoming election?” 15 / 67

Wording matters Motivate voter turnout (Bryan et al., 2011) “How important is it to you to be a voter “How important is it to you to vote in the in the upcoming election?” upcoming election?” 16 / 67

Large-scale natural experiments 17 / 67

Large-scale natural experiments A large number of social interactions in the format of texts ⇓ Potential opportunities for natural experiments 18 / 67

Large-scale natural experiments The effect of wording on message propagation on Twitter (Tan et al., 2014) 19 / 67

Large-scale natural experiments The effect of wording on message propagation on Twitter (Tan et al., 2014) 20 / 67

Large-scale natural experiments ◮ Millions of topic-author controlled pairs ◮ Ranking within a pair (classification) ◮ Evaluation: the accuracy of predicting which one was retweeted more (random → 50%) ◮ Classifier: logistic regression 21 / 67

Features Pronouns first person singular (i) first person plural (we) second person (you) third person singular (she, he) third person plural (they) 22 / 67

Features Pronouns first person singular (i) ——– first person plural (we) ——– second person (you) ——– third person singular (she, he) ↑↑ ↑↑ third person plural (they) ↑ ↑↑↑ 23 / 67

Features Pronouns first person singular (i) ——– first person plural (we) ——– second person (you) ——– third person singular (she, he) ↑↑ ↑↑ third person plural (they) ↑ ↑↑↑ Referring to other people helps 24 / 67

Features Generality indefinite articles (a,an) definite articles (the) 25 / 67

Features Generality indefinite articles (a,an) ↑↑↑ ↑ definite articles (the) ——– 26 / 67

Features Generality indefinite articles (a,an) ↑↑↑ ↑ definite articles (the) ——– Generality helps 27 / 67

Features Language model scores ◮ similarity with overall Twitter users twitter unigram twitter bigram 28 / 67

Features Language model scores ◮ similarity with overall Twitter users twitter unigram ↑↑↑ ↑ twitter bigram ↑↑↑ ↑ 29 / 67

Features Language model scores ◮ similarity with overall Twitter users twitter unigram ↑↑↑ ↑ twitter bigram ↑↑↑ ↑ ◮ similarity with personal history personal unigram personal bigram 30 / 67

Features Language model scores ◮ similarity with overall Twitter users twitter unigram ↑↑↑ ↑ twitter bigram ↑↑↑ ↑ ◮ similarity with personal history personal unigram ↑↑↑ ↑ personal bigram ——– Be like the community & be true to yourself 31 / 67

Baseline without “natural experiments” Supervised classification without control ◮ most-retweeted tweets vs. least-retweeted tweets 32 / 67

Prediction performance Human performance 33 / 67

Prediction performance Human performance ◮ Controlling for context is important ◮ Big data can help understand pragmatics https://chenhaot.com/retweetedmore 34 / 67

Beyond retweeting ◮ Persuasive arguments (Tan et al., 2016) ◮ Memorable (movie) quotes (Danescu-Niculescu-Mizil et al., 2012a) ◮ Power dynamics (Danescu-Niculescu-Mizil et al., 2012b; Prabhakaran et al., 2014) ◮ Newsworthiness of research articles and political speeches (Zhang et al., 2016) 35 / 67

Dialogue act classification/tagging Define categories and label corpora (Stolcke et al., 2000) ◮ statement ◮ question ◮ backchannel ◮ agreement ◮ apology ◮ ... 37 / 67

Dialogue act classification/tagging Supervised classification ◮ SVM ◮ logistic classification Structure prediction (sequence tagging) ◮ Hidden Markov model ◮ Conditional random field 38 / 67

Speech act theory The effect of wording choices (big data pragmatics) Modeling conversations: dialogue act categorization Rational speech acts model 39 / 67

Cooperative Principle Make your contribution as is required, when it is required, by the conversation in which you are engaged (Grice, 1975). 40 / 67

Conversational Implicatures ◮ Maxims of quality (Do not say what you believe to be false; do not say that for which you lack adequate evidence) e.g., Noah is a nice person 41 / 67

Conversational Implicatures ◮ Maxims of quality (Do not say what you believe to be false; do not say that for which you lack adequate evidence) e.g., Noah is a nice person ⇒ I believe that Noah is a nice person 42 / 67

Conversational Implicatures ◮ Maxims of quality (Do not say what you believe to be false; do not say that for which you lack adequate evidence) e.g., Noah is a nice person ⇒ I believe that Noah is a nice person ◮ Maxims of quantity Make you contribution as informative as is required (for the current purposes of the exchange); do not make your contribution more informative than is required ◮ I have two hands 43 / 67

Conversational Implicatures ◮ Maxims of quality (Do not say what you believe to be false; do not say that for which you lack adequate evidence) e.g., Noah is a nice person ⇒ I believe that Noah is a nice person ◮ Maxims of quantity Make you contribution as informative as is required (for the current purposes of the exchange); do not make your contribution more informative than is required ◮ I have two hands ⇒ I have no more than two hands 44 / 67

Rational Speech Acts Model Reference games (Wittgenstein, 1953; Frank and Goodman, 2012) ◮ Speaker. Imagine you are talking to someone and want to refer to the middle object. Would you say “blue” or “circle”? 45 / 67

Rational Speech Acts Model Reference games (Wittgenstein, 1953; Frank and Goodman, 2012) ◮ Speaker. Imagine you are talking to someone and want to refer to the middle object. Would you say “blue” or “circle”? ◮ Listener. Someone uses the word “blue” to refer to one of these objects. Which object are they talking about? 46 / 67

Rational Speech Acts Model Literal listener ( l 0 ) P l 0 ( s | u ) ∝ P ( s ) � u � ( s ) ◮ P ( s ) : the prior over states ◮ � u � ( s ) : a mapping from states of the world to truth values 47 / 67

Rational Speech Acts Model Literal listener ( l 0 ) P l 0 ( s | u ) ∝ P ( s ) � u � ( s ) ◮ P ( s ) : the prior over states ◮ � u � ( s ) : a mapping from states of the world to truth values ∀ s, P ( s ) = 1 / 3 blue 0.5 0.5 0 green 0 0 1 square 0.5 0 0.5 circle 0 1 0 48 / 67

Natural Language Processing (CSEP 517): Computational Pragmatics - PowerPoint PPT Presentation

Natural Language Processing (CSEP 517): Computational Pragmatics Chenhao Tan 2017 c University of Washington chenhao@chenhaot.com May 22, 2017 1 / 67 What do we use language for? 2 / 67 What do we use language for? Communicating with

CSEP 517 Natural Language Processing Language Models Luke Zettlemoyer Slides adapted from Dan

CSEP 517 Natural Language Processing Introduction Luke Zettlemoyer Slides adapted from Dan

CSEP 517: Natural Language Processing New PMP Course! Instructor: Luke Zettlemoyer Autumn 2013

CSEP 517 Natural Language Processing Autumn 2018 Introduction Luke Zettlemoyer Slides adapted

Natural Language Processing (CSEP 517): Introduction & Language Models Noah Smith c 2017

CSEP 517 Natural Language Processing Autumn 2015 Parsing (Trees) Yejin Choi - University of

CSEP 517 Natural Language Processing Frame Semantics Luke Zettlemoyer Slides adapted from Yejin

CSEP 517: Natural Language Processing Recurrent Neural Networks Autumn 2018 Luke Zettlemoyer

CSEP 517 Natural Language Processing Autumn 2015 Introduction Yejin Choi Slides adapted

CSEP 517 Natural Language Processing Luke Zettlemoyer Machine Translation, Sequence-to-sequence

Natural Language Processing (CSEP 517): Distributional Semantics Roy Schwartz 2017 c

Natural Language Processing (CSEP 517): Machine Translation (Continued), Summarization, &

CSEP 517 Natural Language Processing Coreference Resolution Luke Zettlemoyer University of

Natural Language Processing (CSEP 517): Dependency Syntax and Parsing Noah Smith 2017 c

CSEP 517 Natural Language Processing Text Classification Linear Models Luke Zettlemoyer -

CSEP 517 Natural Language Processing Autumn 2018 Distributed Semantics & Embeddings Luke

Ruth Wageman, Pauline Willis & Krister Lowe Agenda for today Introducing our team Ruth

Commission: Live the Story Keynote Session 3 Sarah Agnew 1 2 Gods Story: Ours for the

How good is your eyesight ? Can you see and hope for the human soul the way God does? Mat t hew

Urban Being The Future of City Living and the European Beds Sector Clare Thomas, Partner,

Fluctuation studies in Phobos Constantin Loizides for the

UML: Unified Modeling Language 1 Objectives of UML UML is a general purpose notation that is

CMP224 Cap on the total TNUoS target revenue to be recovered from generation users Place your

Clothing, Insulation, Clothing can keep you cool in very hot places Insulation controls

Natural Language Processing (CSEP 517): Computational Pragmatics - PowerPoint PPT Presentation

Natural Language Processing (CSEP 517): Computational Pragmatics Chenhao Tan 2017 c University of Washington chenhao@chenhaot.com May 22, 2017 1 / 67 What do we use language for? 2 / 67 What do we use language for? Communicating with

CSEP 517 Natural Language Processing Language Models Luke Zettlemoyer Slides adapted from Dan

CSEP 517 Natural Language Processing Introduction Luke Zettlemoyer Slides adapted from Dan

CSEP 517: Natural Language Processing New PMP Course! Instructor: Luke Zettlemoyer Autumn 2013

CSEP 517 Natural Language Processing Autumn 2018 Introduction Luke Zettlemoyer Slides adapted

Natural Language Processing (CSEP 517): Introduction &amp; Language Models Noah Smith c 2017

CSEP 517 Natural Language Processing Autumn 2015 Parsing (Trees) Yejin Choi - University of

CSEP 517 Natural Language Processing Frame Semantics Luke Zettlemoyer Slides adapted from Yejin

CSEP 517: Natural Language Processing Recurrent Neural Networks Autumn 2018 Luke Zettlemoyer

CSEP 517 Natural Language Processing Autumn 2015 Introduction Yejin Choi Slides adapted

CSEP 517 Natural Language Processing Luke Zettlemoyer Machine Translation, Sequence-to-sequence

Natural Language Processing (CSEP 517): Distributional Semantics Roy Schwartz 2017 c

Natural Language Processing (CSEP 517): Machine Translation (Continued), Summarization, &amp;

CSEP 517 Natural Language Processing Coreference Resolution Luke Zettlemoyer University of

Natural Language Processing (CSEP 517): Dependency Syntax and Parsing Noah Smith 2017 c

CSEP 517 Natural Language Processing Text Classification Linear Models Luke Zettlemoyer -

CSEP 517 Natural Language Processing Autumn 2018 Distributed Semantics &amp; Embeddings Luke

Ruth Wageman, Pauline Willis &amp; Krister Lowe Agenda for today Introducing our team Ruth

Commission: Live the Story Keynote Session 3 Sarah Agnew 1 2 Gods Story: Ours for the

How good is your eyesight ? Can you see and hope for the human soul the way God does? Mat t hew

Urban Being The Future of City Living and the European Beds Sector Clare Thomas, Partner,

Fluctuation studies in Phobos Constantin Loizides for the

UML: Unified Modeling Language 1 Objectives of UML UML is a general purpose notation that is

CMP224 Cap on the total TNUoS target revenue to be recovered from generation users Place your

Clothing, Insulation, Clothing can keep you cool in very hot places Insulation controls

Natural Language Processing (CSEP 517): Introduction & Language Models Noah Smith c 2017

Natural Language Processing (CSEP 517): Machine Translation (Continued), Summarization, &

CSEP 517 Natural Language Processing Autumn 2018 Distributed Semantics & Embeddings Luke

Ruth Wageman, Pauline Willis & Krister Lowe Agenda for today Introducing our team Ruth