Neural Conversational Models Human: What is the purpose of living? - PowerPoint PPT Presentation

Neural Conversational Models Human: What is the purpose of living? Machine: To live forever. Berkay Antmen March 8, 2016

Conversational model • Purpose: Given previous sentences of the dialogue and context, output a response

• Why? • How? • goal driven dialogue systems • discriminative • generative • e.g. tech support • non-goal driven dialogue systems • heavily hand-crafted • data-driven systems • e.g. language learning, video game characters

Demo (Cleverbot) • http://www.cleverbot.com/ • http://www.cleverbot.com/conv/201603150055/VWU01366204_Hi-can-you- help-me (Troubleshooting) • http://www.cleverbot.com/conv/201603150111/VWU01366307_Hello (Basic) • http://www.cleverbot.com/conv/201603150120/VWU01366357_What-is-the- purpose-of-life (Philosophical) • http://www.cleverbot.com/conv/201603150204/VWU01366635_We-are-no- strangers-to-love (extra)

Frameworks • sequence-to-sequence (seq2seq) • classification problem over a known vocabulary • input: sequence of tokens • output: sequence of tokens image: Sutskever et. al. 2015

Frameworks: seq2seq • The goal: estimate • problem: boundaries • solution: • training: maximize (target given source) • inference: • approximated by beam search equation images: Sutskever et. al. 2015

Beam Search w=3 image: http://bit.ly/251bIfl

A Neural Conversational Model • IT helpdesk dataset of conversations (closed-domain) • OpenSubtitles movie transcript dataset (open-domain) • Experiments: troubleshooting, general knowledge, philosophical etc.

A Neural Conversational Model • training: maximize cross entropy of the correct sequence given its context • (aside) how is cross entropy measured when the true distribution of the words in the corpus is not known? Monte Carlo estimation: training set is treated as samples from the true distribution • inference: greedy search image: Chris Olah

Some results (troubleshooting) Password issues Browser issues Cleverbot: http://www.cleverbot.com/conv/201603150055/VWU0136620 4_Hi-can-you-help-me

Some more results Basic Contexts and multiple choice Cleverbot: http://www.cleverbot.com/conv/201603150111/VWU01366307_Hello

Some more results Philosophical Opinions Cleverbot: http://www.cleverbot.com/conv/201603150120/VWU0136635 7_What-is-the-purpose-of-life

Evaluation • Perplexity measures how well a model predicts the given samples • 2 𝐼 𝑟 (𝑇 1 ,…𝑇 𝑜 ) = 2 − 𝑗 𝑟 𝑇 𝑗 log 2 𝑟(𝑇 𝑗 ) Experiment Model Perplexity IT Helpdesk Troubleshooting N-grams 18 IT Helpdesk Troubleshooting Neural conversational model 8 OpenSubtitles N-grams 28 OpenSubtitles Neural conversational model 17

Evaluation • human evaluation against a rule-based bot (CleverBot) • asked a list of questions to both models • judges picked the bot they preferred • Mechanical Turk # questions # judges # prefer neural # prefer # tie # disagreement model CleverBot 200 4 97 60 20 23

Wrong objective function? • the answers are not diverse, i.e. likely to give most probable answers without giving out much information • e.g. S=“How old are you?” T=“I don’t know.” • 𝑞(𝑈|𝑇) high, 𝑞(𝑇|𝑈) low • e.g. S=“How old are you?” T=“I am 10 years old” • 𝑞(𝑈|𝑇) lower, 𝑞(𝑇|𝑈) higher • not really obvious from the selected examples in the paper

A Diversity-Promoting Objective Function for Neural Conversation Models Li et. al. 2015

A Diversity-Promoting Objective Function for Neural Conversation Models • An alternative objective function: Maximum Mutual Information (MMI) • maximize mutual information between source (S) and target (T) • 𝐽 𝑇, 𝑈 = log( 𝑞(𝑇,𝑈) 𝑞 𝑇 𝑞(𝑈) ) • 𝑈 = arg 𝑈 max 𝑚𝑝𝑕𝑞 𝑈 𝑇 − 𝜇𝑚𝑝𝑕𝑞(𝑈) • remember, previously

Some results (OpenSubtitles)

Some results (Twitter)

Frameworks • Hierarchical Recurrent Encoder Decoder (HRED) image: Serban et. al. 2015

Frameworks: HRED • Motivation?

Hierarchical Neural Network Generative Models for Movie Dialogues • Non-goal driven: can be easily adapted to specific tasks • Bootstrapping • from word embeddings OR • from a large non-dialogue corpus (Q-A SubTle containing 5.5 pairs) • Interactive dialogue structure • end-of-utterance token • continued-utterance token

Dataset • why movie scripts? • large dataset • wide range of topics • long dialogues with few participants • relatively few spelling mistakes and acronyms • similar to human spoken conversations • mostly single dialogue thread • atomic entries are triples • 13M words total; 10M in training

Evaluations (movie dialogue generation) • test set perplexity and classification errors when bootstrapping from SubTle corpus

Evaluations

Future work? • study larger length dialogues (as opposed to triplets) • bootstrapping on other non-dialogue but large datasets

Thank you! Questions?

References • seq2seq http://arxiv.org/abs/1409.3215 • neural conversational http://arxiv.org/abs/1506.05869 • hierarchical http://arxiv.org/abs/1507.02221 • hierarchical conversational http://arxiv.org/abs/1507.04808 • MMI http://arxiv.org/abs/1510.03055

Neural Conversational Models Human: What is the purpose of living? - PowerPoint PPT Presentation

Neural Conversational Models Human: What is the purpose of living? Machine: To live forever. Berkay Antmen March 8, 2016 Conversational model Purpose: Given previous sentences of the dialogue and context, output a response Why?

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Generic Generic and Subjective and Subjective Assisting Assisting Conversational Conversational

Designing for Conversational UI Angie T errell Design Director, Big Nerd Ranch Designing for

1 Best Practices Conversational UX Design 2 Best Practices Conversational UX Design SET THE

NICT Use Cases and Requirements for New Models of Human Language to Support Mobile Conversational

Conversational implicatures: interacting with grammar Christopher Potts Stanford Linguistics

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

NEURAL CONVERSATIONAL MODELS WITH ENTROPY-BASED DATA FILTERING Richard Csaky 1 , Patrik Purgai 1

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Learning to Ask Questions in Open-domain Conversational Systems with Typed Decoders Yansen Wang 1

Create conversational agents for Android Carmelo Ferrante Prof. Giuseppe Riccardi LPSMT-Spring

conversational content WITH HUBSPOT CHATFLOWS LOCKABLE PDFS & EMBEDDED CHAT The

Bazaar: Coordinating Multi-dimensional Support in Collaborative Conversational Agents David

14 May 2019 TODAYS PRESENTERS Lawrence Flynn Chris Bushnell CEO CFO A CONVERSATIONAL AI

Man vs. Machine in Conversational Speech Recognition George Saon IBM Research AI Deep Blue vs.

C hatti ng w i th I n C hat C hatbots f or Resear ch Purposes I n C hat H ands O n C

Improving Domain Independent Question Parsing with Synthetic Treebanks COLING 2018: LAW-MWE-CxG

Dialogue Quality and Nugget Detection for Short Text Conversation (STC-3) based on Hierarchical

Verification of Deep Learning Systems Xiaowei Huang, University of Liverpool December 25, 2017

Towards the clouds, together The IaaS framework Andres Steijaert SURFnet GANT cloud activity

Data Security And Privacy Of Chatbots @electrobabe Background 27.2.19 sec4dev 27.2.19

How machine learning is used in www.coach-bot.de processing text Fabian Reich www.coach-bot.de

AI Neural Bots By: Machine Learning Group @ CLT https://machinelearning.group/ Thanks to our

Neural Conversational Models Human: What is the purpose of living? - PowerPoint PPT Presentation

Neural Conversational Models Human: What is the purpose of living? Machine: To live forever. Berkay Antmen March 8, 2016 Conversational model Purpose: Given previous sentences of the dialogue and context, output a response Why?

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Generic Generic and Subjective and Subjective Assisting Assisting Conversational Conversational

Designing for Conversational UI Angie T errell Design Director, Big Nerd Ranch Designing for

1 Best Practices Conversational UX Design 2 Best Practices Conversational UX Design SET THE

NICT Use Cases and Requirements for New Models of Human Language to Support Mobile Conversational

Conversational implicatures: interacting with grammar Christopher Potts Stanford Linguistics

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

NEURAL CONVERSATIONAL MODELS WITH ENTROPY-BASED DATA FILTERING Richard Csaky 1 , Patrik Purgai 1

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Learning to Ask Questions in Open-domain Conversational Systems with Typed Decoders Yansen Wang 1

Create conversational agents for Android Carmelo Ferrante Prof. Giuseppe Riccardi LPSMT-Spring

conversational content WITH HUBSPOT CHATFLOWS LOCKABLE PDFS &amp; EMBEDDED CHAT The

Bazaar: Coordinating Multi-dimensional Support in Collaborative Conversational Agents David

14 May 2019 TODAYS PRESENTERS Lawrence Flynn Chris Bushnell CEO CFO A CONVERSATIONAL AI

Man vs. Machine in Conversational Speech Recognition George Saon IBM Research AI Deep Blue vs.

C hatti ng w i th I n C hat C hatbots f or Resear ch Purposes I n C hat H ands O n C

Improving Domain Independent Question Parsing with Synthetic Treebanks COLING 2018: LAW-MWE-CxG

Dialogue Quality and Nugget Detection for Short Text Conversation (STC-3) based on Hierarchical

Verification of Deep Learning Systems Xiaowei Huang, University of Liverpool December 25, 2017

Towards the clouds, together The IaaS framework Andres Steijaert SURFnet GANT cloud activity

Data Security And Privacy Of Chatbots @electrobabe Background 27.2.19 sec4dev 27.2.19

How machine learning is used in www.coach-bot.de processing text Fabian Reich www.coach-bot.de

AI Neural Bots By: Machine Learning Group @ CLT https://machinelearning.group/ Thanks to our

conversational content WITH HUBSPOT CHATFLOWS LOCKABLE PDFS & EMBEDDED CHAT The