Generative Deep Neural Networks for Dialogue Presented By Shantanu - PowerPoint PPT Presentation

Generative Deep Neural Networks for Dialogue Presented By Shantanu Kumar Adapted from slides by Iulian Vlad Serban

What are Dialogue Systems? • Computer system that can converse like a human with another human while making sense • Types of Dialogue • Open Domain • Task Oriented

Applications of Dialogue Systems • Technical Support • Product enquiry • Website navigation • HR helpdesk • Error diagnosis • IVR system in Call Centres • Entertainment • IoT interface • Virtual Assistants • Siri, Cortana, Google Assistant • Assistive technology • Simulate human conversations

How do we build such a system??

Traditional Pipeline models

End-To-End models with DL Dialogue Context Neural Network Response

End-To-End models with DL Knowledge Database Actions

What is a good Chatbot? The responses should be • Grammatical • Coherent • In Context • Ideally non-Generic responses

How can we learn the model? - Unsupervised Learning (Generative Models) - Maximise likelihood w.r.t. words - Supervised Learning Maximise likelihood w.r.t. annotated labels - - Reinforcement Learning - Learning from real users - Learning from simulated users - Learning with given reward function

Generative Dialogue Modeling Decomposing Dialogue Probability, Decomposing Utterance Probability,

Generative Dialogue Modeling Maximising likelihood on fixed corpora - Imitating human dialogues

Generative Dialogue Modeling Models proposed with three inductive biases • Long-term memory - Recurrent units used (GRU) • High-level compositional structure - Hierarchical structure - Multi resolution representation (MRRNN paper) • Representing uncertainty and ambiguity - Latent variables (MRRNN and VHRED)

Hierarchical Recurrent Encoder-Decoder (HRED) Akshay: - Encoder RNN Can be applied to - For encoding each utterance independently into arbitrary an utterance vector lengths - Context RNN - For encoding the topic/context of the dialogue up till the current utterance using utterance vectors - Decoder RNN - For predicting the next utterance

Hierarchical Recurrent Encoder-Decoder (HRED)

Hierarchical Recurrent Encoder-Decoder (HRED) Bidirectional HRED - Encoder RNN -> Bidirectional - Forward and Backward RNNs combined to get fixed length representation - Concat last state of each RNN - Concat of L2 pooling over temporal dimension

Hierarchical Recurrent Encoder-Decoder (HRED) Bootstrapping Barun Akshay Prachi - Initialising with Word2Vec embeddings Dinesh - Trained on Google News dataset Gagan - Pre-training on SubTle Q-A dataset - 5.5M Q-A pairs Prachi: 2 - Converted to 2-turn dialogue stage D = {U1 = Q, U2 = A} training

Gagan, Dataset - MovieTriples dataset Rishabh, Dinesh Why only triples? Anshul: Split train/ • Open Domain - Wide variety of topics covered val on • Names and Numbers replaced with <person> and <number> tokens movies? • Vocab of 10K most popular tokens • Special <continued-utterance> and <end-of-utterance> tokens to capture breaks

Dialogue Modeling Ubuntu Dialog Corpus - Goal-driven: Users resolve technical problems - ~0.5M dialogues Twitter Dialog Corpus - Open-domain: Social chit-chat - ~0.75M dialogues in Train, 100K for Val and Test - 6.27 utterance and 94 tokens per dialogue

Example - Ubuntu Corpus User Expert Hello! Recently I updated to ubuntu 12.04 LTS and I am unsatisfied by its performance. I am facing a bug since the upgrade to 12.04 LTS. Can anyone help??????????

Example - Ubuntu Corpus User Expert Hello! Recently I updated to ubuntu 12.04 LTS and I am unsatisfied by its performance. I am facing a bug since the upgrade to 12.04 LTS. Can anyone help?????????? You need to give more details on the issue.

Example - Ubuntu Corpus User Expert Hello! Recently I updated to ubuntu 12.04 LTS and I am unsatisfied by its performance. I am facing a bug since the upgrade to 12.04 LTS. Can anyone help?????????? You need to give more details on the issue. Every time I login it gives me "System Error" pop up. It is happing since I upgraded to 12.04.

Example - Ubuntu Corpus User Expert Hello! Recently I updated to ubuntu 12.04 LTS and I am unsatisfied by its performance. I am facing a bug since the upgrade to 12.04 LTS. Can anyone help?????????? You need to give more details on the issue. Every time I login it gives me "System Error" pop up. It is happing since I upgraded to 12.04. Send a report, or cancel it.

Example - Ubuntu Corpus User Expert Hello! Recently I updated to ubuntu 12.04 LTS and I am unsatisfied by its performance. I am facing a bug since the upgrade to 12.04 LTS. Can anyone help?????????? You need to give more details on the issue. Every time I login it gives me "System Error" pop up. It is happing since I upgraded to 12.04. Send a report, or cancel it. I have already done that but after few min, it pops up again...

Example - Twitter Corpus Person A Person B Hanging out in the library for the past couple hours makes me feel like I'll do great on this test! @smilegirl400 wow, what a nerd lol jk haha =p what!? you changed your bio =( @smileman400 Do you like my bio now? I feel bad for changing it but I like change. =P @smilegirl400 yes I do =) It definitely sums up who you are lisa. Yay! you still got me =)

Evaluation Metric Barun Akshay Dinesh - Word Perplexity Rishabh - Measures the probability of generating the exact Arindam reference utterance Anshul - Word error-rate - Number of words in the dataset the model has predicted incorrectly divided by the total number of words in the dataset. - Penalises diversity [Akshay]

Evaluation Metric - Word Perplexity - Can only be used with generative models - Given an utterance, what is the probability? How do we evaluate given an output utterance? - Multi-modal output - Space of possible valid utterance is huge - Human annotation is expensive and slow

Evaluation Metric How do we evaluate given an output utterance? - Multi-modal output - Space of possible valid utterance is huge - Human annotation is expensive and slow Automatic Evaluation Metrics - Word overlap measure (BLEU, ROUGE, Levenshtein dist.) - Embedding based measures - Poor correlation with Human annotation

Results Lack of error analysis

MAP Output Nupur: MAP vs Stochastic Sampling - Most probable last utterance - Found using beam search for better approximation - Generic responses observed - Stochastic sampling gives more diverse dialogues

Extensions Model • [Barun][Rishabh] Attention model during decoding for long contexts • [Prachi] Dialogue systems with multiple participants • Different decoders for each participant? • Order of speaking • [Rishabh] Incorporating outside knowledge using KB

Extensions Data • [Akshay][Surag] Use bigger datasets like Reddit for dialogue • [Rishabh] Using film dialogue scripts from films like "Ek ruka hua fasla" might be useful. • [Barun] Artificially scoring generic responses • [Surag] Prune generic responses from training data

Extensions • [Prachi] Automatic generation of dialogue for movie given storyline and character description • [Gagan] Pre-train word embeddings on SubTle • [Arindam] RL is the best bet to avoid generic responses • [Arindam] Adversarial evaluation • [Arindam] Train additional context to add consistency?

Thank You

Generative Deep Neural Networks for Dialogue Presented By Shantanu - PowerPoint PPT Presentation

Generative Deep Neural Networks for Dialogue Presented By Shantanu Kumar Adapted from slides by Iulian Vlad Serban What are Dialogue Systems? Computer system that can converse like a human with another human while making sense

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Deep-Learning: Unsupervised Generative models Deep Belief Networks Deep Stacked AutoEncoders

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

generative design systems Generative Brief Design Definitions Workshop Processes

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Generative Deep Learning Prof. Kuan-Ting Lai 2020/5/12 DeepFake (Intro) Generative Recurrent

Deep Learning with Neural Networks The Structure and Optimization of Deep Neural Networks Allan

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

dialogue notations and design Dialogue Notations and Design Dialogue Notations

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Optimizing Deep Neural Networks Leena Chennuru Vankadara 26-10-2015 Table of Contents Neural

Visualization & Visual Analytics 1 Angus Forbes creativecoding.evl.uic.edu/courses/cs424

Hierarchical Probabilistic Models for Object Segmentation S. M. Ali Eslami Christopher K. I.

Multi-Cast Channels with Hierarchical Flow Jonathan Ponniah San Jose State University

FACET ANALYSIS AS A TOOL FOR MODELLING SUBJECT DOMAINS AND TERMINOLOGIES R E P R E S E N T I N G

Tools for Memory Performance Analysis Goals Expose the hierarchy Show the placement and

Analysis and Optimization of the Memory Hierarchy for Graph Processing Workloads Abanti Basak ,

Computer-System Architecture Operating System Concepts 2.2 Silberschatz, Galvin and Gagne

What is Computer Architecture? Structure: static arrangement of the parts Organization:

Sambuz

Useful Links

Newsletter

Mail Us

Generative Deep Neural Networks for Dialogue Presented By Shantanu - PowerPoint PPT Presentation

Generative Deep Neural Networks for Dialogue Presented By Shantanu Kumar Adapted from slides by Iulian Vlad Serban What are Dialogue Systems? Computer system that can converse like a human with another human while making sense

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Deep-Learning: Unsupervised Generative models Deep Belief Networks Deep Stacked AutoEncoders

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

generative design systems Generative Brief Design Definitions Workshop Processes

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Generative Deep Learning Prof. Kuan-Ting Lai 2020/5/12 DeepFake (Intro) Generative Recurrent

Deep Learning with Neural Networks The Structure and Optimization of Deep Neural Networks Allan

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

dialogue notations and design Dialogue Notations and Design Dialogue Notations

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Learning Deep Generative Models Inference &amp; Representation Lecture 12 Rahul G. Krishnan

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Optimizing Deep Neural Networks Leena Chennuru Vankadara 26-10-2015 Table of Contents Neural

Visualization &amp; Visual Analytics 1 Angus Forbes creativecoding.evl.uic.edu/courses/cs424

Hierarchical Probabilistic Models for Object Segmentation S. M. Ali Eslami Christopher K. I.

Multi-Cast Channels with Hierarchical Flow Jonathan Ponniah San Jose State University

FACET ANALYSIS AS A TOOL FOR MODELLING SUBJECT DOMAINS AND TERMINOLOGIES R E P R E S E N T I N G

Tools for Memory Performance Analysis Goals Expose the hierarchy Show the placement and

Analysis and Optimization of the Memory Hierarchy for Graph Processing Workloads Abanti Basak ,

Computer-System Architecture Operating System Concepts 2.2 Silberschatz, Galvin and Gagne

What is Computer Architecture? Structure: static arrangement of the parts Organization:

Sambuz

Useful Links

Newsletter

Mail Us

Learning Deep Generative Models Inference & Representation Lecture 12 Rahul G. Krishnan

Visualization & Visual Analytics 1 Angus Forbes creativecoding.evl.uic.edu/courses/cs424