End-to-end LSTM-based dialog control optimized with supervised and - PowerPoint PPT Presentation

Dec 17, 2023 •90 likes •388 views

End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning Authors: Jason D. Williams and Geoffrey Zweig Speaker: Hamidreza Shahidi Outline Introduction Model description Optimizing with

End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning Authors: Jason D. Williams and Geoffrey Zweig Speaker: Hamidreza Shahidi
Outline ● Introduction ● Model description ● Optimizing with supervised learning ● Optimizing with reinforcement learning ● Conclusion
Task-oriented dialogue systems A dialog system for: ● Initiating phone calls to a contact in an address book ● Ordering a taxi ● Reserving a table at a restaurant
Task-oriented dialogue systems A dialog system for: ● Initiating phone calls to a contact in an address book ● Ordering a taxi ● Reserving a table at a restaurant
Reinforcement learning Setting State = (user’s goal, dialogue history) Text actions “Do you want to call <name>?” Actions = API calls PlacePhoneCall(<name>) Reward = 1 for successfully completing the task, and 0 otherwise
Reinforcement learning Setting State = (user’s goal, dialogue history) Text actions “Do you want to call <name>?” Actions = API calls PlacePhoneCall(<name>) Reward = 1 for successfully completing the task, and 0 otherwise
Model description
Model
User Input
Entity Extraction For example: identifying “Jason Williams” as a <name> entity
Entity Input For example: Maps from the text “Jason Williams” to a specific row in a database
Feature Vector
Recurrent Neural Network LSTM neural network is used because it has the ability to remember past observations arbitrarily long.
Action Mask If a target phone number has not yet been identified, the API action to place a phone call may be masked.
Re-normalization Pr{masked actions} = 0 Re-normalize into a probability distribution
Sample Action RL: sample from the distribution SL: select action with highest probability
Entity Output
Taking Action
Training the Model
Optimizing with supervised learning
Prediction accuracy ● Loss = categorical cross entropy ● Training sets = 1, 2, 5, 10, and 20 dialogues ● Test set = one held out dialogue
The model is rebuilt. The current model is run on unlabeled instances. Active Learning The unlabeled instances for which the model is most uncertain are labeled.
Active learning ● For active learning to be effective, the scores output by the model must be a good indicator of correctness. ● 80% of the actions with the lowest scores are incorrect. ● Re-training the LSTM is fast Labeling low scoring actions will rapidly improve the performance.
Optimizing with reinforcement learning
Policy gradient Dialog history at time t Return of the dialogue Weights of the LSTM The LSTM which outputs a distribution over actions
RL Evaluation
Conclusion 1. This paper has taken a first step toward an end-to-end learning for task-oriented dialog systems. 2. The LSTM automatically extracts a representation of the dialogue state (no hand-crafting). 3. Code provided by the developer can enforce business rules on the policy. 4. The model is trained using both SL & RL.
Thank you

Recommend

Advanced NLU & Dialog Models Ling575 Spoken Dialog Systems April 21, 2016 Roadmap

Advanced NLU & Dialog Models Ling575 Spoken Dialog Systems April 21, 2016 Roadmap Advanced NLU Advanced Dialog Models Information State Models Statistical Dialog Models Learning Probabilistic Slot Filling

627 views • 60 slides

Speech Processing 15-492/18-492 Spoken Dialog Systems Advanced Concepts in Dialog Spoken Dialog

Speech Processing 15-492/18-492 Spoken Dialog Systems Advanced Concepts in Dialog Spoken Dialog Systems Basic steps for machine conversation: Basic steps for machine conversation: Take speech to text (ASR) Take speech to text (ASR)

732 views • 27 slides

Attention Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Encoder-decoder Models

CS11-747 Neural Networks for NLP Attention Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Encoder-decoder Models (Sutskever et al. 2014) Encoder kono eiga ga kirai </s> LSTM LSTM LSTM LSTM LSTM I hate this movie

823 views • 35 slides

Attention Graham Neubig Site https://phontron.com/class/nn4nlp2020/ Encoder-decoder Models

CS11-747 Neural Networks for NLP Attention Graham Neubig Site https://phontron.com/class/nn4nlp2020/ Encoder-decoder Models (Sutskever et al. 2014) Encoder kono eiga ga kirai </s> LSTM LSTM LSTM LSTM LSTM I hate this movie

614 views • 35 slides

A Dialog Control Framework for Hypertext-based Applications November 8, 2002 YOU ARE HERE YOU

A Dialog Control Framework for Hypertext-based Applications November 8, 2002 YOU ARE HERE YOU ARE HERE Introduction A Dialog Control Framework A Dialog Control Framework Motivation for Hypertext-

296 views • 5 slides

AI DIALOG SEARCH news services Josef Krupi ka Michal Svoboda Goals dialog system

IBM VUT Student Research Project 2006 AI DIALOG SEARCH news services Josef Krupi ka Michal Svoboda Goals dialog system complex news dialog services user interface controlled by voice news database dynamic updatable

275 views • 6 slides

Dialog Models 11-716 September 18, 2003 Thomas Harris What is a (dialog) model? A model is

Dialog Models 11-716 September 18, 2003 Thomas Harris What is a (dialog) model? A model is an abstraction of a thing, dimensionally reduced, while still informative of the thing with respect to a particular perspective. A dialog

794 views • 30 slides

Dialog Management EE596/LING580 -- Conversational Artificial Intelligence Hao Cheng University

Dialog Management EE596/LING580 -- Conversational Artificial Intelligence Hao Cheng University of Washington Dialog Management in Dialog Systems 1 What is Dialog Management? Controls the interaction with the user Takes input from

852 views • 56 slides

Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap Overview Distinctive

Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013 Roadmap Overview Distinctive factors in dialog: Human-human Human-computer Dialog components & dialog management Specialized topics: Detailed

254 views • 22 slides

SDS: ASR, NLU, & VXML Ling575 Spoken Dialog April 14, 2016 Roadmap Dialog System

SDS: ASR, NLU, & VXML Ling575 Spoken Dialog April 14, 2016 Roadmap Dialog System components: ASR: Noisy channel model Representation Decoding NLU: Call routing Grammars for dialog systems Basic

925 views • 76 slides

Speech Processing 15-492/18-492 Spoken Dialog Systems Tree based dialogs VoiceXML State-based

Speech Processing 15-492/18-492 Spoken Dialog Systems Tree based dialogs VoiceXML State-based Dialogs Simple state- -based dialog systems based dialog systems Simple state Get Name Get Name Get Account number Get

482 views • 22 slides

Multi-Dimensional LSTM Networks for Video Prediction Wonmin Byeon NVIDIA Research March 29, 2018

Multi-Dimensional LSTM Networks for Video Prediction Multi-Dimensional LSTM Networks for Video Prediction Wonmin Byeon NVIDIA Research March 29, 2018 Wonmin Byeon | NVIDIA Research | March 29, 2018 1 / 44 Multi-Dimensional LSTM Networks for

739 views • 71 slides

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials Study materials http://colah.github.io/posts/2015-08-Understanding-LSTMs/ (http://colah.github.io/posts/2015-08-Understanding-LSTMs/)

641 views • 23 slides

E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System Runbin Shi 1 Junjie

E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System Runbin Shi 1 Junjie Liu 1 Shuo Wang 2 Yun Liang 2 Hayden So 1 1 Department of Electrical and Electronic Engineering The University of Hong Kong 2 Center for

317 views • 30 slides

Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting LSTM

Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting LSTM : VALSE 2016/03/23 Content Quick Review of Recurrent Neural Network

738 views • 27 slides

Dialog State Tracking Based on Pairwise Ranking Veljko Miljanic Dialog

Dialog State Tracking Based on Pairwise Ranking Veljko Miljanic Dialog State Tracking State is representa2on of what the user wants at any point Slot values,

1.16k views • 86 slides

Persistence CS 442: Mobile App Development Michael Saelee <lee@iit.edu> Things to persist

Persistence CS 442: Mobile App Development Michael Saelee <lee@iit.edu> Things to persist - Application settings - Application state - Model data - Model relationships Persistence options - User defaults - Property lists serialization

1.48k views • 99 slides

Facebook 101 FACEBOOK 101 What is Facebook? Facebooks Mission is to give people the

Facebook 101 FACEBOOK 101 What is Facebook? Facebooks Mission is to give people the power to share and make the world more open and connected. Mission Statement Facts: More than 500 million active users 50% of our

202 views • 17 slides

Lync service phone calls and collaboration Pawel Grzywaczewski IT/OIS ACCU Meeting September

Lync service phone calls and collaboration Pawel Grzywaczewski IT/OIS ACCU Meeting September 2014 9/9/2014 1 Agenda Lync overview Lync phone functionality Summary 9/9/2014 2 Lync Skype for businesses Microsoft

583 views • 10 slides

r r rsss

tr P tt s r r rsss rr

751 views • 24 slides

MCC assignment info Slides will be available in Noppa Assignment assistants: Rasmus Eskola

MCC assignment info Slides will be available in Noppa Assignment assistants: Rasmus Eskola (phases 1-2) Teemu Kmrinen (phase 3) Structure of the lecture We will discuss important tools for completing the assignment. Overview of the

1.03k views • 17 slides

Longtime Behavior of Harvesting Spam Bots Oliver Hohlfeld TU Berlin / DT Labs Thomas Graf

Longtime Behavior of Harvesting Spam Bots Oliver Hohlfeld TU Berlin / DT Labs Thomas Graf Florin Ciucu Modas GmbH TU Berlin / DT Labs Oliver Hohlfeld (TU Berlin / DT Labs) Longtime Behavior of Harvesting Spam Bots IMC12 1 / 10 Image

870 views • 43 slides

OSINT tools for security auditing Open Source Intelligence with python tools Jos Manuel Ortega

OSINT tools for security auditing Open Source Intelligence with python tools Jos Manuel Ortega @jmortegac http://jmortega.github.io https://github.com/jmortega/osint_tools_security_auditing Agenda OSINT introduction Server

743 views • 62 slides

EECS 373 Design of Microprocessor-Based Systems

EECS 373 Design of Microprocessor-Based Systems http://web.eecs.umich.edu/~prabal/teaching/eecs373 Prabal Dutta University of Michigan Lecture 1: Introduction September 2, 2014 1 What is an embedded system? 2 3 Embedded, Everywhere 4

796 views • 60 slides