Learning to Control the Specificity in Neural Response Generation - PowerPoint PPT Presentation

Learning to Control the Specificity in Neural Response Generation Ruqing Zhang , Jiafeng Guo, Yixing Fan, Yanyan Lan, Jun Xu, Xueqi Cheng 1. CAS Key Lab of Network Data Science and Technology Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China 2. University of Chinese Academy of Sciences, Beijing, China

Background - Dialog Task-‑Oriented Dialog Chit-‑Chat Dialog p No ¡specific ¡goal, ¡attempts ¡to ¡ p Personal assistant, helps people produce ¡natural ¡responses complete specific tasks p Combination of rules and statistical p Using ¡variants ¡of ¡seq2seq ¡model ¡ components

Background – Neural Model TA-Seq2Seq • utterance-response: n-to-1 Rank-frequency distribution relationship Must support! Cheer! Support! It’s good. • e.g., the response “Must • pre-defined a set of topics My friends and I are shocked! support! Cheer!” is used from an external corpus • rely on external corpus for 1216 different input utterances MARM Performance Seq2Seq framework • treat all the utterance-response pairs uniformly • introduce latent responding factors to model multiple • employ a single model to learn the mapping responding mechanisms between utterance and response • lack of interpretation favor such general responses with high frequency

How to capture different utterance-response relationships ? Conversation context Topic information Keyword Coherence Scenarios heuristics Our motivation comes from Human Conversation Process

Human Conversation Process Do you know a good eating place for Australian special food? general response I don’t know specific response knowledge state dialogue partner Good Australian eating places include steak, seafood, cake, current mood etc. What do you want to choose?

Key Idea • introduce an explicit specificity control variable s to represent the response purpose - s summarizes many latent factors into one variable - s has explicit meaning on specificity -‐‒ 𝑡 actively controls the generation of the response 𝑻 knowledge state dialogue partner current mood

Model Architecture • the specificity control variable 𝑡 is introduced into the Seq2Seq model • single model -> multiple model • different <utterance, response>, different 𝑡 , different models • word representation • semantic representation: relates to the semantic meaning • usage representation: relates to the usage preference my name is John Semantic-based & Specificity-based Generation P(“John”) = ( ) (“John”) + ( * (“John”) Response !!" # $ !!" # % !!" # & !!" # ' Decoder <eos> my name is Attentive Read Gaussian Kernel Layer !!" # ' Specificity Control Variable Utterance 0 U !!" / !!" , !!" + !!" - !!" . Encoder Semantic Representation Usage Representation what is your name ?

Model - Encoder p Bi-RNN: modeling the utterance from both forward and backward directions → , …, 𝒊 * → } 𝒊 𝑼 ← , …, 𝒊 & ← n {𝒊 & → ,𝒊 *2/3& ← n 𝐢 / = [𝒊 / ]

Model - Decoder • predict target word based on a mixture of two probabilities: the semantic-based and specificity-based generation probability 𝑞 𝑧 / = 𝛾𝑞 8 𝑧 / + 𝛿𝑞 ; (𝑧 / ) Ø semantic-based probability - decides what to say next given the input A B 𝒊 C D + 𝑿 𝑵 𝒇 B 𝒇 /2& + 𝒄 8 𝑞 8 𝑧 / = 𝑥 = 𝒙 * 𝑿 8 hidden state semantic representation

Model - Decoder Ø specificity-based probability - decides how specific we should reply l Gaussian Kernel layer ü the specificity control variable interacts with the usage representation of words through the layer ü let the word usage representation regress to the variable 2𝜌𝜏 exp(−(Ψ ; 𝑽, 𝒙 − 𝑡) U 1 𝑡 through certain mapping function (sigmoid) 𝑞 ; 𝑧 / = 𝑥 = ¡ ) 2𝜏 U l specificity control variable 𝑡 ∈ [0,1] ¡Ψ ; 𝑽, 𝒙 = 𝜏(𝒙 * (𝑽 B 𝑿 V + 𝒄 V )) ü 0 denotes the most general response variance ü 1 denotes the most specific response usage representation

Model Training • Objective function – log likelihood ℒ = X log 𝑄(𝒁|𝒀,𝑡; 𝜄) (𝒀,𝒁)b𝒠 • Training data: triples (𝒀,𝒁, 𝑡) • s is not directly available in the raw conversation corpus How to obtain s to learn our model? We propose to acquire distant labels for 𝑡

Distant Supervision l Normalized Inverse Response Frequency (NIRF) Ø a response is more general if it corresponds to more input utterances Ø the Inverse Response Frequency (IRF) in a conversation corpus l Normalized Inverse Word Frequency (NIWF) Ø a response is more specific if it contains more specific words Ø the maximum of the Inverse Word Frequency (IWF) of all the words in a response

Specificity Controlled Response Generation • Given a new input utterance, we can generate responses at different specificity levels by varying the control variable s • Different s, different models, different responses n 𝑡 = 1 : the most informative response n 𝑡 ∈ 0,1 : ¡more ¡dynamic ¡ , enrich the styles in the response n 𝑡 = 0 : the most general response 1 0 s Specific response General response

Experiments - Dataset l Short Text Conversation (STC) dataset Ø released in NTCIR-13 Ø a large repository of post-comment pairs from the Sina Weibo Ø 3.8 million post-comment pairs Ø Jieba Chinese word segmenter

Experiments – ModelAnalysis 1. We vary the control variable s by setting it to five different values (i.e., 0, 0.2, 0.5, 0.8, 1) 2. NIWF (word-based) is a good distant label for the response specificity

Experiments – ModelAnalysis specific general 1. Varying the variable s from 0 to 1, the generated responses turn from general to specific 2. Different s -> different models -> different focus

Experiments – Comparisons When ¡ 𝑡 = 1 , our SC- Seq2Seq opqr model can achieve the best specificity performance

Experiments – Comparisons 1. our SC- Seq2Seq opqr model can best fit the ground truth data 2. there are diverse responses in real data in terms of specificity

Experiments – Comparisons 1. SC- Seq2Seq opqr,st& generates the most informative responses and interesting and the least general responses than all the baseline models 2. The largest kappa value is achieved by SC-Seq2SeqNIWF,s=0

Experiments – Case study The responses generated by the four baselines are often quite general and short

Experiments – Case study With s from 1 to 0, SC- Seq2Seq opqr can generate very long and specific responses, to more general and shorter responses.

Experiments – Analysis 1. Neighbors based on semantic representations are semantically related 2. Neighbors based on usage representations are not so related but with similar specificity levels

Conclusion l We argue n employing a single model to learn the mapping between the utterance and response will inevitably favor general responses l We propose n an explicit specificity control variable is introduced into the Seq2Seq model handle different utterance-response relationships in terms of specificity l Future work Ø employ some reinforcement learning technique to learn to adjust the control variable depending on users’ feedbacks Ø apply to other tasks, like summarization, QA, etc

Thanks Q & A • Name: Ruqing Zhang | Email: zhangruqing@software.ict.ac.cn

Learning to Control the Specificity in Neural Response Generation - PowerPoint PPT Presentation

Learning to Control the Specificity in Neural Response Generation Ruqing Zhang , Jiafeng Guo, Yixing Fan, Yanyan Lan, Jun Xu, Xueqi Cheng 1. CAS Key Lab of Network Data Science and Technology Institute of Computing Technology, Chinese Academy of

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Overview Understanding the neural code Neural Encoding Encoding: Prediction of neural response to

Spatiotemporal Regulation of ERK by Spatiotemporal Regulation of ERK by Dual- -specificity

INTRODUCTION TO SPECIFICITY THEORY, MARCH 7, 2015 For Discussion at 2 nd Year ICP PPT Class

Measures of Specificity Used Additional Natural . . . in the Principle of Justifiable

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Rapid Response Jobs are Alaskas Future Rapid Response Rapid Response Rapid Response is a

CHAPTER VI VI CHAPTER Learning in Feedforward Feedforward Learning in Neural Networks Neural

Neural Networks Neural Net Basics Dan Klein, John DeNero UC Berkeley Slides adapted from Greg

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

Industrial Robots Industrial Robots Control Control Part 1 Control Control Part 1 Part 1

Introduction to Artificial Intelligence Neural Networks - Deep Learning for NLP Janyl Jumadinova

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Neural Networks for Machine Learning Lecture 2a An overview of the main types of neural network

Practice Resistant Versatile Proansoap soap dispenser in gel or foam Wall soap dispenser.

Water Based and Odour Free Nail Polish Acquarella Polish - offers a high quality, chemicals free

Ramapo High School Families This presentation was launched in the Sophomore Health Classes during

Sun Tan Oil: Never a summer without the classical tanning oil, Amatoury infuses its own version

Low Enriched Uranium Control Applicable to a Range of Potential 99 Mo Production Processes I. May

enthalpies by Adiabatic Calorimetry Faculty: Prof. Chandra Venkataraman Staff: Nirmal Karangale

CCFHS Dental Health Service Day Expansion of scope of services to complement WIC-OH fluoride

Aaron Smith, Michael Frow, Joe Quddus,