Dialogue Summarization Presenter: Wang Chen Mentor: Piji Li 1

Outline • Introduction • Task Definition & Applications • Taxonomy Based on Data Type • Challenges • Recent Work • Keep Meeting Summaries on Topic: Abstractive Multi-Modal Meeting Summarization. ACL short, 2019. • Automatic Dialogue Summary Generation for Customer Service. KDD 2019. • Conclusion 2

Task Definition & Applications • Definition: Given an input dialogue, the goal is to generate a summary to capture the highlights of the dialogue. DiDi Customer Service [2] SAMSum [4] Dial2Desc [3] 3

Task Definition & Applications • Applications • Automatic Meeting Summarization • Medical Conversation Summarization • Customer Service Summarization • … AMI, Meeting Summarization [7] Medical Conversation Summarization [8] DiDi Customer Service [6] 4

Taxonomy Based on Data Type Dialogue Summarization Video & Audio Audio Only Text Only • Dataset: • Datasets: • Datasets: • AMI, meeting sum [7] • Nurse-to-Patient Dialogue Data [10] • DialSum [6] • Models: • … • SAMSum [4] • TopicSeg + VFOA [1] • Models: • Didi Customer Service [2] • HAS + RL [9] • PG-Net + TA [5] • Dial2Desc [3] • … • … • … • Models: • Leader-Writer [2] • Enhanced Interaction Dialogue Encoder [3] • Sentence-Gated [6] • … 5

Challenges • Logicality • The summary should be organized in a readable order. • Integrality • All the important facts should be covered. • Correctness • The summary should be consistent with the facts in the dialogue. • Other challenges in generation area • Fluency • Evaluation metrics • … 6

Taxonomy Based on Data Type Dialogue Summarization Video & Audio Audio Only Text Only • Dataset: • Datasets: • Datasets: • AMI, meeting sum [7] • Nurse-to-Patient Dialogue Data [10] • DialSum [6] • Models: • … • SAMSum [4] • TopicSeg + VFOA [1] • Models: • Didi Customer Service [2] • HAS + RL [9] • PG-Net + TA [5] • Dial2Desc [3] • … • … • … • Models: • Leader-Writer [2] • Enhanced Interaction Dialogue Encoder [3] • Sentence-Gated [6] • … 7

Keep Meeting Summaries on Topic • Main Contributions: • proposed a novel hierarchical attention mechanism across three levels: topic segment, utterance, and word. • introduced the multi-modal feature i.e. Visual Focus of Attention ( VFOA ) to help recognize the important utterances. 8

Keep Meeting Summaries on Topic • Why the Visual Focus of Attention (VFOA) feature is useful? • One widely-accepted assumption is that an utterance is more important if its speaker receives more attention . • One data sample from AMI corpus: The color indicates the attention received by the speaker PM (Project Manager). Others: ME (Marketing Expert), ID (Industrial Designer), UI (User Interface). 9

Keep Meeting Summaries on Topic • TopicSec + VFOA We formulate a meeting transcript as a list of triples 𝑌 ={( 𝑞 𝑗 ; 𝑔 𝑗 ; 𝑣 𝑗 )} • 𝑞 𝑗 ∈ 𝑄 is the speaker of utterance 𝒗 𝒋 , where 𝑄 denotes the set of participants. 𝑗 ∈ 𝑆 𝑄 ∗|𝐺| contains the VFOA • 𝑔 target sequence over the course of utterance 𝑣 𝑗 for each participant where 𝐺 = {𝑞 0 , … , 𝑞 𝑄 , 𝑢𝑏𝑐𝑚𝑓, 𝑥ℎ𝑗𝑢𝑓𝑐𝑝𝑏𝑠𝑒, 𝑞𝑠𝑝𝑘𝑓𝑑𝑢𝑗𝑝𝑜 𝑡𝑑𝑠𝑓𝑓𝑜, 𝑣𝑜𝑙𝑜𝑝𝑥𝑜} . • 𝑣 𝑗 is a sequence of words. 10

Keep Meeting Summaries on Topic • TopicSec + VFOA We formulate a meeting transcript as a list of triples 𝑗 ∈ 𝑆 𝑄 ∗ 𝐺 , 𝑄 = 4, 𝐺 = 8 𝑌 ={( 𝑞 𝑗 ; 𝑔 𝑗 ; 𝑣 𝑗 )} 𝑔 𝑞 𝑗 ∈ 𝑆 |𝑄| is the speaker of • utterance 𝒗 𝒋 , where 𝑄 denotes the set of participants. One hot vector. 𝑗 ∈ 𝑆 𝑄 ∗|𝐺| contains the VFOA • 𝑔 target sequence over the course of utterance 𝑣 𝑗 for each participant where 𝐺 = {𝑞 0 , … , 𝑞 𝑄 , 𝑢𝑏𝑐𝑚𝑓, 𝑥ℎ𝑗𝑢𝑓𝑐𝑝𝑏𝑠𝑒, 𝑞𝑠𝑝𝑘𝑓𝑑𝑢𝑗𝑝𝑜 𝑡𝑑𝑠𝑓𝑓𝑜, 𝑣𝑜𝑙𝑜𝑝𝑥𝑜} . • 𝑣 𝑗 is a sequence of words. 11

Keep Meeting Summaries on Topic • TopicSec + VFOA We formulate a meeting transcript as a list of triples 𝑌 ={( 𝑞 𝑗 ; 𝑔 𝑗 ; 𝑣 𝑗 )} 𝑞 𝑗 ∈ 𝑆 |𝑄| is the speaker of • utterance 𝒗 𝒋 , where 𝑄 denotes the set of participants. One hot vector. 𝑗 ∈ 𝑆 𝑄 ∗|𝐺| contains the VFOA • 𝑔 target sequence over the course of utterance 𝑣 𝑗 for each participant where 𝐺 = {𝑞 0 , … , 𝑞 𝑄 , 𝑢𝑏𝑐𝑚𝑓, 𝑥ℎ𝑗𝑢𝑓𝑐𝑝𝑏𝑠𝑒, 𝑞𝑠𝑝𝑘𝑓𝑑𝑢𝑗𝑝𝑜 𝑡𝑑𝑠𝑓𝑓𝑜, 𝑣𝑜𝑙𝑜𝑝𝑥𝑜} . • 𝑣 𝑗 is a sequence of words. 12

Keep Meeting Summaries on Topic • TopicSec + VFOA We formulate a meeting transcript as a list of triples 𝑌 ={( 𝑞 𝑗 ; 𝑔 𝑗 ; 𝑣 𝑗 )} 𝑞 𝑗 ∈ 𝑆 |𝑄| is the speaker of • utterance 𝒗 𝒋 , where 𝑄 denotes the set of participants. One hot vector. 𝑗 ∈ 𝑆 𝑄 ∗|𝐺| contains the VFOA • 𝑔 target sequence over the course of utterance 𝑣 𝑗 for each participant where 𝐺 = {𝑞 0 , … , 𝑞 𝑄 , 𝑢𝑏𝑐𝑚𝑓, 𝑥ℎ𝑗𝑢𝑓𝑐𝑝𝑏𝑠𝑒, 𝑞𝑠𝑝𝑘𝑓𝑑𝑢𝑗𝑝𝑜 𝑡𝑑𝑠𝑓𝑓𝑜, 𝑣𝑜𝑙𝑜𝑝𝑥𝑜} . • 𝑣 𝑗 is a sequence of words. 13

Keep Meeting Summaries on Topic • TopicSec + VFOA The probability of generating 𝑧 𝑗 follows the decoder Hierarchical Attention in Summary Decoder 𝑡𝑣𝑛 is the copying probability. in PGNet, and 𝛽 𝑗𝑘 14

Keep Meeting Summaries on Topic • TopicSec + VFOA The probability of generating 𝑧 𝑗 follows the decoder Hierarchical Attention in Summary Decoder 𝑡𝑣𝑛 is the copying probability. in PGNet, and 𝛽 𝑗𝑘 15

Keep Meeting Summaries on Topic • TopicSec + VFOA 16

Keep Meeting Summaries on Topic • Dataset • 97 meetings for training; 20 meetings for validation; 20 meetings for testing. • Each meeting lasts 30 minutes. • Experiment Results 17

Taxonomy Based on Data Type Dialogue Summarization Video & Audio Audio Only Text Only • Dataset: • Datasets: • Datasets: • AMI, meeting sum [7] • Nurse-to-Patient Dialogue Data [10] • DialSum [6] • Models: • … • SAMSum [4] • TopicSeg + VFOA [1] • Models: • Didi Customer Service [2] • HAS + RL [9] • PG-Net + TA [5] • Dial2Desc [3] • … • … • … • Models: • Leader-Writer [2] • Enhanced Interaction Dialogue Encoder [3] • Sentence-Gated [6] • … DiDi Customer Service [2] 18

Dialogue Summarization for Customer Service • Main Contributions: • proposed to use auxiliary key point sequences to ensure the logic and integrity of dialogue summaries. • proposed a novel hierarchical decoder architecture, the Leader-Writer net , to generate both key point sequences and the summaries. 19

Dialogue Summarization for Customer Service • What is a key point sequence? • A key point is the theme of a contiguous set of one or more summary sentences. • One example • The key point sequence can be used to enhance the logic and integrity of the generated summary. 20

Dialogue Summarization for Customer Service • How to generate a key point sequence for the training dataset? Key point Designed rules by Summary sequence domain experts … Totally 51 key points 21

Dialogue Summarization for Customer Service • Leader-Writer net: Overall architecture 22

Dialogue Summarization for Customer Service • Leader-Writer net: Hierarchical Encoder 23

Dialogue Summarization for Customer Service • Leader-Writer net: Hierarchical Encoder The relative position of 𝑗 -th utterance: where 𝑁 is the utterance number of a dialogue and 𝐿 is the maximum relative position number. 𝐿 is set to 30 in the experiments. 24

Dialogue Summarization for Customer Service • Leader-Writer net: Hierarchical Decoder The hidden state of the key point decoder is used to guide the generation of the sub-summary instead of the predicted key point . The reason is to avoid the exactly the same initial states for two sub- summaries under the same key point in one key point sequence. e.g., [· · · , Solution, User disapproval, Solution, User approval, · · · ] 25

Dialogue Summarization for Customer Service • Leader-Writer net: Training • Cross-entropy loss • Reinforcement loss (use ROUGE-L as reward) • Joint loss 26

Dialogue Summarization for Customer Service • Dataset 27

Dialogue Summarization for Customer Service • Experiments 28

Conclusions • A brief introduction for dialogue summarization is given. • Two recent papers are introduced • One is in meeting summarization area, which introduced the VFOA features to enhance the performance. • The other is in customer service summarization, which introduced the key point sequence to improve the logicality and integrity of the generated summary. 29

Q&A 30

Dialogue Summarization Presenter: Wang Chen Mentor: Piji Li 1 - PowerPoint PPT Presentation

Dialogue Summarization Presenter: Wang Chen Mentor: Piji Li 1 Outline Introduction Task Definition & Applications Taxonomy Based on Data Type Challenges Recent Work Keep Meeting Summaries on Topic: Abstractive

ACL19 Summarization Xiachong Feng Papers Multi-Document Summarization Scientific Paper

Document Summarization Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC

Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC Berkeley Document

Movie Summarization and Movie Summarization and Skimming Demonstrator Skimming Demonstrator

Get To The Point: Summarization with Pointer-Generator Networks Abigail See* Peter J. Liu

Overview of TAC 2011 Summarization Track Karolina Owczarzak, Hoa Trang Dang National Institute of

Alternative Perspectives on Summarization Systems & Applications Ling 573 May 25, 2017

dialogue notations and Dialogue linked to the semantics of the system what it does

linking, cross-lingual entity linking) TAC 2011 Summarization Track Guided Summarization task

dialogue notations and design Dialogue Notations and Design Dialogue Notations

Alternative Summarization: Abstraction, Reviews & Speech Ling 573 Systems and Applications

Summarization: Overview Ling573 Systems & Applications April 2, 2015 Roadmap

A Neural Attention Model for Sentence Summarization Alexander M. Rush, Sumit Chopra, Jason

Tutorial on Abstractive Text Summarization Advaith Siddharthan NLG Summer School, Aberdeen, 22

Automatic Summarization (and other stuff) Taylor Berg-Kirkpatrick CS 288 UC Berkeley

Recent Advances in Automatic Speech Summarization Sadaoki Furui Department of Computer Science

General Purpose Database Summarization A web service architecture for on-line database

dialogue systems, dialogue modeling 15 June 2007 ptt dialogue systems: intro 1/71 Dialog

Algorithms for NLP Summarization Chan Young Park CMU Slides adapted from: Dan Jurafsky

Algorithms for NLP Summarization Chan Young Park CMU Slides adapted from: Dan Jurafsky

Video Summarization Ben Wing CS 395T, Spring 2008 April 11, 2008 Overview Video

Entity Type Modeling for Multi-Document Summarization: Generating Descriptive Summaries of

Advanced Lesson 11 Topic 11: Unit 5: Educational program and dialogue Dialogue is typically a

MeanSum : A Neural Model for Unsupervised Multi-Document Abstractive Summarization Eric Chu *

Dialogue Summarization Presenter: Wang Chen Mentor: Piji Li 1 - PowerPoint PPT Presentation

Dialogue Summarization Presenter: Wang Chen Mentor: Piji Li 1 Outline Introduction Task Definition & Applications Taxonomy Based on Data Type Challenges Recent Work Keep Meeting Summaries on Topic: Abstractive

ACL19 Summarization Xiachong Feng Papers Multi-Document Summarization Scientific Paper

Document Summarization Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC

Statistical NLP Spring 2011 Lecture 25: Summarization Dan Klein UC Berkeley Document

Movie Summarization and Movie Summarization and Skimming Demonstrator Skimming Demonstrator

Get To The Point: Summarization with Pointer-Generator Networks Abigail See* Peter J. Liu

Overview of TAC 2011 Summarization Track Karolina Owczarzak, Hoa Trang Dang National Institute of

Alternative Perspectives on Summarization Systems &amp; Applications Ling 573 May 25, 2017

dialogue notations and Dialogue linked to the semantics of the system what it does

linking, cross-lingual entity linking) TAC 2011 Summarization Track Guided Summarization task

dialogue notations and design Dialogue Notations and Design Dialogue Notations

Alternative Summarization: Abstraction, Reviews &amp; Speech Ling 573 Systems and Applications

Summarization: Overview Ling573 Systems &amp; Applications April 2, 2015 Roadmap

A Neural Attention Model for Sentence Summarization Alexander M. Rush, Sumit Chopra, Jason

Tutorial on Abstractive Text Summarization Advaith Siddharthan NLG Summer School, Aberdeen, 22

Automatic Summarization (and other stuff) Taylor Berg-Kirkpatrick CS 288 UC Berkeley

Recent Advances in Automatic Speech Summarization Sadaoki Furui Department of Computer Science

General Purpose Database Summarization A web service architecture for on-line database

dialogue systems, dialogue modeling 15 June 2007 ptt dialogue systems: intro 1/71 Dialog

Algorithms for NLP Summarization Chan Young Park CMU Slides adapted from: Dan Jurafsky

Algorithms for NLP Summarization Chan Young Park CMU Slides adapted from: Dan Jurafsky

Video Summarization Ben Wing CS 395T, Spring 2008 April 11, 2008 Overview Video

Entity Type Modeling for Multi-Document Summarization: Generating Descriptive Summaries of

Advanced Lesson 11 Topic 11: Unit 5: Educational program and dialogue Dialogue is typically a

MeanSum : A Neural Model for Unsupervised Multi-Document Abstractive Summarization Eric Chu *

Alternative Perspectives on Summarization Systems & Applications Ling 573 May 25, 2017

Alternative Summarization: Abstraction, Reviews & Speech Ling 573 Systems and Applications

Summarization: Overview Ling573 Systems & Applications April 2, 2015 Roadmap