Trainable Approaches for Surface NLG* Adwait Ratnaparkhi WhizBang! - PowerPoint PPT Presentation

Trainable Approaches for Surface NLG* Adwait Ratnaparkhi WhizBang! Labs -- Research *Funded by IBM TJ Watson Research Center

What is surface NL generation ? Module that produces grammatical NL phrase to describe an input semantic representation For our purposes what information to say is determined elsewhere (deep generation) how to say the information is determined by NLG systems (surface generation)

Existing Traditional Methods Canned Phrases & Templates Simple to implement Scalability is limited NLG Packages FUF/SURGE (Columbia Univ.),ILEX (Edinburgh Univ.), PENMAN (ISI), REALPRO (CogenTex), ... Advantages Input: abstract semantic representation Output: NLG package turns it into English Disadvantages Requires many rules to map semantics to NL Writing rules, as well as input representation requires linguistic expertise

Trainable NLG Motivation Avoid manually writing rules mapping semantics to English Data driven Base NL generation on real data, instead of the preferences of grammar writer Portability to other languages & domains Solve Lexical Choice problem : if there are many correct ways to say the same thing, which is the best ?

Trainable NLG for air travel Generate noun phrase for a flight description Input to NLG: meaning of flight phrase { $air = "USAIR", $city-fr = "Miami", $dep-time = "evening", $city-to = "Boston", $city-stp = "New York" } NLG produces: $air flight leaving $city-fr in the $dep-time and arriving in $city-to via $city-stp After substitution: "USAIR flight leaving Miami in the evening and arriving in Boston via New York" System learns to generate from corpus of (meaning, phrase) pairs, e.g. Meaning Phrase $city-fr $city-to $air flight from $city-fr to $city-to on $air

What is so difficult about generating flight descriptions ? Flight phrases are necessary in a dialog response e.g., "There are 5 flights ... , which do you prefer ?" Combinatorial explosion of ways to present flight information, i.e., we use 26 attributes Given n attributes, n! possible orderings NLG must solve: What is the optimal ordering of attributes ? What words do we use to "glue" together attributes, so that phrase is well-formed? What is the optimal way to choose between multiple ways of saying the same flight, i.e., lexical choice ?

Three methods for trainable surface NLG NLG1: Baseline model Find most common phrase to express attribute set Surprisingly effective: over 80% accuracy Cannot generate phrases for novel attribute sets NLG2: Consecutive n-gram model predict words left-to-right NLG3: Dependency based model predict words in dependency tree order (not necessarily left-to-right)

NLG2: n-gram based generation Predict sentence, one word at a time Associate a probability with each word Use information in previous 2 words & attributes Simultaneously search many hypotheses Probability model for sentence: A = initial attribute list A i = attributes remaining when predicting i th word P(w 1 ... w n |A) = � i P(w i | w i-1 , w i-2 , A i ) NLG2 outputs best sentence W* W* = w 1 *... w n * = argmax w1 ... wn P(w 1 ... w n | A)

Combine local & non-local information to predict next word Implement information in context as features in maximum entropy framework f j (w i w i-1 w i-2 A i ) = 1 if <w i w i-1 w i-2 A i > is interesting 0 otherwise Derive feature set by applying patterns to training data E.g., f j (w i w i-1 w i-2 A i ) = 1 if w i = "from", w i-1 = "flights", $city-fr c A i, 0 otherwise P(w i | w i-1 w i-2 A i )= Π j=1...k α j f j (w i w i-1 w i-2 A i ) / Z(w i-1 w i-2 A i ) Each feature has a weight : α j > 0

NLG2 Sample output A = { $city-to = "Boston", $day-dep = "Tuesday", $airport-fr = "JFK", $time-depint = "morning" } NLG2 produces: 0.137 flights from JFK to Boston on Tuesday morning 0.084 flights from JFK to Boston Tuesday morning 0.023 flights from JFK to Boston leaving Tuesday morning 0.013 flights between JFK and Boston on Tuesday morning 0.002 flights from JFK to Boston Tuesday morning flights

NLG2 Summary Advantages Automatic determination of attribute ordering, connecting English, and lexical choice Minimally annotated data 86-88% correct Disadvantages Current word is dependent on only previous 2 words May not scale to longer sentences with long distance dependencies Difficult to implement number agreement

NLG3: Predict dependency tree Links indicate grammatical dependency USAIR flights to NY from Boston in the afternoon Links form a tree (+/- indicate direction) flights USAIR(-) to(+) from(+) in(+) NY(+) Boston(+) afternoon(+) the(-)

NLG3 Model for Dependency generation Testing: given attribute list (A), find most probable dependency tree T* T* = argmax t p(t | A) p(t|A) = � child p(child | parent, grandparent, 2 siblings, A child ) Form of p(child| ... ) is maximum entropy model Use beam-like search to find T* Assumption: easier to predict new words when conditioning on grammatically related words together with attributes

NLG3 Summary Automatic determination of attribute ordering, connecting English, and lexical choice Annotated data semi-automatically derived from NLU training data Easier to implement number agreement Should scale to longer sentences with long-distance dependencies 88-90% correct on test sentences

Evaluation Training: 6k flight phrases NLG1, NLG2 : train from text only NLG3 : train from text & grammatical dependencies Testing: 2k flight phrases test data consists of 190 unique attribute sets Evaluate NLG output by hand (2 judges) 1 = perfectly acceptable [ Perfect ] 2 = acceptable except for tense or agreement [ OK ] 3 = not acceptable (extra or missing words) [ Bad ] 4 = no output from NLG [ Nothing ]

Accuracy improves with more sophisticated methods Accuracy Improvement (Category = "Perfect") 91 90 89 88 % Perfect Judge A 87 86 Judge B 85 84 83 82 81 NLG1 NLG2 NLG3 Method

Fewer cases of no output with more sophisticated models Error Reduction (Category = "No output") 3.5 3 2.5 % No output 2 1.5 1 0.5 0 NLG1 NLG2 NLG3 Method

Conclusions Learning reduces error from baseline system by 33% - 37% attribute ordering, connecting English, lexical choice (Langkilde & Knight, 1998) uses corpus statistics to rerank output of hand-written grammar NLG3 can be viewed as inducing a probabilistic dependency grammar (Berger et al, 1996) does statistical MT (and hence generation) straight from source text Our systems use a statistical approach with an "interlingua" (attribute-value pairs)

Trainable Approaches for Surface NLG* Adwait Ratnaparkhi WhizBang! - PowerPoint PPT Presentation

Trainable Approaches for Surface NLG* Adwait Ratnaparkhi WhizBang! Labs -- Research *Funded by IBM TJ Watson Research Center What is surface NL generation ? Module that produces grammatical NL phrase to describe an input semantic

Natural Language Generation Demos Basics of NLG NLG concepts Issues in NLG NLG subtasks Scott

NLG: Specific Components Texts NLG Systems Architecture modules Scott Farrar Textplanner

NLG, Wrap up Surface realizer Linearization SimpleNLG Lexicon Scott Farrar Design ideas

Trainable Decoding of Sets of Sequences for Neural Sequence Models Ashwin Kalyan Peter Anderson

Tutorial on Abstractive Text Summarization Advaith Siddharthan NLG Summer School, Aberdeen, 22

STS for NLG Christian Chiarcos chiarcos@uni-potsdam.de Natural Language Generation Natural

Findings of the E2E NLG Challenge Ondej Duek , Jekaterina Novikova and Verena Rieser

Why NLU doesnt generalize to NLG Yejin Choi Paul G. Allen School of Computer Science &

NLG as Cogni,ve Modelling The case of Referring Expressions

Objective Appearance Measurement Appearance of surface finish Many factors can affect surface

Surface Area and Volume Day 1 - Surface Area of Prisms Surface Area = The total area of the

Surfaces, surface area and surface integrals Our main objective here is the construction of surface

Natural Language Generation Survey in the State of the Art of Natural Topic Coverage

Intelligent Indexing: A Semi-Automated, Trainable System for Field Labeling Robert Clawson, Bill

A Network-based End-to-End Trainable Task-oriented Dialogue System Authors: Tsung-Hsien Wen,

THE LOTTERY TICKET HYPOTHESIS: FINDING SPARSE, TRAINABLE NEURAL NETWORKS Jonathan Frankle,

Chinese Characters Ling 203 Languages of the World Overview Most Chinese characters bear no

Optimization Problems LING 572 Advanced Statistical Methods for NLP January 28, 2020 1

Compiling Comp Ling Practical weighted dynamic programming and the Dyna language -Michael Jordan

Ling Chuan Lee@F13 Labs Lee Yee Chan@F13 Labs Ling

Security of SHA-3 and Related Constructions Jian Guo FSE 2019 @ Paris, France. 27th March 2019

Kumar Sambhav Pandey, Hitesh Shrimali, Dinesh Kumar B School of Computing and Electrical

MobiArch 2008 MobiArch 2008 Shall we apply paging technologies to Shall we apply paging

FraudVis : Understanding Unsupervised Fraud Detection Algorithms Jiao Sun 1 , Qixin Zhu 1 , Zhifei

Trainable Approaches for Surface NLG* Adwait Ratnaparkhi WhizBang! - PowerPoint PPT Presentation

Trainable Approaches for Surface NLG* Adwait Ratnaparkhi WhizBang! Labs -- Research *Funded by IBM TJ Watson Research Center What is surface NL generation ? Module that produces grammatical NL phrase to describe an input semantic

Natural Language Generation Demos Basics of NLG NLG concepts Issues in NLG NLG subtasks Scott

NLG: Specific Components Texts NLG Systems Architecture modules Scott Farrar Textplanner

NLG, Wrap up Surface realizer Linearization SimpleNLG Lexicon Scott Farrar Design ideas

Trainable Decoding of Sets of Sequences for Neural Sequence Models Ashwin Kalyan Peter Anderson

Tutorial on Abstractive Text Summarization Advaith Siddharthan NLG Summer School, Aberdeen, 22

STS for NLG Christian Chiarcos chiarcos@uni-potsdam.de Natural Language Generation Natural

Findings of the E2E NLG Challenge Ondej Duek , Jekaterina Novikova and Verena Rieser

Why NLU doesnt generalize to NLG Yejin Choi Paul G. Allen School of Computer Science &amp;

NLG as Cogni,ve Modelling The case of Referring Expressions

Objective Appearance Measurement Appearance of surface finish Many factors can affect surface

Surface Area and Volume Day 1 - Surface Area of Prisms Surface Area = The total area of the

Surfaces, surface area and surface integrals Our main objective here is the construction of surface

Natural Language Generation Survey in the State of the Art of Natural Topic Coverage

Intelligent Indexing: A Semi-Automated, Trainable System for Field Labeling Robert Clawson, Bill

A Network-based End-to-End Trainable Task-oriented Dialogue System Authors: Tsung-Hsien Wen,

THE LOTTERY TICKET HYPOTHESIS: FINDING SPARSE, TRAINABLE NEURAL NETWORKS Jonathan Frankle,

Chinese Characters Ling 203 Languages of the World Overview Most Chinese characters bear no

Optimization Problems LING 572 Advanced Statistical Methods for NLP January 28, 2020 1

Compiling Comp Ling Practical weighted dynamic programming and the Dyna language -Michael Jordan

Ling Chuan Lee@F13 Labs Lee Yee Chan@F13 Labs Ling

Security of SHA-3 and Related Constructions Jian Guo FSE 2019 @ Paris, France. 27th March 2019

Kumar Sambhav Pandey, Hitesh Shrimali, Dinesh Kumar B School of Computing and Electrical

MobiArch 2008 MobiArch 2008 Shall we apply paging technologies to Shall we apply paging

FraudVis : Understanding Unsupervised Fraud Detection Algorithms Jiao Sun 1 , Qixin Zhu 1 , Zhifei

Why NLU doesnt generalize to NLG Yejin Choi Paul G. Allen School of Computer Science &