Question answering CS685 Fall 2020 Advanced Natural Language - PowerPoint PPT Presentation

Question answering CS685 Fall 2020 Advanced Natural Language Processing Mohit Iyyer College of Information and Computer Sciences University of Massachusetts Amherst some slides from Jordan Boyd-Graber, Jacob Devlin, and Chris Manning

Stuff from last time • HW0 grades published, good job! • HW1 coming soon :) • Project proposal feedback in early October • Exam pushed back to end of October • Thanks to whoever posted all those Notability tips in the anonymous form!

Who wrote the song “Kiss from a Rose”? Seal Question Analysis: Answer Type Final Ranking POS/Parsing/NER Selection Query Formulation/ Evidence Retrieval/ Template Extraction Candidate Scoring Knowledge Base Search/ Candidate Answer Generation 3

Can we replace all of these modules with a single neural network? External Knowledge Neural Classifier Network Who wrote the song Seal “Kiss from a Rose”? 4

• factoid QA: the answer is a single entity / numeric • “who wrote the book “Dracula”? • non-factoid QA: answer is free text • “why is Dracula so evil?” • QA subtypes (could be factoid or non-factoid): semantic parsing: question is mapped to a logical form • which is then executed over some database “how many people did Dracula bite?” • reading comprehension : answer is a span of text within a • document (could be factoid or non-factoid) community-based QA: question is answered by multiple • web users (e.g., Yahoo! Answers) visual QA: questions about images •

Machine reading (“reading comprehension”)

Let’s look at the DRQA model (Chen et al., ACL 2017) (pre-BERT)

Big idea

Start and End Probabilities P start ( i ) ∝ exp { ~ p i W s ~ q } (1) P end ( i ) ∝ exp { ~ p i W e ~ q } (2) 1. A vector representing our question 2. Vector representing each word in the query text 3. Parameter: here’s the start/end of the answer

Start and End Probabilities P start ( i ) ∝ exp { ~ p i W s ~ q } (1) P end ( i ) ∝ exp { ~ p i W e ~ q } (2) 1. A vector representing our question 2. Vector representing each word in the query text 3. Parameter: here’s the start/end of the answer How does this work at test-time?

Stanford Attentive Reader++ Figure from SLP3: Chapter 23 p start(1) p end(3) p end(1) … p start(3) … similarity similarity similarity q q q p 3 p 2 p 1 Weighted sum LSTM2 LSTM2 LSTM2 LSTM1 LSTM1 LSTM1 LSTM2 LSTM2 LSTM2 ~ ~ ~ p 1 p 2 p 3 q-align1 LSTM1 LSTM1 LSTM1 q-align2 q-align3 Attention GloVe GloVe 0 0 1 NNP Att NN Att NN GloVe GloVe GloVe GloVe O PER O GloVe GloVe GloVe GloVe p 1 p 2 p 3 Beyonce … q 1 q 2 q 3 When did … Beyonce’s debut album … Question Passage Training objective: 32

5. BiDAF: Bi-Directional Attention Flow for Machine Comprehension (Seo, Kembhavi, Farhadi, Hajishirzi, ICLR 2017) 37

Coattention Encoder U : u t D : bi-LSTM bi-LSTM bi-LSTM bi-LSTM bi-LSTM � m+1 A D document A Q C D concat product product C Q Q : concat � n+1

SQuAD v1.1 leaderboard, end of 2016 (Dec 6) EM F1 18

All of these models are trained from scratch on the SQuAD training set!!!

��

�� Simply concatenate the question and paragraph into a single sequence, pass through BERT, and apply a softmax layer on the final layer token representations to predict start/end answer span boundaries

SQuAD v1.1 leaderboard, 2019-02-07 – it’s solved!

Transfer learning via BERT made most of the task-specific QA architectures obsolete

SQuAD 2.0 Example When did Genghis Khan kill Great Khan? Gold Answers: <No Answer> Prediction: 1234 [from Microsoft nlnet] 22

SQuAD 2.0 leaderboard, 2019-02-07 EM F1 23

SQuAD 2.0 leaderboard, 2019-02-07 24

Good systems are great, but still basic NLU errors What dynasty came before the Yuan? Gold Answers: � Song dynasty � Mongol Empire � the Song dynasty Prediction: Ming dynasty [BERT (single model) (Google AI)]

SQuAD limitations • SQuAD has a number of other key limitations too: • Only span-based answers (no yes/no, counting, implicit why) • Questions were constructed looking at the passages • Not genuine information needs • Generally greater lexical and syntactic matching between questions and answer span than you get IRL • Barely any multi-fact/sentence inference beyond coreference • Nevertheless, it is a well-targeted, well-structured, clean dataset • It has been the most used and competed on QA dataset • It has also been a useful starting point for building systems in industry (though in-domain data always really helps!) •

Several variants of the SQuAD style setup (all easily portable to BERT :)

Conversational question answering: Multiple questions about the same document (answers still spans from the document) datasets : QuAC, CoQA, CSQA, etc How do we use BERT to solve this task?

Multi-hop question answering: Requires models to perform more “reasoning” over the document datasets : HotpotQA, QAngaroo

long-form question answering: Answers must be generated , not extracted datasets : ELI5, NarrativeQA, etc More on these later!

open-domain question answering: a model must retrieve relevant documents and use them to generate an answer No supporting documents given (no evidence given!) to the model!!! The future of QA?

All of these QA tasks are very similar… can we share information across di ff erent datasets to improve our performance across the board? (more next time!)

finally… a real-world example of deploying QA models

Quiz Bowl

what is quiz bowl ? • a trivia game that contains questions about famous entities (e.g., novels, battles, countries) • developed a deep learning system, QANTA , to play quiz bowl • one of the first applications of deep learning to question answering Iyyer et al., EMNLP 2014 & ACL 2015

This author described a "plank in reason" breaking and hitting a "world at every plunge" in a poem which opens "I felt a funeral in my brain." She wrote that "the stillness round my form was like the stillness in the air" in "I heard a fly buzz when I died." She wrote about a scarcely visible roof and a cornice that was "but a mound" in a poem about a carriage ride with Immortality and Death. For 10 points, name this reclusive "Belle of Amherst" who wrote "Because I could not stop for Death." A: Emily Dickinson

… name this reclusive "Belle of Amherst”… NN classifier Emily Dickinson

dependency-tree NNs softmax: predict Emily Dickinson out of a set of ~5000 answers … name this reclusive belle … Iyyer et al., EMNLP 2014

simple discourse-level representations by averaging In one novel, one of these figures antagonizes an impoverished n family before leaping into an active volcano. c i X av = Another of these figures titles a novella in which General n i =1 Spielsdorf describes the circumstances of his niece Bertha Reinfeldt's death to the narrator, Laura. In addition to Varney and Carmilla, another of these figures sails on the Russian ship Demeter in order to reach London. That figure bites Lucy Westenra before being killed by a coalition including Jonathan Harker and Van Helsing. For 10 points, identify these bloodsucking beings most famously exemplified by Bram Stoker’s Dracula.

Of course, nowadays we would just put these questions into BERT and place a classifier over the [CLS] token to predict the answer!

2015: defeated Ken Jennings 300-160

2016: lost to top quiz bowlers 345-145

2017: beat top quiz bowlers 260-215

late 2017: crushed top team 475-185

deep learning ~ memorization during training, QANTA becomes very good at associating named entities in questions with answers… That figure bites Lucy Westenra before being killed by a coalition including Jonathan Harker and Van Helsing . Vampire

Question answering CS685 Fall 2020 Advanced Natural Language - PowerPoint PPT Presentation

Question answering CS685 Fall 2020 Advanced Natural Language Processing Mohit Iyyer College of Information and Computer Sciences University of Massachusetts Amherst some slides from Jordan Boyd-Graber, Jacob Devlin, and Chris Manning Stuff from

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

Designing deep architectures for Visual Question Answering Matthieu Cord Sorbonne University

Question Answering and AnswerFinder Diego Moll a Centre for Language Technology Department of

A Multilingual Hybrid Question-Answering System Cross-Lingual Open-Domain Question Answering

Answering Queries Using Answering Queries Using Materialized view: result set is stored

Statistical NLP Spring 2011 Lecture 26: Question Answering Dan Klein UC Berkeley Question

Question Answering and Reading Comprehension Kevin Duh Fall 2019, Intro to HLT, Johns Hopkins

An Question Recommendation System for Question Answer Community (Stackoverflow) Presenter: Haoyu

Neural Question Answering at BioASQ 5B Georg Wiese, Dirk Weissenborn, Mariana Neves Motivation

Embodied Question Answering NVIDIA GTC March 26, 2018 Abhishek Das PhD student, Georgia Tech

Question Answering Alexander Solovyev Bauman Moscow Sate Technical University a-soloviev@mail.ru

CS345a Data Mining Project A Web Based Question Answering System Vincenzo Di Nicola Jyotika

Factoid Question Answering Roy Aslan (ra2752@Columbia.edu) A Neural Network for Factoid

Question Answering over Freebase with Multi-Column Convolutional Neural Networks Li Dong 1 , Furu

Questioning Question Answering Answers Sameer Singh University of California, Irvine Questioning

Additional Semantic Tasks: Entity Coreference and Question Answering CMSC 473/673 UMBC Outline

Towards a Characterization of the Double Category of Spans Evangelia Aleiferi Dalhousie

Using Kieker with Elastic APM: An Experience Report Valentin Seifermann Duan Okanovi SSP

4.3 Linearly Independent Sets McDonald Fall 2018, MATH 2210Q, 4.3 Slides 4.3 Homework : Read

nineteen wrought-iron rivets steel construction: trusses, decks & plate girders

MA/CSSE 473 Day 28 Optimal BSTs Dynamic Programming Example OPTIMAL BINARY SEARCH TREES 1

lgebra Linear e Aplicaes VECTOR SPACES Avoid rediscovering the wheel Many mathematical

No BMW (1916) No Sliced Bread (1919) The Republic of Ireland Germany Didnt

The interplay of symplectic and projective geometry in the context of plane wave spacetimes

Question answering CS685 Fall 2020 Advanced Natural Language - PowerPoint PPT Presentation

Question answering CS685 Fall 2020 Advanced Natural Language Processing Mohit Iyyer College of Information and Computer Sciences University of Massachusetts Amherst some slides from Jordan Boyd-Graber, Jacob Devlin, and Chris Manning Stuff from

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

Designing deep architectures for Visual Question Answering Matthieu Cord Sorbonne University

Question Answering and AnswerFinder Diego Moll a Centre for Language Technology Department of

A Multilingual Hybrid Question-Answering System Cross-Lingual Open-Domain Question Answering

Answering Queries Using Answering Queries Using Materialized view: result set is stored

Statistical NLP Spring 2011 Lecture 26: Question Answering Dan Klein UC Berkeley Question

Question Answering and Reading Comprehension Kevin Duh Fall 2019, Intro to HLT, Johns Hopkins

An Question Recommendation System for Question Answer Community (Stackoverflow) Presenter: Haoyu

Neural Question Answering at BioASQ 5B Georg Wiese, Dirk Weissenborn, Mariana Neves Motivation

Embodied Question Answering NVIDIA GTC March 26, 2018 Abhishek Das PhD student, Georgia Tech

Question Answering Alexander Solovyev Bauman Moscow Sate Technical University a-soloviev@mail.ru

CS345a Data Mining Project A Web Based Question Answering System Vincenzo Di Nicola Jyotika

Factoid Question Answering Roy Aslan (ra2752@Columbia.edu) A Neural Network for Factoid

Question Answering over Freebase with Multi-Column Convolutional Neural Networks Li Dong 1 , Furu

Questioning Question Answering Answers Sameer Singh University of California, Irvine Questioning

Additional Semantic Tasks: Entity Coreference and Question Answering CMSC 473/673 UMBC Outline

Towards a Characterization of the Double Category of Spans Evangelia Aleiferi Dalhousie

Using Kieker with Elastic APM: An Experience Report Valentin Seifermann Duan Okanovi SSP

4.3 Linearly Independent Sets McDonald Fall 2018, MATH 2210Q, 4.3 Slides 4.3 Homework : Read

nineteen wrought-iron rivets steel construction: trusses, decks &amp; plate girders

MA/CSSE 473 Day 28 Optimal BSTs Dynamic Programming Example OPTIMAL BINARY SEARCH TREES 1

lgebra Linear e Aplicaes VECTOR SPACES Avoid rediscovering the wheel Many mathematical

No BMW (1916) No Sliced Bread (1919) The Republic of Ireland Germany Didnt

The interplay of symplectic and projective geometry in the context of plane wave spacetimes

nineteen wrought-iron rivets steel construction: trusses, decks & plate girders