Phrase-Indexed Question Answering : A New Challenge for Scalable - PowerPoint PPT Presentation

Phrase-Indexed Question Answering : A New Challenge for Scalable Document Comprehension Minjoon Seo, Tom Kwiatkowski, Ankur Parikh, Ali Farhadi, Hannaneh Hajishirzi

Question Answering?

1961 Model “Barack Obama (1961-present) was the 44 th When was Obama born? President of the United States.” Document (context) Question

1961 Model Extractive “Barack Obama (1961-present) was the 44 th When was Obama born? President of the United States.” Document (context) Question

Extractive QA Datasets • SQuAD (Rajpurkar et al., 2016) • NewsQA (Trischler et al., 2016) • TriviaQA (Joshi et al., 2017) • QuAC (Choi et al., 2018) • CoQA (Reddy & Chen & Manning, 2018) • HotpotQA (Yang et al., 2018) • And more…

Open-domain QA?

1961 Model “Barack Obama (1961-present) was the 44 th When was Obama born? President of the United States.” Document (context) Question

1961 Model When was Obama born? Question

4 Million documents 3 Billion tokens 0.1s / doc * 4M docs = 6 days !

Choi et al., 2017; Chen et al., 2017; Clark & Gardner, 2017 TF-IDF, BM 25, LSA Information Retrieval Model 1961 When was Obama born? Pipelined

Choi et al., 2017; Chen et al., 2017; Clark & Gardner, 2017 Wrong TF-IDF, document! BM 25, LSA Information Retrieval Model 1961 When was Obama born?

Choi et al., 2017; Chen et al., 2017; Clark & Gardner, 2017 Wrong TF-IDF, document! BM 25, LSA Wrong Information answer! Retrieval Model 1911 When was Obama born? Error propagation

Ideally…

TF-IDF, BM 25, LSA Information Retrieval Model 1961 When was Obama born?

? Model 1961 When was Obama born? End-to-end & elegant… But how?

Solution: Index phrases!

[-3, 0.1, …] When was [0.5, 0.1, …] Obama born? [0.3, -0.2, …] Nearest [0.5, 0.1, …] neighbor search [0.7, -0.4, …] Document Indexing [0.5, 0.0, …] - Locality Sensitive Hashing - aLSH (Shrivastava & Li, 2014) [3.3, -2.2, …] - …

“Barack Obama (1961-present) was the 44 th President of the United States.” Who is the 44 th Barack Obama … President of the U.S.? Nearest … ( 1961 -present … neighbor … 44 th President … search When was … United States . Obama born? Question Phrase encoding encoding

Model phrase question document " = argmax ! * + ", -, . ) Decompose " = argmax ! / + (-) 2 3 + (", .) ) Phrase encoder Question encoder

Decomposability is a strong constraint

Phrase-Indexed QA (PIQA) Challenge • Open-domain QA is hard to setup or evaluate • Instead, benchmark on existing datasets (e.g. SQuAD) • Create two models: • Phrase (document) encoder • Question encoder • Phrase encoder must be question-agnostic , and vice versa • Answer must be obtained via nearest neighbor search (NNS)

PI-SQuAD Evaluation

Is it too easy or too hard?

BERT (Devlin et al., 2018) 92% F1 SQuAD v1.1 Red color is phrase- SA+ELMo (Peters et al., 2018) 86% F1 indexed. Decomposability gap SA+ELMo (Seo et al., 2018) 64% F1 Feature-based (Rajpurkar et al., 2018) 50% F1

BERT (Devlin et al., 2018) 92% F1 SQuAD v1.1 Red color is phrase- SA+ELMo (Peters et al., 2018) 86% F1 indexed. Sparse+SA+ELMo 70% F1 Match-LSTM (Wang & Jiang., 2017) First neural model 68% F1 SA+ELMo (Seo et al., 2018) 64% F1 Feature-based (Rajpurkar et al., 2018) 50% F1

Phrase Representation Learning • Not just about scalability, but also about comprehension • Standalone representations of phrases (document) PIQA can be viewed as: • A phrase embedding evaluation method • Sentence embedding in SNLI (Bowman et al., 2015) • Constructing a memory of knowledge • Memory Networks (Weston et al., 2014)

According to the American Library Association , this makes … … tasked with drafting a European Charter of Human Rights , … Named Entities

The LM engines were successfully test- fired and restarted, … Steam turbines were extensively applied … Lexical & Syntactic Similarity

… primarily accomplished through the ductile stretching and thinning . … directly derived from the homogeneity or symmetry of space … Syntactic Similarity

Demo on my Macbook Corpus size: 300k Tokens (SQuAD dev set) 16 CPUs: 100s+ GPU: 10s+

A lot of things to do • Closing the gap due to decomposability constraint • BERT (Devlin et al., 2018)? • Reducing index storage (100TB+ for Wikipedia) • Reducing phrase embedding dimension (1024) • Extending to open-domain QA • Analyzing phrase representations • And more!

http://pi-qa.com Thank you!

Phrase-Indexed Question Answering : A New Challenge for Scalable - PowerPoint PPT Presentation

Phrase-Indexed Question Answering : A New Challenge for Scalable Document Comprehension Minjoon Seo, Tom Kwiatkowski, Ankur Parikh, Ali Farhadi, Hannaneh Hajishirzi Question Answering? 1961 Model Barack Obama (1961-present) was the 44 th

A Method of Cross-Lingual Question-Answering Based on Machine Translation and Noun Phrase

Semantic Question Answering on Big Data Tatiana Erekhinskaya July, 2016 The Goal Challenge:

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

A Multilingual Hybrid Question-Answering System Cross-Lingual Open-Domain Question Answering

Question Answering and AnswerFinder Diego Moll a Centre for Language Technology Department of

Statistical NLP Spring 2011 Lecture 26: Question Answering Dan Klein UC Berkeley Question

Question Answering Statistical NLP Following largely from Chris Mannings slides, which

Designing deep architectures for Visual Question Answering Matthieu Cord Sorbonne University

Question Answering and Reading Comprehension Kevin Duh Fall 2019, Intro to HLT, Johns Hopkins

Towards End-to-End Reasoning for Question Answering Minjoon Seo Department of Computer Science

Question-Answering: Shallow & Deep Techniques for NLP Deep Processing Techniques for NLP

Factoid Question Answering Roy Aslan (ra2752@Columbia.edu) A Neural Network for Factoid

Question-Answering: Shallow & Deep Techniques for NLP Ling571 Deep Processing Techniques

Question Classification in English-Chinese Cross-Language Question Answering: An Integrated

Self-Critical Reasoning for Robust Visual Question Answering Jialin Wu and Raymond J. Mooney

Question Answering Spring 2020 2020-04-02 Adapted from slides from Danqi Chen and Karthik

Chess Q&A : Question Answering on Chess Games Reasoning, Attention, Memory NIPS Workshop 12

CQARank:Jointly Model Topics and Expertise in Community Question Answering Liu Yang, Minghui Qiu,

An Question Recommendation System for Question Answer Community (Stackoverflow) Presenter: Haoyu

Question Answering Alexander Solovyev Bauman Moscow Sate Technical University a-soloviev@mail.ru

Neural Question Answering at BioASQ 5B Georg Wiese, Dirk Weissenborn, Mariana Neves Motivation

Question Answering & the Semantic Web Gnter Neumann Language Technology-Lab DFKI,

Paper Reading Jun Gao June 26, 2018 Tencent AI Lab Neural Generative Question Answering

Building a Smart Question Answering System from Scratch Minjoon Seo PhD Student University of

Phrase-Indexed Question Answering : A New Challenge for Scalable - PowerPoint PPT Presentation

Phrase-Indexed Question Answering : A New Challenge for Scalable Document Comprehension Minjoon Seo, Tom Kwiatkowski, Ankur Parikh, Ali Farhadi, Hannaneh Hajishirzi Question Answering? 1961 Model Barack Obama (1961-present) was the 44 th

A Method of Cross-Lingual Question-Answering Based on Machine Translation and Noun Phrase

Semantic Question Answering on Big Data Tatiana Erekhinskaya July, 2016 The Goal Challenge:

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

A Multilingual Hybrid Question-Answering System Cross-Lingual Open-Domain Question Answering

Question Answering and AnswerFinder Diego Moll a Centre for Language Technology Department of

Statistical NLP Spring 2011 Lecture 26: Question Answering Dan Klein UC Berkeley Question

Question Answering Statistical NLP Following largely from Chris Mannings slides, which

Designing deep architectures for Visual Question Answering Matthieu Cord Sorbonne University

Question Answering and Reading Comprehension Kevin Duh Fall 2019, Intro to HLT, Johns Hopkins

Towards End-to-End Reasoning for Question Answering Minjoon Seo Department of Computer Science

Question-Answering: Shallow &amp; Deep Techniques for NLP Deep Processing Techniques for NLP

Factoid Question Answering Roy Aslan (ra2752@Columbia.edu) A Neural Network for Factoid

Question-Answering: Shallow &amp; Deep Techniques for NLP Ling571 Deep Processing Techniques

Question Classification in English-Chinese Cross-Language Question Answering: An Integrated

Self-Critical Reasoning for Robust Visual Question Answering Jialin Wu and Raymond J. Mooney

Question Answering Spring 2020 2020-04-02 Adapted from slides from Danqi Chen and Karthik

Chess Q&amp;A : Question Answering on Chess Games Reasoning, Attention, Memory NIPS Workshop 12

CQARank:Jointly Model Topics and Expertise in Community Question Answering Liu Yang, Minghui Qiu,

An Question Recommendation System for Question Answer Community (Stackoverflow) Presenter: Haoyu

Question Answering Alexander Solovyev Bauman Moscow Sate Technical University a-soloviev@mail.ru

Neural Question Answering at BioASQ 5B Georg Wiese, Dirk Weissenborn, Mariana Neves Motivation

Question Answering &amp; the Semantic Web Gnter Neumann Language Technology-Lab DFKI,

Paper Reading Jun Gao June 26, 2018 Tencent AI Lab Neural Generative Question Answering

Building a Smart Question Answering System from Scratch Minjoon Seo PhD Student University of

Question-Answering: Shallow & Deep Techniques for NLP Deep Processing Techniques for NLP

Question-Answering: Shallow & Deep Techniques for NLP Ling571 Deep Processing Techniques

Chess Q&A : Question Answering on Chess Games Reasoning, Attention, Memory NIPS Workshop 12

Question Answering & the Semantic Web Gnter Neumann Language Technology-Lab DFKI,