Co Commonsense for r Generative Mu Multi-Ho Hop Ques p Questio - PowerPoint PPT Presentation

Co Commonsense for r Generative Mu Multi-Ho Hop Ques p Questio ion n An Answering Tasks EMNLP2018 UNC Chapel Hill （北卡罗来纳大学教堂山分校） Lisa Bauer* Yicheng Wang* Mohit Bansal Xiachong Feng

Au Author Lisa Bauer • Second year Ph.D. UNC Chapel Hill • B.A. Johns Hopkins University • natural language generation 、 QA • Dialogue 、 deep reasoning • knowledge-based inference Mohit Bansal • Director of the UNC-NLP Lab • Assistant Professor • Ph.D. from the UC Berkeley

Co Commonsense fo for Gener Generativ ive e Mu Multi-Ho Hop Ques p Questio ion n An Answering Tasks

QA QA Da Datas aset • Task Machine reading comprehension (MRC) based QA , • asking it to answer a question based on a passage of relevant content. • Dataset • bAbI ： smaller lexicons and simpler passage structures • CNN/DM 、 SQuAD ： fact-based 、 answer extraction 、 select a context span • Qangaroo(WikiHop): extractive dataset 、 multi-hop reasoning bAbI

QA QA Da Datas aset • Dataset • NarrativeQA generative dataset • includes fictional stories, which are 1,567 complete stories from books and movie scripts , with human written questions and answers based solely on human- generated abstract summaries. • There are 46,765 pairs of answers to questions written by humans and includes mostly the more complicated variety of questions such as “when / where / who / why”. • Requiring multi-hop reasoning for long, complex stories • Experiment • Qangaroo: extractive dataset 、 multi-hop reasoning • NarrativeQA: generative dataset 、 multi-hop reasoning

Common Commonsense Dataset • ConceptNet • Large-scale graphical commonsense databases

Ta Task • generative QA • Input: • Context • Query • Output ： • series of answer tokens ：

Mod Model ov overview • Multi-Hop Pointer-Generator Model (MHPGM) • baseline model • Baseline reasoning cell • multiple hops of bidirectional attention • self-attention • pointer-generator decoder • Necessary and Optional Information Cell (NOIC) • NOIC Reasoning Cell • Choose knowledge • pointwise mutual information (PMI) • term-frequency-based scoring function • Insert knowledge • Selectively gated attention mechanism

Mul Multi ti-Ho Hop Pointer er-Ge Generator or Mod odel

Em Embe beddi dding ng Layer • learned embedding space of dimension d • pretrained embedding from language models (ELMo) • The embedded representation for each word in the context or question ：

Re Reasoning layer • k reasoning cells • The reasoning cell’s inputs are the previous step’s output and the embedded question • First creates step-specific context and query encodings via cell-specific bidirectional LSTMs:

Re Reasoning layer • Use bidirectional attention to emulate a hop of resoning by focusing on relevant aspects of the context. • Context-to-query attention About Query • Query-to-context attention About Context • Final

Se Self-At Attention Layer • Residual static self-attention mechanism • Input ： output of the last reasoning cell 1. fully-connected layer 2. a bi-directional LSTM Self attention representation • • Output of the self-attention layer is generated by another layer of bidirectional LSTM. • Final encoded context:

Po Pointer-Ge Generator De Decodin ing Layer • embedded representation of last timestep’s output • the last time step’s hidden state • context vector

Mu Multi-Ho Hop p Poin inter er-Ge Generator Model BiDAF • cell-specific bidirectional LSTMs • • Attention context-to-query attention Copy • • query-to-context attention Generate • • Word embedding • • fully-connected layer • ELMo a bi-directional LSTM • Self attention • • a bi-directional LSTM residually •

Commonsense Select ction Representation • QA tasks often needs knowledge of relations not directly stated in the context • Key idea • Introducing useful connections between concepts in the context and question via ConceptNet 1. collect potentially relevant concepts via a tree construction method 2. rank and filter these paths to ensure both the quality and variety of added via a 3-step scoring strategy

Tr Tree Construction (2)Multi-Hop (1)Direct Interaction select relations in select relations r1 from ConceptNet r2 that link ConceptNet that directly c2 to another concept in link c1 to a concept the context, c3 ∈ C. within the context c2 ∈ C For each concept c1 in the question (3)Outside Knowledge an unconstrained hop into c3 ’s neighbors in (4)Context-Grounding ConceptNet connecting c4 to c5 ∈ C

Ex Exampl ple

Ra Rank k and Fi Filter er(3 (3-st step scoring method) • Initial Node Scoring • For c2 、 c3 、 c5 • Term frequency • Heuristic: important concepts occur more frequently • |C| is the context length and count() is the number of times a concept appears in the con text. • For c4 • want c4 to be a logically consistent next step in reasoning following the path of c1 to c3 • Heuristic: logically consistent paths occur more frequently • Pointwise Mutual Information (PMI)

Ra Rank k and Fi Filter er(3 (3-st step scoring method) • Initial Node Scoring • For c4 • Pointwise Mutual Information (PMI) • normalized PMI (NPMI) • Normalize each node’s score against its siblings

Ra Rank k and Fi Filter er(3 (3-st step scoring method) • Cumulative Node Scoring • re-score each node based not only on its relevance and saliency but also that of its tree descendants . • When at the leaf nodes • c-score = n-score • for cl not a leaf node • c-score(cl) = n-score(cl) + f(cl) • f of a node is the average of the c-scores of its top 2 highest scoring children lady → mother → daughter(high) lady → mother → married(high) lady → mother → book(low) example

Ra Rank k and Fi Filter er(3 (3-st step scoring method) 1. Starting at the root 2. recursively take two of its children with the highest cumulative scores 3. until reach a leaf Final: directly give these paths to the model as sequences of tokens.

Common Commonsense Mod Model Incorp orpor oration on Given: • list of commonsense logic paths as sequences of words • Example : <lady, AtLocation, church, RelatedTo, house, • RelatedTo, child, RelatedTo, their> Necessary and Optional Information Cell (NOIC) • concatenating the embedded • commonsense project it to the same dimension • as attention between • commonsense and the context

To Total Model

Expe Experiment • Dataset • generative NarrativeQA • extractive QAngaroo WikiHop • For multiple-choice WikiHop, we rank candidate responses by their generation probability. • Metric • NarrativeQA • Bleu-1 、 Bleu-4 、 METEOR 、 RougeL 、 CIDEr • WikiHop • Accuracy

Re Result • NarrativeQA • WikiHop

Mod Model A Ablation ons

Common Commonsense Ab Ablation ons NumberBatch :naively add ConceptNet information by • initializing the word embeddings with the ConceptNet-trained embeddings In-domain noise :giving each context-query pair a set of random • relations grounded in other context-query pairs Using a single hop from the query to the context. •

Hum Human an Evalua aluatio tion n Analy nalysis is Commonsense Selection • Model Performance •

Con Conclusion on Effective reasoning-generative QA architecture • 1. multiple hops of bidirectional attention and a pointer- generator decoder 2. select grounded, useful paths of commonsense knowledge 3. Necessary and Optional Information Cell (NOIC) New state-of-the-art on NarrativeQA. •

Th Thank yo you!

Co Commonsense for r Generative Mu Multi-Ho Hop Ques p Questio - PowerPoint PPT Presentation

Co Commonsense for r Generative Mu Multi-Ho Hop Ques p Questio ion n An Answering Tasks EMNLP2018 UNC Chapel Hill Lisa Bauer* Yicheng Wang* Mohit Bansal Xiachong Feng Au Author Lisa Bauer

Commonsense for Generative Multi-Hop Question Answering Tasks Lisa Bauer* Yicheng Wang* Mohit

Commonsense benchmarks Or how to measure that your model is actually doing some commonsense

Commonsense Knowledge in Pre-trained Language Models Vered Shwartz July 5th, 2020 Commonsense

generative design systems Generative Brief Design Definitions Workshop Processes

Multi-Hop RC, HotpotQA & GNNs Select, Answer and Explain: Interpretable Multi-hop Reading

Which Material Design Is Commonsense . . . Possible Under Additive Commonsense . . . How

Representing Knowledge Dustin Smith MIT Media Lab July 2008 Commonsense Computing MIT MediaLab

Agenda 08:00 PST 1 hr 50 mins Part I - Review of CSKGs 15 min Introduction to commonsense

Agenda 08:00 PST 1 hr 50 mins Part I - Review of CSKGs 15 min Introduction to commonsense

HIP HOP NARRATIVES: POWER, PRIVILEGE AND PREJUDICE Diego R. Mancha THE CONCEPTS Hip Hop

Use of OSPF-MDR in Single-Hop Broadcast Networks draft-ogier-ospf-manet-single-hop-00 Richard

One Hop Lookups Plugin for RELOAD IETF81@Quebec, Canada draft-peng-p2psip-one-hop-plugin-00 Jin

MPLS Basics Penultimate Hop Popping How a router determines the outgoing interface: Last Hop

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

Latency-Reliability Tradeoff for Different Hop-Level ARQ-based Error Recovery in a Multi-Hop

Vision and Language Learning with Graph Neural Networks Linchao Zhu 22 Apr, 2020 Recognition,

Byzantine Fault Tolerance Consensus Strikes Back (continued) Announcements Lab 2 Due in

POIR 613: Computational Social Science Pablo Barber a School of International Relations

Track 1 Lightning Talk: Forking as a Tool for Software SustainabilityAn Empirical Study. SA

The AI Future of Math, Logic, and Humanity AITP-2019 Assume a future where AI does

The Complexity of Reasoning for Fragments of Default Logic Heribert Vollmer Joint work with O.

How Far are We from Effective Context Modeling? An Exploratory Study on Semantic Parsing in

Calculi for Reasoning About Action and Knowledge Dimitris Plexousakis, Theodore Patkos {dp,

Co Commonsense for r Generative Mu Multi-Ho Hop Ques p Questio - PowerPoint PPT Presentation

Co Commonsense for r Generative Mu Multi-Ho Hop Ques p Questio ion n An Answering Tasks EMNLP2018 UNC Chapel Hill Lisa Bauer* Yicheng Wang* Mohit Bansal Xiachong Feng Au Author Lisa Bauer

Commonsense for Generative Multi-Hop Question Answering Tasks Lisa Bauer* Yicheng Wang* Mohit

Commonsense benchmarks Or how to measure that your model is actually doing some commonsense

Commonsense Knowledge in Pre-trained Language Models Vered Shwartz July 5th, 2020 Commonsense

generative design systems Generative Brief Design Definitions Workshop Processes

Multi-Hop RC, HotpotQA &amp; GNNs Select, Answer and Explain: Interpretable Multi-hop Reading

Which Material Design Is Commonsense . . . Possible Under Additive Commonsense . . . How

Representing Knowledge Dustin Smith MIT Media Lab July 2008 Commonsense Computing MIT MediaLab

Agenda 08:00 PST 1 hr 50 mins Part I - Review of CSKGs 15 min Introduction to commonsense

Agenda 08:00 PST 1 hr 50 mins Part I - Review of CSKGs 15 min Introduction to commonsense

HIP HOP NARRATIVES: POWER, PRIVILEGE AND PREJUDICE Diego R. Mancha THE CONCEPTS Hip Hop

Use of OSPF-MDR in Single-Hop Broadcast Networks draft-ogier-ospf-manet-single-hop-00 Richard

One Hop Lookups Plugin for RELOAD IETF81@Quebec, Canada draft-peng-p2psip-one-hop-plugin-00 Jin

MPLS Basics Penultimate Hop Popping How a router determines the outgoing interface: Last Hop

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Question Answering What is Ques+on Answering? Dan Jurafsky Ques%on

Latency-Reliability Tradeoff for Different Hop-Level ARQ-based Error Recovery in a Multi-Hop

Vision and Language Learning with Graph Neural Networks Linchao Zhu 22 Apr, 2020 Recognition,

Byzantine Fault Tolerance Consensus Strikes Back (continued) Announcements Lab 2 Due in

POIR 613: Computational Social Science Pablo Barber a School of International Relations

Track 1 Lightning Talk: Forking as a Tool for Software SustainabilityAn Empirical Study. SA

The AI Future of Math, Logic, and Humanity AITP-2019 Assume a future where AI does

The Complexity of Reasoning for Fragments of Default Logic Heribert Vollmer Joint work with O.

How Far are We from Effective Context Modeling? An Exploratory Study on Semantic Parsing in

Calculi for Reasoning About Action and Knowledge Dimitris Plexousakis, Theodore Patkos {dp,

Multi-Hop RC, HotpotQA & GNNs Select, Answer and Explain: Interpretable Multi-hop Reading