Discourse Marker Augmented Network with Reinforcement Learning for - PowerPoint PPT Presentation

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference Authors Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He Organization Zhejiang University, China

What is Natural Language Inference (NLI)? Premise Hypothesis Entailment

What is Natural Language Inference (NLI)? ? Premise Hypothesis Neutral

What is Natural Language Inference (NLI)? Premise Hypothesis Contradiction

Applications • Question Answering • Machine Translation • Semantic Search • Text Summarization …

Discourse Marker • A discourse marker is a word or a phrase that plays a role in managing the flow and structure of discourse. • Examples: so , because , and , but , or …

Discourse Marker & NLI? But Because If Although And So Entailment Neutral Contradiction

Related Works • Datasets SNLI (Bowman et al., 2015) MultiNLI (Williams et al., 2017) • SOTA Neural Network Models CAFE (Tay et al., 2017) KIM (Chen et al., 2017) DIIN (Gong et al., 2018)

Related Works • Transfer Learning for NLI Skip-thoughts (Vendrov et al., 2016) Cove (McCann et al., 2017) • Discourse Marker Applications DisSent (Nie et al., 2017)

Discourse Marker Prediction (DMP) It’s rainy outside but we will not take the umbrella It’s rainy outside + But + We will not take the umbrella So Because But (S1, S2) Neural Networks M … … If

Discourse Marker Prediction (DMP) To Be Transferred Sentence1 BiLSTM Sentence Representations Glove Glove Sentence2 Last hidden state Max pooling over all the hidden states Prediction

Discourse Marker Augmented Network (NLI Model) Encoding Layer Premise BiLSTM Glove Char POS NER EM NER EM Char POS Glove Hypothesis

Discourse Marker Augmented Network (NLI Model) BiLSTM Sentence Representations Pre-trained DMP Model: Premise BiLSTM Glove Char POS NER EM NER EM Char POS Glove Hypothesis

Discourse Marker Augmented Network (NLI Model) Interaction ------ Similarity Matrix The sentence representation of the premise The sentence representation of the hypothesis The i-th word of the premise The j-th word of the hypothesis

Discourse Marker Augmented Network (NLI Model) The sentence representation of the hypothesis The sentence representation of the premise Prediction Attention Modeling vector of the premise Mechanism Modeling vector of the hypothesis Similarity Matrix

Training Cross Entropy Loss Correct Label: neutral Original Labels: neutral , neutral , entailment , entailment , neutral

Training Previous action policy that predicts the label given P and H.

Experiments (Datasets) • Stanford Natural Language Inference (SNLI) (Bowman et al., 2015) 570k human annotated sentence pairs • Multi-Genre Natural Language Inference (MultiNLI) (Williams et al., 2017) 433k human annotated sentences pairs • BookCorpus (Zhu et al., 2015) 6.5M pairs of sentences for 8 discourse markers

Experiments (Results) Sentence Encoding- Based Models Other Neural Network Models Ensemble Models

Experiments (Analysis)

Experiments (Analysis) Premise: “3 young man in hoods standing in the middle of a quiet street facing the camera. ” Hypothesis: “ Three people sit by a busy street bare-headed.

Conclusion • We solve the task of the natural language inference via transferring knowledge from another supervised task. • We propose a new objective function to make full use of the labels’ information. • In the future work, we would like to explore some other transfer learning sources.

Thank You !

Discourse Marker Augmented Network with Reinforcement Learning for - PowerPoint PPT Presentation

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference Authors Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He Organization Zhejiang University, China What is Natural

Computational Models of Discourse Regina Barzilay MIT What is Discourse? What is Discourse?

Network performance requirements of Augmented Reality Systems Mike P. Wittie 1 Augmented

Computational Discourse 11-711 Algorithms for NLP 15 November 2018 What Is Discourse? Discourse

Computational Discourse 11-711 Algorithms for NLP 31 October 2019 What Is Discourse? Discourse

Discourse Coherence Lecture Plan: Einf uhrung in Pragmatik Discourse cohesion and

Cheetah Conservation Fund Dr. Laurie Dr. Laurie Dr. Laurie Dr. Laurie Marker, Dr. Bruce Brewer

Joseph O. Marker Marker Actuarial Services, LLC and University

Joseph O. Marker Marker Actuarial Services, LLC a e ctua a Se v ces, C and University of

Discourse Structure Ling575 Discourse & Dialogue April 13, 2011 Roadmap Project

IMPACT OF AUGMENTED REALITY ON SOCIETY BY DEREK MANDL AND STEPHEN SLADEK WHAT IS AUGMENTED

Modeling Discourse Cohesion for Discourse Parsing via Memory Network Yanyan Jia, Yuan Ye, Yansong

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Introduction of Laser Marker ELM- -700A 700A Introduction of Laser Marker ELM 2008. 03. 19

Marker Assisted Marker Assisted Selection Selection Biotechnology in Action Biotechnology in

The Snapshot Algorithm Two rules: Marker sending Rule Marker receiving rule The thing to

Constructing density. the marker function 4. There are several ways to construct a marker

R EPOSITORIES AND RA I D L INK ALL THE THINGS Andrew Janke , Siobhann McCafferty, Ian Duncan 1

Dublin Mountains Project 22 nd March 2017 Landowners Information Meeting Firstly . Thank you

ADFOLKS D a t a M a n a g e m e n t P l a t f o r m a n d S e r v i c e s Behaviorally

DMPonline.be Dodji Amouzou (ADRE) Fabrizio Tinti (BIUL) Adeline Grard (SCEB) Journes

Reference Group AWE Limited Update 23 July 2015 2 Key milestones of a typical onshore oil and

Abstract : Mixed halide perovskite allows the realization of a light-emitting solar cell (LESC)

School District of Pickens County, SC Easley Area Rezoning Recommendation January 22, 2018

Revitalization of Durand Park Durand Park Proposal June 2008 Background Property

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Discourse Marker Augmented Network with Reinforcement Learning for - PowerPoint PPT Presentation

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference Authors Boyuan Pan, Yazheng Yang, Zhou Zhao, Yueting Zhuang, Deng Cai, Xiaofei He Organization Zhejiang University, China What is Natural

Computational Models of Discourse Regina Barzilay MIT What is Discourse? What is Discourse?

Network performance requirements of Augmented Reality Systems Mike P. Wittie 1 Augmented

Computational Discourse 11-711 Algorithms for NLP 15 November 2018 What Is Discourse? Discourse

Computational Discourse 11-711 Algorithms for NLP 31 October 2019 What Is Discourse? Discourse

Discourse Coherence Lecture Plan: Einf uhrung in Pragmatik Discourse cohesion and

Cheetah Conservation Fund Dr. Laurie Dr. Laurie Dr. Laurie Dr. Laurie Marker, Dr. Bruce Brewer

Joseph O. Marker Marker Actuarial Services, LLC and University

Joseph O. Marker Marker Actuarial Services, LLC a e ctua a Se v ces, C and University of

Discourse Structure Ling575 Discourse &amp; Dialogue April 13, 2011 Roadmap Project

IMPACT OF AUGMENTED REALITY ON SOCIETY BY DEREK MANDL AND STEPHEN SLADEK WHAT IS AUGMENTED

Modeling Discourse Cohesion for Discourse Parsing via Memory Network Yanyan Jia, Yuan Ye, Yansong

Reinforcement Learning AIMA Chapters: 21.1, 21.2, 21.3. Sutton and Barto, Reinforcement Learning:

Introduction of Laser Marker ELM- -700A 700A Introduction of Laser Marker ELM 2008. 03. 19

Marker Assisted Marker Assisted Selection Selection Biotechnology in Action Biotechnology in

The Snapshot Algorithm Two rules: Marker sending Rule Marker receiving rule The thing to

Constructing density. the marker function 4. There are several ways to construct a marker

R EPOSITORIES AND RA I D L INK ALL THE THINGS Andrew Janke , Siobhann McCafferty, Ian Duncan 1

Dublin Mountains Project 22 nd March 2017 Landowners Information Meeting Firstly . Thank you

ADFOLKS D a t a M a n a g e m e n t P l a t f o r m a n d S e r v i c e s Behaviorally

DMPonline.be Dodji Amouzou (ADRE) Fabrizio Tinti (BIUL) Adeline Grard (SCEB) Journes

Reference Group AWE Limited Update 23 July 2015 2 Key milestones of a typical onshore oil and

Abstract : Mixed halide perovskite allows the realization of a light-emitting solar cell (LESC)

School District of Pickens County, SC Easley Area Rezoning Recommendation January 22, 2018

Revitalization of Durand Park Durand Park Proposal June 2008 Background Property

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Discourse Structure Ling575 Discourse & Dialogue April 13, 2011 Roadmap Project