Multi-task Attention-based Neural Networks for Implicit Discourse - PowerPoint PPT Presentation

Nov 26, 2022 •21 likes •301 views

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification Man Lan , Jianxiang Wang, Yuanbin Wu, Zheng-Yu Niu, Haifeng Wang Presented by: Aidan San Implicit Discourse Relation to

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification Man Lan , Jianxiang Wang, Yuanbin Wu, Zheng-Yu Niu, Haifeng Wang Presented by: Aidan San
Implicit Discourse Relation “ to recognize how two adjacent text spans without explicit ● discourse marker (i.e., connective, e.g., because or but ) between them are logically connected to one another (e.g., cause or contrast)”
Sense Tags
Implicit Discourse Relation - Motivations Discourse Analysis ● Language Generation ● QA ● Machine Translation ● Sentiment Analysis ●
Summary Attention-based neural network conducts discourse ● relationship representation learning Multi-task learning framework leverage knowledge from ● auxiliary task
Recap - Attention Use a vector to scale certain parts of the input so you can ● “focus” more on that part of the input
Recap - Multi-Task Learning Simultaneously train your model on another task to augment ● yourmodel with additional information PS: Nothing crazy in this paper like training with images ●
Motivation - Attention Contrast information can come from different parts of ● sentence Tenses - Previous vs Now ○ Entities - Their vs Our ○ Whole arguments ○ Attention selections most important part of arguments ●
Motivation - Multi-Task Learning Lack of labeled data ● Information from unlabeled data may be helpful ●
LSTM Neural Network
Bi-LSTM Concatenate Sum-Up Hidden States Concatenate
LSTM Neural Network
Attention Neural Network
What is the other task? Not really a different task ● Using the explicit data for the same task ●
Multi-task Attention-based Neural Network
Knowledge Sharing Methods 1. Equal Share 2. Weighted Share 3. Gated Interaction
Gated Interaction Cont. Acts as a gate to control how ● much information goes to the end result
Datasets - PDTB 2.0 Largest Annotated Corpus of discourse relations ● 2, 312 Wall Street Journal (WSJ) articles ● Comparison (denoted as Comp.), Contingency (Cont.), ● Expansion (Exp.) and Temporal (Temp.)
Datasets - CoNLL-2016 Test - From PDTB ● Blind - From English Wikinews ● Merges labels to remove sparsity ●
Datasets - BLLIP The North American News Text ● Unlabeled data ● Remove Explicit discourse connectives -> Synthetic Implicit ● Relations 100,000 relationships from random sampling ●
Parameters Word2Vec Dimension: 50 ● PDTB ● Hidden State Dimension: 50 ○ Multi-task framework hidden layer size: 80 ○ CoNLL-2016 ● Hidden State Dimension: 100 ○ Multi-task framework hidden layer size: 80 ○
Parameters (cont.) Dropout: .5 (To penultimate layer) ● Cross-Entropy ● AdaGrad ● Learning rate: .001 ○ Minibatch size: 64 ●
Results
Effect of Weight Parameter Low value of W reduces weight of auxiliary task and makes model pay more attention to main task
Conclusion Multi-task attention-based neural network ● Implicit discourse relationship ● Discourse arguments and interactions between annotated ● and unannotated data Outperforms state-of-the-art ●

Recommend

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in encoder-decoder networks Various kinds of attention 2 Overview What is attention? Attention in encoder-decoder networks 3 Visual

971 views • 73 slides

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Evidence from Money Market Funds Evidence from Money Market Funds Money and Payments Workshop

742 views • 45 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

889 views • 41 slides

Identifying beneficial task relations for multi-task learning in deep neural networks Author:

Identifying beneficial task relations for multi-task learning in deep neural networks Author: Joachim Bingel, Anders Sogaard Presenter: Litian Ma Background Multi-task learning (MTL) in deep neural networks for

567 views • 15 slides

Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo

Overview Implicit Parallelism Programming Languages References Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo Multi-core Programming: Implicit Parallelism Overview Implicit Parallelism

480 views • 34 slides

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our understanding, actions and decisions in an unconscious manner. It's different from suppressed thoughts we might conceal to keep the peace; it's

634 views • 16 slides

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a scalar function f(x,y,z)=0 . Roger Crawfis The term is usually used when modeling a surface, whereas iso-contour is used when visualizing a

388 views • 12 slides

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

380 views • 9 slides

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

4/14/17 Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial attention Attention to features 3. Directing attention: Posterior parietal cortex Frontal eye fields Top-down and bottom-up attention 1

336 views • 17 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is attention? How is attention allocated? How are eye movements related to attention? Further questions Attention Attention

331 views • 18 slides

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A. Waswani et al., NIPS , 2017 Google Brain & University of Toronto 2 Attention Visual attention and textual attention

628 views • 21 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush

Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush HarvardNLP 1 Deep Neural Networks for Text Processing and Generation 2 Attention Networks 3 Structured Attention Networks Computational Challenges

1.33k views • 110 slides

Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush

1.53k views • 127 slides

industry to school Annette Green Charles Sturt University Focus of research What are the

Bridging two worlds: From industry to school Annette Green Charles Sturt University Focus of research What are the effects of an industry background on the orientation to teaching and learning of mature career change teachers? Is the

522 views • 21 slides

Facilita'ng Meaningful Mathema'cal Discourse NCTM Interac've Ins'tute, 2016 Name Title/Posi'on

Facilita'ng Meaningful Mathema'cal Discourse NCTM Interac've Ins'tute, 2016 Name Title/Posi'on Affilia'on Email Address Facilita'ng Meaningful Mathema'cal Discourse At your tables, discuss the following ques7on. How would you define

328 views • 31 slides

Structures Informatiques et Logiques pour la Mod elisation Linguistique (MPRI 2.27.1 - second

Structures Informatiques et Logiques pour la Mod elisation Linguistique (MPRI 2.27.1 - second part) Philippe de Groote Inria 2012-2013 Philippe de Groote (Inria) MPRI 2.27.1 2012-2013 1 / 44 Discourse Analysis Introduction 1

583 views • 44 slides

Towards an Error Correction Memory to Enhance Technical Texts Authoring in LELIE Juyeon Kang,

Towards an Error Correction Memory to Enhance Technical Texts Authoring in LELIE Juyeon Kang, Patrick Saint-Dizier IRIT-CNRS, Prometil, Toulouse, France Motivations Technical documents are designed to be easy to read and as efficient and

650 views • 25 slides

Games & Play Week of October 22 Journal? Feedback/Debriefing on Game Jam From design to

Games & Play Week of October 22 Journal? Feedback/Debriefing on Game Jam From design to analysis. How and why do we observe gameplay? Satwicz, Stevens & McCarthy + Sudnow Two extremely di ff erent approaches. Watching others play, both

418 views • 21 slides

SCIENTIFIC WRITING IN LINGUISTICS: DEVELOPING A STORY LINE Prof. Dr. Shanley Allen University of

SCIENTIFIC WRITING IN LINGUISTICS: DEVELOPING A STORY LINE Prof. Dr. Shanley Allen University of Kaiserslautern EXERCISE MY FAMILY Read the two versions of the passage about my family. What is the difference between them? Which do you

547 views • 35 slides

Statistical Discrimination, Employer Learning, and Employment Dierentials by Race, Gender, and

Statistical Discrimination, Employer Learning, and Employment Dierentials by Race, Gender, and Education Seik Kim Department of Economics University of Washington January 30, 2012 Imperfect Information I in hiring and wage-setting

800 views • 32 slides

Using Footballers to Capture the Dynamics of Discrimination Didier Ruedin, University of

Using Footballers to Capture the Dynamics of Discrimination Didier Ruedin, University of Neuchtel, didier.ruedin@unine.ch & Daniel Auer, WZB IMISCOE , Barcelona, 3 July 2018 Hiring situations are about uncertainty only some information

340 views • 12 slides

Multi-task Attention-based Neural Networks for Implicit Discourse - PowerPoint PPT Presentation

Multi-task Attention-based Neural Networks for Implicit Discourse Relationship Representation and Identification Man Lan , Jianxiang Wang, Yuanbin Wu, Zheng-Yu Niu, Haifeng Wang Presented by: Aidan San Implicit Discourse Relation to

Attention in NLP CS 6956: Deep Learning for NLP Overview What is attention Attention in

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Identifying beneficial task relations for multi-task learning in deep neural networks Author:

Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Attention! 1. Definitions and behavioral effects 2. Effects on neural firing rates: Spatial

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Attention Eye tracking seminar 2/19/15 Presented by Tatiana Emmanouil Outline What is

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush

Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush

industry to school Annette Green Charles Sturt University Focus of research What are the

Facilita'ng Meaningful Mathema'cal Discourse NCTM Interac've Ins'tute, 2016 Name Title/Posi'on

Structures Informatiques et Logiques pour la Mod elisation Linguistique (MPRI 2.27.1 - second

Towards an Error Correction Memory to Enhance Technical Texts Authoring in LELIE Juyeon Kang,

Games &amp; Play Week of October 22 Journal? Feedback/Debriefing on Game Jam From design to

SCIENTIFIC WRITING IN LINGUISTICS: DEVELOPING A STORY LINE Prof. Dr. Shanley Allen University of

Statistical Discrimination, Employer Learning, and Employment Dierentials by Race, Gender, and

Using Footballers to Capture the Dynamics of Discrimination Didier Ruedin, University of

Games & Play Week of October 22 Journal? Feedback/Debriefing on Game Jam From design to