Neural Classification of Linguistic Coherence using Long Short-Term - PowerPoint PPT Presentation

Jan 13, 2023 •480 likes •667 views

Neural Classification of Linguistic Coherence using Long Short-Term Memories Pashutan Modaresi, Matthias Liebeck and Stefan Conrad Order of Sentences Hi! My name is Alan. My name is Alan. Hi! Is what makes a text semantically I am a

Neural Classification of Linguistic Coherence using Long Short-Term Memories Pashutan Modaresi, Matthias Liebeck and Stefan Conrad
Order of Sentences Hi! My name is Alan. My name is Alan. Hi! Is what makes a text semantically I am a computer scientist! I am a computer scientist! meaningful Hi! ➔ Hi! I am a computer scientist! I am a computer scientist! My name is Alan. ➔ My name is Alan. My name is Alan. Hi! I am a computer scientist! ➔ I am a computer scientist! My name is Alan. Hi! I am a computer scientist! My name is Alan. Hi!
Humans vs. Machines Discourse Coherence Linguistic Contradiction Question Is there a need to teach Linguistic Redundancy all these abilities to a machine? Pragmatics
0, 1 Sentence Ordering -1, 0, 1 Question? Hi! My name is Alan. What about the sizes of m and m’ ? Should they be equal?
Many Applications! Focus was TEXT SUMMARIZATION in the news domain Question? What are the other applications of sentence ordering?
Treat the problem as a classification task Number of Instances Question Why do we use the negative log-likelihood and not the log-likelihood? Class probability of the n-th pair
Deep Neural Architecture S1 +1 0 LSTM Dropout LSTM Dropout -1 S2
Deep Neural Architecture One-Hot Encoding S1 +1 0 LSTM Dropout LSTM Dropout -1 S2
Tip Embedding: Simple matrix multiplication Deep Neural Architecture with input vector Init the matrix E Embedding S1 +1 0 LSTM Dropout LSTM Dropout -1 S2
Tip Concatenate the embeddings Deep Neural Architecture Merge S1 +1 0 LSTM Dropout LSTM Dropout -1 S2
Tip LSTM: Just a special kind of RNNs addressing Deep Neural Architecture their difficulties Long Short-Term Memory S1 +1 0 LSTM Dropout LSTM Dropout -1 S2
Tip Dropout: Sets a random set of its arguments to Deep Neural Architecture zero. Regularization S1 +1 0 LSTM Dropout LSTM Dropout -1 S2
Tip Dropout: Sets a random set of its arguments to Deep Neural Architecture zero. Softmax S1 +1 0 LSTM Dropout LSTM Dropout -1 S2
Data How to collect the required data to train the -1 network? +1 Binary ➔ Correct Wrong Order Order Ternary ➔ +1 -1 0 Correct Wrong Mising Order Order Context
Baseline - SVM English German Binary Ternary Binary Ternary 0.24 0.16 0.25 0.16 SVMs: Not really appropriate for sequential modelling
Macro-Averaged F1
Lessons Learned ● Use appropriate tools for sequence modeling ● RNNs are slow. First train on a subset of data ● Train deep models with lots of data points ● Find a way to automatically annotate data ● Use regularization (be generous)
Thank You For Your Attention

Recommend

Neural representation of linguistic feature Neural representation of linguistic feature hierarchy

Neural representation of linguistic feature Neural representation of linguistic feature hierarchy reflects language proficiency hierarchy reflects language proficiency Giovanni Di Liberto Jinping Nie Jeremy Yeaton, Bahar Khalighinejad, Shihab

294 views • 13 slides

Detection and Localisation of Neural Responses to Linguistic Phenomena using Machine Learning

Detection and Localisation of Neural Responses to Linguistic Phenomena using Machine Learning Student: Mehdi Parviz Supervisor: Mark Johnson Department of Computing Macquarie University COMP901, 2012 1 / 23 Outline Introduction Background

382 views • 23 slides

Representing symbolic linguistic structures for neural NLP: methods and applications Alexander

Representing symbolic linguistic structures for neural NLP: methods and applications Alexander Panchenko Assistant Professor for NLP Structure and goals of this talk Publishing in ACL and similar conferences, e.g. NAACL, EMNLP , CoNLL:

999 views • 51 slides

Representing symbolic linguistic structures for neural NLP: methods and applications Alexander

Representing symbolic linguistic structures for neural NLP: methods and applications Alexander Panchenko Assistant Professor for NLP About myself: a decade of fun R&D in NLP 2002-2008: Bauman Moscow State Technical University ,

883 views • 45 slides

Propagating Error Backward Hyperparameters for Neural Networks } Multi-layer (deep) neural

Learning in Neural Networks w 1,3 w 3,5 1 3 5 w w 1,4 3,6 w w 2,3 4,5 2 4 6 w w 2,4 4,6 Class #18: Back-Propagation; } A neural network can learn a classification function by Tuning Hyper-Parameters adjusting its weights to

259 views • 4 slides

Coherence Coherence Coherence Holography Recording Holography Recording Let the object

Coherence Coherence Coherence Holography Recording Holography Recording Let the object and Laser beam is split in 2 reference waves in the hologram 1 wave illuminates the object plane be described by the field The

73 views • 4 slides

Classification of Rare Recipes Requires Linguistic Features as Special Ingredients Elham

Classification of Rare Recipes Requires Linguistic Features as Special Ingredients Elham Mohammadi, Nada Naji, Louis Marceau, Marc Queudot, Eric Charton, Leila Kosseim, and Marie-Jean Meurs Banque Nationale du Canada Concordia University

393 views • 21 slides

A Unified Local and Global Model for Discourse Coherence Micha Elsner, Joseph Austerweil, Eugene

A Unified Local and Global Model for Discourse Coherence Micha Elsner, Joseph Austerweil, Eugene Charniak Brown Laboratory for Linguistic Information Processing (BLLIP) Coherence Ranking Sentence 4 Sentence 1 A+! Sentence 3 Sentence 2

468 views • 34 slides

Coherence Intuition that the parts of a discourse hang together Local coherence: Consecutive

Discourse Coherence Coherence Intuition that the parts of a discourse hang together Local coherence: Consecutive thoughts are related Indicated through coherence relations Often, but not always , accompanied by transition cues

423 views • 38 slides

Acute coronary syndrome Optical coherence tomography (OCT) Rationale Aim of the study Healthy

Acute coronary syndrome Optical coherence tomography (OCT) Rationale Aim of the study Healthy wall profile Segmentation problem Dynamic programming Machine Learning Study protocol Segmentation results Classification results Classification

450 views • 15 slides

Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34

Convolutional Neural Networks for Sentence Classification Convolutional Neural Networks for Sentence Classification Yoon Kim New York University 1 / 34 Convolutional Neural Networks for Sentence Classification Agenda Word Embeddings

389 views • 34 slides

Document Modeling with Gated Recurrent Neural Network for Sentiment Classification Duyu Tang,

Document Modeling with Gated Recurrent Neural Network for Sentiment Classification Duyu Tang, Bing Qin, Ting Liu Harbin Institute of Technology 1 Sentiment Classification Given a piece of text, sentiment classification focus on inferring

287 views • 27 slides

11/21/2018 ImageNet Classification with Deep Convolutional Neural Networks Prepared by Faizaan

11/21/2018 ImageNet Classification with Deep Convolutional Neural Networks Prepared by Faizaan Naveed Won Mo (Andy) Jung Lassonde School of Engineering, York University Canada Objective Overview of Convolutional Neural Network

567 views • 17 slides

Concise Introduction to Deep Neural Networks Outline: Classification problems Motivating

Concise Introduction to Deep Neural Networks Outline: Classification problems Motivating Deep (large) Neural Network (DNN) classifiers Neurons and DNN architectures Numerical training of DNNs (supervised deep learning) Spiking

630 views • 33 slides

Cost function Machine Learning Neural Network (Classification) total no. of layers in network

Neural Networks: Learning Cost function Machine Learning Neural Network (Classification) total no. of layers in network no. of units (not counting bias unit) in layer Layer 1 Layer 2 Layer 3 Layer 4 Multi-class classification (K classes)

711 views • 33 slides

Uncertainty Estimation in Deep Neural Networks for Dermoscopic Image Classification Marc

Uncertainty Estimation in Deep Neural Networks for Dermoscopic Image Classification Marc Combalia, Ferran Hueto, Susana Puig, Josep Malvehy, Veronica Vilaplana Introduction Neural Networks in HealthCare High performance of AI in HealthCare

958 views • 22 slides

Fooling Neural Networks Linguang Zhang Feb-4-2015 Preparation Task: image classification.

Fooling Neural Networks Linguang Zhang Feb-4-2015 Preparation Task: image classification. Datasets: MNIST, ImageNet. training and testing data. Preparation Logistic regression: Good for 0/1 classification. e.g. spam

889 views • 51 slides

Artificial neural network for image classification Author: Sten Sootla Mentor: Tambet Matiisen

Artificial neural network for image classification Author: Sten Sootla Mentor: Tambet Matiisen Quick overview Loosely based on biological neural networks Able to learn by changing the connection strenghts between different nodes

547 views • 9 slides

Optical Coherence Tomography Incorporating Spatial priors Da Ma 1* , Donghuan Lu 1,2* , Morgan

Cascade Dual-branch Deep Neural Networks for Retinal Layer and fluid Segmentation of Optical Coherence Tomography Incorporating Spatial priors Da Ma 1* , Donghuan Lu 1,2* , Morgan Heisler 1 , Setareh Dabiri 1 , Sieun Lee 1 , Gavin Weiguang Ding 1 ,

359 views • 8 slides

Tuning the Performance of Convolutional Neural Network for Image Classification on GPU Agenda

Tuning the Performance of Convolutional Neural Network for Image Classification on GPU Agenda Adoptions of Image classification or image recognition at Alibaba Easy ways to improve performance of Caffe Further performance

182 views • 15 slides

Controlling Linguistic Style Aspects in Neural Language Generation Jessica Ficler and Yoav

Controlling Linguistic Style Aspects in Neural Language Generation Jessica Ficler and Yoav Goldberg ISCOL 2017 Controlling Linguistic Style Aspects in Neural Language Generation Jessica Ficler and Yoav Goldberg ISCOL 2017 Our goal is to

1.7k views • 150 slides

A Modified Fuzzy Min-Max Neural Network and Its Application to Fault Classification Anas M.

A Modified Fuzzy Min-Max Neural Network and Its Application to Fault Classification Anas M. Quteishat and Chee Peng Lim School of Electrical & Electronic Engineering University of Science Malaysia Abstract The objectives of this paper are:

689 views • 31 slides

Transition-based Parsing with Neural Nets Graham Neubig Site

CS11-747 Neural Networks for NLP Transition-based Parsing with Neural Nets Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Two Types of Linguistic Structure Dependency: focus on relations between words ROOT I saw a girl

641 views • 38 slides

Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet

Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees Rafdord M. Neal and Jianguo Zhang Presented by Jiwen Li Feb 2, 2006 Outline Bayesian view of feature selection The approach

866 views • 30 slides