Attention for Machine Comprehension Made by : Rishab Goel Based on - PowerPoint PPT Presentation

Jan 29, 2024 •204 likes •681 views

Attention for Machine Comprehension Made by : Rishab Goel Based on slides by: Alex Graves, Hien Quoc, Renjie Liao Highway Networks Benefits ... Benefits ... Importance ... For training very deep architectures By allowing better information

Attention for Machine Comprehension Made by : Rishab Goel Based on slides by: Alex Graves, Hien Quoc, Renjie Liao
Highway Networks
Benefits ...
Benefits ...
Importance ... For training very deep architectures By allowing better information flow Better optimization Intuition : linear transformation/input suffice for learning, language at higher level of http://colah.github.io/posts/2014-03-NN-Manifolds-Topology/ abstraction???
Hien Quoc Dang
Idea of Maxout Hien Quoc Dang
Intuitions Inspired from dropout Similar to bagging but integrated as a part of single network Hien Quoc Dang
Idea of Maxout ... Hien Quoc Dang
Idea of Maxout ... Hien Quoc Dang
Comparison to Rectifiers Hien Quoc Dang
Why Maxout Work ? Hien Quoc Dang
Slides : Santi Pascual
LSTMs ... Chris Olah’s blog
Need for Attention The embeddings not sufficient to encode information over long distances Helps to attend to important patch of data Interpretability to the model
Attentive Reader
DYNAMIC COATTENTION NETWORKS FOR QUESTION ANSWERING Authors : Caiming Xiong, Victor Zhong, Richard Socher
Introduction Machine Comprehension No knowledge base required Till SQUAD no large scale, natural dataset Cloze style datasets like CNN/Mail Daily Synthetic/small size
About SQuAD Consists questions on a set of Wikipedia articles Wh type questions The answer is a segment of text, or span Source : Rajpurkar et al.
Model in nutshell ... Socher et al
Doc and Query Encoder Socher et al
Liked ● Gagan Socher et al
Liked : all Dynamic Decoder Socher et al
Highway Maxout Network ... Socher et al
Socher et al
Socher et al
Disliked ● Gagan (pt. 3) Implementation ● Akshay (pt. 4) claim not proven 1. CoreNLP for preprocessing 2. GloVe word vectors pretrained on 840B Common Crawl corpus 3. OOV set to 0 4. Sentinel vectors randomly initialized, optimized during training
Iterative process visualisation ... Socher et al
Socher et al
Disliked ● Haroun (ensemble gain too Results much) Socher et al
Liked ● Barun ● Nupur Socher et al
Liked Performance across diff. types of ques. ● Shantanu Socher et al
Liked ● Prachi Ablation studies ... Socher et al
Predictions Socher et al
Logistic Regression Prediction : Theatre Museum Socher et al
Comments : Trouble decoding multiple intuitive answer Socher et al
Cons Lack error analysis, need more ablation studies[Barun, Surag] System give extractive answer and not abstractive[Nupur] Do not compare HMN and MN[all] Unintuitive decoder[Dinesh]
Doubts ... Why HMN worked out? Role of sentinel vectors?? Error propagation in argmax function Maxout for LSTMs as well (not clear) Use multiple initialisation of start and end pointers ( how ??)
Extensions ... Use approach for others datasets like CNN/Daily Mail and MS COCO QA [Barun] Use different attention, Match LSTM [Barun] Bi-directional attention [Gagan] Use iterative idea to visual QA, classification, NER, SRL etc [Akshay, Surag] Find synonyms[Haroun]
Extensions ... Combine char2vec and word2vec embeddings to represent the document and query
Thanks!

Recommend

COMPREHENSION Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi Presenter: Wenda

BI-DIRECTIONAL ATTENTION FLOW FOR MACHINE COMPREHENSION Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi Presenter: Wenda Qiu 04/01/2020 Machine Comprehension Question Answering: Answer a query about a given context

655 views • 61 slides

Consensus Attention-based Neural Networks for Reading Comprehension Y IMING C UI , T ING L IU , Z

Consensus Attention-based Neural Networks for Reading Comprehension Y IMING C UI , T ING L IU , Z HIPENG C HEN , S HIJIN W ANG AND G UOPING H U J OINT L ABORATORY OF HIT AND I FLYTEK R ESEARCH (HFL), C HINA 2016-12-15 O SAKA , J APAN O UTLINE

589 views • 45 slides

Convolutional Spatial Attention Model for Reading Comprehension with Multiple- Choice Questions Z

Convolutional Spatial Attention Model for Reading Comprehension with Multiple- Choice Questions Z HIPENG C HEN , Y IMING C UI * , W ENTAO M A , S HIJIN W ANG , G UOPING H U J OINT L ABORATORY OF HIT AND I FLYTEK R ESEARCH ( HFL ), B EIJING , C

935 views • 28 slides

Multi-attention Recurrent Network for Human Communication Comprehension Amir Zadeh, Paul Pu

Multi-attention Recurrent Network for Human Communication Comprehension Amir Zadeh, Paul Pu Liang, Soujanya Poria, Prateek Vij, Erik Cambria, Louis-Philippe Morency Presenter: Paul Pu Liang 1 Progress of Artificial Intelligence Intelligent

482 views • 35 slides

Using Natural Language Relations between Answer Choices for Machine Comprehension Rajkumar Pujari

Using Natural Language Relations between Answer Choices for Machine Comprehension Rajkumar Pujari and Dan Goldwasser June 5, 2019 Overview Model Results Conclusion Intuition Intuition When humans perform Reading Comprehension, we answer

327 views • 32 slides

Measuring Non-Expert Comprehension of Machine Learning Fairness Metrics Debjani Saha , Candice

Measuring Non-Expert Comprehension of Machine Learning Fairness Metrics Debjani Saha , Candice Schumann, Duncan C. McElfresh, John P. Dickerson, Michelle L. Mazurek, Michael Carl Tschantz 37th International Conference on Machine Learning (ICML)

980 views • 69 slides

Evaluation Metrics for Machine Reading Comprehension (RC): Prerequisite Skills and Readability

Evaluation Metrics for Machine Reading Comprehension (RC): Prerequisite Skills and Readability Sugawara et al. The University of Tokyo, Fujitsu Laboratories Ltd., Natural Institute of Informatics Presented by: Shaima AbdulMajeed To give the

577 views • 36 slides

Coordinated Interplay of Scene, Utterance, and World Knowledge - Comprehension of spoken

Coordinated Interplay of Scene, Utterance, and World Knowledge - Comprehension of spoken utterances that relate to a visual scene - Eye movements in scenes during utterance comprehension - utterance can direct attention to an object in the scene

207 views • 10 slides

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

810 views • 56 slides

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

828 views • 57 slides

Representa)on Learning for Reading Comprehension Russ Salakhutdinov Machine Learning Department

Representa)on Learning for Reading Comprehension Russ Salakhutdinov Machine Learning Department Carnegie Mellon University Canadian Institute for Advanced Research Joint work with Bhuwan Dhingra, Zhilin Yang, Ye Yuan, Junjie Hu, Hanxiao Liu,

556 views • 41 slides

SQuAD:100,000+ Questions for Machine Comprehension of Text Pranav Rajpurkar, Jian Zhang,

SQuAD:100,000+ Questions for Machine Comprehension of Text Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, Percy Liang Published in EMNLP 2016 Presented by Jiaming Shen April 17, 2018 1 SQuAD = S tanford Qu estion A nswering D ataset Online

785 views • 34 slides

Machine Comprehension with Discourse Relations Karthik Narasimhan Regina Barzilay CSAIL,

Machine Comprehension with Discourse Relations Karthik Narasimhan Regina Barzilay CSAIL, Massachusetts Institute of Technology 1 Sally liked going outside. She put on her shoes. She went outside to walk. [...]

1.17k views • 29 slides

Sparse and Constrained Attention for Neural Machine Translation Chaitanya Malaviya 1 ,

Sparse and Constrained Attention for Neural Machine Translation Chaitanya Malaviya 1 , Pedro Ferreira 2 , Andr F.T. Martins 2,3 1 Carnegie Mellon University, 2 Instituto Superior Tcnico, 3 Unbabel 1 Adequacy in Neural Machine

574 views • 22 slides

Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu Pham and Chris

Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu Pham and Chris Manning EMNLP 2015 Presented by: Yunan Zhang Neural Machine Translation Attention Mechanism (Sutskever et al., 2014) (Bahdanau et al., 2015) _ suis

499 views • 34 slides

Speech Question Answering TOEFL Listening Comprehension Test by Machine Wei Fang December 13,

Speech Question Answering TOEFL Listening Comprehension Test by Machine Wei Fang December 13, 2017 Speech Processing & Machine Learning Lab 1 Question Answering (QA) Understand spoken content Answer questions about spoken content

420 views • 28 slides

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A.

Attention, Transformer and BERT Prof. Kuan-Ting Lai 2020/6/16 Attention is All You Need! A. Waswani et al., NIPS , 2017 Google Brain & University of Toronto 2 Attention Visual attention and textual attention

628 views • 21 slides

The Attention Economy What is the attention economy? A business model where you (as the

The Attention Economy What is the attention economy? A business model where you (as the company) want to hold the users attention as much as possible. Attention is treat like a scarce resource What are ethical issues that have emerged

170 views • 3 slides

Catchy vs. Comprehension Tanya Archie Noyce Master Teaching Fellow Secondary Math Coach for

Catchy vs. Comprehension Tanya Archie Noyce Master Teaching Fellow Secondary Math Coach for Omaha Public Schools Mnemonic: A device or aid intended to assist or improve the memory Gimmick: A trick or device used to attract attention or

253 views • 24 slides

(Age 7-11) A new solution for guided reading Agenda Why a comprehension programme? What is Bug

Introducing Bug Club Comprehension (Age 7-11) A new solution for guided reading Agenda Why a comprehension programme? What is Bug Club Comprehension? Pedagogical principles A week of teaching in practice Bug Club Family Positioning

789 views • 41 slides

CS224N/Ling284 Lecture 8: Machine Translation, Sequence-to-sequence and Attention Abigail See

Natural Language Processing with Deep Learning CS224N/Ling284 Lecture 8: Machine Translation, Sequence-to-sequence and Attention Abigail See Announcements We are taking attendance today Sign in with the TAs outside the auditorium

1.05k views • 79 slides

For excellent connections worldwide export ex pertise > language tra nsfer > comprehension

For excellent connections worldwide export ex pertise > language tra nsfer > comprehension com munication extra-com FOR small and medium-sized businesses: machine construction, capital goods, engineering, complex products and services

485 views • 12 slides

Selective Attention for Context-aware Neural Machine Translation Sameen Maruf , Andr e F. T.

Selective Attention for Context-aware Neural Machine Translation Sameen Maruf , Andr e F. T. Martins , Gholamreza Haffari Faculty of Information Technology, Monash University, Australia Unbabel & Instituto de

1.01k views • 88 slides

Construal Construal partial overview Packaging of mental input Attention to and

Construal Construal partial overview Packaging of mental input Attention to and comprehension of situations Linguistic presentation Unpacking of linguistic expressions Mental simulation of an experience conveyed

694 views • 38 slides