Breaking NLI Systems with Sentences that Require Simple Lexical - PowerPoint PPT Presentation

Breaking NLI Systems with Sentences that Require Simple Lexical Inferences Max Glockner 1 , Vered Shwartz 2 and Yoav Goldberg 2 1 TU Darmstadt 2 Bar-Ilan University July 18, 2018

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street ⇒ NEUTRAL NEUTRAL Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street ⇒ NEUTRAL NEUTRAL 3. A magician performing for an audience in a nightclub Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural Language Inference; Recognizing Textual Entailment [Dagan et al., 2013]) Premises are image captions, hypotheses generated by crowdsourcing workers: Premise Street performer is doing his act for kids Hypotheses 1. A person performing for children on the street ⇒ ENTAILMENT ENTAILMENT 2. A juggler entertaining a group of children on the street ⇒ NEUTRAL NEUTRAL 3. A magician performing for an audience in a nightclub ⇒ CONTRADICTION CONTRADICTION Event co-reference assumption Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 2 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Classifier Extract Features Premise Hypothesis Encoder Encoder Hypothesis Premise 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet SOTA exceeds human performance... 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet SOTA exceeds human performance... 1 1 Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Neural NLI Models End-to-end, either sentence-encoding or attention-based Label Label Classifier Classifier Extract Features Extract Features Attention Premise Hypothesis Premise Hypothesis Encoder Encoder Encoder Encoder Hypothesis Premise Hypothesis Premise Lexical knowledge: only from pre-trained word embeddings As opposed to using resources like WordNet SOTA exceeds human performance... 1 1 [Gururangan et al., 2018, Poliak et al., 2018]: by learning “easy clues” Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 3 / 13

Do neural NLI models implicitly learn lexical semantic relations? Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 4 / 13

New Test Set We constructed a new test set to answer this question Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ w ′ is in the SNLI vocabulary and in pre-trained embeddings Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ w ′ is in the SNLI vocabulary and in pre-trained embeddings Crowdsourcing labels (mostly contradictions!) Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

New Test Set We constructed a new test set to answer this question Premise : sentences from the SNLI training set Hypothesis : Replacing a single term w in the premise with a related term w ′ w ′ is in the SNLI vocabulary and in pre-trained embeddings Crowdsourcing labels (mostly contradictions!) Contradiction The man is holding a saxophone → The man is holding an electric guitar Max Glockner, Vered Shwartz and Yoav Goldberg · Breaking NLI Systems with Sentences that Require Simple Lexical Inferences 5 / 13

Breaking NLI Systems with Sentences that Require Simple Lexical - PowerPoint PPT Presentation

Breaking NLI Systems with Sentences that Require Simple Lexical Inferences Max Glockner 1 , Vered Shwartz 2 and Yoav Goldberg 2 1 TU Darmstadt 2 Bar-Ilan University July 18, 2018 SNLI [Bowman et al., 2015] A large scale dataset for NLI (Natural

Breaking News: We Cant Control Everything! Using Systems Thinking to Understand Context in

Breaking out of the box Understanding rela5onships between learning and assessment Breaking

Making a Splash; Breaking a Neck: The Development of Complexity in Physical Systems Leo P.

Signatures of the full replica symmetry breaking in jamming systems under shear Hajime Yoshino

Parsing OSM XML iD OSMCha To-Fix Frontend Developer kepta kushan2020 Breaking down iD

THE MSSM FROM SS BREAKING MARIANO QUIROS, ICREA/IFAE HEP 2006 THE MSSM FROM SS BREAKING

Security in Pervasive Wireless Security in Pervasive Wireless Systems Systems Wade Trappe

Systems and Symmetry Breaking Koji Umemoto (YITP) Based on Arpan Bhattacharyya (YITP), Alexander

Breaking Up Breaking Up Is Is Hard Hard To Do (But It Might To Do (But It Might Be Be Easier

Breaking DNSSEC D. J. Bernstein University of Illinois at Chicago Advertisement:

The Benefit of the Doubt The Swedish Legal System Why Breaking the Law? Judas Priest Breaking

Transitions in Place Leonard Hock, D.O., M.A.C.O.I., CMD Breaking Bad News Goals and Objectives

SUSY breaking and the MSSM Spontaneous SUSY breaking at tree-level ORaifeartaigh, Fayet,

Breaking The Silos: A Social Innovation Approach to Tackling Food Waste in Canada Tammara Soma

Breaking parabolic points along stable directions CUI Guizhen and TAN Lei Academy of Mathematics

Breaking Gridlock, Breaking Ground: Tackling Anchorage Housing Affordability Presented by Michele

i.e., Higgs? 1 Understanding Electroweak Symmetry breaking is essential for significant progress

Dynamical SUSY breaking A rule of thumb for SUSY breaking theory with no flat directions that

Breaking Hardware Wallets Breaking Bitcoin September 2017 Nicolas Bacca @btchip Why Hardware

Breaking Down Barrie iers Asya Choudry Genetic Counsellor Community Engagement Manager for

2018-2019 BUDGET Breaking Down the Budget Breaking Down the Budget This presentation focuses on

BREAKING THE BARRIERS TO AI- SCALE IN THE ENTERPRISE Charlie Boyle Senior Director, DGX Systems

Breaking Up the Transport Logjam Bryan Ford Janardhan Iyengar Max Planck Institute Franklin

Breaking Up the Transport Logjam Bryan Ford Janardhan Iyengar Max Planck Institute Franklin