Relation Extraction II Luke Zettlemoyer CSE 517 Winter 2013 [with - PowerPoint PPT Presentation

Relation Extraction II Luke Zettlemoyer CSE 517 Winter 2013 [with slides adapted from many people, including Bill MacCartney, Raphael Hoffmann, Dan Jurafsky, Rion Snow, Jim Martin, Chris Manning, William Cohen, and others]

Supervised RE: summary • Supervised approach can achieve high accuracy o At least, for some relations o If we have lots of hand-labeled training data • But has significant limitations! o Labeling 5,000 relations (+ named entities) is expensive o Doesn’t generalize to different relations • Next: beyond supervised relation extraction o Distantly supervised relation extraction o Unsupervised relation extraction

Relation extraction: 5 easy methods 1. Hand-built patterns 2. Bootstrapping methods 3. Supervised methods 4. Distant supervision 5. Unsupervised methods

Extracting structured knowledge Each article can contain hundreds or thousands of items of knowledge “The Lawrence Livermore National Laboratory (LLNL) in Livermore, California is ascientific research laboratory founded by the University of California in 1952.” LLNL EQ Lawrence Livermore National Laboratory LLNL LOC-IN California Livermore LOC-IN California LLNL IS-A scientific research laboratory LLNL FOUNDED-BY University of California LLNL FOUNDED-IN 1952

Distant supervision Snow, Jurafsky, Ng. 2005. Learning syntactic patterns for automatic hypernym discovery. NIPS 17 Mintz, Bills, Snow, Jurafsky. 2009. Distant supervision for relation extraction without labeled data. ACL-2009. • Hypothesis: If two entities belong to a certain relation, any sentence containing those two entities is likely to express that relation • Key idea: use a database of relations to get lots of noisy training examples o instead of hand-creating seed tuples (bootstrapping) o instead of using hand-labeled corpus (supervised)

Benefits of distant supervision • Has advantages of supervised approach o leverage rich, reliable hand-created knowledge o relations have canonical names o can use rich features (e.g. syntactic features) • Has advantages of unsupervised approach o leverage unlimited amounts of text data o allows for very large number of weak features o not sensitive to training corpus: genre- independent

Hypernyms via distant supervision We construct a noisy training set consisting of occurrences from our corpus that contain a hyponym-hypernym pair from WordNet. This yields high-signal examples like: “...consider authors like Shakespeare...” “Some authors (including Shakespeare)...” “Shakespeare was the author of several...” “Shakespeare, author of The Tempest... ” slide adapted from Rion Snow

Hypernyms via distant supervision We construct a noisy training set consisting of occurrences from our corpus that contain a hyponym-hypernym pair from WordNet. This yields high-signal examples like: “...consider authors like Shakespeare...” “Some authors (including Shakespeare)...” “Shakespeare was the author of several...” “Shakespeare, author of The Tempest... ” But also noisy examples like: “The author of Shakespeare in Love ...” “...authors at the Shakespeare Festival...” slide adapted from Rion Snow

Learning hypernym patterns Key idea: work at corpus level (entity pairs), instead of sentence level! 1. Take corpus sentences ... doubly heavy hydrogen atom called deuterium ... 2. Collect noun pairs e.g. (atom, deuterium) 752,311 pairs from 6M sentences of newswire 3. Is pair an IS-A in WordNet? 14,387 yes; 737,924 no 4. Parse the sentences 5. Extract patterns 69,592 dependency paths with >5 pairs 6. Train classifier on patterns logistic regression with 70K features (converted to 974,288 bucketed binary features) slide adapted from Rion Snow

One of 70,000 patterns Pattern: <superordinate> called <subordinate> Learned from cases such as: (sarcoma, cancer) … an uncommon bone cancer called osteogenic sarcoma and to … (deuterium, atom) … heavy water rich in the doubly heavy hydrogen atom called deuterium. New pairs discovered: (efflorescence, condition) … and a condition called efflorescence are other reasons for … (O’neal_inc, company) … The company, now called O'Neal Inc., was sole distributor of … (hat_creek_outfit, ranch) … run a small ranch called the Hat Creek Outfit. (hiv-1, aids_virus) … infected by the AIDS virus, called HIV-1. (bateau_mouche, attraction) … local sightseeing attraction called the Bateau Mouche...

Syntactic dependency paths Patterns are based on paths through dependency parses generated by MINIPAR (Lin, 1998) Example word pair: (Shakespeare, author) Example sentence: “Shakespeare was the author of several plays...” Minipar parse: Extract shortest path: -N:s:VBE, be, VBE:pred:N slide adapted from Rion Snow

Hearst patterns to dependency paths Hearst Pattern MINIPAR Representation Y such as X … -N:pcomp-n:Prep,such_as,such_as,-Prep:mod:N Such Y as X … -N:pcomp-n:Prep,as,as,-Prep:mod:N,(such,PreDet:pre:N)} X … and other Y (and,U:punc:N),N:conj:N, (other,A:mod:N) slide adapted from Rion Snow

P/R of hypernym extraction patterns slide adapted from Rion Snow 14

P/R of hypernym classifier logistic regression 10-fold Cross Validation on 14,000 WordNet-Labeled Pairs slide adapted from Rion Snow 18

P/R of hypernym classifier F-score logistic regression 10-fold Cross Validation on 14,000 WordNet-Labeled Pairs slide adapted from Rion Snow 19

What about other relations? Mintz, Bills, Snow, Jurafsky (2009). Distant supervision for relation extraction without labeled data. Training set Corpus 102 relations 1.8 million articles 940,000 entities 25.7 million sentences 1.8 million instances slide adapted from Rion Snow

Frequent Freebase relations

Collecting training data Corpus text Training data Bill Gates founded Microsoft in 1975. Bill Gates, founder of Microsoft, … Bill Gates attended Harvard from … Google was founded by Larry Page … Freebase Founder: (Bill Gates, Microsoft) Founder: (Larry Page, Google) CollegeAttended: (Bill Gates, Harvard)

Collecting training data Corpus text Training data (Bill Gates, Microsoft) Bill Gates founded Microsoft in 1975. Label: Founder Bill Gates, founder of Microsoft, … Feature: X founded Y Bill Gates attended Harvard from … Google was founded by Larry Page … Freebase Founder: (Bill Gates, Microsoft) Founder: (Larry Page, Google) CollegeAttended: (Bill Gates, Harvard)

Collecting training data Corpus text Training data (Bill Gates, Microsoft) Bill Gates founded Microsoft in 1975. Label: Founder Bill Gates, founder of Microsoft, … Feature: X founded Y Bill Gates attended Harvard from … Feature: X, founder of Y Google was founded by Larry Page … Freebase Founder: (Bill Gates, Microsoft) Founder: (Larry Page, Google) CollegeAttended: (Bill Gates, Harvard)

Collecting training data Corpus text Training data (Bill Gates, Microsoft) Bill Gates founded Microsoft in 1975. Label: Founder Bill Gates, founder of Microsoft, … Feature: X founded Y Bill Gates attended Harvard from … Feature: X, founder of Y Google was founded by Larry Page … (Bill Gates, Harvard) Label: CollegeAttended Feature: X attended Y Freebase Founder: (Bill Gates, Microsoft) Founder: (Larry Page, Google) CollegeAttended: (Bill Gates, Harvard)

Collecting training data Corpus text Training data (Bill Gates, Microsoft) Bill Gates founded Microsoft in 1975. Label: Founder Bill Gates, founder of Microsoft, … Feature: X founded Y Bill Gates attended Harvard from … Feature: X, founder of Y Google was founded by Larry Page … (Bill Gates, Harvard) Label: CollegeAttended Feature: X attended Y Freebase Founder: (Bill Gates, Microsoft) (Larry Page, Google) Founder: (Larry Page, Google) Label: Founder CollegeAttended: (Bill Gates, Harvard) Feature: Y was founded by X

Negative training data Can’t train a classifier with only positive data! Training data Need negative training data too! (Larry Page, Microsoft) Label: NO_RELATION Solution? Feature: X took a swipe at Y Sample 1% of unrelated pairs of entities. (Larry Page, Harvard) Label: NO_RELATION Feature: Y invited X Corpus text Larry Page took a swipe at Microsoft... (Bill Gates, Google) ...after Harvard invited Larry Page to... Label: NO_RELATION Feature: Y is X's worst fear Google is Bill Gates' worst fear ...

Preparing test data Test data Corpus text Henry Ford founded Ford Motor Co. in … Ford Motor Co. was founded by Henry Ford … Steve Jobs attended Reed College from …

Preparing test data Test data (Henry Ford, Ford Motor Co.) Corpus text Label: ??? Feature: X founded Y Henry Ford founded Ford Motor Co. in … Ford Motor Co. was founded by Henry Ford … Steve Jobs attended Reed College from …

Preparing test data Test data (Henry Ford, Ford Motor Co.) Corpus text Label: ??? Feature: X founded Y Feature: Y was founded by X Henry Ford founded Ford Motor Co. in … Ford Motor Co. was founded by Henry Ford … Steve Jobs attended Reed College from …

Relation Extraction II Luke Zettlemoyer CSE 517 Winter 2013 [with - PowerPoint PPT Presentation

Relation Extraction II Luke Zettlemoyer CSE 517 Winter 2013 [with slides adapted from many people, including Bill MacCartney, Raphael Hoffmann, Dan Jurafsky, Rion Snow, Jim Martin, Chris Manning, William Cohen, and others] Supervised RE:

Mohamed Thahir Traditional and Open Relation Extraction Read the Web Relation Extraction

Relation Extraction CSCI 699 Instructor: Xiang Ren USC Computer Science Relation extraction

? (entity type) Apr 23, 2007 NAACL-HLT 2 1 What Is Relation Extraction? hundreds of

CORE: Context-Aware Open Relation Extraction with Factorization Machines Fabio Petroni Luciano

Relation Extraction Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 February 23, 2017

IN4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning 2 IE: Relation extraction,

Zero-Shot Relation Extraction via Reading Comprehension Omer Levy Minjoon Seo Eunsol Choi Luke

Language Technology I 2011/2012 Relation Extraction Exercises 1. What are the motivations of

An Integrated Approach for Large-scale Relation Extraction from the Web Naimdjon Takhirov, Fabien

Portuguese Relation Extraction in the Organization Domain Sandra

Lecture 24: Relation Extraction Kai-Wei Chang CS @ University of Virginia kw@kwchang.net Couse

Understanding Social Tags: Relation Extraction and Tag Annotation Presentation at NLP@UoL, Mar

Neural Distant Superv rvision for Relation Ext xtraction Deepanshu Jindal Elements and Images

A Study of Hybrid Similarity Measures for Semantic Relation Extraction Alexander Panchenko and

Relation Extraction Luke Zettlemoyer CSE 517 Winter 2013 [with slides adapted from many people,

APCNN : Tackling Class Imbalance in Relation Extraction through Aggregated Piecewise Convolutional

Relation Extraction Bill MacCartney CS224U 14-16 April 2014 [with slides adapted from many

Towards an Efficient Combination of Similarity Measures for Semantic Relation Extraction

Unsupervised Relation Extraction from Web -Bhavishya Mittal (11198) - Vempati Anurag Sai

A SURVEY ON RELATION EXTRACTION Nguyen Bach & Sameer Badaskar Language Technologies Institute

REET Joint Relation Extraction and Entity Typing via Multi-task Learning ADVISOR: JIA-LING, KOH

Exploring the Efficiency of Batch Active Learning for Human-in-the-Loop Relation Extraction

Introduction The automatic detection and extraction of Semantic Relations is a crucial step to

RExtractor Entity Relation Extraction from Unstructured Texts Intelligent library (INTLIB,

Relation Extraction II Luke Zettlemoyer CSE 517 Winter 2013 [with - PowerPoint PPT Presentation

Relation Extraction II Luke Zettlemoyer CSE 517 Winter 2013 [with slides adapted from many people, including Bill MacCartney, Raphael Hoffmann, Dan Jurafsky, Rion Snow, Jim Martin, Chris Manning, William Cohen, and others] Supervised RE:

Mohamed Thahir Traditional and Open Relation Extraction Read the Web Relation Extraction

Relation Extraction CSCI 699 Instructor: Xiang Ren USC Computer Science Relation extraction

? (entity type) Apr 23, 2007 NAACL-HLT 2 1 What Is Relation Extraction? hundreds of

CORE: Context-Aware Open Relation Extraction with Factorization Machines Fabio Petroni Luciano

Relation Extraction Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 February 23, 2017

IN4080 2020 FALL NATURAL LANGUAGE PROCESSING Jan Tore Lnning 2 IE: Relation extraction,

Zero-Shot Relation Extraction via Reading Comprehension Omer Levy Minjoon Seo Eunsol Choi Luke

Language Technology I 2011/2012 Relation Extraction Exercises 1. What are the motivations of

An Integrated Approach for Large-scale Relation Extraction from the Web Naimdjon Takhirov, Fabien

Portuguese Relation Extraction in the Organization Domain Sandra

Lecture 24: Relation Extraction Kai-Wei Chang CS @ University of Virginia kw@kwchang.net Couse

Understanding Social Tags: Relation Extraction and Tag Annotation Presentation at NLP@UoL, Mar

Neural Distant Superv rvision for Relation Ext xtraction Deepanshu Jindal Elements and Images

A Study of Hybrid Similarity Measures for Semantic Relation Extraction Alexander Panchenko and

Relation Extraction Luke Zettlemoyer CSE 517 Winter 2013 [with slides adapted from many people,

APCNN : Tackling Class Imbalance in Relation Extraction through Aggregated Piecewise Convolutional

Relation Extraction Bill MacCartney CS224U 14-16 April 2014 [with slides adapted from many

Towards an Efficient Combination of Similarity Measures for Semantic Relation Extraction

Unsupervised Relation Extraction from Web -Bhavishya Mittal (11198) - Vempati Anurag Sai

A SURVEY ON RELATION EXTRACTION Nguyen Bach &amp; Sameer Badaskar Language Technologies Institute

REET Joint Relation Extraction and Entity Typing via Multi-task Learning ADVISOR: JIA-LING, KOH

Exploring the Efficiency of Batch Active Learning for Human-in-the-Loop Relation Extraction

Introduction The automatic detection and extraction of Semantic Relations is a crucial step to

RExtractor Entity Relation Extraction from Unstructured Texts Intelligent library (INTLIB,

A SURVEY ON RELATION EXTRACTION Nguyen Bach & Sameer Badaskar Language Technologies Institute