Event Detection and Coreference TAC KBP 2015 Sean Monahan, Michael - PowerPoint PPT Presentation

Event Detection and Coreference TAC KBP 2015 Sean Monahan, Michael Mohler, Marc Tomlinson Amy Book, Mary Brunson, Maxim Gorelkin, Kevin Crosby

Overview • Event Detection (Task 1) – What worked and what didn’t – Lexical Knowledge – Annotation Ideas • Event Hoppers (Task 2 / 3) 2

Event Detection – Problem Description • Find the text which indicates the event – Triggers • “Find the smallest extent of text (usually a word or short phrase) that expresses the occurrence of an event)” – Nugget • Find the maximal extent of a textual event indicator • Event Types – 38 different event types (subtypes) – Each with a different definition and different requirements • Highly varying performance per type • Difficult Cases – Unclear context – “The politician attacked his rivals” – Unclear event – “There’s murder in his blood” 3

Event Detection – All Strategies • We experimented with a lot of different strategies Semantic Cicero Lexicon Doc2Vec WSD Patterns Custom Active Word Lemma Learning Unkn Word Lemma owns Trigger Data +POS +POS Voting Trigger ML 4

Event Detection – Working Strategies • Many of the strategies didn’t work Semantic Cicero Lexicon Doc2Vec WSD Patterns Custom Active Word Lemma Learning Unkn Word Lemma owns Trigger Data +POS +POS Trigger ML Voting 5

Event Detection – Lexicon Strategy • Build a lexicon from training sources for nuggets • C_P_word: Count the times the word/phrase occurs as a positive example • C_T_word: Count the times the word/phrase occurs as a string • Lexicon_score_word = C_P_word / C_T_word • Also experimented with – Lexicon_score_lemma • Attack, attacks, attackers – Lexicon_score_pos • Attack#n, Attack#v – Lexicon_score_lemma_pos • Attacked, attacking -> Attack#v • Attackers, the attack -> Attack#n 6

Event Detection – Lexical Priors 2500 2000 Number of Observed 1500 Examples Negative Positive 1000 500 0 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 Percent Observed Correct 7

Event Detection – Lexical Priors 2500 2000 Number of Observed 1500 Examples Negative Positive 1000 Lexicons with 0 or no score are not shown Unseen in train : 931 correct / 5,475 occurrences (14% accuracy) 500 0 correct in train : 955/146,918 (0.6% accurate) 0 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 Percent Observed Correct 8

Event Detection – Lexical Priors 2500 2000 Number of Observed 1500 100% accuracy occurs a Examples lot, mostly 1/1 or 2/2 Negative Less accurate compared Positive to neighbors 1000 500 0 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 Percent Observed Correct 9

Event Detection – Lexical Priors 2500 2000 Number of Observed 1500 Examples Negative Positive 50% accuracy occurs 1000 a lot, mostly 1/2 or 2/4 500 0 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 Percent Observed Correct 10

Event Detection – Lexical Priors 2500 2000 Number of Observed 1500 Examples Negative Positive 1000 33% accuracy occurs a lot, mostly 1/3 500 0 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 Percent Observed Correct 11

Event Detection – Lexical Priors 2500 2000 Why does 8% occur so often…? Number of Observed 1500 Examples Negative Positive 1000 500 0 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 Percent Observed Correct 12

Event Detection – Selecting Threshold 13

Event Detection – Selecting Threshold Lexicon only strategy F-measure plateau achieves around 56% on maximized around 0.3 mention_type 14

Event Detection – Selecting Threshold Lexicon only strategy achieves around 56% on mention_type 15

Event Detection – High Precision Types Maximum F-measure achieved at low lexicon threshold Recall Precision F-Measure Precision Trendline 16

Event Detection – Medium Precision Types Maximum F-measure achieved at higher lexicon threshold Recall Precision F-Measure Precision Trendline 17

Event Detection – Low Precision Types Maximum F-measure achieved somewhere ??? There’s that 8% again Recall Precision F-Measure Precision Trendline 18

Event Detection – Context Modelling Example: Justice Sentence John wrote a sentence about life. The sentence had 17 words. John was given a life sentence. Estimated Density Function Vector representation for context For Negatives (Doc2Vec, Le and Mikolov, 2014 ) Peter’s life sentence was almost over. Positive Contextual Classification Negative 19

Event Detection – Winning Strategies • Pick best combination of strategies for each event type – Watch out for Micro- vs. Macro F-measure • In order to optimize Micro, we use the No-op strategy for some types Semantic Cicero Lexicon Doc2vec WSD No-op Patterns Custom Active Word Lemma Learning Unkn Word Lemma owns Trigger Data +POS +POS Trigger ML Voting 20

Event Detection – Winning Strategies End-Org, • Pick best combination of strategies for each event type Manufacture.Artifact, Transaction.Transaction – Watch out for Micro- vs. Macro F-measure occur too rarely to model Semantic Cicero Lexicon Doc2vec WSD No-op Patterns Custom Active Word Lemma Learning Unkn Word Lemma owns Trigger Data +POS +POS Trigger ML Voting 21

Event Detection – Winning Strategies Contact.Contact and • Pick best combination of strategies for each event type Contract.Broadcast too – Watch out for Micro- vs. Macro F-measure noisy to output at all Semantic Cicero Lexicon Doc2vec WSD No-op Patterns Custom Active Word Lemma Learning Unkn Word Lemma owns Trigger Data +POS +POS Trigger ML Voting 22

Event Detection – Winning Strategies “said” occurs ~8% as • Pick best combination of strategies for each event type Contact, ~8% as Broadcast, and 84% as no – Watch out for Micro- vs. Macro F-measure event Semantic Cicero Lexicon Doc2vec WSD No-op Patterns Custom Active Word Lemma Learning Unkn Word Lemma owns Trigger Data +POS +POS Trigger ML Voting 23

Event Detection – Evaluation Task 1 test Event (mention_type) +realis_status P R F P R F LCC1 66.86 53.31 59.32 49.80 39.71 44.18 eval Event (mention_type) +realis_status P R F P R F Rank1 58.41 44.24 LCC2 73.95 45.61 57.18 49.22 31.02 38.06 LCC1 72.92 45.91 56.35 48.92 30.81 37.81 Median 48.79 34.78 24

Event Detection – Challenge • Data is one-dimensional – This text is a trigger for this event type • Problem is multi-dimensional 1. Does this meet the minimum threshold to be considered an “event”? 2. Is this text describing the appropriate event type? • Could access to extra annotation data provide a solution? 25

Event Detection – Eventiveness HIGH The man bombed the building. The comedian bombed on stage last night. The bomber destroyed the building. The FBI discovered the man had planned to build a bomb . Eventiveness The agent is an expert in bomb disposal. The B-52 bomber took off. He is wearing a bomber jacket. LOW LOW 26

Event Detection – Word Sense Appropriateness HIGH The man bombed the building. The bomber destroyed the building. The FBI discovered the man had planned to build a bomb . The agent is an expert in bomb disposal. Word Sense Appropriateness The B-52 bomber took off. He is wearing a bomber jacket. The comedian bombed on stage last night. LOW LOW 27

Event Detection – Multi-Dimensional HIGH man bombed comedian bombed bomber destroyed Eventiveness planned to build a bomb expert in bomb disposal B-52 bomber Alan Turing’s bombe bomber jacket LOW HIGH LOW Word Sense Appropriateness 28

Event Detection – Detailed Annotations 1. One-dimensional outcome Positive Negative 2. Two-dimensional outcome Negative Not Eventive Negative Not Relevant 3. Three-dimensional outcome – B52-bomber Negative Not Eventive Function – Negative Not Eventive Descriptor Abusive Husband 29

Overview • Event Detection (Task 1) • Event Hoppers (Task 2 / 3) – Compatibility Modules – Hopperator – Scores on Diagnostic vs. System events 30

Event Hoppers - Description • Event Hoppers consist of event mentions that refer to the same event occurrence. • For this purpose, we define a more inclusive, less strict notion of event coreference as compared to ACE and Light ERE. • Event hoppers contain mentions of events that “feel” coreferential to the annotator. • Event mentions that have the following features go into the same hopper: – They have the same event type and subtype (with exceptions for Contact.Contact and Transaction.Transaction) – They have the same temporal and location scope . • The following do not represent an incompatibility between two events. – Trigger specificity can be different (assaulting 32 people vs. wielded a knife) – Event arguments may be non-coreferential or conflicting (18 killed vs. dozens killed) – Realis status may be different (will travel [OTHER] to Europe next week vs. is on a 5-day trip [ACTUAL]) 31

Event Detection and Coreference TAC KBP 2015 Sean Monahan, Michael - PowerPoint PPT Presentation

Event Detection and Coreference TAC KBP 2015 Sean Monahan, Michael Mohler, Marc Tomlinson Amy Book, Mary Brunson, Maxim Gorelkin, Kevin Crosby Overview Event Detection (Task 1) What worked and what didnt Lexical Knowledge

Interplay of Coreference and Discourse Research and Annotations Anna Nedoluzhko Charles University,

Easy Victories and Uphill Ba4les in Coreference Resolu9on Greg

Latent Structures for Coreference Resolution Sebastian Martschat and Michael Strube Heidelberg

CORBON 2016: Coreference Resolution Beyond OntoNotes NAACL HLT 2016 Workshop Maciej Ogrodniczuk

Evaluating Theories of Coreference Resolution Coreference Resolution: The Task Bayer AG has

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

CSEP 517 Natural Language Processing Coreference Resolution Luke Zettlemoyer University of

Events Detection, Coreference and Sequencing: Whats next? Overview of TAC KBP 2017 Event

Joint Event Trigger Identification and Event Coreference Resolution with Structured Perceptron

Events Event-driven programming Event loop Event dispatch Event handling Event Driven

Events Event-driven programming Event loop Event dispatch Event handling Event Driven

Neural Networks and Coreference Resolution for Slot Filling Heike Adel, Hinrich Sch utze Team

Exploring Lexicalized Features for Coreference Resolution Anders Bj orkelund and Pierre Nugues

End-to-end Neural Coreference Resolution Kenton Lee, Luheng He, Mike Lewis and Luke Zettlemoyer

Additional Semantic Tasks: Entity Coreference and Question Answering CMSC 473/673 UMBC Outline

Resolving Entity Coreference in Croatian with a Constrained Mention-Pair Model s and Jan

The impact and outcomes of the Grenfell Fire Tragedy CII Leicester October 2019 Rob Dakin AXA

Roadmap towards Energy Neutral Sewage Treatment Works in Hong Kong Po-Heng (Henry) LEE Dept. of

RECYCLING OF THERMOSET COMPOSITES BY MICROWAVE PYROLYSIS D. kesson 1 , M. Skrifvars 1 1

POP2019-02 Fish shoal dynamics in north-eastern North Island Milestone 1: Proposed methodology

Melissa Pierce @melissapierce Wednesday, November 5, 14 Wednesday, November 5, 14 Hi, I came

Turing House Parents/Carers Information Evening September 2019 acceptance | respect | honesty

History of Computing 1 A mobile phone can scan and solve a Sudoku puzzle in seconds. I can

I N T H E H E A R T O F T H E U N I O N M A R K E T D I S T R I C T Fulton Market,