TAC KBP 2016 Linguistic Resources: Event Arguments (EA), Event - PowerPoint PPT Presentation

TAC KBP 2016 Linguistic Resources: Event Arguments (EA), Event Nuggets (EN) and Belief/Sentiment (BeSt) Joe Ellis (presenter), Jennifer Tracey (presenter), Jeremy Getman, Zhiyi Song, Ann Bies, Stephanie Strassel Linguistic Data Consortium University of Pennsylvania

Introduction and Overview  Linguistic resources for TAC KBP 2016  Eighth year LDC produced KBP resources  Twenty-nine new data sets  Two primary goals  Increase coordination across tracks  Increase multi-lingual evaluation tracks  Yesterday  Doc selection, ED&L, and Cold Start  Today  Event Arguments, Event Nuggets, and Belief/Sentiment (BeSt) TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

Entities, Relations, & Events (ERE)  Entities, Relations, and Events (ERE)  Ongoing annotation task developed by LDC for DARPA’s Deep Exploration and Filtering of Text program (DEFT)  Exhaustive labeling of entities, relations and events and their attributes.  ERE annotation performed as upstream task  Provided inputs for multiple downstream tasks supporting ED&L, EA, EN, and BeSt  Primary means of meeting increased coordination of data goal TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

ERE Annotation Entity Event Type.Subtype Realis Trigger Arguments The Bo Xilai event was ignited NAM Filler (Rich ERE) Bo Xilai, Bo Xilai H1 Movement. Actual running Lijun PER PRO his Relation SPC IND Time Transport 2012-XX-XX US by Lijun running into the US Person PER NAM Crime bribery and corruption Lijun SPC IND R1 Physical.Located Lijun consulate NAM Bogu Kailai consulate in 2012 to bring Bogu US consulate into 2012 PER NOM his wife SPC IND his R2 Personal.Family H2 Conflict. Actual killing Bogu Kailai Kailai’s killing to light. Will Bo SPC IND GPE NAM US wife Attack his wife Xilai end up in jail due to SPC IND LOC NOM H3 Justice. US consulate Other jail Bo Xilai Jail Non bribery and bribery and corruption; what will corruption LOC NOM jail SPC his wife end up with? H4 Life.Die Actual killing Bogu Kailai H5 Movement. Other end up Bo Xilai Transport jail Person TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

ERE Event Types  ERE event type inventory reduced for 2016  8 types and 18 subtypes in 2016 (listed below)  9 types and 38 subtypes in 2015 (listed in overview paper)  Most of the dropped event types and subtypes are scarce in existing data Conflict.Attack Manufacture.Artifact Justice.ArrestJail Conflict.Demonstrate Movement.TransportArtifact Life.Die Contact.Broadcast Movement.TransportPerson Life.Injure Contact.Contact Personnel.Elect Transaction.Transaction Contact.Correspondence Personnel.EndPosition Transaction.TransferMoney Contact.Meet Personnel.StartPosition Transaction.TransferOwnership TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

ERE annotation counts 4000 3500 3000 2500 2000 1500 1000 500 0 NW DF NW DF NW DF Chinese Chinese English English Spanish Spanish Entities Fillers Relations Event Mentions Event Hoppers TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

ERE event mentions 700 600 500 400 300 200 100 0 CMN ENG SPA TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

Event Argument (EA) Overview  Given new approach to evaluating EA in 2016, data development procedure overhauled  2014-2015  Manual run  Argument-level assessment  2016  Gold Standard  Event-level cross-document task • Queries, manual run, assessment TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

EA Gold Standard  ERE annotations on core source corpus  Augmentation pass  BBN script run over ERE data  Annotators review results  Inferred arguments  Locational containment  Not annotated in ERE  Baghdad as Place of Conflict.Attack  Iraq added as 2 nd Place TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

EA Cross-Document Queries & Manual Run  Query selection  51 simple, low-granularity queries  Event arguments in EAL Gold Standard  Annotators reviewed over 1300 potential queries  Manual Run  Exhaustive across full 30K English source corpus  Justification strings indicating presence of event hopper in doc Personnel.End-Position Person – Thabo Mbeki TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

EA Cross-document Assessment and Results  Does justification prove presence of query in document?  Correct: Response contains query event  Event Type Match: Contains event of same type as query, but not query event  Wrong: Doesn’t contain query event or event of same type  Low system recall on manually selected queries  BBN produced 249 “derived” queries based on system responses  No LDC manual run for these queries TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

EA Cross-Document Results 7000 700 6000 600 5000 500 4000 400 300 3000 2000 200 1000 100 0 0 Systems LDC Systems CORRECT ET_MATCH WRONG CORRECT ET_MATCH WRONG Manual Queries Derived Queries TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

Event Nuggets and Linking (ENL)  No separate ENL annotation task  Data are entirely produced by running a script over ERE data to extract and reformat a subset for use by ENL 3000 2500 2000 1500 1000 500 0 NW DF NW DF NW DF CMN CMN ENG ENG SPA SPA Event Nuggets Event Hoppers TAC KBP 2016 Evaluation Workshop - NIST, November 14-15

Belief and Sentiment Annotation  Only ERE entities are holders of belief and sentiment  Only ERE entities, relations and events are targets of belief and sentiment  For events only, belief marked for each argument as well as the event itself  Belief values: committed, non-committed, reported, n/a  Polarity also marked  Sentiment values (polarity): positive, negative  Sarcasm flag indicated when polarity annotated is opposite of literal meaning (based on context) TAC KBP Workshop, November 14-15, 2016

Belief Annotation Example Ominous new action by U KRAINE ’ S SECURITY FORCES on Monday, including a raid on AN OPPOSITION PARTY ’ S HEADQUARTERS , appeared to diminish prospects for talks between THE GOVERNMENT and Event: raid PROTEST LEADERS , as W ESTERN OFFICIALS grasped for CB a way to defuse THE COUNTRY ’s intensifying political Event: talks crisis. Arguments also NA Relation: Ukraine’s CB Arguments also security forces NA CB TAC KBP Workshop, November 14-15, 2016

BeSt Data Overview Language Belief annotations Sentiment annotations Training Evaluation Training Evaluation # Ann #/Doc # Ann #/Doc # Ann #/Doc # Ann #/Doc Chinese 13,192 66 12,163 76 27,982 140 18,982 118 English 18,915 77 21,188 128 38,664 157 25,358 154 Spanish 9,406 99 12,546 75 14,299 151 17,353 103  English evaluation data notably more dense in belief annotations than training data  Spanish evaluation data less dense than training data in both belief and sentiment TAC KBP Workshop, November 14-15, 2016

Comparison of Training and Eval Data - Belief 16000 14000 12000 10000 8000 6000 4000 2000 0 Committed Non-Committed Reported N/A Committed Non-Committed Reported N/A Committed Non-Committed Reported N/A CMN CMN CMN CMN ENG ENG ENG ENG SPA SPA SPA SPA Training Evaluation TAC KBP Workshop, November 14-15, 2016

Comparison of Training and Eval Data - Sentiment 35000 30000 25000 20000 15000 10000 5000 0 Positive Negative None Positive Negative None Positive Negative None CMN CMN CMN ENG ENG ENG SPA SPA SPA Training Evaluation DEFT PI Meeting, May 28-29, 2015 – Boulder, CO

TAC KBP 2016 Linguistic Resources: Event Arguments (EA), Event - PowerPoint PPT Presentation

TAC KBP 2016 Linguistic Resources: Event Arguments (EA), Event Nuggets (EN) and Belief/Sentiment (BeSt) Joe Ellis (presenter), Jennifer Tracey (presenter), Jeremy Getman, Zhiyi Song, Ann Bies, Stephanie Strassel Linguistic Data Consortium

Overview of Event Nugget Track TAC KBP 2016 Teruko Mitamura Zhengzhong Liu Eduard Hovy

Events Detection, Coreference and Sequencing: Whats next? Overview of TAC KBP 2017 Event

KBP 2017 Cold Start KB Construction and Slot Filling Hoa Dang Shahzad Rajput U.S. National

Overview of 2015 TAC KBP Event Nugget Tasks Teruko Mitamura Zhengzhong Liu Eduard Hovy

Event Detection and Coreference TAC KBP 2015 Sean Monahan, Michael Mohler, Marc Tomlinson Amy

Command Line Arguments ECE2893 Lecture 20 ECE2893 Command Line Arguments Spring 2011 1 / 5

Joe Ellis (presenter), Jeremy Getman, Zhiyi Song, Ann Bies, Stephanie Strassel Linguistic Data

Text Analysis Conference TAC 2016 Sponsored by: Hoa Trang Dang National Institute of Standards

New York University 2016 System for KBP Event Nugget: A Deep Learning Approach Thien Huu Nguyen,

The BeSt Eval at the 2016 NIST TAC KBP Overview BeSt Eval Task

The Columbia-GWU System at the 2016 TAC KBP BeSt Evaluation Owen Rambow, Tao Yu, Axinia Radeva,

Joe Ellis (presenter), Jeremy Getman, Stephanie Strassel Linguistic Data Consortium University of

TAC 2018 Streaming Multimedia KBP Pilot Hoa Trang Dang National Institute of Standards and

The BeSt Eval at the 2017 NIST TAC KBP BeSt: Evaluating Mind Reading People in real world:

Stanford-UBC at TAC-KBP Eneko Agirre , Angel Chang, Dan Jurafsky, Christopher Manning, Valentin

UTD at the KBP 2016 Event Track Jing Lu and Vincent Ng Human Language Technology Research

Hardware Security Modules: Attacks and Secure Configuration Graham Steel Graham Steel April

20 16 CEO Address to Shareholders B USINESS OVERVIEW Leading >1 million scans per

Agenda What is The Access Point? Integra3on of

Transforming emotional wellbeing and mental health services for children and young people across

The Economy is reliant on the Internet The state of Internet security is eroding quickly. Trust

CyberTruck Challenge 1 Connecting next generation talent with the heavy duty industry to keep

Integrating Device Registries and Innovative Tools for Enhanced Medical Device Evaluation and

New Jersey Sustainable Business Program NJDEP Sustainable Business Initiative March 3, 2015 NJ

Sambuz

Useful Links

Newsletter

Mail Us

TAC KBP 2016 Linguistic Resources: Event Arguments (EA), Event - PowerPoint PPT Presentation

TAC KBP 2016 Linguistic Resources: Event Arguments (EA), Event Nuggets (EN) and Belief/Sentiment (BeSt) Joe Ellis (presenter), Jennifer Tracey (presenter), Jeremy Getman, Zhiyi Song, Ann Bies, Stephanie Strassel Linguistic Data Consortium

Overview of Event Nugget Track TAC KBP 2016 Teruko Mitamura Zhengzhong Liu Eduard Hovy

Events Detection, Coreference and Sequencing: Whats next? Overview of TAC KBP 2017 Event

KBP 2017 Cold Start KB Construction and Slot Filling Hoa Dang Shahzad Rajput U.S. National

Overview of 2015 TAC KBP Event Nugget Tasks Teruko Mitamura Zhengzhong Liu Eduard Hovy

Event Detection and Coreference TAC KBP 2015 Sean Monahan, Michael Mohler, Marc Tomlinson Amy

Command Line Arguments ECE2893 Lecture 20 ECE2893 Command Line Arguments Spring 2011 1 / 5

Joe Ellis (presenter), Jeremy Getman, Zhiyi Song, Ann Bies, Stephanie Strassel Linguistic Data

Text Analysis Conference TAC 2016 Sponsored by: Hoa Trang Dang National Institute of Standards

New York University 2016 System for KBP Event Nugget: A Deep Learning Approach Thien Huu Nguyen,

The BeSt Eval at the 2016 NIST TAC KBP Overview BeSt Eval Task

The Columbia-GWU System at the 2016 TAC KBP BeSt Evaluation Owen Rambow, Tao Yu, Axinia Radeva,

Joe Ellis (presenter), Jeremy Getman, Stephanie Strassel Linguistic Data Consortium University of

TAC 2018 Streaming Multimedia KBP Pilot Hoa Trang Dang National Institute of Standards and

The BeSt Eval at the 2017 NIST TAC KBP BeSt: Evaluating Mind Reading People in real world:

Stanford-UBC at TAC-KBP Eneko Agirre , Angel Chang, Dan Jurafsky, Christopher Manning, Valentin

UTD at the KBP 2016 Event Track Jing Lu and Vincent Ng Human Language Technology Research

Hardware Security Modules: Attacks and Secure Configuration Graham Steel Graham Steel April

20 16 CEO Address to Shareholders B USINESS OVERVIEW Leading &gt;1 million scans per

Agenda What is The Access Point? Integra3on of

Transforming emotional wellbeing and mental health services for children and young people across

The Economy is reliant on the Internet The state of Internet security is eroding quickly. Trust

CyberTruck Challenge 1 Connecting next generation talent with the heavy duty industry to keep

Integrating Device Registries and Innovative Tools for Enhanced Medical Device Evaluation and

New Jersey Sustainable Business Program NJDEP Sustainable Business Initiative March 3, 2015 NJ

Sambuz

Useful Links

Newsletter

Mail Us

20 16 CEO Address to Shareholders B USINESS OVERVIEW Leading >1 million scans per