Avi Sil Joint work with: Georgiana Dinu, Gourab Kundu and - PowerPoint PPT Presentation

Avi Sil Joint work with: Georgiana Dinu, Gourab Kundu and RaduFlorian IBM Research AI

¡ Architecture for the IBM Entity Discovery & Linking (EDL) System § Model & Results ▪ Mention Detection ▪ In doc Coref Resolution ▪ Entity Linking & Clustering IBM Research AI 2

¡ Architecture for the IBM Entity Discovery & Linking (EDL) System § Model & Results ▪ Mention Detection ▪ In doc Coref Resolution Neural & Traditional Models ▪ Entity Linking & Clustering IBM Research AI 3

MD Coref EL Experiments Conclusion ¡ Standard IOB sequence classifier, trained on the task ¡ 2 main classifiers: CRF and Neural Network-based IBM Research AI 4

MD Coref EL Experiments Conclusion • Model probability: P ( y t | X , y t − 1 ) • Additional features: Gazetteers, Character-level LSTMs • Recurrence: previous 2 labels are embedded and added as input IBM Research AI 5

MD Coref EL Experiments Conclusion ¡ Both systems (CRF, NN) have high precision ¡ We combine them as follows § Start with the “best” system § For each consequent system ▪ Add any mentions that do not overlap with the current output 2017 2016 CRF - dev NN - dev NN+CRF - tst English 0.803 0.843 0.806 Spanish 0.785 0.809 0.785 Chinese 0.811 0.843 0.699 0.75* CharCNNs The Lample model didn’t produce better results on our dev data. IBM Research AI 6

MD Coref EL Experiments Conclusion ¡ Train monolingual embeddings in En and foreign language ¡ Use a small dictionary to train a map from a foreign language into the English embedding space (Mikolov 13) ¡ Train a En mention detection model ¡ Decode new languages using the En model and mapped embeddings IBM Research AI 7

MD Coref EL Experiments Conclusion ¡ Weak classifiers: § Silver-data (Pan et.al16) trained NN models § Cross-lingual transfer of models with: 1. TAC data and 2. In-house mention detection data ¡ Train a NN classifier to combine all the weak classifier outputs ¡ Use Spanish as a test case, apply to all other languages Silver-trained Best transfer Combination Supervised Spanish 0.335 0.609 0.704 0.809 Pan et.al_ACL16 IBM Research AI 8

MD Coref EL Experiments Conclusion ¡ All mentions in a document are clustered into entities using an in document coreference system ¡ The canonical mention of an entity is linked using EL system ¡ The link of canonical mention is assigned to all mentions in the entity ¡ We use 2 different coreference systems in this evaluation § MaxEnt Model § Neural network based Model IBM Research AI 10

MD Coref EL Experiments Conclusion ¡ This model is used for languages without any gold standard training data § low resource languages like Nepali ¡ This model is trained over English coreference data using multilingual embeddings ¡ Subsequently, the model is tested over data from new language without any retraining IBM Research AI 11

MD Coref EL Experiments Conclusion P(y=1|E1,E2) E2 E1 ϕ (m1, m3) v m1, m3 embed gen m3 m1 features features ϕ (m2, m3) v m2, m3 ϕ (m1, m4) v m1, m4 m2 m4 P(y=0|E1,E2) ϕ (m2, m4) v m2, m4 softmax hidden weighted layer layer average layer ! ! 𝜍(𝑢𝑧𝑞𝑓 ( ) ) 𝑤 ( 2 ,( ) 0,-,/ +,-,/ IBM Research AI 12

MD Coref EL Experiments Conclusion ¡ Model is trained with multilingual embeddings over § TAC 15 training portion of English coreference data § TAC 16 test portion of English coreference data ¡ Model is tested over § TAC 15 test portion of 3 languages Language MUC B3 CEAF TAC 15- test-Eng 0.9 0.89 0.84 TAC 15-test-Spa 0.91 0.92 0.88 TAC 15- test-Cmn 0.97 0.96 0.91 IBM Research AI 13

MD Coref EL Experiments Conclusion ¡ Language Independent EL system: LIEL (Sil & Florian,16) § Collective disambiguation model based on Maximum Entropy [ 查理周刊 ] 记者 [ 查理周刊 ] 记者 [ 洛 Chinese WP English WP [ 洛朗 · 莱热 ] 捍卫朗 · 莱热 ] 捍卫杂志查理周刊杂志的时候，他 Charlie_Hebdo 的时候，他说的漫说的漫画并不是画并不是要挑起愤要挑起愤怒或暴怒或暴力行为。力行为。 NIL009 NIL009 ¡ SOTA performance on TAC evaluation & other benchmarks IBM Research AI 15

MD Coref EL Experiments Conclusion ¡ New system ¡ Neural Cross-lingual Entity Linking § Zero-shot model § Avi Sil, Gourab Kundu, Radu Florian, Wael Hamza § AAAI 2018 IBM Research AI 16

MD Coref EL Experiments Conclusion ¡ Given : Query mention 𝑛 and a document 𝐸 ∈ 𝑓𝑜 and Wikipedia KB en ¡ Step 1 (Fast Search) : Extract the most likely list of links 𝑚 + 9 ,.., 𝑚 + : for 𝑛 in 𝐸 ¡ Step 2 (Ranking) : Estimate: ¡ where “ 𝐷 ” is the consistency measure for matching contexts between : § the pair ( 𝑛,𝐸 ) and a Wikipedia link 𝑚 𝑘 IBM Research AI 17

MD Coref EL Experiments Conclusion ¡ Given : Query mention 𝑛 and a document 𝐸 ∈ tr and Wikipedia KB en ¡ Step 1 (Fast Search) : Extract the most likely list of links 𝑚 + 9 ,.., 𝑚 + : for 𝑛 in 𝐸 ¡ Step 2 (Ranking) : Estimate: ¡ where “ 𝐷 ” is the consistency measure for matching contexts between : § the pair ( 𝑛,𝐸 ) and a Wikipedia link 𝑚 𝑘 IBM Research AI 18

MD Coref EL Experiments Conclusion Tayvan, ABD ve İngiltere'de hukuk okuması, Tsai'ye bir LL.B. kazandırdı … Example by Tsai & Roth’16 • Challenges : • Link to the English Wikipedia • Comparing non-English words to English Wikipedia titles IBM Research AI 19

MD Coref EL Experiments Conclusion ¡ Problem Formulation § Fast Search ¡ Word Embeddings ¡ Modeling Contexts ¡ Cross-Lingual Entity Linking § Model § Feature Abstraction layer ¡ Experiments IBM Research AI 20

MD Coref EL Experiments Conclusion On June 29, 2012, Holmes had filed for divorce from Cruise in New York after five years of marriage. Ethan Hunt (Cruise) while vacationing is alerted… Cruise joined in and made his debut for Arsenal F.C. Reserves… Thomas Cruise (footballer) Tom Cruise Cruise: • en/Tom_Cruise (probability: 0.66) • en/Thomas_Cruise_(footballer) (probability: 0.33) IBM Research AI 21

MD Coref EL Experiments Conclusion ..a los Premios Óscar y en cuatro a los Premios Globo de Oro, su significativa presencia.. Interlanguage Links Premios Oscar: en/Academy_Awards (probability: 1.0) Premios Globo de Oro: en/Golden_Globe_Awards(probability: 1.0) IBM Research AI 22

MD Coref EL Experiments Conclusion ¡ Mono-lingual (English) § CBOW Word2Vec ¡ Multi-Lingual § Canonical Correlation Analysis (CCA) (Faruqui & Dyer, 14; Tsai & Roth, 16) : ▪ Alignment using Wikipedia title mapping obtained from inter-language links § Multi-CCA (Ammar et.al, 16) ▪ Project pre-trained monolingual embeddings in each language (except English) to the vector space of pre-trained English word embeddings § Weighted Least Squares (LS) (Mikolov et.al, 13) IBM Research AI 24

MD Coref EL Experiments Conclusion ¡ Get all sentences from the entity coref chain “ [Broad] catapulted [England] to a 74-run win over [Australia] … [Broad] sent captain [Michael Clarke] 's off stump cart-wheeling before [Steve Smith] .. [Broad] and [Bresnan] found their stride in the evening session..” ¡ Concatenate them together § Get a variable length representation IBM Research AI 26

MD Coref EL Experiments Conclusion tanh Mean Pool Convolution Layer Context from the Source Document IBM Research AI 27

MD Coref EL Experiments Conclusion ¡ Get all possible links of the mention from the KB “ [Broad] catapulted [England] to a 74-run win over [Australia] … IBM Research AI 28

MD Coref EL Experiments Conclusion ¡ Extract the first paragraph of the current link/page ¡ Run CNNs on them IBM Research AI 29

MD Coref EL Experiments Conclusion ¡ Objective : Model the whole Wikipedia page for an entity ¡ We compute the embeddings 𝑓 𝑞 of the page 𝑞 : IBM Research AI 30

MD Coref EL Experiments Conclusion Final Context Vector Slices of NTN Overall Left Overall Right Context Context Mean-pooling Mean-pooling … … h 1 h m h 2 h 7 h m h m h m h 22 h 42 h 8 h 21 h 41 LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM … … m w 1 w 2 w 7 m w 21 w 22 w 41 w 42 w 8 m m Left Context 1 Left Context n Right Context 1 Right Context 1 IBM Research AI

Avi Sil Joint work with: Georgiana Dinu, Gourab Kundu and - PowerPoint PPT Presentation

Avi Sil Joint work with: Georgiana Dinu, Gourab Kundu and RaduFlorian IBM Research AI Architecture for the IBM Entity Discovery & Linking (EDL) System Model & Results Mention Detection In doc Coref Resolution Entity

Networking (Containers) in Ultra- Low-Latency Environments Avi Deitcher avi@atomicinc.com

SPSI SPSI NFPA NFPA NFPA NFPA TUV TUV TUV TUV 72 72 72 72 CERTIF CERTIFIED CERTIF

Multiscale modelling of the aortic media Marek Netu sil September 1st, 2016 Marek Netu sil

Avi Sil Joint work with: Georgiana Dinu and Radu Florian IBM T.J. Watson Research Center

GEODESY I N AVI ATI ON, GEODESY I N AVI ATI ON, I mplementation of the WGS 84 I mplementation of

http://registry.sf.net http://registry.sf.net Before We Start . . . FORGET ABOUT THE NAME

Mining language resources from institutional repositories Christopher Hirt Gary Simons SIL

Standards and Interoperability Lab - Asia 1604 Asian Development Bank HQ, ADB Avenue, Ortigas CBD,

AVI SHAREHOLDER PROPOSAL TO TBS HOLDINGS: A FIRST STEP TOWARD PHASING OUT STRATEGIC

Performance: Towards a New Optimization Tool Adi Fuchs, Noam Shalev and Avi Mendelson Technion

4 th workshop on Architecture and Systems for Big Data Prof. Avi Mendelson, CS and EE

Living (SIL) What will NDIS fund to support participants to live independently? Capacity

Gary F. Simons SIL International CoRSAL Symposium, UNT, Denton, TX, 17 Nov 2017 The digital

Basis of SIL Determination & Introduction to Layers of Protection Analysis (LOPA) Fayyaz

R NCY P ROJE E SIL IE CT N E W Y ORK , NY CB- 1 E AL P ROT ION C OMMIT E PRE SE NT AT ION

Gary F. Simons SIL International AARDVARC Symposium, LSA, Portland, OR, 11 Jan 2015 Given the

COLLEGE PLANNING PARENT NIGHT Colleges and Application Types 1. Financial Aid 2. Naviance 3.

INTRODUCTION Background PROGRAM DIAGNOSIS EVALUATION LINK NCA PROGRAM PROGRAM

Cases and their Close Contacts Tools for LBOHs May 26, 2020 Hillary Johnson, MHS, Infectious

S9299 NVIDIA VGPU ON RED HAT LINUX HYPERVISOR (RHV) Shailesh Deshmukh Senior Solution Architect,

Improving methods for linking area frames with list frames: preliminary results Cristiano

[01.04] Linking IRAN to the International Comparison Program 2011 7 TH Technical Advisory Group

10/12/16: Johnson County Presentation on Data Analysis Presenter: Lauren Haynes, Senior

Conducting Research at the New York Federal Statistical Research Data Center Diane Gibson, Ph.D.

Sambuz

Useful Links

Newsletter

Mail Us

Avi Sil Joint work with: Georgiana Dinu, Gourab Kundu and - PowerPoint PPT Presentation

Avi Sil Joint work with: Georgiana Dinu, Gourab Kundu and RaduFlorian IBM Research AI Architecture for the IBM Entity Discovery & Linking (EDL) System Model & Results Mention Detection In doc Coref Resolution Entity

Networking (Containers) in Ultra- Low-Latency Environments Avi Deitcher avi@atomicinc.com

SPSI SPSI NFPA NFPA NFPA NFPA TUV TUV TUV TUV 72 72 72 72 CERTIF CERTIFIED CERTIF

Multiscale modelling of the aortic media Marek Netu sil September 1st, 2016 Marek Netu sil

Avi Sil Joint work with: Georgiana Dinu and Radu Florian IBM T.J. Watson Research Center

GEODESY I N AVI ATI ON, GEODESY I N AVI ATI ON, I mplementation of the WGS 84 I mplementation of

http://registry.sf.net http://registry.sf.net Before We Start . . . FORGET ABOUT THE NAME

Mining language resources from institutional repositories Christopher Hirt Gary Simons SIL

Standards and Interoperability Lab - Asia 1604 Asian Development Bank HQ, ADB Avenue, Ortigas CBD,

AVI SHAREHOLDER PROPOSAL TO TBS HOLDINGS: A FIRST STEP TOWARD PHASING OUT STRATEGIC

Performance: Towards a New Optimization Tool Adi Fuchs, Noam Shalev and Avi Mendelson Technion

4 th workshop on Architecture and Systems for Big Data Prof. Avi Mendelson, CS and EE

Living (SIL) What will NDIS fund to support participants to live independently? Capacity

Gary F. Simons SIL International CoRSAL Symposium, UNT, Denton, TX, 17 Nov 2017 The digital

Basis of SIL Determination &amp; Introduction to Layers of Protection Analysis (LOPA) Fayyaz

R NCY P ROJE E SIL IE CT N E W Y ORK , NY CB- 1 E AL P ROT ION C OMMIT E PRE SE NT AT ION

Gary F. Simons SIL International AARDVARC Symposium, LSA, Portland, OR, 11 Jan 2015 Given the

COLLEGE PLANNING PARENT NIGHT Colleges and Application Types 1. Financial Aid 2. Naviance 3.

INTRODUCTION Background PROGRAM DIAGNOSIS EVALUATION LINK NCA PROGRAM PROGRAM

Cases and their Close Contacts Tools for LBOHs May 26, 2020 Hillary Johnson, MHS, Infectious

S9299 NVIDIA VGPU ON RED HAT LINUX HYPERVISOR (RHV) Shailesh Deshmukh Senior Solution Architect,

Improving methods for linking area frames with list frames: preliminary results Cristiano

[01.04] Linking IRAN to the International Comparison Program 2011 7 TH Technical Advisory Group

10/12/16: Johnson County Presentation on Data Analysis Presenter: Lauren Haynes, Senior

Conducting Research at the New York Federal Statistical Research Data Center Diane Gibson, Ph.D.

Sambuz

Useful Links

Newsletter

Mail Us

Basis of SIL Determination & Introduction to Layers of Protection Analysis (LOPA) Fayyaz