for Cross-lingual Knowledge Alignment Muhao Chen 1 , Yingtao Tian 2 , - - PowerPoint PPT Presentation

for cross lingual knowledge alignment
SMART_READER_LITE
LIVE PREVIEW

for Cross-lingual Knowledge Alignment Muhao Chen 1 , Yingtao Tian 2 , - - PowerPoint PPT Presentation

Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment Muhao Chen 1 , Yingtao Tian 2 , Mohan Yang 1 , and Carlo Zaniolo 1 University of California, Los Angeles 1 Stony Brook University 2 Outline Background MTransE


slide-1
SLIDE 1

Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment

Muhao Chen1, Yingtao Tian2, Mohan Yang1, and Carlo Zaniolo1 University of California, Los Angeles1 Stony Brook University2

slide-2
SLIDE 2

Outline

  • Background
  • MTransE—A multilingual knowledge graph embedding model
  • Evaluation
  • Open Challenges and Future Work
slide-3
SLIDE 3

Knowledge Graphs

  • Symbolic representation of entities and relations

Monolingual knowledge: triples (relation facts of entities) Cross-lingual knowledge: alignment of monolingual knowledge across languages (California, capital city, Sacramento) (カリフォルニア, 首都,サクラメント)

slide-4
SLIDE 4

Knowledge Graph Embeddings

  • Encode entities as vectors

Bach Male Germany Eisenach

Knowledge Graph Encode Embeddings Enable

Relational inferences as vector algebra

– France – Paris ≈ capital – US – USD ≈ currency – Bach – German ≈ nationality – …

Applications

  • KG Completion
  • Relation extraction from text
  • Question answering

Capture

Semantic similarity of entities

Paris (0.036, -0.12, ..., 0.323) capital (0.102, 0.671, …, -0.101) France (0.138, 0.551, …, 0.222) …

slide-5
SLIDE 5

Current KG Embedding Approaches

TransE: h+r≈t

  • Focused on embedding monolingual triples (h, r, t)

Later approaches

– TransH [Wang et al. 2014] – TransR [Lin et al. 2015] – TransD [Ji et al. 2015] – HolE [Nickle et al. 2016] – ComplEx [Trouillon et al. 2016] – …

Embedding of monolingual knowledge seems to be well-addressed. What about cross-lingual knowledge?

slide-6
SLIDE 6

Emerging challenge

  • Existing works do not characterize cross-lingual knowledge

– Entity inter-lingual links (ILLs): (ambulance --- krankenwagen) – Triple-wise alignment (TWA): ((State of California, capital city, Sacramento) --- (カリフォ ルニア, 首都,サクラメント)) – Many KGs store such knowledge Why important?

  • Enables multilingual

semantic representations

  • Benefits cross-lingual NLP

– Knowledge alignment – Machine translation – Cross-lingual Q&A – … Difficult to characterize:

  • Fewer samples: Cross-lingual knowledge currently

accounts for a small portion of each KB

  • Larger domains: Cross-lingual knowledge applies on the

entire spaces of involved languages

  • Incoherence: Language-specific versions of KG are

usually incoherent

  • Heterogeneity: Applies to both entities and

monolingual relations with inconsistent vocabularies

slide-7
SLIDE 7

What does MTransE use and enable?

  • Corpora: (partially-aligned) multilingual KGs
  • Enabling: inferable embeddings of

multilingual semantics

  • Can be applied to:

– Knowledge alignment – Cross-lingual Q&A – Multilingual chat-bots – …

France Capital Paris

+

フラ ンス 首都 パリ

+

slide-8
SLIDE 8

MTransE Model Components

  • Knowledge model
  • Alignment model
  • Objective of learning

– Minimizing 𝐾(𝜄) = 𝑇𝐿 + 𝛽𝑇𝐵 𝑇𝐿 = ෍

𝑀∈{𝑀𝑗,𝑀𝑘}

𝑈∈𝐻𝑀

||𝐢 + 𝐬 − 𝐮||

(h, r, t) (h , r , t ) Space L1 Space L2 Alignment model Knowledge model

𝑇𝐵 = ෍

𝑈,𝑈′ ∈𝜀(𝑀𝑗,𝑀𝑘)

𝑇𝑏(𝑈, 𝑈′)

All aligned triples

slide-9
SLIDE 9

Dif ifferent alignment techniques

Space Li Space Lj

Space Li Space Lj Translate Translate Translate

Space Li Space Lj Transformations Mij

Axis calibration

  • Cross-lingual counterparts

have close embeddings Translation vectors

  • Encoding cross-lingual

transitions just like monolingual relations Linear Transformations

  • Transformations across

embedding spaces of different languages

slide-10
SLIDE 10

Ali lignment Scores and Five Model Variants

  • Vari combines the ith alignment model with the knowledge model

Variant Alignment Score Remark Var1 𝑇𝑏1 = 𝒊 − 𝒊′ + 𝒖 − 𝒖′ Var2 𝑇𝑏2 = 𝒊 − 𝒊′ + 𝒔 − 𝒔′ + 𝒖 − 𝒖′ Var3 𝑇𝑏3 = 𝒊 + 𝒘𝒋𝒌

𝒇 − 𝒊′ + 𝒔 + 𝒘𝒋𝒌 𝒔 − 𝒔′

+ 𝒖 + 𝒘𝒋𝒌

𝒇 − 𝒖′

𝒘𝒋𝒌

𝒇 =−𝒘𝒌𝒋 𝒇 , 𝒘𝒋𝒌 𝒔 =−𝒘𝒌𝒋 𝒔

Var4 𝑇𝑏4 = 𝑵𝑗𝑘

𝑓 𝒊 − 𝒊′ + 𝑵𝑗𝑘 𝑓 𝒖 − 𝒖′

𝑵𝑗𝑘

𝑓 ∈ ℝ𝒍×𝒍, 𝑵𝑗𝑘 𝑠 ∈ ℝ𝒍×𝒍

Var5 𝑇𝑏5 = 𝑵𝑗𝑘

𝑓 𝒊 − 𝒊′ + 𝑵𝑗𝑘 𝑠 𝒔 − 𝒔′

+ 𝑵𝑗𝑘

𝑓 𝒖 − 𝒖′

Axis Calibration Linear Transforms Translation Vector

slide-11
SLIDE 11

Experimental Evaluation

  • Cross-lingual knowledge alignment tasks

– Entity Matching – Triple-wise Alignment (TWA) Verification

  • Monolingual relation extraction task
  • Trilingual data sets

– Wiki-based (WK3l-15k, WK3l-120k) – ConceptNet-based (CN3l)

  • Baselines

– LM [Mikolov et al. 2013] + Knowledge models – CCA [Faruqui et al. 2014] + Knowledge models – OT [Xing et al. 2015] + Knowledge models These three data sets are available at https://github.com/muhaochen/MTransE

slide-12
SLIDE 12

Entity Matching

  • Evaluation protocol

– For each (e, e’), rank e’ in the neighborhood of 𝜐 𝒇

  • Training sets

– Pairs of language-specific graphs and corresponding alignment sets

  • Test data

– Entity Inter-lingual links {(e, e’)} (Unidirectional) What is the German entity for the English entity “Regulation of Property”?

slide-13
SLIDE 13

Entity Matching

20 40 60 80 100 Hits@10/En-Fr Hits@10/Fr-En Hits@10/En-De Hits@10/De-En

Hits@10 on WK3l-15k

LM CCA OT Var1 Var2 Var3 Var4 Var5

20 40 60 80 100 Hits@10/En-Fr Hits@10/Fr-En Hits@10/En-De Hits@10/De-En

Hits@10 on WK3l-120k

LM CCA OT Var1 Var2 Var3 Var4 Var5

1 10 100 1000 10000 Mean/En-Fr Mean/Fr-En Mean/En-De Mean/De-En

Mean on WK3l-15k

LM CCA OT Var1 Var2 Var3 Var4 Var5

1 10 100 1000 10000 Mean/En-Fr Mean/Fr-En Mean/En-De Mean/De-En

Mean on CN3l

LM CCA OT Var1 Var2 Var3 Var4 Var5

Var4≈Var5>Var1≈Var3≈OT>Var2≫CCA>LM

Axis Calibration

Var1, Var2

  • Trans. Vectors

Var3

Linear Transforms

Var4, Var5

20 40 60 80 100 Hits@10/En-Fr Hits@10/Fr-En Hits@10/En-De Hits@10/De-En

Hits@10 on CN3l

LM CCA OT Var1 Var2 Var3 Var4 Var5

slide-14
SLIDE 14

Triple-wise Ali lignment Verifi fication

Var4≈Var5>Var1>Var2>Var3≈OT ≫CCA>LM

We receive similar evaluation conclusions in all settings.

Axis Calibration

Var1, Var2

  • Trans. Vectors

Var3

Linear Transforms

Var4, Var5

10 20 30 40 50 60 70 80 90 100

Accuracy of TWA Verification

LM CCA OT Var1 Var2 Var3 Var4 Var5

slide-15
SLIDE 15

Monolingual Relation Ext xtraction (E (English, French)

  • Train/Test

– Train Sets: 90% triples and intersecting alignment sets – Test Sets: 10% triples

  • MTransE preserves well the

monolingual relations

5 10 15 20 25 30 35 40 45 WK3l-15k/EN WK3l-15k/FR WK3l-120k/EN WK3l-120k/FR

Predicting Missing Tails (Hits@10)

TransE Var1 Var2 Var3 Var4 Var5

10 20 30 40 50 60 70 80 WK3l-15k/EN WK3l-15k/FR WK3l-120k/EN WK3l-120k/FR

Predicting Missing Relations (Hits@10)

TransE Var1 Var2 Var3 Var4 Var5

Axis Calibration

Var1, Var2

  • Trans. Vectors

Var3

Linear Transforms

Var4, Var5

slide-16
SLIDE 16

Applications based on MTransE

  • Multilingual Q&A
  • Cross-lingual relation prediction
  • Improving monolingual KG completion using multilingual correlation
  • Knowledge alignment across knowledge bases
slide-17
SLIDE 17

Examples of f Cross-lingual Question Answering

Bold-faced ones are correct answers, italic ones are close answers.

slide-18
SLIDE 18

Improve the embedding model

  • Other forms of knowledge models and alignment models

– Neural knowledge models such as HolE and ComplEx – Other alignment models such as affine transformations – Alignment models which consider disambiguation

  • Encoding more information from multilingual KGs

– Entity domains, class templates, entity descriptions, etc – Cross-lingual disambiguation

  • Jointly embedding with other forms of corpora such as multilingual

documents

slide-19
SLIDE 19

References

  • [Bordes et al., 2013] Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and

Oksana Yakhnenko. Translating embeddings for modeling multi-relational data. In NIPS, pages 2787–2795, 2013.

  • [Nickel et al., 2016] Maximilian Nickel, Lorenzo Rosasco, Tomaso Poggio, et al. Holographic

embeddings of knowledge graphs. In AAAI, 2016.

  • [Saxe et al., 2014] Andrew M Saxe, James L McClelland, and Surya Ganguli. Exact solutions to the

nonlinear dynamics of learning in deep linear neural networks. ICLR, 2014.

  • [Wang et al., 2014] Zhen Wang, Jianwen Zhang, Jianlin Feng, and Zheng Chen. Knowledge graph

embedding by translating on hyperplanes. In AAAI, 2014.

  • [Lin et al., 2015] Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, and Xuan Zhu. Learning entity

and relation embeddings for knowledge graph completion. In AAAI, 2015.

  • [Ji et al., 2015] Guoliang Ji, Shizhu He, Liheng Xu, Kang Liu, and Jun Zhao. Knowledge graph

embedding via dynamic mapping matrix. In ACL, pages 687–696, 2015.

  • [Mikolov et al., 2013] Tomas Mikolov, Quoc V Le, and Ilya Sutskever. Exploiting similarities among

languages for machine translation. arXiv, 2013.

  • [Faruqui and Dyer, 2014] Manaal Faruqui and Chris Dyer. Improving vector space word

representations using multilingual correlation. EACL, 2014.

  • [Xing et al., 2015] Chao Xing, Dong Wang, Chao Liu, and Yiye Lin. Normalized word embedding

and orthogonal transform for bilingual word translation. In NAACL HLT, pages 1006–1011, 2015.

slide-20
SLIDE 20

Thank You

20