Automatic Category Label Coarsening for Syntax-Based Machine - PowerPoint PPT Presentation

Automatic Category Label Coarsening for Syntax-Based Machine Translation Greg Hanneman and Alon Lavie Language Technologies Institute Carnegie Mellon University Fifth Workshop on Syntax and Structure in Statistical Translation June 23, 2011

Motivation • SCFG-based MT: – Training data annotated with constituency parse trees on both sides – Extract labeled SCFG rules A::JJ → [bleues]::[blue] NP::NP → [D 1 N 2 A 3 ]::[DT 1 JJ 3 NNS 2 ] • We think syntax on both sides is best • But joint default label set is too large 2

Motivation • Labeling ambiguity: – Same RHS with many LHS labels JJ::JJ → [ 快速 ]::[fast] AD::JJ → [ 快速 ]::[fast] JJ::RB → [ 快速 ]::[fast] VA::JJ → [ 快速 ]::[fast] VP::ADJP → [VV 1 VV 2 ]::[RB 1 VBN 2 ] VP::VP → [VV 1 VV 2 ]::[RB 1 VBN 2 ] 3

Motivation • Rule sparsity: – Label mismatch blocks rule application VP::VP → [VV 1 了 PP 2 的 NN 3 ]::[VBD 1 their NN 3 PP 2 ] VP::VP → [VV 1 了 PP 2 的 NN 3 ]::[VB 1 their NNS 3 PP 2 ] ✓ saw their friend from the conference ✓ see their friends from the conference ✘ saw their friends from the conference 4

Motivation • Solution: modify the label set • Preference grammars [Venugopal et al. 2009] – X rule specifies distribution over SAMT labels – Avoids score fragmentation, but original labels still used for decoding • Soft matching constraint [Chiang 2010] – Substitute A::Z at B::Y with model cost subst(B, A) and subst(Y, Z) – Avoids application sparsity, but must tune each subst(s 1 , s 2 ) and subst(t 1 , t 2 ) separately 5

Our Approach • Difference in translation behavior ⇒ different category labels la grande voiture the large car la plus grande voiture the larger car la voiture la plus grande the largest car • Simple measure: how category is aligned to other language A::JJ → [grande]::[large] AP::JJR → [plus grande]::[larger] 6

L 1 Alignment Distance JJ JJR JJS 7

L 1 Alignment Distance JJ JJR 0.9941 JJS 0.8730 0.3996 11

Label Collapsing Algorithm • Extract baseline grammar from aligned tree pairs (e.g. Lavie et al. [2008] ) • Compute label alignment distributions • Repeat until stopping point: – Compute L 1 distance between all pairs of source and target labels – Merge the label pair with smallest distance – Update label alignment distributions 12

Experiment 1 • Goal: Explore effect of collapsing with respect to stopping point • Data: Chinese–English FBIS corpus (302 k) Parallel Word Extract Collapse Parse Corpus Align Grammar Labels Build MT System 13

Experiment 1 14

Experiment 1 15

Effect on Label Set • Number of unique labels in grammar Zh En Joint Baseline 55 71 1556 Iter. 29 46 51 1035 Iter. 45 38 44 755 Iter. 60 33 34 558 Iter. 81 24 22 283 Iter. 99 14 14 106 16

Effect on Grammar • Split grammar into three partitions: – Phrase pair rules NN::NN → [ 友好 ]::[friendship] – Partially lexicalized grammar rules 年 NN 1 ]::[the 2000 NN 1 ] NP::NP → [2000 – Fully abstract grammar rules VP::ADJP → [VV 1 VV 2 ]::[RB 1 VBN 2 ] 17

Effect on Grammar 18

Effect on Metric Scores • NIST MT ’03 Chinese–English test set • Results averaged over four tune/test runs BLEU METR TER Baseline 24.43 54.77 68.02 Iter. 29 27.31 55.27 63.24 Iter. 45 27.10 55.24 63.41 Iter. 60 27.52 55.32 62.67 Iter. 81 26.31 54.63 63.53 Iter. 99 25.89 54.76 64.82 19

Effect on Decoding • Different outputs produced – Collapsed 1-best in baseline 100-best: 3.5% – Baseline 1-best in collapsed 100-best: 5.0% • Different hypergraph entries explored in cube pruning – 90% of collapsed entries not in baseline – Overlapping entries tend to be short • Hypothesis: different rule possibilities lead search in complementary direction 20

Experiment 2 • Goal: Explore effect of collapsing across language pairs • Data: Chinese–English FBIS corpus, French–English WMT 2010 data Zh–En Word Extract Collapse Parse Corpus Align Grammar Labels Build MT System 21

Experiment 2 • Goal: Explore effect of collapsing across language pairs • Data: Chinese–English FBIS corpus, French–English WMT 2010 data Zh–En Word Extract Collapse Parse Fr–En Word Extract Collapse Corpus Align Grammar Labels Parse Corpus Align Grammar Labels Build MT Build MT System System 22

Effect on English Collapsing • Adverbs – Zh–En: RB, RBR – Fr–En: RBR, RBS • Verbs – Zh–En: VB, VBG, VBN – Fr–En: VB, VBD, VBN, VBP, VBZ, MD • Wh -phrases – Zh–En: ADJP, WHADJP; ADVP, WHADVP – Fr–En: PP, WHPP 23

Effect on Label Set • Full subtype collapsing VNV VSB VRD VPT VCD VCP VC • Partial subtype collapsing NN NNS NNPS NNP N • Combination by syntactic function RRC WHADJP INTJ INS 24

Conclusions • Can effectively coarsen labels based on alignment distributions • Significantly improved metric scores at all attempted stopping points • Reduces rule sparsity more than labeling ambiguity • Points decoder in different direction • Different results for different language pairs or grammars 25

Future Work • Take rule context into account [NP::NP] → [D 1 N 2 ]::[DT 1 NN 2 ] la voiture / the car [NP::NP] → [les N 2 ]::[NNS 2 ] les voitures / cars • Try finer-grained label sets [Petrov et al. 2006] NP NP-0, NP-1, ..., NP-30 VBN VBN-0, VBN-1, ..., VBN-25 RBS RBS-0 • Non-greedy collapsing 26

References • Chiang (2010), “Learning to translate with source and target syntax,” ACL • Lavie, Parlikar, and Ambati (2008), “Syntax-driven learning of sub-sentential translation equivalents and translation rules from parsed parallel corpora,” SSST-2 • Petrov, Barrett, Thibaux, and Klein (2006), “Learning accurate, compact, and interpretable tree annotation,” ACL/COLING • Venugopal, Zollmann, Smith, and Vogel (2009), “Preference grammars: Softening syntactic constraints to improve statistical machine translation,” NAACL 27

Automatic Category Label Coarsening for Syntax-Based Machine - PowerPoint PPT Presentation

Automatic Category Label Coarsening for Syntax-Based Machine Translation Greg Hanneman and Alon Lavie Language Technologies Institute Carnegie Mellon University Fifth Workshop on Syntax and Structure in Statistical Translation June 23, 2011

Multi-level Coarsening Algorithm Perform Edge Coarsening (EC) Visit nodes and break ties in

Blue Label Pilot-plant Reactor 1 Product Line-up Platinum Label Gold Label Blue Label Blue

AG! Blue Label Bench-top Reactor 1 Product line up Platinum Label Gold Label Blue Label Blue

Extreme Classification A New Paradigm for Ranking & Recommendation Manik Varma Microsoft

Chapter 6: Syntax Syntax Syntax is the structure of a language. Earlier, both syntax and

Bag-of-features for category classification for category classification Cordelia Schmid

Recursive regularity and the finest regular coarsening of a polyhedral subdivision Rafel Jaume

Syntax Liam OConnor CSE, UNSW (and data61) Term3 2019 1 Abstract Syntax Parsing Bindings

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Bag-of-features models for category classification for category classification Cordelia Schmid

MPLS Source Label draft-chen-mpls-source-label-02 Mach Chen, Xiaohu Xu Zhenbin Li, Luyuan Fang

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Club Med Bintan Island, Indonesia A HOLISTIC WELLNESS ESCAPE JUST OFF SINGAPORE Image label

Presentation of the label Certicold WHY A CERTICOLD LABEL? A European conformity label For

IETF 78 TPA-Label for ADSP DKIM Third-Party Authorization Label draft-otis-dkim-tpa-label By

Literary Analysis Syntax Review AP Literature and Composition 1 SYNTAX n Syntax Defines Style

Role of particle creation mechanism on the collapse of a massive star Sudipto Bhattacharjee

9.2 Decimation Hao Li http://cs621.hao-li.com 1 Last Time Parameterization 1 0 I (

Collapsing Bose stars as source of repeating fast radio bursts D. Levkov, A. Panin, I. Tkachev

CONTINUOUS WAVE FUNCTION COLLAPSE IN QUANTUM-ELECTRODYNAMICS? Lajos Di osi, Budapest CONTENT:

Thermalization of two point functions in the AdS/CFT duality Ville Kernen University of Oxford

Gr Grafter er: A Clang tool for tree traversals fusion Laith Sakka Kirshanthan Sundararajah

Scientific Computing 2013 Maastricht Science Program Week 3 Frans Oliehoek

Chapter 2 Numeric, Cell, and Structure Arrays Vectors: To create a row vector, separate the

Automatic Category Label Coarsening for Syntax-Based Machine - PowerPoint PPT Presentation

Automatic Category Label Coarsening for Syntax-Based Machine Translation Greg Hanneman and Alon Lavie Language Technologies Institute Carnegie Mellon University Fifth Workshop on Syntax and Structure in Statistical Translation June 23, 2011

Multi-level Coarsening Algorithm Perform Edge Coarsening (EC) Visit nodes and break ties in

Blue Label Pilot-plant Reactor 1 Product Line-up Platinum Label Gold Label Blue Label Blue

AG! Blue Label Bench-top Reactor 1 Product line up Platinum Label Gold Label Blue Label Blue

Extreme Classification A New Paradigm for Ranking &amp; Recommendation Manik Varma Microsoft

Chapter 6: Syntax Syntax Syntax is the structure of a language. Earlier, both syntax and

Bag-of-features for category classification for category classification Cordelia Schmid

Recursive regularity and the finest regular coarsening of a polyhedral subdivision Rafel Jaume

Syntax Liam OConnor CSE, UNSW (and data61) Term3 2019 1 Abstract Syntax Parsing Bindings

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Bag-of-features models for category classification for category classification Cordelia Schmid

MPLS Source Label draft-chen-mpls-source-label-02 Mach Chen, Xiaohu Xu Zhenbin Li, Luyuan Fang

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Club Med Bintan Island, Indonesia A HOLISTIC WELLNESS ESCAPE JUST OFF SINGAPORE Image label

Presentation of the label Certicold WHY A CERTICOLD LABEL? A European conformity label For

IETF 78 TPA-Label for ADSP DKIM Third-Party Authorization Label draft-otis-dkim-tpa-label By

Literary Analysis Syntax Review AP Literature and Composition 1 SYNTAX n Syntax Defines Style

Role of particle creation mechanism on the collapse of a massive star Sudipto Bhattacharjee

9.2 Decimation Hao Li http://cs621.hao-li.com 1 Last Time Parameterization 1 0 I (

Collapsing Bose stars as source of repeating fast radio bursts D. Levkov, A. Panin, I. Tkachev

CONTINUOUS WAVE FUNCTION COLLAPSE IN QUANTUM-ELECTRODYNAMICS? Lajos Di osi, Budapest CONTENT:

Thermalization of two point functions in the AdS/CFT duality Ville Kernen University of Oxford

Gr Grafter er: A Clang tool for tree traversals fusion Laith Sakka Kirshanthan Sundararajah

Scientific Computing 2013 Maastricht Science Program Week 3 Frans Oliehoek

Chapter 2 Numeric, Cell, and Structure Arrays Vectors: To create a row vector, separate the

Extreme Classification A New Paradigm for Ranking & Recommendation Manik Varma Microsoft