Using Parallel Features in Parsing of Machine-Translated Sentences - PowerPoint PPT Presentation

Rudolf Rosa, Ondřej Dušek, David Mareček, Martin Popel {rosa,odusek,marecek,popel}@ufal.mff.cuni.cz Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors Charles University in Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics SSST, Jeju, 12th July 2012

Parsing of SMT Outputs  can be useful in many applications  automatic classification of translation errors  automatic correction of translation errors (Depfix)  confidence estimation, multilingual question answering... ✔ we have the source sentence available  Can we use it to help parsing? ✗ SMT outputs noisy (errors in fluency, grammar...)  parsers trained on gold standard treebanks  Can we adapt parser to noisy sentences?

MST Parser  Maximum Spanning Tree dependency parser  by Ryan McDonald

(1) Words and Tags # words = nodes root relaxes VBZ Rudolph abroad NNP RB

(2) (Nearly) Complete Graph # all possible edges = root directed edges relaxes VBZ Rudolph abroad NNP RB

(3) Assign Edge Weights # edge weight = Margin Infused root Relaxed Algorithm sum of (MIRA) edge features weights 325 -1659 -7 1490 1154 relaxes 24 -263 VBZ 368 185 Rudolph abroad NNP RB

(4) Maximum Spanning Tree # non-projective trees: (projective trees: root Chu-Liu-Edmonds Eisner algorithm) algorithm 325 -1659 -7 1490 1154 relaxes 24 -263 VBZ 368 185 Rudolph abroad NNP RB

(5) Unlabeled Dependency Tree # dependency tree = root maximum spanning tree relaxes VBZ Rudolph abroad NNP RB

(6) Labeled Dependency Tree # labels asigned root by a second stage labeler Pred icate Subj ect Adv erbial relaxes VBZ Rudolph abroad NNP RB

RUR Parser  reimplementation of MST Parser  (so far only) first-order, non-projective  adapted for SMT outputs parsing  parallel features  ”worsening” the training treebank

English-to-Czech SMT  Czech language  highly flective  4 genders, 2 numbers, 7 cases, 3 persons...  Czech grammar requires agreement in related words  word order relatively free: word order errors not crucial  Phrase-Based SMT often makes inflection errors: ➔ Rudolph's car is black. ✗ Rudolfov a / fem auto/ neut je čern ý / masc . ✔ Rudolfov o / neut auto/ neut je čern é / neut .

Parser Training Data  Prague Czech-English Dependency Treebank  parallel treebank  50k sentences, 1.2M words  morphological tags, surface syntax, deep syntax  word alignment

Parallel Features  word alignment (using GIZA++)  additional features (if aligned node exists):  aligned tag (NNS, VBD...)  aligned dependency label (Subject, Attribute...)  aligned edge existence (0/1)

Parallel Features Example Pred # root AuxP Subj relaxuje VB S 3 Adv v RR 6 Rudolf zahraničí NN M S 1 NN N S 6 Pred # root Adv Subj relaxes VBZ Rudolph abroad NNP RB

Worsening the Treebank  treebank used for training contains correct sentences  SMT output is noisy  grammatical errors  incorrect word order  missing/superfluous words  …  let's introduce similar errors into the treebank!  so far, we have only tried inflection errors

Worsen (1): Apply SMT  translate English side of PCEDT to Czech  by an SMT system (we used Moses)  now we have (e.g.):  Gold English  Rudolph's car is black.  Gold Czech  Rudolfovo NEUT auto NEUT je černé NEUT .  SMT Czech  Rudolfova FEM auto NEUT je černý MASC .

Worsen (2): Align SMT to Gold  align SMT Czech to Gold Czech  Monolingual Greedy Aligner  alignment link score = linear combination of:  similarity of word forms (or lemmas)  similarity of morphological tags (fine-grained)  similarity of positions in the sentence  indication whether preceding/following words aligned  repeat: align best scoring pair until below threshold  no training: weights and threshold set manually

Worsen (3): Create Error Model  for each tag:  estimate probabilities of SMT system using an incorrect tag instead of the correct tag (Maximum Likelihood Estimate)  Czech tagset: fine-grained morphological tags  part-of-speech, gender, number, case, person, tense, voice...  1500 different tags in training data

Worsen (3): Error Model  Adjective, Masculine, Plural, Instrumental case ( AAMP7 ), e.g. lingvistickými (linguistic) ➔ 0.2 Adjective, Masculine, Singular, Nominative case ➔ e.g. lingvistický ➔ 0.1 Adjective, Masculine, Plural, Nominative case ➔ e.g. lingvističtí ➔ 0.1 Adjective, Neuter, Singular, Accusative case ➔ e.g. lingvistické  … altogether 2000 such change rules

Worsen (4): Apply Error Model  take Gold Czech  for each word:  assign a new tag randomly sampled according to Tag Error Model  generate a new word form  rule-based generator, generates even unseen forms  new_form = generate_form(lemma, tag) || old_form  → get Worsened Czech  use resulting Gold English-Worsened Czech parallel treebank to train the parser

Direct Evaluation by Inspection  manual inspection of several parse trees  comparing baseline and adapted parser ouputs  examples of improvements:  subject identification even if not in nominative case  adjective-noun dependence identification even if agreement violated (gender, number, case)  hard to do reliably  trying to find a correct parse tree for an (often) incorrect sentence – not well defined

Indirect Evaluation: in Depfix  rule-based grammar correction of SMT outputs  input = aligned, tagged and parsed sentences:  target (Czech) sentence – to be corrected  source (English) sentence – additional information  applies 20 correction rules:  noun – adjective agreement (gender, number, case)  subject – predicate agreement (gender, number)  preposition – noun agreement (case)  …

Depfix: Rudolph's Car auto auto car NN neut NN neut NN Atr Atr Atr Rudolph Rudolfov o Rudolfov a NNP AA neut AA fem Atr 's POS Adjective – Noun Agreement

Indirect Evaluation Results  differences in Depfix corrections evaluated by humans: better / worse / indefinite  three different parsers  RUR + parallel features + worsened treebank  – original McDonald's MST Parser  RUR – our baseline setup RUR + parallel features + worsened treebank better worse indefinite 51% 30% 18% RUR 54% 28% 18%

Conclusion  SMT outputs often hard to parse  RUR parser – adapted to parsing SMT outputs  parallel features (tag, dep. label, edge existence)  worsening the training treebank (tag error model)  outputs of English-to-Czech translation  evaluated in Depfix  SMT errors correction system

Future Work  more sophisticated parallel features  more experiments on worsening  more languages  parallel tagging

Thank you for your attention For this presentation and other information, visit: http://ufal.mff.cuni.cz/~rosa/depfix/ Rudolf Rosa, Ondřej Dušek, David Mareček, Martin Popel {rosa,odusek,marecek,popel}@ufal.mff.cuni.cz Charles University in Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics

Using Parallel Features in Parsing of Machine-Translated Sentences - PowerPoint PPT Presentation

Rudolf Rosa, Ondej Duek, David Mareek, Martin Popel {rosa,odusek,marecek,popel}@ufal.mff.cuni.cz Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors Charles University in Prague

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

Outline LR Parsing Review of bottom-up parsing LALR Parser Generators Computing the

Models of Human Parsing Experimental Data 2 Informatics 2A: Lecture 22 Eye-tracking Reading

Graph-Based Parsing Joakim Nivre Uppsala University Department of Linguistics and Philology

Dependency Parsing II CMSC 470 Marine Carpuat Graph-based Dependency Parsing Slides credit:

Generalised Parsing and Combinator Parsing A Happy Marriage? L. Thomas van Binsbergen

Parsing as Deduction Joseph K uhner March 24, 2007 Joseph K uhner Parsing as Deduction

Bottom-up parsing LR parsing Construct parse tree for input from leaves up LR( k ) parsing

Compilers Shift-Reduce Parsing Alex Aiken Shift-Reduce Parsing Important Fact #1 about

Parsing, Part I Jim Royer April 2, 2019 CIS 352 Parsing, Part I 1 Miss Teen South

Programming Languages: Parsing Onur Tolga S ehito glu Computer Engineering,METU 27 May

* 07/16/96 Plan for Today Shift-reduce parsing The problem with predictive top down parsing

The Commonwealth of Massachusetts Rating Agency Update & 2013 Series C & D (Green

RB81-19 Presenta/on (stairways) to ICC Appeals Commi<ee

2018-19 Budget Public Hearing Presented by: Susan Smith Leland, Assistant Superintendent of

KINDERGARTEN ENROLLMENT KINDERGARTEN ENROLLMENT School Year Live Birth to Number As of 1/3/11

The story since 2000 Towards more effective, fairer use of alternatives to prison July 2014

Creating Success Through Creative Sentencing Matthew Buesing Assistant City Prosecutor III

Justice Reinvestment in Arkansas 2nd Presentation to the Legislative Criminal Justice

LipNet End-to-End Sentence-level Lipreading Yannis Assael, Brendan Shillingford, Shimon Whiteson,

Using Parallel Features in Parsing of Machine-Translated Sentences - PowerPoint PPT Presentation

Rudolf Rosa, Ondej Duek, David Mareek, Martin Popel {rosa,odusek,marecek,popel}@ufal.mff.cuni.cz Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors Charles University in Prague

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

Outline LR Parsing Review of bottom-up parsing LALR Parser Generators Computing the

Models of Human Parsing Experimental Data 2 Informatics 2A: Lecture 22 Eye-tracking Reading

Graph-Based Parsing Joakim Nivre Uppsala University Department of Linguistics and Philology

Dependency Parsing II CMSC 470 Marine Carpuat Graph-based Dependency Parsing Slides credit:

Generalised Parsing and Combinator Parsing A Happy Marriage? L. Thomas van Binsbergen

Parsing as Deduction Joseph K uhner March 24, 2007 Joseph K uhner Parsing as Deduction

Bottom-up parsing LR parsing Construct parse tree for input from leaves up LR( k ) parsing

Compilers Shift-Reduce Parsing Alex Aiken Shift-Reduce Parsing Important Fact #1 about

Parsing, Part I Jim Royer April 2, 2019 CIS 352 Parsing, Part I 1 Miss Teen South

Programming Languages: Parsing Onur Tolga S ehito glu Computer Engineering,METU 27 May

* 07/16/96 Plan for Today Shift-reduce parsing The problem with predictive top down parsing

The Commonwealth of Massachusetts Rating Agency Update &amp; 2013 Series C &amp; D (Green

RB81-19 Presenta/on (stairways) to ICC Appeals Commi&lt;ee

2018-19 Budget Public Hearing Presented by: Susan Smith Leland, Assistant Superintendent of

KINDERGARTEN ENROLLMENT KINDERGARTEN ENROLLMENT School Year Live Birth to Number As of 1/3/11

The story since 2000 Towards more effective, fairer use of alternatives to prison July 2014

Creating Success Through Creative Sentencing Matthew Buesing Assistant City Prosecutor III

Justice Reinvestment in Arkansas 2nd Presentation to the Legislative Criminal Justice

LipNet End-to-End Sentence-level Lipreading Yannis Assael, Brendan Shillingford, Shimon Whiteson,

The Commonwealth of Massachusetts Rating Agency Update & 2013 Series C & D (Green

RB81-19 Presenta/on (stairways) to ICC Appeals Commi<ee