ADAPTIVE QUALITY ESTIMATION FOR MACHINE TRANSLATION AND AUTOMATIC - PowerPoint PPT Presentation

1 ADAPTIVE QUALITY ESTIMATION FOR MACHINE TRANSLATION AND AUTOMATIC SPEECH RECOGNITION José G. C. de Souza Advisors: Matteo Negri Marco Turchi Marcello Federico EAMT 2017 30/05/2017

What is MT Quality Estimation? 2 Translated Source sentences sentences QE • Quality control when Model there are no references • Real-time estimations Quality scores

Applications 3 • Informing the reader of the target language about whether the translation is reliable.

Applications 4  Deciding whether the translation is good enough to be published  Selecting best MT output out of a pool of MT systems  Deciding whether the translation needs to be post-edited  Computer-assisted translation (CAT) scenario

CAT scenario 5  Fuzzy match score for translation memory  MT suggestions require scores: MT QE

Outline 6  Quality Estimation  Quality Judgments  Quality Indicators  Current (static) MT QE approaches  Adaptive approaches  Online  Multitask  Online Multitask

Quality Estimation (QE) 7  Supervised learning task Source Translated segments segments  Quality Judgments (labels)  Proxy for correctness and usefulness QE Labels Train  Quality Indicators (features)  Granularity  Word QE  Sentence model  Document

Quality Judgments 8  Perceived post-editing effort (Specia, 2011)  Two levels of ambiguity  Post-editing time ( O’Brien, 2005)  High variability  Actual Post-editing effort (HTER) (Tatsumi, 2009)  Does not capture cognitive effort

Quality indicators 9 Translated Source sentences sentences  Complexity of the source sentence  Fluency of the translation QuEst [ACL13a]  Adequacy of the translation  MT confidence

Quality indicators 10 Translated Source sentences sentences  Complexity of the source sentence;  Sentences that are complex at the syntactical, semantic, discursive or pragmatic levels are harder to translate.  Examples:  n-gram language model perplexity  average source token length

Quality indicators 11 Translated Source sentences sentences  Fluency of the translation  Related to grammatical correctness in the target language  Example:  n-gram language model perplexity

Quality indicators 12 Translated Source sentences sentences  Translation adequacy  Related to the meaning equivalence between source and its translation.  Examples:  Ratios of aligned word classes [ACL13b, WMT13, WMT14]  Topic-model-based features [MTSummit13]

Quality indicators 13 Translated Source sentences sentences  MT confidence  Related to the difficulty of the MT process  Examples  log-likelihood scores (normalized by source length)  average distances between n-best hypothesis [WMT13,14]

Problems in current MT QE approaches 15  Systems assume ideal conditions:  Single MT system, text type and user  Best setting is task-dependent  Scarcity of labeled data Static

MT QE in real conditions 16  QE in the CAT scenario typically requires dealing with diverse input  Different genres/types of text/projects  Different MT systems  Different post-editors  Here, users + text type + MT system = domain/task Domain 3 Domain 2 Domain 1

Adaptive QE 18  Copes with variability in:  Post-editors  Text types  MT quality

Online QE 19 Training/Test Test Training Domain 2 Domain 1 Domain 2 Human Human feedback feedback Empty Adaptive Quality Quality Online QE Online QE prediction prediction Sentence pair Sentence pair [ACL14]

Online QE 20  Explores user corrections to adapt to different post- editing styles and text types  Online learning for MT QE  Passive Aggressive (PA) (Crammer et al., 2006)  Online Support Vector Machines (Parrella, 2007)

Results 21 Train = L cons, Test = IT rad Empty Mean SVR Adaptive (OSVR) (OSVR) (batch)  Online QE improves over batch on very different domains  Empty more accurate than Adaptive

MT QE across multiple domains 22  Online MT QE is not able to deal with several domains at the same time Domain 2 Domain 3 Domain 1 QE model

MT QE across multiple domains 23  Multitask learning (Caruana 1997)  Leverages different domains  Knowledge transfer between domains Domain 3 Domain 1 QE model Domain 2 [Coling14a]

Experimental Setting 24  Data: 363 src, tgt and post-edit sentences  TED talks transcripts, IT manuals, News-wire texts  181/182 training/test  Baselines: Frustratingly Easy Domain Concatenation of domains Adaptation Single task learning (SVR in-domain) (SVR pooling) (SVR FEDA)(Daumé, 2007) data data data data data SVR FEDA SVR SVR SVR Model Model Model

MT QE across multiple Domains 25 Learning curve showing MAE for different amounts News training data (95% conf. bands) IT TED  Pooling and FEDA worse than Mean  Improvements over in-domain models  RMTL usually requires less in-domain data

What have we learnt so far? 26  Online QE methods  Continuous learning from user feedback  Do not exploit similarities between domains  Batch multitask learning  Models similarities between domains  Requires complete re-training

Online Multitask MT QE (PAMTL) 27  Combines online learning and multitask learning  Based on Passive Aggressive algorithms (Crammer et al. 2006)  Epsilon-insensitive loss (regression)  Identifies task relationships (Saha et al. 2011) ACL15

Online Multitask MT QE (PAMTL) 28  Interaction matrix is initialized so that tasks are learnt independently  After a given number of instances the matrix is updated computing divergences over the task weights t 1 … t N Interaction matrix Interaction matrix … Model (feature weights) Model (feature weights) … D1 D3 D1 D3 D2 D2

Experimental Setting (data) 29  1,000 En-Fr tuples of (source, translation, post-edit):  TED talks (TED)  Educational Material (EM)  (IT LSP1 ), software manual  (IT LSP2 ), automotive software manual  700/300 train/test

Experimental Settings (baselines) 30  Online learning for QE  Passive Aggressive (PA-I) Concatenation of domains  Two usages (STL pool ), one for all domains data Single task learning (STL in ), one per domain Learning data data data data Algorithm Learning Learning Learning Learning Algorithm Algorithm Algorithm Algorithm Model Model

Results (stream of domains) 31 Learning curve showing MAE for different amounts training data (95% conf. bands)  Pooling presents very poor performance  PAMTL outperforms all baselines  PAMTL MAE with 20% of data ≈ in -domain training with 100% of data

Conclusion 32  Before the work presented here:  Static QE systems serving one domain  After the work presented here:  Adaptive QE systems serving diverse domains

Conclusion 33  Adaptive approaches that can be used for domain adaptation  Single-domain adaptation: online QE  Multi-domain adaptation: batch MTL QE  Multi-domain with online updates: online MTL QE

Conclusion 34  State-of-the-art MT QE features for post-editing time and effort prediction  Introduction of QE for ASR  Adaptive QE for ASR shows improvements over in-domain models for both classification and regression scenarios  New online multitask algorithm for multi-domain large- scale regression problems

Thank you! 35

Publications 36  [WMT13] José G. C. de Souza , Christian Buck, Marco Turchi, and Matteo Negri. FBK-UEdin participation to the WMT13 Quality Estimation shared- task . In Proceedings of the Eighth Workshop on Statistical Machine Translation, pages 352 – 358, 2013  [ACL13b] José G. C. de Souza , Miquel Esplá-Gomis, Marco Turchi, and Matteo Negri. Exploiting Qualitative Information from Automatic Word Alignment for Cross-lingual NLP Tasks . In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pages 771 – 776, 2013  [MTSummit13] Raphael Rubino, José G. C. de Souza , and Lucia Specia. Topic Models for Translation Quality Estimation for Gisting Purposes . In Machine Translation Summit XIV, pages 295 – 302, 2013a  [ACL13a] Lucia Specia, Kashif Shah, José G. C. de Souza , and Trevor Cohn. QuEst – A translation quality estimation framework . In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pages 79 – 84, 2013

ADAPTIVE QUALITY ESTIMATION FOR MACHINE TRANSLATION AND AUTOMATIC - PowerPoint PPT Presentation

1 ADAPTIVE QUALITY ESTIMATION FOR MACHINE TRANSLATION AND AUTOMATIC SPEECH RECOGNITION Jos G. C. de Souza Advisors: Matteo Negri Marco Turchi Marcello Federico EAMT 2017 30/05/2017 What is MT Quality Estimation? 2 Translated Source

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

Machine Translation Machine Translation February 13, 2008 Andreas Eisele UdS Computerlinguistik

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

Machine Translation Philipp Koehn 28 April 2020 Philipp Koehn Artificial Intelligence: Machine

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation:

Machine Translation: Going Deep Philipp Koehn 4 June 2015 Philipp Koehn Machine Translation:

Machine Translation Philipp Koehn 1 December 2015 Philipp Koehn Artificial Intelligence:

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Representing Huge Translation Models Statistical Machine Translation parallel text + alignment

FAILING TOWARDS VICTORY A tale of agile experimentation, and using failure to lead to success!

Passive DNS Replication Florian Weimer <fw@deneb.enyo.de> April 2005 Overview The domain

Welcome 1 DAS Customer Service Training April and May 2015 Learning Objectives To enhance your

Project Manager Compass How do you communicate effectively with BLUE , RED and GREEN

11/2/2009 Protecting Employees from Destructive Leaders: What is the role of the Organisational

Module 5 Outcomes Participants will: Enhance listening and questioning skills

Proxychain: Developing a Robust and Efficient Authentication Infrastructure for Carrier-Scale

Steps in Successful Organizational Change Lewins Change Model Refreezing Movement

ADAPTIVE QUALITY ESTIMATION FOR MACHINE TRANSLATION AND AUTOMATIC - PowerPoint PPT Presentation

1 ADAPTIVE QUALITY ESTIMATION FOR MACHINE TRANSLATION AND AUTOMATIC SPEECH RECOGNITION Jos G. C. de Souza Advisors: Matteo Negri Marco Turchi Marcello Federico EAMT 2017 30/05/2017 What is MT Quality Estimation? 2 Translated Source

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

Machine Translation Machine Translation February 13, 2008 Andreas Eisele UdS Computerlinguistik

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

Machine Translation Philipp Koehn 28 April 2020 Philipp Koehn Artificial Intelligence: Machine

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation:

Machine Translation: Going Deep Philipp Koehn 4 June 2015 Philipp Koehn Machine Translation:

Machine Translation Philipp Koehn 1 December 2015 Philipp Koehn Artificial Intelligence:

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Representing Huge Translation Models Statistical Machine Translation parallel text + alignment

FAILING TOWARDS VICTORY A tale of agile experimentation, and using failure to lead to success!

Passive DNS Replication Florian Weimer &lt;fw@deneb.enyo.de&gt; April 2005 Overview The domain

Welcome 1 DAS Customer Service Training April and May 2015 Learning Objectives To enhance your

Project Manager Compass How do you communicate effectively with BLUE , RED and GREEN

11/2/2009 Protecting Employees from Destructive Leaders: What is the role of the Organisational

Module 5 Outcomes Participants will: Enhance listening and questioning skills

Proxychain: Developing a Robust and Efficient Authentication Infrastructure for Carrier-Scale

Steps in Successful Organizational Change Lewins Change Model Refreezing Movement

Passive DNS Replication Florian Weimer <fw@deneb.enyo.de> April 2005 Overview The domain