Comparative human and automatic evaluation of glass-box and - PowerPoint PPT Presentation

Comparative human and automatic evaluation of glass-box and black-box approaches to interactive translation prediction Daniel Torregrosa , Juan Antonio P´ erez-Ortiz, Mikel L. Forcada Departament de Llenguatges i Sistemes Inform` atics Universitat d’Alacant, Spain EAMT2017 Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 1 / 37

Outline Introduction 1 Automatic evaluation 2 Human evaluation 3 Summary 4 Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 2 / 37

Abstract Interactive translation prediction (ITP) offers suggestions as the translation is being written by the translator We compare black-box and glass-box ITP for the first time Translators can potentially save 20–50% keystrokes using ITP All software used for the comparison is free/open-source Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 3 / 37

Outline Introduction 1 Translation technologies Glass-box interactive translation prediction Black-box interactive translation prediction Automatic evaluation 2 Human evaluation 3 Summary 4 Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 4 / 37

Translation technologies Professional translators often use translation technologies such as ◮ dictionaries ◮ bilingual concordancers (Context Reverso, Linguee) ◮ translation memories ◮ machine translation ◮ post-editing ◮ interactive translation prediction to achieve better, faster translations Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 5 / 37

Computer assisted translation Interactive translation prediction Interactive translation prediction (ITP) focuses on offering suggestions as the translator types the translation The approaches in the literature use a glass-box approach, where the inner workings of a SMT system are queried to provide ITP We have proposed a black-box approach 1 that can use any bilingual resource to provide ITP 1 Torregrosa Rivero, Daniel, Mikel L. Forcada, and Juan Antonio P´ erez-Ortiz. “An Open-Source Web-Based Tool for Resource-Agnostic Interactive Translation Prediction.” (2014). Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 6 / 37

Glass-box ITP Glass-box ITP typically uses a modified or tailor-made SMT system that is also able to provide additional information, such as word alignments, alternative translations and translation probabilities Recently, neural MT has been used to provide ITP ◮ Unlike with SMT, access to the internals is not needed ◮ The decoding process is modified so it can accept a prefix Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 7 / 37

Glass-box ITP Example Source sentence er geht ja nicht nach hause Target translation he does not go home In this example, we will use the decoder of a modified statistical machine translation system The translator types the prefix of the translation, and gets the best path as a suggestion Based on Statistical Machine Translation (2009) by Philipp Koehn Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 8 / 37

Statistical Machine Translation Decoder Source sentence er geht ja nicht nach hause Based on Statistical Machine Translation by Philipp Koehn Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 9 / 37

Glass-box ITP Example I Typed prefix he Based on Statistical Machine Translation (2009) by Philipp Koehn Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 9 / 37

Glass-box ITP Example II Typed prefix he d Based on Statistical Machine Translation (2009) by Philipp Koehn Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 9 / 37

Glass-box ITP Example III Typed prefix he does not go home Based on Statistical Machine Translation (2009) by Philipp Koehn Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 9 / 37

Black-box ITP To generate translation suggestions, black-box ITP can use any kind of bilingual resource that provides one or more translations for a sentence This lets us to seamlessly integrate any kind of resource without needing to know how they work Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 10 / 37

Black-box ITP Generating suggestions Source sentence This studio is spacious Subsegments of length 1 este estudio es espacioso este estudio Subsegments of length 2 estudio es es amplio este estudio est´ a Subsegments of length 3 estudio es amplio Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 11 / 37

Black-box ITP Offering suggestions Typed prefix Este e estudio es estudio es amplio estudio Proposals este es amplio espacioso We need to rank and select which suggestions to show. Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 12 / 37

Black-box ITP Example Source sentence this studio is spacious Target sentence este estudio es amplio Prefix e este estudio est´ a este Suggestions estudio es amplio estudio Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 12 / 37

Black-box ITP Example Source sentence this studio is spacious Target sentence este estudio es amplio Prefix e este estudio est´ a este Suggestions estudio es amplio estudio Prefix e ste e estudio es amplio estudio Suggestions es amplio es Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 12 / 37

Black-box ITP Example Source sentence this studio is spacious Target sentence este estudio es amplio Prefix e este estudio est´ a este Suggestions estudio es amplio estudio Prefix e ste e estudio es amplio estudio Suggestions es amplio es Prefix e ste e studio es amplio Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 12 / 37

Outline Introduction 1 Automatic evaluation 2 Software Method Metrics Results Human evaluation 3 Summary 4 Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 13 / 37

Software Glass-box ITP As a glass-box implementation, we use Thot Suggests one translation completion that automatically updates as the user types the prefix Can also be used as an SMT system Trained using 1 000 000 sentences from the United Nations Parallel Corpus v1.0 2 ◮ Motivated by lack of resources ◮ A bilingual domain adaptation technique 2 has been used to minimize the impact of reducing the size of the corpus ◮ Excerpts of this corpus will be used for testing 2 http://conferences.unite.un.org/UNCorpus 2 Axelrod, Amittai, Xiaodong He, and Jianfeng Gao. “Domain adaptation via pseudo in-domain data selection.” (EMNLP 2011) Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 14 / 37

Software Black-box ITP As a black-box implementation, we use Forecat Using Thot SMT as the only bilingual resource We use a multilayer perceptron 2 for ranking the black-box model suggestions. Some features ◮ Source and target position and lengths of the suggestion ◮ Alignment model ◮ Position with respect the last used suggestion: before, after, overlapping... 2 With ≈ 10 4 parameters. Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 15 / 37

Automatic evaluation Method We simulate the behaviour of a professional translator ◮ who has a planned, immutable translation in mind ◮ who writes monotonically ◮ who makes no mistakes ◮ who reads all the proposed suggestions, evaluates them all, and uses the longest suggestion or suggestion prefix that fits (if any) Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 16 / 37

Automatic evaluation Metrics We measure the keystroke ratio (KSR), the ratio between the number of keys typed and the length of the final translation ◮ KSR < 1 means we saved some of the keystrokes by using suggestions ◮ If we type the translation without mistakes and use no suggestions, we get KSR= 1 ◮ KSR > 1 means we used extra keystrokes, e.g. the user mistyped or rethought the translation halfway Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 17 / 37

Automatic evaluation Results: KSR 0.9 [Better] KSR [Worse] 0.8 0.7 0.6 0.5 0.4 en → ar ar → en en → es es → en en → zh zh → en Black-box M=1 Black-box M=4 Glass-box M = maximum number of suggestions. All differences between the values are statistically significant ( p ≤ 0 . 05). Daniel Torregrosa (Univ. Alacant) Comparing black and glass-box ITP EAMT2017 18 / 37

Comparative human and automatic evaluation of glass-box and - PowerPoint PPT Presentation

Comparative human and automatic evaluation of glass-box and black-box approaches to interactive translation prediction Daniel Torregrosa , Juan Antonio P erez-Ortiz, Mikel L. Forcada Departament de Llenguatges i Sistemes Inform` atics

SCULPTURE MATERIALS SCULPTURE MATERIALS PATTERNS PLATE GLASS GLASS BOTTLES GLASS BOTTLES

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

3M Glass Bubbles iM16K 3M Glass Bubbles iM16K 3M Glass Bubbles and Other Additives

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Glass Packaging Institute Overview and Activity Update Bryan Vickers Glass Packaging Institute

Knowledge Seminar Series : High Performance Glass October 18, 2017 Jim Larsen Director,

An Image Capture Application For The Google Glass Framework Oliver Nina (UCF), Roger Pack (FS)

RUSSIAN FLAT GLASS COATINGS MARKET OVERVIEW Dmitriy D. Bernt Glass Coating Technology

Glass Container Manufacturing & Recycling Overview Scott DeFife President Glass Packaging

3. A vacuum Glass Catch attachment for broken glass GLASS OTHERS HOME $2.35 Million

A Brief Introduction to Automatic Speech Recognition Jim Glass (glass@mit.edu) MIT Computer

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

Glass Transformation- -Range Range Glass Transformation Behavior- - Odds and Ends Odds and

Verification of PCP-Related Computational Reductions in Coq Yannick Forster, Edith Heiter, Gert

Intelligent Truckparking in Nodes and harbours Henk Jansma, Rijkswaterstaat, The Netherlands.

A survey on Interactive Theorem Proving Andrea Asperti Department of Computer Science, University

!"#$%&'"()%(+,%-+.%"/0%112 3/4/"#5567772

7 Consistency projector Definition (Initial trajectory problem (ITP)) . Given past trajectory x 0 :

breakdown of perturbation approach ! Fermi liquid theory one-dimensional systems 28 distribution

HOL Light: future wishes John Harrison Intel Corporation Workshop on Interactive Theorem Proving

MATH 100 Introduction to the Profession Applied Mathematicians Who are we, and what do we

Comparative human and automatic evaluation of glass-box and - PowerPoint PPT Presentation

Comparative human and automatic evaluation of glass-box and black-box approaches to interactive translation prediction Daniel Torregrosa , Juan Antonio P erez-Ortiz, Mikel L. Forcada Departament de Llenguatges i Sistemes Inform` atics

SCULPTURE MATERIALS SCULPTURE MATERIALS PATTERNS PLATE GLASS GLASS BOTTLES GLASS BOTTLES

Paradoxes in Probability How probability continues to amuse me! Let's play a game! Box A Box B

3M Glass Bubbles iM16K 3M Glass Bubbles iM16K 3M Glass Bubbles and Other Additives

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Glass Packaging Institute Overview and Activity Update Bryan Vickers Glass Packaging Institute

Knowledge Seminar Series : High Performance Glass October 18, 2017 Jim Larsen Director,

An Image Capture Application For The Google Glass Framework Oliver Nina (UCF), Roger Pack (FS)

RUSSIAN FLAT GLASS COATINGS MARKET OVERVIEW Dmitriy D. Bernt Glass Coating Technology

Glass Container Manufacturing &amp; Recycling Overview Scott DeFife President Glass Packaging

3. A vacuum Glass Catch attachment for broken glass GLASS OTHERS HOME $2.35 Million

A Brief Introduction to Automatic Speech Recognition Jim Glass (glass@mit.edu) MIT Computer

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Flux Box Flux Box A concept by Flux Laboratory Flux box : concept Flux box : concept What is Flux

Kid s Box American English Level 1 Presentation Plus: Kid s Box American English Kid s Box

Glass Transformation- -Range Range Glass Transformation Behavior- - Odds and Ends Odds and

Verification of PCP-Related Computational Reductions in Coq Yannick Forster, Edith Heiter, Gert

Intelligent Truckparking in Nodes and harbours Henk Jansma, Rijkswaterstaat, The Netherlands.

A survey on Interactive Theorem Proving Andrea Asperti Department of Computer Science, University

!&quot;#$%&amp;'&quot;()%(*+,*%-+.%&quot;/0%112 3/4/&quot;#5567772

7 Consistency projector Definition (Initial trajectory problem (ITP)) . Given past trajectory x 0 :

breakdown of perturbation approach ! Fermi liquid theory one-dimensional systems 28 distribution

HOL Light: future wishes John Harrison Intel Corporation Workshop on Interactive Theorem Proving

MATH 100 Introduction to the Profession Applied Mathematicians Who are we, and what do we

Glass Container Manufacturing & Recycling Overview Scott DeFife President Glass Packaging

!"#$%&'"()%(+,%-+.%"/0%112 3/4/"#5567772