The Efficacy of Human Post-Editing for Language Translation Spence - PowerPoint PPT Presentation

The Efficacy of Human Post-Editing for Language Translation Spence Green Jeffrey Heer Christopher D. Manning Stanford University CHI 2013 // 29 April 2013

Ngarrka-ngku ka wawirri panti-rni

Ngarrka-ngku ka wawirri panti-rni man kangaroo spear

Ngarrka-ngku ka wawirri panti-rni man kangaroo spear The man is spearing the kangaroo Ngarrka-ngku ka wawirri panti-rni man kangaroo spear

Scaling up language translation NLP —fully automatic translation (MT) Not yet human quality HCI —collaborative and crowdsourced translation Cost-effective but slow 3

Scaling up language translation NLP —fully automatic translation (MT) Not yet human quality HCI —collaborative and crowdsourced translation Cost-effective but slow Our work: NLP + HCI = interactive translation 3

NLP + HCI: Interactive translation [ Bisbey and Kay 1972 ] 4

Interactive MT: Caitra [ Koehn 2009 ] 5

Interactive MT: YouTube captions 6

Does interactive MT enhance productivity? Mixed prior results Faster or slower? Higher or lower translation quality? 7

Does interactive MT enhance productivity? Mixed prior results Faster or slower? Higher or lower translation quality? Expert translator skepticism of MT Low quality? You want to pay me less!? 7

“Advantages” of post-editing machine translation

Our view: MT improving rapidly

This work: Post-editing user study Simplest interactive MT: Post-editing 10

This work: Post-editing user study Simplest interactive MT: Post-editing Hypotheses: 1. Post-edit reduces translation time 10

This work: Post-editing user study Simplest interactive MT: Post-editing Hypotheses: 1. Post-edit reduces translation time 2. Post-edit increases quality 10

This work: Post-editing user study Simplest interactive MT: Post-editing Hypotheses: 1. Post-edit reduces translation time 2. Post-edit increases quality 3. Suggestions prime the translator 10

This work: Post-editing user study Simplest interactive MT: Post-editing Hypotheses: 1. Post-edit reduces translation time 2. Post-edit increases quality 3. Suggestions prime the translator 4. Post-edit reduces drafting 10

This work: Post-editing user study Simplest interactive MT: Post-editing Hypotheses: 1. Post-edit reduces translation time 2. Post-edit increases quality 3. Suggestions prime the translator 4. Post-edit reduces drafting Exploratory and confirmatory analysis 10

Post-editing experimental design Task translate an English sentence to ... 11

Post-editing experimental design Task translate an English sentence to ... Target languages Arabic, French, German 11

Post-editing experimental design Task translate an English sentence to ... Target languages Arabic, French, German Conditions Unaided and post-edit 11

Post-editing experimental design Task translate an English sentence to ... Target languages Arabic, French, German Conditions Unaided and post-edit Expert Subjects 16 per target language 11

Experimental design Two-way, mixed design Translation conditions (within subjects) Source sentences (between subjects) 12

Experimental design Two-way, mixed design Translation conditions (within subjects) Source sentences (between subjects) Two timed translation efforts Untimed break Total time: about 60 min. per subject 12

Experimental design Two-way, mixed design Translation conditions (within subjects) Source sentences (between subjects) Two timed translation efforts Untimed break Total time: about 60 min. per subject MT from Google [March 2012] 12

Unaided UI 13

Post-edit UI 14

Experimental setup: Linguistic data Topic selections from Wikipedia 1. Flag of Japan easy 2. 1896 Olympic Games easy 3. Schizophrenia hard 4. Infinite Monkey Theorem hard One easy, one hard per condition 15

It was the first international Olympic Games held in the Modern era.

The chance of their doing so is decidedly more favourable than the chance of the molecules returning to one half of the vessel.

Experimental setup: Human subjects Expert freelance translators on oDesk Ecological validity Fair payment: subjects bid on job 18

Experimental setup: Human subjects Expert freelance translators on oDesk Ecological validity Fair payment: subjects bid on job Lots of subject data oDesk language skills tests Hours worked per week Demographic information 18

Experimental setup: Quality rating Same setup as annual Workshop on Machine Translation 19

Experimental setup: Quality rating Same setup as annual Workshop on Machine Translation Crowdsourced, pairwise evaluation on MTurk 19

Experimental setup: Quality rating Same setup as annual Workshop on Machine Translation Crowdsourced, pairwise evaluation on MTurk Three judgments per translation pair 19

Results

Fixed effects fallacies Fixed effect —Data includes all factor levels Gender Machine configuration 22

Fixed effects fallacies Fixed effect —Data includes all factor levels Gender Machine configuration Random effect —sampled levels Human subjects (RM-ANOVA) 22

Fixed effects fallacies Fixed effect —Data includes all factor levels Gender Machine configuration Random effect —sampled levels Human subjects (RM-ANOVA) English source sentences Target languages “Language as fixed-effect fallacy” [ Clark 1973 ] 22

Mixed effects models Random effects structure �� x ⊺ β z ⊺ b y = η + + �� Linear predictor Error term 23

Post-editor variance �� 24

Recap: Experimental hypotheses 1. Post-edit reduces translation time 2. Post-edit increases quality 3. Suggestions prime the translator 4. Post-edit reduces drafting 25

Hypothesis #1: Reduced time �� 26

Hypothesis #1: Reduced time Post-edit reduces translation time ? 27

Hypothesis #1: Reduced time Post-edit reduces translation time ? Yes! p < 0 . 001 Significant covariates Source length % nouns in sentence 27

The Efficacy of Human Post-Editing for Language Translation Spence - PowerPoint PPT Presentation

The Efficacy of Human Post-Editing for Language Translation Spence Green Jeffrey Heer Christopher D. Manning Stanford University CHI 2013 // 29 April 2013 Ngarrka-ngku ka wawirri panti-rni Ngarrka-ngku ka wawirri panti-rni man

I n t e r n s L i g h t n i n g T a l k s Proxy editing PiTiVi Proxy editing

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Efficacy and Crop Safety Data Development Issues David Richardson Pesticides Safety Directorate

Non Linear Editing Programmable Solutions for the Broadcast Industry Non Linear Editing

Developmental Editing What is developmental editing? Who does the developmental edit?

Batch Metadata Editing in DSpace 1.6+ Maureen P. Walsh, The Ohio State University Libraries

RGBN IMAGE EDITING SIBGRAPI 2009 THIAGO PEREIRA LUIZ VELHO IMPA OUTLINE RGBN LINEAR EDITING

Yo: A video editing language Mengqing Wang, Munan Cheng, Tiezheng Li, Yufei Ou Introduction -

Drug Efficacy and Response Minutes of the TEG on Drug Efficacy and Response Malaria Policy

Human Language vs. Animal Communication Linguistics 101 Human Language vs. Animal Communication

Estimating post-editing effort State-of-the-art systems and open issues Lucia Specia University

EFFICACY TOPICS EFFICACY TOPICS Public ICH meeting - Brussels 14 th November 2008 International

editing technique Emma de Pater CGEC Cancer Genome Editing Center CRISPR/Cas9 CRISPR/Cas9

Photoshopping and Video Editing By Mitchell Schirmers History of photo and video editing

Photo-editing and presentation: a guide to image editing and presentation for photographers and

SNAPSEED, a Photo Editing App for Mobile Devices Nancy Matheson Snapseed is a photo-editing

Compilers & Translator Writing Systems Prof. R. Eigenmann ECE573, Fall 2005

Head Finalization: Translation from SVO to SOV Hideki Isozaki Okayama

Introduction to Machine Translation CMSC 723 / LING 723 / INST 725 Marine Carpuat Slides &

DIFFUSION PROCESS IN NETWORKS THE CASE OF GMO SOYBEAN IN ARGENTINA THE CASE OF GMO SOYBEAN IN

Cross-ISA Machine Instrumentation Cross-ISA Machine Instrumentation using Fast and Scalable

Tree-based and Forest-Based Translation Liang Huang Joint work with Kevin Knight (ISI), Aravind

Machine Translation Luke Zettlemoyer (Slides adapted from Karthik Narasimhan, Chris Manning, Dan

Translation from SQL into the relational algebra Consider the following relational schema:

Sambuz

Useful Links

Newsletter

Mail Us

The Efficacy of Human Post-Editing for Language Translation Spence - PowerPoint PPT Presentation

The Efficacy of Human Post-Editing for Language Translation Spence Green Jeffrey Heer Christopher D. Manning Stanford University CHI 2013 // 29 April 2013 Ngarrka-ngku ka wawirri panti-rni Ngarrka-ngku ka wawirri panti-rni man

I n t e r n s L i g h t n i n g T a l k s Proxy editing PiTiVi Proxy editing

Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat Chicken Human 1 Human 2 Rat

Efficacy and Crop Safety Data Development Issues David Richardson Pesticides Safety Directorate

Non Linear Editing Programmable Solutions for the Broadcast Industry Non Linear Editing

Developmental Editing What is developmental editing? Who does the developmental edit?

Batch Metadata Editing in DSpace 1.6+ Maureen P. Walsh, The Ohio State University Libraries

RGBN IMAGE EDITING SIBGRAPI 2009 THIAGO PEREIRA LUIZ VELHO IMPA OUTLINE RGBN LINEAR EDITING

Yo: A video editing language Mengqing Wang, Munan Cheng, Tiezheng Li, Yufei Ou Introduction -

Drug Efficacy and Response Minutes of the TEG on Drug Efficacy and Response Malaria Policy

Human Language vs. Animal Communication Linguistics 101 Human Language vs. Animal Communication

Estimating post-editing effort State-of-the-art systems and open issues Lucia Specia University

EFFICACY TOPICS EFFICACY TOPICS Public ICH meeting - Brussels 14 th November 2008 International

editing technique Emma de Pater CGEC Cancer Genome Editing Center CRISPR/Cas9 CRISPR/Cas9

Photoshopping and Video Editing By Mitchell Schirmers History of photo and video editing

Photo-editing and presentation: a guide to image editing and presentation for photographers and

SNAPSEED, a Photo Editing App for Mobile Devices Nancy Matheson Snapseed is a photo-editing

Compilers &amp; Translator Writing Systems Prof. R. Eigenmann ECE573, Fall 2005

Head Finalization: Translation from SVO to SOV Hideki Isozaki Okayama

Introduction to Machine Translation CMSC 723 / LING 723 / INST 725 Marine Carpuat Slides &amp;

DIFFUSION PROCESS IN NETWORKS THE CASE OF GMO SOYBEAN IN ARGENTINA THE CASE OF GMO SOYBEAN IN

Cross-ISA Machine Instrumentation Cross-ISA Machine Instrumentation using Fast and Scalable

Tree-based and Forest-Based Translation Liang Huang Joint work with Kevin Knight (ISI), Aravind

Machine Translation Luke Zettlemoyer (Slides adapted from Karthik Narasimhan, Chris Manning, Dan

Translation from SQL into the relational algebra Consider the following relational schema:

Sambuz

Useful Links

Newsletter

Mail Us

Compilers & Translator Writing Systems Prof. R. Eigenmann ECE573, Fall 2005

Introduction to Machine Translation CMSC 723 / LING 723 / INST 725 Marine Carpuat Slides &