Why Neural Translations are the Right Length Xing Shi , Kevin Knight - PowerPoint PPT Presentation

Oct 01, 2022 •498 likes •808 views

Why Neural Translations are the Right Length Xing Shi , Kevin Knight and Deniz Yuret; EMNLP 2016 What is the fundamental question as a PhD student ? How to publish a lot of high-quality papers ? How to graduate in 5 years ? PhD Life MT How

Why Neural Translations are the Right Length Xing Shi , Kevin Knight and Deniz Yuret; EMNLP 2016
What is the fundamental question as a PhD student ?
How to publish a lot of high-quality papers ?
How to graduate in 5 years ?
PhD Life MT How to publish a lot of high-quality papers ? How to graduate in 5 years ?
PhD Life MT How to publish a lot of H-index || BLEU high-quality papers ? 5 years || right length How to graduate in 5 years ?
Language Pairs BLEU Length Ratio (MT output / reference) English => Spanish 31.0 0.97 English => French 29.8 0.96 2-layer 1000 hidden units non-attentional LSTM seq2seq
English : does he know about phone hacking ? French reference : a-t-il connaissance du piratage téléphonique ? French translation: <UNK> <UNK> <UNK> <UNK> ?
When to stop PBMT [- - - -] → [- x - -] → [x x x x] Neural MT Word → Word → <EOF>
When to stop How to generate right length ? PBMT [- - - -] → [- x - -] → [x x x x] ● word-penalty feature Neural MT Word → Word → <EOF> ● no explicit penalty
When to stop How to generate right length ? Statistical MT [- - - -] → [- x - -] → [x x x x] ● word-penalty feature ● MERT Neural MT Word → Word → <EOF> ● no explicit penalty ● MLE
When to stop How to generate right length ? Statistical MT [- - - -] → [- x - -] → [x x x x] ● word-penalty feature ● MERT ● Heavy beam search Neural MT Word → Word → <EOF> ● no explicit penalty ● MLE ● light beam search (beam = 10)
Toy Example: String Copy a a a b b <EOS> → a a a b b <EOS> b b a <EOS> → b b a <EOS> Train: 2500 random string Single-layer, 4 hidden states LSTM
Toy Example: String Copy C t = [-2.1 2 0.5 0.6] b a <EOF> b a <s> b a
Toy Example: String Copy C t involves only elementwise + and x.
x-axis: unit_1 y-axis: unit_2
x-axis: unit_1 y-axis: unit_2
x-axis: unit_1 y-axis: unit_2
x-axis: unit_1 y-axis: unit_2
x-axis: unit_1 y-axis: unit_2 unit 1 = -len(input_string)
Toy Example: String Copy <s> b b b a b a → <s> b b b a b a <EOF> Encoding Cell State unit_1 decrease by 1.0
Toy Example: String Copy <s> b b b a b a → <s> b b b a b a <EOF> Encoding Cell State Decoding Cell State unit_1 decrease by 1.0 unit_1 increase by 1.0
Full Scale NMT English => French 1000 hidden units LSTM 2 layers Non-attention BLEU = 29.8
Full Scale NMT Y = w 1 * X 1 + w 2 * X 2 + … + w 1000 * X 1000 + b Sentence_i It is raining right now Y 1 2 3 4 5 X 1000 1000 1000 1000 1000 cell cell cell cell cell states states states states states In total 143,379 (Y, X)
Full Scale NMT Y = w 1 * X 1 + w 2 * X 2 + … + w 1000 * X 1000 + b R 2 1000 units in lower-layer 0.990 1000 units in upper-layer 0.981
Full Scale NMT
Encoding Unit 109 and 334 decrease from above zero Decoding Increase during decoding, once they are above zero, the model is ready to generate <EOS>.
Conclusion Toy Example Full Scale NMT Who Unit 1 controls the Unit 109 and Unit 334 length contributes to the length How
Thanks and QA

Recommend

Section2.5 Transformations Transformations Translations Horizontal Translations: Vertical

Section2.5 Transformations Transformations Translations Horizontal Translations: Vertical Translations: The graph of f ( x c ) is f ( x ) shifted c units to the right. f ( x ) f ( x c ) Translations Horizontal Translations: Vertical

344 views • 30 slides

Translations, rotations and homogeneous coordinates Basilio Bona DAUIN Politecnico di Torino

Translations, rotations and homogeneous coordinates Basilio Bona DAUIN Politecnico di Torino Semester 1, 2016-17 B. Bona (DAUIN) Roto-translations Semester 1, 2016-17 1 / 12 Translations Translations are rigid displacements that move

601 views • 12 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

OPTILINGUA INTERNATIONAL ALPHATRAD / TRADUCTA / VIAVERBIA www.optilingua.com Quality With over

Quality language services that really make a difference! - Business translations - Fast translations - Interpreting - Technical and specialised translations - Voice-overs - Subtitling - Legal and certified translations - Desktop

219 views • 4 slides

Translations Requiring Paraphrasing A student who studies hard will learn to tango. Mark Criley

Translations Requiring Paraphrasing A student who studies hard will learn to tango. Mark Criley tango. 6 Read It Back! But now, lets check our work: Translations Requiring Paraphrasing Mark Criley 5 Substitute Translations 4 Translate

274 views • 11 slides

Cognitive Testing of Survey Translations: Does Respondent Translations: Does Respondent Language

Cognitive Testing of Survey Translations: Does Respondent Translations: Does Respondent Language Proficiency Matter? Patricia Goerman and Mikelyn Meyers, U.S. Census Bureau Hyunjoo Park and Mandy Sha RTI International Hyunjoo Park and Mandy

620 views • 18 slides

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 1, 2014

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 1, 2014 UFRN, Natal O. Hermant (Mines) Double Negations December 1, 2014 1 / 24 Double-Negation Translations Double-Negation translations: a

360 views • 32 slides

Govt. of Gujarat Gujarat Coastline Zone Accretion Erosion length Stable length Total length

Coastal Protection works 15 th Meeting of CPDAC Committee Narmada, Water Resources, Water Supply and Kalpsar Department Govt. of Gujarat Gujarat Coastline Zone Accretion Erosion length Stable length Total length length (km) (km) (km) (km)

881 views • 31 slides

Finding the Right Target Audience Defining the Right Audience Right Visitors Right Time

Finding the Right Target Audience Defining the Right Audience Right Visitors Right Time Right Destination Panel Discussion & Q+A INTRODUCTIONS WHO ARE YOU? Defining the Right Audience PATA 2019 Finding the Click to edit

407 views • 36 slides

Verification of Security Protocols with Lists: from Length One to Unbounded Length Miriam Paiola

Introduction Protocols with lists Generalized Horn Clauses From any length to length one An approximation algorithm Conclusion Verification of Security Protocols with Lists: from Length One to Unbounded Length Miriam Paiola Bruno Blanchet {

1.02k views • 44 slides

For Friday Read Chapter 10, sections 1 and 2 Prolog Handout 4 Length of a List

For Friday Read Chapter 10, sections 1 and 2 Prolog Handout 4 Length of a List Definition of length/2 length([], 0). length([_ | Tail], N) :- length(Tail, N1), N is 1 + N1. Note: all loops must be implemented via recursion

629 views • 26 slides

Matrix COSEC Right People in Right Place at Right Time Matrix COmplete SECurity Matrix COSEC

Matrix COSEC Right People in Right Place at Right Time Matrix COmplete SECurity Matrix COSEC Right People in Right Place at Right Time Matrix Across the Globe and more What is COSEC? Comprehensive Solution Enterprises, SME, and SOHO

2.25k views • 146 slides

light right light right light right light right to steady the tongue, hold the sides of

light right light right light right light right to steady the tongue, hold the sides of the tongue against the upper teeth l r minimal pairs , blew light right, rite, write l r alive arrive blue, blew brew clash crash

325 views • 8 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

890 views • 41 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural

Introduction to Neural Machine Translation Gongbo Tang 16 September 2019 Outline Why Neural Machine Translation ? 1 Introduction to Neural Networks 2 Neural Language Models 3 Gongbo Tang Introduction to Neural Machine Translation 2/38

686 views • 50 slides

Computer Architecture Summer 2018 C Programming Tyler Bletsch Duke University Slides are

ECE/CS 250 Computer Architecture Summer 2018 C Programming Tyler Bletsch Duke University Slides are derived from work by Daniel J. Sorin (Duke), Andrew Hilton (Duke), Alvy Lebeck (Duke), Benjamin Lee (Duke), and Amir Roth (Penn) Also

1.2k views • 77 slides

CHARACTERS, STRINGS, AND FILES CSSE 120 Rose Hulman Institute of Technology Characters,

CHARACTERS, STRINGS, AND FILES CSSE 120 Rose Hulman Institute of Technology Characters, Strings, and Files Characters in Python Just a special case of string >>> myChar = 'C' >>> print myChar C >>> print

484 views • 15 slides

The Magnificent Do _______________ Paul M. Dorfman SAS Consultant Jacksonville, FL Q.: What

The Magnificent Do _______________ Paul M. Dorfman SAS Consultant Jacksonville, FL Q.: What is the DO statement in SAS NOT intended for? Doing all kinds of weird stuff with arrays Creating a perpetuum mobile Saving

597 views • 22 slides

Buffer overflows (a security interlude) IA32 Linux Memory Layout ...

11/20/15 Buffer overflows (a security interlude) IA32 Linux Memory Layout ... FF ... How address space layout, the stack discipline, and C's lack of Stack

726 views • 4 slides

Compilers Nikos Papaspyrou Kostis Sagonas nickie@softlab.ntua.gr kostis@cs.ntua.gr National

Compilers Nikos Papaspyrou Kostis Sagonas nickie@softlab.ntua.gr kostis@cs.ntua.gr National Technical University of Athens School of Electrical and Computer Engineering Software Engineering Laboratory Polytechnioupoli, 15780 Zografou,

1.3k views • 79 slides

Lecture 14 Lecture 14 2. B inary R epresentation What is a file? 3. H ardw are and S

1. Introduction Lecture 14 Lecture 14 2. B inary R epresentation What is a file? 3. H ardw are and S oftw are 4. H igh Level Languages 5. S tandard input and output A file is data stored in secondary storage 6. Operators,

216 views • 3 slides

(Partially Specified) Secure Channels Tom Shrimpton University of Florida Summer School on Real

(Partially Specified) Secure Channels Tom Shrimpton University of Florida Summer School on Real World Crypto and Privacy (June 14, 2018) Prologue: Review of AE Authenticated Encryption M M Prologue: Review of AE Probabilistic or

925 views • 60 slides

ERP Plenary July 2015 CO 2 Enhanced Oil Recovery Richard Heap July 2015 Energy Research

ERP Plenary July 2015 CO 2 Enhanced Oil Recovery Richard Heap July 2015 Energy Research Partnership Energy Research Partnership Outline Technical challenges CO 2 -EOR is not easy, and expensive Synchronisation issue Timing of

1.41k views • 16 slides