Machine Translation Contd Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 March 7, 2017 Based on slides from Richard Socher, Chris Manning, Philipp Koehn, and everyone else they copied from.
Upcoming… Status report due tonight: March 7, 2017 • Project Almost final report, only 5 pages • Paper summaries: March 14 • Summaries • Summary 2 graded Homework 4 is due on March 13 • Homework • Write-up, code, and data released. Lowest grade of the homeworks will be dropped • CS 295: STATISTICAL NLP (WINTER 2017) 2
Outline Decoding Algorithms Syntax-Based MT Neural MT Models CS 295: STATISTICAL NLP (WINTER 2017) 3
Outline Decoding Algorithms Syntax-Based MT Neural MT Models CS 295: STATISTICAL NLP (WINTER 2017) 4
Phrase Decoding: Stacks CS 295: STATISTICAL NLP (WINTER 2017) 5
Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 6
Monotonic Phrase Decoding CS 295: STATISTICAL NLP (WINTER 2017) 7
Monotonic Phrase Decoding (Mary) CS 295: STATISTICAL NLP (WINTER 2017) 8
Monotonic Phrase Decoding (Mary) (did not) CS 295: STATISTICAL NLP (WINTER 2017) 9
Monotonic Phrase Decoding (Mary) (did not) (slap) CS 295: STATISTICAL NLP (WINTER 2017) 10
Monotonic Phrase Decoding (Mary) (did not) (slap) (the) CS 295: STATISTICAL NLP (WINTER 2017) 11
Monotonic Phrase Decoding (Mary) (did not) (slap) (the) (green witch) CS 295: STATISTICAL NLP (WINTER 2017) 12
Non-Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 13
Non-Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 14
Non-Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 15
Non-Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 16
Non-Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 17
Non-Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 18
Non-Monotonic Decoding CS 295: STATISTICAL NLP (WINTER 2017) 19
Comparing Partial Translations CS 295: STATISTICAL NLP (WINTER 2017) 20
Hypothesis Recombination CS 295: STATISTICAL NLP (WINTER 2017) 21
Hypothesis Recombination CS 295: STATISTICAL NLP (WINTER 2017) 22
Multi-Stack Decoding function S TACK D ECODING (source sentence) initialize stacks from 0..n for i = 0 .. n-1 for h in k-best hypothesis from stack[ i ] for h’ in possible expansion of h assign a score to h’ push h’ onto stack[len( h’ )] return max(stack[ n ]) CS 295: STATISTICAL NLP (WINTER 2017) 23
Outline Decoding Algorithms Syntax-Based MT Neural MT Models CS 295: STATISTICAL NLP (WINTER 2017) 24
Limits of the Phrase Model Ich habe das Auto gekauft Non-Contiguous Phrases I bought the car Den Antrag verabschiedet das Parlament Syntactic Transformations The draft approves the Parliament CS 295: STATISTICAL NLP (WINTER 2017) 25
Syntax leads to Good English CS 295: STATISTICAL NLP (WINTER 2017) 26
The Vauquios Triangle CS 295: STATISTICAL NLP (WINTER 2017) 27
String to Tree Translation CS 295: STATISTICAL NLP (WINTER 2017) 28 [Yamada and Knight, 2001]
String to Tree Translation CS 295: STATISTICAL NLP (WINTER 2017) 29
Synchronous CFGs CS 295: STATISTICAL NLP (WINTER 2017) 30
Outline Decoding Algorithms Syntax-Based MT Neural MT Models CS 295: STATISTICAL NLP (WINTER 2017) 31
Neural MT Models CS 295: STATISTICAL NLP (WINTER 2017) 32
Recurrent Neural Networks CS 295: STATISTICAL NLP (WINTER 2017) 33
Recurrent Neural Networks Bee’s knees rodillas de abejas CS 295: STATISTICAL NLP (WINTER 2017) 34
Different Weights Bee’s knees rodillas de abejas CS 295: STATISTICAL NLP (WINTER 2017) 35
More Connections Bee’s knees rodillas de abejas CS 295: STATISTICAL NLP (WINTER 2017) 36
CS 295: STATISTICAL NLP (WINTER 2017) 37
Other Extensions Bee’s knees Reverse rodillas de abejas abejas de rodillas CS 295: STATISTICAL NLP (WINTER 2017) 38
Other Extensions Bee’s knees Stacking rodillas de abejas CS 295: STATISTICAL NLP (WINTER 2017) 39
Other Extensions Bi-directional de rodillas abejas Bee’s knees rodillas de abejas CS 295: STATISTICAL NLP (WINTER 2017) 40
Regular Recurrent Units h t-1 h t x t CS 295: STATISTICAL NLP (WINTER 2017) 41
Gated Recurrent Units h t-1 h t ĥ t r t z t x t CS 295: STATISTICAL NLP (WINTER 2017) 42
Long Short-Term Memory h t-1 o t h t ĉ t c t-1 c t f t i t x t CS 295: STATISTICAL NLP (WINTER 2017) 43
Neural MT Results CS 295: STATISTICAL NLP (WINTER 2017) 44
Trend in Machine Translation CS 295: STATISTICAL NLP (WINTER 2017) 45
Multilingual Neural MT: Naïve English-Spanish English-Spanish English Spanish Encoder Decoder English-Hindi English-Hindi English Hindi Encoder Decoder English-Chinese English-Chinese English Chinese Encoder Decoder CS 295: STATISTICAL NLP (WINTER 2017) 46
Multilingual NMT: Decoder English-Spanish Spanish Encoder English-Hindi Shared English Hindi Encoder Decoder English-Chinese Chinese Encoder CS 295: STATISTICAL NLP (WINTER 2017) 47
Multilingual NMT: Encoder English-Spanish Spanish Decoder English English-Hindi English Hindi Encoder Decoder English-Chinese Chinese Decoder CS 295: STATISTICAL NLP (WINTER 2017) 48
Google Neural MT Spanish Spanish Spanish Encoder Spanish Decoder Hindi Hindi Encoder Hindi Decoder Hindi GNMT Chinese Chinese Encoder Chinese Decoder Chinese English English Encoder English Decoder English CS 295: STATISTICAL NLP (WINTER 2017) 49
Out of Control! CS 295: STATISTICAL NLP (WINTER 2017) 50 From Ryan Cotterell @_shrdlu_
Out of Control! CS 295: STATISTICAL NLP (WINTER 2017) 51 From Iain Murray @driainmurray
Out of Control! CS 295: STATISTICAL NLP (WINTER 2017) 52 From Iain Murray @driainmurray
Recommend
More recommend