Machine Translation Contd Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 March 2, 2017 Based on slides from Dan Klein, Philipp Koehn, Jacob Eisenstein, and everyone else they copied from.
Omer Levy AI/ML Seminar Monday, March 6 th 1pm-2pm DBH 4011 Understanding Word Embeddings Meeting with Graduate Students 4:00-4:45pm Room TBA (email me) CS 295: STATISTICAL NLP (WINTER 2017) 2
Upcoming… Status report due in 1 weeks: March 7, 2017 • Project Instructions coming today! • Almost final report, only 5 pages • Paper summaries: March 14 • Summaries • Summary 1 graded Homework 4 is due on March 13 • Homework Write-up and data releasing soon. • CS 295: STATISTICAL NLP (WINTER 2017) 3
Outline EM-Algorithm for Alignments Phrase-Based MT Decoding Algorithms Syntax-Based MT CS 295: STATISTICAL NLP (WINTER 2017) 4
Outline EM-Algorithm for Alignments Phrase-Based MT Decoding Algorithms Syntax-Based MT CS 295: STATISTICAL NLP (WINTER 2017) 5
Parameters of the IBM Models CS 295: STATISTICAL NLP (WINTER 2017) 6
Parameters of the IBM Models CS 295: STATISTICAL NLP (WINTER 2017) 7
Translation from Alignments CS 295: STATISTICAL NLP (WINTER 2017) 8
Alignments from Translation CS 295: STATISTICAL NLP (WINTER 2017) 9
Expectation Maximization Expectation Maximization CS 295: STATISTICAL NLP (WINTER 2017) 10
Example CS 295: STATISTICAL NLP (WINTER 2017) 11
Example CS 295: STATISTICAL NLP (WINTER 2017) 12
Word-based MT: Problems Multi-word Alignments Non-compositionality Phrasal Translations CS 295: STATISTICAL NLP (WINTER 2017) 13
Outline EM-Algorithm for Alignments Phrase-Based MT Decoding Algorithms Syntax-Based MT CS 295: STATISTICAL NLP (WINTER 2017) 14
The Vauquios Triangle CS 295: STATISTICAL NLP (WINTER 2017) 15
Phrase-based MT Mary did not slap the green witch CS 295: STATISTICAL NLP (WINTER 2017) 16
Phrase Lexicon CS 295: STATISTICAL NLP (WINTER 2017) 17
Learning Phrasal Alignments CS 295: STATISTICAL NLP (WINTER 2017) 18
Learning Phrasal Alignments CS 295: STATISTICAL NLP (WINTER 2017) 19
Learning Phrasal Alignments CS 295: STATISTICAL NLP (WINTER 2017) 20
Phrasal Alignments Should contain all the alignment points for covered words CS 295: STATISTICAL NLP (WINTER 2017) 21
Learning Phrasal Alignments (Maria, Mary), (no, did not), (slap, daba una bofetada), (a la, the), (bruja, witch), (verde, green) CS 295: STATISTICAL NLP (WINTER 2017) 22
Learning Phrasal Alignments (Maria, Mary), (no, did not), (slap, daba una bofetada), (a la, the), (bruja, witch), (verde, green) (Maria no, Mary did not), (no daba una bofetada, did not slap), (daba una bofetada a la, slap the), (bruja verde, green witch) CS 295: STATISTICAL NLP (WINTER 2017) 23
Learning Phrasal Alignments (Maria, Mary), (no, did not), (slap, daba una bofetada), (a la, the), (bruja, witch), (verde, green) (Maria no, Mary did not), (no daba una bofetada, did not slap), (daba una bofetada a la, slap the), (bruja verde, green witch) (Maria no daba una bofetada, Mary did not slap), (no daba una bofetada a la, did not slap the), (a la bruja verde, the green witch) CS 295: STATISTICAL NLP (WINTER 2017) 24
Learning Phrasal Alignments (Maria, Mary), (no, did not), (slap, daba una bofetada), (a la, the), (bruja, witch), (verde, green) (Maria no, Mary did not), (no daba una bofetada, did not slap), (daba una bofetada a la, slap the), (bruja verde, green witch) (Maria no daba una bofetada, Mary did not slap), (no daba una bofetada a la, did not slap the), (a la bruja verde, the green witch) (Maria no daba una bofetada a la, Mary did not slap the), (daba una bofetada a la bruja verde, slap the green witch) CS 295: STATISTICAL NLP (WINTER 2017) 25
Phrase Translation Scores CS 295: STATISTICAL NLP (WINTER 2017) 26
Phrases for a Sentence wir müssen auch diese kritik ernst nehmen (wir müssen, we must) (wir müssen auch, we must also) (ernst, seriously) …. CS 295: STATISTICAL NLP (WINTER 2017) 27
Derivations for a Sentence CS 295: STATISTICAL NLP (WINTER 2017) 28
Distortion Limits CS 295: STATISTICAL NLP (WINTER 2017) 29
Distortion Scores CS 295: STATISTICAL NLP (WINTER 2017) 30
Scoring Derivations CS 295: STATISTICAL NLP (WINTER 2017) 31
The Translation Problem CS 295: STATISTICAL NLP (WINTER 2017) 32
A Secret of Statistical MT CS 295: STATISTICAL NLP (WINTER 2017) 33
Outline EM-Algorithm for Alignments Phrase-Based MT Decoding Algorithms Syntax-Based MT CS 295: STATISTICAL NLP (WINTER 2017) 34
The Decoding Task CS 295: STATISTICAL NLP (WINTER 2017) 35
Monotonic Word Decoding CS 295: STATISTICAL NLP (WINTER 2017) 36
Monotonic Word Decoding CS 295: STATISTICAL NLP (WINTER 2017) 37
Monotonic Word Decoding Mary CS 295: STATISTICAL NLP (WINTER 2017) 38
Monotonic Word Decoding Mary did not CS 295: STATISTICAL NLP (WINTER 2017) 39
Monotonic Word Decoding Mary did not give CS 295: STATISTICAL NLP (WINTER 2017) 40
Monotonic Word Decoding Mary did not give a CS 295: STATISTICAL NLP (WINTER 2017) 41
Monotonic Word Decoding Mary did not give a slap CS 295: STATISTICAL NLP (WINTER 2017) 42
Monotonic Word Decoding Mary did not give a slap to CS 295: STATISTICAL NLP (WINTER 2017) 43
Monotonic Word Decoding Mary did not give a slap to the CS 295: STATISTICAL NLP (WINTER 2017) 44
Monotonic Word Decoding Mary did not give a slap to the witch CS 295: STATISTICAL NLP (WINTER 2017) 45
Monotonic Word Decoding Mary did not give a slap to the witch green CS 295: STATISTICAL NLP (WINTER 2017) 46
Monotonic Word Decoding CS 295: STATISTICAL NLP (WINTER 2017) 47
Phrase Decoding: Stacks CS 295: STATISTICAL NLP (WINTER 2017) 48
Phrase Decoding: Stacks CS 295: STATISTICAL NLP (WINTER 2017) 49
Monotonic Phrase Decoding CS 295: STATISTICAL NLP (WINTER 2017) 50
Monotonic Phrase Decoding CS 295: STATISTICAL NLP (WINTER 2017) 51
Monotonic Phrase Decoding (Mary) CS 295: STATISTICAL NLP (WINTER 2017) 52
Monotonic Phrase Decoding (Mary) (did not) CS 295: STATISTICAL NLP (WINTER 2017) 53
Monotonic Phrase Decoding (Mary) (did not) (slap) CS 295: STATISTICAL NLP (WINTER 2017) 54
Monotonic Phrase Decoding (Mary) (did not) (slap) (the) CS 295: STATISTICAL NLP (WINTER 2017) 55
Monotonic Phrase Decoding (Mary) (did not) (slap) (the) (green witch) CS 295: STATISTICAL NLP (WINTER 2017) 56
Recommend
More recommend