Machine Translation Machine Translation Berlin Chen 2003 - PowerPoint PPT Presentation

Machine Translation Machine Translation Berlin Chen 2003 References: 1. Natural Language Understanding, chapter 13 2. W. A. Gale and K. W. Church, A Program for Aligning Sentences in Bilingual Corpora, Computational Linguistics 1993 1

Machine Translation (MT) • Definition – Automatic translation of text or speech from one language to another • Goal – Produce close to error-free output that reads fluently in the target language – Far from it ? • Current Status – Existing systems are used in restricted domains – A mix of probabilistic and non-probabilistic components 2

Issues • Build high-quality semantic-based MT systems in circumscribed domains • Abandon automatic MT, build software to assist human translators instead – Post-edit the output of a buggy translation • Develop automatic knowledge acquisition techniques for improving general-purpose MT 3

Different Strategies for MT Interlingua (knowledge representation) knowledge-based translation English French (semantic (semantic semantic transfer representation) representation) English French (syntactic parse) (syntactic parse) syntactic transfer English Text French Text (word string) (word string) word-for-word 4

Word for Word MT 1950 • Translate words one-by-one from one language to another • Problems – No one-to-one correspondence between words in different languages (lexical ambiguity) • Need to look at the context larger than individual word ( → phrase or clause ) – Languages have different word orders 5

Syntactic Transfer MT • Parse the source text, then transfer the parse tree of the source text into a syntactic tree in the target language, and then generate the translation from this syntactic tree • Problems – Syntactic ambiguity – The target syntax will likely mirror that of the source text Adv V N German : Ich esse gern (I like to eat) English : I eat readily 6

Semantic Transfer MT • Represent the meaning of the source sentence and then generate the translation from the meaning • Problems – Still be unnatural to the point of being unintelligible – Difficult to build the translation system for all pairs of languages Spanish : La botella entró a la cueva flotando (The bottle floated into the cave) English : The bottle entered the cave floating 7

Knowledge-Based MT • The translation is performed by way of a knowledge representation formulism called “ interlingua” – Independence of the way particular language s express meaning • Problems – Difficult to design an efficient and comprehensive knowledge representation formulism – Large amount of ambiguity needed to be solved to translate from a natural language to a knowledge representation language 8

Text Alignment • Definition – Align paragraphs, sentences or words in one language to paragraphs, sentences or words in another languages • Thus can learn which words tend to be translated by which other words in another language bilingual dictionaries, MT , parallel grammars … • Applications – Bilingual lexicography – Machine translation – Multilingual information retrieval – … 9

Text Alignment • Sources of Parallel texts or bitexts – Parliamentary proceedings (Hansards) with less literal – Newspapers and magazines translation – Religious and literary works • Two levels of alignment – Gross large scale alignment • Learn which paragraphs or sentences correspond to which paragraphs or sentences in another language – Word alignment • Learn which words tend to be translated by which words in another language 10

Text Alignment 2:2 alignment 11

Text Alignment 2:2 alignment 1:1 alignment 1:1 alignment 2:1 alignment 12

Sentence Alignment Length-based method • Rationale : the short sentences will be translated as short sentences and long sentences as long sentences – Length is defined as the number of words or the number of characters • Approach 1 ( Gale & Church 1993 ) – Assumptions • The paragraph structure was clearly marked in the s 1 t 1 s 2 corpus, confusions are checked by hand t 2 s 3 t 3 s 4 t 4 • Crossing dependences are not handled here . . . . – The order of sentences are not changed in the . . translation s I . 13 t J

Sentence Alignment Length-based method Most cases are 1:1 alignments. 14

Sentence Alignment Length-based method source target s 1 t 1 B 1 = S s s s L s 2 t 2 1 2 I s 3 t 3 = B 2 possible alignments: T t t t L 1 2 J s 4 t 4 {1:1, 1:0, 0:1, 2:1,1:2, 2:2,…} B 3 . . . . . . probability independence s I . B k a bead between beads t J ( ) K ( ) ( ) ∏ = ≈ arg max P A S , T arg max P A , S , T P B k = A A k 1 ( ) = where A B , B ,..., B 15 1 2 k

Sentence Alignment Length-based method – Dynamic Programming • The cost function (Distance Measure) Bayes’ Law ( ) ( ) ( ) α = − α δ µ 2 cost align l , l log P align l , l , , s 1 2 1 2 [ ] ( ( ) ) ( ) ≈ − α δ µ α 2 log P align P l , l , , s align ( ) − 1 2 log P B k ( ) ( ( ) ) δ ⋅ δ µ 2 = − µ 2 is a normal distribution l , l , , s l l l s 1 2 2 1 1 square difference of two L Ratio of texts in two languages = µ 2 paragraphs L 1 ( ( ) ) ( ( ) ) δ µ α = − δ 2 P l , l , , s align 2 1 prob 1 2 The prob. distribution • Sentence is the unit of alignment of standard normal distribution • Statistically modeling of character lengths 16

Sentence Alignment Length-based method • The priori probability Or P ( α align) Source ( ) ( ) − + φ  D i , j 1 cost 0 : 1 align , t j  ( ) ( ) − + φ D i 1 , j cost 1 : 0 align s , s i  i ( ) ( )  − − + D i 1 , j 1 cost 1 : 1 align s , t s i-1  ( ) = D i , j i j  ( ) ( ) − − + s i-2 D i 1 , j 2 cost 1 : 2 align s , t , t  i j − 1 j ( ) ( ) t j-2 t j-1  t j − − + D i 2 , j 1 cost 2 : 1 align s , s , t  − i 1 i j ( ) ( )  − − + D i 2 , j 2 cost 2 : 2 align s , s , t , t  Target − − i 1 i j 1 j 17

Sentence Alignment Length-based method – A simple example L 1 alignment 1 L 1 alignment 2 s 1 t 1 cost(align( s 1 , t 1 )) cost(align( s 1 , s 2 , t 1 )) t 1 + s 2 t 2 + cost(align( s 2 , t 2 )) + s 3 t 2 cost(align( s 3 , t 2 )) cost(align( s 3 ,Ø)) + + s 4 t 3 t 3 cost(align( s 4 , t 3 )) cost(align( s 4 , t 3 )) 18

Sentence Alignment Length-based method – The experimental results 19

Sentence Alignment Length-based method – 4% error rate was achieved – Problems : • Can not handle noisy and imperfect input – E.g., OCR output or file containing unknown markup conventions – Finding paragraph or sentence boundaries is difficult – Solution : just align text (position) offsets in two parallel texts (Church 1993) • Questionable for languages with few cognates or different writing systems – E.g., English ←→ Chinese eastern European languages ←→ Asian languages 20

Sentence Alignment Length-based method • Approach 2 ( Brown 1991 ) – Compare sentence length in words rather than characters • However, variance in number of words us greater than that of characters – EM training for the model parameters • Approach 3 ( Wu 1994 ) – Apply the method of Gale and Church(1993) to a corpus of parallel English and Cantonese text – Also explore the use of lexical cues 21

Sentence Alignment Lexical method • Rationale : the lexical information gives a lot of confirmation of alignments – Use a partial alignment of lexical items to induce the sentence alignment – That is, a partial alignment at the word level induces a maximum likelihood at the sentence level – The result of the sentence alignment can be in turn to refine the word level alignment 22

Sentence Alignment Lexical method • Approach 1 (Kay and Röscheisen 1993) – First assume the first and last sentences of the text were align as the initial anchors – Form an envelope of possible alignments • Alignments excluded when sentences across anchors or their respective distance from an anchor differ greatly – Choose word pairs their distributions are similar in most of the sentences – Find pairs of source and target sentences which contain many possible lexical correspondences • The most reliable of pairs are used to induce a set Iterations of partial alignment (add to the list of anchors) 23

Sentence Alignment Lexical method • Approach 1 – Experiments • On Scientific American articles – 96% coverage achieved after 4 iterations, the reminders is 1:0 and 0:1 matches • On 1000 Hansard sentences – Only 7 errors (5 of them are due to the error of sentence boundary detection) were found after 5 iterations – Problem • If a large text is accompanied with only endpoints for anchors, the pillow must be set to large enough, or the correct alignments will be lost – Pillow is treated as a constraint 24

Machine Translation Machine Translation Berlin Chen 2003 - PowerPoint PPT Presentation

Machine Translation Machine Translation Berlin Chen 2003 References: 1. Natural Language Understanding, chapter 13 2. W. A. Gale and K. W. Church, A Program for Aligning Sentences in Bilingual Corpora, Computational Linguistics 1993 1

Statistical Machine Translation Nadir Durrani 21-November-2014 Machine Translation

Introd u ction to machine translation MAC H IN E TR AN SL ATION IN P YTH ON Th u shan

Machine Translation Machine Translation February 13, 2008 Andreas Eisele UdS Computerlinguistik

Neural Machine Translation Gongbo Tang 8 October 2018 Outline Neural Machine Translation 1

11-731 Machine Translation Speech 2 Speech Translation Speech Translation Three part systems

Machine Translation Philipp Koehn 28 April 2020 Philipp Koehn Artificial Intelligence: Machine

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Computer Aided Translation Philipp Koehn 30 April 2015 Philipp Koehn Machine Translation:

Computer Aided Translation Philipp Koehn 15 November 2018 Philipp Koehn Machine Translation:

Machine Translation: Going Deep Philipp Koehn 4 June 2015 Philipp Koehn Machine Translation:

Machine Translation Philipp Koehn 1 December 2015 Philipp Koehn Artificial Intelligence:

Neural Machine Translation II Refinements Philipp Koehn 17 October 2017 Philipp Koehn Machine

Representing Huge Translation Models Statistical Machine Translation parallel text + alignment

Global Translation Services Website translation using post-edited machine translation and

Community Translation By Willem Stoeller Examples Community Translation Virtual Teams Powering

Machine Translation 12: (Non-neural) Statistical Machine Translation Rico Sennrich University of

One Multidisciplinary Approach to IT Education J. Michael Dunn, Dean

Cross-Language Retrieval LBSC 796/INFM 718R Douglas W. Oard Session 12: April 27, 2011 Agenda

Lexical representations in theories of word reading Stphan Tulkens Overview Color legend

21 Advanced Topics 3: Sub-word MT Up until this point, we have treated words as the atomic unit

Dictionary and Monolingual Corpus-based Query Translation for Basque-English CLIR Xabier Saralegi

Asse ssessing F ssing Frenc nch a h at K t KS3 S3 without le without levels ls results

Q2-2020 Earnings July 22, 2020 Forward Looking Statements This communication may be deemed to

WWTF Financing Rev It Up Loan Program January 13, 2016 Rev It Up Loan Program Legislative

Sambuz

Useful Links

Newsletter

Mail Us