ListNet-based MT Rescoring Jan Niehues, Quoc Khanh Do, Alexandre - PowerPoint PPT Presentation

ListNet-based MT Rescoring Jan Niehues, Quoc Khanh Do, Alexandre Allauzen and Alex Waibel KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS 0 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and KIT – University of the State of Baden-Wuerttemberg and www.kit.edu Robotics and LIMSI-CNRS National Research Center of the Helmholtz Association

Motivation Log-linear model is widely used in SMT Use during decoding Use in MT rescoring MT Rescoring Easy and efficient way to integrate of complex models Machine learning view Ranking problem Promising approach: ListNet algorithm Apply ListNet algorithm to SMT 1 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Related Work Optimization in Machine translation Minimum Error Rate Training (MERT) (Och, 2003) Standard in most machine translation systems MIRA (Watanabe et al., 2007; Chiang et al., 2008) PRO (Hopkins and May, 2011) Expected BLEU (Rosti et al, 2011; He and Deng, 2012) Ranking in machine learning ListNet algorithm (Cao et al., 2007) Overview over different ranking algorithms (Chen et al., 2009) 2 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Overview Motivation ListNet Algorithm MT Rescoring MT specific problems Evaluation WMT IWSLT - TED 3 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

ListNet -Ranking Input: List Model score Metric for reference ranking Hypothesis Model Metric A 7.4 24.4 B 7.8 24.2 C 7.2 24.5 D 7.1 24.1 4 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

ListNet -Ranking Input: List Model score Metric for reference ranking Hypothesis Model Metric B 7.8 24.2 A 7.4 24.4 C 7.2 24.5 D 7.1 24.1 According to the model 4 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

ListNet -Ranking Input: List Model score Metric for reference ranking Hypothesis Model Metric C 7.2 24.5 A 7.4 24.4 B 7.8 24.2 D 7.1 24.1 According to the metric 4 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

ListNet -Ranking Input: List Model score Metric for reference ranking Hypothesis Model Metric Hypothesis Model Metric B 7.8 24.2 C 7.2 24.5 A 7.4 24.4 A 7.4 24.4 ⇒ C 7.2 24.5 B 7.8 24.2 D 7.1 24.1 D 7.1 24.1 Aim: Learn a model to rank like the metric 4 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

ListNet - Idea Define a probability distribution over possible rankings Learn model that produces a distribution similar to the one defined by the metric Problem: large number of possible rankings Define a probability distribution associated to the model ranking based on first ranked object exp ( s j ) P s ( j ) = (1) ∑ n k = 1 exp ( s k ) 5 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

ListNet - Distribution 1 Model Metric 0.8 Probability 0.6 0.4 0.2 0 A B C D Hypothesis Minimize cross-entropy difference 6 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

MT Rescoring Use ListNet to rescore N-Best list Train log-linear model Input: N-Best list Additional features Learn new weights for log-linear model 8 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Model Define probability distribution associated to the model ranking exp ( s j ) P s ( j ) = (2) ∑ n k = 1 exp ( s k ) Problem: Many scores are small probabilities Log-probabilities are very small negative values exp ( s ) calculation may be erroneous Feature normalization: Linear transform all features to the range [ − 1 , 1 ] Score normalization: Linear transform the final score of the model to the range [ − r , r ] 9 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Metric Define probability distribution associated to the reference ranking Reference ranking for every sentence needed Ranking induced by MT metric Sentence-wise MT metric Metric: BLEU+1 (Liang et al. 2006) Smoothed version of BLEU score exp ( BLEU ( x ( i ) )) P y ( i ) ( x ( i ) j ) = (3) j j ′ = 1 exp ( BLEU ( x ( i ) ∑ n i j ′ ) 10 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Training Minimize cross-entropy difference between model-based and BLEU+1-based probability distribution Use ListNet algorithm to calculate derivation Stochastic gradient descent 100,000 batches Batch size of 10 11 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Evaluation WMT 2015 EN-DE PBMT System Additional features based on neural network translation models WMT 2015 DE-EN PBMT System Additional features using RBM-based translation models and source DWL TED 2014 EN-DE Translation of TED talks 13 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

WMT – English to German 22 ListNet PRO KBMira MERT 21.5 No Resco. BLEU 21 20.5 20 Baseline NCE SOUL SOUL+NCE Feature 14 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

WMT – German to English 29 ListNet PRO KBMira MERT 28.5 No Resco. BLEU 28 27.5 27 Baseline SDWL SDWL+RBMTM Feature 15 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Convergence 15 Dev score 14.8 14.6 14.4 BLEU+1 14.2 14 13.8 13.6 13.4 13.2 13 0 200 400 600 800 1000 Samples (x1000) 16 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Score normalization 22 Score Feature 21.5 21 BLEU 20.5 20 19.5 19 0.1 1 10 100 Range 17 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

TED – English to German 25 ListNet PRO 24.5 KBMira MERT No Resco. 24 BLEU 23.5 23 22.5 22 Baseline extra Dev Data Feature 18 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

Conclusion Presented a new technique to train log-linear model Scale to many features Consider whole list Technique can also be applied to more complex models Evaluated using different tasks and languages WMT English – German WMT German – English IWSLT –TED English – German Translation quality improvements measured in BLEU score Outperform MERT in all configurations Less prone to overfitting 19 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

WMT – English to German Baseline NCE SOUL SOUL+NCE System Dev Test Dev Test Dev Test Dev Test Baseline 20.19 MERT 20.63 20.52 21.24 20.92 21.36 20.84 21.36 20.94 KB-MIRA 20.64 20.38 21.51 20.96 21.65 20.83 21.71 21.06 PRO 20.17 21.01 21.04 21.25 21.18 21.31 21.14 21.34 ListNet 19.95 20.98 21.00 21.51 21.02 21.54 21.14 21.63 20 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

WMT – German to English Baseline SDWL SDWL+RBMTM System Dev Test Dev Test Dev Test Baseline 27.77 MERT 28.18 27.80 28.24 27.65 28.23 27.64 KB-MIRA 28.23 28.06 28.18 28.00 28.00 27.88 PRO 27.38 28.01 27.56 28.14 28.68 28.04 ListNet 28.00 27.87 27.89 28.18 27.94 28.28 21 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

TED – English to German Baseline extra Dev Data System Dev Test Dev Test Baseline 23.67 MERT 27.69 23.46 25.63 23.36 KB-MIRA 27.47 23.19 25.65 23.76 PRO 26.67 23.10 25.00 23.65 ListNet 27.37 23.51 25.49 24.08 22 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS

ListNet-based MT Rescoring Jan Niehues, Quoc Khanh Do, Alexandre - PowerPoint PPT Presentation

ListNet-based MT Rescoring Jan Niehues, Quoc Khanh Do, Alexandre Allauzen and Alex Waibel KIT - Institute for Anthropomatics and Robotics and LIMSI-CNRS 0 2015-09-17 Jan Niehues - ListNet-based MT Rescoring KIT - Institute for Anthropomatics

A Stack-based Algorithm for Neural Lattice Rescoring Gaurav Kumar Center for Language and Speech

EXEMPLAR-BASED SPEECH RECOGNITION IN A RESCORING APPROACH Georg Heigold, Google, USA Joint work

Forest Rescoring Faster Decoding with Integrated Language Models Liang Huang David Chiang ACL

based results vs based results based results based results based results based results vs

Lecture 3: reflex-based, model-based, goal-based, utility-based, learning-based Systematic

SCA Based SCA Based SCA Based SCA Based Wideband Networking Wideband Networking Waveforms

Dichotomy between Rights-based and Market- based Dichotomy between Rights-based and Market- based

Detecting Attacks Anomaly-based Detection Signature-based Signature-based (Misuse)

Attribute-Based Cryptography Lecture 21 And Pairing-Based Cryptography 1 Identity-Based

Designing and Developing Person-Based and Topic-Based Person-Based and Topic-Based Data

Landmark Landmark-based routing based routing Landmark Landmark-based routing based routing

Attribute-Based Cryptography Lecture 21 And Pairing-Based Cryptography Identity-Based

Rare Rare Rare-earth Rare-earth earth-based half earth-based half based half-Heusler based

Outcome Based Approach in Outcome Based Approach in Outcome Based Approach in Outcome Based

Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Convolutional over Recurrent Encoder for Neural Machine Translation Praveen Dakwale and Christof

and Configuration Selection Tei-Wei Kuo Embedded Systems and Wireless Networking Laboratory Dept

Spectral Clustering Guokun Lai 2016/10 1 / 37 Organization Graph Cut Fundamental

Binomial Arrays and Generalized Vandermonde Identities Robert W. Donley, Jr. (CUNY-QCC) March

Marketing Liverpool Update Strategic - Discover England Fund / SIF / Northern Cultural

Marketing/Outreach Update April 14 2014 Approach Increase awareness of deadline/federal

Content Development Content Creation No Matter the Resources Eric Webb Pamela Markey Senior

Get Out of Overwhelm: Using Delegation, Systems, and Automation to Get More Done in Less Time