Using Word Embeddings to Enforce Document-Level Lexical Consistency - PowerPoint PPT Presentation

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva Martínez Garcia Carles Creus Cristina España-Bonet Lluís Màrquez EAMT 2017 – May 30th – Prague

Outline Motivation 1 Lexical Consistency 2 Experiments 3 Conclusions & Future Work 4

Outline Motivation 1 Document-Level Decoding Lexical Consistency 2 Experiments 3 Conclusions & Future Work 4

MOTIVATION Traditionally, MT systems are designed at sentence level Discourse information helps for more coherent translations SMT: recent work at Document Level: Usually focused on a specific phenomenon: pronominal anaphora, topic cohesion/coherence, lexical consistency, discourse connectives Post-process and re-ranking approaches Document-Level SMT decoders: Docent (Hardmeier et al. 2012, 2013) and Lehrer NMT: only some work introducing context information or tackling Document-Level phenomena 4

MOTIVATION: Sentence-Level Decoding 5

MOTIVATION: Document-Level Decoding 6

Outline Motivation 1 Lexical Consistency 2 Semantic Space Lexical Consistency Feature (SSLC) Lexical Consistency Change Operation (LCCO) Experiments 3 Conclusions & Future Work 4

Lexical Consistency: Our Approach Translations are more consistent when the same word appears translated into the same forms or into different forms with similar/related meaning throughout a document Goals Avoid inconsistent translations for the same word Handle lexical-choice problem 8

Lexical Consistency: Example 9

SSLC Feature Semantic Space Lexical Consistency Feature Inspired by Semantic Space Language Models (SSLM): - based on word embeddings - maximize the similarity between a word and its context Uses CBOW word2vec word embeddings trained on: - bilingual tokens ( target__source ) - monolingual tokens ( target ) 10

SSLC Feature SSLC scores each occurrence of an inconsistently translated source word depending on: - how distant the proposed translation is to the occurrence context - the best adequacy that could be obtained using another translation option (seen in the document) � � score ( w ) = sim ( � k ∈ occ ( w ) sim ( � w , ctxt w ) − max w k , ctxt w ) 11

SSLC Feature 12

LCCO Change Operation Lexical Consistency Change Operation Boost the decoding process applying several changes at a time & producing more consistent translation candidates LCCO works as follows: - Randomly chooses an inconsistently translated word - Randomly chooses one of its translation options used in the document - Retranslates its occurrences throughout the document 13

LCCO Change Operation 14

Outline Motivation 1 Lexical Consistency 2 Experiments 3 Automatic Evaluation Manual Evaluation Conclusions & Future Work 4

Experiments - Settings Word embeddings: - CBOW word2vec implementation - trained on: europarlv7, UN, MultiUN, subtitles2012 Corpus: - training: europarlv7 - development: newscommentary2009 - test: newscommentary2010 (119 documents) Baselines: Moses, Lehrer Extended systems: - using LCCO - using document-level features: SSLMs SSLC SSLMs+SSLC 16

Automatic Evaluation Development set Test set System TER ↓ BLEU ↑ METEOR ↑ TER ↓ BLEU ↑ METEOR ↑ M OSES 58.28 24.27 46.84 53.70 27.52 50.02 L EHRER 58.34 24.28 46.92 53.78 27.58 50.08 +SSLMs 58.01 24.36 46.91 53.49 27.48 50.10 27.61 +SSLC 58.38 24.26 46.90 53.77 50.07 +SSLMs+SSLC 57.99 24.39 46.95 53.50 27.50 50.07 L EHRER +LCCO 58.36 24.27 46.92 53.77 27.57 50.07 +SSLMs 58.04 24.35 46.92 53.43 27.60 50.15 +SSLC 58.36 24.25 46.89 53.81 27.59 50.07 +SSLMs+SSLC 58.06 24.34 46.93 53.46 27.57 50.12 - not statistically significat at 95 % of confidence - # diff. sentences: between 8 % − 42 % - LCCO applied on 8 % of the documents 17

Manual Evaluation: task 1 100 sentences randomly selected and randomly presented Translated by 17 different systems: - Moses - 8 Lehrer systems - 8 Lehrer + LCCO systems Task: ranking from best to worst sentence-level translation quality (allowing ties) 3 annotators, 70 % − 72 % of pairwise annotator agreement 18

Manual Evaluation: task 1 Results: Lehrer baselines are equivalent to Moses Lehrer+SSLC systems surpass Moses Bilingual information helps SSLC Best system: using SSLMs and SSLCbi together Same patterns when introducing LCCO 19

Manual Evaluation: task 2 Comparison between systems with and without LCCO: baseline, SSLC, SSLMs+SSLC 10 selected documents with lexical changes by LCCO Choose the document translation with the best lexical consistency and adequacy 20

Manual Evaluation: task 2 Comparison between systems with and without LCCO: baseline, SSLC, SSLMs+SSLC 10 selected documents with lexical changes by LCCO Choose the document translation with the best lexical consistency and adequacy Results : - 60 % of the time LCCO variants were preferred - 20 % of the time were ties Systems with LCCO provided better translations 20

Manual Evaluation: example source [...] Due to the choice of the camera and the equipment, these portraits remember the classic photos. [...] The passion for the portrait led Bauer to repeat the idea [...] reference [...] Son retratos que, debido a la selección de la cá- mara y del material recuerdan la fotografía clásica. [...] La pasión por los retratos de Bauer le llevó a repetir la idea [...] M OSES [...] Debido a la elección de la cámara y el equipo, estos retratos recordar el clásico fotos. [...] la pasión por el cuadro conducido Bauer a repetir la idea [...] L EHRER +LCCO [...] Debido a la elección de la cámara y el equipo, estos retratos recordar el clásico fotos. [...] la pasión por el retrato conducido Bauer a repetir la idea [...] 21

Manual Evaluation: example source A special desk was opened [...] “It has been in operation for over a week” respond the clerks at the desk [...] The desk is not overwhelmed with questions. reference [...] se abre una ventanilla especial [...] “Lleva funcio- nando una semana” responden los trabajadores tras ella [...] La ventanilla no logra disipar la avalancha de dudas. M OSES [...] un mostrador especial se inició [...] “Funciona desde hace más de una semana” responder los ujieres en la mesa [...] El escritorio no es, sin duda, cargado con preguntas. L EHRER +SSLC [...] una mesa especial se abre [...] “Funciona desde hace más de una semana” responder los ujieres en la mesa [...] El escritorio no es, sin duda, cargado con preguntas. L EHRER +LCCO [...] un mostrador especial se abre [...] “Funciona desde hace más de una semana” responder los ujieres en la ventanilla [...] El mostrador no es abrumado con preguntas. 22

Outline Motivation 1 Lexical Consistency 2 Experiments 3 Conclusions & Future Work 4

Conclusions We tackled lexical consistency at decoding time Introduced a new feature (SSLC) and a new change operation (LCCO) - SSLC uses word embeddings to measure lexical selection consistency - LCCO performs simultaneous lexical changes in a translation step thus generating more consistent translation candidates Results: - Automatic evaluation metrics do not capture system differences - Human evaluators prefer those systems with our strategies 24

Future Work Use information at lemma and seme level to identify inconsistent translations Work with NMT systems: - Develop post-process or re-ranking strategies - Introduce document-level information as input features - Explore new neural network architectures 25

Thank You! 26

Using Word Embeddings to Enforce Document-Level Lexical Consistency - PowerPoint PPT Presentation

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva Martnez Garcia Carles Creus Cristina Espaa-Bonet Llus Mrquez EAMT 2017 May 30th Prague Outline Motivation 1 Lexical Consistency

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3.

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary

Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1

under Class Imbalance Aditya K. Menon 1 , Harikrishna Narasimhan 2 , Shivani Agarwal 2 and Sanjay

The Consistency Analysis of Secondary Index on Distributed

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science,

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci Hazelcast Hazelcast The

Verifying Strong Eventual Consistency in -CRDTs Taylor Blau University of Washington June,

(In)consistency of the combinatorial codifferential Gantumur Tsogtgerel (McGill University) Joint

Using Word Embeddings to Enforce Document-Level Lexical Consistency - PowerPoint PPT Presentation

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva Martnez Garcia Carles Creus Cristina Espaa-Bonet Llus Mrquez EAMT 2017 May 30th Prague Outline Motivation 1 Lexical Consistency

Word Embeddings Natural Language Processing VU (706.230) - Andi Rexha 02/04/2020 Word Embeddings

Word embeddings Rappel Embeddings ( pas Word Embeddings ) Est une lookup table Formalisme:

Word Embeddings Revisited: Contextual Embeddings CS 6956: Deep Learning for NLP Overview

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Word Embeddings CS 6956: Deep Learning for NLP Overview Representing meaning Word

Heterogeneous Lexical Resources MultiJEDI ERC 259234 Lexical Resource Lexical Resource Lexical

Embeddings @ Twitter Making ML easy with Embeddings !!! Sept 2018 Agenda 1 Team 2 Whats an

Word Embeddings Tutorial HILA GONEN PHD STUDENT AT YOAV GOLDBERGS LAB BAR ILAN UNIVERSITY

Mixed membership word embeddings: Corpus-specific embeddings without big data James Foulds

LEXICAL TYPOLOGY Peter Koch (Part I) Koch, Lexical typology, 2010-8-24 A. General introduction

Compilers Lexical Analysis Alex Aiken Lexical Analysis 1. Lexical Analysis 2. Parsing 3.

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction Roy Schwartz + ,

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky &amp; Martin How to

8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary

Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1

under Class Imbalance Aditya K. Menon 1 , Harikrishna Narasimhan 2 , Shivani Agarwal 2 and Sanjay

The Consistency Analysis of Secondary Index on Distributed

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science,

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci Hazelcast Hazelcast The

Verifying Strong Eventual Consistency in -CRDTs Taylor Blau University of Washington June,

(In)consistency of the combinatorial codifferential Gantumur Tsogtgerel (McGill University) Joint

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to

Dense Word Embeddings CMSC 470 Marine Carpuat Slides credit: Jurasky & Martin How to