Cr Cros oss-lin lingual al lan languag age mod model pr - PowerPoint PPT Presentation

Nov 01, 2023 •285 likes •410 views

Cr Cros oss-lin lingual al lan languag age mod model pr pretraini ning ng Alexis Conneau and Guillaume Lample Facebook AI Research 1 Why learning cross-lingual representations? 1 2 4 3 This is great. Cest super. Das ist toll.

Cr Cros oss-lin lingual al lan languag age mod model pr pretraini ning ng Alexis Conneau and Guillaume Lample Facebook AI Research 1
Why learning cross-lingual representations? 1 2 4 3 This is great. C’est super. Das ist toll. 2
Cross-lingual language models 3
Mult. Masked Language Modeling (MLM) Similar to BERT, we pretrain a Transformer model with MLM but in many languages: Multilingual Masked language modeling pretraining .. multilingual representations emerge from a single MLM trained on many languages. Devlin et al. – BERT: Pretraining of Deep Bidirectional Transformers for Language Understanding (+ mBERT) 4
Translation Language Modeling (TLM) Multilingual MLM is unsupervised, but we leverage parallel data with TLM: Translation language modeling (TLM) pretraining .. to encourage the model to leverage cross-lingual context when making predictions. 5
Results on XLU benchmarks 6
Results on Cross-lingual Classification (XNLI) Average XNLI accuracy on the 15 languages of XNLI The pretrained encoder is fine-tuned on the English for zero-shot cross-lingual classification XNLI(*) training data and then tested on 15 languages XNLI baseline 65.6 mBERT 66.3 XLM LASER 70.2 XLM (MLM) 71.5 XLM (MLM+TLM) 75.1 60 64 68 72 76 Average XNLI accuracy over 15 languages (*) Conneau et al. – XNLI: Evaluating Cross-lingual Sentence Representations (EMNLP 2018) 7
Results on Unsupervised Machine Translation Initialization is key in unsupervised MT to bootstrap the iterative BT process Embedding layer initialization Full Transformer model initialization is essential for neural unsupervised MT (*) significantly improves performance (+7 BLEU) Embeddings pretrained 27.3 Full model pretrained (CLM) 30.5 Full model pretrained (MLM) 34.3 Supervised 2016 SOTA (Edinburgh) 36.2 20 24 28 32 36 40 BLEU (*) Lample et al. – Phrase-based and neural unsupervised machine translation (EMNLP 2018) 8
Results on Supervised Machine Translation We also show the importance of pretraining for generation • Pretraining both the encoder and decoder improves BLEU score No pretraining • MLM better than LM pretraining Full model pretrained (CLM) • Back-translation + pretraining leads to the best BLEU score Full model pretrained (MLM) 20 24 28 32 36 40 • Pretraining is more important without back-translation with back-translation when supervised data is small 9
Conclusion • Cross-lingual language model pretraining is very effective for XLU • New state of the art for cross-lingual classification on XNLI • Reduces the gap between unsupervised and supervised MT • Recent developments have improved XLM/mBERT models 10
Thank you! Code and models available at github.com/facebookresearch/XLM Lample & Conneau – Cross-lingual Language Model Pretraining (NeurIPS 2019) 11

Recommend

Cross-lingual NLP Sara Stymne Uppsala University Department of Linguistics and Philology

Cross-lingual NLP Sara Stymne Uppsala University Department of Linguistics and Philology September 1, 2020 What is a cross-lingual model Used to describe systems that involve more than one language Not one clear definition Typical NLP

718 views • 28 slides

CROS SPOONER, RICHARD WAKELIN, ERIN GARNETT Beef + Lamb New Zealand Cros Spooner is Chief

CROS SPOONER, RICHARD WAKELIN, ERIN GARNETT Beef + Lamb New Zealand Cros Spooner is Chief Operating Offjcer, with responsibilities for the economic service, information technology and the fjnancial and strategic functions. Richard Wakelin is

293 views • 28 slides

Successful Marketing Strategies for San Diego CROs Marketing your CRO for peanuts... Guy Guy

Successful Marketing Strategies for San Diego CROs Marketing your CRO for peanuts... Guy Guy Iannuz annuzzi i Ment entus us Mar arch h 7, 7, 2013 2013 3/6/13 Mentus / 321 Medical Launch 1 Agenda Why CROs (you) need to market

1.49k views • 108 slides

C C # # 7, 7, 8, 8, and beyon ond: lang languag uage e fe features from design to to

C C # # 7, 7, 8, 8, and beyon ond: lang languag uage e fe features from design to to release to to IDE su support ort Kevin Pilch kevinpi@microsoft.com @Pilchie ck Overflow - most popular technologies Stack

811 views • 29 slides

Relationships with CROs Sponsor and CRO Objectives Sponsors produce a product (i.e. quality data)

Sponsor Perspectives on Effective Relationships with CROs Sponsor and CRO Objectives Sponsors produce a product (i.e. quality data) to make a profit. People Resources Expertise Time CROs want to provide a service to make

378 views • 34 slides

Mul$lingual web- based communica$on solu$ons for the

Mul$lingual web- based communica$on solu$ons for the global media industry 1 Mul$lingual professional support for interna$onal media

223 views • 8 slides

IMPLIC IMPLICATION & TION & EVIDEN EVIDENCE CE Nations - ations - 19 193 Languag

BABEL - ABEL - Scattering - Gen 10-11 Part 1 IMPLIC IMPLICATION & TION & EVIDEN EVIDENCE CE Nations - ations - 19 193 Languag Languages - es - 69 6912 2 94 f 4 families amilies VI VIEW EWS Secular -

837 views • 72 slides

Ad Advanced ed Pre-tr training languag language m e models dels a br a brie ief in

Ad Advanced ed Pre-tr training languag language m e models dels a br a brie ief in introduct ductio ion Xiachong Feng Ou Outline 1. Encoder-Decoder 2. Attention 3. Transformer : Attention is all you need 4. Word embedding

980 views • 62 slides

Rec ecen ent U Updates es to o J2J 2J Ex Explorer er Job ob H Hop opping A g Acr cros

Rec ecen ent U Updates es to o J2J 2J Ex Explorer er Job ob H Hop opping A g Acr cros oss Cities Heath Hayward Longitudinal Employer-Household Dynamics Research U.S. Census Bureau December 2019 1 Disclaimer er Any opinions

633 views • 51 slides

Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language

Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model Presenter : 1 Kosuke Takahashi 1 2 Katsuhito Sudoh 1 Satoshi Nakamura 1: Nara Institute of Science and Technology (NAIST) 2: PRESTO, Japan

894 views • 11 slides

Cross-lingual Information Retrieval Pavel Pecina Institute of Formal and Applied Linguistics

Cross-lingual Information Retrieval Pavel Pecina Institute of Formal and Applied Linguistics Charles University, Prague, Czech Republic Joint work with Shadi Saleh Jan 17, 2019 - AFCEA Outline 1. Introduction 2. (Cross-lingual) information

797 views • 69 slides

Mul&lingualism @ ECUAD Debora O & Tara Wren

v Mul&lingualism @ ECUAD Debora O & Tara Wren Mul&lingual Coordinators 42% of ECU Students say they are mul&lingual Mul&lingual

604 views • 13 slides

SOU OUTH TH AFR FRIC ICAN AN RED ED CROS OSS S SOC OCIE IETY TY (SARCS) RCS) Disaste

SOU OUTH TH AFR FRIC ICAN AN RED ED CROS OSS S SOC OCIE IETY TY (SARCS) RCS) Disaste ster r Risk Red eduction ction Summit mmit Date: e: 1-2 Feb ebrua uary ry 2018 8 Polokw okwane ane Prese sented ted by; Levona na

238 views • 20 slides

New Starter Models for Pharmaceutical Companies and Clinical Research Organisations (CROs)

New Starter Models for Pharmaceutical Companies and Clinical Research Organisations (CROs) October 2011 Gakava L Roche Products Ltd., Welwyn, UK Disclaimer The views and opinions expressed in this presentation are those of the author and do

339 views • 23 slides

Working Together An Interprofessional Collaboration Model for SLPs and BCBAs Presented by Penny

Working Together An Interprofessional Collaboration Model for SLPs and BCBAs Presented by Penny Tonn, M.S., CCC-SLP Speech Language Pathology SLPs Provide de treatment atment for speech, ch, languag uage, e, social al communic

225 views • 11 slides

WMT 2016 Shared Task on Cross-lingual Pronoun Prediction . Liane Guillou, Christian Hardmeier,

. WMT 2016 Shared Task on Cross-lingual Pronoun Prediction . Liane Guillou, Christian Hardmeier, Preslav Nakov, Sara Stymne, J org Tiedemann, Yannick Versley, Mauro Cettolo, Bonnie Webber and Andrei Popescu-Belis 12/08/2016 Cross-lingual

296 views • 16 slides

On On Secure Pos osition oning (P (Proj oject CSP: SP: Cros oss-La Layer D Desig ign o

On On Secure Pos osition oning (P (Proj oject CSP: SP: Cros oss-La Layer D Desig ign o of Se f Secure Po Positioning) Sr Srdjan a apkun Relay attack only takes a couple of seconds si signal stre rength d we need se

402 views • 39 slides

A Sur urplus Base ased Fr Fram amework for or Cros oss- Bor order Elec Electricit ity

A Sur urplus Base ased Fr Fram amework for or Cros oss- Bor order Elec Electricit ity Trad ade in n Sou outh Ameri erica Claudio A. Agostini 1 Andrs M. Guzmn 2 Shahriyar Nasirov 1 Carlos Silva 1 16th IAEE European Conference

397 views • 21 slides

Acquis cquisit ition ion Ref efor orm Acr cros oss the he Go Gover ernment nment Jon

Acquis cquisit ition ion Ref efor orm Acr cros oss the he Go Gover ernment nment Jon Etherton Senior Fellow for Acquisition Reform National Defense Industrial Association 1" 1" The Environment Why now? Downward

430 views • 9 slides

Evalua&ng Mul&lingual Humboldt-Universitt zu Berlin

Juliane S$ller, Evalua&ng Mul&lingual Humboldt-Universitt zu Berlin Mul$lingualWeb Workshop, Features in Europeana Riga, 29.04.2015 Deriving best

487 views • 19 slides

Mul$lingual Issues in the Representa$on of Interna$onal

Mul$lingual Issues in the Representa$on of Interna$onal Bibliographic Standards for the Seman$c Web Gordon Dunsire Independent Consultant; Chair of IFLA

492 views • 11 slides

CROS OSSR SROADS DS Corp orate Center of S u gar Grove ASTORIA of Sugar Grove The D e

CROS OSSR SROADS DS Corp orate Center of S u gar Grove ASTORIA of Sugar Grove The D e Devel elop oper er History and Background Henry Crown & Company (HCC) was originally founded in 1918 Family holdings now include

621 views • 26 slides

Cros oss-Bor Border C Connected Cit itie ies SMART Borders / Cross-Border Connected Cities

ARI-SON Council Meeting Welcome Reception Cros oss-Bor Border C Connected Cit itie ies SMART Borders / Cross-Border Connected Cities Steve Hamilton, Senior Manager Deloitte Advisory August 23, 2017 THE US-MEXI CO CEO DI ALOGUE |

429 views • 14 slides

Critical Path Learning How to Develop Top Level SAS Programmers Paolo Morelli CEO, CROS NT

Critical Path Learning How to Develop Top Level SAS Programmers Paolo Morelli CEO, CROS NT Group Verona, Italy PhUSE Conference 2011 Brighton, UK www.crosnt.com Agenda Agenda Introduction The need for qualified

271 views • 26 slides