Meta-Learning for Low Resource NMT Introduction Historically - PowerPoint PPT Presentation

Feb 22, 2023 •224 likes •427 views

Meta-Learning for Low Resource NMT Introduction Historically Statistical Translation Neural Machine Translation recently outperforms Statistical Models outperformed translations on low resource language pairs NMT Previous Work

Meta-Learning for Low Resource NMT
Introduction ● Historically Statistical Translation ● Neural Machine Translation recently outperforms ● Statistical Models outperformed translations on low resource language pairs
NMT Previous Work Monolingual Corpora Single Task Mixed Datasets Direct Transfer Learning
Meta Learning in NMT Idea: Improve on direct transfer learning by better fine-tuning
MAML for NMT 17 High-Resource Languages 4 Low-resource Languages Turkish Finnish Greek Danish Romanian Latvian Spanish Italian French Portuguese Greek Polish
17 High-Resource Languages 4 Low-resource Languages Finnish Turkish Danish Greek Romanian Latvian Spanish Italian French Portuguese Greek Polish Meta-test on these! e.g. Turkish English Meta-train on these! Note: they simulate low- e.g. Spanish English resource by sub- sampling
Gradient Update Meta-Gradient Update
Gradient Update 1st-order Approximate Meta- Gradient Update Meta-Gradient Update
Issue: Meta-train and meta-test input spaces should match! Meta-train En un lugar de la mancha, In some place in the de cuyo nombre no puedo... Mancha, whose name... Spanish Word Embeddings Spanish Embedding for nombre Meta-test Trained Benim adım kırmızı... My name is Red… independently Turkish Word Embeddings Turkish embedding for adım
Universal Lexical Representation Word embeddings trained independently on monolingual corpora Spanish Word English Word French Word Turkish Word Embeddings Embeddings Embeddings Embeddings
Universal Lexical Representation Word embeddings trained independently on monolingual corpora Spanish Word English Word French Word Turkish Word Embeddings Embeddings Embeddings Embeddings Universal Universal Embedding Values Embedding Keys
Universal Lexical Representation Universal Transformation Embedding Keys Spanish Word Matrix (transposed) Embeddings nombre Universal Embedding Values And, these are the Key: We represent weights of the linear “nombre” as a linear combination! combination of tokens in the ULR!
Universal Lexical Representation Universal Transformation Embedding Keys Turkish Word Matrix (transposed) Embeddings adım Universal Embedding Values Same embedding space as Spanish!
Training Universal Transformation Embedding Keys Spanish Word Matrix (transposed) Embeddings nombre Universal Embedding Values trainable fixed
Experiments
Experiments Comment : Best to leave the decoder be! Why?
Comment: Gap narrows as more training examples are included
Critique : Don’t evaluate on any real low-resource languages! Critique : Don’t know how many training examples per task? k-shot, but what is k?

Recommend

DEA PMU NMT Content Introduction Project Planning NMT Friendly Policy and

DEA PMU NMT Content Introduction Project Planning NMT Friendly Policy and Legislation Developing Cycle and Pedestrian Ways Designing the NMT Infrastructure Monitoring and Evaluation of NMT Projects NMT

385 views • 21 slides

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta

Meta- Meta -Programming with Programming with Modelica Modelica for Meta- for Meta -Modeling and Modeling and Model Transformations Model Transformations Peter Fritzson, Adrian Pop Peter Fritzson, Adrian Pop OpenModelica Course, 2007

381 views • 14 slides

NMT Structure Terry Kuzma NMT Instructor Outline Program Mission Logistics / Schedule

NSF ATE National Center for Nanotechnology Applications and Career Knowledge NACK Center NMT Structure Terry Kuzma NMT Instructor Outline Program Mission Logistics / Schedule Key Technology Projects Questions? 2 Program

590 views • 20 slides

D.O.T. HAZMAT / DANGEROUS GOODS TRAINING FOR HEALTHCARE WORKERS including the Nuclear

D.O.T. HAZMAT / DANGEROUS GOODS TRAINING FOR HEALTHCARE WORKERS including the Nuclear Medicine Technologist (NMT) Jason S. Tavel, PhD, DABR Astarita Associates, Inc. Why The NMT? The NMT routinely ships Excepted Packages The NMT

855 views • 49 slides

META-SHARE META SHARE the Open Resource Exchange Facility Stelios Piperidis ILSP-Athena RC,

META-SHARE META SHARE the Open Resource Exchange Facility Stelios Piperidis ILSP-Athena RC, Greece spip@ilsp.gr META-FORUM 2010: Challenges for Multilingual Europe Brussels, Belgium, November 17/18, 2010 Data has become a key factor in LT

574 views • 31 slides

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the

META Seal of Recognition and META Prize Award Ceremony Georg Rehm (DFKI) on behalf of the META Technology Council and the META-NET Executive Board

809 views • 28 slides

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia,

Bayesian Model-Agnostic Meta-Learning Taesup Kim* (presenter), Jaesik Yoon* Ousmane Dia, Sungwoong Kim, Yoshua Bengio, Sungjin Ahn Model-Agnostic Meta-learning (MAML) gradient-based meta-learning framework meta-update task adaptation

886 views • 22 slides

Low Resource Machine Translation MarcAurelio Ranzato Facebook AI Research - NYC

Low Resource Machine Translation MarcAurelio Ranzato Facebook AI Research - NYC ranzato@fb.com Stanford - CS224N, 10 March 2020 Machine Translation English French Training data Ingredients: Train NMT seq2seq with attention NMT

935 views • 74 slides

Analysis of NMT Systems Yonatan Belinkov Guest lecture CMU CS 11-731: Machine Translation and

Analysis of NMT Systems Yonatan Belinkov Guest lecture CMU CS 11-731: Machine Translation and Seq2seq Models 10/4/2018 Outline Non-neural statistical MT vs neural MT Previous phrase-based MT Opaqueness of NMT Why analyze?

965 views • 67 slides

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 23 September 2019 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

828 views • 57 slides

Machine Translation 3: Linguistics in SMT and NMT Ond rej Bojar bojar@ufal.mff.cuni.cz

Machine Translation 3: Linguistics in SMT and NMT Ond rej Bojar bojar@ufal.mff.cuni.cz Institute of Formal and Applied Linguistics Faculty of Mathematics and Physics Charles University, Prague January 2019 MT3: Linguistics in SMT and NMT

796 views • 68 slides

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention

Advanced Neural Machine Translation Gongbo Tang 21 September 2020 Outline NMT with Attention Mechanisms 1 Attention Mechanisms Understanding Attention Mechanisms Attention Variants NMT at Different Granularities 2 Hybrid Models

810 views • 56 slides

Target Conditioned Sampling: Optimizing Data Selection for Multilingual NMT Xinyi Wang,

Target Conditioned Sampling: Optimizing Data Selection for Multilingual NMT Xinyi Wang, Graham Neubig Language Technologies Institute Carnegie Mellon University Multilingual NMT glg: A ma que eu nunca vou spa: Una maana que nunca

747 views • 17 slides

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A

Meta Learning Shengchao Liu Background Meta Learning (AKA Learning to Learn) A fast-learning algorithm: quickly adapted from the source tasks to the target tasks Key terminologies Support Set & Query Set C-Way K-Shot

2k views • 41 slides

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta

A few meta learning papers Guy Gur-Ari Machine Learning Journal Club, September 2017 Meta Learning Mechanisms for faster, better adaptation to new tasks Integrate prior experience with a small amount of new information

700 views • 28 slides

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today,

The Meta-Learning Problem & Black-Box Meta-Learning CS 330 Logistics Homework 1 posted today, due Wednesday, September 30 Project guidelines will be posted by tomorrow. Plan for Today Transfer Learning - Problem formulation - Fine-tuning

728 views • 34 slides

Large Margin Taxonomy Embedding with an Application to Document Categorization K. Weinberger and

Large Margin Taxonomy Embedding with an Application to Document Categorization K. Weinberger and O. Chapelle NIPS 2008 presented by J. Silva, Duke University Large Margin Taxonomy Embedding with an Application to Document Categorization May

372 views • 16 slides

Large-Scale Clustering through Functional NCut Embedding Embedding Experiments Summary

Large-Scale Clustering Ratle, Weston & Miller Introduction Large-Scale Clustering through Functional NCut Embedding Embedding Experiments Summary Frdric Ratle Jason Weston Matthew L. Miller IGAR - University of

417 views • 18 slides

PREDATOR / REAPER SAFETY EDUCATION Presentation for GASCo UAVs are dangerous, right? The slides

PREDATOR / REAPER SAFETY EDUCATION Presentation for GASCo UAVs are dangerous, right? The slides in this presentation are UNCLASSIFIED These are my views and some slides may not reflect current MoD policy RAF Cat 4/5 AIR ACCIDENT RATE SINCE

533 views • 31 slides

Manufacturing Research at McMaster An Emphasis on Materials and Manufacturing 1998: McMaster

Manufacturing Research at McMaster An Emphasis on Materials and Manufacturing 1998: McMaster selected Materials and Manufacturing as a strategic area for investment. $65 million invested by Canadian Foundation for I nnovation (CFI ) in

637 views • 28 slides

Embedding client feedback into reflective practice. Dr Suzie Hudson, Clinical Director NADA

Embedding client feedback into reflective practice. Dr Suzie Hudson, Clinical Director NADA Reflect ective p e practi actice ce What you actually do, rather than what you say you do? A way of improving practice (Schon 1976)

495 views • 15 slides

Failing EMC testing? > 50% of products fail EMC testing first time around Situation An

Failing EMC testing? > 50% of products fail EMC testing first time around Situation An engineer of a small or medium size enterprise usually has to rely on his experience and on best practice methods in order to design an EMC compliant

428 views • 12 slides

EMC KIT BOXES COVERS 4 EMC KIT BOXES Frequency Range 5 FOCUS ON HCC Kit Box 6

EMC KIT BOXES COVERS 4 EMC KIT BOXES Frequency Range 5 FOCUS ON HCC Kit Box 6 FOCUS ON NCC Kit Box 7 NCC STANDARD FERRITE CHOKES NCC Vs Ferrite CMC 8 STANDARD FERRITE CHOKES NCC NCC Vs Ferrite CMC 9 EMI KIT BOXES

400 views • 11 slides

Early/Middle College Lisa Bartell, GCDF Kalamazoo County EMC Coordinator, KRESA EFE What is EMC?

Early/Middle College Lisa Bartell, GCDF Kalamazoo County EMC Coordinator, KRESA EFE What is EMC? Allows students to earn an associates degree or certificate along with their high school diploma Students are enrolled in a focused

449 views • 21 slides