translation model interpolation for domain adaptation in
play

Translation Model Interpolation for Domain Adaptation in TectoMT - PowerPoint PPT Presentation

Rudolf Rosa , Ondej Duek, Michal Novk, Martin Popel {rosa,odusek,mnovak,popel}@ufal.mff.cuni.cz Translation Model Interpolation for Domain Adaptation in TectoMT Charles University in Prague Faculty of Mathematics and Physics Institute


  1. Rudolf Rosa , Ondřej Dušek, Michal Novák, Martin Popel {rosa,odusek,mnovak,popel}@ufal.mff.cuni.cz Translation Model Interpolation for Domain Adaptation in TectoMT Charles University in Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics DMTW, Praha, 3 September 2015

  2. Why domain adaptation?  not to do MT as “Europarl science”  reasonably good for e.g. economic news  not particularly good for IT helpdesk Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 2/29

  3. Why domain adaptation?  not to do MT as “Europarl science”  reasonably good for e.g. economic news  not particularly good for IT helpdesk Jak pošlu přílohu mailem? How do I send an attachment by e-mail? Click on the paperclip icon and select the file. Klikněte na ikonu sponky a vyberte soubor. Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 3/29

  4. Why model interpolation?  available parallel data  out-of-domain (Europarl): 2 000 000 sentences  in-domain (IT helpdesk): 1 000 sentences Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 4/29

  5. Why model interpolation?  available parallel data  out-of-domain (Europarl): 2 000 000 sentences  in-domain (IT helpdesk): 1 000 sentences  train on out + in concatenation  effect of in negligible (overpowered by out ) Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 5/29

  6. Why model interpolation?  available parallel data  out-of-domain (Europarl): 2 000 000 sentences  in-domain (IT helpdesk): 1 000 sentences  train on out + in concatenation  effect of in negligible (overpowered by out )  train on in only  too weak, low coverage, worse than out alone Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 6/29

  7. Why model interpolation?  available parallel data  out-of-domain (Europarl): 2 000 000 sentences  in-domain (IT helpdesk): 1 000 sentences  train on out + in concatenation  effect of in negligible (overpowered by out )  train on in only  too weak, low coverage, worse than out alone  train 2 translation models, one on in , one on out  final TM = (weighted) interpolation of in TM and out TM Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 7/29

  8. Why TectoMT (deep transfer MT)? ✔ better meaning preservation ✔ better generalization from small data ✗ limited support for TM interpolation ✗ no support for tuning of TM weights Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 8/29

  9. Why TectoMT (deep transfer MT)? ✔ better meaning preservation ✔ better generalization from small data ✗ limited support for TM interpolation – solved ✗ no support for tuning of TM weights – TODO Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 9/29

  10. Outline  Motivation  TectoMT (deep transfer)  Translation model interpolation in TectoMT  Evaluation  Conclusion Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 10/29

  11. TectoMT deep transfer Jak pošlu přílohu mailem? Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 11/29

  12. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 12/29

  13. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 13/29

  14. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 send v:fin Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 14/29

  15. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 send v:fin How adv Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 15/29

  16. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 send v:fin How #PersPron adv n:subj Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 16/29

  17. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 send v:fin How #PersPron attachment adv n:subj n:obj Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 17/29

  18. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 send v:fin How #PersPron attachment e-mail adv n:subj n:obj n:by+X Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 18/29

  19. TectoMT deep transfer Jak pošlu přílohu mailem? poslat v:fin Jak #PersPron příloha mail adv drop n:4 n:7 send v:fin How #PersPron attachment e-mail adv n:subj n:obj n:by+X How do I send an attachment by e-mail? Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 19/29

  20. T-lemma and formeme transfer poslat v:fin mail ? n:7 Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 20/29

  21. T-lemma and formeme transfer e-mail (n) 0.32 online (adv) 0.25 poslat address (n) 0.11 v:fin online (adj) 0.05 ... mail n:7 n:by+X 0.23 n:with+X 0.22 adv 0.13 n:via+X 0.04 ... Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 21/29

  22. T-lemma and formeme transfer e-mail (n) 0.32 online (adv) 0.25 poslat address (n) 0.11 v:fin online (adj) 0.05 ... mail e-mail n:7 n:by+X n:by+X 0.23 n:with+X 0.22 adv 0.13 n:via+X 0.04 ... Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 22/29

  23. Translation model interpolation P ( e-mail ∣ mail )= 2 3 MaxEnt mail ( e-mail ,context )+ 1 3 Static ( e-mail , mail ) Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 23/29

  24. Translation model interpolation  past: P ( e-mail ∣ mail )= 2 3 MaxEnt mail ( e-mail ,context )+ 1 3 Static ( e-mail , mail )  present: P ( e-mail ∣ mail )= 2 6 OutMaxEnt mail ( e-mail , context )+ 1 6 OutStatic ( e-mail , mail ) + 2 6 InMaxEnt mail ( e-mail , context ) + 1 6 InStatic ( e-mail , mail ) Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 24/29

  25. Translation model interpolation  future: P ( e-mail ∣ mail )= w 1 ⋅ OutMaxEnt mail ( e-mail ,context )+ w 2 ⋅ OutStatic ( e-mail , mail ) + w 3 ⋅ InMaxEnt mail ( e-mail ,context ) + w 4 ⋅ InStatic ( e-mail , mail )  interpolation weights tuned on development set Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 25/29

  26. Translation model interpolation  future: P ( e-mail ∣ mail )= w 1 ⋅ OutMaxEnt mail ( e-mail ,context )+ w 2 ⋅ OutStatic ( e-mail , mail ) + w 3 ⋅ InMaxEnt mail ( e-mail ,context ) + w 4 ⋅ InStatic ( e-mail , mail )  interpolation weights tuned on development set  not available at the moment:  tuning in TectoMT  development set Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 26/29

  27. Evaluation 40 38.9 out 38 37.0 in 36 interpol 33.7 34 32 31.3 30.6 30 28.4 28.3 28 27.1 26.5 26 24.3 24 23.3 23.0 21.5 22 21.4 20.4 20.4 20 18.5 18.5 18 16 EN→CS CS→EN EN→ES ES→EN EN→NL NL→EN Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 27/29

  28. Conclusion  TectoMT deep transfer system  IT helpdesk domain (1000 parallel sentences)  domain adaptation by TM interpolation  no tuning of interpolation weights  consistently positive results, up to +3 BLEU  future work  tuning of interpolation weights  other domain adaptation methods Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 28/29

  29. Thank you for your attention Rudolf Rosa , Ondřej Dušek, Michal Novák, Martin Popel {rosa,odusek,mnovak,popel}@ufal.mff.cuni.cz Translation Model Interpolation for Domain Adaptation in TectoMT Charles University in Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics http://ufal.mff.cuni.cz/rudolf-rosa/ Rosa, Dušek, Novák, Popel: Translation Model Interpolation for Domain Adaptation in TectoMT 29/29

Recommend


More recommend