neural reranking improves subjective quality of machine
play

Neural Reranking Improves Subjective Quality of Machine Translation: - PowerPoint PPT Presentation

Neural Reranking Improves Subjective Quality of Machine Translation Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT 2015 Graham Neubig, Makoto Morishita, Satoshi Nakamura Nara Institute of Science and


  1. Neural Reranking Improves Subjective Quality of Machine Translation Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT 2015 Graham Neubig, Makoto Morishita, ○Satoshi Nakamura Nara Institute of Science and Technology (NAIST) 2015-10-16 1

  2. Neural Reranking Improves Subjective Quality of Machine Translation Statistical Translation Frameworks Symbolic Models Continuous-space (Neural) Models Encoder-Decoder [Sutskever+ 14] Phrase-based MT [Koehn+ 03] 引 い て he has a cold <s> 彼 は 風 邪 を he has a cold い る he has a cold 彼 は 引いている 風邪 を he a cold has 彼 は 風邪 を 引いている 引 い て 彼 は 風邪 を 引いている 彼 は 風 邪 を <s> い る Attentional [Bahdanau+ 15] Tree-to-String MT [Liu+ 06] he has a cold S VP NP g 1 ,...,g 4 PRP VBZ DET NN a 1 a 2 a 3 a 4 he has a cold h i-1 h i 彼 は 風邪 を 引いている P(e i |F,e 1 ,...,e i-1 ) 2 r i-1

  3. Neural Reranking Improves Subjective Quality of Machine Translation Relative Merits/Demerits ● Symbolic Models ✔ Inner workings well understood ✔ Better at translating low-frequency words ● Continuous-space Models ✔ Easier to implement ✔ Produce more fluent output ✔ Probabilistic model – can score output of other systems! 3

  4. Neural Reranking Improves Subjective Quality of Machine Translation Reranking with Neural MT Models Input N-best w/MT Features Neural Features he has a cold 1. 彼 は 寒 さ を 持 っ て い る t=-0.5 l=-5.6 | -6.1 nmt=-5.8 2. 彼 は 風 邪 を 持 っ て い る t=-0.9 l=-5.8 | -6.7 nmt=-5.5 T2S/ 3. 彼 は 風 邪 を 引 い た t=-1.5 l=-5.3 | -6.8 nmt=-3.4 PBMT 4. 彼 は 風 邪 が あ る t=-1.9 l=-5.4 | -7.3 nmt=-5.2 Neural Model Reranking Rescored/Reranked N-best 1. 彼 は 風 邪 を 引 い た t=-1.5 l=-5.3 nmt=-3.4 | -9.2 2. 彼 は 寒 さ を 持 っ て い る t=-0.5 l=-5.6 nmt=-5.8 | -10.9 3. 彼 は 風 邪 を 持 っ て い る t=-0.9 l=-5.8 nmt=-5.5 | -11.2 4 4. 彼 は 風 邪 が あ る t=-1.9 l=-5.4 nmt=-5.2 | -12.5

  5. Neural Reranking Improves Subjective Quality of Machine Translation What Do We Know About Reranking? ● Reranking greatly improves BLEU score, even over strong baseline systems: Sutskever+ 2014 Alkhouli+ 2015 de-en ar-en en-fr BLEU BLEU BLEU Baseline 30.6 26.4 Base 33.3 Reranked 32.3 27.0 Rerank 36.5 5

  6. Neural Reranking Improves Subjective Quality of Machine Translation What Don't We Know About Reranking? ● Does reranking improve subjective impressions of results? ● What are the qualitative differences before/after reranking with neural MT models? 6

  7. Neural Reranking Improves Subjective Quality of Machine Translation Experiments 7

  8. Neural Reranking Improves Subjective Quality of Machine Translation Experimental Setup ● Data: ASPEC Scientific Abstracts ● Japanese ↔ English, Chinese ● Baseline: NAIST WAT2014 Tree-to-String System ● Strong baseline achieving high scores ● Implemented using Travatar (http://phontron.com/travatar) ● Neural MT Model: Attentional model ● Trained ~500k sent., 256 hidden nodes, 2 model ensemble ● Use words occurring 3+ times (vocab 50,000~80,000) ● Trained w/ lamtram (http://github.com/neubig/lamtram) ● Automatic Evaluation: BLEU, RIBES ● Manual Evaluation: WAT 2015 HUMAN Score 8

  9. Neural Reranking Improves Subjective Quality of Machine Translation Results +2.5 90 50 +1.4 +1.6 +1.8 +1.5 40 +1.8 85 +2.8 30 RIBES Base BLEU 80 +2.7 20 Rerank 75 10 0 70 en-ja ja-en zh-ja ja-zh en-ja ja-en zh-ja ja-zh Confirm what we know: Neural reranking helps automatic evaluation. +12.5 70 60 50 +23.7 +10.0 HUMAN 40 Base 30 Rerank 20 +4.2 10 0 en-ja ja-en zh-ja ja-zh 9 Show what we didn't know: Also help manual evaluation.

  10. Neural Reranking Improves Subjective Quality of Machine Translation What is Getting Better? ● Perform detailed categorization of the changes in Japanese-English results: 1. Is the sentence better/worse after ranking? 2. What is the main error corrected: insertion, deletion, substitution, reordering, or conjugation? 3. What is the detailed subcategory? 10

  11. Neural Reranking Improves Subjective Quality of Machine Translation Main Types of Errors Corrected/Caused Type Improved Degraded % Impr. Reordering 55 9 86% Deletion 20 10 67% Insertion 19 2 90% Substitution 15 11 58% Conjugation 8 1 89% Total 117 33 78% Overall improvements re-confirmed In particular fixing reordering, insertion, and conjugation errors 11

  12. Neural Reranking Improves Subjective Quality of Machine Translation #1 Detailed Improvement Category: Phrasal Reordering (+26, -4) Source 症例2 においては、直腸がんの肝転移に対する化学療法中に、 発赤、硬結、皮膚潰ようを生じた。 Ref In case 2, reddening, induration, and skin ulcer appeared during chemical therapy for liver metastasis of rectal cancer. Base In case 2, occurred during chemotherapy for liver metastasis of rectal cancer, flare, induration, skin ulcer. Rerank In case 2, the flare, induration, skin ulcer was produced during the chemotherapy for hepatic metastasis of rectal cancer. 12

  13. Neural Reranking Improves Subjective Quality of Machine Translation #2 Detailed Improvement Category: Auxiliary Verb Ins./Del. (+15, -0) Source これにより得られる支配方程式は壁面乱流のようなせん断乱流に も有用て゚ある。 Ref Governing equation derived by this method is useful for turbulent shear flow like turbulent flow near wall. Base The governing equation is obtained by this is also useful for such as wall turbulence shear flow. Rerank The governing equation obtained by this is also useful for shear flow such as wall turbulence. 13

  14. Neural Reranking Improves Subjective Quality of Machine Translation #3 Detailed Improvement Category: Coordinate Structures (+13, -2) Source レーザー加工は高密度光束による局所的な加熱とアブレーション により行う。 Ref Laser work is done by local heating and ablation with high density light flux. Base The laser processing is carried out by local heating by high- density luminous flux and ablation. Rerank The laser processing is carried out by local heating and ablation by high-density flux. 14

  15. Neural Reranking Improves Subjective Quality of Machine Translation #4 Detailed Improvement Category: Verb Agreement (+6, 0) Source ラングミュア‐ ブロジェット法や包接化にも触れた。 Ref Langmuir-Blodgett method and inclusion compounds are mentioned. Base Langmuir-Blodgett method and inclusion is also discussed. Rerank Langmuir-Blodgett method and inclusion are also mentioned. 15

  16. Neural Reranking Improves Subjective Quality of Machine Translation What Wasn't Helped: Terminology (+2, -4) Source 放射熱を利用する赤外線応用計測が応力解析に役立っている Ref Infrared ray applied measurement using radiant heat is useful for stress analysis. Base The infrared application measurement using radiant heat is useful in the stress analysis. Rerank Infrared ray application measurement using radiation heat is useful for stress analysis. 16

  17. Neural Reranking Improves Subjective Quality of Machine Translation Conclusion 17

  18. Neural Reranking Improves Subjective Quality of Machine Translation What Do We Know Now? ● Neural reranking improves subjective quality of machine translation output. ● Main gains are from grammatical factors, and not lexical selection. 18

  19. Neural Reranking Improves Subjective Quality of Machine Translation What Do We Still Not Know Yet? ● How do neural translation models compare with neural language models? ● How does reranking compare with pure neural MT? 19

  20. Neural Reranking Improves Subjective Quality of Machine Translation Thank You! 20

Recommend


More recommend