localizing html web pages for francophone audiences with
play

Localizing HTML Web Pages for Francophone Audiences with Machine - PowerPoint PPT Presentation

Localizing HTML Web Pages for Francophone Audiences with Machine Translation By Johnny Driscoll Introduction What is Localization? Why Localize? Machine Translation (MT) Research began in the 1950s Statistical MT


  1. Localizing HTML Web Pages for Francophone Audiences with Machine Translation By Johnny Driscoll

  2. Introduction ● What is Localization? ● Why Localize?

  3. Machine Translation (MT) ● Research began in the 1950’s ● Statistical MT ● Human evaluation ● Automated evaluation (BLEU) ● Google Translate (2006) ● Neural MT (2016)

  4. Can MT provide high-quality automated Localization? ● Python Implementation Python middleware Beautifulsoup for parsing the text HTML Scraping in HTML ● Translation pipeline Python Code Write translated which sends text text to new .txt file through google for evaluation translate and returns result

  5. Data ● 3 different static web pages a. Union College CS Homepage b. Political article from Yahoo News c. Blog post from MIT’s Technology Revie ● Reference Translations

  6. Outline of Methods ● 14 Participants ● Participants’ Background in French ● Participants evaluated each translation ● No access to reference translations ● Quantitative and Qualitative feedback ○ Adequacy + Fluency Scores ○ Post-Editing Marks

  7. Preliminary Questions ● Native French speaker? ● Did you grow up around French speakers? ● Highest level French course?

  8. Quantitative Results ● Adequacy ● Fluency

  9. Qualitative Results ● Word choice errors ● Verb tense errors ● Word Placement errors ● Examples from translations

  10. Conclusion ● French-Speakers surprised with quality of translations ● Use NMT as preliminary translation service ● Reduce work for human editors

  11. Future Work ● Get results for automated metric (BLEU score) ● Evaluate on more diverse set of websites/translations

  12. Acknowledgements I would like to thank my advisors Nick Webb and Charles Batson, as well as my parents and siblings for their support and guidance throughout this project. I would also like to thank David Frey for always being a helpful and positive presence in the department.

Recommend


More recommend