How to Speak a Language without Knowing It Xing Shi, Kevin Knight 1 - PowerPoint PPT Presentation

How to Speak a Language without Knowing It Xing Shi, Kevin Knight 1 Heng Ji 2 1 Information Sciences Institute Computer Science Department University of Southern California { xingshi, knight } @isi.edu 2 Computer Science Department Rensselaer Polytechnic Institute Troy, NY 12180, USA jih@rpi.edu June 24, 2014 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 1 / 31

Overview Introduction 1 Data 2 Evaluation 3 Model 4 Training 5 Phoneme-based model Phoneme-phrase-based model Word-based model Hybrid training/decoding Experiments 6 Conclusion and Future work 7 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 2 / 31

Introduction Can people speak a language they don’t know ? Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 3 / 31

Yes, use a phrasebook Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 4 / 31

Yes, use a phrasebook What if we want to say something beyond the phrasebook ? Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 7 / 31

Or, a speech-to-speech translator from: proto-knowledge.blogspot.com However, direct Human interactivity is much more fun ! Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 8 / 31

Our solution Easily pronounceable Both input T(S) and output T’(S) are in speaker’s language. Understandable by listener T’(S) sounds like T(F). T(F) and T(S) has the same meaning. Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 9 / 31

Our solution Demo Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 10 / 31

Our solution Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 11 / 31

Data A collection of 1312 < Chinese, English, Chinglish > phrasebook tuples. 1 1182 for training, 65 for development and 65 for test. 已经八点了 Chinese English It’s eight o’clock now 意思埃特额克劳克闹 (yi si ai te e ke lao ke nao) Chinglish 这件衬衫又时髦又便宜 Chinese English this shirt is very stylish and not very expensive 迪思舍特意思危锐思掉利失安的闹特危锐伊克思班西五 Chinglish 1 Dataset at http://www.isi.edu/natural-language/mt/chinglish-data.txt Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 12 / 31

Data Frequency Rank Chinese Chinglish 1 de si 2 shi te 3 yi de 4 ji yi 5 zhi fu Table : Top 5 frequent syllables in Chinese and Chinglish Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 13 / 31

Evaluation Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 14 / 31

Model: Cascade FSTs Chinese 谢谢你 translate.google.com MT Eword Thank you CMU Pron Dict FST A (Weide,2007) Epron TH EY N K Y UW wFST B wFST E Pinyin-split s an k e y ou FST C Deterministic Rules san ke you Pinyin FST D Pron Dict Chinglish 三可由 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 17 / 31

Model: Cascade FSTs Chinese 谢谢你 translate.google.com MT Eword Thank you CMU Pron Dict FST A (Weide,2007) Epron TH EY N K Y UW wFST B Need to learn from data wFST E Pinyin-split s an k e y ou FST C Deterministic Rules san ke you Pinyin FST D Pron Dict Chinglish 三可由 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 18 / 31

Phoneme-based model Construct < Epron, Pinyin-split > training pairs. Chinese Mapping schema: 1-to-1, 1-to-2 and 2-to-1. MT g r ae n Eword FST A g e r uan Epron EM to learn parameters in wFST B, e.g. wFST E wFST B P (g e | g ). Pinyin-split Viterbi alignments: FST C grand Pinyin g r ae n d FST D g e r uan d e 哥软的 Chinglish Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 19 / 31

Phoneme-based model labeled Epron Pinyin-split P ( p | e ) d d 0.46 d e 0.40 d i 0.06 s 0.01 ao r u 0.26 o 0.13 ao 0.06 ou 0.01 Table : Learned translation tables for the phoneme based model Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 20 / 31

Phoneme-based model Chinese MT Alignment using phoneme-based model is fine. Eword When decoding test data, choices of target FST A phonemes are context sensitive. Epron Decoding “grandmother”: wFST E wFST B g r ae n d m ah dh er g e r an d e m u e d e Pinyin-split FST C reference Pinyin-split sequence: Pinyin g e r uan d e m a d e FST D Chinglish Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 21 / 31

Phoneme-phrase-based model Intuition: model the substitution of longer sequences 2 . Viterbi alignment using Phoneme-based model: g r ae n d m ah dh er g e r uan d e m a d e Extract phoneme phrase pairs: g → g e g r → g e r ... r → r r ae n → r uan ... 2 (Koehn et al., 2003) Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 22 / 31

Word-based Model Chinese Construct < Eword,Pinyin > training pairs. MT Mapping schema: 1-to-[1,7]. Eword EM to learn parameters in wFST E, i.e. P (nai te | night ). FST A Viterbi alignments: Epron wake up wFST E wFST B wei ke a pu Pinyin-split Error happen due to sparsity: “tips” and “ti pu FST C si” only appear once. Pinyin accept tips a ke sha pu te ti pu si FST D Chinglish Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 23 / 31

Hybrid training Intuition: Combine two models during training Chinese phrase. MT Use phoneme-based model to help word-based model: Eword FST A Epron wFST B wFST E Pinyin-split FST C Pinyin Errors are fixed: FST D accept tips a ke sha pu te ti pu si Chinglish Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 24 / 31

Hybrid decoding Intuition: Combine two models during decoding phrase. Chinese MT Eword seen unseen FST A word word Epron wFST B wFST E Pinyin-split FST C Pinyin FST D Chinglish Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 25 / 31

Experiments: Sample system output 等等我 Chinese Reference English wait for me 唯特佛密 (wei te fo mi) Reference Chinglish 位忒佛密 (wei te fo mi) Hybrid Chinglish Human-dictated English wait for me ASR English wait for me 年夜饭都要吃些什么 Chinese Reference English what do you have for the Reunion dinner 沃特杜又海夫佛则锐又尼恩低呢 Reference Chinglish 我忒度优嗨佛佛得瑞优你恩低呢 Hybrid Chinglish Human-dictated English what do you have for the reunion dinner ASR English what do you high for 43 Union Cena Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 26 / 31

Experiments: English-to-Pinyin decoding accuracy Error Rate Model Coverage Error Rate on covered text Word based 29/65 0.042 0.664 Word-based hybrid training 29/65 0.029 0.659 Phoneme based 63/65 0.583 0.611 Phoneme-phrase based 63/65 0.136 0.194 Hybrid training/decoding 63/65 0.115 0.175 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 27 / 31

Experiments: Human Dictation Accuracy Error Rate Model vs. reference English Dictation from Reference Chinglish 0.477 Phoneme based 0.696 Hybrid training and decoding 0.496 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 28 / 31

Experiments: No Human in the Loop Error Rate Model vs. reference English Word based 0.925 Word-based hybrid training 0.925 Phoneme based 0.937 Phoneme-phrase based 0.896 Hybrid training and decoding 0.898 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 29 / 31

Conclusion & Future work Conclusion Goal: Help people speak foreign languages Provide native phonetic spellings that approximate the sounds of foreign phrases Use a cascade of FSTs Improve the model by adding phrases and combining models in both training and decoding phase For future: More Language Pairs Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 30 / 31

Thank you! & QA Demo: http:\\cage.isi.edu:8080 Shi, X., Knight, K. and Ji, H. (USC & RPI) How to Speak a Language June 24, 2014 31 / 31

How to Speak a Language without Knowing It Xing Shi, Kevin Knight 1 - PowerPoint PPT Presentation

How to Speak a Language without Knowing It Xing Shi, Kevin Knight 1 Heng Ji 2 1 Information Sciences Institute Computer Science Department University of Southern California { xingshi, knight } @isi.edu 2 Computer Science Department Rensselaer

Advocacy 1 101 Key Elements Knowing what you want Knowing who youre talking to Knowing

Making Better Decisions as a HR Leader MTSHRM-September 2017 Retained Executive Search

New Sales Speak The 9 Biggest Sales Presentation Mistakes And How To Avoid Them New Sales Speak

Speak Up For Your Business Presentation Secrets for Entrepreneurs Ready to Tell, Speak Up For Your

Your Perfect Presentation: Speak in Front of Any Your Perfect Presentation: Speak in Front of Any

Your Perfect Presentation: Speak in Front of Any Audience Anytime Your Perfect Presentation: Speak

Your Perfect Presentation: Speak in Front of Any Your Perfect Presentation: Speak in Front of Any

Knowing Me Knowing You Funded by Young Roots Heritage Lottery, BCC and Hengrove Community Arts

Knowing me, knowing youthere is something we can do What is it? #Knowvember18 National

Anomaly Detection Fault Tolerance Anticipation Patterns John Allspaw SVP, Tech Ops Qcon

Outward Church: Knowing God Acts 4:23-31 Our doing for God must come from a deep knowing of

Without sustaining injury Without sustaining injury Without sustaining injury Without sustaining

SPEAK SPEAK YOUR L R LAN ANGUAGE E CAM AMPAIG PAIGN A Community & School Partnership

SPEAK YOUR LANGUAGE CAMPAIGN Celebrating the Power of Bilingualism Nimco Bulale, MPA Education

2015-05-07 TO SPEAK OR NOT TO SPEAK: THE ART OF PROMPTING Glenda Eberlein,

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

Generative Models for Rapid Propagation of Information Propagation of Information Kirill

Root initiated routing state in RPL draft-ietf-roll-dao-projection Pascal Thubert IETF 105

0 to 1 : An Infinite Space? Shawn Lawson Ryan Ross Smith Rensselaer Polytechnic Institute Monash

64 bit Bare Metal Tristan Gingold Programming on RPI-3 gingold@adacore.com What is Bare Metal ?

SCIENTIFIC COMPUTING Samarth Shah (shah.samarth.p@gmail.com) 29/12/2012 About Me 2 B.Tech

Control-Flow Integrity Zhi Wang, Xuxian Jiang North Carolina State University Outline

Ontology Aided Smart Contract Execution for Unexpected Situations Farhad Mohsin, Xingjian Zhao,

Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J.

Sambuz

Useful Links

Newsletter

Mail Us

How to Speak a Language without Knowing It Xing Shi, Kevin Knight 1 - PowerPoint PPT Presentation

How to Speak a Language without Knowing It Xing Shi, Kevin Knight 1 Heng Ji 2 1 Information Sciences Institute Computer Science Department University of Southern California { xingshi, knight } @isi.edu 2 Computer Science Department Rensselaer

Advocacy 1 101 Key Elements Knowing what you want Knowing who youre talking to Knowing

Making Better Decisions as a HR Leader MTSHRM-September 2017 Retained Executive Search

New Sales Speak The 9 Biggest Sales Presentation Mistakes And How To Avoid Them New Sales Speak

Speak Up For Your Business Presentation Secrets for Entrepreneurs Ready to Tell, Speak Up For Your

Your Perfect Presentation: Speak in Front of Any Your Perfect Presentation: Speak in Front of Any

Your Perfect Presentation: Speak in Front of Any Audience Anytime Your Perfect Presentation: Speak

Your Perfect Presentation: Speak in Front of Any Your Perfect Presentation: Speak in Front of Any

Knowing Me Knowing You Funded by Young Roots Heritage Lottery, BCC and Hengrove Community Arts

Knowing me, knowing youthere is something we can do What is it? #Knowvember18 National

Anomaly Detection Fault Tolerance Anticipation Patterns John Allspaw SVP, Tech Ops Qcon

Outward Church: Knowing God Acts 4:23-31 Our doing for God must come from a deep knowing of

Without sustaining injury Without sustaining injury Without sustaining injury Without sustaining

SPEAK SPEAK YOUR L R LAN ANGUAGE E CAM AMPAIG PAIGN A Community &amp; School Partnership

SPEAK YOUR LANGUAGE CAMPAIGN Celebrating the Power of Bilingualism Nimco Bulale, MPA Education

2015-05-07 TO SPEAK OR NOT TO SPEAK: THE ART OF PROMPTING Glenda Eberlein,

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

Generative Models for Rapid Propagation of Information Propagation of Information Kirill

Root initiated routing state in RPL draft-ietf-roll-dao-projection Pascal Thubert IETF 105

0 to 1 : An Infinite Space? Shawn Lawson Ryan Ross Smith Rensselaer Polytechnic Institute Monash

64 bit Bare Metal Tristan Gingold Programming on RPI-3 gingold@adacore.com What is Bare Metal ?

SCIENTIFIC COMPUTING Samarth Shah (shah.samarth.p@gmail.com) 29/12/2012 About Me 2 B.Tech

Control-Flow Integrity Zhi Wang, Xuxian Jiang North Carolina State University Outline

Ontology Aided Smart Contract Execution for Unexpected Situations Farhad Mohsin, Xingjian Zhao,

Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J.

Sambuz

Useful Links

Newsletter

Mail Us

SPEAK SPEAK YOUR L R LAN ANGUAGE E CAM AMPAIG PAIGN A Community & School Partnership