How do you pronounce your name? Improving G2P with transliterations - PowerPoint PPT Presentation

How do you pronounce your name? Improving G2P with transliterations Aditya Bhargava and Grzegorz Kondrak University of Alberta ACL-HLT 2011

Introduction ● Name pronunciations can be fickle – Speech synthesis systems must handle them – Best G2P system can't account for how I decide my name is pronounced ● Existing transliterations encode this info – Ample data that can be easily mined from the Web

Objective: apply transliterations ͡ ʒʌɹʃwɪn/@ ɪn/@ Gershwin / d w n/? / ɡʌɹʃwɪn/@ ɪn/@ w n/? ...? ガーシュウィン Гершвин

Applying transliterations ● Assume existing G2P base systems – Produce n-best output lists ● Assume available transliteration ● Pick candidate output that is “most similar” to transliteration

Data ● G2P: Combilex – Provides “name” annotations ● Transliterations: NEWS Shared Task 2010 English-to-Hindi data ● Intersect data

Base systems ● Festival (Black et al., 1998) – CARTs – Popular end-to-end speech synthesis ● Sequitur (Bisani and Ney, 2008) – Generative joint n-grams – G2P only ● DirecTL+ (Jiampojamarn et al., 2008) – Discriminative phrasal decoding – G2P only

Similarity ● Similarity measures: – ALINE phoneme-to-phoneme aligner score ● Rule-based G2P converter for Hindi – M2M-Aligner alignment system score ● Extension of learned edit distance algorithm ● Two overall approaches: – Use highest similarity score – Combine similarity score with system score

Similarity: results 80 70 60 50 Word accuracy Base ALINE 40 M2M ALINE+Base M2M+Base 30 20 10 0 Festival Sequitur DirecTL+

Similarity: post mortem ● Difficult to do! ● Can't follow transliterations exactly – Differences in scripts – Differences in languages (phonologies) – Noisy data ● Need to smooth out this volatility ● Limited to one language

SVM re-ranking ● Many features – Similarity scores (M2M-Aligner) – Score differences – N-grams based on alignments between transcriptions and transliterations ● Similar to features used in DirecTL+

SVM re-ranking ● Many features – Similarity scores (M2M-Aligner) – Score differences – N-grams based on alignments between transcriptions and transliterations ● Similar to features used in ガ | ー | シュ | イ | DirecTL+ ン ɡ | | | w | n ɜ� ʃwɪn/@ ɪn/@

SVM re-ranking ● Allows many languages – English-to-{Bengali, Chinese, Hindi, Thai, Japanese, Kannada, Korean, Russian, Tamil} – Features repeated for each transliteration

SVM re-ranking

SVM re-ranking 80 75 70 Word accuracy Base SVM-score 65 SVM-ngram SVM-all 60 55 50 Festival Sequitur DirecTL+

Analysis ● SVM re-ranking gives significant improvements ● Festival and Sequitur get higher improvement – The better the base system, the harder it is to re-rank – n -gram features styled after DirecTL+ ● This benefits Festival and Sequitur ● Similar features in a novel direction can lead to improved performance

Analysis ● N-gram features most useful – Granular features – Includes unable-to-align feature k क カ X к ʧ Bacchus

Multiple languages 4 3.5 Absolute improvement in word accuracy 3 2.5 2 1.5 1 0.5 0 0 ≤1 ≤2 ≤3 ≤4 ≤5 ≤6 ≤7 ≤8 ≤9 Number of available transliterations

Future work ● Apply same re-ranking approach to different tasks (e.g. transliteration) and different data (e.g. transcriptions) – Very successful results so far ● Leverage noisy web transcriptions ● Incorporate supplemental information directly in system

Conclusion ● First use of transliterations for G2P ● Basic similarity-based methods don't work ● SVM re-ranking improves all tested base systems ● Multiple languages are vital ● Relevant scripts, etc. are online

How do you pronounce your name? Improving G2P with transliterations - PowerPoint PPT Presentation

How do you pronounce your name? Improving G2P with transliterations Aditya Bhargava and Grzegorz Kondrak University of Alberta ACL-HLT 2011 Introduction Name pronunciations can be fickle Speech synthesis systems must handle them

Namaste! ka ka ka Pronounce as in English Pink, Check, neck, pick, Kick, back, hack,

Learning attention for historical text normalization by learning to pronounce Marcel Bollmann 1

Robin Reeves, Regional Manager, International Office #LboroFamily One University Two Inspiring

for innovation improving for innovation improving Design Thinking for innovation improving New

Services in Portsmouth date Improving health services Improving health services Improving

Improving Improving Finances, Finances, Improving Improving Lives Lives www.jeanchatzky.com

Pennine Acute Hospitals NHS Trust: Improvement Journey 1 Pennine Improvement Plan Improving

Duke iGEM 2014 Methodology Scaling up Synthetic Biology Improving Improving Improving CRISPR

for Microbiology Improving the stability of microbiology Improving the stability of microbiology

Improving QOS in IP Networks Principles for QOS Guarantees Improving QOS in IP Networks

Improving Memory in Improving Memory in Children with Down syndrome Children with Down syndrome Dr

Improving Vehicle Efficiency with Aerodynamic Modifications by Phil Knox Improving aerodynamics

Improving the Consumer Improving the Consumer Expenditure Survey: A View from the Research

CSE 504: Project Proposal Jennifer Niederlnder 01/13/2016 Improving Security Testing

Improving availability of ASF Challenges and Opportunities Khieu Borin Director General for

Annual General Meeting Wednesday 2 nd October 2013 Improving health services Improving health

(14021601-3 ) A Welcome Lecture ! What Motivated This Course? Improving students

Improving the Arborway DCR Public Meeting Wednesday, October 14 th , 2015 - 6:30 p.m. 8:30

NCUK Marketing Activity MEET THE TEAM 4 KEY INITIATIVES IN 2018 Improving Improving our

WELCOME Improving health services Learning Lessons from Serious Case Reviews Improving

AUHUK & NHS England Improving Value Workshop Slides and Notes from Group Work Improving

Progress with improving electricity Progress with improving electricity industry performance 1

IMPROVING IMPROVING EFFECTIVENESS, EFFECTIVENESS, ACADEMIC GROWTH AND ACADEMIC GROWTH AND

Plymouth Larissa Milden, Active for All Service Manager What is Im Improving Lives Plymouth?

How do you pronounce your name? Improving G2P with transliterations - PowerPoint PPT Presentation

How do you pronounce your name? Improving G2P with transliterations Aditya Bhargava and Grzegorz Kondrak University of Alberta ACL-HLT 2011 Introduction Name pronunciations can be fickle Speech synthesis systems must handle them

Namaste! ka ka ka Pronounce as in English Pink, Check, neck, pick, Kick, back, hack,

Learning attention for historical text normalization by learning to pronounce Marcel Bollmann 1

Robin Reeves, Regional Manager, International Office #LboroFamily One University Two Inspiring

for innovation improving for innovation improving Design Thinking for innovation improving New

Services in Portsmouth date Improving health services Improving health services Improving

Improving Improving Finances, Finances, Improving Improving Lives Lives www.jeanchatzky.com

Pennine Acute Hospitals NHS Trust: Improvement Journey 1 Pennine Improvement Plan Improving

Duke iGEM 2014 Methodology Scaling up Synthetic Biology Improving Improving Improving CRISPR

for Microbiology Improving the stability of microbiology Improving the stability of microbiology

Improving QOS in IP Networks Principles for QOS Guarantees Improving QOS in IP Networks

Improving Memory in Improving Memory in Children with Down syndrome Children with Down syndrome Dr

Improving Vehicle Efficiency with Aerodynamic Modifications by Phil Knox Improving aerodynamics

Improving the Consumer Improving the Consumer Expenditure Survey: A View from the Research

CSE 504: Project Proposal Jennifer Niederlnder 01/13/2016 Improving Security Testing

Improving availability of ASF Challenges and Opportunities Khieu Borin Director General for

Annual General Meeting Wednesday 2 nd October 2013 Improving health services Improving health

(14021601-3 ) A Welcome Lecture ! What Motivated This Course? Improving students

Improving the Arborway DCR Public Meeting Wednesday, October 14 th , 2015 - 6:30 p.m. 8:30

NCUK Marketing Activity MEET THE TEAM 4 KEY INITIATIVES IN 2018 Improving Improving our

WELCOME Improving health services Learning Lessons from Serious Case Reviews Improving

AUHUK &amp; NHS England Improving Value Workshop Slides and Notes from Group Work Improving

Progress with improving electricity Progress with improving electricity industry performance 1

IMPROVING IMPROVING EFFECTIVENESS, EFFECTIVENESS, ACADEMIC GROWTH AND ACADEMIC GROWTH AND

Plymouth Larissa Milden, Active for All Service Manager What is Im Improving Lives Plymouth?

AUHUK & NHS England Improving Value Workshop Slides and Notes from Group Work Improving