Accent reclassification and speech recognition of Afrikaans, Black - PowerPoint PPT Presentation

Accent reclassification and speech recognition of Afrikaans, Black and White South African English Herman Kamper and Thomas Niesler Digital Signal Processing Laboratory Department of Electrical and Electronic Engineering Stellenbosch University UNIVERSITEIT • STELLENBOSCH • UNIVERSITY • your knowledge partner jou kennisvennoot

Introduction Accented English is highly prevalent in South Africa We consider three accents of South African English: ◮ Afrikaans English (AE) ◮ Black South African English (BE) ◮ White South African English (EE) For multi-accent speech recognition, accent labels must be assigned to training set utterances These are assigned by human annotators based on a speaker’s mother-tongue or ethnicity and might not necessarily be optimal for modelling purposes We consider the unsupervised reclassification of training set accent labels H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 2 / 14

Oracle and parallel recognition of AE and EE H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 3 / 14

Oracle and parallel recognition of AE and EE Oracle: Separate accent-specific recognisers for each accent AE AE Hypothesised speech recogniser transcription EE EE Hypothesised speech recogniser transcription H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 3 / 14

Oracle and parallel recognition of AE and EE Oracle: Separate accent-specific recognisers for each accent AE AE Hypothesised speech recogniser transcription EE EE Hypothesised speech recogniser transcription Parallel: Two accent-specific recognisers operating in parallel Select output with highest likelihood AE recogniser AE & EE Hypothesised speech transcription EE recogniser H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 3 / 14

Accent misclassifications Select output with highest likelihood AE recogniser AE Hypothesised speech transcription EE recogniser Correctly identified: The matching recogniser is selected H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 4 / 14

Accent misclassifications Select output with highest likelihood AE recogniser AE Hypothesised speech transcription EE recogniser Misclassification: A recogniser from another accent is selected H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 4 / 14

Oracle and parallel recognition of AE and EE Oracle: Separate accent-specific recognisers for each accent AE AE Hypothesised speech recogniser transcription EE EE Hypothesised speech recogniser transcription Parallel: Two accent-specific recognisers operating in parallel Select output with highest likelihood AE recogniser AE & EE Hypothesised speech transcription EE recogniser H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 5 / 14

Oracle and parallel recognition of AE and EE Oracle: Separate accent-specific recognisers for each accent AE AE Hypothesised speech recogniser transcription EE EE Hypothesised speech recogniser transcription Parallel: Two accent-specific recognisers operating in parallel Select output with highest likelihood AE recogniser AE & EE Hypothesised speech transcription EE recogniser Small improvements of parallel over oracle for AE+EE H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 5 / 14

Accent reclassification Conclusions from oracle vs. parallel recognition Misclassifications do not always lead to deteriorated accuracies The accent labels assigned to training/test utterances might not be the most appropriate H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 6 / 14

Accent reclassification Conclusions from oracle vs. parallel recognition Misclassifications do not always lead to deteriorated accuracies The accent labels assigned to training/test utterances might not be the most appropriate Propose accent reclassification Use first-pass acoustic models trained on the originally labelled data to reclassify the accent of training set utterances and then retrain the acoustic models H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 6 / 14

Accent reclassification Conclusions from oracle vs. parallel recognition Misclassifications do not always lead to deteriorated accuracies The accent labels assigned to training/test utterances might not be the most appropriate Propose accent reclassification Use first-pass acoustic models trained on the originally labelled data to reclassify the accent of training set utterances and then retrain the acoustic models: AE+EE: relatively similar accents BE+EE: relatively dissimilar accents H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 6 / 14

Accent reclassification Transcriptions with original accent labels Train Create transcrip- accent-specific tions with new HMMs accent labels Reclassified accent labels Use HMMs to No Last iteration? reclassify training set Yes Reclassified Multi-accent speech accent-specific recognition HMMs H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 7 / 14

Speech databases African Speech Technology (AST) databases: ◮ Afrikaans English (AE) database ◮ Black South African English (BE) database ◮ White South African English (EE) database Training set: approximately 6 hours of speech in each accent Test set: approximately 24 minutes of speech from 20 speakers in each accent Development set: used to optimise recognition parameters H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 8 / 14

Experimental setup Setup of systems Word recognition of continuous telephone speech Trained 8-mixture cross-word triphone HMMs Parameterisation: MFCCs, 1 st and 2 nd order derivatives, per-utterance CMN Accent-independent language models and pronunciation dictionaries H. Kamper (Stellenbosch University) Reclassification of SAE accents PRASA 2011 9 / 14

Accent reclassification and speech recognition of Afrikaans, Black - PowerPoint PPT Presentation

Accent reclassification and speech recognition of Afrikaans, Black and White South African English Herman Kamper and Thomas Niesler Digital Signal Processing Laboratory Department of Electrical and Electronic Engineering Stellenbosch University

Review reclassification criteria changes in SY 2018-19 Share preliminary reclassification

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs

Katrin Lichterfeld Balancing accent, identity and culture 1. Linguistic diversity 2. Accent and

HMMS and Speech HMMS and Speech HMMS and Speech Recognition Recognition Recognition Presented

EECS E6870 converting speech to text Speech Recognition automatic speech recognition

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 25: Speech

FAIC Foreign Accent Imitation Corpus Sara Neuhauser University of Jena, Germany IAFPA 2011

CORRIDOR FINISHES ACCENT CARPET | Outlier in Brink, Bentley Mills ACCENT VINYL WALL COVERING |

Overview Focus Projection Focus Projection Focus to Accent Focus to Accent Restricted View of

Speech recognition Brief history Technology Computer Literacy 1 Lecture 22 How does

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic

Speech Processing 15-492/18-492 Speech Recognition Template matching Speech Recognition by

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 23: Speech

GPU-Accelerated GPU-Accelerated Large Vocabulary Continuous Speech Recognition Large

Accent Location 53.5

Motivating Dramatic Interactions Stefan Rank 1 Paolo Petta 1 , 2 1 Austrian Research Institute for

Career Development Workshops for Women in Physics Shobhana Narasimhan Jawaharlal Nehru Centre

10-701 Machine Learning Decision trees Optional additional reading: Mitchell Chapter 3 Types of

Challenges in Digital Sanskrit Lexicography The Example of Vedic Accent Felix Rau (University of

Prosody in Dialog Predicting Accent based on Information Structure and context Defining

Testing 221 238 197 223 171 213 Manoj Nambiar, Tata Blue 50% Tata Blue 25% Purple 50 %

Information Structure Annotation and Secondary Accents Arndt Riester 1 Stefan Baumann 2 1

Sambuz

Useful Links

Newsletter

Mail Us