Lisa van Staden Removing Nuisance Variables from Acoustic Word Embeddings
Obtaining transcriptions is expensive and not always possible. Popular methods for speech processing rely on transcribed speech. 1 Low-Resource Speech and Language Processing
• Query-by-Example Search: search speech using speech. We don’t always need to predict text labels: • Tasks need speech segments to be compared. 2 Tasks in LSL Processing
3 We want to map speech to these representation without using labels. Acoustic Word Embeddings
We want embeddings to be robust. Acoustic properties of speech from different speakers/sexes differ. 4 Nuisance Variables: Speaker and Sex
5 Current Models
• Improved models: Disentanglement with adverserial training. • Using embeddings in downstream tasks. • Investigate the phonetic information in embeddings. • Links to language acquisition. 6 What’s Next
Recommend
More recommend