chive
play

CHiVE Varying Prosody in Speech Synthesis with a Linguistically - PowerPoint PPT Presentation

CHiVE Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network Vincent Wan, Chun-an Chan, Tom Kenter, Jakub Vit, Rob Clark Modelling intonation in prosody A conditional variational


  1. CHiVE Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network Vincent Wan, Chun-an Chan, Tom Kenter, Jakub Vit, Rob Clark

  2. Modelling intonation in prosody

  3. A conditional variational autoencoder captures the difgerent intonations

  4. Language has a hierarchical linguistic structure Sentence Words sil hello sil Syllables sil h+e l+ou sil Phonemes sil h e l ou sil Frames

  5. Add linguistic knowledge to the network

  6. The structured model is betuer Baseline (30.7%) CHiVE (46.1%) No preference (23.2%)

More recommend