temporal code temporal code temporal code
play

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) - PowerPoint PPT Presentation

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) Human Recognition Machine Recognition RECOGNIZED UTTERANCE LANGUAGE MODELING (Back-end) HYPOTHESIZED UTTERANCES STATISTICAL SEQUENCE RECOGNITION ACOUSTIC REPRESENTATION


  1. Temporal Code

  2. Temporal Code

  3. Temporal Code (Acoustic Front-end)

  4. Human Recognition

  5. Machine Recognition RECOGNIZED UTTERANCE LANGUAGE MODELING (Back-end) � HYPOTHESIZED UTTERANCES STATISTICAL SEQUENCE RECOGNITION ACOUSTIC REPRESENTATION ACOUSTIC FRONT END SPEECH WAVEFORM

  6. Human vs. Machine

  7. “Top-down” Processing

  8. Machine Training • Aurora-4 Speech Database � • Wall Street Journal (WSJO) Corpus � • Large Vocabulary Continuous Speech Recognition � • 7,138 clean speech utterances, 16kHz

  9. Human Training • Wernicke’s Area: Speech Understanding � • Broca’s Area: Speech Production

  10. Acoustic Model Hidden Markov Model (HMM) • Each triphone characterized by HMM consisting of 3 states, 8 Gaussian mixtures per state Transition Probability Emission Probability Density

  11. Acoustic Model • Maximum likelihood (ML) training applied to estimate a set of context-dependent triphone acoustic models

  12. Language Model • Standard 5k lexicon (CMU pronouncing Dictionary) • Tri-gram language model

  13. Decoder • Single-pass Viterbi beam search-based decoder

  14. Human Recognition � Noise-Vocoder � Tone-Vocoder

  15. CI Recognition

  16. Normal Hearing vs. CI • Cochlear Implant range (hatched area) compared with average normal hearing scores (filled squares)

  17. CI vs. Machine Recognition • ASR provided most accurate simulation ever!

  18. Machine Recognition • ASR derived by world’s best auditory scientists

  19. Effects of Training

  20. Effects of Training

  21. Effects of Training

  22. Clinical Implications • Alter Frequency Allocation � • Deactivate Interfering Electrodes � • Alter Compression Curve � • Modify Electric Pulse Width

  23. Summary

  24. Information Technology • 2014, HMM can now improve Hearing Science

  25. Future Work • Design improved signal processing to mimic: � • 1) Place code of neurons � • 2) Neural Firing Rates

  26. FAME Strategy • Frequency Amplitude Modulation Encoder

  27. SOUND S pectral Or � U ndertone N ormalization D ecomposition

Recommend


More recommend