effective open source speech recognition in your
play

Effective Open Source Speech Recognition in Your Application - PowerPoint PPT Presentation

Effective Open Source Speech Recognition in Your Application #kde-speech Peter Grasch peter@grasch.net The Basics Speech model Decoder Acoustic model Language model Sounds Vocabulary Grammar Open Source Speech Recognition


  1. Effective Open Source Speech Recognition in Your Application #kde-speech Peter Grasch peter@grasch.net

  2. The Basics Speech model Decoder Acoustic model Language model ● Sounds ● Vocabulary ● Grammar

  3. Open Source Speech Recognition Decoder Trainer UI CMU SPHINX ✓ ✓ (PocketSphinx, SphinxTrain) Julius ✓ KALDI ✓ ✓ Simon ✓ ✓ ✓

  4. Standard Architecture Commands Simond Simon Your application ? Acoustic model Language model

  5. Standard Architecture Commands Simond Simon Scenario Scenario Your application Scenario Acoustic model Language model

  6. Headless Architecture Commands Simond Simon Your application Acoustic model Language model

  7. Embedded Architecture Commands Simond Simon Your application Acoustic model Language model Decoder

  8. Standard Architecture Commands Simond Simon Scenario Scenario Your application Scenario Acoustic model Language model

  9. Writing your Scenario ● Lay out the commands you want to support ● Create: – Vocabulary – Grammar – Commands

  10. Writing your Scenario Demonstration

  11. Tighter Integration: Write a Custom Command Plug-In ● Full, programmatic control of the scenario ● Meta information of recognition results: – Phonetic transcriptions – Confidence scores* – Alternative results*

  12. Tighter Integration: Write a Custom Command Plug-In Demonstration

  13. Q & A #kde-speech Peter Grasch peter@grasch.net

  14. Thank you for your attention

Recommend


More recommend