Speech recognition frontend on Cell BE Pavel Bazika - PowerPoint PPT Presentation

Dec 18, 2022 •386 likes •453 views

IBM - CVUT Student Research Projects Speech recognition frontend on Cell BE Pavel Bazika (bazikp1@fel.cvut.cz) Speech recognizer Input speech is represented by samples Inner format is 25ms length frames FRONTEND speech comparison

IBM - CVUT Student Research Projects Speech recognition frontend on Cell BE Pavel Bazika (bazikp1@fel.cvut.cz)
Speech recognizer • Input speech is represented by samples • Inner format is 25ms length frames FRONTEND speech comparison vocabulary •preprocessing •feature extraction word probability IBM - CVUT Student Research Projects 2
Algorithms needed for speech recognition • Mean value subtraction • Preemphasis • Hamming window selection } cepstrum • FFT • Logarithm • Triangular filters • DCT IBM - CVUT Student Research Projects 3
Speed of our algorithm • Four frames are computed at once • Cepstrum calculation of 25 ms length frame for input sampling frequency 8 kHz takes 3,7 μs • One SPU can process 2700 speeches in realtime IBM - CVUT Student Research Projects 4
Cepstrum calculation comparison with Pentium 4 30000 25000 20000 Time [ns] SPU F4S 15000 Pentium 4 10000 5000 0 0 200 400 600 800 1000 1200 Frame size IBM - CVUT Student Research Projects 5
Highlights • Optimized algorithms for SPU, dual-issue used when possible • FFT for four streams of data implemented • Pentium 4 is slower in every algorithm • Faster FFT than FFTW with SSE2 enabled • Input samples are converted to inner format in parallel with mean value computation IBM - CVUT Student Research Projects 6

Recommend

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types 1 7-Speech Recognition (Cont d) HMM Calculating Approaches

1.08k views • 74 slides

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs Text Speech vs Text Same but different Same but different Core Speech Technologies Core Speech Technologies Speech Recognition Speech

707 views • 38 slides

EECS E6870 converting speech to text Speech Recognition automatic speech recognition

What Is Speech Recognition? EECS E6870 converting speech to text Speech Recognition automatic speech recognition (ASR), speech-to-text (STT) what its not Michael Picheny,

346 views • 22 slides

HMMS and Speech HMMS and Speech HMMS and Speech Recognition Recognition Recognition Presented

HMMS and Speech HMMS and Speech HMMS and Speech Recognition Recognition Recognition Presented by Jen-Wei Kuo Reference 1. X. Huang et. al., Spoken Language Processing, Chapter 8 2. Daniel Jurafsky and James H. Martin, Speech and Language

1.06k views • 65 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 25: Speech

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 25: Speech synthesis (Concluding lecture) Instructor: Preethi Jyothi Nov 6, 2017 Recall: SPSS framework O Speech Speech Train Parameter

277 views • 26 slides

L9: Frontend Abstractions Web Engineering 188.951 2VU SS20 Jrgen Cito L9: Frontend

L9: Frontend Abstractions Web Engineering 188.951 2VU SS20 Jrgen Cito L9: Frontend Abstractions Overview of abstractions that support building declarative and reactive frontend applications Case study demonstrating these

404 views • 22 slides

Speech recognition Brief history Technology Computer Literacy 1 Lecture 22 How does

Topics Definition of speech recognition Speech recognition Brief history Technology Computer Literacy 1 Lecture 22 How does speech recognition work 10/11/2008 Speaker recognition Problems of speech and speaker recognition

327 views • 6 slides

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone Sequence To Speech Articulatory Approaches Concatenative Approaches HMM-based Approaches Rule-Based Approaches 1 Speech Synthesis Concept

751 views • 57 slides

Bacteria Without a Cell Wall L-forms Pros & Cons of Cell Wall Cell membrane Cell wall DNA

A New Chassis for Synthetic Biology: Bacteria Without a Cell Wall L-forms Pros & Cons of Cell Wall Cell membrane Cell wall DNA Cell membrane ribosomes RNA metabolites Bacterium with Bacterium cell wall without cell wall Previous

680 views • 35 slides

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic modeling Pronunciation dictionary Acoustic Modeling Acoustic Modeling Speech and Signal Variability Speech and Signal Variability Measuring

625 views • 27 slides

Speech Processing 15-492/18-492 Speech Recognition Template matching Speech Recognition by

Speech Processing 15-492/18-492 Speech Recognition Template matching Speech Recognition by Templates A little history A little history Matching Templates Matching Templates DTW (Dynamic Time Warping) DTW (Dynamic

381 views • 24 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 23: Speech

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 23: Speech Synthesis (Part I) Instructor: Preethi Jyothi Oct 30, 2017 T ext- T o- S peech Systems Storied History Von Kempelens speaking machine (1791)

290 views • 8 slides

GPU-Accelerated GPU-Accelerated Large Vocabulary Continuous Speech Recognition Large

GPU-Accelerated GPU-Accelerated Large Vocabulary Continuous Speech Recognition Large Vocabulary Continuous Speech Recognition for Scalable Distributed Speech Recognition for Scalable Distributed Speech Recognition Jungsuk Kim

603 views • 34 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction to Statistical Speech Recognition Instructor: Preethi Jyothi Lecture 1 Course Specifics About the course (I) Main Topics: Introduction to

525 views • 36 slides

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction to Statistical Speech Recognition Instructor: Preethi Jyothi July 24, 2017 Course Specifics Pre-requisites Ideal Background: Completed one of

739 views • 44 slides

Cell Communication and Cell Signaling Why is cell signaling important? Why is cell signaling

Cell Communication and Cell Signaling Why is cell signaling important? Why is cell signaling important? Allows cells to communicate and coordinate functions/activities of the organism Usually involves the cell membrane Cell

412 views • 36 slides

Modeling And Visualizing Fire Without Getting Burned MCSD Seminar June 29, 2005 Glenn P. Forney

Modeling And Visualizing Fire Without Getting Burned MCSD Seminar June 29, 2005 Glenn P. Forney Overview Fire Models Fire modeling applications Gaining insight through visualization Smokeview Visualization Team FDS

947 views • 65 slides

measurements. ENRIS2019, 16-18 June 2019 1 Acknowledgements Dorien van der AA, project manager

6/11/2019 Setting up a very low noise continuous vibration monitoring system and evaluating the measurements. ENRIS2019, 16-18 June 2019 1 Acknowledgements Dorien van der AA, project manager 2015-2018 H.L. Offerhaus, chairholder E.H.

740 views • 48 slides

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

408 views • 27 slides

Interactive FIJI/ImageJ2 Plugin for Biological Image Segmentation Case Study: Wound Healing

Senior Project, Spring 2019 Interactive FIJI/ImageJ2 Plugin for Biological Image Segmentation Case Study: Wound Healing Analysis Nurzhan Sakenov, Bekzhan Kaspakov, Madiyar Katranov Adviser: Martin Lukac Wound Healing Analysis Cell migration

1.64k views • 11 slides

HD GP-GPU Systems for HPC Applications: Engines | SAR | RF Amps Sergio Tafur , &

Introduction Implementation HD GP-GPU Systems for HPC Applications: Engines | SAR | RF Amps Sergio Tafur , & Christopher Kung Center for Computational Science | Section Head (Acting) Code 5594 Productivity Enhancement,

798 views • 77 slides

The role of migration on family formation trajectories Evidence from the United States Andrs

The role of migration on family formation trajectories Evidence from the United States Andrs Felipe Castro Torres 1 Ph.D. candidate in Demography & Sociology University of Pennsylvania 1 Introduction Understanding differences in

778 views • 50 slides

Mining Markov Network Surrogates for Value-Added Optimisation Alexander Brownlee

Mining Markov Network Surrogates for Value-Added Optimisation Alexander Brownlee www.cs.stir.ac.uk/~sbr sbr@cs.stir.ac.uk Outline Value-added optimisation Markov network fitness model Mining the model Examples with benchmarks

532 views • 32 slides

The role of low-carbon technologies in climate mitigation Perspectives on feasibility of low

The role of low-carbon technologies in climate mitigation Perspectives on feasibility of low climate targets, sector-specific action and mitigation costs based on the EMF27 model inter-comparison project Volker Krey krey@iiasa.ac.at The

384 views • 27 slides

Download document

More recommend

Explore More Topics

Stay informed with curated content and fresh updates.

animals pets art culture automotive transportation business finance computer internet construction architecture education-career electronics communication

Speech recognition frontend on Cell BE Pavel Bazika - PowerPoint PPT Presentation

IBM - CVUT Student Research Projects Speech recognition frontend on Cell BE Pavel Bazika (bazikp1@fel.cvut.cz) Speech recognizer Input speech is represented by samples Inner format is 25ms length frames FRONTEND speech comparison

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs

EECS E6870 converting speech to text Speech Recognition automatic speech recognition

HMMS and Speech HMMS and Speech HMMS and Speech Recognition Recognition Recognition Presented

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 25: Speech

L9: Frontend Abstractions Web Engineering 188.951 2VU SS20 Jrgen Cito L9: Frontend

Speech recognition Brief history Technology Computer Literacy 1 Lecture 22 How does

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone

Bacteria Without a Cell Wall L-forms Pros & Cons of Cell Wall Cell membrane Cell wall DNA

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic

Speech Processing 15-492/18-492 Speech Recognition Template matching Speech Recognition by

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 23: Speech

GPU-Accelerated GPU-Accelerated Large Vocabulary Continuous Speech Recognition Large

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction

Cell Communication and Cell Signaling Why is cell signaling important? Why is cell signaling

Modeling And Visualizing Fire Without Getting Burned MCSD Seminar June 29, 2005 Glenn P. Forney

measurements. ENRIS2019, 16-18 June 2019 1 Acknowledgements Dorien van der AA, project manager

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Interactive FIJI/ImageJ2 Plugin for Biological Image Segmentation Case Study: Wound Healing

HD GP-GPU Systems for HPC Applications: Engines | SAR | RF Amps Sergio Tafur , &

The role of migration on family formation trajectories Evidence from the United States Andrs

Mining Markov Network Surrogates for Value-Added Optimisation Alexander Brownlee

The role of low-carbon technologies in climate mitigation Perspectives on feasibility of low

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Speech recognition frontend on Cell BE Pavel Bazika - PowerPoint PPT Presentation

IBM - CVUT Student Research Projects Speech recognition frontend on Cell BE Pavel Bazika (bazikp1@fel.cvut.cz) Speech recognizer Input speech is represented by samples Inner format is 25ms length frames FRONTEND speech comparison

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs

EECS E6870 converting speech to text Speech Recognition automatic speech recognition

HMMS and Speech HMMS and Speech HMMS and Speech Recognition Recognition Recognition Presented

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 25: Speech

L9: Frontend Abstractions Web Engineering 188.951 2VU SS20 Jrgen Cito L9: Frontend

Speech recognition Brief history Technology Computer Literacy 1 Lecture 22 How does

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone

Bacteria Without a Cell Wall L-forms Pros &amp; Cons of Cell Wall Cell membrane Cell wall DNA

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic

Speech Processing 15-492/18-492 Speech Recognition Template matching Speech Recognition by

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 23: Speech

GPU-Accelerated GPU-Accelerated Large Vocabulary Continuous Speech Recognition Large

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 1: Introduction

Cell Communication and Cell Signaling Why is cell signaling important? Why is cell signaling

Modeling And Visualizing Fire Without Getting Burned MCSD Seminar June 29, 2005 Glenn P. Forney

measurements. ENRIS2019, 16-18 June 2019 1 Acknowledgements Dorien van der AA, project manager

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Interactive FIJI/ImageJ2 Plugin for Biological Image Segmentation Case Study: Wound Healing

HD GP-GPU Systems for HPC Applications: Engines | SAR | RF Amps Sergio Tafur , &amp;

The role of migration on family formation trajectories Evidence from the United States Andrs

Mining Markov Network Surrogates for Value-Added Optimisation Alexander Brownlee

The role of low-carbon technologies in climate mitigation Perspectives on feasibility of low

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Bacteria Without a Cell Wall L-forms Pros & Cons of Cell Wall Cell membrane Cell wall DNA

HD GP-GPU Systems for HPC Applications: Engines | SAR | RF Amps Sergio Tafur , &