Cepstral analysis in speech processing From speech production model, - PDF document

Nov 01, 2022 •248 likes •318 views

Lecture-oct4-a 03 October 2010 11:20 Cepstral analysis in speech processing From speech production model, we have: s[n] = (p[n]g[n] + u[n]) v[n] *r[n] p[n] => periodic impulse train u[n] => random white noise g[n] => glottal

Lecture-oct4-a 03 October 2010 11:20 Cepstral analysis in speech processing From speech production model, we have: s[n] = (p[n]*g[n] + u[n]) * v[n] *r[n] p[n] => periodic impulse train u[n] => random white noise g[n] => glottal filter impulse response v[n] => vocal tract impulse response r[n] => lip radiation system impulse response Consider voiced speech: s[n] = p[n] * g[n] * v[n] * r[n] => S(z) = P(z)H(z) where H(z) = G(z)V(z)R(z) The convolved components p[n] and h[n] are additive in the complex cepstrum H(z) will give a complex cepstrum • which is non-zero for both positive and negative time • which decays rapidly for large n P(z) gives a complex cepstrum consisting of decaying impulses at multiples of the pitch period The real cepstrum is the even part of the complex cepstrum Screen clipping taken: 25-09-2013, 15:59 Class A Page 1
Screen clipping taken: 25-09-2013, 16:00 From: cepstrum*murphy.pdf Example of some real cepstra: Screen clipping taken: 03-10-2010, 11:48 From Oppenheim and Schafer, Discrete-time Signal Processing, PHI, 1989 The example suggests that the a window applied to the cepstrum can separate the 2 components. Class A Page 2
Class A Page 3
Lecture-oct4-c 03 October 2010 12:43 Speech parameter estimation Short-time analysis needed for: Formant estimation • Pitch and voicing detection • The low-quefrency part of the cepstrum corresponds primarily to the vocal tract, glottal shaping and radiation. The high-quefrency part is due primarily to the excitation. Part of "chase" [y-axis:increasing time] (From O&S, DT signal processing, PHI, 1989 Class A Page 4
Class A Page 5

Recommend

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs

Speech Processing Speech Processing Using Speech with Computers Overview Overview Speech vs Text Speech vs Text Same but different Same but different Core Speech Technologies Core Speech Technologies Speech Recognition Speech

705 views • 38 slides

Speech Processing 15-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis

Speech Processing 15-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis From text to speech From text to speech Text Analysis Text Analysis Strings of characters to words Strings of characters to words

667 views • 25 slides

Speech Processing 15- -492/18 492/18- -492 492 Speech Processing 15 Speech Synthesis Prosody

Speech Processing 15- -492/18 492/18- -492 492 Speech Processing 15 Speech Synthesis Prosody Speech Synthesis Speech Synthesis Linguistic Analysis Linguistic Analysis Pronunciations Pronunciations Prosody Prosody

420 views • 24 slides

EE E6820: Speech & Audio Processing & Recognition Lecture 5: Speech modeling and

EE E6820: Speech & Audio Processing & Recognition Lecture 5: Speech modeling and synthesis 1 Modeling speech signals 2 Spectral and cepstral models 3 Linear Predictive models (LPC) 4 Other signal models 5 Speech synthesis Dan

623 views • 44 slides

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Synthesis Evaluation

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Synthesis Evaluation Evaluating Speech Synthesis Evaluating Speech Synthesis How good is the voice? How good is the voice? This voice is a 45.67 This voice is a

463 views • 24 slides

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Acoustic modeling Pronunciation dictionary Acoustic Modeling Acoustic Modeling Speech and Signal Variability Speech and Signal Variability Measuring

622 views • 27 slides

Speech Processing 11-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis

Speech Processing 11-492/18-492 Speech Synthesis Overview Text processing Speech Synthesis From text to speech Text Analysis Strings of characters to words Linguistic Analysis From words to pronunciations and prosody

490 views • 25 slides

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone

6-Text To Speech (TTS) Speech Synthesis Speech Synthesis Concept Speech Naturalness Phone Sequence To Speech Articulatory Approaches Concatenative Approaches HMM-based Approaches Rule-Based Approaches 1 Speech Synthesis Concept

749 views • 57 slides

Speech Processing for Speech Processing for Unwritten Languages Unwritten Languages Alan W

Speech Processing for Speech Processing for Unwritten Languages Unwritten Languages Alan W Black Language Technologies Institute Carnegie Mellon Universit y ISCSLP 2016 Tianjin, China Speech Processing for Speech Processing for

581 views • 47 slides

Speech Processing 15-492/18-492 Speech Recognition Signal Processing Analog to Digital Speech

Speech Processing 15-492/18-492 Speech Recognition Signal Processing Analog to Digital Speech (sound) is analog Speech (sound) is analog Computers are digital Computers are digital We need to convert We need to convert

499 views • 15 slides

Speech Processing 15-492/18-492 Speech Synthesis Pronunciation Letter to Sound rules Speech

Speech Processing 15-492/18-492 Speech Synthesis Pronunciation Letter to Sound rules Speech Synthesis Linguistic Analysis Linguistic Analysis Pronunciations Pronunciations Prosody Prosody Part of Speech Tagging

383 views • 21 slides

Chapter 1 Introduction to Speech Signal Processing 1 Outline The

Chapter 1 Introduction to Speech Signal Processing 1 Outline The Speech Signal Speech Signal Processing Speech Production/Perception Model and the Speech Chain The Speech Stack Applications

668 views • 51 slides

Speech Processing 15-492/18-492 Speech Processing Current Topics and Future challenges

Speech Processing 15-492/18-492 Speech Processing Current Topics and Future challenges Commercial and Research Current and Future What are the hot topics in Speech What are the hot topics in Speech What currently works What

544 views • 16 slides

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Grammars

Speech Processing 11-492/18-492 Speech Processing 11-492/18-492 Speech Recognition Grammars Other ASR techniques But not just acoustics But not just acoustics But not all phones are equi-probable Find word sequences that maximizes

570 views • 20 slides

Speech Processing 11-492/18-495 Speech Processing Current Topics and Future challenges

Speech Processing 11-492/18-495 Speech Processing Current Topics and Future challenges Commercial and Research Current and Future Current and Future What are the hot topics in Speech What are the hot topics in Speech What currently

430 views • 17 slides

Analysis of speech Dr. Anil Kumar Vuppala IIIT Hyderabad Analysis of speech Representing speech

Analysis of speech Dr. Anil Kumar Vuppala IIIT Hyderabad Analysis of speech Representing speech signal on a digital computer Sampling and Quantization Representing information present in speech Extraction of parameters Method of

338 views • 11 slides

Periodic Orbits of Discretized Rotations Shigeki Akiyama, Univ. of Tsukuba 11 December 2012,

Periodic Orbits of Discretized Rotations Shigeki Akiyama, Univ. of Tsukuba 11 December 2012, Chinese University of Hong Kong This is a joint work with Attila Peth o. Discretized Rotation Conjecture 1. For any 2 < < 2 , the

879 views • 36 slides

Expressive Completeness over Nat and Finite orders MLO=Automata=regular expressions (over finite

Expressive Completeness over Nat and Finite orders MLO=Automata=regular expressions (over finite orders). p.1/12 Expressive Completeness over Nat and Finite orders MLO=Automata=regular expressions (over finite orders). MLO= -Automata=

857 views • 49 slides

Rational isogenies Computing rational isogenies from the equations of the kernel David Lubicz,

Rational isogenies Computing rational isogenies from the equations of the kernel David Lubicz, Damien Robert Damien Robert Rational isogenies 2 1 Theta functions Complex abelian varieties Quasi-periodicity: Damien Robert Rational

734 views • 23 slides

Stabilization of quasistatic evolution of elastoplastic systems subject to periodic loading Oleg

Stabilization of quasistatic evolution of elastoplastic systems subject to periodic loading Oleg Makarenkov Department of Mathematical Sciences University of Texas at Dallas in cooperation with Ivan Gudoshnikov A parallel network of

792 views • 27 slides

Trapped flux and quench in SRF cavities Dmitri A. Sergatskov (Fermilab) Experimental setup Two

Trapped flux and quench in SRF cavities Dmitri A. Sergatskov (Fermilab) Experimental setup Two bands with 8 CERNOX thermometers (~85 The arrow mark on the cavity points to the mm apart) each placed on the equator of the quench location.

67 views • 4 slides

Quantum Quench in Conformal Field Theory from a General Short-Ranged State John Cardy University

Quantum Quench in Conformal Field Theory from a General Short-Ranged State John Cardy University of Oxford GGI, Florence, May 2012 Quantum Quench in Conformal Field Theory (Global) Quantum Quench prepare an extended system at time t = 0 in a

961 views • 20 slides

Universal short-time dynamics: FRG for a temperature quench arXiv:1606.06272 Alessio Chiocchetta

Universal short-time dynamics: FRG for a temperature quench arXiv:1606.06272 Alessio Chiocchetta SISSA and INFN, Trieste (Italy) In collaboration with: Jamir Marino ThP , Cologne (Germany) Sebastian Diehl ThP , Cologne (Germany) Andrea

626 views • 23 slides

Probing the QGP time structure from large to small(er) systems with top quarks Liliana

Probing the QGP time structure from large to small(er) systems with top quarks Liliana Apolinrio Guilherme Milhano, Carlos Salgado and Gavin Salam Based on: arXiv:1711.03105 and arXiv: 1812.06772 (HE-LHC WG5) February 2019 COST Workshop

533 views • 51 slides