Fundamental frequency (F0) Tần số cơ bản và ôn tập Three ways to find F0 ( tần số cơ bản, độ cao ): • From waveform; Cơ sở âm vị học và ngữ âm học • From (wide-band) spectrogram; • From spectrum. Lecture 16 Fundamental frequency (F0) Fundamental frequency (F0) Three ways to find F0 ( tần số cơ bản, độ cao ): Three ways to find F0 ( tần số cơ bản, độ cao ): • From waveform; • From waveform; • From (wide-band) spectrogram; • From (wide-band) spectrogram; • From spectrum. • From spectrum.
F0 from waveform F0 from waveform • F0 = 1000 / duration of a single period (in ms) • F0 = 1000 / duration of a single period (in ms) 0.2 0.2 Intensity Intensity 0 0 -0.2 -0.2 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22 0.24 0.26 0.28 0.3 0.32 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22 0.24 0.26 0.28 0.3 0.32 Time Time • Chu kỳ lặp lại bao nhiều lần trên một giây? • Chu kỳ lặp lại bao nhiều lần trên một giây? F0 from wide-band spectrogram F0 from wide-band spectrogram • F0 = 1000 / duration of interval between striations (in ms) • F0 = 1000 / duration of interval between striations (in ms) 5000 5000 4500 4500 4000 4000 Frequency (Hz) Frequency (Hz) 3500 3500 3000 3000 2500 2500 2000 2000 1500 1500 1000 1000 500 500 0 0 0 0.05 0.1 0.15 0.2 0.25 0.3 0 0.05 0.1 0.15 0.2 0.25 0.3 Time (s) Time (s) • Một đường = một mạch dây thanh • Một đường = một mạch dây thanh
F0 from spectrum F0 from spectrum • ...or H10 (H10/10 = F0) • Locate first harmonic ( tần số cộng hưởng thứ nhất =F0)... • Harmonics ( cộng hưởng ) are always multiples of F0 ( bội số của F0 ) Sound pressure level (dB / Hz) 40 Sound pressure level (dB / Hz) 40 20 20 0 0 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5000 0 Frequency (Hz) 0 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5000 Frequency (Hz) F0 from spectrum How do languages use F0? • ...or H10 (H10/10 = F0) • Harmonics ( cộng hưởng ) are always multiples of F0 ( bội số của F0 ) • At the lexical level, as tone ( thanh điệu ) • At the utterance level, as prosody ( điệu tính ) and Sound pressure level (dB / Hz) intonation ( ngữ điệu ) 40 • All three uses can occur simultaneously and 20 (semi-)independently. 0 0 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 5000 Frequency (Hz)
How do languages use F0? How do languages use F0? • At the lexical level, as tone ( thanh điệu ) • At the lexical level, as tone ( thanh điệu ) • At the utterance level, as prosody ( điệu tính ) and • At the utterance level, as prosody ( điệu tính ) and intonation ( ngữ điệu ) intonation ( ngữ điệu ) • All three uses can occur simultaneously and • All three uses can occur simultaneously and (semi-)independently. (semi-)independently. Lexical tone (Vietnamese) Lexical tone (Matbat)
Notes on F0 Notes on F0 • Most F0-extraction software (like Praat) essentially uses • Most F0-extraction software (like Praat) essentially uses the last method the last method • But... software can make mistakes, e.g. octave jump • But... software can make mistakes, e.g. octave jump (thinking H2 is H1) (thinking H2 is H1) • F0 trace is unreliable when speech is unvoiced/partially • F0 trace is unreliable when speech is unvoiced/partially devoiced or creaky (dấu nặng, dấu ngã...in Praat?) devoiced or creaky (dấu nặng, dấu ngã...in Praat?) Notes on F0 Prosodic structure • In English, pitch accents (phrasal stress, trọng âm ngữ đọan ) are (partly) motivated by information structure ( cấu trúc thông tin ) • Most F0-extraction software (like Praat) essentially uses the last method 350 350 350 300 300 HARRY’s 250 250 • But... software can make mistakes, e.g. octave jump Pitch (Hz) 200 200 going to Hawaii 150 150 (thinking H2 is H1) 100 100 50 50 0 0 0 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.32816 1.3 • F0 trace is unreliable when speech is unvoiced/partially Time (s) devoiced or creaky (dấu nặng, dấu ngã...in Praat?) 350 350 300 250 HAWAII Harry’s Pitch (Hz) to 200 going 150 100 50 0 0 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.23816 Time (s)
Prosodic structure Revision • In tone languages, such effects are (often) achieved • What acoustic properties of speech of the visual through pitch range expansion representations we have covered show? – Waveform – Spectrum – Spectrogram • What does each axis ( trục ) represent? • Which distinctive features of speech sounds can/can’t we measure in each representation? – Why are certain representations better/worse for different classes of speech sounds? From Remijsen, 2007 Revision Revision • What acoustic properties of speech of the visual • What acoustic properties of speech of the visual representations we have covered show? representations we have covered show? – Waveform – Waveform – Spectrum – Spectrum – Spectrogram – Spectrogram • What does each axis ( trục ) represent? • What does each axis ( trục ) represent? • Which distinctive features of speech sounds can/can’t • Which distinctive features of speech sounds can/can’t we measure in each representation? we measure in each representation? – Why are certain representations better/worse for different – Why are certain representations better/worse for different classes of speech sounds? classes of speech sounds?
Formant � = harmonic A formant is not the same • Formants are properties of the vocal tract ( đường dẫn âm ) - they are independent of pitch • Harmonics are multiples of F0 ( tâôn số cơ bản ) - if F0 as a changes, the harmonics change too harmonic! Formant � = harmonic Spectrogram reading • Be able to identify broad classes of sounds • Formants are properties of the vocal tract ( đường dẫn – nguyên âm ) - they are independent of pitch âm, bán nguyên âm, âm xát, âm tắc... • Vô thanh? Hữu thanh? • Harmonics are multiples of F0 ( tâôn số cơ bản ) - if F0 changes, the harmonics change too – clues: striations ( dải ) and voicing bar
Spectrogram reading Spectrogram reading: stops • Be able to identify broad classes of sounds – nguyên âm, bán nguyên âm, âm xát, âm tắc... • Vô thanh? Hữu thanh? – clues: striations ( dải ) and voicing bar http://www.cns.nyu.edu/ david/courses/perception/lecturenotes/speech/speech.html Spectrogram reading: vowels Spectrogram reading: vowels 200 200 300 300 i i u u 400 400 ! ! % % & & 500 500 $ $ " " 600 600 700 700 & & 800 800 # # æ æ 900 900 1000 1000 3000 2500 2000 1500 1000 3000 2500 2000 1500 1000 F2 (Hertz) F2 (Hertz) • Given F1 and F2, can you guess which vowel? • Given F1 and F2, can you guess which vowel? • Given a vowel, can you say if F1/F2 are high or low? • Given a vowel, can you say if F1/F2 are high or low?
First half: general First half: general 1. The spectrum shows: 1. The spectrum shows: a) amplitude over time a) amplitude over time b) amplitude over intensity b) amplitude over intensity c) frequency over time c) frequency over time d) amplitude over frequency d) amplitude over frequency 2. If a complex wave has three component waves with 2. If a complex wave has three component waves with fundamental frequencies (F0) of 60 Hz, 90 Hz, and 120 fundamental frequencies (F0) of 60 Hz, 90 Hz, and 120 Hz, what is the fundamental frequency of the complex Hz, what is the fundamental frequency of the complex wave? wave? First half: general First half: general 3. True or false: the vowel [a] is always pronounced with 3. True or false: the vowel [a] is always pronounced with the same pitch (F0). the same pitch (F0). 4. Which statement is correct? 4. Which statement is correct? a) In a waveform printout of an utterance, you can easily tell a) In a waveform printout of an utterance, you can easily tell whether a vowel is an /o/ or /e/. whether a vowel is an /o/ or /e/. b) In a waveform printout of an utterance, you can easily tell b) In a waveform printout of an utterance, you can easily tell whether a sound is an /o/ or /f/. whether a sound is an /o/ or /f/. c) Both a) and b) are correct. c) Both a) and b) are correct. d) Neither a) nor b) are correct. d) Neither a) nor b) are correct.
Second half: Spectrogram reading Second half: Spectrogram reading 1. Which segment is located in region 1? 1. Which segment is located in region 1? a) [r] a) [r] b) [l] b) [l] c) [ N ] c) [ N ] d) [p] d) [p] 2. Which segment is located in region 2? 2. Which segment is located in region 2? a) [v] a) [v] b) [k] b) [k] c) [ i ] c) [ i ] d) [ d ] d) [ d ] Âm vị học • Trình bầy giống như kỳ thi giữa nhưng cũng có mốt vấn đề về biểu diễn tầng sâu....
Recommend
More recommend