Burst Spectrum as a Cue to Stop Consonant Voicing English Production - PowerPoint PPT Presentation

Burst Spectrum as a Cue to Stop Consonant Voicing English Production and Perception Results Eleanor Chodroff and Colin Wilson Johns Hopkins University

Summerfield and Haggard (1977), Lisker (1978), Repp (1979), Lisker (1986) voice onset time F1 onset F1 transition F 0 contour relative amplitude of aspiration following vowel duration spectral shape of the burst: lower frequencies for voiced stops Cues to stop consonant voicing

“Since most of our lax [voiced] stops were pronounced with vocal-cord vibration, their spectra contained a strong low-frequency component … ¡ The lax stops also show a significant drop in level in the high frequencies. This high-frequency loss is a consequence of the lower pressure associated with the production of lax stops and is therefore a crucial cue for this class of stops.” ¡ Halle, Hughes, and Radley (1957) Background: Production

coronals labials dorsals /t/ /t/ /d/ /d/ Δ /k/ /k/ /g/ /g/ Δ /p/ /p/ /b/ /b/ Δ 3600 3300 300 + 1940 1910 30 + 1910 1163 747 v 5649 5225 424 v 2261 2268 -7 v 4900 4400 500 w Hz Hz Hz ¡ + = Zue (1976) using peak frequency v = Parikh and Loizou (2005) using peak frequency w = Sundara (2005) using mean frequency (CoG) see also Van Alphen and Smits (2004), Vicenik (2010), Kirkham (2011) Background: Production

production study laboratory and TIMIT experiments

methods adapted from Forrest et al. (1988), Jongman et al. (2000), Sundara (2005) /p,t,k,b,d,g/ x /i, ɪ ,e, ɛ ,æ, ʌ , ɑ , ɔ ,o,u/ x /t/ ¡ N=18 (4 male) resampled at 16kHz pre-emphasized above 1000Hz high-pass filtered at 200Hz segmented from transient to voicing Laboratory Production: Methods

analysis as in Forrest et al. (1988), Hanson and Stevens (2003), Flemming (2007) § Computed 64-point FFT for 7 consecutive 3ms Hamming windows, shifted by 1ms § 7 PSDs averaged to give a smoothed spectrum § Center of Gravity (CoG) calculated from smoothed spectrum: amplitude-weighted mean frequency CoG = f 1 p(1) + … + f 32 p(32) Laboratory Production: Measurement

lab cor dor * ¡ 4967 5000 4664 4000 3521 * ¡ 3450 CoG (Hz) 3318 2833 3000 2000 1000 vcl vcd vcl vcd vcl vcd voicing Laboratory Production: Results

Mixed-effects linear regression Fixed effects sum-coded and maximal random effect structure voice β voice = 122, p < .01 × place β labial = -633, p < .001; β coronal = 916, p < .001 × gender β gender = 86, p < .01 Significant interactions examined with post-hoc comparisons ¡ labial coronal dorsal β voice = 224 β voice = 224 male n.s. p < .001 p < .05 female β voice = 253 n.s. n.s. p < .001 Crucially, the pattern of significance remains the same when tokens with glottal pulses near the release are excluded. ¡ Laboratory Production: Analysis

Byrd (1993), Keating et al. (1993) 630 different AE speakers Word-initial, pre-vocalic /p, t, k, b, d, g/ Words with high token freq. removed ( too, to, do, carry, dark ) ¡ Phoneme Tokens Phoneme Tokens /p/ 661 /b/ 668 /t/ 579 /d/ 547 /k/ 1179 /g/ 415 TIMIT: Methods

lab cor dor * ¡ 5000 4550 * ¡ 4000 3743 3704 (*) ¡ CoG (Hz) 3155 2941 3000 2672 2000 1000 vcl vcd vcl vcd vcl vcd voicing TIMIT: Results

Mixed-effects linear regression Fixed effects sum-coded and maximal random effect structure voice β voice = 320, p < .001 × place β labial = -314, p < .001; β coronal = 762, p < .001 × gender β gender = 205, p < .001 Significant interactions examined with post-hoc comparisons ¡ labial coronal dorsal β voice = 555 β voice = 460 ( β voice = 112 male p < .001 p < .001 p < .001) female β voice = 396 β voice = 280 ( β voice = 113 p < .001 p < .001 p < .05) Crucially, the pattern of significance remains the same, except for the dorsals, when tokens with glottal pulses near the release are excluded. ¡ TIMIT: Analysis

perception study laboratory and Mechanical Turk experiments

/t/-burst VOT continuum /d/-burst VOT continuum Trading relation between burst and VOT Keating (1979) Nittrouer (1999) Caldwell and Nittrouer (2013) ¡ Background: Perception

Keating (1979), Ganong (1980), Andruski et al. (1994) Labial Continua /bæt/-/pæt/ VOT (ms) 10 p 17 24 CoG: 3494Hz Dur: 10ms 31 b 38 45 CoG: 1513Hz Dur: 10ms 52 Laboratory Perception: Stimuli

Keating (1979), Ganong (1980), Andruski et al. (1994) Coronal Continua /dat/-/tat/ VOT (ms) 10 t 17 24 CoG: 5424Hz Dur: 10ms 31 d 38 45 CoG: 3601Hz Dur: 10ms 52 Laboratory Perception: Stimuli

Massaro and Cohen (1983), Hallé and Best (2007) Two-alternative forced choice Goodness rating identification Differences verified with logistic mixed- Differences verified with linear mixed- effects analysis with maximal random effect effects analysis with maximal random structures effect structures Order of labial and coronal conditions counterbalanced Within condition: 8 blocks of 14 stimuli in random order Laboratory Perception: Methods and analysis

1.00 ● ● ● ● labials 0.75 ● Proportion /p/ Response burst ● p 0.50 b ● 0.25 β burst = .54 ● 0.00 p<.001 10 20 30 40 50 VOT (ms) N=16 Laboratory Perception: Results

B P labials 3 2 1 standardized rating 0 burst p b − 1 − 2 − 3 − 4 10 17 24 31 38 45 52 10 17 24 31 38 45 52 VOT (ms) N=16 Laboratory Perception: Results

coronals 1.00 ● ● ● 0.75 ● Proportion /t/ Response burst ● t 0.50 d 0.25 ● β burst = .85 ● 0.00 ● p<.001 10 20 30 40 50 VOT (ms) N=16 Laboratory Perception: Results

D T coronals 3 2 1 standardized rating 0 burst t d − 1 − 2 − 3 − 4 10 17 24 31 38 45 52 10 17 24 31 38 45 52 VOT (ms) N=16 Laboratory Perception: Results

Kleinschmidt and Jaeger (2012), Eskanazi et al. (2013) Crowdsourcing service increasingly used in psycholinguistics and phonetic studies Greater diversity in participant population and listening conditions (noise!) Labials Coronals 12 headphones 9 headphones 3 external speakers 4 external speakers 1 internal speakers 3 internal speakers ¡ ¡ Mechanical Turk: Methods

labials 1.00 ● ● ● ● 0.75 Proportion /p/ Response burst ● p 0.50 b ● 0.25 ● β burst = .46 ● 0.00 p<.001 10 20 30 40 50 VOT (ms) N=16 Mechanical Turk: Results

1.00 coronals ● ● 0.75 Proportion /t/ Response ● burst ● t 0.50 d ● 0.25 ● β burst = .60 ● 0.00 ● p<.001 10 20 30 40 50 VOT (ms) N=16 Mechanical Turk: Results

Spectral shape of the burst is a cue to anterior stop consonant voicing Higher CoG for voiceless labials and coronals Spectral shape influences voicing identification Summary and Implications

Repp (1978), Allopenna et al. (1998), Benkí (2001), Stevens (2002), McMurray et al. (2008a) Place and voice perception are interdependent Cues to phonetic distinctions at burst landmark Early cue to voicing and incremental perception Summary and Implications

Thank you!

TIMIT lab cor dor 6000 ê 5000 lab cor dor 4000 6000 CoG (Hz) 3000 5000 2000 4000 CoG (Hz) 1000 3000 0 2000 female male female male female male é 1000 laboratory 0 female male female male female male Production: Results by Gender

B P labials 3 2 1 standardized rating 0 burst p b − 1 − 2 − 3 − 4 10 17 24 31 38 45 52 10 17 24 31 38 45 52 VOT (ms) N=16 Mechanical Turk: Results

D T coronals 3 2 1 standardized rating 0 burst t d − 1 − 2 − 3 − 4 10 17 24 31 38 45 52 10 17 24 31 38 45 52 VOT (ms) N=16 Mechanical Turk: Results

Study /p/ /b/ /t/ /d/ /k/ /g/ Language easure Study /p/ /b/ /t/ /d/ /k/ /g/ La Mea Zue 1976 Am. English Peak -- -- 3600 3300 1940 1910 Parikh and Loizou 2005 Am. English Peak 1910 1163 5649 5225 2261 2268 Sundara 2005 Ca. English CoG -- -- 4900 4400 -- -- Kirkham 2011 Br. English CoG -- -- 5220 4888 -- -- Van Alphen and Smits 2004 Dutch CoG 1160 830 3540 2140 -- -- Sundara 2005 Ca. French CoG -- -- 3800 3000 -- -- Vicenik 2010 Georgian CoG 4000 3200 5300 4600 3100 3100 CoG = Center of Gravity (mean frequency) Background: Production

Burst Spectrum as a Cue to Stop Consonant Voicing English Production - PowerPoint PPT Presentation

Burst Spectrum as a Cue to Stop Consonant Voicing English Production and Perception Results Eleanor Chodroff and Colin Wilson Johns Hopkins University Summerfield and Haggard (1977), Lisker (1978), Repp (1979), Lisker (1986) voice onset time

Cue validity Cue validity - predictiveness of a cue for a given category Central

Status of GEO burst analysis efforts Ik Siong Heng for the GEO burst group Outline

Depth Perception Deep Blue See April 5, 2020 PSYCH 4041 / 6014 Overview Cue Theory

CUE Library Information Everything you need to know about CUE Library. We're here to help!

Breaking the TV Habit Valerie Lanard @valer @gigabody Duhiggs Habit Cycle Routine Cue

Cue Based Feeding in the NICU ANNA ELSENBROCK, MS, OTR/L, CPST, CNT LAURA LUCAS, MS, RD, CSP, LD

Observation of THz CSR Observation of THz CSR Burst at UVSOR- -II II Burst at UVSOR 1 Miho

Gamma- -Ray Burst observation with GLAST Ray Burst observation with GLAST Gamma F. Piron F.

Spectrum Sharing in Cognitive Radio Networks By: H.Feizresan Summer 2009 1 Spectrum sharing in

Optical flow Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys Slides from S.

Spectrum The Electromagnetic Spectrum The EM spectrum is the ENTIRE range of EM waves in order

National Advanced Spectrum and Communications Test Network (NASCTN) T rusted Spectrum Testing 5G

DATA AT SWEDEN'S TELEVISION Ismail Elouafiq A wide spectrum of Apps A wide spectrum of Apps

Electromagnetic Spectrum Effects of Radiation Solar Output Solar Spectrum Visible Spectrum

Spread Spectrum Concept Frequency Hopping Spread Spectrum Direct Sequence Spread

Normal A Spectrum of Engineering Design Normal Radical A Spectrum of Engineering Design Normal

Statistical NLP Spring 2011 Lecture 4: Speech Recognition Dan Klein UC Berkeley Speech in a

Alkaline Activation as a procedure for the transformation of fly ashes into cementitious

Counting d.o.f.s in periodic frameworks Louis Theran (Aalto University / AScI, CS) Frameworks

Prtr rss

Voice Capture and Analysis Cody Narber Computer and Information Science Department Kansas State

Pitch (in speech) MATLAB tutorial series (Part 2.2) Pouyan Ebrahimbabaie Laboratory for Signal

The ABCs of MLT Aptitude vs. Achievement Who has music aptitude? An Introduction to Gordons

LCS 11: Cognitive Science Results of evaluations Perception in language acquisition Language

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us