메뉴 건너뛰기




Volumn 134, Issue 2, 2013, Pages 1295-1313

Formant frequency estimation of high-pitched vowels using weighted linear prediction

Author keywords

[No Author keywords available]

Indexed keywords

ALL-POLE MODELING; FILTER COEFFICIENTS; FORMANT ESTIMATION; FORMANT FREQUENCY; FORMANT FREQUENCY ESTIMATION; FUNDAMENTAL FREQUENCIES; PREDICTION ERRORS; WEIGHTING FUNCTIONS;

EID: 84882383984     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.4812756     Document Type: Article
Times cited : (82)

References (51)
  • 1
    • 33747109882 scopus 로고    scopus 로고
    • An amplitude quotient based method to analyze changes in the shape of the glottal pulse in the regulation of vocal intensity
    • 10.1121/1.2211589
    • Alku, P., Airas, M., Björkner, E., and Sundberg, J. (2006). " An amplitude quotient based method to analyze changes in the shape of the glottal pulse in the regulation of vocal intensity.," J. Acoust. Soc. Am. 120, 1052-1062. 10.1121/1.2211589
    • (2006) J. Acoust. Soc. Am. , vol.120 , pp. 1052-1062
    • Alku, P.1    Airas, M.2    Björkner, E.3    Sundberg, J.4
  • 2
    • 65549101092 scopus 로고    scopus 로고
    • Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering
    • Alku, P., Magi, C., Yrttiaho, S., Bäckström, T., and Story, B. (2009). " Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering.," J. Acoust. Soc. Am. 120, 3289-3305.
    • (2009) J. Acoust. Soc. Am. , vol.120 , pp. 3289-3305
    • Alku, P.1    Magi, C.2    Yrttiaho, S.3    Bäckström, T.4    Story, B.5
  • 3
    • 0029684340 scopus 로고    scopus 로고
    • A comparison of glottal voice source quantification parameters in breathy, normal, and pressed phonation of female and male speakers
    • 10.1159/000266415
    • Alku, P., and Vilkman, E. (1996). " A comparison of glottal voice source quantification parameters in breathy, normal, and pressed phonation of female and male speakers.," Folia Phoniatr. Logop. 48, 240-254. 10.1159/000266415
    • (1996) Folia Phoniatr. Logop. , vol.48 , pp. 240-254
    • Alku, P.1    Vilkman, E.2
  • 4
    • 54049133744 scopus 로고    scopus 로고
    • Mixed-effects modeling with crossed random effects for subjects and items
    • 10.1016/j.jml.2007.12.005
    • Baayen, R., Davidson, D., and Bates, D. (2008). " Mixed-effects modeling with crossed random effects for subjects and items.," J. Mem. Lang. 59, 390-412. 10.1016/j.jml.2007.12.005
    • (2008) J. Mem. Lang. , vol.59 , pp. 390-412
    • Baayen, R.1    Davidson, D.2    Bates, D.3
  • 5
    • 34547517867 scopus 로고    scopus 로고
    • Adaptive Kalman filtering and smoothing for tracking vocal tract resonances using a continuous-valued hidden dynamic model
    • 10.1109/TASL.2006.876724
    • Deng, L., Lee, L., Attias, H., and Acero, A. (2007). " Adaptive Kalman filtering and smoothing for tracking vocal tract resonances using a continuous-valued hidden dynamic model.," IEEE Trans. Audio Speech Lang. Process. 15, 13-23. 10.1109/TASL.2006.876724
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 13-23
    • Deng, L.1    Lee, L.2    Attias, H.3    Acero, A.4
  • 6
    • 0026106454 scopus 로고
    • Discrete all-pole modeling
    • 10.1109/78.80824
    • El-Jaroudi, A., and Makhoul, J. (1991). " Discrete all-pole modeling.," IEEE Trans. Signal Process. 39, 411-423. 10.1109/78.80824
    • (1991) IEEE Trans. Signal Process. , vol.39 , pp. 411-423
    • El-Jaroudi, A.1    Makhoul, J.2
  • 8
    • 33947684811 scopus 로고
    • A four-parameter model of glottal flow
    • (Speech, Music and Hearing, Royal Institute of Technology, Stockholm, Sweden)
    • Fant, G., Liljencrants, J., and Lin, Q. (1985). " A four-parameter model of glottal flow.," STL-QPSR 4 (Speech, Music and Hearing, Royal Institute of Technology, Stockholm, Sweden), pp. 1-13.
    • (1985) STL-QPSR 4 , pp. 1-13
    • Fant, G.1    Liljencrants, J.2    Lin, Q.3
  • 9
    • 0034945901 scopus 로고    scopus 로고
    • SIM - Simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals
    • 10.1121/1.1379076
    • Fröhlich, M., Michaelis, D., and Strube, H. (2001). " SIM-Simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals.," J. Acoust. Soc. Am. 110, 479-488. 10.1121/1.1379076
    • (2001) J. Acoust. Soc. Am. , vol.110 , pp. 479-488
    • Fröhlich, M.1    Michaelis, D.2    Strube, H.3
  • 10
    • 84867695437 scopus 로고
    • A preliminary study of acoustic voice quality correlates
    • (Speech, Music and Hearing, Royal Institute of Technology, Stockholm, Sweden)
    • Gobl, C. (1989). " A preliminary study of acoustic voice quality correlates.," STL-QPSR 4 (Speech, Music and Hearing, Royal Institute of Technology, Stockholm, Sweden), pp. 9-22.
    • (1989) STL-QPSR 4 , pp. 9-22
    • Gobl, C.1
  • 11
    • 0345416476 scopus 로고
    • Analysis of digital and analog formant synthesizers
    • 10.1109/TAU.1968.1161954
    • Gold, B., and Rabiner, L. (1968). " Analysis of digital and analog formant synthesizers.," IEEE Trans. Audio Electroacoust. 16, 81-94. 10.1109/TAU.1968.1161954
    • (1968) IEEE Trans. Audio Electroacoust. , vol.16 , pp. 81-94
    • Gold, B.1    Rabiner, L.2
  • 12
    • 0004236492 scopus 로고
    • (Johns Hopkins University Press, Baltimore, MD)
    • Golub, G., and Van Loan, C. (1983). Matrix Computation (Johns Hopkins University Press, Baltimore, MD), p. 55.
    • (1983) Matrix Computation , pp. 55
    • Golub, G.1    Van Loan, C.2
  • 13
    • 0030764571 scopus 로고    scopus 로고
    • Dialect variation and formant frequency: The American English vowels revisited
    • 10.1121/1.419712
    • Hagiwara, R. (1997). " Dialect variation and formant frequency: The American English vowels revisited.," J. Acoust. Soc. Am. 102, 655-658. 10.1121/1.419712
    • (1997) J. Acoust. Soc. Am. , vol.102 , pp. 655-658
    • Hagiwara, R.1
  • 15
    • 0029060033 scopus 로고
    • Acoustic characteristics of American English vowels
    • 10.1121/1.411872
    • Hillenbrand, J., Getty, L., Clark, M., and Wheeler, K. (1995). " Acoustic characteristics of American English vowels.," J. Acoust. Soc. Am. 97, 3099-3111. 10.1121/1.411872
    • (1995) J. Acoust. Soc. Am. , vol.97 , pp. 3099-3111
    • Hillenbrand, J.1    Getty, L.2    Clark, M.3    Wheeler, K.4
  • 16
    • 0023687414 scopus 로고
    • Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice
    • 10.1121/1.396829
    • Holmberg, E., Hillman, R., and Perkell, J. (1988). " Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice.," J. Acoust. Soc. Am. 84, 511-529. 10.1121/1.396829
    • (1988) J. Acoust. Soc. Am. , vol.84 , pp. 511-529
    • Holmberg, E.1    Hillman, R.2    Perkell, J.3
  • 18
    • 0024017495 scopus 로고
    • On robust linear prediction of speech
    • 10.1109/29.1574
    • Lee, C.-H. (1988). " On robust linear prediction of speech.," IEEE Trans. Acoust. Speech Signal Process. 36, 642-650. 10.1109/29.1574
    • (1988) IEEE Trans. Acoust. Speech Signal Process. , vol.36 , pp. 642-650
    • Lee, C.-H.1
  • 19
    • 0003656289 scopus 로고
    • Stockholm, Sweden DS Dissertation, Dep. of Speech Comm. and Music Acoustics, Royal Inst. of Technol
    • Liljencrants, J. (1985). " Speech synthesis with a reflection-type line analog.," DS dissertation, Dep. of Speech Comm. and Music Acoustics, Royal Inst. of Technol., Stockholm, Sweden, pp. 1-125.
    • (1985) Speech Synthesis with a Reflection-Type Line Analog , pp. 1-125
    • Liljencrants, J.1
  • 20
    • 0027560122 scopus 로고
    • Robust signal selection for linear prediction analysis of voice speech
    • 10.1016/0167-6393(93)90019-H
    • Ma, C., Kamp, Y., and Willems, L. (1993). " Robust signal selection for linear prediction analysis of voice speech.," Speech Commun. 12, 69-81. 10.1016/0167-6393(93)90019-H
    • (1993) Speech Commun. , vol.12 , pp. 69-81
    • Ma, C.1    Kamp, Y.2    Willems, L.3
  • 21
    • 61849149377 scopus 로고    scopus 로고
    • Stabilised weighted linear prediction
    • 10.1016/j.specom.2008.12.005
    • Magi, C., Pohjalainen, J., Bäckström, T., and Alku, P. (2009). " Stabilised weighted linear prediction.," Speech Commun. 51, 401-411. 10.1016/j.specom.2008.12.005
    • (2009) Speech Commun. , vol.51 , pp. 401-411
    • Magi, C.1    Pohjalainen, J.2    Bäckström, T.3    Alku, P.4
  • 22
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • 10.1109/PROC.1975.9792
    • Makhoul, J. (1975a). " Linear prediction: A tutorial review.," Proc. IEEE 63, 561-580. 10.1109/PROC.1975.9792
    • (1975) Proc. IEEE , vol.63 , pp. 561-580
    • Makhoul, J.1
  • 23
    • 0016519041 scopus 로고
    • Spectral linear prediction: Properties and applications
    • 10.1109/TASSP.1975.1162685
    • Makhoul, J. (1975b). " Spectral linear prediction: Properties and applications.," IEEE Trans. Acoust. Speech Signal Process. 23, 283-296. 10.1109/TASSP.1975.1162685
    • (1975) IEEE Trans. Acoust. Speech Signal Process. , vol.23 , pp. 283-296
    • Makhoul, J.1
  • 25
    • 0023422057 scopus 로고
    • Analysis of speech signals of short pitch period by a sample-selective linear prediction
    • 10.1109/TASSP.1987.1165282
    • Miyoshi, Y., Yamato, K., Mizoguchi, R., Yanagida, M., and Kakusho, O. (1987). " Analysis of speech signals of short pitch period by a sample-selective linear prediction.," IEEE Trans. Acoust. Speech Signal Process. 35, 1233-1240. 10.1109/TASSP.1987.1165282
    • (1987) IEEE Trans. Acoust. Speech Signal Process. , vol.35 , pp. 1233-1240
    • Miyoshi, Y.1    Yamato, K.2    Mizoguchi, R.3    Yanagida, M.4    Kakusho, O.5
  • 26
    • 0000668614 scopus 로고    scopus 로고
    • Robustness of group-delay-based method for extraction of significant instants of excitation from speech signals
    • 10.1109/89.799686
    • Murthy, P. S., and Yegnanarayana, B. (1999). " Robustness of group-delay-based method for extraction of significant instants of excitation from speech signals.," IEEE Trans. Speech Audio Process. 7, 609-619. 10.1109/89.799686
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , pp. 609-619
    • Murthy, P.S.1    Yegnanarayana, B.2
  • 27
    • 41049089736 scopus 로고    scopus 로고
    • Estimation of glottal closure instants in voiced speech using the DYPSA algorithm
    • Naylor, P., Kounoudes, A., Gudnason, J., and Brookes, M. (2007). " Estimation of glottal closure instants in voiced speech using the DYPSA algorithm.," IEEE Trans. Speech Audio Process. 15, 34-43.
    • (2007) IEEE Trans. Speech Audio Process. , vol.15 , pp. 34-43
    • Naylor, P.1    Kounoudes, A.2    Gudnason, J.3    Brookes, M.4
  • 28
    • 0015100168 scopus 로고
    • Automatic formant tracking by a Newton-Raphson technique
    • 10.1121/1.1912681
    • Olive, J. (1971). " Automatic formant tracking by a Newton-Raphson technique.," J. Acoust. Soc. Am. 50, 661-670. 10.1121/1.1912681
    • (1971) J. Acoust. Soc. Am. , vol.50 , pp. 661-670
    • Olive, J.1
  • 30
    • 0032595183 scopus 로고    scopus 로고
    • Modeling of the glottal flow derivative waveform with application to speaker identification
    • 10.1109/89.784109
    • Plumpe, M., Quatieri, T., and Reynolds, D. (1999). " Modeling of the glottal flow derivative waveform with application to speaker identification.," IEEE Trans. Speech Audio Process. 7, 569-586. 10.1109/89.784109
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , pp. 569-586
    • Plumpe, M.1    Quatieri, T.2    Reynolds, D.3
  • 31
    • 0030008906 scopus 로고    scopus 로고
    • Speech formant frequency and bandwidth tracking using multiband energy demodulation
    • 10.1121/1.414997
    • Potamianos, A., and Maragos, P. (1996). " Speech formant frequency and bandwidth tracking using multiband energy demodulation.," J. Acoust. Soc. Am. 99, 3795-3806. 10.1121/1.414997
    • (1996) J. Acoust. Soc. Am. , vol.99 , pp. 3795-3806
    • Potamianos, A.1    Maragos, P.2
  • 34
    • 84855199947 scopus 로고    scopus 로고
    • R Development Core Team. (R Foundation for Statistical Computing, Vienna, Austria)
    • R Development Core Team (2009). A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, Austria), pp. 1-409.
    • (2009) A Language and Environment for Statistical Computing , pp. 1-409
  • 35
    • 79953288197 scopus 로고    scopus 로고
    • Time-varying autoregressions in speech: Detection theory and applications
    • 10.1109/TASL.2010.2073704
    • Rudoy, D., Quatieri, T., and Wolfe, P. (2011). " Time-varying autoregressions in speech: Detection theory and applications.," IEEE Trans. Audio Speech Lang. Process. 19, 977-989. 10.1109/TASL.2010.2073704
    • (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , pp. 977-989
    • Rudoy, D.1    Quatieri, T.2    Wolfe, P.3
  • 36
    • 77952192470 scopus 로고    scopus 로고
    • Temporally weighted linear prediction features for tackling additive noise in speaker verification
    • 10.1109/LSP.2010.2048649
    • Saeidi, R., Pohjalainen, J., Kinnunen, T., and Alku, P. (2010). " Temporally weighted linear prediction features for tackling additive noise in speaker verification.," IEEE Signal Process. Lett. 17, 599-602. 10.1109/LSP.2010.2048649
    • (2010) IEEE Signal Process. Lett. , vol.17 , pp. 599-602
    • Saeidi, R.1    Pohjalainen, J.2    Kinnunen, T.3    Alku, P.4
  • 38
    • 85009098540 scopus 로고    scopus 로고
    • Estimating the spectral envelope of voiced speech using multi-frame analysis
    • Geneva, Switzerland
    • Shiga, Y., and King, S. (2003). " Estimating the spectral envelope of voiced speech using multi-frame analysis.," in Proceedings of Interspeech, Geneva, Switzerland, pp. 1737-1740.
    • (2003) Proceedings of Interspeech , pp. 1737-1740
    • Shiga, Y.1    King, S.2
  • 40
    • 29244463002 scopus 로고    scopus 로고
    • Synergistic modes of vocal tract articulation for American English vowels
    • 10.1121/1.2118367
    • Story, B. (2005). " Synergistic modes of vocal tract articulation for American English vowels.," J. Acoust. Soc. Am. 118, 3834-3859. 10.1121/1.2118367
    • (2005) J. Acoust. Soc. Am. , vol.118 , pp. 3834-3859
    • Story, B.1
  • 41
    • 31744450738 scopus 로고    scopus 로고
    • A technique for 'tuning' vocal tract area functions based on acoustic sensitivity functions
    • 10.1121/1.2151802
    • Story, B. (2006). " A technique for 'tuning' vocal tract area functions based on acoustic sensitivity functions.," J. Acoust. Soc. Am. 119, 715-718. 10.1121/1.2151802
    • (2006) J. Acoust. Soc. Am. , vol.119 , pp. 715-718
    • Story, B.1
  • 42
    • 37849046598 scopus 로고    scopus 로고
    • Comparison of magnetic resonance imaging-based vocal tract area functions obtained from the same speaker in 1994 and 2002
    • 10.1121/1.2805683
    • Story, B. (2008). " Comparison of magnetic resonance imaging-based vocal tract area functions obtained from the same speaker in 1994 and 2002.," J. Acoust. Soc. Am. 123, 327-335. 10.1121/1.2805683
    • (2008) J. Acoust. Soc. Am. , vol.123 , pp. 327-335
    • Story, B.1
  • 43
    • 84875935510 scopus 로고    scopus 로고
    • Phrase-level speech simulation with an airway modulation model of speech production
    • 10.1016/j.csl.2012.10.005
    • Story, B. (2013). " Phrase-level speech simulation with an airway modulation model of speech production.," Comp. Speech Lang. 27, 989-1010. 10.1016/j.csl.2012.10.005
    • (2013) Comp. Speech Lang. , vol.27 , pp. 989-1010
    • Story, B.1
  • 44
    • 0016129045 scopus 로고
    • Determination of the instant of glottal closure from the speech wave
    • 10.1121/1.1903487
    • Strube, H. (1974). " Determination of the instant of glottal closure from the speech wave.," J. Acoust. Soc. Am. 56, 1625-1629. 10.1121/1.1903487
    • (1974) J. Acoust. Soc. Am. , vol.56 , pp. 1625-1629
    • Strube, H.1
  • 45
    • 0021377343 scopus 로고
    • Parameterization of the glottal area, glottal flow, and vocal fold contact area
    • 10.1121/1.390530
    • Titze, I. (1984). " Parameterization of the glottal area, glottal flow, and vocal fold contact area.," J. Acoust. Soc. Am. 75, 570-580. 10.1121/1.390530
    • (1984) J. Acoust. Soc. Am. , vol.75 , pp. 570-580
    • Titze, I.1
  • 46
    • 0036150214 scopus 로고    scopus 로고
    • Regulating glottal airflow in phonation: Application of the maximum power transfer theorem to a low dimensional phonation model
    • 10.1121/1.1417526
    • Titze, I. (2002). " Regulating glottal airflow in phonation: Application of the maximum power transfer theorem to a low dimensional phonation model.," J. Acoust. Soc. Am. 111, 367-376. 10.1121/1.1417526
    • (2002) J. Acoust. Soc. Am. , vol.111 , pp. 367-376
    • Titze, I.1
  • 48
    • 0036723036 scopus 로고    scopus 로고
    • Systematic errors in the formant analysis of steady-state vowels
    • 10.1016/S0167-6393(01)00049-8
    • Vallabha, G., and Tuller, B. (2002). " Systematic errors in the formant analysis of steady-state vowels.," Speech Commun. 38, 141-160. 10.1016/S0167-6393(01)00049-8
    • (2002) Speech Commun. , vol.38 , pp. 141-160
    • Vallabha, G.1    Tuller, B.2
  • 49
    • 85008561872 scopus 로고    scopus 로고
    • High-pitch formant estimation by exploiting temporal change of pitch
    • 10.1109/TASL.2009.2024732
    • Wang, T., and Quatieri, T. (2010). " High-pitch formant estimation by exploiting temporal change of pitch.," IEEE Trans. Audio, Speech, Lang. Process. 18, 171-186. 10.1109/TASL.2009.2024732
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , pp. 171-186
    • Wang, T.1    Quatieri, T.2
  • 50
    • 0018653975 scopus 로고
    • Least squares glottal inverse filtering from the acoustic speech waveform
    • 10.1109/TASSP.1979.1163260
    • Wong, D., Markel, J., and Gray, A., Jr. (1979). " Least squares glottal inverse filtering from the acoustic speech waveform.," IEEE Trans. Acoust. Speech Signal Process. 27, 350-355. 10.1109/TASSP.1979.1163260
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , pp. 350-355
    • Wong, D.1    Markel, J.2    Gray Jr., A.3
  • 51
    • 0032121729 scopus 로고    scopus 로고
    • Extraction of vocal-tract system characteristics from speech signals
    • 10.1109/89.701359
    • Yegnanarayana, B., and Veldhuis, N. (1998). " Extraction of vocal-tract system characteristics from speech signals.," IEEE Trans. Speech Audio Process. 6, 313-327. 10.1109/89.701359
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 313-327
    • Yegnanarayana, B.1    Veldhuis, N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.