메뉴 건너뛰기




Volumn 14, Issue 2, 2006, Pages 435-444

Robust formant tracking for continuous speech with speaker variability

Author keywords

Formant estimation; Hearing aids; Speech analysis; Speech enhancement

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; FEATURE EXTRACTION; HEARING AIDS; NATURAL FREQUENCIES; SIGNAL DETECTION; SIGNAL TO NOISE RATIO; SPEECH ANALYSIS; SPEECH ENHANCEMENT;

EID: 33947155741     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.855840     Document Type: Article
Times cited : (83)

References (47)
  • 1
    • 84941328385 scopus 로고
    • Control methods used in a study of the vowels
    • Mar
    • G.E. Peterson and H. L. Barney, "Control methods used in a study of the vowels," J. Acoust. Soc. Amer., vol. 24, no. 2, pp. 175-184, Mar. 1952.
    • (1952) J. Acoust. Soc. Amer , vol.24 , Issue.2 , pp. 175-184
    • Peterson, G.E.1    Barney, H.L.2
  • 2
    • 0014557625 scopus 로고
    • Perceptual and physical space of vowel sounds
    • Aug
    • L. C. Pols, L. J. van der Kamp, and R. Plomp, "Perceptual and physical space of vowel sounds," J. Acoust. Soc. Amer., vol. 46, no. 2, pp. 458-467, Aug. 1969.
    • (1969) J. Acoust. Soc. Amer , vol.46 , Issue.2 , pp. 458-467
    • Pols, L.C.1    van der Kamp, L.J.2    Plomp, R.3
  • 3
    • 0028820976 scopus 로고
    • The role of formant transitions in the perception of concurrent vowels
    • Jan
    • P. F. Assmann, "The role of formant transitions in the perception of concurrent vowels," J. Acoust. Soc. Amer., vol. 97, no. 1, pp. 575-584, Jan. 1995.
    • (1995) J. Acoust. Soc. Amer , vol.97 , Issue.1 , pp. 575-584
    • Assmann, P.F.1
  • 4
    • 0029809872 scopus 로고    scopus 로고
    • _, Modeling the perception of concurrent vowels: Role of formant transitions, J. Acoust. Soc. Amer., pt. 1, 100, no. 2, pp. 1141-1152, Aug. 1996.
    • _, "Modeling the perception of concurrent vowels: Role of formant transitions," J. Acoust. Soc. Amer., pt. 1, vol. 100, no. 2, pp. 1141-1152, Aug. 1996.
  • 5
    • 0028129916 scopus 로고    scopus 로고
    • R. N. Ohde, The development of the perception of cues to the [m] [n]distinction in CV syllables, J. Acoust. Soc. Amer., pt. 1, 96, no. 2, pp. 675-686, Aug. 1994.
    • R. N. Ohde, "The development of the perception of cues to the [m] [n]distinction in CV syllables," J. Acoust. Soc. Amer., pt. 1, vol. 96, no. 2, pp. 675-686, Aug. 1994.
  • 6
    • 0028241671 scopus 로고    scopus 로고
    • A. K. Nábělek, Z. Czyzewski, and H. Crowley, Cues for perception of the diphthong /ai/in either noise or reverberation. Part I. Duration of the transition, J. Acoust. Soc. Amer., pt. 1, 95, no. 5, pp. 2681-2693, May 1994.
    • A. K. Nábělek, Z. Czyzewski, and H. Crowley, "Cues for perception of the diphthong /ai/in either noise or reverberation. Part I. Duration of the transition," J. Acoust. Soc. Amer., pt. 1, vol. 95, no. 5, pp. 2681-2693, May 1994.
  • 7
    • 0001877436 scopus 로고
    • The role of consonant-vowel transitions in the perception of the stop and nasal consonants
    • A. M. Liberman, P. C. Delattre, F. S. Cooper, and L. J. Gerstman, "The role of consonant-vowel transitions in the perception of the stop and nasal consonants," Psychol. Monographs, vol. 68, pp. 1-13, 1954.
    • (1954) Psychol. Monographs , vol.68 , pp. 1-13
    • Liberman, A.M.1    Delattre, P.C.2    Cooper, F.S.3    Gerstman, L.J.4
  • 8
    • 0031006065 scopus 로고    scopus 로고
    • Effects of acoustic trauma on the representation of the vowel /5/in cat auditory nerve fibers
    • Jun
    • R. L. Miller, J. R. Schilling, K. R. Franck, and E. D. Young, "Effects of acoustic trauma on the representation of the vowel /5/in cat auditory nerve fibers," J. Acoust. Soc. Amer., vol. 101, no. 6, pp. 3602-3616, Jun. 1997.
    • (1997) J. Acoust. Soc. Amer , vol.101 , Issue.6 , pp. 3602-3616
    • Miller, R.L.1    Schilling, J.R.2    Franck, K.R.3    Young, E.D.4
  • 9
    • 0036010525 scopus 로고    scopus 로고
    • Biological basis of hearing-aid design
    • Feb
    • M. B. Sachs, I. C. Bruce, R. L. Miller, and E. D. Young,"Biological basis of hearing-aid design," Ann. Biomed. Eng., vol. 30, no. 2, pp. 157-168, Feb. 2002.
    • (2002) Ann. Biomed. Eng , vol.30 , Issue.2 , pp. 157-168
    • Sachs, M.B.1    Bruce, I.C.2    Miller, R.L.3    Young, E.D.4
  • 10
    • 0031945435 scopus 로고    scopus 로고
    • Frequency-shaped amplification changes the neural representation of speech with noise-induced hearing loss
    • J. R. Schilling, R. L. Miller, M. B. Sachs, and E. D. Young, "Frequency-shaped amplification changes the neural representation of speech with noise-induced hearing loss," Hearing Res., vol. 117, pp. 57-70, 1998.
    • (1998) Hearing Res , vol.117 , pp. 57-70
    • Schilling, J.R.1    Miller, R.L.2    Sachs, M.B.3    Young, E.D.4
  • 11
    • 0032716549 scopus 로고    scopus 로고
    • Contrast enhancement improves the representation of /ε/-like vowels in the hearing-impaired auditory nerve
    • R. L. Miller, B. M. Calhoun, and E. D. Young, "Contrast enhancement improves the representation of /ε/-like vowels in the hearing-impaired auditory nerve," J. Acousl. Soc. Amer., vol. 106, no. 5, pp. 2693-2708, 1999.
    • (1999) J. Acousl. Soc. Amer , vol.106 , Issue.5 , pp. 2693-2708
    • Miller, R.L.1    Calhoun, B.M.2    Young, E.D.3
  • 12
    • 4344609571 scopus 로고    scopus 로고
    • Physiological assessment of contrast-enhancing frequency shaping and multiband compression in hearing aids
    • Aug
    • I. C. Bruce, "Physiological assessment of contrast-enhancing frequency shaping and multiband compression in hearing aids," Physiol. Meas., vol. 25, no. 4, pp. 945-956, Aug. 2004.
    • (2004) Physiol. Meas , vol.25 , Issue.4 , pp. 945-956
    • Bruce, I.C.1
  • 13
    • 0009625192 scopus 로고
    • Automatic extraction of formant frequencies from continuous speech
    • J. L. Flanagan, "Automatic extraction of formant frequencies from continuous speech," J. Acousl. Soc. Amer., vol. 28, pp. 110-118, 1956.
    • (1956) J. Acousl. Soc. Amer , vol.28 , pp. 110-118
    • Flanagan, J.L.1
  • 14
    • 0014730929 scopus 로고
    • System for automatic formant analysis of voiced speech
    • Feb
    • R. W. Schafer and L. R. Rabiner, "System for automatic formant analysis of voiced speech," J. Acoust. Soc. Amer., vol. 47, no. 2, pp. 634-648, Feb. 1970.
    • (1970) J. Acoust. Soc. Amer , vol.47 , Issue.2 , pp. 634-648
    • Schafer, R.W.1    Rabiner, L.R.2
  • 15
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of the speech wave
    • Aug
    • B. S. Atal and S. L. Hanauer, "Speech analysis and synthesis by linear prediction of the speech wave," J. Acoust. Soc. Amer., vol. 50, no. 2, pp. 637-655, Aug. 1971.
    • (1971) J. Acoust. Soc. Amer , vol.50 , Issue.2 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 16
    • 0016049328 scopus 로고
    • An algorithm for automatic formant extraction using linear prediction spectra
    • S. McCandless, "An algorithm for automatic formant extraction using linear prediction spectra," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-22, pp. 135-141, 1974.
    • (1974) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-22 , pp. 135-141
    • McCandless, S.1
  • 18
    • 33947106218 scopus 로고    scopus 로고
    • Robust Formant Tracking for Continuous Speech With Speaker Variability,
    • M.S. thesis, McMaster Univ, Hamilton, ON, Canada
    • K. Mustafa, "Robust Formant Tracking for Continuous Speech With Speaker Variability," M.S. thesis, McMaster Univ., Hamilton, ON, Canada, 2003.
    • (2003)
    • Mustafa, K.1
  • 19
    • 0000330384 scopus 로고    scopus 로고
    • On decomposing speech into modulated components
    • May
    • A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 240-254, May 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.3 , pp. 240-254
    • Rao, A.1    Kumaresan, R.2
  • 23
    • 0016030463 scopus 로고
    • On the behavior of minimax FIR digital Hubert transformers
    • L. R. Rabiner and R. W. Schafer, "On the behavior of minimax FIR digital Hubert transformers," Bell Syst. Tech. J., vol. 53, no. 2, pp. 363-390, 1974.
    • (1974) Bell Syst. Tech. J , vol.53 , Issue.2 , pp. 363-390
    • Rabiner, L.R.1    Schafer, R.W.2
  • 24
    • 0024069934 scopus 로고
    • Spectrum estimation using an analytic sisnal representation
    • Sep
    • J. Picone, D. P. Prezas, W. T. Hartwell, and J. L. Locicero, "Spectrum estimation using an analytic sisnal representation," Signal Process., vol. 15, no. 2, pp. 169-182, Sep. 1988.
    • (1988) Signal Process , vol.15 , Issue.2 , pp. 169-182
    • Picone, J.1    Prezas, D.P.2    Hartwell, W.T.3    Locicero, J.L.4
  • 26
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • Apr
    • J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, no. 4, pp. 561-580, Apr. 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 28
    • 0000618817 scopus 로고
    • New methods of pitch extraction
    • M. M. Sondhi, "New methods of pitch extraction," IEEE Trans. Audio Electroacoust., vol. AU-16, no. 2, pp. 262-266, 1968.
    • (1968) IEEE Trans. Audio Electroacoust , vol.AU-16 , Issue.2 , pp. 262-266
    • Sondhi, M.M.1
  • 29
    • 0033623527 scopus 로고    scopus 로고
    • Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics
    • L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, no. 6, pp. 3036-3048, 2000.
    • (2000) J. Acoust. Soc. Amer , vol.108 , Issue.6 , pp. 3036-3048
    • Deng, L.1    Ma, J.2
  • 30
    • 85009211881 scopus 로고    scopus 로고
    • Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint
    • L. Deng, I. Bazzi, and A. Acero, 'Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint," in Proc. EUROSPEECH, 2003, pp. 73-76.
    • (2003) Proc. EUROSPEECH , pp. 73-76
    • Deng, L.1    Bazzi, I.2    Acero, A.3
  • 31
    • 4544323815 scopus 로고    scopus 로고
    • A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances
    • L. Deng, L. Lee, H. Attias, and A. Acero, "A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances," in Proc. ICASSP, vol. 1, 2004, pp. 557-560.
    • (2004) Proc. ICASSP , vol.1 , pp. 557-560
    • Deng, L.1    Lee, L.2    Attias, H.3    Acero, A.4
  • 32
    • 84876465692 scopus 로고    scopus 로고
    • A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech
    • L. Deng, D. Yu, and A. Acero, "A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech," in Proc. InterSpeech-ICSLP, 2004, pp. 981-984.
    • (2004) Proc. InterSpeech-ICSLP , pp. 981-984
    • Deng, L.1    Yu, D.2    Acero, A.3
  • 33
    • 4544278205 scopus 로고    scopus 로고
    • Formant tracking by mixture state particle filter
    • Y. Zheng and M. Hasegawa-Johnson, "Formant tracking by mixture state particle filter," in Proc. ICASSP, vol. 1, 2004, pp. 565-568.
    • (2004) Proc. ICASSP , vol.1 , pp. 565-568
    • Zheng, Y.1    Hasegawa-Johnson, M.2
  • 34
    • 33947130944 scopus 로고    scopus 로고
    • Improved differential phase spectrum processing for formant tracking
    • B. Bozkurt, T. Dutoit, B. Doval, and C. D'Alessandro, "Improved differential phase spectrum processing for formant tracking," in Proc. InterSpeech-ICSLP, 2004, pp. 2421-2424.
    • (2004) Proc. InterSpeech-ICSLP , pp. 2421-2424
    • Bozkurt, B.1    Dutoit, T.2    Doval, B.3    D'Alessandro, C.4
  • 35
    • 33947190185 scopus 로고    scopus 로고
    • A concurrent curve strategy for formant tracking
    • Y. Laprie, "A concurrent curve strategy for formant tracking," in Proc. InterSpeech-ICSLP, 2004, pp. 2405-2408.
    • (2004) Proc. InterSpeech-ICSLP , pp. 2405-2408
    • Laprie, Y.1
  • 36
    • 0031632620 scopus 로고    scopus 로고
    • On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
    • 36
    • |36] P. N. Gamer and W. J. Holmes, "On the robust incorporation of formant features into hidden Markov models for automatic speech recognition," in Proc. ICASSP, vol 1, 1998, pp. 1-4.
    • (1998) Proc. ICASSP , vol.1 , pp. 1-4
    • Gamer, P.N.1    Holmes, W.J.2
  • 37
    • 0141814630 scopus 로고    scopus 로고
    • An expectation maximization approach for formant tracking using a parameter-free nonlinear predictor
    • I. Bazzi, A. Acero, and L. Deng, "An expectation maximization approach for formant tracking using a parameter-free nonlinear predictor," in Proc. ICASSP, vol. 1, 2003, pp. 464-467.
    • (2003) Proc. ICASSP , vol.1 , pp. 464-467
    • Bazzi, I.1    Acero, A.2    Deng, L.3
  • 38
    • 85135264071 scopus 로고    scopus 로고
    • Formant analysis and synthesis using hidden Markov models
    • A. Acero, "Formant analysis and synthesis using hidden Markov models," in Proc. EUROSPEECH, 1999, pp. 1047-1050.
    • (1999) Proc. EUROSPEECH , pp. 1047-1050
    • Acero, A.1
  • 40
    • 0141479037 scopus 로고    scopus 로고
    • Evaluation of methods for parameteric formant transformation in voice conversion
    • E. Turajlic, D. Rentzos, S. Vaseghi, and C.-H. Ho, "Evaluation of methods for parameteric formant transformation in voice conversion," in Proc. ICASSP, vol. 1, 2003, pp. 724-727.
    • (2003) Proc. ICASSP , vol.1 , pp. 724-727
    • Turajlic, E.1    Rentzos, D.2    Vaseghi, S.3    Ho, C.-H.4
  • 42
    • 4444238905 scopus 로고    scopus 로고
    • Evaluation of formant-like features on an automatic vowel classification task
    • F. de Wet, K. Weber, L. Boves, B. Cranen, S. Bengio, and H. Bourlard, "Evaluation of formant-like features on an automatic vowel classification task," J. Acoust. Soc. Amer., vol. 116, no. 3, pp. 1781-1792, 2004.
    • (2004) J. Acoust. Soc. Amer , vol.116 , Issue.3 , pp. 1781-1792
    • de Wet, F.1    Weber, K.2    Boves, L.3    Cranen, B.4    Bengio, S.5    Bourlard, H.6
  • 43
    • 0023516708 scopus 로고
    • A composite auditory model for processing speech sounds
    • Dec
    • L. Deng and C. D. Geisler, "A composite auditory model for processing speech sounds," J. Acoust. Soc. Amer., vol. 82, no. 6, pp. 2001-2012, Dec. 1987.
    • (1987) J. Acoust. Soc. Amer , vol.82 , Issue.6 , pp. 2001-2012
    • Deng, L.1    Geisler, C.D.2
  • 44
    • 85041486134 scopus 로고    scopus 로고
    • Optimising unit selection with voice source and formants in the CHATR speech synthesis system
    • W. Ding and N. Campbell, "Optimising unit selection with voice source and formants in the CHATR speech synthesis system," in Proc. EUROSPEECH, 1997, pp. 537-540.
    • (1997) Proc. EUROSPEECH , pp. 537-540
    • Ding, W.1    Campbell, N.2
  • 45
    • 33745192738 scopus 로고    scopus 로고
    • A fast method of speaker normalization using formant estimation
    • M. Lincoln, S. Cox, and S. Ringland, "A fast method of speaker normalization using formant estimation," in Proc. EUROSPEECH, 1997, pp. 2095-2098.
    • (1997) Proc. EUROSPEECH , pp. 2095-2098
    • Lincoln, M.1    Cox, S.2    Ringland, S.3
  • 46
    • 4544290146 scopus 로고    scopus 로고
    • ASR dependent techniques for speaker identification
    • A. Park and T. J. Hazen, "ASR dependent techniques for speaker identification," in Proc. InterSpeech-lCSLP, 2002, pp. 478-479.
    • (2002) Proc. InterSpeech-lCSLP , pp. 478-479
    • Park, A.1    Hazen, T.J.2
  • 47
    • 33645587401 scopus 로고    scopus 로고
    • Data driven formant synthesis
    • J. Högberg, "Data driven formant synthesis," in Proc. EUROSPEECH, 1997, pp. 565-568.
    • (1997) Proc. EUROSPEECH , pp. 565-568
    • Högberg, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.