SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 2, 2006, Pages 435-444

Robust formant tracking for continuous speech with speaker variability

(2) Mustafa, Kamran b,c Bruce, Ian C a,b

a IEEE

b MCMASTER UNIVERSITY (Canada)

c Siemens Canada Limited (Canada)

Author keywords

Formant estimation; Hearing aids; Speech analysis; Speech enhancement

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; FEATURE EXTRACTION; HEARING AIDS; NATURAL FREQUENCIES; SIGNAL DETECTION; SIGNAL TO NOISE RATIO; SPEECH ANALYSIS; SPEECH ENHANCEMENT;

FORMANT TRACKING; GENDER DETECTORS; INTERBANKS; SPEAKER VARIABILITY;

CONTINUOUS SPEECH RECOGNITION;

EID: 33947155741 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.855840 Document Type: Article

Times cited : (83)

References (47)

1
- 84941328385
- Control methods used in a study of the vowels
- Mar
- G.E. Peterson and H. L. Barney, "Control methods used in a study of the vowels," J. Acoust. Soc. Amer., vol. 24, no. 2, pp. 175-184, Mar. 1952.
- (1952) J. Acoust. Soc. Amer , vol.24 , Issue.2 , pp. 175-184
- Peterson, G.E.¹ Barney, H.L.²

2
- 0014557625
- Perceptual and physical space of vowel sounds
- Aug
- L. C. Pols, L. J. van der Kamp, and R. Plomp, "Perceptual and physical space of vowel sounds," J. Acoust. Soc. Amer., vol. 46, no. 2, pp. 458-467, Aug. 1969.
- (1969) J. Acoust. Soc. Amer , vol.46 , Issue.2 , pp. 458-467
- Pols, L.C.¹ van der Kamp, L.J.² Plomp, R.³

3
- 0028820976
- The role of formant transitions in the perception of concurrent vowels
- Jan
- P. F. Assmann, "The role of formant transitions in the perception of concurrent vowels," J. Acoust. Soc. Amer., vol. 97, no. 1, pp. 575-584, Jan. 1995.
- (1995) J. Acoust. Soc. Amer , vol.97 , Issue.1 , pp. 575-584
- Assmann, P.F.¹

4
- 0029809872
- _, Modeling the perception of concurrent vowels: Role of formant transitions, J. Acoust. Soc. Amer., pt. 1, 100, no. 2, pp. 1141-1152, Aug. 1996.
- _, "Modeling the perception of concurrent vowels: Role of formant transitions," J. Acoust. Soc. Amer., pt. 1, vol. 100, no. 2, pp. 1141-1152, Aug. 1996.

5
- 0028129916
- R. N. Ohde, The development of the perception of cues to the [m] [n]distinction in CV syllables, J. Acoust. Soc. Amer., pt. 1, 96, no. 2, pp. 675-686, Aug. 1994.
- R. N. Ohde, "The development of the perception of cues to the [m] [n]distinction in CV syllables," J. Acoust. Soc. Amer., pt. 1, vol. 96, no. 2, pp. 675-686, Aug. 1994.

6
- 0028241671
- A. K. Nábělek, Z. Czyzewski, and H. Crowley, Cues for perception of the diphthong /ai/in either noise or reverberation. Part I. Duration of the transition, J. Acoust. Soc. Amer., pt. 1, 95, no. 5, pp. 2681-2693, May 1994.
- A. K. Nábělek, Z. Czyzewski, and H. Crowley, "Cues for perception of the diphthong /ai/in either noise or reverberation. Part I. Duration of the transition," J. Acoust. Soc. Amer., pt. 1, vol. 95, no. 5, pp. 2681-2693, May 1994.

7
- 0001877436
- The role of consonant-vowel transitions in the perception of the stop and nasal consonants
- A. M. Liberman, P. C. Delattre, F. S. Cooper, and L. J. Gerstman, "The role of consonant-vowel transitions in the perception of the stop and nasal consonants," Psychol. Monographs, vol. 68, pp. 1-13, 1954.
- (1954) Psychol. Monographs , vol.68 , pp. 1-13
- Liberman, A.M.¹ Delattre, P.C.² Cooper, F.S.³ Gerstman, L.J.⁴

8
- 0031006065
- Effects of acoustic trauma on the representation of the vowel /5/in cat auditory nerve fibers
- Jun
- R. L. Miller, J. R. Schilling, K. R. Franck, and E. D. Young, "Effects of acoustic trauma on the representation of the vowel /5/in cat auditory nerve fibers," J. Acoust. Soc. Amer., vol. 101, no. 6, pp. 3602-3616, Jun. 1997.
- (1997) J. Acoust. Soc. Amer , vol.101 , Issue.6 , pp. 3602-3616
- Miller, R.L.¹ Schilling, J.R.² Franck, K.R.³ Young, E.D.⁴

9
- 0036010525
- Biological basis of hearing-aid design
- Feb
- M. B. Sachs, I. C. Bruce, R. L. Miller, and E. D. Young,"Biological basis of hearing-aid design," Ann. Biomed. Eng., vol. 30, no. 2, pp. 157-168, Feb. 2002.
- (2002) Ann. Biomed. Eng , vol.30 , Issue.2 , pp. 157-168
- Sachs, M.B.¹ Bruce, I.C.² Miller, R.L.³ Young, E.D.⁴

10
- 0031945435
- Frequency-shaped amplification changes the neural representation of speech with noise-induced hearing loss
- J. R. Schilling, R. L. Miller, M. B. Sachs, and E. D. Young, "Frequency-shaped amplification changes the neural representation of speech with noise-induced hearing loss," Hearing Res., vol. 117, pp. 57-70, 1998.
- (1998) Hearing Res , vol.117 , pp. 57-70
- Schilling, J.R.¹ Miller, R.L.² Sachs, M.B.³ Young, E.D.⁴

11
- 0032716549
- Contrast enhancement improves the representation of /ε/-like vowels in the hearing-impaired auditory nerve
- R. L. Miller, B. M. Calhoun, and E. D. Young, "Contrast enhancement improves the representation of /ε/-like vowels in the hearing-impaired auditory nerve," J. Acousl. Soc. Amer., vol. 106, no. 5, pp. 2693-2708, 1999.
- (1999) J. Acousl. Soc. Amer , vol.106 , Issue.5 , pp. 2693-2708
- Miller, R.L.¹ Calhoun, B.M.² Young, E.D.³

12
- 4344609571
- Physiological assessment of contrast-enhancing frequency shaping and multiband compression in hearing aids
- Aug
- I. C. Bruce, "Physiological assessment of contrast-enhancing frequency shaping and multiband compression in hearing aids," Physiol. Meas., vol. 25, no. 4, pp. 945-956, Aug. 2004.
- (2004) Physiol. Meas , vol.25 , Issue.4 , pp. 945-956
- Bruce, I.C.¹

13
- 0009625192
- Automatic extraction of formant frequencies from continuous speech
- J. L. Flanagan, "Automatic extraction of formant frequencies from continuous speech," J. Acousl. Soc. Amer., vol. 28, pp. 110-118, 1956.
- (1956) J. Acousl. Soc. Amer , vol.28 , pp. 110-118
- Flanagan, J.L.¹

14
- 0014730929
- System for automatic formant analysis of voiced speech
- Feb
- R. W. Schafer and L. R. Rabiner, "System for automatic formant analysis of voiced speech," J. Acoust. Soc. Amer., vol. 47, no. 2, pp. 634-648, Feb. 1970.
- (1970) J. Acoust. Soc. Amer , vol.47 , Issue.2 , pp. 634-648
- Schafer, R.W.¹ Rabiner, L.R.²

15
- 0015112070
- Speech analysis and synthesis by linear prediction of the speech wave
- Aug
- B. S. Atal and S. L. Hanauer, "Speech analysis and synthesis by linear prediction of the speech wave," J. Acoust. Soc. Amer., vol. 50, no. 2, pp. 637-655, Aug. 1971.
- (1971) J. Acoust. Soc. Amer , vol.50 , Issue.2 , pp. 637-655
- Atal, B.S.¹ Hanauer, S.L.²

16
- 0016049328
- An algorithm for automatic formant extraction using linear prediction spectra
- S. McCandless, "An algorithm for automatic formant extraction using linear prediction spectra," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-22, pp. 135-141, 1974.
- (1974) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-22 , pp. 135-141
- McCandless, S.¹

17
- 0026363469
- Auditory modeling applied to formant tracking of noise-corrupted speech
- S. W. Metz, J. A. Heinen, R. J. Niederjohn, and T. V. Sreenivas, "Auditory modeling applied to formant tracking of noise-corrupted speech," in Proc. Int. Conf. Industrial Electronics, Control and Instrumentation, vol.3, 1991, pp. 2120-2124.
- (1991) Proc. Int. Conf. Industrial Electronics, Control and Instrumentation , vol.3 , pp. 2120-2124
- Metz, S.W.¹ Heinen, J.A.² Niederjohn, R.J.³ Sreenivas, T.V.⁴

18
- 33947106218
- Robust Formant Tracking for Continuous Speech With Speaker Variability,
- M.S. thesis, McMaster Univ, Hamilton, ON, Canada
- K. Mustafa, "Robust Formant Tracking for Continuous Speech With Speaker Variability," M.S. thesis, McMaster Univ., Hamilton, ON, Canada, 2003.
- (2003)
- Mustafa, K.¹

19
- 0000330384
- On decomposing speech into modulated components
- May
- A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 240-254, May 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.3 , pp. 240-254
- Rao, A.¹ Kumaresan, R.²

20
- 17344378368
- Robust formant tracking in noise
- I. C. Bruce, N. V. Karkhanis, E. D. Young, and M. B. Sachs, "Robust formant tracking in noise," in Proc. ICASSP, vol. 1, 2002, pp. 281-284.
- (2002) Proc. ICASSP , vol.1 , pp. 281-284
- Bruce, I.C.¹ Karkhanis, N.V.² Young, E.D.³ Sachs, M.B.⁴

21
- 33947097643
- Robust formant tracking for continuous speech with speaker variability
- K. Mustafa and I. C. Bruce, "Robust formant tracking for continuous speech with speaker variability," in Proc. 7th Int. Symp. Signal Processing and Its Applications (ISSPA), vol. 2, 2003, pp. 623-624.
- (2003) Proc. 7th Int. Symp. Signal Processing and Its Applications (ISSPA) , vol.2 , pp. 623-624
- Mustafa, K.¹ Bruce, I.C.²

22
- 0003424145
- New York: Macmillan
- J. R. Deller, J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals. New York: Macmillan, 1993.
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Proakis, J.G.² Hansen, J.H.L.³

23
- 0016030463
- On the behavior of minimax FIR digital Hubert transformers
- L. R. Rabiner and R. W. Schafer, "On the behavior of minimax FIR digital Hubert transformers," Bell Syst. Tech. J., vol. 53, no. 2, pp. 363-390, 1974.
- (1974) Bell Syst. Tech. J , vol.53 , Issue.2 , pp. 363-390
- Rabiner, L.R.¹ Schafer, R.W.²

24
- 0024069934
- Spectrum estimation using an analytic sisnal representation
- Sep
- J. Picone, D. P. Prezas, W. T. Hartwell, and J. L. Locicero, "Spectrum estimation using an analytic sisnal representation," Signal Process., vol. 15, no. 2, pp. 169-182, Sep. 1988.
- (1988) Signal Process , vol.15 , Issue.2 , pp. 169-182
- Picone, J.¹ Prezas, D.P.² Hartwell, W.T.³ Locicero, J.L.⁴

25
- 0003927842
- Upper Saddle River, NJ: Prentice-Hall
- T. F. Quatieri, Discrete-Time Speech Signal Processing. Upper Saddle River, NJ: Prentice-Hall, 2002.
- (2002) Discrete-Time Speech Signal Processing
- Quatieri, T.F.¹

26
- 0016495091
- Linear prediction: A tutorial review
- Apr
- J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, no. 4, pp. 561-580, Apr. 1975.
- (1975) Proc. IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

27
- 0003425258
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1978.
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

28
- 0000618817
- New methods of pitch extraction
- M. M. Sondhi, "New methods of pitch extraction," IEEE Trans. Audio Electroacoust., vol. AU-16, no. 2, pp. 262-266, 1968.
- (1968) IEEE Trans. Audio Electroacoust , vol.AU-16 , Issue.2 , pp. 262-266
- Sondhi, M.M.¹

29
- 0033623527
- Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics
- L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, no. 6, pp. 3036-3048, 2000.
- (2000) J. Acoust. Soc. Amer , vol.108 , Issue.6 , pp. 3036-3048
- Deng, L.¹ Ma, J.²

30
- 85009211881
- Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint
- L. Deng, I. Bazzi, and A. Acero, 'Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint," in Proc. EUROSPEECH, 2003, pp. 73-76.
- (2003) Proc. EUROSPEECH , pp. 73-76
- Deng, L.¹ Bazzi, I.² Acero, A.³

31
- 4544323815
- A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances
- L. Deng, L. Lee, H. Attias, and A. Acero, "A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances," in Proc. ICASSP, vol. 1, 2004, pp. 557-560.
- (2004) Proc. ICASSP , vol.1 , pp. 557-560
- Deng, L.¹ Lee, L.² Attias, H.³ Acero, A.⁴

32
- 84876465692
- A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech
- L. Deng, D. Yu, and A. Acero, "A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech," in Proc. InterSpeech-ICSLP, 2004, pp. 981-984.
- (2004) Proc. InterSpeech-ICSLP , pp. 981-984
- Deng, L.¹ Yu, D.² Acero, A.³

33
- 4544278205
- Formant tracking by mixture state particle filter
- Y. Zheng and M. Hasegawa-Johnson, "Formant tracking by mixture state particle filter," in Proc. ICASSP, vol. 1, 2004, pp. 565-568.
- (2004) Proc. ICASSP , vol.1 , pp. 565-568
- Zheng, Y.¹ Hasegawa-Johnson, M.²

34
- 33947130944
- Improved differential phase spectrum processing for formant tracking
- B. Bozkurt, T. Dutoit, B. Doval, and C. D'Alessandro, "Improved differential phase spectrum processing for formant tracking," in Proc. InterSpeech-ICSLP, 2004, pp. 2421-2424.
- (2004) Proc. InterSpeech-ICSLP , pp. 2421-2424
- Bozkurt, B.¹ Dutoit, T.² Doval, B.³ D'Alessandro, C.⁴

35
- 33947190185
- A concurrent curve strategy for formant tracking
- Y. Laprie, "A concurrent curve strategy for formant tracking," in Proc. InterSpeech-ICSLP, 2004, pp. 2405-2408.
- (2004) Proc. InterSpeech-ICSLP , pp. 2405-2408
- Laprie, Y.¹

36
- 0031632620
- On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
- 36
- |36] P. N. Gamer and W. J. Holmes, "On the robust incorporation of formant features into hidden Markov models for automatic speech recognition," in Proc. ICASSP, vol 1, 1998, pp. 1-4.
- (1998) Proc. ICASSP , vol.1 , pp. 1-4
- Gamer, P.N.¹ Holmes, W.J.²

37
- 0141814630
- An expectation maximization approach for formant tracking using a parameter-free nonlinear predictor
- I. Bazzi, A. Acero, and L. Deng, "An expectation maximization approach for formant tracking using a parameter-free nonlinear predictor," in Proc. ICASSP, vol. 1, 2003, pp. 464-467.
- (2003) Proc. ICASSP , vol.1 , pp. 464-467
- Bazzi, I.¹ Acero, A.² Deng, L.³

38
- 85135264071
- Formant analysis and synthesis using hidden Markov models
- A. Acero, "Formant analysis and synthesis using hidden Markov models," in Proc. EUROSPEECH, 1999, pp. 1047-1050.
- (1999) Proc. EUROSPEECH , pp. 1047-1050
- Acero, A.¹

39
- 85009247886
- Evaluation of formant-like features for ASR
- K. Weber, F. de Wet, B. Cranen, L. Boves, S. Bengio, and H. Bourlard, "Evaluation of formant-like features for ASR," in Proc. InterSpeech-ICSLP, 2002, pp. 2101-2104.
- (2002) Proc. InterSpeech-ICSLP , pp. 2101-2104
- Weber, K.¹ de Wet, F.² Cranen, B.³ Boves, L.⁴ Bengio, S.⁵ Bourlard, H.⁶

40
- 0141479037
- Evaluation of methods for parameteric formant transformation in voice conversion
- E. Turajlic, D. Rentzos, S. Vaseghi, and C.-H. Ho, "Evaluation of methods for parameteric formant transformation in voice conversion," in Proc. ICASSP, vol. 1, 2003, pp. 724-727.
- (2003) Proc. ICASSP , vol.1 , pp. 724-727
- Turajlic, E.¹ Rentzos, D.² Vaseghi, S.³ Ho, C.-H.⁴

41
- 33947125734
- A formant tracking LP model for speech processing
- Q. Yan, E. Zavarehei, S. Vaseghi, and D. Rentzos, "A formant tracking LP model for speech processing," in Proc. Inter Speech-ICSLP, 2004, pp. 2409-2412.
- (2004) Proc. Inter Speech-ICSLP , pp. 2409-2412
- Yan, Q.¹ Zavarehei, E.² Vaseghi, S.³ Rentzos, D.⁴

42
- 4444238905
- Evaluation of formant-like features on an automatic vowel classification task
- F. de Wet, K. Weber, L. Boves, B. Cranen, S. Bengio, and H. Bourlard, "Evaluation of formant-like features on an automatic vowel classification task," J. Acoust. Soc. Amer., vol. 116, no. 3, pp. 1781-1792, 2004.
- (2004) J. Acoust. Soc. Amer , vol.116 , Issue.3 , pp. 1781-1792
- de Wet, F.¹ Weber, K.² Boves, L.³ Cranen, B.⁴ Bengio, S.⁵ Bourlard, H.⁶

43
- 0023516708
- A composite auditory model for processing speech sounds
- Dec
- L. Deng and C. D. Geisler, "A composite auditory model for processing speech sounds," J. Acoust. Soc. Amer., vol. 82, no. 6, pp. 2001-2012, Dec. 1987.
- (1987) J. Acoust. Soc. Amer , vol.82 , Issue.6 , pp. 2001-2012
- Deng, L.¹ Geisler, C.D.²

44
- 85041486134
- Optimising unit selection with voice source and formants in the CHATR speech synthesis system
- W. Ding and N. Campbell, "Optimising unit selection with voice source and formants in the CHATR speech synthesis system," in Proc. EUROSPEECH, 1997, pp. 537-540.
- (1997) Proc. EUROSPEECH , pp. 537-540
- Ding, W.¹ Campbell, N.²

45
- 33745192738
- A fast method of speaker normalization using formant estimation
- M. Lincoln, S. Cox, and S. Ringland, "A fast method of speaker normalization using formant estimation," in Proc. EUROSPEECH, 1997, pp. 2095-2098.
- (1997) Proc. EUROSPEECH , pp. 2095-2098
- Lincoln, M.¹ Cox, S.² Ringland, S.³

46
- 4544290146
- ASR dependent techniques for speaker identification
- A. Park and T. J. Hazen, "ASR dependent techniques for speaker identification," in Proc. InterSpeech-lCSLP, 2002, pp. 478-479.
- (2002) Proc. InterSpeech-lCSLP , pp. 478-479
- Park, A.¹ Hazen, T.J.²

47
- 33645587401
- Data driven formant synthesis
- J. Högberg, "Data driven formant synthesis," in Proc. EUROSPEECH, 1997, pp. 565-568.
- (1997) Proc. EUROSPEECH , pp. 565-568
- Högberg, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.