SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 8, 2007, Pages 2418-2430

A study of filter bank smoothing in MFCC features for recognition of children's speech

(2) Umesh, S a Sinha, Rohit b

a INDIAN INSTITUTE OF TECHNOLOGY KANPUR (India)

b INDIAN INSTITUTE OF TECHNOLOGY GUWAHATI (India)

Author keywords

Children's speech recognition; Mel filter bank; Vocal tract length normalization

Indexed keywords

BANDWIDTH FILTERS; CENTER FREQUENCIES; CEPSTRAL COEFFICIENTS; CHILDREN'S SPEECH RECOGNITION; DIGIT RECOGNITION; FILTER BANDWIDTHS; HIGHER FREQUENCIES; MEL FILTER BANK; PERFORMANCE DEGRADATIONS; RECOGNITION PERFORMANCE; SPECTRAL SMOOTHING; VOCAL-TRACT LENGTH NORMALIZATION;

BANDWIDTH; DEGRADATION; PHOTODEGRADATION; SPEECH ANALYSIS; SPEECH RECOGNITION; TELECOMMUNICATION SYSTEMS; TELEPHONE; TELEPHONE SETS;

FILTER BANKS;

EID: 63049121231 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.906194 Document Type: Article

Times cited : (30)

References (26)

1
- 0141629128
- Experiments in vocal tract normalization
- A. Andreou, T. Kamm, and J. Cohen, "Experiments in vocal tract normalization," in Proc. CAIPWorkshop: Frontiers in Speech Recognition II, 1994.
- (1994) Proc. CAIPWorkshop: Frontiers in Speech Recognition II
- Andreou, A.¹ Kamm, T.² Cohen, J.³

2
- 0031647824
- Frequency warping approach to speaker normalization
- Jan
- L. Lee and R. Rose, "Frequency warping approach to speaker normalization," IEEE Trans. Speech, Audio Process., vol. 6, no. 1, pp. 49-59, Jan. 1998.
- (1998) IEEE Trans. Speech, Audio Process , vol.6 , Issue.1 , pp. 49-59
- Lee, L.¹ Rose, R.²

3
- 0036475971
- Creating conversational interfaces for children
- Jan
- A. Potamianos and S. Narayanan, "Creating conversational interfaces for children," IEEE Trans. Speech, Audio Process., vol. 10, no. 1, pp. 65-78, Jan. 2002.
- (2002) IEEE Trans. Speech, Audio Process , vol.10 , Issue.1 , pp. 65-78
- Potamianos, A.¹ Narayanan, S.²

4
- 0347338002
- Robust recognition of children's speech
- Nov
- A. Potamianos and S. Narayanan, "Robust recognition of children's speech," IEEE Trans. Speech, Audio Process., vol. 11, no. 6, pp. 603-616, Nov. 2003.
- (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.6 , pp. 603-616
- Potamianos, A.¹ Narayanan, S.²

5
- 0141702066
- Investigating recognition of children's speech
- Hong Kong, Apr
- D. Giuliani and M. Geroso, "Investigating recognition of children's speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, Apr. 2003, pp. 137-140.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 137-140
- Giuliani, D.¹ Geroso, M.²

6
- 33947653242
- Comparing speech recognition for adults and children
- Stockholm, Sweden
- M. Blomberg and D. Elenius, "Comparing speech recognition for adults and children," in Proc. Fonetik, Stockholm, Sweden, 2004, pp. 156-159.
- (2004) Proc. Fonetik , pp. 156-159
- Blomberg, M.¹ Elenius, D.²

7
- 0029747582
- A study of speech recognition for children and the elderly
- May
- J. G. Wilpon and C. N. Jacobsen, "A study of speech recognition for children and the elderly," in Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process., May 1996, vol. 1, pp. 349-352.
- (1996) Proc. IEEE Int. Conf. Acoustic, Speech, Signal Process , vol.1 , pp. 349-352
- Wilpon, J.G.¹ Jacobsen, C.N.²

8
- 0030375265
- Rapid unsupervised adaptation to children's speech on a connected-digit task
- Philadelphia, PA
- D. C. Burnett and M. Fanty, "Rapid unsupervised adaptation to children's speech on a connected-digit task," in Proc. Int. Conf. Spoken Lang. Process., Philadelphia, PA, 1996, pp. 1145-1148.
- (1996) Proc. Int. Conf. Spoken Lang. Process , pp. 1145-1148
- Burnett, D.C.¹ Fanty, M.²

9
- 0031644298
- Improvements in children's speech recognition performance
- Seattle, WA, May
- S. Das, D. Nix, and M. Picheny, "Improvements in children's speech recognition performance," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, pp. 433-436.
- (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 433-436
- Das, S.¹ Nix, D.² Picheny, M.³

10
- 0032761999
- Scale transform in speech analysis
- Jan
- S. Umesh, L. Cohen, N. Marinovic, and D. Nelson, "Scale transform in speech analysis," IEEE Trans. Speech, Audio Process., vol. 1, no. 1, pp. 40-45, Jan. 1999.
- (1999) IEEE Trans. Speech, Audio Process , vol.1 , Issue.1 , pp. 40-45
- Umesh, S.¹ Cohen, L.² Marinovic, N.³ Nelson, D.⁴

11
- 0036293694
- Non-uniform scaling based speaker normalization
- Orlando, FL, May
- R. Sinha and S. Umesh, "Non-uniform scaling based speaker normalization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Orlando, FL, May 2002, vol. 1, pp. 589-592.
- (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 589-592
- Sinha, R.¹ Umesh, S.²

12
- 4544318501
- An investigation into front-end signal processing for speaker normalization
- Montreal, QC, Canada
- S. Umesh, R. Sinha, and S. V. B. Kumar, "An investigation into front-end signal processing for speaker normalization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, 2004, pp. 345-348.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 345-348
- Umesh, S.¹ Sinha, R.² Kumar, S.V.B.³

13
- 0032969462
- Acoustic of children's speech: Developmental changes of temporal and spectral parameters
- S. Lee, A. Potamianos, and S. Narayanan, "Acoustic of children's speech: Developmental changes of temporal and spectral parameters," J. Acoust. Soc. Amer., vol. 105, no. 3, pp. 1455-1468, 1999.
- (1999) J. Acoust. Soc. Amer , vol.105 , Issue.3 , pp. 1455-1468
- Lee, S.¹ Potamianos, A.² Narayanan, S.³

14
- 75149166713
- A non-uniform vowel normalization
- G. Fant, "A non-uniform vowel normalization," STL-QPSR, no. 2-3, pp. 1-19, 1975.
- (1975) STL-QPSR , Issue.2-3 , pp. 1-19
- Fant, G.¹

15
- 0038676741
- Methods of measuring vowel formant bandwidths
- H. K. Dunn, "Methods of measuring vowel formant bandwidths," J. Acoust. Soc. Amer., vol. 33, no. 12, pp. 1737-1746, 1961.
- (1961) J. Acoust. Soc. Amer , vol.33 , Issue.12 , pp. 1737-1746
- Dunn, H.K.¹

16
- 33646610836
- Formant bandwidth data
- G. Fant, "Formant bandwidth data," STL-QPSR, no. 1, pp. 1-2, 1962.
- (1962) STL-QPSR , Issue.1 , pp. 1-2
- Fant, G.¹

17
- 30244446534
- Vocal tract wall effects, losses, and resonance bandwidths
- G. Fant, "Vocal tract wall effects, losses, and resonance bandwidths," STL-QPSR, no. 2-3, pp. 28-58, 1972.
- (1972) STL-QPSR , Issue.2-3 , pp. 28-58
- Fant, G.¹

18
- 0015008011
- Sweep-tone measurements of vocaltract characteristics
- O. Fujimura and J. Lindqvist, "Sweep-tone measurements of vocaltract characteristics," J. Acoust. Soc. Amer., vol. 49, pp. 541-558, 1971.
- (1971) J. Acoust. Soc. Amer , vol.49 , pp. 541-558
- Fujimura, O.¹ Lindqvist, J.²

19
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuous spoken sentences
- Aug
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuous spoken sentences," IEEE Trans. Acoust., Speech Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

20
- 0020186727
- Spectral estimation using combined time and lag weighting
- Sep
- A. H. Nuttall and G. C. Carter, "Spectral estimation using combined time and lag weighting," Proc. IEEE, vol. 70, no. 9, pp. 1115-1125, Sep. 1982.
- (1982) Proc. IEEE , vol.70 , Issue.9 , pp. 1115-1125
- Nuttall, A.H.¹ Carter, G.C.²

21
- 33745196139
- Front-end signal processing for speaker-normalization,
- Ph.D. dissertation, Indian Inst. Technol, Kanpur, India
- R. Sinha, "Front-end signal processing for speaker-normalization, " Ph.D. dissertation, Indian Inst. Technol., Kanpur, India, 2004.
- (2004)
- Sinha, R.¹

22
- 0036753897
- Speaker adaptive modeling by vocal tract normalization
- Sep
- L. Welling, H. Ney, and S. Kanthak, "Speaker adaptive modeling by vocal tract normalization," IEEE Trans. Speech, Audio Process., vol. 10, no. 6, pp. 415-426, Sep. 2002.
- (2002) IEEE Trans. Speech, Audio Process , vol.10 , Issue.6 , pp. 415-426
- Welling, L.¹ Ney, H.² Kanthak, S.³

23
- 0028934605
- Formant frequency values of vowels produced by preadolescent boys and girls
- P. A. Busby and G. L. Plant, "Formant frequency values of vowels produced by preadolescent boys and girls," J. Acoust. Soc. Amer., vol. 97, no. 4, pp. 2603-2606, 1995.
- (1995) J. Acoust. Soc. Amer , vol.97 , Issue.4 , pp. 2603-2606
- Busby, P.A.¹ Plant, G.L.²

24
- 4544247329
- An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimation
- Montreal, QC, Canada, May
- M. Iseli and A. Alwan, "An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, pp. 669-672.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 669-672
- Iseli, M.¹ Alwan, A.²

25
- 4544357742
- Formant diphone parameter extraction utilizing a labeled single-speaker database
- Sydney, Australia
- R. H. Mannell, "Formant diphone parameter extraction utilizing a labeled single-speaker database," in Proc. Int. Conf. Spoken Lang. Process., Sydney, Australia, 1998, pp. 2003-2006.
- (1998) Proc. Int. Conf. Spoken Lang. Process , pp. 2003-2006
- Mannell, R.H.¹

26
- 0034944595
- Sex-specific fundamental and formant frequency patterns in a cross-sectional study
- S. P. Whiteside, "Sex-specific fundamental and formant frequency patterns in a cross-sectional study," J. Acoust. Soc. Amer., vol. 110, no. 1, pp. 464-478, 2001.
- (2001) J. Acoust. Soc. Amer , vol.110 , Issue.1 , pp. 464-478
- Whiteside, S.P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.