메뉴 건너뛰기




Volumn 15, Issue 8, 2007, Pages 2418-2430

A study of filter bank smoothing in MFCC features for recognition of children's speech

Author keywords

Children's speech recognition; Mel filter bank; Vocal tract length normalization

Indexed keywords

BANDWIDTH FILTERS; CENTER FREQUENCIES; CEPSTRAL COEFFICIENTS; CHILDREN'S SPEECH RECOGNITION; DIGIT RECOGNITION; FILTER BANDWIDTHS; HIGHER FREQUENCIES; MEL FILTER BANK; PERFORMANCE DEGRADATIONS; RECOGNITION PERFORMANCE; SPECTRAL SMOOTHING; VOCAL-TRACT LENGTH NORMALIZATION;

EID: 63049121231     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.906194     Document Type: Article
Times cited : (30)

References (26)
  • 2
    • 0031647824 scopus 로고    scopus 로고
    • Frequency warping approach to speaker normalization
    • Jan
    • L. Lee and R. Rose, "Frequency warping approach to speaker normalization," IEEE Trans. Speech, Audio Process., vol. 6, no. 1, pp. 49-59, Jan. 1998.
    • (1998) IEEE Trans. Speech, Audio Process , vol.6 , Issue.1 , pp. 49-59
    • Lee, L.1    Rose, R.2
  • 3
    • 0036475971 scopus 로고    scopus 로고
    • Creating conversational interfaces for children
    • Jan
    • A. Potamianos and S. Narayanan, "Creating conversational interfaces for children," IEEE Trans. Speech, Audio Process., vol. 10, no. 1, pp. 65-78, Jan. 2002.
    • (2002) IEEE Trans. Speech, Audio Process , vol.10 , Issue.1 , pp. 65-78
    • Potamianos, A.1    Narayanan, S.2
  • 4
    • 0347338002 scopus 로고    scopus 로고
    • Robust recognition of children's speech
    • Nov
    • A. Potamianos and S. Narayanan, "Robust recognition of children's speech," IEEE Trans. Speech, Audio Process., vol. 11, no. 6, pp. 603-616, Nov. 2003.
    • (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.6 , pp. 603-616
    • Potamianos, A.1    Narayanan, S.2
  • 6
    • 33947653242 scopus 로고    scopus 로고
    • Comparing speech recognition for adults and children
    • Stockholm, Sweden
    • M. Blomberg and D. Elenius, "Comparing speech recognition for adults and children," in Proc. Fonetik, Stockholm, Sweden, 2004, pp. 156-159.
    • (2004) Proc. Fonetik , pp. 156-159
    • Blomberg, M.1    Elenius, D.2
  • 8
    • 0030375265 scopus 로고    scopus 로고
    • Rapid unsupervised adaptation to children's speech on a connected-digit task
    • Philadelphia, PA
    • D. C. Burnett and M. Fanty, "Rapid unsupervised adaptation to children's speech on a connected-digit task," in Proc. Int. Conf. Spoken Lang. Process., Philadelphia, PA, 1996, pp. 1145-1148.
    • (1996) Proc. Int. Conf. Spoken Lang. Process , pp. 1145-1148
    • Burnett, D.C.1    Fanty, M.2
  • 12
    • 4544318501 scopus 로고    scopus 로고
    • An investigation into front-end signal processing for speaker normalization
    • Montreal, QC, Canada
    • S. Umesh, R. Sinha, and S. V. B. Kumar, "An investigation into front-end signal processing for speaker normalization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, 2004, pp. 345-348.
    • (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 345-348
    • Umesh, S.1    Sinha, R.2    Kumar, S.V.B.3
  • 13
    • 0032969462 scopus 로고    scopus 로고
    • Acoustic of children's speech: Developmental changes of temporal and spectral parameters
    • S. Lee, A. Potamianos, and S. Narayanan, "Acoustic of children's speech: Developmental changes of temporal and spectral parameters," J. Acoust. Soc. Amer., vol. 105, no. 3, pp. 1455-1468, 1999.
    • (1999) J. Acoust. Soc. Amer , vol.105 , Issue.3 , pp. 1455-1468
    • Lee, S.1    Potamianos, A.2    Narayanan, S.3
  • 14
    • 75149166713 scopus 로고
    • A non-uniform vowel normalization
    • G. Fant, "A non-uniform vowel normalization," STL-QPSR, no. 2-3, pp. 1-19, 1975.
    • (1975) STL-QPSR , Issue.2-3 , pp. 1-19
    • Fant, G.1
  • 15
    • 0038676741 scopus 로고
    • Methods of measuring vowel formant bandwidths
    • H. K. Dunn, "Methods of measuring vowel formant bandwidths," J. Acoust. Soc. Amer., vol. 33, no. 12, pp. 1737-1746, 1961.
    • (1961) J. Acoust. Soc. Amer , vol.33 , Issue.12 , pp. 1737-1746
    • Dunn, H.K.1
  • 16
    • 33646610836 scopus 로고
    • Formant bandwidth data
    • G. Fant, "Formant bandwidth data," STL-QPSR, no. 1, pp. 1-2, 1962.
    • (1962) STL-QPSR , Issue.1 , pp. 1-2
    • Fant, G.1
  • 17
    • 30244446534 scopus 로고
    • Vocal tract wall effects, losses, and resonance bandwidths
    • G. Fant, "Vocal tract wall effects, losses, and resonance bandwidths," STL-QPSR, no. 2-3, pp. 28-58, 1972.
    • (1972) STL-QPSR , Issue.2-3 , pp. 28-58
    • Fant, G.1
  • 18
    • 0015008011 scopus 로고
    • Sweep-tone measurements of vocaltract characteristics
    • O. Fujimura and J. Lindqvist, "Sweep-tone measurements of vocaltract characteristics," J. Acoust. Soc. Amer., vol. 49, pp. 541-558, 1971.
    • (1971) J. Acoust. Soc. Amer , vol.49 , pp. 541-558
    • Fujimura, O.1    Lindqvist, J.2
  • 19
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuous spoken sentences
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuous spoken sentences," IEEE Trans. Acoust., Speech Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 20
    • 0020186727 scopus 로고
    • Spectral estimation using combined time and lag weighting
    • Sep
    • A. H. Nuttall and G. C. Carter, "Spectral estimation using combined time and lag weighting," Proc. IEEE, vol. 70, no. 9, pp. 1115-1125, Sep. 1982.
    • (1982) Proc. IEEE , vol.70 , Issue.9 , pp. 1115-1125
    • Nuttall, A.H.1    Carter, G.C.2
  • 21
    • 33745196139 scopus 로고    scopus 로고
    • Front-end signal processing for speaker-normalization,
    • Ph.D. dissertation, Indian Inst. Technol, Kanpur, India
    • R. Sinha, "Front-end signal processing for speaker-normalization, " Ph.D. dissertation, Indian Inst. Technol., Kanpur, India, 2004.
    • (2004)
    • Sinha, R.1
  • 22
    • 0036753897 scopus 로고    scopus 로고
    • Speaker adaptive modeling by vocal tract normalization
    • Sep
    • L. Welling, H. Ney, and S. Kanthak, "Speaker adaptive modeling by vocal tract normalization," IEEE Trans. Speech, Audio Process., vol. 10, no. 6, pp. 415-426, Sep. 2002.
    • (2002) IEEE Trans. Speech, Audio Process , vol.10 , Issue.6 , pp. 415-426
    • Welling, L.1    Ney, H.2    Kanthak, S.3
  • 23
    • 0028934605 scopus 로고
    • Formant frequency values of vowels produced by preadolescent boys and girls
    • P. A. Busby and G. L. Plant, "Formant frequency values of vowels produced by preadolescent boys and girls," J. Acoust. Soc. Amer., vol. 97, no. 4, pp. 2603-2606, 1995.
    • (1995) J. Acoust. Soc. Amer , vol.97 , Issue.4 , pp. 2603-2606
    • Busby, P.A.1    Plant, G.L.2
  • 24
    • 4544247329 scopus 로고    scopus 로고
    • An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimation
    • Montreal, QC, Canada, May
    • M. Iseli and A. Alwan, "An improved correction formula for the estimation of harmonic magnitudes and its application to open quotient estimation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, pp. 669-672.
    • (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 669-672
    • Iseli, M.1    Alwan, A.2
  • 25
    • 4544357742 scopus 로고    scopus 로고
    • Formant diphone parameter extraction utilizing a labeled single-speaker database
    • Sydney, Australia
    • R. H. Mannell, "Formant diphone parameter extraction utilizing a labeled single-speaker database," in Proc. Int. Conf. Spoken Lang. Process., Sydney, Australia, 1998, pp. 2003-2006.
    • (1998) Proc. Int. Conf. Spoken Lang. Process , pp. 2003-2006
    • Mannell, R.H.1
  • 26
    • 0034944595 scopus 로고    scopus 로고
    • Sex-specific fundamental and formant frequency patterns in a cross-sectional study
    • S. P. Whiteside, "Sex-specific fundamental and formant frequency patterns in a cross-sectional study," J. Acoust. Soc. Amer., vol. 110, no. 1, pp. 464-478, 2001.
    • (2001) J. Acoust. Soc. Amer , vol.110 , Issue.1 , pp. 464-478
    • Whiteside, S.P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.