메뉴 건너뛰기




Volumn , Issue , 2009, Pages 2987-2990

Auditory model based optimization of MFCCs improves automatic speech recognition performance

Author keywords

ASR; Auditory model; MFCC

Indexed keywords

AUDITORY MODELS; AUTOMATIC SPEECH RECOGNITION; ENVIRONMENTAL CONDITIONS; FEATURE DOMAIN; HUMAN AUDITORY SYSTEM; LOCAL GEOMETRY; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; RECOGNITION PERFORMANCE; SPEECH RECOGNITION SYSTEMS;

EID: 70450221097     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (18)
  • 1
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S.B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Proc., vol. 28, No. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Proc , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 2
    • 0024392496 scopus 로고
    • Application of an auditory model to speech recognition
    • June
    • J.R. Cohen, "Application of an auditory model to speech recognition," J. Acoust. Soc. Amer., pp. 2623-2629, Vol. 85 (6), June 1989.
    • (1989) J. Acoust. Soc. Amer , vol.85 , Issue.6 , pp. 2623-2629
    • Cohen, J.R.1
  • 3
    • 0028312802 scopus 로고
    • Auditory models and human performance in tasks related to speech coding and speech recognition
    • Jan
    • O. Ghitza, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech, Audio Proc., vol. 2, No. 1, pp. 115-132, Jan 1994.
    • (1994) IEEE Trans. Speech, Audio Proc , vol.2 , Issue.1 , pp. 115-132
    • Ghitza, O.1
  • 4
    • 0031238095 scopus 로고    scopus 로고
    • A model of dynamic auditory perception and its application to robust word recognition
    • Sept
    • B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech, Audio Proc., vol. 5, No. 5, pp. 451-464, Sept. 1997.
    • (1997) IEEE Trans. Speech, Audio Proc , vol.5 , Issue.5 , pp. 451-464
    • Strope, B.1    Alwan, A.2
  • 5
    • 0032785783 scopus 로고    scopus 로고
    • Auditory processing of speech signals for robust speech recognition in real-world noisy environments
    • Jan
    • D.S. Kim, S.Y. Lee and R.M. Kil, "Auditory processing of speech signals for robust speech recognition in real-world noisy environments," IEEE Trans. Speech, Audio Proc., vol. 7, No. 1, pp. 55-69, Jan 1999.
    • (1999) IEEE Trans. Speech, Audio Proc , vol.7 , Issue.1 , pp. 55-69
    • Kim, D.S.1    Lee, S.Y.2    Kil, R.M.3
  • 6
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front end for automatic speech recognition
    • Oct
    • J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., pp. 2040-2050, Vol. 106 (4), Oct. 1999.
    • (1999) J. Acoust. Soc. Amer , vol.106 , Issue.4 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 7
    • 33744994972 scopus 로고    scopus 로고
    • Automatic speech recognition with an adaptation model motivated by auditory processing
    • Jan
    • M. Holmberg, D. Gelbart and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. Speech, Audio Proc., vol. 14, No. 1, pp. 43-49, Jan 2006.
    • (2006) IEEE Trans. Speech, Audio Proc , vol.14 , Issue.1 , pp. 43-49
    • Holmberg, M.1    Gelbart, D.2    Hemmert, W.3
  • 8
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory processing
    • Jan
    • S. Seneff, "A joint synchrony/mean-rate model of auditory processing," J. Phonet., pp. 55-76, Vol. 85 (1), Jan 1988.
    • (1988) J. Phonet , vol.85 , Issue.1 , pp. 55-76
    • Seneff, S.1
  • 9
    • 0022624057 scopus 로고
    • Simulation of mechanical to neural transduction in the auditory receptor
    • March
    • R. Meddis, "Simulation of mechanical to neural transduction in the auditory receptor," J. Acoust. Soc. Amer., pp. 702-711, Vol. 79 (3), March 1988.
    • (1988) J. Acoust. Soc. Amer , vol.79 , Issue.3 , pp. 702-711
    • Meddis, R.1
  • 10
    • 0029378047 scopus 로고
    • Two-tone suppression in a cochlear model
    • Sept
    • J.M. Kates, "Two-tone suppression in a cochlear model," IEEE Trans. Speech, Audio Proc., vol. 3, No. 5, pp. 396-406, Sept. 1995.
    • (1995) IEEE Trans. Speech, Audio Proc , vol.3 , Issue.5 , pp. 396-406
    • Kates, J.M.1
  • 11
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the effective signal processing in the auditory system. I. Model structure
    • Jun
    • T. Dau, D. Puschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I. Model structure," J. Acoust. Soc. Amer., pp. 3615-3622, Vol. 99 (6), Jun 1996.
    • (1996) J. Acoust. Soc. Amer , vol.99 , Issue.6 , pp. 3615-3622
    • Dau, T.1    Puschel, D.2    Kohlrausch, A.3
  • 12
    • 0035125936 scopus 로고    scopus 로고
    • Forward masking: Adaptation or integration?
    • Feb
    • A.J. Oxenham, "Forward masking: Adaptation or integration?," J. Acoust. Soc. Amer., pp. 732-741, Vol. 109 (2), Feb 2001.
    • (2001) J. Acoust. Soc. Amer , vol.109 , Issue.2 , pp. 732-741
    • Oxenham, A.J.1
  • 14
    • 0029375948 scopus 로고
    • Theoretical analysis of the high-rate vector quantization of LPC parameters
    • Sept
    • W.R. Gardner and B.D. Rao, "Theoretical analysis of the high-rate vector quantization of LPC parameters," IEEE Trans. Speech and Audio Proc., vol. 3, No.5, pp. 367-381, Sept 1995.
    • (1995) IEEE Trans. Speech and Audio Proc , vol.3 , Issue.5 , pp. 367-381
    • Gardner, W.R.1    Rao, B.D.2
  • 15
    • 47649083103 scopus 로고    scopus 로고
    • The sensitivity matrix: Using advanced auditory models in speech and audio processing
    • Jan
    • J.H. Plasberg and W.B. Kleijn, "The sensitivity matrix: using advanced auditory models in speech and audio processing," IEEE Trans. Audio, Speech, Language Proc., vol. 15, No. 1, pp. 310-319, Jan 2007.
    • (2007) IEEE Trans. Audio, Speech, Language Proc , vol.15 , Issue.1 , pp. 310-319
    • Plasberg, J.H.1    Kleijn, W.B.2
  • 16
    • 0027659197 scopus 로고
    • Signal modeling techniques in speech recognition
    • Sept
    • J.W. Picone, "Signal modeling techniques in speech recognition," Proc. IEEE, pp. 1215-1247, Vol. 81, No. 9, Sept. 1993.
    • (1993) Proc. IEEE , vol.81 , Issue.9 , pp. 1215-1247
    • Picone, J.W.1
  • 17
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • Nov
    • K.F. Lee and H.W. Hon, "Speaker-independent phone recognition using hidden Markov models," IEEE Trans. Acoust., Speech, Signal Proc., vol. 37, No. 11, pp. 1641-1648, Nov. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Proc , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.F.1    Hon, H.W.2
  • 18


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.