메뉴 건너뛰기




Volumn 51, Issue 1, 2009, Pages 58-75

Perceptual features for automatic speech recognition in noisy environments

Author keywords

Auditory system; Automatic speech recognition; Hidden Markov model; Perceptual features; Synaptic adaptation; Two tone suppression

Indexed keywords

BANDPASS FILTERS; CROSSINGS (PIPE AND CABLE); DEGRADATION; GAUSSIAN NOISE (ELECTRONIC); HIDDEN MARKOV MODELS; HIGH PASS FILTERS; IIR FILTERS; IMPULSE RESPONSE; MARKOV PROCESSES; POLARIZATION; SENSOR NETWORKS; SPEECH; SPEECH ANALYSIS; TRELLIS CODES; WAVE FILTERS; WHITE NOISE;

EID: 55049112969     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2008.06.002     Document Type: Article
Times cited : (32)

References (38)
  • 1
    • 0032716023 scopus 로고    scopus 로고
    • Abdelatty, A.M., Spiegel, J.V., Mueller, P., Haentjens, G., Berman, J., 1999. An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech. In: Proc. IEEE Intertnat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1999).
    • Abdelatty, A.M., Spiegel, J.V., Mueller, P., Haentjens, G., Berman, J., 1999. An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech. In: Proc. IEEE Intertnat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1999).
  • 2
    • 55049087283 scopus 로고    scopus 로고
    • Bloomberg, M., Carlson, R., Elenius, K., Granstrom, B., 1984. Auditory models and isolated word recognition. Q Prog. Statist. Rep., Speech Transmiss. Lab., Royal Institute of Technology, Stockholm, pp. 1-15.
    • Bloomberg, M., Carlson, R., Elenius, K., Granstrom, B., 1984. Auditory models and isolated word recognition. Q Prog. Statist. Rep., Speech Transmiss. Lab., Royal Institute of Technology, Stockholm, pp. 1-15.
  • 3
    • 0024392496 scopus 로고
    • Application of an auditory model to speech recognition
    • Cohen J.R. Application of an auditory model to speech recognition. J. Acoust. Soc. Amer. 85 6 (1989) 2623-2629
    • (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.6 , pp. 2623-2629
    • Cohen, J.R.1
  • 4
    • 0004399776 scopus 로고
    • An audio noise reduction system
    • Dolby R. An audio noise reduction system. J. Audio Eng. Soc. 15 4 (1967)
    • (1967) J. Audio Eng. Soc. , vol.15 , Issue.4
    • Dolby, R.1
  • 6
    • 0141703354 scopus 로고    scopus 로고
    • Gajić, B., Paliwal, K.K., 2003. Robust speech recognition using features based on zero-crossings with peak amplitudes. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2003).
    • Gajić, B., Paliwal, K.K., 2003. Robust speech recognition using features based on zero-crossings with peak amplitudes. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2003).
  • 7
    • 0024909979 scopus 로고    scopus 로고
    • Gillick, L., Cox, S.J., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1989).
    • Gillick, L., Cox, S.J., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1989).
  • 8
    • 0028312802 scopus 로고
    • Auditory models and human performance in tasks related to speech coding and speech recognition
    • Ghitza O. Auditory models and human performance in tasks related to speech coding and speech recognition. IEEE Trans. Speech Audio Process. 2 1 (1988) 115-132
    • (1988) IEEE Trans. Speech Audio Process. , vol.2 , Issue.1 , pp. 115-132
    • Ghitza, O.1
  • 9
    • 33646758174 scopus 로고    scopus 로고
    • Ghulam, M., Fukuda, T., Horikawa, J., Nitta, T., 2005. Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory masking. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2005).
    • Ghulam, M., Fukuda, T., Horikawa, J., Nitta, T., 2005. Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory masking. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2005).
  • 10
    • 28844443495 scopus 로고    scopus 로고
    • Grayden, D.B., Burkitt, A.N., Kenny, O.P., Clarey, J.N., Paolini, A.G., Clark, G.M., 2004. A cochlear implant speech processing strategy based on an auditory model. In: Internat. Conf. on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP 2004), Melbourne, pp. 491-496.
    • Grayden, D.B., Burkitt, A.N., Kenny, O.P., Clarey, J.N., Paolini, A.G., Clark, G.M., 2004. A cochlear implant speech processing strategy based on an auditory model. In: Internat. Conf. on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP 2004), Melbourne, pp. 491-496.
  • 11
    • 34547543044 scopus 로고    scopus 로고
    • Haque, S., Togneri, R., Zaknich, A., 2007. A temporal auditory model with adaptation for automatic speech recognition. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2007).
    • Haque, S., Togneri, R., Zaknich, A., 2007. A temporal auditory model with adaptation for automatic speech recognition. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2007).
  • 14
    • 55049104587 scopus 로고    scopus 로고
    • Hermansky, H., 1994. Speech beyond 10 ms (temporal filtering in feature domain). In: Proc. Internat. Workshop on Human Interface Technology, Aizu, Japan.
    • Hermansky, H., 1994. Speech beyond 10 ms (temporal filtering in feature domain). In: Proc. Internat. Workshop on Human Interface Technology, Aizu, Japan.
  • 15
    • 33744994972 scopus 로고    scopus 로고
    • Automatic speech recognition with an adaptation model motivated by auditory processing
    • Holmberg M., Gelbart D., and Hemmert W. Automatic speech recognition with an adaptation model motivated by auditory processing. IEEE Trans. Audio, Speech, Lang. Process. 14 1 (2006) 44-49
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 44-49
    • Holmberg, M.1    Gelbart, D.2    Hemmert, W.3
  • 17
    • 0029378047 scopus 로고
    • Two-tone suppression in a cochlear model
    • Kates J.M. Two-tone suppression in a cochlear model. IEEE Trans. Speech Audio Process. 3 5 (1995) 396-406
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 396-406
    • Kates, J.M.1
  • 18
    • 0022806994 scopus 로고
    • Spectral analysis and discrimination by zero-crossings
    • Kedem B. Spectral analysis and discrimination by zero-crossings. Proc. IEEE 74 11 (1986) 1477-1493
    • (1986) Proc. IEEE , vol.74 , Issue.11 , pp. 1477-1493
    • Kedem, B.1
  • 19
    • 0032785783 scopus 로고    scopus 로고
    • Auditory processing of speech signals for robust speech recognition in real world noisy environments
    • Kim D.S., Lee S.Y., and Kil R.M. Auditory processing of speech signals for robust speech recognition in real world noisy environments. IEEE Trans. Speech and Audio Process. 7 1 (1999) 55-69
    • (1999) IEEE Trans. Speech and Audio Process. , vol.7 , Issue.1 , pp. 55-69
    • Kim, D.S.1    Lee, S.Y.2    Kil, R.M.3
  • 21
    • 0030701415 scopus 로고    scopus 로고
    • Loughlin, P., Groutage, D., Rohrbaugh, R., 1997. Time-frequency analysis of acoustic transients. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1997).
    • Loughlin, P., Groutage, D., Rohrbaugh, R., 1997. Time-frequency analysis of acoustic transients. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1997).
  • 23
    • 0022624057 scopus 로고
    • Simulation of mechanical to neural transduction in the auditory receptor
    • Meddis R. Simulation of mechanical to neural transduction in the auditory receptor. J. Acoust. Soc. Amer. 79 3 (1988) 702-711
    • (1988) J. Acoust. Soc. Amer. , vol.79 , Issue.3 , pp. 702-711
    • Meddis, R.1
  • 24
    • 0020816083 scopus 로고
    • Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
    • Moore B.C.J., and Glasberg B.R. Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. J. Acoust. Soc. Amer. 74 3 (1983) 750-753
    • (1983) J. Acoust. Soc. Amer. , vol.74 , Issue.3 , pp. 750-753
    • Moore, B.C.J.1    Glasberg, B.R.2
  • 25
    • 0035125936 scopus 로고    scopus 로고
    • Forward masking: adaptation or integration?
    • Oxenham A.J. Forward masking: adaptation or integration?. J. Acoust. Soc. Amer. 109 (2001) 732-741
    • (2001) J. Acoust. Soc. Amer. , vol.109 , pp. 732-741
    • Oxenham, A.J.1
  • 26
    • 0030244273 scopus 로고    scopus 로고
    • Time-frequency analysis and auditory modeling for automatic recognition of speech
    • Pitton J.W., Wang K., and Juang B. Time-frequency analysis and auditory modeling for automatic recognition of speech. Proc. IEEE 84 9 (1996) 1199-1215
    • (1996) Proc. IEEE , vol.84 , Issue.9 , pp. 1199-1215
    • Pitton, J.W.1    Wang, K.2    Juang, B.3
  • 27
    • 55049120604 scopus 로고    scopus 로고
    • Time-interval information in the auditory representation of speech sounds
    • Patterson R.D. Time-interval information in the auditory representation of speech sounds. J. Acoust. Soc. Amer. 105 2 (1999) 1305
    • (1999) J. Acoust. Soc. Amer. , vol.105 , Issue.2 , pp. 1305
    • Patterson, R.D.1
  • 28
    • 0018076531 scopus 로고
    • Some observations on cochlear mechanics
    • Rhode W.S. Some observations on cochlear mechanics. J. Acoust. Soc. Amer. 64 1 (1978) 158-176
    • (1978) J. Acoust. Soc. Amer. , vol.64 , Issue.1 , pp. 158-176
    • Rhode, W.S.1
  • 29
    • 0034710868 scopus 로고    scopus 로고
    • Mechanical bases of frequency tuning and neural excitation at the base of the cochlea
    • Ruggero M.A., Narayan S.S., Temchin A.N., and Recio A. Mechanical bases of frequency tuning and neural excitation at the base of the cochlea. Proc. Natl. Acad. Sci. USA 97 (2000) 11744-11750
    • (2000) Proc. Natl. Acad. Sci. USA , vol.97 , pp. 11744-11750
    • Ruggero, M.A.1    Narayan, S.S.2    Temchin, A.N.3    Recio, A.4
  • 30
    • 0020579183 scopus 로고
    • Auditory nerve representation of vowels in background noise
    • Sachs M.B., Voigt H.F., and Young E.D. Auditory nerve representation of vowels in background noise. J. Neurophysiol. 50 1 (1983)
    • (1983) J. Neurophysiol. , vol.50 , Issue.1
    • Sachs, M.B.1    Voigt, H.F.2    Young, E.D.3
  • 31
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory processing
    • Seneff S. A joint synchrony/mean-rate model of auditory processing. J. Phonet. 85 1 (1988) 55-76
    • (1988) J. Phonet. , vol.85 , Issue.1 , pp. 55-76
    • Seneff, S.1
  • 32
    • 0016439358 scopus 로고
    • Short-term adaptation and incremental responses of single auditory-nerve fibres
    • Smith R., and Zwislocki J.J. Short-term adaptation and incremental responses of single auditory-nerve fibres. Biol. Cyber. 17 (1975) 169-182
    • (1975) Biol. Cyber. , vol.17 , pp. 169-182
    • Smith, R.1    Zwislocki, J.J.2
  • 33
    • 55049129276 scopus 로고    scopus 로고
    • Spoor, A., Eggermont, J.J., Odenthal, D.W., 1976. Comparison of human and animal data concerning adaptation and masking of eighth nerve compound action potential. In: Ruber, J., Elberling, C., Solomon, G. (Eds.), Electrocochleography, Baltimore, University Park, MD, pp. 183-198.
    • Spoor, A., Eggermont, J.J., Odenthal, D.W., 1976. Comparison of human and animal data concerning adaptation and masking of eighth nerve compound action potential. In: Ruber, J., Elberling, C., Solomon, G. (Eds.), Electrocochleography, Baltimore, University Park, MD, pp. 183-198.
  • 34
    • 0031238095 scopus 로고    scopus 로고
    • A model of dynamic auditory perception and its application to robust word recognition
    • Strope B., and Alwan A. A model of dynamic auditory perception and its application to robust word recognition. IEEE Trans. Speech Audio Process. 95 5 (1997) 451-464
    • (1997) IEEE Trans. Speech Audio Process. , vol.95 , Issue.5 , pp. 451-464
    • Strope, B.1    Alwan, A.2
  • 36
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front-end for automatic speech recognition
    • Tchorz J., and Kollmeier B. A model of auditory perception as front-end for automatic speech recognition. J. Acoust. Soc. Amer. 106 4 (1999)
    • (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.4
    • Tchorz, J.1    Kollmeier, B.2
  • 37
    • 14644431758 scopus 로고    scopus 로고
    • A bio-inspired companding strategy for spectral enhancement
    • Turicchia L., and Sarpeshkar R. A bio-inspired companding strategy for spectral enhancement. IEEE Trans. Speech Audio Process. 13 2 (2005) 243-253
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 243-253
    • Turicchia, L.1    Sarpeshkar, R.2
  • 38
    • 0021207990 scopus 로고
    • Rapid and short-term adaptation in auditory nerve responses
    • Westerman L., and Smith R.L. Rapid and short-term adaptation in auditory nerve responses. Hearing Res. 15 (1984) 249-260
    • (1984) Hearing Res. , vol.15 , pp. 249-260
    • Westerman, L.1    Smith, R.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.