메뉴 건너뛰기




Volumn 39, Issue 1-2, 2003, Pages 47-63

Sub-band SNR estimation using auditory feature processing

Author keywords

Auditory front end; Neural networks; Sigma pi cells; Situation classification; Sub band SNR estimation

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; AUDITION; ERROR ANALYSIS; HEARING AIDS; MODULATION; NEURAL NETWORKS; SIGNAL TO NOISE RATIO; SPEECH RECOGNITION;

EID: 0037211087     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(02)00058-4     Document Type: Article
Times cited : (21)

References (32)
  • 1
    • 0028516073 scopus 로고
    • How do humans process and recognize speech
    • Allen J.B. How do humans process and recognize speech. IEEE Trans. Speech Audio Process. 2(4):1994;567-576.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 567-576
    • Allen, J.B.1
  • 2
    • 0029728607 scopus 로고    scopus 로고
    • Adaptive speech enhancement using frequency-specific SNR estimates
    • Basking Ridge, N.J.
    • Avendano C., Hermansky H., Vis M., Bayya A. Adaptive speech enhancement using frequency-specific SNR estimates. Proc. IEEE IVTTA'96, Basking Ridge, N.J. 1996;65-68.
    • (1996) Proc. IEEE IVTTA'96 , pp. 65-68
    • Avendano, C.1    Hermansky, H.2    Vis, M.3    Bayya, A.4
  • 4
    • 0040290402 scopus 로고    scopus 로고
    • Spectro-temporal modulation transfer functions and speech intelligibility
    • Chi T., Gao Y., Guyton M.C., Ru P., Shamma S. Spectro-temporal modulation transfer functions and speech intelligibility. J. Acoust. Soc. Amer. 106(5):1999;2719-2732.
    • (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.5 , pp. 2719-2732
    • Chi, T.1    Gao, Y.2    Guyton, M.C.3    Ru, P.4    Shamma, S.5
  • 5
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the "effective" signal processing in the auditory system: I. Model structure
    • Dau T., Püschel D., Kohlrausch A. A quantitative model of the "effective" signal processing in the auditory system: I. Model structure. J. Acoust. Soc. Amer. 99:1996;3615-3622.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , pp. 3615-3622
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 6
    • 0030691985 scopus 로고    scopus 로고
    • Modeling auditory processing of amplitude modulation: I. Modulation detection and masking with narrowband carriers
    • Dau T., Kollmeier B., Kohlrausch A. Modeling auditory processing of amplitude modulation: I. Modulation detection and masking with narrowband carriers. J. Acoust. Soc. Amer. 102(2):1997;2892-2905.
    • (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.2 , pp. 2892-2905
    • Dau, T.1    Kollmeier, B.2    Kohlrausch, A.3
  • 7
    • 0032577379 scopus 로고    scopus 로고
    • Optimizing sound features for cortical neurons
    • deCharms R.C., Blake D.T., Merzenich M.M. Optimizing sound features for cortical neurons. Science. 280:1998;1439-1443.
    • (1998) Science , vol.280 , pp. 1439-1443
    • DeCharms, R.C.1    Blake, D.T.2    Merzenich, M.M.3
  • 9
    • 0026368274 scopus 로고
    • Fast algorithms to find invariant features for a word recognizing neural net
    • Bournemouth
    • Gramß T. Fast algorithms to find invariant features for a word recognizing neural net. IEEE 2nd Internat. Conf. on Artificial Neural Networks, Bournemouth. 1991;180-184.
    • (1991) IEEE 2nd Internat. Conf. on Artificial Neural Networks , pp. 180-184
    • Gramß, T.1
  • 10
    • 0025383284 scopus 로고
    • Recognition of isolated words based on psychoacoustics and neurobiology
    • Gramß T., Strube H.W. Recognition of isolated words based on psychoacoustics and neurobiology. Speech Communication. 9:1990;35-40.
    • (1990) Speech Communication , vol.9 , pp. 35-40
    • Gramß, T.1    Strube, H.W.2
  • 11
    • 0028543366 scopus 로고
    • Training feedforward networks with the Marquardt algorithm
    • Hagan M.T., Menhaj M. Training feedforward networks with the Marquardt algorithm. IEEE Trans. Neural Networks. 5(6):1994;989-993.
    • (1994) IEEE Trans. Neural Networks , vol.5 , Issue.6 , pp. 989-993
    • Hagan, M.T.1    Menhaj, M.2
  • 12
    • 0033729018 scopus 로고    scopus 로고
    • Objective modeling of speech quality with a psychoacoustically validated auditory model
    • Hansen M., Kollmeier B. Objective modeling of speech quality with a psychoacoustically validated auditory model. J. Audio Eng. Soc. 48(5):2000;395-409.
    • (2000) J. Audio Eng. Soc. , vol.48 , Issue.5 , pp. 395-409
    • Hansen, M.1    Kollmeier, B.2
  • 14
    • 85009254284 scopus 로고    scopus 로고
    • TRAPS - Classifiers of temporal patterns
    • Hermansky H., Sharma S. TRAPS - Classifiers of temporal patterns. Proc. ICSLP'98. Vol. 3:1998;1003-1006.
    • (1998) Proc. ICSLP'98 , vol.3 , pp. 1003-1006
    • Hermansky, H.1    Sharma, S.2
  • 17
    • 0011729005 scopus 로고    scopus 로고
    • Frequency analysis and synthesis using a gammatone filterbank
    • May/June
    • Hohmann, V., 2002. Frequency analysis and synthesis using a gammatone filterbank. Acta Acustica united with Acustica, no. 3, May/June, pp. 433-442.
    • (2002) Acta Acustica united with Acustica , vol.3 , pp. 433-442
    • Hohmann, V.1
  • 19
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • Kingsbury B., Morgan N., Greenberg S. Robust speech recognition using the modulation spectrogram. Speech Communication. 25(1):1998;117-132.
    • (1998) Speech Communication , vol.25 , Issue.1 , pp. 117-132
    • Kingsbury, B.1    Morgan, N.2    Greenberg, S.3
  • 20
    • 0011791859 scopus 로고    scopus 로고
    • Perzeptive vorverarbeitung und automatische selektion sekundärer merkmale zur robusten spracherkennung
    • Oldenburg: DEGA
    • Kleinschmidt M., Hohmann V. Perzeptive Vorverarbeitung und automatische Selektion sekundärer Merkmale zur robusten Spracherkennung. Fortschritte der Akustik - DAGA 2000. 2000;382-383 DEGA, Oldenburg.
    • (2000) Fortschritte der Akustik - DAGA 2000 , pp. 382-383
    • Kleinschmidt, M.1    Hohmann, V.2
  • 21
    • 0034824912 scopus 로고    scopus 로고
    • Combining speech enhancement and auditory feature extraction for robust speech recognition
    • special issue on Robust ASR
    • Kleinschmidt M., Tchorz J., Kollmeier B. Combining speech enhancement and auditory feature extraction for robust speech recognition. Speech Communication. 34:2001;75-91. (special issue on Robust ASR).
    • (2001) Speech Communication , vol.34 , pp. 75-91
    • Kleinschmidt, M.1    Tchorz, J.2    Kollmeier, B.3
  • 23
    • 0028297185 scopus 로고
    • Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction
    • Kollmeier B., Koch R. Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction. J. Acoust. Soc. Amer. 95(3):1994;1593-1602.
    • (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.3 , pp. 1593-1602
    • Kollmeier, B.1    Koch, R.2
  • 24
    • 85135379452 scopus 로고
    • An efficient algorithm to estimate the instantaneous SNR of speech signals
    • ESCA
    • Martin R. An efficient algorithm to estimate the instantaneous SNR of speech signals. Proc. Eurospeech. 1993;1093-1096 ESCA.
    • (1993) Proc. Eurospeech , pp. 1093-1096
    • Martin, R.1
  • 25
    • 0020816083 scopus 로고
    • Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
    • Moore B.C.J., Glasberg B.R. Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. J. Acoust. Soc. Amer. 74:1983;750-753.
    • (1983) J. Acoust. Soc. Amer. , vol.74 , pp. 750-753
    • Moore, B.C.J.1    Glasberg, B.R.2
  • 28
    • 0034832359 scopus 로고    scopus 로고
    • Assessing local noise level estimation methods: Application to noise robust ASR
    • Ris C., Dupont S. Assessing local noise level estimation methods: application to noise robust ASR. Speech Communication. 34:2001;141-158.
    • (2001) Speech Communication , vol.34 , pp. 141-158
    • Ris, C.1    Dupont, S.2
  • 29
    • 0032828464 scopus 로고    scopus 로고
    • A model of the auditory perception as front end for automatic speech recognition
    • Tchorz J., Kollmeier B. A model of the auditory perception as front end for automatic speech recognition. J. Acoust. Soc. Amer. 106(4):1999a;2040-2050.
    • (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.4 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 30
    • 0011765108 scopus 로고    scopus 로고
    • Speech detection and SNR prediction basing on amplitude modulation pattern recognition
    • Budapest, Hungary: ISCA
    • Tchorz J., Kollmeier B. Speech detection and SNR prediction basing on amplitude modulation pattern recognition. Proc. Eurospeech. 1999b;2399-2404 ISCA, Budapest, Hungary.
    • (1999) Proc. Eurospeech , pp. 2399-2404
    • Tchorz, J.1    Kollmeier, B.2
  • 31
    • 0036722886 scopus 로고    scopus 로고
    • Estimation of the signal-to-noise ratio with amplitude modulation spectrograms
    • Tchorz, J., Kollmeier, B., 2002. Estimation of the signal-to-noise ratio with amplitude modulation spectrograms. Speech Communication 38, 1-17.
    • (2002) Speech Communication , vol.38 , pp. 1-17
    • Tchorz, J.1    Kollmeier, B.2
  • 32
    • 84898984996 scopus 로고    scopus 로고
    • Noise suppression based on neurophysiologically-motivated SNR estimation for robust speech recognition
    • Leen, T.K., Dietterich, T.G., Tresp, V. MIT Press
    • Tchorz J., Kleinschmidt M., Kollmeier B. Noise suppression based on neurophysiologically-motivated SNR estimation for robust speech recognition. Leen T.K., Dietterich T.G., Tresp V. Advances in Neural Information Processing Systems 13 - NIPS 2000. 2001;821-827 MIT Press.
    • (2001) Advances in Neural Information Processing Systems 13 - NIPS 2000 , pp. 821-827
    • Tchorz, J.1    Kleinschmidt, M.2    Kollmeier, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.