SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 39, Issue 1-2, 2003, Pages 47-63

Sub-band SNR estimation using auditory feature processing

(2) Kleinschmidt, Michael a Hohmann, Volker a

a UNIVERSITY OF OLDENBURG (Germany)

Author keywords

Auditory front end; Neural networks; Sigma pi cells; Situation classification; Sub band SNR estimation

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; AUDITION; ERROR ANALYSIS; HEARING AIDS; MODULATION; NEURAL NETWORKS; SIGNAL TO NOISE RATIO; SPEECH RECOGNITION;

AUTOMATIC SPEECH RECOGNITION (ASR);

ACOUSTICS;

EID: 0037211087 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(02)00058-4 Document Type: Article

Times cited : (21)

References (32)

1
- 0028516073
- How do humans process and recognize speech
- Allen J.B. How do humans process and recognize speech. IEEE Trans. Speech Audio Process. 2(4):1994;567-576.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 567-576
- Allen, J.B.¹

2
- 0029728607
- Adaptive speech enhancement using frequency-specific SNR estimates
- Basking Ridge, N.J.
- Avendano C., Hermansky H., Vis M., Bayya A. Adaptive speech enhancement using frequency-specific SNR estimates. Proc. IEEE IVTTA'96, Basking Ridge, N.J. 1996;65-68.
- (1996) Proc. IEEE IVTTA'96 , pp. 65-68
- Avendano, C.¹ Hermansky, H.² Vis, M.³ Bayya, A.⁴

3
- 0003039657
- Towards sub-band-based speech recognition
- Trieste
- Bourlard H., Dupont S., Hermansky H., Morgan N. Towards sub-band-based speech recognition. European Signal Proc. Conf., Trieste. 1996;1579-1582.
- (1996) European Signal Proc. Conf. , pp. 1579-1582
- Bourlard, H.¹ Dupont, S.² Hermansky, H.³ Morgan, N.⁴

4
- 0040290402
- Spectro-temporal modulation transfer functions and speech intelligibility
- Chi T., Gao Y., Guyton M.C., Ru P., Shamma S. Spectro-temporal modulation transfer functions and speech intelligibility. J. Acoust. Soc. Amer. 106(5):1999;2719-2732.
- (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.5 , pp. 2719-2732
- Chi, T.¹ Gao, Y.² Guyton, M.C.³ Ru, P.⁴ Shamma, S.⁵

5
- 0029952425
- A quantitative model of the "effective" signal processing in the auditory system: I. Model structure
- Dau T., Püschel D., Kohlrausch A. A quantitative model of the "effective" signal processing in the auditory system: I. Model structure. J. Acoust. Soc. Amer. 99:1996;3615-3622.
- (1996) J. Acoust. Soc. Amer. , vol.99 , pp. 3615-3622
- Dau, T.¹ Püschel, D.² Kohlrausch, A.³

6
- 0030691985
- Modeling auditory processing of amplitude modulation: I. Modulation detection and masking with narrowband carriers
- Dau T., Kollmeier B., Kohlrausch A. Modeling auditory processing of amplitude modulation: I. Modulation detection and masking with narrowband carriers. J. Acoust. Soc. Amer. 102(2):1997;2892-2905.
- (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.2 , pp. 2892-2905
- Dau, T.¹ Kollmeier, B.² Kohlrausch, A.³

7
- 0032577379
- Optimizing sound features for cortical neurons
- deCharms R.C., Blake D.T., Merzenich M.M. Optimizing sound features for cortical neurons. Science. 280:1998;1439-1443.
- (1998) Science , vol.280 , pp. 1439-1443
- DeCharms, R.C.¹ Blake, D.T.² Merzenich, M.M.³

8
- 0002768123
- Assessing local noise level estimation methods
- Tampere, Finland
- Dupont S., Ris C. Assessing local noise level estimation methods. Proc. Workshop on Robust Methods for Speech Recognition in Adverse Environments, Tampere, Finland. 1999;115-118.
- (1999) Proc. Workshop on Robust Methods for Speech Recognition in Adverse Environments , pp. 115-118
- Dupont, S.¹ Ris, C.²

9
- 0026368274
- Fast algorithms to find invariant features for a word recognizing neural net
- Bournemouth
- Gramß T. Fast algorithms to find invariant features for a word recognizing neural net. IEEE 2nd Internat. Conf. on Artificial Neural Networks, Bournemouth. 1991;180-184.
- (1991) IEEE 2nd Internat. Conf. on Artificial Neural Networks , pp. 180-184
- Gramß, T.¹

10
- 0025383284
- Recognition of isolated words based on psychoacoustics and neurobiology
- Gramß T., Strube H.W. Recognition of isolated words based on psychoacoustics and neurobiology. Speech Communication. 9:1990;35-40.
- (1990) Speech Communication , vol.9 , pp. 35-40
- Gramß, T.¹ Strube, H.W.²

11
- 0028543366
- Training feedforward networks with the Marquardt algorithm
- Hagan M.T., Menhaj M. Training feedforward networks with the Marquardt algorithm. IEEE Trans. Neural Networks. 5(6):1994;989-993.
- (1994) IEEE Trans. Neural Networks , vol.5 , Issue.6 , pp. 989-993
- Hagan, M.T.¹ Menhaj, M.²

12
- 0033729018
- Objective modeling of speech quality with a psychoacoustically validated auditory model
- Hansen M., Kollmeier B. Objective modeling of speech quality with a psychoacoustically validated auditory model. J. Audio Eng. Soc. 48(5):2000;395-409.
- (2000) J. Audio Eng. Soc. , vol.48 , Issue.5 , pp. 395-409
- Hansen, M.¹ Kollmeier, B.²

13
- 0028517164
- RASTA processing of speech
- Hermansky H., Morgan N. RASTA processing of speech. IEEE Trans. Speech Audio Process. 2(4):1994;578-589.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

14
- 85009254284
- TRAPS - Classifiers of temporal patterns
- Hermansky H., Sharma S. TRAPS - Classifiers of temporal patterns. Proc. ICSLP'98. Vol. 3:1998;1003-1006.
- (1998) Proc. ICSLP'98 , vol.3 , pp. 1003-1006
- Hermansky, H.¹ Sharma, S.²

15
- 0004055099
- Technical Report TR-93-012, International Computer Science Institute, Berkeley, California, USA
- Hirsch, H.G., 1993. Estimation of noise spectrum and its applications to SNR-estimation and speech enhancement. Technical Report TR-93-012, International Computer Science Institute, Berkeley, California, USA.
- (1993) Estimation of noise spectrum and its applications to SNR-estimation and speech enhancement
- Hirsch, H.G.¹

16
- 0028996871
- Noise estimation techniques for robust speech recognition
- IEEE
- Hirsch H.G., Ehrlicher C. Noise estimation techniques for robust speech recognition. Proc. Internat. Conf. on Acoust., Speech and Signal Process. (ICASSP). 1995;153-156 IEEE.
- (1995) Proc. Internat. Conf. on Acoust., Speech and Signal Process. (ICASSP) , pp. 153-156
- Hirsch, H.G.¹ Ehrlicher, C.²

17
- 0011729005
- Frequency analysis and synthesis using a gammatone filterbank
- May/June
- Hohmann, V., 2002. Frequency analysis and synthesis using a gammatone filterbank. Acta Acustica united with Acustica, no. 3, May/June, pp. 433-442.
- (2002) Acta Acustica united with Acustica , vol.3 , pp. 433-442
- Hohmann, V.¹

18
- 0002715745
- Early auditory feature coding
- BIS, Universität Oldenburg
- Kaernbach, C., 2000. Early auditory feature coding. In: Contributions to psychological acoustics: Results of the 8th Oldenburg Symposium on Psychological Acoustics. BIS, Universität Oldenburg, pp. 295-307.
- (2000) Contributions to psychological acoustics: Results of the 8th Oldenburg Symposium on Psychological Acoustics , pp. 295-307
- Kaernbach, C.¹

19
- 0032136330
- Robust speech recognition using the modulation spectrogram
- Kingsbury B., Morgan N., Greenberg S. Robust speech recognition using the modulation spectrogram. Speech Communication. 25(1):1998;117-132.
- (1998) Speech Communication , vol.25 , Issue.1 , pp. 117-132
- Kingsbury, B.¹ Morgan, N.² Greenberg, S.³

20
- 0011791859
- Perzeptive vorverarbeitung und automatische selektion sekundärer merkmale zur robusten spracherkennung
- Oldenburg: DEGA
- Kleinschmidt M., Hohmann V. Perzeptive Vorverarbeitung und automatische Selektion sekundärer Merkmale zur robusten Spracherkennung. Fortschritte der Akustik - DAGA 2000. 2000;382-383 DEGA, Oldenburg.
- (2000) Fortschritte der Akustik - DAGA 2000 , pp. 382-383
- Kleinschmidt, M.¹ Hohmann, V.²

21
- 0034824912
- Combining speech enhancement and auditory feature extraction for robust speech recognition
- special issue on Robust ASR
- Kleinschmidt M., Tchorz J., Kollmeier B. Combining speech enhancement and auditory feature extraction for robust speech recognition. Speech Communication. 34:2001;75-91. (special issue on Robust ASR).
- (2001) Speech Communication , vol.34 , pp. 75-91
- Kleinschmidt, M.¹ Tchorz, J.² Kollmeier, B.³

22
- 0004086101
- Tech. rep., Verbmobil-Technischer Report
- Kohler, K., Lex, G., Pätzold, M., Scheffers, M., Simpson, A., Thon, W., 1994. Handbuch zur Datenaufnahme und Transliteration in TP14 von VERBMOBIL-3.0. Tech. rep., Verbmobil-Technischer Report.
- (1994) Handbuch zur Datenaufnahme und Transliteration in TP14 von VERBMOBIL-3.0
- Kohler, K.¹ Lex, G.² Pätzold, M.³ Scheffers, M.⁴ Simpson, A.⁵ Thon, W.⁶

23
- 0028297185
- Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction
- Kollmeier B., Koch R. Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction. J. Acoust. Soc. Amer. 95(3):1994;1593-1602.
- (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.3 , pp. 1593-1602
- Kollmeier, B.¹ Koch, R.²

24
- 85135379452
- An efficient algorithm to estimate the instantaneous SNR of speech signals
- ESCA
- Martin R. An efficient algorithm to estimate the instantaneous SNR of speech signals. Proc. Eurospeech. 1993;1093-1096 ESCA.
- (1993) Proc. Eurospeech , pp. 1093-1096
- Martin, R.¹

25
- 0020816083
- Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
- Moore B.C.J., Glasberg B.R. Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. J. Acoust. Soc. Amer. 74:1983;750-753.
- (1983) J. Acoust. Soc. Amer. , vol.74 , pp. 750-753
- Moore, B.C.J.¹ Glasberg, B.R.²

26
- 0003306591
- An efficient auditory filterbank based on the gammatone function
- Patterson, R.D., Nimmo-Smith, J., Holdsworth, J., Rice, P., 1987. An efficient auditory filterbank based on the gammatone function. Paper presented at a meeting of the IOC Speech Group on Auditory Modeling at RSRE.
- (1987) Meeting of the IOC Speech Group on Auditory Modeling at RSRE
- Patterson, R.D.¹ Nimmo-Smith, J.² Holdsworth, J.³ Rice, P.⁴

27
- 0003748962
- Doctoral thesis, Universität Göttingen
- Püschel, D., 1988. Prinzipien der zeitlichen Analyse beim Hören. Doctoral thesis, Universität Göttingen.
- (1988) Prinzipien der zeitlichen Analyse beim Hören
- Püschel, D.¹

28
- 0034832359
- Assessing local noise level estimation methods: Application to noise robust ASR
- Ris C., Dupont S. Assessing local noise level estimation methods: application to noise robust ASR. Speech Communication. 34:2001;141-158.
- (2001) Speech Communication , vol.34 , pp. 141-158
- Ris, C.¹ Dupont, S.²

29
- 0032828464
- A model of the auditory perception as front end for automatic speech recognition
- Tchorz J., Kollmeier B. A model of the auditory perception as front end for automatic speech recognition. J. Acoust. Soc. Amer. 106(4):1999a;2040-2050.
- (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.4 , pp. 2040-2050
- Tchorz, J.¹ Kollmeier, B.²

30
- 0011765108
- Speech detection and SNR prediction basing on amplitude modulation pattern recognition
- Budapest, Hungary: ISCA
- Tchorz J., Kollmeier B. Speech detection and SNR prediction basing on amplitude modulation pattern recognition. Proc. Eurospeech. 1999b;2399-2404 ISCA, Budapest, Hungary.
- (1999) Proc. Eurospeech , pp. 2399-2404
- Tchorz, J.¹ Kollmeier, B.²

31
- 0036722886
- Estimation of the signal-to-noise ratio with amplitude modulation spectrograms
- Tchorz, J., Kollmeier, B., 2002. Estimation of the signal-to-noise ratio with amplitude modulation spectrograms. Speech Communication 38, 1-17.
- (2002) Speech Communication , vol.38 , pp. 1-17
- Tchorz, J.¹ Kollmeier, B.²

32
- 84898984996
- Noise suppression based on neurophysiologically-motivated SNR estimation for robust speech recognition
- Leen, T.K., Dietterich, T.G., Tresp, V. MIT Press
- Tchorz J., Kleinschmidt M., Kollmeier B. Noise suppression based on neurophysiologically-motivated SNR estimation for robust speech recognition. Leen T.K., Dietterich T.G., Tresp V. Advances in Neural Information Processing Systems 13 - NIPS 2000. 2001;821-827 MIT Press.
- (2001) Advances in Neural Information Processing Systems 13 - NIPS 2000 , pp. 821-827
- Tchorz, J.¹ Kleinschmidt, M.² Kollmeier, B.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.