SCOPUS 정보 검색 플랫폼

Volumn 51, Issue 1, 2009, Pages 58-75

Perceptual features for automatic speech recognition in noisy environments

(3) Haque, Serajul a Togneri, Roberto a Zaknich, Anthony a

a UNIVERSITY OF WESTERN AUSTRALIA (Australia)

Author keywords

Auditory system; Automatic speech recognition; Hidden Markov model; Perceptual features; Synaptic adaptation; Two tone suppression

Indexed keywords

BANDPASS FILTERS; CROSSINGS (PIPE AND CABLE); DEGRADATION; GAUSSIAN NOISE (ELECTRONIC); HIDDEN MARKOV MODELS; HIGH PASS FILTERS; IIR FILTERS; IMPULSE RESPONSE; MARKOV PROCESSES; POLARIZATION; SENSOR NETWORKS; SPEECH; SPEECH ANALYSIS; TRELLIS CODES; WAVE FILTERS; WHITE NOISE;

AUDITORY SYSTEM; AUTOMATIC SPEECH RECOGNITION; PERCEPTUAL FEATURES; SYNAPTIC ADAPTATION; TWO-TONE SUPPRESSION;

SPEECH RECOGNITION;

EID: 55049112969 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2008.06.002 Document Type: Article

Times cited : (32)

References (38)

1
- 0032716023
- Abdelatty, A.M., Spiegel, J.V., Mueller, P., Haentjens, G., Berman, J., 1999. An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech. In: Proc. IEEE Intertnat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1999).
- Abdelatty, A.M., Spiegel, J.V., Mueller, P., Haentjens, G., Berman, J., 1999. An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech. In: Proc. IEEE Intertnat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1999).

2
- 55049087283
- Bloomberg, M., Carlson, R., Elenius, K., Granstrom, B., 1984. Auditory models and isolated word recognition. Q Prog. Statist. Rep., Speech Transmiss. Lab., Royal Institute of Technology, Stockholm, pp. 1-15.
- Bloomberg, M., Carlson, R., Elenius, K., Granstrom, B., 1984. Auditory models and isolated word recognition. Q Prog. Statist. Rep., Speech Transmiss. Lab., Royal Institute of Technology, Stockholm, pp. 1-15.

3
- 0024392496
- Application of an auditory model to speech recognition
- Cohen J.R. Application of an auditory model to speech recognition. J. Acoust. Soc. Amer. 85 6 (1989) 2623-2629
- (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.6 , pp. 2623-2629
- Cohen, J.R.¹

4
- 0004399776
- An audio noise reduction system
- Dolby R. An audio noise reduction system. J. Audio Eng. Soc. 15 4 (1967)
- (1967) J. Audio Eng. Soc. , vol.15 , Issue.4
- Dolby, R.¹

5
- 0035300277
- Syllabic-companding log domain filters
- Frey D., Tsividis Y., Efthivoulidis G., and Krishnapura N. Syllabic-companding log domain filters. IEEE Trans. Circ. Systems II 48 4 (2001) 329-339
- (2001) IEEE Trans. Circ. Systems II , vol.48 , Issue.4 , pp. 329-339
- Frey, D.¹ Tsividis, Y.² Efthivoulidis, G.³ Krishnapura, N.⁴

6
- 0141703354
- Gajić, B., Paliwal, K.K., 2003. Robust speech recognition using features based on zero-crossings with peak amplitudes. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2003).
- Gajić, B., Paliwal, K.K., 2003. Robust speech recognition using features based on zero-crossings with peak amplitudes. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2003).

7
- 0024909979
- Gillick, L., Cox, S.J., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1989).
- Gillick, L., Cox, S.J., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1989).

8
- 0028312802
- Auditory models and human performance in tasks related to speech coding and speech recognition
- Ghitza O. Auditory models and human performance in tasks related to speech coding and speech recognition. IEEE Trans. Speech Audio Process. 2 1 (1988) 115-132
- (1988) IEEE Trans. Speech Audio Process. , vol.2 , Issue.1 , pp. 115-132
- Ghitza, O.¹

9
- 33646758174
- Ghulam, M., Fukuda, T., Horikawa, J., Nitta, T., 2005. Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory masking. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2005).
- Ghulam, M., Fukuda, T., Horikawa, J., Nitta, T., 2005. Pitch-synchronous ZCPA (PS-ZCPA)-based feature extraction with auditory masking. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2005).

10
- 28844443495
- Grayden, D.B., Burkitt, A.N., Kenny, O.P., Clarey, J.N., Paolini, A.G., Clark, G.M., 2004. A cochlear implant speech processing strategy based on an auditory model. In: Internat. Conf. on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP 2004), Melbourne, pp. 491-496.
- Grayden, D.B., Burkitt, A.N., Kenny, O.P., Clarey, J.N., Paolini, A.G., Clark, G.M., 2004. A cochlear implant speech processing strategy based on an auditory model. In: Internat. Conf. on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP 2004), Melbourne, pp. 491-496.

11
- 34547543044
- Haque, S., Togneri, R., Zaknich, A., 2007. A temporal auditory model with adaptation for automatic speech recognition. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2007).
- Haque, S., Togneri, R., Zaknich, A., 2007. A temporal auditory model with adaptation for automatic speech recognition. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 2007).

12
- 0003837292
- John Wiley and Sons, Inc., New Jersey
- Hayes M.H. Statistical Digital Signal Processing and Modeling (1996), John Wiley and Sons, Inc., New Jersey 72-73
- (1996) Statistical Digital Signal Processing and Modeling , pp. 72-73
- Hayes, M.H.¹

13
- 0028517164
- RASTA processing of speech
- Hermansky H., and Morgan N. RASTA processing of speech. IEEE Trans. Speech Audio Process. 2 (1994) 587-589
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 587-589
- Hermansky, H.¹ Morgan, N.²

14
- 55049104587
- Hermansky, H., 1994. Speech beyond 10 ms (temporal filtering in feature domain). In: Proc. Internat. Workshop on Human Interface Technology, Aizu, Japan.
- Hermansky, H., 1994. Speech beyond 10 ms (temporal filtering in feature domain). In: Proc. Internat. Workshop on Human Interface Technology, Aizu, Japan.

15
- 33744994972
- Automatic speech recognition with an adaptation model motivated by auditory processing
- Holmberg M., Gelbart D., and Hemmert W. Automatic speech recognition with an adaptation model motivated by auditory processing. IEEE Trans. Audio, Speech, Lang. Process. 14 1 (2006) 44-49
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 44-49
- Holmberg, M.¹ Gelbart, D.² Hemmert, W.³

16
- 0029345416
- A comparison of signal processing front ends for automatic word recognition
- Jankowski Jr. C.R., Vo H.H., and Lippman R.P. A comparison of signal processing front ends for automatic word recognition. IEEE Trans. Speech Audio Process. 3 (1995) 286-293
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 286-293
- Jankowski Jr., C.R.¹ Vo, H.H.² Lippman, R.P.³

17
- 0029378047
- Two-tone suppression in a cochlear model
- Kates J.M. Two-tone suppression in a cochlear model. IEEE Trans. Speech Audio Process. 3 5 (1995) 396-406
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 396-406
- Kates, J.M.¹

18
- 0022806994
- Spectral analysis and discrimination by zero-crossings
- Kedem B. Spectral analysis and discrimination by zero-crossings. Proc. IEEE 74 11 (1986) 1477-1493
- (1986) Proc. IEEE , vol.74 , Issue.11 , pp. 1477-1493
- Kedem, B.¹

19
- 0032785783
- Auditory processing of speech signals for robust speech recognition in real world noisy environments
- Kim D.S., Lee S.Y., and Kil R.M. Auditory processing of speech signals for robust speech recognition in real world noisy environments. IEEE Trans. Speech and Audio Process. 7 1 (1999) 55-69
- (1999) IEEE Trans. Speech and Audio Process. , vol.7 , Issue.1 , pp. 55-69
- Kim, D.S.¹ Lee, S.Y.² Kil, R.M.³

20
- 0003398180
- Academic Press, New York
- Koopmans L.H. The Spectral Analysis of Time Series (1974), Academic Press, New York
- (1974) The Spectral Analysis of Time Series
- Koopmans, L.H.¹

21
- 0030701415
- Loughlin, P., Groutage, D., Rohrbaugh, R., 1997. Time-frequency analysis of acoustic transients. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1997).
- Loughlin, P., Groutage, D., Rohrbaugh, R., 1997. Time-frequency analysis of acoustic transients. In: Proc. IEEE Internat. Conf. on Acoust. Speech, and Signal Processing (ICASSP 1997).

22
- 0024048578
- An analog electronic cochlea
- Lyon R.F., and Mead C. An analog electronic cochlea. IEEE Trans. Acoust. Speech Signal Process. 36 7 (1988) 1119-1134
- (1988) IEEE Trans. Acoust. Speech Signal Process. , vol.36 , Issue.7 , pp. 1119-1134
- Lyon, R.F.¹ Mead, C.²

23
- 0022624057
- Simulation of mechanical to neural transduction in the auditory receptor
- Meddis R. Simulation of mechanical to neural transduction in the auditory receptor. J. Acoust. Soc. Amer. 79 3 (1988) 702-711
- (1988) J. Acoust. Soc. Amer. , vol.79 , Issue.3 , pp. 702-711
- Meddis, R.¹

24
- 0020816083
- Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
- Moore B.C.J., and Glasberg B.R. Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. J. Acoust. Soc. Amer. 74 3 (1983) 750-753
- (1983) J. Acoust. Soc. Amer. , vol.74 , Issue.3 , pp. 750-753
- Moore, B.C.J.¹ Glasberg, B.R.²

25
- 0035125936
- Forward masking: adaptation or integration?
- Oxenham A.J. Forward masking: adaptation or integration?. J. Acoust. Soc. Amer. 109 (2001) 732-741
- (2001) J. Acoust. Soc. Amer. , vol.109 , pp. 732-741
- Oxenham, A.J.¹

26
- 0030244273
- Time-frequency analysis and auditory modeling for automatic recognition of speech
- Pitton J.W., Wang K., and Juang B. Time-frequency analysis and auditory modeling for automatic recognition of speech. Proc. IEEE 84 9 (1996) 1199-1215
- (1996) Proc. IEEE , vol.84 , Issue.9 , pp. 1199-1215
- Pitton, J.W.¹ Wang, K.² Juang, B.³

27
- 55049120604
- Time-interval information in the auditory representation of speech sounds
- Patterson R.D. Time-interval information in the auditory representation of speech sounds. J. Acoust. Soc. Amer. 105 2 (1999) 1305
- (1999) J. Acoust. Soc. Amer. , vol.105 , Issue.2 , pp. 1305
- Patterson, R.D.¹

28
- 0018076531
- Some observations on cochlear mechanics
- Rhode W.S. Some observations on cochlear mechanics. J. Acoust. Soc. Amer. 64 1 (1978) 158-176
- (1978) J. Acoust. Soc. Amer. , vol.64 , Issue.1 , pp. 158-176
- Rhode, W.S.¹

29
- 0034710868
- Mechanical bases of frequency tuning and neural excitation at the base of the cochlea
- Ruggero M.A., Narayan S.S., Temchin A.N., and Recio A. Mechanical bases of frequency tuning and neural excitation at the base of the cochlea. Proc. Natl. Acad. Sci. USA 97 (2000) 11744-11750
- (2000) Proc. Natl. Acad. Sci. USA , vol.97 , pp. 11744-11750
- Ruggero, M.A.¹ Narayan, S.S.² Temchin, A.N.³ Recio, A.⁴

30
- 0020579183
- Auditory nerve representation of vowels in background noise
- Sachs M.B., Voigt H.F., and Young E.D. Auditory nerve representation of vowels in background noise. J. Neurophysiol. 50 1 (1983)
- (1983) J. Neurophysiol. , vol.50 , Issue.1
- Sachs, M.B.¹ Voigt, H.F.² Young, E.D.³

31
- 84928837806
- A joint synchrony/mean-rate model of auditory processing
- Seneff S. A joint synchrony/mean-rate model of auditory processing. J. Phonet. 85 1 (1988) 55-76
- (1988) J. Phonet. , vol.85 , Issue.1 , pp. 55-76
- Seneff, S.¹

32
- 0016439358
- Short-term adaptation and incremental responses of single auditory-nerve fibres
- Smith R., and Zwislocki J.J. Short-term adaptation and incremental responses of single auditory-nerve fibres. Biol. Cyber. 17 (1975) 169-182
- (1975) Biol. Cyber. , vol.17 , pp. 169-182
- Smith, R.¹ Zwislocki, J.J.²

33
- 55049129276
- Spoor, A., Eggermont, J.J., Odenthal, D.W., 1976. Comparison of human and animal data concerning adaptation and masking of eighth nerve compound action potential. In: Ruber, J., Elberling, C., Solomon, G. (Eds.), Electrocochleography, Baltimore, University Park, MD, pp. 183-198.
- Spoor, A., Eggermont, J.J., Odenthal, D.W., 1976. Comparison of human and animal data concerning adaptation and masking of eighth nerve compound action potential. In: Ruber, J., Elberling, C., Solomon, G. (Eds.), Electrocochleography, Baltimore, University Park, MD, pp. 183-198.

34
- 0031238095
- A model of dynamic auditory perception and its application to robust word recognition
- Strope B., and Alwan A. A model of dynamic auditory perception and its application to robust word recognition. IEEE Trans. Speech Audio Process. 95 5 (1997) 451-464
- (1997) IEEE Trans. Speech Audio Process. , vol.95 , Issue.5 , pp. 451-464
- Strope, B.¹ Alwan, A.²

35
- 85013709368
- Elsevier, San Diego, CA
- Theodoridis S., and Koutroumbas K. Pattern Recognition (1999), Elsevier, San Diego, CA
- (1999) Pattern Recognition
- Theodoridis, S.¹ Koutroumbas, K.²

36
- 0032828464
- A model of auditory perception as front-end for automatic speech recognition
- Tchorz J., and Kollmeier B. A model of auditory perception as front-end for automatic speech recognition. J. Acoust. Soc. Amer. 106 4 (1999)
- (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.4
- Tchorz, J.¹ Kollmeier, B.²

37
- 14644431758
- A bio-inspired companding strategy for spectral enhancement
- Turicchia L., and Sarpeshkar R. A bio-inspired companding strategy for spectral enhancement. IEEE Trans. Speech Audio Process. 13 2 (2005) 243-253
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 243-253
- Turicchia, L.¹ Sarpeshkar, R.²

38
- 0021207990
- Rapid and short-term adaptation in auditory nerve responses
- Westerman L., and Smith R.L. Rapid and short-term adaptation in auditory nerve responses. Hearing Res. 15 (1984) 249-260
- (1984) Hearing Res. , vol.15 , pp. 249-260
- Westerman, L.¹ Smith, R.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.