메뉴 건너뛰기




Volumn , Issue , 2011, Pages 5492-5495

Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments

Author keywords

Amplitude Modulation Spectrogram (AMS); Automatic Speech Recognition (ASR); Feature Extraction; Phase; Reverberation

Indexed keywords

ACOUSTIC SPECTRA; AMPLITUDE FLUCTUATIONS; AUTOMATIC SPEECH RECOGNITION; BASIS FUNCTIONS; CEPSTRAL; DYNAMIC FEATURES; EARLY-TO-LATE; ENERGY RATIO; FEATURE EXTRACTION METHODS; MODULATION SPECTRUM; PHASE; PHASE INFORMATION; REVERBERANT ENVIRONMENT; ROBUST SPEECH RECOGNITION; SPECTROGRAMS; SUB-BANDS; TEMPORAL FILTERS;

EID: 80051627812     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2011.5947602     Document Type: Conference Paper
Times cited : (33)

References (14)
  • 1
    • 0027957839 scopus 로고
    • Effect of temporal envelope smearing on speech reception
    • R. Drullman, J.M. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech reception," J. Acoust Soc. Am. 95, pp. 1053-1064, 1994a.
    • (1994) J. Acoust Soc. Am. , vol.95 , pp. 1053-1064
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 2
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • R. Drullman, J.M. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception," J. Acoust Soc. Am. 95, pp. 2670-2680, 1994b.
    • (1994) J. Acoust Soc. Am. , vol.95 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 3
    • 0030369532 scopus 로고    scopus 로고
    • Intelligibility of speech with filtered time trajectories of spectral envelopes
    • T. Arai, M. Pave, H. Hermansky, and C. Avendano, "Intelligibility of speech with filtered time trajectories of spectral envelopes," Proc. ICSLP 96, 1996.
    • (1996) Proc. ICSLP , vol.96
    • Arai, T.1    Pave, M.2    Hermansky, H.3    Avendano, C.4
  • 4
    • 0032676337 scopus 로고    scopus 로고
    • On the relative importance of various components of the modulation spectrum for automatic speech recognition
    • N. Kanedera, T. Arai, H. Hermansky, and M. Pavel, "On the relative importance of various components of the modulation spectrum for automatic speech recognition," Speech Communication 28, pp. 43-55, 1999.
    • (1999) Speech Communication , vol.28 , pp. 43-55
    • Kanedera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 5
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech Signal Process. 34(1), pp. 52-59, 1986.
    • (1986) IEEE Trans. Acoust. Speech Signal Process. , vol.34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 7
    • 78049370506 scopus 로고    scopus 로고
    • Comparison of modulation features for phoneme recognition
    • S. Ganapathy, S. Thomas, and H. Hermansky, "Comparison of modulation features for phoneme recognition," Proc. ICASSP 2010, pp. 5038-5041, 2010.
    • (2010) Proc. ICASSP 2010 , pp. 5038-5041
    • Ganapathy, S.1    Thomas, S.2    Hermansky, H.3
  • 8
    • 70450179487 scopus 로고    scopus 로고
    • Spectral and temporal modulation features for phonetic recognition
    • S.A. Zahorian, H. Hu, Z. Chen, and J. Wu, "Spectral and temporal modulation features for phonetic recognition," Proc. Interspeech 2009, pp. 1071-1074, 2009.
    • (2009) Proc. Interspeech 2009 , pp. 1071-1074
    • Zahorian, S.A.1    Hu, H.2    Chen, Z.3    Wu, J.4
  • 9
    • 0028297185 scopus 로고
    • Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction
    • B. Kollmeier, and R. Koch, "Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction," J. Acoust. Soc. Am. 95(3), pp. 1593-1602, 1994.
    • (1994) J. Acoust. Soc. Am. , vol.95 , Issue.3 , pp. 1593-1602
    • Kollmeier, B.1    Koch, R.2
  • 10
    • 0024241221 scopus 로고
    • Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms
    • G. Langner, and C.E. Schreiner, "Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms," J. of Neurophysiology 60, pp. 1799-1822, 1988.
    • (1988) J. of Neurophysiology , vol.60 , pp. 1799-1822
    • Langner, G.1    Schreiner, C.E.2
  • 11
    • 0030691985 scopus 로고    scopus 로고
    • Modeling auditory processing of amplitude modulation. I. Detection and masking with narrowband carriers
    • T. Dau, and B. Kollmeier, "Modeling auditory processing of amplitude modulation. I. Detection and masking with narrowband carriers," J. Acoust. Soc. Am. 102(5), pp. 2892-2905, 1997.
    • (1997) J. Acoust. Soc. Am. , vol.102 , Issue.5 , pp. 2892-2905
    • Dau, T.1    Kollmeier, B.2
  • 12
    • 0002787767 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluations of speech recognition systems under noisy conditions
    • H.G. Hirsch, and D. Pearce, "The aurora experimental framework for the performance evaluations of speech recognition systems under noisy conditions," In: ISCA ITRW ASR, 2000.
    • ISCA ITRW ASR, 2000
    • Hirsch, H.G.1    Pearce, D.2
  • 13
    • 84856269531 scopus 로고    scopus 로고
    • On properties of modulation spectrum for robust automatic speech recognition
    • N. Kanedera, H. Hermansky, and T. Arai, "On properties of modulation spectrum for robust automatic speech recognition," Proc. ICASSP 1998, pp. 613-616, 1998.
    • (1998) Proc. ICASSP 1998 , pp. 613-616
    • Kanedera, N.1    Hermansky, H.2    Arai, T.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.