메뉴 건너뛰기




Volumn , Issue , 2012, Pages 4117-4120

Normalized amplitude modulation features for large vocabulary noise-robust speech recognition

Author keywords

Large Vocabulary Speech Recognition; Modulation Features; Noise Robust Speech Recognition

Indexed keywords

AUTOMATED SYSTEMS; AUTOMATIC SPEECH RECOGNITION SYSTEM; BACKGROUND NOISE; CEPSTRAL COEFFICIENTS; CHANNEL DEGRADATIONS; DIGIT RECOGNITION; ENERGY OPERATORS; FEATURE SETS; HUMAN AUDITORY SYSTEM; HUMAN SPEECH; LARGE VOCABULARY; LARGE VOCABULARY SPEECH RECOGNITION; NOISE ROBUST SPEECH RECOGNITION; NOISE ROBUSTNESS; SPEECH RECOGNITION SYSTEMS; WALL STREET JOURNAL;

EID: 84867589420     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2012.6288824     Document Type: Conference Paper
Times cited : (92)

References (19)
  • 1
    • 0033097443 scopus 로고    scopus 로고
    • Single channel speech enhancement based on masking properties of the human auditory system
    • N. Virag, "Single channel speech enhancement based on masking properties of the human auditory system", IEEE Trans. Speech Audio Process. vol.7, no.2, pp. 126-137, 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.2 , pp. 126-137
    • Virag, N.1
  • 2
    • 56249136428 scopus 로고    scopus 로고
    • Transforming binary uncertainties for robust speech recognition
    • S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition", IEEE Trans Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2130-2140, 2007.
    • (2007) IEEE Trans Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
    • Srinivasan, S.1    Wang, D.L.2
  • 4
    • 78049398950 scopus 로고    scopus 로고
    • Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring
    • C. Kim and R. M. Stern, "Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring", in Proc. ICASSP, pp. 4574-4577, 2010.
    • (2010) Proc. ICASSP , pp. 4574-4577
    • Kim, C.1    Stern, R.M.2
  • 5
    • 84867613224 scopus 로고    scopus 로고
    • Fepstrum features: Design and application to conversational speech recognition
    • 11009
    • V. Tyagi, Fepstrum features: Design and application to conversational speech recognition, IBM Research Report, 11009, 2011.
    • (2011) IBM Research Report
    • Tyagi, V.1
  • 6
    • 37649022051 scopus 로고    scopus 로고
    • A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
    • U. H. Yapanel and J. H. L. Hansen, "A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition", Speech Comm., vol.50, iss. 2, pp. 142-152, 2008.
    • (2008) Speech Comm. , vol.50 , Issue.2 , pp. 142-152
    • Yapanel, U.H.1    Hansen, J.H.L.2
  • 7
    • 33745225159 scopus 로고    scopus 로고
    • Auditory Teager energy cepstrum coefficients for robust speech recognition
    • D. Dimitriadis, P. Maragos, and A. Potamianos, "Auditory Teager energy cepstrum coefficients for robust speech recognition", in Proc of Interspeech, pp. 3013-3016, 2005.
    • (2005) Proc of Interspeech , pp. 3013-3016
    • Dimitriadis, D.1    Maragos, P.2    Potamianos, A.3
  • 8
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • R. Drullman, J. M. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception", J. Acoust. Soc. of Am., vol. 95, no. 5, pp. 2670-2680, 1994.
    • (1994) J. Acoust. Soc. of Am. , vol.95 , Issue.5 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 9
    • 0034844903 scopus 로고    scopus 로고
    • On the upper cutoff frequency of auditory critical-band envelope detectors in the context of speech perception
    • O. Ghitza, "On the upper cutoff frequency of auditory critical-band envelope detectors in the context of speech perception", J. Acoust. Soc. of America, vol. 110, no. 3, pp. 1628-1640, 2001.
    • (2001) J. Acoust. Soc. of America , vol.110 , Issue.3 , pp. 1628-1640
    • Ghitza, O.1
  • 10
    • 0035278964 scopus 로고    scopus 로고
    • Time-frequency distributions for automatic speech recognition
    • A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition", IEEE Trans. Speech & Audio Proc., vol. 9, no. 3, pp. 196-200, 2001.
    • (2001) IEEE Trans. Speech & Audio Proc. , vol.9 , Issue.3 , pp. 196-200
    • Potamianos, A.1    Maragos, P.2
  • 11
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • P. Maragos, J. Kaiser, and T. Quatieri, "Energy separation in signal modulations with application to speech analysis", IEEE Trans. Signal Processing, vol.41, pp. 3024-3051, 1993.
    • (1993) IEEE Trans. Signal Processing , vol.41 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.2    Quatieri, T.3
  • 12
    • 0033328948 scopus 로고    scopus 로고
    • Teager energy based feature parameters for speech recognition in car noise
    • F. Jabloun, A. E. Cetin, and E. Erzin, "Teager energy based feature parameters for speech recognition in car noise", IEEE Sig. Proc. Letters, vol. 6, no. 10, pp. 259-261, 1999.
    • (1999) IEEE Sig. Proc. Letters , vol.6 , Issue.10 , pp. 259-261
    • Jabloun, F.1    Cetin, A.E.2    Erzin, E.3
  • 13
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B.R. Glasberg and B.C.J. Moore, "Derivation of auditory filter shapes from notched-noise data", Hearing Research, vol. 47, pp.103-138, 1990.
    • (1990) Hearing Research , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 14
    • 84867613230 scopus 로고    scopus 로고
    • http://labrosa.ee.columbia.edu/projects/renoiser/create-wsj.html
  • 15
    • 0019075685 scopus 로고
    • Some observations on oral air flow during phonation
    • H. Teager, "Some observations on oral air flow during phonation", IEEE Trans. ASSP, pp. 599-601, 1980.
    • (1980) IEEE Trans. ASSP , pp. 599-601
    • Teager, H.1
  • 16
    • 0032030556 scopus 로고    scopus 로고
    • A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
    • J.H.L. Hansen, L. Gavidia-Ceballos, and J.F. Kaiser, "A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment", IEEE Trans. Biomedical Engineering, vol. 45, no. 3, pp. 300-313, 1998.
    • (1998) IEEE Trans. Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
    • Hansen, J.H.L.1    Gavidia-Ceballos, L.2    Kaiser, J.F.3
  • 17
    • 84987702417 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • D. Pearce and H.G. Hirsch, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", in Proc. ICSLP, Beijing, China, 2000.
    • Proc. ICSLP, Beijing, China, 2000
    • Pearce, D.1    Hirsch, H.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.