메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1749-1753

Medium-duration modulation cepstral feature for robust speech recognition

Author keywords

large vocabulary continuous speech recognition; modulation features; noise robust speech recognition

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; MODULATION; SPEECH; SPEECH TRANSMISSION; VOCABULARY CONTROL;

EID: 84905269267     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6853898     Document Type: Conference Paper
Times cited : (35)

References (24)
  • 1
    • 33745225159 scopus 로고    scopus 로고
    • Auditory Teager energy cepstrum coefficients for robust speech recognition
    • D. Dimitriadis, P. Maragos, and A. Potamianos, "Auditory Teager energy cepstrum coefficients for robust speech recognition", in Proc. of Interspeech, pp. 3013-3016, 2005.
    • (2005) Proc. of Interspeech , pp. 3013-3016
    • Dimitriadis, D.1    Maragos, P.2    Potamianos, A.3
  • 2
    • 0033097443 scopus 로고    scopus 로고
    • Single channel speech enhancement based on masking properties of the human auditory system
    • N. Virag, "Single channel speech enhancement based on masking properties of the human auditory system", IEEE Trans. Speech Audio Process., 7(2), pp. 126-137, 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.2 , pp. 126-137
    • Virag, N.1
  • 3
    • 56249136428 scopus 로고    scopus 로고
    • Transforming binary uncertainties for robust speech recognition
    • S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition", IEEE Trans Audio, Speech, Lang. Process., 15(7), pp. 2130-2140, 2007.
    • (2007) IEEE Trans Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
    • Srinivasan, S.1    Wang, D.L.2
  • 5
    • 78049398950 scopus 로고    scopus 로고
    • Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring
    • C. Kim and R. M. Stern, "Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring", in Proc. ICASSP, pp. 4574-4577, 2010.
    • (2010) Proc. ICASSP , pp. 4574-4577
    • Kim, C.1    Stern, R.M.2
  • 6
    • 84867613224 scopus 로고    scopus 로고
    • Fepstrum features: Design and application to conversational speech recognition
    • V. Tyagi, "Fepstrum features: Design and application to conversational speech recognition", IBM Research Report, 11009, 2011.
    • (2011) IBM Research Report , pp. 11009
    • Tyagi, V.1
  • 7
    • 84867589420 scopus 로고    scopus 로고
    • Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
    • Japan
    • V. Mitra, H. Franco, M. Graciarena and A. Mandal, "Normalized amplitude modulation features for large vocabulary noise-robust speech recognition", in Proc. of ICASSP, pp. 4117-4120, Japan, 2012.
    • (2012) Proc. of ICASSP , pp. 4117-4120
    • Mitra, V.1    Franco, H.2    Graciarena, M.3    Mandal, A.4
  • 8
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • R. Drullman, J. M. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception", J. Acoust. Soc. of Am., 95(5), pp. 2670-2680, 1994.
    • (1994) J. Acoust. Soc. of Am. , vol.95 , Issue.5 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 9
    • 0034844903 scopus 로고    scopus 로고
    • On the upper cutoff frequency of auditory criticalband envelope detectors in the context of speech perception
    • O. Ghitza, "On the upper cutoff frequency of auditory criticalband envelope detectors in the context of speech perception", J. Acoust. Soc. of Am., 110(3), pp. 1628-1640, 2001.
    • (2001) J. Acoust. Soc. of Am. , vol.110 , Issue.3 , pp. 1628-1640
    • Ghitza, O.1
  • 10
    • 33745225159 scopus 로고    scopus 로고
    • Auditory Teager energy cepstrum coefficients for robust speech recognition
    • D. Dimitriadis, P. Maragos, and A. Potamianos, "Auditory Teager energy cepstrum coefficients for robust speech recognition", in Proc of Interspeech, pp. 3013-3016, 2005.
    • (2005) Proc of Interspeech , pp. 3013-3016
    • Dimitriadis, D.1    Maragos, P.2    Potamianos, A.3
  • 11
    • 0035278964 scopus 로고    scopus 로고
    • Time-frequency distributions for automatic speech recognition
    • A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition", IEEE Trans. Speech & Audio Proc., 9(3), pp. 196-200, 2001.
    • (2001) IEEE Trans. Speech & Audio Proc. , vol.9 , Issue.3 , pp. 196-200
    • Potamianos, A.1    Maragos, P.2
  • 12
    • 0033328948 scopus 로고    scopus 로고
    • Teager energy based feature parameters for speech recognition in car noise
    • F. Jabloun, A. E. Cetin, and E. Erzin, "Teager energy based feature parameters for speech recognition in car noise", IEEE Sig. Proc. Letters, 6(10), pp. 259-261, 1999.
    • (1999) IEEE Sig. Proc. Letters , vol.6 , Issue.10 , pp. 259-261
    • Jabloun, F.1    Cetin, A.E.2    Erzin, E.3
  • 13
    • 0032030556 scopus 로고    scopus 로고
    • A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
    • J.H.L. Hansen, L. Gavidia-Ceballos, and J.F. Kaiser, "A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment", IEEE Trans. Biomedical Engineering, 45(3), pp. 300-313, 1998.
    • (1998) IEEE Trans. Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
    • Hansen, J.H.L.1    Gavidia-Ceballos, L.2    Kaiser, J.F.3
  • 14
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B.R. Glasberg and B.C.J. Moore, "Derivation of auditory filter shapes from notched-noise data", Hearing Research, 47, pp.103-138, 1990.
    • (1990) Hearing Research , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 15
    • 0019075685 scopus 로고
    • Some observations on oral air flow during phonation
    • H. Teager, "Some observations on oral air flow during phonation", IEEE Trans. ASSP, pp. 599-601, 1980.
    • (1980) IEEE Trans. ASSP , pp. 599-601
    • Teager, H.1
  • 16
    • 0027210171 scopus 로고
    • Some useful properties of the Teager's energy operator
    • J.F. Kaiser, "Some useful properties of the Teager's energy operator", in Proc of IEEE, Iss. III, pp. 149-152, 1993.
    • (1993) Proc of IEEE, Iss. , vol.3 , pp. 149-152
    • Kaiser, J.F.1
  • 17
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • P. Maragos, J. Kaiser, and T. Quatieri, "Energy separation in signal modulations with application to speech analysis", IEEE Trans. Signal Processing, 41, pp. 3024-3051, 1993.
    • (1993) IEEE Trans. Signal Processing , vol.41 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.2    Quatieri, T.3
  • 18
    • 0032030556 scopus 로고    scopus 로고
    • A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
    • J.H.L. Hansen, L. Gavidia-Ceballos, and J.F. Kaiser, "A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment", IEEE Trans. Biomedical Engineering, 45(3), pp. 300-313, 1998.
    • (1998) IEEE Trans. Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
    • Hansen, J.H.L.1    Gavidia-Ceballos, L.2    Kaiser, J.F.3
  • 19
    • 33646677283 scopus 로고    scopus 로고
    • Experimental framework for the performance evaluation of speech recognition front-ends on a large vocabulary task
    • June 4
    • G. Hirsch, "Experimental framework for the performance evaluation of speech recognition front-ends on a large vocabulary task", ETSI STQ-Aurora DSR Working Group, June 4, 2001.
    • (2001) ETSI STQ-Aurora DSR Working Group
    • Hirsch, G.1
  • 20
    • 84873310339 scopus 로고    scopus 로고
    • The RATS radio traffic collection system
    • Odyssey
    • K.Walker and S. Strassel, "The RATS radio traffic collection system," in Proc. of ISCA, Odyssey, 2012.
    • (2012) Proc. of ISCA
    • Walker, K.1    Strassel, S.2
  • 21
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • B.R. Glasberg and B.C.J. Moore, "Derivation of auditory filter shapes from notched-noise data", Hearing Research, 47, pp.103-138, 1990.
    • (1990) Hearing Research , vol.47 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 22


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.