메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7097-7101

A robust frontend for ASR: Combining denoising, noise masking and feature normalization

Author keywords

noise robust feature extraction; speech enhancement; speech recognition

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; BACKGROUND NOISE; COMPUTATIONAL COSTS; DENOISING METHODS; FEATURE NORMALIZATION; NOISE COMPENSATION; NOISE ROBUST; STATE OF THE ART;

EID: 84890541926     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639039     Document Type: Conference Paper
Times cited : (3)

References (30)
  • 1
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113-120, Apr. 1979
    • (1979) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 5
    • 0027622158 scopus 로고
    • Root cepstral analysis: A unified view. Application to speech processing in car noise environments
    • July
    • P. Alexandre and P. Lockwood, "Root cepstral analysis: A unified view. Application to speech processing in car noise environments," Speech Communication, vol. 12, no. 3, pp. 277-288, July 1993
    • (1993) Speech Communication , vol.12 , Issue.3 , pp. 277-288
    • Alexandre, P.1    Lockwood, P.2
  • 7
    • 70450205161 scopus 로고    scopus 로고
    • Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction
    • Sept
    • C. Kim and R. M. Stern, "Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction," in Proc. Interspeech, Sept. 2009
    • (2009) Proc. Interspeech
    • Kim, C.1    Stern, R.M.2
  • 8
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • NM, U.S.A., Apr
    • A.P. Varga and R.K. Moore, "Hidden Markov model decomposition of speech and noise," Albuquerque, NM, U.S.A., Apr. 1990, pp. 845-848
    • (1990) Albuquerque , pp. 845-848
    • Varga, A.P.1    Moore, R.K.2
  • 10
    • 0036291376 scopus 로고    scopus 로고
    • Uncertainty decoding with splice for noise robust speech recognition
    • Orlando, Florida, U.S.A., May
    • J. Droppo, A. Acero, and L. Deng, "Uncertainty decoding with splice for noise robust speech recognition," in Proc. ICASSP, Orlando, Florida, U.S.A., May 2002, pp. 57-60
    • (2002) Proc. ICASSP , pp. 57-60
    • Droppo, J.1    Acero, A.2    Deng, L.3
  • 11
  • 12
    • 84890520795 scopus 로고    scopus 로고
    • Power-normalized coefficients (pncc) for robust speech recognition
    • C. Kim and R. M. R. M. Stern, "Power-normalized coefficients (pncc) for robust speech recognition," in Proc. ICASSP, 2012
    • (2012) Proc. ICASSP
    • Kim, C.1    Stern, R.M.R.M.2
  • 13
    • 84890447859 scopus 로고    scopus 로고
    • Spectro-temporal gabor features as a front end for ASR
    • Kleinschmidt M., "Spectro-temporal gabor features as a front end for ASR," in Proc. Forum Acusticum Sevilla, 2002
    • (2002) Proc. Forum Acusticum Sevilla
    • Kleinschmidt, M.1
  • 14
    • 34547499683 scopus 로고    scopus 로고
    • Incorporating auditory feature uncertainties in robust speaker identification
    • Y. Shao, S. Srinivasan, and D.L. Wang, "Incorporating auditory feature uncertainties in robust speaker identification," in Proc. ICASSP, 2002, pp. 277-280
    • (2002) Proc. ICASSP , pp. 277-280
    • Shao, Y.1    Srinivasan, S.2    Wang, D.L.3
  • 20
    • 4544315110 scopus 로고    scopus 로고
    • Robust speech recognition using cepstral domain missing data techniques and noisy masks
    • Canada, May
    • H. Van hamme, "Robust speech recognition using cepstral domain missing data techniques and noisy masks," in Proc. ICASSP, Montreal, Canada, May 2004, pp. 213-216
    • (2004) Proc. ICASSP, Montreal , pp. 213-216
    • Van Hamme, H.1
  • 21
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • July
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," in IEEE Transactions on Speech and Audio Processing, July 2001, vol. 9, pp. 504-512
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , pp. 504-512
    • Martin, R.1
  • 22
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences
    • Aug
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357-366, Aug. 1980
    • (1980) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 23
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Apr
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, Apr. 1990
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 24
    • 84865769808 scopus 로고    scopus 로고
    • Comparing different flavors of spectro-temporal features for ASR
    • B. Meyer, S. Ravuri, M.R. Schadler, and N. Morgan, "Comparing different flavors of spectro-temporal features for ASR," in Proc. Interspeech, 2011, pp. 1269-1272
    • (2011) Proc. Interspeech , pp. 1269-1272
    • Meyer, B.1    Ravuri, S.2    Schadler, M.R.3    Morgan, N.4
  • 25
    • 0032050110 scopus 로고    scopus 로고
    • Maximum-likelihood linear transforms for HMM-based speech recognition
    • M. J. F. Gales, "Maximum-likelihood linear transforms for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 26
    • 84865769808 scopus 로고    scopus 로고
    • Comparing different flavors of spectro-temporal features for asr
    • B. T. Meyer, S. V. Ravuri, M. R. Schadler, and N. Morgan, "Comparing different flavors of spectro-temporal features for asr," in Proc. Interspeech, 2011, pp. 1269-1272
    • (2011) Proc. Interspeech , pp. 1269-1272
    • Meyer, B.T.1    Ravuri, S.V.2    Schadler, M.R.3    Morgan, N.4
  • 27
    • 84878395103 scopus 로고    scopus 로고
    • Longer features: They do a speech detector good
    • T.J. Tsai and N. Morgan, "Longer features: They do a speech detector good," in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Tsai, T.J.1    Morgan, N.2
  • 29
    • 85009088984 scopus 로고    scopus 로고
    • Robust digit recognition in noisy environments: The ibm aurora-2 system
    • G. Saon, J. M. Huerta, and E.E. Jan, "Robust digit recognition in noisy environments: The ibm aurora-2 system," in Proc. Interspeech, 2001, pp. 629-632
    • (2001) Proc. Interspeech , pp. 629-632
    • Saon, G.1    Huerta, J.M.2    Jan, E.E.3
  • 30
    • 0030369274 scopus 로고    scopus 로고
    • Inclusion of temporal information into features for speech recognition
    • B. Milner, "Inclusion of temporal information into features for speech recognition," in Proc. ICSLP, 1996, pp. 256-259.
    • (1996) Proc. ICSLP , pp. 256-259
    • Milner, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.