SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 7097-7101

A robust frontend for ASR: Combining denoising, noise masking and feature normalization

(2) Van Segbroeck, Maarten a Narayanan, Shrikanth S a

a UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

noise robust feature extraction; speech enhancement; speech recognition

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; BACKGROUND NOISE; COMPUTATIONAL COSTS; DENOISING METHODS; FEATURE NORMALIZATION; NOISE COMPENSATION; NOISE ROBUST; STATE OF THE ART;

FEATURE EXTRACTION; SIGNAL PROCESSING; SPEECH ENHANCEMENT; SPEECH RECOGNITION;

ACOUSTIC NOISE;

EID: 84890541926 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6639039 Document Type: Conference Paper

Times cited : (3)

References (30)

1
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113-120, Apr. 1979
- (1979) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.27 , Issue.2 , pp. 113-120
- Boll, S.¹

2
- 85135369853
- S. V. Vaseghi and B. P. Milner, "Noise-adaptive hidden markov models based on wiener filters," 1993, pp. 1023-1026
- (1993) Noise-adaptive Hidden Markov Models Based on Wiener Filters , pp. 1023-1026
- Vaseghi, S.V.¹ Milner, B.P.²

3
- 0021645331
- Dec
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," vol. 32, no. 6, pp. 1109-1121, Dec. 1984
- (1984) Speech Enhancement Using A Minimum Mean-square Error Short-time Spectral Amplitude Estimator , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

4
- 85135377175
- Genua, Italy, Sept
- H. Hermansky, N. Morgan, A. Bayya, and Ph. Kohn, "Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)," Genua, Italy, Sept. 1991, pp. 1367-1370
- (1991) Compensation for the Effect of the Communication Channel in Auditory-like Analysis of Speech (RASTA-PLP) , pp. 1367-1370
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, Ph.⁴

5
- 0027622158
- Root cepstral analysis: A unified view. Application to speech processing in car noise environments
- July
- P. Alexandre and P. Lockwood, "Root cepstral analysis: A unified view. Application to speech processing in car noise environments," Speech Communication, vol. 12, no. 3, pp. 277-288, July 1993
- (1993) Speech Communication , vol.12 , Issue.3 , pp. 277-288
- Alexandre, P.¹ Lockwood, P.²

6
- 0030711174
- B. Kingsbury and S. Greenberg, "The modulation spectrogram: in pursuit of an invariant representation of speech," 1997, pp. 1647-1650
- (1997) The Modulation Spectrogram: In Pursuit of An Invariant Representation of Speech , pp. 1647-1650
- Kingsbury, B.¹ Greenberg, S.²

7
- 70450205161
- Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction
- Sept
- C. Kim and R. M. Stern, "Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction," in Proc. Interspeech, Sept. 2009
- (2009) Proc. Interspeech
- Kim, C.¹ Stern, R.M.²

8
- 0025681008
- Hidden Markov model decomposition of speech and noise
- NM, U.S.A., Apr
- A.P. Varga and R.K. Moore, "Hidden Markov model decomposition of speech and noise," Albuquerque, NM, U.S.A., Apr. 1990, pp. 845-848
- (1990) Albuquerque , pp. 845-848
- Varga, A.P.¹ Moore, R.K.²

9
- 0003671941
- Ph.D. thesis, University of Cambridge, UK, Sept
- M.F.J. Gales, Model-Based Techniques for Noise Robust Speech Recognition, Ph.D. thesis, University of Cambridge, UK, Sept. 1995
- (1995) Model-Based Techniques for Noise Robust Speech Recognition
- Gales, M.F.J.¹

10
- 0036291376
- Uncertainty decoding with splice for noise robust speech recognition
- Orlando, Florida, U.S.A., May
- J. Droppo, A. Acero, and L. Deng, "Uncertainty decoding with splice for noise robust speech recognition," in Proc. ICASSP, Orlando, Florida, U.S.A., May 2002, pp. 57-60
- (2002) Proc. ICASSP , pp. 57-60
- Droppo, J.¹ Acero, A.² Deng, L.³

11
- 34547548235
- Probabilistic and bottle-neck features for lvcsr of meetings
- F. Grezl, M. Karafiat, S. Kontar, and J. Cernocky, "Probabilistic and bottle-neck features for lvcsr of meetings," in Proc. ICASSP, 2007
- (2007) Proc. ICASSP
- Grezl, F.¹ Karafiat, M.² Kontar, S.³ Cernocky, J.⁴

12
- 84890520795
- Power-normalized coefficients (pncc) for robust speech recognition
- C. Kim and R. M. R. M. Stern, "Power-normalized coefficients (pncc) for robust speech recognition," in Proc. ICASSP, 2012
- (2012) Proc. ICASSP
- Kim, C.¹ Stern, R.M.R.M.²

13
- 84890447859
- Spectro-temporal gabor features as a front end for ASR
- Kleinschmidt M., "Spectro-temporal gabor features as a front end for ASR," in Proc. Forum Acusticum Sevilla, 2002
- (2002) Proc. Forum Acusticum Sevilla
- Kleinschmidt, M.¹

14
- 34547499683
- Incorporating auditory feature uncertainties in robust speaker identification
- Y. Shao, S. Srinivasan, and D.L. Wang, "Incorporating auditory feature uncertainties in robust speaker identification," in Proc. ICASSP, 2002, pp. 277-280
- (2002) Proc. ICASSP , pp. 277-280
- Shao, Y.¹ Srinivasan, S.² Wang, D.L.³

15
- 79959822392
- Feature versus model based noise robustness
- K. Demuynck, X. Zhang, and H. Van Compernolle, Van hamme, "Feature versus model based noise robustness," in Proc. Interspeech, 2010
- (2010) Proc. Interspeech
- Demuynck, K.¹ Zhang, X.² Van Compernolle, H.³ Van Hamme⁴

16
- 50449097354
- Ph.D. thesis, K.U.Leuven, ESAT, Sept
- V. Stouten, Robust automatic speech recognition in timevarying environments, Ph.D. thesis, K.U.Leuven, ESAT, Sept. 2006
- (2006) Robust Automatic Speech Recognition in Timevarying Environments
- Stouten, V.¹

17
- 78049530924
- Ph.D. thesis, K.U.Leuven, ESAT, Jan
- M. Van Segbroeck, Robust Large Vocabulary Continuous Speech Recognition using Missing Data Techniques, Ph.D. thesis, K.U.Leuven, ESAT, Jan. 2010
- (2010) Robust Large Vocabulary Continuous Speech Recognition Using Missing Data Techniques
- Van Segbroeck, M.¹

18
- 0003571972
- Entropic
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book-version2.2, Entropic, 1999
- (1999) The HTK book-version2.2
- Young, S.¹ Kershaw, D.² Odell, J.³ Ollason, D.⁴ Valtchev, V.⁵ Woodland, P.⁶

19
- 79951796005
- The ibm attila speech recognition toolkit
- H. Soltau, G. Saon, and B. Kingsbury, "The ibm attila speech recognition toolkit," in Spoken Language Technology Workshop (SLT), 2010
- (2010) Spoken Language Technology Workshop (SLT)
- Soltau, H.¹ Saon, G.² Kingsbury, B.³

20
- 4544315110
- Robust speech recognition using cepstral domain missing data techniques and noisy masks
- Canada, May
- H. Van hamme, "Robust speech recognition using cepstral domain missing data techniques and noisy masks," in Proc. ICASSP, Montreal, Canada, May 2004, pp. 213-216
- (2004) Proc. ICASSP, Montreal , pp. 213-216
- Van Hamme, H.¹

21
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- July
- R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," in IEEE Transactions on Speech and Audio Processing, July 2001, vol. 9, pp. 504-512
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , pp. 504-512
- Martin, R.¹

22
- 0019053271
- Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences
- Aug
- S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357-366, Aug. 1980
- (1980) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

23
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Apr
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, Apr. 1990
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

24
- 84865769808
- Comparing different flavors of spectro-temporal features for ASR
- B. Meyer, S. Ravuri, M.R. Schadler, and N. Morgan, "Comparing different flavors of spectro-temporal features for ASR," in Proc. Interspeech, 2011, pp. 1269-1272
- (2011) Proc. Interspeech , pp. 1269-1272
- Meyer, B.¹ Ravuri, S.² Schadler, M.R.³ Morgan, N.⁴

25
- 0032050110
- Maximum-likelihood linear transforms for HMM-based speech recognition
- M. J. F. Gales, "Maximum-likelihood linear transforms for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

26
- 84865769808
- Comparing different flavors of spectro-temporal features for asr
- B. T. Meyer, S. V. Ravuri, M. R. Schadler, and N. Morgan, "Comparing different flavors of spectro-temporal features for asr," in Proc. Interspeech, 2011, pp. 1269-1272
- (2011) Proc. Interspeech , pp. 1269-1272
- Meyer, B.T.¹ Ravuri, S.V.² Schadler, M.R.³ Morgan, N.⁴

27
- 84878395103
- Longer features: They do a speech detector good
- T.J. Tsai and N. Morgan, "Longer features: They do a speech detector good," in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Tsai, T.J.¹ Morgan, N.²

28
- 80051648777
- Tech. Rep. version 1.1, Cambridge University Engineering Department
- H.G. Hirsch and D. Pearce, "Applying the advanced etsi frontend to the aurora-2 task," Tech. Rep. version 1.1, Cambridge University Engineering Department, 2006
- (2006) Applying the Advanced Etsi Frontend to the aurora-2 Task
- Hirsch, H.G.¹ Pearce, D.²

29
- 85009088984
- Robust digit recognition in noisy environments: The ibm aurora-2 system
- G. Saon, J. M. Huerta, and E.E. Jan, "Robust digit recognition in noisy environments: The ibm aurora-2 system," in Proc. Interspeech, 2001, pp. 629-632
- (2001) Proc. Interspeech , pp. 629-632
- Saon, G.¹ Huerta, J.M.² Jan, E.E.³

30
- 0030369274
- Inclusion of temporal information into features for speech recognition
- B. Milner, "Inclusion of temporal information into features for speech recognition," in Proc. ICSLP, 1996, pp. 256-259.
- (1996) Proc. ICSLP , pp. 256-259
- Milner, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.