SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 1, 2011, Pages 123-137

Advances in missing feature techniques for robust large-vocabulary continuous speech recognition

(2) Van Segbroeck, Maarten a Van Hamme, Hugo a

a UNIVERSITY OF LEUVEN (Belgium)

Author keywords

Automatic speech recognition (ASR); channel compensation; missing data techniques; noise robustness

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; BINARY MASKS; CEPSTRAL DOMAIN; CHANNEL COMPENSATION; FEATURE DOMAIN; GAUSSIANS; HARD DECISIONS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; LINEAR TRANSFORM; LOG-SPECTRAL DOMAIN; MISSING DATA TECHNIQUES; MISSING FEATURE THEORIES; NOISE ROBUSTNESS; NOISY DATA; RECOGNITION PERFORMANCE; RECOGNITION PROCESS; SOFT DECISION; STATIC AND DYNAMIC; STRUCTURED COVARIANCE;

ACOUSTIC NOISE; CONTINUOUS SPEECH RECOGNITION; CONVOLUTION; COVARIANCE MATRIX; MAXIMUM LIKELIHOOD ESTIMATION; STRAIN MEASUREMENT;

FEATURE EXTRACTION;

EID: 77957739976 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2045235 Document Type: Article

Times cited : (28)

References (42)

1
- 0012404388
- Noise reduction in speech applications
- ser, Boca Raton, FL: CRC
- G. Davis, Noise Reduction in Speech Applications, ser. The Electrical Engineering and Applied Signal Processing Series. Boca Raton, FL: CRC, 2002.
- (2002) The Electrical Engineering and Applied Signal Processing Series
- Davis, G.¹

2
- 33947611996
- Ph. D. dissertation, Univ. of Sheffield, Sheffield, U. K.
- L. Josifovski, "Robust automatic speech recognition with missing and unreliable data", Ph. D. dissertation, Univ. of Sheffield, Sheffield, U. K., 2002.
- (2002) Robust Automatic Speech Recognition with Missing and Unreliable Data
- Josifovski, L.¹

3
- 85032752225
- Robust automatic speech recognition with missing and unreliable acoustic data
- B. Raj and R. Stern, "Robust automatic speech recognition with missing and unreliable acoustic data", Signal Process. Mag., vol. 22, no. 2, pp. 101-116, 2005.
- (2005) Signal Process. Mag. , vol.22 , Issue.2 , pp. 101-116
- Raj, B.¹ Stern, R.²

4
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Apr
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech", J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, Apr. 1990.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

5
- 85135377175
- Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)
- Genua, Italy, Sep
- H. Hermansky, N. Morgan, A. Bayya, and P. Kohn, "Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)", in Proc. Eurospeech, Genua, Italy, Sep. 1991, pp. 1367-1370.
- (1991) Proc. Eurospeech , pp. 1367-1370
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, P.⁴

6
- 0027622158
- Root cepstral analysis: A unified view. Application to speech processing in car noise environments
- Jul
- P. Alexandre and P. Lockwood, "Root cepstral analysis: A unified view. Application to speech processing in car noise environments", Speech Commun., vol. 12, no. 3, pp. 277-288, Jul. 1993.
- (1993) Speech Commun. , vol.12 , Issue.3 , pp. 277-288
- Alexandre, P.¹ Lockwood, P.²

7
- 0032136330
- Robust speech recognition using the modulation spectrogram
- Aug
- B. Kingsbury, N. Morgan, and S. Greenberg, "Robust speech recognition using the modulation spectrogram", Speech Commun., vol. 25, pp. 117-132, Aug. 1998.
- (1998) Speech Commun. , vol.25 , pp. 117-132
- Kingsbury, B.¹ Morgan, N.² Greenberg, S.³

8
- 0005451715
- Modelling the recognition of spectrally reduced speech
- J. Barker and M. Cooke, "Modelling the recognition of spectrally reduced speech", in Proc. Eurospeech, 1997, pp. 2127-2130.
- (1997) Proc. Eurospeech , pp. 2127-2130
- Barker, J.¹ Cooke, M.²

9
- 0019053271
- Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences
- Aug
- S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences", IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

10
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- S. Boll, "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113-120, Apr. 1979. (Pubitemid 9467471)
- (1979) IEEE Trans Acoust Speech Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll Steven, F.¹

11
- 85135369853
- Noise-adaptive hidden Markov models based on Wiener filters
- S. V. Vaseghi and B. P. Milner, "Noise-adaptive hidden Markov models based on Wiener filters", in Proc. Eurospeech, 1993, pp. 1023-1026.
- (1993) Proc. Eurospeech , pp. 1023-1026
- Vaseghi, S.V.¹ Milner, B.P.²

12
- 0021645331
- Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
- Dec
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator", IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

13
- 50449097354
- Ph. D. dissertation, K. U. Leuven, Leuven, Belgium, Sep
- V. Stouten, "Robust Automatic Speech Recognition In Time-Varying Environments", Ph. D. dissertation, K. U. Leuven, Leuven, Belgium, Sep. 2006.
- (2006) Robust Automatic Speech Recognition in Time-Varying Environments
- Stouten, V.¹

14
- 0029345417
- A signal subspace approach for speech enhancement
- Jul
- Y. Ephraim and H. Van Trees, "A signal subspace approach for speech enhancement", IEEE Trans. Speech Audio Process., vol. 3, no. 4, pp. 251-266, Jul. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.4 , pp. 251-266
- Ephraim, Y.¹ Van Trees, H.²

15
- 33846186879
- A review ofsignal subspace speech enhancement and its application to noise robust speech recognition
- K. Hermus, P. Wambacq, and H. Van Hamme, "A review ofsignal subspace speech enhancement and its application to noise robust speech recognition", EURASIP J. Appl. Signal Process. Special Iss. Adv. in Subspace-Based Tech. for Signal Process. Commun., vol. 2007, no. 1, pp. 195-204, 2007.
- (2007) EURASIP J. Appl. Signal Process. Special Iss. Adv. in Subspace-Based Tech. for Signal Process. Commun. , vol.2007 , Issue.1 , pp. 195-204
- Hermus, K.¹ Wambacq, P.² Van Hamme, H.³

16
- 0029725301
- A vector Taylor series approach for environment-independent speech recognition
- Atlanta, GA, May
- P. Moreno, B. Raj, and R. Stern, "A vector Taylor series approach for environment-independent speech recognition", in Proc. ICASSP, Atlanta, GA, May 1996, pp. 733-736.
- (1996) Proc. ICASSP , pp. 733-736
- Moreno, P.¹ Raj, B.² Stern, R.³

17
- 0025681008
- Hidden Markov model decomposition of speech and noise
- Albuquerque, NM, Apr
- A. Varga and R. Moore, "Hidden Markov model decomposition of speech and noise", in Proc. ICASSP, Albuquerque, NM, Apr. 1990, pp. 845-848.
- (1990) Proc. ICASSP , pp. 845-848
- Varga, A.¹ Moore, R.²

18
- 0003671941
- Ph. D. dissertation, Univ. of Cambridge, Cambridge, U. K., Sep
- M. Gales, "Model-based techniques for noise robust speech recognition", Ph. D. dissertation, Univ. of Cambridge, Cambridge, U. K., Sep. 1995.
- (1995) Model-based Techniques for Noise Robust Speech Recognition
- Gales, M.¹

19
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- Apr
- C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models", Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, Apr. 1995.
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

20
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains", IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

21
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data", Speech Commun., vol. 34, pp. 267-285, 2001.
- (2001) Speech Commun. , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

22
- 11144316019
- Decoding speech in the presence of other sources
- J. Barker, M. Cooke, and D. Ellis, "Decoding speech in the presence of other sources", Speech Commun., vol. 45, no. 1, pp. 5-25, 2005.
- (2005) Speech Commun. , vol.45 , Issue.1 , pp. 5-25
- Barker, J.¹ Cooke, M.² Ellis, D.³

23
- 0037841203
- State based imputation of missing data for robust speech recognition and speech enhancement
- Budapest, Hungary
- L. Josifovski, M. Cooke, P. Green, and A. Vizinho, "State based imputation of missing data for robust speech recognition and speech enhancement", in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 2837-2840.
- (1999) Proc. Eurospeech , pp. 2837-2840
- Josifovski, L.¹ Cooke, M.² Green, P.³ Vizinho, A.⁴

24
- 4644336054
- Reconstruction of missing features for robust speech recognition
- B. Raj, M. L. Seltzer, and R. Stern, "Reconstruction of missing features for robust speech recognition", Speech Commun., vol. 43, no. 4, pp. 275-296, 2004.
- (2004) Speech Commun. , vol.43 , Issue.4 , pp. 275-296
- Raj, B.¹ Seltzer, M.L.² Stern, R.³

25
- 85009212472
- Robust speech recognition using missing feature theory in the cepstral or LDA domain
- Geneva, Switzerland, Sep
- H. Van Hamme, "Robust speech recognition using missing feature theory in the cepstral or LDA domain", in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 3089-3092.
- (2003) Proc. Eurospeech , pp. 3089-3092
- Van Hamme, H.¹

26
- 85009128803
- PROSPECT features and their application to missing data techniques for robust speech recognition
- Jeju Island, Korea
- H. Van Hamme, "PROSPECT features and their application to missing data techniques for robust speech recognition", in Proc. Interspeech, Jeju Island, Korea, 2004, pp. 101-104.
- (2004) Proc. Interspeech , pp. 101-104
- Van Hamme, H.¹

27
- 0000540156
- Soft decisions in missing data techniques for robust automatic speech recognition
- Beijing, China, Sep
- J. Barker, L. Josifovski, M. Cooke, and P. Green, "Soft decisions in missing data techniques for robust automatic speech recognition", in Proc. Interspeech, Beijing, China, Sep. 2000, pp. 373-376.
- (2000) Proc. Interspeech , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.³ Green, P.⁴

28
- 18744390181
- From missing data to maybe useful data: Soft data modelling for noise robust ASR
- Stratford-upon-Avon, U. K., Apr
- A. Morris, J. Barker, and H. Bourlard, "From missing data to maybe useful data: Soft data modelling for noise robust ASR", in Proc. WISP-01, Stratford-upon-Avon, U. K., Apr. 2001, pp. 153-164.
- (2001) Proc. WISP-01 , pp. 153-164
- Morris, A.¹ Barker, J.² Bourlard, H.³

29
- 70349226857
- Bounded conditional mean imputation with Gaussian mixture models: A reconstruction approach to partly occluded features
- Taipei, Taiwan, Sep
- F. Faubel, J. McDonough, and D. Klakow, "Bounded conditional mean imputation with Gaussian mixture models: A reconstruction approach to partly occluded features", in Proc. ICASSP, Taipei, Taiwan, Sep. 2009, pp. 3869-3872.
- (2009) Proc. ICASSP , pp. 3869-3872
- Faubel, F.¹ McDonough, J.² Klakow, D.³

30
- 51449106172
- Robust speech recognition using missing data techniques in the prospect domain and fuzzy masks
- Las Vegas, NV, Apr
- M. Van Segbroeck and H. Van Hamme, "Robust speech recognition using missing data techniques in the prospect domain and fuzzy masks", in Proc. ICASSP, Las Vegas, NV, Apr. 2008, pp. 4393-4396.
- (2008) Proc. ICASSP , pp. 4393-4396
- Van Segbroeck, M.¹ Van Hamme, H.²

31
- 44949096514
- Handling convolutional noise in missing data automatic speech recognition
- Pittsburgh, PA, Sep
- M. Van Segbroeck and H. Van Hamme, "Handling convolutional noise in missing data automatic speech recognition", in Proc. Interspeech, Pittsburgh, PA, Sep. 2006, pp. 2526-2565.
- (2006) Proc. Interspeech , pp. 2526-2565
- Van Segbroeck, M.¹ Van Hamme, H.²

32
- 13344250769
- Missing feature theory and probabilistic estimation of clean speech components for robust speech recognition
- Budapest, Hungary
- P. Reneveyand A. Drygajlo, "Missing feature theory and probabilistic estimation of clean speech components for robust speech recognition", in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 2627-2630.
- (1999) Proc. Eurospeech , pp. 2627-2630
- Reneveyand, P.¹ Drygajlo, A.²

33
- 4644317224
- A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
- M. L. Seltzer, B. Raj, and R. Stern, "A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition", Speech Commun., vol. 43, no. 4, pp. 379-393, 2004.
- (2004) Speech Commun. , vol.43 , Issue.4 , pp. 379-393
- Seltzer, M.L.¹ Raj, B.² Stern, R.³

34
- 33947622695
- Handling time-derivative features in a missing data framework for robust automatic speech recognition
- Toulouse, France, May
- H. Van Hamme, "Handling time-derivative features in a missing data framework for robust automatic speech recognition", in Proc. ICASSP, Toulouse, France, May 2006, pp. 293-296.
- (2006) Proc. ICASSP , pp. 293-296
- Van Hamme, H.¹

35
- 70450167189
- Vector-quantization based mask estimation for missing data automatic speech recognition
- Antwerp, Belgium, Aug
- M. Van Segbroeck and H. Van Hamme, "Vector-Quantization based mask estimation for missing data automatic speech recognition", in Proc. Interspeech, Antwerp, Belgium, Aug. 2007, pp. 910-913.
- (2007) Proc. Interspeech , pp. 910-913
- Van Segbroeck, M.¹ Van Hamme, H.²

36
- 85009074922
- Harmonic tunneling: Tracking nonstationary noises during speech
- Aalborg, Denmark, Sep
- D. Ealey, H. Kelleher, and D. Pearce, "Harmonic tunneling: Tracking nonstationary noises during speech", in Proc. Eurospeech, Aalborg, Denmark, Sep. 1999, pp. 437-410.
- (1999) Proc. Eurospeech , pp. 437-410
- Ealey, D.¹ Kelleher, H.² Pearce, D.³

37
- 33847629729
- On noise masking for automatic missing data speech recognition: Asurveyand discussion
- Jul
- C. Cerisara, S. Demange, and J.-P. Haton, "On noise masking for automatic missing data speech recognition: Asurveyand discussion", Computer, Speech, Lang., vol. 21, no. 3, pp. 443-457, Jul. 2007.
- (2007) Computer, Speech, Lang. , vol.21 , Issue.3 , pp. 443-457
- Cerisara, C.¹ Demange, S.² Haton, J.-P.³

38
- 2942539074
- Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
- K. Palomäki, G. Brown, and J. Barker, "Techniques for handling convolutional distortion with 'missing data' automatic speech recognition", Speech Commun., vol. 43, no. 1-2, pp. 123-142, 2004.
- (2004) Speech Commun. , vol.43 , Issue.1-2 , pp. 123-142
- Palomäki, K.¹ Brown, G.² Barker, J.³

39
- 0003822743
- Entropic
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book-Ver. 2.2. 1999, Entropic.
- (1999) The HTK Book-Ver. 2.2
- Young, S.¹ Kershaw, D.² Odell, J.³ Ollason, D.⁴ Valtchev, V.⁵ Woodland, P.⁶

40
- 85009227702
- Analysis of the aurora large vocabulary evaluations
- Geneva, Switzerland, Sep
- N. Parihar and J. Picone, "Analysis of the aurora large vocabulary evaluations", in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003, pp. 337-340.
- (2003) Proc. Eurospeech , pp. 337-340
- Parihar, N.¹ Picone, J.²

41
- 77957726993
- Group Online. Available
- "ESAT-PSI Speech", Group [Online]. Available: http://www.esat. kuleuven. be/psi/spraak
- ESAT-PSI Speech

42
- 77957744561
- Online. Available
- "SPRAAK: Speech processing, recognition and automatic annotation kit", [Online]. Available: http://www.spraak.org
- SPRAAK: Speech Processing, Recognition and Automatic Annotation Kit

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.