메뉴 건너뛰기




Volumn 48, Issue 11, 2006, Pages 1502-1514

Model-based feature enhancement with uncertainty decoding for noise robust ASR

Author keywords

Additive noise; Convolutional noise; Model based feature enhancement; Noise robust speech recognition; Uncertainty decoding

Indexed keywords

ACOUSTIC NOISE; DECODING; FEATURE EXTRACTION; LEARNING SYSTEMS; MAXIMUM LIKELIHOOD ESTIMATION; PROBABILITY; PROBABILITY DENSITY FUNCTION; ROBUSTNESS (CONTROL SYSTEMS);

EID: 33750376174     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2005.12.006     Document Type: Article
Times cited : (39)

References (28)
  • 1
    • 85009067687 scopus 로고    scopus 로고
    • Arrowood, J., Clements, M., 2002. Using observation uncertainty in HMM decoding. In: Proc. ICSLP, Denver, Colorado, pp. 1561-1564.
  • 2
    • 84977901887 scopus 로고    scopus 로고
    • Attias, H., Deng, L., Acero, A., Platt, J., 2001. A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise. In: Proc. EUROSPEECH, Aalborg, Denmark, pp. 1903-1906.
  • 3
    • 85009154399 scopus 로고    scopus 로고
    • Benitez, M., Segura, J., dela Torre, A., Ramirez, J., Rubio, A., 2004. Including uncertainty of speech observations in robust speech recognition. In: Proc. ICSLP, Jeju Island, Korea, pp. 137-140.
  • 4
    • 4544310318 scopus 로고    scopus 로고
    • Bernard, A., Gong, Y., Cui, X., 2004. Can back-ends be more robust than font-ends? Investigation over the Aurora-2 database. In: Proc. ICASSP, Montreal, Canada, pp. 1025-1028.
  • 5
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke M., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Comm. 34 3 (2001) 267-285
    • (2001) Speech Comm. , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 6
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM-algorithm
    • Dempster A.P., Laird N.M., and Rubin D.B. Maximum likelihood from incomplete data via the EM-algorithm. J. Roy. Statist. Soc. B 39 (1977) 1-38
    • (1977) J. Roy. Statist. Soc. B , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 7
    • 0033889293 scopus 로고    scopus 로고
    • An efficient search space representation for large vocabulary continuous speech recognition
    • Demuynck K., Duchateau J., Van Compernolle D., and Wambacq P. An efficient search space representation for large vocabulary continuous speech recognition. Speech Comm. 30 1 (2000) 37-53
    • (2000) Speech Comm. , vol.30 , Issue.1 , pp. 37-53
    • Demuynck, K.1    Duchateau, J.2    Van Compernolle, D.3    Wambacq, P.4
  • 8
    • 85009275141 scopus 로고    scopus 로고
    • Deng, L., Droppo, J., Acero, A., 2002. Exploiting variances in robust feature extraction based on a parametric model of speech distortion. In: Proc. ICSLP, Denver, Colorado, pp. 2449-2452.
  • 9
    • 85006734596 scopus 로고    scopus 로고
    • Droppo, J., Deng, L., Acero, A., 2001. Evaluation of the SPLICE algorithm on the Aurora2 database. In: Proc. EUROSPEECH, Aalborg, Denmark, pp. 217-220.
  • 10
    • 0032045533 scopus 로고    scopus 로고
    • Fast and accurate acoustic modelling with semi-continuous HMMs
    • Duchateau J., Demuynck K., and Van Compernolle D. Fast and accurate acoustic modelling with semi-continuous HMMs. Speech Comm. 24 1 (1998) 5-17
    • (1998) Speech Comm. , vol.24 , Issue.1 , pp. 5-17
    • Duchateau, J.1    Demuynck, K.2    Van Compernolle, D.3
  • 11
    • 84893656625 scopus 로고    scopus 로고
    • Duchateau, J., Demuynck, K., Van Compernolle, D., Wambacq, P., 2001. Class definition in discriminant feature analysis. In: Proc. EUROSPEECH, Vol. III, Aalborg, Denmark, pp. 1621-1624.
  • 12
    • 0025587084 scopus 로고    scopus 로고
    • Ephraim, Y., 1990. A minimum mean square error approach for speech enhancement. In: Proc. ICASSP, New Mexico, USA, pp. 829-832.
  • 13
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Ephraim Y., and Malah D. Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Trans. ASSP 32 6 (1984) 1109-1121
    • (1984) IEEE Trans. ASSP , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 14
    • 33750356594 scopus 로고    scopus 로고
    • ETSI ES 202 050 v1.1.1, 2002. Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithm.
  • 15
    • 33750354442 scopus 로고    scopus 로고
    • Gales, M., 1995. Model-based techniques for noise robust speech recognition. Ph.D. thesis, University of Cambridge.
  • 16
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: a survey
    • Gong Y. Speech recognition in noisy environments: a survey. Speech Comm. 16 3 (1995) 261-291
    • (1995) Speech Comm. , vol.16 , Issue.3 , pp. 261-291
    • Gong, Y.1
  • 17
    • 33750327909 scopus 로고    scopus 로고
    • Holmes, J., Holmes, W., Garner, P., 1997. Using formant frequencies in speech recognition. In: Proc. EUROSPEECH, Rhodes, Greece, pp. 2083-2086.
  • 18
    • 33750285584 scopus 로고    scopus 로고
    • HTK homepage: .
  • 19
    • 0036293930 scopus 로고    scopus 로고
    • Kristjansson, T., Frey, B., 2002. Accounting for uncertainty in observations: a new paradigm for robust automatic speech recognition. In: Proc. ICASSP, Orlando, Florida, pp. 61-64.
  • 20
    • 84962786176 scopus 로고    scopus 로고
    • Kristjansson, T., Frey, B., Deng, L., 2001. Joint estimation of noise and channel distortion in a generalized EM framework. In: Proc. ASRU, Madonna di Campiglio, Italy.
  • 21
    • 85009242725 scopus 로고    scopus 로고
    • Macho, D., Mauuary, L., Noé, B., Cheng, Y., Ealey, D., Jouvet, D., Kelleher, H., Pearce, D., Saadoun, F., 2002. Evaluation of a noise-robust DSR front-end on Aurora databases. In: Proc. ICSLP, Denver, Colorado, USA, pp. 17-20.
  • 22
    • 0032166087 scopus 로고    scopus 로고
    • HMM-based strategies for enhancement of speech signals embedded in non-stationary noise
    • Sameti H., Sheikhzadeh H., Deng L., and Brennan R. HMM-based strategies for enhancement of speech signals embedded in non-stationary noise. IEEE Trans. SAP 6 5 (1998) 445-455
    • (1998) IEEE Trans. SAP , vol.6 , Issue.5 , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.4
  • 23
    • 85009228863 scopus 로고    scopus 로고
    • Stouten, V., Van hamme, H., Demuynck, K., Wambacq, P., 2003. Robust speech recognition using model-based feature enhancement. In: Proc. EUROSPEECH, Geneva, Switzerland, pp. 17-20.
  • 24
    • 85009154856 scopus 로고    scopus 로고
    • Stouten, V., Van hamme, H., Wambacq, P., 2004a. Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement. In: Proc. ICSLP, Vol. I, Jeju Island, Korea, pp. 105-108.
  • 25
    • 4544288024 scopus 로고    scopus 로고
    • Stouten, V., Van hamme, H., Wambacq, P., 2004b. Joint removal of additive and convolutional noise with model-based feature enhancement. In: Proc. ICASSP, Vol. I, Montreal, Canada, pp. 949-952.
  • 26
    • 0023206182 scopus 로고    scopus 로고
    • Van Compernolle, D., 1987. Increased noise immunity in large vocabulary speech recognition with the aid of spectral subtraction. In: Proc. International Conference on Acoustics, Speech and Signal Processing, Dallas, TX, USA, pp. 1143-1146.
  • 27
    • 0025681008 scopus 로고    scopus 로고
    • Varga, A., Moore, R., 1990. Hidden Markov model decomposition of speech and noise. In: Proc. ICASSP, Albuquerque, USA, pp. 845-848.
  • 28
    • 33750369318 scopus 로고    scopus 로고
    • Yamaguchi, Y., Takahashi, S., Sagayama, S., 1997. Fast adaptation of acoustic models to environmental noise using Jacobian adaptation algorithm. In: Proc. EUROSPEECH, Rhodes, Greece, pp. 2051-2054.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.