메뉴 건너뛰기




Volumn 15, Issue 3, 2007, Pages 882-892

HMM-based gain modeling for enhancement of speech in noise

Author keywords

Gain modeling; Hidden Markov modeling (HMM); Noise suppression; Speech enhancement

Indexed keywords

ACCURATE MODELING; BAYESIAN; DATA-DRIVEN; ENERGY VARIATIONS; EXPECTATION-MAXIMIZATION ALGORITHMS; GAIN MODELING; GAIN MODELS; HIDDEN MARKOV MODELING (HMM); MODELING TECHNIQUES; NOISE GAINS; NOISE SUPPRESSION; NON-STATIONARY NOISE; OFFLINE; RECURSIVE EM; SPEECH ENHANCEMENT METHODS; SUBJECTIVE TESTS; TIME-INVARIANT MODELS; TIME-VARYING; TIME-VARYING MODEL PARAMETERS; UNIFIED FRAMEWORKS;

EID: 51449116166     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.885256     Document Type: Article
Times cited : (102)

References (37)
  • 1
    • 0026843273 scopus 로고
    • A Bayesian estimation approach for speech enhancement using hidden Markov models
    • Apr
    • Y. Ephraim, "A Bayesian estimation approach for speech enhancement using hidden Markov models," IEEE Trans. Signal Process., vol. 40, no. 4, pp. 725-735, Apr. 1992.
    • (1992) IEEE Trans. Signal Process , vol.40 , Issue.4 , pp. 725-735
    • Ephraim, Y.1
  • 2
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-27 , pp. 113-120
    • Boll, S.1
  • 3
    • 64149119183 scopus 로고    scopus 로고
    • quot;Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems, 1996, TIA/EIA/IS-127.
    • quot;Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems," 1996, TIA/EIA/IS-127.
  • 4
    • 0036296949 scopus 로고    scopus 로고
    • Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors
    • R. Martin, "Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2002, vol. 1, pp. 253-256.
    • (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 253-256
    • Martin, R.1
  • 5
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • Sep
    • I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 466-475, Sep. 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 6
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 8
    • 0026881830 scopus 로고
    • Gain-adapted hidden Markov models for recognition of clean and noisy speech
    • Jun
    • Y. Ephraim, "Gain-adapted hidden Markov models for recognition of clean and noisy speech," IEEE Trans. Signal Process., vol. 40, no. 6, pp. 1303-1316, Jun. 1992.
    • (1992) IEEE Trans. Signal Process , vol.40 , Issue.6 , pp. 1303-1316
    • Ephraim, Y.1
  • 9
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • Apr
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 10
    • 0001459635 scopus 로고    scopus 로고
    • Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises
    • May
    • Y. Zhao, "Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 255-266, May 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.3 , pp. 255-266
    • Zhao, Y.1
  • 11
    • 0035400320 scopus 로고    scopus 로고
    • Adaptive model-based speech enhancement
    • Jul
    • B. Logan and T. Robinson, "Adaptive model-based speech enhancement," Speech Commun., vol. 34, no. 4, pp. 351-368, Jul. 2001.
    • (2001) Speech Commun , vol.34 , Issue.4 , pp. 351-368
    • Logan, B.1    Robinson, T.2
  • 12
    • 0347968277 scopus 로고    scopus 로고
    • Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
    • Nov
    • L. Deng, J. Droppo, and A. Acero, "Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 568-580, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.6 , pp. 568-580
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 13
    • 0032166087 scopus 로고    scopus 로고
    • HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
    • Sep
    • H. Sameti, H. Sheikhzadeh, L. Deng, and R. L. Brennan, "HMM-based strategies for enhancement of speech signals embedded in nonstationary noise," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 445-455, Sep. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.5 , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.L.4
  • 14
    • 0034842353 scopus 로고    scopus 로고
    • Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding
    • May
    • M. Kuropatwinski and W. B. Kleijn, "Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding," in Proc. IEEE Int. Conf. Acoust., Speech Signal Process., May 2001, vol. 1, pp. 669-672.
    • (2001) Proc. IEEE Int. Conf. Acoust., Speech Signal Process , vol.1 , pp. 669-672
    • Kuropatwinski, M.1    Kleijn, W.B.2
  • 15
    • 33744970011 scopus 로고    scopus 로고
    • Codebook driven short-term predictor parameter estimation for speech enhancement
    • Jan
    • S. Srinivasan, J. Samuelsson, and W. B. Kleijn, "Codebook driven short-term predictor parameter estimation for speech enhancement," IEEE Trans. Speech Audio Process., vol. 14, no. 1, pp. 163-176, Jan. 2006.
    • (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.1 , pp. 163-176
    • Srinivasan, S.1    Samuelsson, J.2    Kleijn, W.B.3
  • 16
    • 33645821784 scopus 로고    scopus 로고
    • Codebook-based Bayesian speech enhancement
    • -, "Codebook-based Bayesian speech enhancement," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2005, vol. 1, pp. 1077-1080.
    • (2005) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 1077-1080
  • 17
    • 0036471883 scopus 로고    scopus 로고
    • Nonstationary-state hidden Markov model representation of speech signals for speech enhancement
    • Feb
    • H. Sameti and L. Deng, "Nonstationary-state hidden Markov model representation of speech signals for speech enhancement," Elsevier Signal Process. J., vol. 82, no. 2, pp. 205-227, Feb. 2002.
    • (2002) Elsevier Signal Process. J , vol.82 , Issue.2 , pp. 205-227
    • Sameti, H.1    Deng, L.2
  • 18
    • 0028195651 scopus 로고
    • Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization
    • Jan
    • H. Sheikhzadeh and L. Deng, "Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization," IEEE Trans. Speech Audio Process., vol. 2, no. 1, pp. 80-89, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.1 , pp. 80-89
    • Sheikhzadeh, H.1    Deng, L.2
  • 19
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.2    Rubin, D.B.3
  • 20
    • 0001593436 scopus 로고
    • Recursive parameter estimation using incomplete data
    • D. M. Titterington, "Recursive parameter estimation using incomplete data," J. R. Statist. Soc. B, vol. 46, no. 2, pp. 257-267, 1984.
    • (1984) J. R. Statist. Soc. B , vol.46 , Issue.2 , pp. 257-267
    • Titterington, D.M.1
  • 21
    • 33745221416 scopus 로고    scopus 로고
    • On noise gain estimation for HMM-based speech enhancement
    • Sep
    • D. Zhao and W. B. Kleijn, "On noise gain estimation for HMM-based speech enhancement," in Proc. Interspeech, Sep. 2005, pp. 2113-2116.
    • (2005) Proc. Interspeech , pp. 2113-2116
    • Zhao, D.1    Kleijn, W.B.2
  • 23
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 24
    • 0022270364 scopus 로고
    • Mixture autoregressive hidden Markov models for speech signals
    • Dec
    • B.-H. Juang and L. R. Rabiner, "Mixture autoregressive hidden Markov models for speech signals," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 6, pp. 1404-1413, Dec. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-33 , Issue.6 , pp. 1404-1413
    • Juang, B.-H.1    Rabiner, L.R.2
  • 25
    • 0027797470 scopus 로고
    • Online estimation of hidden Markov model parameters based on the Kullback-Leibler information measure
    • Aug
    • V. Krishnamurthy and J. Moore, "Online estimation of hidden Markov model parameters based on the Kullback-Leibler information measure," IEEE Trans. Signal Process., vol. 41, no. 8, pp. 2557-2573, Aug. 1993.
    • (1993) IEEE Trans. Signal Process , vol.41 , Issue.8 , pp. 2557-2573
    • Krishnamurthy, V.1    Moore, J.2
  • 27
    • 0004117648 scopus 로고    scopus 로고
    • Inform. Sys. Lab, Stanford Univ, Stanford, CA, Tech. Rep. No. 6504-1, revised
    • R. Gray, "Toeplitz and circulant matrices: A review," Inform. Sys. Lab., Stanford Univ., Stanford, CA, 2005, Tech. Rep. No. 6504-1, revised.
    • (2005) Toeplitz and circulant matrices: A review
    • Gray, R.1
  • 29
    • 0002077742 scopus 로고
    • Quantization of LPC parameters
    • W. B. Kleijn and K. K. Paliwal, Eds. New York: Elsevier, ch. 12, pp
    • K. K. Paliwal and W. B. Kleijn, "Quantization of LPC parameters," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. New York: Elsevier, 1995, ch. 12, pp. 433-466.
    • (1995) Speech Coding and Synthesis , pp. 433-466
    • Paliwal, K.K.1    Kleijn, W.B.2
  • 30
    • 64149127893 scopus 로고    scopus 로고
    • quot;Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs, 2001, ITU-T Rec. P.862.
    • quot;Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs," 2001, ITU-T Rec. P.862.
  • 32
    • 0024035451 scopus 로고
    • A unified approach for encoding clean and noisy sources by means of waveform and autoregressive model vector quantization
    • Jul
    • Y. Ephraim and R. M. Gray, "A unified approach for encoding clean and noisy sources by means of waveform and autoregressive model vector quantization," IEEE Trans. Inform. Theory, vol. 34, no. 4, pp. 826-834, Jul. 1988.
    • (1988) IEEE Trans. Inform. Theory , vol.34 , Issue.4 , pp. 826-834
    • Ephraim, Y.1    Gray, R.M.2
  • 34
    • 84961779105 scopus 로고    scopus 로고
    • Enhancement of coded speech by constrained optimization
    • Oct
    • W. B. Kleijn, "Enhancement of coded speech by constrained optimization," in Proc. IEEE Workshop on Speech Coding, Oct. 2002, pp. 163-165.
    • (2002) Proc. IEEE Workshop on Speech Coding , pp. 163-165
    • Kleijn, W.B.1
  • 35
    • 64149104224 scopus 로고    scopus 로고
    • quot;Methods for subjective determination of transmisson quality, 1996, ITU-T Rec. P.800.
    • quot;Methods for subjective determination of transmisson quality," 1996, ITU-T Rec. P.800.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.