메뉴 건너뛰기




Volumn 17, Issue 2, 2009, Pages 231-246

Integrated speech enhancement method using noise suppression and dereverberation

Author keywords

Dereverberation; Maximum likelihood (ML) estimation; Minimum mean square error (MMSE) estimation; Noise suppression; Speech enhancement

Indexed keywords

AUTOREGRESSIVE SYSTEMS; DEREVERBERATION; ESTIMATED PARAMETER; GAUSSIAN RANDOM VARIABLE; MAXIMUM LIKELIHOOD ESTIMATION METHOD; MAXIMUM-LIKELIHOOD (ML) ESTIMATION; MINIMUM MEAN SQUARE ERROR (MMSE) ESTIMATION; MINIMUM MEAN SQUARE ERROR ESTIMATE; NOISE SUPPRESSION; REVERBERATION TIME; SPECTRAL COMPONENTS; SPEECH ENHANCEMENT METHODS; SPEECH MODELS; SPEECH SIGNALS; SPEECH SPECTRA; STATIONARY NOISE; TIME INVARIANTS;

EID: 70350435249     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2008042     Document Type: Article
Times cited : (75)

References (37)
  • 3
    • 0032123744 scopus 로고    scopus 로고
    • Iterative and sequential kalman filter-based speech enhancement algorithms
    • Jul.
    • S. Gannot, D. Burshtein, and E. Weinstein, "Iterative and sequential Kalman filter-based speech enhancement algorithms, " IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 373-385, Jul. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 373-385
    • Gannot, S.1    Burshtein, D.2    Weinstein, E.3
  • 4
    • 51449109652 scopus 로고    scopus 로고
    • Codebook-based bayesian speech enhancement for nonstationary environments
    • Feb.
    • S. Srinivasan, J. Samuelsson, and W. B. Kleijn, "Codebook-based Bayesian speech enhancement for nonstationary environments, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 441-452, Feb. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 441-452
    • Srinivasan, S.1    Samuelsson, J.2    Kleijn, W.B.3
  • 5
    • 0029185029 scopus 로고
    • EVAM: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals
    • Jan.
    • M. I. Gurelli and C. L. Nikias, "EVAM: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals, " IEEE Trans. Signal Process., vol. 43, no. 1, pp. 134-149, Jan. 1995.
    • (1995) IEEE Trans. Signal Process. , vol.43 , Issue.1 , pp. 134-149
    • Gurelli, M.I.1    Nikias, C.L.2
  • 6
    • 0242271432 scopus 로고    scopus 로고
    • Subspace methods for multimicrophone speech dereverberation
    • S. Gannot and M. Moonen, "Subspace methods for multimicrophone speech dereverberation, " EURASIP J. Appl. Signal Process., vol. 2003, no. 11, pp. 1074-1090, 2003.
    • (2003) EURASIP J. Appl. Signal Process. , vol.2003 , Issue.11 , pp. 1074-1090
    • Gannot, S.1    Moonen, M.2
  • 7
    • 34247241719 scopus 로고    scopus 로고
    • Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations
    • 10.1155/2007/34013, article ID 34013
    • T. Hikichi, M. Delcroix, and M. Miyoshi, "Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations, " EURASIP J. Adv. Signal Process., vol. 2007, 2007, 10.1155/2007/34013, article ID 34013..
    • (2007) EURASIP J. Adv. Signal Process. , vol.2007
    • Hikichi, T.1    Delcroix, M.2    Miyoshi, M.3
  • 8
    • 70350478055 scopus 로고    scopus 로고
    • An experimental study of the eigendecomposition methods for blind simo system identification in the presence of noise
    • CD-ROM Proc
    • S. Javidi, N. D. Gaubitch, and P. A. Naylor, "An experimental study of the eigendecomposition methods for blind SIMO system identification in the presence of noise, " in Proc. Int. Worksh. Acoust. Echo, Noise Contr., 2006, CD-ROM Proc..
    • (2006) Proc. Int. Worksh. Acoust. Echo, Noise Contr.
    • Javidi, S.1    Gaubitch, N.D.2    Naylor, P.A.3
  • 9
    • 0001379957 scopus 로고    scopus 로고
    • Enhancement of reverberant speech using LP residual signal
    • May
    • B. Yegnanarayana and P. S. Murthy, "Enhancement of reverberant speech using LP residual signal, " IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 267-281, May 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 267-281
    • Yegnanarayana, B.1    Murthy, P.S.2
  • 12
    • 34548571735 scopus 로고    scopus 로고
    • Harmonicity-based dereverberation for single-channel speech signals
    • Jan.
    • T. Nakatani, K. Kinoshita, and M. Miyoshi, "Harmonicity-based dereverberation for single-channel speech signals, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 80-95, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 80-95
    • Nakatani, T.1    Kinoshita, K.2    Miyoshi, M.3
  • 13
    • 33947694356 scopus 로고    scopus 로고
    • Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation
    • K. Kinoshita, T. Nakatani, and M. Miyoshi, "Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation, " in Proc. Int. Conf. Acoust., Speech, Signal Process., 2006, vol. I, pp. 817-820.
    • (2006) Proc. Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 817-820
    • Kinoshita, K.1    Nakatani, T.2    Miyoshi, M.3
  • 14
    • 0042362199 scopus 로고    scopus 로고
    • Blind single channel deconvolution using nonstationary signal processing
    • J. R. Hopgood and P. J. W. Rayner, "Blind single channel deconvolution using nonstationary signal processing, " IEEE Trans. Speech, Audio Process., vol. 11, no. 5, pp. 476-488, 2003.
    • (2003) IEEE Trans. Speech, Audio Process. , vol.11 , Issue.5 , pp. 476-488
    • Hopgood, J.R.1    Rayner, P.J.W.2
  • 15
    • 34548569780 scopus 로고    scopus 로고
    • Precise dereverberation using multichannel linear prediction
    • Feb.
    • M. Delcroix, T. Hikichi, and M. Miyoshi, "Precise dereverberation using multichannel linear prediction, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 430-440, Feb. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 430-440
    • Delcroix, M.1    Hikichi, T.2    Miyoshi, M.3
  • 16
    • 34548555989 scopus 로고    scopus 로고
    • Dereverberation by using time-variant nature of speech production system
    • 10.1155/2007/65698, article ID 65698
    • T. Yoshioka, T. Hikichi, and M. Miyoshi, "Dereverberation by using time-variant nature of speech production system, " EURASIP J. Adv. Signal Process., vol. 2007, 2007, 10.1155/2007/65698, article ID 65698.
    • (2007) EURASIP J. Adv. Signal Process. , vol.2007
    • Yoshioka, T.1    Hikichi, T.2    Miyoshi, M.3
  • 21
    • 0009589653 scopus 로고    scopus 로고
    • Speech denoising and dereverberation using probabilistic models
    • H. Attias, J. C. Platt, A. Acero, and L. Deng, "Speech denoising and dereverberation using probabilistic models, " Adv. Neural Inf. Process. Syst., vol. 13, pp. 758-764, 2000.
    • (2000) Adv. Neural Inf. Process. Syst. , vol.13 , pp. 758-764
    • Attias, H.1    Platt, J.C.2    Acero, A.3    Deng, L.4
  • 22
    • 51449085414 scopus 로고    scopus 로고
    • Multi-step linear prediction based speech dereverberation in noisy reverberant environment
    • K. Kinoshita, T. Nakatani, M. Delcroix, and M. Miyoshi, "Multi-step linear prediction based speech dereverberation in noisy reverberant environment, " in Proc. Interspeech, 2007, pp. 854-857.
    • (2007) Proc. Interspeech , pp. 854-857
    • Kinoshita, K.1    Nakatani, T.2    Delcroix, M.3    Miyoshi, M.4
  • 23
    • 51449104008 scopus 로고    scopus 로고
    • Dereverberation and denoising using multichannel linear prediction
    • Aug.
    • M. Delcroix, T. Hikichi, and M. Miyoshi, "Dereverberation and denoising using multichannel linear prediction, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1791-1801, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1791-1801
    • Delcroix, M.1    Hikichi, T.2    Miyoshi, M.3
  • 24
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 25
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul.
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics, " IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 27
    • 0003023502 scopus 로고
    • Dereverberation of speech signals based on sub-band envelope estimation
    • H. Wang and F. Itakura, "Dereverberation of speech signals based on sub-band envelope estimation, " IEICE Trans. Fund., vol. E74-A, no. 11, pp. 3576-3583, 1991.
    • (1991) IEICE Trans. Fund. , vol.E74-A , Issue.11 , pp. 3576-3583
    • Wang, H.1    Itakura, F.2
  • 28
    • 50449087796 scopus 로고    scopus 로고
    • System identification in the short-time fourier transform domain with crossband filtering
    • May
    • Y. Avargel and I. Cohen, "System identification in the short-time Fourier transform domain with crossband filtering, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1305-1319, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1305-1319
    • Avargel, Y.1    Cohen, I.2
  • 29
    • 0036289676 scopus 로고    scopus 로고
    • Acoustic diversity for improved speech recognition in reverberant environments
    • B. W. Gillespie and L. E. Atlas, "Acoustic diversity for improved speech recognition in reverberant environments, " in Proc. Int. Conf. Acoust., Speech, Signal Process., 2002, vol. I, pp. 557-560.
    • (2002) Proc. Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 557-560
    • Gillespie, B.W.1    Atlas, L.E.2
  • 32
    • 0029274575 scopus 로고
    • The multivariate complex normal distribution-a generalization
    • Mar.
    • A. van den Bos, "The multivariate complex normal distribution-A generalization, " IEEE Trans. Inf. Theory, vol. 41, no. 2, pp. 537-539, Mar. 1995.
    • (1995) IEEE Trans. Inf. Theory , vol.41 , Issue.2 , pp. 537-539
    • Van Den Bos, A.1
  • 33
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
    • Dec
    • Y. Ephraim, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator, " IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust. Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1
  • 34
    • 0003807773 scopus 로고
    • 3rd ed. Englewood Cliffs, NJ: Prentice-Hall
    • S. Haykin, Adaptive Filter Theory, 3rd ed. Englewood Cliffs, NJ: Prentice-Hall, 1995.
    • (1995) Adaptive Filter Theory
    • Haykin, S.1
  • 35
    • 0000251971 scopus 로고
    • Maximum likelihood estimation via the ecm algorithm: A general framework
    • X. L. Meng and D. B. Rubin, "Maximum likelihood estimation via the ECM algorithm: A general framework, " Biometrika, vol. 80, no. 2, pp. 267-278, 1993.
    • (1993) Biometrika , vol.80 , Issue.2 , pp. 267-278
    • Meng, X.L.1    Rubin, D.B.2
  • 36
    • 0016962212 scopus 로고
    • Implementation of the digital phase vocoder using the fast fourier transform
    • Jun.
    • M. R. Portnoff, "Implementation of the digital phase vocoder using the fast Fourier transform, " IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-24, no. 3, pp. 243-248, Jun. 1976.
    • (1976) IEEE Trans. Acoust. Speech, Signal Process. , vol.ASSP-24 , Issue.3 , pp. 243-248
    • Portnoff, M.R.1
  • 37
    • 34548599475 scopus 로고    scopus 로고
    • Acoustical Society of Japan [Online]. Available
    • ASJ Continuous Speech Corpus, Acoustical Society of Japan [Online]. Available: http://www.milab.is.tsukuba.ac.jp/jnas/instruct.html.
    • ASJ Continuous Speech Corpus


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.