SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 17, Issue 2, 2009, Pages 231-246

Integrated speech enhancement method using noise suppression and dereverberation

(3) Yoshioka, Takuya a Nakatani, Tomohiro a Miyoshi, Masato a

a NTT Communication Science Laboratories (Japan)

Author keywords

Dereverberation; Maximum likelihood (ML) estimation; Minimum mean square error (MMSE) estimation; Noise suppression; Speech enhancement

Indexed keywords

AUTOREGRESSIVE SYSTEMS; DEREVERBERATION; ESTIMATED PARAMETER; GAUSSIAN RANDOM VARIABLE; MAXIMUM LIKELIHOOD ESTIMATION METHOD; MAXIMUM-LIKELIHOOD (ML) ESTIMATION; MINIMUM MEAN SQUARE ERROR (MMSE) ESTIMATION; MINIMUM MEAN SQUARE ERROR ESTIMATE; NOISE SUPPRESSION; REVERBERATION TIME; SPECTRAL COMPONENTS; SPEECH ENHANCEMENT METHODS; SPEECH MODELS; SPEECH SIGNALS; SPEECH SPECTRA; STATIONARY NOISE; TIME INVARIANTS;

ARCHITECTURAL ACOUSTICS; BLOCK CODES; FREQUENCY BANDS; MEAN SQUARE ERROR; PARAMETER ESTIMATION; POLES; POWER SPECTRAL DENSITY; POWER SPECTRUM; PROBABILITY DISTRIBUTIONS; RANDOM PROCESSES; RANDOM VARIABLES; REVERBERATION; SIGNAL TO NOISE RATIO; SPEECH ENHANCEMENT;

MAXIMUM LIKELIHOOD ESTIMATION;

EID: 70350435249 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2008.2008042 Document Type: Article

Times cited : (75)

References (37)

1
- 0003927842
- Upper Saddle River, NJ: Prentice-Hall
- T. Quatieri, Discrete-Time Speech Signal Processing. Upper Saddle River, NJ: Prentice-Hall, 2002.
- (2002) Discrete-Time Speech Signal Processing
- Quatieri, T.¹

2
- 0017980972
- All-pole modeling of degraded speech
- Jun.
- J. S. Lim and A. V. Oppenheim, "All-pole modeling of degraded speech, " IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-26, no. 3, pp. 197-210, Jun. 1978.
- (1978) IEEE Trans. Acoust. Speech, Signal Process. , vol.ASSP-26 , Issue.3 , pp. 197-210
- Lim, J.S.¹ Oppenheim, A.V.²

3
- 0032123744
- Iterative and sequential kalman filter-based speech enhancement algorithms
- Jul.
- S. Gannot, D. Burshtein, and E. Weinstein, "Iterative and sequential Kalman filter-based speech enhancement algorithms, " IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 373-385, Jul. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 373-385
- Gannot, S.¹ Burshtein, D.² Weinstein, E.³

4
- 51449109652
- Codebook-based bayesian speech enhancement for nonstationary environments
- Feb.
- S. Srinivasan, J. Samuelsson, and W. B. Kleijn, "Codebook-based Bayesian speech enhancement for nonstationary environments, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 441-452, Feb. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 441-452
- Srinivasan, S.¹ Samuelsson, J.² Kleijn, W.B.³

5
- 0029185029
- EVAM: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals
- Jan.
- M. I. Gurelli and C. L. Nikias, "EVAM: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals, " IEEE Trans. Signal Process., vol. 43, no. 1, pp. 134-149, Jan. 1995.
- (1995) IEEE Trans. Signal Process. , vol.43 , Issue.1 , pp. 134-149
- Gurelli, M.I.¹ Nikias, C.L.²

6
- 0242271432
- Subspace methods for multimicrophone speech dereverberation
- S. Gannot and M. Moonen, "Subspace methods for multimicrophone speech dereverberation, " EURASIP J. Appl. Signal Process., vol. 2003, no. 11, pp. 1074-1090, 2003.
- (2003) EURASIP J. Appl. Signal Process. , vol.2003 , Issue.11 , pp. 1074-1090
- Gannot, S.¹ Moonen, M.²

7
- 34247241719
- Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations
- 10.1155/2007/34013, article ID 34013
- T. Hikichi, M. Delcroix, and M. Miyoshi, "Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations, " EURASIP J. Adv. Signal Process., vol. 2007, 2007, 10.1155/2007/34013, article ID 34013..
- (2007) EURASIP J. Adv. Signal Process. , vol.2007
- Hikichi, T.¹ Delcroix, M.² Miyoshi, M.³

8
- 70350478055
- An experimental study of the eigendecomposition methods for blind simo system identification in the presence of noise
- CD-ROM Proc
- S. Javidi, N. D. Gaubitch, and P. A. Naylor, "An experimental study of the eigendecomposition methods for blind SIMO system identification in the presence of noise, " in Proc. Int. Worksh. Acoust. Echo, Noise Contr., 2006, CD-ROM Proc..
- (2006) Proc. Int. Worksh. Acoust. Echo, Noise Contr.
- Javidi, S.¹ Gaubitch, N.D.² Naylor, P.A.³

9
- 0001379957
- Enhancement of reverberant speech using LP residual signal
- May
- B. Yegnanarayana and P. S. Murthy, "Enhancement of reverberant speech using LP residual signal, " IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 267-281, May 2000.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 267-281
- Yegnanarayana, B.¹ Murthy, P.S.²

10
- 0034857681
- Speech dereverberation via maximum-kurtosis subband adaptive filtering
- B. W. Gillespie, H. S. Malvar, and D. A. F. Florêncio, "Speech dereverberation via maximum-kurtosis subband adaptive filtering, " in Proc. Int. Conf. Acoust., Speech, Signal Process., 2001, vol. VI, pp. 3701-3704.
- (2001) Proc. Int. Conf. Acoust., Speech, Signal Process. , vol.6 , pp. 3701-3704
- Gillespie, B.W.¹ Malvar, H.S.² Florêncio, D.A.F.³

11
- 33845361792
- On the use of linear prediction for dereverberation of speech
- N. D. Gaubitch, P. A. Naylor, and D. B. Ward, "On the use of linear prediction for dereverberation of speech, " in Proc. Int. Worksh. Acoust. Echo, Noise Contr., 2003, pp. 99-102.
- (2003) Proc. Int. Worksh. Acoust. Echo, Noise Contr. , pp. 99-102
- Gaubitch, N.D.¹ Naylor, P.A.² Ward, D.B.³

12
- 34548571735
- Harmonicity-based dereverberation for single-channel speech signals
- Jan.
- T. Nakatani, K. Kinoshita, and M. Miyoshi, "Harmonicity-based dereverberation for single-channel speech signals, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 80-95, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 80-95
- Nakatani, T.¹ Kinoshita, K.² Miyoshi, M.³

13
- 33947694356
- Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation
- K. Kinoshita, T. Nakatani, and M. Miyoshi, "Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation, " in Proc. Int. Conf. Acoust., Speech, Signal Process., 2006, vol. I, pp. 817-820.
- (2006) Proc. Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 817-820
- Kinoshita, K.¹ Nakatani, T.² Miyoshi, M.³

14
- 0042362199
- Blind single channel deconvolution using nonstationary signal processing
- J. R. Hopgood and P. J. W. Rayner, "Blind single channel deconvolution using nonstationary signal processing, " IEEE Trans. Speech, Audio Process., vol. 11, no. 5, pp. 476-488, 2003.
- (2003) IEEE Trans. Speech, Audio Process. , vol.11 , Issue.5 , pp. 476-488
- Hopgood, J.R.¹ Rayner, P.J.W.²

15
- 34548569780
- Precise dereverberation using multichannel linear prediction
- Feb.
- M. Delcroix, T. Hikichi, and M. Miyoshi, "Precise dereverberation using multichannel linear prediction, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 430-440, Feb. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 430-440
- Delcroix, M.¹ Hikichi, T.² Miyoshi, M.³

16
- 34548555989
- Dereverberation by using time-variant nature of speech production system
- 10.1155/2007/65698, article ID 65698
- T. Yoshioka, T. Hikichi, and M. Miyoshi, "Dereverberation by using time-variant nature of speech production system, " EURASIP J. Adv. Signal Process., vol. 2007, 2007, 10.1155/2007/65698, article ID 65698.
- (2007) EURASIP J. Adv. Signal Process. , vol.2007
- Yoshioka, T.¹ Hikichi, T.² Miyoshi, M.³

17
- 50249160056
- Overfittingresistant speech dereverberation
- T. Yoshioka, T. Nakatani, T. Hikichi, and M. Miyoshi, "Overfittingresistant speech dereverberation, " in Proc. IEEE Worksh. Appl. Signal Process. Audio, Acoust., 2007, pp. 163-166.
- (2007) Proc. IEEE Worksh. Appl. Signal Process. Audio, Acoust. , pp. 163-166
- Yoshioka, T.¹ Nakatani, T.² Hikichi, T.³ Miyoshi, M.⁴

18
- 34547538582
- Study on speech dereverberation with autocorrelation codebook
- T. Nakatani, B.-H. Juang, T. Hikichi, T. Yoshioka, K. Kinoshita, M. Delcroix, and M. Miyoshi, "Study on speech dereverberation with autocorrelation codebook, " in Proc. Int. Conf. Acoust. Speech, Signal Process., 2007, vol. I, pp. 193-196.
- (2007) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 193-196
- Nakatani, T.¹ Juang, B.-H.² Hikichi, T.³ Yoshioka, T.⁴ Kinoshita, K.⁵ Delcroix, M.⁶ Miyoshi, M.⁷

19
- 50249151398
- Importance of energy and spectral features in gaussian source model for speech dereverberation
- T. Nakatani, B. H. Juang, T. Yoshioka, K. Kinoshita, and M. Miyoshi, "Importance of energy and spectral features in Gaussian source model for speech dereverberation, " in Proc. IEEE Workshop Appl. Signal Process. Audio, Acoust., 2007, pp. 299-302.
- (2007) Proc. IEEE Workshop Appl. Signal Process. Audio, Acoust. , pp. 299-302
- Nakatani, T.¹ Juang, B.H.² Yoshioka, T.³ Kinoshita, K.⁴ Miyoshi, M.⁵

20
- 51449121832
- Blind speech dereverberation with multi-channel linear prediction based on short time fourier transform representation
- T. Nakatani, T. Yoshioka, K. Kinoshita, M. Miyoshi, and B.-H. Juang, "Blind speech dereverberation with multi-channel linear prediction based on short time Fourier transform representation, " in Proc. Int. Conf. Acoust. Speech, Signal Process., 2008, pp. 85-88.
- (2008) Proc. Int. Conf. Acoust. Speech, Signal Process. , pp. 85-88
- Nakatani, T.¹ Yoshioka, T.² Kinoshita, K.³ Miyoshi, M.⁴ Juang, B.-H.⁵

21
- 0009589653
- Speech denoising and dereverberation using probabilistic models
- H. Attias, J. C. Platt, A. Acero, and L. Deng, "Speech denoising and dereverberation using probabilistic models, " Adv. Neural Inf. Process. Syst., vol. 13, pp. 758-764, 2000.
- (2000) Adv. Neural Inf. Process. Syst. , vol.13 , pp. 758-764
- Attias, H.¹ Platt, J.C.² Acero, A.³ Deng, L.⁴

22
- 51449085414
- Multi-step linear prediction based speech dereverberation in noisy reverberant environment
- K. Kinoshita, T. Nakatani, M. Delcroix, and M. Miyoshi, "Multi-step linear prediction based speech dereverberation in noisy reverberant environment, " in Proc. Interspeech, 2007, pp. 854-857.
- (2007) Proc. Interspeech , pp. 854-857
- Kinoshita, K.¹ Nakatani, T.² Delcroix, M.³ Miyoshi, M.⁴

23
- 51449104008
- Dereverberation and denoising using multichannel linear prediction
- Aug.
- M. Delcroix, T. Hikichi, and M. Miyoshi, "Dereverberation and denoising using multichannel linear prediction, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1791-1801, Aug. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1791-1801
- Delcroix, M.¹ Hikichi, T.² Miyoshi, M.³

24
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr.
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

25
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Jul.
- R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics, " IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
- Martin, R.¹

26
- 0023961145
- Inverse filtering of room acoustics
- Feb.
- M. Miyoshi and Y. Kaneda, "Inverse filtering of room acoustics, " IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 2, pp. 145-152, Feb. 1988.
- (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.2 , pp. 145-152
- Miyoshi, M.¹ Kaneda, Y.²

27
- 0003023502
- Dereverberation of speech signals based on sub-band envelope estimation
- H. Wang and F. Itakura, "Dereverberation of speech signals based on sub-band envelope estimation, " IEICE Trans. Fund., vol. E74-A, no. 11, pp. 3576-3583, 1991.
- (1991) IEICE Trans. Fund. , vol.E74-A , Issue.11 , pp. 3576-3583
- Wang, H.¹ Itakura, F.²

28
- 50449087796
- System identification in the short-time fourier transform domain with crossband filtering
- May
- Y. Avargel and I. Cohen, "System identification in the short-time Fourier transform domain with crossband filtering, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1305-1319, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1305-1319
- Avargel, Y.¹ Cohen, I.²

29
- 0036289676
- Acoustic diversity for improved speech recognition in reverberant environments
- B. W. Gillespie and L. E. Atlas, "Acoustic diversity for improved speech recognition in reverberant environments, " in Proc. Int. Conf. Acoust., Speech, Signal Process., 2002, vol. I, pp. 557-560.
- (2002) Proc. Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 557-560
- Gillespie, B.W.¹ Atlas, L.E.²

30
- 50449108163
- Speech dereverberation in short time fourier transform domain with crossband effect compensation
- T. Nakatani, T. Yoshioka, K. Kinoshita, M. Miyoshi, and B.-H. Juang, "Speech dereverberation in short time Fourier transform domain with crossband effect compensation, " in Hands-Free Speech Commun, Mic. Arrays, 2008, pp. 220-223.
- (2008) Hands-Free Speech Commun, Mic. Arrays , pp. 220-223
- Nakatani, T.¹ Yoshioka, T.² Kinoshita, K.³ Miyoshi, M.⁴ Juang, B.-H.⁵

31
- 0003425258
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1983.
- (1983) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

32
- 0029274575
- The multivariate complex normal distribution-a generalization
- Mar.
- A. van den Bos, "The multivariate complex normal distribution-A generalization, " IEEE Trans. Inf. Theory, vol. 41, no. 2, pp. 537-539, Mar. 1995.
- (1995) IEEE Trans. Inf. Theory , vol.41 , Issue.2 , pp. 537-539
- Van Den Bos, A.¹

33
- 0021645331
- Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
- Dec
- Y. Ephraim, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator, " IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust. Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹

34
- 0003807773
- 3rd ed. Englewood Cliffs, NJ: Prentice-Hall
- S. Haykin, Adaptive Filter Theory, 3rd ed. Englewood Cliffs, NJ: Prentice-Hall, 1995.
- (1995) Adaptive Filter Theory
- Haykin, S.¹

35
- 0000251971
- Maximum likelihood estimation via the ecm algorithm: A general framework
- X. L. Meng and D. B. Rubin, "Maximum likelihood estimation via the ECM algorithm: A general framework, " Biometrika, vol. 80, no. 2, pp. 267-278, 1993.
- (1993) Biometrika , vol.80 , Issue.2 , pp. 267-278
- Meng, X.L.¹ Rubin, D.B.²

36
- 0016962212
- Implementation of the digital phase vocoder using the fast fourier transform
- Jun.
- M. R. Portnoff, "Implementation of the digital phase vocoder using the fast Fourier transform, " IEEE Trans. Acoust. Speech, Signal Process., vol. ASSP-24, no. 3, pp. 243-248, Jun. 1976.
- (1976) IEEE Trans. Acoust. Speech, Signal Process. , vol.ASSP-24 , Issue.3 , pp. 243-248
- Portnoff, M.R.¹

37
- 34548599475
- Acoustical Society of Japan [Online]. Available
- ASJ Continuous Speech Corpus, Acoustical Society of Japan [Online]. Available: http://www.milab.is.tsukuba.ac.jp/jnas/instruct.html.
- ASJ Continuous Speech Corpus

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.