SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 5, 2011, Pages 1123-1137

Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty

b The University of Texas at Dallas (United States)

Author keywords

Binary mask; maximum a posterior (MAP) estimators; minimum mean square error (MMSE) estimators; soft mask; speech enhancement

Indexed keywords

EID: 85008013225 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2010.2082531 Document Type: Article

Times cited : (98)

References (44)

1
- 34447100796
- 1st ed. Boca Raton, FL: CRC Taylor & Francis
- P. Loizou, Speech Enhancement: Theory and Practice, 1st ed. Boca Raton, FL: CRC Taylor & Francis, 2007.
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.¹

2
- 0021645331
- Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
- Dec.
- Y. Ephraim and D. Malah “Speech enhancement using a minimum mean square error short-time spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109–1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

3
- 0021892216
- Speech enhancement using a minimum mean square error log-spectral amplitude estimator
- Apr.
- Y. Ephraim and D. Malah “Speech enhancement using a minimum mean square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443–445, Apr. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

4
- 34447092407
- Subjective evaluation and comparison of speech enhancement algorithms
- Y. Hu and P. Loizou “Subjective evaluation and comparison of speech enhancement algorithms,” Speech Commun., vol. 49, pp. 588–601, 2007.
- (2007) Speech Commun. , vol.49 , pp. 588-601
- Hu, Y.¹ Loizou, P.²

5
- 2942524164
- Suppression of additive noise using a power spectral density MMSE estimator
- Jun.
- G. H. Ding, T. Huang, and B. Xu, “Suppression of additive noise using a power spectral density MMSE estimator,” IEEE Signal Process. Lett., vol. 11, no. 6, pp. 585–588, Jun. 2004.
- (2004) IEEE Signal Process. Lett. , vol.11 , Issue.6 , pp. 585-588
- Ding, G.H.¹ Huang, T.² Xu, B.³

6
- 0032672098
- A modular approach to speech enhancement with an application to speech coding
- Phoenix, AZ, May
- A. Accardi and R. Cox, “A modular approach to speech enhancement with an application to speech coding,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP'99), Phoenix, AZ, May 1999, pp. 201–204.
- (1999) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP'99) , pp. 201-204
- Accardi, A.¹ Cox, R.²

7
- 0141957802
- Efficient alternatives to Ephraim and Malah suppression rule for audio signal enhancement
- P. J. Wolfe and S. J. Godsill “Efficient alternatives to Ephraim and Malah suppression rule for audio signal enhancement,” EURASIP J. Appl. Signal Process., vol. 2003, no. 10, pp. 1043–1051, 2003.
- (2003) EURASIP J. Appl. Signal Process. , vol.2003 , Issue.10 , pp. 1043-1051
- Wolfe, P.J.¹ Godsill, S.J.²

8
- 22544465033
- β-order MMSE spectral amplitude estimation for speech enhancement
- Jul.
- C. H. You, S. N. Koh, and S. Rahardja “β-order MMSE spectral amplitude estimation for speech enhancement,” IEEE Trans. Speech Audio Process., vol. 13, no. 4, pp. 475–486, Jul. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.4 , pp. 475-486
- You, C.H.¹ Koh, S.N.² Rahardja, S.³

9
- 34447099536
- A data-driven approach to optimizing spectral speech enhancement methods for various error criteria
- 8
- J. Erkelens, J. Jensen, and R. Heusdens “A data-driven approach to optimizing spectral speech enhancement methods for various error criteria,” Speech Commun., vol. 49, no. 7–8, pp. 530–541, 2007.
- (2007) Speech Commun. , vol.49 , Issue.7 , pp. 530-541
- Erkelens, J.¹ Jensen, J.² Heusdens, R.³

10
- 27644563039
- Relaxed statistical model for speech enhancement and a priori SNR estimation
- Sep.
- I. Cohen “Relaxed statistical model for speech enhancement and a priori SNR estimation,” IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 870–881, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 870-881
- Cohen, I.¹

11
- 0035556258
- Simple alternatives to the Ephraim and Malah suppression rule for speech enhancement
- Aug.
- P. J. Wolfe and S. J. Godsill, “Simple alternatives to the Ephraim and Malah suppression rule for speech enhancement,” in Proc. 11th IEEE Signal Process. Workshop Statist. Signal Process., Aug. 2001, pp. 496–499.
- (2001) Proc. 11th IEEE Signal Process. Workshop Statist. Signal Process. , pp. 496-499
- Wolfe, P.J.¹ Godsill, S.J.²

12
- 22944477796
- Noise reduction by maximum a posteriori spectral amplitude estimation with super Gaussian speech modeling
- Kyoto, Japan, Sep.
- T. Lotter and P. Vary, “Noise reduction by maximum a posteriori spectral amplitude estimation with super Gaussian speech modeling,” in Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC'03), Kyoto, Japan, Sep. 2003, pp. 83–86.
- (2003) Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC'03) , pp. 83-86
- Lotter, T.¹ Vary, P.²

13
- 48849113127
- Noise reduction by joint maximum a posteriori spectral amplitude and phase estimation with super Gaussian speech modeling
- Vienna, Austria, Sep.
- T. Lotter and P. Vary, “Noise reduction by joint maximum a posteriori spectral amplitude and phase estimation with super Gaussian speech modeling,” in Proc. EUSIPCO, Vienna, Austria, Sep. 2004, pp. 1457–1460.
- (2004) Proc. EUSIPCO , pp. 1457-1460
- Lotter, T.¹ Vary, P.²

14
- 22944438092
- Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model
- T. Lotter and P. Vary “Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model,” EURASIP J. Appl. Signal Process., vol. 2005, no. 1, pp. 1110–1126, 2005.
- (2005) EURASIP J. Appl. Signal Process. , vol.2005 , Issue.1 , pp. 1110-1126
- Lotter, T.¹ Vary, P.²

15
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr.
- S. F. Boll “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113–120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

16
- 0018320733
- Enhancement of speech corrupted by acoustic noise
- M. Berouti, M. Schwartz, and J. Makhoul, “Enhancement of speech corrupted by acoustic noise.,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1979, pp. 208–211.
- (1979) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 208-211
- Berouti, M.¹ Schwartz, M.² Makhoul, J.³

17
- 0028426335
- Noise reduction by noise-adaptive spectral magnitude expansion
- May
- W. Etter and G. S. Moschytz “Noise reduction by noise-adaptive spectral magnitude expansion,” J. Audio Eng. Soc., vol. 42, pp. 341–349, May 1994.
- (1994) J. Audio Eng. Soc. , vol.42 , pp. 341-349
- Etter, W.¹ Moschytz, G.S.²

18
- 0032123832
- A parametric formulation of the generalized spectral subtraction method
- Jul.
- B. L. Sim, Y. C. Tong, J. S. Chang, and C. T. Tan “A parametric formulation of the generalized spectral subtraction method,” IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 328–337, Jul. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 328-337
- Sim, B.L.¹ Tong, Y.C.² Chang, J.S.³ Tan, C.T.⁴

19
- 0242327016
- Subband noise reduction methods for speech enhancement
- S. L. Gay and J. Benesty, Eds. Norwell, MA: Kluwer
- E. J. Diethorn, “Subband noise reduction methods for speech enhancement,” in Acoustic Signal Processing for Telecommunication, S. L. Gay and J. Benesty, Eds. Norwell, MA: Kluwer, 2000, pp. 155–178.
- (2000) Acoustic Signal Processing for Telecommunication , pp. 155-178
- Diethorn, E.J.¹

20
- 27644504471
- Suppressing acoustic echo in a spectral envelope space
- Sep.
- C. Faller and J. Chen “Suppressing acoustic echo in a spectral envelope space,” IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 1048–1062, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 1048-1062
- Faller, C.¹ Chen, J.²

21
- 44149115462
- A geometric approach to spectral subtraction
- Jun.
- Y. Lu and P. Loizou “A geometric approach to spectral subtraction,” Speech Commun., vol. 50, no. 6, pp. 453–466, Jun. 2008.
- (2008) Speech Commun. , vol.50 , Issue.6 , pp. 453-466
- Lu, Y.¹ Loizou, P.²

22
- 82255178542
- Piscataway, NJ: Wiley/IEEE Press
- Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, D. Wang and G. Brown, Eds. Piscataway, NJ: Wiley/IEEE Press, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
- Wang, D.¹ Brown, G.²

23
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- D. S. Brungart, P. S. Chang, B. D. Simpson, and D. Wang “Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation,” J. Acoust. Soc. Amer., vol. 120, no. 6, pp. 4007–4018, 2006.
- (2006) J. Acoust. Soc. Amer. , vol.120 , Issue.6 , pp. 4007-4018
- Brungart, D.S.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.⁴

24
- 40749125179
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
- N. Li and P. Loizou “Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction,” J. Acoust. Soc. Amer., vol. 123, no. 3, pp. 1673–1682, 2008.
- (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.3 , pp. 1673-1682
- Li, N.¹ Loizou, P.²

25
- 58149196390
- On the optimality of ideal binary time-frequency masks
- Mar.
- Y. Li and D. Wang “On the optimality of ideal binary time-frequency masks,” Speech Commun., vol. 51, pp. 230–239, Mar. 2009.
- (2009) Speech Commun. , vol.51 , pp. 230-239
- Li, Y.¹ Wang, D.²

26
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- P. Divenyi, Ed. Norwell, MA: Kluwer
- D. Wang, “On ideal binary mask as the computational goal of auditory scene analysis,” in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2005, pp. 181–197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.¹

27
- 0029307534
- De-noising by soft-thresholding
- May
- D. L. Donoho “De-noising by soft-thresholding,” IEEE Trans. Inf. Theory, vol. 41, no. 3, pp. 613–627, May 1995.
- (1995) IEEE Trans. Inf. Theory , vol.41 , Issue.3 , pp. 613-627
- Donoho, D.L.¹

28
- 84950459514
- Adapting to unknown smoothness via wavelet shrinkage
- D. L. Donoho and I. M. Johnstone “Adapting to unknown smoothness via wavelet shrinkage,” J. Amer. Statist. Assoc., vol. 90, no. 432, pp. 1200–1224, 1995.
- (1995) J. Amer. Statist. Assoc. , vol.90 , Issue.432 , pp. 1200-1224
- Donoho, D.L.¹ Johnstone, I.M.²

29
- 0003794165
- ser. Lecture notes in Statistics. Berlin, Germany: Springer-Verlag
- M. Jansen, Noise Reduction by Wavelet Thresholding, ser. Lecture notes in Statistics. Berlin, Germany: Springer-Verlag, 2001, vol. 161.
- (2001) Noise Reduction by Wavelet Thresholding , vol.161
- Jansen, M.¹

30
- 34447095085
- A study of the distribution of time-domain speech samples and discrete Fourier coefficients
- J. Jensen, I. Batina, R. C. Hendriks, and R. Heusdens, “A study of the distribution of time-domain speech samples and discrete Fourier coefficients,” Proc. SPS-DARTS, vol. 1, pp. 155–158, 2005.
- (2005) Proc. SPS-DARTS , vol.1 , pp. 155-158
- Jensen, J.¹ Batina, I.² Hendriks, R.C.³ Heusdens, R.⁴

31
- 85008040661
- 4th ed. New York: McGraw-Hill
- A. Papoulis and S. U. Pillai, Probability, Random Variables and Stochastic Processes, 4th ed. New York: McGraw-Hill, 2002.
- (2002) Probability, Random Variables and Stochastic Processes
- Papoulis, A.¹ Pillai, S.U.²

32
- 0041958932
- Ideal spatial adaptation by wavelet shrinkage
- D. L. Donoho and I. M. Johnstone “Ideal spatial adaptation by wavelet shrinkage,” Biometrika, vol. 81, no. 3, pp. 425–455, 1994.
- (1994) Biometrika , vol.81 , Issue.3 , pp. 425-455
- Donoho, D.L.¹ Johnstone, I.M.²

33
- 0003456805
- San Diego, CA: Academic
- S. Mallat, A Wavelet Tour of Signal Processing. San Diego, CA: Academic, 1999.
- (1999) A Wavelet Tour of Signal Processing
- Mallat, S.¹

34
- 64349110818
- Audio denoising by time-frequency block thresholding
- May
- G. Yu, S. Mallat, and E. Bacry “Audio denoising by time-frequency block thresholding,” IEEE Trans. Signal Process., vol. 56, no. 5, pp. 1830–1839, May 2008.
- (2008) IEEE Trans. Signal Process. , vol.56 , Issue.5 , pp. 1830-1839
- Yu, G.¹ Mallat, S.² Bacry, E.³

35
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J.-L. Gauvain and C.-H. Lee “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–299, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-299
- Gauvain, J.-L.¹ Lee, C.-H.²

36
- 70349093614
- An algorithm that improves speech intelligibility in noise for normal-hearing listeners
- Sep.
- G. Kim, Y. Lu, Y. Hu, and P. C. Loizou “An algorithm that improves speech intelligibility in noise for normal-hearing listeners,” J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1486–1494, Sep. 2009.
- (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1486-1494
- Kim, G.¹ Lu, Y.² Hu, Y.³ Loizou, P.C.⁴

37
- 77956547397
- Improving speech intelligibility in noise using environment-optimized algorithms
- Sep.
- G. Kim and P. C. Loizou “Improving speech intelligibility in noise using environment-optimized algorithms,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2080–2090, Sep. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 2080-2090
- Kim, G.¹ Loizou, P.C.²

38
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Jul.
- R. Martin “Noise power spectral density estimation based on optimal smoothing and minimum statistics,” IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504–512, Jul. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
- Martin, R.¹

39
- 0036226165
- Noise estimation by minima controlled recursive averaging for robust speech enhancement
- Jan.
- I. Cohen and B. Berdugo “Noise estimation by minima controlled recursive averaging for robust speech enhancement,” IEEE Signal Process. Lett., vol. 9, no. 1, pp. 12–15, Jan. 2002.
- (2002) IEEE Signal Process. Lett. , vol.9 , Issue.1 , pp. 12-15
- Cohen, I.¹ Berdugo, B.²

40
- 0019009880
- Speech enhancement using a soft-decision noise suppression filter
- Apr.
- R. McAulay and M. Malpass “Speech enhancement using a soft-decision noise suppression filter,” IEEE Trans. Acoust., Speech Signal Process., vol. 28, no. 2, pp. 137–145, Apr. 1980.
- (1980) IEEE Trans. Acoust., Speech Signal Process. , vol.28 , Issue.2 , pp. 137-145
- McAulay, R.¹ Malpass, M.²

41
- 59849095077
- ITU-T Rec.
- ITU-T Rec. p.862, “Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs,” 2000.
- (2000) Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs , pp. 862

42
- 44149106061
- Evaluation of objective quality measures for speech enhancement
- Jan.
- Y. Hu and P. Loizou “Evaluation of objective quality measures for speech enhancement.,” IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 1, pp. 229–238, Jan. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.1 , pp. 229-238
- Hu, Y.¹ Loizou, P.²

43
- 33750311718
- Binary and ratio time-frequency masks for robust speech recognition
- Nov.
- S. Srinivasan, N. Roman, and D. Wang “Binary and ratio time-frequency masks for robust speech recognition,” Speech Commun., vol. 48, pp. 1486–1501, Nov. 2006.
- (2006) Speech Commun. , vol.48 , pp. 1486-1501
- Srinivasan, S.¹ Roman, N.² Wang, D.³

44
- 0036543522
- Optimal speech enhancement under signal presence uncertainty using log-spectra amplitude estimator
- Apr.
- I. Cohen “Optimal speech enhancement under signal presence uncertainty using log-spectra amplitude estimator,” IEEE Signal Process. Lett., vol. 9, no. 4, pp. 113–116, Apr. 2002.
- (2002) IEEE Signal Process. Lett. , vol.9 , Issue.4 , pp. 113-116
- Cohen, I.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.