메뉴 건너뛰기




Volumn 19, Issue 5, 2011, Pages 1123-1137

Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty

Author keywords

Binary mask; maximum a posterior (MAP) estimators; minimum mean square error (MMSE) estimators; soft mask; speech enhancement

Indexed keywords


EID: 85008013225     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2010.2082531     Document Type: Article
Times cited : (98)

References (44)
  • 2
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah “Speech enhancement using a minimum mean square error short-time spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109–1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 3
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean square error log-spectral amplitude estimator
    • Apr.
    • Y. Ephraim and D. Malah “Speech enhancement using a minimum mean square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443–445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 4
    • 34447092407 scopus 로고    scopus 로고
    • Subjective evaluation and comparison of speech enhancement algorithms
    • Y. Hu and P. Loizou “Subjective evaluation and comparison of speech enhancement algorithms,” Speech Commun., vol. 49, pp. 588–601, 2007.
    • (2007) Speech Commun. , vol.49 , pp. 588-601
    • Hu, Y.1    Loizou, P.2
  • 5
    • 2942524164 scopus 로고    scopus 로고
    • Suppression of additive noise using a power spectral density MMSE estimator
    • Jun.
    • G. H. Ding, T. Huang, and B. Xu, “Suppression of additive noise using a power spectral density MMSE estimator,” IEEE Signal Process. Lett., vol. 11, no. 6, pp. 585–588, Jun. 2004.
    • (2004) IEEE Signal Process. Lett. , vol.11 , Issue.6 , pp. 585-588
    • Ding, G.H.1    Huang, T.2    Xu, B.3
  • 6
  • 7
    • 0141957802 scopus 로고    scopus 로고
    • Efficient alternatives to Ephraim and Malah suppression rule for audio signal enhancement
    • P. J. Wolfe and S. J. Godsill “Efficient alternatives to Ephraim and Malah suppression rule for audio signal enhancement,” EURASIP J. Appl. Signal Process., vol. 2003, no. 10, pp. 1043–1051, 2003.
    • (2003) EURASIP J. Appl. Signal Process. , vol.2003 , Issue.10 , pp. 1043-1051
    • Wolfe, P.J.1    Godsill, S.J.2
  • 8
    • 22544465033 scopus 로고    scopus 로고
    • β-order MMSE spectral amplitude estimation for speech enhancement
    • Jul.
    • C. H. You, S. N. Koh, and S. Rahardja “β-order MMSE spectral amplitude estimation for speech enhancement,” IEEE Trans. Speech Audio Process., vol. 13, no. 4, pp. 475–486, Jul. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.4 , pp. 475-486
    • You, C.H.1    Koh, S.N.2    Rahardja, S.3
  • 9
    • 34447099536 scopus 로고    scopus 로고
    • A data-driven approach to optimizing spectral speech enhancement methods for various error criteria
    • 8
    • J. Erkelens, J. Jensen, and R. Heusdens “A data-driven approach to optimizing spectral speech enhancement methods for various error criteria,” Speech Commun., vol. 49, no. 7–8, pp. 530–541, 2007.
    • (2007) Speech Commun. , vol.49 , Issue.7 , pp. 530-541
    • Erkelens, J.1    Jensen, J.2    Heusdens, R.3
  • 10
    • 27644563039 scopus 로고    scopus 로고
    • Relaxed statistical model for speech enhancement and a priori SNR estimation
    • Sep.
    • I. Cohen “Relaxed statistical model for speech enhancement and a priori SNR estimation,” IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 870–881, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 870-881
    • Cohen, I.1
  • 12
    • 22944477796 scopus 로고    scopus 로고
    • Noise reduction by maximum a posteriori spectral amplitude estimation with super Gaussian speech modeling
    • Kyoto, Japan, Sep.
    • T. Lotter and P. Vary, “Noise reduction by maximum a posteriori spectral amplitude estimation with super Gaussian speech modeling,” in Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC'03), Kyoto, Japan, Sep. 2003, pp. 83–86.
    • (2003) Proc. Int. Workshop Acoust. Echo Noise Control (IWAENC'03) , pp. 83-86
    • Lotter, T.1    Vary, P.2
  • 13
    • 48849113127 scopus 로고    scopus 로고
    • Noise reduction by joint maximum a posteriori spectral amplitude and phase estimation with super Gaussian speech modeling
    • Vienna, Austria, Sep.
    • T. Lotter and P. Vary, “Noise reduction by joint maximum a posteriori spectral amplitude and phase estimation with super Gaussian speech modeling,” in Proc. EUSIPCO, Vienna, Austria, Sep. 2004, pp. 1457–1460.
    • (2004) Proc. EUSIPCO , pp. 1457-1460
    • Lotter, T.1    Vary, P.2
  • 14
    • 22944438092 scopus 로고    scopus 로고
    • Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model
    • T. Lotter and P. Vary “Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model,” EURASIP J. Appl. Signal Process., vol. 2005, no. 1, pp. 1110–1126, 2005.
    • (2005) EURASIP J. Appl. Signal Process. , vol.2005 , Issue.1 , pp. 1110-1126
    • Lotter, T.1    Vary, P.2
  • 15
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. F. Boll “Suppression of acoustic noise in speech using spectral subtraction,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113–120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 17
    • 0028426335 scopus 로고
    • Noise reduction by noise-adaptive spectral magnitude expansion
    • May
    • W. Etter and G. S. Moschytz “Noise reduction by noise-adaptive spectral magnitude expansion,” J. Audio Eng. Soc., vol. 42, pp. 341–349, May 1994.
    • (1994) J. Audio Eng. Soc. , vol.42 , pp. 341-349
    • Etter, W.1    Moschytz, G.S.2
  • 18
    • 0032123832 scopus 로고    scopus 로고
    • A parametric formulation of the generalized spectral subtraction method
    • Jul.
    • B. L. Sim, Y. C. Tong, J. S. Chang, and C. T. Tan “A parametric formulation of the generalized spectral subtraction method,” IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 328–337, Jul. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 328-337
    • Sim, B.L.1    Tong, Y.C.2    Chang, J.S.3    Tan, C.T.4
  • 19
    • 0242327016 scopus 로고    scopus 로고
    • Subband noise reduction methods for speech enhancement
    • S. L. Gay and J. Benesty, Eds. Norwell, MA: Kluwer
    • E. J. Diethorn, “Subband noise reduction methods for speech enhancement,” in Acoustic Signal Processing for Telecommunication, S. L. Gay and J. Benesty, Eds. Norwell, MA: Kluwer, 2000, pp. 155–178.
    • (2000) Acoustic Signal Processing for Telecommunication , pp. 155-178
    • Diethorn, E.J.1
  • 20
    • 27644504471 scopus 로고    scopus 로고
    • Suppressing acoustic echo in a spectral envelope space
    • Sep.
    • C. Faller and J. Chen “Suppressing acoustic echo in a spectral envelope space,” IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 1048–1062, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 1048-1062
    • Faller, C.1    Chen, J.2
  • 21
    • 44149115462 scopus 로고    scopus 로고
    • A geometric approach to spectral subtraction
    • Jun.
    • Y. Lu and P. Loizou “A geometric approach to spectral subtraction,” Speech Commun., vol. 50, no. 6, pp. 453–466, Jun. 2008.
    • (2008) Speech Commun. , vol.50 , Issue.6 , pp. 453-466
    • Lu, Y.1    Loizou, P.2
  • 23
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • D. S. Brungart, P. S. Chang, B. D. Simpson, and D. Wang “Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation,” J. Acoust. Soc. Amer., vol. 120, no. 6, pp. 4007–4018, 2006.
    • (2006) J. Acoust. Soc. Amer. , vol.120 , Issue.6 , pp. 4007-4018
    • Brungart, D.S.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.4
  • 24
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
    • N. Li and P. Loizou “Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction,” J. Acoust. Soc. Amer., vol. 123, no. 3, pp. 1673–1682, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.3 , pp. 1673-1682
    • Li, N.1    Loizou, P.2
  • 25
    • 58149196390 scopus 로고    scopus 로고
    • On the optimality of ideal binary time-frequency masks
    • Mar.
    • Y. Li and D. Wang “On the optimality of ideal binary time-frequency masks,” Speech Commun., vol. 51, pp. 230–239, Mar. 2009.
    • (2009) Speech Commun. , vol.51 , pp. 230-239
    • Li, Y.1    Wang, D.2
  • 26
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • P. Divenyi, Ed. Norwell, MA: Kluwer
    • D. Wang, “On ideal binary mask as the computational goal of auditory scene analysis,” in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2005, pp. 181–197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.1
  • 27
    • 0029307534 scopus 로고
    • De-noising by soft-thresholding
    • May
    • D. L. Donoho “De-noising by soft-thresholding,” IEEE Trans. Inf. Theory, vol. 41, no. 3, pp. 613–627, May 1995.
    • (1995) IEEE Trans. Inf. Theory , vol.41 , Issue.3 , pp. 613-627
    • Donoho, D.L.1
  • 28
    • 84950459514 scopus 로고
    • Adapting to unknown smoothness via wavelet shrinkage
    • D. L. Donoho and I. M. Johnstone “Adapting to unknown smoothness via wavelet shrinkage,” J. Amer. Statist. Assoc., vol. 90, no. 432, pp. 1200–1224, 1995.
    • (1995) J. Amer. Statist. Assoc. , vol.90 , Issue.432 , pp. 1200-1224
    • Donoho, D.L.1    Johnstone, I.M.2
  • 29
    • 0003794165 scopus 로고    scopus 로고
    • ser. Lecture notes in Statistics. Berlin, Germany: Springer-Verlag
    • M. Jansen, Noise Reduction by Wavelet Thresholding, ser. Lecture notes in Statistics. Berlin, Germany: Springer-Verlag, 2001, vol. 161.
    • (2001) Noise Reduction by Wavelet Thresholding , vol.161
    • Jansen, M.1
  • 30
    • 34447095085 scopus 로고    scopus 로고
    • A study of the distribution of time-domain speech samples and discrete Fourier coefficients
    • J. Jensen, I. Batina, R. C. Hendriks, and R. Heusdens, “A study of the distribution of time-domain speech samples and discrete Fourier coefficients,” Proc. SPS-DARTS, vol. 1, pp. 155–158, 2005.
    • (2005) Proc. SPS-DARTS , vol.1 , pp. 155-158
    • Jensen, J.1    Batina, I.2    Hendriks, R.C.3    Heusdens, R.4
  • 32
    • 0041958932 scopus 로고
    • Ideal spatial adaptation by wavelet shrinkage
    • D. L. Donoho and I. M. Johnstone “Ideal spatial adaptation by wavelet shrinkage,” Biometrika, vol. 81, no. 3, pp. 425–455, 1994.
    • (1994) Biometrika , vol.81 , Issue.3 , pp. 425-455
    • Donoho, D.L.1    Johnstone, I.M.2
  • 34
    • 64349110818 scopus 로고    scopus 로고
    • Audio denoising by time-frequency block thresholding
    • May
    • G. Yu, S. Mallat, and E. Bacry “Audio denoising by time-frequency block thresholding,” IEEE Trans. Signal Process., vol. 56, no. 5, pp. 1830–1839, May 2008.
    • (2008) IEEE Trans. Signal Process. , vol.56 , Issue.5 , pp. 1830-1839
    • Yu, G.1    Mallat, S.2    Bacry, E.3
  • 35
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J.-L. Gauvain and C.-H. Lee “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–299, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-299
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 36
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normal-hearing listeners
    • Sep.
    • G. Kim, Y. Lu, Y. Hu, and P. C. Loizou “An algorithm that improves speech intelligibility in noise for normal-hearing listeners,” J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1486–1494, Sep. 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.C.4
  • 37
    • 77956547397 scopus 로고    scopus 로고
    • Improving speech intelligibility in noise using environment-optimized algorithms
    • Sep.
    • G. Kim and P. C. Loizou “Improving speech intelligibility in noise using environment-optimized algorithms,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2080–2090, Sep. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 2080-2090
    • Kim, G.1    Loizou, P.C.2
  • 38
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul.
    • R. Martin “Noise power spectral density estimation based on optimal smoothing and minimum statistics,” IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504–512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 39
    • 0036226165 scopus 로고    scopus 로고
    • Noise estimation by minima controlled recursive averaging for robust speech enhancement
    • Jan.
    • I. Cohen and B. Berdugo “Noise estimation by minima controlled recursive averaging for robust speech enhancement,” IEEE Signal Process. Lett., vol. 9, no. 1, pp. 12–15, Jan. 2002.
    • (2002) IEEE Signal Process. Lett. , vol.9 , Issue.1 , pp. 12-15
    • Cohen, I.1    Berdugo, B.2
  • 40
    • 0019009880 scopus 로고
    • Speech enhancement using a soft-decision noise suppression filter
    • Apr.
    • R. McAulay and M. Malpass “Speech enhancement using a soft-decision noise suppression filter,” IEEE Trans. Acoust., Speech Signal Process., vol. 28, no. 2, pp. 137–145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech Signal Process. , vol.28 , Issue.2 , pp. 137-145
    • McAulay, R.1    Malpass, M.2
  • 42
    • 44149106061 scopus 로고    scopus 로고
    • Evaluation of objective quality measures for speech enhancement
    • Jan.
    • Y. Hu and P. Loizou “Evaluation of objective quality measures for speech enhancement.,” IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 1, pp. 229–238, Jan. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.1 , pp. 229-238
    • Hu, Y.1    Loizou, P.2
  • 43
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • Nov.
    • S. Srinivasan, N. Roman, and D. Wang “Binary and ratio time-frequency masks for robust speech recognition,” Speech Commun., vol. 48, pp. 1486–1501, Nov. 2006.
    • (2006) Speech Commun. , vol.48 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.3
  • 44
    • 0036543522 scopus 로고    scopus 로고
    • Optimal speech enhancement under signal presence uncertainty using log-spectra amplitude estimator
    • Apr.
    • I. Cohen “Optimal speech enhancement under signal presence uncertainty using log-spectra amplitude estimator,” IEEE Signal Process. Lett., vol. 9, no. 4, pp. 113–116, Apr. 2002.
    • (2002) IEEE Signal Process. Lett. , vol.9 , Issue.4 , pp. 113-116
    • Cohen, I.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.