메뉴 건너뛰기




Volumn 56, Issue 5, 2008, Pages 1830-1839

Audio denoising by time-frequency block thresholding

Author keywords

Audio denoising; Block thresholding; Ephraim and Malah; Power spectrum; Power subtraction; Thresholding

Indexed keywords

AUDIO DENOISING; BLOCK THRESHOLDING; EPHRAIM AND MALAH; POWER SUBTRACTION; THRESHOLDING;

EID: 64349110818     PISSN: 1053587X     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSP.2007.912893     Document Type: Article
Times cited : (153)

References (60)
  • 1
    • 34548186349 scopus 로고    scopus 로고
    • Wavelet speech enhancement based on timescale adaptation
    • Dec
    • M. Bahoura and J. Rouat, "Wavelet speech enhancement based on timescale adaptation," Speech Commun., vol. 48, no. 12, pp. 1620-1637, Dec. 2006.
    • (2006) Speech Commun , vol.48 , Issue.12 , pp. 1620-1637
    • Bahoura, M.1    Rouat, J.2
  • 3
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoustics, Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoustics, Speech, Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 4
    • 0033248624 scopus 로고    scopus 로고
    • Adaptive wavelet estimation: A block thresholding and oracle inequality approach
    • T. Cai, "Adaptive wavelet estimation: A block thresholding and oracle inequality approach," Ann. Statist., vol. 27, pp. 898-924, 1999.
    • (1999) Ann. Statist , vol.27 , pp. 898-924
    • Cai, T.1
  • 5
    • 0001298787 scopus 로고    scopus 로고
    • Incorporation information on neighboring coefficients into wavelet estimation
    • T. Cai and B. W. Silverman, "Incorporation information on neighboring coefficients into wavelet estimation," Sankhya, vol. 63, pp. 127-148, 2001.
    • (2001) Sankhya , vol.63 , pp. 127-148
    • Cai, T.1    Silverman, B.W.2
  • 7
    • 0028413241 scopus 로고
    • Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor
    • Apr
    • O. Cappé, "Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor," IEEE Trans. Speech, Audio Process., vol. 2, pp. 345-349, Apr. 1994.
    • (1994) IEEE Trans. Speech, Audio Process , vol.2 , pp. 345-349
    • Cappé, O.1
  • 9
    • 1542783190 scopus 로고    scopus 로고
    • Speech enhancement using perceptual wavelet packet decomposition and Teager energy operator
    • Feb
    • S. H. Chen and J. F. Wang, "Speech enhancement using perceptual wavelet packet decomposition and Teager energy operator," J. VLSI Signal Process., vol. 36, no. 2-3, pp. 125-139(15), Feb. 2004.
    • (2004) J. VLSI Signal Process , vol.36 , Issue.2-15 -3 , pp. 125-139
    • Chen, S.H.1    Wang, J.F.2
  • 10
    • 4444301747 scopus 로고    scopus 로고
    • Speech enhancement using a noncausal a priori SNR estimator
    • Sep
    • I. Cohen, "Speech enhancement using a noncausal a priori SNR estimator," IEEE Signal Process. Lett., vol. 11, no. 9, pp. 725-728, Sep. 2004.
    • (2004) IEEE Signal Process. Lett , vol.11 , Issue.9 , pp. 725-728
    • Cohen, I.1
  • 11
    • 85009115414 scopus 로고    scopus 로고
    • Enhancement of speech using bark-scaled wavelet packet decomposition
    • Scandinavia
    • Enhancement of speech using bark-scaled wavelet packet decomposition," in Eurospeech, Scandinavia, 2001.
    • (2001) Eurospeech
    • Cohen, I.1
  • 12
    • 26444569329 scopus 로고    scopus 로고
    • Speech enhancement using supergaussian speech models and noncausal a priori SNR estimation
    • Nov
    • I. Cohen, "Speech enhancement using supergaussian speech models and noncausal a priori SNR estimation," Speech Commun., vol. 47, no. 3, pp. 336-350, Nov. 2005.
    • (2005) Speech Commun , vol.47 , Issue.3 , pp. 336-350
    • Cohen, I.1
  • 13
    • 27644563039 scopus 로고    scopus 로고
    • ΛRelaxed statistical model for speech enhancement and a priori SNR estimation
    • Sep
    • I. Cohen, ΛRelaxed statistical model for speech enhancement and a priori SNR estimation," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 870-881, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 870-881
    • Cohen, I.1
  • 14
    • 32644447834 scopus 로고    scopus 로고
    • Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models
    • Apr
    • I. Cohen, "Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models," Signal Process., vol. 86, no. 4, pp. 698-709, Apr. 2006.
    • (2006) Signal Process , vol.86 , Issue.4 , pp. 698-709
    • Cohen, I.1
  • 15
    • 0036543522 scopus 로고    scopus 로고
    • Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator
    • Apr
    • I. Cohen, "Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator," IEEE Signal Process. Lett. vol. 9, no. 4, pp. 113-116, Apr. 2002.
    • (2002) IEEE Signal Process. Lett , vol.9 , Issue.4 , pp. 113-116
    • Cohen, I.1
  • 16
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • Sep
    • I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 466-475, Sep. 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 17
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non-stationary noise environments
    • Nov
    • I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environments," Signal Process., vol. 81, no. 11, pp. 2403-2418, Nov. 2001.
    • (2001) Signal Process , vol.81 , Issue.11 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 18
    • 0002001578 scopus 로고
    • Translation-Invariant De-Noising
    • A. Antoniadis and G. Oppenheim, Eds. Berlin, Germany: Springer-Verlag
    • R. R. Coifman and D. L. Donoho, "Translation-Invariant De-Noising," in Lecture Notes in Statistics: Wavelets and Statistics, A. Antoniadis and G. Oppenheim, Eds. Berlin, Germany: Springer-Verlag, 1995.
    • (1995) Lecture Notes in Statistics: Wavelets and Statistics
    • Coifman, R.R.1    Donoho, D.L.2
  • 19
    • 36549090598 scopus 로고
    • Painless nonorthogonal expansions
    • I. Daubechies, A. Grossmann, and Y. Meyer, "Painless nonorthogonal expansions," J. Math. Phys., vol. 27, no. 5, pp. 1271-1283, 1986.
    • (1986) J. Math. Phys , vol.27 , Issue.5 , pp. 1271-1283
    • Daubechies, I.1    Grossmann, A.2    Meyer, Y.3
  • 20
    • 0041958932 scopus 로고
    • Idea spatial adaptation via wavelet shrinkage
    • D. Donoho and I. Johnstone, "Idea spatial adaptation via wavelet shrinkage," Biometrika, vol. 81, pp. 425-455, 1994.
    • (1994) Biometrika , vol.81 , pp. 425-455
    • Donoho, D.1    Johnstone, I.2
  • 21
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
    • Dec
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator," IEEE. Trans. Acoust., Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE. Trans. Acoust., Speech, Signal Process , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 22
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean square error log-spectral amplitude estimator
    • Apr
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 23
    • 0029345417 scopus 로고
    • A signal subspace approach for speech enhancement
    • Jul
    • Y. Ephraim and H. L. V. Trees, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Signal Process., vol. 3, no. 4, pp. 251-266, Jul. 1995.
    • (1995) IEEE Trans. Speech Signal Process , vol.3 , Issue.4 , pp. 251-266
    • Ephraim, Y.1    Trees, H.L.V.2
  • 24
    • 51449123884 scopus 로고    scopus 로고
    • Recent advancements in speech enhancement
    • R. C. Dorf, Ed. Boca Raton, FL: CRC Press, ch. 15, pp
    • Y. Ephraim and I. Cohen, "Recent advancements in speech enhancement," in The Electrical Engineering Handbook, R. C. Dorf, Ed. Boca Raton, FL: CRC Press, 2005, ch. 15, pp. 15-12.
    • (2005) The Electrical Engineering Handbook , pp. 15-12
    • Ephraim, Y.1    Cohen, I.2
  • 26
    • 0037426983 scopus 로고    scopus 로고
    • Wavelet-based denoising for robust feature extraction for speech recognition
    • Jan
    • O. Farooq and S. Datta, "Wavelet-based denoising for robust feature extraction for speech recognition," Electron. Lett., vol. 39, no. 1, pp. 163-165, Jan. 2003.
    • (2003) Electron. Lett , vol.39 , Issue.1 , pp. 163-165
    • Farooq, O.1    Datta, S.2
  • 28
    • 57649123263 scopus 로고    scopus 로고
    • Improved wavelet denoising via empiricalWiener filtering
    • San Diego, Jul
    • S. Ghael, A. Sayeed, and R. Baraniuk, "Improved wavelet denoising via empiricalWiener filtering," in Proc. SPIE, Math. Imag., San Diego, Jul. 1997.
    • (1997) Proc. SPIE, Math. Imag
    • Ghael, S.1    Sayeed, A.2    Baraniuk, R.3
  • 29
    • 33745387383 scopus 로고    scopus 로고
    • A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets
    • Y. Ghanbari and M. R. Karami-Mollaei, "A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets," Speech Commun., vol. 48, no. 8, pp. 927-940, 2006.
    • (2006) Speech Commun , vol.48 , Issue.8 , pp. 927-940
    • Ghanbari, Y.1    Karami-Mollaei, M.R.2
  • 30
    • 0442311161 scopus 로고    scopus 로고
    • Incorporating a psychoacoustical model in frequency domain speech enhancement
    • Feb
    • Y. Hu and P. C. Loizou, "Incorporating a psychoacoustical model in frequency domain speech enhancement," IEEE Signal Process. Lett., vol. 11, no. 2, pp. 270-273, Feb. 2004.
    • (2004) IEEE Signal Process. Lett , vol.11 , Issue.2 , pp. 270-273
    • Hu, Y.1    Loizou, P.C.2
  • 32
    • 0032360314 scopus 로고    scopus 로고
    • Block threshold rules for curve estimation using kernel and wavelet methods
    • P. Hall, G. Kerkyacharian, and D. Picard, "Block threshold rules for curve estimation using kernel and wavelet methods," Ann. Statist. vol. 26, pp. 922-942, 1998.
    • (1998) Ann. Statist , vol.26 , pp. 922-942
    • Hall, P.1    Kerkyacharian, G.2    Picard, D.3
  • 33
    • 0347181848 scopus 로고    scopus 로고
    • On the minimax optimality of block thresholded wavelet estimators
    • P. Hall, G. Kerkyacharian, and D. Picard, "On the minimax optimality of block thresholded wavelet estimators," Statistica Sinica, vol. 9, pp. 33-50, 1999.
    • (1999) Statistica Sinica , vol.9 , pp. 33-50
    • Hall, P.1    Kerkyacharian, G.2    Picard, D.3
  • 34
    • 33846967785 scopus 로고    scopus 로고
    • Speech signal enhancement through adaptive wavelet thresholding
    • Feb
    • M. Johnson, X.Yuan, and Y. Ren, "Speech signal enhancement through adaptive wavelet thresholding," Speech Commun., vol. 49, no. 2, Feb. 2007.
    • (2007) Speech Commun , vol.49 , Issue.2
    • Johnson, M.1    Yuan, X.2    Ren, Y.3
  • 35
    • 85008053840 scopus 로고    scopus 로고
    • Spectral enhancement based on global soft decision
    • May
    • N. S. Kim and J. H. Chang, "Spectral enhancement based on global soft decision," IEEE Signal Process. Lett., vol. 7, no. 5, pp. 108-110, May 2000.
    • (2000) IEEE Signal Process. Lett , vol.7 , Issue.5 , pp. 108-110
    • Kim, N.S.1    Chang, J.H.2
  • 37
    • 0034892786 scopus 로고    scopus 로고
    • Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids
    • Sep
    • M. Li, H. G. McAllister, N. D. Black, and D. T. A. Perez, "Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids," IEEE Trans. Biomed. Eng., vol. 48, no. 9, pp. 979-988, Sep. 2001.
    • (2001) IEEE Trans. Biomed. Eng , vol.48 , Issue.9 , pp. 979-988
    • Li, M.1    McAllister, H.G.2    Black, N.D.3    Perez, D.T.A.4
  • 38
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech
    • Dec
    • J. S. Lim and A. V. Oppenheim, "Enhancement and bandwidth compression of noisy speech," Proc. IEEE, vol. 67, Dec. 1979.
    • (1979) Proc. IEEE , vol.67
    • Lim, J.S.1    Oppenheim, A.V.2
  • 39
    • 33847209982 scopus 로고    scopus 로고
    • Speech enhancement for nonstationary noises by wavelet packet transform and adaptive noise estimation
    • Dec
    • S. F. Lei and Y. K. Tung, "Speech enhancement for nonstationary noises by wavelet packet transform and adaptive noise estimation," in Proc. Int. Symp. Intelligent Signal Processing Communication Systems, Dec. 2005, pp. 41-44.
    • (2005) Proc. Int. Symp. Intelligent Signal Processing Communication Systems , pp. 41-44
    • Lei, S.F.1    Tung, Y.K.2
  • 40
    • 0038373390 scopus 로고    scopus 로고
    • Enhancement of single channel speech based on masking property andwavelet transform
    • Oct
    • C. T. Lu and H. C.Wang, "Enhancement of single channel speech based on masking property andwavelet transform," Speech Commun., vol. 41, no. 2, pp. 409-427(19), Oct. 2003.
    • (2003) Speech Commun , vol.41 , Issue.2-19 , pp. 409-427
    • Lu, C.T.1    Wang, H.C.2
  • 41
    • 1842865648 scopus 로고    scopus 로고
    • Speech enhancement using perceptually-constrained gain factors in critical-band-wavelet-packet transform
    • Mar
    • C. T. Lu and H. C. Wang, "Speech enhancement using perceptually-constrained gain factors in critical-band-wavelet-packet transform," Electron. Lett., vol. 40, no. 6, pp. 394-396, Mar. 2004.
    • (2004) Electron. Lett , vol.40 , Issue.6 , pp. 394-396
    • Lu, C.T.1    Wang, H.C.2
  • 42
    • 0032667465 scopus 로고    scopus 로고
    • Tracking speech-presence uncertainty to improve speech enhancement in mon-stationary noise environments
    • presented at the, ICASSP, Phoenix, AZ, Mar
    • D. Malah, R. V. Cox, and A. J. Accardi, "Tracking speech-presence uncertainty to improve speech enhancement in mon-stationary noise environments," presented at the IEEE Int. Conf. Acoust., Speech, Signal Processing (ICASSP), Phoenix, AZ, Mar. 1999.
    • (1999) IEEE Int. Conf. Acoust., Speech, Signal Processing
    • Malah, D.1    Cox, R.V.2    Accardi, A.J.3
  • 44
    • 0036296949 scopus 로고    scopus 로고
    • Speech enhancement using MMSE short-time spectral estimation with gamma speech prior
    • Orlando, FL
    • R. Martin, "Speech enhancement using MMSE short-time spectral estimation with gamma speech prior," in Proc. Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), Orlando, FL, 2002, pp. I-253.
    • (2002) Proc. Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
    • Martin, R.1
  • 45
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process. vol. 9, no. 5, pp. 504-512, 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 47
    • 0034847164 scopus 로고    scopus 로고
    • Signal-adaptive robust timevarying Wiener filters: Best subspace selection and statistical analysis
    • Salt Lake City, UT, May
    • G. Matz, F. Hlawatsch, and A. Raidl, "Signal-adaptive robust timevarying Wiener filters: Best subspace selection and statistical analysis," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), Salt Lake City, UT, May 2001, pp. 3945-3948.
    • (2001) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP) , pp. 3945-3948
    • Matz, G.1    Hlawatsch, F.2    Raidl, A.3
  • 48
    • 0019009880 scopus 로고
    • Speech enhancement using soft decision noise suppression filter
    • Apr
    • R. J. McAulay and M. L. Malpass, "Speech enhancement using soft decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 2, pp. 137-145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.2 , pp. 137-145
    • McAulay, R.J.1    Malpass, M.L.2
  • 51
    • 34547115461 scopus 로고    scopus 로고
    • A generalized time-frequency subtraction method for robust speech enhancement based on wavelet filter bank modeling of human auditory system
    • Aug
    • Y. Shao and C. H. Chang, "A generalized time-frequency subtraction method for robust speech enhancement based on wavelet filter bank modeling of human auditory system," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 37, no. 4, pp. 877-889, Aug. 2007.
    • (2007) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.37 , Issue.4 , pp. 877-889
    • Shao, Y.1    Chang, C.H.2
  • 52
    • 85009074815 scopus 로고    scopus 로고
    • An improved wavelet-based speech enhancement system
    • H. Sheikhzadeh and H. R. Abutalebi, "An improved wavelet-based speech enhancement system," EUROSPEECH, pp. 1855-1858, 2001.
    • (2001) EUROSPEECH , pp. 1855-1858
    • Sheikhzadeh, H.1    Abutalebi, H.R.2
  • 53
    • 33751294092 scopus 로고    scopus 로고
    • Speech enhancement with natural sounding residual noise based on connected time-frequency speech presence regions
    • K. V. Sorensen and S. V. Andersen, "Speech enhancement with natural sounding residual noise based on connected time-frequency speech presence regions," EURASIP J. Appl. Signal Process., vol. 18, no. 18, pp. 2954-2964, 2005.
    • (2005) EURASIP J. Appl. Signal Process , vol.18 , Issue.18 , pp. 2954-2964
    • Sorensen, K.V.1    Andersen, S.V.2
  • 55
    • 0000169918 scopus 로고
    • Estimation of the mean of a multivariate normal distribution
    • C. Stein, "Estimation of the mean of a multivariate normal distribution," Ann. Statist., vol. 9, pp. 1135-1151, 1980.
    • (1980) Ann. Statist , vol.9 , pp. 1135-1151
    • Stein, C.1
  • 56
    • 64349124411 scopus 로고    scopus 로고
    • A time-space adapted wavelet de-noising algorithm for robust automatic speech recognition in low-SNR environments
    • H. Tolba, "A time-space adapted wavelet de-noising algorithm for robust automatic speech recognition in low-SNR environments," in Proce. 46th IEEE Int. Midwest Symp. Circuits Systems, 2003, vol. 1, pp. 311-314.
    • (2003) Proce. 46th IEEE Int. Midwest Symp. Circuits Systems , vol.1 , pp. 311-314
    • Tolba, H.1
  • 57
    • 84883367674 scopus 로고    scopus 로고
    • Denoising Gabor Transforms [Online]. Available: http://www.uwec.edu/walkerjs/media/DGT.pdf
    • preprint
    • J. S. Walker and Y.-J. Chen, Denoising Gabor Transforms [Online]. Available: http://www.uwec.edu/walkerjs/media/DGT.pdf, preprint
    • Walker, J.S.1    Chen, Y.-J.2
  • 58
    • 0035556258 scopus 로고    scopus 로고
    • Simple alternatives to the Ephraim and Malah suppression rule for speech enhancement
    • Aug
    • P. J. Wolfe and S. J. Godsill, "Simple alternatives to the Ephraim and Malah suppression rule for speech enhancement," in Proc. IEEE Workshop Statistical Signal Processing, Aug. 2001, pp. 496-499.
    • (2001) Proc. IEEE Workshop Statistical Signal Processing , pp. 496-499
    • Wolfe, P.J.1    Godsill, S.J.2
  • 59
    • 0027239228 scopus 로고
    • Frequency domain noise suppression approaches in mobile telephone systems
    • J. Yang, "Frequency domain noise suppression approaches in mobile telephone systems," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1993, vol. 2, pp. 363-366.
    • (1993) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 363-366
    • Yang, J.1
  • 60
    • 34547497825 scopus 로고    scopus 로고
    • G. Yu, E. Bacry, and S. Mallat, Audio signal denoising with complex wavelets and adaptive block attenuation, in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), Apr. 2007, 3, pp. III-869-III-872.
    • G. Yu, E. Bacry, and S. Mallat, "Audio signal denoising with complex wavelets and adaptive block attenuation," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), Apr. 2007, vol. 3, pp. III-869-III-872.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.