메뉴 건너뛰기




Volumn 37, Issue 4, 2007, Pages 877-889

A generalized time-frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system

Author keywords

Auditory masking; Noise reduction; Speech enhancement; Wavelet

Indexed keywords

ALGORITHMS; MATHEMATICAL MODELS; MICROPHONES; NOISE ABATEMENT; NUMERICAL METHODS; SIGNAL TO NOISE RATIO; WAVELET TRANSFORMS;

EID: 34547115461     PISSN: 10834419     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCB.2007.895365     Document Type: Article
Times cited : (46)

References (38)
  • 1
    • 3442876970 scopus 로고    scopus 로고
    • Phase-based dual-microphone robust speech enhancement
    • Aug
    • P. Aarabi and G. Shi, "Phase-based dual-microphone robust speech enhancement," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34, no. 4, pp. 1763-1773, Aug. 2004.
    • (2004) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.34 , Issue.4 , pp. 1763-1773
    • Aarabi, P.1    Shi, G.2
  • 3
    • 0018320733 scopus 로고
    • Enhancement of speech corrupted by acoustic noise
    • Apr
    • M. Berouti, R. Schwartz, and J. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE ICASSP, Apr. 1979, vol. 4, pp. 208-211.
    • (1979) Proc. IEEE ICASSP , vol.4 , pp. 208-211
    • Berouti, M.1    Schwartz, R.2    Makhoul, J.3
  • 4
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 5
    • 0035125193 scopus 로고    scopus 로고
    • Wavelet speech enhancement based on the teager energy operator
    • Jan
    • M. Bahoura and J. Rouat, "Wavelet speech enhancement based on the teager energy operator," IEEE Signal Process. Lett., vol. 8, no. 1, pp. 10-12, Jan. 2001.
    • (2001) IEEE Signal Process. Lett , vol.8 , Issue.1 , pp. 10-12
    • Bahoura, M.1    Rouat, J.2
  • 6
    • 23944498183 scopus 로고    scopus 로고
    • On the use of different speech representations for speaker modeling
    • Aug
    • K. Chen, "On the use of different speech representations for speaker modeling," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 35, no. 3, pp. 301-314, Aug. 2005.
    • (2005) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev , vol.35 , Issue.3 , pp. 301-314
    • Chen, K.1
  • 8
    • 0029307534 scopus 로고
    • De-noising by soft-thresholding
    • May
    • D. L. Donoho, "De-noising by soft-thresholding," IEEE Trans. Inf. Theory, vol. 41, no. 3, pp. 613-627, May 1995.
    • (1995) IEEE Trans. Inf. Theory , vol.41 , Issue.3 , pp. 613-627
    • Donoho, D.L.1
  • 9
    • 0021645331 scopus 로고
    • Speech enhancement using a minimummean square error short-time spectral amplitude estimator
    • Dec
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimummean square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 10
    • 84948598244 scopus 로고
    • Statistical-model-based speech enhancement systems
    • Oct
    • Y. Ephraim, "Statistical-model-based speech enhancement systems," Proc. IEEE, vol. 80, no. 10, pp. 1526-1555, Oct. 1992.
    • (1992) Proc. IEEE , vol.80 , Issue.10 , pp. 1526-1555
    • Ephraim, Y.1
  • 11
    • 1942488383 scopus 로고    scopus 로고
    • A modified a priori SNR for speech enhancement using spectral subtraction rules
    • Apr
    • M. K. Hasan, S. Salahuddin, and M. R. Khan, "A modified a priori SNR for speech enhancement using spectral subtraction rules," IEEE Signal Process. Lett., vol. 11, no. 4, pp. 450-453, Apr. 2004.
    • (2004) IEEE Signal Process. Lett , vol.11 , Issue.4 , pp. 450-453
    • Hasan, M.K.1    Salahuddin, S.2    Khan, M.R.3
  • 12
    • 0442311161 scopus 로고    scopus 로고
    • Incorporating a psychoacoustical model in frequency domain speech enhancement
    • Feb
    • Y. Hu and P. C. Loizou, "Incorporating a psychoacoustical model in frequency domain speech enhancement," IEEE Signal Process. Lett., vol. 11, no. 2, pp. 270-273, Feb. 2004.
    • (2004) IEEE Signal Process. Lett , vol.11 , Issue.2 , pp. 270-273
    • Hu, Y.1    Loizou, P.C.2
  • 13
    • 0036293748 scopus 로고    scopus 로고
    • S. Kamath and P. C. Loizou, A multi-band spectral subtraction method for enhancing speech corrupted by colored noise, in Proc. IEEE ICASSP, May 13-17, 2002, 4, p. IV-4164.
    • S. Kamath and P. C. Loizou, "A multi-band spectral subtraction method for enhancing speech corrupted by colored noise," in Proc. IEEE ICASSP, May 13-17, 2002, vol. 4, p. IV-4164.
  • 14
    • 0034892786 scopus 로고    scopus 로고
    • Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids
    • Sep
    • M. Li, H. G. McAllister, N. D. Black, and T. A. De Perez, "Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids," IEEE Trans. Biomed. Eng., vol. 48, no. 9, pp. 979-988, Sep. 2001.
    • (2001) IEEE Trans. Biomed. Eng , vol.48 , Issue.9 , pp. 979-988
    • Li, M.1    McAllister, H.G.2    Black, N.D.3    De Perez, T.A.4
  • 15
    • 0142227717 scopus 로고    scopus 로고
    • Single-channel speech enhancement in variable noise-level environment
    • Jan
    • C. T. Lin, "Single-channel speech enhancement in variable noise-level environment," IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 33, no. 1, pp. 137-143, Jan. 2003.
    • (2003) IEEE Trans. Syst., Man, Cybern. A, Syst., Humans , vol.33 , Issue.1 , pp. 137-143
    • Lin, C.T.1
  • 16
    • 1842865648 scopus 로고    scopus 로고
    • Speech enhancement using perceptually-constrained gain factors in critical-band-wavelet-packet transform
    • Mar. 18
    • C.-T. Lu and H.-C. Wang, "Speech enhancement using perceptually-constrained gain factors in critical-band-wavelet-packet transform," Electron. Lett., vol. 40, no. 6, pp. 394-396, Mar. 18, 2004.
    • (2004) Electron. Lett , vol.40 , Issue.6 , pp. 394-396
    • Lu, C.-T.1    Wang, H.-C.2
  • 17
    • 0023963510 scopus 로고
    • Transform coding of audio signals using perceptual noise criteria
    • Feb
    • J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria," IEEE J. Sel. Areas Commun., vol. 6, no. 2, pp. 314-323, Feb. 1988.
    • (1988) IEEE J. Sel. Areas Commun , vol.6 , Issue.2 , pp. 314-323
    • Johnston, J.D.1
  • 18
  • 20
    • 0036476655 scopus 로고    scopus 로고
    • Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
    • Feb
    • M. Marzinzik and B. Kollmeier, "Speech pause detection for noise spectrum estimation by tracking power envelope dynamics," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 109-118, Feb. 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.2 , pp. 109-118
    • Marzinzik, M.1    Kollmeier, B.2
  • 21
    • 0019009880 scopus 로고
    • Speech enhancement using a soft-decision noise suppression filter
    • Apr
    • R. McAulay and M. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 2, pp. 137-145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.2 , pp. 137-145
    • McAulay, R.1    Malpass, M.2
  • 22
    • 0022227187 scopus 로고
    • Comparative study of several distortion measures for speech recognition
    • Apr
    • N. Nocerino, F. Soong, L. Rabiner, and D. Klatt, "Comparative study of several distortion measures for speech recognition," in Proc. IEEE ICASSP, Apr. 1985, vol. 10, pp. 25-28.
    • (1985) Proc. IEEE ICASSP , vol.10 , pp. 25-28
    • Nocerino, N.1    Soong, F.2    Rabiner, L.3    Klatt, D.4
  • 23
    • 27644487859 scopus 로고    scopus 로고
    • Speech reinforcement system for car cabin communications
    • Sep
    • A. Ortega, E. Lleida, and E. Masgrau, "Speech reinforcement system for car cabin communications," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pt. 2, pp. 917-929, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 PART. 2 , pp. 917-929
    • Ortega, A.1    Lleida, E.2    Masgrau, E.3
  • 24
    • 0036490786 scopus 로고    scopus 로고
    • Integrated echo and noise canceler for hands-free applications
    • Mar
    • S. J. Park, C. G. Cho, C. Lee, and D. H. Youn, "Integrated echo and noise canceler for hands-free applications," IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 49, no. 3, pp. 188-195, Mar. 2002.
    • (2002) IEEE Trans. Circuits Syst. II, Exp. Briefs , vol.49 , Issue.3 , pp. 188-195
    • Park, S.J.1    Cho, C.G.2    Lee, C.3    Youn, D.H.4
  • 25
    • 0027842082 scopus 로고
    • Low bit rate transparent audio compression using adapted wavelets
    • Dec
    • D. Sinha and A. H. Tewfik, "Low bit rate transparent audio compression using adapted wavelets," IEEE Trans. Signal Process., vol. 1, no. 12, pp. 3463-3479, Dec. 1993.
    • (1993) IEEE Trans. Signal Process , vol.1 , Issue.12 , pp. 3463-3479
    • Sinha, D.1    Tewfik, A.H.2
  • 26
    • 0032123832 scopus 로고    scopus 로고
    • A parametric formulation of the generalized spectral subtraction method
    • Jul
    • B. L. Sim, Y. C. Tong, J. S. Chang, and C. T. Tan, "A parametric formulation of the generalized spectral subtraction method," IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 328-337, Jul. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.4 , pp. 328-337
    • Sim, B.L.1    Tong, Y.C.2    Chang, J.S.3    Tan, C.T.4
  • 27
    • 84866492988 scopus 로고
    • Optimizing digital speech coders by exploiting masking properties of the human ear
    • Dec
    • M. R. Schroeder, B. S. Atal, and J. L. Hall, "Optimizing digital speech coders by exploiting masking properties of the human ear," J. Acoust. Soc. Amer., vol. 66, no. 6, pp. 1647-1652, Dec. 1979.
    • (1979) J. Acoust. Soc. Amer , vol.66 , Issue.6 , pp. 1647-1652
    • Schroeder, M.R.1    Atal, B.S.2    Hall, J.L.3
  • 29
    • 0000389611 scopus 로고    scopus 로고
    • High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling
    • Apr
    • P. Srinivasan and L. H. Jamieson, "High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling," IEEE Trans. Signal Process., vol. 46, no. 4, pp. 1085-1093, Apr. 1998.
    • (1998) IEEE Trans. Signal Process , vol.46 , Issue.4 , pp. 1085-1093
    • Srinivasan, P.1    Jamieson, L.H.2
  • 30
    • 34547102623 scopus 로고    scopus 로고
    • A generalized perceptual time-frequency subtraction method for speech enhancement
    • Kos Island, Greece, May 20-23
    • Y. Shao and C. H. Chang, "A generalized perceptual time-frequency subtraction method for speech enhancement," in Proc. IEEE ISCAS, Kos Island, Greece, May 20-23, 2006, pp. 2537-2540.
    • (2006) Proc. IEEE ISCAS , pp. 2537-2540
    • Shao, Y.1    Chang, C.H.2
  • 31
    • 34547117752 scopus 로고    scopus 로고
    • A versatile speech enhancement system based on perceptual wavelet denoising
    • Kobe, Japan, May 23-26
    • Y. Shao and C. H. Chang, "A versatile speech enhancement system based on perceptual wavelet denoising," in Proc IEEE ISCAS, Kobe, Japan, May 23-26, 2005, pp. 864-867.
    • (2005) Proc IEEE ISCAS , pp. 864-867
    • Shao, Y.1    Chang, C.H.2
  • 32
    • 0037358681 scopus 로고    scopus 로고
    • A wavelet transform approach to blind adaptive filtering of speech from unknown noises
    • Mar
    • D. Veselinovic and D. Graupe, "A wavelet transform approach to blind adaptive filtering of speech from unknown noises," IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 50, no. 3, pp. 150-154, Mar. 2003.
    • (2003) IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process , vol.50 , Issue.3 , pp. 150-154
    • Veselinovic, D.1    Graupe, D.2
  • 33
    • 0033097443 scopus 로고    scopus 로고
    • Single channel speech enhancement based on masking properties of the human auditory system
    • Mar
    • N. Virag, "Single channel speech enhancement based on masking properties of the human auditory system," IEEE Trans. Speech Audio Process., vol. 7, no. 2, pp. 126-137, Mar. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.2 , pp. 126-137
    • Virag, N.1
  • 35
    • 0035248382 scopus 로고    scopus 로고
    • A recurrent neural fuzzy network for word boundary detection in variable noise-level environments
    • Feb
    • G. D. Wu and C. T. Lin, "A recurrent neural fuzzy network for word boundary detection in variable noise-level environments," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 31, no. 1, pp. 84-97, Feb. 2000.
    • (2000) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.31 , Issue.1 , pp. 84-97
    • Wu, G.D.1    Lin, C.T.2
  • 37
    • 34547097658 scopus 로고
    • Speech and noise data base
    • "Speech and noise data base," NATO AC243-panel 3/RSG.10, 1992. NOISEX-92.
    • (1992) NATO AC243-panel 3/RSG.10 , Issue.NOISEX-92


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.