SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Volumn 37, Issue 4, 2007, Pages 877-889

A generalized time-frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system

(2) Shao, Yu a Chang, Chip Hong a

a NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

Author keywords

Auditory masking; Noise reduction; Speech enhancement; Wavelet

Indexed keywords

ALGORITHMS; MATHEMATICAL MODELS; MICROPHONES; NOISE ABATEMENT; NUMERICAL METHODS; SIGNAL TO NOISE RATIO; WAVELET TRANSFORMS;

AUDITORY MASKING; HUMAN AUDITORY SYSTEM; ROBUST SPEECH ENHANCEMENT; SINGLE-MICROPHONE SYSTEM; TIME-FREQUENCY SUBSTRACTION METHOD; WAVELET FILTER BANKS;

SPEECH ENHANCEMENT;

ALGORITHM; ARTICLE; ARTIFICIAL INTELLIGENCE; AUTOMATED PATTERN RECOGNITION; AUTOMATIC SPEECH RECOGNITION; BIOLOGICAL MODEL; BIOMIMETICS; COMPUTER SIMULATION; HEARING; HUMAN; METHODOLOGY; PHYSIOLOGY; SIGNAL PROCESSING; SOUND DETECTION;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; AUDITORY PERCEPTION; BIOMIMETICS; COMPUTER SIMULATION; HUMANS; MODELS, BIOLOGICAL; PATTERN RECOGNITION, AUTOMATED; SIGNAL PROCESSING, COMPUTER-ASSISTED; SOUND SPECTROGRAPHY; SPEECH RECOGNITION SOFTWARE;

EID: 34547115461 PISSN: 10834419 EISSN: None Source Type: Journal
DOI: 10.1109/TSMCB.2007.895365 Document Type: Article

Times cited : (46)

References (38)

1
- 3442876970
- Phase-based dual-microphone robust speech enhancement
- Aug
- P. Aarabi and G. Shi, "Phase-based dual-microphone robust speech enhancement," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34, no. 4, pp. 1763-1773, Aug. 2004.
- (2004) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.34 , Issue.4 , pp. 1763-1773
- Aarabi, P.¹ Shi, G.²

2
- 14544272307
- Wavelet based speech enhancement using a new thresholding algorithm
- Hong Kong, Oct. 20-22
- S. Ayat, M. T. Manzuri, and R. Dianat, "Wavelet based speech enhancement using a new thresholding algorithm," in Proc. Int. Symp. Intell. Multimedia, Video and Speech Process., Hong Kong, Oct. 20-22, 2004, pp. 238-241.
- (2004) Proc. Int. Symp. Intell. Multimedia, Video and Speech Process , pp. 238-241
- Ayat, S.¹ Manzuri, M.T.² Dianat, R.³

3
- 0018320733
- Enhancement of speech corrupted by acoustic noise
- Apr
- M. Berouti, R. Schwartz, and J. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE ICASSP, Apr. 1979, vol. 4, pp. 208-211.
- (1979) Proc. IEEE ICASSP , vol.4 , pp. 208-211
- Berouti, M.¹ Schwartz, R.² Makhoul, J.³

4
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.¹

5
- 0035125193
- Wavelet speech enhancement based on the teager energy operator
- Jan
- M. Bahoura and J. Rouat, "Wavelet speech enhancement based on the teager energy operator," IEEE Signal Process. Lett., vol. 8, no. 1, pp. 10-12, Jan. 2001.
- (2001) IEEE Signal Process. Lett , vol.8 , Issue.1 , pp. 10-12
- Bahoura, M.¹ Rouat, J.²

6
- 23944498183
- On the use of different speech representations for speaker modeling
- Aug
- K. Chen, "On the use of different speech representations for speaker modeling," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 35, no. 3, pp. 301-314, Aug. 2005.
- (2005) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev , vol.35 , Issue.3 , pp. 301-314
- Chen, K.¹

7
- 0003424145
- Englewood Cliffs, NJ: Prentice-Hall
- J. Deller, J. Proakis, and J. Hansen, Discrete-Time Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.¹ Proakis, J.² Hansen, J.³

8
- 0029307534
- De-noising by soft-thresholding
- May
- D. L. Donoho, "De-noising by soft-thresholding," IEEE Trans. Inf. Theory, vol. 41, no. 3, pp. 613-627, May 1995.
- (1995) IEEE Trans. Inf. Theory , vol.41 , Issue.3 , pp. 613-627
- Donoho, D.L.¹

9
- 0021645331
- Speech enhancement using a minimummean square error short-time spectral amplitude estimator
- Dec
- Y. Ephraim and D. Malah, "Speech enhancement using a minimummean square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

10
- 84948598244
- Statistical-model-based speech enhancement systems
- Oct
- Y. Ephraim, "Statistical-model-based speech enhancement systems," Proc. IEEE, vol. 80, no. 10, pp. 1526-1555, Oct. 1992.
- (1992) Proc. IEEE , vol.80 , Issue.10 , pp. 1526-1555
- Ephraim, Y.¹

11
- 1942488383
- A modified a priori SNR for speech enhancement using spectral subtraction rules
- Apr
- M. K. Hasan, S. Salahuddin, and M. R. Khan, "A modified a priori SNR for speech enhancement using spectral subtraction rules," IEEE Signal Process. Lett., vol. 11, no. 4, pp. 450-453, Apr. 2004.
- (2004) IEEE Signal Process. Lett , vol.11 , Issue.4 , pp. 450-453
- Hasan, M.K.¹ Salahuddin, S.² Khan, M.R.³

12
- 0442311161
- Incorporating a psychoacoustical model in frequency domain speech enhancement
- Feb
- Y. Hu and P. C. Loizou, "Incorporating a psychoacoustical model in frequency domain speech enhancement," IEEE Signal Process. Lett., vol. 11, no. 2, pp. 270-273, Feb. 2004.
- (2004) IEEE Signal Process. Lett , vol.11 , Issue.2 , pp. 270-273
- Hu, Y.¹ Loizou, P.C.²

13
- 0036293748
- S. Kamath and P. C. Loizou, A multi-band spectral subtraction method for enhancing speech corrupted by colored noise, in Proc. IEEE ICASSP, May 13-17, 2002, 4, p. IV-4164.
- S. Kamath and P. C. Loizou, "A multi-band spectral subtraction method for enhancing speech corrupted by colored noise," in Proc. IEEE ICASSP, May 13-17, 2002, vol. 4, p. IV-4164.

14
- 0034892786
- Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids
- Sep
- M. Li, H. G. McAllister, N. D. Black, and T. A. De Perez, "Perceptual time-frequency subtraction algorithm for noise reduction in hearing aids," IEEE Trans. Biomed. Eng., vol. 48, no. 9, pp. 979-988, Sep. 2001.
- (2001) IEEE Trans. Biomed. Eng , vol.48 , Issue.9 , pp. 979-988
- Li, M.¹ McAllister, H.G.² Black, N.D.³ De Perez, T.A.⁴

15
- 0142227717
- Single-channel speech enhancement in variable noise-level environment
- Jan
- C. T. Lin, "Single-channel speech enhancement in variable noise-level environment," IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 33, no. 1, pp. 137-143, Jan. 2003.
- (2003) IEEE Trans. Syst., Man, Cybern. A, Syst., Humans , vol.33 , Issue.1 , pp. 137-143
- Lin, C.T.¹

16
- 1842865648
- Speech enhancement using perceptually-constrained gain factors in critical-band-wavelet-packet transform
- Mar. 18
- C.-T. Lu and H.-C. Wang, "Speech enhancement using perceptually-constrained gain factors in critical-band-wavelet-packet transform," Electron. Lett., vol. 40, no. 6, pp. 394-396, Mar. 18, 2004.
- (2004) Electron. Lett , vol.40 , Issue.6 , pp. 394-396
- Lu, C.-T.¹ Wang, H.-C.²

17
- 0023963510
- Transform coding of audio signals using perceptual noise criteria
- Feb
- J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria," IEEE J. Sel. Areas Commun., vol. 6, no. 2, pp. 314-323, Feb. 1988.
- (1988) IEEE J. Sel. Areas Commun , vol.6 , Issue.2 , pp. 314-323
- Johnston, J.D.¹

18
- 0035509971
- Combined noise and echo reduction in hands-free systems: A survey
- Nov
- W. L. B. Jeannes, P. Scalart, G. Faucon, and C. Beaugeant, "Combined noise and echo reduction in hands-free systems: A survey," IEEE Trans. Speech Audio Process., vol. 9, no. 8, pp. 808-820, Nov. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.8 , pp. 808-820
- Jeannes, W.L.B.¹ Scalart, P.² Faucon, G.³ Beaugeant, C.⁴

19
- 0003659270
- Chichester, U.K, Wiley
- A. Mertins, Signal Analysis: Wavelet, Filter Banks, Time-Frequency Transforms and Applications. Chichester, U.K.: Wiley, 1999.
- (1999) Signal Analysis: Wavelet, Filter Banks, Time-Frequency Transforms and Applications
- Mertins, A.¹

20
- 0036476655
- Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
- Feb
- M. Marzinzik and B. Kollmeier, "Speech pause detection for noise spectrum estimation by tracking power envelope dynamics," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 109-118, Feb. 2002.
- (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.2 , pp. 109-118
- Marzinzik, M.¹ Kollmeier, B.²

21
- 0019009880
- Speech enhancement using a soft-decision noise suppression filter
- Apr
- R. McAulay and M. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 2, pp. 137-145, Apr. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.2 , pp. 137-145
- McAulay, R.¹ Malpass, M.²

22
- 0022227187
- Comparative study of several distortion measures for speech recognition
- Apr
- N. Nocerino, F. Soong, L. Rabiner, and D. Klatt, "Comparative study of several distortion measures for speech recognition," in Proc. IEEE ICASSP, Apr. 1985, vol. 10, pp. 25-28.
- (1985) Proc. IEEE ICASSP , vol.10 , pp. 25-28
- Nocerino, N.¹ Soong, F.² Rabiner, L.³ Klatt, D.⁴

23
- 27644487859
- Speech reinforcement system for car cabin communications
- Sep
- A. Ortega, E. Lleida, and E. Masgrau, "Speech reinforcement system for car cabin communications," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pt. 2, pp. 917-929, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 PART. 2 , pp. 917-929
- Ortega, A.¹ Lleida, E.² Masgrau, E.³

24
- 0036490786
- Integrated echo and noise canceler for hands-free applications
- Mar
- S. J. Park, C. G. Cho, C. Lee, and D. H. Youn, "Integrated echo and noise canceler for hands-free applications," IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 49, no. 3, pp. 188-195, Mar. 2002.
- (2002) IEEE Trans. Circuits Syst. II, Exp. Briefs , vol.49 , Issue.3 , pp. 188-195
- Park, S.J.¹ Cho, C.G.² Lee, C.³ Youn, D.H.⁴

25
- 0027842082
- Low bit rate transparent audio compression using adapted wavelets
- Dec
- D. Sinha and A. H. Tewfik, "Low bit rate transparent audio compression using adapted wavelets," IEEE Trans. Signal Process., vol. 1, no. 12, pp. 3463-3479, Dec. 1993.
- (1993) IEEE Trans. Signal Process , vol.1 , Issue.12 , pp. 3463-3479
- Sinha, D.¹ Tewfik, A.H.²

26
- 0032123832
- A parametric formulation of the generalized spectral subtraction method
- Jul
- B. L. Sim, Y. C. Tong, J. S. Chang, and C. T. Tan, "A parametric formulation of the generalized spectral subtraction method," IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 328-337, Jul. 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.4 , pp. 328-337
- Sim, B.L.¹ Tong, Y.C.² Chang, J.S.³ Tan, C.T.⁴

27
- 84866492988
- Optimizing digital speech coders by exploiting masking properties of the human ear
- Dec
- M. R. Schroeder, B. S. Atal, and J. L. Hall, "Optimizing digital speech coders by exploiting masking properties of the human ear," J. Acoust. Soc. Amer., vol. 66, no. 6, pp. 1647-1652, Dec. 1979.
- (1979) J. Acoust. Soc. Amer , vol.66 , Issue.6 , pp. 1647-1652
- Schroeder, M.R.¹ Atal, B.S.² Hall, J.L.³

28
- 0004206760
- Cambridge, MA: Wellesley-Cambridge Press
- G. Strang and T. Nguyen, Wavelets and Filter Banks. Cambridge, MA: Wellesley-Cambridge Press, 1996.
- (1996) Wavelets and Filter Banks
- Strang, G.¹ Nguyen, T.²

29
- 0000389611
- High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling
- Apr
- P. Srinivasan and L. H. Jamieson, "High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling," IEEE Trans. Signal Process., vol. 46, no. 4, pp. 1085-1093, Apr. 1998.
- (1998) IEEE Trans. Signal Process , vol.46 , Issue.4 , pp. 1085-1093
- Srinivasan, P.¹ Jamieson, L.H.²

30
- 34547102623
- A generalized perceptual time-frequency subtraction method for speech enhancement
- Kos Island, Greece, May 20-23
- Y. Shao and C. H. Chang, "A generalized perceptual time-frequency subtraction method for speech enhancement," in Proc. IEEE ISCAS, Kos Island, Greece, May 20-23, 2006, pp. 2537-2540.
- (2006) Proc. IEEE ISCAS , pp. 2537-2540
- Shao, Y.¹ Chang, C.H.²

31
- 34547117752
- A versatile speech enhancement system based on perceptual wavelet denoising
- Kobe, Japan, May 23-26
- Y. Shao and C. H. Chang, "A versatile speech enhancement system based on perceptual wavelet denoising," in Proc IEEE ISCAS, Kobe, Japan, May 23-26, 2005, pp. 864-867.
- (2005) Proc IEEE ISCAS , pp. 864-867
- Shao, Y.¹ Chang, C.H.²

32
- 0037358681
- A wavelet transform approach to blind adaptive filtering of speech from unknown noises
- Mar
- D. Veselinovic and D. Graupe, "A wavelet transform approach to blind adaptive filtering of speech from unknown noises," IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 50, no. 3, pp. 150-154, Mar. 2003.
- (2003) IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process , vol.50 , Issue.3 , pp. 150-154
- Veselinovic, D.¹ Graupe, D.²

33
- 0033097443
- Single channel speech enhancement based on masking properties of the human auditory system
- Mar
- N. Virag, "Single channel speech enhancement based on masking properties of the human auditory system," IEEE Trans. Speech Audio Process., vol. 7, no. 2, pp. 126-137, Mar. 1999.
- (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.2 , pp. 126-137
- Virag, N.¹

34
- 84889779628
- 2nd ed. Chichester, U.K, Wiley
- S. V. Vaseghi, Advanced Digital Signal Processing and Noise Reduction 2nd ed. Chichester, U.K.: Wiley, 2000.
- (2000) Advanced Digital Signal Processing and Noise Reduction
- Vaseghi, S.V.¹

35
- 0035248382
- A recurrent neural fuzzy network for word boundary detection in variable noise-level environments
- Feb
- G. D. Wu and C. T. Lin, "A recurrent neural fuzzy network for word boundary detection in variable noise-level environments," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 31, no. 1, pp. 84-97, Feb. 2000.
- (2000) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.31 , Issue.1 , pp. 84-97
- Wu, G.D.¹ Lin, C.T.²

36
- 0004236521
- Berlin, Germany: Springer-Verlag
- E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models. Berlin, Germany: Springer-Verlag, 1990.
- (1990) Psychoacoustics: Facts and Models
- Zwicker, E.¹ Fastl, H.²

37
- 34547097658
- Speech and noise data base
- "Speech and noise data base," NATO AC243-panel 3/RSG.10, 1992. NOISEX-92.
- (1992) NATO AC243-panel 3/RSG.10 , Issue.NOISEX-92

38
- 0003639435
- Feb, ITU-T Recommend
- Perceptual Evaluation of Speech Quality (PESQ), An Objective Method for End-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs, p. 862, Feb. 2001. ITU-T Recommend.
- (2001) Perceptual Evaluation of Speech Quality (PESQ), An Objective Method for End-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs , pp. 862

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.