메뉴 건너뛰기




Volumn , Issue , 2008, Pages 873-902

Spectral Enhancement Methods

Author keywords

Babble Noise; Musical Noise; Speech Enhancement; Speech Signal; Voice Activity Detector

Indexed keywords


EID: 85075926376     PISSN: 25228692     EISSN: 25228706     Source Type: Book Series    
DOI: 10.1007/978-3-540-49127-9_44     Document Type: Chapter
Times cited : (51)

References (76)
  • 1
    • 33745146930 scopus 로고    scopus 로고
    • J. Benesty, S. Makino, J. Chen (Eds.), Springer, Berlin, Heidelberg
    • J. Benesty, S. Makino, J. Chen (Eds.): Speech Enhancement (Springer, Berlin, Heidelberg 2005)
    • (2005) Speech Enhancement
  • 4
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S.F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE Trans. Acoust. Speech Signal Process. 27(2), 113–120 (1979)
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 5
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech
    • J.S. Lim, A.V. Oppenheim: Enhancement and bandwidth compression of noisy speech, Proc. IEEE 67(12), 1586–1604 (1979)
    • (1979) Proc. IEEE , vol.67 , Issue.12 , pp. 1586-1604
    • Lim, J.S.1    Oppenheim, A.V.2
  • 6
    • 0018320733 scopus 로고
    • Enhancement of speech corrupted by acoustic noise
    • M. Berouti, R. Schwartz, J. Makhoul: Enhancement of speech corrupted by acoustic noise, Proc. 4th ICASSP 79, 208–211 (1979)
    • (1979) Proc. 4Th ICASSP , vol.79 , pp. 208-211
    • Berouti, M.1    Schwartz, R.2    Makhoul, J.3
  • 7
    • 0032073763 scopus 로고    scopus 로고
    • Postprocessing method for suppressing musical noise generated by spectral subtraction
    • Z. Goh, K.-C. Tan, T.G. Tan: Postprocessing method for suppressing musical noise generated by spectral subtraction, IEEE Trans. Speech Audio Process. 6(3), 287–292 (1998)
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.3 , pp. 287-292
    • Goh, Z.1    Tan, K.-C.2    Tan, T.G.3
  • 8
    • 0032123832 scopus 로고    scopus 로고
    • A parametric formulation of the generalized spectral subtraction method
    • B.L. Sim, Y.C. Tong, J.S. Chang, C.T. Tan: A parametric formulation of the generalized spectral subtraction method, IEEE Trans. Speech Audio Process. 6(4), 328–337 (1998)
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 328-337
    • Sim, B.L.1    Tong, Y.C.2    Chang, J.S.3    Tan, C.T.4
  • 9
    • 0035510532 scopus 로고    scopus 로고
    • Spectral subtraction using reduced delay convolution and adaptive averaging
    • H. Gustafsson, S.E. Nordholm, I. Claesson: Spectral subtraction using reduced delay convolution and adaptive averaging, IEEE Trans. Speech Audio Process. 9(8), 799–807 (2001)
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.8 , pp. 799-807
    • Gustafsson, H.1    Nordholm, S.E.2    Claesson, I.3
  • 11
    • 0033097443 scopus 로고    scopus 로고
    • Single channel speech enhancement based on masking properties of the human auditory system
    • N. Virag: Single channel speech enhancement based on masking properties of the human auditory system, IEEE Trans. Speech Audio Process. 7(2), 126–137 (1999)
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.2 , pp. 126-137
    • Virag, N.1
  • 12
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. ASSP-32(6), 1109–1121 (1984)
    • (1984) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 13
    • 0036754453 scopus 로고    scopus 로고
    • Speech enhancement using a mixture-maximum model
    • D. Burshtein, S. Gannot: Speech enhancement using a mixture-maximum model, IEEE Trans. Speech Audio Process. 10(6), 341–351 (2002)
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.6 , pp. 341-351
    • Burshtein, D.1    Gannot, S.2
  • 14
    • 27644556974 scopus 로고    scopus 로고
    • Speech enhancement based on minimum mean-square error estimation and supergaussian priors
    • R. Martin: Speech enhancement based on minimum mean-square error estimation and supergaussian priors, IEEE Trans. Speech Audio Process. 13(5), 845–856 (2005)
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 845-856
    • Martin, R.1
  • 15
    • 27644563039 scopus 로고    scopus 로고
    • Relaxed statistical model for speech enhancement and a priori SNR estimation
    • I. Cohen: Relaxed statistical model for speech enhancement and a priori SNR estimation, IEEE Trans. Speech Audio Process. 13(5), 870–881 (2005)
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 870-881
    • Cohen, I.1
  • 16
    • 32644447834 scopus 로고    scopus 로고
    • Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models
    • I. Cohen: Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models, Signal Process. 86(4), 698–709 (2006)
    • (2006) Signal Process , vol.86 , Issue.4 , pp. 698-709
    • Cohen, I.1
  • 17
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process. ASSP-33(2), 443–445 (1985)
    • (1985) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 18
    • 0141957802 scopus 로고    scopus 로고
    • Efficient alternatives to the Ephraim and Malah suppression rule for audio signal enhancement
    • P.J. Wolfe, S.J. Godsill: Efficient alternatives to the Ephraim and Malah suppression rule for audio signal enhancement, Special Issue EURASIP JASP Digital Audio Multim. Commun. 2003(10), 1043–1051 (2003)
    • (2003) Special Issue EURASIP JASP Digital Audio Multim. Commun. , vol.2003 , Issue.10 , pp. 1043-1051
    • Wolfe, P.J.1    Godsill, S.J.2
  • 19
    • 27644515429 scopus 로고    scopus 로고
    • Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum
    • P.C. Loizou: Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum, IEEE Trans. Speech Audio Process. 13(5), 857–869 (2005)
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 857-869
    • Loizou, P.C.1
  • 21
    • 84948598244 scopus 로고
    • Statistical-model-based speech enhancement systems
    • Y. Ephraim: Statistical-model-based speech enhancement systems, Proc. IEEE 80(10), 1526–1555 (1992)
    • (1992) Proc. IEEE , vol.80 , Issue.10 , pp. 1526-1555
    • Ephraim, Y.1
  • 22
    • 0028195651 scopus 로고
    • Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization
    • H. Sheikhzadeh, L. Deng: Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization, IEEE Trans. Speech Audio Process. 2, 80–91 (1994)
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 80-91
    • Sheikhzadeh, H.1    Deng, L.2
  • 24
    • 0023211846 scopus 로고
    • Explicit time correlations in hidden Markov models for speech recognition
    • C.J. Wellekens: Explicit time correlations in hidden Markov models for speech recognition, Proc. 12th ICASSP 87, 384–386 (1987)
    • (1987) Proc. 12Th ICASSP , vol.87 , pp. 384-386
    • Wellekens, C.J.1
  • 25
    • 0032166087 scopus 로고    scopus 로고
    • HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
    • H. Sameti, H. Sheikhzadeh, L. Deng, R.L. Brennan: HMM-based strategies for enhancement of speech signals embedded in nonstationary noise, IEEE Trans. Speech Audio Process. 6(5), 445–455 (1998)
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.5 , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.L.4
  • 28
    • 0029345417 scopus 로고
    • A signal subspace approach for speech enhancement
    • Y. Ephraim, H.L.V. Trees: A signal subspace approach for speech enhancement, IEEE Trans. Speech Audio Process. 3(4), 251–266 (1995)
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.4 , pp. 251-266
    • Ephraim, Y.1    Trees, H.L.V.2
  • 30
    • 0033892199 scopus 로고    scopus 로고
    • Signal/noise KLT based approach for enhancing speech degraded by colored noise
    • U. Mittal, N. Phamdo: Signal/noise KLT based approach for enhancing speech degraded by colored noise, IEEE Trans. Speech Audio Process. 8(2), 159– 167 (2000)
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.2 , pp. 159-167
    • Mittal, U.1    Phamdo, N.2
  • 31
    • 0041591273 scopus 로고    scopus 로고
    • A generalized subspace approach for enhancing speech corrupted by colored noise
    • Y. Hu, P.C. Loizou: A generalized subspace approach for enhancing speech corrupted by colored noise, IEEE Trans. Speech Audio Process. 11(4), 334– 341 (2003)
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.4 , pp. 334-341
    • Hu, Y.1    Loizou, P.C.2
  • 33
    • 0036725739 scopus 로고    scopus 로고
    • GSVD-based optimal filtering for single and multimicrophone speech enhancement
    • S. Doclo, M. Moonen: GSVD-based optimal filtering for single and multimicrophone speech enhancement, IEEE Trans. Signal Process. 50(9), 2230–2244 (2002)
    • (2002) IEEE Trans. Signal Process. , vol.50 , Issue.9 , pp. 2230-2244
    • Doclo, S.1    Moonen, M.2
  • 34
    • 0347337999 scopus 로고    scopus 로고
    • Incorporating the human hearing properties in the signal subspace approach for speech enhancement
    • F. Jabloun, B. Champagne: Incorporating the human hearing properties in the signal subspace approach for speech enhancement, IEEE Trans. Speech Audio Process. 11(6), 700–708 (2003)
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 700-708
    • Jabloun, F.1    Champagne, B.2
  • 35
    • 0042362201 scopus 로고    scopus 로고
    • A perceptually motivated approach for speech enhancement
    • Y. Hu, P.C. Loizou: A perceptually motivated approach for speech enhancement, IEEE Trans. Speech Audio Process. 11(5), 457–465 (2003)
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 457-465
    • Hu, Y.1    Loizou, P.C.2
  • 36
    • 0025519408 scopus 로고
    • Discrete Gabor expansions
    • J. Wexler, S. Raz: Discrete Gabor expansions, Speech Process. 21(3), 207–220 (1990)
    • (1990) Speech Process , vol.21 , Issue.3 , pp. 207-220
    • Wexler, J.1    Raz, S.2
  • 40
    • 0036296949 scopus 로고    scopus 로고
    • Speech enhancement using MMSE short time spectral estimation with Gamma distributed speech priors
    • R. Martin: Speech enhancement using MMSE short time spectral estimation with Gamma distributed speech priors, Proc. 27th ICASSP 02, 253–256 (2002)
    • (2002) Proc. 27Th ICASSP 02 , pp. 253-256
    • Martin, R.1
  • 41
    • 6344245783 scopus 로고    scopus 로고
    • Modeling speech signals in the time-frequency domain using GARCH
    • I. Cohen: Modeling speech signals in the time-frequency domain using GARCH, Signal Process. 84(12), 2453–2459 (2004)
    • (2004) Signal Process , vol.84 , Issue.12 , pp. 2453-2459
    • Cohen, I.1
  • 42
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non-stationary noise environments
    • I. Cohen, B. Berdugo: Speech enhancement for non-stationary noise environments, Signal Process. 81(11), 2403–2418 (2001)
    • (2001) Signal Process. , vol.81 , Issue.11 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 43
    • 26444569329 scopus 로고    scopus 로고
    • Speech enhancement using supergaussian speech models and noncausal a priori SNR estimation
    • I. Cohen: Speech enhancement using supergaussian speech models and noncausal a priori SNR estimation, Speech Commun. 47(3), 336–350 (2005)
    • (2005) Speech Commun , vol.47 , Issue.3 , pp. 336-350
    • Cohen, I.1
  • 46
    • 0021158675 scopus 로고
    • Optimal estimators for spectral restoration of noisy speech
    • J. Porter, S. Boll: Optimal estimators for spectral restoration of noisy speech, Proc. ICASSP 84, 18A.2.1–18A.2.4 (1984)
    • (1984) Proc. ICASSP , vol.84
    • Porter, J.1    Boll, S.2
  • 47
    • 0028413241 scopus 로고
    • Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor
    • O. Cappé: Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor, IEEE Trans. Acoust. Speech Signal Process. 2(2), 345–349 (1994)
    • (1994) IEEE Trans. Acoust. Speech Signal Process. , vol.2 , Issue.2 , pp. 345-349
    • Cappé, O.1
  • 48
    • 0029726517 scopus 로고    scopus 로고
    • Speech enhancement based on a priori signal to noise estimation
    • P. Scalart, J. Vieira-Filho: Speech enhancement based on a priori signal to noise estimation, Proc. 21th ICASSP 96, 629–632 (1996)
    • (1996) Proc. 21Th ICASSP 96 , pp. 629-632
    • Scalart, P.1    Vieira-Filho, J.2
  • 49
    • 0032667465 scopus 로고    scopus 로고
    • Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments
    • D. Malah, R.V. Cox, A.J. Accardi: Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments, Proc. 24th ICASSP 99, 789–792 (1999)
    • (1999) Proc. 24Th ICASSP , vol.99 , pp. 789-792
    • Malah, D.1    Cox, R.V.2    Accardi, A.J.3
  • 50
    • 85075919056 scopus 로고    scopus 로고
    • On speech enhancement under signal presence uncertainty
    • I. Cohen: On speech enhancement under signal presence uncertainty, Proc. 26th ICASSP 2001, 167– 170 (2001)
    • (2001) Proc. 26Th ICASSP , pp. 167-170
    • Cohen, I.1
  • 51
    • 0032681259 scopus 로고    scopus 로고
    • Improved noise suppression filter using self-adaptive estimator of probability of speech absence
    • I.Y. Soon, S.N. Koh, C.K. Yeo: Improved noise suppression filter using self-adaptive estimator of probability of speech absence, Signal Process. 75(2), 151–159 (1999)
    • (1999) Signal Process , vol.75 , Issue.2 , pp. 151-159
    • Soon, I.Y.1    Koh, S.N.2    Yeo, C.K.3
  • 53
    • 4444301747 scopus 로고    scopus 로고
    • Speech enhancement using a noncausal a priori SNR estimator
    • I. Cohen: Speech enhancement using a noncausal a priori SNR estimator, IEEE Signal Process. Lett. 11(9), 725–728 (2004)
    • (2004) IEEE Signal Process. Lett. , vol.11 , Issue.9 , pp. 725-728
    • Cohen, I.1
  • 54
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • I. Cohen: Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging, IEEE Trans. Speech Audio Process. 11(5), 466–475 (2003)
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 55
    • 0034832359 scopus 로고    scopus 로고
    • Assessing local noise level estimation methods: Application to noise robust ASR
    • C. Ris, S. Dupont: Assessing local noise level estimation methods: Application to noise robust ASR, Speech Commun. 34(1–2), 141–158 (2001)
    • (2001) Speech Commun , vol.34 , Issue.1-2 , pp. 141-158
    • Ris, C.1    Dupont, S.2
  • 56
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statis-tics
    • R. Martin: Noise power spectral density estimation based on optimal smoothing and minimum statis-tics, IEEE Trans. Speech Audio Process. 9(5), 504–512 (2001)
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 57
    • 85097044692 scopus 로고
    • Computationally efficient speech enhancement by spectral minima tracking in sub-bands
    • G. Doblinger: Computationally efficient speech enhancement by spectral minima tracking in sub-bands, Proc. 4th Eurospeech 95, 1513–1516 (1995)
    • (1995) Proc. 4Th Eurospeech , vol.95 , pp. 1513-1516
    • Doblinger, G.1
  • 58
    • 0027629367 scopus 로고
    • Discrete Gabor transform
    • S. Qian, D. Chen: Discrete Gabor transform, IEEE Trans. Signal Process. 41(7), 2429–2438 (1993)
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.7 , pp. 2429-2438
    • Qian, S.1    Chen, D.2
  • 59
    • 0003089362 scopus 로고
    • Spectral subtraction based on minimum statistics
    • R. Martin: Spectral subtraction based on minimum statistics, Proc. 7th EUSIPCO 94, 1182–1185 (1994)
    • (1994) Proc. 7Th EUSIPCO , vol.94 , pp. 1182-1185
    • Martin, R.1
  • 60
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga, H.J.M. Steeneken: Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems, Speech Commun. 12(3), 247–251 (1993)
    • (1993) Speech Commun , vol.12 , Issue.3 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 64
    • 0032672098 scopus 로고    scopus 로고
    • A modular approach to speech enhancement with an application to speech coding
    • A.J. Accardi, R.V. Cox: A modular approach to speech enhancement with an application to speech coding, Proc. 24th ICASSP 99, 201–204 (1999)
    • (1999) Proc. 24Th ICASSP 99 , pp. 201-204
    • Accardi, A.J.1    Cox, R.V.2
  • 65
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detector
    • J. Sohn, N.S. Kim, W. Sung: A statistical model-based voice activity detector, IEEE Signal Process. Lett. 6(1), 1–3 (1999)
    • (1999) IEEE Signal Process. Lett , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 66
    • 85075912199 scopus 로고    scopus 로고
    • Multichannel speech enhancement using bayesian spectral amplitude estimation
    • T. Lotter, C. Benien, P. Vary: Multichannel speech enhancement using bayesian spectral amplitude estimation, Proc. 28th ICASSP 03, 832–835 (2003)
    • (2003) Proc. 28Th ICASSP , vol.3 , pp. 832-835
    • Lotter, T.1    Benien, C.2    Vary, P.3
  • 68
    • 0141702133 scopus 로고    scopus 로고
    • MMSE estimation of magnitude-squared DFT coefficients with supergaussian priors
    • C. Breithaupt, R. Martin: MMSE estimation of magnitude-squared DFT coefficients with supergaussian priors, Proc. 28th ICASSP 03, 896–899 (2003)
    • (2003) Proc. 28Th ICASSP 03 , pp. 896-899
    • Breithaupt, C.1    Martin, R.2
  • 69
    • 22944477796 scopus 로고    scopus 로고
    • Noise reduction by maximum a posteriori spectral amplitude estimation with supergaussian speech modeling
    • pp
    • T. Lotter, P. Vary: Noise reduction by maximum a posteriori spectral amplitude estimation with supergaussian speech modeling. In: Proc. 8th In-ternat. Workshop on Acoustic Echo and Noise Control (2003) pp. 83–86
    • (2003) Proc. 8Th In-Ternat. Workshop on Acoustic Echo and Noise Control , pp. 83-86
    • Lotter, T.1    Vary, P.2
  • 71
    • 0001228536 scopus 로고    scopus 로고
    • Comparison of one-and two-channel noise-estimation techniques
    • J. Meyer, K.U. Simmer, K.D. Kammeyer: Comparison of one-and two-channel noise-estimation techniques, Proc. 5th IWAENC 97, 137–145 (1997)
    • (1997) Proc. 5Th IWAENC 97 , pp. 137-145
    • Meyer, J.1    Simmer, K.U.2    Kammeyer, K.D.3
  • 74
    • 0028996871 scopus 로고
    • Noise estimation techniques for robust speech recognition
    • H.G. Hirsch, C. Ehrlicher: Noise estimation techniques for robust speech recognition, Proc. 20th ICASSP 95, 153–156 (1995)
    • (1995) Proc. 20Th ICASSP , vol.95 , pp. 153-156
    • Hirsch, H.G.1    Ehrlicher, C.2
  • 75
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non-stationary noise environments
    • I. Cohen, B. Berdugo: Speech enhancement for non-stationary noise environments, Signal Process. 81(11), 2403–2418 (2001)
    • (2001) Signal Process , vol.81 , Issue.11 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 76
    • 0033693215 scopus 로고    scopus 로고
    • Quantile based noise estimation for spectral subtraction and Wiener filtering
    • V. Stahl, A. Fischer, R. Bippus: Quantile based noise estimation for spectral subtraction and Wiener filtering, Proc. 25th ICASSP 2000, 1875–1878 (2000)
    • (2000) Proc. 25Th ICASSP 2000 , pp. 1875-1878
    • Stahl, V.1    Fischer, A.2    Bippus, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.