메뉴 건너뛰기




Volumn 53, Issue 4, 2011, Pages 465-494

The importance of phase in speech enhancement

Author keywords

Analysis window; Analysis modification synthesis (AMS); Magnitude spectrum; Minimum mean square error (MMSE) short time spectral amplitude (STSA) estimator; MMSE PSC; Phase spectrum; Phase spectrum compensation (PSC); Short time Fourier analysis; Speech enhancement

Indexed keywords

ANALYSIS WINDOWS; ANALYSIS-MODIFICATION-SYNTHESIS (AMS); MAGNITUDE SPECTRUM; MINIMUM MEAN-SQUARE ERROR (MMSE) SHORT-TIME SPECTRAL AMPLITUDE (STSA) ESTIMATOR; MMSE PSC; PHASE SPECTRUM; PHASE SPECTRUM COMPENSATION (PSC); SHORT-TIME FOURIER ANALYSIS;

EID: 79952363352     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2010.12.003     Document Type: Article
Times cited : (429)

References (54)
  • 3
    • 33646255447 scopus 로고    scopus 로고
    • Further intelligibility results from human listening tests using the short-time phase spectrum
    • L. Alsteris, and K. Paliwal Further intelligibility results from human listening tests using the short-time phase spectrum Speech Commun. 48 6 2006 727 736
    • (2006) Speech Commun. , vol.48 , Issue.6 , pp. 727-736
    • Alsteris, L.1    Paliwal, K.2
  • 4
    • 33749546023 scopus 로고    scopus 로고
    • Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra
    • DOI 10.1016/j.csl.2006.03.001, PII S0885230806000064
    • L. Alsteris, and K. Paliwal Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra Comput. Speech Lang. 21 1 2007 174 186 (Pubitemid 44537648)
    • (2007) Computer Speech and Language , vol.21 , Issue.1 , pp. 174-186
    • Alsteris, L.D.1    Paliwal, K.K.2
  • 6
    • 0018455310 scopus 로고
    • SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    • S. Boll Suppression of acoustic noise in speech using spectral subtraction IEEE Trans. Acoust. Speech Signal Process. ASSP-27 2 1979 113 120 (Pubitemid 9467471)
    • (1979) IEEE Trans Acoust Speech Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll Steven, F.1
  • 7
    • 0019053271 scopus 로고
    • COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
    • S. Davis, and P. Mermelstein Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Trans. Acoust. Speech Signal Process. ASSP-28 4 1980 357 366 (Pubitemid 11464930)
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis Steven, B.1    Mermelstein Paul2
  • 8
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
    • Y. Ephraim, and D. Malah Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator IEEE Trans. Acoust. Speech Signal Process. ASSP-32 6 1984 1109 1121
    • (1984) IEEE Trans. Acoust. Speech Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 9
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Y. Ephraim, and D. Malah Speech enhancement using a minimum mean-square error log-spectral amplitude estimator IEEE Trans. Acoust. Speech Signal Process. ASSP-33 2 1985 443 445
    • (1985) IEEE Trans. Acoust. Speech Signal Process. , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 11
    • 0034844903 scopus 로고    scopus 로고
    • On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception
    • O. Ghitza On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception J. Acoust. Soc. Am. 110 3 2001 1628 1640
    • (2001) J. Acoust. Soc. Am. , vol.110 , Issue.3 , pp. 1628-1640
    • Ghitza, O.1
  • 13
    • 0017851927 scopus 로고
    • On the use of windows for harmonic analysis with the discrete Fourier transform
    • F. Harris On the use of windows for harmonic analysis with the discrete Fourier transform Proc. IEEE 66 1 1978 51 83 (Pubitemid 8338069)
    • (1978) Proceedings of the IEEE , vol.66 , Issue.1 , pp. 51-83
    • Harris, F.J.1
  • 16
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky Perceptual linear predictive (PLP) analysis of speech J. Acoust. Soc. Am. 87 4 1990 1738 1752 (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 17
    • 34447092407 scopus 로고    scopus 로고
    • Subjective comparison and evaluation of speech enhancement algorithms
    • DOI 10.1016/j.specom.2006.12.006, PII S0167639306001920
    • Y. Hu, and P.C. Loizou Subjective comparison and evaluation of speech enhancement algorithms Speech Commun. 49 7-8 2007 588 601 (Pubitemid 47031352)
    • (2007) Speech Communication , vol.49 , Issue.7-8 , pp. 588-601
    • Hu, Y.1    Loizou, P.C.2
  • 19
    • 0043095309 scopus 로고    scopus 로고
    • Perceptual phase quantization of speech
    • D. Kim Perceptual phase quantization of speech IEEE Trans. Speech Audio Process. 11 4 2003 355 364
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.4 , pp. 355-364
    • Kim, D.1
  • 20
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech
    • J. Lim, and A. Oppenheim Enhancement and bandwidth compression of noisy speech Proc. IEEE 67 12 1979 1586 1604 (Pubitemid 10179553)
    • (1979) Proceedings of the IEEE , vol.67 , Issue.12 , pp. 1586-1604
    • Lim, J.S.1    Oppenheim, A.V.2
  • 21
    • 0031220487 scopus 로고    scopus 로고
    • Effects of phase on the perception of intervocalic stop consonants
    • PII S016763939700054X
    • L. Liu, J. He, and G. Palm Effects of phase on the perception of intervocalic stop consonants Speech Commun. 22 4 1997 403 417 (Pubitemid 127433607)
    • (1997) Speech Communication , vol.22 , Issue.4 , pp. 403-417
    • Liu, L.1    He, J.2    Palm, G.3
  • 23
    • 77953864796 scopus 로고    scopus 로고
    • Objective evaluation of magnitude and phase only spectrum-based reconstruction of the speech signal
    • Limassol, Cyprus
    • Loveimi, E., Ahadi, S., 2010. Objective evaluation of magnitude and phase only spectrum-based reconstruction of the speech signal. In: Proc. Int. Sympos. Commun. Control Signal Process (ISCCSP). Limassol, Cyprus, pp. 1-4.
    • (2010) Proc. Int. Sympos. Commun. Control Signal Process (ISCCSP) , pp. 1-4
    • Loveimi, E.1    Ahadi, S.2
  • 24
    • 44149115462 scopus 로고    scopus 로고
    • A geometric approach to spectral subtraction
    • Y. Lu, and P. Loizou A geometric approach to spectral subtraction Speech Commun. 50 6 2008 453 466
    • (2008) Speech Commun. , vol.50 , Issue.6 , pp. 453-466
    • Lu, Y.1    Loizou, P.2
  • 25
    • 0003089362 scopus 로고
    • Spectral subtraction based on minimum statistics
    • Edinburgh, Scotland, UK
    • Martin, R., 1994. Spectral subtraction based on minimum statistics. In: Proc. EURASIP European Signal Process. Conf. (EUSIPCO). Edinburgh, Scotland, UK, pp. 1182-1185.
    • (1994) Proc. EURASIP European Signal Process. Conf. (EUSIPCO) , pp. 1182-1185
    • Martin, R.1
  • 29
    • 0019569248 scopus 로고
    • The importance of phase in signals
    • A.V. Oppenheim, and J.S. Lim The importance of phase in signals Proc. IEEE 69 5 1981 529 541
    • (1981) Proc. IEEE , vol.69 , Issue.5 , pp. 529-541
    • Oppenheim, A.V.1    Lim, J.S.2
  • 33
    • 13544259544 scopus 로고    scopus 로고
    • On the usefulness of STFT phase spectrum in human listening tests
    • DOI 10.1016/j.specom.2004.08.001, PII S0167639304000950
    • K. Paliwal, and L. Alsteris On the usefulness of STFT phase spectrum in human listening tests Speech Commun. 45 2 2005 153 170 (Pubitemid 40220191)
    • (2005) Speech Communication , vol.45 , Issue.2 , pp. 153-170
    • Paliwal, K.K.1    Alsteris, L.D.2
  • 35
    • 0027659197 scopus 로고
    • Signal modeling techniques in speech recognition
    • J. Picone Signal modeling techniques in speech recognition Proc. IEEE 81 9 1993 1215 1247
    • (1993) Proc. IEEE , vol.81 , Issue.9 , pp. 1215-1247
    • Picone, J.1
  • 42
    • 0032123832 scopus 로고    scopus 로고
    • A parametric formulation of the generalized spectral subtraction method
    • B.L. Sim, Y.C. Tong, J. Chang, and C.T. Tan A parametric formulation of the generalized spectral subtraction method IEEE Trans. Speech Audio Process. 6 4 1998 328 337
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 328-337
    • Sim, B.L.1    Tong, Y.C.2    Chang, J.3    Tan, C.T.4
  • 44
    • 84867217172 scopus 로고    scopus 로고
    • Noise-driven short-time phase spectrum compensation procedure for speech enhancement
    • Brisbane, QLD, Australia
    • Stark, A., Wójcicki, K., Lyons, J., Paliwal, K., 2008. Noise-driven short-time phase spectrum compensation procedure for speech enhancement. In: Proc. ISCA Conf. Int. Speech Commun. Assoc. (INTERSPEECH). Brisbane, QLD, Australia, pp. 549-552.
    • (2008) Proc. ISCA Conf. Int. Speech Commun. Assoc. (INTERSPEECH) , pp. 549-552
    • Stark, A.1
  • 45
    • 0022093620 scopus 로고
    • NOISE SUPPRESSION BY SPECTRAL MAGNITUDE ESTIMATION - MECHANISM AND THEORETICAL LIMITS
    • DOI 10.1016/0165-1684(85)90002-7
    • P. Vary Noise suppression by spectral magnitude estimation - mechanism and theoretical limits Signal Process. 8 4 1985 387 400 (Pubitemid 15526064)
    • (1985) Signal Processing , vol.8 , Issue.4 , pp. 387-400
    • Vary Peter1
  • 46
    • 0033097443 scopus 로고    scopus 로고
    • Single channel speech enhancement based on masking properties of the human auditory system
    • N. Virag Single channel speech enhancement based on masking properties of the human auditory system IEEE Trans. Speech Audio Process. 7 2 1999 126 137
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.2 , pp. 126-137
    • Virag, N.1
  • 48
    • 0020167383 scopus 로고
    • The unimportance of phase in speech enhancement
    • D. Wang, and J. Lim The unimportance of phase in speech enhancement IEEE Trans. Acoust. Speech Signal Process. ASSP-30 4 1982 679 681
    • (1982) IEEE Trans. Acoust. Speech Signal Process. , vol.30 , Issue.4 , pp. 679-681
    • Wang, D.1    Lim, J.2
  • 49
    • 70349485603 scopus 로고    scopus 로고
    • High improvement of speaker identification and verification by combining MFCC and phase information
    • Taipei, Taiwan
    • Wang, L., Ohtsuka, S., Nakagawa, S., 2009. High improvement of speaker identification and verification by combining MFCC and phase information. In: Proc. IEEE Int. Conf. Acoustics Speech and Signal Process. (ICASSP). Taipei, Taiwan, pp. 4529-4532.
    • (2009) Proc. IEEE Int. Conf. Acoustics Speech and Signal Process. (ICASSP) , pp. 4529-4532
    • Wang, L.1    Ohtsuka, S.2    Nakagawa, S.3
  • 51
    • 67650180126 scopus 로고    scopus 로고
    • Exploiting conjugate symmetry of the short-time Fourier spectrum for speech enhancement
    • K. Wójcicki, M. Milacic, A. Stark, J. Lyons, and K. Paliwal Exploiting conjugate symmetry of the short-time Fourier spectrum for speech enhancement IEEE Signal Process. Lett. 15 2008 461 464
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 461-464
    • Wójcicki, K.1    Milacic, M.2    Stark, A.3    Lyons, J.4    Paliwal, K.5
  • 52
    • 34547500071 scopus 로고    scopus 로고
    • Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech
    • Honolulu, HI, USA
    • Wójcicki, K., Paliwal, K., 2007. Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech. In: Proc. IEEE Int. Conf. Acoustics Speech and Signal Process (ICASSP), vol. IV. Honolulu, HI, USA, pp. 729-733.
    • (2007) Proc. IEEE Int. Conf. Acoustics Speech and Signal Process (ICASSP) , vol.4 , pp. 729-733
    • Wójcicki, K.1
  • 53
    • 79952363969 scopus 로고    scopus 로고
    • On the relative importance of the short-time magnitude and phase spectra towards speaker dependent information
    • Aalborg, Denmark
    • Wójcicki, K., Paliwal, K., 2008. On the relative importance of the short-time magnitude and phase spectra towards speaker dependent information. In: Proc. ISCA Tutorial and Research Workshop (ITRW). Aalborg, Denmark.
    • (2008) Proc. ISCA Tutorial and Research Workshop (ITRW)
    • Wójcicki, K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.