메뉴 건너뛰기




Volumn 22, Issue 12, 2014, Pages 1931-1940

STFT phase reconstruction in voiced speech for an improved single-channel speech enhancement

Author keywords

Noise reduction; Phase estimation; Signal reconstruction; Speech enhancement

Indexed keywords

DISCRETE FOURIER TRANSFORMS; NOISE ABATEMENT; SIGNAL RECONSTRUCTION; SIGNAL TO NOISE RATIO; SPEECH; SPEECH ANALYSIS;

EID: 84921800494     PISSN: 23299290     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2014.2354236     Document Type: Article
Times cited : (217)

References (38)
  • 1
    • 0021407831 scopus 로고
    • Signal estimation from modified shorttime fourier transform
    • Apr
    • D. W. Griffin and J. S. Lim, "Signal estimation from modified shorttime Fourier transform," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 2, pp. 236-243, Apr. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.2 , pp. 236-243
    • Griffin, D.W.1    Lim, J.S.2
  • 2
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Dec
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp.1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 3
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Apr
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans.Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
    • (1985) IEEE Trans.Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 4
    • 79952363352 scopus 로고    scopus 로고
    • The importance of phase in speech enhancement
    • Apr
    • K. Paliwal, K. Wójcicki, and B. Shannon, "The importance of phase in speech enhancement," ELSEVIER Speech Commun., vol. 53, no. 4, pp. 465-494, Apr. 2011.
    • (2011) ELSEVIER Speech Commun , vol.53 , Issue.4 , pp. 465-494
    • Paliwal, K.1    Wójcicki, K.2    Shannon, B.3
  • 5
    • 77950415416 scopus 로고    scopus 로고
    • On the significance of phase in the short term fourier spectrum for speech intelligibility
    • Mar
    • M. Kazama, S. Gotoh, M. Tohyama, and T. Houtgast, "On the significance of phase in the short term Fourier spectrum for speech intelligibility," J. Acoust. Soc. Amer., vol. 127, no. 3, pp. 1432-1439, Mar.2010.
    • (2010) J. Acoust. Soc. Amer , vol.127 , Issue.3 , pp. 1432-1439
    • Kazama, M.1    Gotoh, S.2    Tohyama, M.3    Houtgast, T.4
  • 6
    • 84890503044 scopus 로고    scopus 로고
    • Phase randomization - A new paradigm for single-channel signal enhancement
    • Vancouver, BC, Canada, May
    • A. Sugiyama and R. Miyahara, "Phase randomization - a new paradigm for single-channel signal enhancement," in Proc. IEEE Int. Conf.Acoust., Speech, Signal Process. (ICASSP), Vancouver, BC, Canada, May 2013, pp. 7487-7491.
    • (2013) Proc. IEEE Int. Conf.Acoust., Speech, Signal Process. (ICASSP) , pp. 7487-7491
    • Sugiyama, A.1    Miyahara, R.2
  • 7
    • 84872720036 scopus 로고    scopus 로고
    • Signal reconstruction from stft magnitude: A state of the art
    • Paris, France, Sep
    • N. Sturmel and L. Daudet, "Signal reconstruction from STFT magnitude: A state of the art," in Proc. Int. Conf. Digital Audio Effects (DAFx), Paris, France, Sep. 2011, pp. 375-386.
    • (2011) Proc. Int. Conf. Digital Audio Effects (DAFx) , pp. 375-386
    • Sturmel, N.1    Daudet, L.2
  • 8
    • 84873346243 scopus 로고    scopus 로고
    • Consistent wiener filtering for audio source separation
    • Mar
    • J. Le Roux and E. Vincent, "Consistent Wiener filtering for audio source separation," IEEE Signal Process. Lett., vol. 20, no. 3, pp.217-220, Mar. 2013.
    • (2013) IEEE Signal Process. Lett , vol.20 , Issue.3 , pp. 217-220
    • Le Roux, J.1    Vincent, E.2
  • 9
    • 77949635098 scopus 로고    scopus 로고
    • Iterative phase estimation for the synthesis of separated sources from single-channel mixtures
    • May
    • D. Gunawan and D. Sen, "Iterative phase estimation for the synthesis of separated sources from single-channel mixtures," IEEE Signal Process. Lett., vol. 17, no. 5, pp. 421-424, May 2010.
    • (2010) IEEE Signal Process. Lett , vol.17 , Issue.5 , pp. 421-424
    • Gunawan, D.1    Sen, D.2
  • 10
    • 84878414736 scopus 로고    scopus 로고
    • Phase estimation for signal reconstruction in single-channel speech separation
    • Portland, OR, USA, Sep
    • P. Mowlaee, R. Saeidi, and R. Martin, "Phase estimation for signal reconstruction in single-channel speech separation," in Proc. ISCA Interspeech, Portland, OR, USA, Sep. 2012.
    • (2012) Proc. ISCA Interspeech
    • Mowlaee, P.1    Saeidi, R.2    Martin, R.3
  • 11
    • 84871960544 scopus 로고    scopus 로고
    • Phase estimation in speech enhancement - Unimportant, important, or impossible?
    • Eilat, Israel, Nov
    • T. Gerkmann, M. Krawczyk, and R. Rehr, "Phase estimation in speech enhancement - unimportant, important, or impossible?," in Proc.IEEE Conv. Elect. Electron. Eng. Israel, Eilat, Israel, Nov. 2012.
    • (2012) Proc. IEEE Conv. Elect. Electron. Eng. Israel
    • Gerkmann, T.1    Krawczyk, M.2    Rehr, R.3
  • 12
    • 84871802498 scopus 로고    scopus 로고
    • MMSE-optimal spectral amplitude estimation given the stft-phase
    • Feb
    • T. Gerkmann and M. Krawczyk, "MMSE-optimal spectral amplitude estimation given the STFT-phase," IEEE Signal Process. Lett., vol. 20, no. 2, pp. 129-132, Feb. 2013.
    • (2013) IEEE Signal Process. Lett , vol.20 , Issue.2 , pp. 129-132
    • Gerkmann, T.1    Krawczyk, M.2
  • 13
    • 84901345301 scopus 로고    scopus 로고
    • Phase-sensitive real-time capable speech enhancement under voiced-unvoiced uncertainty
    • Marrakech, Morocco, Sep
    • M. Krawczyk, R. Rehr, and T. Gerkmann, "Phase-sensitive real-time capable speech enhancement under voiced-unvoiced uncertainty," in Proc. EURASIP Eur. Signal Process. Conf. (EUSIPCO), Marrakech, Morocco, Sep. 2013.
    • (2013) Proc. EURASIP Eur. Signal Process. Conf. (EUSIPCO)
    • Krawczyk, M.1    Rehr, R.2    Gerkmann, T.3
  • 14
    • 84905015144 scopus 로고    scopus 로고
    • Bayesian estimation of clean speech spectral coefficients given a priori knowledge of the phase
    • Aug
    • T. Gerkmann, "Bayesian estimation of clean speech spectral coefficients given a priori knowledge of the phase," IEEE Trans. Signal Process., vol. 62, no. 16, pp. 4199-4208, Aug 2014.
    • (2014) IEEE Trans. Signal Process , vol.62 , Issue.16 , pp. 4199-4208
    • Gerkmann, T.1
  • 15
    • 64149106876 scopus 로고
    • Speech synthesis from short-time fourier transform magnitude and its application to speech processing
    • Mar
    • D. Griffin, D. Deadrick, and J. Lim, "Speech synthesis from short-time Fourier transform magnitude and its application to speech processing," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Mar. 1984, vol. 9, pp. 61-64.
    • (1984) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.9 , pp. 61-64
    • Griffin, D.1    Deadrick, D.2    Lim, J.3
  • 17
    • 84878992874 scopus 로고    scopus 로고
    • Speech enhancement by maintaining phase continuity
    • Nov
    • E. Mehmetcik and T. Çiloʇlu, "Speech enhancement by maintaining phase continuity," in Proc. Meetings Acoust. Soc. Amer., Nov. 2012, vol. 18, no. 055002.
    • (2012) Proc. Meetings Acoust. Soc. Amer , vol.18
    • Mehmetcik, E.1    Çiloʇlu, T.2
  • 18
    • 84867229000 scopus 로고    scopus 로고
    • Speech analysis using instantaneous frequency deviation
    • Brisbane, Australia, Sep
    • A. P. Stark and K. K. Paliwal, "Speech analysis using instantaneous frequency deviation," in Proc. ISCA Interspeech, Brisbane, Australia, Sep. 2008, vol. 9, pp. 2602-2605.
    • (2008) Proc. ISCA Interspeech , vol.9 , pp. 2602-2605
    • Stark, A.P.1    Paliwal, K.K.2
  • 19
    • 70450163916 scopus 로고    scopus 로고
    • Group-delay-deviation based spectral analysis of speech
    • Brighton, U.K., Sep
    • A. P. Stark and K. K. Paliwal, "Group-delay-deviation based spectral analysis of speech," in Proc. ISCA Interspeech, Brighton, U.K., Sep.2009, vol. 10, pp. 1083-1086.
    • (2009) Proc. ISCA Interspeech , vol.10 , pp. 1083-1086
    • Stark, A.P.1    Paliwal, K.K.2
  • 20
    • 0022093620 scopus 로고
    • Noise suppression by spectral magnitude estimation-mechanism and theoretical limits
    • May
    • P. Vary, "Noise suppression by spectral magnitude estimation-mechanism and theoretical limits," ELSEVIER Signal Process., vol. 8, pp.387-400, May 1985.
    • (1985) ELSEVIER Signal Process , vol.8 , pp. 387-400
    • Vary, P.1
  • 23
    • 0031214234 scopus 로고    scopus 로고
    • Speech enhancement using statebased estimation and sinusoidal modeling
    • M. E. Deisher and A. S. Spanias, "Speech enhancement using statebased estimation and sinusoidal modeling," J. Acoust. Soc. Amer., vol.102, no. 2, pp. 1141-1148, 1997.
    • (1997) J. Acoust. Soc. Amer , vol.102 , Issue.2 , pp. 1141-1148
    • Deisher, M.E.1    Spanias, A.S.2
  • 24
    • 0035472866 scopus 로고    scopus 로고
    • Speech enhancement using a constrained iterative sinusoidal model
    • Oct
    • J. Jensen and J. H. Hansen, "Speech enhancement using a constrained iterative sinusoidal model," IEEE Trans. Speech Audio Process., vol.9, no. 7, pp. 731-740, Oct. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.7 , pp. 731-740
    • Jensen, J.1    Hansen, J.H.2
  • 25
    • 84875850302 scopus 로고    scopus 로고
    • Stochastic-deterministic mmse stft speech enhancement with general a priori information
    • Jul
    • M. McCallum and B. Guillemin, "Stochastic-deterministic MMSE STFT speech enhancement with general a priori information," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 7, pp. 1445-1457, Jul. 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.7 , pp. 1445-1457
    • McCallum, M.1    Guillemin, B.2
  • 26
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • Aug
    • R. McAulay and T. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744-754, Aug 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-34 , Issue.4 , pp. 744-754
    • McAulay, R.1    Quatieri, T.2
  • 27
    • 0029763793 scopus 로고    scopus 로고
    • Low bit rate high quality audio coding with combined harmonic and wavelet representations
    • May
    • K. Hamdy, M. Ali, and A. Tewfik, "Low bit rate high quality audio coding with combined harmonic and wavelet representations," in Proc.IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP),May 1996, vol. 2, pp. 1045-1048.
    • (1996) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.2 , pp. 1045-1048
    • Hamdy, K.1    Ali, M.2    Tewfik, A.3
  • 29
    • 84897944294 scopus 로고    scopus 로고
    • PEFAC - A pitch estimation algorithm robust to high levels of noise
    • Feb
    • S. Gonzalez and M. Brookes, "PEFAC-A pitch estimation algorithm robust to high levels of noise," IEEE/ACM Trans. Audio, Speech, Lang.Process., vol. 22, no. 2, pp. 518-530, Feb. 2014.
    • (2014) IEEE/ACM Trans. Audio, Speech, Lang.Process , vol.22 , Issue.2 , pp. 518-530
    • Gonzalez, S.1    Brookes, M.2
  • 30
    • 84861121219 scopus 로고    scopus 로고
    • Joint fundamental frequency and order estimation using optimal filtering
    • M. Christensen, J. Hojvang, A. Jakobsson, and S. Jensen, "Joint fundamental frequency and order estimation using optimal filtering," EURASIP J. Adv. Signal Process., vol. 2011, no. 1, p. 13, 2011.
    • (2011) EURASIP J. Adv. Signal Process , vol.2011 , Issue.1 , pp. 13
    • Christensen, M.1    Hojvang, J.2    Jakobsson, A.3    Jensen, S.4
  • 36
    • 44149106061 scopus 로고    scopus 로고
    • Evaluation of objective quality measures for speech enhancement
    • Jan
    • Y. Hu and P. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no. 1, pp. 229-238, Jan. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process , vol.16 , Issue.1 , pp. 229-238
    • Hu, Y.1    Loizou, P.2
  • 37
    • 84857498666 scopus 로고    scopus 로고
    • Unbiased mmse-based noise power estimation with low complexity and low tracking delay
    • May
    • T. Gerkmann and R. C. Hendriks, "Unbiased MMSE-based noise power estimation with low complexity and low tracking delay," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 4, pp. 1383-1393, May 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.4 , pp. 1383-1393
    • Gerkmann, T.1    Hendriks, R.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.