SCOPUS 정보 검색 플랫폼

IEEE/ACM Transactions on Audio Speech and Language Processing

Volumn 22, Issue 12, 2014, Pages 1931-1940

STFT phase reconstruction in voiced speech for an improved single-channel speech enhancement

(2) Krawczyk, Martin a Gerkmann, Timo a

a UNIVERSITY OF OLDENBURG (Germany)

Author keywords

Noise reduction; Phase estimation; Signal reconstruction; Speech enhancement

Indexed keywords

DISCRETE FOURIER TRANSFORMS; NOISE ABATEMENT; SIGNAL RECONSTRUCTION; SIGNAL TO NOISE RATIO; SPEECH; SPEECH ANALYSIS;

AMPLITUDE ENHANCEMENTS; AMPLITUDE ESTIMATION; FUNDAMENTAL FREQUENCIES; MICROPHONE SIGNALS; NOISY OBSERVATIONS; PHASE ESTIMATION; PHASE RECONSTRUCTION; SPECTRAL AMPLITUDE;

SPEECH ENHANCEMENT;

EID: 84921800494 PISSN: 23299290 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2014.2354236 Document Type: Article

Times cited : (217)

References (38)

1
- 0021407831
- Signal estimation from modified shorttime fourier transform
- Apr
- D. W. Griffin and J. S. Lim, "Signal estimation from modified shorttime Fourier transform," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 2, pp. 236-243, Apr. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.2 , pp. 236-243
- Griffin, D.W.¹ Lim, J.S.²

2
- 0021645331
- Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
- Dec
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp.1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

3
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Apr
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans.Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
- (1985) IEEE Trans.Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

4
- 79952363352
- The importance of phase in speech enhancement
- Apr
- K. Paliwal, K. Wójcicki, and B. Shannon, "The importance of phase in speech enhancement," ELSEVIER Speech Commun., vol. 53, no. 4, pp. 465-494, Apr. 2011.
- (2011) ELSEVIER Speech Commun , vol.53 , Issue.4 , pp. 465-494
- Paliwal, K.¹ Wójcicki, K.² Shannon, B.³

5
- 77950415416
- On the significance of phase in the short term fourier spectrum for speech intelligibility
- Mar
- M. Kazama, S. Gotoh, M. Tohyama, and T. Houtgast, "On the significance of phase in the short term Fourier spectrum for speech intelligibility," J. Acoust. Soc. Amer., vol. 127, no. 3, pp. 1432-1439, Mar.2010.
- (2010) J. Acoust. Soc. Amer , vol.127 , Issue.3 , pp. 1432-1439
- Kazama, M.¹ Gotoh, S.² Tohyama, M.³ Houtgast, T.⁴

6
- 84890503044
- Phase randomization - A new paradigm for single-channel signal enhancement
- Vancouver, BC, Canada, May
- A. Sugiyama and R. Miyahara, "Phase randomization - a new paradigm for single-channel signal enhancement," in Proc. IEEE Int. Conf.Acoust., Speech, Signal Process. (ICASSP), Vancouver, BC, Canada, May 2013, pp. 7487-7491.
- (2013) Proc. IEEE Int. Conf.Acoust., Speech, Signal Process. (ICASSP) , pp. 7487-7491
- Sugiyama, A.¹ Miyahara, R.²

7
- 84872720036
- Signal reconstruction from stft magnitude: A state of the art
- Paris, France, Sep
- N. Sturmel and L. Daudet, "Signal reconstruction from STFT magnitude: A state of the art," in Proc. Int. Conf. Digital Audio Effects (DAFx), Paris, France, Sep. 2011, pp. 375-386.
- (2011) Proc. Int. Conf. Digital Audio Effects (DAFx) , pp. 375-386
- Sturmel, N.¹ Daudet, L.²

8
- 84873346243
- Consistent wiener filtering for audio source separation
- Mar
- J. Le Roux and E. Vincent, "Consistent Wiener filtering for audio source separation," IEEE Signal Process. Lett., vol. 20, no. 3, pp.217-220, Mar. 2013.
- (2013) IEEE Signal Process. Lett , vol.20 , Issue.3 , pp. 217-220
- Le Roux, J.¹ Vincent, E.²

9
- 77949635098
- Iterative phase estimation for the synthesis of separated sources from single-channel mixtures
- May
- D. Gunawan and D. Sen, "Iterative phase estimation for the synthesis of separated sources from single-channel mixtures," IEEE Signal Process. Lett., vol. 17, no. 5, pp. 421-424, May 2010.
- (2010) IEEE Signal Process. Lett , vol.17 , Issue.5 , pp. 421-424
- Gunawan, D.¹ Sen, D.²

10
- 84878414736
- Phase estimation for signal reconstruction in single-channel speech separation
- Portland, OR, USA, Sep
- P. Mowlaee, R. Saeidi, and R. Martin, "Phase estimation for signal reconstruction in single-channel speech separation," in Proc. ISCA Interspeech, Portland, OR, USA, Sep. 2012.
- (2012) Proc. ISCA Interspeech
- Mowlaee, P.¹ Saeidi, R.² Martin, R.³

11
- 84871960544
- Phase estimation in speech enhancement - Unimportant, important, or impossible?
- Eilat, Israel, Nov
- T. Gerkmann, M. Krawczyk, and R. Rehr, "Phase estimation in speech enhancement - unimportant, important, or impossible?," in Proc.IEEE Conv. Elect. Electron. Eng. Israel, Eilat, Israel, Nov. 2012.
- (2012) Proc. IEEE Conv. Elect. Electron. Eng. Israel
- Gerkmann, T.¹ Krawczyk, M.² Rehr, R.³

12
- 84871802498
- MMSE-optimal spectral amplitude estimation given the stft-phase
- Feb
- T. Gerkmann and M. Krawczyk, "MMSE-optimal spectral amplitude estimation given the STFT-phase," IEEE Signal Process. Lett., vol. 20, no. 2, pp. 129-132, Feb. 2013.
- (2013) IEEE Signal Process. Lett , vol.20 , Issue.2 , pp. 129-132
- Gerkmann, T.¹ Krawczyk, M.²

13
- 84901345301
- Phase-sensitive real-time capable speech enhancement under voiced-unvoiced uncertainty
- Marrakech, Morocco, Sep
- M. Krawczyk, R. Rehr, and T. Gerkmann, "Phase-sensitive real-time capable speech enhancement under voiced-unvoiced uncertainty," in Proc. EURASIP Eur. Signal Process. Conf. (EUSIPCO), Marrakech, Morocco, Sep. 2013.
- (2013) Proc. EURASIP Eur. Signal Process. Conf. (EUSIPCO)
- Krawczyk, M.¹ Rehr, R.² Gerkmann, T.³

14
- 84905015144
- Bayesian estimation of clean speech spectral coefficients given a priori knowledge of the phase
- Aug
- T. Gerkmann, "Bayesian estimation of clean speech spectral coefficients given a priori knowledge of the phase," IEEE Trans. Signal Process., vol. 62, no. 16, pp. 4199-4208, Aug 2014.
- (2014) IEEE Trans. Signal Process , vol.62 , Issue.16 , pp. 4199-4208
- Gerkmann, T.¹

15
- 64149106876
- Speech synthesis from short-time fourier transform magnitude and its application to speech processing
- Mar
- D. Griffin, D. Deadrick, and J. Lim, "Speech synthesis from short-time Fourier transform magnitude and its application to speech processing," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Mar. 1984, vol. 9, pp. 61-64.
- (1984) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.9 , pp. 61-64
- Griffin, D.¹ Deadrick, D.² Lim, J.³

16
- 84957716236
- STFT phase improvement for single channel speech enhancement
- Aachen, Germany, Sep
- M. Krawczyk and T. Gerkmann, "STFT phase improvement for single channel speech enhancement," in Proc. Int. Workshop Acoust. Echo, Noise Control (IWAENC), Aachen, Germany, Sep. 2012.
- (2012) Proc. Int. Workshop Acoust. Echo, Noise Control (IWAENC)
- Krawczyk, M.¹ Gerkmann, T.²

17
- 84878992874
- Speech enhancement by maintaining phase continuity
- Nov
- E. Mehmetcik and T. Çiloʇlu, "Speech enhancement by maintaining phase continuity," in Proc. Meetings Acoust. Soc. Amer., Nov. 2012, vol. 18, no. 055002.
- (2012) Proc. Meetings Acoust. Soc. Amer , vol.18
- Mehmetcik, E.¹ Çiloʇlu, T.²

18
- 84867229000
- Speech analysis using instantaneous frequency deviation
- Brisbane, Australia, Sep
- A. P. Stark and K. K. Paliwal, "Speech analysis using instantaneous frequency deviation," in Proc. ISCA Interspeech, Brisbane, Australia, Sep. 2008, vol. 9, pp. 2602-2605.
- (2008) Proc. ISCA Interspeech , vol.9 , pp. 2602-2605
- Stark, A.P.¹ Paliwal, K.K.²

19
- 70450163916
- Group-delay-deviation based spectral analysis of speech
- Brighton, U.K., Sep
- A. P. Stark and K. K. Paliwal, "Group-delay-deviation based spectral analysis of speech," in Proc. ISCA Interspeech, Brighton, U.K., Sep.2009, vol. 10, pp. 1083-1086.
- (2009) Proc. ISCA Interspeech , vol.10 , pp. 1083-1086
- Stark, A.P.¹ Paliwal, K.K.²

20
- 0022093620
- Noise suppression by spectral magnitude estimation-mechanism and theoretical limits
- May
- P. Vary, "Noise suppression by spectral magnitude estimation-mechanism and theoretical limits," ELSEVIER Signal Process., vol. 8, pp.387-400, May 1985.
- (1985) ELSEVIER Signal Process , vol.8 , pp. 387-400
- Vary, P.¹

21
- 0022907822
- Pitch detection using the short-term phase spectrum
- Tokyo, Japan, Apr
- F. J. Charpentier, "Pitch detection using the short-term phase spectrum," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.(ICASSP), Tokyo, Japan, Apr. 1986, pp. 113-116.
- (1986) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.(ICASSP) , pp. 113-116
- Charpentier, F.J.¹

22
- 0025659239
- Noise reduction using a soft-decision sine-wave vector quantizer
- Apr
- T. Quatieri and R. McAulay, "Noise reduction using a soft-decision sine-wave vector quantizer," in IProc. EEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Apr. 1990, vol. 2, pp. 821-824.
- (1990) IProc. EEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.2 , pp. 821-824
- Quatieri, T.¹ McAulay, R.²

23
- 0031214234
- Speech enhancement using statebased estimation and sinusoidal modeling
- M. E. Deisher and A. S. Spanias, "Speech enhancement using statebased estimation and sinusoidal modeling," J. Acoust. Soc. Amer., vol.102, no. 2, pp. 1141-1148, 1997.
- (1997) J. Acoust. Soc. Amer , vol.102 , Issue.2 , pp. 1141-1148
- Deisher, M.E.¹ Spanias, A.S.²

24
- 0035472866
- Speech enhancement using a constrained iterative sinusoidal model
- Oct
- J. Jensen and J. H. Hansen, "Speech enhancement using a constrained iterative sinusoidal model," IEEE Trans. Speech Audio Process., vol.9, no. 7, pp. 731-740, Oct. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.7 , pp. 731-740
- Jensen, J.¹ Hansen, J.H.²

25
- 84875850302
- Stochastic-deterministic mmse stft speech enhancement with general a priori information
- Jul
- M. McCallum and B. Guillemin, "Stochastic-deterministic MMSE STFT speech enhancement with general a priori information," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 7, pp. 1445-1457, Jul. 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.7 , pp. 1445-1457
- McCallum, M.¹ Guillemin, B.²

26
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- Aug
- R. McAulay and T. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744-754, Aug 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-34 , Issue.4 , pp. 744-754
- McAulay, R.¹ Quatieri, T.²

27
- 0029763793
- Low bit rate high quality audio coding with combined harmonic and wavelet representations
- May
- K. Hamdy, M. Ali, and A. Tewfik, "Low bit rate high quality audio coding with combined harmonic and wavelet representations," in Proc.IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP),May 1996, vol. 2, pp. 1045-1048.
- (1996) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.2 , pp. 1045-1048
- Hamdy, K.¹ Ali, M.² Tewfik, A.³

28
- 84889302357
- Chichester, West Sussex, U.K.: Wiley
- P. Vary and R. Martin, Digital Speech Transmission: Enhancement, Coding And Error Concealment. Chichester, West Sussex, U.K.: Wiley, 2006.
- (2006) Digital Speech Transmission: Enhancement, Coding and Error Concealment
- Vary, P.¹ Martin, R.²

29
- 84897944294
- PEFAC - A pitch estimation algorithm robust to high levels of noise
- Feb
- S. Gonzalez and M. Brookes, "PEFAC-A pitch estimation algorithm robust to high levels of noise," IEEE/ACM Trans. Audio, Speech, Lang.Process., vol. 22, no. 2, pp. 518-530, Feb. 2014.
- (2014) IEEE/ACM Trans. Audio, Speech, Lang.Process , vol.22 , Issue.2 , pp. 518-530
- Gonzalez, S.¹ Brookes, M.²

30
- 84861121219
- Joint fundamental frequency and order estimation using optimal filtering
- M. Christensen, J. Hojvang, A. Jakobsson, and S. Jensen, "Joint fundamental frequency and order estimation using optimal filtering," EURASIP J. Adv. Signal Process., vol. 2011, no. 1, p. 13, 2011.
- (2011) EURASIP J. Adv. Signal Process , vol.2011 , Issue.1 , pp. 13
- Christensen, M.¹ Hojvang, J.² Jakobsson, A.³ Jensen, S.⁴

31
- 84921800871
- [Online], Feb
- S. Gonzalez, "Pitch of the core TIMIT database set," [Online]. Available: http://www.ee.ic.ac.uk/hp/staff/dmb/data/TIMITfxv.zip Feb.2014
- (2014) Pitch of the Core TIMIT Database Set
- Gonzalez, S.¹

32
- 51449124098
- DARPA timit acoustic-phonetic speech database
- J. S. Garofolo, "DARPA TIMIT acoustic-phonetic speech database," Nat. Insti. of Stand. and Technol. (NIST) 1988.
- (1988) Nat. Insti. of Stand. and Technol. (NIST)
- Garofolo, J.S.¹

33
- 0003639435
- ITU-T, ITU-T Rec.P.862
- ITU-T, "Perceptual evaluation of speech quality (PESQ)," ITU-T Rec.P.862 2001.
- (2001) Perceptual Evaluation of Speech Quality (PESQ)

34
- 0017787719
- A study of complexity and quality of speech waveform coders
- Apr
- J. Tribolet, P. Noll, B.McDermott, and R. Crochiere, "A study of complexity and quality of speech waveform coders," in Proc. IEEE Int.Conf. Acoust., Speech, Signal Process. (ICASSP), Apr. 1978, vol. 3, pp. 586-590.
- (1978) Proc. IEEE Int.Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.3 , pp. 586-590
- Tribolet, J.¹ Noll, P.² Mcdermott, B.³ Crochiere, R.⁴

35
- 48349113750
- [Online]
- M. Brookes, "VOICEBOX: A speech processing toolbox for MATLAB," [Online]. Available: http://www.ee.ic.ac.uk/hp/staff/ dmb/voicebox/voicebox.html
- VOICEBOX: A Speech Processing Toolbox for MATLAB
- Brookes, M.¹

36
- 44149106061
- Evaluation of objective quality measures for speech enhancement
- Jan
- Y. Hu and P. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no. 1, pp. 229-238, Jan. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process , vol.16 , Issue.1 , pp. 229-238
- Hu, Y.¹ Loizou, P.²

37
- 84857498666
- Unbiased mmse-based noise power estimation with low complexity and low tracking delay
- May
- T. Gerkmann and R. C. Hendriks, "Unbiased MMSE-based noise power estimation with low complexity and low tracking delay," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 4, pp. 1383-1393, May 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.4 , pp. 1383-1393
- Gerkmann, T.¹ Hendriks, R.C.²

38
- 84921819178
- [Online]
- M. Krawczyk and T. Gerkmann, "STFT phase reconstruction based on a harmonic model: Listening examples and code," [Online]. Available:
- STFT Phase Reconstruction Based on A Harmonic Model: Listening Examples and Code
- Krawczyk, M.¹ Gerkmann, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.