메뉴 건너뛰기




Volumn 21, Issue 10, 2013, Pages 2140-2151

Supervised and unsupervised speech enhancement using nonnegative matrix factorization

Author keywords

Bayesian inference; HMM; nonnegative matrix factorization (NMF); PLCA; speech enhancement

Indexed keywords

BAYESIAN INFERENCE; HMM; MINIMUM MEAN-SQUARE ERROR ESTIMATORS; NOISY SPEECH SIGNALS; NONNEGATIVE MATRIX FACTORIZATION; PLCA; SPEECH ENHANCEMENT METHODS; SPEECH ENHANCEMENT SYSTEM;

EID: 84881053943     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2270369     Document Type: Article
Times cited : (427)

References (51)
  • 1
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979. (Pubitemid 9467471)
    • (1979) IEEE Trans Acoust Speech Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll Steven, F.1
  • 2
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech
    • J. S. Lim and V. O. Alan, "Enhancement and bandwidth compression of noisy speech," Proc. IEEE, vol. 67, no. 12, pp. 1586-1604, Dec. 1979. (Pubitemid 10179553)
    • (1979) Proceedings of the IEEE , vol.67 , Issue.12 , pp. 1586-1604
    • Lim, J.S.1    Oppenheim, A.V.2
  • 4
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator," IEEE Trans. Audio, Speech, Lang. Process., vol. ASSP-32, no. 6, pp. 1109-1121, 1984.
    • (1984) IEEE Trans. Audio, Speech, Lang. Process., Vol. ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 5
    • 27644556974 scopus 로고    scopus 로고
    • Speech enhancement based on minimum mean-square error estimation and supergaussian priors
    • DOI 10.1109/TSA.2005.851927
    • R. Martin, "Speech enhancement based on minimum mean-square error estimation and supergaussian priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 13, no. 5, pp. 845-856, Sep. 2005. (Pubitemid 41558900)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 845-856
    • Martin, R.1
  • 6
    • 32644447834 scopus 로고    scopus 로고
    • Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models
    • DOI 10.1016/j.sigpro.2005.06.005, PII S0165168405001982
    • I. Cohen, "Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models," Signal Process., vol. 86, no. 4, pp. 698-709, Apr. 2006. (Pubitemid 43242967)
    • (2006) Signal Processing , vol.86 , Issue.4 , pp. 698-709
    • Cohen, I.1
  • 7
    • 51449104842 scopus 로고    scopus 로고
    • Minimum Mean-Square Error estimation of discrete Fourier coefficients with generalized Gamma priors
    • Aug
    • J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen, "Minimum Mean-Square Error estimation of discrete Fourier coefficients with generalized Gamma priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1741-1752, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1741-1752
    • Erkelens, J.S.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4
  • 8
    • 33846907750 scopus 로고    scopus 로고
    • A Laplacian-based MMSE estimator for speech enhancement
    • DOI 10.1016/j.specom.2006.12.005, PII S0167639306001956
    • B. Chen and P. C. Loizou, "A Laplacian-based MMSE estimator for speech enhancement," Speech Commun., vol. 49, no. 2, pp. 134-143, Feb. 2007. (Pubitemid 46241513)
    • (2007) Speech Communication , vol.49 , Issue.2 , pp. 134-143
    • Chen, B.1    Loizou, P.C.2
  • 10
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • DOI 10.1109/89.928915, PII S106366760104980X
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001. (Pubitemid 32631178)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 11
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments : Improved minima controlled recursive averaging
    • Sep
    • I. Cohen, "Noise spectrum estimation in adverse environments : Improved minima controlled recursive averaging," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 466-475, Sep. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 13
    • 0030247604 scopus 로고    scopus 로고
    • Codebook constrained wiener filtering for speech enhancement
    • PII S1063667696067156
    • T. Sreenivas and P.Kirnapure, "Codebook constrainedWiener filtering for speech enhancement," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 383-389, Sep. 1996. (Pubitemid 126753026)
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 383-389
    • Sreenivas, T.V.1    Kirnapure, P.2
  • 14
    • 33744970011 scopus 로고    scopus 로고
    • Codebook driven short-term predictor parameter estimation for speech enhancement
    • DOI 10.1109/TSA.2005.854113
    • S. Srinivasan, J. Samuelsson, and W. Kleijn, "Codebook driven shortterm predictor parameter estimation for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 163-176, Jan. 2006. (Pubitemid 43863463)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 163-176
    • Srinivasan, S.1    Samuelsson, J.2    Kleijn, W.B.3
  • 15
    • 0026843273 scopus 로고
    • A Bayesian estimation approach for speech enhancement using hidden Markov models
    • Apr
    • Y. Ephraim, "A Bayesian estimation approach for speech enhancement using hidden Markov models," IEEE Trans. Signal Process., vol. 40, no. 4, pp. 725-735, Apr. 1992.
    • (1992) IEEE Trans. Signal Process. , vol.40 , Issue.4 , pp. 725-735
    • Ephraim, Y.1
  • 16
    • 0032166087 scopus 로고    scopus 로고
    • HMM-based strategies for enhancement of speech signals embedded in nonstationary noise
    • Sep
    • H. Sameti, H. Sheikhzadeh, L. Deng, and R. Brennan, "HMM-based strategies for enhancement of speech signals embedded in nonstationary noise," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 445-455, Sep. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.5 , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.4
  • 17
    • 51449116166 scopus 로고    scopus 로고
    • HMM-based gainmodeling for enhancement of speech in noise
    • Mar
    • D.Y.Zhao andW. B. Kleijn, "HMM-based gainmodeling for enhancement of speech in noise," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 882-892, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 882-892
    • Zhao, D.Y.1    Kleijn, W.B.2
  • 18
    • 84873620144 scopus 로고    scopus 로고
    • Spectral domain speech enhancement using HMM state-dependent super-Gaussian priors
    • Mar
    • N. Mohammadiha, R. Martin, and A. Leijon, "Spectral domain speech enhancement using HMM state-dependent super-Gaussian priors," IEEE Signal Process. Lett., vol. 20, no. 3, pp. 253-256, Mar. 2013.
    • (2013) IEEE Signal Process. Lett. , vol.20 , Issue.3 , pp. 253-256
    • Mohammadiha, N.1    Martin, R.2    Leijon, A.3
  • 19
    • 84870253774 scopus 로고    scopus 로고
    • Speech enhancement using hidden Markov models in Mel-frequency domain
    • Feb
    • H. Veisi and H. Sameti, "Speech enhancement using hidden Markov models in Mel-frequency domain," Speech Commun., vol. 55, no. 2, pp. 205-220, Feb. 2013.
    • (2013) Speech Commun. , vol.55 , Issue.2 , pp. 205-220
    • Veisi, H.1    Sameti, H.2
  • 23
    • 38049021850 scopus 로고    scopus 로고
    • Convolutive speech bases and their application to supervised speech separation
    • Jan
    • P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 1-12, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 1-12
    • Smaragdis, P.1
  • 24
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
    • Mar
    • T. Virtanen, "Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 25
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
    • C. Févotte, N. Bertin, and J. L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis," Neural Comput., vol. 21, pp. 793-830, 2009.
    • (2009) Neural Comput. , vol.21 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.L.3
  • 26
    • 84873897366 scopus 로고    scopus 로고
    • Nonnegative HMM for babble noise derived from speech HMM: Application to speech enhancement
    • May
    • N. Mohammadiha and A. Leijon, "Nonnegative HMM for babble noise derived from speech HMM: Application to speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 5, pp. 998-1011, May 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.5 , pp. 998-1011
    • Mohammadiha, N.1    Leijon, A.2
  • 27
    • 84867198451 scopus 로고    scopus 로고
    • Regularized non-negative matrix factorization with temporal dependencies for speech denoising
    • K. W. Wilson, B. Raj, and P. Smaragdis, "Regularized non-negative matrix factorization with temporal dependencies for speech denoising," in Proc. Int. Conf. Spoken Lang. Process. (Interspeech), 2008, pp. 411-414.
    • (2008) Proc. Int. Conf. Spoken Lang. Process. (Interspeech) , pp. 411-414
    • Wilson, K.W.1    Raj, B.2    Smaragdis, P.3
  • 30
    • 80051625972 scopus 로고    scopus 로고
    • A non-negative approach to semisupervised separation of speech from noise with the use of temporal dynamics
    • May
    • G. J.Mysore and P. Smaragdis, "A non-negative approach to semisupervised separation of speech from noise with the use of temporal dynamics," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP), May 2011, pp. 17-20.
    • (2011) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP) , pp. 17-20
    • Mysore, G.J.1    Smaragdis, P.2
  • 32
    • 76749107542 scopus 로고    scopus 로고
    • Online learning for matrix factorization and sparse coding
    • J.Mairal, F. Bach, J. Ponce, andG. Sapiro, "Online learning for matrix factorization and sparse coding," J. Mach. Learn. Res., vol. 11, pp. 19-60, 2010.
    • (2010) J. Mach. Learn. Res. , vol.11 , pp. 19-60
    • Mairal, J.1    Bach, F.2    Ponce, J.3    Sapiro, G.4
  • 34
    • 67650927380 scopus 로고    scopus 로고
    • Bayesian inference for nonnegativematrix factorisation models
    • Article ID 785152 2009
    • A. T. Cemgil, "Bayesian inference for nonnegativematrix factorisation models," Computat. Intell. Neurosci., vol. 2009, no. Article ID 785152, p. 17 pages, 2009.
    • (2009) Computat. Intell. Neurosci. , pp. 17
    • Cemgil, A.T.1
  • 36
    • 84900510076 scopus 로고    scopus 로고
    • Non-negative matrix factorization with sparseness constraints
    • P. O. Hoyer, "Non-negative matrix factorization with sparseness constraints," J. Mach. Learn. Res., vol. 5, pp. 1457-1469, 2004.
    • (2004) J. Mach. Learn. Res , vol.5 , pp. 1457-1469
    • Hoyer, P.O.1
  • 41
    • 0003857778 scopus 로고    scopus 로고
    • A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models
    • Tech. Rep. ICSI-TR-97-021
    • J. A. Bilmes, "A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models," Univ. of California, Berkeley, 1997, Tech. Rep. ICSI-TR-97- 021.
    • (1997) Univ. Of California Berkeley
    • Bilmes, J.A.1
  • 43
    • 59849095077 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ), and objectivemethod for end-to-end speech quality assessment of narrowband telephone networks and speech codecs
    • I.-T. P.862
    • "Perceptual evaluation of speech quality (PESQ), and objectivemethod for end-to-end speech quality assessment of narrowband telephone networks and speech codecs," Tech. Rep. 2000, I.-T. P.862.
    • (2000) Tech. Rep.
  • 44
    • 84867201503 scopus 로고    scopus 로고
    • Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis
    • C. Kim and R. M. Stern, "Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis," in Proc. Int. Conf. Spoken Lang. Process. (Interspeech), 2008, pp. 2598-2601.
    • (2008) Proc. Int. Conf. Spoken Lang. Process. (Interspeech) , pp. 2598-2601
    • Kim, C.1    Stern, R.M.2
  • 46
    • 84881068716 scopus 로고    scopus 로고
    • Model order selection for nonnegative matrix factorization with application to speech enhancement
    • N. Mohammadiha and A. Leijon, "Model order selection for nonnegative matrix factorization with application to speech enhancement," Tech. Rep. KTH Royal Inst. of Technol., 2011.
    • Tech. Rep. KTH Royal Inst. of Technol. , vol.2011
    • Mohammadiha, N.1    Leijon, A.2
  • 47
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • Jul
    • A. Varga and H. J. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, no. 3, pp. 247-251, Jul. 1993.
    • (1993) Speech Commun. , vol.12 , Issue.3 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.2
  • 50
    • 51449123884 scopus 로고    scopus 로고
    • Recent advancements in speech enhancement
    • Boca Raton, FL, USA: CRC
    • Y. Ephraim and I. Cohen, "Recent advancements in speech enhancement," in The Electrical Engineering Handbook. Boca Raton, FL, USA: CRC, 2005.
    • (2005) The Electrical Engineering Handbook
    • Ephraim, Y.1    Cohen, I.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.