메뉴 건너뛰기




Volumn 21, Issue 5, 2013, Pages 998-1011

Nonnegative HMM for babble noise derived from speech HMM: Application to speech enhancement

Author keywords

Babble noise; hidden Markov model; nonnegative matrix factorization; speech enhancement

Indexed keywords

BABBLE NOISE; BASIS MATRIX; BASIS VECTOR; COCKTAIL PARTY; CONVENTIONAL METHODS; EXPECTATION-MAXIMIZATION ALGORITHMS; GAIN PARAMETER; MULTI-TALKER BABBLE; NOISE REDUCTION ALGORITHMS; NON NEGATIVES; NONNEGATIVE MATRIX FACTORIZATION; POWER-SPECTRA; PROCESSING ALGORITHMS; RECURSIVE EM; SPARSE NON-NEGATIVE MATRIX FACTORIZATIONS; SPARSITY CONSTRAINTS; SPEECH SIGNALS; SPEECH WAVEFORMS; STATIONARY MODELS; TIME VARYING PARAMETER; WAVE FORMS;

EID: 84873897366     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2243435     Document Type: Article
Times cited : (33)

References (46)
  • 1
    • 80052339383 scopus 로고
    • Some experiments on the recognition of speech, with one and two ears
    • E. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer. (JASA), vol. 25, pp. 975-979, 1953.
    • (1953) J. Acoust. Soc. Amer. (JASA) , vol.25 , pp. 975-979
    • Cherry, E.1
  • 2
    • 0037668478 scopus 로고
    • A review of the cocktail party effect
    • B. Arons, "A review of the cocktail party effect," J. Acoust. Soc. Amer. (JASA), vol. 12, pp. 35-50, 1992.
    • (1992) J. Acoust. Soc. Amer. (JASA) , vol.12 , pp. 35-50
    • Arons, B.1
  • 3
    • 22944480530 scopus 로고    scopus 로고
    • The cocktail party problem
    • S. Haykin and Z. Chen, "The cocktail party problem," Neural Comput., vol. 17, pp. 1875-1902, 2005.
    • (2005) Neural Comput. , vol.17 , pp. 1875-1902
    • Haykin, S.1    Chen, Z.2
  • 4
    • 27744596913 scopus 로고    scopus 로고
    • Consonant identification in N-talker babble is a nonmonotonic function of N
    • S. A. Simpson and M. Cooke, "Consonant identification in N-talker babble is a nonmonotonic function of N," J. Acoust. Soc. Amer. (JASA), vol. 118, no. 5, pp. 2775-2778, 2005.
    • (2005) J. Acoust. Soc. Amer. (JASA) , vol.118 , Issue.5 , pp. 2775-2778
    • Simpson, S.A.1    Cooke, M.2
  • 5
    • 85008053933 scopus 로고    scopus 로고
    • Babble noise: Modeling, analysis, and applications
    • Se
    • N. Krishnamurthy and J. Hansen, "Babble noise: Modeling, analysis, and applications," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 7, pp. 1394-1407, Sep. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.7 , pp. 1394-1407
    • Krishnamurthy, N.1    Hansen, J.2
  • 6
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator," IEEE Trans. Audio, Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Audio, Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 7
    • 0032654277 scopus 로고    scopus 로고
    • A dynamic system approach to speech enhancement using the filtering algorithm
    • Jul
    • X. Shen and L. Deng, "A dynamic system approach to speech enhancement using the filtering algorithm," IEEE Trans. Speech Audio Process., vol. 7, no. 4, pp. 391-399, Jul. 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.4 , pp. 391-399
    • Shen, X.1    Deng, L.2
  • 8
    • 0036508204 scopus 로고    scopus 로고
    • Particle Methods for Bayesian Modeling and Enhancement of Speech Signals
    • Mar
    • J. Vermaak, C. Andrieu, A. Doucet, and S. Godsill, "Particle Methods for Bayesian Modeling and Enhancement of Speech Signals," IEEE Trans. Speech Audio Process., vol. 10, no. 3, pp. 173-185, Mar. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 173-185
    • Vermaak, J.1    Andrieu, C.2    Doucet, A.3    Godsill, S.4
  • 9
    • 27644556974 scopus 로고    scopus 로고
    • Speech enhancement based on minimum mean-square error estimation and supergaussian priors
    • Se
    • R. Martin, "Speech enhancement based on minimum mean-square error estimation and supergaussian priors," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 845-856, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 845-856
    • Martin, R.1
  • 11
    • 51449104842 scopus 로고    scopus 로고
    • Minimum Mean-Square Error estimation of discrete Fourier coefficients with generalized Gamma priors
    • Aug
    • J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen, "Minimum Mean-Square Error estimation of discrete Fourier coefficients with generalized Gamma priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1741-1752, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1741-1752
    • Erkelens, J.S.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4
  • 12
    • 0035708733 scopus 로고    scopus 로고
    • Noise reduction in hearing aids: An overview
    • H. Levitt, "Noise reduction in hearing aids: An overview," J. Rehab. Res. Develop., vol. 38, pp. 111-121, 2001.
    • (2001) J. Rehab. Res. Develop. , vol.38 , pp. 111-121
    • Levitt, H.1
  • 13
    • 33744970011 scopus 로고    scopus 로고
    • Codebook driven short-term predictor parameter estimation for speech enhancement
    • Jan
    • S. Srinivasan, J. Samuelsson, and W. Kleijn, "Codebook driven short-term predictor parameter estimation for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 163-176, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 163-176
    • Srinivasan, S.1    Samuelsson, J.2    Kleijn, W.3
  • 14
    • 0026843273 scopus 로고
    • A Bayesian estimation approach for speech enhancement using hidden Markov models
    • Ar
    • Y. Ephraim, "A Bayesian estimation approach for speech enhancement using hidden Markov models," IEEE Trans. Signal Process., vol. 40, no. 4, pp. 725-735, Apr. 1992.
    • (1992) IEEE Trans. Signal Process. , vol.40 , Issue.4 , pp. 725-735
    • Ephraim, Y.1
  • 15
    • 0032166087 scopus 로고    scopus 로고
    • HMM-based strategies for enhancement of speech signals embedded in nonsta-tionary noise
    • Se
    • H. Sameti, H. Sheikhzadeh, L. Deng, and R. Brennan, "HMM-based strategies for enhancement of speech signals embedded in nonsta-tionary noise," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 445-455, Sep. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.5 , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.4
  • 16
    • 51449116166 scopus 로고    scopus 로고
    • HMM-based gain modeling for enhancement of speech in noise
    • Mar
    • D. Y. Zhao and W. B. Kleijn, "HMM-based gain modeling for enhancement of speech in noise," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 882-892, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 882-892
    • Zhao, D.Y.1    Kleijn, W.B.2
  • 17
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by non-negative matrix factorization
    • D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, vol. 401, no. 6755, pp. 788-791, 1999.
    • (1999) Nature , vol.401 , Issue.6755 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 18
    • 38049021850 scopus 로고    scopus 로고
    • Convolutive speech bases and their application to supervised speech separation
    • Jan
    • P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 1-12, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 1-12
    • Smaragdis, P.1
  • 19
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
    • C. Févotte, N. Bertin, and J. L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis," Neural Comput., vol. 21, pp. 793-830, 2009.
    • (2009) Neural Comput. , vol.21 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.L.3
  • 20
    • 76949094445 scopus 로고    scopus 로고
    • Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation
    • Mar
    • A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 550-563, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 550-563
    • Ozerov, A.1    Févotte, C.2
  • 22
    • 80051625972 scopus 로고    scopus 로고
    • A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics
    • May
    • G. J. Mysore and P. Smaragdis, "A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP), May 2011, pp. 17-20.
    • (2011) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP) , pp. 17-20
    • Mysore, G.J.1    Smaragdis, P.2
  • 27
    • 0027797470 scopus 로고
    • On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure
    • Aug
    • V. Krishnamurthy and J. Moore, "On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure," IEEE Trans. Signal Process., vol. 41, no. 8, pp. 2557-2573, Aug. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.8 , pp. 2557-2573
    • Krishnamurthy, V.1    Moore, J.2
  • 28
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 30
    • 84873620144 scopus 로고    scopus 로고
    • Spectral domain speech enhancement using HMM state-dependent super-Gaussian priors
    • Mar
    • N. Mohammadiha, R. Martin, and A. Leijon, "Spectral domain speech enhancement using HMM state-dependent super-Gaussian priors," IEEE Signal Process. Lett., vol. 20, no. 3, pp. 253-256, Mar. 2013.
    • (2013) IEEE Signal Process. Lett. , vol.20 , Issue.3 , pp. 253-256
    • Mohammadiha, N.1    Martin, R.2    Leijon, A.3
  • 31
    • 32644447834 scopus 로고    scopus 로고
    • Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models
    • Apr.
    • I. Cohen, "Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models," Signal Process., vol. 86, no. 4, pp. 698-709, Apr. 2006.
    • (2006) Signal Process. , vol.86 , Issue.4 , pp. 698-709
    • Cohen, I.1
  • 34
    • 0022685753 scopus 로고
    • Continuously variable duration hidden Markov models for automatic speech recognition
    • S. E. Levinson, "Continuously variable duration hidden Markov models for automatic speech recognition," Comput. Speech Lang., vol. 1, pp. 29-45, 1986.
    • (1986) Comput. Speech Lang. , vol.1 , pp. 29-45
    • Levinson, S.E.1
  • 35
    • 0037686659 scopus 로고    scopus 로고
    • The concave-convex procedure
    • A. L. Yuille and A. Rangarajan, "The concave-convex procedure," Neural Comput., vol. 15, pp. 915-936, 2003.
    • (2003) Neural Comput. , vol.15 , pp. 915-936
    • Yuille, A.L.1    Rangarajan, A.2
  • 38
    • 0025494624 scopus 로고
    • Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure
    • Se
    • E. Weinstein, M. Feder, and A. Oppenheim, "Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure," IEEE Trans. Acoust., Speech, Signal Process., vol. 38, no. 9, pp. 1652-1654, Sep. 1990.
    • (1990) IEEE Trans. Acoust., Speech, Signal Process. , vol.38 , Issue.9 , pp. 1652-1654
    • Weinstein, E.1    Feder, M.2    Oppenheim, A.3
  • 42
    • 0002077742 scopus 로고
    • Quantization of LPC parameters
    • W. Kleijn and K. Paliwal, Eds. New York, NY, USA: Elsevier ch. 12
    • K. K. Paliwal and W. B. Kleijn, "Quantization of LPC parameters," in Speech Coding Synth., W. Kleijn and K. Paliwal, Eds. New York, NY, USA: Elsevier, 1995, ch. 12, pp. 443-466.
    • (1995) Speech Coding Synth , pp. 443-466
    • Paliwal, K.K.1    Kleijn, W.B.2
  • 44
    • 22944438092 scopus 로고    scopus 로고
    • Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model
    • T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model," EURASIP J. Appl. Signal Process., vol. 2005, pp. 1110-1126, 2005.
    • (2005) EURASIP J. Appl. Signal Process. , vol.2005 , pp. 1110-1126
    • Lotter, T.1    Vary, P.2
  • 46
    • 3042630167 scopus 로고    scopus 로고
    • Characterizations of the distributions of power inverse Gaussian and others based on the entropy maximization principle
    • T. Kawamura and K. Iwase, "Characterizations of the distributions of power inverse Gaussian and others based on the entropy maximization principle," J. Jpn. Statist. Soc., vol. 33, no. 1, pp. 95-104, 2003.
    • (2003) J. Jpn. Statist. Soc. , vol.33 , Issue.1 , pp. 95-104
    • Kawamura, T.1    Iwase, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.