SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 21, Issue 5, 2013, Pages 998-1011

Nonnegative HMM for babble noise derived from speech HMM: Application to speech enhancement

(2) Mohammadiha, Nasser a Leijon, Arne a

a ROYAL INSTITUTE OF TECHNOLOGY (Sweden)

Author keywords

Babble noise; hidden Markov model; nonnegative matrix factorization; speech enhancement

Indexed keywords

BABBLE NOISE; BASIS MATRIX; BASIS VECTOR; COCKTAIL PARTY; CONVENTIONAL METHODS; EXPECTATION-MAXIMIZATION ALGORITHMS; GAIN PARAMETER; MULTI-TALKER BABBLE; NOISE REDUCTION ALGORITHMS; NON NEGATIVES; NONNEGATIVE MATRIX FACTORIZATION; POWER-SPECTRA; PROCESSING ALGORITHMS; RECURSIVE EM; SPARSE NON-NEGATIVE MATRIX FACTORIZATIONS; SPARSITY CONSTRAINTS; SPEECH SIGNALS; SPEECH WAVEFORMS; STATIONARY MODELS; TIME VARYING PARAMETER; WAVE FORMS;

HIDDEN MARKOV MODELS; MATRIX ALGEBRA; SPEECH; SPEECH ENHANCEMENT;

ALGORITHMS;

EID: 84873897366 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2013.2243435 Document Type: Article

Times cited : (33)

References (46)

1
- 80052339383
- Some experiments on the recognition of speech, with one and two ears
- E. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer. (JASA), vol. 25, pp. 975-979, 1953.
- (1953) J. Acoust. Soc. Amer. (JASA) , vol.25 , pp. 975-979
- Cherry, E.¹

2
- 0037668478
- A review of the cocktail party effect
- B. Arons, "A review of the cocktail party effect," J. Acoust. Soc. Amer. (JASA), vol. 12, pp. 35-50, 1992.
- (1992) J. Acoust. Soc. Amer. (JASA) , vol.12 , pp. 35-50
- Arons, B.¹

3
- 22944480530
- The cocktail party problem
- S. Haykin and Z. Chen, "The cocktail party problem," Neural Comput., vol. 17, pp. 1875-1902, 2005.
- (2005) Neural Comput. , vol.17 , pp. 1875-1902
- Haykin, S.¹ Chen, Z.²

4
- 27744596913
- Consonant identification in N-talker babble is a nonmonotonic function of N
- S. A. Simpson and M. Cooke, "Consonant identification in N-talker babble is a nonmonotonic function of N," J. Acoust. Soc. Amer. (JASA), vol. 118, no. 5, pp. 2775-2778, 2005.
- (2005) J. Acoust. Soc. Amer. (JASA) , vol.118 , Issue.5 , pp. 2775-2778
- Simpson, S.A.¹ Cooke, M.²

5
- 85008053933
- Babble noise: Modeling, analysis, and applications
- Se
- N. Krishnamurthy and J. Hansen, "Babble noise: Modeling, analysis, and applications," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 7, pp. 1394-1407, Sep. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.7 , pp. 1394-1407
- Krishnamurthy, N.¹ Hansen, J.²

6
- 0021645331
- Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
- Dec.
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator," IEEE Trans. Audio, Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Audio, Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

7
- 0032654277
- A dynamic system approach to speech enhancement using the filtering algorithm
- Jul
- X. Shen and L. Deng, "A dynamic system approach to speech enhancement using the filtering algorithm," IEEE Trans. Speech Audio Process., vol. 7, no. 4, pp. 391-399, Jul. 1999.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.4 , pp. 391-399
- Shen, X.¹ Deng, L.²

8
- 0036508204
- Particle Methods for Bayesian Modeling and Enhancement of Speech Signals
- Mar
- J. Vermaak, C. Andrieu, A. Doucet, and S. Godsill, "Particle Methods for Bayesian Modeling and Enhancement of Speech Signals," IEEE Trans. Speech Audio Process., vol. 10, no. 3, pp. 173-185, Mar. 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 173-185
- Vermaak, J.¹ Andrieu, C.² Doucet, A.³ Godsill, S.⁴

9
- 27644556974
- Speech enhancement based on minimum mean-square error estimation and supergaussian priors
- Se
- R. Martin, "Speech enhancement based on minimum mean-square error estimation and supergaussian priors," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 845-856, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 845-856
- Martin, R.¹

10
- 34047265321
- On causal algorithms for speech enhancement
- May
- V. Grancharov and J. S. B. Kleijn, "On causal algorithms for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 764-773, May 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 764-773
- Grancharov, V.¹ Kleijn, J.S.B.²

11
- 51449104842
- Minimum Mean-Square Error estimation of discrete Fourier coefficients with generalized Gamma priors
- Aug
- J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen, "Minimum Mean-Square Error estimation of discrete Fourier coefficients with generalized Gamma priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1741-1752, Aug. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1741-1752
- Erkelens, J.S.¹ Hendriks, R.C.² Heusdens, R.³ Jensen, J.⁴

12
- 0035708733
- Noise reduction in hearing aids: An overview
- H. Levitt, "Noise reduction in hearing aids: An overview," J. Rehab. Res. Develop., vol. 38, pp. 111-121, 2001.
- (2001) J. Rehab. Res. Develop. , vol.38 , pp. 111-121
- Levitt, H.¹

13
- 33744970011
- Codebook driven short-term predictor parameter estimation for speech enhancement
- Jan
- S. Srinivasan, J. Samuelsson, and W. Kleijn, "Codebook driven short-term predictor parameter estimation for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 163-176, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 163-176
- Srinivasan, S.¹ Samuelsson, J.² Kleijn, W.³

14
- 0026843273
- A Bayesian estimation approach for speech enhancement using hidden Markov models
- Ar
- Y. Ephraim, "A Bayesian estimation approach for speech enhancement using hidden Markov models," IEEE Trans. Signal Process., vol. 40, no. 4, pp. 725-735, Apr. 1992.
- (1992) IEEE Trans. Signal Process. , vol.40 , Issue.4 , pp. 725-735
- Ephraim, Y.¹

15
- 0032166087
- HMM-based strategies for enhancement of speech signals embedded in nonsta-tionary noise
- Se
- H. Sameti, H. Sheikhzadeh, L. Deng, and R. Brennan, "HMM-based strategies for enhancement of speech signals embedded in nonsta-tionary noise," IEEE Trans. Speech Audio Process., vol. 6, no. 5, pp. 445-455, Sep. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.5 , pp. 445-455
- Sameti, H.¹ Sheikhzadeh, H.² Deng, L.³ Brennan, R.⁴

16
- 51449116166
- HMM-based gain modeling for enhancement of speech in noise
- Mar
- D. Y. Zhao and W. B. Kleijn, "HMM-based gain modeling for enhancement of speech in noise," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 882-892, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 882-892
- Zhao, D.Y.¹ Kleijn, W.B.²

17
- 0033592606
- Learning the parts of objects by non-negative matrix factorization
- D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization," Nature, vol. 401, no. 6755, pp. 788-791, 1999.
- (1999) Nature , vol.401 , Issue.6755 , pp. 788-791
- Lee, D.D.¹ Seung, H.S.²

18
- 38049021850
- Convolutive speech bases and their application to supervised speech separation
- Jan
- P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 1-12, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 1-12
- Smaragdis, P.¹

19
- 63249085556
- Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
- C. Févotte, N. Bertin, and J. L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis," Neural Comput., vol. 21, pp. 793-830, 2009.
- (2009) Neural Comput. , vol.21 , pp. 793-830
- Févotte, C.¹ Bertin, N.² Durrieu, J.L.³

20
- 76949094445
- Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation
- Mar
- A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 550-563, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 550-563
- Ozerov, A.¹ Févotte, C.²

21
- 51449092704
- Speech de-noising using nonnegative matrix factorization with priors
- K. W. Wilson, B. Raj, P. Smaragdis, and A. Divakaran, "Speech de-noising using nonnegative matrix factorization with priors," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008, pp. 4029-4032.
- (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 4029-4032
- Wilson, K.W.¹ Raj, B.² Smaragdis, P.³ Divakaran, A.⁴

22
- 80051625972
- A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics
- May
- G. J. Mysore and P. Smaragdis, "A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP), May 2011, pp. 17-20.
- (2011) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP) , pp. 17-20
- Mysore, G.J.¹ Smaragdis, P.²

23
- 84857244217
- A new approach for speech enhancement based on a constrained nonnegative matrix factorization
- N. Mohammadiha, T. Gerkmann, and A. Leijon, "A new approach for speech enhancement based on a constrained nonnegative matrix factorization," in Proc. IEEE Int. Symp. Intell. Signal Process. and Commun. Syst. (ISPACS), 2011, pp. 1-5.
- (2011) Proc. IEEE Int. Symp. Intell. Signal Process. and Commun. Syst. (ISPACS) , pp. 1-5
- Mohammadiha, N.¹ Gerkmann, T.² Leijon, A.³

24
- 83455182002
- A new linear MMSE filter for single channel speech enhancement based on nonnegative matrix factorization
- N. Mohammadiha, T. Gerkmann, and A. Leijon, "A new linear MMSE filter for single channel speech enhancement based on nonnegative matrix factorization," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), 2011, pp. 45-48.
- (2011) Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA) , pp. 45-48
- Mohammadiha, N.¹ Gerkmann, T.² Leijon, A.³

25
- 84867609546
- Single channel speech enhancement using Bayesian NMF with recursive temporal updates of prior distributions
- N. Mohammadiha, J. Taghia, and A. Leijon, "Single channel speech enhancement using Bayesian NMF with recursive temporal updates of prior distributions," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2012, pp. 4561-4564.
- (2012) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 4561-4564
- Mohammadiha, N.¹ Taghia, J.² Leijon, A.³

26
- 0001593436
- Recursive parameter estimation using incomplete data
- D. M. Titterington, "Recursive parameter estimation using incomplete data," J. R. Statist. Soc. Ser. B (Methodological), vol. 46, pp. 257-267, 1984.
- (1984) J. R. Statist. Soc. Ser. B (Methodological) , vol.46 , pp. 257-267
- Titterington, D.M.¹

27
- 0027797470
- On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure
- Aug
- V. Krishnamurthy and J. Moore, "On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure," IEEE Trans. Signal Process., vol. 41, no. 8, pp. 2557-2573, Aug. 1993.
- (1993) IEEE Trans. Signal Process. , vol.41 , Issue.8 , pp. 2557-2573
- Krishnamurthy, V.¹ Moore, J.²

28
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb
- L. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.¹

29
- 13844251687
- New York, NY, USA: Springer
- O. Cappé, E. Moulines, and T. Ryden, Inference in Hidden Markov Models, ser. Springer Series in Statistics. New York, NY, USA: Springer, 2005.
- (2005) Inference in Hidden Markov Models, Ser. Springer Series in Statistics
- Cappé, O.¹ Moulines, E.² Ryden, T.³

30
- 84873620144
- Spectral domain speech enhancement using HMM state-dependent super-Gaussian priors
- Mar
- N. Mohammadiha, R. Martin, and A. Leijon, "Spectral domain speech enhancement using HMM state-dependent super-Gaussian priors," IEEE Signal Process. Lett., vol. 20, no. 3, pp. 253-256, Mar. 2013.
- (2013) IEEE Signal Process. Lett. , vol.20 , Issue.3 , pp. 253-256
- Mohammadiha, N.¹ Martin, R.² Leijon, A.³

31
- 32644447834
- Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models
- Apr.
- I. Cohen, "Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models," Signal Process., vol. 86, no. 4, pp. 698-709, Apr. 2006.
- (2006) Signal Process. , vol.86 , Issue.4 , pp. 698-709
- Cohen, I.¹

32
- 33846516584
- New York, NY USA: Springer-Verlag
- C. M. Bishop, Pattern Recognition and Machine Learning. New York, NY, USA: Springer-Verlag, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.M.¹

33
- 0003857778
- Univ. of California, Berkeley, CA, USA Tech. Rep. ICSI-TR-97-021
- J. A. Bilmes, "A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models," Univ. of California, Berkeley, CA, USA, 1997, Tech. Rep. ICSI-TR-97-021.
- (1997) A Gentle Tutorial of the em Algorithm and Its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models
- Bilmes, J.A.¹

34
- 0022685753
- Continuously variable duration hidden Markov models for automatic speech recognition
- S. E. Levinson, "Continuously variable duration hidden Markov models for automatic speech recognition," Comput. Speech Lang., vol. 1, pp. 29-45, 1986.
- (1986) Comput. Speech Lang. , vol.1 , pp. 29-45
- Levinson, S.E.¹

35
- 0037686659
- The concave-convex procedure
- A. L. Yuille and A. Rangarajan, "The concave-convex procedure," Neural Comput., vol. 15, pp. 915-936, 2003.
- (2003) Neural Comput. , vol.15 , pp. 915-936
- Yuille, A.L.¹ Rangarajan, A.²

36
- 78149477774
- On the convergence of the concave-convex procedure
- B. K. Sriperumbudur and G. R. G. Lanckriet, "On the convergence of the concave-convex procedure," in Proc. Adv. in Neural Inf. Process. Syst., 2009.
- (2009) Proc. Adv. in Neural Inf. Process. Syst.
- Sriperumbudur, B.K.¹ Lanckriet, G.R.G.²

37
- 0004055894
- Cambridge U.K.: Cambridge Univ. Press
- S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge, U.K.: Cambridge Univ. Press, 2004.
- (2004) Convex Optimization
- Boyd, S.¹ Vandenberghe, L.²

38
- 0025494624
- Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure
- Se
- E. Weinstein, M. Feder, and A. Oppenheim, "Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure," IEEE Trans. Acoust., Speech, Signal Process., vol. 38, no. 9, pp. 1652-1654, Sep. 1990.
- (1990) IEEE Trans. Acoust., Speech, Signal Process. , vol.38 , Issue.9 , pp. 1652-1654
- Weinstein, E.¹ Feder, M.² Oppenheim, A.³

39
- 0442317754
- Tech. Rep. ETSI ES 202 050 V1.1.5
- "Speech processing, transmission and quality aspects (STQ), distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms," 2007, Tech. Rep. ETSI ES 202 050 V1.1.5.
- (2007) Speech Processing, Transmission and Quality Aspects (STQ), Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithms

40
- 33744975847
- Performance measurement in blind audio source separation
- Ar
- E. Vincent, R. Gribonval, and C. Févotte, "Performance measurement in blind audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 4, pp. 1462-1469, Apr. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1462-1469
- Vincent, E.¹ Gribonval, R.² Févotte, C.³

41
- 34447100796
- Boca Raton, FL, USA: CRC
- P. C. Loizou, Speech Enhancement: Theory and Practice. Boca Raton, FL, USA: CRC, 2007.
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.C.¹

42
- 0002077742
- Quantization of LPC parameters
- W. Kleijn and K. Paliwal, Eds. New York, NY, USA: Elsevier ch. 12
- K. K. Paliwal and W. B. Kleijn, "Quantization of LPC parameters," in Speech Coding Synth., W. Kleijn and K. Paliwal, Eds. New York, NY, USA: Elsevier, 1995, ch. 12, pp. 443-466.
- (1995) Speech Coding Synth , pp. 443-466
- Paliwal, K.K.¹ Kleijn, W.B.²

43
- 59849095077
- Tech. Rep
- "Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs," 2000, Tech. Rep.
- (2000) Perceptual Evaluation of Speech Quality (PESQ), and Objective Method for End-to-end Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs

44
- 22944438092
- Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model
- T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model," EURASIP J. Appl. Signal Process., vol. 2005, pp. 1110-1126, 2005.
- (2005) EURASIP J. Appl. Signal Process. , vol.2005 , pp. 1110-1126
- Lotter, T.¹ Vary, P.²

45
- 13344250603
- ITU-R Rec. BS.1534-1 Std, 2001-2003
- "Method for the subjective assessment of intermediate quality level of coding systems," ITU-R Rec. BS.1534-1 Std, 2001-2003 [Online]. Available: http://www.itu.int
- Method for the Subjective Assessment of Intermediate Quality Level of Coding Systems

46
- 3042630167
- Characterizations of the distributions of power inverse Gaussian and others based on the entropy maximization principle
- T. Kawamura and K. Iwase, "Characterizations of the distributions of power inverse Gaussian and others based on the entropy maximization principle," J. Jpn. Statist. Soc., vol. 33, no. 1, pp. 95-104, 2003.
- (2003) J. Jpn. Statist. Soc. , vol.33 , Issue.1 , pp. 95-104
- Kawamura, T.¹ Iwase, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.