메뉴 건너뛰기




Volumn 4, Issue 5, 2010, Pages 808-815

Improved speech presence probabilities using HMM-based inference, with applications to speech enhancement and ASR

Author keywords

Automatic speech recognition (ASR); hidden Markov models (HMMs); noise suppression; soft decision speech enhancement; speech presence probability (SPPs)

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; FORWARD BACKWARD ALGORITHMS; INFERENCE TECHNIQUES; KULLBACK-LEIBLER DISTANCE; LOW COMPLEXITY; MARKOV MODEL; NOISE SUPPRESSION; NONSTATIONARY NOISE; PARAMETERIZED; RECOGNITION PERFORMANCE; RESOURCE-CONSTRAINED; SIGNAL MODELS; SOFT-DECISION SPEECH ENHANCEMENT; SPEECH DATA; SPEECH DISTORTION; STATE-OF-THE-ART METHODS; STATISTICAL INFERENCE; TEMPORAL CORRELATIONS; TWO-STATE;

EID: 77956740715     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2048605     Document Type: Article
Times cited : (5)

References (23)
  • 1
    • 0019009880 scopus 로고
    • Speech enhancement using a softdecision noise suppression filter
    • Apr
    • R. J. McAuley and M. L. Malpass, "Speech enhancement using a softdecision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-28, no.2, pp. 137-145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.2 , pp. 137-145
    • McAuley, R.J.1    Malpass, M.L.2
  • 3
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square log-spectral amplitude estimator
    • Apr
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-33, no.2, pp. 443-445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 4
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol.77, no.2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 6
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Jan
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol.6, no.1, pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 7
    • 0032667465 scopus 로고    scopus 로고
    • Tracking speech presence uncertainty to improve speech enhancement in non-stationary noise environments
    • D. Malah, R. V. Cox, and A. J. Accardi, "Tracking speech presence uncertainty to improve speech enhancement in non-stationary noise environments," in Proc. ICASSP, 1999, vol.2, pp. 789-792.
    • (1999) Proc. ICASSP , vol.2 , pp. 789-792
    • Malah, D.1    Cox, R.V.2    Accardi, A.J.3
  • 8
    • 19944382585 scopus 로고    scopus 로고
    • Enabling new speech driven services for mobile devices: An overview of the ETSI standards activities for distributed speech recognition front-ends
    • May
    • D. Pearce, "Enabling new speech driven services for mobile devices: An overview of the ETSI standards activities for distributed speech recognition front-ends," in Proc. AVIOS 2000: Speech Appl. Conf.,May 2000, vol.5, pp. 1-6.
    • (2000) Proc. AVIOS 2000: Speech Appl. Conf. , vol.5 , pp. 1-6
    • Pearce, D.1
  • 9
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • H.-G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions," ASR2000-Autom. Speech Recognition: Challenges New Millenn., 2000.
    • (2000) ASR2000-Autom. Speech Recognition: Challenges New Millenn
    • Hirsch, H.-G.1    Pearce, D.2
  • 10
    • 85009078216 scopus 로고    scopus 로고
    • Entropy based voice activity detection in very noisy conditions
    • P. Renevey and A. Drygajlo, "Entropy based voice activity detection in very noisy conditions," in Proc. Eurospeech, 2001, pp. 1887-1890.
    • (2001) Proc. Eurospeech , pp. 1887-1890
    • Renevey, P.1    Drygajlo, A.2
  • 11
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul.
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol.9, no.5, pp. 504-512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 12
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non-stationary noise environments
    • I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environments," Signal Process., vol.81, no.11, pp. 2403-2418, 2001.
    • (2001) Signal Process , vol.81 , Issue.11 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 14
    • 0036543522 scopus 로고    scopus 로고
    • Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator
    • Apr
    • I. Cohen, "Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator," IEEE Signal Process. Lett., vol.9, no.4, pp. 113-116, Apr. 2002.
    • (2002) IEEE Signal Process. Lett. , vol.9 , Issue.4 , pp. 113-116
    • Cohen, I.1
  • 15
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • Sep
    • I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech Audio Process., vol.11, no.5, pp. 466-475, Sep. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 17
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, and A. Rubio, "Efficient voice activity detection algorithms using long-term speech information," Speech Commun., vol. 42, no. 3-3, pp. 271-287, 2004.
    • (2004) Speech Commun , vol.42 , Issue.3 , pp. 271-287
    • Ramirez, J.1    Segura, J.C.2    Benitez, C.3    De La Torre, A.4    Rubio, A.5
  • 20
    • 33846907750 scopus 로고    scopus 로고
    • ALaplacian-based MMSE estimator for speech enhancement
    • B. Chen and P. Loizou,"ALaplacian-basedMMSEestimator for speech enhancement," Speech Commun., vol.49, no.2, pp. 134-143, 2007.
    • (2007) Speech Commun , vol.49 , Issue.2 , pp. 134-143
    • Chen, B.1    Loizou, P.2
  • 21
    • 77956738115 scopus 로고    scopus 로고
    • Speech Processing, Transmission, and Quality Aspects (STQ); Distributed Speech Recognition; Front-end Feature Extraction Algorithms; Compression Algorithms, ETSI ES 202 050 v1.1.1 (2007-10), ETSI Standard Doc
    • Speech Processing, Transmission, and Quality Aspects (STQ); Distributed Speech Recognition; Front-end Feature Extraction Algorithms; Compression Algorithms, ETSI ES 202 050 v1.1.1 (2007-10), ETSI Standard Doc.
  • 22
    • 66149120230 scopus 로고    scopus 로고
    • Improved a posteriori speech presence probability estimation based on a likelihood ratio with fixed priors
    • Jul.
    • T. Gerkmann, C. Breithaupt, and R. Martin, "Improved a posteriori speech presence probability estimation based on a likelihood ratio with fixed priors," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.5, pp. 910-919, Jul. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.5 , pp. 910-919
    • Gerkmann, T.1    Breithaupt, C.2    Martin, R.3
  • 23
    • 77956759225 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: http://webee.technion.ac.il/Sites/People/Israel- Cohen/


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.