메뉴 건너뛰기




Volumn 18, Issue 8, 2010, Pages 2080-2090

Improving speech intelligibility in noise using environment-optimized algorithms

Author keywords

Environment optimized algorithms; Speech enhancement; Speech intelligibility

Indexed keywords

ACOUSTIC ENVIRONMENT; BAYESIAN CLASSIFIER; BINARY DECISION; INCREMENTAL APPROACH; INPUT SIGNAL; MODEL PARAMETERS; OPTIMIZED ALGORITHMS; SPEECH ENHANCEMENT ALGORITHM; SPEECH QUALITY; TARGET SIGNALS; TIME FREQUENCY;

EID: 77956547397     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2041116     Document Type: Article
Times cited : (52)

References (36)
  • 2
    • 35248891610 scopus 로고    scopus 로고
    • A comparative intelligibility study of single-microphone noise reduction algorithms
    • DOI 10.1121/1.2766778
    • Y. Hu and P.C. Loizou, "A comparative intelligibility study of singlemicrophone noise reduction algorithms," J. Acoust. Soc. Amer., vol. 122, pp. 1777-1786, 2007. (Pubitemid 47560539)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.3 , pp. 1777-1786
    • Hu, Y.1    Loizou, P.C.2
  • 3
    • 0018027039 scopus 로고
    • Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise
    • Oct.
    • J.S. Lim, "Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-26, no. 5, pp. 471-472, Oct. 1978.
    • (1978) IEEE Trans. Acoust. Speech Signal Process. , vol.26 ASSP , Issue.5 , pp. 471-472
    • Lim, J.S.1
  • 4
    • 35848945907 scopus 로고    scopus 로고
    • The design and evaluation of a hearing aid with trainable amplification parameters
    • J.A. Zakis, H. Dillon, and H.J. McDermott, "The design and evaluation of a hearing aid with trainable amplification parameters," Ear Hear., vol. 28, no. 6, pp. 812-830, 2007.
    • (2007) Ear Hear. , vol.28 , Issue.6 , pp. 812-830
    • Zakis, J.A.1    Dillon, H.2    McDermott, H.J.3
  • 6
    • 33744970011 scopus 로고    scopus 로고
    • Codebook driven short-term predictor parameter estimation for speech enhancement
    • Jan.
    • S. Srinivasan, J. Samuelsson, and W.B. Kleijn, "Codebook driven short-term predictor parameter estimation for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 163-176, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 163-176
    • Srinivasan, S.1    Samuelsson, J.2    Kleijn, W.B.3
  • 7
    • 84862603071 scopus 로고    scopus 로고
    • A general optimization procedure for spectral speech enhancement methods
    • Florence, Italy Sep.
    • J. Erkelens, J. Jensen, and R. Heusdens, "A general optimization procedure for spectral speech enhancement methods," in Proc. Eur. Signal Proc. Conf., Florence, Italy, Sep. 2006.
    • (2006) Proc. Eur. Signal Proc. Conf.
    • Erkelens, J.1    Jensen, J.2    Heusdens, R.3
  • 8
    • 34447099536 scopus 로고    scopus 로고
    • A data-driven approach to optimizing spectral speech enhancement methods for various error criteria
    • J. Erkelens, J. Jensen, and R. Heusdens, "A data-driven approach to optimizing spectral speech enhancement methods for various error criteria," Speech Commun., vol. 49, pp. 530-541, 2007.
    • (2007) Speech Commun. , vol.49 , pp. 530-541
    • Erkelens, J.1    Jensen, J.2    Heusdens, R.3
  • 11
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.32 ASSP , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 12
    • 22944438092 scopus 로고    scopus 로고
    • Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model
    • T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model," EURASIP J. Appl. Signal Process., vol. 7, pp. 1110-1126, 2005.
    • (2005) EURASIP J. Appl. Signal Process. , vol.7 , pp. 1110-1126
    • Lotter, T.1    Vary, P.2
  • 13
    • 33846907750 scopus 로고    scopus 로고
    • A Laplacian-based MMSE estimator for speech enhancement
    • C. Bin and P.C. Loizou, "A Laplacian-based MMSE estimator for speech enhancement," Speech Commun., pp. 134-143, 2007.
    • (2007) Speech Commun. , pp. 134-143
    • Bin, C.1    Loizou, P.C.2
  • 14
    • 27644515429 scopus 로고    scopus 로고
    • Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum
    • Sep.
    • P.C. Loizou, "Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 857-869, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 857-869
    • Loizou, P.C.1
  • 15
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • D. Brungart, P. Chang, B. Simpson, and D. Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Amer., vol. 120, pp. 4007-4018, 2006.
    • (2006) J. Acoust. Soc. Amer. , vol.120 , pp. 4007-4018
    • Brungart, D.1    Chang, P.2    Simpson, B.3    Wang, D.4
  • 16
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary- masked speech: Implications for noise reduction
    • N. Li and P.C. Loizou, "Factors influencing intelligibility of ideal binary- masked speech: Implications for noise reduction," J. Acoust. Soc. Amer., vol. 123, no. 3, pp. 1673-1682, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.3 , pp. 1673-1682
    • Li, N.1    Loizou, P.C.2
  • 17
    • 41849093721 scopus 로고    scopus 로고
    • Effect of spectral resolution on the intelligibility of ideal binary masked speech
    • N. Li and P.C. Loizou, "Effect of spectral resolution on the intelligibility of ideal binary masked speech," J. Acoust. Soc. Amer., vol. 123, no. 4, pp. 59-64, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.4 , pp. 59-64
    • Li, N.1    Loizou, P.C.2
  • 18
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001.
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 19
    • 70349093614 scopus 로고    scopus 로고
    • An algorithm that improves speech intelligibility in noise for normal-hearing listeners
    • G. Kim, Y. Lu, Y. Hu, and P.C. Loizou, "An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1486-1494, 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1486-1494
    • Kim, G.1    Lu, Y.2    Hu, Y.3    Loizou, P.C.4
  • 20
    • 0028297185 scopus 로고
    • Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction
    • B. Kollmeier and R. Koch, "Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction," J. Acoust. Soc. Amer., vol. 95, no. 3, pp. 1593-1602, 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.3 , pp. 1593-1602
    • Kollmeier, B.1    Koch, R.2
  • 21
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian classifier for spectrographhic mask estimation for missing feature speech recognition
    • M. Seltzer, B. Raj, and R. Stern, "A Bayesian classifier for spectrographhic mask estimation for missing feature speech recognition," Speech Commun., vol. 43, pp. 379-393, 2004.
    • (2004) Speech Commun. , vol.43 , pp. 379-393
    • Seltzer, M.1    Raj, B.2    Stern, R.3
  • 22
    • 0024241221 scopus 로고
    • Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms
    • G. Langner and C. Schreiner, "Periodicity coding in the inferior colliculus of the cat. I: Neuronal mechanisms," J. Neurophysiol., vol. 60, no. 6, pp. 1799-1822, 1988. (Pubitemid 19017451)
    • (1988) Journal of Neurophysiology , vol.60 , Issue.6 , pp. 1799-1822
    • Langner, G.1    Schreiner, C.E.2
  • 23
    • 0038712550 scopus 로고    scopus 로고
    • SNR estimation based on amplitude modulation analysis with applications to noise suppression
    • May
    • J. Tchorz and B. Kollmeier, "SNR estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Trans. Speech Audio Process., vol. 11, no. 3, pp. 184-192, May 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.3 , pp. 184-192
    • Tchorz, J.1    Kollmeier, B.2
  • 25
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 26
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 27
    • 0030105005 scopus 로고    scopus 로고
    • On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition
    • Q. Huo, C. Chan, and C.-H. Lee, "On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 2, pp. 141-144, 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.2 , pp. 141-144
    • Huo, Q.1    Chan, C.2    Lee, C.-H.3
  • 28
    • 0031103160 scopus 로고    scopus 로고
    • On-line adaptive learning of the continuous density hidden markov model based on approximate recursive Bayes estimate
    • Mar.
    • Q. Huo and C.-H. Lee, "On-line adaptive learning of the continuous density hidden markov model based on approximate recursive Bayes estimate," IEEE Trans. Speech Audio Process., vol. 5, no. 2, pp. 161-172, Mar. 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.2 , pp. 161-172
    • Huo, Q.1    Lee, C.-H.2
  • 29
    • 0014568991 scopus 로고
    • IEEE recommended practice for speech quality measurements
    • Sep.
    • "IEEE recommended practice for speech quality measurements," IEEE Trans. Audio Electroacoust., vol. 19, no. 3, pp. 225-246, Sep. 1969.
    • (1969) IEEE Trans. Audio Electroacoust. , vol.19 , Issue.3 , pp. 225-246
  • 30
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga and H.J.M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, pp. 247-251, 1993.
    • (1993) Speech Commun. , vol.12 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 32
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Sep.
    • G. Hu and D.L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
    • (2004) IEEE Trans. Neural Netw. , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 33
    • 49249107353 scopus 로고    scopus 로고
    • Segregation of unvoiced speech from nonspeech interference
    • G. Hu and D.L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol. 124, pp. 1306-1319, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.124 , pp. 1306-1319
    • Hu, G.1    Wang, D.L.2
  • 35
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S.B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-28, no. 4, pp. 357-336, Aug. 1980.
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 ASSP , Issue.4 , pp. 357-336
    • Davis, S.B.1    Mermelstein, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.