메뉴 건너뛰기




Volumn 126, Issue 3, 2009, Pages 1486-1494

An algorithm that improves speech intelligibility in noise for normal-hearing listeners

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN CLASSIFIER; BINARY DECISION; HUMAN LISTENERS; IDEAL BINARY MASK; INPUT SIGNAL; LOW SIGNAL-TO-NOISE RATIO; NORMAL-HEARING LISTENERS; SPEECH QUALITY; SUPPRESSION ALGORITHM; TIME FREQUENCY;

EID: 70349093614     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.3184603     Document Type: Article
Times cited : (306)

References (35)
  • 2
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • DOI 10.1121/1.2363929
    • Brungart, D., Chang, P., Simpson, B., and Wang, D. (2006). " Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation.," J. Acoust. Soc. Am. 120, 4007-4018. 10.1121/1.2363929 (Pubitemid 44888096)
    • (2006) Journal of the Acoustical Society of America , vol.120 , Issue.6 , pp. 4007-4018
    • Brungart, D.S.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.4
  • 3
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
    • Cooke, M., Green, P., Josifovski, L., and Vizinho, A. (2001). " Robust automatic speech recognition with missing and unreliable acoustic data.," Speech Commun. 34, 267-285. 10.1016/S0167-6393(00)00034-0 (Pubitemid 32284867)
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 4
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • 10.1109/TASSP.1980.1163420
    • Davis, S. B., and Mermelstein, P. (1980). " Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences.," IEEE Trans. Acoust., Speech, Signal Process. ASSP-28, 357-336. 10.1109/TASSP.1980.1163420
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.28 , pp. 357-336
    • Davis, S.B.1    Mermelstein, P.2
  • 6
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • DOI 10.1109/TASSP.1984.1164453
    • Ephraim, Y., and Malah, D. (1984). " Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator.," IEEE Trans. Acoust., Speech, Signal Process. ASSP-32, 1109-1121. 10.1109/TASSP.1984. 1164453 (Pubitemid 15159457)
    • (1984) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 7
    • 0022667694 scopus 로고
    • Speaker independent isolated word recognition using dynamic features of speech spectrum
    • 10.1109/TASSP.1986.1164788
    • Furui, S. (1986). " Speaker independent isolated word recognition using dynamic features of speech spectrum.," IEEE Trans. Acoust., Speech, Signal Process. ASSP-34, 52-59. 10.1109/TASSP.1986.1164788
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , pp. 52-59
    • Furui, S.1
  • 8
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • 10.1109/TNN.2004.832812
    • Hu, G., and Wang, D. L. (2004). " Monaural speech segregation based on pitch tracking and amplitude modulation.," IEEE Trans. Neural Netw. 15, 1135-1150. 10.1109/TNN.2004.832812
    • (2004) IEEE Trans. Neural Netw. , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 9
    • 49249107353 scopus 로고    scopus 로고
    • Segregation of unvoiced speech from nonspeech interference
    • 10.1121/1.2939132
    • Hu, G., and Wang, D. L. (2008). " Segregation of unvoiced speech from nonspeech interference.," J. Acoust. Soc. Am. 124, 1306-1319. 10.1121/1.2939132
    • (2008) J. Acoust. Soc. Am. , vol.124 , pp. 1306-1319
    • Hu, G.1    Wang, D.L.2
  • 10
    • 35248891610 scopus 로고    scopus 로고
    • A comparative intelligibility study of single-microphone noise reduction algorithms
    • DOI 10.1121/1.2766778
    • Hu, Y., and Loizou, P. C. (2007a). " A comparative intelligibility study of single-microphone noise reduction algorithms.," J. Acoust. Soc. Am. 122, 1777-1786. 10.1121/1.2766778 (Pubitemid 47560539)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.3 , pp. 1777-1786
    • Hu, Y.1    Loizou, P.C.2
  • 11
    • 34447092407 scopus 로고    scopus 로고
    • Subjective comparison and evaluation of speech enhancement algorithms
    • DOI 10.1016/j.specom.2006.12.006, PII S0167639306001920
    • Hu, Y., and Loizou, P. C. (2007b). " Subjective evaluation and comparison of speech enhancement algorithms.," Speech Commun. 49, 588-601. 10.1016/j.specom.2006.12.006 (Pubitemid 47031352)
    • (2007) Speech Communication , vol.49 , Issue.7-8 , pp. 588-601
    • Hu, Y.1    Loizou, P.C.2
  • 12
    • 77956534281 scopus 로고    scopus 로고
    • in The 11th International Workshoon Acoustic Echo and Noise Control, Seattle, WA
    • Hu, Y., and Loizou, P. C. (2008). " Techniques for estimating the ideal binary mask.," in The 11th International Workshop on Acoustic Echo and Noise Control, Seattle, WA
    • (2008) Techniques for Estimating the Ideal Binary Mask
    • Hu, Y.1    Loizou, P.C.2
  • 13
    • 0014568991 scopus 로고
    • IEEE recommended practice for speech quality measurements
    • IEEE. ",",. 10.1109/TAU.1969.1162058
    • IEEE (1969). " IEEE recommended practice for speech quality measurements.," IEEE Trans. Audio Electroacoust. 17, 225-246. 10.1109/TAU.1969.1162058
    • (1969) IEEE Trans. Audio Electroacoust. , vol.17 , pp. 225-246
  • 14
    • 0028297185 scopus 로고
    • Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction
    • 10.1121/1.408546
    • Kollmeier, B., and Koch, R. (1994). " Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction.," J. Acoust. Soc. Am. 95, 1593-1602. 10.1121/1.408546
    • (1994) J. Acoust. Soc. Am. , vol.95 , pp. 1593-1602
    • Kollmeier, B.1    Koch, R.2
  • 15
    • 0024241221 scopus 로고
    • Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms
    • Langner, G., and Schreiner, C. (1988). " Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms.," J. Neurophysiol. 60, 1799-1822. (Pubitemid 19017451)
    • (1988) Journal of Neurophysiology , vol.60 , Issue.6 , pp. 1799-1822
    • Langner, G.1    Schreiner, C.E.2
  • 16
    • 41849093721 scopus 로고    scopus 로고
    • Effect of spectral resolution on the intelligibility of ideal binary masked speech
    • 10.1121/1.2884086
    • Li, N., and Loizou, P. C. (2008a). " Effect of spectral resolution on the intelligibility of ideal binary masked speech.," J. Acoust. Soc. Am. 123, EL59-EL64. 10.1121/1.2884086
    • (2008) J. Acoust. Soc. Am. , vol.123
    • Li, N.1    Loizou, P.C.2
  • 17
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
    • 10.1121/1.2832617
    • Li, N., and Loizou, P. C. (2008b). " Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction.," J. Acoust. Soc. Am. 123, 1673-1682. 10.1121/1.2832617
    • (2008) J. Acoust. Soc. Am. , vol.123 , pp. 1673-1682
    • Li, N.1    Loizou, P.C.2
  • 18
    • 0018027039 scopus 로고
    • Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise
    • 10.1109/TASSP.1978.1163129
    • Lim, J. S. (1978). " Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise.," IEEE Trans. Acoust., Speech, Signal Process. 26, 471-472. 10.1109/TASSP.1978.1163129
    • (1978) IEEE Trans. Acoust., Speech, Signal Process. , vol.26 , pp. 471-472
    • Lim, J.S.1
  • 19
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • 10.1016/S0167-6393(97)00021-6
    • Lippmann, R. P. (1997). " Speech recognition by machines and humans.," Speech Commun. 22, 1-15. 10.1016/S0167-6393(97)00021-6
    • (1997) Speech Commun. , vol.22 , pp. 1-15
    • Lippmann, R.P.1
  • 23
    • 2942665634 scopus 로고    scopus 로고
    • An efficient robust sound classification algorithm for hearing aids
    • DOI 10.1121/1.1710877
    • Nordqvist, P., and Leijon, A. (2004). " An efficient robust sound classification algorithm for hearing aids.," J. Acoust. Soc. Am. 115, 3033-3041. 10.1121/1.1710877 (Pubitemid 38781236)
    • (2004) Journal of the Acoustical Society of America , vol.115 , Issue.6 , pp. 3033-3041
    • Nordqvist, P.1    Leijon, A.2
  • 24
    • 0141595299 scopus 로고    scopus 로고
    • The power of speech
    • DOI 10.1126/science.1088904
    • Rabiner, L. (2003). " The power of speech.," Science 301, 1494-1495. 10.1126/science.1088904 (Pubitemid 37128532)
    • (2003) Science , vol.301 , Issue.5639 , pp. 1494-1495
    • Rabiner, L.1
  • 25
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • 10.1109/89.365379
    • Reynolds, D., and Rose, R. (1995). " Robust text-independent speaker identification using Gaussian mixture speaker models.," IEEE Trans. Speech Audio Process. 3, 72-83. 10.1109/89.365379
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 26
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • DOI 10.1006/dspr.1999.0361
    • Reynolds, D., Quatieri, T., and Dunn, R. (2000). " Speaker verification using adapted Gaussian mixture models.," Digit. Signal Process. 10, 19-41. 10.1006/dspr.1999.0361 (Pubitemid 30592166)
    • (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 28
    • 34247580087 scopus 로고    scopus 로고
    • Reaching over the gap: A review of efforts to link human and automatic speech recognition research
    • DOI 10.1016/j.specom.2007.01.009, PII S0167639307000106, Bridging the Gap between Human and Automatic Speech Recognition
    • Scharenborg, O. (2007). " Reaching over the gap: A review of efforts to link human and automatic speech recognition research.," Speech Commun. 49, 336-347. 10.1016/j.specom.2007.01.009 (Pubitemid 46670364)
    • (2007) Speech Communication , vol.49 , Issue.5 , pp. 336-347
    • Scharenborg, O.1
  • 29
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
    • ",. 10.1016/j.specom.2004.03.006
    • Seltzer, M., Raj, B., and Stern, R. (2004). " A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition.," Speech Commun. 43, 379-393. 10.1016/j.specom.2004.03.006
    • (2004) Speech Commun. , vol.43 , pp. 379-393
    • Seltzer, M.1    Raj, B.2    Stern, R.3
  • 30
    • 15844428932 scopus 로고    scopus 로고
    • Human and machine consonant recognition
    • DOI 10.1016/j.specom.2004.11.009, PII S0167639304001499
    • Sroka, J. J., and Braida, L. D. (2005). " Human and machine consonant recognition.," Speech Commun. 45, 401-423. 10.1016/j.specom.2004. 11.009 (Pubitemid 40423287)
    • (2005) Speech Communication , vol.45 , Issue.4 , pp. 401-423
    • Sroka, J.J.1    Braida, L.D.2
  • 31
    • 0038712550 scopus 로고    scopus 로고
    • SNR estimation based on amplitude modulation analysis with applications to noise suppression
    • 10.1109/TSA.2003.811542
    • Tchorz, J., and Kollmeier, B. (2003). " SNR estimation based on amplitude modulation analysis with applications to noise suppression.," IEEE Trans. Speech Audio Process. 11, 184-192. 10.1109/TSA.2003.811542
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 184-192
    • Tchorz, J.1    Kollmeier, B.2
  • 32
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • 10.1016/0167-6393(93)90095-3
    • Varga, A., and Steeneken, H. J. M. (1993). " Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems.," Speech Commun. 12, 247-251. 10.1016/0167-6393(93)90095-3
    • (1993) Speech Commun. , vol.12 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 34
    • 35848945907 scopus 로고    scopus 로고
    • The design and evaluation of a hearing aid with trainable amplification parameters
    • DOI 10.1097/AUD.0b013e3181576738, PII 0000344620071200000010
    • Zakis, J. A., Dillon, H., and McDermott, H. J. (2007). " The design and evaluation of a hearing aid with trainable amplification parameters.," Ear Hear. 28, 812-830. 10.1097/AUD.0b013e3181576738 (Pubitemid 350059322)
    • (2007) Ear and Hearing , vol.28 , Issue.6 , pp. 812-830
    • Zakis, J.A.1    Dillon, H.2    McDermott, H.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.