SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 126, Issue 3, 2009, Pages 1486-1494

An algorithm that improves speech intelligibility in noise for normal-hearing listeners

(4) Kim, Gibak a Lu, Yang a Hu, Yi a Loizou, Philipos C a

a The University of Texas at Dallas (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN CLASSIFIER; BINARY DECISION; HUMAN LISTENERS; IDEAL BINARY MASK; INPUT SIGNAL; LOW SIGNAL-TO-NOISE RATIO; NORMAL-HEARING LISTENERS; SPEECH QUALITY; SUPPRESSION ALGORITHM; TIME FREQUENCY;

ACOUSTIC INTENSITY; ALGORITHMS; AUDITION; SIGNAL TO NOISE RATIO;

SPEECH INTELLIGIBILITY;

ALGORITHM; ARTICLE; AUDITORY STIMULATION; BAYES THEOREM; FEMALE; HEARING; HUMAN; MALE; NOISE REDUCTION; PRIORITY JOURNAL; SIGNAL NOISE RATIO; SPEECH INTELLIGIBILITY;

ACOUSTIC STIMULATION; ALGORITHMS; ARTIFICIAL INTELLIGENCE; BAYES THEOREM; DATABASES AS TOPIC; FEMALE; HUMANS; MALE; NOISE; PATTERN RECOGNITION, PHYSIOLOGICAL; PERCEPTUAL MASKING; PSYCHOACOUSTICS; SOUND SPECTROGRAPHY; SPEECH; SPEECH INTELLIGIBILITY; SPEECH PERCEPTION;

EID: 70349093614 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.3184603 Document Type: Article

Times cited : (306)

References (35)

1
- 0003684441
- (MIT, Cambridge, MA).
- Bregman, A. S. (1990). Auditory Scene Analysis (MIT, Cambridge, MA).
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

2
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- DOI 10.1121/1.2363929
- Brungart, D., Chang, P., Simpson, B., and Wang, D. (2006). " Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation.," J. Acoust. Soc. Am. 120, 4007-4018. 10.1121/1.2363929 (Pubitemid 44888096)
- (2006) Journal of the Acoustical Society of America , vol.120 , Issue.6 , pp. 4007-4018
- Brungart, D.S.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.⁴

3
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
- Cooke, M., Green, P., Josifovski, L., and Vizinho, A. (2001). " Robust automatic speech recognition with missing and unreliable acoustic data.," Speech Commun. 34, 267-285. 10.1016/S0167-6393(00)00034-0 (Pubitemid 32284867)
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

4
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- 10.1109/TASSP.1980.1163420
- Davis, S. B., and Mermelstein, P. (1980). " Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences.," IEEE Trans. Acoust., Speech, Signal Process. ASSP-28, 357-336. 10.1109/TASSP.1980.1163420
- (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.28 , pp. 357-336
- Davis, S.B.¹ Mermelstein, P.²

5
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977). " Maximum likelihood from incomplete data via the EM algorithm.," J. R. Stat. Soc. Ser. B (Methodol.) 39, 1-38.
- (1977) J. R. Stat. Soc. Ser. B (Methodol.) , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

6
- 0021645331
- Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
- DOI 10.1109/TASSP.1984.1164453
- Ephraim, Y., and Malah, D. (1984). " Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator.," IEEE Trans. Acoust., Speech, Signal Process. ASSP-32, 1109-1121. 10.1109/TASSP.1984. 1164453 (Pubitemid 15159457)
- (1984) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

7
- 0022667694
- Speaker independent isolated word recognition using dynamic features of speech spectrum
- 10.1109/TASSP.1986.1164788
- Furui, S. (1986). " Speaker independent isolated word recognition using dynamic features of speech spectrum.," IEEE Trans. Acoust., Speech, Signal Process. ASSP-34, 52-59. 10.1109/TASSP.1986.1164788
- (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , pp. 52-59
- Furui, S.¹

8
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- 10.1109/TNN.2004.832812
- Hu, G., and Wang, D. L. (2004). " Monaural speech segregation based on pitch tracking and amplitude modulation.," IEEE Trans. Neural Netw. 15, 1135-1150. 10.1109/TNN.2004.832812
- (2004) IEEE Trans. Neural Netw. , vol.15 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

9
- 49249107353
- Segregation of unvoiced speech from nonspeech interference
- 10.1121/1.2939132
- Hu, G., and Wang, D. L. (2008). " Segregation of unvoiced speech from nonspeech interference.," J. Acoust. Soc. Am. 124, 1306-1319. 10.1121/1.2939132
- (2008) J. Acoust. Soc. Am. , vol.124 , pp. 1306-1319
- Hu, G.¹ Wang, D.L.²

10
- 35248891610
- A comparative intelligibility study of single-microphone noise reduction algorithms
- DOI 10.1121/1.2766778
- Hu, Y., and Loizou, P. C. (2007a). " A comparative intelligibility study of single-microphone noise reduction algorithms.," J. Acoust. Soc. Am. 122, 1777-1786. 10.1121/1.2766778 (Pubitemid 47560539)
- (2007) Journal of the Acoustical Society of America , vol.122 , Issue.3 , pp. 1777-1786
- Hu, Y.¹ Loizou, P.C.²

11
- 34447092407
- Subjective comparison and evaluation of speech enhancement algorithms
- DOI 10.1016/j.specom.2006.12.006, PII S0167639306001920
- Hu, Y., and Loizou, P. C. (2007b). " Subjective evaluation and comparison of speech enhancement algorithms.," Speech Commun. 49, 588-601. 10.1016/j.specom.2006.12.006 (Pubitemid 47031352)
- (2007) Speech Communication , vol.49 , Issue.7-8 , pp. 588-601
- Hu, Y.¹ Loizou, P.C.²

12
- 77956534281
- in The 11th International Workshoon Acoustic Echo and Noise Control, Seattle, WA
- Hu, Y., and Loizou, P. C. (2008). " Techniques for estimating the ideal binary mask.," in The 11th International Workshop on Acoustic Echo and Noise Control, Seattle, WA
- (2008) Techniques for Estimating the Ideal Binary Mask
- Hu, Y.¹ Loizou, P.C.²

13
- 0014568991
- IEEE recommended practice for speech quality measurements
- IEEE. ",",. 10.1109/TAU.1969.1162058
- IEEE (1969). " IEEE recommended practice for speech quality measurements.," IEEE Trans. Audio Electroacoust. 17, 225-246. 10.1109/TAU.1969.1162058
- (1969) IEEE Trans. Audio Electroacoust. , vol.17 , pp. 225-246

14
- 0028297185
- Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction
- 10.1121/1.408546
- Kollmeier, B., and Koch, R. (1994). " Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction.," J. Acoust. Soc. Am. 95, 1593-1602. 10.1121/1.408546
- (1994) J. Acoust. Soc. Am. , vol.95 , pp. 1593-1602
- Kollmeier, B.¹ Koch, R.²

15
- 0024241221
- Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms
- Langner, G., and Schreiner, C. (1988). " Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms.," J. Neurophysiol. 60, 1799-1822. (Pubitemid 19017451)
- (1988) Journal of Neurophysiology , vol.60 , Issue.6 , pp. 1799-1822
- Langner, G.¹ Schreiner, C.E.²

16
- 41849093721
- Effect of spectral resolution on the intelligibility of ideal binary masked speech
- 10.1121/1.2884086
- Li, N., and Loizou, P. C. (2008a). " Effect of spectral resolution on the intelligibility of ideal binary masked speech.," J. Acoust. Soc. Am. 123, EL59-EL64. 10.1121/1.2884086
- (2008) J. Acoust. Soc. Am. , vol.123
- Li, N.¹ Loizou, P.C.²

17
- 40749125179
- Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
- 10.1121/1.2832617
- Li, N., and Loizou, P. C. (2008b). " Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction.," J. Acoust. Soc. Am. 123, 1673-1682. 10.1121/1.2832617
- (2008) J. Acoust. Soc. Am. , vol.123 , pp. 1673-1682
- Li, N.¹ Loizou, P.C.²

18
- 0018027039
- Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise
- 10.1109/TASSP.1978.1163129
- Lim, J. S. (1978). " Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise.," IEEE Trans. Acoust., Speech, Signal Process. 26, 471-472. 10.1109/TASSP.1978.1163129
- (1978) IEEE Trans. Acoust., Speech, Signal Process. , vol.26 , pp. 471-472
- Lim, J.S.¹

19
- 0031187171
- Speech recognition by machines and humans
- 10.1016/S0167-6393(97)00021-6
- Lippmann, R. P. (1997). " Speech recognition by machines and humans.," Speech Commun. 22, 1-15. 10.1016/S0167-6393(97)00021-6
- (1997) Speech Commun. , vol.22 , pp. 1-15
- Lippmann, R.P.¹

20
- 34447100796
- (CRC, Boca Raton, FL).
- Loizou, P. C. (2007). Speech Enhancement: Theory and Practice (CRC, Boca Raton, FL).
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.C.¹

21
- 22144497277
- (Lawrence Erlbaum Associates, New York).
- Macmillan, N., and Creelman, D. (2005). Detection Theory: A User's Guide (Lawrence Erlbaum Associates, New York).
- (2005) Detection Theory: A User's Guide
- MacMillan, N.¹ Creelman, D.²

22
- 0003789815
- (Academic, London).
- Moore, B. (2003). An Introduction to the Psychology of Hearing (Academic, London).
- (2003) An Introduction to the Psychology of Hearing
- Moore, B.¹

23
- 2942665634
- An efficient robust sound classification algorithm for hearing aids
- DOI 10.1121/1.1710877
- Nordqvist, P., and Leijon, A. (2004). " An efficient robust sound classification algorithm for hearing aids.," J. Acoust. Soc. Am. 115, 3033-3041. 10.1121/1.1710877 (Pubitemid 38781236)
- (2004) Journal of the Acoustical Society of America , vol.115 , Issue.6 , pp. 3033-3041
- Nordqvist, P.¹ Leijon, A.²

24
- 0141595299
- The power of speech
- DOI 10.1126/science.1088904
- Rabiner, L. (2003). " The power of speech.," Science 301, 1494-1495. 10.1126/science.1088904 (Pubitemid 37128532)
- (2003) Science , vol.301 , Issue.5639 , pp. 1494-1495
- Rabiner, L.¹

25
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- 10.1109/89.365379
- Reynolds, D., and Rose, R. (1995). " Robust text-independent speaker identification using Gaussian mixture speaker models.," IEEE Trans. Speech Audio Process. 3, 72-83. 10.1109/89.365379
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 72-83
- Reynolds, D.¹ Rose, R.²

26
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- DOI 10.1006/dspr.1999.0361
- Reynolds, D., Quatieri, T., and Dunn, R. (2000). " Speaker verification using adapted Gaussian mixture models.," Digit. Signal Process. 10, 19-41. 10.1006/dspr.1999.0361 (Pubitemid 30592166)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

27
- 0029726517
- Speech enhancement based on a priori signal to noise estimation
- in
- Scalart, P., and Filho, J. (1996). " Speech enhancement based on a priori signal to noise estimation.," in Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 629-632.
- (1996) Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 629-632
- Scalart, P.¹ Filho, J.²

28
- 34247580087
- Reaching over the gap: A review of efforts to link human and automatic speech recognition research
- DOI 10.1016/j.specom.2007.01.009, PII S0167639307000106, Bridging the Gap between Human and Automatic Speech Recognition
- Scharenborg, O. (2007). " Reaching over the gap: A review of efforts to link human and automatic speech recognition research.," Speech Commun. 49, 336-347. 10.1016/j.specom.2007.01.009 (Pubitemid 46670364)
- (2007) Speech Communication , vol.49 , Issue.5 , pp. 336-347
- Scharenborg, O.¹

29
- 4644317224
- A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
- ",. 10.1016/j.specom.2004.03.006
- Seltzer, M., Raj, B., and Stern, R. (2004). " A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition.," Speech Commun. 43, 379-393. 10.1016/j.specom.2004.03.006
- (2004) Speech Commun. , vol.43 , pp. 379-393
- Seltzer, M.¹ Raj, B.² Stern, R.³

30
- 15844428932
- Human and machine consonant recognition
- DOI 10.1016/j.specom.2004.11.009, PII S0167639304001499
- Sroka, J. J., and Braida, L. D. (2005). " Human and machine consonant recognition.," Speech Commun. 45, 401-423. 10.1016/j.specom.2004. 11.009 (Pubitemid 40423287)
- (2005) Speech Communication , vol.45 , Issue.4 , pp. 401-423
- Sroka, J.J.¹ Braida, L.D.²

31
- 0038712550
- SNR estimation based on amplitude modulation analysis with applications to noise suppression
- 10.1109/TSA.2003.811542
- Tchorz, J., and Kollmeier, B. (2003). " SNR estimation based on amplitude modulation analysis with applications to noise suppression.," IEEE Trans. Speech Audio Process. 11, 184-192. 10.1109/TSA.2003.811542
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 184-192
- Tchorz, J.¹ Kollmeier, B.²

32
- 0027623210
- Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- 10.1016/0167-6393(93)90095-3
- Varga, A., and Steeneken, H. J. M. (1993). " Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems.," Speech Commun. 12, 247-251. 10.1016/0167-6393(93)90095-3
- (1993) Speech Commun. , vol.12 , pp. 247-251
- Varga, A.¹ Steeneken, H.J.M.²

33
- 82255178542
- (Wiley, Hoboken, NJ).
- Wang, D. L., and Brown, G. J. (2006). Computational Auditory Scene Analysis: Principles, Algorithms, and Applications (Wiley, Hoboken, NJ).
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
- Wang, D.L.¹ Brown, G.J.²

34
- 35848945907
- The design and evaluation of a hearing aid with trainable amplification parameters
- DOI 10.1097/AUD.0b013e3181576738, PII 0000344620071200000010
- Zakis, J. A., Dillon, H., and McDermott, H. J. (2007). " The design and evaluation of a hearing aid with trainable amplification parameters.," Ear Hear. 28, 812-830. 10.1097/AUD.0b013e3181576738 (Pubitemid 350059322)
- (2007) Ear and Hearing , vol.28 , Issue.6 , pp. 812-830
- Zakis, J.A.¹ Dillon, H.² McDermott, H.J.³

35
- 14044252930
- Speech recognition with amplitude and frequency modulations
- DOI 10.1073/pnas.0406460102
- Zeng, F.-G., Nie, K., Stickney, G. S., Kong, Y.-Y., Vongphoe, M., Bhargave, A., Wei, C., and Cao, K. (2005). " Speech recognition with amplitude and frequency modulations.," Proc. Natl. Acad. Sci. U.S.A. 102, 2293-2298. 10.1073/pnas.0406460102 (Pubitemid 40279369)
- (2005) Proceedings of the National Academy of Sciences of the United States of America , vol.102 , Issue.7 , pp. 2293-2298
- Zeng, F.-G.¹ Nie, K.² Stickney, G.S.³ Kong, Y.-Y.⁴ Vongphoe, M.⁵ Bhargave, A.⁶ Wei, C.⁷ Cao, K.⁸

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.