메뉴 건너뛰기




Volumn 130, Issue 4, 2011, Pages 2153-2161

Intelligibility of reverberant noisy speech with ideal binary masking

Author keywords

[No Author keywords available]

Indexed keywords

DESIRED SIGNAL; DIRECT PATHS; IDEAL BINARY MASK; NOISE ENERGY; NOISY SPEECH; NORMAL-HEARING LISTENERS; REVERBERANT CONDITION; REVERBERATION TIME; SPEECH RECEPTION THRESHOLD; SPEECH-SHAPED NOISE; SUBSTANTIAL REDUCTION; TARGET SPEECH; TIME FREQUENCY;

EID: 82255167374     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.3631668     Document Type: Article
Times cited : (38)

References (42)
  • 1
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • 10.1121/1.382599
    • Allen, J. B., and Berkley, D. A. (1979). Image method for efficiently simulating small-room acoustics, J. Acoust. Soc. Am. 65, 943-950. 10.1121/1.382599
    • (1979) J. Acoust. Soc. Am. , vol.65 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 2
    • 33748523481 scopus 로고    scopus 로고
    • Determination of the potential benefit of time-frequency gain manipulation
    • DOI 10.1097/01.aud.0000233891.86809.df, PII 0000344620061000000004
    • Anzalone, M. C., Calandruccio, L., Doherty, K. A., and Carney, L. H. (2006). Determination of the potential benefit of time-frequency gain manipulation, Ear Hear. 27, 480-492. 10.1097/01.aud.0000233891.86809.df (Pubitemid 44371244)
    • (2006) Ear and Hearing , vol.27 , Issue.5 , pp. 480-492
    • Anzalone, M.C.1    Calandruccio, L.2    Doherty, K.A.3    Carney, L.H.4
  • 3
    • 2142812604 scopus 로고    scopus 로고
    • The perception of speech under adverse conditions
    • edited by S. Greenberg, W. A. Ainsworth, A. N. Popper, and R. R. Fay (Springer, New York)
    • Assmann P., and Summerfield, A. Q. (2004). The perception of speech under adverse conditions, in Speech Processing in the Auditory System, edited by, S. Greenberg, W. A. Ainsworth, A. N. Popper, and, R. R. Fay, (Springer, New York), pp. 231-308.
    • (2004) Speech Processing in the Auditory System , pp. 231-308
    • Assmann, P.1    Summerfield, A.Q.2
  • 4
    • 0037945504 scopus 로고    scopus 로고
    • On the importance of early reflections for speech in rooms
    • DOI 10.1121/1.1570439
    • Bradley, J. S., Sato, H., and Picard, M. (2003). On the importance of early reflections for speech in rooms, J. Acoust. Soc. Am. 113 (6), 3233-3244. 10.1121/1.1570439 (Pubitemid 36676517)
    • (2003) Journal of the Acoustical Society of America , vol.113 , Issue.6 , pp. 3233-3244
    • Bradley, J.S.1    Sato, H.2    Picard, M.3
  • 6
    • 0039334758 scopus 로고    scopus 로고
    • The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions
    • Bronkhorst, A. W. (2000). The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions, Acustica, 86, 117-128. (Pubitemid 34103984)
    • (2000) Acta Acustica united with Acustica , vol.86 , Issue.1 , pp. 117-128
    • Bronkhorst, A.W.1
  • 7
    • 0035106984 scopus 로고    scopus 로고
    • Informational and energetic masking effects in the perception of two simultaneous talkers
    • DOI 10.1121/1.1345696
    • Brungart, D. S. (2001). Information and energetic masking effects in the perception of two simultaneous talkers, J. Acoust. Soc. Am. 109, 1101-1109. 10.1121/1.1345696 (Pubitemid 32215916)
    • (2001) Journal of the Acoustical Society of America , vol.109 , Issue.3 , pp. 1101-1109
    • Brungart, D.S.1
  • 8
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • DOI 10.1121/1.2363929
    • Brungart, D., Chang, P. S., Simpson B. D., and Wang, D. L. (2006). Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation, J. Acoust. Soc. Am. 120, 4007-4018. 10.1121/1.2363929 (Pubitemid 44888096)
    • (2006) Journal of the Acoustical Society of America , vol.120 , Issue.6 , pp. 4007-4018
    • Brungart, D.S.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.4
  • 9
    • 67649095324 scopus 로고    scopus 로고
    • Multitalker speech perception with ideal time-frequency segregation: Effects of voice characteristics and number of talkers
    • 10.1121/1.3117686
    • Brungart, D., Chang, P. S., Simpson B. D., and Wang, D. L. (2009). Multitalker speech perception with ideal time-frequency segregation: Effects of voice characteristics and number of talkers, J. Acoust. Soc. Am. 125 (6), 4006-4022. 10.1121/1.3117686
    • (2009) J. Acoust. Soc. Am. , vol.125 , Issue.6 , pp. 4006-4022
    • Brungart, D.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.L.4
  • 10
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
    • Cooke, M. P., Green, P., Josifovski, L., and Vizinho, A. (2001). Robust automatic speech recognition with missing and unreliable acoustic data, Speech Commun. 34, 267-285. 10.1016/S0167-6393(00)00034-0 (Pubitemid 32284867)
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 11
    • 0242440783 scopus 로고    scopus 로고
    • Effects of reverberation on perceptual segregation of competing voices
    • DOI 10.1121/1.1616922
    • Culling, J. F., Hodder, K. I., and Toh, C. Y. (2003). Effects of reverberation on perceptual segregation of competing voices, J. Acoust. Soc. Am. 114 (5), 2871-2876. 10.1121/1.1616922 (Pubitemid 37371643)
    • (2003) Journal of the Acoustical Society of America , vol.114 , Issue.5 , pp. 2871-2876
    • Culling, J.F.1    Hodder, K.I.2    Toh, C.Y.3
  • 12
    • 37849026992 scopus 로고    scopus 로고
    • Speech recognition with varying numbers and types of competing talkers by normal-hearing, cochlear-implant, and implant simulation subjects
    • 10.1121/1.2805617
    • Cullington, H. E., and Zeng, F. G. (2008). Speech recognition with varying numbers and types of competing talkers by normal-hearing, cochlear-implant, and implant simulation subjects, J. Acoust. Soc. Am. 123 (1), 450-461. 10.1121/1.2805617
    • (2008) J. Acoust. Soc. Am. , vol.123 , Issue.1 , pp. 450-461
    • Cullington, H.E.1    Zeng, F.G.2
  • 13
    • 0029165344 scopus 로고
    • Speech intelligibility in noise: Relative contribution of speech elements above and below the noise level
    • 10.1121/1.413378
    • Drullman, R. (1995). Speech intelligibility in noise: Relative contribution of speech elements above and below the noise level, J. Acoust. Soc. Am. 98, 1796-1798. 10.1121/1.413378
    • (1995) J. Acoust. Soc. Am. , vol.98 , pp. 1796-1798
    • Drullman, R.1
  • 14
    • 0019195665 scopus 로고
    • Effect of reverberation and noise on the intelligibility of sentences in cases of presbyacusis
    • DOI 10.1121/1.384767
    • Duquesnoy A. J., and Plomp, R. (1980). Effect of reverberation and noise on the intelligibility of sentences in cases of presbyacusis, J. Acoust. Soc. Am. 68 (2), 537-544. 10.1121/1.384767 (Pubitemid 11227341)
    • (1980) Journal of the Acoustical Society of America , vol.68 , Issue.2 , pp. 537-544
    • Duquesnoy, A.J.1    Plomp, R.2
  • 15
    • 0025259936 scopus 로고
    • Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing
    • 10.1121/1.400247
    • Festen, J. M., and Plomp, R. (1990). Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing, J. Acoust. Soc. Am. 88 (4), 1725-1736. 10.1121/1.400247
    • (1990) J. Acoust. Soc. Am. , vol.88 , Issue.4 , pp. 1725-1736
    • Festen, J.M.1    Plomp, R.2
  • 16
    • 49249085785 scopus 로고    scopus 로고
    • The combined effects of reverberation and nonstationary noise on sentence intelligibility
    • 10.1121/1.2945153
    • George, E. L. J., Festen, J. M., and Houtgast, T. (2008). The combined effects of reverberation and nonstationary noise on sentence intelligibility, J. Acoust. Soc. Am. 124 (2), 1269-1277. 10.1121/1.2945153
    • (2008) J. Acoust. Soc. Am. , vol.124 , Issue.2 , pp. 1269-1277
    • George, E.L.J.1    Festen, J.M.2    Houtgast, T.3
  • 17
    • 78650501915 scopus 로고    scopus 로고
    • Measuring the effects of reverberation and noise on sentence intelligibility for hearing-imapired listeners
    • 10.1044/1092-4388(2010/09-0197)
    • George, E. L. J., Goverts, S. T., Festen J. M., and Houtgast, T. (2010). Measuring the effects of reverberation and noise on sentence intelligibility for hearing-imapired listeners, J. Speech, Lang. and Hear. Res. 53, 1429-1439. 10.1044/1092-4388(2010/09-0197)
    • (2010) J. Speech, Lang. and Hear. Res. , vol.53 , pp. 1429-1439
    • George, E.L.J.1    Goverts, S.T.2    Festen, J.M.3    Houtgast, T.4
  • 18
    • 24444458549 scopus 로고    scopus 로고
    • Importance of early and late reflections for automatic speech recognition in reverberant environments
    • Glzer, H., and Kleinschmidt, M. (2003). Importance of early and late reflections for automatic speech recognition in reverberant environments, Proc. Elektronische Sprachsignalverarbeitung (ESSV).
    • (2003) Proc. Elektronische Sprachsignalverarbeitung (ESSV).
    • Glzer, H.1    Kleinschmidt, M.2
  • 19
    • 34547450787 scopus 로고    scopus 로고
    • A new definition of boundary point between early reflections and late reverberation in room impulse responses
    • DOI 10.1121/1.2743161
    • Hidaka, T., Yamada, Y, and Nakagawa, T. (2007). A new definition of boundary point between early reflections and late reverberation in room impulse responses, J. Acoust. Soc. Am. 122 (1), 326-332. 10.1121/1.2743161 (Pubitemid 47365943)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.1 , pp. 326-332
    • Hidaka, T.1    Yamada, Y.2    Nakagawa, T.3
  • 20
    • 0019060580 scopus 로고
    • Predicting speech intelligibility in rooms from the modulation transfer function. I. General room acoustics
    • Houtgast, T., Steeneken, H. J. M., and Plomp, R. (1980). Predicting speech intelligibility in rooms from the modulation transfer function. I. General room acoustics, Acustica 46, 59-72.
    • (1980) Acustica , vol.46 , pp. 59-72
    • Houtgast, T.1    Steeneken, H.J.M.2    Plomp, R.3
  • 21
    • 70349161218 scopus 로고    scopus 로고
    • Role of mask pattern in intelligibility of ideal binary-masked noisy speech
    • 10.1121/1.3179673
    • Kjems, U., Boldt, J. P., Pedersen, M. S., Lunner, T., and Wang, D. L. (2010). Role of mask pattern in intelligibility of ideal binary-masked noisy speech, J. Acoust. Soc. Am. 126, 1415-1426. 10.1121/1.3179673
    • (2010) J. Acoust. Soc. Am. , vol.126 , pp. 1415-1426
    • Kjems, U.1    Boldt, J.P.2    Pedersen, M.S.3    Lunner, T.4    Wang, D.L.5
  • 22
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
    • DOI 10.1121/1.2832617
    • Li, N., and Loizou, P. C. (2008). Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction, J. Acoust. Soc. Am. 123, 1673-1682. 10.1121/1.2832617 (Pubitemid 351379593)
    • (2008) Journal of the Acoustical Society of America , vol.123 , Issue.3 , pp. 1673-1682
    • Li, N.1    Loizou, P.C.2
  • 23
    • 58149196390 scopus 로고    scopus 로고
    • On the optimality of ideal binary time-frequency masks
    • 10.1016/j.specom.2008.09.001
    • Li, Y., and Wang, D. L. (2009). On the optimality of ideal binary time-frequency masks, Speech Commun. 51, 230-239. 10.1016/j.specom.2008.09.001
    • (2009) Speech Commun. , vol.51 , pp. 230-239
    • Li, Y.1    Wang, D.L.2
  • 24
    • 9644276874 scopus 로고    scopus 로고
    • The effect of overlap-masking on binaural reverberant word intelligibility
    • DOI 10.1121/1.1781621
    • Libbey, B., and Rogers, P. H. (2004). The effect of overlap-masking on binaural reverberant word intelligibility, J. Acoust. Soc. Am. 116 (5), 3141-3151. 10.1121/1.1781621 (Pubitemid 39575559)
    • (2004) Journal of the Acoustical Society of America , vol.116 , Issue.5 , pp. 3141-3151
    • Libbey, B.1    Rogers, P.H.2
  • 25
    • 50549201067 scopus 로고
    • The influence of reflections on auditorium acoustics
    • 10.1016/0022-460X(64)90057-4
    • Lochner, J. P. A., and J. F. Burger, J. F. (1964). The influence of reflections on auditorium acoustics, J. Sound Vib. 1, 426-454. 10.1016/0022-460X(64)90057-4
    • (1964) J. Sound Vib. , vol.1 , pp. 426-454
    • Lochner, J.P.A.1    Burger, J.F.2    Burger, J.F.3
  • 26
    • 85008544097 scopus 로고    scopus 로고
    • Model-based expectation maximization source separation and localization
    • 10.1109/TASL.2009.2029711
    • Mandel, M. I., Weiss, R. J., and Ellis, D. P. W. (2010). Model-based expectation maximization source separation and localization, IEEE Trans. Audio, Speech, Lang. Process. 18, 382-394. 10.1109/TASL.2009.2029711
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , pp. 382-394
    • Mandel, M.I.1    Weiss, R.J.2    Ellis, D.P.W.3
  • 28
    • 0020325263 scopus 로고
    • Monaural and binaural speech perception in reverberation for listeners of various ages
    • DOI 10.1121/1.387773
    • Nabelek, A. K., and Robinson, P. K. (1982). Monaural and binaural speech perception in reverberation for listeners of various ages, J. Acoust. Soc. Am. 71 (5), 1242-1248. 10.1121/1.387773 (Pubitemid 12058777)
    • (1982) Journal of the Acoustical Society of America , vol.71 , Issue.5 , pp. 1242-1248
    • Nabelek, A.K.1    Robinson, P.K.2
  • 29
    • 0028012490 scopus 로고
    • Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise
    • Nilsson, M., Soli, S., and Sullivan, J., (1994). Development of the Hearing In Noise Test for the measurement of speech reception thresholds in quiet and in noise, J. Acoust. Soc. Am. 95, 1085-1099. 10.1121/1.408469 (Pubitemid 24056373)
    • (1994) Journal of the Acoustical Society of America , vol.95 , Issue.2 , pp. 1085-1099
    • Nilsson, M.1    Soli, S.D.2    Sullivan, J.A.3
  • 30
    • 2942539074 scopus 로고    scopus 로고
    • Techniques for handling convolutional distortion with missing data' automatic speech recognition
    • 10.1016/j.specom.2004.02.005
    • Palomki, K. J., Brown, G. J., and Barker, J. P. (2004). Techniques for handling convolutional distortion with missing data' automatic speech recognition, Speech Commun. 43, 123-142. 10.1016/j.specom.2004.02.005
    • (2004) Speech Commun. , vol.43 , pp. 123-142
    • Palomki, K.J.1    Brown, G.J.2    Barker, J.P.3
  • 31
    • 40949096929 scopus 로고    scopus 로고
    • Two-microphone separation of speech mixtures
    • DOI 10.1109/TNN.2007.911740
    • Pedersen, M. S., Wang, D. L., Larsen, J., and Kjems, U. (2008). Two-microphone separation of speech mixtures, IEEE Trans. Neural Networks 19, 475-492. 10.1109/TNN.2007.911740 (Pubitemid 351411571)
    • (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.3 , pp. 475-492
    • Pedersen, M.S.1    Wang, D.L.2    Larsen, J.3    Kjems, U.4
  • 32
    • 67149105535 scopus 로고    scopus 로고
    • Limitations of the spectrum masking technique for blind source separation
    • Rodrigues, G. F., and Yehia, H. C. (2009). Limitations of the spectrum masking technique for blind source separation, Proc. ICA, 621-628.
    • (2009) Proc. ICA , pp. 621-628
    • Rodrigues, G.F.1    Yehia, H.C.2
  • 33
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • DOI 10.1121/1.1610463
    • Roman, N., Wang, D. L., and Brown, G. J. (2003). Speech segregation based on sound localization, J. Acoust. Soc. Am. 114, 2236-2252. 10.1121/1.1610463 (Pubitemid 37266649)
    • (2003) Journal of the Acoustical Society of America , vol.114 , Issue.4 , pp. 2236-2252
    • Roman, N.1    Wang, D.2    Brown, G.J.3
  • 34
    • 33845361885 scopus 로고    scopus 로고
    • Binaural segregation in multisource reverberant environments
    • DOI 10.1121/1.2355480
    • Roman, N., Srinivasan, S., and Wang, D. L. (2006). Binaural segregation in multisource reverberant environments, J. Acoust. Soc. Am. 120, 4040-4051. 10.1121/1.2355480 (Pubitemid 44888099)
    • (2006) Journal of the Acoustical Society of America , vol.120 , Issue.6 , pp. 4040-4051
    • Roman, N.1    Srinivasan, S.2    Wang, D.3
  • 35
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • DOI 10.1016/j.specom.2006.09.003, PII S0167639306001129
    • Srinivasan, S., Roman, N., and Wang, D. L. (2006). Binary and ratio time- frequency masks for robust speech recognition, Speech Commun. 48, 1486-1501. 10.1016/j.specom.2006.09.003 (Pubitemid 44634774)
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.3
  • 36
    • 70350038037 scopus 로고    scopus 로고
    • Robust speech recognition by integrating speech separation and hypothesis testing
    • 10.1016/j.specom.2009.08.008
    • Srinivasan, S., and Wang, D. L. (2010) Robust speech recognition by integrating speech separation and hypothesis testing, Speech Commun. 52, pp. 72-81. 10.1016/j.specom.2009.08.008
    • (2010) Speech Commun. , vol.52 , pp. 72-81
    • Srinivasan, S.1    Wang, D.L.2
  • 37
    • 84924843200 scopus 로고
    • The precedence effect in sound localization
    • 10.2307/1418275
    • Wallach, H., Newman, E. B., and Rosenzweig, M. R. (1949) The precedence effect in sound localization, Am. J. Psychol. 52, 315-336. 10.2307/1418275
    • (1949) Am. J. Psychol. , vol.52 , pp. 315-336
    • Wallach, H.1    Newman, E.B.2    Rosenzweig, M.R.3
  • 38
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • edited by P. Divenyi (Kluwer Academic, Norwell, MA)
    • Wang, D. L. (2005). On ideal binary mask as the computational goal of auditory scene analysis, in Speech Separation by Humans and Machines, edited by, P. Divenyi, (Kluwer Academic, Norwell, MA), pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 39
    • 56249144201 scopus 로고    scopus 로고
    • Time-frequency masking for speech separation and its potential for hearing aid design
    • 10.1177/1084713808326455
    • Wang, D. L. (2008). Time-frequency masking for speech separation and its potential for hearing aid design, Trends Amplif. 12, 332-353. 10.1177/1084713808326455
    • (2008) Trends Amplif. , vol.12 , pp. 332-353
    • Wang, D.L.1
  • 41
    • 64649103540 scopus 로고    scopus 로고
    • Speech intelligibility background noise with ideal binary time-frequency masking
    • 10.1121/1.3083233
    • Wang, D. L., Kjems, U., Pedersen, M. S. Boldt, J. B., and Lunner, T. (2009). Speech intelligibility background noise with ideal binary time-frequency masking, J. Acoust. Soc. Am. 125, 2336-2347. 10.1121/1.3083233
    • (2009) J. Acoust. Soc. Am. , vol.125 , pp. 2336-2347
    • Wang, D.L.1    Kjems, U.2    Pedersen, M.S.3    Boldt, J.B.4    Lunner, T.5
  • 42
    • 77955697785 scopus 로고    scopus 로고
    • Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization
    • 10.1109/TASL.2010.2050087
    • Woodruff, J., and Wang, D. L. (2010). Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization, IEEE Trans. Audio, Speech, Lang. Process. 18, 1856-1866. 10.1109/TASL.2010.2050087
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , pp. 1856-1866
    • Woodruff, J.1    Wang, D.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.