메뉴 건너뛰기




Volumn 125, Issue 5, 2009, Pages 3387-3405

Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions

Author keywords

[No Author keywords available]

Indexed keywords

ARTICULATION INDICES; IMPORTANCE FUNCTIONS; INTELLIGIBILITY SCORES; NOISE CONDITIONS; NORMAL-HEARING LISTENERS; OBJECTIVE MEASURES; WEIGHTING FUNCTIONS;

EID: 65549157071     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.3097493     Document Type: Article
Times cited : (403)

References (61)
  • 1
    • 0028516073 scopus 로고
    • How do humans process and recognize speech
    • Allen, J. B. (1994). " How do humans process and recognize speech.," IEEE Trans. Speech Audio Process. 2, 567-577.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 567-577
    • Allen, J.B.1
  • 2
    • 84883672556 scopus 로고
    • English verification of STI method for estimating speech intelligibility of a communications channel
    • Anderson, W. B., and Kalb, J. T. (1987). " English verification of STI method for estimating speech intelligibility of a communications channel.," J. Acoust. Soc. Am. 81, 1982-1985.
    • (1987) J. Acoust. Soc. Am. , vol.81 , pp. 1982-1985
    • Anderson, W.B.1    Kalb, J.T.2
  • 4
    • 0030369532 scopus 로고    scopus 로고
    • Intelligibility of speech with filtered time trajectories of spectral envelopes
    • " in
    • Arai, T., Pavel, M., Hermansky, H., and Avendano, C. (1996). " Intelligibility of speech with filtered time trajectories of spectral envelopes.," in Proceedings of the ICSLP, pp. 2490-2493.
    • (1996) Proceedings of the ICSLP , pp. 2490-2493
    • Arai, T.1    Pavel, M.2    Hermansky, H.3    Avendano, C.4
  • 5
    • 34547645591 scopus 로고    scopus 로고
    • Effects of noise and distortion on speech quality judgments in normal-hearing and hearing-impaired listeners
    • DOI 10.1121/1.2754061
    • Arehart, K., Kates, J., Anderson, M., and Harvey, L. (2007). " Effects of noise and distortion on speech quality judgments in normal-hearing and hearing-impaired listeners.," J. Acoust. Soc. Am. 122, 1150-1164. (Pubitemid 47205512)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.2 , pp. 1150-1164
    • Arehart, K.H.1    Kates, J.M.2    Anderson, M.C.3    Harvey, L.O.4
  • 7
    • 78649295196 scopus 로고    scopus 로고
    • Extension of ITU-T recommendation P.862 PESQ towards measuring speech intelligibility with vocoders
    • " in, Proceedings of the RT0-MP-HFM-123, Neuilly-sur-Seine, France
    • Beerends, J., van Wijngaarden, S., and van Buuren, R. (2005). " Extension of ITU-T recommendation P.862 PESQ towards measuring speech intelligibility with vocoders.," in New Directions for Improving Audio Effectiveness, Proceedings of the RT0-MP-HFM-123, Neuilly-sur-Seine, France, pp. 10-1-10.6.
    • (2005) New Directions for Improving Audio Effectiveness
    • Beerends, J.1    Van Wijngaarden, S.2    Van Buuren, R.3
  • 9
    • 0028589019 scopus 로고
    • The hearing aid input: A phonemic approach to assessing the spectral distribution of speech
    • Boothroyd, A., Erickson, F. N., and Medwetsky, L. (1994). " The hearing aid input: A phonemic approach to assessing the spectral distribution of speech.," Ear Hear. 6, 432-442.
    • (1994) Ear Hear. , vol.6 , pp. 432-442
    • Boothroyd, A.1    Erickson, F.N.2    Medwetsky, L.3
  • 10
    • 10644227006 scopus 로고    scopus 로고
    • Estimation of logatom intelligibility with STI method for polish speech transmitted via communication channels
    • Brachmanski, S. (2004). " Estimation of logatom intelligibility with STI method for polish speech transmitted via communication channels.," Arch. Acoust. 29, 555-562.
    • (2004) Arch. Acoust. , vol.29 , pp. 555-562
    • Brachmanski, S.1
  • 11
    • 0015652267 scopus 로고
    • Estimation of the magnitude-squared coherence function via overlapped fast Fourier transform processing
    • Carter, C., Knapp, C., and Nuttall, A. (1973). " Estimation of the magnitude-squared coherence function via overlapped fast Fourier transform processing.," IEEE Trans. Audio Electroacoust. AU-21, 337-344.
    • (1973) IEEE Trans. Audio Electroacoust. , vol.AU-21 , pp. 337-344
    • Carter, C.1    Knapp, C.2    Nuttall, A.3
  • 12
    • 0036226165 scopus 로고    scopus 로고
    • Noise estimation by minima controlled recursive averaging for robust speech enhancement
    • DOI 10.1109/97.988717, PII S1070990802024100
    • Cohen, I., and Berdugo, B. (2002). " Noise estimation by minima controlled recursive averaging for robust speech enhancement.," IEEE Signal Process. Lett. 9, 12-15. (Pubitemid 34306628)
    • (2002) IEEE Signal Processing Letters , vol.9 , Issue.1 , pp. 12-15
    • Cohen, I.1    Berdugo, B.2
  • 15
    • 84955051307 scopus 로고
    • Statistical measurements on conversational speech
    • Dunn, H., and White, S. (1940). " Statistical measurements on conversational speech.," J. Acoust. Soc. Am. 11, 278-288.
    • (1940) J. Acoust. Soc. Am. , vol.11 , pp. 278-288
    • Dunn, H.1    White, S.2
  • 16
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error-log-spectral amplitude estimator
    • DOI 10.1109/TASSP.1985.1164550
    • Ephraim, Y., and Malah, D. (1985). " Speech enhancement using a minimum mean-square error log-spectral amplitude estimator.," IEEE Trans. Acoust., Speech, Signal Process. ASSP-33, 443-445. (Pubitemid 15109380)
    • (1985) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 17
    • 84955037166 scopus 로고
    • The perception of speech and its relation to telephony
    • Fletcher, H., and Galt, R. H. (1950). " The perception of speech and its relation to telephony.," J. Acoust. Soc. Am. 22, 89-151.
    • (1950) J. Acoust. Soc. Am. , vol.22 , pp. 89-151
    • Fletcher, H.1    Galt, R.H.2
  • 18
    • 84953657538 scopus 로고
    • Factors governing the intelligibility of speech sounds
    • French, N. R., and Steinberg, J. C. (1947). " Factors governing the intelligibility of speech sounds.," J. Acoust. Soc. Am. 19, 90-119.
    • (1947) J. Acoust. Soc. Am. , vol.19 , pp. 90-119
    • French, N.R.1    Steinberg, J.C.2
  • 19
    • 11144348189 scopus 로고    scopus 로고
    • Analysis of speech-based speech transmission index methods with implications for nonlinear operations
    • DOI 10.1121/1.1804628
    • Goldsworthy, R., and Greenberg, J. (2004). " Analysis of speech-based speech transmission index methods with implications for nonlinear operations.," J. Acoust. Soc. Am. 116, 3679-3689. (Pubitemid 40029948)
    • (2004) Journal of the Acoustical Society of America , vol.116 , Issue.6 , pp. 3679-3689
    • Goldsworthy, R.L.1    Greenberg, J.E.2
  • 20
    • 0035510532 scopus 로고    scopus 로고
    • Spectral subtraction using reduced delay convolution and adaptive averaging
    • Gustafsson, H., Nordholm, S., and Claesson, I. (2001). " Spectral subtraction using reduced delay convolution and adaptive averaging.," IEEE Trans. Speech Audio Process. 9, 799-807.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 799-807
    • Gustafsson, H.1    Nordholm, S.2    Claesson, I.3
  • 22
    • 0038669544 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • in, Paris, France.
    • Hirsch, H., and Pearce, D. (2000). " The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions.," in ISCA Tutorial and Research Workshop ASR2000, Paris, France,.
    • (2000) ISCA Tutorial and Research Workshop ASR2000
    • Hirsch, H.1    Pearce, D.2
  • 23
    • 0028795358 scopus 로고
    • The effect of multichannel dynamic compression on speech intelligibility
    • Hohmann, V., and Kollmeier, B. (1995). " The effect of multichannel dynamic compression on speech intelligibility.," J. Acoust. Soc. Am. 97, 1191-1195.
    • (1995) J. Acoust. Soc. Am. , vol.97 , pp. 1191-1195
    • Hohmann, V.1    Kollmeier, B.2
  • 24
    • 0029816568 scopus 로고    scopus 로고
    • Speech intelligibility prediction in hearing-impaired listeners based on a psychoacoustically motivated perception model
    • Hollube, I., and Kollmeier, K. (1996). " Speech intelligibility prediction in hearing-impaired listeners based on a psychoacoustically motivated perception model.," J. Acoust. Soc. Am. 100, 1703-1715. (Pubitemid 26307559)
    • (1996) Journal of the Acoustical Society of America , vol.100 , Issue.3 , pp. 1703-1716
    • Holube, I.1    Kollmeier, B.2
  • 25
    • 85050418319 scopus 로고
    • Evaluation of speech transmission channels by using artificial signals
    • Houtgast, T., and Steeneken, H. J. M. (1971). " Evaluation of speech transmission channels by using artificial signals.," Acustica 25, 355-367.
    • (1971) Acustica , vol.25 , pp. 355-367
    • Houtgast, T.1    Steeneken, H.J.M.2
  • 26
    • 84873312246 scopus 로고
    • A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria
    • Houtgast, T., and Steeneken, H., (1985). " A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria.," J. Acoust. Soc. Am. 77, 1069-1077.
    • (1985) J. Acoust. Soc. Am. , vol.77 , pp. 1069-1077
    • Houtgast, T.1    Steeneken, H.2
  • 27
    • 0041591273 scopus 로고    scopus 로고
    • A generalized subspace approach for enhancing speech corrupted by colored noise
    • Hu, Y., and Loizou, P. C. (2003). " A generalized subspace approach for enhancing speech corrupted by colored noise.," IEEE Trans. Speech Audio Process. 11, 334-341.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 334-341
    • Hu, Y.1    Loizou, P.C.2
  • 28
    • 0742307391 scopus 로고    scopus 로고
    • Speech enhancement based on wavelet thresholding the multitaper spectrum
    • Hu, Y., and Loizou, P. C. (2004). " Speech enhancement based on wavelet thresholding the multitaper spectrum.," IEEE Trans. Speech Audio Process. 12, 59-67.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , pp. 59-67
    • Hu, Y.1    Loizou, P.C.2
  • 29
    • 35248891610 scopus 로고    scopus 로고
    • A comparative intelligibility study of single-microphone noise reduction algorithms
    • DOI 10.1121/1.2766778
    • Hu, Y., and Loizou, P. C., (2007). " A comparative intelligibility study of single-microphone noise reduction algorithms.," J. Acoust. Soc. Am. 122, 1777-1786. (Pubitemid 47560539)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.3 , pp. 1777-1786
    • Hu, Y.1    Loizou, P.C.2
  • 30
    • 44149106061 scopus 로고    scopus 로고
    • Evaluation of objective quality measures for speech enhancement
    • Hu, Y., and Loizou, P. C. (2008). " Evaluation of objective quality measures for speech enhancement.," IEEE Trans. Audio, Speech, Lang. Process. 16, 229-238.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , pp. 229-238
    • Hu, Y.1    Loizou, P.C.2
  • 32
    • 0014568991 scopus 로고
    • IEEE recommended practice for speech quality measurements
    • IEEE
    • IEEE (1969). " IEEE recommended practice for speech quality measurements.," IEEE Trans. Audio Electroacoust. 17, 225-246.
    • (1969) IEEE Trans. Audio Electroacoust. , vol.17 , pp. 225-246
  • 34
    • 0347337999 scopus 로고    scopus 로고
    • Incorporating the human hearing properties in the signal subspace approach for speech enhancement
    • Jabloun, F., and Champagne, B. (2003). " Incorporating the human hearing properties in the signal subspace approach for speech enhancement.," IEEE Trans. Speech Audio Process. 11, 700-708.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 700-708
    • Jabloun, F.1    Champagne, B.2
  • 37
    • 0026780278 scopus 로고
    • On using coherence to measure distortion in hearing aids
    • Kates, J. (1992). " On using coherence to measure distortion in hearing aids.," J. Acoust. Soc. Am. 91, 2236-2244.
    • (1992) J. Acoust. Soc. Am. , vol.91 , pp. 2236-2244
    • Kates, J.1
  • 39
    • 0023965454 scopus 로고
    • Objective quality evaluation for low bit-rate speech coding systems
    • "
    • Kitawaki, N., Nagabuchi, H., and Itoh, K. (1988). " Objective quality evaluation for low bit-rate speech coding systems.," IEEE J. Sel. Areas Commun. 6, pp. 262-273.
    • (1988) IEEE J. Sel. Areas Commun. , vol.6 , pp. 262-273
    • Kitawaki, N.1    Nagabuchi, H.2    Itoh, K.3
  • 41
    • 84889381426 scopus 로고
    • Methods for the calculation and use of the articulation index
    • Kryter, K. D. (1962a). " Methods for the calculation and use of the articulation index.," J. Acoust. Soc. Am. 34, 1689-1697.
    • (1962) J. Acoust. Soc. Am. , vol.34 , pp. 1689-1697
    • Kryter, K.D.1
  • 42
    • 84953648831 scopus 로고
    • Validation of the articulation index
    • Kryter, K. D. (1962b). " Validation of the articulation index.," J. Acoust. Soc. Am. 34, 1698-1706.
    • (1962) J. Acoust. Soc. Am. , vol.34 , pp. 1698-1706
    • Kryter, K.D.1
  • 43
    • 31744448838 scopus 로고    scopus 로고
    • Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index
    • DOI 10.1121/1.2146112
    • Larm, P., and Hongisto, V. (2006). " Experimental comparison between speech transmission index, rapid speech transmission index, and speech intelligibility index.," J. Acoust. Soc. Am. 119, 1106-1117. (Pubitemid 43177488)
    • (2006) Journal of the Acoustical Society of America , vol.119 , Issue.2 , pp. 1106-1117
    • Larm, P.1    Hongisto, V.2
  • 44
    • 85044887127 scopus 로고    scopus 로고
    • The contribution of obstruent consonants and acoustic landmarks to speech recognition in noise
    • Li, N., and Loizou, P. (2008). " The contribution of obstruent consonants and acoustic landmarks to speech recognition in noise.," J. Acoust. Soc. Am. 124, 498-509.
    • (2008) J. Acoust. Soc. Am. , vol.124 , pp. 498-509
    • Li, N.1    Loizou, P.2
  • 46
    • 0027868016 scopus 로고
    • Evaluation of a noise reduction method - Comparison between observed scores and scores predicted from STI
    • Ludvigsen, C., Elberling, C., and Keidser, G. (1993). " Evaluation of a noise reduction method-Comparison of observed scores and scores predicted from STI.," Scand. Audiol. Suppl. 38, 50-55. (Pubitemid 23362792)
    • (1993) Scandinavian Audiology, Supplement , vol.22 , Issue.38 , pp. 50-55
    • Ludvigsen, C.1    Elberling, C.2    Keidser, G.3
  • 49
    • 0020816083 scopus 로고
    • Suggested formulas for calculation auditory-filter bandwidths and excitation patterns
    • Moore, B., and Glasberg, B. (1993). " Suggested formulas for calculation auditory-filter bandwidths and excitation patterns.," J. Acoust. Soc. Am. 74, 750-753.
    • (1993) J. Acoust. Soc. Am. , vol.74 , pp. 750-753
    • Moore, B.1    Glasberg, B.2
  • 50
    • 0023390033 scopus 로고
    • Derivation of primary parameters and procedures for use in speech intelligibility predictions
    • Pavlovic, C. V. (1987). " Derivation of primary parameters and procedures for use in speech intelligibility predictions.," J. Acoust. Soc. Am. 82, 413-422.
    • (1987) J. Acoust. Soc. Am. , vol.82 , pp. 413-422
    • Pavlovic, C.V.1
  • 52
    • 17644371385 scopus 로고    scopus 로고
    • A Speech Intelligibility Index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners
    • DOI 10.1121/1.1861713
    • Rhebergen, K. S., and Versfeld, N. J. (2005). " A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners.," J. Acoust. Soc. Am. 117, 2181-2192. (Pubitemid 40570476)
    • (2005) Journal of the Acoustical Society of America , vol.117 , Issue.1-4 , pp. 2181-2192
    • Rhebergen, K.S.1    Versfeld, N.J.2
  • 53
    • 33845367670 scopus 로고    scopus 로고
    • Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise
    • Rhebergen, K. S., Versfeld, N. J., and Dreschler, W. (2006). " Extended speech intelligibility index for the prediction of the speech reception threshold in fluctuating noise.," J. Acoust. Soc. Am. 120, 3988-3997.
    • (2006) J. Acoust. Soc. Am. , vol.120 , pp. 3988-3997
    • Rhebergen, K.S.1    Versfeld, N.J.2    Dreschler, W.3
  • 57
    • 0020178903 scopus 로고
    • SOME APPLICATIONS of the SPEECH TRANSMISSION INDEX (STI) INAUDITORIA.
    • Steeneken, H., and Houtgast, T. (1982). " Some applications of the speech transmission index (STI) in auditoria.," Acustica 51, 229-234. (Pubitemid 13450728)
    • (1982) ACUSTICA , vol.51 , pp. 229-234
    • Steeneken, H.J.M.1    Houtgast, T.2
  • 58
    • 0036219864 scopus 로고    scopus 로고
    • Toward a model for lexical access based on acoustic landmarks and distinctive features
    • Stevens, K. (2002). " Toward a model for lexical access based on acoustic landmarks and distinctive features.," J. Acoust. Soc. Am. 111, 1872-1891.
    • (2002) J. Acoust. Soc. Am. , vol.111 , pp. 1872-1891
    • Stevens, K.1
  • 59
    • 0036194118 scopus 로고    scopus 로고
    • Intensity-importance functions for bandlimited monosyllabic words
    • DOI 10.1121/1.1445788
    • Studebaker, G., and Sherbecoe, R. (2002). " Intensity-importance functions for bandlimited monosyllabic words.," J. Acoust. Soc. Am. 111, 1422-1436. (Pubitemid 34214082)
    • (2002) Journal of the Acoustical Society of America , vol.111 , Issue.3 , pp. 1422-1436
    • Studebaker, G.A.1    Sherbecoe, R.L.2
  • 60
    • 0032918927 scopus 로고    scopus 로고
    • Compression and expansion of the temporal envelope: Evaluation of speech intelligibility and sound quality
    • DOI 10.1121/1.426943
    • van Buuren, R., Festen, J., and Houtgast, T. (1999). " Compression and expansion of the temporal envelope: Evaluation of speech intelligibility and sound quality.," J. Acoust. Soc. Am. 105, 2903-2913. (Pubitemid 29218406)
    • (1999) Journal of the Acoustical Society of America , vol.105 , Issue.5 , pp. 2903-2913
    • Van Buuren, R.A.1    Festen, J.M.2    Houtgast, T.3
  • 61
    • 1542295297 scopus 로고    scopus 로고
    • Effect of talker and speaking style on the Speech Transmission Index (L)
    • DOI 10.1121/1.1635411
    • Van Wijngaarden, S., and Houtgast, T. (2004). " Effect of talker and speaking style on the speech transmission index.," J. Acoust. Soc. Am. 115, 38L-41L. (Pubitemid 38112404)
    • (2004) Journal of the Acoustical Society of America , vol.115 , Issue.1 , pp. 38-41
    • Van Wijngaarden, S.J.1    Houtgast, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.