메뉴 건너뛰기




Volumn 19, Issue 7, 2011, Pages 2125-2136

An algorithm for intelligibility prediction of time-frequency weighted noisy speech

Author keywords

Noise reduction; objective measure; speech enhancement; speech intelligibility prediction

Indexed keywords

DEVELOPMENT PROCESS; GLOBAL STATISTICS; NOISY SPEECH; OBJECTIVE MEASURE; SEGMENT LENGTHS; TIME FREQUENCY; TIME SEGMENTS;

EID: 79960916745     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2114881     Document Type: Article
Times cited : (2198)

References (33)
  • 1
    • 84953657538 scopus 로고
    • Factors governing the intelligibility of speech sounds
    • N. R. French and J. C. Steinberg, "Factors governing the intelligibility of speech sounds," J. Acoust. Soc. Amer., vol. 19, no. 1, pp. 90-119, 1947.
    • (1947) J. Acoust. Soc. Amer. , vol.19 , Issue.1 , pp. 90-119
    • French, N.R.1    Steinberg, J.C.2
  • 2
    • 84889381426 scopus 로고
    • Methods for the calculation and use of the articulation index
    • K. D. Kryter, "Methods for the calculation and use of the articulation index," J. Acoust. Soc. Amer., vol. 34, no. 11, pp. 1689-1697, 1962.
    • (1962) J. Acoust. Soc. Amer. , vol.34 , Issue.11 , pp. 1689-1697
    • Kryter, K.D.1
  • 5
    • 17644371385 scopus 로고    scopus 로고
    • A Speech Intelligibility Index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners
    • 4 I, DOI 10.1121/1.1861713
    • K. S. Rhebergen and N. J. Versfeld, "A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners," J. Acoust. Soc. Amer., vol. 117, no. 4, pp. 2181-2192, 2005. (Pubitemid 40570476)
    • (2005) Journal of the Acoustical Society of America , vol.117 , pp. 2181-2192
    • Rhebergen, K.S.1    Versfeld, N.J.2
  • 6
    • 17644399140 scopus 로고    scopus 로고
    • Coherence and the speech intelligibility index
    • 4 I, DOI 10.1121/1.1862575
    • J. M. Kates and K. H. Arehart, "Coherence and the speech intelligibility index," J. Acoust. Soc. Amer., vol. 117, no. 4, pp. 2224-2237, 2005. (Pubitemid 40570480)
    • (2005) Journal of the Acoustical Society of America , vol.117 , pp. 2224-2237
    • Kates, J.M.1    Arehart, K.H.2
  • 7
    • 11144348189 scopus 로고    scopus 로고
    • Analysis of speech-based speech transmission index methods with implications for nonlinear operations
    • DOI 10.1121/1.1804628
    • R. L. Goldsworthy and J. E. Greenberg, "Analysis of speech-based speech transmission index methods with implications for nonlinear operations," J. Acoust. Soc. Amer., vol. 116, no. 6, pp. 3679-3689, 2004. (Pubitemid 40029948)
    • (2004) Journal of the Acoustical Society of America , vol.116 , Issue.6 , pp. 3679-3689
    • Goldsworthy, R.L.1    Greenberg, J.E.2
  • 9
    • 33845354768 scopus 로고    scopus 로고
    • Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
    • DOI 10.1121/1.2363929
    • D. S. Brungart, P. S. Chang, B. D. Simpson, and D. L. Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Amer., vol. 120, no. 6, pp. 4007-4018, 2006. (Pubitemid 44888096)
    • (2006) Journal of the Acoustical Society of America , vol.120 , Issue.6 , pp. 4007-4018
    • Brungart, D.S.1    Chang, P.S.2    Simpson, B.D.3    Wang, D.4
  • 10
    • 0027868016 scopus 로고
    • Evaluation of a noise reduction method - Comparison between observed scores and scores predicted from STI
    • C. Ludvigsen, C. Elberling, and G. Keidser, "Evaluation of a noise reduction method-Comparison between observed scores and scores predicted from STI," Scand. Audiol. Supplement., vol. 38, pp. 50-55, 1993. (Pubitemid 23362792)
    • (1993) Scandinavian Audiology, Supplement , vol.22 , Issue.38 , pp. 50-55
    • Ludvigsen, C.1    Elberling, C.2    Keidser, G.3
  • 11
    • 60049084444 scopus 로고    scopus 로고
    • The concept of signal-to-noise ratio in the modulation domain and speech intelligibility
    • F. Dubbelboer and T. Houtgast, "The concept of signal-to-noise ratio in the modulation domain and speech intelligibility," J. Acoust. Soc. Amer., vol. 124, no. 6, pp. 3937-3946, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.124 , Issue.6 , pp. 3937-3946
    • Dubbelboer, F.1    Houtgast, T.2
  • 12
    • 35248891610 scopus 로고    scopus 로고
    • A comparative intelligibility study of single-microphone noise reduction algorithms
    • DOI 10.1121/1.2766778
    • Y. Hu and P. C. Loizou, "A comparative intelligibility study of single-microphone noise reduction algorithms," J. Acoust. Soc. Amer., vol. 122, no. 3, pp. 1777-1786, 2007. (Pubitemid 47560539)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.3 , pp. 1777-1786
    • Hu, Y.1    Loizou, P.C.2
  • 13
    • 70450161547 scopus 로고    scopus 로고
    • An evaluation of objective quality measures for speech intelligibility prediction
    • C. H. Taal, R. C. Hendriks, R. Heusdens, J. Jensen, and U. Kjems, "An evaluation of objective quality measures for speech intelligibility prediction," in Proc. Interspeech, 2009, pp. 1947-1950.
    • (2009) Proc. Interspeech , pp. 1947-1950
    • Taal, C.H.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4    Kjems, U.5
  • 14
    • 65549157071 scopus 로고    scopus 로고
    • Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions
    • J. Ma, Y. Hu, and P. Loizou, "Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions," J. Acoust. Soc. Amer., vol. 125, no. 5, pp. 3387-3405, 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.125 , Issue.5 , pp. 3387-3405
    • Ma, J.1    Hu, Y.2    Loizou, P.3
  • 15
    • 79952871923 scopus 로고    scopus 로고
    • Prediction of speech intelligibility based on an auditory preprocessing model
    • C. Christiansen, M. S. Pedersen, and T. Dau, "Prediction of speech intelligibility based on an auditory preprocessing model," Speech Commun., vol. 52, pp. 678-692, 2010.
    • (2010) Speech Commun. , vol.52 , pp. 678-692
    • Christiansen, C.1    Pedersen, M.S.2    Dau, T.3
  • 16
    • 84863763285 scopus 로고    scopus 로고
    • A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation
    • J. B. Boldt and D. P. W. Ellis, "A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation," in Proc. EUSIPCO, 2009, pp. 1849-1853.
    • (2009) Proc. EUSIPCO , pp. 1849-1853
    • Boldt, J.B.1    Ellis, D.P.W.2
  • 17
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • 5 I, DOI 10.1121/1.409836
    • R. Drullman, J. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception," J. Acoust. Soc. Amer., vol. 95, no. 5, pp. 2670-2680, 1994. (Pubitemid 24152861)
    • (1994) Journal of the Acoustical Society of America , vol.95 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 18
    • 0009628338 scopus 로고
    • Detection of tone pulse of various durations in noise of various bandwidths
    • G. van den Brink, "Detection of tone pulse of various durations in noise of various bandwidths," J. Acoust. Soc. Amer., vol. 36, no. 6, pp. 1206-1211, 1964.
    • (1964) J. Acoust. Soc. Amer. , vol.36 , Issue.6 , pp. 1206-1211
    • Van Den Brink, G.1
  • 19
    • 70349161218 scopus 로고    scopus 로고
    • Role of mask pattern in intelligibility of ideal binary-masked noisy speech
    • U. Kjems, J. B. Boldt, M. S. Pedersen, T. Lunner, and D. Wang, "Role of mask pattern in intelligibility of ideal binary-masked noisy speech," J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1415-1426, 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1415-1426
    • Kjems, U.1    Boldt, J.B.2    Pedersen, M.S.3    Lunner, T.4    Wang, D.5
  • 20
    • 40749125179 scopus 로고    scopus 로고
    • Factors influencing intelligibility of ideal binary- masked speech: Implications for noise reduction
    • N. Li and P. C. Loizou, "Factors influencing intelligibility of ideal binary- masked speech: Implications for noise reduction," J. Acoust. Soc. Amer., vol. 123, p. 1673, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.123 , pp. 1673
    • Li, N.1    Loizou, P.C.2
  • 21
    • 0037504237 scopus 로고    scopus 로고
    • Design, optimization and evaluation of a Danish sentence test in noise
    • K.Wagener, J. L. Josvassen, and R. Ardenkjaer, "Design, optimization and evaluation of a Danish sentence test in noise," Int. J. Audiol., vol. 42, no. 1, pp. 10-17, 2003. (Pubitemid 37372682)
    • (2003) International Journal of Audiology , vol.42 , Issue.1 , pp. 10-17
    • Wagener, K.1    Josvassen, J.L.2    Ardenkjaer, R.3
  • 22
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 23
    • 51449104842 scopus 로고    scopus 로고
    • Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors
    • Aug.
    • J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen, "Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 6, pp. 1741-1752, Aug. 2007.
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.6 , pp. 1741-1752
    • Erkelens, J.S.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4
  • 24
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • DOI 10.1109/89.928915, PII S106366760104980X
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001. (Pubitemid 32631178)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 26
    • 44149106061 scopus 로고    scopus 로고
    • Evaluation of objective quality measures for speech enhancement
    • Jan.
    • Y. Hu and P. C. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 1, pp. 229-238, Jan. 2008.
    • (2008) IEEE Trans. Audio Speech Lang. Process. , vol.16 , Issue.1 , pp. 229-238
    • Hu, Y.1    Loizou, P.C.2
  • 28
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the "effective" signal processing in the auditory system. I. Model structure
    • T. Dau, D. Püschel, and A. Kohlrausch, "A quantitative model of the "effective" signal processing in the auditory system. I. Model structure," J. Acoust. Soc. Amer., vol. 99, no. 6, pp. 3615-3622, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3615-3622
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 29
    • 0029952451 scopus 로고    scopus 로고
    • A quantitative model of the "effective" signal processing in the auditory system. II. Simulations and measurements
    • T. Dau, D. Püschel, and A. Kohlrausch, "A quantitative model of the "effective" signal processing in the auditory system. II. Simulations and measurements," J. Acoust. Soc. Amer., vol. 99, no. 6, pp. 3623-3631, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3623-3631
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 31
    • 0032945152 scopus 로고    scopus 로고
    • Syllable intelligibility for temporally filtered LPC cepstral trajectories
    • DOI 10.1121/1.426895
    • T. Arai, M. Pavel, H. Hermansky, and C. Avendano, "Syllable intelligibility for temporally filtered LPC cepstral trajectories," J. Acoust. Soc. Amer., vol. 105, no. 5, pp. 2783-2791, 1999. (Pubitemid 29218397)
    • (1999) Journal of the Acoustical Society of America , vol.105 , Issue.5 , pp. 2783-2791
    • Arai, T.1    Pavel, M.2    Hermansky, H.3    Avendano, C.4
  • 33
    • 79959815343 scopus 로고    scopus 로고
    • The characterization of the relative information content by spectral features for the objective intelligibility assessment of nonlinearly processed speech
    • A. Schlesinger and M. M. Boone, "The characterization of the relative information content by spectral features for the objective intelligibility assessment of nonlinearly processed speech," in Proc. Interspeech, 2010, pp. 1309-1312.
    • (2010) Proc. Interspeech , pp. 1309-1312
    • Schlesinger, A.1    Boone, M.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.