메뉴 건너뛰기




Volumn 22, Issue 1, 2014, Pages 6-16

Objective intelligibility measures based on mutual information for speech subjected to speech enhancement processing

Author keywords

Mutual information; Objective measures; Speech intelligibility prediction

Indexed keywords

AUDITION; FORECASTING; SPEECH ENHANCEMENT; SPEECH RECOGNITION; HIGHER ORDER STATISTICS; INFORMATION THEORY; SIGNAL TO NOISE RATIO; SPEECH;

EID: 84897949130     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2281574     Document Type: Article
Times cited : (44)

References (49)
  • 3
    • 0028516073 scopus 로고
    • How do humans process and recognize speech?
    • Oct.
    • J. B. Allen, "How do humans process and recognize speech?," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 567-577, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 567-577
    • Allen, J.B.1
  • 4
    • 84953657538 scopus 로고
    • Factors governing the intelligibility of speech sounds
    • N. French and J. Steinberg, "Factors governing the intelligibility of speech sounds," J. Acoust. Soc. Amer., vol. 19, no. 1, pp. 90-119, 1947.
    • (1947) J. Acoust. Soc. Amer. , vol.19 , Issue.1 , pp. 90-119
    • French, N.1    Steinberg, J.2
  • 6
    • 17644399140 scopus 로고    scopus 로고
    • Coherence and the speech intelligibility index
    • J. M. Kates and K. H. Arehart, "Coherence and the speech intelligibility index," J. Acoust. Soc. Amer., vol. 117, no. 4, pp. 2224-2237, 2005.
    • (2005) J. Acoust. Soc. Amer. , vol.117 , Issue.4 , pp. 2224-2237
    • Kates, J.M.1    Arehart, K.H.2
  • 7
    • 65549157071 scopus 로고    scopus 로고
    • Objectivemeasures for predicting speech intelligibility in noisy conditions based on new band-importance functions
    • J. Ma, Y. Hu, and P. Loizou, "Objectivemeasures for predicting speech intelligibility in noisy conditions based on new band-importance functions," J. Acoust. Soc. Amer., vol. 125, no. 5, pp. 3387-3405, 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.125 , Issue.5 , pp. 3387-3405
    • Ma, J.1    Hu, Y.2    Loizou, P.3
  • 8
    • 79959815343 scopus 로고    scopus 로고
    • The characterization of the relative information content by spectral features for the objective intelligibility assessment of nonlinearly processed speech
    • A. Schlesinger and M. M. Boone, "The characterization of the relative information content by spectral features for the objective intelligibility assessment of nonlinearly processed speech," in Proc. Interspeech, 2010, pp. 1309-1312.
    • Proc. Interspeech, 2010 , pp. 1309-1312
    • Schlesinger, A.1    Boone, M.M.2
  • 9
    • 81355153924 scopus 로고    scopus 로고
    • An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech
    • C. H. Taal, R. C. Hendriks, and R. Heusdens, "An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech," J. Acoust. Soc. Amer., vol. 130, no. 5, pp. 3013-3027, 2011.
    • (2011) J. Acoust. Soc. Amer. , vol.130 , Issue.5 , pp. 3013-3027
    • Taal, C.H.1    Hendriks, R.C.2    Heusdens, R.3
  • 11
    • 79960916745 scopus 로고    scopus 로고
    • An algorithm for intelligibility prediction of time-frequency weighted noisy speech
    • Sep.
    • C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, "An algorithm for intelligibility prediction of time-frequency weighted noisy speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2125-2136, Sep. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.7 , pp. 2125-2136
    • Taal, C.H.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4
  • 13
    • 11144348189 scopus 로고    scopus 로고
    • Analysis of speech-based speech transmission index methods with implications for nonlinear operations
    • R. L. Goldsworthy and J. E. Greenberg, "Analysis of speech-based speech transmission index methods with implications for nonlinear operations," J. Acoust. Soc. Amer., vol. 116, no. 6, pp. 3679-3689, 2004.
    • (2004) J. Acoust. Soc. Amer. , vol.116 , Issue.6 , pp. 3679-3689
    • Goldsworthy, R.L.1    Greenberg, J.E.2
  • 14
    • 0027868016 scopus 로고
    • Evaluation of a noise reduction method - Comparison between observed scores and scores predicted from STI
    • C. Ludvigsen, C. Elberling, and G. Keidser, "Evaluation of a noise reduction method-comparison between observed scores and scores predicted from STI," Scand. Audiol. Suppl., vol. 38, pp. 50-55, 1993. (Pubitemid 23362792)
    • (1993) Scandinavian Audiology, Supplement , vol.22 , Issue.38 , pp. 50-55
    • Ludvigsen, C.1    Elberling, C.2    Keidser, G.3
  • 15
    • 80052546551 scopus 로고    scopus 로고
    • Predicting speech intelligibility based on the envelope power signal-to-noise ratio aftermodulation-frequency selective processing
    • S. Jørgensen and T. Dau, "Predicting speech intelligibility based on the envelope power signal-to-noise ratio aftermodulation-frequency selective processing," J. Acoust. Soc. Amer., vol. 130, no. 3, pp. 1475-1487, 2011.
    • (2011) J. Acoust. Soc. Amer. , vol.130 , Issue.3 , pp. 1475-1487
    • Jørgensen, S.1    Dau, T.2
  • 16
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the "effective" signal processing in the auditory system. I. Model structure
    • T. Dau, D. Püschel, and A. Kohlrausch, "A quantitative model of the "effective" signal processing in the auditory system. I. model structure," J. Acoust. Soc. Amer., vol. 99, no. 6, pp. 3615-3622, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3615-3622
    • Dau, T.1    Püschel, D.2    Kohlrausch, A.3
  • 17
    • 79952871923 scopus 로고    scopus 로고
    • Prediction of speech intelligibility based on an auditory preprocessing model
    • C. Christiansen, M. S. Pedersen, and T. Dau, "Prediction of speech intelligibility based on an auditory preprocessing model," Speech Commun., no. 52, pp. 678-692, 2010.
    • (2010) Speech Commun. , Issue.52 , pp. 678-692
    • Christiansen, C.1    Pedersen, M.S.2    Dau, T.3
  • 18
    • 84897935512 scopus 로고    scopus 로고
    • A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation
    • J. B. Boldt and D. P. W. Ellis, "A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation," in Proc. EUSIPCO, Glasgow, U.K., Aug. 2009.
    • Proc. EUSIPCO, Glasgow, U.K., Aug. 2009
    • Boldt, J.B.1    Ellis, D.P.W.2
  • 21
    • 84865783312 scopus 로고    scopus 로고
    • Subjective and objective evaluation of speech intelligibility enhancement under constant energy and duration constraints
    • Y. Tang and M. Cooke, "Subjective and objective evaluation of speech intelligibility enhancement under constant energy and duration constraints," in Proc. Interspeech, Florence, Italy, Aug. 2011.
    • Proc. Interspeech, Florence, Italy, Aug. 2011
    • Tang, Y.1    Cooke, M.2
  • 23
    • 84897934183 scopus 로고    scopus 로고
    • Auditory perception of nonlinear distortion - Theory
    • presented at the unpublished
    • E. R. Geddes and L. W. Lee, "Auditory perception of nonlinear distortion - Theory," presented at the 115th Conv. Audio Eng. Soc., Oct. 2003, unpublished.
    • 115th Conv. Audio Eng. Soc., Oct. 2003
    • Geddes, E.R.1    Lee, L.W.2
  • 25
    • 84856043672 scopus 로고
    • A mathematical theory of communication
    • C. E. Shannon, "A mathematical theory of communication," Bell Syst. Tech. J., vol. 27, pp. 379-423, 1948.
    • (1948) Bell Syst. Tech. J. , vol.27 , pp. 379-423
    • Shannon, C.E.1
  • 26
    • 84897937155 scopus 로고    scopus 로고
    • M.S. thesis, Dept. of Math. and Statist., Queen's Univ., Kingston, ON, Canada, Sep.
    • S. Lu, "Measuring dependence via mutual information," M.S. thesis, Dept. of Math. and Statist., Queen's Univ., Kingston, ON, Canada, Sep. 2011.
    • (2011) Measuring Dependence Via Mutual Information
    • Lu, S.1
  • 27
    • 69049111920 scopus 로고    scopus 로고
    • Estimation of mutual information: A survey
    • J. Walters-Williams and Y. Li, "Estimation of mutual information: A survey," Lecture Notes in Comput. Sci., vol. 5589, pp. 389-396, 2009.
    • (2009) Lecture Notes in Comput. Sci. , vol.5589 , pp. 389-396
    • Walters-Williams, J.1    Li, Y.2
  • 29
    • 39749164774 scopus 로고    scopus 로고
    • Estimating mutual information
    • 066138
    • A. Kraskov, H. Stögbauer, and P. Grassberger, "Estimating mutual information," Phys. Rev. E, vol. 69, no. 6, pp. 1-16, 2004, 066138.
    • (2004) Phys. Rev. E , vol.69 , Issue.6 , pp. 1-16
    • Kraskov, A.1    Stögbauer, H.2    Grassberger, P.3
  • 31
    • 0023325560 scopus 로고
    • Sample estimate of entropy of a random vector
    • L. F. Kozachenko and N. N. Leonenko, "Sample estimate of entropy of a random vector," Probl. Inf. Transm., vol. 23, pp. 95-101, 1987.
    • (1987) Probl. Inf. Transm. , vol.23 , pp. 95-101
    • Kozachenko, L.F.1    Leonenko, N.N.2
  • 34
    • 0038785451 scopus 로고    scopus 로고
    • Ph.D. dissertation, Inst. für Nachrichtengeräte und Datenverarbeitung, Rheinisch-Westfälische Technische Hochschule Aachen, Aachen, Germany
    • P. Jax, "Enhancement of band limited speech signals: Algorithms and theoretical bounds," Ph.D. dissertation, Inst. für Nachrichtengeräte und Datenverarbeitung, Rheinisch-Westfälische Technische Hochschule Aachen, Aachen, Germany, 2002.
    • (2002) Enhancement of Band Limited Speech Signals: Algorithms and Theoretical Bounds
    • Jax, P.1
  • 35
    • 0035306657 scopus 로고    scopus 로고
    • Mutual information theory for adaptive mixture models
    • Apr.
    • Z. R. Yang and M. Zwolinski, "Mutual information theory for adaptive mixture models," IEEE Trans. Pattern Anal. Mach. Intell., vol. 23, no. 4, pp. 561-403, Apr. 2001.
    • (2001) IEEE Trans. Pattern Anal. Mach. Intell. , vol.23 , Issue.4 , pp. 561-1403
    • Yang, Z.R.1    Zwolinski, M.2
  • 36
    • 63649092788 scopus 로고    scopus 로고
    • The estimating optimal number of Gaussian mixtures based on incremental k-means for speaker identification
    • Y. Lee, K. Y. Lee, and J. Lee, "The estimating optimal number of Gaussian mixtures based on incremental k-means for speaker identification," Int. J. Inf. Technol., vol. 12, no. 7, pp. 13-21, 2006.
    • (2006) Int. J. Inf. Technol. , vol.12 , Issue.7 , pp. 13-21
    • Lee, Y.1    Lee, K.Y.2    Lee, J.3
  • 37
    • 84898934543 scopus 로고    scopus 로고
    • Variational inference for bayesian mixtures of factor analysers
    • MIT Press
    • Z. Ghahramani and M. J. Beal, "Variational inference for bayesian mixtures of factor analysers," Adv. Neural Inf. Process. Syst. MIT Press, vol. 12, pp. 449-455, 2000.
    • (2000) Adv. Neural Inf. Process. Syst. , vol.12 , pp. 449-455
    • Ghahramani, Z.1    Beal, M.J.2
  • 38
    • 70349161218 scopus 로고    scopus 로고
    • Role of mask pattern in intelligibility of ideal binary-masked noisy speech
    • U. Kjems, J. B. Boldt, M. S. Pedersen, T. Lunner, and D. Wang, "Role of mask pattern in intelligibility of ideal binary-masked noisy speech," J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1415-1426, 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1415-1426
    • Kjems, U.1    Boldt, J.B.2    Pedersen, M.S.3    Lunner, T.4    Wang, D.5
  • 40
    • 0037504237 scopus 로고    scopus 로고
    • Design, optimization and evaluation of a Danish sentence test in noise
    • K. Wagener, J. L. Josvassen, and R. Ardenkjaer, "Design, optimization and evaluation of a Danish sentence test in noise," Int. J. Audiol., vol. 42, no. 1, pp. 10-17, 2003. (Pubitemid 37372682)
    • (2003) International Journal of Audiology , vol.42 , Issue.1 , pp. 10-17
    • Wagener, K.1    Josvassen, J.L.2    Ardenkjaer, R.3
  • 41
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 42
    • 51449104842 scopus 로고    scopus 로고
    • Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors
    • Aug.
    • J. S. Erkelens, R. C. Hendriks, R. Heusdens, and J. Jensen, "Minimum mean-square error estimation of discrete Fourier coefficients with generalized gamma priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1741-1752, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1741-1752
    • Erkelens, J.S.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4
  • 44
    • 33645998440 scopus 로고    scopus 로고
    • Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments
    • M. Skowronski and J. Harris, "Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments," Speech Commun., vol. 48, no. 5, pp. 549-558, 2006.
    • (2006) Speech Commun. , vol.48 , Issue.5 , pp. 549-558
    • Skowronski, M.1    Harris, J.2
  • 47
    • 0002282074 scopus 로고
    • A new measure of rank correlation
    • M. G. Kendall, "A new measure of rank correlation," Biometrika, vol. 30, no. 1/2, pp. 81-93, 1938.
    • (1938) Biometrika , vol.30 , Issue.1-2 , pp. 81-93
    • Kendall, M.G.1
  • 48
    • 84155164651 scopus 로고    scopus 로고
    • Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio
    • M. Gómez, B. Schwerin, and K. Paliwal, "Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio," Speech Commun., vol. 54, pp. 503-515, 2012.
    • (2012) Speech Commun. , vol.54 , pp. 503-515
    • Gómez, M.1    Schwerin, B.2    Paliwal, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.