메뉴 건너뛰기




Volumn 18, Issue 7, 2010, Pages 1766-1774

A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech

Author keywords

Coloration; dereverberation; modulation spectrum; quality diagnosis; reverberation

Indexed keywords

COLORATION; DEREVERBERATION; ENERGY RATIO; MODULATION SPECTRUM; MULTIPLE DIMENSIONS; NON-INTRUSIVE; QUALITY DIAGNOSIS; QUALITY MEASUREMENTS; SPECTRAL REPRESENTATIONS; SPEECH SIGNALS; STANDARD ALGORITHMS; TEMPORAL ENVELOPES;

EID: 77955707186     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2052247     Document Type: Article
Times cited : (360)

References (46)
  • 1
    • 4243594972 scopus 로고
    • Normal listeners in typical rooms-Reverberation perception, simulation, and reduction
    • Baltimore, MD: University Park Press
    • D. Berkley, "Normal listeners in typical rooms-Reverberation perception, simulation, and reduction," in Acoustical Factors Affecting Hearing Aid Performance. Baltimore, MD: University Park Press, 1980, pp. 3-24.
    • (1980) Acoustical Factors Affecting Hearing Aid Performance , pp. 3-24
    • Berkley, D.1
  • 2
    • 77949422912 scopus 로고    scopus 로고
    • Sound coloration from (very) early reflections
    • Jun.
    • T. Halmrast, "Sound coloration from (very) early reflections," in Proc. Meeting Acoust. Soc. Amer., Jun. 2001.
    • (2001) Proc. Meeting Acoust. Soc. Amer.
    • Halmrast, T.1
  • 4
    • 78651441188 scopus 로고    scopus 로고
    • Speech enhancement: Dereverberation
    • New York: Springer
    • Y. Huang, J. Benesty, and J. Chen, "Speech enhancement: Dereverberation," in Handbook of Speech Processing. New York: Springer, 2008, pp. 929-943.
    • (2008) Handbook of Speech Processing , pp. 929-943
    • Huang, Y.1    Benesty, J.2    Chen, J.3
  • 5
    • 0003450846 scopus 로고    scopus 로고
    • Methods for subjective determination of transmission quality
    • ITU-T P.800
    • ITU-T P.800, "Methods for subjective determination of transmission quality," Int. Telecom. Union, 1996.
    • (1996) Int. Telecom. Union
  • 7
    • 4544265401 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality: An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs
    • ITU-T P.862
    • ITU-T P.862, "Perceptual evaluation of speech quality: An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs," Int. Telecom. Union, 2001.
    • (2001) Int. Telecom. Union
  • 8
    • 27844456199 scopus 로고    scopus 로고
    • Single-ended method for objective speech quality assessment in narrowband telephony applications
    • ITU-T P.563
    • ITU-T P.563, Single-ended method for objective speech quality assessment in narrowband telephony applications Int. Telecom. Union, 2004.
    • (2004) Int. Telecom. Union
  • 9
    • 77955678362 scopus 로고    scopus 로고
    • Auditory non-intrusive quality estimation plus (ANIQUE+): Perceptual model for non-intrusive estimation of narrowband speech quality Amer
    • ATIS-PP-0100005.2006
    • ATIS-PP-0100005.2006, Auditory non-intrusive quality estimation plus (ANIQUE+): Perceptual model for non-intrusive estimation of narrowband speech quality Amer. National Standards Inst., 2006.
    • (2006) National Standards Inst.
  • 10
    • 77955665981 scopus 로고    scopus 로고
    • Sound system equipment. Objective rating of speech intelligibility by speech transmission index
    • BS EN 60268-60316:2003
    • BS EN 60268-60316:2003, Sound system equipment. Objective rating of speech intelligibility by speech transmission index British Standards Inst., 2003.
    • (2003) British Standards Inst.
  • 11
    • 0000008694 scopus 로고
    • An objective measure for predicting subjective quality of speechcoders
    • Jun.
    • S.Wang, A. Sekey, A. Gersho, T. Syst, and C. Berkeley, "An objective measure for predicting subjective quality of speechcoders," IEEE J. Sel. Areas Commun., vol.10, no.5, pp. 819-829, Jun. 1992.
    • (1992) IEEE J. Sel. Areas Commun. , vol.10 , Issue.5 , pp. 819-829
    • Wang, S.1    Sekey, A.2    Gersho, A.3    Syst, T.4    Berkeley, C.5
  • 13
    • 0018906941 scopus 로고
    • A physical method for measuring speech-transmission quality
    • H. Steeneken and T. Houtgast, "A physical method for measuring speech-transmission quality," J. Acoust. Soc. Amer., vol.67, p. 318, 1980.
    • (1980) J. Acoust. Soc. Amer. , vol.67 , pp. 318
    • Steeneken, H.1    Houtgast, T.2
  • 14
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • DOI 10.1121/1.409836
    • R. Drullman, J. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception," J. Acoust. Soc. Amer., vol.95, no.5, pp. 2670-2680, May 1994. (Pubitemid 24152861)
    • (1994) Journal of the Acoustical Society of America , vol.95 , Issue.5 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 15
    • 0032784372 scopus 로고    scopus 로고
    • A method to determine the speech transmission index from speech waveforms
    • K. Payton and L. Braida, "A method to determine the speech transmission index from speech waveforms," J. Acoust. Soc. Amer., vol.106, p. 3637, 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.106 , pp. 3637
    • Payton, K.1    Braida, L.2
  • 16
    • 11144348189 scopus 로고    scopus 로고
    • Analysis of speech-based speech transmission index methods with implications for nonlinear operations
    • R. Goldsworthy and J. Greenberg, "Analysis of speech-based speech transmission index methods with implications for nonlinear operations," J. Acoust. Soc. Amer., vol.116, p. 3679, 2004.
    • (2004) J. Acoust. Soc. Amer. , vol.116 , pp. 3679
    • Goldsworthy, R.1    Greenberg, J.2
  • 18
    • 50249138241 scopus 로고    scopus 로고
    • Semantic coloration space investigation: Controlled coloration in the bark-sone domain
    • J. Wen and P. Naylor, "Semantic coloration space investigation: Controlled coloration in the bark-sone domain," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust., 2007, pp. 311-314.
    • (2007) Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. , pp. 311-314
    • Wen, J.1    Naylor, P.2
  • 20
  • 21
    • 70449360175 scopus 로고    scopus 로고
    • Modulation spectral features for robust far-field speaker identification
    • Jan.
    • T. H. Falk and W.-Y. Chan, "Modulation spectral features for robust far-field speaker identification," IEEE Trans. Audio, Speech, Lang. Process., vol.18, no.1, pp. 90-100, Jan. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.1 , pp. 90-100
    • Falk, T.H.1    Chan, W.-Y.2
  • 22
    • 0003913694 scopus 로고
    • An Efficient implementation of the Patterson-Holdsworth auditory filterbank
    • Tech. Rep.
    • M. Slaney, "An Efficient implementation of the Patterson-Holdsworth auditory filterbank," Apple Computer, 1993, Tech. Rep..
    • (1993) Apple Computer
    • Slaney, M.1
  • 23
    • 0025110885 scopus 로고
    • Derivation of auditory filter shapes from notched-noise data
    • DOI 10.1016/0378-5955(90)90170-T
    • B. Glasberg and B. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol.47, no.1, pp. 103-138, 1990. (Pubitemid 20244652)
    • (1990) Hearing Research , vol.47 , Issue.1-2 , pp. 103-138
    • Glasberg, B.R.1    Moore, B.C.J.2
  • 25
    • 0029952425 scopus 로고    scopus 로고
    • A quantitative model of the effective signal processing in the auditory system. I-Model structure
    • T. Dau, D. Puschel, and A. Kohlrausch, "A quantitative model of the effective signal processing in the auditory system. I-Model structure," J. Acoust. Soc. Amer., vol.99, no.6, pp. 3615-3622, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3615-3622
    • Dau, T.1    Puschel, D.2    Kohlrausch, A.3
  • 26
    • 4744344338 scopus 로고    scopus 로고
    • A cue for objective speech quality estimation in temporal envelope representation
    • Oct.
    • D.-S. Kim, "A cue for objective speech quality estimation in temporal envelope representation," IEEE Signal Process. Lett., vol.11, no.10, pp. 849-852, Oct. 2004.
    • (2004) IEEE Signal Process. Lett. , vol.11 , Issue.10 , pp. 849-852
    • Kim, D.-S.1
  • 27
    • 84873312246 scopus 로고
    • A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria
    • Mar.
    • T. Houtgast and H. Steeneken, "A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria," J. Acoust. Soc. Amer., vol.77, no.3, pp. 1069-1077, Mar. 1985.
    • (1985) J. Acoust. Soc. Amer. , vol.77 , Issue.3 , pp. 1069-1077
    • Houtgast, T.1    Steeneken, H.2
  • 28
    • 0030369532 scopus 로고    scopus 로고
    • Intelligibility of speech with filtered time trajectories of spectral envelopes
    • Oct.
    • T. Arai, M. Pavel, H. Hermansky, and C. Avendano, "Intelligibility of speech with filtered time trajectories of spectral envelopes," in Proc. Int. Conf. Speech Lang. Process., Oct. 1996, pp. 2490-2493.
    • (1996) Proc. Int. Conf. Speech Lang. Process. , pp. 2490-2493
    • Arai, T.1    Pavel, M.2    Hermansky, H.3    Avendano, C.4
  • 29
    • 27644596289 scopus 로고    scopus 로고
    • ANIQUE: An auditory model for single-ended speech quality estimation
    • Sep.
    • D.-S. Kim, "ANIQUE: An auditory model for single-ended speech quality estimation," IEEE Trans. Speech Audio Process., vol.13, no.5, pp. 821-831, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 821-831
    • Kim, D.-S.1
  • 30
    • 0037034899 scopus 로고    scopus 로고
    • Chimaeric sounds reveal dichotomies in auditory perception
    • Mar.
    • Z. Smith, B. Delgutte, and A. Oxenham, "Chimaeric sounds reveal dichotomies in auditory perception," Lett. Nature, vol.416, pp. 87-90, Mar. 2002.
    • (2002) Lett. Nature , vol.416 , pp. 87-90
    • Smith, Z.1    Delgutte, B.2    Oxenham, A.3
  • 31
    • 77949423782 scopus 로고    scopus 로고
    • Temporal dynamics for blind measurement of room acoustical parameters
    • Apr.
    • T. H. Falk and W.-Y. Chan, "Temporal dynamics for blind measurement of room acoustical parameters," IEEE Trans. Instrum. Meas., vol.59, no.4, pp. 978-989, Apr. 2010.
    • (2010) IEEE Trans. Instrum. Meas. , vol.59 , Issue.4 , pp. 978-989
    • Falk, T.H.1    Chan, W.-Y.2
  • 32
    • 77955706337 scopus 로고    scopus 로고
    • Spatial distribution of early reflections and speech intelligibility
    • Y. Oh, D. Jeong, S. Doo, H. Lee, C. Choi, L. Kim, and I. Ko, "Spatial distribution of early reflections and speech intelligibility, " J. Acoust. Soc. Amer., vol.109, pp. 2313-2314, 2001.
    • (2001) J. Acoust. Soc. Amer. , vol.109 , pp. 2313-2314
    • Oh, Y.1    Jeong, D.2    Doo, S.3    Lee, H.4    Choi, C.5    Kim, L.6    Ko, I.7
  • 33
    • 77955678081 scopus 로고
    • Sounds like An audio glossary
    • Jul.
    • J. Holt, "Sounds like An audio glossary," Stereophile Mag., vol.16, no.7, pp. 1-16, Jul. 1993.
    • (1993) Stereophile Mag. , vol.16 , Issue.7 , pp. 1-16
    • Holt, J.1
  • 34
    • 84866868685 scopus 로고    scopus 로고
    • Subjective test methodology for evaluating speech communication systems that include noise suppression algorithms
    • ITU-T P.835
    • ITU-T P.835, "Subjective test methodology for evaluating speech communication systems that include noise suppression algorithms," Int. Telecom. Union, 2003.
    • (2003) Int. Telecom. Union
  • 35
    • 67650143408 scopus 로고    scopus 로고
    • Effect of analysis window duration on speech intelligibility
    • K. Paliwal, K. Wojcicki, and K. Wheeler, "Effect of analysis window duration on speech intelligibility," IEEE Signal Process. Lett., vol.15, pp. 785-788, 2008.
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 785-788
    • Paliwal, K.1    Wojcicki, K.2    Wheeler, K.3
  • 37
    • 0028831004 scopus 로고
    • Temporal envelope and fine structure cues for speech intelligibility
    • R. Drullman, "Temporal envelope and fine structure cues for speech intelligibility," J. Acoust. Soc. Amer., vol.97, p. 585, 1995.
    • (1995) J. Acoust. Soc. Amer. , vol.97 , pp. 585
    • Drullman, R.1
  • 38
    • 0036836688 scopus 로고    scopus 로고
    • Validation of the revised STI method
    • H. Steeneken and T. Houtgast, "Validation of the revised STI method," Speech Commun., vol.38, no.3-4, pp. 413-425, 2002.
    • (2002) Speech Commun. , vol.38 , Issue.3-4 , pp. 413-425
    • Steeneken, H.1    Houtgast, T.2
  • 39
    • 33646529353 scopus 로고    scopus 로고
    • Reverberation times and speech transmission indices in classrooms
    • S. Tang and M. Yeung, "Reverberation times and speech transmission indices in classrooms," J. Sound Vibr., vol.294, no.3, pp. 596-607, 2006.
    • (2006) J. Sound Vibr. , vol.294 , Issue.3 , pp. 596-607
    • Tang, S.1    Yeung, M.2
  • 40
    • 33745836760 scopus 로고    scopus 로고
    • Wideband extension to Rec. P.862 for the assessment of wideband telephone networks and speech codecs
    • ITU-T P.862.2
    • ITU-T P.862.2, "Wideband extension to Rec. P.862 for the assessment of wideband telephone networks and speech codecs," Int. Telecom. Union, 2007.
    • (2007) Int. Telecom. Union
  • 41
    • 84866844246 scopus 로고    scopus 로고
    • Application guide for objective quality measurement based on recommendations P.862, P.862.1 and P.862.2
    • ITU-T P.862.3
    • ITU-T P.862.3, "Application guide for objective quality measurement based on recommendations P.862, P.862.1 and P.862.2," Int. Telecom. Union, 2005.
    • (2005) Int. Telecom. Union
  • 43
    • 39649083007 scopus 로고    scopus 로고
    • P.563-The ITU-T standard for single-ended speech quality assessment
    • Nov.
    • L. Malfait, J. Berger, and M. Kastner, "P.563-The ITU-T standard for single-ended speech quality assessment," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.6, pp. 1924-1934, Nov. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 1924-1934
    • Malfait, L.1    Berger, J.2    Kastner, M.3
  • 44
    • 34748832326 scopus 로고    scopus 로고
    • Objective assessment of speech and audio quality-Technology and applications
    • Nov.
    • A. Rix, J. Beerends, D.-S. Kim, P. Kroon, and O. Ghitza, "Objective assessment of speech and audio quality-Technology and applications," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.6, pp. 1890-1901, Nov. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 1890-1901
    • Rix, A.1    Beerends, J.2    Kim, D.-S.3    Kroon, P.4    Ghitza, O.5
  • 45
    • 34547286393 scopus 로고    scopus 로고
    • ANIQUE+: A new American national standard for non-intrusive estimation of narrowband speech quality
    • May
    • D.-S. Kim and A. Tarraf, "ANIQUE+: A new American national standard for non-intrusive estimation of narrowband speech quality," Bell Labs Tech. J., vol.12, no.1, pp. 221-236, May 2007.
    • (2007) Bell Labs Tech. J. , vol.12 , Issue.1 , pp. 221-236
    • Kim, D.-S.1    Tarraf, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.