메뉴 건너뛰기




Volumn 53, Issue 3, 2011, Pages 327-339

Role of modulation magnitude and phase spectrum towards speech intelligibility

Author keywords

Analysis frame duration; Analysis modification synthesis (AMS); Modulation domain; Modulation frame duration; Modulation magnitude spectrum; Modulation phase spectrum; Speech intelligibility; Speech transmission index (STI)

Indexed keywords

ACOUSTIC DOMAINS; ANALYSIS FRAME DURATION; ANALYSIS-MODIFICATION-SYNTHESIS (AMS); DOMAIN PROCESSING; MAGNITUDE SPECTRUM; PHASE INFORMATION; PHASE SPECTRA; RELATIVE CONTRIBUTION; SPECTRAL COMPONENTS; SPEECH TRANSMISSION INDEX (STI);

EID: 79551488220     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2010.10.004     Document Type: Article
Times cited : (25)

References (39)
  • 2
    • 0035765498 scopus 로고    scopus 로고
    • Modulation frequency and efficient audio coding
    • Atlas, L.; Vinton, M.; 2001. Modulation frequency and efficient audio coding. In: Proc. SPIE Internat. Soc. Opt. Eng.; Vol. 4474, pp. 1-8.
    • (2001) Proc. SPIE Internat. Soc. Opt. Eng. , vol.4474 , pp. 1-8
    • Atlas, L.1    Vinton, M.2
  • 3
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • R. Drullman, J. Festen, and R. Plomp Effect of reducing slow temporal modulations on speech reception J. Acoust. Soc. Amer. 95 5 1994 2670 2680
    • (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.5 , pp. 2670-2680
    • Drullman, R.1    Festen, J.2    Plomp, R.3
  • 6
    • 70449360175 scopus 로고    scopus 로고
    • Modulation spectral features for robust far-field speaker identification
    • T.H. Falk, and W.-Y. Chan Modulation spectral features for robust far-field speaker identification IEEE Trans. Audio Speech Lang. Process. 18 1 2010 90 100
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.1 , pp. 90-100
    • Falk, T.H.1    Chan, W.-Y.2
  • 7
    • 77955707186 scopus 로고    scopus 로고
    • A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech
    • T.H. Falk, C. Zheng, and W.-Y. Chan A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech IEEE Trans. Audio Speech Lang. Process. 18 7 2010 1766 1774
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.7 , pp. 1766-1774
    • Falk, T.H.1    Zheng, C.2    Chan, W.-Y.3
  • 8
    • 11144348189 scopus 로고    scopus 로고
    • Analysis of speech-based speech transmission index methods with implications for nonlinear operations
    • R. Goldsworthy, and J. Greenberg Analysis of speech-based speech transmission index methods with implications for nonlinear operations J. Acoust. Soc. Amer. 116 6 2004 3679 3689
    • (2004) J. Acoust. Soc. Amer. , vol.116 , Issue.6 , pp. 3679-3689
    • Goldsworthy, R.1    Greenberg, J.2
  • 9
    • 85128367018 scopus 로고    scopus 로고
    • Speech intelligibility derived from exceedingly sparse spectral information
    • Sydney, Australia
    • Greenberg, S.; Arai, T.; Silipo, R.; 1998. Speech intelligibility derived from exceedingly sparse spectral information. In: Proc. Internat. Conf. Spoken Lang. Process. (ICSLP), Vol. 6, Sydney, Australia, pp. 2803-2806.
    • (1998) Proc. Internat. Conf. Spoken Lang. Process. (ICSLP) , vol.6 , pp. 2803-2806
    • Greenberg, S.1    Arai, T.2    Silipo, R.3
  • 11
    • 0021407831 scopus 로고
    • Signal estimation from modified short-time Fourier transform
    • D. Griffin, and J. Lim Signal estimation from modified short-time Fourier transform IEEE Trans. Acoust. Speech Signal Process. ASSP-32 2 1984 236 243
    • (1984) IEEE Trans. Acoust. Speech Signal Process. , vol.32 , Issue.2 , pp. 236-243
    • Griffin, D.1    Lim, J.2
  • 12
    • 0342728435 scopus 로고
    • Subband or cepstral domain filtering for recognition of lombard and channel-distorted speech
    • Minneapolis, MN, USA
    • Hanson, B.; Applebaum, T.; 1993. Subband or cepstral domain filtering for recognition of lombard and channel-distorted speech. In: Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. (ICASSP), Vol. 2, Minneapolis, MN, USA, pp. 79-82.
    • (1993) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. (ICASSP) , vol.2 , pp. 79-82
    • Hanson, B.1    Applebaum, T.2
  • 13
    • 84873312246 scopus 로고
    • A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria
    • T. Houtgast, and H. Steeneken A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria J. Acoust. Soc. Amer. 77 3 1985 1069 1077
    • (1985) J. Acoust. Soc. Amer. , vol.77 , Issue.3 , pp. 1069-1077
    • Houtgast, T.1    Steeneken, H.2
  • 14
    • 34447092407 scopus 로고    scopus 로고
    • Subjective comparison and evaluation of speech enhancement algorithms
    • Y. Hu, and P.C. Loizou Subjective comparison and evaluation of speech enhancement algorithms Speech Comm. 49 7-8 2007 588 601
    • (2007) Speech Comm. , vol.49 , Issue.78 , pp. 588-601
    • Hu, Y.1    Loizou, P.C.2
  • 17
    • 4744344338 scopus 로고    scopus 로고
    • A cue for objective speech quality estimation in temporal envelope representations
    • D. Kim A cue for objective speech quality estimation in temporal envelope representations IEEE Signal Process. Lett. 11 10 2004 849 852
    • (2004) IEEE Signal Process. Lett. , vol.11 , Issue.10 , pp. 849-852
    • Kim, D.1
  • 18
    • 27644596289 scopus 로고    scopus 로고
    • Anique: An auditory model for single-ended speech quality estimation
    • D. Kim Anique: an auditory model for single-ended speech quality estimation IEEE Trans. Speech Audio Process. 13 5 2005 821 831
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 821-831
    • Kim, D.1
  • 19
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • B. Kingsbury, N. Morgan, and S. Greenberg Robust speech recognition using the modulation spectrogram Speech Comm. 25 1-3 1998 117 132
    • (1998) Speech Comm. , vol.25 , Issue.13 , pp. 117-132
    • Kingsbury, B.1    Morgan, N.2    Greenberg, S.3
  • 20
    • 0031220487 scopus 로고    scopus 로고
    • Effects of phase on the perception of intervocalic stop consonants
    • L. Liu, J. He, and G. Palm Effects of phase on the perception of intervocalic stop consonants Speech Comm. 22 4 1997 403 417
    • (1997) Speech Comm. , vol.22 , Issue.4 , pp. 403-417
    • Liu, L.1    He, J.2    Palm, G.3
  • 22
    • 84867218794 scopus 로고    scopus 로고
    • Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement
    • Brisbane, Australia
    • Lyons, J.; Paliwal, K.; 2008. Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement. In: Proc. ISCA Conf. Internat. Speech Comm. Assoc. (INTERSPEECH), Brisbane, Australia, pp. 387-390.
    • (2008) Proc. ISCA Conf. Internat. Speech Comm. Assoc. (INTERSPEECH) , pp. 387-390
    • Lyons, J.1    Paliwal, K.2
  • 23
    • 0019569248 scopus 로고
    • The importance of phase in signals
    • A.V. Oppenheim, and J.S. Lim The importance of phase in signals Proc. IEEE 69 5 1981 529 541
    • (1981) Proc. IEEE , vol.69 , Issue.5 , pp. 529-541
    • Oppenheim, A.V.1    Lim, J.S.2
  • 24
    • 13544259544 scopus 로고    scopus 로고
    • On the usefulness of STFT phase spectrum in human listening tests
    • K. Paliwal, and L. Alsteris On the usefulness of STFT phase spectrum in human listening tests Speech Comm. 45 2 2005 153 170
    • (2005) Speech Comm. , vol.45 , Issue.2 , pp. 153-170
    • Paliwal, K.1    Alsteris, L.2
  • 25
    • 67650143408 scopus 로고    scopus 로고
    • Effect of analysis window duration on speech intelligibility
    • K. Paliwal, and K. Wójcicki Effect of analysis window duration on speech intelligibility IEEE Signal Process. Lett. 15 2008 785 788
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 785-788
    • Paliwal, K.1    Wójcicki, K.2
  • 27
    • 77949911656 scopus 로고    scopus 로고
    • Single-channel speech enhancement using spectral subtraction in the short-time modulation domain
    • K. Paliwal, K. Wójcicki, and B. Schwerin Single-channel speech enhancement using spectral subtraction in the short-time modulation domain Speech Comm. 52 5 2010 450 475
    • (2010) Speech Comm. , vol.52 , Issue.5 , pp. 450-475
    • Paliwal, K.1    Wójcicki, K.2    Schwerin, B.3
  • 28
    • 0032784372 scopus 로고    scopus 로고
    • A method to determine the speech transmission index from speech waveforms
    • K. Payton, and L. Braida A method to determine the speech transmission index from speech waveforms J. Acoust. Soc. Amer. 106 6 1999 3637 3648
    • (1999) J. Acoust. Soc. Amer. , vol.106 , Issue.6 , pp. 3637-3648
    • Payton, K.1    Braida, L.2
  • 29
    • 0027659197 scopus 로고
    • Signal modeling techniques in speech recognition
    • J. Picone Signal modeling techniques in speech recognition Proc. IEEE 81 9 1993 1215 1247
    • (1993) Proc. IEEE , vol.81 , Issue.9 , pp. 1215-1247
    • Picone, J.1
  • 32
    • 10444286831 scopus 로고    scopus 로고
    • Perceptual Evaluation of Speech Quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs
    • Rix, A.; Beerends, J.; Hollier, M.; Hekstra, A.; 2001. Perceptual Evaluation of Speech Quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs. ITU-T Recommendation, P. 862.
    • (2001) ITU-T Recommendation , pp. 862
    • Rix, A.1    Beerends, J.2    Hollier, M.3    Hekstra, A.4
  • 33
    • 0016555855 scopus 로고
    • Models of hearing
    • M. Schroeder Models of hearing Proc. IEEE 63 9 1975 1332 1350
    • (1975) Proc. IEEE , vol.63 , Issue.9 , pp. 1332-1350
    • Schroeder, M.1
  • 34
    • 0018906941 scopus 로고
    • A physical method for measuring speech-transmission quality
    • H. Steeneken, and T. Houtgast A physical method for measuring speech-transmission quality J. Acoust. Soc. Amer. 67 1 1980 318 326
    • (1980) J. Acoust. Soc. Amer. , vol.67 , Issue.1 , pp. 318-326
    • Steeneken, H.1    Houtgast, T.2
  • 35
    • 0141520589 scopus 로고    scopus 로고
    • A non-uniform modulation transform for audio coding with increased time resolution
    • Hong Kong
    • Thompson, J.; Atlas, L.; 2003. A non-uniform modulation transform for audio coding with increased time resolution. In: Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. (ICASSP), Vol. 5, Hong Kong, pp. 397-400.
    • (2003) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. (ICASSP) , vol.5 , pp. 397-400
    • Thompson, J.1    Atlas, L.2
  • 37
    • 0020167383 scopus 로고
    • The unimportance of phase in speech enhancement
    • D. Wang, and J. Lim The unimportance of phase in speech enhancement IEEE Trans. Acoust. Speech Signal Process. ASSP-30 4 1982 679 681
    • (1982) IEEE Trans. Acoust. Speech Signal Process. , vol.30 , Issue.4 , pp. 679-681
    • Wang, D.1    Lim, J.2
  • 38
    • 34547500071 scopus 로고    scopus 로고
    • Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech
    • Honolulu, Hawaii, USA
    • Wójcicki, K.; Paliwal, K.; 2007. Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech. In: Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. (ICASSP), Vol. 4, Honolulu, Hawaii, USA, pp. 729-732.
    • (2007) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. (ICASSP) , vol.4 , pp. 729-732
    • Wójcicki, K.1    Paliwal, K.2
  • 39
    • 70449580752 scopus 로고    scopus 로고
    • Automatic recognition of speech emotion using long-term spectro-temporal features
    • Wu, S.; Falk, T.; Chan, W.-Y.; 2009. Automatic recognition of speech emotion using long-term spectro-temporal features. In: Internat. Conf. Digital Signal Process.
    • (2009) Internat. Conf. Digital Signal Process
    • Wu, S.1    Falk, T.2    Chan W., .-Y.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.