메뉴 건너뛰기




Volumn 55, Issue 4, 2013, Pages 572-585

Evaluating the intelligibility benefit of speech modifications in known noise conditions

Author keywords

Speech intelligibility; Speech modification; Synthetic speech

Indexed keywords

HIGH NOISE LEVELS; INTENSITY CHANGE; MESSAGE RECEPTION; MODIFICATION METHODS; NATURAL SPEECH; NOISE CONDITIONS; SPEECH MASKERS; SYNTHETIC SPEECH;

EID: 84875231469     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2013.01.001     Document Type: Article
Times cited : (110)

References (56)
  • 2
    • 0027081884 scopus 로고
    • Frequency-important functions for words in high- and low context sentences
    • S.T. Bell, D.D. Dirks, and T.D. Trine Frequency-important functions for words in high- and low context sentences J. Speech Hear. Res. 35 1992 950 959
    • (1992) J. Speech Hear. Res. , vol.35 , pp. 950-959
    • Bell, S.T.1    Dirks, D.D.2    Trine, T.D.3
  • 3
    • 84937998754 scopus 로고
    • Audio dynamic range compression for minimum perceived distortion
    • B.A. Blesser Audio dynamic range compression for minimum perceived distortion IEEE Trans. Audio Electroacoust. 17 1969
    • (1969) IEEE Trans. Audio Electroacoust. , vol.17
    • Blesser, B.A.1
  • 4
    • 4444257069 scopus 로고    scopus 로고
    • Praat, a system for doing phonetics by computer
    • P. Boersma Praat, a system for doing phonetics by computer Glot Internat. 5 2001 341 435
    • (2001) Glot Internat. , vol.5 , pp. 341-435
    • Boersma, P.1
  • 5
    • 34047272820 scopus 로고    scopus 로고
    • Semantic and phonetic enhancements for speech-in-noise recognition by native and non-native listeners
    • A.R. Bradlow, and J.A. Alexander Semantic and phonetic enhancements for speech-in-noise recognition by native and non-native listeners J. Acoust. Soc. Amer. 121 2007 2339 2349
    • (2007) J. Acoust. Soc. Amer. , vol.121 , pp. 2339-2349
    • Bradlow, A.R.1    Alexander, J.A.2
  • 6
    • 79952871923 scopus 로고    scopus 로고
    • Prediction of speech intelligibility based on an auditory preprocessing model
    • C. Christiansen, M.S. Pedersen, and T. Dau Prediction of speech intelligibility based on an auditory preprocessing model Speech Comm. 52 2010 678 692
    • (2010) Speech Comm. , vol.52 , pp. 678-692
    • Christiansen, C.1    Pedersen, M.S.2    Dau, T.3
  • 7
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • M. Cooke A glimpsing model of speech perception in noise J. Acoust. Soc. Amer. 119 2006 1562 1573
    • (2006) J. Acoust. Soc. Amer. , vol.119 , pp. 1562-1573
    • Cooke, M.1
  • 8
    • 79952200604 scopus 로고    scopus 로고
    • Spectral and temporal changes to speech produced in the presence of energetic and informational maskers
    • M. Cooke, and Y. Lu Spectral and temporal changes to speech produced in the presence of energetic and informational maskers J. Acoust. Soc. Amer. 128 2010 2059 2069
    • (2010) J. Acoust. Soc. Amer. , vol.128 , pp. 2059-2069
    • Cooke, M.1    Lu, Y.2
  • 9
    • 0041538788 scopus 로고
    • Effects of ambient noise on speaker intelligibility for words and phrases
    • J. Dreher, and J. O'Neill Effects of ambient noise on speaker intelligibility for words and phrases J. Acoust. Soc. Amer. 29 1957 1320 1323
    • (1957) J. Acoust. Soc. Amer. , vol.29 , pp. 1320-1323
    • Dreher, J.1    O'Neill, J.2
  • 10
    • 0034920512 scopus 로고    scopus 로고
    • ICRA noises: Artificial noise signals with speech-like spectral and temporal properties for hearing aid assessment
    • W.A. Dreschler, H. Verschuure, C. Ludvigsen, and S. Westermann ICRA noises: Artificial noise signals with speech-like spectral and temporal properties for hearing aid assessment Audiology 40 2001 148 157
    • (2001) Audiology , vol.40 , pp. 148-157
    • Dreschler, W.A.1    Verschuure, H.2    Ludvigsen, C.3    Westermann, S.4
  • 11
    • 84878388202 scopus 로고    scopus 로고
    • Implementation of simple spectral techniques to enhance the intelligibility of speech using a harmonic model
    • Portland, USA
    • Erro, D., Stylianou, Y., Navas, E., Hernaez, I., 2012. Implementation of simple spectral techniques to enhance the intelligibility of speech using a harmonic model. In: Proc. Interspeech, Portland, USA.
    • (2012) Proc. Interspeech
    • Erro, D.1    Stylianou, Y.2    Navas, E.3    Hernaez, I.4
  • 12
    • 0025259936 scopus 로고
    • Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing
    • J. Festen, and R. Plomp Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing J. Acoust. Soc. Amer. 88 1990 1725 1736
    • (1990) J. Acoust. Soc. Amer. , vol.88 , pp. 1725-1736
    • Festen, J.1    Plomp, R.2
  • 13
    • 82255167329 scopus 로고    scopus 로고
    • Acoustic-phonetic characteristics of speech produced with communicative intent to counter adverse listening conditions
    • V. Hazan, and R. Baker Acoustic-phonetic characteristics of speech produced with communicative intent to counter adverse listening conditions J. Acoust. Soc. Amer. 130 2011 2139 2152
    • (2011) J. Acoust. Soc. Amer. , vol.130 , pp. 2139-2152
    • Hazan, V.1    Baker, R.2
  • 14
    • 84937272460 scopus 로고    scopus 로고
    • Cue-enhancement strategies for natural VCV and sentence materials presented in noise
    • V. Hazan, and A. Simpson Cue-enhancement strategies for natural VCV and sentence materials presented in noise Speech Hear. Lang. 9 1996 43 55
    • (1996) Speech Hear. Lang. , vol.9 , pp. 43-55
    • Hazan, V.1    Simpson, A.2
  • 16
    • 33646462919 scopus 로고    scopus 로고
    • Strength of British English accents in altered listening conditions
    • P. Howell, W. Barry, and D. Vinson Strength of British English accents in altered listening conditions Percept. Psychophys. 68 2006 139 153
    • (2006) Percept. Psychophys. , vol.68 , pp. 139-153
    • Howell, P.1    Barry, W.2    Vinson, D.3
  • 17
    • 84875216420 scopus 로고    scopus 로고
    • Lombard effect mimicking
    • Kyoto, Japan
    • Huang, D.Y., Rahardja, S., Ong, E.P., 2010. Lombard effect mimicking. In: Proc. SSW7, Kyoto, Japan, pp. 258-263.
    • (2010) Proc. SSW7 , pp. 258-263
    • Huang, D.Y.1    Rahardja, S.2    Ong, E.P.3
  • 18
    • 4143097845 scopus 로고    scopus 로고
    • Signal processing for hearing aids
    • M. Kahrs, K. Brandenberg, Kluwer Academic Publishers Boston
    • J.M. Kates Signal processing for hearing aids M. Kahrs, K. Brandenberg, Applications of Digital Signal Processing to Audio and Acoustics 1998 Kluwer Academic Publishers Boston 235 277
    • (1998) Applications of Digital Signal Processing to Audio and Acoustics , pp. 235-277
    • Kates, J.M.1
  • 19
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds Speech Comm. 27 1999 187 207
    • (1999) Speech Comm. , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigné, A.3
  • 20
    • 33646819304 scopus 로고    scopus 로고
    • Improving the understandability of speech synthesis by modeling speech in noise
    • Langner, B., Black, A.W., 2005. Improving the understandability of speech synthesis by modeling speech in noise. In: Proc. ICASSP, pp. 265-268.
    • (2005) Proc. ICASSP , pp. 265-268
    • Langner, B.1    Black, A.W.2
  • 21
    • 0000665734 scopus 로고
    • Explaining phonetic variation: A sketch of the H&H theory
    • W.J. Hardcastle, A. Marchal, Kluwer Academic Publishers
    • B. Lindblom Explaining phonetic variation: A sketch of the H&H theory W.J. Hardcastle, A. Marchal, Speech Production and Speech Modelling 1990 Kluwer Academic Publishers 403 439
    • (1990) Speech Production and Speech Modelling , pp. 403-439
    • Lindblom, B.1
  • 22
    • 0000874053 scopus 로고
    • Le signe d'élévation de la voix (the sign of the elevation of the voice)
    • E. Lombard Le signe d'élévation de la voix (the sign of the elevation of the voice) Annales des maladies de l'oreille et du larynx 37 1911 101 119
    • (1911) Annales des Maladies de l'Oreille et du Larynx , vol.37 , pp. 101-119
    • Lombard, E.1
  • 23
    • 56749169816 scopus 로고    scopus 로고
    • Speech production modifications produced by competing talkers, babble and stationary noise
    • Y. Lu, and M. Cooke Speech production modifications produced by competing talkers, babble and stationary noise J. Acoust. Soc. Amer. 124 2008 3261 3275
    • (2008) J. Acoust. Soc. Amer. , vol.124 , pp. 3261-3275
    • Lu, Y.1    Cooke, M.2
  • 24
    • 0031356549 scopus 로고    scopus 로고
    • LSP-based speech modification for intelligibility enhancement
    • Santorini, Greece
    • McLoughlin, I.V., Chance, R.J., 1997. LSP-based speech modification for intelligibility enhancement. In: Proc. Digital Signal Processing, Santorini, Greece, pp. 591-594.
    • (1997) Proc. Digital Signal Processing , pp. 591-594
    • McLoughlin, I.V.1    Chance, R.J.2
  • 25
    • 84875226067 scopus 로고    scopus 로고
    • Reactive speech synthesis: Actively managing phonetic contrast along an H&H continuum
    • Moore, R.K., Nicolao, M., 2011. Reactive speech synthesis: Actively managing phonetic contrast along an H&H continuum. In: 17th Internat. Cong. on Phonetic Sciences, pp. 1422-1425.
    • (2011) 17th Internat. Cong. on Phonetic Sciences , pp. 1422-1425
    • Moore, R.K.1    Nicolao, M.2
  • 26
    • 0016990909 scopus 로고
    • The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression
    • R.J. Niederjohn, and J.H. Grotelueschen The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression IEEE Trans. Acoust. Speech Signal Process. 24 1976 277 282
    • (1976) IEEE Trans. Acoust. Speech Signal Process. , vol.24 , pp. 277-282
    • Niederjohn, R.J.1    Grotelueschen, J.H.2
  • 27
    • 38849146557 scopus 로고    scopus 로고
    • The influence of linguistic content on the Lombard effect
    • R. Patel, and K.W. Schell The influence of linguistic content on the Lombard effect J. Speech Lang. Hear. Res. 51 2008 209 220
    • (2008) J. Speech Lang. Hear. Res. , vol.51 , pp. 209-220
    • Patel, R.1    Schell, K.W.2
  • 28
    • 0022003919 scopus 로고
    • Speaking clearly for the hard of hearing I: Intelligibility differences between clear and conversational speech
    • M. Picheny, N. Durlach, and L. Braida Speaking clearly for the hard of hearing I: Intelligibility differences between clear and conversational speech J. Speech Hear. Res. 28 1985 96 103
    • (1985) J. Speech Hear. Res. , vol.28 , pp. 96-103
    • Picheny, M.1    Durlach, N.2    Braida, L.3
  • 30
    • 84865727922 scopus 로고    scopus 로고
    • Analysis of HMM-based Lombard speech synthesis
    • Florence, Italy
    • Raitio, T., Suni, A., Vainio, M., Alku, P., 2011. Analysis of HMM-based Lombard speech synthesis. In: Proc. Interspeech, Florence, Italy, pp. 2781-2784.
    • (2011) Proc. Interspeech , pp. 2781-2784
    • Raitio, T.1    Suni, A.2    Vainio, M.3    Alku, P.4
  • 31
    • 84867606532 scopus 로고    scopus 로고
    • On measuring the intelligibility of synthetic speech in noise do we need a realistic noise environment?
    • Raitio, T., Takanen, M., Santala, O., Sun, A., Vainio, M., Alku, P., 2012. On measuring the intelligibility of synthetic speech in noise do we need a realistic noise environment? In: Proc. ICASSP, pp. 4015-4028.
    • (2012) Proc. ICASSP , pp. 4015-4028
    • Raitio, T.1    Takanen, M.2    Santala, O.3    Sun, A.4    Vainio, M.5    Alku, P.6
  • 32
    • 0034847662 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ)-A new method for speech quality assessment of telephone networks and codecs
    • Rix, A., Beerends, J., Hollier, M., Hekstra, A., 2001. Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs. In: Proc. ICASSP, pp. 749-752.
    • (2001) Proc. ICASSP , pp. 749-752
    • Rix, A.1    Beerends, J.2    Hollier, M.3    Hekstra, A.4
  • 34
    • 33947613677 scopus 로고    scopus 로고
    • Near end listening enhancement: Speech intelligibility improvement in noisy environments
    • Toulouse, France
    • Sauert, B., Vary, P., 2006. Near end listening enhancement: Speech intelligibility improvement in noisy environments. In: Proc. ICASSP, Toulouse, France, pp. 493-496.
    • (2006) Proc. ICASSP , pp. 493-496
    • Sauert, B.1    Vary, P.2
  • 35
    • 84869816755 scopus 로고    scopus 로고
    • Recursive closed-form optimization of spectral audio power allocation for near end listening enhancement
    • Bochum, Germany
    • Sauert, B., Vary, P., 2010. Recursive closed-form optimization of spectral audio power allocation for near end listening enhancement. In: Proc. ITG-Fachtagung Sprachkommunikation, Bochum, Germany.
    • (2010) Proc. ITG-Fachtagung Sprachkommunikation
    • Sauert, B.1    Vary, P.2
  • 36
    • 84875224029 scopus 로고    scopus 로고
    • Near end listening enhancement considering thermal limit of mobile phone loudspeakers
    • Aachen, Germany
    • Sauert, B., Vary, P., 2011. Near end listening enhancement considering thermal limit of mobile phone loudspeakers. In: Proc. Conf. on Elektronische Sprachsignalverarbeitung (ESSV), Aachen, Germany, pp. 333-340.
    • (2011) Proc. Conf. on Elektronische Sprachsignalverarbeitung (ESSV) , pp. 333-340
    • Sauert, B.1    Vary, P.2
  • 37
    • 33645998440 scopus 로고    scopus 로고
    • Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments
    • M.D. Skowronski, and J.G. Harris Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments Speech Comm. 48 2006 549 558
    • (2006) Speech Comm. , vol.48 , pp. 549-558
    • Skowronski, M.D.1    Harris, J.G.2
  • 38
    • 84875221710 scopus 로고    scopus 로고
    • SoX Available [Apr. 2012] from
    • SoX, 2012. SoX-Sound eXchange. Software. Available [Apr. 2012] from .
    • (2012) SoX-Sound EXchange. Software
  • 41
    • 84867593799 scopus 로고    scopus 로고
    • A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure
    • Taal, C.H., Hendriks, R.C., Heusdens, R., 2012. A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure. In: Proc. ICASSP, pp. 4061-4064.
    • (2012) Proc. ICASSP , pp. 4061-4064
    • Taal, C.H.1    Hendriks, R.C.2    Heusdens, R.3
  • 43
    • 79959812739 scopus 로고    scopus 로고
    • Energy reallocation strategies for speech enhancement in known noise conditions
    • Tang, Y., Cooke, M., 2010. Energy reallocation strategies for speech enhancement in known noise conditions. In: Proc. Interspeech, pp. 1636-1639.
    • (2010) Proc. Interspeech , pp. 1636-1639
    • Tang, Y.1    Cooke, M.2
  • 44
    • 84865783312 scopus 로고    scopus 로고
    • Subjective and objective evaluation of speech intelligibility enhancement under constant energy and duration constraints
    • Florence, Italy
    • Tang, Y., Cooke, M., 2011. Subjective and objective evaluation of speech intelligibility enhancement under constant energy and duration constraints. In: Proc. Interspeech, Florence, Italy, pp. 345-348.
    • (2011) Proc. Interspeech , pp. 345-348
    • Tang, Y.1    Cooke, M.2
  • 45
    • 84878411602 scopus 로고    scopus 로고
    • Optimised spectral weightings for noise-dependent speech intelligibility enhancement
    • Portland, USA
    • Tang, Y., Cooke, M., 2012. Optimised spectral weightings for noise-dependent speech intelligibility enhancement. In: Proc. Interspeech, Portland, USA.
    • (2012) Proc. Interspeech
    • Tang, Y.1    Cooke, M.2
  • 46
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inf. Syst. E90-D 2007 816 824
    • (2007) IEICE Trans. Inf. Syst. , vol.E90-D , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 47
    • 33845948493 scopus 로고    scopus 로고
    • Do you speak E-NG-L-I-SH? a comparison of foreigner- and infant-directed speech
    • M. Uther, M.A. Knoll, and D. Burnham Do you speak E-NG-L-I-SH? a comparison of foreigner- and infant-directed speech Speech Comm. 49 2007 2 7
    • (2007) Speech Comm. , vol.49 , pp. 2-7
    • Uther, M.1    Knoll, M.A.2    Burnham, D.3
  • 48
    • 84867624102 scopus 로고    scopus 로고
    • Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise
    • Kyoto, Japan
    • Valentini-Botinhao, C., Maia, R., Yamagishi, J., King, S., Zen, H., 2012. Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise. In: Proc. ICASSP Kyoto, Japan, pp. 3997-4000.
    • (2012) Proc. ICASSP , pp. 3997-4000
    • Valentini-Botinhao, C.1    Maia, R.2    Yamagishi, J.3    King, S.4    Zen, H.5
  • 49
    • 84878385645 scopus 로고    scopus 로고
    • Mel cepstral coefficient modification based on the glimpse proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise
    • Portland, USA
    • Valentini-Botinhao, C., Yamagishi, J., King, S., 2012. Mel cepstral coefficient modification based on the glimpse proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise. In: Proc. Interspeech, Portland, USA.
    • (2012) Proc. Interspeech
    • Valentini-Botinhao, C.1    Yamagishi, J.2    King, S.3
  • 50
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm IEEE Trans. Audio Speech Lang. Process. 17 2009 66 83
    • (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 52
    • 67650819492 scopus 로고    scopus 로고
    • Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge
    • Yamagishi, J., Zen, H., Wu, Y.J., Toda, T., Tokuda, K., 2008. Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge. In: Proc. Blizzard Challenge Workshop.
    • (2008) Proc. Blizzard Challenge Workshop
    • Yamagishi, J.1    Zen, H.2    Wu, Y.J.3    Toda, T.4    Tokuda, K.5
  • 54
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A.W. Black Statistical parametric speech synthesis Speech Comm. 51 2009 1039 1064
    • (2009) Speech Comm. , vol.51 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 55
    • 84878419232 scopus 로고    scopus 로고
    • Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression
    • Portland, USA
    • Zorila, T.C., Kandia, V., Stylianou, Y., 2012. Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression. In: Proc. Interspeech, Portland, USA.
    • (2012) Proc. Interspeech
    • Zorila, T.C.1    Kandia, V.2    Stylianou, Y.3
  • 56
    • 84953656445 scopus 로고
    • Subdivison of audible frequency range into critical bands
    • E. Zwicker Subdivison of audible frequency range into critical bands J. Acoust. Soc. Amer. 33 1961 248
    • (1961) J. Acoust. Soc. Amer. , vol.33 , pp. 248
    • Zwicker, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.