메뉴 건너뛰기




Volumn 132, Issue 6, 2012, Pages 3990-4001

Signal-to-noise ratio adaptive post-filtering method for intelligibility enhancement of telephone speech

Author keywords

[No Author keywords available]

Indexed keywords

AMBIENT NOISE; BACKGROUND NOISE; FLAT FREQUENCY RESPONSE; HIGH-PASS; HIGHER FREQUENCIES; NOISE CONDITIONS; POST-FILTERING; POST-FILTERING ALGORITHM; POSTFILTERS; QUALITY OF SPEECH; SUBJECTIVE LISTENING TEST; TELEPHONE SPEECH; TRANSFER ENERGY;

EID: 84870926927     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.4765074     Document Type: Article
Times cited : (15)

References (30)
  • 1
    • 84981309017 scopus 로고    scopus 로고
    • 3GPP, TS 26.090. , 3rd Generation PartnershiProject, Valbonne, France, version 8.0.0
    • 3GPP, TS 26.090 (2008). Adaptive multi-rate (AMR) speech codec: Transcoding functions., 3rd Generation Partnership Project, Valbonne, France, version 8.0.0.
    • (2008) Adaptive Multi-rate (AMR) Speech Codec: Transcoding Functions
  • 3
    • 34547643558 scopus 로고    scopus 로고
    • Companding to improve cochlear-implant speech recognition in speech-shaped noise
    • 10.1121/1.2749710
    • Bhattacharya, A., and Zeng, F.-G. (2007). Companding to improve cochlear-implant speech recognition in speech-shaped noise., J. Acoust. Soc. Am. 122, 1079-1089. 10.1121/1.2749710
    • (2007) J. Acoust. Soc. Am. , vol.122 , pp. 1079-1089
    • Bhattacharya, A.1    Zeng, F.-G.2
  • 4
    • 0029219433 scopus 로고
    • Adaptive postfiltering for quality enhancement of coded speech
    • 10.1109/89.365380
    • Chen, J.-H., and Gersho, A. (1995). Adaptive postfiltering for quality enhancement of coded speech., IEEE Trans. Speech Audio Process. 3, 59-71. 10.1109/89.365380
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 59-71
    • Chen, J.-H.1    Gersho, A.2
  • 5
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • 10.1109/TASSP.1985.1164550
    • Ephraim, Y., and Malah, D. (1985). Speech enhancement using a minimum mean-square error log-spectral amplitude estimator., IEEE Trans. Acoust., Speech, Signal Process. 33, 443-445. 10.1109/TASSP.1985.1164550
    • (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.33 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 7
    • 0035510532 scopus 로고    scopus 로고
    • Spectral subtraction using reduced delay convolution and adaptive averaging
    • 10.1109/89.966083
    • Gustafsson, H., Nordholm, S. E., and Claesson, I. (2001). Spectral subtraction using reduced delay convolution and adaptive averaging., IEEE Trans. Speech Audio Process. 9, 799-807. 10.1109/89.966083
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 799-807
    • Gustafsson, H.1    Nordholm, S.E.2    Claesson, I.3
  • 8
    • 75949113698 scopus 로고    scopus 로고
    • Intelligibility and listener preference of telephone speech in the presence of babble noise
    • 10.1121/1.3263603
    • Hall, J. L., and Flanagan, J. L. (2010). Intelligibility and listener preference of telephone speech in the presence of babble noise., J. Acoust. Soc. Am. 127, 280-285. 10.1121/1.3263603
    • (2010) J. Acoust. Soc. Am. , vol.127 , pp. 280-285
    • Hall, J.L.1    Flanagan, J.L.2
  • 9
    • 0032090027 scopus 로고    scopus 로고
    • The effect of cue-enhancement on the intelligibility of nonsense word and sentence materials presented in noise
    • 10.1016/S0167-6393(98)00011-9
    • Hazan, V., and Simpson, A. (1998). The effect of cue-enhancement on the intelligibility of nonsense word and sentence materials presented in noise., Speech Commun. 24, 211-226. 10.1016/S0167-6393(98)00011-9
    • (1998) Speech Commun. , vol.24 , pp. 211-226
    • Hazan, V.1    Simpson, A.2
  • 10
    • 0041591273 scopus 로고    scopus 로고
    • A generalized subspace approach for enhancing speech corrupted by colored noise
    • 10.1109/TSA.2003.814458
    • Hu, Y., and Loizou, P. C. (2003). A generalized subspace approach for enhancing speech corrupted by colored noise., IEEE Trans. Speech Audio Process. 11, 334-341. 10.1109/TSA.2003.814458
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 334-341
    • Hu, Y.1    Loizou, P.C.2
  • 11
    • 0742307391 scopus 로고    scopus 로고
    • Speech enhancement based on wavelet thresholding the multitaper spectrum
    • 10.1109/TSA.2003.819949
    • Hu, Y., and Loizou, P. C. (2004). Speech enhancement based on wavelet thresholding the multitaper spectrum., IEEE Trans. Speech Audio Process. 12, 59-67. 10.1109/TSA.2003.819949
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , pp. 59-67
    • Hu, Y.1    Loizou, P.C.2
  • 12
    • 35248891610 scopus 로고    scopus 로고
    • A comparative intelligibility study of single-microphone noise reduction algorithms
    • 10.1121/1.2766778
    • Hu, Y., and Loizou, P. C. (2007). A comparative intelligibility study of single-microphone noise reduction algorithms., J. Acoust. Soc. Am. 122, 1777-1786. 10.1121/1.2766778
    • (2007) J. Acoust. Soc. Am. , vol.122 , pp. 1777-1786
    • Hu, Y.1    Loizou, P.C.2
  • 15
    • 84870884465 scopus 로고    scopus 로고
    • ITU-T Users Grouon Software Tools., International Telecommunications Union, Geneva, Switzerland
    • ITU-T Users Group on Software Tools (2005). ITU-T Software Tool Library 2005 Users Manual, International Telecommunications Union, Geneva, Switzerland.
    • (2005) ITU-T Software Tool Library 2005 Users Manual
  • 16
    • 80052532239 scopus 로고    scopus 로고
    • Gain-induced speech distortions and the absence of intelligibility benefit with existing noise-reduction algorithms
    • 10.1121/1.3619790
    • Kim, G., and Loizou, P. C. (2011). Gain-induced speech distortions and the absence of intelligibility benefit with existing noise-reduction algorithms., J. Acoust. Soc. Am. 130, 1581-1596. 10.1121/1.3619790
    • (2011) J. Acoust. Soc. Am. , vol.130 , pp. 1581-1596
    • Kim, G.1    Loizou, P.C.2
  • 18
    • 77957725494 scopus 로고    scopus 로고
    • Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions
    • 10.1109/TASL.2010.2045180
    • Loizou, P. C., and Kim, G. (2011). Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions., IEEE Trans. Audio, Speech, Lang. Process. 19, 47-56. 10.1109/TASL.2010.2045180
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , pp. 47-56
    • Loizou, P.C.1    Kim, G.2
  • 19
    • 0036722860 scopus 로고    scopus 로고
    • Telephone speech quality prediction: Towards network planning and monitoring models for modern network scenarios
    • 10.1016/S0167-6393(01)00043-7
    • Möller, S., and Raake, A. (2002). Telephone speech quality prediction: Towards network planning and monitoring models for modern network scenarios., Speech Commun. 38, 47-75. 10.1016/S0167-6393(01)00043-7
    • (2002) Speech Commun. , vol.38 , pp. 47-75
    • Möller, S.1    Raake, A.2
  • 20
    • 0016990909 scopus 로고
    • The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression
    • 10.1109/TASSP.1976.1162824
    • Niederjohn, R. J., and Grotelueschen, J. H. (1976). The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression., IEEE Trans. Acoust., Speech, Signal Process. 24, 277-282. 10.1109/TASSP.1976.1162824
    • (1976) IEEE Trans. Acoust., Speech, Signal Process , vol.24 , pp. 277-282
    • Niederjohn, R.J.1    Grotelueschen, J.H.2
  • 21
    • 34147095906 scopus 로고    scopus 로고
    • Evaluation of companding-based spectral enhancement using simulated cochlear-implant processing
    • 10.1121/1.2434757
    • Oxenham, A. J., Simonson, A. M., Turicchia, L., and Sarpeshkar, R. (2007). Evaluation of companding-based spectral enhancement using simulated cochlear-implant processing., J. Acoust. Soc. Am. 121, 1709-1716. 10.1121/1.2434757
    • (2007) J. Acoust. Soc. Am. , vol.121 , pp. 1709-1716
    • Oxenham, A.J.1    Simonson, A.M.2    Turicchia, L.3    Sarpeshkar, R.4
  • 23
    • 33645998440 scopus 로고    scopus 로고
    • Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments
    • 10.1016/j.specom.2005.09.003
    • Skowronski, M. D., and Harris, J. G. (2006). Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments., Speech Commun. 48, 549-558. 10.1016/j.specom.2005.09.003
    • (2006) Speech Commun. , vol.48 , pp. 549-558
    • Skowronski, M.D.1    Harris, J.G.2
  • 24
    • 0023786666 scopus 로고
    • Effects of noise on speech production: Acoustic and perceptual analyses
    • 10.1121/1.396660
    • Summers, W. V., Pisoni, D. B., Bernacki, R. H., Pedlow, R. I., and Stokes, M. A. (1988). Effects of noise on speech production: Acoustic and perceptual analyses., J. Acoust. Soc. Am. 84, 917-928. 10.1121/1.396660
    • (1988) J. Acoust. Soc. Am. , vol.84 , pp. 917-928
    • Summers, W.V.1    Pisoni, D.B.2    Bernacki, R.H.3    Pedlow, R.I.4    Stokes, M.A.5
  • 25
    • 79959812739 scopus 로고    scopus 로고
    • Energy reallocation strategies for speech enhancement in known noise conditions
    • in
    • Tang, Y., and Cooke, M. (2010). Energy reallocation strategies for speech enhancement in known noise conditions., in Proceedings of Interspeech, pp. 1636-1639.
    • (2010) Proceedings of Interspeech , pp. 1636-1639
    • Tang, Y.1    Cooke, M.2
  • 26
    • 84865783312 scopus 로고    scopus 로고
    • Subjective and objective evaluation of speech intelligibility enhancement under constant energy and duration constraints
    • in
    • Tang, Y., and Cooke, M. (2011). Subjective and objective evaluation of speech intelligibility enhancement under constant energy and duration constraints., in Proceedings of Interspeech, pp. 345-348.
    • (2011) Proceedings of Interspeech , pp. 345-348
    • Tang, Y.1    Cooke, M.2
  • 27
    • 24944526637 scopus 로고    scopus 로고
    • Developing a speech intelligibility test based on measuring speech reception thresholds in noise for English and Finnish
    • 10.1121/1.1993129
    • Vainio, M., Suni, A., Järveläinen, H., Järvikivi, J., and Mattila, V.-V. (2005). Developing a speech intelligibility test based on measuring speech reception thresholds in noise for English and Finnish., J. Acoust. Soc. Am. 118, 1742-1750. 10.1121/1.1993129
    • (2005) J. Acoust. Soc. Am. , vol.118 , pp. 1742-1750
    • Vainio, M.1    Suni, A.2    Järveläinen, H.3    Järvikivi, J.4    Mattila, V.-V.5
  • 28
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • 10.1016/0167-6393(93)90095-3
    • Varga, A., and Steeneken, H. J. (1993). Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems., Speech Commun. 12, 247-251. 10.1016/0167-6393(93)90095-3
    • (1993) Speech Commun. , vol.12 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.2
  • 29
    • 78649530915 scopus 로고    scopus 로고
    • Quality dimensions of narrowband and wideband speech transmission
    • 10.3813/AAA.918370
    • Wältermann, M., Raake, A., and Möller, S. (2010). Quality dimensions of narrowband and wideband speech transmission., Acta Acust. Acust. 96, 1090-1103. 10.3813/AAA.918370
    • (2010) Acta Acust. Acust. , vol.96 , pp. 1090-1103
    • Wältermann, M.1    Raake, A.2    Möller, S.3
  • 30
    • 34547618996 scopus 로고    scopus 로고
    • Speech signal modification to increase intelligibility in noisy environments
    • 10.1121/1.2751257
    • Yoo, S. D., Boston, J. R., El-Jaroudi, A., and Li, C.-C. (2007). Speech signal modification to increase intelligibility in noisy environments., J. Acoust. Soc. Am. 122, 1138-1149. 10.1121/1.2751257
    • (2007) J. Acoust. Soc. Am. , vol.122 , pp. 1138-1149
    • Yoo, S.D.1    Boston, J.R.2    El-Jaroudi, A.3    Li, C.-C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.