SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 8, 2010, Pages 2080-2090

Improving speech intelligibility in noise using environment-optimized algorithms

(2) Kim, Gibak a Loizou, Philipos C a

a The University of Texas at Dallas (United States)

Author keywords

Environment optimized algorithms; Speech enhancement; Speech intelligibility

Indexed keywords

ACOUSTIC ENVIRONMENT; BAYESIAN CLASSIFIER; BINARY DECISION; INCREMENTAL APPROACH; INPUT SIGNAL; MODEL PARAMETERS; OPTIMIZED ALGORITHMS; SPEECH ENHANCEMENT ALGORITHM; SPEECH QUALITY; TARGET SIGNALS; TIME FREQUENCY;

ALGORITHMS; BAYESIAN NETWORKS; CLASSIFIERS; OPTIMIZATION; SPEECH ENHANCEMENT;

SPEECH INTELLIGIBILITY;

EID: 77956547397 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2041116 Document Type: Article

Times cited : (52)

References (36)

1
- 34447100796
- Boca Raton: CRC
- P.C. Loizou, Speech Enhancement: Theory and Practice. Boca Raton: CRC, 2007.
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.C.¹

2
- 35248891610
- A comparative intelligibility study of single-microphone noise reduction algorithms
- DOI 10.1121/1.2766778
- Y. Hu and P.C. Loizou, "A comparative intelligibility study of singlemicrophone noise reduction algorithms," J. Acoust. Soc. Amer., vol. 122, pp. 1777-1786, 2007. (Pubitemid 47560539)
- (2007) Journal of the Acoustical Society of America , vol.122 , Issue.3 , pp. 1777-1786
- Hu, Y.¹ Loizou, P.C.²

3
- 0018027039
- Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise
- Oct.
- J.S. Lim, "Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-26, no. 5, pp. 471-472, Oct. 1978.
- (1978) IEEE Trans. Acoust. Speech Signal Process. , vol.26 ASSP , Issue.5 , pp. 471-472
- Lim, J.S.¹

4
- 35848945907
- The design and evaluation of a hearing aid with trainable amplification parameters
- J.A. Zakis, H. Dillon, and H.J. McDermott, "The design and evaluation of a hearing aid with trainable amplification parameters," Ear Hear., vol. 28, no. 6, pp. 812-830, 2007.
- (2007) Ear Hear. , vol.28 , Issue.6 , pp. 812-830
- Zakis, J.A.¹ Dillon, H.² McDermott, H.J.³

5
- 0021158675
- Optimal estimators for spectral restoration of noisy speech
- J.E. Porter and S.F. Boll, "Optimal estimators for spectral restoration of noisy speech," in Proc. Int. Conf. Acoust. Speech Signal Process., 1984, pp. 18A.2.1-18A.2.4.
- (1984) Proc. Int. Conf. Acoust. Speech Signal Process.
- Porter, J.E.¹ Boll, S.F.²

6
- 33744970011
- Codebook driven short-term predictor parameter estimation for speech enhancement
- Jan.
- S. Srinivasan, J. Samuelsson, and W.B. Kleijn, "Codebook driven short-term predictor parameter estimation for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 163-176, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 163-176
- Srinivasan, S.¹ Samuelsson, J.² Kleijn, W.B.³

7
- 84862603071
- A general optimization procedure for spectral speech enhancement methods
- Florence, Italy Sep.
- J. Erkelens, J. Jensen, and R. Heusdens, "A general optimization procedure for spectral speech enhancement methods," in Proc. Eur. Signal Proc. Conf., Florence, Italy, Sep. 2006.
- (2006) Proc. Eur. Signal Proc. Conf.
- Erkelens, J.¹ Jensen, J.² Heusdens, R.³

8
- 34447099536
- A data-driven approach to optimizing spectral speech enhancement methods for various error criteria
- J. Erkelens, J. Jensen, and R. Heusdens, "A data-driven approach to optimizing spectral speech enhancement methods for various error criteria," Speech Commun., vol. 49, pp. 530-541, 2007.
- (2007) Speech Commun. , vol.49 , pp. 530-541
- Erkelens, J.¹ Jensen, J.² Heusdens, R.³

9
- 44949225388
- Data-driven speech enhancement
- Kiel, Germany
- T. Fingscheidt and S. Suhadi, "Data-driven speech enhancement," in Proc. ITG-Fachtagung Sprachkommunikation, Kiel, Germany, 2006.
- (2006) Proc. ITG-Fachtagung Sprachkommunikation
- Fingscheidt, T.¹ Suhadi, S.²

10
- 64849116094
- Environment-optimized speech enhancement
- May
- T. Fingscheidt, S. Suhadi, and S. Stan, "Environment-optimized speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 4, pp. 825-834, May 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.4 , pp. 825-834
- Fingscheidt, T.¹ Suhadi, S.² Stan, S.³

11
- 0021645331
- Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
- Dec.
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.32 ASSP , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

12
- 22944438092
- Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model
- T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model," EURASIP J. Appl. Signal Process., vol. 7, pp. 1110-1126, 2005.
- (2005) EURASIP J. Appl. Signal Process. , vol.7 , pp. 1110-1126
- Lotter, T.¹ Vary, P.²

13
- 33846907750
- A Laplacian-based MMSE estimator for speech enhancement
- C. Bin and P.C. Loizou, "A Laplacian-based MMSE estimator for speech enhancement," Speech Commun., pp. 134-143, 2007.
- (2007) Speech Commun. , pp. 134-143
- Bin, C.¹ Loizou, P.C.²

14
- 27644515429
- Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum
- Sep.
- P.C. Loizou, "Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 857-869, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 857-869
- Loizou, P.C.¹

15
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- D. Brungart, P. Chang, B. Simpson, and D. Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Amer., vol. 120, pp. 4007-4018, 2006.
- (2006) J. Acoust. Soc. Amer. , vol.120 , pp. 4007-4018
- Brungart, D.¹ Chang, P.² Simpson, B.³ Wang, D.⁴

16
- 40749125179
- Factors influencing intelligibility of ideal binary- masked speech: Implications for noise reduction
- N. Li and P.C. Loizou, "Factors influencing intelligibility of ideal binary- masked speech: Implications for noise reduction," J. Acoust. Soc. Amer., vol. 123, no. 3, pp. 1673-1682, 2008.
- (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.3 , pp. 1673-1682
- Li, N.¹ Loizou, P.C.²

17
- 41849093721
- Effect of spectral resolution on the intelligibility of ideal binary masked speech
- N. Li and P.C. Loizou, "Effect of spectral resolution on the intelligibility of ideal binary masked speech," J. Acoust. Soc. Amer., vol. 123, no. 4, pp. 59-64, 2008.
- (2008) J. Acoust. Soc. Amer. , vol.123 , Issue.4 , pp. 59-64
- Li, N.¹ Loizou, P.C.²

18
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001.
- (2001) Speech Commun. , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

19
- 70349093614
- An algorithm that improves speech intelligibility in noise for normal-hearing listeners
- G. Kim, Y. Lu, Y. Hu, and P.C. Loizou, "An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Amer., vol. 126, no. 3, pp. 1486-1494, 2009.
- (2009) J. Acoust. Soc. Amer. , vol.126 , Issue.3 , pp. 1486-1494
- Kim, G.¹ Lu, Y.² Hu, Y.³ Loizou, P.C.⁴

20
- 0028297185
- Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction
- B. Kollmeier and R. Koch, "Speech enhancement based on physiological and psychoacoustical models of modulation perception and binaural interaction," J. Acoust. Soc. Amer., vol. 95, no. 3, pp. 1593-1602, 1994.
- (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.3 , pp. 1593-1602
- Kollmeier, B.¹ Koch, R.²

21
- 4644317224
- A Bayesian classifier for spectrographhic mask estimation for missing feature speech recognition
- M. Seltzer, B. Raj, and R. Stern, "A Bayesian classifier for spectrographhic mask estimation for missing feature speech recognition," Speech Commun., vol. 43, pp. 379-393, 2004.
- (2004) Speech Commun. , vol.43 , pp. 379-393
- Seltzer, M.¹ Raj, B.² Stern, R.³

22
- 0024241221
- Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms
- G. Langner and C. Schreiner, "Periodicity coding in the inferior colliculus of the cat. I: Neuronal mechanisms," J. Neurophysiol., vol. 60, no. 6, pp. 1799-1822, 1988. (Pubitemid 19017451)
- (1988) Journal of Neurophysiology , vol.60 , Issue.6 , pp. 1799-1822
- Langner, G.¹ Schreiner, C.E.²

23
- 0038712550
- SNR estimation based on amplitude modulation analysis with applications to noise suppression
- May
- J. Tchorz and B. Kollmeier, "SNR estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Trans. Speech Audio Process., vol. 11, no. 3, pp. 184-192, May 2003.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.3 , pp. 184-192
- Tchorz, J.¹ Kollmeier, B.²

24
- 82255178542
- Hoboken, NJ: Wiley and IEEE Press
- D.L.Wang and G.J. Brown, Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. Hoboken, NJ: Wiley and IEEE Press, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
- Wang, D.L.¹ Brown, G.J.²

25
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

26
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

27
- 0030105005
- On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition
- Q. Huo, C. Chan, and C.-H. Lee, "On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 2, pp. 141-144, 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.2 , pp. 141-144
- Huo, Q.¹ Chan, C.² Lee, C.-H.³

28
- 0031103160
- On-line adaptive learning of the continuous density hidden markov model based on approximate recursive Bayes estimate
- Mar.
- Q. Huo and C.-H. Lee, "On-line adaptive learning of the continuous density hidden markov model based on approximate recursive Bayes estimate," IEEE Trans. Speech Audio Process., vol. 5, no. 2, pp. 161-172, Mar. 1997.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.2 , pp. 161-172
- Huo, Q.¹ Lee, C.-H.²

29
- 0014568991
- IEEE recommended practice for speech quality measurements
- Sep.
- "IEEE recommended practice for speech quality measurements," IEEE Trans. Audio Electroacoust., vol. 19, no. 3, pp. 225-246, Sep. 1969.
- (1969) IEEE Trans. Audio Electroacoust. , vol.19 , Issue.3 , pp. 225-246

30
- 0027623210
- Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- A. Varga and H.J.M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, pp. 247-251, 1993.
- (1993) Speech Commun. , vol.12 , pp. 247-251
- Varga, A.¹ Steeneken, H.J.M.²

31
- 0003454484
- 3rd ed. Boston, MA: PWS-Kent
- L. Ott, An Introduction to Statistical Methods and Data Analysis, 3rd ed. Boston, MA: PWS-Kent, 1988.
- (1988) An Introduction to Statistical Methods and Data Analysis
- Ott, L.¹

32
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Sep.
- G. Hu and D.L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
- (2004) IEEE Trans. Neural Netw. , vol.15 , Issue.5 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

33
- 49249107353
- Segregation of unvoiced speech from nonspeech interference
- G. Hu and D.L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol. 124, pp. 1306-1319, 2008.
- (2008) J. Acoust. Soc. Amer. , vol.124 , pp. 1306-1319
- Hu, G.¹ Wang, D.L.²

34
- 77956534281
- Techniques for estimating the ideal binary mask
- Sep.
- Y. Hu and P.C. Loizou, "Techniques for estimating the ideal binary mask," in Proc. 11th Int. Workshop Acoust. Echo Noise Control, Sep. 2008.
- (2008) Proc. 11th Int. Workshop Acoust. Echo Noise Control
- Hu, Y.¹ Loizou, P.C.²

35
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug.
- S.B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-28, no. 4, pp. 357-336, Aug. 1980.
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 ASSP , Issue.4 , pp. 357-336
- Davis, S.B.¹ Mermelstein, P.²

36
- 14044252930
- Speech recognition with amplitude and frequency modulations
- DOI 10.1073/pnas.0406460102
- F.-G. Zeng, K. Nie, G.S. Stickney, Y.-Y.Kong, M.Vongphoe, A. Bhargave, C.Wei, and K. Cao, "Speech recognition with amplitude and frequency modulations," Proc. Nat. Acad. Sci. USA, vol. 102, no. 7, pp. 2293-2298, 2005. (Pubitemid 40279369)
- (2005) Proceedings of the National Academy of Sciences of the United States of America , vol.102 , Issue.7 , pp. 2293-2298
- Zeng, F.-G.¹ Nie, K.² Stickney, G.S.³ Kong, Y.-Y.⁴ Vongphoe, M.⁵ Bhargave, A.⁶ Wei, C.⁷ Cao, K.⁸

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.