SCOPUS 정보 검색 플랫폼

IEEE Signal Processing Letters

Volumn 21, Issue 1, 2014, Pages 65-68

An experimental study on speech enhancement based on deep neural networks

(4) Xu, Yong a Du, Jun a Dai, Li Rong a Lee, Chin Hui b

a UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA (China)

b GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Deep neural networks; noise reduction; regression model; speech enhancement

Indexed keywords

CONVENTIONAL TECHNIQUES; DEEP NEURAL NETWORKS; GENERALIZATION CAPABILITY; MINIMUM MEAN SQUARE ERRORS; MULTI-CONDITION TRAININGS; OBJECTIVE QUALITY MEASURES; REGRESSION MODEL; SPEECH ENHANCEMENT ALGORITHM;

ALGORITHMS; NEURAL NETWORKS; NOISE ABATEMENT; REGRESSION ANALYSIS;

SPEECH ENHANCEMENT;

EID: 84889257121 PISSN: 10709908 EISSN: None Source Type: Journal
DOI: 10.1109/LSP.2013.2291240 Document Type: Article

Times cited : (965)

References (22)

1
- 34447100796
- 2nd ed. ed. Boca Raton, FL, USA: CRC
- P. C. Loizou, Speech Enhancement: Theory and Practice, 2nd ed. ed. Boca Raton, FL, USA: CRC, 2013.
- (2013) Speech Enhancement: Theory and Practice
- Loizou, P.C.¹

2
- 85075926376
- Spectral enhancement methods
- J. Benesty M. M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer
- I. Cohen and S. Gannot, "Spectral enhancement methods," in Springer Handbook of Speech Processing, J. Benesty, M. M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer, 2008, pp. 873-901.
- (2008) Springer Handbook of Speech Processing , pp. 873-901
- Cohen, I.¹ Gannot, S.²

3
- 0021892216
- Speech enhancement using minimu mean square log spectral amplitude estimator
- Y. Ephraim and D. Malah, "Speech enhancement using minimu mean square log spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. 33, no. 2, pp. 443-445, 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

4
- 39149087385
- Nonlinear speech enhancement: An overview
- Berlin, Germany: Springer
- A. Hussain, M. Chetouani, S. Squartini, A. Bastari, and F. Piazza, "Nonlinear speech enhancement: An overview," in Progress in Nonlinear Speech Processing. Berlin, Germany: Springer, 2007, pp. 217-248.
- (2007) Progress in Nonlinear Speech Processing , pp. 217-248
- Hussain, A.¹ Chetouani, M.² Squartini, S.³ Bastari, A.⁴ Piazza, F.⁵

5
- 0024876950
- An analysis of a noise reduction neural network
- S. I. Tamura, "An analysis of a noise reduction neural network," in Proc. ICASSP, 1989, pp. 2001-2004.
- (1989) Proc. ICASSP , pp. 2001-2004
- Tamura, S.I.¹

6
- 85079214161
- A family of MLP based nonlinear spectral estimators for noise reduction
- F. Xie and D. V. Compernolle, "A family of MLP based nonlinear spectral estimators for noise reduction," in Proc. ICASSP, 1994, pp. 53-56.
- (1994) Proc. ICASSP , pp. 53-56
- Xie, F.¹ Compernolle, D.V.²

7
- 0009766947
- Networks for speech enhancement
- S. Katagiri, Ed. Norwell, MA, USA: Artech House
- E. A. Wan and A. T. Nelson, "Networks for speech enhancement," in Handbook of Neural Networks for Speech Processing, S. Katagiri, Ed. Norwell, MA, USA: Artech House, 1998.
- (1998) Handbook of Neural Networks for Speech Processing
- Wan, E.A.¹ Nelson, A.T.²

8
- 69349090197
- Learning deep architectures for AI
- Y. Bengio, "Learning deep architectures for AI," Found. Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, 2009.
- (2009) Found. Trends Mach. Learn. , vol.2 , Issue.1 , pp. 1-127
- Bengio, Y.¹

9
- 33746600649
- Reducing the dimensionality of data with neural networks
- G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
- (2006) Science , vol.313 , Issue.5786 , pp. 504-507
- Hinton, G.E.¹ Salakhutdinov, R.R.²

10
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. E. Hinton, L. Deng, D. Yu, and G. E. Dahl, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.E.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴

11
- 84889263385
- Denoising deep neural networks based voice activity detection
- X. L. Zhang and J. Wu, "Denoising deep neural networks based voice activity detection," in Proc. ICASSP, 2013, pp. 853-857.
- (2013) Proc. ICASSP , pp. 853-857
- Zhang, X.L.¹ Wu, J.²

12
- 84867202951
- A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions
- J. Du and Q. Huo, "A speech enhancement approach using piecewise linear approximation of an explicit model of environmental distortions," in Proc. Interspeech, 2008, pp. 569-572.
- (2008) Proc. Interspeech , pp. 569-572
- Du, J.¹ Huo, Q.²

13
- 84875678689
- Towards scaling up classification-based speech separation
- Y. X.Wang and D. L. Wang, "Towards scaling up classification-based speech separation," IEEE Trans. Audio, Speech Lang. Process., vol. 21, no. 7, pp. 1381-1390, 2013.
- (2013) IEEE Trans. Audio, Speech Lang. Process. , vol.21 , Issue.7 , pp. 1381-1390
- Wang, Y.X.¹ Wang, D.L.²

14
- 84890493989
- Ideal ratio mask estimation using deep neural networks for robust speech recognition
- A. Narayanan and D. L.Wang, "Ideal ratio mask estimation using deep neural networks for robust speech recognition," in Proc. ICASSP, 2013, pp. 1520-6149.
- (2013) Proc. ICASSP , pp. 1520-6149
- Narayanan, A.¹ Wang, D.L.²

15
- 0035500783
- Speech enhancement for non-stationary noise environments
- I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environments," Signal Process., vol. 81, no. 11, pp. 2403-2418, 2001.
- (2001) Signal Process. , vol.81 , Issue.11 , pp. 2403-2418
- Cohen, I.¹ Berdugo, B.²

16
- 0041360463
- Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
- I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 466-475, 2003.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 466-475
- Cohen, I.¹

17
- 79959842828
- Binary coding of speech spectrograms using a deep auto-encoder
- L. Deng, M. L. Seltzer, and D. Yu et al., "Binary coding of speech spectrograms using a deep auto-encoder," in Proc. Interspeech, 2010, pp. 1692-1695.
- Proc. Interspeech , vol.2010 , pp. 1692-1695
- Deng, L.¹ Seltzer, M.L.² Yu, D.³

18
- 0038669544
- The AURORA experimental framework for the preformance evaluations of speech recognition systems under noisy conditions
- H. G. Hirsch and D. Pearce, "The AURORA experimental framework for the preformance evaluations of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR, 2000, pp. 181-188.
- (2000) Proc. ISCA ITRW ASR , pp. 181-188
- Hirsch, H.G.¹ Pearce, D.²

19
- 0003419545
- J. S. Garofolo, Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech databaseNIST Tech Report, 1988.
- (1988) Getting Started with the DARPA TIMIT CD-ROM: An Acoustic Phonetic Continuous Speech DatabaseNIST Tech Report
- Garofolo, J.S.¹

20
- 0003639435
- Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs
- Int. Telecommun. Union-Telecommun. Stand. Sector
- ITU-T, Rec. P.862, "Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs," Int. Telecommun. Union-Telecommun. Stand. Sector 2001.
- (2001) ITU-T Rec. P.862

21
- 70349227623
- Efficient musical noise suppression for speech enhancement system
- T. Esch and P. Vary, "Efficient musical noise suppression for speech enhancement system," in Proc. ICASSP, 2009, pp. 4409-4412.
- (2009) Proc. ICASSP , pp. 4409-4412
- Esch, T.¹ Vary, P.²

22
- 0003425258
- Englewood Cliffs, NJ, USA: Prentice-Hall
- L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ, USA: Prentice-Hall, 1978.
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.