SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 3002-3006

An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition

(3) Li, Bo a Tsao, Yu b Sim, Khe Chai a

a NATIONAL UNIVERSITY OF SINGAPORE (Singapore)

b RESEARCH CENTER FOR INFORMATION TECHNOLOGY INNOVATION (Taiwan)

Author keywords

Deep neural networks; Spectral restoration; Speech enhancement

Indexed keywords

RESTORATION; SPEECH ENHANCEMENT; SPEECH RECOGNITION;

AUTOMATIC SPEECH RECOGNITION SYSTEM; DEEP NEURAL NETWORKS; GAUSSIAN MIXTURE MODEL; GENERALIZATION CAPABILITY; MINIMUM MEAN-SQUARE ERROR; MULTI-CONDITION TRAININGS; NOISE ROBUST SPEECH RECOGNITION; SPECTRAL AMPLITUDE ESTIMATORS;

ALGORITHMS;

EID: 84906272122 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (42)

References (25)

1
- 84055211743
- Acoustic modeling using deep belief networks
- A. Mohamed, G. E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 14-22, 2012.
- (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.E.² Hinton, G.³

2
- 79959840616
- Investigation of full-sequence training of deep belief networks for speech recognition
- A. Mohamed, D. Yu, and L. Deng, "Investigation of full-sequence training of deep belief networks for speech recognition, " in Proc. Interspeech. ISCA, 2010, pp. 2846-2849.
- (2010) Proc. Interspeech. ISCA , pp. 2846-2849
- Mohamed, A.¹ Yu, D.² Deng, L.³

3
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

4
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. Interspeech. ISCA, 2011, pp. 437-440.
- (2011) Proc. Interspeech. ISCA , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

5
- 84867626068
- Revisiting recurrent neural networks for robust ASR
- O. Vinyals, S. V. Ravuri, and D. Povey, "Revisiting recurrent neural networks for robust ASR, " in Proc. ICASSP. IEEE, 2012, pp. 4085-4088.
- (2012) Proc. ICASSP. IEEE , pp. 4085-4088
- Vinyals, O.¹ Ravuri, S.V.² Povey, D.³

6
- 84890497765
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- H. G. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, " in ASR2000-Automatic Speech Recognition: Challenges for the new Millenium ISCA Tutorial and Research Workshop (ITRW), 2000.
- (2000) ASR2000-Automatic Speech Recognition: Challenges for the New Millenium ISCA Tutorial and Research Workshop (ITRW)
- Hirsch, H.G.¹ Pearce, D.²

7
- 84878409063
- Recurrent neural networks for noise reduction in robust ASR
- ISCA
- A. L. Maas, Q. V. Le, T. M. ONeil, O. Vinyals, P. Nguyen, and A. Y. Ng, "Recurrent neural networks for noise reduction in robust ASR, " in Proc. Interspeech. ISCA, 2012.
- (2012) Proc. Interspeech.
- Maas, A.L.¹ Le, Q.V.² Oneil, T.M.³ Vinyals, O.⁴ Nguyen, P.⁵ Ng, A.Y.⁶

8
- 0034855352
- Highperformance robust speech recognition using stereo training data
- IEEE
- L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "Highperformance robust speech recognition using stereo training data, " in Proc. ICASSP, vol. 1. IEEE, 2001, pp. 301-304.
- (2001) Proc. ICASSP , vol.1 , pp. 301-304
- Deng, L.¹ Acero, A.² Jiang, L.³ Droppo, J.⁴ Huang, X.⁵

9
- 77955815755
- Advanced front-end feature extraction algorithm
- ETSI
- ETSI, "Advanced front-end feature extraction algorithm, " in Technical Report. ETSI ES 202 050, 2007.
- (2007) Technical Report, ETSI es 202 050

10
- 84890532503
- Noise adaptive front-end normalization based on vector taylor series for deep neural networks in robust speech recognition
- IEEE
- B. Li and K. C. Sim, "Noise adaptive front-end normalization based on vector taylor series for deep neural networks in robust speech recognition, " in Proc. ICASSP. IEEE, 2013.
- (2013) Proc. ICASSP
- Li, B.¹ Sim, K.C.²

11
- 85083953021
- Feature learning in deep neural networks - A study on speech recognition tasks
- D. Yu, M. L. Seltzer, J. Li, and F. Seide, "Feature learning in deep neural networks - A study on speech recognition tasks, " in International Conference on Learning Representations, 2013.
- (2013) International Conference on Learning Representations
- Yu, D.¹ Seltzer, M.L.² Li, J.³ Seide, F.⁴

12
- 84883495336
- Springer
- J. Chen, J. Benesty, Y. Huang, and E. J. Diethorn, Fundamentals of Noise Reduction, ser. Springer Handbook of Speech Processing. Springer, 2008.
- (2008) Fundamentals of Noise Reduction, Ser. Springer Handbook of Speech Processing
- Chen, J.¹ Benesty, J.² Huang, Y.³ Diethorn, E.J.⁴

13
- 0029726517
- Speech enhancement based on a priori signal to noise estimation
- IEEE
- P. Scalart and J. Vieira Filho, "Speech enhancement based on a priori signal to noise estimation, " in Proc. ICASSP, vol. 2. IEEE, 1996, pp. 629-632.
- (1996) Proc. ICASSP , vol.2 , pp. 629-632
- Scalart, P.¹ Vieira Filho, J.²

14
- 0021645331
- Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 32, no. 6, pp. 1109-1121, 1984.
- (1984) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

15
- 27644556974
- Speech enhancement based on minimum mean-square error estimation and supergaussian priors
- R. Martin, "Speech enhancement based on minimum mean-square error estimation and supergaussian priors, " Speech and Audio Processing, IEEE Transactions on, vol. 13, no. 5, pp. 845-856, 2005.
- (2005) Speech and Audio Processing, IEEE Transactions on , vol.13 , Issue.5 , pp. 845-856
- Martin, R.¹

16
- 47949104834
- Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
- J. H. Hansen, V. Radhakrishnan, and K. H. Arehart, "Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 14, no. 6, pp. 2049-2063, 2006.
- (2006) Audio, Speech, and Language Processing, IEEE Transactions on , vol.14 , Issue.6 , pp. 2049-2063
- Hansen, J.H.¹ Radhakrishnan, V.² Arehart, K.H.³

17
- 22944438092
- Speech enhancement by MAP spectral amplitude estimation using a super-gaussian speech model
- T. Lotter and P. Vary, "Speech enhancement by MAP spectral amplitude estimation using a super-gaussian speech model, " EURASIP Journal on Applied Signal Processing, vol. 2005, pp. 1110-1126, 2005.
- (2005) EURASIP Journal on Applied Signal Processing , vol.2005 , pp. 1110-1126
- Lotter, T.¹ Vary, P.²

18
- 77957737243
- A data-driven approach to a priori SNR estimation
- S. Suhadi, C. Last, and T. Fingscheidt, "A data-driven approach to a priori SNR estimation, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 1, pp. 186-195, 2011.
- (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.1 , pp. 186-195
- Suhadi, S.¹ Last, C.² Fingscheidt, T.³

19
- 0019009880
- Speech enhancement using a softdecision noise suppression filter
- R. McAulay and M. Malpass, "Speech enhancement using a softdecision noise suppression filter, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 28, no. 2, pp. 137-145, 1980.
- (1980) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.28 , Issue.2 , pp. 137-145
- McAulay, R.¹ Malpass, M.²

20
- 84890540088
- Maximum likelihood based noise covariance matrix estimation for multi-microphone speech enhancement
- U. Kjems and J. Jensen, "Maximum likelihood based noise covariance matrix estimation for multi-microphone speech enhancement, " Proc. EUSIPCO, 2012.
- (2012) Proc. EUSIPCO
- Kjems, U.¹ Jensen, J.²

21
- 84890461970
- Speech enhancement using generalized maximum a posteriori spectral amplitude estimator
- IEEE
- Y. C. Su, Y. Tsao, J. E. Wu, and F. R. Jean, "Speech enhancement using generalized maximum a posteriori spectral amplitude estimator, " in Proc. ICASSP. IEEE, 2013.
- (2013) Proc. ICASSP
- Su, Y.C.¹ Tsao, Y.² Wu, J.E.³ Jean, F.R.⁴

22
- 0036226165
- Noise estimation by minima controlled recursive averaging for robust speech enhancement
- IEEE
- I. Cohen and B. Berdugo, "Noise estimation by minima controlled recursive averaging for robust speech enhancement, " Signal Processing Letters, IEEE, vol. 9, no. 1, pp. 12-15, 2002.
- (2002) Signal Processing Letters , vol.9 , Issue.1 , pp. 12-15
- Cohen, I.¹ Berdugo, B.²

23
- 0041360463
- Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
- I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging, " Speech and Audio Processing, IEEE Transactions on, vol. 11, no. 5, pp. 466- 475, 2003.
- (2003) Speech and Audio Processing, IEEE Transactions on , vol.11 , Issue.5 , pp. 466-475
- Cohen, I.¹

24
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 33, no. 2, pp. 443-445, 1985.
- (1985) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

25
- 57849152033
- Aurora 2.0 speech recognition in noise: Update 2
- D. Pierce and A. Gunawardana, "Aurora 2.0 speech recognition in noise: Update 2, " in Proc. ICSLP, 2002.
- (2002) Proc. ICSLP
- Pierce, D.¹ Gunawardana, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.