SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 853-857

Denoising deep neural networks based voice activity detection

(2) Zhang, Xiao Lei a Wu, Ji a

a TSINGHUA UNIVERSITY (China)

Author keywords

Deep learning; denoising deep neural networks; voice activity detection

Indexed keywords

CLEAN SPEECH; CROSS ENTROPY; DEEP LEARNING; DEEP NEURAL NETWORKS; MULTIPLE FEATURES; NOISY SPEECH SIGNALS; STATE-OF-THE-ART PERFORMANCE; VOICE ACTIVITY DETECTION;

BACKPROPAGATION ALGORITHMS; SIGNAL PROCESSING; SPEECH RECOGNITION;

NEURAL NETWORKS;

EID: 84889263385 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6637769 Document Type: Conference Paper

Times cited : (57)

References (26)

1
- 79959828814
- Deep-structured hidden conditional random fields for phonetic recognition
- D. Yu and L. Deng, "Deep-structured hidden conditional random fields for phonetic recognition," in Proc. IN-TERSPEECH, 2010, pp. 2986-2989.
- (2010) Proc. IN-TERSPEECH , pp. 2986-2989
- Yu, D.¹ Deng, L.²

2
- 84055222005
- Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
- G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

3
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, et al., "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Process. Mag., vol. 29, no. 11, pp. 2-17, 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.11 , pp. 2-17
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰

4
- 84890460656
- A classification based approach to speech segregation
- K. Hana and D. L. Wang, "A classification based approach to speech segregation," The Journal of the Acoustical Society of America, vol. 99, pp. 1-34, 2012.
- (2012) The Journal of the Acoustical Society of America , vol.99 , pp. 1-34
- Hana, K.¹ Wang, D.L.²

5
- 28244470718
- The time dimension for scene analysis
- D. L. Wang, "The time dimension for scene analysis," IEEE Trans. Neural Netw., vol. 16, no. 6, pp. 1401-1426, 2005.
- (2005) IEEE Trans. Neural Netw. , vol.16 , Issue.6 , pp. 1401-1426
- Wang, D.L.¹

6
- 82255178542
- Wiley-IEEE Press
- D. L. Wang and G. J. Brown, Computational auditory scene analysis: principles, algorithms and applications, Wiley-IEEE Press, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms and Applications
- Wang, D.L.¹ Brown, G.J.²

7
- 84877762231
- Exploring monaural features for classification-based speech segregation
- Y. X. Wang, K. Han, and D. L. Wang, "Exploring monaural features for classification-based speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 1, no. 99, pp. 1-10, 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.1 , Issue.99 , pp. 1-10
- Wang, Y.X.¹ Han, K.² Wang, D.L.³

8
- 84875681333
- Cocktail party processing via structured prediction
- Y. X.Wang and D. L.Wang, "Cocktail party processing via structured prediction," in Proc. Adv. Neural Inform. Process. Syst., 2012, pp. 1-8.
- (2012) Proc. Adv. Neural Inform. Process. Syst. , pp. 1-8
- Wang, Y.X.¹ Wang, D.L.²

9
- 84875678689
- Towards scaling up classification-based speech separation
- Y. X. Wang and D. L. Wang, "Towards scaling up classification-based speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. PP, no. 99, pp. 1-23, 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.PP , Issue.99 , pp. 1-23
- Wang, Y.X.¹ Wang, D.L.²

10
- 67650137747
- Discriminative weight training for a statistical model-based voice activity detection
- S. I. Kang, Q. H. Jo, and J. H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 15, pp. 170-173, 2008.
- (2008) IEEE Signal Process. Lett. , vol.15 , pp. 170-173
- Kang, S.I.¹ Jo, Q.H.² Chang, J.H.³

11
- 77950091897
- Voice activity detection based on statistical models and machine learning approaches
- J. W. Shin, J. H. Chang, and N. S. Kim, "Voice activity detection based on statistical models and machine learning approaches," Computer Speech & Language, vol. 24, no. 3, pp. 515-530, 2010.
- (2010) Computer Speech & Language , vol.24 , Issue.3 , pp. 515-530
- Shin, J.W.¹ Chang, J.H.² Kim, N.S.³

12
- 77956289831
- Discriminative training for multiple observation likelihood ratio based voice activity detection
- T. Yu and J. H. L. Hansen, "Discriminative training for multiple observation likelihood ratio based voice activity detection," IEEE Signal Process. Lett., vol. 17, no. 11, pp. 897-900, 2010.
- (2010) IEEE Signal Process. Lett. , vol.17 , Issue.11 , pp. 897-900
- Yu, T.¹ Hansen, J.H.L.²

13
- 79952611095
- Maximum margin clustering based statistical VAD with multiple observation compound feature
- J. Wu and X. L. Zhang, "Maximum margin clustering based statistical VAD with multiple observation compound feature," IEEE Signal Process. Lett., vol. 18, no. 5, pp. 283-286, 2011.
- (2011) IEEE Signal Process. Lett. , vol.18 , Issue.5 , pp. 283-286
- Wu, J.¹ Zhang, X.L.²

14
- 79959756010
- Efficient multiple kernel support vector machine based voice activity detection
- J. Wu and X. L. Zhang, "Efficient multiple kernel support vector machine based voice activity detection," IEEE Signal Process. Lett., vol. 18, no. 8, pp. 466-499, 2011.
- (2011) IEEE Signal Process. Lett. , vol.18 , Issue.8 , pp. 466-499
- Wu, J.¹ Zhang, X.L.²

15
- 84890504386
- Linearithmic time sparse and convex maximum margin clustering
- X. L. Zhang and J. Wu, "Linearithmic time sparse and convex maximum margin clustering," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 1, no. 99, pp. 1-24, 2012.
- (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.1 , Issue.99 , pp. 1-24
- Zhang, X.L.¹ Wu, J.²

16
- 85008579584
- Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection
- Y. Suh and H. Kim, "Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection," IEEE Signal Process. Lett., vol. 19, no. 8, pp. 507-510, 2012.
- (2012) IEEE Signal Process. Lett. , vol.19 , Issue.8 , pp. 507-510
- Suh, Y.¹ Kim, H.²

17
- 84872300403
- Deep belief networks based voice activity detection
- X. L. Zhang and J. Wu, "Deep belief networks based voice activity detection," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 4, pp. 3371-3408, 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.4 , pp. 3371-3408
- Zhang, X.L.¹ Wu, J.²

18
- 33746600649
- Reducing the dimensionality of data with neural networks
- G.E. Hinton and R.R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
- (2006) Science , vol.313 , Issue.5786 , pp. 504-507
- Hinton, G.E.¹ Salakhutdinov, R.R.²

19
- 84862612564
- On contrastive divergence learning
- M. A. Carreira-Perpinan and G. E. Hinton, "On contrastive divergence learning," in Proc. Int. Conf. Artif. Intell. Stat., 2005, pp. 17-25.
- (2005) Proc. Int. Conf. Artif. Intell. Stat. , pp. 17-25
- Carreira-Perpinan, M.A.¹ Hinton, G.E.²

20
- 56449089103
- Extracting and composing robust features with denoising auto encoders
- P. Vincent, H. Larochelle, Y. Bengio, and P. A. Manzagol, "Extracting and composing robust features with denoising autoencoders," in Proc. 25th Int. Conf. Mach. Learn., 2008, pp. 1096-1103.
- (2008) Proc. 25th Int. Conf. Mach. Learn. , pp. 1096-1103
- Vincent, P.¹ Larochelle, H.² Bengio, Y.³ Manzagol, P.A.⁴

21
- 79551480483
- Stacked denoising auto encoders: Learning useful representations in a deep network with a local denoising criterion
- P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P. A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion," J. Mach. Learn. Res., vol. 11, pp. 3371-3408, 2010.
- (2010) J. Mach. Learn. Res. , vol.11 , pp. 3371-3408
- Vincent, P.¹ Larochelle, H.² Lajoie, I.³ Bengio, Y.⁴ Manzagol, P.A.⁵

22
- 0021645331
- Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator," IEEE Trans. Acoustic, Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, 1984.
- (1984) IEEE Trans. Acoustic, Speech, Signal Process , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

23
- 0032762471
- A statistical model based voice activity detection
- J. Sohn, N. S. Kim, and W. Sung, "A statistical modelbased voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, 1999.
- (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

24
- 0041360463
- Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
- Israel Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech, Audio Process., vol. 11, no. 5, pp. 466-475, 2003.
- (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.5 , pp. 466-475
- Cohen, I.¹

25
- 23344452899
- Statistical voice activity detection using a multiple observation likelihood ratio test
- J. Ramírez, J. C. Segura, C. Benítez, L. García, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test," IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, 2005.
- (2005) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 689-692
- Ramírez, J.¹ Segura, J.C.² Benítez, C.³ García, L.⁴ Rubio, A.⁵

26
- 77956547440
- Simple and efficient multiple kernel learning by group lasso
- Z. Xu, R. Jin, H. Yang, I. King, and M. R. Lyu, "Simple and efficient multiple kernel learning by group lasso," in Proc. 27th Int. Conf. Mach. Learn., 2010, pp. 1175-1182.
- (2010) Proc. 27th Int. Conf. Mach. Learn. , pp. 1175-1182
- Xu, Z.¹ Jin, R.² Yang, H.³ King, I.⁴ Lyu, M.R.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.