메뉴 건너뛰기




Volumn , Issue , 2013, Pages 853-857

Denoising deep neural networks based voice activity detection

Author keywords

Deep learning; denoising deep neural networks; voice activity detection

Indexed keywords

CLEAN SPEECH; CROSS ENTROPY; DEEP LEARNING; DEEP NEURAL NETWORKS; MULTIPLE FEATURES; NOISY SPEECH SIGNALS; STATE-OF-THE-ART PERFORMANCE; VOICE ACTIVITY DETECTION;

EID: 84889263385     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6637769     Document Type: Conference Paper
Times cited : (57)

References (26)
  • 1
    • 79959828814 scopus 로고    scopus 로고
    • Deep-structured hidden conditional random fields for phonetic recognition
    • D. Yu and L. Deng, "Deep-structured hidden conditional random fields for phonetic recognition," in Proc. IN-TERSPEECH, 2010, pp. 2986-2989.
    • (2010) Proc. IN-TERSPEECH , pp. 2986-2989
    • Yu, D.1    Deng, L.2
  • 2
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
    • G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 5
    • 28244470718 scopus 로고    scopus 로고
    • The time dimension for scene analysis
    • D. L. Wang, "The time dimension for scene analysis," IEEE Trans. Neural Netw., vol. 16, no. 6, pp. 1401-1426, 2005.
    • (2005) IEEE Trans. Neural Netw. , vol.16 , Issue.6 , pp. 1401-1426
    • Wang, D.L.1
  • 7
    • 84877762231 scopus 로고    scopus 로고
    • Exploring monaural features for classification-based speech segregation
    • Y. X. Wang, K. Han, and D. L. Wang, "Exploring monaural features for classification-based speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 1, no. 99, pp. 1-10, 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.1 , Issue.99 , pp. 1-10
    • Wang, Y.X.1    Han, K.2    Wang, D.L.3
  • 9
    • 84875678689 scopus 로고    scopus 로고
    • Towards scaling up classification-based speech separation
    • Y. X. Wang and D. L. Wang, "Towards scaling up classification-based speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. PP, no. 99, pp. 1-23, 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.PP , Issue.99 , pp. 1-23
    • Wang, Y.X.1    Wang, D.L.2
  • 10
    • 67650137747 scopus 로고    scopus 로고
    • Discriminative weight training for a statistical model-based voice activity detection
    • S. I. Kang, Q. H. Jo, and J. H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 15, pp. 170-173, 2008.
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 170-173
    • Kang, S.I.1    Jo, Q.H.2    Chang, J.H.3
  • 11
    • 77950091897 scopus 로고    scopus 로고
    • Voice activity detection based on statistical models and machine learning approaches
    • J. W. Shin, J. H. Chang, and N. S. Kim, "Voice activity detection based on statistical models and machine learning approaches," Computer Speech & Language, vol. 24, no. 3, pp. 515-530, 2010.
    • (2010) Computer Speech & Language , vol.24 , Issue.3 , pp. 515-530
    • Shin, J.W.1    Chang, J.H.2    Kim, N.S.3
  • 12
    • 77956289831 scopus 로고    scopus 로고
    • Discriminative training for multiple observation likelihood ratio based voice activity detection
    • T. Yu and J. H. L. Hansen, "Discriminative training for multiple observation likelihood ratio based voice activity detection," IEEE Signal Process. Lett., vol. 17, no. 11, pp. 897-900, 2010.
    • (2010) IEEE Signal Process. Lett. , vol.17 , Issue.11 , pp. 897-900
    • Yu, T.1    Hansen, J.H.L.2
  • 13
    • 79952611095 scopus 로고    scopus 로고
    • Maximum margin clustering based statistical VAD with multiple observation compound feature
    • J. Wu and X. L. Zhang, "Maximum margin clustering based statistical VAD with multiple observation compound feature," IEEE Signal Process. Lett., vol. 18, no. 5, pp. 283-286, 2011.
    • (2011) IEEE Signal Process. Lett. , vol.18 , Issue.5 , pp. 283-286
    • Wu, J.1    Zhang, X.L.2
  • 14
    • 79959756010 scopus 로고    scopus 로고
    • Efficient multiple kernel support vector machine based voice activity detection
    • J. Wu and X. L. Zhang, "Efficient multiple kernel support vector machine based voice activity detection," IEEE Signal Process. Lett., vol. 18, no. 8, pp. 466-499, 2011.
    • (2011) IEEE Signal Process. Lett. , vol.18 , Issue.8 , pp. 466-499
    • Wu, J.1    Zhang, X.L.2
  • 15
    • 84890504386 scopus 로고    scopus 로고
    • Linearithmic time sparse and convex maximum margin clustering
    • X. L. Zhang and J. Wu, "Linearithmic time sparse and convex maximum margin clustering," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 1, no. 99, pp. 1-24, 2012.
    • (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.1 , Issue.99 , pp. 1-24
    • Zhang, X.L.1    Wu, J.2
  • 16
    • 85008579584 scopus 로고    scopus 로고
    • Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection
    • Y. Suh and H. Kim, "Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection," IEEE Signal Process. Lett., vol. 19, no. 8, pp. 507-510, 2012.
    • (2012) IEEE Signal Process. Lett. , vol.19 , Issue.8 , pp. 507-510
    • Suh, Y.1    Kim, H.2
  • 17
    • 84872300403 scopus 로고    scopus 로고
    • Deep belief networks based voice activity detection
    • X. L. Zhang and J. Wu, "Deep belief networks based voice activity detection," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 4, pp. 3371-3408, 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process , vol.21 , Issue.4 , pp. 3371-3408
    • Zhang, X.L.1    Wu, J.2
  • 18
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G.E. Hinton and R.R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 21
    • 79551480483 scopus 로고    scopus 로고
    • Stacked denoising auto encoders: Learning useful representations in a deep network with a local denoising criterion
    • P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P. A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion," J. Mach. Learn. Res., vol. 11, pp. 3371-3408, 2010.
    • (2010) J. Mach. Learn. Res. , vol.11 , pp. 3371-3408
    • Vincent, P.1    Larochelle, H.2    Lajoie, I.3    Bengio, Y.4    Manzagol, P.A.5
  • 22
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator," IEEE Trans. Acoustic, Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, 1984.
    • (1984) IEEE Trans. Acoustic, Speech, Signal Process , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 23
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model based voice activity detection
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical modelbased voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 24
    • 0041360463 scopus 로고    scopus 로고
    • Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
    • Israel Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech, Audio Process., vol. 11, no. 5, pp. 466-475, 2003.
    • (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.5 , pp. 466-475
    • Cohen, I.1
  • 25
    • 23344452899 scopus 로고    scopus 로고
    • Statistical voice activity detection using a multiple observation likelihood ratio test
    • J. Ramírez, J. C. Segura, C. Benítez, L. García, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test," IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, 2005.
    • (2005) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 689-692
    • Ramírez, J.1    Segura, J.C.2    Benítez, C.3    García, L.4    Rubio, A.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.