메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1534-1538

Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection

Author keywords

Boosting; Cochleagram; Deep neural network; MRCG; Voice activity detection

Indexed keywords

COMPUTATIONAL LINGUISTICS; FORECASTING; SPEECH COMMUNICATION; SPEECH PROCESSING;

EID: 84910097441     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (61)

References (25)
  • 1
    • 79959828814 scopus 로고    scopus 로고
    • Deep-structured hidden conditional random fields for phonetic recognition
    • D. Yu and L. Deng, "Deep-structured hidden conditional random fields for phonetic recognition, " in Proc. Inter Speech, 2010, pp. 2986-2989.
    • (2010) Proc. Inter Speech , pp. 2986-2989
    • Yu, D.1    Deng, L.2
  • 3
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, andW. Sung, "A statistical model-based voice activity detection, " IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, A.3
  • 5
    • 77950091897 scopus 로고    scopus 로고
    • Voice activity detection based on statistical models and machine learning approaches
    • J. W. Shin, J. H. Chang, and N. S. Kim, "Voice activity detection based on statistical models and machine learning approaches, " Computer Speech & Lang., vol. 24, no. 3, pp. 515-530, 2010.
    • (2010) Computer Speech & Lang. , vol.24 , Issue.3 , pp. 515-530
    • Shin, J.W.1    Chang, J.H.2    Kim, N.S.3
  • 6
    • 79959838316 scopus 로고    scopus 로고
    • Voice activity detection based on conditional random fields using multiple features
    • A. Saito, Y. Nankaku, A. Lee, and K. Tokuda, "Voice activity detection based on conditional random fields using multiple features." in Proc. Inter speech, 2010, pp. 2086-2089.
    • (2010) Proc. Inter Speech , pp. 2086-2089
    • Saito, A.1    Nankaku, Y.2    Lee, A.3    Tokuda, K.4
  • 7
    • 84875828442 scopus 로고    scopus 로고
    • Voice activity detection via noise reducing using non-negative sparse coding
    • P. Teng and Y. Jia, "Voice activity detection via noise reducing using non-negative sparse coding, " IEEE Signal Process. Lett., vol. 20, no. 5, pp. 475-478, 2013.
    • (2013) IEEE Signal Process. Lett. , vol.20 , Issue.5 , pp. 475-478
    • Teng, P.1    Jia, Y.2
  • 8
    • 84910100905 scopus 로고    scopus 로고
    • Voice activity detection in presence of transient noise using spectral clustering
    • S. Mousazadeh and I. Cohen, "Voice activity detection in presence of transient noise using spectral clustering." IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 6, pp. 1261-1271, 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.6 , pp. 1261-1271
    • Mousazadeh, S.1    Cohen, I.2
  • 9
    • 77956289831 scopus 로고    scopus 로고
    • Discriminative training for multiple observation likelihood ratio based voice activity detection
    • T. Yu and J. H. L. Hansen, "Discriminative training for multiple observation likelihood ratio based voice activity detection, " IEEE Signal Process. Lett., vol. 17, no. 11, pp. 897-900, 2010.
    • (2010) IEEE Signal Process. Lett. , vol.17 , Issue.11 , pp. 897-900
    • Yu, T.1    Hansen, J.H.L.2
  • 10
    • 80053614636 scopus 로고    scopus 로고
    • Voice activity detection based on an unsupervised learning framework
    • D. Ying, Y. Yan, J. Dang, and F. Soong, "Voice activity detection based on an unsupervised learning framework, " IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2624-2644, 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.8 , pp. 2624-2644
    • Ying, D.1    Yan, Y.2    Dang, J.3    Soong, F.4
  • 11
    • 85008579584 scopus 로고    scopus 로고
    • Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection
    • Y. Suh and H. Kim, "Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection, " IEEE Signal Process. Lett., vol. 19, no. 8, pp. 507-510, 2012.
    • (2012) IEEE Signal Process. Lett. , vol.19 , Issue.8 , pp. 507-510
    • Suh, Y.1    Kim, H.2
  • 12
    • 84890490765 scopus 로고    scopus 로고
    • Robust front-end processing for speaker identification over extremely degraded communication channels
    • S. O. Sadjadi and J. H. Hansen, "Robust front-end processing for speaker identification over extremely degraded communication channels, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2013, pp. 7214-7218.
    • (2013) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 7214-7218
    • Sadjadi, S.O.1    Hansen, J.H.2
  • 14
    • 84872300403 scopus 로고    scopus 로고
    • Deep belief networks based voice activity detection
    • X.-L. Zhang and J. Wu, "Deep belief networks based voice activity detection, " IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 4, pp. 697-710, 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.4 , pp. 697-710
    • Zhang, X.-L.1    Wu, J.2
  • 15
    • 84906228076 scopus 로고    scopus 로고
    • Speech activity detection on youtube using deep neural networks
    • N. Ryant, M. Liberman, and J. Yuan, "Speech activity detection on youtube using deep neural networks, " in Proc. Inter Speech, 2013, pp. 728-731.
    • (2013) Proc. Inter Speech , pp. 728-731
    • Ryant, N.1    Liberman, M.2    Yuan, J.3
  • 16
    • 84905233552 scopus 로고    scopus 로고
    • A feature study for classification-based speech separation at very low signal-to-noise ratio
    • in press
    • J. Chen, Y. Wang, and D. L. Wang, "A feature study for classification-based speech separation at very low signal-to-noise ratio, " in Proc. Int. Conf. Acoust., Speech, Signal Process., 2014, in press.
    • (2014) Proc. Int. Conf. Acoust., Speech, Signal Process.
    • Chen, J.1    Wang, Y.2    Wang, D.L.3
  • 17
    • 84910032338 scopus 로고    scopus 로고
    • Aurora working group: DSR front end LVCSR evaluation AU/384/02
    • State Univ. Tech. Rep
    • D. Pearce and J. Picone, "Aurora working group: DSR front end LVCSR evaluation AU/384/02, " Inst. for Signal & Inform. Process., Mississippi State Univ., Tech. Rep., 2002.
    • (2002) Inst. for Signal & Inform. Process., Mississippi
    • Pearce, D.1    Picone, J.2
  • 18
    • 80053403826 scopus 로고    scopus 로고
    • Ensemble methods in machine learning
    • T. G. Dietterich, "Ensemble methods in machine learning, " Multiple Classifier Sys., pp. 1-15, 2000.
    • (2000) Multiple Classifier Sys. , pp. 1-15
    • Dietterich, T.G.1
  • 22
    • 38849102154 scopus 로고    scopus 로고
    • Auditory segmentation based on onset and offset analysis
    • G. Hu and D. L. Wang, "Auditory segmentation based on onset and offset analysis, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 396-405, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 396-405
    • Hu, G.1    Wang, D.L.2
  • 23
    • 84871829474 scopus 로고    scopus 로고
    • A multi stream feature framework based on bandpass modulation filtering for robust speech recognition
    • S. K. Nemala, K. Patil, and M. Elhilali, "A multistream feature framework based on bandpass modulation filtering for robust speech recognition, " IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 2, pp. 416-426, 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.2 , pp. 416-426
    • Nemala, S.K.1    Patil, K.2    Elhilali, M.3
  • 24
    • 71049180205 scopus 로고    scopus 로고
    • Computational auditory scene analysis: Principles
    • Wiley-IEEE Press
    • D. L.Wang and G. J. Brown, Computational Auditory Scene Analysis: Principles, Algorithms and Applications. Wiley-IEEE Press, 2006.
    • (2006) Algorithms and Applications
    • Wang, D.L.1    Brown, G.J.2
  • 25
    • 23344452899 scopus 로고    scopus 로고
    • Statistical voice activity detection using a multiple observation likelihood ratio test
    • J. Ramírez, J. C. Segura, C. Benítez, L. Garciá, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test, " IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, 2005.
    • (2005) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 689-692
    • Ramírez, J.1    Segura, J.C.2    Benítez, C.3    Garciá, L.4    Rubio, A.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.