메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7378-7382

Recurrent neural networks for voice activity detection

Author keywords

endpointing; recurrent neural networks (RNNs); Voice activity detection (VAD)

Indexed keywords

END-POINTING; GAUSSIAN MIXTURE MODEL (GMMS); QUADRATIC POLYNOMIAL; RECURRENT NEURAL NETWORK (RNN); RECURRENT NEURAL NETWORK (RNNS); TEMPORAL CONTINUITY; TEMPORAL SMOOTHING; VOICE ACTIVITY DETECTION;

EID: 84890484287     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639096     Document Type: Conference Paper
Times cited : (201)

References (15)
  • 1
    • 0032762471 scopus 로고    scopus 로고
    • A statistical modelbased voice activity detection
    • J. Sohn, N.S. Kim, and W. Sung, "A Statistical Modelbased Voice Activity Detection," Signal Processing Letters, IEEE, vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) Signal Processing Letters, IEEE , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 2
    • 84878610785 scopus 로고    scopus 로고
    • Speech/nonspeech segmentation in web videos
    • Ananya Misra, "Speech/Nonspeech Segmentation in Web Videos," in Proceedings of InterSpeech 2012, 2012.
    • (2012) Proceedings of InterSpeech 2012
    • Misra, A.1
  • 3
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L.E. Baum, T. Petrie, G. Soules, and N.Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," The annals of mathematical statistics, pp. 164-171, 1970.
    • (1970) The Annals of Mathematical Statistics , pp. 164-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 5
  • 7
    • 36348930730 scopus 로고    scopus 로고
    • Non-linear estimation of voice activity to improve automatic recognition of noisy speech
    • R. Gemello, F. Mana, and R. De Mori, "Non-linear estimation of voice activity to improve automatic recognition of noisy speech," in Proceedings of Interspeech 2005, 2005.
    • (2005) Proceedings of Interspeech 2005
    • Gemello, R.1    Mana, F.2    De Mori, R.3
  • 8
    • 0035248382 scopus 로고    scopus 로고
    • A recurrent neural fuzzy network for word boundary detection in variable noiselevel environments
    • G.D. Wu and C.T. Lin, "A recurrent neural fuzzy network for word boundary detection in variable noiselevel environments," Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on, vol. 31, no. 1, pp. 84-97, 2001.
    • (2001) Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on , vol.31 , Issue.1 , pp. 84-97
    • Wu, G.D.1    Lin, C.T.2
  • 11
    • 84865072849 scopus 로고    scopus 로고
    • Taming the reservoir: Feedforward training for recurrent neural networks
    • Oliver Obst and Martin Riedmiller, "Taming the Reservoir: Feedforward Training for Recurrent Neural Networks," in Accepted at IJCNN 2012, 2012.
    • (2012) Accepted at IJCNN 2012
    • Obst, O.1    Riedmiller, M.2
  • 12
    • 84890491840 scopus 로고    scopus 로고
    • Sameer Agarwal and Keir Mierle, Ceres Solver: Tutorial &Reference, Google Inc
    • Sameer Agarwal and Keir Mierle, Ceres Solver: Tutorial &Reference, Google Inc.
  • 15
    • 84878539964 scopus 로고    scopus 로고
    • Application of pretrained deep neural networks to large vocabulary speech recognition
    • Navdeep Jaitly, Patrick Nguyen, Andrew Senior, and Vincent Vanhoucke, "Application Of Pretrained Deep Neural Networks To Large Vocabulary Speech Recognition," in Proceedings of Interspeech 2012, 2012.
    • (2012) Proceedings of Interspeech 2012
    • Jaitly, N.1    Nguyen, P.2    Senior, A.3    Vanhoucke, V.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.