메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4989-4993

An analysis of convolutional neural networks for speech recognition

Author keywords

Convolutional neural networks; DNN; low footprint models; maxout units

Indexed keywords

AUDIO SIGNAL PROCESSING; CONVOLUTION; DEEP NEURAL NETWORKS; NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION;

EID: 84946086402     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178920     Document Type: Conference Paper
Times cited : (125)

References (24)
  • 3
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent pre-trained deep neural networks for largevocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for largevocabulary speech recognition," IEEE Trans. on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Trans. on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 5
    • 84890491198 scopus 로고    scopus 로고
    • Recent advances in deep learning for speech research at Microsoft
    • L. Deng, J. Li, J.-T. Huang et al. "Recent advances in deep learning for speech research at Microsoft," in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Deng, L.1    Li, J.2    Huang, J.-T.3
  • 9
    • 84910028405 scopus 로고    scopus 로고
    • Improving language-universal feature extraction with deep maxout and convolutional neural networks
    • Y. Miao and F. Metze, "Improving language-universal feature extraction with deep maxout and convolutional neural networks," in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Miao, Y.1    Metze, F.2
  • 11
    • 85083953021 scopus 로고    scopus 로고
    • Feature learning in deep neural networks-studies on speech recognition tasks
    • D. Yu, M. Seltzer, J. Li, J-T. Huang, F. Seide, "Feature learning in deep neural networks-studies on speech recognition tasks", ICLR 2013.
    • (2013) ICLR
    • Yu, D.1    Seltzer, M.2    Li, J.3    Huang, J.-T.4    Seide, F.5
  • 14
    • 84906251664 scopus 로고    scopus 로고
    • Accurate and compact large vocabulary speech recognition on mobile devices
    • X. Lei, A. Senior, A., A. Gruenstein, and J. Sorensen, "Accurate and compact large vocabulary speech recognition on mobile devices," in Proc. Interspeech, 2013.
    • (2013) Proc. Interspeech
    • Lei, X.1    Senior, A.A.2    Gruenstein, A.3    Sorensen, J.4
  • 17
    • 84905270524 scopus 로고    scopus 로고
    • Investigation of maxout networks for speech recognition
    • P. Swietojanski, J. Li, and J.-T. Huang, "Investigation of maxout networks for speech recognition," in Proc. ICASSP, 2014
    • (2014) Proc. ICASSP
    • Swietojanski, P.1    Li, J.2    Huang, J.-T.3
  • 18
    • 67651044226 scopus 로고    scopus 로고
    • Spectro-temporal analysis of speech using 2-d Gabor filters
    • T. Ezzat, J. Bouvrie, and T. Poggio, "Spectro-temporal analysis of speech using 2-d Gabor filters," in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Ezzat, T.1    Bouvrie, J.2    Poggio, T.3
  • 19
    • 84910036228 scopus 로고    scopus 로고
    • Robust CNN-based speech recognition with Gabor filter kernels
    • S.-Y. Chang and N. Morgan, "Robust CNN-based speech recognition with Gabor filter kernels," in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Chang, S.-Y.1    Morgan, N.2
  • 22
    • 84910035297 scopus 로고    scopus 로고
    • Learning small-size DNN with output-distribution-based criteria
    • J. Li, R. Zhao, J.-T. Huang, and Y. Gong, "Learning small-size DNN with output-distribution-based criteria," in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Li, J.1    Zhao, R.2    Huang, J.-T.3    Gong, Y.4
  • 23
    • 84910069623 scopus 로고    scopus 로고
    • Convolutional deep maxout networks for phone recognition
    • L. Toth, "Convolutional deep maxout networks for phone recognition," in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Toth, L.1
  • 24
    • 84910046405 scopus 로고    scopus 로고
    • Long short-term memory recurrent neural network architectures for large scale acoustic modeling
    • H. Sak, A. Senior, and F. Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling," in Interspeech, 2014, pp. 338-342.
    • (2014) Interspeech , pp. 338-342
    • Sak, H.1    Senior, A.2    Beaufays, F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.