메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1078-1082

Convolutional deep maxout networks for phone recognition

Author keywords

Convolutional neural networks; Deep neural networks; Maxout networks; TIMIT

Indexed keywords

CHEMICAL ACTIVATION; CONVOLUTION; ELECTRIC RECTIFIERS; NEURAL NETWORKS; NEURONS; SPEECH COMMUNICATION; TELEPHONE CIRCUITS; TELEPHONE SETS;

EID: 84910069623     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (41)

References (24)
  • 1
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural network concepts to hybrid NN-HMM model for speech recognition
    • O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural network concepts to hybrid NN-HMM model for speech recognition, " in Proc. ICASSP, 2012, pp. 4277 - 4280.
    • (2012) Proc. ICASSP , pp. 4277-4280
    • Abdel-Hamid, O.1    Mohamed, A.2    Jiang, H.3    Penn, G.4
  • 2
    • 84890545163 scopus 로고    scopus 로고
    • A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
    • L. Deng, O. Abdel-Hamid, and D. Yu, "A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion, " in Proc. ICASSP, 2013, pp. 6669 - 6673.
    • (2013) Proc. ICASSP , pp. 6669-6673
    • Deng, L.1    Abdel-Hamid, O.2    Yu, D.3
  • 3
    • 84906214784 scopus 로고    scopus 로고
    • Exploring convolutional neural network structures and optimization techniques for speech recognition
    • O. Abdel-Hamid, L. Deng, and D. Yu, "Exploring convolutional neural network structures and optimization techniques for speech recognition, " in Proc. Interspeech, 2013, pp. 3366 - 3370.
    • (2013) Proc. Interspeech , pp. 3366-3370
    • Abdel-Hamid, O.1    Deng, L.2    Yu, D.3
  • 5
    • 84893654379 scopus 로고    scopus 로고
    • Improvements to deep convolutional neural networks for LVCSR
    • T. N. Sainath, B. Kingsbury, A. Mohamed, and B. Ramabhadran, et al. "Improvements to deep convolutional neural networks for LVCSR, " in Proc. ASRU, 2013, pp. 315-320.
    • (2013) Proc. ASRU , pp. 315-320
    • Sainath, T.N.1    Kingsbury, B.2    Mohamed, A.3    Ramabhadran, B.4
  • 6
    • 84905252069 scopus 로고    scopus 로고
    • Combining time- And frequency-domain convolution in convolutional neural network-based phone recognition
    • accepted, in print
    • L. Tóth, "Combining time- And frequency-domain convolution in convolutional neural network-based phone recognition, " in Proc. ICASSP. 2014, accepted, in print.
    • (2014) Proc. ICASSP
    • Tóth, L.1
  • 7
    • 84858971297 scopus 로고    scopus 로고
    • Convolutive bottleneck network features for LVCSR
    • K. Veselý, M. Karafiát, and F. Grézl, "Convolutive bottleneck network features for LVCSR, " in Proc. ASRU, 2011, pp. 42 - 47.
    • (2011) Proc. ASRU , pp. 42-47
    • Veselý, K.1    Karafiát, M.2    Grézl, F.3
  • 8
    • 84906276981 scopus 로고    scopus 로고
    • Convolutional deep rectifier neural nets for phone recognition
    • L. Tóth, "Convolutional deep rectifier neural nets for phone recognition, " in Proc. Interspeech, 2013, pp. 1722-1726.
    • (2013) Proc. Interspeech , pp. 1722-1726
    • Tóth, L.1
  • 10
    • 84893651518 scopus 로고    scopus 로고
    • Deep maxout neural networks for speech recognition
    • M. Cai, Y. Shi, and J. Liu, "Deep maxout neural networks for speech recognition, " in Proc. ASRU, 2013, pp. 291-296.
    • (2013) Proc. ASRU , pp. 291-296
    • Cai, M.1    Shi, Y.2    Liu, J.3
  • 11
    • 84893701756 scopus 로고    scopus 로고
    • Deep maxout networks for low-resource speech recognition
    • Y. Miao, F. Metze, and S. Rawat, "Deep maxout networks for low-resource speech recognition, " in Proc. ASRU, 2013, pp. 398- 403.
    • (2013) Proc. ASRU , pp. 398-403
    • Miao, Y.1    Metze, F.2    Rawat, S.3
  • 12
    • 84905270524 scopus 로고    scopus 로고
    • Investigation of maxout networks for speech recognition
    • accepted, in print
    • P. Swietojanski, J. Li, and J. T. Huang, "Investigation of maxout networks for speech recognition, " in Proc. ICASSP. 2014, accepted, in print.
    • (2014) Proc. ICASSP
    • Swietojanski, P.1    Li, J.2    Huang, J.T.3
  • 13
    • 84905239342 scopus 로고    scopus 로고
    • Improving deep neural network acoustic models using generalized maxout networks
    • accepted, in print
    • X. Zhang, J. Trmal, D. Povey, and S. Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks, " in Proc. ICASSP. 2014, accepted, in print.
    • (2014) Proc. ICASSP
    • Zhang, X.1    Trmal, J.2    Povey, D.3    Khudanpur, S.4
  • 14
    • 84905252882 scopus 로고    scopus 로고
    • Stochastic pooling maxout networks for low-resource speech recognition
    • accepted, in print
    • M. Cai, Y. Shi, and J. Liu, "Stochastic pooling maxout networks for low-resource speech recognition, " in Proc. ICASSP. 2014, accepted, in print.
    • (2014) Proc. ICASSP
    • Cai, M.1    Shi, Y.2    Liu, J.3
  • 15
    • 77955803591 scopus 로고    scopus 로고
    • Enhanced phone posteriors for improving speech recognition systems
    • H. Ketabdar and H. Bourlard, "Enhanced phone posteriors for improving speech recognition systems, " IEEE Trans. ASLP, vol. 18, no. 6, pp. 1094-1106, 2010.
    • (2010) IEEE Trans. ASLP , vol.18 , Issue.6 , pp. 1094-1106
    • Ketabdar, H.1    Bourlard, H.2
  • 16
    • 78049251448 scopus 로고    scopus 로고
    • Analysis of MLP based hierarchical phoneme posterior probability estimator
    • J. Pinto et al., "Analysis of MLP based hierarchical phoneme posterior probability estimator, " IEEE Trans. ASLP, vol. 19, no. 2, pp. 225-241, 2010.
    • (2010) IEEE Trans. ASLP , vol.19 , Issue.2 , pp. 225-241
    • Pinto, J.1
  • 18
    • 84890527827 scopus 로고    scopus 로고
    • Improving deep neural networks for LVCSR using rectified linear units and dropout
    • G. E. Dahl, T. N. Sainath, and G. E. Hinton, "Improving deep neural networks for LVCSR using rectified linear units and dropout, " in Proc. ICASSP, 2013, pp. 8609-8613.
    • (2013) Proc. ICASSP , pp. 8609-8613
    • Dahl, G.E.1    Sainath, T.N.2    Hinton, G.E.3
  • 20
    • 84890451371 scopus 로고    scopus 로고
    • Phone recognition with deep sparse rectifier neural networks
    • L. Tóth, "Phone recognition with deep sparse rectifier neural networks, " in Proc. ICASSP, 2013, pp. 6985-6989.
    • (2013) Proc. ICASSP , pp. 6985-6989
    • Tóth, L.1
  • 21
    • 84893676344 scopus 로고    scopus 로고
    • Rectifier nonlinearities improve neural network acoustic models
    • A. L. Maas, A. Y. Hannun, and A. Y. Ng, "Rectifier nonlinearities improve neural network acoustic models, " in Proc. ICML, 2013.
    • (2013) Proc. ICML
    • Maas, A.L.1    Hannun, A.Y.2    Ng, A.Y.3
  • 22
    • 84055211743 scopus 로고    scopus 로고
    • Acoustic modeling using deep belief networks
    • A. Mohamed, G. E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. ASLP, vol. 20, no. 1, pp. 14-22, 2012.
    • (2012) IEEE Trans. ASLP , vol.20 , Issue.1 , pp. 14-22
    • Mohamed, A.1    Dahl, G.E.2    Hinton, G.3
  • 23
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, L. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, 2011, pp. 24-29.
    • (2011) Proc. ASRU , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, L.3    Yu, D.4
  • 24
    • 84890466217 scopus 로고    scopus 로고
    • Improving neural networks by preventing coadaptation of feature detectors
    • vol. abs/1207.0580
    • G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing coadaptation of feature detectors, " CoRR, vol. abs/1207.0580, 2012.
    • (2012) CoRR
    • Hinton, G.E.1    Srivastava, N.2    Krizhevsky, A.3    Sutskever, I.4    Salakhutdinov, R.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.