메뉴 건너뛰기




Volumn 2016-May, Issue , 2016, Pages 4955-4959

Very deep multilingual convolutional neural networks for LVCSR

Author keywords

Acoustic Modeling; Convolutional Networks; Multilingual; Neural Networks; Speech Recognition

Indexed keywords


EID: 84973324686     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2016.7472620     Document Type: Conference Paper
Times cited : (198)

References (31)
  • 1
    • 0032203257 scopus 로고    scopus 로고
    • Gradientbased learning applied to document recognition
    • Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradientbased learning applied to document recognition, " Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
    • (1998) Proc. of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • LeCun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 2
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks, " in Proc. NIPS, 2012, pp. 1097-1105.
    • (2012) Proc. NIPS , pp. 1097-1105
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 3
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition, " Proc. ICLR, 2015.
    • (2015) Proc. ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 4
    • 84887328988 scopus 로고    scopus 로고
    • Pedestrian detection with unsupervised multi-stage feature learning
    • P. Sermanet, K. Kavukcuoglu, S. Chintala, and Y. LeCun, "Pedestrian detection with unsupervised multi-stage feature learning, " in Proc. CVPR, 2013, pp. 3626-3633.
    • (2013) Proc. CVPR , pp. 3626-3633
    • Sermanet, P.1    Kavukcuoglu, K.2    Chintala, S.3    LeCun, Y.4
  • 5
    • 84911400494 scopus 로고    scopus 로고
    • Rich feature hierarchies for accurate object detection and semantic segmentation
    • R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation, " in Proc. CVPR, 2014, pp. 580-587.
    • (2014) Proc. CVPR , pp. 580-587
    • Girshick, R.1    Donahue, J.2    Darrell, T.3    Malik, J.4
  • 6
    • 85083951635 scopus 로고    scopus 로고
    • Overfeat: Integrated recognition, localization and detection using convolutional networks
    • P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun, "Overfeat: Integrated recognition, localization and detection using convolutional networks, " Proc. ICLR, 2014.
    • (2014) Proc. ICLR
    • Sermanet, P.1    Eigen, D.2    Zhang, X.3    Mathieu, M.4    Fergus, R.5    LeCun, Y.6
  • 9
    • 85077149209 scopus 로고
    • Experiments with time delay networks and dynamic time warping for speaker independent isolated digits recognition
    • L. Bottou, F. F. Soulié, P. Blanchet, and J. S. Liénard, "Experiments with time delay networks and dynamic time warping for speaker independent isolated digits recognition, " in Proc. Eurospeech, 1989.
    • (1989) Proc. Eurospeech
    • Bottou, L.1    Soulié, F.F.2    Blanchet, P.3    Liénard, J.S.4
  • 10
    • 0025209234 scopus 로고
    • Speaker-independent isolated digit recognition: Multilayer perceptrons vs. Dynamic time warping
    • L. Bottou, F. F. Soulié, P. Blanchet, and J. S. Liénard, "Speaker-independent isolated digit recognition: multilayer perceptrons vs. dynamic time warping, " Neural Networks, vol. 3, no. 4, pp. 453-465, 1990.
    • (1990) Neural Networks , vol.3 , Issue.4 , pp. 453-465
    • Bottou, L.1    Soulié, F.F.2    Blanchet, P.3    Liénard, J.S.4
  • 11
    • 0026835134 scopus 로고
    • Global optimization of a neural network-hidden Markov model hybrid
    • Y. Bengio, R. De Mori, G. Flammia, and R. Kompe, "Global optimization of a neural network-hidden Markov model hybrid, " IEEE Trans. on Neural Networks, vol. 3, no. 2, pp. 252-259, 1992.
    • (1992) IEEE Trans. on Neural Networks , vol.3 , Issue.2 , pp. 252-259
    • Bengio, Y.1    De Mori, R.2    Flammia, G.3    Kompe, R.4
  • 12
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks., " in Proc. Interspeech, 2011, pp. 437-440.
    • (2011) Proc. Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 14
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
    • Brian Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. ICASSP. IEEE, 2009, pp. 3761-3764.
    • (2009) Proc. ICASSP. IEEE , pp. 3761-3764
    • Kingsbury, B.1
  • 15
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition
    • O. Abdel-Hamid, A.-r. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition, " in Proc. ICASSP, 2012, pp. 4277-4280.
    • (2012) Proc. ICASSP , pp. 4277-4280
    • Abdel-Hamid, O.1    Mohamed, A.-R.2    Jiang, H.3    Penn, G.4
  • 17
    • 84905265980 scopus 로고    scopus 로고
    • Joint training of convolutional and non-convolutional neural networks
    • H. Soltau, G. Saon, and T. N. Sainath, "Joint training of convolutional and non-convolutional neural networks, " Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Soltau, H.1    Saon, G.2    Sainath, T.N.3
  • 18
    • 84959129849 scopus 로고    scopus 로고
    • The IBM 2015 english conversational telephone speech recognition system
    • G. Saon, H.-K. Kuo, S. Rennie, and M. Picheny, "The IBM 2015 english conversational telephone speech recognition system, " Proc. Interspeech, 2015.
    • (2015) Proc. Interspeech
    • Saon, G.1    Kuo, H.-K.2    Rennie, S.3    Picheny, M.4
  • 19
    • 84959133563 scopus 로고    scopus 로고
    • Very deep convolutional neural networks for LVCSR
    • M. Bi, Y. Qian, and K. Yu, "Very deep convolutional neural networks for LVCSR, " in Proc. Interspeech, 2015.
    • (2015) Proc. Interspeech
    • Bi, M.1    Qian, Y.2    Yu, K.3
  • 21
    • 84867606552 scopus 로고    scopus 로고
    • Multilingual MLP features for low-resource LVCSR systems
    • S. Thomas, S. Ganapathy, and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems, " in Proc. ICASSP, 2012.
    • (2012) Proc. ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 22
    • 84890474441 scopus 로고    scopus 로고
    • Investigation on cross-and multilingual MLP features under matched and mismatched acoustical conditions
    • Z. Tüske, J. Pinto, D. Willett, and R. Schlüter, "Investigation on cross-and multilingual MLP features under matched and mismatched acoustical conditions, " in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Tüske, Z.1    Pinto, J.2    Willett, D.3    Schlüter, R.4
  • 25
    • 84959205572 scopus 로고    scopus 로고
    • Fully convolutional networks for semantic segmentation
    • J. Long, E. Shelhamer, and T. Darrell, "Fully convolutional networks for semantic segmentation, " CVPR, 2015.
    • (2015) CVPR
    • Long, J.1    Shelhamer, E.2    Darrell, T.3
  • 26
    • 84937943470 scopus 로고    scopus 로고
    • Depth map prediction from a single image using a multi-scale deep network
    • D. Eigen, C. Puhrsch, and R. Fergus, "Depth map prediction from a single image using a multi-scale deep network, " in Proc. NIPS, 2014, pp. 2366-2374.
    • (2014) Proc. NIPS , pp. 2366-2374
    • Eigen, D.1    Puhrsch, C.2    Fergus, R.3
  • 27
    • 70450211380 scopus 로고    scopus 로고
    • Investigation into bottleneck features for meeting speech recognition
    • F. Grezl, M. Karafiát, and L. Burget, "Investigation into bottleneck features for meeting speech recognition., " in Proc. Interspeech, 2009, pp. 2947-2950.
    • (2009) Proc. Interspeech , pp. 2947-2950
    • Grezl, F.1    Karafiát, M.2    Burget, L.3
  • 28
    • 84946037134 scopus 로고    scopus 로고
    • Convolutional, long short-term memory, fully connected deep neural networks
    • T. N Sainath, O. Vinyals, A. Senior, and H. Sak, "Convolutional, long short-term memory, fully connected deep neural networks, " Proc. ICASSP, 2015.
    • (2015) Proc. ICASSP
    • Sainath, T.N.1    Vinyals, O.2    Senior, A.3    Sak, H.4
  • 31
    • 84862277874 scopus 로고    scopus 로고
    • Understanding the difficulty of training deep feedforward neural networks
    • X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks, " in Proc. AISTATS, 2010, pp. 249-256.
    • (2010) Proc. AISTATS , pp. 249-256
    • Glorot, X.1    Bengio, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.