SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2016-May, Issue , 2016, Pages 4955-4959

Very deep multilingual convolutional neural networks for LVCSR

(4) Sercu, Tom a,b Puhrsch, Christian a Kingsbury, Brian b Lecun, Yann a

a NEW YORK UNIVERSITY (United States)

b IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

Acoustic Modeling; Convolutional Networks; Multilingual; Neural Networks; Speech Recognition

Indexed keywords

EID: 84973324686 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2016.7472620 Document Type: Conference Paper

Times cited : (198)

References (31)

1
- 0032203257
- Gradientbased learning applied to document recognition
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradientbased learning applied to document recognition, " Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
- (1998) Proc. of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

2
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks, " in Proc. NIPS, 2012, pp. 1097-1105.
- (2012) Proc. NIPS , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

3
- 85083953063
- Very deep convolutional networks for large-scale image recognition
- K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition, " Proc. ICLR, 2015.
- (2015) Proc. ICLR
- Simonyan, K.¹ Zisserman, A.²

4
- 84887328988
- Pedestrian detection with unsupervised multi-stage feature learning
- P. Sermanet, K. Kavukcuoglu, S. Chintala, and Y. LeCun, "Pedestrian detection with unsupervised multi-stage feature learning, " in Proc. CVPR, 2013, pp. 3626-3633.
- (2013) Proc. CVPR , pp. 3626-3633
- Sermanet, P.¹ Kavukcuoglu, K.² Chintala, S.³ LeCun, Y.⁴

5
- 84911400494
- Rich feature hierarchies for accurate object detection and semantic segmentation
- R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation, " in Proc. CVPR, 2014, pp. 580-587.
- (2014) Proc. CVPR , pp. 580-587
- Girshick, R.¹ Donahue, J.² Darrell, T.³ Malik, J.⁴

6
- 85083951635
- Overfeat: Integrated recognition, localization and detection using convolutional networks
- P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun, "Overfeat: Integrated recognition, localization and detection using convolutional networks, " Proc. ICLR, 2014.
- (2014) Proc. ICLR
- Sermanet, P.¹ Eigen, D.² Zhang, X.³ Mathieu, M.⁴ Fergus, R.⁵ LeCun, Y.⁶

7
- 84876258641
- Learning hierarchical features for scene labeling
- C. Farabet, C. Couprie, L. Najman, and Y. LeCun, "Learning hierarchical features for scene labeling, " IEEE Trans. on Pattern Analysis and Machine Intelligence, 2013.
- (2013) IEEE Trans. on Pattern Analysis and Machine Intelligence
- Farabet, C.¹ Couprie, C.² Najman, L.³ LeCun, Y.⁴

8
- 0024634603
- Phoneme recognition using time-delay neural networks
- A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, and K. J. Lang, "Phoneme recognition using time-delay neural networks, " IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 37, no. 3, 1989.
- (1989) IEEE Trans. on Acoustics, Speech and Signal Processing , vol.37 , Issue.3
- Waibel, A.¹ Hanazawa, T.² Hinton, G.³ Shikano, K.⁴ Lang, K.J.⁵

9
- 85077149209
- Experiments with time delay networks and dynamic time warping for speaker independent isolated digits recognition
- L. Bottou, F. F. Soulié, P. Blanchet, and J. S. Liénard, "Experiments with time delay networks and dynamic time warping for speaker independent isolated digits recognition, " in Proc. Eurospeech, 1989.
- (1989) Proc. Eurospeech
- Bottou, L.¹ Soulié, F.F.² Blanchet, P.³ Liénard, J.S.⁴

10
- 0025209234
- Speaker-independent isolated digit recognition: Multilayer perceptrons vs. Dynamic time warping
- L. Bottou, F. F. Soulié, P. Blanchet, and J. S. Liénard, "Speaker-independent isolated digit recognition: multilayer perceptrons vs. dynamic time warping, " Neural Networks, vol. 3, no. 4, pp. 453-465, 1990.
- (1990) Neural Networks , vol.3 , Issue.4 , pp. 453-465
- Bottou, L.¹ Soulié, F.F.² Blanchet, P.³ Liénard, J.S.⁴

11
- 0026835134
- Global optimization of a neural network-hidden Markov model hybrid
- Y. Bengio, R. De Mori, G. Flammia, and R. Kompe, "Global optimization of a neural network-hidden Markov model hybrid, " IEEE Trans. on Neural Networks, vol. 3, no. 2, pp. 252-259, 1992.
- (1992) IEEE Trans. on Neural Networks , vol.3 , Issue.2 , pp. 252-259
- Bengio, Y.¹ De Mori, R.² Flammia, G.³ Kompe, R.⁴

12
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks., " in Proc. Interspeech, 2011, pp. 437-440.
- (2011) Proc. Interspeech , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

13
- 80051654263
- Deep belief networks using discriminative features for phone recognition
- A.-r. Mohamed, T. N Sainath, G. Dahl, B. Ramabhadran, G. E Hinton, and Michael A P., "Deep belief networks using discriminative features for phone recognition, " in Proc. ICASSP. IEEE, 2011, pp. 5060-5063.
- (2011) Proc. ICASSP. IEEE , pp. 5060-5063
- Mohamed, A.-R.¹ Sainath, T.N.² Dahl, G.³ Ramabhadran, B.⁴ Hinton, G.E.⁵ Michael, A.P.⁶

14
- 70349213445
- Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
- Brian Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. ICASSP. IEEE, 2009, pp. 3761-3764.
- (2009) Proc. ICASSP. IEEE , pp. 3761-3764
- Kingsbury, B.¹

15
- 84867605836
- Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition
- O. Abdel-Hamid, A.-r. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition, " in Proc. ICASSP, 2012, pp. 4277-4280.
- (2012) Proc. ICASSP , pp. 4277-4280
- Abdel-Hamid, O.¹ Mohamed, A.-R.² Jiang, H.³ Penn, G.⁴

16
- 84890525984
- Deep convolutional neural networks for lvcsr
- T. N. Sainath, A.-r. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for lvcsr, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Sainath, T.N.¹ Mohamed, A.-R.² Kingsbury, B.³ Ramabhadran, B.⁴

17
- 84905265980
- Joint training of convolutional and non-convolutional neural networks
- H. Soltau, G. Saon, and T. N. Sainath, "Joint training of convolutional and non-convolutional neural networks, " Proc. ICASSP, 2014.
- (2014) Proc. ICASSP
- Soltau, H.¹ Saon, G.² Sainath, T.N.³

18
- 84959129849
- The IBM 2015 english conversational telephone speech recognition system
- G. Saon, H.-K. Kuo, S. Rennie, and M. Picheny, "The IBM 2015 english conversational telephone speech recognition system, " Proc. Interspeech, 2015.
- (2015) Proc. Interspeech
- Saon, G.¹ Kuo, H.-K.² Rennie, S.³ Picheny, M.⁴

19
- 84959133563
- Very deep convolutional neural networks for LVCSR
- M. Bi, Y. Qian, and K. Yu, "Very deep convolutional neural networks for LVCSR, " in Proc. Interspeech, 2015.
- (2015) Proc. Interspeech
- Bi, M.¹ Qian, Y.² Yu, K.³

20
- 84867224965
- On the use of a multilingual neural network front-end
- S. Scanzio, P. Laface, L. Fissore, R. Gemello, and F. Mana, "On the use of a multilingual neural network front-end, " in Proc. Interspeech, 2008.
- (2008) Proc. Interspeech
- Scanzio, S.¹ Laface, P.² Fissore, L.³ Gemello, R.⁴ Mana, F.⁵

21
- 84867606552
- Multilingual MLP features for low-resource LVCSR systems
- S. Thomas, S. Ganapathy, and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems, " in Proc. ICASSP, 2012.
- (2012) Proc. ICASSP
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

22
- 84890474441
- Investigation on cross-and multilingual MLP features under matched and mismatched acoustical conditions
- Z. Tüske, J. Pinto, D. Willett, and R. Schlüter, "Investigation on cross-and multilingual MLP features under matched and mismatched acoustical conditions, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Tüske, Z.¹ Pinto, J.² Willett, D.³ Schlüter, R.⁴

23
- 84893668957
- Investigation of multilingual deep neural networks for spoken term detection
- K. M. Knill, M. J. F. Gales, S. P. Rath, P. C. Woodland, C. Zhang, and S.-X. Zhang, "Investigation of multilingual deep neural networks for spoken term detection, " in Proc. ASRU, 2013.
- (2013) Proc. ASRU
- Knill, K.M.¹ Gales, M.J.F.² Rath, S.P.³ Woodland, P.C.⁴ Zhang, C.⁵ Zhang, S.-X.⁶

24
- 80054736963
- Traffic sign recognition with multi-scale convolutional networks
- P. Sermanet and Y. LeCun, "Traffic sign recognition with multi-scale convolutional networks, " in Neural Networks (IJCNN), The 2011 International Joint Conference on. IEEE, 2011, pp. 2809-2813.
- (2011) Neural Networks (IJCNN), the 2011 International Joint Conference On. IEEE , pp. 2809-2813
- Sermanet, P.¹ LeCun, Y.²

25
- 84959205572
- Fully convolutional networks for semantic segmentation
- J. Long, E. Shelhamer, and T. Darrell, "Fully convolutional networks for semantic segmentation, " CVPR, 2015.
- (2015) CVPR
- Long, J.¹ Shelhamer, E.² Darrell, T.³

26
- 84937943470
- Depth map prediction from a single image using a multi-scale deep network
- D. Eigen, C. Puhrsch, and R. Fergus, "Depth map prediction from a single image using a multi-scale deep network, " in Proc. NIPS, 2014, pp. 2366-2374.
- (2014) Proc. NIPS , pp. 2366-2374
- Eigen, D.¹ Puhrsch, C.² Fergus, R.³

27
- 70450211380
- Investigation into bottleneck features for meeting speech recognition
- F. Grezl, M. Karafiát, and L. Burget, "Investigation into bottleneck features for meeting speech recognition., " in Proc. Interspeech, 2009, pp. 2947-2950.
- (2009) Proc. Interspeech , pp. 2947-2950
- Grezl, F.¹ Karafiát, M.² Burget, L.³

28
- 84946037134
- Convolutional, long short-term memory, fully connected deep neural networks
- T. N Sainath, O. Vinyals, A. Senior, and H. Sak, "Convolutional, long short-term memory, fully connected deep neural networks, " Proc. ICASSP, 2015.
- (2015) Proc. ICASSP
- Sainath, T.N.¹ Vinyals, O.² Senior, A.³ Sak, H.⁴

29
- 84969736572
- Technical Report, arXiv: 1212. 5701
- M. D. Zeiler, "ADADELTA: An adaptive learning rate method, " Technical Report, arXiv: 1212. 5701, 2012.
- (2012) ADADELTA: An Adaptive Learning Rate Method
- Zeiler, M.D.¹

30
- 85083951076
- Adam: A method for stochastic optimization
- D. Kingma and J. Ba, "Adam: A method for stochastic optimization, " in Proc. International Conference on Learning Representations (ICLR), 2015.
- (2015) Proc. International Conference on Learning Representations (ICLR)
- Kingma, D.¹ Ba, J.²

31
- 84862277874
- Understanding the difficulty of training deep feedforward neural networks
- X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks, " in Proc. AISTATS, 2010, pp. 249-256.
- (2010) Proc. AISTATS , pp. 249-256
- Glorot, X.¹ Bengio, Y.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.