SCOPUS 정보 검색 플랫폼

Proceedings - International Conference on Pattern Recognition

Volumn 0, Issue , 2016, Pages 3542-3547

Faster training of very deep networks via p-norm gates

(4) Pham, Trang a Tran, Truyen a Phung, Dinh a Venkatesh, Svetha a

a DEAKIN UNIVERSITY (Australia)

Author keywords

[No Author keywords available]

Indexed keywords

DEEP NEURAL NETWORKS; NETWORK LAYERS; PATTERN RECOGNITION;

CONTRIBUTING FACTOR; CONTROLLABLE FLOW; FEEDFORWARD MODEL; HIGHWAY NETWORKS; LEARNING PROCESS; MACHINE TRANSLATIONS; RECURRENT MODELS; SENSORY INFORMATION;

LONG SHORT-TERM MEMORY;

EID: 85019136679 PISSN: 10514651 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICPR.2016.7900183 Document Type: Conference Paper

Times cited : (20)

References (21)

1
- 84973911419
- Delving deep into rectifiers: Surpassing human-level performance on imagenet classification
- K. He, X. Zhang, S. Ren, and J. Sun, "Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, " in Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 1026-1034.
- (2015) Proceedings of the IEEE International Conference on Computer Vision , pp. 1026-1034
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

2
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
- (2012) Signal Processing Magazine, IEEE , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.-R.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰

3
- 84928547704
- Sequence to sequence learning with neural networks
- I. Sutskever, O. Vinyals, and Q. V. Le, "Sequence to sequence learning with neural networks, " in Advances in Neural Information Processing Systems, 2014, pp. 3104-3112.
- (2014) Advances in Neural Information Processing Systems , pp. 3104-3112
- Sutskever, I.¹ Vinyals, O.² Le, Q.V.³

4
- 84978835300
- arXiv preprint arXiv:1506. 07285
- A. Kumar, O. Irsoy, J. Su, J. Bradbury, R. English, B. Pierce, P. Ondruska, I. Gulrajani, and R. Socher, "Ask me anything: Dynamic memory networks for natural language processing, " arXiv preprint arXiv:1506. 07285, 2015.
- (2015) Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
- Kumar, A.¹ Irsoy, O.² Su, J.³ Bradbury, J.⁴ English, R.⁵ Pierce, B.⁶ Ondruska, P.⁷ Gulrajani, I.⁸ Socher, R.⁹

5
- 69349090197
- Learning deep architectures for AI
- Y. Bengio, "Learning deep architectures for AI, " Foundations and trendsR in Machine Learning, vol. 2, no. 1, pp. 1-127, 2009.
- (2009) Foundations and TrendsR in Machine Learning , vol.2 , Issue.1 , pp. 1-127
- Bengio, Y.¹

6
- 84954310140
- The loss surfaces of multilayer networks
- A. Choromanska, M. Henaff, M. Mathieu, G. B. Arous, and Y. LeCun, "The loss surfaces of multilayer networks, " in International Conference on Artificial Intelligence and Statistics, 2015, pp. 192-204.
- (2015) International Conference on Artificial Intelligence and Statistics , pp. 192-204
- Choromanska, A.¹ Henaff, M.² Mathieu, M.³ Arous, G.B.⁴ LeCun, Y.⁵

7
- 59449087310
- Exploring strategies for training deep neural networks
- H. Larochelle, Y. Bengio, J. Louradour, and P. Lamblin, "Exploring strategies for training deep neural networks, " The Journal of Machine Learning Research, vol. 10, pp. 1-40, 2009.
- (2009) The Journal of Machine Learning Research , vol.10 , pp. 1-40
- Larochelle, H.¹ Bengio, Y.² Louradour, J.³ Lamblin, P.⁴

8
- 0041914606
- Gradient flow in recurrent nets: The difficulty of learning long-term dependencies
- IEEE Press
- S. Hochreiter, Y. Bengio, P. Frasconi, and J. Schmidhuber, "Gradient flow in recurrent nets: the difficulty of learning long-term dependencies, " A field guide to dynamical recurrent neural networks. IEEE Press, 2001.
- (2001) A Field Guide to Dynamical Recurrent Neural Networks
- Hochreiter, S.¹ Bengio, Y.² Frasconi, P.³ Schmidhuber, J.⁴

9
- 84862294866
- Deep sparse rectifier networks
- X. Glorot, A. Bordes, and Y. Bengio, "Deep sparse rectifier networks, " in Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP Volume, vol. 15, 2011, pp. 315-323.
- (2011) Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP Volume , vol.15 , pp. 315-323
- Glorot, X.¹ Bordes, A.² Bengio, Y.³

10
- 84897543523
- Maxout networks
- I. Goodfellow, D. Warde-Farley, M. Mirza, A. Courville, and Y. Bengio, "Maxout networks, " in Proceedings of The 30th International Conference on Machine Learning, 2013, pp. 1319-1327.
- (2013) Proceedings of the 30th International Conference on Machine Learning , pp. 1319-1327
- Goodfellow, I.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.⁴ Bengio, Y.⁵

11
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

12
- 84906979661
- arXiv preprint arXiv:1308. 0850
- A. Graves, "Generating sequences with recurrent neural networks, " arXiv preprint arXiv:1308. 0850, 2013.
- (2013) Generating Sequences with Recurrent Neural Networks
- Graves, A.¹

13
- 84965164720
- Training very deep networks
- R. K. Srivastava, K. Greff, and J. Schmidhuber, "Training very deep networks, " in Advances in Neural Information Processing Systems, 2015, pp. 2368-2376.
- (2015) Advances in Neural Information Processing Systems , pp. 2368-2376
- Srivastava, R.K.¹ Greff, K.² Schmidhuber, J.³

14
- 84958589374
- arXiv preprint arXiv:1512. 03385
- K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition, " arXiv preprint arXiv:1512. 03385, 2015.
- (2015) Deep Residual Learning for Image Recognition
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

15
- 84943799837
- arXiv preprint arXiv:1409. 1259
- K. Cho, B. van Merriënboer, D. Bahdanau, and Y. Bengio, "On the properties of neural machine translation: Encoder-decoder approaches, " arXiv preprint arXiv:1409. 1259, 2014.
- (2014) On the Properties of Neural Machine Translation: Encoder-decoder Approaches
- Cho, K.¹ Van Merriënboer, B.² Bahdanau, D.³ Bengio, Y.⁴

16
- 84961291190
- Learning phrase representations using rnn encoder-decoder for statistical machine translation
- K. Cho, B. Van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio, "Learning phrase representations using rnn encoder-decoder for statistical machine translation, " in EMNLP, 2014, pp. 1724-1734.
- (2014) EMNLP , pp. 1724-1734
- Cho, K.¹ Van Merriënboer, B.² Gulcehre, C.³ Bahdanau, D.⁴ Bougares, F.⁵ Schwenk, H.⁶ Bengio, Y.⁷

17
- 84939821078
- arXiv preprint arXiv:1412. 3555
- J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, "Empirical evaluation of gated recurrent neural networks on sequence modeling, " arXiv preprint arXiv:1412. 3555, 2014.
- (2014) Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
- Chung, J.¹ Gulcehre, C.² Cho, K.³ Bengio, Y.⁴

18
- 84897527816
- T. Mikolov, I. Sutskever, A. Deoras, H.-S. Le, S. Kombrink, and J. Cernocky, "Subword language modeling with neural networks, " preprint (http://www. fit. vutbr. cz/imikolov/rnnlm/char. pdf), 2012.
- (2012) Subword Language Modeling with Neural Networks
- Mikolov, T.¹ Sutskever, I.² Deoras, A.³ Le, H.-S.⁴ Kombrink, S.⁵ Cernocky, J.⁶

19
- 84978952442
- arXiv preprint arXiv:1508. 06615
- Y. Kim, Y. Jernite, D. Sontag, and A. M. Rush, "Character-aware neural language models, " arXiv preprint arXiv:1508. 06615, 2015.
- (2015) Character-aware Neural Language Models
- Kim, Y.¹ Jernite, Y.² Sontag, D.³ Rush, A.M.⁴

20
- 85019122005
- arXiv preprint arXiv:1602. 00357
- T. Pham, T. Tran, D. Phung, and S. Venkatesh, "Deepcare: A deep dynamic memory model for predictive medicine, " arXiv preprint arXiv:1602. 00357, 2016.
- (2016) Deepcare: A Deep Dynamic Memory Model for Predictive Medicine
- Pham, T.¹ Tran, T.² Phung, D.³ Venkatesh, S.⁴

21
- 84998851843
- arXiv preprint arXiv:1511. 08400
- D. Krueger and R. Memisevic, "Regularizing RNNs by Stabilizing Activations, " arXiv preprint arXiv:1511. 08400, 2015.
- (2015) Regularizing RNNs by Stabilizing Activations
- Krueger, D.¹ Memisevic, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.