SCOPUS 정보 검색 플랫폼

4th International Conference on Learning Representations, ICLR 2016 - Conference Track Proceedings

Volumn , Issue , 2016, Pages

All you need is a good init

a CZECH TECHNICAL UNIVERSITY IN PRAGUE (Czech Republic)

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVATION FUNCTIONS; COMPLEX SCHEMES; INNER PRODUCT; ORTHONORMAL; SIMPLE METHOD; STATE OF THE ART; TEST ACCURACY; WEIGHT INITIALIZATION;

DEEP LEARNING;

EID: 85083951894 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (189)

References (29)

1
- 84864073449
- Greedy layer-wise training of deep networks
- Schölkopf, B., Platt, J.C., and Hoffman, T. (eds), MIT Press
- Bengio, Yoshua, Lamblin, Pascal, Popovici, Dan, and Larochelle, Hugo. Greedy layer-wise training of deep networks. In Schölkopf, B., Platt, J.C., and Hoffman, T. (eds.), Advances in Neural Information Processing Systems 19, pp. 153–160. MIT Press, 2007. URL http://papers.nips.cc/paper/3048-greedy-layer-wise-training-of-deep-networks.pdf.
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 153-160
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴

2
- 84998841407
- arXiv e-prints, November
- Chang, J.-R. and Chen, Y.-S. Batch-normalized Maxout Network in Network. ArXiv e-prints, November 2015. URL http://arxiv.org/abs/1511.02583.
- (2015) Batch-Normalized Maxout Network in Network
- Chang, J.-R.¹ Chen, Y.-S.²

3
- 84989350175
- Dieleman, Sander. Classifying plankton with deep neural networks, 2015. URL http://benanne.github.io/2015/03/17/plankton.html.
- (2015) Classifying Plankton with Deep Neural Networks
- Dieleman, S.¹

4
- 79951563340
- Understanding the difficulty of training deep feedforward neural networks
- Society for Artificial Intelligence and Statistics
- Glorot, Xavier and Bengio, Yoshua. Understanding the difficulty of training deep feedforward neural networks. In In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS10). Society for Artificial Intelligence and Statistics, 2010.
- (2010) Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS10)
- Glorot, X.¹ Bengio, Y.²

5
- 84862294866
- Deep sparse rectifier neural networks
- Gordon, Geoffrey J. and Dunson, David B. (eds), Journal of Machine Learning Research Workshop and Conference Proceedings
- Glorot, Xavier, Bordes, Antoine, and Bengio, Yoshua. Deep sparse rectifier neural networks. In Gordon, Geoffrey J. and Dunson, David B. (eds.), Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS-11), volume 15, pp. 315–323. Journal of Machine Learning Research - Workshop and Conference Proceedings, 2011. URL http://www.jmlr.org/proceedings/papers/v15/glorot11a/glorot11a.pdf.
- (2011) Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS-11) , vol.15 , pp. 315-323
- Glorot, X.¹ Bordes, A.² Bengio, Y.³

6
- 84897543523
- Maxout networks
- Atlanta, GA, USA, 16-21 June 2013
- Goodfellow, Ian J., Warde-Farley, David, Mirza, Mehdi, Courville, Aaron C., and Bengio, Yoshua. Maxout networks. In Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16-21 June 2013, pp. 1319–1327, 2013. URL http://jmlr.org/proceedings/papers/v28/goodfellow13.html.
- (2013) Proceedings of the 30th International Conference on Machine Learning, ICML 2013 , pp. 1319-1327
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.C.⁴ Bengio, Y.⁵

7
- 84959913792
- arXiv e-prints, December
- Graham, Ben. Fractional Max-Pooling. ArXiv e-prints, December 2014a. URL http://arxiv.org/abs/1412.6071.
- (2014) Fractional Max-Pooling
- Graham, B.¹

8
- 85070939086
- Graham, Ben. Train you very own deep convolutional network, 2014b. URL https://www.kaggle.com/c/cifar-10/forums/t/10493/train-you-very-own-deep-convolutional-network.
- (2014) Train You Very Own Deep Convolutional Network
- Graham, B.¹

9
- 84965172354
- arXiv e-prints, September
- Graham, Ben. Spatially-sparse convolutional neural networks. ArXiv e-prints, September 2014c.
- (2014) Spatially-Sparse Convolutional Neural Networks
- Graham, B.¹

10
- 84958589374
- arXiv e-prints, December
- He, K., Zhang, X., Ren, S., and Sun, J. Deep Residual Learning for Image Recognition. ArXiv e-prints, December 2015. URL http://arxiv.org/abs/1512/03385.
- (2015) Deep Residual Learning for Image Recognition
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

11
- 84973911419
- Delving deep into rectifiers: Surpassing human-level performance on imagenet classification
- He, Kaiming, Zhang, Xiangyu, Ren, Shaoqing, and Sun, Jian. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In International Conference on Computer Vision (ICCV), 2015. URL http://arxiv.org/abs/1502.01852.
- (2015) International Conference on Computer Vision (ICCV)
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

12
- 84945906645
- Distilling the knowledge in a neural network
- Hinton, Geoffrey, Vinyals, Oriol, and Dean, Jeff. Distilling the Knowledge in a Neural Network. In Proceedings of Deep Learning and Representation Learning Workshop: NIPS 2014, 2014.
- (2014) Proceedings of Deep Learning and Representation Learning Workshop: NIPS 2014
- Hinton, G.¹ Vinyals, O.² Dean, J.³

13
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Blei, David and Bach, Francis (eds), JMLR Workshop and Conference Proceedings
- Ioffe, Sergey and Szegedy, Christian. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Blei, David and Bach, Francis (eds.), Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pp. 448–456. JMLR Workshop and Conference Proceedings, 2015. URL http://jmlr.org/proceedings/papers/v37/ioffe15.pdf.
- (2015) Proceedings of the 32nd International Conference on Machine Learning (ICML-15) , pp. 448-456
- Ioffe, S.¹ Szegedy, C.²

14
- 84913555165
- arXiv preprint
- Jia, Yangqing, Shelhamer, Evan, Donahue, Jeff, Karayev, Sergey, Long, Jonathan, Girshick, Ross, Guadarrama, Sergio, and Darrell, Trevor. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093, 2014.
- (2014) Caffe: Convolutional Architecture for Fast Feature Embedding
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

15
- 77956002520
- Master’s thesis
- Krizhevsky, Alex. Learning Multiple Layers of Features from Tiny Images. Master’s thesis, 2009. URL http://www.cs.toronto.edu/{}kriz/learning-features-2009-TR.pdf.
- (2009) Learning Multiple Layers of Features from Tiny Images
- Krizhevsky, A.¹

16
- 84876231242
- Imagenet classification with deep convolutional neural networks
- Pereira, F., Burges, C.J.C., Bottou, L., and Weinberger, K.Q. (eds), Curran Associates, Inc
- Krizhevsky, Alex, Sutskever, Ilya, and Hinton, Geoffrey E. Imagenet classification with deep convolutional neural networks. In Pereira, F., Burges, C.J.C., Bottou, L., and Weinberger, K.Q. (eds.), Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates, Inc., 2012. URL http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks. pdf.
- (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

17
- 0032203257
- Gradient-based learning applied to document recognition
- Nov
- Lecun, Y., Bottou, L., Bengio, Y., and Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, Nov 1998. ISSN 0018-9219. doi: 10.1109/5.726791.
- (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- Lecun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

18
- 85009928594
- Deeply-supervised nets
- San Diego, California, USA, May 9-12, 2015
- Lee, Chen-Yu, Xie, Saining, Gallagher, Patrick W., Zhang, Zhengyou, and Tu, Zhuowen. Deeply-supervised nets. In Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2015, San Diego, California, USA, May 9-12, 2015, 2015. URL http://jmlr.org/proceedings/papers/v38/lee15a.html.
- (2015) Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2015
- Lee, C.-Y.¹ Xie, S.² Gallagher, P.W.³ Zhang, Z.⁴ Tu, Z.⁵

19
- 84893676344
- Rectifier nonlinearities improve neural network acoustic models
- Maas, Andrew L, Hannun, Awni Y, and Ng, Andrew Y. Rectifier nonlinearities improve neural network acoustic models. Proc. ICML, 30, 2013.
- (2013) Proc. ICML , vol.30
- Maas, A.L.¹ Hannun, A.Y.² Ng, A.Y.³

20
- 85083953559
- Fitnets: Hints for thin deep nets
- May
- Romero, Adriana, Ballas, Nicolas, Kahou, Samira Ebrahimi, Chassang, Antoine, Gatta, Carlo, and Bengio, Yoshua. Fitnets: Hints for thin deep nets. In Proceedings of ICLR, May 2015. URL http://arxiv.org/abs/1412.6550.
- (2015) Proceedings of ICLR
- Romero, A.¹ Ballas, N.² Kahou, S.E.³ Chassang, A.⁴ Gatta, C.⁵ Bengio, Y.⁶

21
- 84947041871
- ImageNet large scale visual recognition challenge
- April
- Russakovsky, Olga, Deng, Jia, Su, Hao, Krause, Jonathan, Satheesh, Sanjeev, Ma, Sean, Huang, Zhiheng, Karpathy, Andrej, Khosla, Aditya, Bernstein, Michael, Berg, Alexander C., and Fei-Fei, Li. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), pp. 1–42, April 2015. doi: 10.1007/s11263-015-0816-y.
- (2015) International Journal of Computer Vision (IJCV) , pp. 1-42
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.¹⁰ Berg, A.C.¹¹ Fei-Fei, L.¹²

22
- 85083950783
- Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
- Saxe, Andrew M., McClelland, James L., and Ganguli, Surya. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks. In Proceedings of ICLR, 2014. URL http://arxiv.org/abs/1312.6120.
- (2014) Proceedings of ICLR
- Saxe, A.M.¹ McClelland, J.L.² Ganguli, S.³

23
- 84994202024
- arXiv e-prints, September
- Sercu, T., Puhrsch, C., Kingsbury, B., and LeCun, Y. Very Deep Multilingual Convolutional Neural Networks for LVCSR. ArXiv e-prints, September 2015. URL http://arxiv.org/abs/1509/08967.
- (2015) Very Deep Multilingual Convolutional Neural Networks for LVCSR
- Sercu, T.¹ Puhrsch, C.² Kingsbury, B.³ LeCun, Y.⁴

24
- 84925410541
- Very deep convolutional networks for large-scale visual recognition
- May
- Simonyan, Karen and Zisserman, Andrew. Very deep convolutional networks for large-scale visual recognition. In Proceedings of ICLR, May 2015. URL http://arxiv.org/abs/1409.1556.
- (2015) Proceedings of ICLR
- Simonyan, K.¹ Zisserman, A.²

25
- 84962006941
- Striving for simplicity: The all convolutional net
- December
- Springenberg, J. T., Dosovitskiy, A., Brox, T., and Riedmiller, M. Striving for Simplicity: The All Convolutional Net. In Proceedings of ICLR Workshop, December 2014. URL http://arxiv.org/abs/1412.6806.
- (2014) Proceedings of ICLR Workshop
- Springenberg, J.T.¹ Dosovitskiy, A.² Brox, T.³ Riedmiller, M.⁴

26
- 84965164720
- Training very deep networks
- Srivastava, Rupesh Kumar, Greff, Klaus, and Schmidhuber, Jrgen. Training Very Deep Networks. In Proceedings of NIPS, 2015. URL http://arxiv.org/abs/1507.06228.
- (2015) Proceedings of NIPS
- Srivastava, R.K.¹ Greff, K.² Schmidhuber, J.³

27
- 84965135330
- arXiv e-prints, December
- Sussillo, David and Abbott, L. F. Random Walk Initialization for Training Very Deep Feedforward Networks. ArXiv e-prints, December 2014. URL http://arxiv.org/abs/1412.6558.
- (2014) Random Walk Initialization for Training Very Deep Feedforward Networks
- Sussillo, D.¹ Abbott, L.F.²

28
- 84937522268
- Going deeper with convolutions
- Szegedy, Christian, Liu, Wei, Jia, Yangqing, Sermanet, Pierre, Reed, Scott, Anguelov, Dragomir, Erhan, Dumitru, Vanhoucke, Vincent, and Rabinovich, Andrew. Going deeper with convolutions. In CVPR 2015, 2015. URL http://arxiv.org/abs/1409.4842.
- (2015) CVPR 2015
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶ Erhan, D.⁷ Vanhoucke, V.⁸ Rabinovich, A.⁹

29
- 84893343292
- Lecture 6.5: RMSProp – Divide the gradient by a running average of its recent magnitude
- Tieleman, Tijmen and Hinton, Geoffrey. Lecture 6.5: RMSProp – Divide the gradient by a running average of its recent magnitude. In COURSERA: Neural Networks for Machine Learning. 2012.
- (2012) COURSERA: Neural Networks for Machine Learning
- Tieleman, T.¹ Hinton, G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.