SCOPUS 정보 검색 플랫폼

4th International Conference on Learning Representations, ICLR 2016 - Conference Track Proceedings

Volumn , Issue , 2016, Pages

Fast and accurate deep network learning by exponential linear units (ELUs)

(3) Clevert, Djork Arné a Unterthiner, Thomas a Hochreiter, Sepp a

a JOHANNES KEPLER UNIVERSITY LINZ (Austria)

Author keywords

[No Author keywords available]

Indexed keywords

CHEMICAL ACTIVATION;

ACTIVATION FUNCTIONS; CLASSIFICATION ACCURACY; CLASSIFICATION ERRORS; GENERALIZATION PERFORMANCE; NATURAL GRADIENT; NEGATIVE VALUES; NETWORK LEARNING; VANISHING GRADIENT;

DEEP NEURAL NETWORKS;

EID: 85083953568 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (2083)

References (35)

1
- 0000396062
- Natural gradient works efficiently in learning
- Amari, S.-I. Natural gradient works efficiently in learning. Neural Computation, 10(2):251–276, 1998.
- (1998) Neural Computation , vol.10 , Issue.2 , pp. 251-276
- Amari, S.-I.¹

2
- 84965180108
- Rectified factor networks
- Cortes, C., Lawrence, N. D., Lee, D. D., Sugiyama, M., and Garnett, R. (eds), Curran Associates, Inc
- Clevert, D.-A., Unterthiner, T., Mayr, A., and Hochreiter, S. Rectified factor networks. In Cortes, C., Lawrence, N. D., Lee, D. D., Sugiyama, M., and Garnett, R. (eds.), Advances in Neural Information Processing Systems 28. Curran Associates, Inc., 2015.
- (2015) Advances in Neural Information Processing Systems , vol.28
- Clevert, D.-A.¹ Unterthiner, T.² Mayr, A.³ Hochreiter, S.⁴

3
- 84965130201
- Natural neural networks
- Desjardins, G., Simonyan, K., Pascanu, R., and Kavukcuoglu, K. Natural neural networks. CoRR, abs/1507.00210, 2015. URL http://arxiv.org/abs/1507.00210.
- (2015) CoRR
- Desjardins, G.¹ Simonyan, K.² Pascanu, R.³ Kavukcuoglu, K.⁴

4
- 84862294866
- Deep sparse rectifier neural networks
- Gordon, G., Dunson, D., and Dudk, M. (eds)
- Glorot, X., Bordes, A., and Bengio, Y. Deep sparse rectifier neural networks. In Gordon, G., Dunson, D., and Dudk, M. (eds.), JMLR W&CP: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), volume 15, pp. 315–323, 2011.
- (2011) JMLR W&CP: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011) , vol.15 , pp. 315-323
- Glorot, X.¹ Bordes, A.² Bengio, Y.³

5
- 84892421248
- arXiv e-prints
- Goodfellow, I. J., Warde-Farley, D., Mirza, M., Courville, A., and Bengio, Y. Maxout networks. ArXiv e-prints, 2013.
- (2013) Maxout Networks
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.⁴ Bengio, Y.⁵

6
- 84978059147
- Fractional max-pooling
- Graham, Benjamin. Fractional max-pooling. CoRR, abs/1412.6071, 2014. URL http://arxiv.org/abs/1412.6071.
- (2014) CoRR
- Graham, B.¹

7
- 84994894307
- Scaling up natural gradient by sparsely factorizing the inverse Fisher matrix
- Proceedings of the 32nd International Conference on Machine Learning (ICML15)
- Grosse, R. and Salakhudinov, R. Scaling up natural gradient by sparsely factorizing the inverse Fisher matrix. Journal of Machine Learning Research, 37:2304–2313, 2015. URL http://jmlr.org/proceedings/papers/v37/grosse15.pdf. Proceedings of the 32nd International Conference on Machine Learning (ICML15).
- (2015) Journal of Machine Learning Research , vol.37 , pp. 2304-2313
- Grosse, R.¹ Salakhudinov, R.²

8
- 84973911419
- Delving deep into rectifiers: Surpassing human-level performance on imagenet classification
- He, K., Zhang, X., Ren, S., and Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In IEEE International Conference on Computer Vision (ICCV), 2015.
- (2015) IEEE International Conference on Computer Vision (ICCV)
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

9
- 0042276525
- The vanishing gradient problem during learning recurrent neural nets and problem solutions
- Hochreiter, S. The vanishing gradient problem during learning recurrent neural nets and problem solutions. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(2):107–116, 1998.
- (1998) International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems , vol.6 , Issue.2 , pp. 107-116
- Hochreiter, S.¹

10
- 0033114102
- Feature extraction through LOCOCODE
- Hochreiter, S. and Schmidhuber, J. Feature extraction through LOCOCODE. Neural Computation, 11(3): 679–714, 1999.
- (1999) Neural Computation , vol.11 , Issue.3 , pp. 679-714
- Hochreiter, S.¹ Schmidhuber, J.²

11
- 0041914606
- Gradient flow in recurrent nets: The difficulty of learning long-term dependencies
- Kremer and Kolen (eds), IEEE Press
- Hochreiter, S., Bengio, Y., Frasconi, P., and Schmidhuber, J. Gradient flow in recurrent nets: the difficulty of learning long-term dependencies. In Kremer and Kolen (eds.), A Field Guide to Dynamical Recurrent Neural Networks. IEEE Press, 2001.
- (2001) A Field Guide to Dynamical Recurrent Neural Networks
- Hochreiter, S.¹ Bengio, Y.² Frasconi, P.³ Schmidhuber, J.⁴

12
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Proceedings of the 32nd International Conference on Machine Learning (ICML15)
- Ioffe, S. and Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. Journal of Machine Learning Research, 37:448–456, 2015. URL http://jmlr.org/proceedings/papers/v37/ioffe15.pdf. Proceedings of the 32nd International Conference on Machine Learning (ICML15).
- (2015) Journal of Machine Learning Research , vol.37 , pp. 448-456
- Ioffe, S.¹ Szegedy, C.²

13
- 85021667706
- PhD thesis, EECS Department, University of California, Berkeley, May
- Jia, Yangqing. Learning Semantic Image Representations at a Large Scale. PhD thesis, EECS Department, University of California, Berkeley, May 2014. URL http://www.eecs.berkeley.edu/Pubs/TechRpts/2014/EECS-2014-93.html.
- (2014) Learning Semantic Image Representations at A Large Scale
- Jia, Y.¹

14
- 84876231242
- ImageNet classification with deep convolutional neural networks
- Pereira, F., Burges, C. J. C., Bottou, L., and Weinberger, K. Q. (eds), Curran Associates, Inc
- Krizhevsky, A., Sutskever, I., and Hinton, G. E. ImageNet classification with deep convolutional neural networks. In Pereira, F., Burges, C. J. C., Bottou, L., and Weinberger, K. Q. (eds.), Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates, Inc., 2012.
- (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

15
- 85029747845
- Iterative weighted least squares algorithms for neural networks classifiers
- Springer
- Kurita, T. Iterative weighted least squares algorithms for neural networks classifiers. In Proceedings of the Third Workshop on Algorithmic Learning Theory (ALT92), volume 743 of Lecture Notes in Computer Science, pp. 77–86. Springer, 1993.
- (1993) Proceedings of the Third Workshop on Algorithmic Learning Theory (ALT92), Volume 743 of Lecture Notes in Computer Science , pp. 77-86
- Kurita, T.¹

16
- 0000044667
- Eigenvalues of covariance matrices: Application to neural-network learning
- LeCun, Y., Kanter, I., and Solla, S. A. Eigenvalues of covariance matrices: Application to neural-network learning. Physical Review Letters, 66(18):2396–2399, 1991.
- (1991) Physical Review Letters , vol.66 , Issue.18 , pp. 2396-2399
- LeCun, Y.¹ Kanter, I.² Solla, S.A.³

17
- 84872543023
- Efficient backprop
- Orr, G. B. and Müller, K.-R. (eds), Springer
- LeCun, Y., Bottou, L., Orr, G. B., and Müller, K.-R. Efficient backprop. In Orr, G. B. and Müller, K.-R. (eds.), Neural Networks: Tricks of the Trade, volume 1524 of Lecture Notes in Computer Science, pp. 9–50. Springer, 1998.
- (1998) Neural Networks: Tricks of the Trade, Volume 1524 of Lecture Notes in Computer Science , pp. 9-50
- LeCun, Y.¹ Bottou, L.² Orr, G.B.³ Müller, K.-R.⁴

18
- 85009928594
- Deeply-supervised nets
- Lee, Chen-Yu, Xie, Saining, Gallagher, Patrick W., Zhang, Zhengyou, and Tu, Zhuowen. Deeply-supervised nets. In AISTATS, 2015.
- (2015) AISTATS
- Lee, C.-Y.¹ Xie, S.² Gallagher, P.W.³ Zhang, Z.⁴ Tu, Z.⁵

19
- 85162000799
- Topmoumoute online natural gradient algorithm
- Platt, J. C., Koller, D., Singer, Y., and Roweis, S. T. (eds)
- LeRoux, N., Manzagol, P.-A., and Bengio, Y. Topmoumoute online natural gradient algorithm. In Platt, J. C., Koller, D., Singer, Y., and Roweis, S. T. (eds.), Advances in Neural Information Processing Systems 20 (NIPS), pp. 849–856, 2008.
- (2008) Advances in Neural Information Processing Systems 20 (NIPS) , pp. 849-856
- LeRoux, N.¹ Manzagol, P.-A.² Bengio, Y.³

20
- 84908678178
- Network in network
- Lin, Min, Chen, Qiang, and Yan, Shuicheng. Network in network. CoRR, abs/1312.4400, 2013. URL http://arxiv.org/abs/1312.4400.
- (2013) CoRR
- Lin, M.¹ Chen, Q.² Yan, S.³

21
- 84893676344
- Rectifier nonlinearities improve neural network acoustic models
- Maas, A. L., Hannun, A. Y., and Ng, A. Y. Rectifier nonlinearities improve neural network acoustic models. In Proceedings of the 30th International Conference on Machine Learning (ICML13), 2013.
- (2013) Proceedings of the 30th International Conference on Machine Learning (ICML13)
- Maas, A.L.¹ Hannun, A.Y.² Ng, A.Y.³

22
- 77956541496
- Deep learning via Hessian-free optimization
- Fürnkranz, J. and Joachims, T. (eds)
- Martens, J. Deep learning via Hessian-free optimization. In Fürnkranz, J. and Joachims, T. (eds.), Proceedings of the 27th International Conference on Machine Learning (ICML10), pp. 735–742, 2010.
- (2010) Proceedings of the 27th International Conference on Machine Learning (ICML10) , pp. 735-742
- Martens, J.¹

23
- 84987943069
- DeepTox: Toxicity prediction using deep learning
- Mayr, A., Klambauer, G., Unterthiner, T., and Hochreiter, S. DeepTox: Toxicity prediction using deep learning. Front. Environ. Sci., 3(80), 2015. doi: 10.3389/fenvs.2015.00080. URL http://journal.frontiersin.org/article/10.3389/fenvs.2015.00080.
- (2015) Front. Environ. Sci. , vol.3 , Issue.80
- Mayr, A.¹ Klambauer, G.² Unterthiner, T.³ Hochreiter, S.⁴

24
- 77956509090
- Rectified linear units improve restricted Boltzmann machines
- Fürnkranz, J. and Joachims, T. (eds)
- Nair, V. and Hinton, G. E. Rectified linear units improve restricted Boltzmann machines. In Fürnkranz, J. and Joachims, T. (eds.), Proceedings of the 27th International Conference on Machine Learning (ICML10), pp. 807–814, 2010.
- (2010) Proceedings of the 27th International Conference on Machine Learning (ICML10) , pp. 807-814
- Nair, V.¹ Hinton, G.E.²

25
- 85070998699
- Riemannian metrics for neural networks I: Feedforward networks
- Olivier, Y. Riemannian metrics for neural networks i: feedforward networks. CoRR, abs/1303.0818, 2013. URL http://arxiv.org/abs/1303.0818.
- (2013) CoRR
- Olivier, Y.¹

26
- 85083950291
- Revisiting natural gradient for deep networks
- Pascanu, R. and Bengio, Y. Revisiting natural gradient for deep networks. In International Conference on Learning Representations 2014, 2014. URL http://arxiv.org/abs/1301.3584.arXiv:1301.3584.
- (2014) International Conference on Learning Representations 2014
- Pascanu, R.¹ Bengio, Y.²

27
- 84893409634
- Deep learning made easier by linear transformations in perceptrons
- Lawrence, N. D. and Girolami, M. A. (eds)
- Raiko, T., Valpola, H., and LeCun, Y. Deep learning made easier by linear transformations in perceptrons. In Lawrence, N. D. and Girolami, M. A. (eds.), Proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS12), volume 22, pp. 924–932, 2012.
- (2012) Proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS12) , vol.22 , pp. 924-932
- Raiko, T.¹ Valpola, H.² LeCun, Y.³

28
- 0038231917
- Centering neural network gradient factor
- Orr, G. B. and Müller, K.-R. (eds), Springer
- Schraudolph, N. N. Centering neural network gradient factor. In Orr, G. B. and Müller, K.-R. (eds.), Neural Networks: Tricks of the Trade, volume 1524 of Lecture Notes in Computer Science, pp. 207–226. Springer, 1998.
- (1998) Neural Networks: Tricks of the Trade, Volume 1524 of Lecture Notes in Computer Science , pp. 207-226
- Schraudolph, N.N.¹

29
- 0033561855
- A fast, compact approximation of the exponential function
- Schraudolph, Nicol N. A Fast, Compact Approximation of the Exponential Function. Neural Computation, 11: 853–862, 1999.
- (1999) Neural Computation , vol.11 , pp. 853-862
- Schraudolph, N.N.¹

30
- 84962006941
- Striving for simplicity: The all convolutional net
- Springenberg, Jost Tobias, Dosovitskiy, Alexey, Brox, Thomas, and Riedmiller, Martin A. Striving for simplicity: The all convolutional net. CoRR, abs/1412.6806, 2014. URL http://arxiv.org/abs/1412.6806.
- (2014) CoRR
- Springenberg, J.T.¹ Dosovitskiy, A.² Brox, T.³ Riedmiller, M.A.⁴

31
- 84973388607
- Training very deep networks
- Srivastava, Rupesh Kumar, Greff, Klaus, and Schmidhuber, Jürgen. Training very deep networks. CoRR, abs/1507.06228, 2015. URL http://arxiv.org/abs/1507.06228.
- (2015) CoRR
- Srivastava, R.K.¹ Greff, K.² Schmidhuber, J.³

32
- 85070976738
- Toxicity prediction using deep learning
- Unterthiner, T., Mayr, A., Klambauer, G., and Hochreiter, S. Toxicity prediction using deep learning. CoRR, abs/1503.01445, 2015. URL http://arxiv.org/abs/1503.01445.
- (2015) CoRR
- Unterthiner, T.¹ Mayr, A.² Klambauer, G.³ Hochreiter, S.⁴

33
- 84867614640
- Krylov subspace descent for deep learning
- Vinyals, O. and Povey, D. Krylov subspace descent for deep learning. In AISTATS, 2012. URL http://arxiv.org/pdf/1111.4259v1. arXiv:1111.4259.
- (2012) AISTATS
- Vinyals, O.¹ Povey, D.²

34
- 85013858782
- Empirical evaluation of rectified activations in convolutional network
- Xu, B., Wang, N., Chen, T., and Li, M. Empirical evaluation of rectified activations in convolutional network. CoRR, abs/1505.00853, 2015. URL http://arxiv.org/abs/1505.00853.
- (2015) CoRR
- Xu, B.¹ Wang, N.² Chen, T.³ Li, M.⁴

35
- 0032533046
- Complexity issues in natural gradient descent method for training multilayer perceptrons
- Yang, H. H. and Amari, S.-I. Complexity issues in natural gradient descent method for training multilayer perceptrons. Neural Computation, 10(8), 1998.
- (1998) Neural Computation , vol.10 , Issue.8
- Yang, H.H.¹ Amari, S.-I.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.