SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn 2015-January, Issue , 2015, Pages 2071-2079

Natural neural networks

(4) Desjardins, Guillaume a Simonyan, Karen a Pascanu, Razvan a Kavukcuoglu, Koray a

a DEEPMIND (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION SCIENCE; LEARNING ALGORITHMS;

FEED FORWARD; INTERNAL REPRESENTATION; NATURAL GRADIENT; NETWORK WEIGHTS; ONLINE LEARNING ALGORITHMS; REPARAMETRIZATION; SPEED UP;

ALGORITHMS;

EID: 84965130201 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (202)

References (28)

1
- 0000396062
- Natural gradient works efficiently in learning
- Shun-ichi Amari. Natural gradient works efficiently in learning. Neural Computation, 1998.
- (1998) Neural Computation
- Amari, S.-I.¹

2
- 84937961091
- Do deep nets really need to be deep?
- Jimmy Ba and Rich Caruana. Do deep nets really need to be deep? In NIPS. 2014.
- (2014) NIPS
- Ba, J.¹ Caruana, R.²

3
- 0037403111
- Mirror descent and nonlinear projected subgradient methods for convex optimization
- Amir Beck and Marc Teboulle. Mirror descent and nonlinear projected subgradient methods for convex optimization. Oper. Res. Lett., 2003.
- (2003) Oper. Res. Lett.
- Beck, A.¹ Teboulle, M.²

4
- 77956532258
- ArXiv e-prints, December
- P. L. Combettes and J.-C. Pesquet. Proximal Splitting Methods in Signal Processing. ArXiv e-prints, December 2009.
- (2009) Proximal Splitting Methods in Signal Processing
- Combettes, P.L.¹ Pesquet, J.-C.²

5
- 80052250414
- Adaptive subgradient methods for online learning and stochastic optimization
- John Duchi, Elad Hazan, and Yoram Singer. Adaptive subgradient methods for online learning and stochastic optimization. In JMLR. 2011.
- (2011) JMLR
- Duchi, J.¹ Hazan, E.² Singer, Y.³

6
- 79951563340
- Understanding the difficulty of training deep feedforward neural networks
- May
- Xavier Glorot and Yoshua Bengio. Understanding the difficulty of training deep feedforward neural networks. In AISTATS, May 2010.
- (2010) AISTATS
- Glorot, X.¹ Bengio, Y.²

7
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. ICML, 2015.
- (2015) ICML
- Ioffe, S.¹ Szegedy, C.²

8
- 84969988426
- Optimizing neural networks with kronecker-factored approximate curvature
- June
- Roger Grosse James Martens. Optimizing neural networks with kronecker-factored approximate curvature. In ICML, June 2015.
- (2015) ICML
- Grosse, R.¹ Martens, J.²

9
- 77956002520
- Master's thesis, University of Toronto
- Alex Krizhevsky. Learning multiple layers of features from tiny images. Master's thesis, University of Toronto, 2009.
- (2009) Learning Multiple Layers of Features from Tiny Images
- Krizhevsky, A.¹

10
- 0001857994
- Efficient backprop
- Lecture Notes in Computer Science LNCS 1524. Springer Verlag
- Yann LeCun, Léon Bottou, Genevieve B. Orr, and Klaus-Robert Müller. Efficient backprop. In Neural Networks, Tricks of the Trade, Lecture Notes in Computer Science LNCS 1524. Springer Verlag, 1998.
- (1998) Neural Networks, Tricks of the Trade
- LeCun, Y.¹ Bottou, L.² Orr, G.B.³ Müller, K.-R.⁴

11
- 0032203257
- Gradient-based learning applied to document recognition
- Yann Lecun, Lon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. In Proceedings of the IEEE, pages 2278-2324, 1998.
- (1998) Proceedings of the IEEE , pp. 2278-2324
- Lecun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

12
- 77956541496
- Deep learning via Hessian-free optimization
- June
- James Martens. Deep learning via Hessian-free optimization. In ICML, June 2010.
- (2010) ICML
- Martens, J.¹

13
- 84965139370
- Deep boltzmann machines and the centering trick
- K.-R. Müller, G. Montavon, and G. B. Orr, editors, Springer
- K.-R. Müller and G. Montavon. Deep boltzmann machines and the centering trick. In K.-R. Müller, G. Montavon, and G. B. Orr, editors, Neural Networks: Tricks of the Trade. Springer, 2013.
- (2013) Neural Networks: Tricks of the Trade
- Müller, K.-R.¹ Montavon, G.²

14
- 84965159508
- 1303.0818 arXiv
- Yann Ollivier. Riemannian metrics for neural networks. arXiv, abs/1303.0818, 2013.
- (2013) Riemannian Metrics for Neural Networks
- Ollivier, Y.¹

15
- 85083950291
- Revisiting natural gradient for deep networks
- Razvan Pascanu and Yoshua Bengio. Revisiting natural gradient for deep networks. In ICLR, 2014.
- (2014) ICLR
- Pascanu, R.¹ Bengio, Y.²

16
- 85083954109
- Parallel training of deep neural networks with natural gradient and parameter averaging
- Daniel Povey, Xiaohui Zhang, and Sanjeev Khudanpur. Parallel training of deep neural networks with natural gradient and parameter averaging. ICLR workshop, 2015.
- (2015) ICLR Workshop
- Povey, D.¹ Zhang, X.² Khudanpur, S.³

17
- 84893414160
- Deep learning made easier by linear transformations in perceptrons
- T. Raiko, H. Valpola, and Y. LeCun. Deep learning made easier by linear transformations in perceptrons. In AISTATS, 2012.
- (2012) AISTATS
- Raiko, T.¹ Valpola, H.² LeCun, Y.³

18
- 84923308192
- arXiv, October
- G. Raskutti and S. Mukherjee. The Information Geometry of Mirror Descent. arXiv, October 2013.
- (2013) The Information Geometry of Mirror Descent
- Raskutti, G.¹ Mukherjee, S.²

19
- 84969962257
- Scaling up natural gradient by sparsely factorizing the inverse fisher matrix
- June
- Ruslan Salakhutdinov Roger B. Grosse. Scaling up natural gradient by sparsely factorizing the inverse fisher matrix. In ICML, June 2015.
- (2015) ICML
- Roger, R.S.B.G.¹

20
- 84947041871
- ImageNet large scale visual recognition challenge
- Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), 2015.
- (2015) International Journal of Computer Vision (IJCV)
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.¹⁰ Berg, A.C.¹¹ Fei-Fei, L.¹²

21
- 84893411823
- Technical Report IDSIA-33-98, Istituto Dalle Molle di Studi sull'Intelligenza Artificiale
- Nicol N. Schraudolph. Accelerated gradient descent by factor-centering decomposition. Technical Report IDSIA-33-98, Istituto Dalle Molle di Studi sull'Intelligenza Artificiale, 1998.
- (1998) Accelerated Gradient Descent by Factor-centering Decomposition
- Schraudolph, N.N.¹

22
- 85083953063
- Very deep convolutional networks for large-scale image recognition
- K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In International Conference on Learning Representations, 2015.
- (2015) International Conference on Learning Representations
- Simonyan, K.¹ Zisserman, A.²

23
- 84919937860
- arXiv
- Jascha Sohl-Dickstein. The natural gradient by analogy to signal whitening, and recipes and tricks for its use. arXiv, 2012.
- (2012) The Natural Gradient by Analogy to Signal Whitening, and Recipes and Tricks for its Use
- Sohl-Dickstein, J.¹

24
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 2014.
- (2014) Journal of Machine Learning Research
- Srivastava, N.¹ Hinton, G.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

25
- 84964983441
- arXiv
- Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. arXiv, 2014.
- (2014) Going Deeper with Convolutions
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶ Erhan, D.⁷ Vanhoucke, V.⁸ Rabinovich, A.⁹

26
- 84899017702
- Projected natural actorcritic
- Philip S Thomas, William C Dabney, Stephen Giguere, and Sridhar Mahadevan. Projected natural actorcritic. In Advances in Neural Information Processing Systems 26. 2013.
- (2013) Advances in Neural Information Processing Systems 26
- Thomas, P.S.¹ Dabney, W.C.² Giguere, S.³ Mahadevan, S.⁴

27
- 84943546021
- Rmsprop: Divide the gradient by a running average of its recent magnitude
- Tijmen Tieleman and Geoffrey Hinton. Rmsprop: Divide the gradient by a running average of its recent magnitude. coursera: Neural networks for machine learning. 2012.
- (2012) Coursera: Neural Networks for Machine Learning
- Tieleman, T.¹ Hinton, G.²

28
- 84965126433
- Pushing stochastic gradient towards second-order methods - Backpropagation learning with transformations in nonlinearities
- Tommi Vatanen, Tapani Raiko, Harri Valpola, and Yann LeCun. Pushing stochastic gradient towards second-order methods - backpropagation learning with transformations in nonlinearities. ICONIP, 2013.
- (2013) ICONIP
- Vatanen, T.¹ Raiko, T.² Valpola, H.³ LeCun, Y.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.