SCOPUS 정보 검색 플랫폼

Journal of Machine Learning Research

Volumn 17, Issue , 2016, Pages

Knowledge matters: Importance of prior information for optimization

(2) Gülçehre, Çǧlar a Bengio, Yoshua a

a UNIVERSITÉ DE MONTRÉAL (Canada)

Author keywords

Curriculum learning; Deep learning; Evolution of culture; Neural networks; Optimization; Training with hints

Indexed keywords

ARTIFICIAL INTELLIGENCE; CURRICULA; LEARNING SYSTEMS; NETWORK ARCHITECTURE; NEURAL NETWORKS; OPTIMIZATION;

CULTURAL LEARNING; DEEP LEARNING; GENERALIZATION ERROR; INTERMEDIATE CONCEPT; INTERMEDIATE LEVEL; POSITIVE EVIDENCE; PRIOR INFORMATION; SUPERVISED NEURAL NETWORKS;

LEARNING ALGORITHMS;

EID: 84962440920 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (128)

References (54)

1
- 0037288370
- Recent advances in hierarchical reinforcement learning
- Andrew G Barto and Sridhar Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13(4):341-379, 2003.
- (2003) Discrete Event Dynamic Systems , vol.13 , Issue.4 , pp. 341-379
- Barto, A.G.¹ Mahadevan, S.²

2
- 77952766987
- A User's guide to support vector machines
- Asa Ben-Hur and Jason Weston. A user's guide to support vector machines. Methods in Molecular Biology, 609:223-239, 2010.
- (2010) Methods in Molecular Biology , vol.609 , pp. 223-239
- Ben-Hur, A.¹ Weston, J.²

3
- 69349090197
- Learning deep architectures for AI
- Also published as a book. Now Publishers, 2009
- Yoshua Bengio. Learning deep architectures for AI. Foundations and Trends in Machine Learning, 2(1):1-127, 2009. Also published as a book. Now Publishers, 2009.
- (2009) Foundations and Trends in Machine Learning , vol.2 , Issue.1 , pp. 1-127
- Bengio, Y.¹

4
- 84883188507
- Evolving culture vs local minima
- number also as ArXiv 1203.2990v1, T. Kowaliw, N. Bredeche & R. Doursat, eds. Springer-Verlag, March
- Yoshua Bengio. Evolving culture vs local minima. In Growing Adaptive Machines: Integrating Development and Learning in Artificial Neural Networks, number also as ArXiv 1203.2990v1, pages T. Kowaliw, N. Bredeche & R. Doursat, eds. Springer-Verlag, March 2013a. URL http://arxiv.org/abs/1203.2990.
- (2013) Growing Adaptive Machines: Integrating Development and Learning in Artificial Neural Networks
- Bengio, Y.¹

5
- 84872560515
- Practical recommendations for gradient-based training of deep architectures
- K.-R. Müller, G. Montavon, and G. B. Orr, editors, Springer
- Yoshua Bengio. Practical recommendations for gradient-based training of deep architectures. In K.-R. Müller, G. Montavon, and G. B. Orr, editors, Neural Networks: Tricks of the Trade. Springer, 2013b.
- (2013) Neural Networks: Tricks of the Trade
- Bengio, Y.¹

6
- 84864073449
- Greedy layer-wise training of deep networks
- Bernhard Schölkopf, John Platt, and Thomas Hoffman, editors, MIT Press
- Yoshua Bengio, Pascal Lamblin, Dan Popovici, and Hugo Larochelle. Greedy layer-wise training of deep networks. In Bernhard Schölkopf, John Platt, and Thomas Hoffman, editors, Ad-vances in Neural Information Processing Systems 19 (NIPS'06), pages 153-160. MIT Press, 2007.
- (2007) Ad-vances in Neural Information Processing Systems 19 (NIPS'06) , pp. 153-160
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴

7
- 71149116544
- Curriculum learning
- Léon Bottou and Michael Littman, editors, ACM
- Yoshua Bengio, Jerome Louradour, Ronan Collobert, and Jason Weston. Curriculum learning. In Léon Bottou and Michael Littman, editors, Proceedings of the Twenty-sixth International Conference on Machine Learning (ICML'09). ACM, 2009.
- (2009) Proceedings of the Twenty-sixth International Conference on Machine Learning (ICML'09)
- Bengio, Y.¹ Louradour, J.² Collobert, R.³ Weston, J.⁴

8
- 84881233012
- Technical Report arXiv:1206.5538, U. Montreal
- Yoshua Bengio, Aaron Courville, and Pascal Vincent. Representation learning: A review and new perspectives. Technical Report arXiv:1206.5538, U. Montreal, 2012. URL http://arxiv.org/abs/1206.5538.
- (2012) Representation Learning: A Review and New Perspectives
- Bengio, Y.¹ Courville, A.² Vincent, P.³

9
- 84871391768
- Unsupervised feature learning and deep learning: A review and new perspectives
- Yoshua Bengio, Aaron Courville, and Pascal Vincent. Unsupervised feature learning and deep learning: A review and new perspectives. IEEE Trans. Pattern Analysis and Machine Intel-ligence (PAMI), 2013.
- (2013) IEEE Trans. Pattern Analysis and Machine Intel-ligence (PAMI)
- Bengio, Y.¹ Courville, A.² Vincent, P.³

10
- 84857819132
- Theano: A CPU and GPU math expression compiler
- James Bergstra, Olivier Breuleux, Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, Guillaume Desjardins, Joseph Turian, David Warde-Farley, and Yoshua Bengio. Theano: a CPU and GPU math expression compiler. In Proceedings of the Python for Scientific Computing Conference (SciPy), 2010.
- (2010) Proceedings of the Python for Scientific Computing Conference (SciPy)
- Bergstra, J.¹ Breuleux, O.² Bastien, F.³ Lamblin, P.⁴ Pascanu, R.⁵ Desjardins, G.⁶ Turian, J.⁷ Warde-Farley, D.⁸ Bengio, Y.⁹

11
- 84904136037
- Large-scale machine learning with stochastic gradient descent
- Springer
- Léon Bottou. Large-scale machine learning with stochastic gradient descent. In Proceedings of COMPSTAT'2010, pages 177-186. Springer, 2010.
- (2010) Proceedings of COMPSTAT'2010 , pp. 177-186
- Bottou, L.¹

12
- 0035478854
- Random forests
- Leo Breiman. Random forests. Machine Learning, 45(1):5-32, 2001.
- (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
- Breiman, L.¹

13
- 0003802343
- Classification and regression trees
- Leo Breiman, Jerome Friedman, Charles J. Stone, and Richard A. Olshen. classification and regression trees. Belmont, Calif.: Wadsworth, 1984.
- (1984) Belmont, Calif.: Wadsworth
- Breiman, L.¹ Friedman, J.² Stone, C.J.³ Olshen, R.A.⁴

14
- 84954310140
- The loss surface of multilayer networks
- Anna Choromanska, Mikael Henaff, Michael Mathieu, Gérard Ben Arous, and Yann LeCun. The loss surface of multilayer networks. AISTATS 2015, Proceedings of the Eighteenth In-ternational Conference on Artificial Intelligence and Statistics, pages 192-204, 2015.
- (2015) AISTATS 2015, Proceedings of the Eighteenth In-ternational Conference on Artificial Intelligence and Statistics , pp. 192-204
- Choromanska, A.¹ Henaff, M.² Mathieu, M.³ Arous, G.B.⁴ LeCun, Y.⁵

15
- 78649669320
- Deep big simple neural nets for handwritten digit recognition
- Dan C. Ciresan, Ueli Meier, Luca M. Gambardella, and Jürgen Schmidhuber. Deep big simple neural nets for handwritten digit recognition. Neural Computation, 22:1-14, 2010.
- (2010) Neural Computation , vol.22 , pp. 1-14
- Ciresan, D.C.¹ Meier, U.² Gambardella, L.M.³ Schmidhuber, J.⁴

16
- 84883197820
- Technical Report arXiv:1301.3583, Universite de Montreal
- Yann Dauphin and Yoshua Bengio. Big neural networks waste capacity. Technical Report arXiv:1301.3583, Universite de Montreal, 2013.
- (2013) Big Neural Networks Waste Capacity
- Dauphin, Y.¹ Bengio, Y.²

17
- 84928534967
- Identifying and attacking the saddle point problem in high-dimensional non-convex optimization
- Yann Dauphin, Razvan Pascanu, Caglar Gulcehre, Kyunghyun Cho, Surya Ganguli, and Yoshua Bengio. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization. In NIPS'2014, 2014.
- (2014) NIPS'2014
- Dauphin, Y.¹ Pascanu, R.² Gulcehre, C.³ Cho, K.⁴ Ganguli, S.⁵ Bengio, Y.⁶

18
- 0004149207
- Oxford University Press
- Richard Dawkins. The Selfish Gene. Oxford University Press, 1976.
- (1976) The Selfish Gene
- Dawkins, R.¹

19
- 80052250414
- Adaptive subgradient methods for online learning and stochastic optimization
- John Duchi, Elad Hazan, and Yoram Singer. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research, 2011.
- (2011) Journal of Machine Learning Research
- Duchi, J.¹ Hazan, E.² Singer, Y.³

20
- 77949522811
- Why does unsupervised pre-training help deep learning?
- JML (-1)
- Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre-Antoine Manzagol, Pascal Vincent, and Samy Bengio. Why does unsupervised pre-training help deep learning? In Journal of Machine Learning Research JML (-1), pages 625-660.
- Journal of Machine Learning Research , pp. 625-660
- Erhan, D.¹ Bengio, Y.² Courville, A.³ Manzagol, P.-A.⁴ Vincent, P.⁵ Bengio, S.⁶

21
- 80555140075
- Scikit-learn: Machine learning in python
- Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel et al Fabian Pedregosa, Gal Varoquaux. Scikit-learn: Machine learning in python. The Journal of Machine Learning Research, 12:2825-2830, 2011.
- (2011) The Journal of Machine Learning Research , vol.12 , pp. 2825-2830
- Gramfort, A.¹ Michel, V.² Thirion, B.³ Grisel, O.⁴ Blondel, M.⁵ Pedregosa, F.⁶ Varoquaux, G.⁷

22
- 80055083194
- Comparing machines and humans on a visual categorization test
- Franois Fleuret, Ting Li, Charles Dubout, Emma K. Wampler, Steven Yantis, and Donald Geman. Comparing machines and humans on a visual categorization test. Proceedings of the National Academy of Sciences, 108(43):17621-17625, 2011.
- (2011) Proceedings of the National Academy of Sciences , vol.108 , Issue.43 , pp. 17621-17625
- Fleuret, F.¹ Li, T.² Dubout, C.³ Wampler, E.K.⁴ Yantis, S.⁵ Geman, D.⁶

23
- 84862277874
- Understanding the difficulty of training deep feedforward neural networks
- May
- Xavier Glorot and Yoshua Bengio. Understanding the difficulty of training deep feedforward neural networks. In JMLR W&CP: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2010), volume 9, pages 249-256, May 2010.
- (2010) JMLR W&CP: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2010) , vol.9 , pp. 249-256
- Glorot, X.¹ Bengio, Y.²

24
- 84862294866
- Deep sparse rectifier neural networks
- April
- Xavier Glorot, Antoine Bordes, and Yoshua Bengio. Deep sparse rectifier neural networks. In JMLR W&CP: Proceedings of the Fourteenth International Conference on Artificial Intelli-gence and Statistics (AISTATS 2011), April 2011.
- (2011) JMLR W&CP: Proceedings of the Fourteenth International Conference on Artificial Intelli-gence and Statistics (AISTATS 2011)
- Glorot, X.¹ Bordes, A.² Bengio, Y.³

25
- 84897543523
- Maxout networks
- Ian J. Goodfellow, David Warde-Farley, Mehdi Mirza, Aaron Courville, and Yoshua Bengio. Maxout networks. In ICML'2013, 2013.
- (2013) ICML'2013
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.⁴ Bengio, Y.⁵

26
- 84905275608
- arXiv preprint arXiv:1311.1780
- Caglar Gulcehre, Kyunghyun Cho, Razvan Pascanu, and Yoshua Bengio. Learned-norm pooling for deep neural networks. arXiv preprint arXiv:1311.1780, 2013.
- (2013) Learned-norm Pooling for Deep Neural Networks
- Gulcehre, C.¹ Cho, K.² Pascanu, R.³ Bengio, Y.⁴

27
- 0037772374
- The evolution of cultural evolution
- Joseph Henrich and Richard McElreath. The evolution of cultural evolution. Evolutionary Anthropology: Issues, News, and Reviews, 12(3):123-135, 2003.
- (2003) Evolutionary Anthropology: Issues, News, and Reviews , vol.12 , Issue.3 , pp. 123-135
- Henrich, J.¹ McElreath, R.²

28
- 33745805403
- A fast learning algorithm for deep belief nets
- Geoffrey E. Hinton, Simon Osindero, and Yee Whye Teh. A fast learning algorithm for deep belief nets. Neural Computation, 18:1527-1554, 2006.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.W.³

29
- 84867720412
- Technical report, arXiv:1207.0580
- Geoffrey E. Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. Technical report, arXiv:1207.0580, 2012.
- (2012) Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

30
- 4944228528
- Chih-Wei Hsu, Chih-Chung Chang, and Chih-Jen Lin. A practical guide to support vector classification, 2003.
- (2003) A Practical Guide to Support Vector Classification
- Hsu, C.-W.¹ Chang, C.-C.² Lin, C.-J.³

31
- 77953183471
- What is the best multi-stage architecture for object recognition?
- IEEE
- Kevin Jarrett, Koray Kavukcuoglu, Marc'Aurelio Ranzato, and Yann LeCun. What is the best multi-stage architecture for object recognition? In Proc. International Conference on Computer Vision (ICCV'09), pages 2146-2153. IEEE, 2009.
- (2009) Proc. International Conference on Computer Vision (ICCV'09) , pp. 2146-2153
- Jarrett, K.¹ Kavukcuoglu, K.² Ranzato, M.³ LeCun, Y.⁴

32
- 85162562509
- How do humans teach: On curriculum learning and teaching dimension
- Faisal Khan, Xiaojin Zhu, and Bilge Mutlu. How do humans teach: On curriculum learning and teaching dimension. In Advances in Neural Information Processing Systems 24 (NIPS'11), pages 1449-1457, 2011.
- (2011) Advances in Neural Information Processing Systems 24 (NIPS'11) , pp. 1449-1457
- Khan, F.¹ Zhu, X.² Mutlu, B.³

33
- 84876231242
- ImageNet classification with deep convolutional neural networks
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 25 (NIPS'2012). 2012.
- (2012) Advances in Neural Information Processing Systems 25 (NIPS'2012)
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.³

34
- 59649113160
- Flexible shaping: How learning in small steps helps
- Kai A. Krueger and Peter Dayan. Flexible shaping: how learning in small steps helps. Cognition, 110:380-394, 2009.
- (2009) Cognition , vol.110 , pp. 380-394
- Krueger, K.A.¹ Dayan, P.²

35
- 84962397829
- The adviceptron: Giving advice to the perceptron
- Gautam Kunapuli, Kristin P. Bennett, Richard Maclin, and Jude W. Shavlik. The adviceptron: Giving advice to the perceptron. Proceedings of the Conference on Artificial Neural Networks In Engineering (ANNIE 2010), 2010.
- (2010) Proceedings of the Conference on Artificial Neural Networks in Engineering (ANNIE 2010)
- Kunapuli, G.¹ Bennett, K.P.² Maclin, R.³ Shavlik, J.W.⁴

36
- 59449087310
- Exploring strategies for training deep neural networks
- Hugo Larochelle, Yoshua Bengio, Jerome Louradour, and Pascal Lamblin. Exploring strategies for training deep neural networks. Journal of Machine Learning Research, 10:1-40, 2009.
- (2009) Journal of Machine Learning Research , vol.10 , pp. 1-40
- Larochelle, H.¹ Bengio, Y.² Louradour, J.³ Lamblin, P.⁴

37
- 0032203257
- Gradient-based learning applied to document recognition
- Yann LeCun, Leon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.
- (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

38
- 0003682772
- Department of Computer Science, Laboratory for Computer Science Research, Rutgers Univ
- Tom M. Mitchell. The Need for Biases in Learning Generalizations. Department of Computer Science, Laboratory for Computer Science Research, Rutgers Univ., 1980.
- (1980) The Need for Biases in Learning Generalizations
- Mitchell, T.M.¹

39
- 0001187959
- Explanation-based neural network learning for robot control
- Tom M. Mitchell and Sebastian B. Thrun. Explanation-based neural network learning for robot control. Advances in Neural information processing systems, pages 287-287, 1993.
- (1993) Advances in Neural Information Processing Systems , pp. 287
- Mitchell, T.M.¹ Thrun, S.B.²

40
- 84979128113
- Universal grammar
- Richard Montague. Universal grammar. Theoria, 36(3):373-398, 1970.
- (1970) Theoria , vol.36 , Issue.3 , pp. 373-398
- Montague, R.¹

41
- 77956509090
- Rectified linear units improve restricted boltzmann machines
- Vinod Nair and Geoffrey E Hinton. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML-10), pages 807-814, 2010.
- (2010) Proceedings of the 27th International Conference on Machine Learning (ICML-10) , pp. 807-814
- Nair, V.¹ Hinton, G.E.²

42
- 84899945949
- Joseph O'Sullivan. Integrating initialization bias and search bias in neural network learning, 1996.
- (1996) Integrating Initialization Bias and Search Bias in Neural Network Learning
- O'Sullivan, J.¹

43
- 12344258158
- A day of great illumination: B. F. Skinner's discovery of shaping
- Gail B. Peterson. A day of great illumination: B. F. Skinner's discovery of shaping. Journal of the Experimental Analysis of Behavior, 82(3):317-328, 2004.
- (2004) Journal of the Experimental Analysis of Behavior , vol.82 , Issue.3 , pp. 317-328
- Peterson, G.B.¹

44
- 80053460450
- Contractive auto-encoders: Explicit invariance during feature extraction
- June
- Salah Rifai, Pascal Vincent, Xavier Muller, Xavier Glorot, and Yoshua Bengio. Contractive auto-encoders: Explicit invariance during feature extraction. In Proceedings of the Twenty-eight International Conference on Machine Learning (ICML'11), June 2011.
- (2011) Proceedings of the Twenty-eight International Conference on Machine Learning (ICML'11)
- Rifai, S.¹ Vincent, P.² Muller, X.³ Glorot, X.⁴ Bengio, Y.⁵

45
- 84867136416
- A generative process for sampling contractive auto-encoders
- ACM
- Salah Rifai, Yoshua Bengio, Yann Dauphin, and Pascal Vincent. A generative process for sampling contractive auto-encoders. In Proceedings of the Twenty-nine International Conference on Machine Learning (ICML'12). ACM, 2012. URL http://icml.cc/discuss/2012/590.html.
- (2012) Proceedings of the Twenty-nine International Conference on Machine Learning (ICML'12)
- Rifai, S.¹ Bengio, Y.² Dauphin, Y.³ Vincent, P.⁴

46
- 84862286946
- Deep boltzmann machines
- Ruslan Salakhutdinov and Geoffrey E Hinton. Deep boltzmann machines. In International Conference on Artificial Intelligence and Statistics, pages 448-455, 2009.
- (2009) International Conference on Artificial Intelligence and Statistics , pp. 448-455
- Salakhutdinov, R.¹ Hinton, G.E.²

47
- 85050787417
- Reinforcement today
- Burrhus F. Skinner. Reinforcement today. American Psychologist, 13:94-99, 1958.
- (1958) American Psychologist , vol.13 , pp. 94-99
- Skinner, B.F.¹

48
- 0242263943
- A system for incremental learning based on algorithmic probability
- Citeseer
- Ray J. Solomonoff. A system for incremental learning based on algorithmic probability. In Proceedings of the Sixth Israeli Conference on Artificial Intelligence, Computer Vision and Pattern Recognition, pages 515-527. Citeseer, 1989.
- (1989) Proceedings of the Sixth Israeli Conference on Artificial Intelligence, Computer Vision and Pattern Recognition , pp. 515-527
- Solomonoff, R.J.¹

49
- 84893343292
- Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
- Tijmen Tieleman and Geoffrey Hinton. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4, 2012.
- (2012) COURSERA: Neural Networks for Machine Learning , vol.4
- Tieleman, T.¹ Hinton, G.²

50
- 0028529307
- Knowledge-based artificial neural networks
- Geoffrey G. Towell and Jude W. Shavlik. Knowledge-based Artificial neural networks. Artificial intelligence, 70(1):119-165, 1994.
- (1994) Artificial Intelligence , vol.70 , Issue.1 , pp. 119-165
- Towell, G.G.¹ Shavlik, J.W.²

51
- 79551480483
- Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
- JML (-1)
- Pascal Vincent, Hugo Larochelle, Isabelle Lajoie, Yoshua Bengio, and Pierre-Antoine Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. In Journal of Machine Learning Research JML (-1), pages 3371-3408.
- Journal of Machine Learning Research , pp. 3371-3408
- Vincent, P.¹ Larochelle, H.² Lajoie, I.³ Bengio, Y.⁴ Manzagol, P.-A.⁵

52
- 35248831179
- Captcha: Using hard ai problems for security
- Springer
- Luis Von Ahn, Manuel Blum, Nicholas J Hopper, and John Langford. Captcha: Using hard ai problems for security. In Advances in Cryptology EUROCRYPT 2003, pages 294-311. Springer, 2003.
- (2003) Advances in Cryptology EUROCRYPT 2003 , pp. 294-311
- Von Ahn, L.¹ Blum, M.² Hopper, N.J.³ Langford, J.⁴

53
- 56449119888
- Deep learning via semi-supervised embedding
- William W. Cohen, Andrew McCallum, and Sam T. Roweis, editors, New York, NY, USA, ACM
- Jason Weston, Frédéric Ratle, and Ronan Collobert. Deep learning via semi-supervised embedding. In William W. Cohen, Andrew McCallum, and Sam T. Roweis, editors, Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML'08), pages 1168-1175, New York, NY, USA, 2008. ACM. ISBN 978-1-60558-205-4. doi: 10.1145/1390156.1390303.
- (2008) Proceedings of the Twenty-fifth International Conference on Machine Learning (ICML'08) , pp. 1168-1175
- Weston, J.¹ Ratle, F.² Collobert, R.³

54
- 84893382981
- arXiv preprint arXiv:1212.5701
- Matthew D Zeiler. Adadelta: An adaptive learning rate method. arXiv preprint arXiv:1212.5701, 2012.
- (2012) Adadelta: An Adaptive Learning Rate Method
- Zeiler, M.D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.