SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Snapshot ensembles: Train 1, get M for free

(6) Huang, Gao a Li, Yixuan a Pleiss, Geoff a Liu, Zhuang b Hopcroft, John E a Weinberger, Kilian Q a

a Department of Obstetrics Gynecology (United States)

b TSINGHUA UNIVERSITY (China)

Author keywords

[No Author keywords available]

Indexed keywords

CONTRADICTORY GOALS; INDIVIDUAL NETWORK; MODEL AVERAGING; MODEL PARAMETERS; MULTIPLE NEURAL NETWORKS; NETWORK ENSEMBLE; RAPID CONVERGENCE; STATE OF THE ART;

NETWORK ARCHITECTURE;

EID: 85088231359 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (411)

References (43)

1
- 84904136037
- Large-scale machine learning with stochastic gradient descent
- Léon Bottou. Large-scale machine learning with stochastic gradient descent. In COMPSTAT. 2010.
- (2010) COMPSTAT
- Bottou, L.¹

2
- 85051496108
- Model compression
- Cristian Bucilu, Rich Caruana, and Alexandru Niculescu-Mizil. Model compression. In KDD, 2006.
- (2006) KDD
- Bucilu, C.¹ Caruana, R.² Niculescu-Mizil, A.³

3
- 14344255621
- Ensemble selection from libraries of models
- Rich Caruana, Alexandru Niculescu-Mizil, Geoff Crew, and Alex Ksikes. Ensemble selection from libraries of models. In ICML, 2004.
- (2004) ICML
- Caruana, R.¹ Niculescu-Mizil, A.² Crew, G.³ Ksikes, A.⁴

4
- 84888340666
- Torch7: A matlab-like environment for machine learning
- Ronan Collobert, Koray Kavukcuoglu, and Clément Farabet. Torch7: A matlab-like environment for machine learning. In BigLearn, NIPS Workshop, 2011.
- (2011) BigLearn, NIPS Workshop
- Collobert, R.¹ Kavukcuoglu, K.² Farabet, C.³

5
- 84928534967
- Identifying and attacking the saddle point problem in high-dimensional non-convex optimization
- Yann N Dauphin, Razvan Pascanu, Caglar Gulcehre, Kyunghyun Cho, Surya Ganguli, and Yoshua Bengio. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization. In NIPS, 2014.
- (2014) NIPS
- Dauphin, Y.N.¹ Pascanu, R.² Gulcehre, C.³ Cho, K.⁴ Ganguli, S.⁵ Bengio, Y.⁶

6
- 85198028989
- ImageNet: A large-scale hierarchical image database
- Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
- (2009) CVPR
- Deng, J.¹ Dong, W.² Socher, R.³ Li, L.-J.⁴ Li, K.⁵ Fei-Fei, L.⁶

7
- 80052250414
- Adaptive subgradient methods for online learning and stochastic optimization
- Jul
- John Duchi, Elad Hazan, and Yoram Singer. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research, 12(Jul):2121-2159, 2011.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 2121-2159
- Duchi, J.¹ Hazan, E.² Singer, Y.³

8
- 84897543523
- Maxout networks
- Ian J Goodfellow, David Warde-Farley, Mehdi Mirza, Aaron Courville, and Yoshua Bengio. Maxout networks. In ICML, 2013.
- (2013) ICML
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.⁴ Bengio, Y.⁵

9
- 84973372740
- arXiv preprint
- Ian J Goodfellow, Oriol Vinyals, and Andrew M Saxe. Qualitatively characterizing neural network optimization problems. arXiv preprint arXiv:1412.6544, 2014.
- (2014) Qualitatively Characterizing Neural Network Optimization Problems
- Goodfellow, I.J.¹ Vinyals, O.² Saxe, A.M.³

10
- 0025507176
- Neural network ensembles
- Lars Kai Hansen and Peter Salamon. Neural network ensembles. IEEE transactions on pattern analysis and machine intelligence, 12:993-1001, 1990.
- (1990) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.12 , pp. 993-1001
- Hansen, L.K.¹ Salamon, P.²

11
- 84986274465
- Deep residual learning for image recognition
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In CVPR, 2016a.
- (2016) CVPR
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

12
- 84990068011
- Identity mappings in deep residual networks
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Identity mappings in deep residual networks. In ECCV, 2016b.
- (2016) ECCV
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

13
- 84959176782
- arXiv preprint
- Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015.
- (2015) Distilling the Knowledge in a Neural Network
- Hinton, G.¹ Vinyals, O.² Dean, J.³

14
- 85013999932
- arXiv preprint
- Gao Huang, Zhuang Liu, and Kilian Q Weinberger. Densely connected convolutional networks. arXiv preprint arXiv:1608.06993, 2016a.
- (2016) Densely Connected Convolutional Networks
- Huang, G.¹ Liu, Z.² Weinberger, K.Q.³

15
- 84984824417
- Deep networks with stochastic depth
- Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra, and Kilian Weinberger. Deep networks with stochastic depth. In ECCV, 2016b.
- (2016) ECCV
- Huang, G.¹ Sun, Y.² Liu, Z.³ Sedra, D.⁴ Weinberger, K.⁵

16
- 84964923476
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICCV, 2015.
- (2015) ICCV
- Ioffe, S.¹ Szegedy, C.²

17
- 84941211770
- arXiv preprint
- Sbastien Jean, Kyunghyun Cho, Roland Memisevic, and Yoshua Bengio. On using very large target vocabulary for neural machine translation. arXiv preprint arXiv:1412.2007, 2014.
- (2014) On Using Very Large Target Vocabulary for Neural Machine Translation
- Jean, S.¹ Cho, K.² Memisevic, R.³ Bengio, Y.⁴

18
- 84975325896
- arXiv preprint
- Kenji Kawaguchi. Deep learning without poor local minima. arXiv preprint arXiv:1605.07110, 2016.
- (2016) Deep Learning without Poor Local Minima
- Kawaguchi, K.¹

19
- 85015249548
- arXiv preprint
- Nitish Shirish Keskar, Dheevatsa Mudigere, Jorge Nocedal, Mikhail Smelyanskiy, and Ping Tak Peter Tang. On large-batch training for deep learning: Generalization gap and sharp minima. arXiv preprint arXiv:1609.04836, 2016.
- (2016) On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
- Keskar, N.S.¹ Mudigere, D.² Nocedal, J.³ Smelyanskiy, M.⁴ Tang, P.T.P.⁵

20
- 84941620184
- arXiv preprint
- Diederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- (2014) Adam: A Method for Stochastic Optimization
- Kingma, D.¹ Ba, J.²

21
- 77956002520
- Alex Krizhevsky and Geoffrey Hinton. Learning multiple layers of features from tiny images. 2009.
- (2009) Learning Multiple Layers of Features from Tiny Images
- Krizhevsky, A.¹ Hinton, G.²

22
- 84876231242
- Imagenet classification with deep convolutional neural networks
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
- (2012) NIPS
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

23
- 0000749354
- Neural network ensembles, cross validation, and active learning
- Anders Krogh, Jesper Vedelsby, et al. Neural network ensembles, cross validation, and active learning. In NIPS, volume 7, 1995.
- (1995) NIPS , vol.7
- Krogh, A.¹ Vedelsby, J.²

24
- 85018911798
- arXiv preprint
- David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Hugo Larochelle, Aaron Courville, et al. Zoneout: Regularizing rnns by randomly preserving hidden activations. arXiv preprint arXiv:1606.01305, 2016.
- (2016) Zoneout: Regularizing Rnns by Randomly Preserving Hidden Activations
- Krueger, D.¹ Maharaj, T.² Kramár, J.³ Pezeshki, M.⁴ Ballas, N.⁵ Ke, N.R.⁶ Goyal, A.⁷ Bengio, Y.⁸ Larochelle, H.⁹ Courville, A.¹⁰

25
- 85044205527
- arXiv preprint
- Samuli Laine and Timo Aila. Temporal ensembling for semi-supervised learning. arXiv preprint arXiv:1610.02242, 2016.
- (2016) Temporal Ensembling for Semi-Supervised Learning
- Laine, S.¹ Aila, T.²

26
- 85011282525
- arXiv preprint
- Gustav Larsson, Michael Maire, and Gregory Shakhnarovich. Fractalnet: Ultra-deep neural networks without residuals. arXiv preprint arXiv:1605.07648, 2016.
- (2016) Fractalnet: Ultra-Deep Neural Networks without Residuals
- Larsson, G.¹ Maire, M.² Shakhnarovich, G.³

27
- 85009928594
- Deeply-supervised nets
- Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, and Zhuowen Tu. Deeply-supervised nets. In AISTATS, 2015.
- (2015) AISTATS
- Lee, C.-Y.¹ Xie, S.² Gallagher, P.³ Zhang, Z.⁴ Tu, Z.⁵

28
- 84939241380
- arXiv preprint
- Min Lin, Qiang Chen, and Shuicheng Yan. Network in network. arXiv preprint arXiv:1312.4400, 2013.
- (2013) Network in Network
- Lin, M.¹ Chen, Q.² Yan, S.³

29
- 85010905792
- arXiv preprint
- Ilya Loshchilov and Frank Hutter. Sgdr: Stochastic gradient descent with restarts. arXiv preprint arXiv:1608.03983, 2016.
- (2016) Sgdr: Stochastic Gradient Descent with Restarts
- Loshchilov, I.¹ Hutter, F.²

30
- 85057385638
- Mohammad Moghimi, Mohammad Saberian, Jian Yang, Li-Jia Li, Nuno Vasconcelos, and Serge Belongie. Boosted convolutional neural networks. 2016.
- (2016) Boosted Convolutional Neural Networks
- Moghimi, M.¹ Saberian, M.² Yang, J.³ Li, L.-J.⁴ Vasconcelos, N.⁵ Belongie, S.⁶

31
- 84865114495
- Reading digits in natural images with unsupervised feature learning, 2011
- Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, and Andrew Y Ng. Reading digits in natural images with unsupervised feature learning, 2011. In NIPS Workshop on Deep Learning and Unsupervised Feature Learning, 2011.
- (2011) NIPS Workshop on Deep Learning and Unsupervised Feature Learning
- Netzer, Y.¹ Wang, T.² Coates, A.³ Bissacco, A.⁴ Wu, B.⁵ Ng, A.Y.⁶

32
- 84964544562
- arXiv preprint
- Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, and Yoshua Bengio. Fitnets: Hints for thin deep nets. arXiv preprint arXiv:1412.6550, 2014.
- (2014) Fitnets: Hints for Thin Deep Nets
- Romero, A.¹ Ballas, N.² Kahou, S.E.³ Chassang, A.⁴ Gatta, C.⁵ Bengio, Y.⁶

33
- 84992639772
- arXiv preprint
- Rico Sennrich, Barry Haddow, and Alexandra Birch. Edinburgh neural machine translation systems for wmt 16. arXiv preprint arXiv:1606.02891, 2016.
- (2016) Edinburgh Neural Machine Translation Systems for Wmt 16
- Sennrich, R.¹ Haddow, B.² Birch, A.³

34
- 84874575248
- Convolutional neural networks applied to house numbers digit classification
- Pierre Sermanet, Soumith Chintala, and Yann LeCun. Convolutional neural networks applied to house numbers digit classification. In ICPR, 2012.
- (2012) ICPR
- Sermanet, P.¹ Chintala, S.² LeCun, Y.³

35
- 85044317583
- arXiv preprint
- Saurabh Singh, Derek Hoiem, and David Forsyth. Swapout: Learning an ensemble of deep architectures. arXiv preprint arXiv:1605.06465, 2016.
- (2016) Swapout: Learning an Ensemble of Deep Architectures
- Singh, S.¹ Hoiem, D.² Forsyth, D.³

36
- 85013935577
- CoRR, abs/1506.01186
- Leslie N. Smith. No more pesky learning rate guessing games. CoRR, abs/1506.01186, 2016. URL http://arxiv.org/abs/1506.01186.
- (2016) No More Pesky Learning Rate Guessing Games
- Smith, L.N.¹

37
- 84962006941
- arXiv preprint
- Jost Tobias Springenberg, Alexey Dosovitskiy, Thomas Brox, and Martin Riedmiller. Striving for simplicity: The all convolutional net. arXiv preprint arXiv:1412.6806, 2014.
- (2014) Striving for Simplicity: The All Convolutional Net
- Springenberg, J.T.¹ Dosovitskiy, A.² Brox, T.³ Riedmiller, M.⁴

38
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15(1):1929-1958, 2014.
- (2014) Journal of Machine Learning Research , vol.15 , Issue.1 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.E.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

39
- 84965156812
- arXiv preprint
- Rupesh Kumar Srivastava, Klaus Greff, and Jürgen Schmidhuber. Highway networks. arXiv preprint arXiv:1505.00387, 2015.
- (2015) Highway Networks
- Srivastava, R.K.¹ Greff, K.² Schmidhuber, J.³

40
- 0032120796
- Fast committee learning: Preliminary results
- A Swann and N Allinson. Fast committee learning: Preliminary results. Electronics Letters, 34(14): 1408-1410, 1998.
- (1998) Electronics Letters , vol.34 , Issue.14 , pp. 1408-1410
- Swann, A.¹ Allinson, N.²

41
- 84897550107
- Regularization of neural networks using dropconnect
- Li Wan, Matthew Zeiler, Sixin Zhang, Yann L Cun, and Rob Fergus. Regularization of neural networks using dropconnect. In ICML, 2013.
- (2013) ICML
- Wan, L.¹ Zeiler, M.² Zhang, S.³ Cun, Y.L.⁴ Fergus, R.⁵

42
- 84923852371
- arXiv preprint
- Jingjing Xie, Bing Xu, and Zhang Chuang. Horizontal and vertical ensemble with deep representation for classification. arXiv preprint arXiv:1306.2759, 2013.
- (2013) Horizontal and Vertical Ensemble with Deep Representation for Classification
- Xie, J.¹ Xu, B.² Chuang, Z.³

43
- 84990053656
- arXiv preprint
- Sergey Zagoruyko and Nikos Komodakis. Wide residual networks. arXiv preprint arXiv:1605.07146, 2016.
- (2016) Wide Residual Networks
- Zagoruyko, S.¹ Komodakis, N.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.