SCOPUS 정보 검색 플랫폼

4th International Conference on Learning Representations, ICLR 2016 - Conference Track Proceedings

Volumn , Issue , 2016, Pages

Net2Net: Accelerating learning via knowledge transfer

(3) Chen, Tianqi a Goodfellow, Ian a Shlens, Jonathon a

a GOOGLE INC (United States)

Author keywords

[No Author keywords available]

Indexed keywords

KNOWLEDGE MANAGEMENT;

DESIGN PROCESS; KNOWLEDGE TRANSFER; KNOWLEDGE TRANSFER MECHANISMS; PRE-TRAINING; REAL-WORLD; STATE OF THE ART; WORK-FLOWS;

NEURAL NETWORKS;

EID: 85083953532 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (308)

References (20)

1
- 84958264664
- Software available from tensorflow.org
- Abadi, Martín, Agarwal, Ashish, Barham, Paul, Brevdo, Eugene, Chen, Zhifeng, Citro, Craig, Corrado, Greg S., Davis, Andy, Dean, Jeffrey, Devin, Matthieu, Ghemawat, Sanjay, Goodfellow, Ian, Harp, Andrew, Irving, Geoffrey, Isard, Michael, Jia, Yangqing, Jozefowicz, Rafal, Kaiser, Lukasz, Kudlur, Manjunath, Leven-berg, Josh, Mané, Dan, Monga, Rajat, Moore, Sherry, Murray, Derek, Olah, Chris, Schuster, Mike, Shlens, Jonathon, Steiner, Benoit, Sutskever, Ilya, Talwar, Kunal, Tucker, Paul, Vanhoucke, Vincent, Vasudevan, Vijay, Viégas, Fernanda, Vinyals, Oriol, Warden, Pete, Wattenberg, Martin, Wicke, Martin, Yu, Yuan, and Zheng, Xiaoqiang. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. URL http://tensorflow.org/. Software available from tensorflow.org.
- (2015) TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
- Abadi, M.¹ Agarwal, A.² Barham, P.³ Brevdo, E.⁴ Chen, Z.⁵ Citro, C.⁶ Corrado, G.S.⁷ Davis, A.⁸ Dean, J.⁹ Devin, M.¹⁰ Ghemawat, S.¹¹ Goodfellow, I.¹² Harp, A.¹³ Irving, G.¹⁴ Isard, M.¹⁵ Jia, Y.¹⁶ Jozefowicz, R.¹⁷ Kaiser, L.¹⁸ Kudlur, M.¹⁹ Levenberg, J.²⁰ more..

2
- 33749545215
- Model compression
- Buciluǎ, Cristian, Caruana, Rich, and Niculescu-Mizil, Alexandru. Model compression. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 535–541. ACM, 2006.
- (2006) Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , pp. 535-541
- Buciluǎ, C.¹ Caruana, R.² Niculescu-Mizil, A.³

3
- 0000155950
- Denver, CO, Morgan Kaufmann, San Mateo
- Fahlman, Scott E. and Lebiere, Christian. The cascade-correlation learning architecture. pp. 524–532, Denver, CO, 1990. Morgan Kaufmann, San Mateo.
- (1990) The Cascade-Correlation Learning Architecture , pp. 524-532
- Fahlman, S.E.¹ Lebiere, C.²

4
- 84872555593
- Deep sparse rectifier neural networks
- Glorot, X., Bordes, A., and Bengio, Y. Deep sparse rectifier neural networks. In AISTATS’2011, 2011.
- (2011) AISTATS’2011
- Glorot, X.¹ Bordes, A.² Bengio, Y.³

5
- 84897543523
- Maxout networks
- Goodfellow, Ian J., Warde-Farley, David, Mirza, Mehdi, Courville, Aaron, and Bengio, Yoshua. Maxout networks. In ICML’2013, 2013.
- (2013) ICML’2013
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.⁴ Bengio, Y.⁵

6
- 46249099599
- Knowledge transfer in deep convolutional neural nets
- Gutstein, Steven, Fuentes, Olac, and Freudenthal, Eric. Knowledge transfer in deep convolutional neural nets. International Journal on Artificial Intelligence Tools, 17(03):555–567, 2008.
- (2008) International Journal on Artificial Intelligence Tools , vol.17 , Issue.3 , pp. 555-567
- Gutstein, S.¹ Fuentes, O.² Freudenthal, E.³

7
- 84959176782
- arXiv preprint
- Hinton, Geoffrey, Vinyals, Oriol, and Dean, Jeff. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015.
- (2015) Distilling the Knowledge in A Neural Network
- Hinton, G.¹ Vinyals, O.² Dean, J.³

8
- 33745805403
- A fast learning algorithm for deep belief nets
- Hinton, Geoffrey E., Osindero, Simon, and Teh, Yee Whye. A fast learning algorithm for deep belief nets. Neural Computation, 18:1527–1554, 2006.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.W.³

9
- 84964923476
- Ioffe, Sergey and Szegedy, Christian. Batch normalization: Accelerating deep network training by reducing internal covariate shift. 2015.
- (2015) Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
- Ioffe, S.¹ Szegedy, C.²

10
- 77953183471
- What is the best multi-stage architecture for object recognition?
- Jarrett, Kevin, Kavukcuoglu, Koray, Ranzato, Marc’Aurelio, and LeCun, Yann. What is the best multi-stage architecture for object recognition? In Proc. International Conference on Computer Vision (ICCV’09), pp. 2146–2153. IEEE, 2009.
- (2009) Proc. International Conference on Computer Vision (ICCV’09) , pp. 2146-2153
- Jarrett, K.¹ Kavukcuoglu, K.² Ranzato, M.³ LeCun, Y.⁴

11
- 85070990006
- Technical report, (unpublished)
- Mahayri, Amjad, Ballas, Nicolas, and Courville, Aaron. FitNets and batch normalization. Technical report, (unpublished), 2015.
- (2015) FitNets and Batch Normalization
- Mahayri, A.¹ Ballas, N.² Courville, A.³

12
- 84949887093
- Never-ending learning
- Mitchell, T., Cohen, W., Hruschka, E., Talukdar, P., Betteridge, J., Carlson, A., Dalvi, B., Gardner, M., Kisiel, B., Krishnamurthy, J., Lao, N., Mazaitis, K., Mohamed, T., Nakashole, N., Platanios, E., Ritter, A., Samadi, M., Settles, B., Wang, R., Wijaya, D., Gupta, A., Chen, X., Saparov, A., Greaves, M., and Welling, J. Never-ending learning. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15), 2015.
- (2015) Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15)
- Mitchell, T.¹ Cohen, W.² Hruschka, E.³ Talukdar, P.⁴ Betteridge, J.⁵ Carlson, A.⁶ Dalvi, B.⁷ Gardner, M.⁸ Kisiel, B.⁹ Krishnamurthy, J.¹⁰ Lao, N.¹¹ Mazaitis, K.¹² Mohamed, T.¹³ Nakashole, N.¹⁴ Platanios, E.¹⁵ Ritter, A.¹⁶ Samadi, M.¹⁷ Settles, B.¹⁸ Wang, R.¹⁹ Wijaya, D.²⁰ more..

13
- 84964544562
- Technical Report arXiv
- Romero, Adriana, Ballas, Nicolas, Ebrahimi Kahou, Samira, Chassang, Antoine, Gatta, Carlo, and Bengio, Yoshua. FitNets: Hints for thin deep nets. Technical Report Arxiv report 1412.6550, arXiv, 2014.
- (2014) FitNets: Hints for Thin Deep Nets
- Romero, A.¹ Ballas, N.² Ebrahimi Kahou, S.³ Chassang, A.⁴ Gatta, C.⁵ Bengio, Y.⁶

14
- 84883265722
- Lifelong machine learning systems: Beyond learning algorithms
- Silver, DL, Yang, Q, and Li, L. Lifelong machine learning systems: Beyond learning algorithms. In AAAI Spring Symposium-Technical Report, 2013.
- (2013) AAAI Spring Symposium-Technical Report
- Silver, D.L.¹ Yang, Q.² Li, L.³

15
- 85083953063
- Very deep convolutional networks for large-scale image recognition
- Simonyan, Karen and Zisserman, Andrew. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
- (2015) ICLR
- Simonyan, K.¹ Zisserman, A.²

16
- 84893795422
- Parsing with compositional vector grammars
- Socher, Richard, Bauer, John, Manning, Christopher D., and Ng, Andrew Y. Parsing with compositional vector grammars. In In Proceedings of the ACL conference, 2013.
- (2013) Proceedings of the ACL Conference
- Socher, R.¹ Bauer, J.² Manning, C.D.³ Ng, A.Y.⁴

17
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- Srivastava, Nitish, Hinton, Geoffrey, Krizhevsky, Alex, Sutskever, Ilya, and Salakhutdinov, Ruslan. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15: 1929–1958, 2014. URL http://jmlr.org/papers/v15/srivastava14a.html.
- (2014) Journal of Machine Learning Research , vol.15 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

18
- 84964983441
- Technical report
- Szegedy, Christian, Liu, Wei, Jia, Yangqing, Sermanet, Pierre, Reed, Scott, Anguelov, Dragomir, Erhan, Du-mitru, Vanhoucke, Vincent, and Rabinovich, Andrew. Going deeper with convolutions. Technical report, arXiv:1409.4842, 2014.
- (2014) Going Deeper with Convolutions
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶ Erhan, D.-M.⁷ Vanhoucke, V.⁸ Rabinovich, A.⁹

19
- 2342641167
- Technical Report CMU-CS-95-208, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, November
- Thrun, Sebastian. Lifelong learning: A case study. Technical Report CMU-CS-95-208, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, November 1995.
- (1995) Lifelong Learning: A Case Study
- Thrun, S.¹

20
- 84893343292
- Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
- Tieleman, T and Hinton, G. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4, 2012.
- (2012) COURSERA: Neural Networks for Machine Learning , vol.4
- Tieleman, T.¹ Hinton, G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.