SCOPUS 정보 검색 플랫폼

35th International Conference on Machine Learning, ICML 2018

Volumn 2, Issue , 2018, Pages 883-893

Understanding and simplifying one-shot architecture search

(5) Bender, Gabriel a Kindermans, Pieter Jan a Zoph, Barret a Vasudevan, Vijay a Le, Quoc a

a GOOGLE INC (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPLEX NETWORKS; REINFORCEMENT LEARNING;

COMPLEX SEARCH SPACES; DIFFERENT ARCHITECTURES; EXISTING ARCHITECTURES; EXPERIMENTAL ANALYSIS; ORDERS OF MAGNITUDE; SEARCH METHOD;

NETWORK ARCHITECTURE;

EID: 85057226704 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (400)

References (33)

1
- 85075670920
- Tensorflow: A system for large-scale machine learning
- Berkeley, CA, USA, USENIX Association
- Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., Wicke, M., Yu, Y., and Zheng, X. Tensorflow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, OSDI'16, pp. 265-283, Berkeley, CA, USA, 2016. USENIX Association. ISBN978-1-931971-33-1.
- (2016) Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, OSDI'16 , pp. 265-283
- Abadi, M.¹ Barham, P.² Chen, J.³ Chen, Z.⁴ Davis, A.⁵ Dean, J.⁶ Devin, M.⁷ Ghemawat, S.⁸ Irving, G.⁹ Isard, M.¹⁰ Kudlur, M.¹¹ Levenberg, J.¹² Monga, R.¹³ Moore, S.¹⁴ Murray, D.G.¹⁵ Steiner, B.¹⁶ Tucker, P.¹⁷ Vasudevan, V.¹⁸ Warden, P.¹⁹ Wicke, M.²⁰ Yu, Y.²¹ Zheng, X.²² more..

2
- 85019172761
- Learning to learn by gradient descent by gradient descent
- Andrychowicz, M., Denil, M., Gomez, S., Hoffman, M. W., Pfau, D., Schaul, T., and de Freitas, N. Learning to learn by gradient descent by gradient descent. In Advances in Neural Information Processing Systems, pp. 3981-3989, 2016.
- (2016) Advances in Neural Information Processing Systems , pp. 3981-3989
- Andrychowicz, M.¹ Denil, M.² Gomez, S.³ Hoffman, M.W.⁴ Pfau, D.⁵ Schaul, T.⁶ De Freitas, N.⁷

3
- 85020546778
- arXiv preprint arXiv:1611.02167
- Baker, B., Gupta, O., Naik, N., and Raskar, R. Designing neural network architectures using reinforcement learning. arXiv preprint arXiv:1611.02167, 2016.
- (2016) Designing Neural Network Architectures Using Reinforcement Learning
- Baker, B.¹ Gupta, O.² Naik, N.³ Raskar, R.⁴

4
- 70450190492
- Evolving memory cell structures for sequence learning
- Springer
- Bayer, J., Wierstra, D., Togelius, J., and Schmidhuber, J. Evolving memory cell structures for sequence learning. In International Conference on Artificial Neural Networks, pp. 755-764. Springer, 2009.
- (2009) International Conference on Artificial Neural Networks , pp. 755-764
- Bayer, J.¹ Wierstra, D.² Togelius, J.³ Schmidhuber, J.⁴

5
- 85057266292
- arXiv preprint arXiv:1709.07417
- Bello, I., Zoph, B., Vasudevan, V., and Le, Q. V. Neural optimizer search with reinforcement learning. arXiv preprint arXiv:1709.07417, 2017.
- (2017) Neural Optimizer Search with Reinforcement Learning
- Bello, I.¹ Zoph, B.² Vasudevan, V.³ Le, Q.V.⁴

6
- 84857855190
- Random search for hyperparameter optimization
- Feb
- Bergstra, J. and Bengio, Y. Random search for hyperparameter optimization. Journal of Machine Learning Research, 13(Feb):281-305, 2012.
- (2012) Journal of Machine Learning Research , vol.13 , pp. 281-305
- Bergstra, J.¹ Bengio, Y.²

7
- 84897558007
- Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures
- Bergstra, J., Yamins, D., and Cox, D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In International Conference on Machine Learning, pp. 115-123, 2013.
- (2013) International Conference on Machine Learning , pp. 115-123
- Bergstra, J.¹ Yamins, D.² Cox, D.³

8
- 85162384813
- Algo-rithms for hyper-parameter optimization
- Bergstra, J. S., Bardenet, R., Bengio, Y., and Kegl, B. Algo-rithms for hyper-parameter optimization. In Advances in neural information processing systems, pp. 2546-2554, 2011.
- (2011) Advances in Neural Information Processing Systems , pp. 2546-2554
- Bergstra, J.S.¹ Bardenet, R.² Bengio, Y.³ Kegl, B.⁴

9
- 85050949672
- arXiv preprint arXiv:1708.05344
- Brock, A., Lim, T., Ritchie, J. M., and Weston, N. SMASH: One-shot model architecture search through hypernetworks. arXiv preprint arXiv:1708.05344, 2017.
- (2017) SMASH: One-shot Model Architecture Search through Hypernetworks.
- Brock, A.¹ Lim, T.² Ritchie, J.M.³ Weston, N.⁴

10
- 85055115822
- arXiv preprint arXiv:1707.04873
- Cai, H., Chen, T., Zhang, W., Yu, Y., and Wang, J. Reinforcement learning for architecture search by network transformation. arXiv preprint arXiv:1707.04873, 2017.
- (2017) Reinforcement Learning for Architecture Search by Network Transformation
- Cai, H.¹ Chen, T.² Zhang, W.³ Yu, Y.⁴ Wang, J.⁵

11
- 85055095297
- arXiv preprint arXiv:1711.04528
- Elsken, T., Metzen, J.-H., and Hutter, F. Simple and efficient architecture search for convolutional neural networks. arXiv preprint arXiv:1711.04528, 2017.
- (2017) Simple and Efficient Architecture Search for Convolutional Neural Networks
- Elsken, T.¹ Metzen, J.-H.² Hutter, F.³

12
- 85057261648
- Morphnet: Fast & simple resource-constrained structure learning of deep networks
- abs/1711.06798
- Gordon, A., Eban, E., Nachum, O., Chen, B., Yang, T., and Choi, E. Morphnet: Fast & simple resource-constrained structure learning of deep networks. CoRR, abs/1711.06798, 2017. URLhttp://arxiv.org/abs/1711.06798.
- (2017) CoRR
- Gordon, A.¹ Eban, E.² Nachum, O.³ Chen, B.⁴ Yang, T.⁵ Choi, E.⁶

13
- 84958985283
- Learning to learn using gradient descent
- Springer
- Hochreiter, S., Younger, A. S., and Conwell, P. R. Learning to learn using gradient descent. In International Conference on Artificial Neural Networks, pp. 87-94. Springer, 2001.
- (2001) International Conference on Artificial Neural Networks , pp. 87-94
- Hochreiter, S.¹ Younger, A.S.² Conwell, P.R.³

14
- 85046996830
- Train longer, generalize better: Closing the generalization gap in large batch training of neural networks
- Hoffer, E., Hubara, I., and Soudry, D. Train longer, generalize better: closing the generalization gap in large batch training of neural networks. In Advances in Neural Information Processing Systems, pp. 1729-1739, 2017.
- (2017) Advances in Neural Information Processing Systems , pp. 1729-1739
- Hoffer, E.¹ Hubara, I.² Soudry, D.³

15
- 85030212949
- arXiv preprint arXiv:1704.04861
- Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861, 2017.
- (2017) Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications
- Howard, A.G.¹ Zhu, M.² Chen, B.³ Kalenichenko, D.⁴ Wang, W.⁵ Weyand, T.⁶ Andreetto, M.⁷ Adam, H.⁸

16
- 85050622039
- arXiv preprint arXiv:1711.09846
- Jaderberg, M., Dalibard, V., Osindero, S., Czarnecki, W. M., Donahue, J., Razavi, A., Vinyals, O., Green, T., Dunning, I., Simonyan, K., et al. Population based training of neural networks. arXiv preprint arXiv:1711.09846, 2017.
- (2017) Population Based Training of Neural Networks
- Jaderberg, M.¹ Dalibard, V.² Osindero, S.³ Czarnecki, W.M.⁴ Donahue, J.⁵ Razavi, A.⁶ Vinyals, O.⁷ Green, T.⁸ Dunning, I.⁹ Simonyan, K.¹⁰

17
- 85010821099
- An empirical exploration of recurrent network architectures
- Jozefowicz, R., Zaremba, W., and Sutskever, I. An empirical exploration of recurrent network architectures. In International Conference on Machine Learning, pp. 2342-2350, .
- International Conference on Machine Learning , pp. 2342-2350
- Jozefowicz, R.¹ Zaremba, W.² Sutskever, I.³

18
- 85055111162
- arXiv preprint arXiv:1712.00559
- Liu, C., Zoph, B., Shlens, J., Hua, W., Li, L.-J., Fei-Fei, L., Yuille, A., Huang, J., and Murphy, K. Progressive neural architecture search. arXiv preprint arXiv:1712.00559, 2017a.
- (2017) Progressive Neural Architecture Search
- Liu, C.¹ Zoph, B.² Shlens, J.³ Hua, W.⁴ Li, L.-J.⁵ Fei-Fei, L.⁶ Yuille, A.⁷ Huang, J.⁸ Murphy, K.⁹

19
- 85050612902
- arXiv preprint arXiv:1711.00436
- Liu, H., Simonyan, K., Vinyals, O., Fernando, C., and Kavukcuoglu, K. Hierarchical representations for efficient architecture search. arXiv preprint arXiv:1711.00436, 2017b.
- (2017) Hierarchical Representations for Efficient Architecture Search
- Liu, H.¹ Simonyan, K.² Vinyals, O.³ Fernando, C.⁴ Kavukcuoglu, K.⁵

20
- 85020496584
- arXiv preprint arXiv:1703.00548
- Miikkulainen, R., Liang, J., Meyerson, E., Rawal, A., Fink, D., Francon, O., Raju, B., Navruzyan, A., Duffy, N., and Hodjat, B. Evolving deep neural networks. arXiv preprint arXiv:1703.00548, 2017.
- (2017) Evolving Deep Neural Networks.
- Miikkulainen, R.¹ Liang, J.² Meyerson, E.³ Rawal, A.⁴ Fink, D.⁵ Francon, O.⁶ Raju, B.⁷ Navruzyan, A.⁸ Duffy, N.⁹ Hodjat, B.¹⁰

21
- 85083952791
- Faster discovery of neural architectures by searching for paths in a large model
- Pham, H., Guan, M. Y., Zoph, B., Le, Q. V., and Dean, J. Faster discovery of neural architectures by searching for paths in a large model. International Conference on Learning Representations, 2018. URLhttps://openreview.net/forum?id=ByQZjx-0-.Understan.
- (2018) International Conference on Learning Representations
- Pham, H.¹ Guan, M.Y.² Zoph, B.³ Le, Q.V.⁴ Dean, J.⁵

22
- 85045081113
- arXiv preprint arXiv:1710.05941
- Ramachandran, P., Zoph, B., and Le, Q. V. Searching for activation functions. arXiv preprint arXiv:1710.05941, 2017.
- (2017) Searching for Activation Functions
- Ramachandran, P.¹ Zoph, B.² Le, Q.V.³

23
- 85048592974
- arXiv preprint arXiv:1703.01041
- Real, E., Moore, S., Selle, A., Saxena, S., Suematsu, Y. L., Le, Q., and Kurakin, A. Large-scale evolution of image classifiers. arXiv preprint arXiv:1703.01041, 2017.
- (2017) Large-scale Evolution of Image Classifiers
- Real, E.¹ Moore, S.² Selle, A.³ Saxena, S.⁴ Suematsu, Y.L.⁵ Le, Q.⁶ Kurakin, A.⁷

24
- 0008006333
- PhD thesis, Technische Universitat Mü nchen
- Schmidhuber, J. Evolutionary principles in self-referential learning, or on learning how to learn: The meta-meta- ... hook. PhD thesis, Technische Universitat Mü nchen, 1987.
- (1987) Evolutionary Principles in Self-referential Learning, or on Learning How to Learn: The Meta-meta- ... Hook
- Schmidhuber, J.¹

25
- 84869201485
- Practical Bayesian optimization of machine learning algorithms
- Snoek, J., Larochelle, H., and Adams, R. P. Practical Bayesian optimization of machine learning algorithms. In Advances in neural information processing systems, pp. 2951-2959, 2012.
- (2012) Advances in Neural Information Processing Systems , pp. 2951-2959
- Snoek, J.¹ Larochelle, H.² Adams, R.P.³

26
- 84970022032
- Scalable Bayesian optimization using deep neural networks
- Snoek, J., Rippel, O., Swersky, K., Kiros, R., Satish, N., Sundaram, N., Patwary, M., Prabhat, M., and Adams, R. Scalable Bayesian optimization using deep neural networks. In International Conference on Machine Learning, pp. 2171-2180, 2015.
- (2015) International Conference on Machine Learning , pp. 2171-2180
- Snoek, J.¹ Rippel, O.² Swersky, K.³ Kiros, R.⁴ Satish, N.⁵ Sundaram, N.⁶ Patwary, M.⁷ Prabhat, M.⁸ Adams, R.⁹

27
- 0036594106
- Evolving neural networks through augmenting topologies
- Stanley, K. O. and Miikkulainen, R. Evolving neural networks through augmenting topologies. Evolutionary computation, 10(2):99-127, 2002.
- (2002) Evolutionary Computation , vol.10 , Issue.2 , pp. 99-127
- Stanley, K.O.¹ Miikkulainen, R.²

28
- 85057243256
- Learning to learn
- Thrun, S. and Pratt, L. Learning to learn. Springer Science & Business Media, 2012.
- (2012) Springer Science & Business Media
- Thrun, S.¹ Pratt, L.²

29
- 85051516459
- arXiv preprint arXiv:1703.04813
- Wichrowska, O., Maheswaranathan, N., Hoffman, M. W., Colmenarejo, S. G., Denil, M., de Freitas, N., and SohlDickstein, J. Learned optimizers that scale and generalize. arXiv preprint arXiv:1703.04813, 2017.
- (2017) Learned Optimizers that Scale and Generalize
- Wichrowska, O.¹ Maheswaranathan, N.² Hoffman, M.W.³ Colmenarejo, S.G.⁴ Denil, M.⁵ De Freitas, N.⁶ SohlDickstein, J.⁷

30
- 84941874233
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Springer
- Williams, R. J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. In Reinforcement Learning, pp. 5-32. Springer, 1992.
- (1992) Reinforcement Learning , pp. 5-32
- Williams, R.J.¹

31
- 85040676995
- arXiv preprint arXiv:1703.01513
- Xie, L. and Yuille, A. Genetic cnn. arXiv preprint arXiv:1703.01513, 2017.
- (2017) Genetic Cnn
- Xie, L.¹ Yuille, A.²

32
- 85020101955
- Neural architecture search with reinforcement learning
- Zoph, B. and Le, Q. V. Neural architecture search with reinforcement learning. In International Conference on Learning Representations, 2016.
- (2016) International Conference on Learning Representations
- Zoph, B.¹ Le, Q.V.²

33
- 85048802871
- arXiv preprint arXiv:1707.07012
- Zoph, B., Vasudevan, V., Shlens, J., and Le, Q. V. Learning transferable architectures for scalable image recognition. arXiv preprint arXiv:1707.07012, 2017.
- (2017) Learning Transferable Architectures for Scalable Image Recognition
- Zoph, B.¹ Vasudevan, V.² Shlens, J.³ Le, Q.V.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.