SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Metacontrol for adaptive imagination-based optimization

(6) Hamrick, Jessica B a Ballard, Andrew J a Pascanu, Razvan a Vinyals, Oriol a Heess, Nicolas a Battaglia, Peter W a

a DEEPMIND (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

BUDGET CONTROL; DECISION MAKING; MACHINE LEARNING;

COMPUTATIONAL BUDGET; COMPUTATIONAL RESOURCES; DECISION-MAKING PROBLEM; INTERACTION NETWORKS; MODEL-BASED REINFORCEMENT LEARNING; OPTIMIZATION PROCEDURES; REINFORCEMENT LEARNING AGENT; STATE TRANSITION MODELS;

REINFORCEMENT LEARNING;

EID: 85087796046 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (59)

References (12)

1
- 84958264664
- Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, et al. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. URL http://tensorflow.org/. Software available from tensorflow.org.
- (2015) TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
- Abadi, M.¹ Agarwal, A.² Barham, P.³ Brevdo, E.⁴ Chen, Z.⁵ Citro, C.⁶ Corrado, G.S.⁷ Davis, A.⁸ Dean, J.⁹ Devin, M.¹⁰

2
- 85018882182
- Interaction networks for learning about objects, relations and physics
- Peter Battaglia, Razvan Pascanu, Matthew Lai, Danilo Jimenez Rezende, and Koray Kavukcuoglu. Interaction networks for learning about objects, relations and physics. Advances in Neural Information Processing Systems, 2016.
- (2016) Advances in Neural Information Processing Systems
- Battaglia, P.¹ Pascanu, R.² Lai, M.³ Rezende, D.J.⁴ Kavukcuoglu, K.⁵

3
- 85016060752
- Alex Graves. Adaptive computation time for recurrent neural networks. arXiv:1603.08983, 2016.
- (2016) Adaptive Computation Time for Recurrent Neural Networks
- Graves, A.¹

4
- 84886054445
- Selecting computations: Theory and applications
- Nicholas Hay, Stuart J. Russell, David Tolpin, and Solomon Eyal Shimony. Selecting computations: Theory and applications. Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence, 2012.
- (2012) Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence
- Hay, N.¹ Russell, S.J.² Tolpin, D.³ Shimony, S.E.⁴

5
- 84965103751
- Learning continuous control policies by stochastic value gradients
- Nicolas Heess, Gregory Wayne, David Silver, Tim Lillicrap, Tom Erez, and Yuval Tassa. Learning continuous control policies by stochastic value gradients. Advances in Neural Information Processing Systems, 2015.
- (2015) Advances in Neural Information Processing Systems
- Heess, N.¹ Wayne, G.² Silver, D.³ Lillicrap, T.⁴ Erez, T.⁵ Tassa, Y.⁶

6
- 84941620184
- Diederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. arXiv:1412.6980, 2014.
- (2014) Adam: A Method for Stochastic Optimization
- Kingma, D.¹ Ba, J.²

7
- 84999036937
- Asynchronous methods for deep reinforcement learning
- Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. Asynchronous methods for deep reinforcement learning. Proceedings of the 33rd International Conference on Machine Learning, 2016.
- (2016) Proceedings of the 33rd International Conference on Machine Learning
- Mnih, V.¹ Badia, A.P.² Mirza, M.³ Graves, A.⁴ Lillicrap, T.P.⁵ Harley, T.⁶ Silver, D.⁷ Kavukcuoglu, K.⁸

8
- 84892982833
- On the difficulty of training recurrent neural networks
- Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio. On the difficulty of training recurrent neural networks. Proceedings of the 27st International Conference on Machine Learning, pp. 1310-1318, 2013.
- (2013) Proceedings of the 27st International Conference on Machine Learning , pp. 1310-1318
- Pascanu, R.¹ Mikolov, T.² Bengio, Y.³

9
- 0026155868
- Principles of metareasoning
- Stuart Russell and Eric Wefald. Principles of metareasoning. Artificial Intelligence, 49(1):361 - 395, 1991.
- (1991) Artificial Intelligence , vol.49 , Issue.1 , pp. 361-395
- Russell, S.¹ Wefald, E.²

10
- 85018927054
- Aäron van den Oord, Nal Kalchbrenner, Oriol Vinyals, Lasse Espeholt, Alex Graves, and Koray Kavukcuoglu. Conditional image generation with PixelCNN decoders. arXiv:1606.05328, 2016.
- (2016) Conditional Image Generation with PixelCNN Decoders
- Van Den Oord, A.¹ Kalchbrenner, N.² Vinyals, O.³ Espeholt, L.⁴ Graves, A.⁵ Kavukcuoglu, K.⁶

11
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Ronald J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8(3-4):229-256, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
- Williams, R.J.¹

12
- 0041154467
- Function optimization using connectionist reinforcement learning algorithms
- Ronald J. Williams and Jing Peng. Function optimization using connectionist reinforcement learning algorithms. Connection Science, 3(3):241-268, 1991.
- (1991) Connection Science , vol.3 , Issue.3 , pp. 241-268
- Williams, R.J.¹ Peng, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.