-
2
-
-
84919784622
-
Skip context tree switching
-
Bellemare, M., Veness, J., and Talvitie, E. (2014). Skip context tree switching. In Proceedings of the 31st International Conference on Machine Learning, pages 1458-1466.
-
(2014)
Proceedings of the 31st International Conference on Machine Learning
, pp. 1458-1466
-
-
Bellemare, M.1
Veness, J.2
Talvitie, E.3
-
3
-
-
84879976780
-
The arcade learning environment: An evaluation platform for general agents
-
Bellemare, M. G., Naddaf, Y., Veness, J., and Bowling, M. (2013). The Arcade Learning Environment: An evaluation platform for general agents. Journal of Artificial Intelligence Research, 47:253-279.
-
(2013)
Journal of Artificial Intelligence Research
, vol.47
, pp. 253-279
-
-
Bellemare, M.G.1
Naddaf, Y.2
Veness, J.3
Bowling, M.4
-
6
-
-
85018869330
-
-
Houthooft, R., Chen, X., Duan, Y., Schulman, J., De Turck, F., and Abbeel, P. (2016). Variational information maximizing exploration.
-
(2016)
Variational Information Maximizing Exploration
-
-
Houthooft, R.1
Chen, X.2
Duan, Y.3
Schulman, J.4
De Turck, F.5
Abbeel, P.6
-
9
-
-
85002497864
-
Thompson sampling is asymptotically optimal in general environments
-
Leike, J., Lattimore, T., Orseau, L., and Hutter, M. (2016). Thompson sampling is asymptotically optimal in general environments. In Proceedings of the Conference on Uncertainty in Artificial Intelligence.
-
(2016)
Proceedings of the Conference on Uncertainty in Artificial Intelligence
-
-
Leike, J.1
Lattimore, T.2
Orseau, L.3
Hutter, M.4
-
10
-
-
84877724875
-
Exploration in model-based reinforcement learning by empirically estimating learning progress
-
Lopes, M., Lang, T., Toussaint, M., and Oudeyer, P.-Y. (2012). Exploration in model-based reinforcement learning by empirically estimating learning progress. In Advances in Neural Information Processing Systems 25.
-
(2012)
Advances in Neural Information Processing Systems
, vol.25
-
-
Lopes, M.1
Lang, T.2
Toussaint, M.3
Oudeyer, P.-Y.4
-
12
-
-
84999036937
-
Asynchronous methods for deep reinforcement learning
-
Mnih, V., Badia, A. P., Mirza, M., Graves, A., Lillicrap, T. P., Harley, T., Silver, D., and Kavukcuoglu, K. (2016). Asynchronous methods for deep reinforcement learning. In Proceedings of the International Conference on Machine Learning.
-
(2016)
Proceedings of the International Conference on Machine Learning
-
-
Mnih, V.1
Badia, A.P.2
Mirza, M.3
Graves, A.4
Lillicrap, T.P.5
Harley, T.6
Silver, D.7
Kavukcuoglu, K.8
-
13
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., Graves, A., Riedmiller, M., Fidjeland, A. K., Ostrovski, G., et al. (2015). Human-level control through deep reinforcement learning. Nature, 518(7540):529-533.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
-
14
-
-
84965128263
-
Variational information maximisation for intrinsically motivated reinforcement learning
-
Mohamed, S. and Rezende, D. J. (2015). Variational information maximisation for intrinsically motivated reinforcement learning. In Advances in Neural Information Processing Systems 28.
-
(2015)
Advances in Neural Information Processing Systems
, vol.28
-
-
Mohamed, S.1
Rezende, D.J.2
-
17
-
-
34047267520
-
Intrinsic motivation systems for autonomous mental development
-
Oudeyer, P., Kaplan, F., and Haffner, V. (2007). Intrinsic motivation systems for autonomous mental development. IEEE Transactions on Evolutionary Computation, 11(2):265-286.
-
(2007)
IEEE Transactions on Evolutionary Computation
, vol.11
, Issue.2
, pp. 265-286
-
-
Oudeyer, P.1
Kaplan, F.2
Haffner, V.3
-
22
-
-
55549110436
-
An analysis of model-based interval estimation for Markov decision processes
-
Strehl, A. L. and Littman, M. L. (2008). An analysis of model-based interval estimation for Markov decision processes. Journal of Computer and System Sciences, 74(8):1309-1331.
-
(2008)
Journal of Computer and System Sciences
, vol.74
, Issue.8
, pp. 1309-1331
-
-
Strehl, A.L.1
Littman, M.L.2
|