-
2
-
-
84879976780
-
The arcade learning environment: An evaluation platform for general agents
-
06
-
M. G. Bellemare, Y. Naddaf, J. Veness, and M. Bowling. The arcade learning environment: An evaluation platform for general agents. Journal of Artificial Intelligence Research, 47: 253-279, 06 2013.
-
(2013)
Journal of Artificial Intelligence Research
, vol.47
, pp. 253-279
-
-
Bellemare, M.G.1
Naddaf, Y.2
Veness, J.3
Bowling, M.4
-
3
-
-
84926078662
-
-
Cambridge University Press, New York, NY, USA
-
Nicolo Cesa-Bianchi and Gabor Lugosi. Prediction, Learning, and Games. Cambridge University Press, New York, NY, USA, 2006. ISBN 0521841089.
-
(2006)
Prediction, Learning, and Games
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
-
4
-
-
84858980829
-
Reasoning, learning, and creativity: Frontal lobe function and human decision-making
-
03
-
Anne Collins and Etienne Koechlin. Reasoning, learning, and creativity: Frontal lobe function and human decision-making. PLoS Biol, 10(3): 1-16, 03 2012.
-
(2012)
PLoS Biol
, vol.10
, Issue.3
, pp. 1-16
-
-
Collins, A.1
Koechlin, E.2
-
5
-
-
84881089217
-
Cognitive control over learning: Creating, clustering and generalizing task-set structure
-
Anne G.E. Collins and Michael J. Frank. Cognitive Control over Learning: Creating, Clustering and Generalizing Task-Set Structure. Psychological review, 120.1: 190-229, 2013.
-
(2013)
Psychological Review
, vol.120
, Issue.1
, pp. 190-229
-
-
Collins, A.G.E.1
Frank, M.J.2
-
6
-
-
84903295903
-
Foundations of human reasoning in the prefrontal cortex
-
Maël Donoso, Anne G. E. Collins, and Etienne Koechlin. Foundations of human reasoning in the prefrontal cortex. Science, 344(6191): 1481-1486, 2014. doi: 10.1126/science.1252254.
-
(2014)
Science
, vol.344
, Issue.6191
, pp. 1481-1486
-
-
Donoso, M.1
Collins, A.G.E.2
Koechlin, E.3
-
7
-
-
0036618011
-
Multiple model-based reinforcement learning
-
Kenji Doya and Kazuyuki Samejima. Multiple model-based reinforcement learning. Neural Computation, 14: 1347-1369, 2002.
-
(2002)
Neural Computation
, vol.14
, pp. 1347-1369
-
-
Doya, K.1
Samejima, K.2
-
8
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
07
-
John Duchi, Elad Hazan, and Yoram Singer. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. Journal of Machine Learning Research (JMLR), 12: 2121-2159, 07 2011.
-
(2011)
Journal of Machine Learning Research (JMLR)
, vol.12
, pp. 2121-2159
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
9
-
-
84969749373
-
MADE: Masked autoencoder for distribution estimation
-
Mathieu Germain, Karol Gregor, Iain Murray, and Hugo Larochelle. MADE: masked autoencoder for distribution estimation. In Proceedings of the 32nd International Conference on Machine Learning, JMLR W&CP, Volume 37, pages 881-889, 2015.
-
(2015)
Proceedings of the 32nd International Conference on Machine Learning, JMLR W&CP
, vol.37
, pp. 881-889
-
-
Germain, M.1
Gregor, K.2
Murray, I.3
Larochelle, H.4
-
10
-
-
84867973521
-
Efficient tracking of large classes of experts
-
A. György, T. Linder, and G. Lugosi. Efficient tracking of large classes of experts. IEEE Transactions on Information Theory, 58(11): 6709-6725, 2011.
-
(2011)
IEEE Transactions on Information Theory
, vol.58
, Issue.11
, pp. 6709-6725
-
-
György, A.1
Linder, T.2
Lugosi, G.3
-
12
-
-
34548243292
-
On universal prediction and Bayesian confirmation
-
Marcus Hutter. On universal prediction and Bayesian confirmation. Theoretical Computer Science, 384(1): 33-48, 2007.
-
(2007)
Theoretical Computer Science
, vol.384
, Issue.1
, pp. 33-48
-
-
Hutter, M.1
-
14
-
-
84887453552
-
Concentration and confidence for discrete Bayesian sequence predictors
-
Sanjay Jain, Rémi Munos, Frank Stephan, and Thomas Zeugmann, editors Springer
-
Tor Lattimore, Marcus Hutter, and Peter Sunehag. Concentration and confidence for discrete bayesian sequence predictors. In Sanjay Jain, Rémi Munos, Frank Stephan, and Thomas Zeugmann, editors, Proceedings of the 24th International Conference on Algorithmic Learning Theory, pages 324-338. Springer, 2013.
-
(2013)
Proceedings of the 24th International Conference on Algorithmic Learning Theory
, pp. 324-338
-
-
Lattimore, T.1
Hutter, M.2
Sunehag, P.3
-
15
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Nov
-
Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11): 2278-2324, Nov 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
16
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. Human-level control through deep reinforcement learning. Nature, 518, 2015.
-
(2015)
Nature
, pp. 518
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
Petersen, S.11
Beattie, C.12
Sadik, A.13
Antonoglou, I.14
King, H.15
Kumaran, D.16
Wierstra, D.17
Legg, S.18
Hassabis, D.19
-
17
-
-
84881051130
-
Partition tree weighting
-
March
-
J. Veness, M. White, M. Bowling, and A. Gyorgy. Partition tree weighting. In Data Compression Conference (DCC), pages 321-330, March 2013.
-
(2013)
Data Compression Conference (DCC)
, pp. 321-330
-
-
Veness, J.1
White, M.2
Bowling, M.3
Gyorgy, A.4
-
18
-
-
84960105649
-
Compress and control
-
January 25-30, 2015, Austin, Texas, USA
-
Joel Veness, Marc G. Bellemare, Marcus Hutter, Alvin Chua, and Guillaume Desjardins. Compress and control. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, January 25-30, 2015, Austin, Texas, USA., pages 3016-3023, 2015.
-
(2015)
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence
, pp. 3016-3023
-
-
Veness, J.1
Bellemare, M.G.2
Hutter, M.3
Chua, A.4
Desjardins, G.5
-
19
-
-
0022026217
-
Random sampling with a reservoir
-
March
-
Jeffrey S. Vitter. Random sampling with a reservoir. ACM Trans. Math. Softw., 11(1): 37-57, March 1985. ISSN 0098-3500. doi: 10.1145/3147.3165.
-
(1985)
ACM Trans. Math. Softw.
, vol.11
, Issue.1
, pp. 37-57
-
-
Vitter, J.S.1
-
20
-
-
0030653133
-
Live-and-die coding for binary piecewise i.i.d. Sources
-
1997. Proceedings., 1997 IEEE International Symposium on jun-4 jul
-
F. Willems and M. Krom. Live-and-die coding for binary piecewise i.i.d. sources. In Information Theory. 1997. Proceedings., 1997 IEEE International Symposium on, page 68, jun-4 jul 1997.
-
(1997)
Information Theory
, pp. 68
-
-
Willems, F.1
Krom, M.2
-
21
-
-
0001153192
-
Coding for a binary independent piecewise-identically-distributed source
-
Frans M. J. Willems. Coding for a binary independent piecewise-identically-distributed source. IEEE Transactions on Information Theory, 42: 2210-2217, 1996.
-
(1996)
IEEE Transactions on Information Theory
, vol.42
, pp. 2210-2217
-
-
Willems, F.M.J.1
|