-
3
-
-
84866006400
-
Multi-timescale nexting in a reinforcement learning robot
-
Modayil, J., White, A., Sutton, R. S. (2012). Multi-timescale nexting in a reinforcement learning robot. In From Animals to Animals 12, 299-309.
-
(2012)
From Animals to Animals 12
, pp. 299-309
-
-
Modayil, J.1
White, A.2
Sutton, R.S.3
-
4
-
-
34047267520
-
Intrinsic motivation systems for autonomous mental development
-
Oudeyer, P. Y., Kaplan, F., Hafner, V. (2007). Intrinsic Motivation Systems for Autonomous Mental Development. In IEEE Transactions on Evolutionary Computation 11, 265-286
-
(2007)
IEEE Transactions on Evolutionary Computation 11
, pp. 265-286
-
-
Oudeyer, P.Y.1
Kaplan, F.2
Hafner, V.3
-
5
-
-
50849094213
-
Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot
-
Schembri, M., Mirolli, M., Baldassarre, G. (2007). Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot. In Development and Learning, 282-287.
-
(2007)
Development and Learning
, pp. 282-287
-
-
Schembri, M.1
Mirolli, M.2
Baldassarre, G.3
-
8
-
-
84899031920
-
Intrinsically motivated reinforcement learning
-
Singh S., Barto, A. G., Chentanez, N. (2005). Intrinsically motivated reinforcement learning. In Advances in Neural Information Processing Systems 17, 1281-1288.
-
(2005)
Advances in Neural Information Processing Systems
, vol.17
, pp. 1281-1288
-
-
Singh, S.1
Barto, A.G.2
Chentanez, N.3
-
10
-
-
71149099079
-
Fast gradient-descent methods for temporal-difference learning with linear function approximation
-
Sutton, R. S., and Maei, H. R., Precup, D., Bhatnagar, S., Silver, D., Szepesvári, Cs., Wiewiora, E. (2009). Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Proceedings of the 26th International Conference on Machine Learning.
-
(2009)
Proceedings of the 26th International Conference on Machine Learning
-
-
Sutton, R.S.1
Maei, H.R.2
Precup, D.3
Bhatnagar, S.4
Silver, D.5
Szepesvári, Cs.6
Wiewiora, E.7
-
11
-
-
84899464022
-
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
-
Sutton, R. S., Modayil, J., Delp, M., Degris, T., and Pilarski, P. M., White, A., Precup, D. (2011). Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. In Proceedings of thelOth International Conference on Autonomous Agents and Multiagent Systems.
-
(2011)
Proceedings of ThelOth International Conference on Autonomous Agents and Multiagent Systems
-
-
Sutton, R.S.1
Modayil, J.2
Delp, M.3
Degris, T.4
Pilarski, P.M.5
White, A.6
Precup, D.7
-
12
-
-
84872849054
-
Scaling life-long off-policy learning
-
White, A., Modayil, J., Sutton, R. S. (2012). Scaling life-long off-policy learning. In Development and Learning and Epigenetic Robotics, 1-6.
-
(2012)
Development and Learning and Epigenetic Robotics
, pp. 1-6
-
-
White, A.1
Modayil, J.2
Sutton, R.S.3
|