-
4
-
-
78049390740
-
Policy search for motor primitives in robotics
-
Kober, J., Peters, J. (2011). Policy search for motor primitives in robotics. Machine Learning 84:171-203.
-
(2011)
Machine Learning
, vol.84
, pp. 171-203
-
-
Kober, J.1
Peters, J.2
-
5
-
-
84898982129
-
Predictive representations of state
-
Littman, M. L., Sutton, R. S., Singh, S. (2002). Predictive representations of state. In Advances in Neural Information Processing Systems 14, 1555-1561.
-
(2002)
Advances in Neural Information Processing Systems
, vol.14
, pp. 1555-1561
-
-
Littman, M.L.1
Sutton, R.S.2
Singh, S.3
-
7
-
-
84866006400
-
Multi-timescale Nexting in a Reinforcement Learning Robot
-
Proceedings of the 12th International Simulation of Conference on Adaptive Behavior
-
Modayil, J., White, A., Sutton, R. S. (2012). Multi-timescale Nexting in a Reinforcement Learning Robot. In Proceedings of the 12th International Simulation of Conference on Adaptive Behavior, LNAI 7426, 299-309
-
(2012)
LNAI
, vol.7426
, pp. 299-309
-
-
Modayil, J.1
White, A.2
Sutton, R.S.3
-
8
-
-
34047267520
-
Intrinsic Motivation Systems for Autonomous Mental Development
-
Oudeyer, P. Y., Kaplan, F., Hafner, V. (2007). Intrinsic Motivation Systems for Autonomous Mental Development. In IEEE Transactions on Evolutionary Computation 11, 265-286
-
(2007)
IEEE Transactions on Evolutionary Computation
, vol.11
, pp. 265-286
-
-
Oudeyer, P.Y.1
Kaplan, F.2
Hafner, V.3
-
9
-
-
84867427247
-
Dynamic Switching and Real-time Machine Learning for Improved Human Control of Assistive Biomedical Robots
-
Pilarski, P.M., Dawson, M.R., Degris, T.,Carey, J.P., Sutton, R.S. (2012). Dynamic Switching and Real-time Machine Learning for Improved Human Control of Assistive Biomedical Robots. In Proceedings of the 4th IEEE International Conference on Biomedical Robotics and Biomechatronics, 296-302.
-
(2012)
Proceedings of the 4th IEEE International Conference on Biomedical Robotics and Biomechatronics
, pp. 296-302
-
-
Pilarski, P.M.1
Dawson, M.R.2
Degris, T.3
Carey, J.P.4
Sutton, R.S.5
-
11
-
-
84899031920
-
Intrinsically motivated reinforcement learning
-
Singh S., Barto, A. G., Chentanez, N. (2005). Intrinsically motivated reinforcement learning. In Advances in Neural Information Processing Systems 17, 1281-1288.
-
(2005)
Advances in Neural Information Processing Systems
, vol.17
, pp. 1281-1288
-
-
Singh, S.1
Barto, A.G.2
Chentanez, N.3
-
13
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Sutton, R. S., Precup D., Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. In Artificial Intelligence 112:181-211.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
15
-
-
71149099079
-
Fast gradient-descent methods for temporal-difference learning with linear function approximation
-
Sutton, R. S., Maei, H. R., Precup, D., Bhatnagar, S., Silver, D., Szepesvári, Cs., Wiewiora, E. (2009). Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Proceedings of the 26th International Conference on Machine Learning.
-
(2009)
Proceedings of the 26th International Conference on Machine Learning
-
-
Sutton, R.S.1
Maei, H.R.2
Precup, D.3
Bhatnagar, S.4
Silver, D.5
Szepesvári, Cs.6
Wiewiora, E.7
-
16
-
-
84899464022
-
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
-
Sutton, R. S., Modayil, J., Delp, M., Degris, T., Pilarski, P. M., White, A., Precup, D. (2011). Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. In Proceedings of the10th International Conference on Autonomous Agents and Multiagent Systems.
-
(2011)
Proceedings of The10th International Conference on Autonomous Agents and Multiagent Systems
-
-
Sutton, R.S.1
Modayil, J.2
Delp, M.3
Degris, T.4
Pilarski, P.M.5
White, A.6
Precup, D.7
|