-
2
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
D. Ernst, P. Geurts, and L. Wehenkel. Tree-based batch mode reinforcement learning. J. Mach. Learn. Res., 6:503-556, 2005.
-
(2005)
J. Mach. Learn. Res
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
5
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L.-J. Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8:293-321, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.-J.1
-
6
-
-
50249100331
-
Users manual: RoboCup soccer server -for soccer server version 7.07 and later
-
August
-
M.Chen, E.Foroughi, F.Heintz, Z.Huang, S.Kapetanakis, K.Kostiadis, J.Kummeneje, I.Noda, O.Obst, P.Riley, T.Steffens, Y.Waug, and X.Yin. Users manual: RoboCup soccer server -for soccer server version 7.07 and later. The RoboCup Federation, August 2002.
-
(2002)
The RoboCup Federation
-
-
Chen, M.1
Foroughi, E.2
Heintz, F.3
Huang, Z.4
Kapetanakis, S.5
Kostiadis, K.6
Kummeneje, J.7
Noda, I.8
Obst, O.9
Riley, P.10
Steffens, T.11
Waug, Y.12
Yin, X.13
-
7
-
-
84898980684
-
Autonomous helicopter flight via reinforcement learning
-
S. Thrun, L. Saul, and B. Schölkopf, editors, MIT Press, Cambridge, MA
-
A. Y. Ng, H. J. Kim, M. I. Jordan, and S. Sastry. Autonomous helicopter flight via reinforcement learning. In S. Thrun, L. Saul, and B. Schölkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, 2004.
-
(2004)
Advances in Neural Information Processing Systems 16
-
-
Ng, A.Y.1
Kim, H.J.2
Jordan, M.I.3
Sastry, S.4
-
8
-
-
33646398129
-
Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method
-
J. Gama, R. Camacho, P. Brazdil, A. Jorge, and L. Torgo, editors, ECML, of, Springer
-
M. Riedmiller. Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method. In J. Gama, R. Camacho, P. Brazdil, A. Jorge, and L. Torgo, editors, ECML, volume 3720 of Lecture Notes in Computer Science, pages 317-328. Springer, 2005.
-
(2005)
Lecture Notes in Computer Science
, vol.3720
, pp. 317-328
-
-
Riedmiller, M.1
-
9
-
-
0003636089
-
On-line Q-learning using connectionist systems
-
Cambridge University Engineering Department
-
G. A. Rummery and M. Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG/TR 166, Cambridge University Engineering Department, 1994.
-
(1994)
Technical Report CUED/F-INFENG/TR
, vol.166
-
-
Rummery, G.A.1
Niranjan, M.2
-
10
-
-
37249034293
-
Keepaway soccer: From machine learning testbed to benchmark
-
P. Stone, G. Kuhlmann, M. E. Taylor, and Y. Liu. Keepaway soccer: From machine learning testbed to benchmark. R.oboCup-2005: Robot Soccer World Cup IX, 4020:93-105, 2006.
-
(2006)
R.oboCup-2005: Robot Soccer World Cup IX
, vol.4020
, pp. 93-105
-
-
Stone, P.1
Kuhlmann, G.2
Taylor, M.E.3
Liu, Y.4
-
11
-
-
27544506565
-
Reinforcement learning for RoboCup-soccer keepaway
-
P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive. Behavior, 13(3):165-188, 2005.
-
(2005)
Adaptive. Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
13
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
R. S. Sutton, D. Precup, and S. P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1-2
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.P.3
-
15
-
-
27544473171
-
Behavior transfer for value-function-based reinforcement learning
-
F. Dignum, V. Dignum, S. Koenig, S. Kraus, M. P. Singh, and M. Wooldridge, editors, New York, NY, July, ACM Press
-
M. E. Taylor and P. Stone. Behavior transfer for value-function-based reinforcement learning. In F. Dignum, V. Dignum, S. Koenig, S. Kraus, M. P. Singh, and M. Wooldridge, editors, The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, New York, NY, July 2005. ACM Press.
-
(2005)
The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems
, pp. 53-59
-
-
Taylor, M.E.1
Stone, P.2
|