-
3
-
-
0141727204
-
Evolving team Darwin United
-
Asada, M., and Kitano, H., eds. Berlin: Springer Verlag
-
Andre, D., and Teller, A. 1999. Evolving team Darwin United. In Asada, M., and Kitano, H., eds., RoboCup-98: Robot Soccer World Cup II. Berlin: Springer Verlag.
-
(1999)
RoboCup-98: Robot Soccer World Cup II
-
-
Andre, D.1
Teller, A.2
-
4
-
-
0006221144
-
Vision-based behavior acquisition for a shooting robot by using a reinforcement learning
-
Asada, M.; Noda, S.; Tawaratsumida, S.; and Hosoda, K. 1994. Vision-based behavior acquisition for a shooting robot by using a reinforcement learning. In Proc. of IAPK/IEEE Workshop on Visual Behaviors-1994, 112-118.
-
(1994)
Proc. of IAPK/IEEE Workshop on Visual Behaviors-1994
, pp. 112-118
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
5
-
-
24844475430
-
Robot Shaping: Developing Situated Agents through Learning
-
International Computer Science Institute, Berkeley, CA
-
Colombetti, M., and Dorigo, M. 1993. Robot Shaping: Developing Situated Agents through Learning. Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA.
-
(1993)
Technical Report
, vol.TR-92-040
-
-
Colombetti, M.1
Dorigo, M.2
-
6
-
-
0003259931
-
Improving elevator performance using reinforcement learning
-
Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds. Cambridge, MA: MIT Press
-
Crites, R. H., and Barto, A. G. 1996. Improving elevator performance using reinforcement learning. In Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds., Advances in Neural Information Processing Systems 8. Cambridge, MA: MIT Press.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
-
-
Crites, R.H.1
Barto, A.G.2
-
8
-
-
0043247546
-
Accelerating reinforcement learning by composing solutions of automatically identified subtasks
-
Drummond, C. 2002. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research 16:59-104.
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 59-104
-
-
Drummond, C.1
-
10
-
-
22944468731
-
Approximate policy iteration with a policy language bias
-
Thrun, S.; Saul, L.; and Schölkopf, B., eds. Cambridge, MA: MIT Press
-
Fern, A.; Yoon, S.; and Givan, R. 2004. Approximate policy iteration with a policy language bias. In Thrun, S.; Saul, L.; and Schölkopf, B., eds., Advances in Neural Information Processing Systems 16. Cambridge, MA: MIT Press.
-
(2004)
Advances in Neural Information Processing Systems
, vol.16
-
-
Fern, A.1
Yoon, S.2
Givan, R.3
-
17
-
-
0003229379
-
Karlsruhe brainstormers - A reinforcement learning approach to robotic soccer
-
Stone, P.; Balch, T.; and Kraetszchmar, G., eds. Berlin: Springer Verlag
-
Riedmiller, M.; Merke, A.; Meier, D.; Hoffman, A.; Sinner, A.; Thate, O.; and Ehrmann, R. 2001. Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In Stone, P.; Balch, T.; and Kraetszchmar, G., eds., RoboCup-2000: Robot Soccer World Cup IV. Berlin: Springer Verlag.
-
(2001)
RoboCup-2000: Robot Soccer World Cup IV
-
-
Riedmiller, M.1
Merke, A.2
Meier, D.3
Hoffman, A.4
Sinner, A.5
Thate, O.6
Ehrmann, R.7
-
19
-
-
0001027894
-
Transfer of learning by composing solutions of elemental sequential tasks
-
Singh, S. P. 1992. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning 8:323-339.
-
(1992)
Machine Learning
, vol.8
, pp. 323-339
-
-
Singh, S.P.1
-
21
-
-
84944901151
-
The CMUnited-99 champion simulator team
-
Veloso, M.; Pagello, E.; and Kitano, H., eds. Berlin: Springer
-
Stone, P.; Riley, P.; and Veloso, M. 2000. The CMUnited-99 champion simulator team. In Veloso, M.; Pagello, E.; and Kitano, H., eds., RoboCup-99: Robot Soccer World Cup III. Berlin: Springer. 35-48.
-
(2000)
RoboCup-99: Robot Soccer World Cup III
, pp. 35-48
-
-
Stone, P.1
Riley, P.2
Veloso, M.3
-
22
-
-
27544506565
-
Reinforcement learning for RoboCup-soccer keepaway
-
To appear
-
Stone, P.; Sutton, R. S.; and Kuhlmann, G. 2005. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior. To appear.
-
(2005)
Adaptive Behavior
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
25
-
-
0000985504
-
TD-Gammon, a self-teaching backgammon program, achieves master-level play
-
Tesauro, G. 1994. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation 6(2):215-219.
-
(1994)
Neural Computation
, vol.6
, Issue.2
, pp. 215-219
-
-
Tesauro, G.1
|