-
1
-
-
0030149709
-
Purposive behavior acquisition for a real robot by vision-based reinforcement learning
-
M. Asada, S. Tawaratsumida, and K. Hosoda: Purposive Behavior Acquisition for a Real Robot by Vision-Based Reinforcement Learning. Machine Learning Journal, 23:279-303, 1996.
-
(1996)
Machine Learning Journal
, vol.23
, pp. 279-303
-
-
Asada, M.1
Tawaratsumida, S.2
Hosoda, K.3
-
2
-
-
0026829832
-
Numerical potential field techniques for robot path planning
-
March/April
-
J. Barraquand, B. Langlois, and J. Latombe: Numerical potential field techniques for robot path planning. IEEE Transactions on Systems, Man, Cybernetics, 22(2):224-241, March/April 1992.
-
(1992)
IEEE Transactions on Systems, Man, Cybernetics
, vol.22
, Issue.2
, pp. 224-241
-
-
Barraquand, J.1
Langlois, B.2
Latombe, J.3
-
4
-
-
0001133021
-
Generalization in reinforcement learning: Safely approximating the value function
-
In Tesauro, G., D. S. Touretzky, and T. K. Leen (eds.); MIT Press
-
J. Boyan and A. Moore: Generalization in Reinforcement Learning: Safely approximating the value function. In Tesauro, G., D. S. Touretzky, and T. K. Leen (eds.), Advances in Neural Information Processing Systems 7 (NIPS). MIT Press, 1995.
-
(1995)
Advances in Neural Information Processing Systems 7 (NIPS)
-
-
Boyan, J.1
Moore, A.2
-
5
-
-
0011934925
-
M-ROSE: A multi robot simulation environment for learning cooperative behavior
-
In H. Asama, T. Arai, T. Fukuda, and T. Hasegawa (eds.):; Springer Verlag
-
S. Buck, M. Beetz, and T. Schmitt: M-ROSE: A Multi Robot Simulation Environment for Learning Cooperative Behavior. In H. Asama, T. Arai, T. Fukuda, and T. Hasegawa (eds.): Distributed Autonomous Robotic Systems 5, Springer Verlag, 2002.
-
(2002)
Distributed Autonomous Robotic Systems
, vol.5
-
-
Buck, S.1
Beetz, M.2
Schmitt, T.3
-
6
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
K. Doya: Reinforcement Learning In Continuous Time and Space. Neural Computation, 12, 219-245, 2000.
-
(2000)
Neural Computation
, vol.12
, pp. 219-245
-
-
Doya, K.1
-
9
-
-
0003932121
-
Reinforcement learning with selective perception and hidden state
-
PhD thesis
-
A. McCallum: Reinforcement Learning with Selective Perception and Hidden State. PhD thesis, 1995.
-
(1995)
-
-
McCallum, A.1
-
10
-
-
0029514510
-
The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
-
A. Moore and C. Atkeson: The Parti-game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-spaces. Machine Learning Journal, 21(3):199-233, 1995.
-
(1995)
Machine Learning Journal
, vol.21
, Issue.3
, pp. 199-233
-
-
Moore, A.1
Atkeson, C.2
-
16
-
-
0004102479
-
Reinforcement learning: An introduction
-
MIT Press, Cambridge, MA
-
R.S. Sutton and A.G. Barto: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
-
(1998)
-
-
Sutton, R.S.1
Barto, A.G.2
-
18
-
-
0003270924
-
Issues in using function approximation for reinforcement learning
-
In M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors; Hillsdale, NJ
-
S. Thrun and A. Schwartz: Issues in Using Function Approximation for Reinforcement Learning. In M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors, Proceedings of the Connectionist Models Summer School, pp. 255-263, Hillsdale, NJ, 1993.
-
(1993)
Proceedings of the Connectionist Models Summer School
, pp. 255-263
-
-
Thrun, S.1
Schwartz, A.2
-
20
-
-
0002011091
-
A menu of designs for reinforcement learning over time
-
W.T. Miller, R.S. Sutton, and P.J. Werbos, editors; MIT Press, MA, USA
-
P. Werbos: A menu of designs for reinforcement learning over time. In Neural Networks for Control, W.T. Miller, R.S. Sutton, and P.J. Werbos, editors, pp. 67-95, MIT Press, MA, USA, 1990.
-
(1990)
Neural Networks for Control
, pp. 67-95
-
-
Werbos, P.1
|