-
2
-
-
0002355083
-
Connectionist learning for control
-
W.T. Miller, R.S. Sutton, P.J. Werbos (Eds.), MIT Press, Cambridge, MA
-
A.G. Barto, Connectionist learning for control, in: W.T. Miller, R.S. Sutton, P.J. Werbos (Eds.), Neural Networks for Control, MIT Press, Cambridge, MA, 1990, pp. 5-58.
-
(1990)
Neural Networks for Control
, pp. 5-58
-
-
Barto, A.G.1
-
4
-
-
0004255998
-
-
Edward Arnold, Paris
-
G.M. Clarke, D. Cooke, A Basic Course in Statistics, 3rd ed., Edward Arnold, Paris, 1992.
-
(1992)
A Basic Course in Statistics, 3rd Ed.
-
-
Clarke, G.M.1
Cooke, D.2
-
7
-
-
84885587394
-
Benchmarks for mobile robotics?
-
Manchester University, School of Computer Science, Available as Technical Report UMCS-97-9-1
-
J. Hallam, G. Hayes, Benchmarks for mobile robotics? in: Towards Intelligent Mobile Robots: Scientific methods in mobile robotics, Manchester University, School of Computer Science, Available as Technical Report UMCS-97-9-1, 1997.
-
(1997)
Towards Intelligent Mobile Robots: Scientific Methods in Mobile Robotics
-
-
Hallam, J.1
Hayes, G.2
-
8
-
-
0346389056
-
Robolearn 97: An international workshop on evaluating robot learning
-
Department of Computer Science, State University of New York at Buffalo, April
-
H. Hexmoor, Robolearn 97: An international workshop on evaluating robot learning, Technical Report TR 97-03, Department of Computer Science, State University of New York at Buffalo, April 1997.
-
(1997)
Technical Report TR 97-03
-
-
Hexmoor, H.1
-
9
-
-
0346389059
-
-
Unpublished Masters Thesis, University of Edinburgh, Department of Artificial Intelligence, September
-
J. Hoar, Reinforcement learning applied to a real robot task, Unpublished Masters Thesis, University of Edinburgh, Department of Artificial Intelligence, September 1996.
-
(1996)
Reinforcement Learning Applied to a Real Robot Task
-
-
Hoar, J.1
-
10
-
-
0004280606
-
-
Ph.D. Thesis, Department of Computer Science, Stanford
-
L.P. Kaelbling, Learning in embedded systems, Ph.D. Thesis, Department of Computer Science, Stanford, 1990.
-
(1990)
Learning in Embedded Systems
-
-
Kaelbling, L.P.1
-
12
-
-
84957798150
-
Evaluation of learning performance of situated embodied agents
-
Morgan Kaufmann, Los Altos, CA
-
M. Mataric, Evaluation of learning performance of situated embodied agents, in: Proceedings of the Third European Conference on Artificial Life, Morgan Kaufmann, Los Altos, CA, 1995, pp. 579-589.
-
(1995)
Proceedings of the Third European Conference on Artificial Life
, pp. 579-589
-
-
Mataric, M.1
-
13
-
-
85152551400
-
Incremental multi-step Q-learning
-
W.W. Cohen, H. Hirsh (Eds.)
-
J. Peng, R.J. Williams, Incremental multi-step Q-learning, in: W.W. Cohen, H. Hirsh (Eds.), Machine Learning: Proceedings of the 11th International Conference, 1994, pp. 226-232.
-
(1994)
Machine Learning: Proceedings of the 11th International Conference
, pp. 226-232
-
-
Peng, J.1
Williams, R.J.2
-
14
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
S. Singh, R. Sutton, Reinforcement learning with replacing eligibility traces, Machine Learning 22 (1996) 123-158.
-
(1996)
Machine Learning
, vol.22
, pp. 123-158
-
-
Singh, S.1
Sutton, R.2
-
15
-
-
0003617454
-
-
Ph.D. Thesis, University of Massachusetts, School of Computer and Information Sciences
-
R.S. Sutton, Temporal credit assignment in reinforcement learning, Ph.D. Thesis, University of Massachusetts, School of Computer and Information Sciences, 1984.
-
(1984)
Temporal Credit Assignment in Reinforcement Learning
-
-
Sutton, R.S.1
-
16
-
-
0004102479
-
-
MIT Press/Bradford Books, Cambridge, MA
-
R.S. Sutton, A.G. Barto, Reinforcement Learning: An Introduction, MIT Press/Bradford Books, Cambridge, MA, 1998.
-
(1998)
Reinforcement Learning: An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
17
-
-
0004049893
-
-
Thesis, University of Cambridge, King's College, Cambridge, UK, May
-
C.J.C.H. Watkins, Learning from delayed rewards, Thesis, University of Cambridge, King's College, Cambridge, UK, May 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
-
18
-
-
0346389058
-
Learning to perceive and act
-
Department of Computer Science, University of Rochester, June
-
S.D. Whitehead, D. Ballard, Learning to perceive and act, Technical Report TR-331 (revised), Department of Computer Science, University of Rochester, June 1990.
-
(1990)
Technical Report TR-331 (Revised)
-
-
Whitehead, S.D.1
Ballard, D.2
-
19
-
-
0029250080
-
Reinforcement learning in non-Markov decision processes
-
S. Whitehead, L.-J. Lin, Reinforcement learning in non-Markov decision processes, Artificial Intelligence 73 (1995) 271-306.
-
(1995)
Artificial Intelligence
, vol.73
, pp. 271-306
-
-
Whitehead, S.1
Lin, L.-J.2
-
20
-
-
0347019040
-
Investigating the behaviour of Q(λ)
-
Department of Artificial Intelligence, Edinburgh University, January Presented at the IEE Colloquia on Self Learning Robots, February 12, 1996, London
-
J. Wyatt, G. Hayes, J. Hallam, Investigating the behaviour of Q(λ), Technical Report 783, Department of Artificial Intelligence, Edinburgh University, January 1996, Presented at the IEE Colloquia on Self Learning Robots, February 12, 1996, London.
-
(1996)
Technical Report
, vol.783
-
-
Wyatt, J.1
Hayes, G.2
Hallam, J.3
|