-
1
-
-
85152198941
-
Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents
-
Amherst, MA
-
M. Tan. Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents. Proc. of the Tenth International Conference on Machine Learning, Amherst, MA, 330-337, 1993
-
(1993)
Proc. of the Tenth International Conference on Machine Learning
, pp. 330-337
-
-
Tan, M.1
-
2
-
-
0000580224
-
A Temporal-Difference Model of Classical Conditioning
-
R. S. Sutton and A. G. Barto. A Temporal-Difference Model of Classical Conditioning. Tech Report GTE Labs. TR 87-509.2, 1987
-
(1987)
Tech Report GTE Labs
, vol.2
, pp. 87-509
-
-
Sutton, R.S.1
Barto, A.G.2
-
4
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
Kluwer Academic publishers
-
L.-J. Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning 8: 293-321, Kluwer Academic publishers, 1992
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.-J.1
-
5
-
-
34249833101
-
Technical note: Q-learning
-
Kluwer Academic publishers
-
C. J. C. H. Watkins, P. D. Dayan. Technical note: Q-learning. Machine Learning 8, 3: 279-292, Kluwer Academic publishers, 1992
-
(1992)
Machine Learning 8
, vol.3
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.D.2
-
6
-
-
0003812851
-
A study of cooperative mechanisms for faster reinforcement learning
-
Computer Science Department, University of Rochester
-
S. D. Whitehead, D. H. Ballard. A study of cooperative mechanisms for faster reinforcement learning. TR 365, Computer Science Department, University of Rochester, 1991
-
(1991)
TR 365
-
-
Whitehead, S.D.1
Ballard, D.H.2
-
8
-
-
23144457147
-
Teaching by shaping
-
Workshop on Learning by Induction vs. Learning by Demonstration, Tahoe City, CA, USA
-
C. Baroglio. Teaching by shaping. Proc. of ICML-95. Workshop on Learning by Induction vs. Learning by Demonstration, Tahoe City, CA, USA, 1995
-
(1995)
Proc. of ICML-95
-
-
Baroglio, C.1
-
9
-
-
0038849321
-
-
Gerhard Weiß and Sandip Sen, editors, Adaptation and Learning in Multiagent Systems, Springer Verlag, Berlin
-
J. A. Clouse. Learning from an automated training agent. Gerhard Weiß and Sandip Sen, editors, Adaptation and Learning in Multiagent Systems, Springer Verlag, Berlin, 1996
-
(1996)
Learning from an automated training agent
-
-
Clouse, J.A.1
-
15
-
-
0029732210
-
Creating advicetaking reinforcement learners
-
R. Maclin, J. Shavlik. Creating advicetaking reinforcement learners. Machine Learning 22: 251-281, 1997
-
(1997)
Machine Learning
, vol.22
, pp. 251-281
-
-
Maclin, R.1
Shavlik, J.2
-
16
-
-
0002797521
-
Learning in behaviour-based multi-robot systems: Policies, models and other agents
-
Elsvier
-
M. J. Mataric. Learning in behaviour-based multi-robot systems: Policies, models and other agents. Journal of Cognitive Systems Research 2: 81-93, Elsvier, 2001
-
(2001)
Journal of Cognitive Systems Research
, vol.2
, pp. 81-93
-
-
Mataric, M.J.1
-
17
-
-
0003226481
-
Primitive-based movement classification for humanoid imitation
-
Cambridge, MA, MIT
-
O. C. Jenkins, M. J. Mataric, S. Weber. Primitive-based movement classification for humanoid imitation. Proc. of the First International Conference on Humanoid Robotics (IEEE-RAS), Cambridge, MA, MIT, 2000
-
(2000)
Proc. of the First International Conference on Humanoid Robotics (IEEE-RAS)
-
-
Jenkins, O.C.1
Mataric, M.J.2
Weber, S.3
-
19
-
-
0003200022
-
Sensory-motor primitives as a basis for imitation: Linking perception to action and biology to robotics
-
C. Nehaniv & K. Dautenhahn (Eds.), MIT Press
-
M. J. Mataric. Sensory-motor primitives as a basis for imitation: Linking perception to action and biology to robotics. C. Nehaniv & K. Dautenhahn (Eds.), Imitation in animals and artifacts, MIT Press, 2001
-
(2001)
Imitation in animals and artifacts
-
-
Mataric, M.J.1
-
23
-
-
0000646059
-
Learning internal representations by error propagation
-
Foundations, Cambridge MA: MIT Press
-
D. E. Rumelhart, G. E. Hinton, R. J. Wlliams. Learning internal representations by error propagation. Parallel Distributed Processing: Exploration in the Microstructure of Cognition, vol. 1: Foundations, 318-362, Cambridge MA: MIT Press, 1986
-
(1986)
Parallel Distributed Processing: Exploration in the Microstructure of Cognition
, vol.1
, pp. 318-362
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Wlliams, R.J.3
-
25
-
-
0033362601
-
Evolving artificial neural networks
-
X. Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9), 1423-1447, 1999
-
(1999)
Proceedings of the IEEE
, vol.87
, Issue.9
, pp. 1423-1447
-
-
Yao, X.1
-
28
-
-
23144445904
-
The Improvement and Comparison of different Algorithms for Optimizing Neural Networks on the MasPar {MP}-2
-
ICSC Academic Press, Ed.M. Heiss
-
W. Erhard, T. Fink, M. M. Gutzmann, C. Rahn, A. Doering, M. Galicki, The Improvement and Comparison of different Algorithms for Optimizing Neural Networks on the MasPar {MP}-2. Neural Computation {NC}'98, ICSC Academic Press, Ed.M. Heiss, 617-623, 1998
-
(1998)
Neural Computation {NC}'98
, pp. 617-623
-
-
Erhard, W.1
Fink, T.2
Gutzmann, M.M.3
Rahn, C.4
Doering, A.5
Galicki, M.6
-
29
-
-
23144451497
-
SA-Prop: Optimization of Multilayer Perceptron Parameters using Simulated Annealing
-
P.A. Castillo, J. González, J.J. Merelo, V. Rivas, G. Romero, A. Prieto. SA-Prop: Optimization of Multilayer Perceptron Parameters using Simulated Annealing. Proc. of IWANN99, 1999
-
(1999)
Proc. of IWANN99
-
-
Castillo, P.A.1
González, J.2
Merelo, J.J.3
Rivas, V.4
Romero, G.5
Prieto, A.6
-
32
-
-
1842538384
-
-
Masters Thesis, Department of Computer Science, Colorado State University
-
T. Thorpe. Vehicle Traffic Light Control Using SARSA. Masters Thesis, Department of Computer Science, Colorado State University, 1997
-
(1997)
Vehicle Traffic Light Control Using SARSA
-
-
Thorpe, T.1
-
35
-
-
26444479778
-
-
Science, Vol, May
-
S. Kirkpatrick, C. D. Gelatt, M. P. Vecchi. Optimization by simulated Annealing. Science, Vol. 220: 671-680, May 1983
-
(1983)
Optimization by simulated Annealing
, vol.220
, pp. 671-680
-
-
Kirkpatrick, S.1
Gelatt, C.D.2
Vecchi, M.P.3
-
37
-
-
85132026293
-
Integrated architectures for learning planning and reacting based on approximating dynamic programming
-
Morgan-Kaufman
-
R. S. Sutton. Integrated architectures for learning planning and reacting based on approximating dynamic programming. Proc. of the Seventh International Conference on Machine Learning, 216-224, Morgan-Kaufman.
-
Proc. of the Seventh International Conference on Machine Learning
, pp. 216-224
-
-
Sutton, R.S.1
|