-
1
-
-
0003812851
-
-
Computer Science Dept., University of Rochester, NY
-
S.D. Whitehead, A Study of Cooperative Mechanisms for Faster Reinforcement Learning, TR-365, Computer Science Dept., University of Rochester, NY, 1991.
-
(1991)
A Study of Cooperative Mechanisms for Faster Reinforcement Learning, TR-365
-
-
Whitehead, S.D.1
-
2
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
R.S. Sutton, "Learning to predict by the methods of temporal differences," Mach. Learn., 3, 9-44 (1988).
-
(1988)
Mach. Learn.
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
3
-
-
0003617454
-
-
Ph.D. Thesis, Department of Computer and Information Science, University of Massachusetts, Amherst, MA
-
R.S. Sutton, "Temporal credit assignment in reinforcement learning," Ph.D. Thesis, Department of Computer and Information Science, University of Massachusetts, Amherst, MA, 1984.
-
(1984)
Temporal Credit Assignment in Reinforcement Learning
-
-
Sutton, R.S.1
-
4
-
-
0002201501
-
Learning and sequential decision making
-
M. Gabriel and J. W. Moore, Eds. MIT Press, Bradford Books, Cambridge, MA
-
A.G. Barto, R.S. Sutton, and C.J.C.H. Watkins, "Learning and sequential decision making," in Learning and Computational Neuroscience: Foundations of Adaptive Network, M. Gabriel and J. W. Moore, Eds. MIT Press, Bradford Books, Cambridge, MA, 1990.
-
(1990)
Learning and Computational Neuroscience: Foundations of Adaptive Network
-
-
Barto, A.G.1
Sutton, R.S.2
Watkins, C.J.C.H.3
-
5
-
-
0024735689
-
Classifier systems and genetic algorithms
-
L. Booker, D.E. Goldberg, and J.H. Holland, "Classifier systems and genetic algorithms," Artif. Intell., 40, 235-282 (1989).
-
(1989)
Artif. Intell.
, vol.40
, pp. 235-282
-
-
Booker, L.1
Goldberg, D.E.2
Holland, J.H.3
-
6
-
-
0004049895
-
-
Ph.D. Dissertation, Psychology Department, University of Cambridge, England
-
C.J.C.H. Watkins, "Learning with delayed rewards," Ph.D. Dissertation, Psychology Department, University of Cambridge, England, 1989.
-
(1989)
Learning with Delayed Rewards
-
-
Watkins, C.J.C.H.1
-
8
-
-
0003411271
-
Efficient Exploration in Reinforcement Learning
-
Carnegie Mellon University, Pittsburgh, PA
-
S.B. Thrun, Efficient Exploration in Reinforcement Learning, Technical Report CMU-CS-92-102, Carnegie Mellon University, Pittsburgh, PA, 1992.
-
(1992)
Technical Report CMU-CS-92-102
-
-
Thrun, S.B.1
-
9
-
-
0003782780
-
-
MIT Press, Bradford Books, Cambridge, MA
-
M. Dorigo and M. Colombetti, Robot Shaping: An Experiment in Behavior Engineering, MIT Press, Bradford Books, Cambridge, MA, 1997.
-
(1997)
Robot Shaping: An Experiment in Behavior Engineering
-
-
Dorigo, M.1
Colombetti, M.2
-
10
-
-
0029326107
-
ALECSYS and the autonoMouse: Learning to control a real robot by distributed classifier systems
-
M. Dorigo, "ALECSYS and the autonoMouse: Learning to control a real robot by distributed classifier systems," Mach. Learn., 19, 209-240 (1995).
-
(1995)
Mach. Learn.
, vol.19
, pp. 209-240
-
-
Dorigo, M.1
-
11
-
-
0028739953
-
Robot shaping: Developing autonomous agents through learning
-
M. Dorigo and M. Colombetti, "Robot shaping: Developing autonomous agents through learning," Artif. Intell., 71, 321-370 (1994).
-
(1994)
Artif. Intell.
, vol.71
, pp. 321-370
-
-
Dorigo, M.1
Colombetti, M.2
-
12
-
-
0001963114
-
The role of the trainer in reinforcement learning
-
New Brunswick, NJ
-
M. Dorigo and M. Colombetti, "The role of the trainer in reinforcement learning," Proceedings of the MLC-COLT '94 Workshop on Robot Learning, New Brunswick, NJ, 1994, pp. 37-45.
-
(1994)
Proceedings of the MLC-COLT '94 Workshop on Robot Learning
, pp. 37-45
-
-
Dorigo, M.1
Colombetti, M.2
-
13
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
Morgan Kaufmann, San Mateo, CA
-
R.S. Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," Proceedings of the Seventh International Conference on Machine Learning, Morgan Kaufmann, San Mateo, CA, 1990, pp. 216-224.
-
(1990)
Proceedings of the Seventh International Conference on Machine Learning
, pp. 216-224
-
-
Sutton, R.S.1
-
14
-
-
0003673017
-
-
Ph.D. Thesis, Carnegie Mellon University, Pittsburgh, PA
-
L-J. Lin, "Reinforcement learning for robots using neural networks," Ph.D. Thesis, Carnegie Mellon University, Pittsburgh, PA, 1993.
-
(1993)
Reinforcement Learning for Robots Using Neural Networks
-
-
Lin, L.-J.1
-
15
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L-J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Mach. Learn., 8, 293-322 (1992).
-
(1992)
Mach. Learn.
, vol.8
, pp. 293-322
-
-
Lin, L.-J.1
-
17
-
-
0030149709
-
Purposive behavior acquisition for a real robot by vision-based reinforcement learning
-
M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda, "Purposive behavior acquisition for a real robot by vision-based reinforcement learning," Mach. Learn., 23, 279-303 (1996).
-
(1996)
Mach. Learn.
, vol.23
, pp. 279-303
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
18
-
-
0026880130
-
Automatic programming of behavior-based robots using reinforcement learning
-
S. Mahadevan and J. Connell, "Automatic programming of behavior-based robots using reinforcement learning," Artif. Intell., 55, 311-365 (1992).
-
(1992)
Artif. Intell.
, vol.55
, pp. 311-365
-
-
Mahadevan, S.1
Connell, J.2
-
19
-
-
0007908166
-
Experiments with reinforcement learning in problems with continuous state and action spaces
-
Department of Computer Science, University of Massachusetts, Amherst, MA
-
J.C. Santamaria, R.S. Sutton, and A. Ram, "Experiments with reinforcement learning in problems with continuous state and action spaces," Technical Report UM-CS-1966-088, Department of Computer Science, University of Massachusetts, Amherst, MA, 1996.
-
(1996)
Technical Report UM-CS-1966-088
-
-
Santamaria, J.C.1
Sutton, R.S.2
Ram, A.3
-
20
-
-
0028730301
-
Fuzzy Q-learning and dynamical fuzzy Q-learning
-
IEEE Press, Piscataway, NJ
-
P.-Y. Glorennec, "Fuzzy Q-learning and dynamical fuzzy Q-learning," Proceedings of the Third IEEE International Conference on Fuzzy Systems, IEEE Press, Piscataway, NJ, 1994, pp. 474-479.
-
(1994)
Proceedings of the Third IEEE International Conference on Fuzzy Systems
, pp. 474-479
-
-
Glorennec, P.-Y.1
-
21
-
-
0030395785
-
Refining linear fuzzy rules by reinforcement learning
-
IEEE Press, Piscataway, NJ
-
H.R. Berenji, P.S. Khedkar, and A. Malkani, "Refining linear fuzzy rules by reinforcement learning," Proceedings of the Fifth IEEE International Conference on Fuzzy Systems, IEEE Press, Piscataway, NJ, 1996, pp. 1750-1756.
-
(1996)
Proceedings of the Fifth IEEE International Conference on Fuzzy Systems
, pp. 1750-1756
-
-
Berenji, H.R.1
Khedkar, P.S.2
Malkani, A.3
-
22
-
-
0023540098
-
Manual training techniques of autonomous systems based on artificial neural networks
-
IEEE Press, Piscataway, NJ
-
J.F. Shepanski and S.A. Macy, "Manual training techniques of autonomous systems based on artificial neural networks," Proceedings of the IEEE First Annual International Conference on Neural Networks, IEEE Press, Piscataway, NJ, 1987, pp. 697-704.
-
(1987)
Proceedings of the IEEE First Annual International Conference on Neural Networks
, pp. 697-704
-
-
Shepanski, J.F.1
Macy, S.A.2
-
23
-
-
0345843391
-
Achieving rapid adaptations in robots by means of external tuition
-
MIT Press, Cambridge, MA
-
U. Nehmzow and B. McGonigle, "Achieving rapid adaptations in robots by means of external tuition," Proceedings of From Animal to Animats, Third International Conference on Simulation of Adaptive Behaviour (SAB94), MIT Press, Cambridge, MA, 1994, pp. 301-308.
-
(1994)
Proceedings of from Animal to Animats, Third International Conference on Simulation of Adaptive Behaviour (SAB94)
, pp. 301-308
-
-
Nehmzow, U.1
McGonigle, B.2
|