-
1
-
-
0000500817
-
Interactions between learning and evolution
-
C.G. Langton et al., eds., SFI Studies Sc. Complexity, Addison-Wesley, Reading, MA
-
D. Ackley and M. Littman, Interactions between learning and evolution, in: C.G. Langton et al., eds., Artificial Life II, SFI Studies Sc. Complexity, Vol X (Addison-Wesley, Reading, MA, 1991) 487-509.
-
(1991)
Artificial Life II
, vol.10
, pp. 487-509
-
-
Ackley, D.1
Littman, M.2
-
2
-
-
0016556021
-
A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
-
J.S. Albus, A new approach to manipulator control: The cerebellar model articulation controller (CMAC), Transactions of the ASME (1975).
-
(1975)
Transactions of the ASME
-
-
Albus, J.S.1
-
4
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
A.G. Barto, R.S. Sutton and C.W. Anderson, Neuronlike adaptive elements that can solve difficult learning control problems, IEEE Transactions on Systems, Man and Cybernetics SMC-13 (1983) 834-846.
-
(1983)
IEEE Transactions on Systems, Man and Cybernetics
, vol.SMC-13
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
6
-
-
30244470118
-
The ART of adaptive pattern recognition by a self-roganizing neural network
-
G.A. Carpenter and S. Grossberg, The ART of adaptive pattern recognition by a self-roganizing neural network, Proceedings of the IEEE (1988).
-
(1988)
Proceedings of the IEEE
-
-
Carpenter, G.A.1
Grossberg, S.2
-
7
-
-
0030167564
-
Behavior analysis and training - A methodology for behavior engineering
-
M. Dorigo, ed.
-
M. Colombetti, M. Dorigo and G. Borghi, Behavior analysis and training - A methodology for behavior engineering, in: M. Dorigo, ed., Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics SMC-part B 26 (3) (1996) 365-380.
-
(1996)
Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics SMC-part B
, vol.26
, Issue.3
, pp. 365-380
-
-
Colombetti, M.1
Dorigo, M.2
Borghi, G.3
-
8
-
-
0030171602
-
Rapid, safe and incremental learning of navigation strategies
-
M. Dorigo, ed.
-
J. del R. Millàn, Rapid, safe and incremental learning of navigation strategies, in: M. Dorigo, ed., Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics, SMC-part B 26 (3) (1996) 408-420.
-
(1996)
Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics, SMC-part B
, vol.26
, Issue.3
, pp. 408-420
-
-
Millàn, J.D.R.1
-
9
-
-
0028739953
-
Robot shaping: Developing autonomous agents through learning
-
M. Dorigo and M. Colombetti, Robot shaping: Developing autonomous agents through learning, Artificial Intelligence 71 (2) (1994) 321-370.
-
(1994)
Artificial Intelligence
, vol.71
, Issue.2
, pp. 321-370
-
-
Dorigo, M.1
Colombetti, M.2
-
10
-
-
30244434915
-
Extending the adaptive heuristic critic and Q-learning: From facts to implications
-
I. Alexander and J. Taylor, eds., Elsevier, Amsterdam
-
O. Holland and M. Snaith, Extending the adaptive heuristic critic and Q-learning: From facts to implications, in: I. Alexander and J. Taylor, eds., Artificial Neural Networks, Vol. 2 (Elsevier, Amsterdam, 1992) 599-602.
-
(1992)
Artificial Neural Networks
, vol.2
, pp. 599-602
-
-
Holland, O.1
Snaith, M.2
-
14
-
-
0000123778
-
Self-improving reative agents based on reinforcement learning, planning and teaching
-
L.J. Lin, Self-improving reative agents based on reinforcement learning, planning and teaching, Machine Learning 8 (1992) 293-321.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.J.1
-
15
-
-
0003673017
-
-
Ph.D. Thesis, in: C.G. Langton et al., eds., Carnegie Mellon University, Pittsburgh, CMU-CS-93-103
-
L.-J. Lin, Reinforcement learning for robots using neural networks, Ph.D. Thesis, in: C.G. Langton et al., eds., Carnegie Mellon University, Pittsburgh, CMU-CS-93-103, 1993.
-
(1993)
Reinforcement Learning for Robots Using Neural Networks
-
-
Lin, L.-J.1
-
16
-
-
0026880130
-
Automatic programming of behavior-based robots using reinforcement learning
-
S. Mahadevan and J. Connell, Automatic programming of behavior-based robots using reinforcement learning, Artificial Intelligence 55 (2) (1991) 311-365.
-
(1991)
Artificial Intelligence
, vol.55
, Issue.2
, pp. 311-365
-
-
Mahadevan, S.1
Connell, J.2
-
17
-
-
84957895797
-
Reward function for accelerated learning
-
W.W. Cohen and H. Hirsh, eds., Morgan Kaufman, LOS Altos, CA
-
M. Mataric, Reward function for accelerated learning, in: W.W. Cohen and H. Hirsh, eds., Proc. of 11th Int. Conf. on Machine Learning (Morgan Kaufman, LOS Altos, CA, 1994).
-
(1994)
Proc. of 11th Int. Conf. on Machine Learning
-
-
Mataric, M.1
-
18
-
-
84957622922
-
Using transitional proximity for faster reinforcement learning
-
Morgan Kaufman, LOS Altos, CA
-
R.A. McCallum, Using transitional proximity for faster reinforcement learning, Proc. 9th Int. Conf. on Machine Learning (Morgan Kaufman, LOS Altos, CA, 1992).
-
(1992)
Proc. 9th Int. Conf. on Machine Learning
-
-
McCallum, R.A.1
-
20
-
-
0000646059
-
Learning internal representations by error propagation
-
D. Rumelhart and J. Mc Clelland, eds., MIT Press, Cambridge, MA
-
D. Rumelhart, G. Hinton and R. Williams, Learning internal representations by error propagation, in: D. Rumelhart and J. Mc Clelland, eds., Parallel Distributed Processing, Vol. 1 (MIT Press, Cambridge, MA, 1986) 318-362.
-
(1986)
Parallel Distributed Processing
, vol.1
, pp. 318-362
-
-
Rumelhart, D.1
Hinton, G.2
Williams, R.3
-
21
-
-
0009283096
-
Reinforcement learning and neural reinforcement learning
-
M. Verleysen, ed., D-Facto Publication, Brussels
-
S. Sehad and C. Touzet, Reinforcement learning and neural reinforcement learning, in: M. Verleysen, ed., ESANN 94 (D-Facto Publication, Brussels, 1994).
-
(1994)
ESANN 94
-
-
Sehad, S.1
Touzet, C.2
-
22
-
-
0011200414
-
Reinforcement learning architectures for a animats
-
J.-A. Meyer and S.W. Wilson, eds., MIT Press, Cambridge, MA
-
R.S. Sutton, Reinforcement learning architectures for a animats, in: J.-A. Meyer and S.W. Wilson, eds., Proc. 1st Int. Conf. on Simulation of Adaptive Behavior, From Animals to Animats (MIT Press, Cambridge, MA, 1991) 288-296.
-
(1991)
Proc. 1st Int. Conf. on Simulation of Adaptive Behavior, From Animals to Animats
, pp. 288-296
-
-
Sutton, R.S.1
-
23
-
-
30244464358
-
The connectionist sequential machine: A general model of sequential networks
-
Canberra P. Leong and M. Jabri, eds., Sydney University Electrical Engineering NSW 2006
-
C. Touzet and N. Giambiasi, The connectionist sequential machine: A general model of sequential networks, in: Canberra P. Leong and M. Jabri, eds., Australian Conf. on Neural Networks (Sydney University Electrical Engineering NSW 2006 (1992).
-
(1992)
Australian Conf. on Neural Networks
-
-
Touzet, C.1
Giambiasi, N.2
-
24
-
-
30244440305
-
Application of connectionist models to fuzzy inference systems
-
B. Fronhöfer and G. Wrightson, eds., Lectures Notes in Artifical Intelligence, Springer, Berlin
-
C. Touzet and N. Giambiasi, Application of connectionist models to fuzzy inference systems, in: B. Fronhöfer and G. Wrightson, eds., Parallelization in Inference Systems, Lectures Notes in Artifical Intelligence, Vol. 590 (Springer, Berlin, 1992).
-
(1992)
Parallelization in Inference Systems
, vol.590
-
-
Touzet, C.1
Giambiasi, N.2
-
25
-
-
30244514264
-
Improving reinforcement learning of obstacle avoidance behavior with forbidden sequences of actions
-
Cancun, Mexico
-
C. Touzet, S. Sehad and N. Giambiasi, Improving reinforcement learning of obstacle avoidance behavior with forbidden sequences of actions, Int. Conf. on Robotics and Manufacturing, Cancun, Mexico (1995).
-
(1995)
Int. Conf. on Robotics and Manufacturing
-
-
Touzet, C.1
Sehad, S.2
Giambiasi, N.3
-
26
-
-
0003117086
-
An imitation of life
-
W.G. Walter, An imitation of life, Scientific American 182 (5) (1950) 42-45.
-
(1950)
Scientific American
, vol.182
, Issue.5
, pp. 42-45
-
-
Walter, W.G.1
-
27
-
-
0004049893
-
-
Ph.D. Thesis, King's College, Cambridge, UK
-
C.J.C.H. Watkins, Learning from delayed rewards, Ph.D. Thesis, King's College, Cambridge, UK, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
|