-
1
-
-
0030149709
-
Purposive behavior acquisition for a real robot by vision-based reinforcement learning
-
M. Asada, S. Noda, S. Tawaratsumida, K. Hosoda, Purposive behavior acquisition for a real robot by vision-based reinforcement learning, Machine Learning 23 (2-3) (1996) 279-303.
-
(1996)
Machine Learning
, vol.23
, Issue.2-3
, pp. 279-303
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
2
-
-
0004142826
-
-
Oxford University Press, Oxford
-
S.A. Barnett, Modern Ethology, Oxford University Press, Oxford, 1981.
-
(1981)
Modern Ethology
-
-
Barnett, S.A.1
-
3
-
-
0010922732
-
Empirically derived adaptive elements and networks simulate associative learning
-
Lawrence Erlbaum, Hillsdale, NJ
-
D.A. Baxter, D.V. Bounomano, J.L. Raymond, D.G. Cook, F.M. Kuenzi, T.J. Carew, J.H. Byrne, Empirically derived adaptive elements and networks simulate associative learning, in: Neural Network Models of Conditioning and Action, Lawrence Erlbaum, Hillsdale, NJ, 1991, pp. 13-52.
-
(1991)
Neural Network Models of Conditioning and Action
, pp. 13-52
-
-
Baxter, D.A.1
Bounomano, D.V.2
Raymond, J.L.3
Cook, D.G.4
Kuenzi, F.M.5
Carew, T.J.6
Byrne, J.H.7
-
8
-
-
0028025030
-
A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli
-
T.J. Bussey, J.L. Muir, T.W. Robbins, A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli, Neuroscience Research Communications 15 (2) (1994) 103-109.
-
(1994)
Neuroscience Research Communications
, vol.15
, Issue.2
, pp. 103-109
-
-
Bussey, T.J.1
Muir, J.L.2
Robbins, T.W.3
-
10
-
-
85152521744
-
A teaching method for reinforcement learning
-
Morgan Kaufmann, Los Altos, CA
-
J.A. Clouse, P.E. Utgoff, A teaching method for reinforcement learning, in: Proceedings of the Ninth Conference on Machine Learning, Morgan Kaufmann, Los Altos, CA, 1992.
-
(1992)
Proceedings of the Ninth Conference on Machine Learning
-
-
Clouse, J.A.1
Utgoff, P.E.2
-
11
-
-
0030167564
-
Behavior analysis and training: A methodology for behavior engineering
-
M. Colombetti, M. Dorigo, G. Borghi, Behavior analysis and training: A methodology for behavior engineering, IEEE Transactions on Systems, Man, and Cybernetics -Part B 26 (3) (1996) 365-380.
-
(1996)
IEEE Transactions on Systems, Man, and Cybernetics - Part B
, vol.26
, Issue.3
, pp. 365-380
-
-
Colombetti, M.1
Dorigo, M.2
Borghi, G.3
-
12
-
-
0010786206
-
Instrumental conditioning
-
N.J. Mackintosh (Ed.), Academic Press, Orlando, FL
-
A. Dickinson, Instrumental conditioning, in: N.J. Mackintosh (Ed.), Handbook of Perception and Cognition, vol. 9, Academic Press, Orlando, FL, 1995.
-
(1995)
Handbook of Perception and Cognition
, vol.9
-
-
Dickinson, A.1
-
14
-
-
0028739953
-
Robot shaping: Developing autonomous agents through learning
-
M. Dorigo, M. Colombetti, Robot shaping: Developing autonomous agents through learning, Artificial Intelligence 70 (2) (1994) 321-370.
-
(1994)
Artificial Intelligence
, vol.70
, Issue.2
, pp. 321-370
-
-
Dorigo, M.1
Colombetti, M.2
-
17
-
-
0003182781
-
A multistrategy learning scheme for agent knowledge acquisition
-
D. Gordon, D. Subramanian, A multistrategy learning scheme for agent knowledge acquisition, Informatica 17 (1994) 331-346.
-
(1994)
Informatica
, vol.17
, pp. 331-346
-
-
Gordon, D.1
Subramanian, D.2
-
18
-
-
0027375966
-
Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat
-
R.E. Hampson, C.J. Heyser, S.A. Deadwyler, Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat, Behavioral Neuroscience 107 (5) (1993) 715-739.
-
(1993)
Behavioral Neuroscience
, vol.107
, Issue.5
, pp. 715-739
-
-
Hampson, R.E.1
Heyser, C.J.2
Deadwyler, S.A.3
-
22
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning, and teaching
-
L.-J. Lin, Self-improving reactive agents based on reinforcement learning, planning, and teaching, Machine Learning 8 (1992) 293-321.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.-J.1
-
23
-
-
0029732210
-
Creating advice-taking reinforcement learners
-
R. Maclin, J.W. Shavlik, Creating advice-taking reinforcement learners, Machine Learning 22 (1-3) (1996) 251-281.
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 251-281
-
-
Maclin, R.1
Shavlik, J.W.2
-
24
-
-
0010862056
-
Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot
-
MIT Press, Cambridge, MA
-
J. del R. Millán, Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot, in: From Animals to Animates 3: Proceedings of the Third International Conference on Simulation of Adaptive Behavior, MIT Press, Cambridge, MA, 1994, pp. 266-274.
-
(1994)
From Animals to Animates 3: Proceedings of the Third International Conference on Simulation of Adaptive Behavior
, pp. 266-274
-
-
Millán, J.D.R.1
-
25
-
-
0030171602
-
Rapid, safe, and incremental learning of navigation strategies
-
J. del R. Millán, Rapid, safe, and incremental learning of navigation strategies, IEEE Transactions on Systems, Man, and Cybernetics - Part B 26 (3) (1996) 408-420.
-
(1996)
IEEE Transactions on Systems, Man, and Cybernetics - Part B
, vol.26
, Issue.3
, pp. 408-420
-
-
Millán, J.D.R.1
-
26
-
-
0019089514
-
A model for Pavlovian learning: Variations in effectiveness of conditioned but not unconditioned stimuli
-
J.M. Pearce, G. Hall, A model for Pavlovian learning: Variations in effectiveness of conditioned but not unconditioned stimuli, Psychological Review 87 (6) (1980) 532-552.
-
(1980)
Psychological Review
, vol.87
, Issue.6
, pp. 532-552
-
-
Pearce, J.M.1
Hall, G.2
-
27
-
-
0023795385
-
Escalation of feline predation along a gradient from avoidance through play to killing
-
S.M. Pellis, D.P. O'Brien, V.C. Pellis, P. Teitelbaum, D.L. Wolgin, S. Kennedy, Escalation of feline predation along a gradient from avoidance through play to killing, Behavioral Neuroscience 102 (5) (1988) 760-777.
-
(1988)
Behavioral Neuroscience
, vol.102
, Issue.5
, pp. 760-777
-
-
Pellis, S.M.1
O'Brien, D.P.2
Pellis, V.C.3
Teitelbaum, P.4
Wolgin, D.L.5
Kennedy, S.6
-
28
-
-
30244452051
-
Robot shaping - Principles, methods, and architectures
-
S. Perkins, G. Hayes, Robot shaping - principles, methods, and architectures, in: Workshop on Learning in Robots and Animals, AISB'96, 1996.
-
(1996)
Workshop on Learning in Robots and Animals, AISB'96
-
-
Perkins, S.1
Hayes, G.2
-
30
-
-
0026923467
-
A learning rule based on empirically derived activity-dependent neuromodulation supports operant conditioning in a small network
-
J.L. Raymond, D.A. Baxter, D.V. Buonomano, J.H. Byrne, A learning rule based on empirically derived activity-dependent neuromodulation supports operant conditioning in a small network, Neural Networks 5 (5) (1992) 789-803.
-
(1992)
Neural Networks
, vol.5
, Issue.5
, pp. 789-803
-
-
Raymond, J.L.1
Baxter, D.A.2
Buonomano, D.V.3
Byrne, J.H.4
-
31
-
-
0002109138
-
A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
-
A.H. Black, W.F. Prokasy (Eds.), Appleton-Century-Crofts, New York
-
R.A. Rescorla, A.R. Wagner, A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement, in: A.H. Black, W.F. Prokasy (Eds.), Classical Conditioning II: Theory and Research, Appleton-Century-Crofts, New York, 1972.
-
(1972)
Classical Conditioning II: Theory and Research
-
-
Rescorla, R.A.1
Wagner, A.R.2
-
32
-
-
0004183870
-
-
Scott, Foresman, Glenview, IL
-
G.S. Reynolds, A Primer of Operant Conditioning, Scott, Foresman, Glenview, IL, 1968.
-
(1968)
A Primer of Operant Conditioning
-
-
Reynolds, G.S.1
-
35
-
-
0030705553
-
A modular architecture for office delivery robots
-
February
-
R. Simmons, R. Goodwin, K. Haigh, S. Koenig, J. O'Sullivan, A modular architecture for office delivery robots, in: The Proceedings of the First International Conference on Autonomous Agents, February 1997.
-
(1997)
The Proceedings of the First International Conference on Autonomous Agents
-
-
Simmons, R.1
Goodwin, R.2
Haigh, K.3
Koenig, S.4
O'Sullivan, J.5
-
36
-
-
0001027894
-
Transfer of learning across sequential tasks
-
S.P. Singh, Transfer of learning across sequential tasks, Machine Learning 8 (1992) 323-339.
-
(1992)
Machine Learning
, vol.8
, pp. 323-339
-
-
Singh, S.P.1
-
37
-
-
0019537951
-
Toward a modern theory of adaptive networks: Expectation and prediction
-
R.S. Sutton, A.G. Barto, Toward a modern theory of adaptive networks: Expectation and prediction, Psychological Review 88 (1981) 135-170.
-
(1981)
Psychological Review
, vol.88
, pp. 135-170
-
-
Sutton, R.S.1
Barto, A.G.2
-
39
-
-
0004049893
-
-
Ph.D. thesis, Cambridge University, Cambridge, UK
-
C.J.C.H. Watkins, Learning from Delayed Rewards, Ph.D. thesis, Cambridge University, Cambridge, UK, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
|