-
1
-
-
58149455097
-
The role of frustrative nonreward in continuous reward situations
-
Amsel, A. (1958). The role of frustrative nonreward in continuous reward situations. Psychological Bulletin, 55, 102-119.
-
(1958)
Psychological Bulletin
, vol.55
, pp. 102-119
-
-
Amsel, A.1
-
2
-
-
0025888177
-
Visual learning, adaptive expectations, and behavioral conditioning of the mobile robot Mavin
-
Baloch, A., & Waxman, A. (1991). Visual learning, adaptive expectations, and behavioral conditioning of the mobile robot Mavin. Neural Networks, 4, 271-302.
-
(1991)
Neural Networks
, vol.4
, pp. 271-302
-
-
Baloch, A.1
Waxman, A.2
-
3
-
-
0020970738
-
Neuronlike elements that can solve difficult learning problems
-
Barto, A. G., Sutton, R. S., & Anderson, C. W. (1983). Neuronlike elements that can solve difficult learning problems. IEEE Transactions on Systems, Man, and Cybernetics, 13, 835-846.
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.13
, pp. 835-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
5
-
-
0003165497
-
Application of biological learning theories to mobile robot avoidance and approach behaviors
-
Chang, C., & Gaudiano, P. (1998). Application of biological learning theories to mobile robot avoidance and approach behaviors. Journal of Complex Systems, 1, 79-114.
-
(1998)
Journal of Complex Systems
, vol.1
, pp. 79-114
-
-
Chang, C.1
Gaudiano, P.2
-
6
-
-
0003259931
-
Improving elevator performance using reinforcement learning
-
D. Touretzky, M. Mozer, & M. Hasselmo (Eds.)
-
Crites, R. H., & Barto, A. G. (1996). Improving elevator performance using reinforcement learning. In D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Neural Information Processing Systems, Vol. 8.
-
(1996)
Neural Information Processing Systems
, vol.8
-
-
Crites, R.H.1
Barto, A.G.2
-
7
-
-
0001655080
-
A mathematical model of reward and aversive nonreward: Its application in over 30 appetitive learning situations
-
Daly, H. B., & Daly, J. T. (1982). A mathematical model of reward and aversive nonreward: Its application in over 30 appetitive learning situations. Journal of Experimental Psychology: General, 111, 441-480.
-
(1982)
Journal of Experimental Psychology: General
, vol.111
, pp. 441-480
-
-
Daly, H.B.1
Daly, J.T.2
-
8
-
-
0040111240
-
DMOD - A mathematical model of reward and aversive nonreward in appetitive learning situations: Program and instruction manual
-
Daly, H. B., & Daly, J. T. (1984). DMOD - A mathematical model of reward and aversive nonreward in appetitive learning situations: Program and instruction manual. Behavior Research Methods, Instruments, & Computers, 16, 38-52.
-
(1984)
Behavior Research Methods, Instruments, & Computers
, vol.16
, pp. 38-52
-
-
Daly, H.B.1
Daly, J.T.2
-
9
-
-
0002697876
-
ARBIB: An autonomous robot based on inspiration from biology
-
Damper, R. I., French, R. L. B., & Scutt, T. W. (2000). ARBIB: An autonomous robot based on inspiration from biology. Robotics and Autonomous Systems, 31(4), 247-274.
-
(2000)
Robotics and Autonomous Systems
, vol.31
, Issue.4
, pp. 247-274
-
-
Damper, R.I.1
French, R.L.B.2
Scutt, T.W.3
-
10
-
-
0032004808
-
Animats and what they can tell us
-
Dean, J. (1998). Animats and what they can tell us. Trends in Cognitive Sciences, 2(2), 60-67.
-
(1998)
Trends in Cognitive Sciences
, vol.2
, Issue.2
, pp. 60-67
-
-
Dean, J.1
-
11
-
-
0027634299
-
A selectionist approach to reinforcement
-
Donahoe, J. W., Burgos, J. E., & Palmer, D. C. (1993). A selectionist approach to reinforcement. Journal of the Experimental Analysis of Behavior, 60, 17-40.
-
(1993)
Journal of the Experimental Analysis of Behavior
, vol.60
, pp. 17-40
-
-
Donahoe, J.W.1
Burgos, J.E.2
Palmer, D.C.3
-
13
-
-
0031088259
-
The S-R issue: Its status in behavior analysis and in Donahoe & Palmer's Learning and Complex Behavior
-
Donahoe, J. W., Palmer, D. C., & Burgos, J. E. (1997). The S-R issue: Its status in behavior analysis and in Donahoe & Palmer's Learning and Complex Behavior. Journal of the Experimental Analysis of Behavior, 67, 193-211.
-
(1997)
Journal of the Experimental Analysis of Behavior
, vol.67
, pp. 193-211
-
-
Donahoe, J.W.1
Palmer, D.C.2
Burgos, J.E.3
-
15
-
-
0000292303
-
Circuitry of primate prefrontal cortex and regulation of behavior by representational memory
-
F. Plum (Ed.). Bethesda, MD: American Physiological Society
-
Goldman-Rakic, P. S. (1987). Circuitry of primate prefrontal cortex and regulation of behavior by representational memory. In F. Plum (Ed.), Handbook of Physiology: The Nervous System (pp. 373-417). Bethesda, MD: American Physiological Society.
-
(1987)
Handbook of Physiology: The Nervous System
, pp. 373-417
-
-
Goldman-Rakic, P.S.1
-
17
-
-
85153938292
-
Reinforcement learning algorithm for partially observable Markov decision problems
-
G. Tesauro, D. Touretzky, & T. Leen (Eds.). Cambridge, MA: MIT Press
-
Jaakola, T., Singh, S. P., & Jordan, M. I. (1995). Reinforcement learning algorithm for partially observable Markov decision problems. In G. Tesauro, D. Touretzky, & T. Leen (Eds.), Advances in neural information processing systems, Vol. 7 (pp. 345-352). Cambridge, MA: MIT Press.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 345-352
-
-
Jaakola, T.1
Singh, S.P.2
Jordan, M.I.3
-
18
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
19
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
-
(1998)
Artificial Intelligence
, vol.101
, pp. 99-134
-
-
Kaelbling, L.P.1
Littman, M.L.2
Cassandra, A.R.3
-
23
-
-
0002109138
-
A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
-
A. H. Black & W. F. Prokasy (Eds.). New York: Appleton-Century-Crofts
-
Rescorla, R. A., & Wagner, A. R. (1972), A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory. New York: Appleton-Century-Crofts.
-
(1972)
Classical Conditioning II: Current Research and Theory
-
-
Rescorla, R.A.1
Wagner, A.R.2
-
24
-
-
0031446796
-
Escape, avoidance and imitation: A neural network approach
-
Schmajuk, N., & Zanutto, B. S. (1997). Escape, avoidance and imitation: A neural network approach. Adaptive Behavior, 6, 63-129.
-
(1997)
Adaptive Behavior
, vol.6
, pp. 63-129
-
-
Schmajuk, N.1
Zanutto, B.S.2
-
25
-
-
0030896968
-
A neural substrate of prediction and reward
-
Schultz, W., Dayan P., & Montague, R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1598.
-
(1997)
Science
, vol.275
, pp. 1593-1598
-
-
Schultz, W.1
Dayan, P.2
Montague, R.3
-
26
-
-
2142812536
-
Learning without state-estimation in partially observable Markovian decision processes
-
W. W. Cohen & H. Hirsh (Eds.). San Francisco, CA: Morgan Kaufmann
-
Singh, S. P, Jaakola, T., & Jordan, M. I. (1994). Learning without state-estimation in partially observable Markovian decision processes. In W. W. Cohen & H. Hirsh (Eds.), Proceedings of the Eleventh International Conference on Machine Learning (pp. 284-292). San Francisco, CA: Morgan Kaufmann.
-
(1994)
Proceedings of the Eleventh International Conference on Machine Learning
, pp. 284-292
-
-
Singh, S.P.1
Jaakola, T.2
Jordan, M.I.3
-
31
-
-
0031498145
-
Operant conditioning in skinnerbots
-
Touretzky, D. S., & Saksida, M. L. (1997). Operant conditioning in skinnerbots. Adaptive Behavior, 5(3/4), 219-247.
-
(1997)
Adaptive Behavior
, vol.5
, Issue.3-4
, pp. 219-247
-
-
Touretzky, D.S.1
Saksida, M.L.2
-
32
-
-
0032191778
-
A bottom up approach towards the acquisition and expression of sequential representations applied to a behaving real-world device: Distributed adaptive control III
-
Verschure, P. F., & Voegtlin, T. (1998). A bottom up approach towards the acquisition and expression of sequential representations applied to a behaving real-world device: Distributed adaptive control III. Neural Networks, 11, 1531-1549.
-
(1998)
Neural Networks
, vol.11
, pp. 1531-1549
-
-
Verschure, P.F.1
Voegtlin, T.2
-
33
-
-
0035811464
-
Dopamine responses comply with basic assumptions of formal learning theory
-
Waelti, P., Dickinson, A., & Schultz, W. (2001). Dopamine responses comply with basic assumptions of formal learning theory. Nature, 412, 43-48.
-
(2001)
Nature
, vol.412
, pp. 43-48
-
-
Waelti, P.1
Dickinson, A.2
Schultz, W.3
-
34
-
-
0004049895
-
-
Ph.D. dissertation, Psychology Department, Cambridge University
-
Watkins, C. J. C. H. (1989). Learning with delayed rewards. Ph.D. dissertation, Psychology Department, Cambridge University.
-
(1989)
Learning with Delayed Rewards
-
-
Watkins, C.J.C.H.1
-
35
-
-
0003857155
-
A neural network model of aversive behavior
-
M. H. Hamza (Ed.). Zürich; IASTED/ACTA Press
-
Zanutto, B. S., & Lew, S. (2000). A neural network model of aversive behavior. In M. H. Hamza (Ed.), Proceedings of the LASTED Neural Networks NN'2000 (pp. 118-123). Zürich; IASTED/ACTA Press.
-
(2000)
Proceedings of the LASTED Neural Networks NN'2000
, pp. 118-123
-
-
Zanutto, B.S.1
Lew, S.2
|