SCOPUS 정보 검색 플랫폼

Artificial Life

Volumn 10, Issue 1, 2004, Pages 65-81

Learning Obstacle Avoidance with an Operant Behavior Model

(2) Gutnisky, D A a,b Zanutto, B S a,b

a UNIVERSIDAD DE BUENOS AIRES (Argentina)

b Facultad de Ciencias Naturales e Instituto M Lillo (Argentina)

Author keywords

Animals; Artificial neural networks; Neural networks; Operant learning; Q Learning; Reinforcement learning

Indexed keywords

ALGORITHMS; BEHAVIORAL RESEARCH; MATHEMATICAL MODELS; NEURAL NETWORKS; NEUROLOGY; ROBOT LEARNING; ROBOTS; ROBUSTNESS (CONTROL SYSTEMS);

ANIMATS; OPERANT LEARNING; Q-LEARNING; REINFORCEMENT LEARNING;

ARTIFICIAL INTELLIGENCE;

ARTICLE; ARTIFICIAL NEURAL NETWORK; AVOIDANCE BEHAVIOR; INSTRUMENTAL CONDITIONING; PHYSIOLOGY; PSYCHOLOGICAL MODEL;

AVOIDANCE LEARNING; CONDITIONING, OPERANT; MODELS, PSYCHOLOGICAL; NEURAL NETWORKS (COMPUTER);

EID: 1542334432 PISSN: 10645462 EISSN: None Source Type: Journal
DOI: 10.1162/106454604322875913 Document Type: Article

Times cited : (18)

References (35)

1
- 58149455097
- The role of frustrative nonreward in continuous reward situations
- Amsel, A. (1958). The role of frustrative nonreward in continuous reward situations. Psychological Bulletin, 55, 102-119.
- (1958) Psychological Bulletin , vol.55 , pp. 102-119
- Amsel, A.¹

2
- 0025888177
- Visual learning, adaptive expectations, and behavioral conditioning of the mobile robot Mavin
- Baloch, A., & Waxman, A. (1991). Visual learning, adaptive expectations, and behavioral conditioning of the mobile robot Mavin. Neural Networks, 4, 271-302.
- (1991) Neural Networks , vol.4 , pp. 271-302
- Baloch, A.¹ Waxman, A.²

3
- 0020970738
- Neuronlike elements that can solve difficult learning problems
- Barto, A. G., Sutton, R. S., & Anderson, C. W. (1983). Neuronlike elements that can solve difficult learning problems. IEEE Transactions on Systems, Man, and Cybernetics, 13, 835-846.
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.13 , pp. 835-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

4
- 0027373552
- Prefrontal connections of medial motor areas in the rhesus monkey
- Bates, J. F., & Goldman-Rakic, P. S. (1993). Prefrontal connections of medial motor areas in the rhesus monkey. Journal of Comparative Neurolobiology, 336, 211-228.
- (1993) Journal of Comparative Neurolobiology , vol.336 , pp. 211-228
- Bates, J.F.¹ Goldman-Rakic, P.S.²

5
- 0003165497
- Application of biological learning theories to mobile robot avoidance and approach behaviors
- Chang, C., & Gaudiano, P. (1998). Application of biological learning theories to mobile robot avoidance and approach behaviors. Journal of Complex Systems, 1, 79-114.
- (1998) Journal of Complex Systems , vol.1 , pp. 79-114
- Chang, C.¹ Gaudiano, P.²

6
- 0003259931
- Improving elevator performance using reinforcement learning
- D. Touretzky, M. Mozer, & M. Hasselmo (Eds.)
- Crites, R. H., & Barto, A. G. (1996). Improving elevator performance using reinforcement learning. In D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Neural Information Processing Systems, Vol. 8.
- (1996) Neural Information Processing Systems , vol.8
- Crites, R.H.¹ Barto, A.G.²

7
- 0001655080
- A mathematical model of reward and aversive nonreward: Its application in over 30 appetitive learning situations
- Daly, H. B., & Daly, J. T. (1982). A mathematical model of reward and aversive nonreward: Its application in over 30 appetitive learning situations. Journal of Experimental Psychology: General, 111, 441-480.
- (1982) Journal of Experimental Psychology: General , vol.111 , pp. 441-480
- Daly, H.B.¹ Daly, J.T.²

8
- 0040111240
- DMOD - A mathematical model of reward and aversive nonreward in appetitive learning situations: Program and instruction manual
- Daly, H. B., & Daly, J. T. (1984). DMOD - A mathematical model of reward and aversive nonreward in appetitive learning situations: Program and instruction manual. Behavior Research Methods, Instruments, & Computers, 16, 38-52.
- (1984) Behavior Research Methods, Instruments, & Computers , vol.16 , pp. 38-52
- Daly, H.B.¹ Daly, J.T.²

9
- 0002697876
- ARBIB: An autonomous robot based on inspiration from biology
- Damper, R. I., French, R. L. B., & Scutt, T. W. (2000). ARBIB: An autonomous robot based on inspiration from biology. Robotics and Autonomous Systems, 31(4), 247-274.
- (2000) Robotics and Autonomous Systems , vol.31 , Issue.4 , pp. 247-274
- Damper, R.I.¹ French, R.L.B.² Scutt, T.W.³

10
- 0032004808
- Animats and what they can tell us
- Dean, J. (1998). Animats and what they can tell us. Trends in Cognitive Sciences, 2(2), 60-67.
- (1998) Trends in Cognitive Sciences , vol.2 , Issue.2 , pp. 60-67
- Dean, J.¹

11
- 0027634299
- A selectionist approach to reinforcement
- Donahoe, J. W., Burgos, J. E., & Palmer, D. C. (1993). A selectionist approach to reinforcement. Journal of the Experimental Analysis of Behavior, 60, 17-40.
- (1993) Journal of the Experimental Analysis of Behavior , vol.60 , pp. 17-40
- Donahoe, J.W.¹ Burgos, J.E.² Palmer, D.C.³

12
- 0003397519
- Boston: Allyn & Bacon
- Donahoe, J. W., & Palmer, D. C. (1994). Learning and complex behavior. Boston: Allyn & Bacon.
- (1994) Learning and Complex Behavior
- Donahoe, J.W.¹ Palmer, D.C.²

13
- 0031088259
- The S-R issue: Its status in behavior analysis and in Donahoe & Palmer's Learning and Complex Behavior
- Donahoe, J. W., Palmer, D. C., & Burgos, J. E. (1997). The S-R issue: Its status in behavior analysis and in Donahoe & Palmer's Learning and Complex Behavior. Journal of the Experimental Analysis of Behavior, 67, 193-211.
- (1997) Journal of the Experimental Analysis of Behavior , vol.67 , pp. 193-211
- Donahoe, J.W.¹ Palmer, D.C.² Burgos, J.E.³

14
- 0030675539
- Adaptive obstacle avoidance with a neural network for operant conditioning: Experiments with real robots
- Monterey, CA
- Gaudiano, P., & Chang, C. (1997). Adaptive obstacle avoidance with a neural network for operant conditioning: Experiments with real robots. In Proceedings of the 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA) (pp. 13-18). Monterey, CA.
- (1997) Proceedings of the 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA) , pp. 13-18
- Gaudiano, P.¹ Chang, C.²

15
- 0000292303
- Circuitry of primate prefrontal cortex and regulation of behavior by representational memory
- F. Plum (Ed.). Bethesda, MD: American Physiological Society
- Goldman-Rakic, P. S. (1987). Circuitry of primate prefrontal cortex and regulation of behavior by representational memory. In F. Plum (Ed.), Handbook of Physiology: The Nervous System (pp. 373-417). Bethesda, MD: American Physiological Society.
- (1987) Handbook of Physiology: The Nervous System , pp. 373-417
- Goldman-Rakic, P.S.¹

16
- 0003409088
- New York: Wiley
- Hebb, D. O. (1949). The organization of behavior; A neuropsychological theory. New York: Wiley.
- (1949) The Organization of Behavior; A Neuropsychological Theory
- Hebb, D.O.¹

17
- 85153938292
- Reinforcement learning algorithm for partially observable Markov decision problems
- G. Tesauro, D. Touretzky, & T. Leen (Eds.). Cambridge, MA: MIT Press
- Jaakola, T., Singh, S. P., & Jordan, M. I. (1995). Reinforcement learning algorithm for partially observable Markov decision problems. In G. Tesauro, D. Touretzky, & T. Leen (Eds.), Advances in neural information processing systems, Vol. 7 (pp. 345-352). Cambridge, MA: MIT Press.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 345-352
- Jaakola, T.¹ Singh, S.P.² Jordan, M.I.³

18
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

19
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

20
- 0034856684
- Role of unconditioned stimulus prediction in the operant learning: A neural network model
- Lew, S. E., Wedemeyer, C., & Zanutto, B. S. (2001). Role of unconditioned stimulus prediction in the operant learning: A neural network model. In Proceedings of IEEE Conference on Neural Networks (pp. 331-336).
- (2001) Proceedings of IEEE Conference on Neural Networks , pp. 331-336
- Lew, S.E.¹ Wedemeyer, C.² Zanutto, B.S.³

21
- 85158146654
- Automatic programming of behavior-based robots using reinforcement learning
- Anaheim, CA
- Mahadevan, S., & Connell, J. (1991). Automatic programming of behavior-based robots using reinforcement learning. In Proceedings of the Ninth National Conference on Artificial Intelligence, Anaheim, CA.
- (1991) Proceedings of the Ninth National Conference on Artificial Intelligence
- Mahadevan, S.¹ Connell, J.²

22
- 0003649697
- Cambridge, MA: Bradford Books, MIT Press
- McFarland, D., & Bösser, T. (1993). Intelligent behavior in animals and robots. Cambridge, MA: Bradford Books, MIT Press.
- (1993) Intelligent Behavior in Animals and Robots
- McFarland, D.¹ Bösser, T.²

23
- 0002109138
- A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
- A. H. Black & W. F. Prokasy (Eds.). New York: Appleton-Century-Crofts
- Rescorla, R. A., & Wagner, A. R. (1972), A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory. New York: Appleton-Century-Crofts.
- (1972) Classical Conditioning II: Current Research and Theory
- Rescorla, R.A.¹ Wagner, A.R.²

24
- 0031446796
- Escape, avoidance and imitation: A neural network approach
- Schmajuk, N., & Zanutto, B. S. (1997). Escape, avoidance and imitation: A neural network approach. Adaptive Behavior, 6, 63-129.
- (1997) Adaptive Behavior , vol.6 , pp. 63-129
- Schmajuk, N.¹ Zanutto, B.S.²

25
- 0030896968
- A neural substrate of prediction and reward
- Schultz, W., Dayan P., & Montague, R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1598.
- (1997) Science , vol.275 , pp. 1593-1598
- Schultz, W.¹ Dayan, P.² Montague, R.³

26
- 2142812536
- Learning without state-estimation in partially observable Markovian decision processes
- W. W. Cohen & H. Hirsh (Eds.). San Francisco, CA: Morgan Kaufmann
- Singh, S. P, Jaakola, T., & Jordan, M. I. (1994). Learning without state-estimation in partially observable Markovian decision processes. In W. W. Cohen & H. Hirsh (Eds.), Proceedings of the Eleventh International Conference on Machine Learning (pp. 284-292). San Francisco, CA: Morgan Kaufmann.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 284-292
- Singh, S.P.¹ Jaakola, T.² Jordan, M.I.³

27
- 0011200414
- Reinforcement learning architectures for animats
- J.-A. Meyer & S. W. Wilson (Eds.). Cambridge, MA: MIT Press
- Sutton, R. S. (1991). Reinforcement learning architectures for animats, In J.-A. Meyer & S. W. Wilson (Eds.), From animals to animals: Proceedings of the First International Conference on Simulation of Adaptative Behavior (pp. 288-296). Cambridge, MA: MIT Press.
- (1991) From Animals to Animals: Proceedings of the First International Conference on Simulation of Adaptative Behavior , pp. 288-296
- Sutton, R.S.¹

28
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

29
- 0003392753
- Cambridge, UK: Cambridge University Press
- Staddon, J. E. R. (1983). Adaptive behavior and learning. Cambridge, UK: Cambridge University Press.
- (1983) Adaptive Behavior and Learning
- Staddon, J.E.R.¹

30
- 0002746315
- Towards a theory of emergent functionality
- J. A. Meyer & S. W. Wilson (Eds.). Cambridge, MA: Bradford
- Steels, L. (1991). Towards a theory of emergent functionality. In J. A. Meyer & S. W. Wilson (Eds.), From animals to animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior (pp. 451-461). Cambridge, MA: Bradford.
- (1991) From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior , pp. 451-461
- Steels, L.¹

31
- 0031498145
- Operant conditioning in skinnerbots
- Touretzky, D. S., & Saksida, M. L. (1997). Operant conditioning in skinnerbots. Adaptive Behavior, 5(3/4), 219-247.
- (1997) Adaptive Behavior , vol.5 , Issue.3-4 , pp. 219-247
- Touretzky, D.S.¹ Saksida, M.L.²

32
- 0032191778
- A bottom up approach towards the acquisition and expression of sequential representations applied to a behaving real-world device: Distributed adaptive control III
- Verschure, P. F., & Voegtlin, T. (1998). A bottom up approach towards the acquisition and expression of sequential representations applied to a behaving real-world device: Distributed adaptive control III. Neural Networks, 11, 1531-1549.
- (1998) Neural Networks , vol.11 , pp. 1531-1549
- Verschure, P.F.¹ Voegtlin, T.²

33
- 0035811464
- Dopamine responses comply with basic assumptions of formal learning theory
- Waelti, P., Dickinson, A., & Schultz, W. (2001). Dopamine responses comply with basic assumptions of formal learning theory. Nature, 412, 43-48.
- (2001) Nature , vol.412 , pp. 43-48
- Waelti, P.¹ Dickinson, A.² Schultz, W.³

34
- 0004049895
- Ph.D. dissertation, Psychology Department, Cambridge University
- Watkins, C. J. C. H. (1989). Learning with delayed rewards. Ph.D. dissertation, Psychology Department, Cambridge University.
- (1989) Learning with Delayed Rewards
- Watkins, C.J.C.H.¹

35
- 0003857155
- A neural network model of aversive behavior
- M. H. Hamza (Ed.). Zürich; IASTED/ACTA Press
- Zanutto, B. S., & Lew, S. (2000). A neural network model of aversive behavior. In M. H. Hamza (Ed.), Proceedings of the LASTED Neural Networks NN'2000 (pp. 118-123). Zürich; IASTED/ACTA Press.
- (2000) Proceedings of the LASTED Neural Networks NN'2000 , pp. 118-123
- Zanutto, B.S.¹ Lew, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.