메뉴 건너뛰기




Volumn 10, Issue 1, 2004, Pages 65-81

Learning Obstacle Avoidance with an Operant Behavior Model

Author keywords

Animals; Artificial neural networks; Neural networks; Operant learning; Q Learning; Reinforcement learning

Indexed keywords

ALGORITHMS; BEHAVIORAL RESEARCH; MATHEMATICAL MODELS; NEURAL NETWORKS; NEUROLOGY; ROBOT LEARNING; ROBOTS; ROBUSTNESS (CONTROL SYSTEMS);

EID: 1542334432     PISSN: 10645462     EISSN: None     Source Type: Journal    
DOI: 10.1162/106454604322875913     Document Type: Article
Times cited : (18)

References (35)
  • 1
    • 58149455097 scopus 로고
    • The role of frustrative nonreward in continuous reward situations
    • Amsel, A. (1958). The role of frustrative nonreward in continuous reward situations. Psychological Bulletin, 55, 102-119.
    • (1958) Psychological Bulletin , vol.55 , pp. 102-119
    • Amsel, A.1
  • 2
    • 0025888177 scopus 로고
    • Visual learning, adaptive expectations, and behavioral conditioning of the mobile robot Mavin
    • Baloch, A., & Waxman, A. (1991). Visual learning, adaptive expectations, and behavioral conditioning of the mobile robot Mavin. Neural Networks, 4, 271-302.
    • (1991) Neural Networks , vol.4 , pp. 271-302
    • Baloch, A.1    Waxman, A.2
  • 4
    • 0027373552 scopus 로고
    • Prefrontal connections of medial motor areas in the rhesus monkey
    • Bates, J. F., & Goldman-Rakic, P. S. (1993). Prefrontal connections of medial motor areas in the rhesus monkey. Journal of Comparative Neurolobiology, 336, 211-228.
    • (1993) Journal of Comparative Neurolobiology , vol.336 , pp. 211-228
    • Bates, J.F.1    Goldman-Rakic, P.S.2
  • 5
    • 0003165497 scopus 로고    scopus 로고
    • Application of biological learning theories to mobile robot avoidance and approach behaviors
    • Chang, C., & Gaudiano, P. (1998). Application of biological learning theories to mobile robot avoidance and approach behaviors. Journal of Complex Systems, 1, 79-114.
    • (1998) Journal of Complex Systems , vol.1 , pp. 79-114
    • Chang, C.1    Gaudiano, P.2
  • 6
    • 0003259931 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • D. Touretzky, M. Mozer, & M. Hasselmo (Eds.)
    • Crites, R. H., & Barto, A. G. (1996). Improving elevator performance using reinforcement learning. In D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Neural Information Processing Systems, Vol. 8.
    • (1996) Neural Information Processing Systems , vol.8
    • Crites, R.H.1    Barto, A.G.2
  • 7
    • 0001655080 scopus 로고
    • A mathematical model of reward and aversive nonreward: Its application in over 30 appetitive learning situations
    • Daly, H. B., & Daly, J. T. (1982). A mathematical model of reward and aversive nonreward: Its application in over 30 appetitive learning situations. Journal of Experimental Psychology: General, 111, 441-480.
    • (1982) Journal of Experimental Psychology: General , vol.111 , pp. 441-480
    • Daly, H.B.1    Daly, J.T.2
  • 8
    • 0040111240 scopus 로고
    • DMOD - A mathematical model of reward and aversive nonreward in appetitive learning situations: Program and instruction manual
    • Daly, H. B., & Daly, J. T. (1984). DMOD - A mathematical model of reward and aversive nonreward in appetitive learning situations: Program and instruction manual. Behavior Research Methods, Instruments, & Computers, 16, 38-52.
    • (1984) Behavior Research Methods, Instruments, & Computers , vol.16 , pp. 38-52
    • Daly, H.B.1    Daly, J.T.2
  • 9
    • 0002697876 scopus 로고    scopus 로고
    • ARBIB: An autonomous robot based on inspiration from biology
    • Damper, R. I., French, R. L. B., & Scutt, T. W. (2000). ARBIB: An autonomous robot based on inspiration from biology. Robotics and Autonomous Systems, 31(4), 247-274.
    • (2000) Robotics and Autonomous Systems , vol.31 , Issue.4 , pp. 247-274
    • Damper, R.I.1    French, R.L.B.2    Scutt, T.W.3
  • 10
    • 0032004808 scopus 로고    scopus 로고
    • Animats and what they can tell us
    • Dean, J. (1998). Animats and what they can tell us. Trends in Cognitive Sciences, 2(2), 60-67.
    • (1998) Trends in Cognitive Sciences , vol.2 , Issue.2 , pp. 60-67
    • Dean, J.1
  • 13
    • 0031088259 scopus 로고    scopus 로고
    • The S-R issue: Its status in behavior analysis and in Donahoe & Palmer's Learning and Complex Behavior
    • Donahoe, J. W., Palmer, D. C., & Burgos, J. E. (1997). The S-R issue: Its status in behavior analysis and in Donahoe & Palmer's Learning and Complex Behavior. Journal of the Experimental Analysis of Behavior, 67, 193-211.
    • (1997) Journal of the Experimental Analysis of Behavior , vol.67 , pp. 193-211
    • Donahoe, J.W.1    Palmer, D.C.2    Burgos, J.E.3
  • 15
    • 0000292303 scopus 로고
    • Circuitry of primate prefrontal cortex and regulation of behavior by representational memory
    • F. Plum (Ed.). Bethesda, MD: American Physiological Society
    • Goldman-Rakic, P. S. (1987). Circuitry of primate prefrontal cortex and regulation of behavior by representational memory. In F. Plum (Ed.), Handbook of Physiology: The Nervous System (pp. 373-417). Bethesda, MD: American Physiological Society.
    • (1987) Handbook of Physiology: The Nervous System , pp. 373-417
    • Goldman-Rakic, P.S.1
  • 17
    • 85153938292 scopus 로고
    • Reinforcement learning algorithm for partially observable Markov decision problems
    • G. Tesauro, D. Touretzky, & T. Leen (Eds.). Cambridge, MA: MIT Press
    • Jaakola, T., Singh, S. P., & Jordan, M. I. (1995). Reinforcement learning algorithm for partially observable Markov decision problems. In G. Tesauro, D. Touretzky, & T. Leen (Eds.), Advances in neural information processing systems, Vol. 7 (pp. 345-352). Cambridge, MA: MIT Press.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 345-352
    • Jaakola, T.1    Singh, S.P.2    Jordan, M.I.3
  • 19
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 23
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • A. H. Black & W. F. Prokasy (Eds.). New York: Appleton-Century-Crofts
    • Rescorla, R. A., & Wagner, A. R. (1972), A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory. New York: Appleton-Century-Crofts.
    • (1972) Classical Conditioning II: Current Research and Theory
    • Rescorla, R.A.1    Wagner, A.R.2
  • 24
    • 0031446796 scopus 로고    scopus 로고
    • Escape, avoidance and imitation: A neural network approach
    • Schmajuk, N., & Zanutto, B. S. (1997). Escape, avoidance and imitation: A neural network approach. Adaptive Behavior, 6, 63-129.
    • (1997) Adaptive Behavior , vol.6 , pp. 63-129
    • Schmajuk, N.1    Zanutto, B.S.2
  • 25
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., Dayan P., & Montague, R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1598.
    • (1997) Science , vol.275 , pp. 1593-1598
    • Schultz, W.1    Dayan, P.2    Montague, R.3
  • 26
    • 2142812536 scopus 로고
    • Learning without state-estimation in partially observable Markovian decision processes
    • W. W. Cohen & H. Hirsh (Eds.). San Francisco, CA: Morgan Kaufmann
    • Singh, S. P, Jaakola, T., & Jordan, M. I. (1994). Learning without state-estimation in partially observable Markovian decision processes. In W. W. Cohen & H. Hirsh (Eds.), Proceedings of the Eleventh International Conference on Machine Learning (pp. 284-292). San Francisco, CA: Morgan Kaufmann.
    • (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 284-292
    • Singh, S.P.1    Jaakola, T.2    Jordan, M.I.3
  • 31
    • 0031498145 scopus 로고    scopus 로고
    • Operant conditioning in skinnerbots
    • Touretzky, D. S., & Saksida, M. L. (1997). Operant conditioning in skinnerbots. Adaptive Behavior, 5(3/4), 219-247.
    • (1997) Adaptive Behavior , vol.5 , Issue.3-4 , pp. 219-247
    • Touretzky, D.S.1    Saksida, M.L.2
  • 32
    • 0032191778 scopus 로고    scopus 로고
    • A bottom up approach towards the acquisition and expression of sequential representations applied to a behaving real-world device: Distributed adaptive control III
    • Verschure, P. F., & Voegtlin, T. (1998). A bottom up approach towards the acquisition and expression of sequential representations applied to a behaving real-world device: Distributed adaptive control III. Neural Networks, 11, 1531-1549.
    • (1998) Neural Networks , vol.11 , pp. 1531-1549
    • Verschure, P.F.1    Voegtlin, T.2
  • 33
    • 0035811464 scopus 로고    scopus 로고
    • Dopamine responses comply with basic assumptions of formal learning theory
    • Waelti, P., Dickinson, A., & Schultz, W. (2001). Dopamine responses comply with basic assumptions of formal learning theory. Nature, 412, 43-48.
    • (2001) Nature , vol.412 , pp. 43-48
    • Waelti, P.1    Dickinson, A.2    Schultz, W.3
  • 34
    • 0004049895 scopus 로고
    • Ph.D. dissertation, Psychology Department, Cambridge University
    • Watkins, C. J. C. H. (1989). Learning with delayed rewards. Ph.D. dissertation, Psychology Department, Cambridge University.
    • (1989) Learning with Delayed Rewards
    • Watkins, C.J.C.H.1
  • 35
    • 0003857155 scopus 로고    scopus 로고
    • A neural network model of aversive behavior
    • M. H. Hamza (Ed.). Zürich; IASTED/ACTA Press
    • Zanutto, B. S., & Lew, S. (2000). A neural network model of aversive behavior. In M. H. Hamza (Ed.), Proceedings of the LASTED Neural Networks NN'2000 (pp. 118-123). Zürich; IASTED/ACTA Press.
    • (2000) Proceedings of the LASTED Neural Networks NN'2000 , pp. 118-123
    • Zanutto, B.S.1    Lew, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.