메뉴 건너뛰기




Volumn 22, Issue 3-4, 1997, Pages 231-249

Shaping robot behavior using principles from instrumental conditioning

Author keywords

Autonomous mobile robots; Instrumental learning; Operant conditioning; Reinforcement learning; Shaping

Indexed keywords

COMPUTATION THEORY; COMPUTER SIMULATION; LEARNING ALGORITHMS; MATHEMATICAL MODELS;

EID: 0031336564     PISSN: 09218890     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0921-8890(97)00041-9     Document Type: Article
Times cited : (73)

References (39)
  • 1
    • 0030149709 scopus 로고    scopus 로고
    • Purposive behavior acquisition for a real robot by vision-based reinforcement learning
    • M. Asada, S. Noda, S. Tawaratsumida, K. Hosoda, Purposive behavior acquisition for a real robot by vision-based reinforcement learning, Machine Learning 23 (2-3) (1996) 279-303.
    • (1996) Machine Learning , vol.23 , Issue.2-3 , pp. 279-303
    • Asada, M.1    Noda, S.2    Tawaratsumida, S.3    Hosoda, K.4
  • 2
    • 0004142826 scopus 로고
    • Oxford University Press, Oxford
    • S.A. Barnett, Modern Ethology, Oxford University Press, Oxford, 1981.
    • (1981) Modern Ethology
    • Barnett, S.A.1
  • 8
    • 0028025030 scopus 로고
    • A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli
    • T.J. Bussey, J.L. Muir, T.W. Robbins, A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli, Neuroscience Research Communications 15 (2) (1994) 103-109.
    • (1994) Neuroscience Research Communications , vol.15 , Issue.2 , pp. 103-109
    • Bussey, T.J.1    Muir, J.L.2    Robbins, T.W.3
  • 12
    • 0010786206 scopus 로고
    • Instrumental conditioning
    • N.J. Mackintosh (Ed.), Academic Press, Orlando, FL
    • A. Dickinson, Instrumental conditioning, in: N.J. Mackintosh (Ed.), Handbook of Perception and Cognition, vol. 9, Academic Press, Orlando, FL, 1995.
    • (1995) Handbook of Perception and Cognition , vol.9
    • Dickinson, A.1
  • 14
    • 0028739953 scopus 로고
    • Robot shaping: Developing autonomous agents through learning
    • M. Dorigo, M. Colombetti, Robot shaping: Developing autonomous agents through learning, Artificial Intelligence 70 (2) (1994) 321-370.
    • (1994) Artificial Intelligence , vol.70 , Issue.2 , pp. 321-370
    • Dorigo, M.1    Colombetti, M.2
  • 17
    • 0003182781 scopus 로고
    • A multistrategy learning scheme for agent knowledge acquisition
    • D. Gordon, D. Subramanian, A multistrategy learning scheme for agent knowledge acquisition, Informatica 17 (1994) 331-346.
    • (1994) Informatica , vol.17 , pp. 331-346
    • Gordon, D.1    Subramanian, D.2
  • 18
    • 0027375966 scopus 로고
    • Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat
    • R.E. Hampson, C.J. Heyser, S.A. Deadwyler, Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat, Behavioral Neuroscience 107 (5) (1993) 715-739.
    • (1993) Behavioral Neuroscience , vol.107 , Issue.5 , pp. 715-739
    • Hampson, R.E.1    Heyser, C.J.2    Deadwyler, S.A.3
  • 22
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning, and teaching
    • L.-J. Lin, Self-improving reactive agents based on reinforcement learning, planning, and teaching, Machine Learning 8 (1992) 293-321.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.-J.1
  • 23
    • 0029732210 scopus 로고    scopus 로고
    • Creating advice-taking reinforcement learners
    • R. Maclin, J.W. Shavlik, Creating advice-taking reinforcement learners, Machine Learning 22 (1-3) (1996) 251-281.
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 251-281
    • Maclin, R.1    Shavlik, J.W.2
  • 26
    • 0019089514 scopus 로고
    • A model for Pavlovian learning: Variations in effectiveness of conditioned but not unconditioned stimuli
    • J.M. Pearce, G. Hall, A model for Pavlovian learning: Variations in effectiveness of conditioned but not unconditioned stimuli, Psychological Review 87 (6) (1980) 532-552.
    • (1980) Psychological Review , vol.87 , Issue.6 , pp. 532-552
    • Pearce, J.M.1    Hall, G.2
  • 30
    • 0026923467 scopus 로고
    • A learning rule based on empirically derived activity-dependent neuromodulation supports operant conditioning in a small network
    • J.L. Raymond, D.A. Baxter, D.V. Buonomano, J.H. Byrne, A learning rule based on empirically derived activity-dependent neuromodulation supports operant conditioning in a small network, Neural Networks 5 (5) (1992) 789-803.
    • (1992) Neural Networks , vol.5 , Issue.5 , pp. 789-803
    • Raymond, J.L.1    Baxter, D.A.2    Buonomano, D.V.3    Byrne, J.H.4
  • 31
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • A.H. Black, W.F. Prokasy (Eds.), Appleton-Century-Crofts, New York
    • R.A. Rescorla, A.R. Wagner, A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement, in: A.H. Black, W.F. Prokasy (Eds.), Classical Conditioning II: Theory and Research, Appleton-Century-Crofts, New York, 1972.
    • (1972) Classical Conditioning II: Theory and Research
    • Rescorla, R.A.1    Wagner, A.R.2
  • 36
    • 0001027894 scopus 로고
    • Transfer of learning across sequential tasks
    • S.P. Singh, Transfer of learning across sequential tasks, Machine Learning 8 (1992) 323-339.
    • (1992) Machine Learning , vol.8 , pp. 323-339
    • Singh, S.P.1
  • 37
    • 0019537951 scopus 로고
    • Toward a modern theory of adaptive networks: Expectation and prediction
    • R.S. Sutton, A.G. Barto, Toward a modern theory of adaptive networks: Expectation and prediction, Psychological Review 88 (1981) 135-170.
    • (1981) Psychological Review , vol.88 , pp. 135-170
    • Sutton, R.S.1    Barto, A.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.