SCOPUS 정보 검색 플랫폼

Robotics and Autonomous Systems

Volumn 22, Issue 3-4, 1997, Pages 231-249

Shaping robot behavior using principles from instrumental conditioning

(3) Saksida, Lisa M a,c Raymond, Scott M b Touretzky, David S b,c

a CARNEGIE MELLON UNIVERSITY (United States)

b Carnegie Mellon University (United States)

c CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

Autonomous mobile robots; Instrumental learning; Operant conditioning; Reinforcement learning; Shaping

Indexed keywords

COMPUTATION THEORY; COMPUTER SIMULATION; LEARNING ALGORITHMS; MATHEMATICAL MODELS;

INSTRUMENTAL LEARNING; REINFORCEMENT LEARNING;

MOBILE ROBOTS;

EID: 0031336564 PISSN: 09218890 EISSN: None Source Type: Journal
DOI: 10.1016/S0921-8890(97)00041-9 Document Type: Article

Times cited : (73)

References (39)

1
- 0030149709
- Purposive behavior acquisition for a real robot by vision-based reinforcement learning
- M. Asada, S. Noda, S. Tawaratsumida, K. Hosoda, Purposive behavior acquisition for a real robot by vision-based reinforcement learning, Machine Learning 23 (2-3) (1996) 279-303.
- (1996) Machine Learning , vol.23 , Issue.2-3 , pp. 279-303
- Asada, M.¹ Noda, S.² Tawaratsumida, S.³ Hosoda, K.⁴

2
- 0004142826
- Oxford University Press, Oxford
- S.A. Barnett, Modern Ethology, Oxford University Press, Oxford, 1981.
- (1981) Modern Ethology
- Barnett, S.A.¹

3
- 0010922732
- Empirically derived adaptive elements and networks simulate associative learning
- Lawrence Erlbaum, Hillsdale, NJ
- D.A. Baxter, D.V. Bounomano, J.L. Raymond, D.G. Cook, F.M. Kuenzi, T.J. Carew, J.H. Byrne, Empirically derived adaptive elements and networks simulate associative learning, in: Neural Network Models of Conditioning and Action, Lawrence Erlbaum, Hillsdale, NJ, 1991, pp. 13-52.
- (1991) Neural Network Models of Conditioning and Action , pp. 13-52
- Baxter, D.A.¹ Bounomano, D.V.² Raymond, J.L.³ Cook, D.G.⁴ Kuenzi, F.M.⁵ Carew, T.J.⁶ Byrne, J.H.⁷

4
- 0000384373
- Action - Selection in hamsterdam: Lessons from ethology
- Brighton
- B. Blumberg, Action - selection in hamsterdam: Lessons from ethology, in: Proceedings of the Third International Conference on the Simulation of Adaptive Behavior, Brighton, 1994.
- (1994) Proceedings of the Third International Conference on the Simulation of Adaptive Behavior
- Blumberg, B.¹

5
- 0003274657
- No bad dogs: Ethological lessons for learning in hamsterdam
- B.M. Blumberg, P.M. Todd, Pattie Maes, No bad dogs: Ethological lessons for learning in hamsterdam, in: Proceedings of the Fourth International Conference on the Simulation of Adaptive Behavior, 1996.
- (1996) Proceedings of the Fourth International Conference on the Simulation of Adaptive Behavior
- Blumberg, B.M.¹ Todd, P.M.² Maes, P.³

6
- 0000696066
- The misbehavior of organisms
- K. Breland, M. Breland, The misbehavior of organisms, American Psychologist 16 (1961) 681-684.
- (1961) American Psychologist , vol.16 , pp. 681-684
- Breland, K.¹ Breland, M.²

7
- 0014234914
- Auto-shaping of the pigeon's keypeck
- P.L. Brown, H.M. Jenkins, Auto-shaping of the pigeon's keypeck, Journal of the Experimental Analysis of Behavior 11 (1968) 1-8.
- (1968) Journal of the Experimental Analysis of Behavior , vol.11 , pp. 1-8
- Brown, P.L.¹ Jenkins, H.M.²

8
- 0028025030
- A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli
- T.J. Bussey, J.L. Muir, T.W. Robbins, A novel automated touchscreen procedure for assessing learning in the rat using computer graphic stimuli, Neuroscience Research Communications 15 (2) (1994) 103-109.
- (1994) Neuroscience Research Communications , vol.15 , Issue.2 , pp. 103-109
- Bussey, T.J.¹ Muir, J.L.² Robbins, T.W.³

9
- 30244475580
- Santa Rosa, CA
- CCI. The CCI Program. Canine Companions for Independence, Santa Rosa, CA, 1995. Informational page available at http://grunt.berkeley.edu/cci/cci.html.
- (1995) The CCI Program. Canine Companions for Independence

10
- 85152521744
- A teaching method for reinforcement learning
- Morgan Kaufmann, Los Altos, CA
- J.A. Clouse, P.E. Utgoff, A teaching method for reinforcement learning, in: Proceedings of the Ninth Conference on Machine Learning, Morgan Kaufmann, Los Altos, CA, 1992.
- (1992) Proceedings of the Ninth Conference on Machine Learning
- Clouse, J.A.¹ Utgoff, P.E.²

11
- 0030167564
- Behavior analysis and training: A methodology for behavior engineering
- M. Colombetti, M. Dorigo, G. Borghi, Behavior analysis and training: A methodology for behavior engineering, IEEE Transactions on Systems, Man, and Cybernetics -Part B 26 (3) (1996) 365-380.
- (1996) IEEE Transactions on Systems, Man, and Cybernetics - Part B , vol.26 , Issue.3 , pp. 365-380
- Colombetti, M.¹ Dorigo, M.² Borghi, G.³

12
- 0010786206
- Instrumental conditioning
- N.J. Mackintosh (Ed.), Academic Press, Orlando, FL
- A. Dickinson, Instrumental conditioning, in: N.J. Mackintosh (Ed.), Handbook of Perception and Cognition, vol. 9, Academic Press, Orlando, FL, 1995.
- (1995) Handbook of Perception and Cognition , vol.9
- Dickinson, A.¹

13
- 0002692217
- Actions and habits: The development of behavioral autonomy
- A. Dickinson, Actions and habits: The development of behavioral autonomy, Philosophical Transactions of the Royal Society of London, Series B 308 (1985) 67-78.
- (1985) Philosophical Transactions of the Royal Society of London, Series B , vol.308 , pp. 67-78
- Dickinson, A.¹

14
- 0028739953
- Robot shaping: Developing autonomous agents through learning
- M. Dorigo, M. Colombetti, Robot shaping: Developing autonomous agents through learning, Artificial Intelligence 70 (2) (1994) 321-370.
- (1994) Artificial Intelligence , vol.70 , Issue.2 , pp. 321-370
- Dorigo, M.¹ Colombetti, M.²

15
- 0003977430
- MIT Press, Cambridge, MA
- G.L. Drescher, Made-Up Minds, MIT Press, Cambridge, MA, 1991.
- (1991) Made-up Minds
- Drescher, G.L.¹

16
- 0038921118
- Lawrence Erlbaum, Hillsdale, NJ
- C.R. Gallistel, The Organization of Action, Lawrence Erlbaum, Hillsdale, NJ, 1980.
- (1980) The Organization of Action
- Gallistel, C.R.¹

17
- 0003182781
- A multistrategy learning scheme for agent knowledge acquisition
- D. Gordon, D. Subramanian, A multistrategy learning scheme for agent knowledge acquisition, Informatica 17 (1994) 331-346.
- (1994) Informatica , vol.17 , pp. 331-346
- Gordon, D.¹ Subramanian, D.²

18
- 0027375966
- Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat
- R.E. Hampson, C.J. Heyser, S.A. Deadwyler, Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat, Behavioral Neuroscience 107 (5) (1993) 715-739.
- (1993) Behavioral Neuroscience , vol.107 , Issue.5 , pp. 715-739
- Hampson, R.E.¹ Heyser, C.J.² Deadwyler, S.A.³

19
- 0004031241
- The Psychonomic Society, Austin
- E. Hearst, H.M. Jenkins, Sign tracking: The Stimulus-Reinforcer Relation and Directed Action, The Psychonomic Society, Austin, 1975.
- (1975) Sign Tracking: The Stimulus-reinforcer Relation and Directed Action
- Hearst, E.¹ Jenkins, H.M.²

20
- 0015889868
- The form of the autoshaped response with food or water reinforcers
- H.M. Jenkins, B.R. Moore, The form of the autoshaped response with food or water reinforcers, Journal of the Experimental Analysis of Behavior 20 (1973) 163-181.
- (1973) Journal of the Experimental Analysis of Behavior , vol.20 , pp. 163-181
- Jenkins, H.M.¹ Moore, B.R.²

21
- 0029679044
- Reinforcement learning: A survey
- L.P. Kaelbling, M.L. Littman, A.W. Moore, Reinforcement learning: A survey, Journal of Artificial Intelligence Research 4 (1996) 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

22
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning, and teaching
- L.-J. Lin, Self-improving reactive agents based on reinforcement learning, planning, and teaching, Machine Learning 8 (1992) 293-321.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

23
- 0029732210
- Creating advice-taking reinforcement learners
- R. Maclin, J.W. Shavlik, Creating advice-taking reinforcement learners, Machine Learning 22 (1-3) (1996) 251-281.
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 251-281
- Maclin, R.¹ Shavlik, J.W.²

24
- 0010862056
- Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot
- MIT Press, Cambridge, MA
- J. del R. Millán, Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot, in: From Animals to Animates 3: Proceedings of the Third International Conference on Simulation of Adaptive Behavior, MIT Press, Cambridge, MA, 1994, pp. 266-274.
- (1994) From Animals to Animates 3: Proceedings of the Third International Conference on Simulation of Adaptive Behavior , pp. 266-274
- Millán, J.D.R.¹

25
- 0030171602
- Rapid, safe, and incremental learning of navigation strategies
- J. del R. Millán, Rapid, safe, and incremental learning of navigation strategies, IEEE Transactions on Systems, Man, and Cybernetics - Part B 26 (3) (1996) 408-420.
- (1996) IEEE Transactions on Systems, Man, and Cybernetics - Part B , vol.26 , Issue.3 , pp. 408-420
- Millán, J.D.R.¹

26
- 0019089514
- A model for Pavlovian learning: Variations in effectiveness of conditioned but not unconditioned stimuli
- J.M. Pearce, G. Hall, A model for Pavlovian learning: Variations in effectiveness of conditioned but not unconditioned stimuli, Psychological Review 87 (6) (1980) 532-552.
- (1980) Psychological Review , vol.87 , Issue.6 , pp. 532-552
- Pearce, J.M.¹ Hall, G.²

27
- 0023795385
- Escalation of feline predation along a gradient from avoidance through play to killing
- S.M. Pellis, D.P. O'Brien, V.C. Pellis, P. Teitelbaum, D.L. Wolgin, S. Kennedy, Escalation of feline predation along a gradient from avoidance through play to killing, Behavioral Neuroscience 102 (5) (1988) 760-777.
- (1988) Behavioral Neuroscience , vol.102 , Issue.5 , pp. 760-777
- Pellis, S.M.¹ O'Brien, D.P.² Pellis, V.C.³ Teitelbaum, P.⁴ Wolgin, D.L.⁵ Kennedy, S.⁶

28
- 30244452051
- Robot shaping - Principles, methods, and architectures
- S. Perkins, G. Hayes, Robot shaping - principles, methods, and architectures, in: Workshop on Learning in Robots and Animals, AISB'96, 1996.
- (1996) Workshop on Learning in Robots and Animals, AISB'96
- Perkins, S.¹ Hayes, G.²

29
- 0003605046
- Harper and Row, New York
- K. Pryor, Lads Before the Wind, Harper and Row, New York, 1975.
- (1975) Lads Before the Wind
- Pryor, K.¹

30
- 0026923467
- A learning rule based on empirically derived activity-dependent neuromodulation supports operant conditioning in a small network
- J.L. Raymond, D.A. Baxter, D.V. Buonomano, J.H. Byrne, A learning rule based on empirically derived activity-dependent neuromodulation supports operant conditioning in a small network, Neural Networks 5 (5) (1992) 789-803.
- (1992) Neural Networks , vol.5 , Issue.5 , pp. 789-803
- Raymond, J.L.¹ Baxter, D.A.² Buonomano, D.V.³ Byrne, J.H.⁴

31
- 0002109138
- A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
- A.H. Black, W.F. Prokasy (Eds.), Appleton-Century-Crofts, New York
- R.A. Rescorla, A.R. Wagner, A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement, in: A.H. Black, W.F. Prokasy (Eds.), Classical Conditioning II: Theory and Research, Appleton-Century-Crofts, New York, 1972.
- (1972) Classical Conditioning II: Theory and Research
- Rescorla, R.A.¹ Wagner, A.R.²

32
- 0004183870
- Scott, Foresman, Glenview, IL
- G.S. Reynolds, A Primer of Operant Conditioning, Scott, Foresman, Glenview, IL, 1968.
- (1968) A Primer of Operant Conditioning
- Reynolds, G.S.¹

33
- 0003952786
- Spartan, New York
- F. Rosenblatt, Principles of Neurodynamics, Spartan, New York, 1962.
- (1962) Principles of Neurodynamics
- Rosenblatt, F.¹

34
- 0028377745
- Structured control for autonomous robots
- R. Simmons, Structured control for autonomous robots, IEEE Transactions on Robotics and Automation 10 (1) (1994) 34-43.
- (1994) IEEE Transactions on Robotics and Automation , vol.10 , Issue.1 , pp. 34-43
- Simmons, R.¹

35
- 0030705553
- A modular architecture for office delivery robots
- February
- R. Simmons, R. Goodwin, K. Haigh, S. Koenig, J. O'Sullivan, A modular architecture for office delivery robots, in: The Proceedings of the First International Conference on Autonomous Agents, February 1997.
- (1997) The Proceedings of the First International Conference on Autonomous Agents
- Simmons, R.¹ Goodwin, R.² Haigh, K.³ Koenig, S.⁴ O'Sullivan, J.⁵

36
- 0001027894
- Transfer of learning across sequential tasks
- S.P. Singh, Transfer of learning across sequential tasks, Machine Learning 8 (1992) 323-339.
- (1992) Machine Learning , vol.8 , pp. 323-339
- Singh, S.P.¹

37
- 0019537951
- Toward a modern theory of adaptive networks: Expectation and prediction
- R.S. Sutton, A.G. Barto, Toward a modern theory of adaptive networks: Expectation and prediction, Psychological Review 88 (1981) 135-170.
- (1981) Psychological Review , vol.88 , pp. 135-170
- Sutton, R.S.¹ Barto, A.G.²

38
- 0008861422
- Two kinds of training information for evaluation function learning
- AAAI Press
- P. Utgoff, J. Clouse, Two kinds of training information for evaluation function learning, in: Proceedings of the Ninth National Conference on Artificial Intelligence (AAAI-91), AAAI Press, 1991.
- (1991) Proceedings of the Ninth National Conference on Artificial Intelligence (AAAI-91)
- Utgoff, P.¹ Clouse, J.²

39
- 0004049893
- Ph.D. thesis, Cambridge University, Cambridge, UK
- C.J.C.H. Watkins, Learning from Delayed Rewards, Ph.D. thesis, Cambridge University, Cambridge, UK, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.