SCOPUS 정보 검색 플랫폼

Intrinsically Motivated Learning in Natural and Artificial Systems

Volumn 9783642323751, Issue , 2013, Pages 17-47

Intrinsic motivation and reinforcement learning

(1) Barto, Andrew G a,b

a UNIVERSITY OF MASSACHUSETTS (United States)

b INSTITUTE OF COGNITIVE SCIENCES AND TECHNOLOGIES (Italy)

Author keywords

[No Author keywords available]

Indexed keywords

MOTIVATION; ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; REINFORCEMENT LEARNING;

EXTRINSIC MOTIVATION; HIGH GRADES; INPUT CHANNELS; INTRINSIC MOTIVATION; LEARN+; LEARNING FRAMEWORKS; MACHINE LEARNING SYSTEMS; MACHINE-LEARNING; REINFORCEMENT LEARNING AGENT; REINFORCEMENT LEARNINGS;

REINFORCEMENT LEARNING; EDUCATION;

EID: 84929046579 PISSN: None EISSN: None Source Type: Book
DOI: 10.1007/978-3-642-32375-1_2 Document Type: Chapter

Times cited : (253)

References (108)

1
- 0000500817
- Interactions between learning and evolution
- Langton, C., Taylor, C., Farmer, C., Rasmussen, S. (eds.) Addison-Wesley, Reading
- Ackley, D.H., Littman, M.: Interactions between learning and evolution. In: Langton, C., Taylor, C., Farmer, C., Rasmussen, S. (eds.) Artificial Life II (Proceedings Volume X in the Santa Fe Institute Studies in the Sciences of Complexity, pp. 487-509. Addison-Wesley, Reading (1991)
- (1991) Artificial Life II (Proceedings Volume X in the Santa Fe Institute Studies in the Sciences of Complexity, Pp , pp. 487-509
- Ackley, D.H.¹ Littman, M.²

2
- 9644288181
- Learning invariant sensorimotor behaviors: A developmental approach to imitation mechanisms
- Andry, P., Gaussier, P., Nadel, J., Hirsbrunner, B.: Learning invariant sensorimotor behaviors: A developmental approach to imitation mechanisms. Adap. Behav. 12, 117-140 (2004)
- (2004) Adap. Behav. , vol.12 , pp. 117-140
- Andry, P.¹ Gaussier, P.² Nadel, J.³ Hirsbrunner, B.⁴

3
- 0004300391
- Brooks/Cole, Monterey
- Arkes, H.R., Garske, J.P.: Psychological Theories of Motivation. Brooks/Cole, Monterey (1982)
- (1982) Psychological Theories of Motivation
- Arkes, H.R.¹ Garske, J.P.²

4
- 78651520536
- Intrinsically motivated goal exploration for active motor learning in robots: A case study
- Taipei, Taiwan
- Baranes, A., Oudeyer, P.-Y.: Intrinsically motivated goal exploration for active motor learning in robots: A case study. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010), Taipei, Taiwan 2010
- (2010) Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)
- Baranes, A.¹ Oudeyer, P.-Y.²

5
- 0037288370
- Recent advances in hierarchical reinforcement learning
- Barto, A.G., Mahadevan, S.: Recent advances in hierarchical reinforcement learning. Discr. Event Dynam. Syst. Theory Appl. 13, 341-379 (2003)
- (2003) Discr. Event Dynam. Syst. Theory Appl. , vol.13 , pp. 341-379
- Barto, A.G.¹ Mahadevan, S.²

6
- 33749651693
- Intrinsically motivated learning of hierarchical collections of skills
- La Jolla, CA
- Barto, A.G., Singh, S., Chentanez, N.: Intrinsically motivated learning of hierarchical collections of skills. In: Proceedings of the International Conference on Developmental Learning (ICDL), La Jolla, CA 2004
- (2004) Proceedings of the International Conference on Developmental Learning (ICDL)
- Barto, A.G.¹ Singh, S.² Chentanez, N.³

7
- 0020970738
- Neuronlike elements that can solve difficult learn-ingcontrol problems
- Barto, A.G., Sutton, R.S., Anderson, C.W.: Neuronlike elements that can solve difficult learn-ingcontrol problems. 13, 835-846 (1983). IEEE Trans. Sys. Man, Cybern.
- (1983) IEEE Trans. Sys. Man, Cybern , vol.13 , pp. 835-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

8
- 0004140522
- Reprinted MIT, Cambridge
- Reprinted in J.A. Anderson and E. Rosenfeld (eds.), Neurocomputing: Foundations of Research, pp. 535-549, MIT, Cambridge (1988)
- (1988) Neurocomputing: Foundations of Research , pp. 535-549
- Anderson, J.A.¹ Rosenfeld, E.²

9
- 85193186785
- Motivation
- 2nd edn. Prentice-Hall, Englewood Cliffs
- Beck, R.C.: Motivation. Theories and Principles, 2nd edn. Prentice-Hall, Englewood Cliffs (1983)
- (1983) Theories and Principles
- Beck, R.C.¹

10
- 0347919795
- A theory of human curiosity
- Berlyne, D.E.: A theory of human curiosity. Br. J. Psychol. 45, 180-191 (1954)
- (1954) Br. J. Psychol. , vol.45 , pp. 180-191
- Berlyne, D.E.¹

11
- 0003722409
- McGraw-Hill, New York
- Berlyne, D.E.: Conflict, Arousal., Curiosity. McGraw-Hill, New York (1960)
- (1960) Conflict, Arousal., Curiosity
- Berlyne, D.E.¹

12
- 0013931617
- Curiosity and exploration
- Berlyne, D.E.: Curiosity and exploration. Science 143, 25-33 (1966)
- (1966) Science , vol.143 , pp. 25-33
- Berlyne, D.E.¹

13
- 0004098526
- Aesthetics and psychobiology
- New York
- Berlyne, D.E.: Aesthetics and Psychobiology. Appleton-Century-Crofts, New York (1971)
- (1971) Appleton-Century-Crofts
- Berlyne, D.E.¹

14
- 0003487482
- Neuro-dynamic programming
- Belmont
- Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, Belmont (1996)
- (1996) Athena Scientific
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

15
- 0018053135
- How adaptive behavior is produced: A perceptual-motivational alternative to response reinforcement
- Bindra, D.: How adaptive behavior is produced: A perceptual-motivational alternative to response reinforcement. Behav. Brain Sci. 1, 41-91 (1978)
- (1978) Behav. Brain Sci. , vol.1 , pp. 41-91
- Bindra, D.¹

16
- 15444375039
- Tutelage and collaboration for humanoid robots
- Breazeal, C., Brooks, A., Gray, J., Hoffman, G., Lieberman, J., Lee, H., Lockerd, A., Mulanda, D.: Tutelage and collaboration for humanoid robots. Int. J. Human. Robot. 1 (2004)
- (2004) Int. J. Human. Robot. , vol.1
- Breazeal, C.¹ Brooks, A.² Gray, J.³ Hoffman, G.⁴ Lieberman, J.⁵ Lee, H.⁶ Lockerd, A.⁷ Mulanda, D.⁸

17
- 0004113802
- Technical report
- Bush, V.: Science the endless frontier: Areport to the president. Technical report (1945)
- (1945) Science the Endless Frontier: Areport to the President
- Bush, V.¹

18
- 40949147745
- A comprehensive survey of multi-agent reinforcement learning
- Busoniu, L., Babuska, R., Schutter, B.D.: A comprehensive survey of multi-agent reinforcement learning. IEEE Trans. Syst. Man Cybern. C Appl. Rev. 38(2), 156-172 (2008)
- (2008) IEEE Trans. Syst. Man Cybern. C Appl. Rev. , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² Schutter, B.D.³

19
- 0004224763
- W.W. Norton, New York
- Cannon, W.B.: The Wisdom of the Body. W.W. Norton, New York (1932)
- (1932) The Wisdom of the Body
- Cannon, W.B.¹

20
- 0004394065
- Generalization of pattern recognition in a self-organizing system
- Western Joint Computer Conference, Los Angeles, CA ACM, New York
- Clark, W.A., Farley, B.G.: Generalization of pattern recognition in a self-organizing system. In: AFIPS' 55 (Western) Proceedings of the March 1-3, 1955, Western Joint Computer Conference, Los Angeles, CA, pp. 86-91, ACM, New York (1955)
- (1955) AFIPS' 55 (Western) Proceedings of the March 1-3, 1955 , pp. 86-91
- Clark, W.A.¹ Farley, B.G.²

21
- 0004014239
- Wiley, New York
- Cofer, C.N., Appley, M.H.: Motivation: Theory and Research. Wiley, New York (1964)
- (1964) Motivation: Theory and Research
- Cofer, C.N.¹ Appley, M.H.²

22
- 33645986626
- Valency for adaptive homeostatic agents: Relating evolution and learning
- Capcarrere, M.S., Freitas, A.A., Bentley, P.J., Johnson, C.G., Timmis, J. (eds.) Canterbury, UK LNAI Springer, Berlin
- Damoulas, T., Cos-Aguilera, I., Hayes, G.M., Taylor, T.: Valency for adaptive homeostatic agents: Relating evolution and learning. In: Capcarrere, M.S., Freitas, A.A., Bentley, P.J., Johnson, C.G., Timmis, J. (eds.) Advances in Artificial Life: 8th European Conference, ECAL 2005. Canterbury, UK LNAI Vol. 3630, pp. 936-945. Springer, Berlin (2005)
- (2005) Advances in Artificial Life: 8th European Conference, ECAL 2005 , vol.3630 , pp. 936-945
- Damoulas, T.¹ Cos-Aguilera, I.² Hayes, G.M.³ Taylor, T.⁴

23
- 53849112833
- The cognitive neuroscience of motivation and learning
- Daw, N.D., Shohamy, D.: The cognitive neuroscience of motivation and learning. Soc. Cogn. 26(5), 593-620 (2008)
- (2008) Soc. Cogn. , vol.26 , Issue.5 , pp. 593-620
- Daw, N.D.¹ Shohamy, D.²

24
- 0011072195
- Motivated reinforcement learning
- Dietterich, T.G., Becker, S., Ghahramani, Z. (eds.) MIT, Cambridge
- Dayan, P.: Motivated reinforcement learning. In: Dietterich, T.G., Becker, S., Ghahramani, Z. (eds.) Advances in Neural Information Processing Systems 14: Proceedings of the 2001 Conference, pp. 11-18. MIT, Cambridge (2001)
- (2001) Advances in Neural Information Processing Systems 14: Proceedings of the 2001 Conference , pp. 11-18
- Dayan, P.¹

25
- 0003631043
- Intrinsic motivation and self-determination in human behavior
- New York
- Deci, E.L., Ryan, R.M.: Intrinsic Motivation and Self-Determination in Human Behavior. Plenum, New York (1985)
- (1985) Plenum
- Deci, E.L.¹ Ryan, R.M.²

26
- 0002618994
- Analysis of exploratory, manipulatory, and curiosity behaviors
- Dember, W.N., Earl, R.W.: Analysis of exploratory, manipulatory, and curiosity behaviors. Psychol. Rev. 64, 91-96 (1957)
- (1957) Psychol. Rev. , vol.64 , pp. 91-96
- Dember, W.N.¹ Earl, R.W.²

27
- 0008210536
- Response by rats to differential stimulus complexity
- Dember, W.N., Earl, R.W., Paradise, N.: Response by rats to differential stimulus complexity. J. Comp. Physiol. Psychol. 50, 514-518 (1957)
- (1957) J. Comp. Physiol. Psychol. , vol.50 , pp. 514-518
- Dember, W.N.¹ Earl, R.W.² Paradise, N.³

28
- 0043250430
- The role of leaning in the operation of motivational systems
- Gallistel, R. (ed.) 3rd edn. Learning, Motivation, and Emotion Wiley, New York
- Dickinson, A., Balleine, B.: The role of leaning in the operation of motivational systems. In: Gallistel, R. (ed.) Handbook of Experimental Psychology, 3rd edn. Learning, Motivation, and Emotion, pp. 497-533. Wiley, New York (2002)
- (2002) Handbook of Experimental Psychology , pp. 497-533
- Dickinson, A.¹ Balleine, B.²

29
- 55949119833
- Co-evolution of shaping rewards and metaparameters in reinforcement learning
- Elfwing, S., Uchibe, E., Doya, K., Christensen, H.I.: Co-evolution of shaping rewards and metaparameters in reinforcement learning. Adap. Behav. 16, 400-412 (2008)
- (2008) Adap. Behav. , vol.16 , pp. 400-412
- Elfwing, S.¹ Uchibe, E.² Doya, K.³ Christensen, H.I.⁴

30
- 0038809077
- Instinct and motivation as explanations of complex behavior
- Pfaff, D.W. (ed.) Springer, New York
- Epstein, A.: Instinct and motivation as explanations of complex behavior. In: Pfaff, D.W. (ed.) The Physiological Mechanisms of Motivation. Springer, New York (1982)
- (1982) The Physiological Mechanisms of Motivation
- Epstein, A.¹

31
- 77952091673
- Action and behavior: A free-energy formulation
- Pubished online February 11, 2020
- Friston, K.J., Daunizeau, J., Kilner, J., Kiebel, S.J.: Action and behavior: A free-energy formulation. Biol. Cybern. (2010). Pubished online February 11, 2020
- (2010) Biol. Cybern.
- Friston, K.J.¹ Daunizeau, J.² Kilner, J.³ Kiebel, S.J.⁴

32
- 0004425698
- D. Appleton, New York
- Groos, K.: The Play of Man. D. Appleton, New York (1901)
- (1901) The Play of Man
- Groos, K.¹

33
- 0009762657
- Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys
- Harlow, H.F.: Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys. J. Comp. Physiol. Psychol. 43, 289-294 (1950)
- (1950) J. Comp. Physiol. Psychol. , vol.43 , pp. 289-294
- Harlow, H.F.¹

34
- 0000371369
- Learning motivated by a manipulation drive
- Harlow, H.F., Harlow, M.K., Meyer, D.R.: Learning motivated by a manipulation drive. J. Exp. Psychol. 40, 228-234 (1950)
- (1950) J. Exp. Psychol. , vol.40 , pp. 228-234
- Harlow, H.F.¹ Harlow, M.K.² Meyer, D.R.³

35
- 84929056927
- Intrinsically motivated affordance discovery and modeling
- Baldassarre, G., Mirolli, M. (eds.) Springer, Berlin this volume
- Hart, S., Grupen, R.: Intrinsically motivated affordance discovery and modeling. In: Baldassarre, G., Mirolli, M. (eds.) Intrinsically Motivated Learning in Natural and Artificial Systems. Springer, Berlin (2012, this volume)
- (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
- Hart, S.¹ Grupen, R.²

36
- 0004230131
- Wiley, New York
- Hebb, D.O.: The Organization of Behavior. Wiley, New York (1949)
- (1949) The Organization of Behavior
- Hebb, D.O.¹

37
- 0002138279
- Instinct and ego during infancy
- Hendrick, I.: Instinct and ego during infancy. Psychoanal. Quart. 11, 33-58 (1942)
- (1942) Psychoanal. Quart. , vol.11 , pp. 33-58
- Hendrick, I.¹

38
- 68349127099
- Modulated exploratory dynamics can shape self-organized behavior
- Hesse, F., Der, R., Herrmann, M., Michael, J.: Modulated exploratory dynamics can shape self-organized behavior. Adv. Complex Syst. 12(2), 273-292 (2009)
- (2009) Adv. Complex Syst. , vol.12 , Issue.2 , pp. 273-292
- Hesse, F.¹ Der, R.² Herrmann, M.³ Michael, J.⁴

39
- 0004129335
- D. Appleton-Century, New York
- Hull, C.L.: Principles of Behavior. D. Appleton-Century, New York (1943)
- (1943) Principles of Behavior
- Hull, C.L.¹

40
- 0004271760
- Yale University Press, New Haven
- Hull, C.L.: Essentials of Behavior. Yale University Press, New Haven (1951)
- (1951) Essentials of Behavior
- Hull, C.L.¹

41
- 0003752169
- Yale University Press, New Haven
- Hull, C.L.: A Behavior System: An Introduction to Behavior Theory Concerning the Individual Organism. Yale University Press, New Haven (1952)
- (1952) A Behavior System: An Introduction to Behavior Theory Concerning the Individual Organism
- Hull, C.L.¹

42
- 0004177606
- Appleton-Century-Crofts, Inc., New York
- Kimble, G.A.: Hilgard and Marquis' Conditioning and Learning. Appleton-Century-Crofts, Inc., New York (1961)
- (1961) Hilgard and Marquis' Conditioning and Learning
- Kimble, G.A.¹

43
- 85193189757
- Motivation
- McGraw-Hill, New York
- Klein, S.B.: Motivation. Biosocial Approaches. McGraw-Hill, New York (1982)
- (1982) Biosocial Approaches
- Klein, S.B.¹

44
- 0003900353
- Brain function and adaptive systems - A heterostatic theory
- Air Force Cambridge Research Laboratories, Bedford. A summary appears in Proceedings of the International Conference on Systems, Man, and Cybernetics, 1974, IEEE Systems, Man, and Cybernetics Society, Dallas
- Klopf, A.H.: Brain function and adaptive systems - A heterostatic theory. Technical report AFCRL-72-0164, Air Force Cambridge Research Laboratories, Bedford. A summary appears in Proceedings of the International Conference on Systems, Man, and Cybernetics, 1974, IEEE Systems, Man, and Cybernetics Society, Dallas (1972)
- (1972) Technical Report AFCRL-72-0164
- Klopf, A.H.¹

45
- 0003607885
- Hemisphere, Washington
- Klopf, A.H.: The Hedonistic Neuron: A Theory of Memory, Learning, and Intelligence. Hemisphere, Washington (1982)
- (1982) The Hedonistic Neuron: A Theory of Memory, Learning, and Intelligence
- Klopf, A.H.¹

46
- 0011633220
- Ph.D. Thesis, Stanford University
- Lenat, D.B.: AM: An artificial intelligence approach to discovery in mathematics. Ph.D. Thesis, Stanford University (1976)
- (1976) AM: An Artificial Intelligence Approach to Discovery in Mathematics
- Lenat, D.B.¹

47
- 80053135557
- Viking, New York
- Linden, D.J.: The Compass of Pleasure: How Our Brains Make Fatty Foods, Orgasm, Exercise, Marijuana, Generosity, Vodka, Learning, and Gambling Feel So Good. Viking, New York (2011)
- (2011) The Compass of Pleasure: How our Brains Make Fatty Foods, Orgasm, Exercise, Marijuana, Generosity, Vodka, Learning, and Gambling Feel So Good
- Linden, D.J.¹

48
- 33747270089
- Adaptation in constant utility nonstationary environments
- San Diego, CA
- Littman, M.L., Ackley, D.H.: Adaptation in constant utility nonstationary environments. In: Proceedings of the Fourth International Conference on Genetic Algorithms, San Diego, CA pp. 136-142 (1991)
- (1991) Proceedings of the Fourth International Conference on Genetic Algorithms , pp. 136-142
- Littman, M.L.¹ Ackley, D.H.²

49
- 1842783360
- Developmental robotics: A survey
- Lungarella, M., Metta, G., Pfeiffer, R., Sandini, G.: Developmental robotics: A survey. Connect. Sci. 15, 151-190 (2003)
- (2003) Connect. Sci. , vol.15 , pp. 151-190
- Lungarella, M.¹ Metta, G.² Pfeiffer, R.³ Sandini, G.⁴

50
- 0004282622
- Oxford University Press, New York
- Mackintosh, N.J.: Conditioning and Associative Learning. Oxford University Press, New York (1983)
- (1983) Conditioning and Associative Learning
- Mackintosh, N.J.¹

51
- 0003649697
- MIT, Cambridge
- McFarland, D., Bösser, T.: Intelligent Behavior in Animals and Robots. MIT, Cambridge (1993)
- (1993) Intelligent Behavior in Animals and Robots
- McFarland, D.¹ Bösser, T.²

52
- 0003799456
- Academic, New York
- Mendel, J.M., Fu, K.S. (eds.): Adaptive, Learning, and Pattern Recognition Systems: Theory and Applications. Academic, New York (1970)
- (1970) Adaptive, Learning, and Pattern Recognition Systems: Theory and Applications
- Mendel, J.M.¹ Fu, K.S.²

53
- 77956759998
- Reinforcement learning control and pattern recognition systems
- Mendel, J.M., Fu, K.S. (eds.) Academic, New York
- Mendel, J.M., McLaren, R.W.: Reinforcement learning control and pattern recognition systems. In: Mendel, J.M., Fu, K.S. (eds.) Adaptive, Learning and Pattern Recognition Systems: Theory and Applications, pp. 287-318. Academic, New York (1970)
- (1970) Adaptive, Learning and Pattern Recognition Systems: Theory and Applications , pp. 287-318
- Mendel, J.M.¹ McLaren, R.W.²

54
- 0000827179
- BOXES: An experiment in adaptive control
- Dale, E., Michie, D. (eds.) Oliver and Boyd, Edinburgh
- Michie, D., Chambers, R.A.: BOXES: An experiment in adaptive control. In: Dale, E., Michie, D. (eds.) Machine Intelligence 2, pp. 137-152. Oliver and Boyd, Edinburgh (1968)
- (1968) Machine Intelligence , vol.2 , pp. 137-152
- Michie, D.¹ Chambers, R.A.²

55
- 0013500961
- Ph.D. Thesis, Princeton University
- Minsky, M.L.: Theory of neural-analog reinforcement systems and its application to the brain-model problem. Ph.D. Thesis, Princeton University (1954)
- (1954) Theory of Neural-analog Reinforcement Systems and its Application to the Brain-model Problem
- Minsky, M.L.¹

56
- 84937350040
- Steps toward artificial intelligence
- Minsky, M.L.: Steps toward artificial intelligence. Proc. Inst. Radio Eng. 49, 8-30 (1961).
- (1961) Proc. Inst. Radio Eng. , vol.49 , pp. 8-30
- Minsky, M.L.¹

57
- 0004242550
- Reprinted McGraw-Hill, New York
- Reprinted in E.A. Feigenbaum and J. Feldman (eds.) Computers and Thought, pp. 406-450. McGraw-Hill, New York (1963)
- (1963) Computers and Thought , pp. 406-450
- Feigenbaum, E.A.¹ Feldman, J.²

58
- 49649154025
- Shifts in deprivations level: Different effects depending on the amount of preshift training
- Mollenauer, S.O.: Shifts in deprivations level: Different effects depending on the amount of preshift training. Learn. Motiv. 2, 58-66 (1971)
- (1971) Learn. Motiv. , vol.2 , pp. 58-66
- Mollenauer, S.O.¹

59
- 0003891507
- Prentice Hall, Englewood Cliffs
- Narendra, K., Thathachar, M.A.L.: Learning Automata: An Introduction. Prentice Hall, Englewood Cliffs (1989)
- (1989) Learning Automata: An Introduction
- Narendra, K.¹ Thathachar, M.A.L.²

60
- 33645367848
- Positive reinforcement produced by electrical stimulation of septal areas and other regions of rat brain
- Olds, J., Milner, P.: Positive reinforcement produced by electrical stimulation of septal areas and other regions of rat brain. J. Comp. Physiol. Psychol. 47, 419-427 (1954)
- (1954) J. Comp. Physiol. Psychol. , vol.47 , pp. 419-427
- Olds, J.¹ Milner, P.²

61
- 84891105730
- What is intrinsic motivation? A typology of computational approaches
- Oudeyer, P.-Y., Kaplan, F.: What is intrinsic motivation? A typology of computational approaches. Front. Neurorobot. 1:6, doi: 10.3389/neuro.12.006.2007 (2007)
- (2007) Front. Neurorobot , vol.1 , pp. 6
- Oudeyer, P.-Y.¹ Kaplan, F.²

62
- 34047267520
- Intrinsic motivation systems for autonomous mental development
- Oudeyer, P.-Y., Kaplan, F., Hafner, V.: Intrinsic motivation systems for autonomous mental development. IEEE Trans. Evol. Comput. 11, 265-286 (2007)
- (2007) IEEE Trans. Evol. Comput. , vol.11 , pp. 265-286
- Oudeyer, P.-Y.¹ Kaplan, F.² Hafner, V.³

63
- 0040259473
- Wadsworth Publishing Company, Belmont
- Petri, H.L.: Motivation: Theory and Research. Wadsworth Publishing Company, Belmont (1981)
- (1981) Motivation: Theory and Research
- Petri, H.L.¹

64
- 0004238774
- Norton, New York
- Piaget, J.: The Origins of Intelligence in Children. Norton, New York (1952)
- (1952) The Origins of Intelligence in Children
- Piaget, J.¹

65
- 0003959340
- MIT, Cambridge
- Picard, R.W.: Affective Computing. MIT, Cambridge (1997)
- (1997) Affective Computing
- Picard, R.W.¹

66
- 85193185735
- Lund University Cognitive Studies Lund University, Lund
- Prince, C.G., Demiris, Y., Marom, Y., Kozima, H., Balkenius, C. (eds.): Proceedings of the Second International Workshop on Epigenetic Robotics: Modeling Cognitive Development in Robotic Systems. Lund University Cognitive Studies, Vol. 94. Lund University, Lund (2001)
- (2001) Proceedings of the Second International Workshop on Epigenetic Robotics: Modeling Cognitive Development in Robotic Systems , vol.94
- Prince, C.G.¹ Demiris, Y.² Marom, Y.³ Kozima, H.⁴ Balkenius, C.⁵

67
- 0002109138
- A theory of pavlovian conditioning: Variationsin the effectiveness of reinforcement and nonreinforcement
- Black, A.H., Prokasy, W.F. (eds.) Appleton-Century-Crofts, New York
- Rescorla, R.A., Wagner, A.R.: A theory of Pavlovian conditioning: Variationsin the effectiveness of reinforcement and nonreinforcement. In: Black, A.H., Prokasy, W.F. (eds.) Classical Conditioning, Vol. II, pp. 64-99. Appleton-Century-Crofts, New York (1972)
- (1972) Classical Conditioning , vol.2 , pp. 64-99
- Rescorla, R.A.¹ Wagner, A.R.²

68
- 0003952786
- Spartan Books, Washington
- Rosenblatt, F.: Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books, Washington (1962)
- (1962) Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms
- Rosenblatt, F.¹

69
- 0022471098
- Learning representations by back-propagating errors
- Rumelhart, D., Hintont, G., Williams, R.: Learning representations by back-propagating errors. Nature 323 (6088), 533-536 (1986)
- (1986) Nature , vol.323 , Issue.6088 , pp. 533-536
- Rumelhart, D.¹ Hintont, G.² Williams, R.³

70
- 0002209063
- Intrinsic and extrinsic motivations: Classic definitions and new directions
- Ryan, R.M., Deci, E.L.: Intrinsic and extrinsic motivations: Classic definitions and new directions. Contemp. Educ. Psychol. 25, 54-67 (2000)
- (2000) Contemp. Educ. Psychol. , vol.25 , pp. 54-67
- Ryan, R.M.¹ Deci, E.L.²

71
- 0035314842
- Introduction to the evolution of preferences
- Samuelson, L.: Introduction to the evolution of preferences. J. Econ. Theory 97, 225-230 (2001)
- (2001) J. Econ. Theory , vol.97 , pp. 225-230
- Samuelson, L.¹

72
- 33746245586
- Information, evolution, and utility
- Samuelson, L., Swinkels, J.: Information, evolution, and utility. Theor. Econ. 1, 119-142 (2006)
- (2006) Theor. Econ. , vol.1 , pp. 119-142
- Samuelson, L.¹ Swinkels, J.²

73
- 0034345039
- Artificial motives: A review of motivation in artificial creatures
- Savage, T.: Artificial motives: A review of motivation in artificial creatures. Connect. Sci. 12, 211-277 (2000)
- (2000) Connect. Sci. , vol.12 , pp. 211-277
- Savage, T.¹

74
- 50849094213
- Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot
- Imperial College, London
- Schembri, M., Mirolli, M., Baldassarre, G.: Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot. In: Proceedings of the 6th International Conference on Development and Learning (ICDL2007), Imperial College, London 2007
- (2007) Proceedings of the 6th International Conference on Development and Learning (ICDL2007)
- Schembri, M.¹ Mirolli, M.² Baldassarre, G.³

75
- 0344252216
- Technical report FKI-149-91, Institut für Informatik, Technische Universität München
- Schmidhuber, J.: Adaptive confidence and adaptive curiosity. Technical report FKI-149-91, Institut für Informatik, Technische Universität München (1991a)
- (1991) Adaptive Confidence and Adaptive Curiosity
- Schmidhuber, J.¹

76
- 2442467081
- A possibility for implementing curiosity and boredom in model-building neural controllers
- MIT, Cambridge
- Schmidhuber, J.: A possibility for implementing curiosity and boredom in model-building neural controllers. In: From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior, pp. 222-227. MIT, Cambridge (1991b)
- (1991) From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior , pp. 222-227
- Schmidhuber, J.¹

77
- 0010820937
- What's interesting?
- IDSIA, Lugano
- Schmidhuber, J.: What's interesting? Technical report TR-35-97. IDSIA, Lugano (1997)
- (1997) Technical Report TR-35-97
- Schmidhuber, J.¹

78
- 84901391337
- Artificial curiosity based on discovering novel algorithmic predictability through coevolution
- IEEE
- Schmidhuber, J.: Artificial curiosity based on discovering novel algorithmic predictability through coevolution. In: Proceedings of the Congress on Evolutionary Computation, Vol. 3, pp. 1612-1618. IEEE (1999)
- (1999) Proceedings of the Congress on Evolutionary Computation , vol.3 , pp. 1612-1618
- Schmidhuber, J.¹

79
- 70349309538
- Driven by compression progress: A simple principle explains essential aspects of subjective beauty, novelty, surprise, interestingness, attention, curiosity, creativity, art, science, music, jokes
- Pezzulo, G., Butz, M.V., Sigaud, O., Baldassarre, G. (eds.) From Psychological Theories to Artificial Cognitive Systems Springer, Berlin
- Schmidhuber, J.: Driven by compression progress: A simple principle explains essential aspects of subjective beauty, novelty, surprise, interestingness, attention, curiosity, creativity, art, science, music, jokes. In: Pezzulo, G., Butz, M.V., Sigaud, O., Baldassarre, G. (eds.) Anticipatory Behavior in Adaptive Learning Systems. From Psychological Theories to Artificial Cognitive Systems, pp. 48-76. Springer, Berlin (2009)
- (2009) Anticipatory Behavior in Adaptive Learning Systems , pp. 48-76
- Schmidhuber, J.¹

80
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz, W.: Predictive reward signal of dopamine neurons. J. Neurophysiol. 80(1), 1-27 (1998)
- (1998) J. Neurophysiol. , vol.80 , Issue.1 , pp. 1-27
- Schultz, W.¹

81
- 44249126945
- Reward
- Schultz, W.: Reward. Scholarpedia 2(3), 1652 (2007a)
- (2007) Scholarpedia , vol.2 , Issue.3 , pp. 1652
- Schultz, W.¹

82
- 44249115494
- Reward signals
- Schultz, W.: Reward signals. Scholarpedia 2(6), 2184 (2007b)
- (2007) Scholarpedia , vol.2 , Issue.6 , pp. 2184
- Schultz, W.¹

83
- 52149094695
- Learning novel domains through curiosity and conjecture
- Sridharan, N.S. (ed.) Detroit, MI Morgan Kaufmann, San Francisco
- Scott, P.D., Markovitch, S.: Learning novel domains through curiosity and conjecture. In: Sridharan, N.S. (ed.) Proceedings of the 11th International Joint Conference on Artificial Intelligence, Detroit, MI pp. 669-674. Morgan Kaufmann, San Francisco (1989)
- (1989) Proceedings of the 11th International Joint Conference on Artificial Intelligence , pp. 669-674
- Scott, P.D.¹ Markovitch, S.²

84
- 68949137209
- Active learning literature survey
- Computer Sciences, University of Wisconsin-Madison, Madison
- Settles, B.: Active learning literature survey. Technical Report 1648, Computer Sciences, University of Wisconsin-Madison, Madison (2009)
- (2009) Technical Report 1648
- Settles, B.¹

85
- 84899031920
- Intrinsically motivated reinforcement learning
- MIT, Cambridge
- Singh, S., Barto, A.G., Chentanez, N.: Intrinsically motivated reinforcement learning. In: Advances in Neural Information Processing Systems 17: Proceedings of the 2004 Conference. MIT, Cambridge (2005)
- (2005) Advances in Neural Information Processing Systems 17: Proceedings of the 2004 Conference
- Singh, S.¹ Barto, A.G.² Chentanez, N.³

86
- 77955909363
- Where do rewards come from?
- Taatgen, N., van Rijn, H. (eds.) Cognitive Science Society
- Singh, S., Lewis, R.L., Barto, A.G.: Where do rewards come from? In: Taatgen, N., van Rijn, H. (eds.) Proceedings of the 31st Annual Conference of the Cognitive Science Society, Amsterdam pp. 2601-2606. Cognitive Science Society (2009)
- (2009) Proceedings of the 31st Annual Conference of the Cognitive Science Society, Amsterdam , pp. 2601-2606
- Singh, S.¹ Lewis, R.L.² Barto, A.G.³

87
- 79953822184
- Intrinsically motivated reinforcement learning: An evolutionary perspective
- Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges
- Singh, S., Lewis, R.L., Barto, A.G., Sorg, J.: Intrinsically motivated reinforcement learning: An evolutionary perspective. IEEE Trans. Auton. Mental Dev. 2(2), 70-82 (2010). Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges
- (2010) IEEE Trans. Auton. Mental Dev. , vol.2 , Issue.2 , pp. 70-82
- Singh, S.¹ Lewis, R.L.² Barto, A.G.³ Sorg, J.⁴

88
- 54249130303
- Evolution of valence systems in an unstable environment
- Osaka, M. Asada, J.C. Hallam, J.-A. Meyer (Eds.)
- Snel, M., Hayes, G.M.: Evolution of valence systems in an unstable environment. In: Proceedings of the 10th International Conference on Simulation of Adaptive Behavior: From Animals to Animats, Osaka, M. Asada, J.C. Hallam, J.-A. Meyer (Eds.) pp. 12-21 (2008)
- (2008) Proceedings of the 10th International Conference on Simulation of Adaptive Behavior: From Animals to Animats , pp. 12-21
- Snel, M.¹ Hayes, G.M.²

89
- 77956525933
- Internal rewards mitigate agent boundedness
- Fürnkranz, J., Joachims, T. (eds.)
- Sorg, J., Singh, S., Lewis, R.L.: Internal rewards mitigate agent boundedness. In: Fürnkranz, J., Joachims, T. (eds.) Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, Omnipress pp. 1007-1014 (2010)
- (2010) Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, Omnipress , pp. 1007-1014
- Sorg, J.¹ Singh, S.² Lewis, R.L.³

90
- 0011200414
- Reinforcement learning architectures for animats
- J.-A. Meyer, S.W. Wilson (Eds.) MIT, Cambridge
- Sutton, R.S.: Reinforcement learning architectures for animats. In: From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior, J.-A. Meyer, S.W. Wilson (Eds.) pp. 288-296. MIT, Cambridge (1991)
- (1991) From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior , pp. 288-296
- Sutton, R.S.¹

91
- 0004102479
- MIT, Cambridge
- Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT, Cambridge (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

92
- 0033170372
- Between mdps and semi-mdps: A framework for temporal abstraction inreinforcement learning
- Sutton, R.S., Precup, D., Singh, S.: Between mdps and semi-mdps: A framework for temporal abstraction inreinforcement learning. Artif. Intell. 112, 181-211 (1999)
- (1999) Artif. Intell. , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

93
- 0000985504
- TD - Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G.J.: TD - gammon, a self-teaching backgammon program, achieves master-level play. Neural Comput. 6(2), 215-219 (1994)
- (1994) Neural Comput. , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.J.¹

94
- 36348961529
- Transparency and socially guided machine learning
- Bloomington, IN
- Thomaz, A.L., Breazeal, C.: Transparency and socially guided machine learning. In: Proceedings of the 5th International Conference on Developmental Learning (ICDL) Bloomington, IN (2006)
- (2006) Proceedings of the 5th International Conference on Developmental Learning (ICDL)
- Thomaz, A.L.¹ Breazeal, C.²

95
- 33745835458
- Experiments in socially guided machine learning: Understanding how humans teach
- Salt Lake City, UT
- Thomaz, A.L., Hoffman, G., Breazeal, C.: Experiments in socially guided machine learning: Understanding how humans teach. In: Proceedings of the 1st Annual conference on HumanRobot Interaction (HRI) Salt Lake City, UT (2006)
- (2006) Proceedings of the 1st Annual Conference on HumanRobot Interaction (HRI)
- Thomaz, A.L.¹ Hoffman, G.² Breazeal, C.³

96
- 0003998491
- Hafner, Darien
- Thorndike, E.L.: Animal Intelligence. Hafner, Darien (1911)
- (1911) Animal Intelligence
- Thorndike, E.L.¹

97
- 0003940621
- Cambridge University Press, Cambridge (1911)
- Toates, F.M. (1911): Motivational Systems. Cambridge University Press, Cambridge (1911)
- (1911) Motivational Systems
- Toates, F.M.¹

98
- 0003649763
- Naiburg, New York
- Tolman, E.C.: Purposive Behavior in Animals and Men. Naiburg, New York (1932)
- (1932) Purposive Behavior in Animals and Men
- Tolman, E.C.¹

99
- 0012526681
- MIT, Cambridge
- Trappl, R., Petta, P., Payr, S. (eds.): Emotions in Humans and Artifacts. MIT, Cambridge (1997)
- (1997) Emotions in Humans and Artifacts
- Trappl, R.¹ Petta, P.² Payr, S.³

100
- 56949096913
- Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
- Uchibe, E., Doya, K.: Finding intrinsic rewards by embodied evolution and constrained reinforcement learning. Neural Netw. 21(10), 1447-1455 (2008)
- (2008) Neural Netw. , vol.21 , Issue.10 , pp. 1447-1455
- Uchibe, E.¹ Doya, K.²

101
- 0000562031
- A heuristic approach to reinforcement learning control systems
- Waltz, M.D., Fu, K.S.: A heuristic approach to reinforcement learning control systems. IEEE Transactions on Automatic Control 10, 390-398 (1965)
- (1965) IEEE Transactions on Automatic Control , vol.10 , pp. 390-398
- Waltz, M.D.¹ Fu, K.S.²

102
- 0035951444
- Autonomous mental development by robots and animals
- Weng, J., McClelland, J., Pentland, A., Sporns, O., Stockman, I., Sur, M., Thelen, E.: Autonomous mental development by robots and animals. Science 291, 599-600 (2001)
- (2001) Science , vol.291 , pp. 599-600
- Weng, J.¹ McClelland, J.² Pentland, A.³ Sporns, O.⁴ Stockman, I.⁵ Sur, M.⁶ Thelen, E.⁷

103
- 0023169119
- Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research
- Werbos, P.J.: Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research. IEEE Trans. Sys. Man Cybern. 17, 7-20 (1987)
- (1987) IEEE Trans. Sys. Man Cybern. , vol.17 , pp. 7-20
- Werbos, P.J.¹

104
- 33749411161
- Motivation reconsidered: The concept of competence
- White, R.W.: Motivation reconsidered: The concept of competence. Psychol. Rev. 66, 297-333 (1959)
- (1959) Psychol. Rev. , vol.66 , pp. 297-333
- White, R.W.¹

105
- 0015667648
- Punish/reward: Learning with a critic in adaptive thresh-oldsystems
- Widrow, B., Gupta, N.K., Maitra, S.: Punish/reward: Learning with a critic in adaptive thresh-oldsystems. IEEE Trans. Sys. Man Cybern. 3, 455-465 (1973)
- (1973) IEEE Trans. Sys. Man Cybern. , vol.3 , pp. 455-465
- Widrow, B.¹ Gupta, N.K.² Maitra, S.³

106
- 0002278965
- Adaptive switching circuits
- Institute of Radio Engineers, New York
- Widrow, B., Hoff, M.E.: Adaptive switching circuits. In: 1960 WESCON Convention Record Part IV, pp. 96-104. Institute of Radio Engineers, New York (1960).
- (1960) 1960 WESCON Convention Record Part IV , pp. 96-104
- Widrow, B.¹ Hoff, M.E.²

107
- 0004140522
- Reprinted MIT, Cambridge
- Reprinted in J.A. Anderson and E. Rosenfeld, Neurocomputing: Foundations of Research, pp. 126-134. MIT, Cambridge (1988)
- (1988) Neurocomputing: Foundations of Research , pp. 126-134
- Anderson, J.A.¹ Rosenfeld, E.²

108
- 0013865891
- Hedonic organization and regulation of behavior
- Young, P.T.: Hedonic organization and regulation of behavior. Psychol. Rev. 73, 59-86 (1966)
- (1966) Psychol. Rev. , vol.73 , pp. 59-86
- Young, P.T.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.