메뉴 건너뛰기




Volumn 9783642323751, Issue , 2013, Pages 17-47

Intrinsic motivation and reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

MOTIVATION; ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; REINFORCEMENT LEARNING;

EID: 84929046579     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1007/978-3-642-32375-1_2     Document Type: Chapter
Times cited : (253)

References (108)
  • 2
    • 9644288181 scopus 로고    scopus 로고
    • Learning invariant sensorimotor behaviors: A developmental approach to imitation mechanisms
    • Andry, P., Gaussier, P., Nadel, J., Hirsbrunner, B.: Learning invariant sensorimotor behaviors: A developmental approach to imitation mechanisms. Adap. Behav. 12, 117-140 (2004)
    • (2004) Adap. Behav. , vol.12 , pp. 117-140
    • Andry, P.1    Gaussier, P.2    Nadel, J.3    Hirsbrunner, B.4
  • 7
    • 0020970738 scopus 로고
    • Neuronlike elements that can solve difficult learn-ingcontrol problems
    • Barto, A.G., Sutton, R.S., Anderson, C.W.: Neuronlike elements that can solve difficult learn-ingcontrol problems. 13, 835-846 (1983). IEEE Trans. Sys. Man, Cybern.
    • (1983) IEEE Trans. Sys. Man, Cybern , vol.13 , pp. 835-846
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 9
    • 85193186785 scopus 로고
    • Motivation
    • 2nd edn. Prentice-Hall, Englewood Cliffs
    • Beck, R.C.: Motivation. Theories and Principles, 2nd edn. Prentice-Hall, Englewood Cliffs (1983)
    • (1983) Theories and Principles
    • Beck, R.C.1
  • 10
    • 0347919795 scopus 로고
    • A theory of human curiosity
    • Berlyne, D.E.: A theory of human curiosity. Br. J. Psychol. 45, 180-191 (1954)
    • (1954) Br. J. Psychol. , vol.45 , pp. 180-191
    • Berlyne, D.E.1
  • 12
    • 0013931617 scopus 로고
    • Curiosity and exploration
    • Berlyne, D.E.: Curiosity and exploration. Science 143, 25-33 (1966)
    • (1966) Science , vol.143 , pp. 25-33
    • Berlyne, D.E.1
  • 15
    • 0018053135 scopus 로고
    • How adaptive behavior is produced: A perceptual-motivational alternative to response reinforcement
    • Bindra, D.: How adaptive behavior is produced: A perceptual-motivational alternative to response reinforcement. Behav. Brain Sci. 1, 41-91 (1978)
    • (1978) Behav. Brain Sci. , vol.1 , pp. 41-91
    • Bindra, D.1
  • 20
    • 0004394065 scopus 로고
    • Generalization of pattern recognition in a self-organizing system
    • Western Joint Computer Conference, Los Angeles, CA ACM, New York
    • Clark, W.A., Farley, B.G.: Generalization of pattern recognition in a self-organizing system. In: AFIPS' 55 (Western) Proceedings of the March 1-3, 1955, Western Joint Computer Conference, Los Angeles, CA, pp. 86-91, ACM, New York (1955)
    • (1955) AFIPS' 55 (Western) Proceedings of the March 1-3, 1955 , pp. 86-91
    • Clark, W.A.1    Farley, B.G.2
  • 22
    • 33645986626 scopus 로고    scopus 로고
    • Valency for adaptive homeostatic agents: Relating evolution and learning
    • Capcarrere, M.S., Freitas, A.A., Bentley, P.J., Johnson, C.G., Timmis, J. (eds.) Canterbury, UK LNAI Springer, Berlin
    • Damoulas, T., Cos-Aguilera, I., Hayes, G.M., Taylor, T.: Valency for adaptive homeostatic agents: Relating evolution and learning. In: Capcarrere, M.S., Freitas, A.A., Bentley, P.J., Johnson, C.G., Timmis, J. (eds.) Advances in Artificial Life: 8th European Conference, ECAL 2005. Canterbury, UK LNAI Vol. 3630, pp. 936-945. Springer, Berlin (2005)
    • (2005) Advances in Artificial Life: 8th European Conference, ECAL 2005 , vol.3630 , pp. 936-945
    • Damoulas, T.1    Cos-Aguilera, I.2    Hayes, G.M.3    Taylor, T.4
  • 23
    • 53849112833 scopus 로고    scopus 로고
    • The cognitive neuroscience of motivation and learning
    • Daw, N.D., Shohamy, D.: The cognitive neuroscience of motivation and learning. Soc. Cogn. 26(5), 593-620 (2008)
    • (2008) Soc. Cogn. , vol.26 , Issue.5 , pp. 593-620
    • Daw, N.D.1    Shohamy, D.2
  • 25
    • 0003631043 scopus 로고
    • Intrinsic motivation and self-determination in human behavior
    • New York
    • Deci, E.L., Ryan, R.M.: Intrinsic Motivation and Self-Determination in Human Behavior. Plenum, New York (1985)
    • (1985) Plenum
    • Deci, E.L.1    Ryan, R.M.2
  • 26
    • 0002618994 scopus 로고
    • Analysis of exploratory, manipulatory, and curiosity behaviors
    • Dember, W.N., Earl, R.W.: Analysis of exploratory, manipulatory, and curiosity behaviors. Psychol. Rev. 64, 91-96 (1957)
    • (1957) Psychol. Rev. , vol.64 , pp. 91-96
    • Dember, W.N.1    Earl, R.W.2
  • 28
    • 0043250430 scopus 로고    scopus 로고
    • The role of leaning in the operation of motivational systems
    • Gallistel, R. (ed.) 3rd edn. Learning, Motivation, and Emotion Wiley, New York
    • Dickinson, A., Balleine, B.: The role of leaning in the operation of motivational systems. In: Gallistel, R. (ed.) Handbook of Experimental Psychology, 3rd edn. Learning, Motivation, and Emotion, pp. 497-533. Wiley, New York (2002)
    • (2002) Handbook of Experimental Psychology , pp. 497-533
    • Dickinson, A.1    Balleine, B.2
  • 29
    • 55949119833 scopus 로고    scopus 로고
    • Co-evolution of shaping rewards and metaparameters in reinforcement learning
    • Elfwing, S., Uchibe, E., Doya, K., Christensen, H.I.: Co-evolution of shaping rewards and metaparameters in reinforcement learning. Adap. Behav. 16, 400-412 (2008)
    • (2008) Adap. Behav. , vol.16 , pp. 400-412
    • Elfwing, S.1    Uchibe, E.2    Doya, K.3    Christensen, H.I.4
  • 30
    • 0038809077 scopus 로고
    • Instinct and motivation as explanations of complex behavior
    • Pfaff, D.W. (ed.) Springer, New York
    • Epstein, A.: Instinct and motivation as explanations of complex behavior. In: Pfaff, D.W. (ed.) The Physiological Mechanisms of Motivation. Springer, New York (1982)
    • (1982) The Physiological Mechanisms of Motivation
    • Epstein, A.1
  • 31
    • 77952091673 scopus 로고    scopus 로고
    • Action and behavior: A free-energy formulation
    • Pubished online February 11, 2020
    • Friston, K.J., Daunizeau, J., Kilner, J., Kiebel, S.J.: Action and behavior: A free-energy formulation. Biol. Cybern. (2010). Pubished online February 11, 2020
    • (2010) Biol. Cybern.
    • Friston, K.J.1    Daunizeau, J.2    Kilner, J.3    Kiebel, S.J.4
  • 33
    • 0009762657 scopus 로고
    • Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys
    • Harlow, H.F.: Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys. J. Comp. Physiol. Psychol. 43, 289-294 (1950)
    • (1950) J. Comp. Physiol. Psychol. , vol.43 , pp. 289-294
    • Harlow, H.F.1
  • 34
  • 35
    • 84929056927 scopus 로고    scopus 로고
    • Intrinsically motivated affordance discovery and modeling
    • Baldassarre, G., Mirolli, M. (eds.) Springer, Berlin this volume
    • Hart, S., Grupen, R.: Intrinsically motivated affordance discovery and modeling. In: Baldassarre, G., Mirolli, M. (eds.) Intrinsically Motivated Learning in Natural and Artificial Systems. Springer, Berlin (2012, this volume)
    • (2012) Intrinsically Motivated Learning in Natural and Artificial Systems
    • Hart, S.1    Grupen, R.2
  • 37
    • 0002138279 scopus 로고
    • Instinct and ego during infancy
    • Hendrick, I.: Instinct and ego during infancy. Psychoanal. Quart. 11, 33-58 (1942)
    • (1942) Psychoanal. Quart. , vol.11 , pp. 33-58
    • Hendrick, I.1
  • 38
    • 68349127099 scopus 로고    scopus 로고
    • Modulated exploratory dynamics can shape self-organized behavior
    • Hesse, F., Der, R., Herrmann, M., Michael, J.: Modulated exploratory dynamics can shape self-organized behavior. Adv. Complex Syst. 12(2), 273-292 (2009)
    • (2009) Adv. Complex Syst. , vol.12 , Issue.2 , pp. 273-292
    • Hesse, F.1    Der, R.2    Herrmann, M.3    Michael, J.4
  • 43
  • 44
    • 0003900353 scopus 로고
    • Brain function and adaptive systems - A heterostatic theory
    • Air Force Cambridge Research Laboratories, Bedford. A summary appears in Proceedings of the International Conference on Systems, Man, and Cybernetics, 1974, IEEE Systems, Man, and Cybernetics Society, Dallas
    • Klopf, A.H.: Brain function and adaptive systems - A heterostatic theory. Technical report AFCRL-72-0164, Air Force Cambridge Research Laboratories, Bedford. A summary appears in Proceedings of the International Conference on Systems, Man, and Cybernetics, 1974, IEEE Systems, Man, and Cybernetics Society, Dallas (1972)
    • (1972) Technical Report AFCRL-72-0164
    • Klopf, A.H.1
  • 54
    • 0000827179 scopus 로고
    • BOXES: An experiment in adaptive control
    • Dale, E., Michie, D. (eds.) Oliver and Boyd, Edinburgh
    • Michie, D., Chambers, R.A.: BOXES: An experiment in adaptive control. In: Dale, E., Michie, D. (eds.) Machine Intelligence 2, pp. 137-152. Oliver and Boyd, Edinburgh (1968)
    • (1968) Machine Intelligence , vol.2 , pp. 137-152
    • Michie, D.1    Chambers, R.A.2
  • 56
    • 84937350040 scopus 로고
    • Steps toward artificial intelligence
    • Minsky, M.L.: Steps toward artificial intelligence. Proc. Inst. Radio Eng. 49, 8-30 (1961).
    • (1961) Proc. Inst. Radio Eng. , vol.49 , pp. 8-30
    • Minsky, M.L.1
  • 58
    • 49649154025 scopus 로고
    • Shifts in deprivations level: Different effects depending on the amount of preshift training
    • Mollenauer, S.O.: Shifts in deprivations level: Different effects depending on the amount of preshift training. Learn. Motiv. 2, 58-66 (1971)
    • (1971) Learn. Motiv. , vol.2 , pp. 58-66
    • Mollenauer, S.O.1
  • 60
    • 33645367848 scopus 로고
    • Positive reinforcement produced by electrical stimulation of septal areas and other regions of rat brain
    • Olds, J., Milner, P.: Positive reinforcement produced by electrical stimulation of septal areas and other regions of rat brain. J. Comp. Physiol. Psychol. 47, 419-427 (1954)
    • (1954) J. Comp. Physiol. Psychol. , vol.47 , pp. 419-427
    • Olds, J.1    Milner, P.2
  • 61
    • 84891105730 scopus 로고    scopus 로고
    • What is intrinsic motivation? A typology of computational approaches
    • Oudeyer, P.-Y., Kaplan, F.: What is intrinsic motivation? A typology of computational approaches. Front. Neurorobot. 1:6, doi: 10.3389/neuro.12.006.2007 (2007)
    • (2007) Front. Neurorobot , vol.1 , pp. 6
    • Oudeyer, P.-Y.1    Kaplan, F.2
  • 62
    • 34047267520 scopus 로고    scopus 로고
    • Intrinsic motivation systems for autonomous mental development
    • Oudeyer, P.-Y., Kaplan, F., Hafner, V.: Intrinsic motivation systems for autonomous mental development. IEEE Trans. Evol. Comput. 11, 265-286 (2007)
    • (2007) IEEE Trans. Evol. Comput. , vol.11 , pp. 265-286
    • Oudeyer, P.-Y.1    Kaplan, F.2    Hafner, V.3
  • 67
    • 0002109138 scopus 로고
    • A theory of pavlovian conditioning: Variationsin the effectiveness of reinforcement and nonreinforcement
    • Black, A.H., Prokasy, W.F. (eds.) Appleton-Century-Crofts, New York
    • Rescorla, R.A., Wagner, A.R.: A theory of Pavlovian conditioning: Variationsin the effectiveness of reinforcement and nonreinforcement. In: Black, A.H., Prokasy, W.F. (eds.) Classical Conditioning, Vol. II, pp. 64-99. Appleton-Century-Crofts, New York (1972)
    • (1972) Classical Conditioning , vol.2 , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2
  • 69
    • 0022471098 scopus 로고
    • Learning representations by back-propagating errors
    • Rumelhart, D., Hintont, G., Williams, R.: Learning representations by back-propagating errors. Nature 323 (6088), 533-536 (1986)
    • (1986) Nature , vol.323 , Issue.6088 , pp. 533-536
    • Rumelhart, D.1    Hintont, G.2    Williams, R.3
  • 70
    • 0002209063 scopus 로고    scopus 로고
    • Intrinsic and extrinsic motivations: Classic definitions and new directions
    • Ryan, R.M., Deci, E.L.: Intrinsic and extrinsic motivations: Classic definitions and new directions. Contemp. Educ. Psychol. 25, 54-67 (2000)
    • (2000) Contemp. Educ. Psychol. , vol.25 , pp. 54-67
    • Ryan, R.M.1    Deci, E.L.2
  • 71
    • 0035314842 scopus 로고    scopus 로고
    • Introduction to the evolution of preferences
    • Samuelson, L.: Introduction to the evolution of preferences. J. Econ. Theory 97, 225-230 (2001)
    • (2001) J. Econ. Theory , vol.97 , pp. 225-230
    • Samuelson, L.1
  • 72
    • 33746245586 scopus 로고    scopus 로고
    • Information, evolution, and utility
    • Samuelson, L., Swinkels, J.: Information, evolution, and utility. Theor. Econ. 1, 119-142 (2006)
    • (2006) Theor. Econ. , vol.1 , pp. 119-142
    • Samuelson, L.1    Swinkels, J.2
  • 73
    • 0034345039 scopus 로고    scopus 로고
    • Artificial motives: A review of motivation in artificial creatures
    • Savage, T.: Artificial motives: A review of motivation in artificial creatures. Connect. Sci. 12, 211-277 (2000)
    • (2000) Connect. Sci. , vol.12 , pp. 211-277
    • Savage, T.1
  • 75
    • 0344252216 scopus 로고
    • Technical report FKI-149-91, Institut für Informatik, Technische Universität München
    • Schmidhuber, J.: Adaptive confidence and adaptive curiosity. Technical report FKI-149-91, Institut für Informatik, Technische Universität München (1991a)
    • (1991) Adaptive Confidence and Adaptive Curiosity
    • Schmidhuber, J.1
  • 78
    • 84901391337 scopus 로고    scopus 로고
    • Artificial curiosity based on discovering novel algorithmic predictability through coevolution
    • IEEE
    • Schmidhuber, J.: Artificial curiosity based on discovering novel algorithmic predictability through coevolution. In: Proceedings of the Congress on Evolutionary Computation, Vol. 3, pp. 1612-1618. IEEE (1999)
    • (1999) Proceedings of the Congress on Evolutionary Computation , vol.3 , pp. 1612-1618
    • Schmidhuber, J.1
  • 79
    • 70349309538 scopus 로고    scopus 로고
    • Driven by compression progress: A simple principle explains essential aspects of subjective beauty, novelty, surprise, interestingness, attention, curiosity, creativity, art, science, music, jokes
    • Pezzulo, G., Butz, M.V., Sigaud, O., Baldassarre, G. (eds.) From Psychological Theories to Artificial Cognitive Systems Springer, Berlin
    • Schmidhuber, J.: Driven by compression progress: A simple principle explains essential aspects of subjective beauty, novelty, surprise, interestingness, attention, curiosity, creativity, art, science, music, jokes. In: Pezzulo, G., Butz, M.V., Sigaud, O., Baldassarre, G. (eds.) Anticipatory Behavior in Adaptive Learning Systems. From Psychological Theories to Artificial Cognitive Systems, pp. 48-76. Springer, Berlin (2009)
    • (2009) Anticipatory Behavior in Adaptive Learning Systems , pp. 48-76
    • Schmidhuber, J.1
  • 80
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz, W.: Predictive reward signal of dopamine neurons. J. Neurophysiol. 80(1), 1-27 (1998)
    • (1998) J. Neurophysiol. , vol.80 , Issue.1 , pp. 1-27
    • Schultz, W.1
  • 81
    • 44249126945 scopus 로고    scopus 로고
    • Reward
    • Schultz, W.: Reward. Scholarpedia 2(3), 1652 (2007a)
    • (2007) Scholarpedia , vol.2 , Issue.3 , pp. 1652
    • Schultz, W.1
  • 82
    • 44249115494 scopus 로고    scopus 로고
    • Reward signals
    • Schultz, W.: Reward signals. Scholarpedia 2(6), 2184 (2007b)
    • (2007) Scholarpedia , vol.2 , Issue.6 , pp. 2184
    • Schultz, W.1
  • 84
    • 68949137209 scopus 로고    scopus 로고
    • Active learning literature survey
    • Computer Sciences, University of Wisconsin-Madison, Madison
    • Settles, B.: Active learning literature survey. Technical Report 1648, Computer Sciences, University of Wisconsin-Madison, Madison (2009)
    • (2009) Technical Report 1648
    • Settles, B.1
  • 87
    • 79953822184 scopus 로고    scopus 로고
    • Intrinsically motivated reinforcement learning: An evolutionary perspective
    • Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges
    • Singh, S., Lewis, R.L., Barto, A.G., Sorg, J.: Intrinsically motivated reinforcement learning: An evolutionary perspective. IEEE Trans. Auton. Mental Dev. 2(2), 70-82 (2010). Special issue on Active Learning and Intrinsically Motivated Exploration in Robots: Advances and Challenges
    • (2010) IEEE Trans. Auton. Mental Dev. , vol.2 , Issue.2 , pp. 70-82
    • Singh, S.1    Lewis, R.L.2    Barto, A.G.3    Sorg, J.4
  • 92
    • 0033170372 scopus 로고    scopus 로고
    • Between mdps and semi-mdps: A framework for temporal abstraction inreinforcement learning
    • Sutton, R.S., Precup, D., Singh, S.: Between mdps and semi-mdps: A framework for temporal abstraction inreinforcement learning. Artif. Intell. 112, 181-211 (1999)
    • (1999) Artif. Intell. , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 93
    • 0000985504 scopus 로고
    • TD - Gammon, a self-teaching backgammon program, achieves master-level play
    • Tesauro, G.J.: TD - gammon, a self-teaching backgammon program, achieves master-level play. Neural Comput. 6(2), 215-219 (1994)
    • (1994) Neural Comput. , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.J.1
  • 97
    • 0003940621 scopus 로고
    • Cambridge University Press, Cambridge (1911)
    • Toates, F.M. (1911): Motivational Systems. Cambridge University Press, Cambridge (1911)
    • (1911) Motivational Systems
    • Toates, F.M.1
  • 100
    • 56949096913 scopus 로고    scopus 로고
    • Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
    • Uchibe, E., Doya, K.: Finding intrinsic rewards by embodied evolution and constrained reinforcement learning. Neural Netw. 21(10), 1447-1455 (2008)
    • (2008) Neural Netw. , vol.21 , Issue.10 , pp. 1447-1455
    • Uchibe, E.1    Doya, K.2
  • 101
    • 0000562031 scopus 로고
    • A heuristic approach to reinforcement learning control systems
    • Waltz, M.D., Fu, K.S.: A heuristic approach to reinforcement learning control systems. IEEE Transactions on Automatic Control 10, 390-398 (1965)
    • (1965) IEEE Transactions on Automatic Control , vol.10 , pp. 390-398
    • Waltz, M.D.1    Fu, K.S.2
  • 103
    • 0023169119 scopus 로고
    • Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research
    • Werbos, P.J.: Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research. IEEE Trans. Sys. Man Cybern. 17, 7-20 (1987)
    • (1987) IEEE Trans. Sys. Man Cybern. , vol.17 , pp. 7-20
    • Werbos, P.J.1
  • 104
    • 33749411161 scopus 로고
    • Motivation reconsidered: The concept of competence
    • White, R.W.: Motivation reconsidered: The concept of competence. Psychol. Rev. 66, 297-333 (1959)
    • (1959) Psychol. Rev. , vol.66 , pp. 297-333
    • White, R.W.1
  • 105
    • 0015667648 scopus 로고
    • Punish/reward: Learning with a critic in adaptive thresh-oldsystems
    • Widrow, B., Gupta, N.K., Maitra, S.: Punish/reward: Learning with a critic in adaptive thresh-oldsystems. IEEE Trans. Sys. Man Cybern. 3, 455-465 (1973)
    • (1973) IEEE Trans. Sys. Man Cybern. , vol.3 , pp. 455-465
    • Widrow, B.1    Gupta, N.K.2    Maitra, S.3
  • 106
    • 0002278965 scopus 로고
    • Adaptive switching circuits
    • Institute of Radio Engineers, New York
    • Widrow, B., Hoff, M.E.: Adaptive switching circuits. In: 1960 WESCON Convention Record Part IV, pp. 96-104. Institute of Radio Engineers, New York (1960).
    • (1960) 1960 WESCON Convention Record Part IV , pp. 96-104
    • Widrow, B.1    Hoff, M.E.2
  • 108
    • 0013865891 scopus 로고
    • Hedonic organization and regulation of behavior
    • Young, P.T.: Hedonic organization and regulation of behavior. Psychol. Rev. 73, 59-86 (1966)
    • (1966) Psychol. Rev. , vol.73 , pp. 59-86
    • Young, P.T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.