메뉴 건너뛰기




Volumn 39, Issue , 2013, Pages 40-51

Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: A simulated robotic study

Author keywords

Actor critic; Computational model; Intrinsic motivation; Phasic dopamine; Reinforcement learning; TD learning

Indexed keywords

ACTOR CRITIC; COMPUTATIONAL MODEL; INTRINSIC MOTIVATION; PHASIC DOPAMINE; TD-LEARNING;

EID: 84872777521     PISSN: 08936080     EISSN: 18792782     Source Type: Journal    
DOI: 10.1016/j.neunet.2012.12.012     Document Type: Article
Times cited : (30)

References (104)
  • 1
    • 80054971108 scopus 로고    scopus 로고
    • What are intrinsic motivations? A biological perspective
    • IEEE, New York, A. Cangelosi, J. Triesch, I. Fasel, K. Rohlfing, F. Nori, P.Y. Oudeyer, M. Schlesinger, Y. Nagai (Eds.)
    • Baldassarre G. What are intrinsic motivations? A biological perspective. Proceedings of the international conference on development and learning and epigenetic robotics 2011, E1-E8. IEEE, New York. A. Cangelosi, J. Triesch, I. Fasel, K. Rohlfing, F. Nori, P.Y. Oudeyer, M. Schlesinger, Y. Nagai (Eds.).
    • (2011) Proceedings of the international conference on development and learning and epigenetic robotics
    • Baldassarre, G.1
  • 3
    • 0000541213 scopus 로고
    • Adaptive critics and the basal ganglia
    • MIT Press, Cambridge, MA, J. Houk, J. Davis, J. Beiser (Eds.)
    • Barto A. Adaptive critics and the basal ganglia. Models of information processing in the basal ganglia 1995, 215-232. MIT Press, Cambridge, MA. J. Houk, J. Davis, J. Beiser (Eds.).
    • (1995) Models of information processing in the basal ganglia , pp. 215-232
    • Barto, A.1
  • 7
    • 21544435722 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode a quantitative reward prediction error signal
    • Bayer H.M., Glimcher P.W. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 2005, 47:129-141.
    • (2005) Neuron , vol.47 , pp. 129-141
    • Bayer, H.M.1    Glimcher, P.W.2
  • 9
    • 33847634405 scopus 로고    scopus 로고
    • The debate over dopamine's role in reward: the case for incentive salience
    • Berridge K. The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacology 2007, 191:391-431.
    • (2007) Psychopharmacology , vol.191 , pp. 391-431
    • Berridge, K.1
  • 10
    • 34247469020 scopus 로고    scopus 로고
    • Dopamine neuron systems in the brain: an update
    • Bjorklund J., Dunnett S. Dopamine neuron systems in the brain: an update. Trends in Neurosciences 2007, 30:194-202.
    • (2007) Trends in Neurosciences , vol.30 , pp. 194-202
    • Bjorklund, J.1    Dunnett, S.2
  • 11
    • 0011879293 scopus 로고
    • Discrimination learning by rhesus monkeys to visual-exploration motivation
    • Butler R.A. Discrimination learning by rhesus monkeys to visual-exploration motivation. Journal of Comparative and Physiology Psychology 1953, 46:95-98.
    • (1953) Journal of Comparative and Physiology Psychology , vol.46 , pp. 95-98
    • Butler, R.A.1
  • 12
    • 19544380277 scopus 로고
    • Discrimination learning and learning sets to visual exploration incentives
    • Butler R.A., Harlow H.F. Discrimination learning and learning sets to visual exploration incentives. The Journal of Genetic Psychology 1957, 57:257-264.
    • (1957) The Journal of Genetic Psychology , vol.57 , pp. 257-264
    • Butler, R.A.1    Harlow, H.F.2
  • 13
  • 14
    • 84933876635 scopus 로고
    • Scopolamine, amphetamine and light-reinforced responding
    • Carlton P.L. Scopolamine, amphetamine and light-reinforced responding. Psychonomic Science 1966, 5:347-348.
    • (1966) Psychonomic Science , vol.5 , pp. 347-348
    • Carlton, P.L.1
  • 15
    • 0018891034 scopus 로고
    • Sensory stimuli alter the discharge rate of dopamine (da) neurons: evidence for two functional types of da cells in the substantia nigra
    • Chiodo L.A., Antelman S.M., Caggiula A.R., Lineberry C.G. Sensory stimuli alter the discharge rate of dopamine (da) neurons: evidence for two functional types of da cells in the substantia nigra. Brain Research 1980, 189:544-549.
    • (1980) Brain Research , vol.189 , pp. 544-549
    • Chiodo, L.A.1    Antelman, S.M.2    Caggiula, A.R.3    Lineberry, C.G.4
  • 17
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience 2005, 8:1704-1711.
    • (2005) Nature Neuroscience , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 18
    • 0030260201 scopus 로고    scopus 로고
    • Exploration bonuses and dual control
    • Dayan P., Sejnowski T. Exploration bonuses and dual control. Machine Learning 1996, 25:5-22.
    • (1996) Machine Learning , vol.25 , pp. 5-22
    • Dayan, P.1    Sejnowski, T.2
  • 21
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • Doya K. Reinforcement learning in continuous time and space. Neural Computation 2000, 12:219-245.
    • (2000) Neural Computation , vol.12 , pp. 219-245
    • Doya, K.1
  • 22
    • 34547679813 scopus 로고    scopus 로고
    • Reinforcement learning: computational theory and biological mechanisms
    • Doya K. Reinforcement learning: computational theory and biological mechanisms. HFSP Journal of Comparative and Physiological Psychology 2007, 1:30-40.
    • (2007) HFSP Journal of Comparative and Physiological Psychology , vol.1 , pp. 30-40
    • Doya, K.1
  • 24
    • 84872773796 scopus 로고    scopus 로고
    • Instrumental conditioning driven by neutral stimuli: a model tested with simulated robotic rat
    • In Schlesinger, M., Balkenius, L.B.C. (Eds.) Proceedings of the eight international conference on epigenetic robotics
    • Fiore, V., Mannella, F., Mirolli, M., Gurney, K., & Baldassarre, G. (2008). Instrumental conditioning driven by neutral stimuli: a model tested with simulated robotic rat. In Schlesinger, M., Balkenius, L.B.C. (Eds.) Proceedings of the eight international conference on epigenetic robotics (pp. 13-20).
    • (2008) , pp. 13-20
    • Fiore, V.1    Mannella, F.2    Mirolli, M.3    Gurney, K.4    Baldassarre, G.5
  • 25
    • 48149101941 scopus 로고    scopus 로고
    • The temporal precision of reward prediction in dopamine neurons
    • Fiorillo C.D., Newsome W.T., Schultz W. The temporal precision of reward prediction in dopamine neurons. Nature Neuroscience 2008, 11:966-973.
    • (2008) Nature Neuroscience , vol.11 , pp. 966-973
    • Fiorillo, C.D.1    Newsome, W.T.2    Schultz, W.3
  • 26
    • 12544251990 scopus 로고    scopus 로고
    • Dynamic dopamine modulation in the basal ganglia: a neurocomputational account of cognitive deficits in medicated and nonmedicated parkinsonism
    • Frank M.J. Dynamic dopamine modulation in the basal ganglia: a neurocomputational account of cognitive deficits in medicated and nonmedicated parkinsonism. Journal of Cognitive Neuroscience 2005, 17:51-72.
    • (2005) Journal of Cognitive Neuroscience , vol.17 , pp. 51-72
    • Frank, M.J.1
  • 27
    • 80053152388 scopus 로고    scopus 로고
    • Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis
    • Glimcher P. Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis. Proceedings of the National Academy of Sciences of the United States of America 2011, 108(Suppl 3):15647-15654.
    • (2011) Proceedings of the National Academy of Sciences of the United States of America , vol.108 , Issue.SUPPL. 3 , pp. 15647-15654
    • Glimcher, P.1
  • 28
    • 0018150198 scopus 로고
    • Response-contingent sensory change in a causally structured environment
    • Glow P., Winefield A. Response-contingent sensory change in a causally structured environment. Animal Learning & Behavior 1978, 6:1-18. 10.3758/BF03211996.
    • (1978) Animal Learning & Behavior , vol.6 , pp. 1-18
    • Glow, P.1    Winefield, A.2
  • 29
    • 34247517910 scopus 로고    scopus 로고
    • Regulation of firing of dopaminergic neuron and control of goal-directed behavior
    • Grace A., Floresco S., Goto Y., Lodge D. Regulation of firing of dopaminergic neuron and control of goal-directed behavior. Trends in Neurosciences 2007, 30:220-227.
    • (2007) Trends in Neurosciences , vol.30 , pp. 220-227
    • Grace, A.1    Floresco, S.2    Goto, Y.3    Lodge, D.4
  • 30
    • 61349116180 scopus 로고    scopus 로고
    • The role of the basal ganglia in learning and memory: neuropsychological studies
    • Grahn J.A., Parkinson J.A., Owen A.M. The role of the basal ganglia in learning and memory: neuropsychological studies. Behavioural Brain Research 2009, 199:53-60.
    • (2009) Behavioural Brain Research , vol.199 , pp. 53-60
    • Grahn, J.A.1    Parkinson, J.A.2    Owen, A.M.3
  • 31
    • 28044450643 scopus 로고    scopus 로고
    • The basal ganglia: learning new tricks and loving it
    • Graybiel A.M. The basal ganglia: learning new tricks and loving it. Current Opinion in Neurobiology 2005, 15:638-644.
    • (2005) Current Opinion in Neurobiology , vol.15 , pp. 638-644
    • Graybiel, A.M.1
  • 32
    • 48249091260 scopus 로고    scopus 로고
    • Habits, rituals, and the evaluative brain
    • Graybiel A. Habits, rituals, and the evaluative brain. Annual Review of Neuroscience 2008, 31:359-387.
    • (2008) Annual Review of Neuroscience , vol.31 , pp. 359-387
    • Graybiel, A.1
  • 33
    • 0009762657 scopus 로고
    • Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys
    • Harlow H.F. Learning and satiation of response in intrinsically motivated complex puzzle performance by monkeys. Journal of Comparative and Physiological Psychology 1950, 43:289-294.
    • (1950) Journal of Comparative and Physiological Psychology , vol.43 , pp. 289-294
    • Harlow, H.F.1
  • 34
    • 33644688754 scopus 로고    scopus 로고
    • Dopamine neurons report an error in the temporal prediction of reward during learning
    • Hollerman J.R., Schultz W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience 1998, 1:304-309.
    • (1998) Nature Neuroscience , vol.1 , pp. 304-309
    • Hollerman, J.R.1    Schultz, W.2
  • 35
    • 0034061668 scopus 로고    scopus 로고
    • Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events
    • Horvitz J.C. Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events. Neuroscience 2000, 96:651-656.
    • (2000) Neuroscience , vol.96 , pp. 651-656
    • Horvitz, J.C.1
  • 36
    • 0030757872 scopus 로고    scopus 로고
    • Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat
    • Horvitz J.C., Stewart T., Jacobs B.L. Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat. Brain Research 1997, 759:251-258.
    • (1997) Brain Research , vol.759 , pp. 251-258
    • Horvitz, J.C.1    Stewart, T.2    Jacobs, B.L.3
  • 37
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • MIT Press, Cambridge, MA, J. Houk, J. Davis, D. Beiser (Eds.)
    • Houk J., Adams J., Barto A. A model of how the basal ganglia generate and use neural signals that predict reinforcement. Models of information processing in the basal ganglia 1995, 249-270. MIT Press, Cambridge, MA. J. Houk, J. Davis, D. Beiser (Eds.).
    • (1995) Models of information processing in the basal ganglia , pp. 249-270
    • Houk, J.1    Adams, J.2    Barto, A.3
  • 38
    • 23144448134 scopus 로고    scopus 로고
    • Novelty and reinforcement learning in the value system of developmental robots
    • Lund University Cognitive Studies, Lund, C. Prince, Y. Demiris, Y. Marom, H. Kozima, C. Balkenius (Eds.)
    • Huang X., Weng J. Novelty and reinforcement learning in the value system of developmental robots. Proceedings of the second international workshop epigenetic robotics: modeling cognitive development in robotic systems 2002, 47-55. Lund University Cognitive Studies, Lund. C. Prince, Y. Demiris, Y. Marom, H. Kozima, C. Balkenius (Eds.).
    • (2002) Proceedings of the second international workshop epigenetic robotics: modeling cognitive development in robotic systems , pp. 47-55
    • Huang, X.1    Weng, J.2
  • 40
    • 0036592026 scopus 로고    scopus 로고
    • Actor-critic models of the basal ganglia: new anatomical and computational perspectives
    • Joel D., Niv Y., Ruppin E. Actor-critic models of the basal ganglia: new anatomical and computational perspectives. Neural Networks 2002, 15:535-547.
    • (2002) Neural Networks , vol.15 , pp. 535-547
    • Joel, D.1    Niv, Y.2    Ruppin, E.3
  • 41
    • 0036592029 scopus 로고    scopus 로고
    • Dopamine: generalization and bonuses
    • Kakade S., Dayan P. Dopamine: generalization and bonuses. Neural Networks 2002, 15:549-559.
    • (2002) Neural Networks , vol.15 , pp. 549-559
    • Kakade, S.1    Dayan, P.2
  • 42
    • 1842793184 scopus 로고    scopus 로고
    • Motivational principles for visual know-how development
    • Lund University Cognitive Studies, Lund, C. Prince, L. Berthouze, H. Kozima, D. Bullock, G. Stojanov, C. Balkenius (Eds.)
    • Kaplan F., Oudeyer P. Motivational principles for visual know-how development. Proceedings of the third international workshop on epigenetic robotics 2003, 73-80. Lund University Cognitive Studies, Lund. C. Prince, L. Berthouze, H. Kozima, D. Bullock, G. Stojanov, C. Balkenius (Eds.).
    • (2003) Proceedings of the third international workshop on epigenetic robotics , pp. 73-80
    • Kaplan, F.1    Oudeyer, P.2
  • 43
    • 21344442393 scopus 로고    scopus 로고
    • Actor-critic models of reinforcement learning in the basal ganglia: from natural to artificial rats
    • Khamassi M., Lacheze L., Girard B., Berthoz A., Guillot A. Actor-critic models of reinforcement learning in the basal ganglia: from natural to artificial rats. Adaptive Behavior 2005, 13:131-148.
    • (2005) Adaptive Behavior , vol.13 , pp. 131-148
    • Khamassi, M.1    Lacheze, L.2    Girard, B.3    Berthoz, A.4    Guillot, A.5
  • 44
    • 0342520116 scopus 로고
    • Learning when the onset of illumination is used as reinforcing stimulus
    • Kish G.B. Learning when the onset of illumination is used as reinforcing stimulus. Journal of Comparative and Physiological Psychology 1955, 48:261-264.
    • (1955) Journal of Comparative and Physiological Psychology , vol.48 , pp. 261-264
    • Kish, G.B.1
  • 46
    • 0026320854 scopus 로고
    • Responses of monkey midbrain dopamine neurons during delayed alternation performance
    • Ljungberg T., Apicella P., Schultz W. Responses of monkey midbrain dopamine neurons during delayed alternation performance. Brain Research 1991, 567:337-341.
    • (1991) Brain Research , vol.567 , pp. 337-341
    • Ljungberg, T.1    Apicella, P.2    Schultz, W.3
  • 47
    • 0026505520 scopus 로고
    • Responses of monkey dopamine neurons during learning of behavioral reactions
    • Ljungberg T., Apicella P., Schultz W. Responses of monkey dopamine neurons during learning of behavioral reactions. Journal of Neurophysiology 1992, 67:145-163.
    • (1992) Journal of Neurophysiology , vol.67 , pp. 145-163
    • Ljungberg, T.1    Apicella, P.2    Schultz, W.3
  • 49
    • 84906736869 scopus 로고    scopus 로고
    • Functions and mechanisms of intrinsic motivations: the knowledge vs. competence distinction
    • Springer-Verlag, Berlin, G. Baldassarre, M. Mirolli (Eds.)
    • Mirolli M., Baldassarre G. Functions and mechanisms of intrinsic motivations: the knowledge vs. competence distinction. Intrinsically motivated learning in natural and artificial systems 2013, Springer-Verlag, Berlin. G. Baldassarre, M. Mirolli (Eds.).
    • (2013) Intrinsically motivated learning in natural and artificial systems
    • Mirolli, M.1    Baldassarre, G.2
  • 50
    • 7244240565 scopus 로고    scopus 로고
    • Computational roles for dopamine in behavioural control
    • Montague P.R., Hyman S.E., Cohen J.D. Computational roles for dopamine in behavioural control. Nature 2004, 431:760-767.
    • (2004) Nature , vol.431 , pp. 760-767
    • Montague, P.R.1    Hyman, S.E.2    Cohen, J.D.3
  • 51
    • 0009763199 scopus 로고
    • The role of the exploratory drive in learning
    • Montgomery K. The role of the exploratory drive in learning. Journal of Comparative Psychology 1954, 47:60-64.
    • (1954) Journal of Comparative Psychology , vol.47 , pp. 60-64
    • Montgomery, K.1
  • 52
    • 3242673464 scopus 로고    scopus 로고
    • Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons
    • Morris G., Arkadir D., Nevet A., Vaadia E., Bergman H. Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons. Neuron 2004, 43:133-143.
    • (2004) Neuron , vol.43 , pp. 133-143
    • Morris, G.1    Arkadir, D.2    Nevet, A.3    Vaadia, E.4    Bergman, H.5
  • 53
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: theory and application to reward shaping
    • Morgan Kaufmann Publisher Inc., San Francisco, CA, USA
    • Ng A.Y., Harada D., Russell S. Policy invariance under reward transformations: theory and application to reward shaping. Proceedings of the 16th international conference of machine learning 1999, 278-287. Morgan Kaufmann Publisher Inc., San Francisco, CA, USA.
    • (1999) Proceedings of the 16th international conference of machine learning , pp. 278-287
    • Ng, A.Y.1    Harada, D.2    Russell, S.3
  • 56
    • 33751184634 scopus 로고    scopus 로고
    • The short-latency dopamine signal: a role in discovering novel actions?
    • Redgrave P., Gurney K. The short-latency dopamine signal: a role in discovering novel actions?. Nature Reviews Neuroscience 2006, 7:967-975.
    • (2006) Nature Reviews Neuroscience , vol.7 , pp. 967-975
    • Redgrave, P.1    Gurney, K.2
  • 59
    • 0033119561 scopus 로고    scopus 로고
    • Is the short-latency dopamine response too short to signal reward error?
    • Redgrave P., Prescott T.J., Gurney K. Is the short-latency dopamine response too short to signal reward error?. Trends in Neuroscience 1999, 22:146-151.
    • (1999) Trends in Neuroscience , vol.22 , pp. 146-151
    • Redgrave, P.1    Prescott, T.J.2    Gurney, K.3
  • 60
    • 81255157804 scopus 로고    scopus 로고
    • Functional properties of the basal ganglia's re-entrant loop architecture: selection and reinforcement
    • Redgrave P., Vautrelle N., Reynolds J.N.J. Functional properties of the basal ganglia's re-entrant loop architecture: selection and reinforcement. Neuroscience 2011.
    • (2011) Neuroscience
    • Redgrave, P.1    Vautrelle, N.2    Reynolds, J.N.J.3
  • 61
    • 0030023561 scopus 로고    scopus 로고
    • Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task
    • Reed P., Mitchell C., Nokes T. Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task. Animal Learning and Behavior 1996, 24:38-45.
    • (1996) Animal Learning and Behavior , vol.24 , pp. 38-45
    • Reed, P.1    Mitchell, C.2    Nokes, T.3
  • 62
    • 0035817882 scopus 로고    scopus 로고
    • A cellular mechanism of reward-related learning
    • Reynolds J.N., Hyland B.I., Wickens J.R. A cellular mechanism of reward-related learning. Nature 2001, 413:67-70.
    • (2001) Nature , vol.413 , pp. 67-70
    • Reynolds, J.N.1    Hyland, B.I.2    Wickens, J.R.3
  • 63
    • 0036592025 scopus 로고    scopus 로고
    • Dopamine-dependent plasticity of corticostriatal synapses
    • Reynolds J.N.J., Wickens J.R. Dopamine-dependent plasticity of corticostriatal synapses. Neural Networks 2002, 15:507-521.
    • (2002) Neural Networks , vol.15 , pp. 507-521
    • Reynolds, J.N.J.1    Wickens, J.R.2
  • 64
    • 0000106924 scopus 로고
    • Functions of dopamine in the dorsal and ventral striatum
    • Robbins T., Everitt B. Functions of dopamine in the dorsal and ventral striatum. Seminars in Neuroscience 1992, 4:119-128.
    • (1992) Seminars in Neuroscience , vol.4 , pp. 119-128
    • Robbins, T.1    Everitt, B.2
  • 66
    • 33644772767 scopus 로고    scopus 로고
    • Local dopamine production in the dorsal striatum restores goal-directed behavior in dopamine-deficient mice
    • Robinson S., Sotak B., During M., Palmiter R. Local dopamine production in the dorsal striatum restores goal-directed behavior in dopamine-deficient mice. Behavioral Neuroscience 2006, 120:196-200.
    • (2006) Behavioral Neuroscience , vol.120 , pp. 196-200
    • Robinson, S.1    Sotak, B.2    During, M.3    Palmiter, R.4
  • 67
    • 13444292185 scopus 로고    scopus 로고
    • Somatotopy in the basal ganglia: experimental and clinical evidence for segregated sensorimotor channels
    • Romanelli P., Esposito V., Schaal D.W., Heit G. Somatotopy in the basal ganglia: experimental and clinical evidence for segregated sensorimotor channels. Brain Research Reviews 2005, 48:112-128.
    • (2005) Brain Research Reviews , vol.48 , pp. 112-128
    • Romanelli, P.1    Esposito, V.2    Schaal, D.W.3    Heit, G.4
  • 68
    • 0025247726 scopus 로고
    • Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-initiated arm movements
    • Romo R., Schultz W. Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-initiated arm movements. Journal of Neurophysiology 1990, 63:592-606.
    • (1990) Journal of Neurophysiology , vol.63 , pp. 592-606
    • Romo, R.1    Schultz, W.2
  • 69
    • 0016262607 scopus 로고
    • Some effects of short-term immediate prior exposure to light change on responding for light change
    • Russell A., Glow P. Some effects of short-term immediate prior exposure to light change on responding for light change. Animal Learning & Behavior 1974, 2:262-266. 10.3758/BF03199191.
    • (1974) Animal Learning & Behavior , vol.2 , pp. 262-266
    • Russell, A.1    Glow, P.2
  • 70
    • 0002209063 scopus 로고    scopus 로고
    • Intrinsic and extrinsic motivations: classic definitions and new directions
    • Ryan, Deci Intrinsic and extrinsic motivations: classic definitions and new directions. Contemporary Educational Psychology 2000, 25:54-67.
    • (2000) Contemporary Educational Psychology , vol.25 , pp. 54-67
    • Ryan1    Deci2
  • 71
    • 27944492732 scopus 로고    scopus 로고
    • Beetles, boxes and brain cells: neural mechanisms underlying valuation and learning
    • Salzman C.D., Belova M.A., Paton J.J. Beetles, boxes and brain cells: neural mechanisms underlying valuation and learning. Current Opinion in Neurobiology 2005, 15:721-729.
    • (2005) Current Opinion in Neurobiology , vol.15 , pp. 721-729
    • Salzman, C.D.1    Belova, M.A.2    Paton, J.J.3
  • 72
    • 84872845059 scopus 로고    scopus 로고
    • Intrinsic motivation mechanisms for competence acquisition
    • In Proceedings of ICDL-Epirob 2012, San Diego.
    • Santucci, V., Baldassarre, G., & Mirolli, M. (2012). Intrinsic motivation mechanisms for competence acquisition. In Proceedings of ICDL-Epirob 2012, San Diego.
    • (2012)
    • Santucci, V.1    Baldassarre, G.2    Mirolli, M.3
  • 73
    • 38049093767 scopus 로고    scopus 로고
    • Evolution and learning in an intrinsically motivated reinforcement learning robot
    • Springer, Berlin, F.A. y Costa, L. Rocha, E. Costa, I. Harvey, A. Coutinho (Eds.)
    • Schembri M., Mirolli M., Baldassarre G. Evolution and learning in an intrinsically motivated reinforcement learning robot. Advances in artificial life 2007, 294-333. Springer, Berlin. F.A. y Costa, L. Rocha, E. Costa, I. Harvey, A. Coutinho (Eds.).
    • (2007) Advances in artificial life , pp. 294-333
    • Schembri, M.1    Mirolli, M.2    Baldassarre, G.3
  • 74
    • 79958838807 scopus 로고    scopus 로고
    • Evolving childhood's length and learning parameters in an intrinsically motivated reinforcement learning robot
    • Lund University Cognitive Studies, Lund, L. Berthouze, G. Dhristiopher, M. Littman, H. Kozima, C. Balkenius (Eds.)
    • Schembri M., Mirolli M., Baldassarre G. Evolving childhood's length and learning parameters in an intrinsically motivated reinforcement learning robot. Proceedings of the seventh international conference on epigenetic robotics 2007, 141-148. Lund University Cognitive Studies, Lund. L. Berthouze, G. Dhristiopher, M. Littman, H. Kozima, C. Balkenius (Eds.).
    • (2007) Proceedings of the seventh international conference on epigenetic robotics , pp. 141-148
    • Schembri, M.1    Mirolli, M.2    Baldassarre, G.3
  • 75
    • 84872764045 scopus 로고    scopus 로고
    • Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot
    • Imperial College, London, Y. Demiris, D. Mareschal, B. Scassellati, J. Weng (Eds.)
    • Schembri M., Mirolli M., Baldassarre G. Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot. Proceedings of the 6th international conference on development and learning 2007, E1-E6. Imperial College, London. Y. Demiris, D. Mareschal, B. Scassellati, J. Weng (Eds.).
    • (2007) Proceedings of the 6th international conference on development and learning
    • Schembri, M.1    Mirolli, M.2    Baldassarre, G.3
  • 76
    • 2442467081 scopus 로고
    • A possibility for implementing curiosity and boredom in model-building neural controllers
    • MIT Press/Bradford Books, Cambridge, Massachusetts/London, England, J. Meyer, S. Wilson (Eds.)
    • Schmidhuber J. A possibility for implementing curiosity and boredom in model-building neural controllers. Proceedings of the international conference on simulation of adaptive behavior: from animals to animats 1991, 222-227. MIT Press/Bradford Books, Cambridge, Massachusetts/London, England. J. Meyer, S. Wilson (Eds.).
    • (1991) Proceedings of the international conference on simulation of adaptive behavior: from animals to animats , pp. 222-227
    • Schmidhuber, J.1
  • 78
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz W. Predictive reward signal of dopamine neurons. Journal of Neurophysiology 1998, 80:1-27.
    • (1998) Journal of Neurophysiology , vol.80 , pp. 1-27
    • Schultz, W.1
  • 79
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • Schultz W. Getting formal with dopamine and reward. Neuron 2002, 36:241-263.
    • (2002) Neuron , vol.36 , pp. 241-263
    • Schultz, W.1
  • 80
    • 32444439058 scopus 로고    scopus 로고
    • Behavioral theories and the neurophysiology of reward
    • Schultz W. Behavioral theories and the neurophysiology of reward. Annual Review of Psychology 2006, 57:87-115.
    • (2006) Annual Review of Psychology , vol.57 , pp. 87-115
    • Schultz, W.1
  • 81
    • 34547659151 scopus 로고    scopus 로고
    • Multiple dopamine functions at different time scales
    • Schultz W. Multiple dopamine functions at different time scales. Annual Review of Neuroscience 2007, 30:259-288.
    • (2007) Annual Review of Neuroscience , vol.30 , pp. 259-288
    • Schultz, W.1
  • 82
    • 0027468102 scopus 로고
    • Responses of monkey dopamine neurons to reaward and conditioned stimuli during successive steps of learning a delayed response task
    • Schultz W., Apicella P., Ljumberg T. Responses of monkey dopamine neurons to reaward and conditioned stimuli during successive steps of learning a delayed response task. Journal of Neuroscience 1993, 13:900-913.
    • (1993) Journal of Neuroscience , vol.13 , pp. 900-913
    • Schultz, W.1    Apicella, P.2    Ljumberg, T.3
  • 83
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science 1997, 275:1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 85
    • 0020558952 scopus 로고
    • Response of dopaminergic neurons in cat to auditory stimuli presented across the sleep-waking cycle
    • Steinfels G.F., Heym J., Strecker R.E., Jacobs B.L. Response of dopaminergic neurons in cat to auditory stimuli presented across the sleep-waking cycle. Brain Research 1983, 277:150-154.
    • (1983) Brain Research , vol.277 , pp. 150-154
    • Steinfels, G.F.1    Heym, J.2    Strecker, R.E.3    Jacobs, B.L.4
  • 87
    • 0022359445 scopus 로고
    • Substantia nigra dopaminergic unit activity in behaving cats: effect of arousal on spontaneous discharge and sensory evoked activity
    • Strecker R.E., Jacobs B.L. Substantia nigra dopaminergic unit activity in behaving cats: effect of arousal on spontaneous discharge and sensory evoked activity. Brain Research 1985, 361:339-350.
    • (1985) Brain Research , vol.361 , pp. 339-350
    • Strecker, R.E.1    Jacobs, B.L.2
  • 88
    • 17844396920 scopus 로고    scopus 로고
    • Choosing the greater of two goods: neural currencies for valuation and decision making
    • Sugrue L.P., Corrado G.S., Newsome W.T. Choosing the greater of two goods: neural currencies for valuation and decision making. Nature Review Neuroscience 2005, 6:363-375.
    • (2005) Nature Review Neuroscience , vol.6 , pp. 363-375
    • Sugrue, L.P.1    Corrado, G.S.2    Newsome, W.T.3
  • 89
    • 0036592034 scopus 로고    scopus 로고
    • Td models of reward predictive responses in dopamine neurons
    • Suri R.E. Td models of reward predictive responses in dopamine neurons. Neural Networks 2002, 15:523-533.
    • (2002) Neural Networks , vol.15 , pp. 523-533
    • Suri, R.E.1
  • 90
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton R. Learning to predict by the methods of temporal differences. Machine Learning 1988, 3:9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.1
  • 91
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Morgan Kaufmann
    • Sutton R. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proceedings of the seventh international conference on machine learning 1990, 216-224. Morgan Kaufmann.
    • (1990) Proceedings of the seventh international conference on machine learning , pp. 216-224
    • Sutton, R.1
  • 93
    • 0033170372 scopus 로고    scopus 로고
    • Between mdps and semi-mdps: a framework for temporal abstraction in reinforcement learning
    • Sutton R., Precup D., Singh S. Between mdps and semi-mdps: a framework for temporal abstraction in reinforcement learning. Artificial Intelligence 1999, 112:181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3
  • 95
    • 14844349975 scopus 로고    scopus 로고
    • Adaptive coding of reward value by dopamine neurons
    • Tobler P.N., Fiorillo C.D., Schultz W. Adaptive coding of reward value by dopamine neurons. Science 2005, 307:1642-1645.
    • (2005) Science , vol.307 , pp. 1642-1645
    • Tobler, P.N.1    Fiorillo, C.D.2    Schultz, W.3
  • 96
    • 56949096913 scopus 로고    scopus 로고
    • Finding intrinsic rewards by embodied evolution and constrained reinforcement learning
    • Uchibe E., Doya K. Finding intrinsic rewards by embodied evolution and constrained reinforcement learning. Neural Networks 2008, 21:1447-1455.
    • (2008) Neural Networks , vol.21 , pp. 1447-1455
    • Uchibe, E.1    Doya, K.2
  • 97
    • 8444231365 scopus 로고    scopus 로고
    • Dopamine: the salient issue
    • Ungless M. Dopamine: the salient issue. Trends in Neuroscience 2004, 27:702-706.
    • (2004) Trends in Neuroscience , vol.27 , pp. 702-706
    • Ungless, M.1
  • 98
    • 0035811464 scopus 로고    scopus 로고
    • Dopamine responses comply with basic assumptions of formal learning theory
    • Waelti P., Dickinson A., Schultz W. Dopamine responses comply with basic assumptions of formal learning theory. Nature 2001, 412:43-48.
    • (2001) Nature , vol.412 , pp. 43-48
    • Waelti, P.1    Dickinson, A.2    Schultz, W.3
  • 99
    • 33749411161 scopus 로고
    • Motivation reconsidered: the concept of competence
    • White R. Motivation reconsidered: the concept of competence. Psychological Review 1959, 66:297-333.
    • (1959) Psychological Review , vol.66 , pp. 297-333
    • White, R.1
  • 100
    • 61349087915 scopus 로고    scopus 로고
    • Synaptic plasticity in the basal ganglia
    • Wickens J.R. Synaptic plasticity in the basal ganglia. Behavioural Brain Research 2009, 199:119-128.
    • (2009) Behavioural Brain Research , vol.199 , pp. 119-128
    • Wickens, J.R.1
  • 101
    • 58149415695 scopus 로고
    • Response contingent illumination change as a reinforcer in the rat
    • Williams D., Lowe G. Response contingent illumination change as a reinforcer in the rat. Animal Behaviour 1972, 20:259-262.
    • (1972) Animal Behaviour , vol.20 , pp. 259-262
    • Williams, D.1    Lowe, G.2
  • 102
    • 2642519680 scopus 로고    scopus 로고
    • Dopamine, learning and motivation
    • Wise R. Dopamine, learning and motivation. Nature Review Neuroscience 2004, 5:483-494.
    • (2004) Nature Review Neuroscience , vol.5 , pp. 483-494
    • Wise, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.