메뉴 건너뛰기




Volumn 140, Issue 2, 2014, Pages 466-486

Navigating complex decision spaces: Problems and paradigms in sequential choice

Author keywords

Reinforcement learning; Sequential choice; Temporal credit assignment

Indexed keywords

DOPAMINE;

EID: 84894423717     PISSN: 00332909     EISSN: None     Source Type: Journal    
DOI: 10.1037/a0033455     Document Type: Article
Times cited : (28)

References (210)
  • 1
    • 33646833114 scopus 로고    scopus 로고
    • Prediction error as a linear function of reward probability is coded in human nucleus accumbens
    • Abler, B., Walter, H., Erk, S., Kammerer, H., & Spitzer, M. (2006). Prediction error as a linear function of reward probability is coded in human nucleus accumbens. NeuroImage, 31, 790-795
    • (2006) NeuroImage , vol.31 , pp. 790-795
    • Abler, B.1    Walter, H.2    Erk, S.3    Kammerer, H.4    Spitzer, M.5
  • 4
    • 23944453331 scopus 로고    scopus 로고
    • Tracing problem solving in real time: fMRI analysis of the subject-paced tower of Hanoi
    • Anderson, J. R., Albert, M. V., & Fincham, J. M. (2005). Tracing problem solving in real time: fMRI analysis of the subject-paced tower of Hanoi. Journal of Cognitive Neuroscience, 17, 1261-1274
    • (2005) Journal of Cognitive Neuroscience , vol.17 , pp. 1261-1274
    • Anderson, J.R.1    Albert, M.V.2    Fincham, J.M.3
  • 5
    • 0033929121 scopus 로고    scopus 로고
    • Task-specific neural activity in the primate prefrontal cortex
    • Asaad, W. F., Rainer, G., & Miller, E. K. (2000). Task-specific neural activity in the primate prefrontal cortex. Journal of Neurophysiology, 84, 451-459
    • (2000) Journal of Neurophysiology , vol.84 , pp. 451-459
    • Asaad, W.F.1    Rainer, G.2    Miller, E.K.3
  • 6
    • 67651120119 scopus 로고    scopus 로고
    • Which way do I go? Neural activation in response to feedback and spatial processing in a virtual T-Maze
    • Baker, T. E., & Holroyd, C. B. (2009). Which way do I go? Neural activation in response to feedback and spatial processing in a virtual T-Maze. Cerebral Cortex, 19, 1708-1722
    • (2009) Cerebral Cortex , vol.19 , pp. 1708-1722
    • Baker, T.E.1    Holroyd, C.B.2
  • 7
    • 28444472936 scopus 로고    scopus 로고
    • Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits
    • Balleine, B. W. (2005). Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits. Physiology & Behavior, 86, 717- 730
    • (2005) Physiology & Behavior , vol.86 , pp. 717-730
    • Balleine, B.W.1
  • 9
    • 72049125602 scopus 로고    scopus 로고
    • Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action
    • Balleine, B. W., & O'doherty, J. P. (2010). Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology, 35, 48-69
    • (2010) Neuropsychopharmacology , vol.35 , pp. 48-69
    • Balleine, B.W.1    O'doherty, J.P.2
  • 12
    • 33847634405 scopus 로고    scopus 로고
    • The debate over dopamine's role in reward: The case for incentive salience
    • Berridge, K. C. (2007). The debate over dopamine's role in reward: The case for incentive salience. Psychopharmacology, 191, 391-431
    • (2007) Psychopharmacology , vol.191 , pp. 391-431
    • Berridge, K.C.1
  • 13
    • 0000040523 scopus 로고
    • The effect of the introduction of reward upon the maze performance of rats
    • Blodgett, H. C. (1929). The effect of the introduction of reward upon the maze performance of rats. University of California Publications in Psychology, 4, 113-134.
    • (1929) University of California Publications in Psychology , vol.4 , pp. 113-134
    • Blodgett, H.C.1
  • 14
    • 34248999741 scopus 로고    scopus 로고
    • Short-term memory traces for action bias in human reinforcement learning
    • Bogacz, R., Mcclure, S. M., Li, J., Cohen, J. D., & Montague, P. R. (2007). Short-term memory traces for action bias in human reinforcement learning. Brain Research, 1153, 111-121
    • (2007) Brain Research , vol.1153 , pp. 111-121
    • Bogacz, R.1    Mcclure, S.M.2    Li, J.3    Cohen, J.D.4    Montague, P.R.5
  • 15
    • 37549047252 scopus 로고    scopus 로고
    • Conflict monitoring and decision making: Reconciling two perspectives on anterior cingulate function
    • Botvinick, M. (2007). Conflict monitoring and decision making: Reconciling two perspectives on anterior cingulate function. Cognitive, Affective & Behavioral Neuroscience, 7, 356-366
    • (2007) Cognitive, Affective & Behavioral Neuroscience , vol.7 , pp. 356-366
    • Botvinick, M.1
  • 17
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
    • Botvinick, M. M., Niv, Y., & Barto, A. C. (2009). Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition, 113, 262-280
    • (2009) Cognition , vol.113 , pp. 262-280
    • Botvinick, M.M.1    Niv, Y.2    Barto, A.C.3
  • 18
    • 77952979246 scopus 로고    scopus 로고
    • Human medial orbitofrontal cortex is recruited during experience of imagined and real rewards
    • Bray, S., Shimojo, S., & O'Doherty, J. P. (2010). Human medial orbitofrontal cortex is recruited during experience of imagined and real rewards. Journal of Neurophysiology, 103, 2506-2512
    • (2010) Journal of Neurophysiology , vol.103 , pp. 2506-2512
    • Bray, S.1    Shimojo, S.2    O'doherty, J.P.3
  • 20
    • 13844309349 scopus 로고    scopus 로고
    • Learned predictions of error likelihood in the anterior cingulate cortex
    • Brown, J. W., & Braver, T. S. (2005). Learned predictions of error likelihood in the anterior cingulate cortex. Science, 307, 1118-1121
    • (2005) Science , vol.307 , pp. 1118-1121
    • Brown, J.W.1    Braver, T.S.2
  • 21
    • 58149439823 scopus 로고
    • Differential errors in animal mazes
    • Buel, J. (1935). Differential errors in animal mazes. Psychological Bulletin, 32, 67-99
    • (1935) Psychological Bulletin , vol.32 , pp. 67-99
    • Buel, J.1
  • 22
    • 14844315691 scopus 로고    scopus 로고
    • How we use rules to select actions: A review of evidence from cognitive neuroscience
    • Bunge, S. A. (2004). How we use rules to select actions: A review of evidence from cognitive neuroscience. Cognitive, Affective & Behavioral Neuroscience, 4, 564-579
    • (2004) Cognitive, Affective & Behavioral Neuroscience , vol.4 , pp. 564-579
    • Bunge, S.A.1
  • 23
    • 0030023405 scopus 로고    scopus 로고
    • Conservation of hippocampal memory function in rats and humans
    • Bunsey, M., & Eichenbaum, H. (1996). Conservation of hippocampal memory function in rats and humans. Nature, 379, 255-257
    • (1996) Nature , vol.379 , pp. 255-257
    • Bunsey, M.1    Eichenbaum, H.2
  • 24
    • 67349227786 scopus 로고    scopus 로고
    • Theoretical tools for understanding and aiding dynamic decision making
    • Busemeyer, J. R., & Pleskac, T. J. (2009). Theoretical tools for understanding and aiding dynamic decision making. Journal of Mathematical Psychology, 53, 126-138
    • (2009) Journal of Mathematical Psychology , vol.53 , pp. 126-138
    • Busemeyer, J.R.1    Pleskac, T.J.2
  • 26
    • 0034801578 scopus 로고    scopus 로고
    • The role of ventral and orbital prefrontal cortex in conditional visuomotor learning and strategy use in Rhesus monkeys (Macaca mulatta)
    • Bussey, T. J., Wise, S. P., & Murray, E. A. (2001). The role of ventral and orbital prefrontal cortex in conditional visuomotor learning and strategy use in Rhesus monkeys (Macaca mulatta). Behavioral Neuroscience, 115, 971-982
    • (2001) Behavioral Neuroscience , vol.115 , pp. 971-982
    • Bussey, T.J.1    Wise, S.P.2    Murray, E.A.3
  • 27
    • 33644480565 scopus 로고
    • The effect of different amounts of alternating partial reinforcement on resistance to extinction
    • Capaldi, E. J. (1957). The effect of different amounts of alternating partial reinforcement on resistance to extinction. The American Journal of Psychology, 70, 451-452
    • (1957) The American Journal of Psychology , vol.70 , pp. 451-452
    • Capaldi, E.J.1
  • 29
    • 0032076255 scopus 로고    scopus 로고
    • Anterior cingulate cortex, error detection, and the online monitoring of performance
    • Carter, C. S., Braver, T. S., Barch, D. M., Botvinick, M. M., Noll, D., & Cohen, J. D. (1998). Anterior cingulate cortex, error detection, and the online monitoring of performance. Science, 280, 747-749
    • (1998) Science , vol.280 , pp. 747-749
    • Carter, C.S.1    Braver, T.S.2    Barch, D.M.3    Botvinick, M.M.4    Noll, D.5    Cohen, J.D.6
  • 30
    • 33846225079 scopus 로고    scopus 로고
    • Reinforcement learning signals predict future decisions
    • Cohen, M. X., & Ranganath, C. (2007). Reinforcement learning signals predict future decisions. The Journal of Neuroscience, 27, 371-378
    • (2007) The Journal of Neuroscience , vol.27 , pp. 371-378
    • Cohen, M.X.1    Ranganath, C.2
  • 31
    • 0003459801 scopus 로고
    • Memory, amnesia, and the hippocampal system
    • Cambridge, MA: MIT Press
    • Cohen, N. J., & Eichenbaum, H. (1993). Memory, amnesia, and the hippocampal system. Cambridge, MA: MIT Press.
    • (1993)
    • Cohen, N.J.1    Eichenbaum, H.2
  • 33
    • 79952746011 scopus 로고    scopus 로고
    • Model-based influences on humans' choices and striatal prediction errors
    • Daw, N. D., Gershman, S. J., Seymour, B., Dayan, P., & Dolan, R. J. (2011). Model-based influences on humans' choices and striatal prediction errors. Neuron, 69, 1204-1215
    • (2011) Neuron , vol.69 , pp. 1204-1215
    • Daw, N.D.1    Gershman, S.J.2    Seymour, B.3    Dayan, P.4    Dolan, R.J.5
  • 34
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw, N. D., Niv, Y., & Dayan, P. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience, 8, 1704-1711
    • (2005) Nature Neuroscience , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 35
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw, N. D., O'doherty, J. P., Dayan, P., Seymour, B., & Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441, 876-879
    • (2006) Nature , vol.441 , pp. 876-879
    • Daw, N.D.1    O'doherty, J.P.2    Dayan, P.3    Seymour, B.4    Dolan, R.J.5
  • 36
    • 84859315924 scopus 로고    scopus 로고
    • Instrumental vigour in punishment and reward
    • Dayan, P. (2012). Instrumental vigour in punishment and reward. European Journal of Neuroscience, 35, 1152-1168
    • (2012) European Journal of Neuroscience , vol.35 , pp. 1152-1168
    • Dayan, P.1
  • 38
    • 52049107354 scopus 로고    scopus 로고
    • Reinforcement learning: The good, the bad and the ugly
    • Dayan, P., & Niv, Y. (2008). Reinforcement learning: The good, the bad and the ugly. Current Opinion in Neurobiology, 18, 185-196
    • (2008) Current Opinion in Neurobiology , vol.18 , pp. 185-196
    • Dayan, P.1    Niv, Y.2
  • 39
    • 0004217226 scopus 로고
    • (2nd ed.) The Hague, the Netherlands: Mouton. (Original work published 1946)
    • de Groot, A. D. (1978). Thought and choice in chess (2nd ed.). The Hague, the Netherlands: Mouton. (Original work published 1946).
    • (1978) Thought and choice in chess
    • de Groot, A.D.1
  • 42
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 44
    • 85006547536 scopus 로고    scopus 로고
    • A comparison of human and agent reinforcement learning
    • L. Carlson, C. Hoelscher, & T. F. Shipley (Eds.), Austin, TX: Cognitive Sciences Society
    • Doshi-Velez, F., & Ghahramani, Z. (2011). A comparison of human and agent reinforcement learning. In L. Carlson, C. Hoelscher, & T. F. Shipley (Eds.), Proceedings of the 33rd annual conference of the Cognitive Science Society (pp. 2703-2708). Austin, TX: Cognitive Sciences Society.
    • (2011) Proceedings of the 33rd annual conference of the Cognitive Science Society , pp. 2703-2708
    • Doshi-Velez, F.1    Ghahramani, Z.2
  • 45
    • 0033213819 scopus 로고    scopus 로고
    • What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Networks
    • Doya, K. (1999). What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Networks, 12, 961-974.
    • (1999) , vol.12 , pp. 961-974
    • Doya, K.1
  • 46
    • 0002337786 scopus 로고    scopus 로고
    • Metalearning, neuromodulation, and emotion
    • G. Hatano, N. Okada, & H. Tanabe (Eds.), Amsterdam, the Netherlands: Elsevier Science
    • Doya, K. (2000). Metalearning, neuromodulation, and emotion. In G. Hatano, N. Okada, & H. Tanabe (Eds.), Affective minds (pp. 101-104). Amsterdam, the Netherlands: Elsevier Science.
    • (2000) Affective minds (pp. 101-104)
    • Doya, K.1
  • 48
    • 37349015901 scopus 로고    scopus 로고
    • Error-related negativities elicited by monetary loss and cues that predict loss
    • Dunning, J. P., & Hajcak, G. (2007). Error-related negativities elicited by monetary loss and cues that predict loss. NeuroReport, 18, 1875-1878.
    • (2007) NeuroReport , vol.18 , pp. 1875-1878
    • Dunning, J.P.1    Hajcak, G.2
  • 49
    • 1542316149 scopus 로고    scopus 로고
    • Instrumental responding for rewards is associated with enhanced neuronal response in subcortical reward systems
    • Elliott, R., Newman, J. L., Longe, O. A., & Deakin, J. F. W. (2004). Instrumental responding for rewards is associated with enhanced neuronal response in subcortical reward systems. NeuroImage, 21, 984- 990.
    • (2004) NeuroImage , vol.21 , pp. 984-990
    • Elliott, R.1    Newman, J.L.2    Longe, O.A.3    Deakin, J.F.W.4
  • 51
    • 78649604962 scopus 로고    scopus 로고
    • Evidence for model-based action planning in a sequential finger movement task
    • Fermin, A., Yoshida, T., Ito, M., Yoshimoto, J., & Doya, K. (2010). Evidence for model-based action planning in a sequential finger movement task. Journal of Motor Behavior, 42, 371-379.
    • (2010) Journal of Motor Behavior , vol.42 , pp. 371-379
    • Fermin, A.1    Yoshida, T.2    Ito, M.3    Yoshimoto, J.4    Doya, K.5
  • 52
    • 0037459319 scopus 로고    scopus 로고
    • Discrete coding of reward probability and uncertainty by dopamine neurons
    • Fiorillo, C. D., Tobler, P. N., & Schultz, W. (2003). Discrete coding of reward probability and uncertainty by dopamine neurons. Science, 299, 1898-1902.
    • (2003) Science , vol.299 , pp. 1898-1902
    • Fiorillo, C.D.1    Tobler, P.N.2    Schultz, W.3
  • 53
    • 33645458694 scopus 로고    scopus 로고
    • Reverse replay of behavioural sequences in hippocampal place cells during the awake state
    • Foster, D. J., & Wilson, M. A. (2006). Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature, 440, 680-683.
    • (2006) Nature , vol.440 , pp. 680-683
    • Foster, D.J.1    Wilson, M.A.2
  • 54
    • 33744550336 scopus 로고    scopus 로고
    • Anatomy of a decision: Striatoorbitofrontal interactions in reinforcement learning, decision making, and reversal
    • Frank, M. J., & Claus, E. D. (2006). Anatomy of a decision: Striatoorbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychological Review, 113, 300-326.
    • (2006) Psychological Review , vol.113 , pp. 300-326
    • Frank, M.J.1    Claus, E.D.2
  • 56
    • 33745108748 scopus 로고    scopus 로고
    • From recurrent choice to skill learning: A reinforcement-learning model
    • Fu, W. T., & Anderson, J. R. (2006). From recurrent choice to skill learning: A reinforcement-learning model. Journal of Experimental Psychology: General, 135, 184-206.
    • (2006) Journal of Experimental Psychology: General , vol.135 , pp. 184-206
    • Fu, W.T.1    Anderson, J.R.2
  • 58
    • 40949160181 scopus 로고    scopus 로고
    • Solving the credit assignment problem: Explicit and implicit learning of action sequences with probabilistic outcomes
    • Fu, W. T., & Anderson, J. R. (2008b). Solving the credit assignment problem: Explicit and implicit learning of action sequences with probabilistic outcomes. Psychological Research, 72, 321-330.
    • (2008) Psychological Research , vol.72 , pp. 321-330
    • Fu, W.T.1    Anderson, J.R.2
  • 59
    • 10244241691 scopus 로고    scopus 로고
    • Resolving the paradox of the active user: Stable suboptimal performance in interactive tasks
    • Fu, W. T., & Gray, W. D. (2004). Resolving the paradox of the active user: Stable suboptimal performance in interactive tasks. Cognitive Science, [28,] 901-935.
    • (2004) Cognitive Science , vol.28 , pp. 901-935
    • Fu, W.T.1    Gray, W.D.2
  • 60
    • 0003966168 scopus 로고    scopus 로고
    • The prefrontal cortex: Anatomy, physiology, and neuropsychology of the frontal lobe
    • Philadelphia, PA: Lippincott- Raven
    • Fuster, J. M. (1997). The prefrontal cortex: Anatomy, physiology, and neuropsychology of the frontal lobe. Philadelphia, PA: Lippincott- Raven.
    • (1997)
    • Fuster, J.M.1
  • 64
    • 84870955069 scopus 로고    scopus 로고
    • Exploring a latent cause theory of classical conditioning
    • Gershman, S. J., & Niv, Y. (2012). Exploring a latent cause theory of classical conditioning. Learning & Behavior, 40, 255-268.
    • (2012) Learning & Behavior , vol.40 , pp. 255-268
    • Gershman, S.J.1    Niv, Y.2
  • 65
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: Dissociable neural prediction error signals underlying modelbased and model-free reinforcement learning
    • Gläscher, J., Daw, N., Dayan, P., & O'doherty, J. P. (2010). States versus rewards: Dissociable neural prediction error signals underlying modelbased and model-free reinforcement learning. Neuron, 66, 585-595.
    • (2010) Neuron , vol.66 , pp. 585-595
    • Gläscher, J.1    Daw, N.2    Dayan, P.3    O'doherty, J.P.4
  • 66
    • 58449113882 scopus 로고    scopus 로고
    • Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making
    • Gläscher, J., Hampton, A. N., & O'doherty, J. P. (2008). Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making. Cerebral Cortex, 19, 483-495.
    • (2008) Cerebral Cortex , vol.19 , pp. 483-495
    • Gläscher, J.1    Hampton, A.N.2    O'doherty, J.P.3
  • 67
    • 33746347681 scopus 로고    scopus 로고
    • The soft constraints hypothesis: A rational analysis approach to resource allocation for interactive behavior
    • Gray, W. D., Sims, C. R., Fu, W. T., & Schoelles, M. J. (2006). The soft constraints hypothesis: A rational analysis approach to resource allocation for interactive behavior. Psychological Review, 113, 461-482.
    • (2006) Psychological Review , vol.113 , pp. 461-482
    • Gray, W.D.1    Sims, C.R.2    Fu, W.T.3    Schoelles, M.J.4
  • 68
    • 70350572378 scopus 로고    scopus 로고
    • Short-term gains, long-term pains: How cues about state aid learning in dynamic environments
    • Gureckis, T. M., & Love, B. C. (2009). Short-term gains, long-term pains: How cues about state aid learning in dynamic environments. Cognition, 113, 293-313.
    • (2009) Cognition , vol.113 , pp. 293-313
    • Gureckis, T.M.1    Love, B.C.2
  • 69
    • 0034654526 scopus 로고    scopus 로고
    • Striatonigrostriatal pathways in primates form an ascending spiral from the shell to the dorsolateral striatum
    • Haber, S. N., Fudge, J. L., & McFarland, N. R. (2000). Striatonigrostriatal pathways in primates form an ascending spiral from the shell to the dorsolateral striatum. Journal of Neuroscience, 20, 2369-2382.
    • (2000) Journal of Neuroscience , vol.20 , pp. 2369-2382
    • Haber, S.N.1    Fudge, J.L.2    McFarland, N.R.3
  • 70
    • 0025264150 scopus 로고
    • Topographic organization of the ventral striatal efferent projections in the rhesus monkey: An anterograde tracing study
    • Haber, S. N., Lynd-Balta, E., Klein, C., & Groenewegen, H. J. (1990). Topographic organization of the ventral striatal efferent projections in the rhesus monkey: An anterograde tracing study. Journal of Comparative Neurology, 293, 282-298.
    • (1990) Journal of Comparative Neurology , vol.293 , pp. 282-298
    • Haber, S.N.1    Lynd-Balta, E.2    Klein, C.3    Groenewegen, H.J.4
  • 71
    • 33748188120 scopus 로고    scopus 로고
    • The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans
    • Hampton, A. N., Bossaerts, P., & O'doherty, J. P. (2006). The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. The Journal of Neuroscience, 26, 8360- 8367.
    • (2006) The Journal of Neuroscience , vol.26 , pp. 8360-8367
    • Hampton, A.N.1    Bossaerts, P.2    O'doherty, J.P.3
  • 75
    • 0036644494 scopus 로고    scopus 로고
    • Decision biases and persistent illicit drug use: An experimental study of distributed choice and addiction
    • Heyman, G. M., & Dunn, B. (2002). Decision biases and persistent illicit drug use: An experimental study of distributed choice and addiction. Drug and Alcohol Dependence, 67, 193-203.
    • (2002) Drug and Alcohol Dependence , vol.67 , pp. 193-203
    • Heyman, G.M.1    Dunn, B.2
  • 76
    • 0016564510 scopus 로고
    • The effect of two ways of devaluing the unconditioned stimulus after first- and second-order appetitive conditioning
    • Holland, P. C., & Rescorla, R. (1975). The effect of two ways of devaluing the unconditioned stimulus after first- and second-order appetitive conditioning. Journal of Experimental Psychology: Animal Behavior Processes, 1, 355-363.
    • (1975) Journal of Experimental Psychology: Animal Behavior Processes , vol.1 , pp. 355-363
    • Holland, P.C.1    Rescorla, R.2
  • 77
    • 85047670409 scopus 로고    scopus 로고
    • The neural basis of human error processing: Reinforcement learning, dopamine, and the error-relatednegativity
    • Holroyd, C. B., & Coles, M. G. H. (2002). The neural basis of human error processing: Reinforcement learning, dopamine, and the error-relatednegativity. Psychological Review, 109, 679-709.
    • (2002) Psychological Review , vol.109 , pp. 679-709
    • Holroyd, C.B.1    Coles, M.G.H.2
  • 78
    • 79952906182 scopus 로고    scopus 로고
    • Reward positivity elicited by predictive cues
    • Holroyd, C. B., Krigolson, O. E., & Lee, S. (2011). Reward positivity elicited by predictive cues. NeuroReport, 22, 249-252.
    • (2011) NeuroReport , vol.22 , pp. 249-252
    • Holroyd, C.B.1    Krigolson, O.E.2    Lee, S.3
  • 79
    • 0034077644 scopus 로고    scopus 로고
    • Neuronal activity in the primate prefrontal cortex in the process of motor selection based on two behavioral rules
    • Hoshi, E., Shima, K., & Tanji, J. (2000). Neuronal activity in the primate prefrontal cortex in the process of motor selection based on two behavioral rules. Journal of Neurophysiology, 83, 2355-2373.
    • (2000) Journal of Neurophysiology , vol.83 , pp. 2355-2373
    • Hoshi, E.1    Shima, K.2    Tanji, J.3
  • 80
    • 0003090736 scopus 로고
    • The goal gradient hypothesis and maze learning
    • Hull, C. L.. (1932) The goal gradient hypothesis and maze learning. Psychological Review, 39, 25-43.
    • (1932) Psychological Review , vol.39 , pp. 25-43
    • Hull, C.L.1
  • 82
    • 84859371025 scopus 로고    scopus 로고
    • Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees
    • Huys, Q. J. M., Eshel, N., O'Nions, E., Sheridan, L., Dayan, P., & Roiser, J. P.. (2012). Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLOS Computational Biology, 8, e1002410.
    • (2012) PLOS Computational Biology , vol.8
    • Huys, Q.J.M.1    Eshel, N.2    O'Nions, E.3    Sheridan, L.4    Dayan, P.5    Roiser, J.P.6
  • 83
    • 0035489031 scopus 로고    scopus 로고
    • Addiction and the brain: The neurobiology of compulsion and its persistence
    • Hyman, S. E., & Malenka, R. C.. (2001). Addiction and the brain: The neurobiology of compulsion and its persistence. Nature Reviews Neuroscience, 2, 695-703.
    • (2001) Nature Reviews Neuroscience , vol.2 , pp. 695-703
    • Hyman, S.E.1    Malenka, R.C.2
  • 84
    • 67449138590 scopus 로고    scopus 로고
    • Brain mechanisms for predictive control by switching internal models: Implications for higher-order cognitive functions
    • Imamizu, H., & Kawato, M.. (2009). Brain mechanisms for predictive control by switching internal models: Implications for higher-order cognitive functions. Psychological Research, 73, 527-544.
    • (2009) Psychological Research , vol.73 , pp. 527-544
    • Imamizu, H.1    Kawato, M.2
  • 85
    • 41149173145 scopus 로고    scopus 로고
    • Control of mental activities by internal models in the cerebellum
    • Ito, M. (2008). Control of mental activities by internal models in the cerebellum. Nature Reviews Neuroscience, 9, 304-313.
    • (2008) Nature Reviews Neuroscience , vol.9 , pp. 304-313
    • Ito, M.1
  • 86
    • 4444358622 scopus 로고    scopus 로고
    • Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency
    • Izquierdo, A. D., Suda, R. K., & Murray, E. A.. (2004). Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency. The Journal of Neuroscience, 24, 7540-7548.
    • (2004) The Journal of Neuroscience , vol.24 , pp. 7540-7548
    • Izquierdo, A.D.1    Suda, R.K.2    Murray, E.A.3
  • 87
    • 0003891643 scopus 로고
    • New York, NY: Dover. (Original work published 1890)
    • James, W.. (1950). The principles of psychology. New York, NY: Dover. (Original work published 1890).
    • (1950) The principles of psychology
    • James, W.1
  • 88
    • 84857974114 scopus 로고    scopus 로고
    • When, what, and how much to reward in reinforcement learning-based models of cognition
    • Janssen, C. P., & Gray, W. D.. (2012). When, what, and how much to reward in reinforcement learning-based models of cognition. Cognitive Science, 36, 333-358.
    • (2012) Cognitive Science , vol.36 , pp. 333-358
    • Janssen, C.P.1    Gray, W.D.2
  • 89
    • 0036592026 scopus 로고    scopus 로고
    • Actor-critic models of the basal ganglia: New anatomical and computational perspectives
    • Joel, D., Niv, Y., & Ruppin, E.. (2002). Actor-critic models of the basal ganglia: New anatomical and computational perspectives. Neural Networks, 15, 535-547.
    • (2002) Neural Networks , vol.15 , pp. 535-547
    • Joel, D.1    Niv, Y.2    Ruppin, E.3
  • 90
    • 36048937548 scopus 로고    scopus 로고
    • Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point
    • Johnson, A., & Redish, A. D.. (2007). Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point. The Journal of Neuroscience, 27, 12176 -12189.
    • (2007) The Journal of Neuroscience , vol.27
    • Johnson, A.1    Redish, A.D.2
  • 91
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling, L. P., Littman, M. L., & Cassandra, A. R.. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 94
    • 79958143780 scopus 로고    scopus 로고
    • Speed/accuracy trade-off between the habitual and the goal-directed processes
    • Keramati, M., Dezfouli, A., & Piray, P.. (2011). Speed/accuracy trade-off between the habitual and the goal-directed processes. PLOS Computational Biology, 7, e1002055.
    • (2011) PLOS Computational Biology , vol.7
    • Keramati, M.1    Dezfouli, A.2    Piray, P.3
  • 96
    • 0037382264 scopus 로고    scopus 로고
    • Coordination of actions and habits in the medial prefrontal cortex of rats
    • Killcross, S., & Coutureau, E. (2003). Coordination of actions and habits in the medial prefrontal cortex of rats. Cerebral Cortex, 13, 400-408.
    • (2003) Cerebral Cortex , vol.13 , pp. 400-408
    • Killcross, S.1    Coutureau, E.2
  • 98
    • 50349093022 scopus 로고    scopus 로고
    • Influences of reward delays on responses of dopamine neurons
    • Kobayashi, S., & Schultz, W. (2008). Influences of reward delays on responses of dopamine neurons. The Journal of Neuroscience, 28, 7837- 7846.
    • (2008) The Journal of Neuroscience , vol.28 , pp. 7837-7846
    • Kobayashi, S.1    Schultz, W.2
  • 99
    • 80052700236 scopus 로고    scopus 로고
    • This ought to be good: Brain activity accompanying positive and negative expectations and outcomes
    • Liao, Y., Gramann, K., Feng, W., Deák, G. O., & Li, H. (2011). This ought to be good: Brain activity accompanying positive and negative expectations and outcomes. Psychophysiology, 48, 1412-1419.
    • (2011) Psychophysiology , vol.48 , pp. 1412-1419
    • Liao, Y.1    Gramann, K.2    Feng, W.3    Deák, G.O.4    Li, H.5
  • 100
    • 79951839136 scopus 로고    scopus 로고
    • Neural correlates of instrumental contingency learning: Differential effects of action-reward conjunction and disjunction
    • Liljeholm, M., Tricomi, E., O'doherty, J. P., & Balleine, B. W. (2011). Neural correlates of instrumental contingency learning: Differential effects of action-reward conjunction and disjunction. The Journal of Neuroscience, 31, 2474-2480.
    • (2011) The Journal of Neuroscience , vol.31 , pp. 2474-2480
    • Liljeholm, M.1    Tricomi, E.2    O'doherty, J.P.3    Balleine, B.W.4
  • 101
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • Lin, L. J. (1992). Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8, 293-321.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.J.1
  • 102
    • 0012327484 scopus 로고    scopus 로고
    • Using eligibility traces to find the best memoryless policy in partially observable Markov decision processes
    • J. W. Shavlik (Ed.), San Francisco, CA: Morgan Kaufmann
    • Loch, J., & Singh, S. (1998). Using eligibility traces to find the best memoryless policy in partially observable Markov decision processes. In J. W. Shavlik (Ed.), Proceedings of the fifteenth international conference on machine learning (pp. 323-331). San Francisco, CA: Morgan Kaufmann.
    • (1998) Proceedings of the fifteenth international conference on machine learning , pp. 32333
    • Loch, J.1    Singh, S.2
  • 104
    • 0000603047 scopus 로고
    • The choice axiom after twenty years
    • Luce, R. D. (1977). The choice axiom after twenty years. Journal of Mathematical Psychology, 15, 215-233.
    • (1977) Journal of Mathematical Psychology , vol.15 , pp. 215-233
    • Luce, R.D.1
  • 105
    • 0030789031 scopus 로고    scopus 로고
    • Impulsive and self-control choices in opioid-dependent patients and non-drug-using control participants: Drug and monetary rewards
    • Madden, G. J., Petry, N. M., Badger, G. J., & Bickel, W. K. (1997). Impulsive and self-control choices in opioid-dependent patients and non-drug-using control participants: Drug and monetary rewards. Experimental and Clinical Psychopharmacology, 5, 256-262.
    • (1997) Experimental and Clinical Psychopharmacology , vol.5 , pp. 256-262
    • Madden, G.J.1    Petry, N.M.2    Badger, G.J.3    Bickel, W.K.4
  • 106
    • 77953156256 scopus 로고    scopus 로고
    • Fear conditioning and social groups: Statistics, not genetics
    • Maia, T. V. (2009). Fear conditioning and social groups: Statistics, not genetics. Cognitive Science, 33, 1232-1251.
    • (2009) Cognitive Science , vol.33 , pp. 1232-1251
    • Maia, T.V.1
  • 107
    • 77949897253 scopus 로고    scopus 로고
    • Two-factor theory, the actor-critic model, and conditioned avoidance
    • Maia, T. V. (2010). Two-factor theory, the actor-critic model, and conditioned avoidance. Learning & Behavior, 38, 50-67.
    • (2010) Learning & Behavior , vol.38 , pp. 50-67
    • Maia, T.V.1
  • 108
    • 79251569290 scopus 로고    scopus 로고
    • From reinforcement learning models to psychiatric and neurological disorders
    • Maia, T. V., & Frank, M. J. (2011). From reinforcement learning models to psychiatric and neurological disorders. Nature Neuroscience, 14, 154-162.
    • (2011) Nature Neuroscience , vol.14 , pp. 154-162
    • Maia, T.V.1    Frank, M.J.2
  • 109
    • 33644820167 scopus 로고    scopus 로고
    • Prefrontal cell activities related to monkeys' success and failure in adapting to rule changes in a Wisconsin Card Sorting test analog
    • Mansouri, F. A., Matsumoto, K., & Tanaka, K. (2006). Prefrontal cell activities related to monkeys' success and failure in adapting to rule changes in a Wisconsin Card Sorting test analog. The Journal of Neuroscience, 26, 2745-2756.
    • (2006) The Journal of Neuroscience , vol.26 , pp. 2745-2756
    • Mansouri, F.A.1    Matsumoto, K.2    Tanaka, K.3
  • 110
    • 0001657237 scopus 로고
    • Instance-based utile distinctions for reinforcement learning with hidden states
    • A. Prieditis & S. J. Russell (Eds.), San Francisco, CA: Morgan Kaufmann
    • Mccallum, R. A. (1995). Instance-based utile distinctions for reinforcement learning with hidden states. In A. Prieditis & S. J. Russell (Eds.), The proceedings of the twelfth international machine learning conference (pp. 387-395). San Francisco, CA: Morgan Kaufmann.
    • (1995) The proceedings of the twelfth international machine learning conference , pp. 387395
    • Mccallum, R.A.1
  • 111
    • 0037650217 scopus 로고    scopus 로고
    • Temporal prediction errors in a passive learning task activate human striatum
    • Mcclure, S. M., Berns, G. S., & Montague, P. R. (2003). Temporal prediction errors in a passive learning task activate human striatum. Neuron, 38, 339-346.
    • (2003) Neuron , vol.38 , pp. 339-346
    • Mcclure, S.M.1    Berns, G.S.2    Montague, P.R.3
  • 112
    • 79951823576 scopus 로고    scopus 로고
    • Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning
    • McDannald, M. A., Lucantonio, F., Burke, K. A., Niv, Y., & Schoenbaum, G. (2011). Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning. The Journal of Neuroscience, 31, 2700-2705.
    • (2011) The Journal of Neuroscience , vol.31 , pp. 2700-2705
    • McDannald, M.A.1    Lucantonio, F.2    Burke, K.A.3    Niv, Y.4    Schoenbaum, G.5
  • 114
    • 0031436055 scopus 로고    scopus 로고
    • Event-related brain potentials following incorrect feedback in a time-estimation task: Evidence for a "generic" neural system for error detection
    • Miltner, W. H. R., Braun, C. H., & Coles, M. G. H. (1997). Event-related brain potentials following incorrect feedback in a time-estimation task: Evidence for a "generic" neural system for error detection. Journal of Cognitive Neuroscience, 9, 788-798.
    • (1997) Journal of Cognitive Neuroscience , vol.9 , pp. 788-798
    • Miltner, W.H.R.1    Braun, C.H.2    Coles, M.G.H.3
  • 115
    • 0002936464 scopus 로고
    • Steps toward artificial intelligence
    • E. A. Feigenbaum & J. Feldman (Eds.), New York, NY: McGraw-Hill
    • Minsky, M. (1963). Steps toward artificial intelligence. In E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought (pp. 406-450). New York, NY: McGraw-Hill.
    • (1963) Computers and thought , pp. 406450
    • Minsky, M.1
  • 116
    • 0033662350 scopus 로고    scopus 로고
    • Effects of central 5-hydroxytrptamine depletion on sensitivity to delayed and probabilistic reinforcement
    • Mobini, S., Chiang, T. J., Ho, M. Y., Bradshaw, C. M., & Szabadi, E. (2000). Effects of central 5-hydroxytrptamine depletion on sensitivity to delayed and probabilistic reinforcement. Psychopharmacology, 152, 390-397.
    • (2000) Psychopharmacology , vol.152 , pp. 390-397
    • Mobini, S.1    Chiang, T.J.2    Ho, M.Y.3    Bradshaw, C.M.4    Szabadi, E.5
  • 117
    • 0037057753 scopus 로고    scopus 로고
    • Neural economics and the biological substrates of valuation
    • Montague, P. R., & Berns, G. S. (2002). Neural economics and the biological substrates of valuation. Neuron, 36, 265-284.
    • (2002) Neuron , vol.36 , pp. 265-284
    • Montague, P.R.1    Berns, G.S.2
  • 118
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • Montague, P. R., Dayan, P., & Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. The Journal of Neuroscience, 16, 1936-1947.
    • (1996) The Journal of Neuroscience , vol.16 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 119
    • 7244240565 scopus 로고    scopus 로고
    • Computational roles for dopamine in behavioural control
    • Montague, P. R., Hyman, S. E., & Cohen, J. D. (2004). Computational roles for dopamine in behavioural control. Nature, 431, 760-767.
    • (2004) Nature , vol.431 , pp. 760-767
    • Montague, P.R.1    Hyman, S.E.2    Cohen, J.D.3
  • 121
    • 33745978411 scopus 로고    scopus 로고
    • A comparison of abstract rules in the prefrontal cortex, premotor cortex, inferior temporal cortex, and striatum
    • Muhammad, R., Wallis, J. D., & Miller, E. K. (2006). A comparison of abstract rules in the prefrontal cortex, premotor cortex, inferior temporal cortex, and striatum. Journal of Cognitive Neuroscience, 18, 974-989.
    • (2006) Journal of Cognitive Neuroscience , vol.18 , pp. 974-989
    • Muhammad, R.1    Wallis, J.D.2    Miller, E.K.3
  • 122
    • 33646431689 scopus 로고    scopus 로고
    • Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans
    • Mushiake, H., Saito, N., Sakamoto, K., Itoyama, Y., & Tanji, J. (2006). Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans. Neuron, 50, 631-641.
    • (2006) Neuron , vol.50 , pp. 631-641
    • Mushiake, H.1    Saito, N.2    Sakamoto, K.3    Itoyama, Y.4    Tanji, J.5
  • 124
    • 67349283062 scopus 로고    scopus 로고
    • Reinforcement learning in the brain
    • Niv, Y. (2009). Reinforcement learning in the brain. Journal of Mathematical Psychology, 53, 139-154.
    • (2009) Journal of Mathematical Psychology , vol.53 , pp. 139-154
    • Niv, Y.1
  • 125
    • 0037987978 scopus 로고    scopus 로고
    • Temporal difference models and reward-related learning in the human brain
    • O'doherty, J. P., Dayan, P., Friston, K., Critchley, H., & Dolan, R. J. (2003). Temporal difference models and reward-related learning in the human brain. Neuron, 38, 329 -337.
    • (2003) Neuron , vol.38
    • O'doherty, J.P.1    Dayan, P.2    Friston, K.3    Critchley, H.4    Dolan, R.J.5
  • 126
    • 1942520195 scopus 로고    scopus 로고
    • Dissociable roles of ventral and dorsal striatum in instrumental conditioning
    • O'doherty, J. P., Dayan, P., Schultz, J., Deichmann, R., Friston, K., & Dolan, R. J. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304, 452-454.
    • (2004) Science , vol.304 , pp. 452-454
    • O'doherty, J.P.1    Dayan, P.2    Schultz, J.3    Deichmann, R.4    Friston, K.5    Dolan, R.J.6
  • 127
    • 34447643062 scopus 로고    scopus 로고
    • Model-based fMRI and its application to reward learning and decision making
    • O'doherty, J. P., Hampton, A., & Kim, H. (2007). Model-based fMRI and its application to reward learning and decision making. Annals of the New York Academy of Science, 1104, 35-53.
    • (2007) Annals of the New York Academy of Science , vol.1104 , pp. 35-53
    • O'doherty, J.P.1    Hampton, A.2    Kim, H.3
  • 129
    • 33644927837 scopus 로고    scopus 로고
    • Making working memory work: A computational model of learning in prefrontal cortex and basal ganglia
    • O'Reilly, R. C., & Frank, M. J. (2006). Making working memory work: A computational model of learning in prefrontal cortex and basal ganglia. Neural Computation, 18, 283-328.
    • (2006) Neural Computation , vol.18 , pp. 283-328
    • O'Reilly, R.C.1    Frank, M.J.2
  • 130
    • 23944507547 scopus 로고    scopus 로고
    • Lesions of medial prefrontal cortex disrupt the acquisition but not the expression of goal-directed learning
    • Ostlund, S. B., & Balleine, B. W. (2005). Lesions of medial prefrontal cortex disrupt the acquisition but not the expression of goal-directed learning. The Journal of Neuroscience, 25, 7763-7770.
    • (2005) The Journal of Neuroscience , vol.25 , pp. 7763-7770
    • Ostlund, S.B.1    Balleine, B.W.2
  • 132
    • 0030722121 scopus 로고    scopus 로고
    • Cognitive planning in humans: Neuropsychological, neuroanatomical and neuropharmacological perspectives
    • Owen, A. M. (1997). Cognitive planning in humans: Neuropsychological, neuroanatomical and neuropharmacological perspectives. Progress in Neurobiology, 53, 431-450.
    • (1997) Progress in Neurobiology , vol.53 , pp. 431-450
    • Owen, A.M.1
  • 134
    • 0036308524 scopus 로고    scopus 로고
    • Learning and memory functions of the basal ganglia
    • Packard, M. G., & Knowlton, B. J. (2002). Learning and memory functions of the basal ganglia. Annual Review of Neuroscience, 25, 563-593.
    • (2002) Annual Review of Neuroscience , vol.25 , pp. 563-593
    • Packard, M.G.1    Knowlton, B.J.2
  • 135
    • 0036159133 scopus 로고    scopus 로고
    • Activity in human ventral striatum locked to errors of reward prediction
    • Pagnoni, G., Zink, C. F., Montague, P. R., & Berns, G. S. (2002). Activity in human ventral striatum locked to errors of reward prediction. Nature Neuroscience, 5, 97-98.
    • (2002) Nature Neuroscience , vol.5 , pp. 97-98
    • Pagnoni, G.1    Zink, C.F.2    Montague, P.R.3    Berns, G.S.4
  • 136
    • 21544455210 scopus 로고    scopus 로고
    • Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward learning network
    • Pan, W. X., Schmidt, R., Wickens, J. R., & Hyland, B. I. (2005). Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward learning network. The Journal of Neuroscience, 25, 6235-6242.
    • (2005) The Journal of Neuroscience , vol.25 , pp. 6235-6242
    • Pan, W.X.1    Schmidt, R.2    Wickens, J.R.3    Hyland, B.I.4
  • 138
    • 34548651404 scopus 로고    scopus 로고
    • Orbitofrontal cortex encodes willingness to pay in everyday economic transactions
    • Plassmann, H., O'doherty, J., & Rangel, A. (2007). Orbitofrontal cortex encodes willingness to pay in everyday economic transactions. The Journal of Neuroscience, 27, 9984-9988.
    • (2007) The Journal of Neuroscience , vol.27 , pp. 9984-9988
    • Plassmann, H.1    O'doherty, J.2    Rangel, A.3
  • 139
    • 0002253315 scopus 로고
    • Psychophysiology of N200/N400: A review and classification scheme
    • J. R. Jennings, P. K. Ackles, & M. G. H. Coles (Eds.), London, England: Jessica Kingsley
    • Pritchard, W. S., Shappell, S. A., & Brandt, M. E. (1991). Psychophysiology of N200/N400: A review and classification scheme. In J. R. Jennings, P. K. Ackles, & M. G. H. Coles (Eds.), Advances in psychophysiology (Vol. 4, pp. 43-106). London, England: Jessica Kingsley.
    • (1991) Advances in psychophysiology , vol.4 , pp. 43-106
    • Pritchard, W.S.1    Shappell, S.A.2    Brandt, M.E.3
  • 140
    • 0029018495 scopus 로고
    • Self-control: Beyond commitment
    • Rachlin, H. (1995). Self-control: Beyond commitment. Behavioral and Brain Sciences, 18, 109-159.
    • (1995) Behavioral and Brain Sciences , vol.18 , pp. 109-159
    • Rachlin, H.1
  • 141
    • 45749098894 scopus 로고    scopus 로고
    • A framework for studying the neurobiology of value-based decision making
    • Rangel, A., Camerer, C., & Montague, P. R. (2008). A framework for studying the neurobiology of value-based decision making. Nature Reviews Neuroscience, 9, 545-556.
    • (2008) Nature Reviews Neuroscience , vol.9 , pp. 545-556
    • Rangel, A.1    Camerer, C.2    Montague, P.R.3
  • 142
    • 79960241771 scopus 로고    scopus 로고
    • Decision making under uncertainty: A neural model based on partially observable Markov decision processes
    • Rao, R. P. N. (2010). Decision making under uncertainty: A neural model based on partially observable Markov decision processes. Frontiers in Computational Neuroscience, 4, 146.
    • (2010) Frontiers in Computational Neuroscience , vol.4 , pp. 146
    • Rao, R.P.N.1
  • 144
    • 48349092693 scopus 로고    scopus 로고
    • A unified framework for addiction: Vulnerabilities in the decision process
    • Redish, A. D., Jensen, S., & Johnson, A. (2008). A unified framework for addiction: Vulnerabilities in the decision process. Behavioral and Brain Sciences, 31, 415-437.
    • (2008) Behavioral and Brain Sciences , vol.31 , pp. 415-437
    • Redish, A.D.1    Jensen, S.2    Johnson, A.3
  • 145
    • 34548837994 scopus 로고    scopus 로고
    • Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling
    • Redish, A. D., Jensen, S., Johnson, A., & Kurth-Nelson, Z. (2007). Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling. Psychological Review, 114, 784-805.
    • (2007) Psychological Review , vol.114 , pp. 784-805
    • Redish, A.D.1    Jensen, S.2    Johnson, A.3    Kurth-Nelson, Z.4
  • 146
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • A. H. Black & W. F. Prokasy (Eds.), New York, NY: Appleton-Century-Crofts
    • Rescorla, R. A., & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory (pp. 64-99). New York, NY: Appleton-Century-Crofts.
    • (1972) Classical conditioning II: Current research and theory , pp. 6499
    • Rescorla, R.A.1    Wagner, A.R.2
  • 147
    • 33751168257 scopus 로고    scopus 로고
    • A review of delay-discounting research with humans: Relations to drug use and gambling
    • Reynolds, B. (2006). A review of delay-discounting research with humans: Relations to drug use and gambling. Behavioural Pharmacology, 17, 651-667.
    • (2006) Behavioural Pharmacology , vol.17 , pp. 651-667
    • Reynolds, B.1
  • 148
    • 0035817882 scopus 로고    scopus 로고
    • A cellular mechanism of reward-related learning
    • Reynolds, J. N. J., Hyland, B. I., & Wickens, J. R. (2001). A cellular mechanism of reward-related learning. Nature, 413, 67-70.
    • (2001) Nature , vol.413 , pp. 67-70
    • Reynolds, J.N.J.1    Hyland, B.I.2    Wickens, J.R.3
  • 149
    • 0036592025 scopus 로고    scopus 로고
    • Dopamine-dependent plasticity of corticostriatal synapses
    • Reynolds, J. N. J., & Wickens, J. R. (2002). Dopamine-dependent plasticity of corticostriatal synapses. Neural Networks, 15, 507-521
    • (2002) Neural Networks , vol.15 , pp. 507-521
    • Reynolds, J.N.J.1    Wickens, J.R.2
  • 152
    • 36448968271 scopus 로고    scopus 로고
    • Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
    • Roesch, M. R., Calu, D. J., & Schoenbaum, G. (2007). Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience, 10, 1615-1624.
    • (2007) Nature Neuroscience , vol.10 , pp. 1615-1624
    • Roesch, M.R.1    Calu, D.J.2    Schoenbaum, G.3
  • 154
    • 77957728784 scopus 로고    scopus 로고
    • Testing the reward prediction error hypothesis with an axiomatic model
    • Rutledge, R. B., Dean, M., Caplin, A., & Glimcher, P. W. (2010). Testing the reward prediction error hypothesis with an axiomatic model. The Journal of Neuroscience, 30, 13525-13536.
    • (2010) The Journal of Neuroscience , vol.30 , pp. 13525-13536
    • Rutledge, R.B.1    Dean, M.2    Caplin, A.3    Glimcher, P.W.4
  • 155
    • 25144449580 scopus 로고    scopus 로고
    • Representation of immediate and final behavioral goals in the monkey prefrontal cortex during an instructed delay period
    • Saito, N., Mushiake, H., Sakamoto, K., Itoyama, Y., & Tanji, J. (2005). Representation of immediate and final behavioral goals in the monkey prefrontal cortex during an instructed delay period. Cerebral Cortex, 15, 1535-1546.
    • (2005) Cerebral Cortex , vol.15 , pp. 1535-1546
    • Saito, N.1    Mushiake, H.2    Sakamoto, K.3    Itoyama, Y.4    Tanji, J.5
  • 156
    • 0003297918 scopus 로고
    • Some studies in machine learning using the game of checkers
    • In E. A. Feigenbaum & J. Feldman (Eds.) New York, NY: McGraw-Hill. (Reprinted from 1959, IBM Journal of Research and Development, 3, pp. 211-229)
    • Samuel, A. L. (1995). Some studies in machine learning using the game of checkers. In E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought (pp. 71-105). New York, NY: McGraw-Hill. (Reprinted from 1959, IBM Journal of Research and Development, 3, pp. 211-229)
    • (1995) Computers and thought , pp. 71-105
    • Samuel, A.L.1
  • 157
    • 0242440823 scopus 로고    scopus 로고
    • Correlated coding of motivation and outcome of decision by dopamine neurons
    • Satoh, T., Nakai, S., Sato, T., & Kimura, M. (2003). Correlated coding of motivation and outcome of decision by dopamine neurons. The Journal of Neuroscience, 23, 9913-9923.
    • (2003) The Journal of Neuroscience , vol.23 , pp. 9913-9923
    • Satoh, T.1    Nakai, S.2    Sato, T.3    Kimura, M.4
  • 158
    • 34548013298 scopus 로고    scopus 로고
    • Remembering the past to imagine the future: The prospective brain
    • Schacter, D. L., Addis, D. R., & Buckner, R. L. (2007). Remembering the past to imagine the future: The prospective brain. Nature Reviews Neuroscience, 8, 657-661.
    • (2007) Nature Reviews Neuroscience , vol.8 , pp. 657-661
    • Schacter, D.L.1    Addis, D.R.2    Buckner, R.L.3
  • 159
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology, 80, 1-27.
    • (1998) Journal of Neurophysiology , vol.80 , pp. 1-27
    • Schultz, W.1
  • 160
    • 0027468102 scopus 로고
    • Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task
    • Schultz, W., Apicella, P., & Ljungberg, T. (1993). Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. The Journal of Neuroscience, 13, 900-913.
    • (1993) The Journal of Neuroscience , vol.13 , pp. 900-913
    • Schultz, W.1    Apicella, P.2    Ljungberg, T.3
  • 161
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 164
    • 79955709936 scopus 로고    scopus 로고
    • Neural correlates of forward planning in a spatial decision task in humans
    • Simon, D. A., & Daw, N. D. (2011). Neural correlates of forward planning in a spatial decision task in humans. The Journal of Neuroscience, 31, 5526-5539.
    • (2011) The Journal of Neuroscience , vol.31 , pp. 5526-5539
    • Simon, D.A.1    Daw, N.D.2
  • 165
    • 84881118309 scopus 로고    scopus 로고
    • Melioration as rational choice: Sequential decision making in uncertain environments
    • Sims, C. R., Neth, H., Jacobs, R. A., & Gray, W. D. (2013). Melioration as rational choice: Sequential decision making in uncertain environments. Psychological Review, 120, 139-154.
    • (2013) Psychological Review , vol.120 , pp. 139-154
    • Sims, C.R.1    Neth, H.2    Jacobs, R.A.3    Gray, W.D.4
  • 166
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • Singh, S. P., & Sutton, R. S. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 22, 123-158.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 167
    • 84894466619 scopus 로고
    • The behavior of organisms: An experimental analysis
    • Skinner, B. F. (1938). The behavior of organisms: An experimental analysis. Oxford, England: Appleton-Century.
    • (1938) Oxford, England , pp. 61-84
    • Skinner, B.F.1
  • 168
    • 33646230819 scopus 로고    scopus 로고
    • Dopamine, prediction error and associative learning: A model-based account
    • Smith, A., Li, M., Becker, S., & Kapur, S. (2006). Dopamine, prediction error and associative learning: A model-based account. Network: Computation in Neural Systems, 17, 61- 84.
    • (2006) Network: Computation in Neural Systems , vol.17 , pp. 61-84
    • Smith, A.1    Li, M.2    Becker, S.3    Kapur, S.4
  • 169
    • 84859737036 scopus 로고    scopus 로고
    • Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates
    • Solway, A., & Botvinick, M. M. (2012). Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates. Psychological Review, 119, 120-154.
    • (2012) Psychological Review , vol.119 , pp. 120-154
    • Solway, A.1    Botvinick, M.M.2
  • 170
    • 33745078823 scopus 로고
    • The order of eliminating blinds in maze learning by the rat
    • Spence, K. W. (1932). The order of eliminating blinds in maze learning by the rat. Journal of Comparative Psychology, 14, 9-27.
    • (1932) Journal of Comparative Psychology , vol.14 , pp. 9-27
    • Spence, K.W.1
  • 173
    • 77953152738 scopus 로고    scopus 로고
    • Conditional routing of information to the cortex: A model of the basal ganglia's role in cognitive coordination
    • Stocco, A., Lebiere, C., & Anderson, J. R. (2010). Conditional routing of information to the cortex: A model of the basal ganglia's role in cognitive coordination. Psychological Review, 117, 541-574.
    • (2010) Psychological Review , vol.117 , pp. 541-574
    • Stocco, A.1    Lebiere, C.2    Anderson, J.R.3
  • 174
    • 84864680808 scopus 로고    scopus 로고
    • The cerebellum and cognition: Evidence from functional imaging studies
    • Stoodley, C. J. (2012). The cerebellum and cognition: Evidence from functional imaging studies. The Cerebellum, 11, 352-365.
    • (2012) The Cerebellum , vol.11 , pp. 352-365
    • Stoodley, C.J.1
  • 176
    • 0002995053 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • B. W. Porter & R. J. Mooney (Eds.), San Francisco, CA: Morgan Kaufmann
    • Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In B. W. Porter & R. J. Mooney (Eds.), Proceedings of the seventh international conference on machine learning (pp. 216-224). San Francisco, CA: Morgan Kaufmann.
    • (1990) Proceedings of the seventh international conference on machine learning , pp. 216224
    • Sutton, R.S.1
  • 180
    • 48549088919 scopus 로고    scopus 로고
    • Calculating consequences: Brain systems that encode the causal effects of actions
    • Tanaka, S. C., Balleine, B. W., & O'doherty, J. P. (2008). Calculating consequences: Brain systems that encode the causal effects of actions. The Journal of Neuroscience, 28, 6750-6755.
    • (2008) The Journal of Neuroscience , vol.28 , pp. 6750-6755
    • Tanaka, S.C.1    Balleine, B.W.2    O'doherty, J.P.3
  • 182
    • 0003033145 scopus 로고
    • A critical review of latent learning and related experiments
    • Thistlethwaite, D. (1951). A critical review of latent learning and related experiments. Psychological Bulletin, 48, 97-129.
    • (1951) Psychological Bulletin , vol.48 , pp. 97-129
    • Thistlethwaite, D.1
  • 183
    • 0002210775 scopus 로고
    • The role of exploration in learning control
    • D. A. White & D. A. Sofge (Eds.), Florence, KY: Van Nostrand Reinhold
    • Thrun, S. B. (1992). The role of exploration in learning control. In D. A. White & D. A. Sofge (Eds.), Handbook of intelligent control: Neural, fuzzy and adaptive approaches (pp. 527-554). Florence, KY: Van Nostrand Reinhold.
    • (1992) Handbook of intelligent control: Neural, fuzzy and adaptive approaches , pp. 527554
    • Thrun, S.B.1
  • 184
    • 14844349975 scopus 로고    scopus 로고
    • Adaptive coding of reward value by dopamine neurons
    • Tobler, P. N., Fiorillo, C. D., & Schultz, W. (2005). Adaptive coding of reward value by dopamine neurons. Science, 307, 1642-1645.
    • (2005) Science , vol.307 , pp. 1642-1645
    • Tobler, P.N.1    Fiorillo, C.D.2    Schultz, W.3
  • 185
    • 33644806981 scopus 로고    scopus 로고
    • Human neural learning depends on reward prediction errors in the blocking paradigm
    • Tobler, P. N., O'doherty, J. P., Dolan, R. J., & Schultz, W. (2005). Human neural learning depends on reward prediction errors in the blocking paradigm. Journal of Neurophysiology, 95, 301-310.
    • (2005) Journal of Neurophysiology , vol.95 , pp. 301-310
    • Tobler, P.N.1    O'doherty, J.P.2    Dolan, R.J.3    Schultz, W.4
  • 186
    • 77549088095 scopus 로고    scopus 로고
    • Learning to use working memory in partially observable environments through dopaminergic reinforcement
    • D. Koller, D. Schuurmans, Y. Bengio, & L. Bottou (Eds.), Cambridge, MA: MIT Press
    • Todd, M. T., Niv, Y., & Cohen, J. D. (2009). Learning to use working memory in partially observable environments through dopaminergic reinforcement. In D. Koller, D. Schuurmans, Y. Bengio, & L. Bottou (Eds.), Advances in neural information processing systems (pp. 1689- 1696). Cambridge, MA: MIT Press.
    • (2009) Advances in neural information processing systems
    • Todd, M.T.1    Niv, Y.2    Cohen, J.D.3
  • 189
    • 66449119919 scopus 로고    scopus 로고
    • A specific role for posterior dorsolateral striatum in human habit learning
    • Tricomi, E., Balleine, B. W., & O'doherty, J. P. (2009). A specific role for posterior dorsolateral striatum in human habit learning. European Journal of Neuroscience, 29, 2225-2232.
    • (2009) European Journal of Neuroscience , vol.29 , pp. 2225-2232
    • Tricomi, E.1    Balleine, B.W.2    O'doherty, J.P.3
  • 190
    • 1642534402 scopus 로고    scopus 로고
    • Modulation of caudate activity by action contingency
    • Tricomi, E. M., Delgado, M. R., & Fiez, J. A. (2004). Modulation of caudate activity by action contingency. Neuron, 41, 281-292.
    • (2004) Neuron , vol.41 , pp. 281-292
    • Tricomi, E.M.1    Delgado, M.R.2    Fiez, J.A.3
  • 192
    • 34247147767 scopus 로고    scopus 로고
    • Determining the neural substrates of goal-directed learning in the human brain
    • Valentin, V. V., Dickinson, A., & O'doherty, J. P. (2007). Determining the neural substrates of goal-directed learning in the human brain. The Journal of Neuroscience, 27, 4019-4026
    • (2007) The Journal of Neuroscience , vol.27 , pp. 4019-4026
    • Valentin, V.V.1    Dickinson, A.2    O'doherty, J.P.3
  • 193
  • 194
    • 0037092472 scopus 로고    scopus 로고
    • The timing of action-monitoring processes in the anterior cingulate cortex
    • van Veen, V., & Carter, C. S. (2002). The timing of action-monitoring processes in the anterior cingulate cortex. Journal of Cognitive Neuroscience, 14, 593-602.
    • (2002) Journal of Cognitive Neuroscience , vol.14 , pp. 593-602
    • van Veen, V.1    Carter, C.S.2
  • 195
    • 0035811464 scopus 로고    scopus 로고
    • Dopamine responses comply with basic assumptions of formal learning theory
    • Waelti, P., Dickinson, A., & Schultz, W. (2001). Dopamine responses comply with basic assumptions of formal learning theory. Nature, 412, 43-48.
    • (2001) Nature , vol.412 , pp. 43-48
    • Waelti, P.1    Dickinson, A.2    Schultz, W.3
  • 196
    • 63149124215 scopus 로고    scopus 로고
    • The strategic nature of changing your mind
    • Walsh, M. M., & Anderson, J. R. (2009). The strategic nature of changing your mind. Cognitive Psychology, 58, 416-440.
    • (2009) Cognitive Psychology , vol.58 , pp. 416-440
    • Walsh, M.M.1    Anderson, J.R.2
  • 197
    • 80051661786 scopus 로고    scopus 로고
    • Learning from delayed feedback: Neural responses in temporal credit assignment
    • Walsh, M. M., & Anderson, J. R. (2011a). Learning from delayed feedback: Neural responses in temporal credit assignment. Cognitive, Affective & Behavioral Neuroscience, 11, 131-143.
    • (2011) Cognitive, Affective & Behavioral Neuroscience , vol.11 , pp. 131-143
    • Walsh, M.M.1    Anderson, J.R.2
  • 199
    • 84864813064 scopus 로고    scopus 로고
    • Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice
    • Walsh, M. M., & Anderson, J. R. (2012). Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice. Neuroscience and Biobehavioral Reviews, 36, 1870-1884.
    • (2012) Neuroscience and Biobehavioral Reviews , vol.36 , pp. 1870-1884
    • Walsh, M.M.1    Anderson, J.R.2
  • 200
    • 84875959090 scopus 로고    scopus 로고
    • The importance of action history in decision making and reinforcement learning
    • R. L. Lewis, T. A. Polk, & J. E. Laird (Eds.), Ann Arbor, MI
    • Wang, Y., & Laird, J. E. (2007). The importance of action history in decision making and reinforcement learning. In R. L. Lewis, T. A. Polk, & J. E. Laird (Eds.), Proceedings of the eighth international conference on cognitive modeling (pp. 85-90). Ann Arbor, MI.
    • (2007) Proceedings of the eighth international conference on cognitive modeling (pp. 85-90)
    • Wang, Y.1    Laird, J.E.2
  • 201
    • 0008573205 scopus 로고    scopus 로고
    • When more means less: Factors affecting human self-control in a local versus global choice paradigm
    • Warry, C. J., Remington, B., & Sonuga-Barke, E. J. S. (1999). When more means less: Factors affecting human self-control in a local versus global choice paradigm. Learning and Motivation, 30, 53-73.
    • (1999) Learning and Motivation , vol.30 , pp. 53-73
    • Warry, C.J.1    Remington, B.2    Sonuga-Barke, E.J.S.3
  • 202
    • 84860166687 scopus 로고    scopus 로고
    • Phasic mesolimbic dopamine signaling precedes and predicts performance of a self-initiated action sequence task
    • Wassum, K. M., Ostlund, S. B., & Maidment, N. T. (2012). Phasic mesolimbic dopamine signaling precedes and predicts performance of a self-initiated action sequence task. Biological Psychiatry, 71, 846-854.
    • (2012) Biological Psychiatry , vol.71 , pp. 846-854
    • Wassum, K.M.1    Ostlund, S.B.2    Maidment, N.T.3
  • 203
    • 0033006462 scopus 로고    scopus 로고
    • Rule-dependent neuronal activity in the prefrontal cortex. Experimental
    • White, I. M., & Wise, S. P. (1999). Rule-dependent neuronal activity in the prefrontal cortex. Experimental Brain Research, 126, 315-335.
    • (1999) Brain Research , vol.126 , pp. 315-335
    • White, I.M.1    Wise, S.P.2
  • 204
    • 0029655991 scopus 로고    scopus 로고
    • Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex in vitro
    • Wickens, J. R., Begg, A. J., & Arbuthnott, G. W. (1996). Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex in vitro. Neuroscience, 70, 1-5.
    • (1996) Neuroscience , vol.70 , pp. 1-5
    • Wickens, J.R.1    Begg, A.J.2    Arbuthnott, G.W.3
  • 205
    • 84860307045 scopus 로고    scopus 로고
    • Mapping value based planning and extensively trained choice in the human brain
    • Wunderlich, K., Dayan, P., & Dolan, R. J. (2012). Mapping value based planning and extensively trained choice in the human brain. Nature Neuroscience, 15, 786-791.
    • (2012) Nature Neuroscience , vol.15 , pp. 786-791
    • Wunderlich, K.1    Dayan, P.2    Dolan, R.J.3
  • 207
    • 1442274999 scopus 로고    scopus 로고
    • Melioration and the transition from touch-typing training to everyday use
    • Yechiam, E., Erev, I., Yehene, V., & Gopher, D. (2003). Melioration and the transition from touch-typing training to everyday use. Human Factors, 45, 671-684.
    • (2003) Human Factors , vol.45 , pp. 671-684
    • Yechiam, E.1    Erev, I.2    Yehene, V.3    Gopher, D.4
  • 208
    • 3042570744 scopus 로고    scopus 로고
    • The neural basis of error detection: Conflict monitoring and the error-related negativity
    • Yeung, N., Botvinick, M. M., & Cohen, J. D. (2004). The neural basis of error detection: Conflict monitoring and the error-related negativity. Psychological Review, 111, 931-959.
    • (2004) Psychological Review , vol.111 , pp. 931-959
    • Yeung, N.1    Botvinick, M.M.2    Cohen, J.D.3
  • 209
    • 1642580578 scopus 로고    scopus 로고
    • Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning
    • Yin, H. H., Knowlton, B. J., & Balleine, B. W. (2004). Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. European Journal of Neuroscience, 19, 181-189.
    • (2004) European Journal of Neuroscience , vol.19 , pp. 181-189
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 210
    • 33646853495 scopus 로고    scopus 로고
    • Resolution of uncertainty in prefrontal cortex
    • Yoshida, W., & Ishii, S. (2006). Resolution of uncertainty in prefrontal cortex. Neuron, 50, 781-789.
    • (2006) Neuron , vol.50 , pp. 781-789
    • Yoshida, W.1    Ishii, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.