메뉴 건너뛰기




Volumn , Issue , 2012, Pages 33-52

Models of Value and Choice

Author keywords

Model based control; Model free control; Motivation; Pavlovian control; Reinforcement learning; Utility

Indexed keywords


EID: 84882459170     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1016/B978-0-12-381431-9.00002-4     Document Type: Chapter
Times cited : (9)

References (89)
  • 1
    • 0004245883 scopus 로고    scopus 로고
    • Cambridge University Press, Cambridge, UK
    • Ainslie G. Breakdown of will 2001, Cambridge University Press, Cambridge, UK.
    • (2001) Breakdown of will
    • Ainslie, G.1
  • 2
    • 0025321039 scopus 로고
    • Functional architecture of basal ganglia circuits: Neural substrates of parallel processing
    • Alexander G.E., Crutcher M.D. Functional architecture of basal ganglia circuits: Neural substrates of parallel processing. Trends in Neurosciences 1990, 13(7):266-271.
    • (1990) Trends in Neurosciences , vol.13 , Issue.7 , pp. 266-271
    • Alexander, G.E.1    Crutcher, M.D.2
  • 3
    • 28444472936 scopus 로고    scopus 로고
    • Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits
    • Balleine B.W. Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits. Physiology & Behavior 2005, 86(5):717-730.
    • (2005) Physiology & Behavior , vol.86 , Issue.5 , pp. 717-730
    • Balleine, B.W.1
  • 4
    • 72049125602 scopus 로고    scopus 로고
    • Human and rodent homologies in action control: Corti-costriatal determinants of goal-directed and habitual action
    • Balleine B.W., O'Doherty J.P. Human and rodent homologies in action control: Corti-costriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2010, 35(1):48-69.
    • (2010) Neuropsychopharmacology , vol.35 , Issue.1 , pp. 48-69
    • Balleine, B.W.1    O'Doherty, J.P.2
  • 5
    • 0000541213 scopus 로고
    • Adaptive critics and the basal ganglia
    • MIT Press, Cambridge MA, J. Houk, J. Davis, D. Beiser (Eds.)
    • Barto A. Adaptive critics and the basal ganglia. Models of information processing in the Basal Ganglia 1995, 215-232. MIT Press, Cambridge MA. J. Houk, J. Davis, D. Beiser (Eds.).
    • (1995) Models of information processing in the Basal Ganglia , pp. 215-232
    • Barto, A.1
  • 7
    • 33749651693 scopus 로고    scopus 로고
    • Intrinsically motivated learning of hierarchical collections of skills
    • Proceedings of international conference of developmental learning, San Diego, CA.
    • Barto, A., Singh, S., & Chentanez, N. (2004). Intrinsically motivated learning of hierarchical collections of skills. In Proceedings of international conference of developmental learning, San Diego, CA.
    • (2004)
    • Barto, A.1    Singh, S.2    Chentanez, N.3
  • 8
    • 2442701355 scopus 로고    scopus 로고
    • Motivation concepts in behavioral neuroscience
    • Berridge K.C. Motivation concepts in behavioral neuroscience. Physiology & Behavior 2004, 81:179-209.
    • (2004) Physiology & Behavior , vol.81 , pp. 179-209
    • Berridge, K.C.1
  • 9
    • 33847634405 scopus 로고    scopus 로고
    • The debate over dopamine's role in reward: The case for incentive salience
    • Berridge K.C. The debate over dopamine's role in reward: The case for incentive salience. Psychopharmacology (Berl) 2007, 191(3):391-431.
    • (2007) Psychopharmacology (Berl) , vol.191 , Issue.3 , pp. 391-431
    • Berridge, K.C.1
  • 12
    • 33750347385 scopus 로고    scopus 로고
    • The physics of optimal decision making: A formal analysis of models of performance in two-alternative forced-choice tasks
    • Bogacz R., Brown E., Moehlis J., Holmes P., Cohen J.D. The physics of optimal decision making: A formal analysis of models of performance in two-alternative forced-choice tasks. Psychological Review 2006, 113(4):700-765.
    • (2006) Psychological Review , vol.113 , Issue.4 , pp. 700-765
    • Bogacz, R.1    Brown, E.2    Moehlis, J.3    Holmes, P.4    Cohen, J.D.5
  • 13
    • 58149417523 scopus 로고
    • Species-specific defense reactions and avoidance learning
    • Bolles R.C. Species-specific defense reactions and avoidance learning. Psychological Review 1970, 77:32-48.
    • (1970) Psychological Review , vol.77 , pp. 32-48
    • Bolles, R.C.1
  • 14
    • 78649651245 scopus 로고    scopus 로고
    • Opponency revisited: Competition and cooperation between dopamine and serotonin
    • Boureau Y.-L., Dayan P. Opponency revisited: Competition and cooperation between dopamine and serotonin. Neuropsychopharmacology 2011, 36:74-97.
    • (2011) Neuropsychopharmacology , vol.36 , pp. 74-97
    • Boureau, Y.-L.1    Dayan, P.2
  • 15
    • 77956971930 scopus 로고    scopus 로고
    • Pavlovian processes in consumer choice: The physical presence of a good increases willingness-to-pay
    • Bushong B., King L., Camerer C., Rangel A. Pavlovian processes in consumer choice: The physical presence of a good increases willingness-to-pay. American Economic Review 2010, 100:1-18.
    • (2010) American Economic Review , vol.100 , pp. 1-18
    • Bushong, B.1    King, L.2    Camerer, C.3    Rangel, A.4
  • 17
    • 34247842923 scopus 로고    scopus 로고
    • Western scrub-jays anticipate future needs independently of their current motivational state
    • Correia S.P.C., Dickinson A., Clayton N.S. Western scrub-jays anticipate future needs independently of their current motivational state. Current Biology 2007, 17(10):856-861.
    • (2007) Current Biology , vol.17 , Issue.10 , pp. 856-861
    • Correia, S.P.C.1    Dickinson, A.2    Clayton, N.S.3
  • 18
    • 79952746011 scopus 로고    scopus 로고
    • Model-based influences on humans' choices and striatal prediction errors
    • Daw N.D., Gershman S.J., Seymour B., Dayan P., Dolan R.J. Model-based influences on humans' choices and striatal prediction errors. Neuron 2011, 69(6):1204-1215.
    • (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
    • Daw, N.D.1    Gershman, S.J.2    Seymour, B.3    Dayan, P.4    Dolan, R.J.5
  • 19
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience 2005, 8(12):1704-1711.
    • (2005) Nature Neuroscience , vol.8 , Issue.12 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 20
    • 33749055062 scopus 로고    scopus 로고
    • The misbehavior of value and the discipline of the will
    • Dayan P., Niv Y., Seymour B., Daw N.D. The misbehavior of value and the discipline of the will. Neural Networks 2006, 19(8):1153-1160.
    • (2006) Neural Networks , vol.19 , Issue.8 , pp. 1153-1160
    • Dayan, P.1    Niv, Y.2    Seymour, B.3    Daw, N.D.4
  • 24
    • 84882564459 scopus 로고    scopus 로고
    • Values and actions in aversion
    • Academic Press, New York, NY, P. Glimcher, C. Camerer, R. Poldrack, E. Fehr (Eds.)
    • Dayan P., Seymour B. Values and actions in aversion. Neuroeconomics: Decision making and the brain 2008, 175-191. Academic Press, New York, NY. P. Glimcher, C. Camerer, R. Poldrack, E. Fehr (Eds.).
    • (2008) Neuroeconomics: Decision making and the brain , pp. 175-191
    • Dayan, P.1    Seymour, B.2
  • 26
    • 33746896990 scopus 로고    scopus 로고
    • Frames, biases, and rational decision-making in the human brain
    • De Martino B., Kumaran D., Seymour B., Dolan R.J. Frames, biases, and rational decision-making in the human brain. Science 2006, 313(5787):684-687.
    • (2006) Science , vol.313 , Issue.5787 , pp. 684-687
    • De Martino, B.1    Kumaran, D.2    Seymour, B.3    Dolan, R.J.4
  • 27
    • 0031619316 scopus 로고    scopus 로고
    • Bayesian Q-learning
    • Proceedings of the fiteenth National/tenth Conference on Artificial intelligence/Innovative Applications of Artificial Intelligence Table of Contents, Menlo Park, CA: American Association for Artificial Intelligence.
    • Dearden, R., Friedman, N., & Russell, S. (1998). Bayesian Q-learning. In Proceedings of the fiteenth National/tenth Conference on Artificial intelligence/Innovative Applications of Artificial Intelligence Table of Contents, pp. 761-768. Menlo Park, CA: American Association for Artificial Intelligence.
    • (1998) , pp. 761-768
    • Dearden, R.1    Friedman, N.2    Russell, S.3
  • 29
    • 0043250430 scopus 로고    scopus 로고
    • The role of learning in the operation of motivational systems
    • Wiley, New York, NY
    • Dickinson A., Balleine B. The role of learning in the operation of motivational systems. Stevens' handbook of experimental psychology 2002, Vol. 3:497-534. Wiley, New York, NY.
    • (2002) Stevens' handbook of experimental psychology , vol.3 , pp. 497-534
    • Dickinson, A.1    Balleine, B.2
  • 31
    • 49049085303 scopus 로고    scopus 로고
    • Mesolimbic dopamine in desire and dread: Enabling motivation to be generated by localized glutamate disruptions in nucleus accumbens
    • Faure A., Reynolds S.M., Richard J.M., Berridge K.C. Mesolimbic dopamine in desire and dread: Enabling motivation to be generated by localized glutamate disruptions in nucleus accumbens. Journal of Neuroscience 2008, 28(28):7184-7192.
    • (2008) Journal of Neuroscience , vol.28 , Issue.28 , pp. 7184-7192
    • Faure, A.1    Reynolds, S.M.2    Richard, J.M.3    Berridge, K.C.4
  • 32
    • 33645458694 scopus 로고    scopus 로고
    • Reverse replay of behavioural sequences in hippocampal place cells during the awake state
    • Foster D.J., Wilson M.A. Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature 2006, 440(7084):680-683.
    • (2006) Nature , vol.440 , Issue.7084 , pp. 680-683
    • Foster, D.J.1    Wilson, M.A.2
  • 33
    • 10344250993 scopus 로고    scopus 로고
    • By carrot or by stick: Cognitive reinforcement learning in parkinsonism
    • Frank M.J., Seeberger L.C., O'Reilly R.C. By carrot or by stick: Cognitive reinforcement learning in parkinsonism. Science 2004, 306(5703):1940-1943.
    • (2004) Science , vol.306 , Issue.5703 , pp. 1940-1943
    • Frank, M.J.1    Seeberger, L.C.2    O'Reilly, R.C.3
  • 34
    • 33744550336 scopus 로고    scopus 로고
    • Anatomy of a decision: Striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal
    • Frank M.J., Claus E.D. Anatomy of a decision: Striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychological Review 2006, 113(2):300-326.
    • (2006) Psychological Review , vol.113 , Issue.2 , pp. 300-326
    • Frank, M.J.1    Claus, E.D.2
  • 36
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • Gläscher J., Daw N., Dayan P., O'Doherty J.P. States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010, 66(4):585-595.
    • (2010) Neuron , vol.66 , Issue.4 , pp. 585-595
    • Gläscher, J.1    Daw, N.2    Dayan, P.3    O'Doherty, J.P.4
  • 37
    • 77649151242 scopus 로고    scopus 로고
    • Hippocampal replay is not a simple function of experience
    • Gupta A.S., van der Meer M.A.A., Touretzky D.S., Redish A.D. Hippocampal replay is not a simple function of experience. Neuron 2010, 65(5):695-705.
    • (2010) Neuron , vol.65 , Issue.5 , pp. 695-705
    • Gupta, A.S.1    van der Meer, M.A.A.2    Touretzky, D.S.3    Redish, A.D.4
  • 39
    • 0022979089 scopus 로고
    • An approach through the looking-glass
    • Hershberger W.A. An approach through the looking-glass. Animal Learning & Behavior 1986, 14:443-451.
    • (1986) Animal Learning & Behavior , vol.14 , pp. 443-451
    • Hershberger, W.A.1
  • 40
    • 4043119771 scopus 로고    scopus 로고
    • Decisions from experience and the effect of rare events in risky choice
    • Hertwig R., Barron G., Weber E.U., Erev I. Decisions from experience and the effect of rare events in risky choice. Psychological Science 2004, 15(8):534-539.
    • (2004) Psychological Science , vol.15 , Issue.8 , pp. 534-539
    • Hertwig, R.1    Barron, G.2    Weber, E.U.3    Erev, I.4
  • 41
    • 70449671239 scopus 로고    scopus 로고
    • The description-experience gap in risky choice
    • Hertwig R., Erev I. The description-experience gap in risky choice. Trends in Cognitive Sciences 2009, 13:517-523.
    • (2009) Trends in Cognitive Sciences , vol.13 , pp. 517-523
    • Hertwig, R.1    Erev, I.2
  • 42
    • 1842853951 scopus 로고    scopus 로고
    • Relations between Pavlovian-instrumental transfer and reinforcer devaluation
    • Holland P.C. Relations between Pavlovian-instrumental transfer and reinforcer devaluation. Journal of Experimental Psychology. Animal Behavior Processes 2004, 30(2):104-117.
    • (2004) Journal of Experimental Psychology. Animal Behavior Processes , vol.30 , Issue.2 , pp. 104-117
    • Holland, P.C.1
  • 43
    • 70350570499 scopus 로고    scopus 로고
    • A Bayesian formulation of behavioral control
    • Huys Q.J.M., Dayan P. A Bayesian formulation of behavioral control. Cognition 2009, 113:314-328.
    • (2009) Cognition , vol.113 , pp. 314-328
    • Huys, Q.J.M.1    Dayan, P.2
  • 44
    • 0242267471 scopus 로고    scopus 로고
    • A perspective on judgment and choice: Mapping bounded rationality
    • Kahneman D. A perspective on judgment and choice: Mapping bounded rationality. American Psychologist 2003, 58(9):697-720.
    • (2003) American Psychologist , vol.58 , Issue.9 , pp. 697-720
    • Kahneman, D.1
  • 45
    • 33846565849 scopus 로고    scopus 로고
    • Frames and brains: Elicitation and control of response tendencies
    • Kahneman D., Frederick S. Frames and brains: Elicitation and control of response tendencies. Trends in Cognitive Sciences 2007, 11:45-46.
    • (2007) Trends in Cognitive Sciences , vol.11 , pp. 45-46
    • Kahneman, D.1    Frederick, S.2
  • 46
    • 0037382264 scopus 로고    scopus 로고
    • Coordination of actions and habits in the medial prefrontal cortex of rats
    • Killcross S., Coutureau E. Coordination of actions and habits in the medial prefrontal cortex of rats. Cerebral Cortex 2003, 13(4):400-408.
    • (2003) Cerebral Cortex , vol.13 , Issue.4 , pp. 400-408
    • Killcross, S.1    Coutureau, E.2
  • 49
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive hebbian learning
    • Montague P.R., Dayan P., Sejnowski T.J. A framework for mesencephalic dopamine systems based on predictive hebbian learning. Journal of Neuroscience 1996, 16(5):1936-1947.
    • (1996) Journal of Neuroscience , vol.16 , Issue.5 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 52
    • 67349283062 scopus 로고    scopus 로고
    • Reinforcement learning in the brain
    • Niv Y. Reinforcement learning in the brain. Journal of Mathematical Psychology 2009, 53(3):139-154.
    • (2009) Journal of Mathematical Psychology , vol.53 , Issue.3 , pp. 139-154
    • Niv, Y.1
  • 53
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: Opportunity costs and the control of response vigor
    • Niv Y., Daw N.D., Joel D., Dayan P. Tonic dopamine: Opportunity costs and the control of response vigor. Psychopharmacology (Berl) 2007, 191(3):507-520.
    • (2007) Psychopharmacology (Berl) , vol.191 , Issue.3 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 54
    • 9644310472 scopus 로고    scopus 로고
    • Reward representations and reward-related learning in the human brain: Insights from neuroimaging
    • O'Doherty J.P. Reward representations and reward-related learning in the human brain: Insights from neuroimaging. Current Opinion in Neurobiology 2004, 14(6):769-776.
    • (2004) Current Opinion in Neurobiology , vol.14 , Issue.6 , pp. 769-776
    • O'Doherty, J.P.1
  • 55
    • 37549066620 scopus 로고    scopus 로고
    • Lights, camembert, action! The role of human orbitofrontal cortex in encoding stimuli, rewards, and choices
    • O'Doherty J.P. Lights, camembert, action! The role of human orbitofrontal cortex in encoding stimuli, rewards, and choices. Annals of the New York Academy of Sciences, USA 2007, 1121:254-272.
    • (2007) Annals of the New York Academy of Sciences, USA , vol.1121 , pp. 254-272
    • O'Doherty, J.P.1
  • 58
    • 45349095604 scopus 로고    scopus 로고
    • Opioid reward "liking" and "wanting" in the nucleus accumbens
    • Peciña S. Opioid reward "liking" and "wanting" in the nucleus accumbens. Physiology & Behavior 2008, 94(5):675-680.
    • (2008) Physiology & Behavior , vol.94 , Issue.5 , pp. 675-680
    • Peciña, S.1
  • 59
    • 30744457109 scopus 로고    scopus 로고
    • Hedonic hot spot in nucleus accumbens shell: Where do mu-opioids cause increased hedonic impact of sweetness?
    • Peciña S., Berridge K.C. Hedonic hot spot in nucleus accumbens shell: Where do mu-opioids cause increased hedonic impact of sweetness?. Journal of Neuroscience 2005, 25(50):11777-11786.
    • (2005) Journal of Neuroscience , vol.25 , Issue.50 , pp. 11777-11786
    • Peciña, S.1    Berridge, K.C.2
  • 61
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement
    • Appleton-Century-Crofts, New York, NY, A.H. Black, W.F. Prokasy (Eds.)
    • Rescorla R.A., Wagner A.R. A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement. Classical conditioning II: Current theory and research 1972, 64-99. Appleton-Century-Crofts, New York, NY. A.H. Black, W.F. Prokasy (Eds.).
    • (1972) Classical conditioning II: Current theory and research , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2
  • 62
    • 0035341482 scopus 로고    scopus 로고
    • Fear and feeding in the nucleus accumbens shell: Rostrocaudal segregation of GABA-elicited defensive behavior versus eating behavior
    • Reynolds S.M., Berridge K.C. Fear and feeding in the nucleus accumbens shell: Rostrocaudal segregation of GABA-elicited defensive behavior versus eating behavior. Journal of Neuroscience 2001, 21(9):3261-3270.
    • (2001) Journal of Neuroscience , vol.21 , Issue.9 , pp. 3261-3270
    • Reynolds, S.M.1    Berridge, K.C.2
  • 63
    • 0037104732 scopus 로고    scopus 로고
    • Positive and negative motivation in nucleus accum-bens shell: Bivalent rostrocaudal gradients for GABA-elicited eating, taste "liking"/"disliking" reactions, place preference/avoidance, and fear
    • Reynolds S.M., Berridge K.C. Positive and negative motivation in nucleus accum-bens shell: Bivalent rostrocaudal gradients for GABA-elicited eating, taste "liking"/"disliking" reactions, place preference/avoidance, and fear. Journal of Neuroscience 2002, 22(16):7308-7320.
    • (2002) Journal of Neuroscience , vol.22 , Issue.16 , pp. 7308-7320
    • Reynolds, S.M.1    Berridge, K.C.2
  • 64
    • 36448968271 scopus 로고    scopus 로고
    • Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
    • Roesch M.R., Calu D.J., Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience 2007, 10(12):1615-1624.
    • (2007) Nature Neuroscience , vol.10 , Issue.12 , pp. 1615-1624
    • Roesch, M.R.1    Calu, D.J.2    Schoenbaum, G.3
  • 66
    • 28144449057 scopus 로고    scopus 로고
    • Representation of action-specific reward values in the striatum
    • Samejima K., Ueda Y., Doya K., Kimura M. Representation of action-specific reward values in the striatum. Science 2005, 310(5752):1337-1340.
    • (2005) Science , vol.310 , Issue.5752 , pp. 1337-1340
    • Samejima, K.1    Ueda, Y.2    Doya, K.3    Kimura, M.4
  • 67
    • 0001201756 scopus 로고
    • Some studies in machine learning using the game of checkers
    • Samuel A. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development 1959, 3:210-229.
    • (1959) IBM Journal of Research and Development , vol.3 , pp. 210-229
    • Samuel, A.1
  • 68
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • Schultz W. Getting formal with dopamine and reward. Neuron 2002, 36(2):241-263.
    • (2002) Neuron , vol.36 , Issue.2 , pp. 241-263
    • Schultz, W.1
  • 69
    • 0002193484 scopus 로고
    • Relation between classical conditioning and instrumental learning
    • Appleton-Century-Crofts, New York, NY, W. Prokasy (Ed.)
    • Sheffield F. Relation between classical conditioning and instrumental learning. Classical conditioning 1965, 302-322. Appleton-Century-Crofts, New York, NY. W. Prokasy (Ed.).
    • (1965) Classical conditioning , pp. 302-322
    • Sheffield, F.1
  • 71
    • 77955909363 scopus 로고    scopus 로고
    • Where do rewards come from? In Proceedings of the thirty-first Annual Conference of the Cognitive Science Society Amsterdam, The Netherlands.
    • Singh, S., Lewis, R., & Barto, A. (2009). Where do rewards come from? In Proceedings of the thirty-first Annual Conference of the Cognitive Science Society (pp. 2601-2606). Amsterdam, The Netherlands.
    • (2009) , pp. 2601-2606
    • Singh, S.1    Lewis, R.2    Barto, A.3
  • 72
    • 53149107120 scopus 로고    scopus 로고
    • Striatal and extrastriatal dopamine in the basal ganglia: An overview of its anatomical organization in normal and parkinsonian brains
    • Smith Y., Villalba R. Striatal and extrastriatal dopamine in the basal ganglia: An overview of its anatomical organization in normal and parkinsonian brains. Movement Disorders 2008, 23(Suppl 3):S534-S547.
    • (2008) Movement Disorders , vol.23 , Issue.SUPPL.3
    • Smith, Y.1    Villalba, R.2
  • 74
    • 0032930935 scopus 로고    scopus 로고
    • A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
    • Suri R.E., Schultz W. A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience 1999, 91(3):871-890.
    • (1999) Neuroscience , vol.91 , Issue.3 , pp. 871-890
    • Suri, R.E.1    Schultz, W.2
  • 75
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton R. Learning to predict by the methods of temporal differences. Machine Learning 1988, 3(1):9-44.
    • (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
    • Sutton, R.1
  • 76
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • Proceedings of the seventh international conference on machine learning
    • Sutton, R. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proceedings of the seventh international conference on machine learning, 216: 224.
    • (1990) , vol.216 , pp. 224
    • Sutton, R.1
  • 79
    • 0001461525 scopus 로고
    • There is more than one kind of learning
    • Tolman E. There is more than one kind of learning. Psychological Review 1949, 56:144-155.
    • (1949) Psychological Review , vol.56 , pp. 144-155
    • Tolman, E.1
  • 80
    • 66449119919 scopus 로고    scopus 로고
    • A specific role for posterior dorsolateral striatum in human habit learning
    • Tricomi E., Balleine B.W., O'Doherty J.P. A specific role for posterior dorsolateral striatum in human habit learning. The European Journal of Neuroscience 2009, 29(11):2225-2232.
    • (2009) The European Journal of Neuroscience , vol.29 , Issue.11 , pp. 2225-2232
    • Tricomi, E.1    Balleine, B.W.2    O'Doherty, J.P.3
  • 82
    • 85047685362 scopus 로고    scopus 로고
    • The time course of perceptual choice: The leaky, competing accumulator model
    • Usher M., McClelland J.L. The time course of perceptual choice: The leaky, competing accumulator model. Psychological Review 2001, 108(3):550-592.
    • (2001) Psychological Review , vol.108 , Issue.3 , pp. 550-592
    • Usher, M.1    McClelland, J.L.2
  • 83
    • 34247147767 scopus 로고    scopus 로고
    • Determining the neural substrates of goal-directed learning in the human brain
    • Valentin V.V., Dickinson A., O'Doherty J.P. Determining the neural substrates of goal-directed learning in the human brain. Journal of Neuroscience 2007, 27(15):4019-4026.
    • (2007) Journal of Neuroscience , vol.27 , Issue.15 , pp. 4019-4026
    • Valentin, V.V.1    Dickinson, A.2    O'Doherty, J.P.3
  • 84
  • 85
    • 84882553319 scopus 로고
    • Learning from delayed rewards. PhD thesis, Cambridge, UK: University of Cambridge.
    • Watkins, C. (1989). Learning from delayed rewards. PhD thesis, Cambridge, UK: University of Cambridge.
    • (1989)
    • Watkins, C.1
  • 86
    • 1942443226 scopus 로고    scopus 로고
    • Predicting risk sensitivity in humans and lower animals: Risk as variance or coefficient of variation
    • Weber E.U., Shafir S., Blais A.-R. Predicting risk sensitivity in humans and lower animals: Risk as variance or coefficient of variation. Psychological Review 2004, 111(2):430-445.
    • (2004) Psychological Review , vol.111 , Issue.2 , pp. 430-445
    • Weber, E.U.1    Shafir, S.2    Blais, A.-R.3
  • 87
    • 0002278965 scopus 로고
    • Adaptive switching circuits. In Western Electric Show and Convention Record
    • New York, NY.
    • Widrow, B., & Hoff, M. (1960). Adaptive switching circuits. In Western Electric Show and Convention Record (Vol. 4, pp. 96-104). New York, NY.
    • (1960) , vol.4 , pp. 96-104
    • Widrow, B.1    Hoff, M.2
  • 88
    • 84989993724 scopus 로고
    • Auto-maintenance in the pigeon: Sustained pecking despite contingent non-reinforcement
    • Williams D.R., Williams H. Auto-maintenance in the pigeon: Sustained pecking despite contingent non-reinforcement. Journal of the Experimental Analysis of Behavior 1969, 12(4):511-520.
    • (1969) Journal of the Experimental Analysis of Behavior , vol.12 , Issue.4 , pp. 511-520
    • Williams, D.R.1    Williams, H.2
  • 89
    • 45249097567 scopus 로고    scopus 로고
    • Striatal activity underlies novelty-based choice in humans
    • Wittmann B.C., Daw N.D., Seymour B., Dolan R.J. Striatal activity underlies novelty-based choice in humans. Neuron 2008, 58(6):967-973.
    • (2008) Neuron , vol.58 , Issue.6 , pp. 967-973
    • Wittmann, B.C.1    Daw, N.D.2    Seymour, B.3    Dolan, R.J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.