메뉴 건너뛰기




Volumn 22, Issue 6, 2012, Pages 1075-1081

The ubiquity of model-based reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ANIMAL; BIOLOGICAL MODEL; BRAIN; DECISION MAKING; HUMAN; PHYSIOLOGY; REINFORCEMENT; REVIEW; REWARD;

EID: 84872761547     PISSN: 09594388     EISSN: 18736882     Source Type: Journal    
DOI: 10.1016/j.conb.2012.08.003     Document Type: Review
Times cited : (290)

References (60)
  • 1
    • 0000541213 scopus 로고
    • Adaptive critics and the basal ganglia
    • MIT Press, Cambridge, MA, (Chapter xii), J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
    • Barto A.G. Adaptive critics and the basal ganglia. Models of Information Processing in the Basal Ganglia 1995, 215-232. MIT Press, Cambridge, MA, (Chapter xii). J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
    • (1995) Models of Information Processing in the Basal Ganglia , pp. 215-232
    • Barto, A.G.1
  • 2
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • Montague P.R., Dayan P., Sejnowski T.J. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci 1996, 6:1936-1947.
    • (1996) J Neurosci , vol.6 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 3
    • 0002621983 scopus 로고
    • Animal intelligence: an experimental study of the associative processes in animals
    • Thorndike E.L. Animal intelligence: an experimental study of the associative processes in animals. Psychol Rev Monogr Suppl 1898, 2:1-8.
    • (1898) Psychol Rev Monogr Suppl , vol.2 , pp. 1-8
    • Thorndike, E.L.1
  • 4
    • 58149442669 scopus 로고
    • Cognitive maps in rats and men
    • Tolman E.C. Cognitive maps in rats and men. Psychol Rev 1948, 55:189-208.
    • (1948) Psychol Rev , vol.55 , pp. 189-208
    • Tolman, E.C.1
  • 5
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci 2005, 8:1704-1711. 10.1038/nn1560.
    • (2005) Nat Neurosci , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 6
    • 0033213819 scopus 로고    scopus 로고
    • What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
    • Doya K. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?. Neural Netw 1999, 12:961-974. 10.1016/S0893-6080(99)00046-5.
    • (1999) Neural Netw , vol.12 , pp. 961-974
    • Doya, K.1
  • 7
    • 84859737036 scopus 로고    scopus 로고
    • Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates
    • Solway A., Botvinick M.M. Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates. Psychol Rev 2012, 119:120-154. 10.1037/a0026435.
    • (2012) Psychol Rev , vol.119 , pp. 120-154
    • Solway, A.1    Botvinick, M.M.2
  • 8
    • 72049125602 scopus 로고    scopus 로고
    • Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
    • Balleine B.W., O'Doherty J.P. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2009, 35:48-69. 10.1038/npp.2009.131.
    • (2009) Neuropsychopharmacology , vol.35 , pp. 48-69
    • Balleine, B.W.1    O'Doherty, J.P.2
  • 9
    • 79958143780 scopus 로고    scopus 로고
    • Speed/accuracy trade-off between the habitual and the goal-directed processes
    • Keramati M., Dezfouli A., Piray P. Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS Comput Biol 2011, 7:e1002055. 10.1371/journal.pcbi.1002055.
    • (2011) PLoS Comput Biol , vol.7 , pp. e1002055
    • Keramati, M.1    Dezfouli, A.2    Piray, P.3
  • 10
    • 84946268134 scopus 로고
    • Variations in the sensitivity of instrumental responding to reinforcer devaluation
    • Adams C.D. Variations in the sensitivity of instrumental responding to reinforcer devaluation. Q J Exp Psychol B 1982, 34:77-98. 10.1080/14640748208400878.
    • (1982) Q J Exp Psychol B , vol.34 , pp. 77-98
    • Adams, C.D.1
  • 11
    • 80054024942 scopus 로고    scopus 로고
    • Instrumental uncertainty as a determinant of behavior under interval schedules of reinforcement
    • Derusso A.L., Fan D., Gupta J., Shelest O., Costa R.M., Yin H.H. Instrumental uncertainty as a determinant of behavior under interval schedules of reinforcement. Front Integr Neurosci 2010, 4. 10.3389/fnint.2010.00017.
    • (2010) Front Integr Neurosci , vol.4
    • Derusso, A.L.1    Fan, D.2    Gupta, J.3    Shelest, O.4    Costa, R.M.5    Yin, H.H.6
  • 12
    • 85162381627 scopus 로고    scopus 로고
    • Environmental statistics and the trade-off between model-based and TD learning in humans
    • J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, K. Weinberger (Eds.)
    • Simon D.A., Daw N.D. Environmental statistics and the trade-off between model-based and TD learning in humans. Advances in Neural Information Processing Systems, vol 24 2011, 127-135. J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, K. Weinberger (Eds.).
    • (2011) Advances in Neural Information Processing Systems, vol 24 , pp. 127-135
    • Simon, D.A.1    Daw, N.D.2
  • 13
    • 80052143971 scopus 로고    scopus 로고
    • Separate encoding of model-based and model-free valuations in the human brain
    • Beierholm U.R., Anen C., Quartz S., Bossaerts P. Separate encoding of model-based and model-free valuations in the human brain. Neuroimage 2011, 58:955-962. 10.1016/j.neuroimage.2011.06.071.
    • (2011) Neuroimage , vol.58 , pp. 955-962
    • Beierholm, U.R.1    Anen, C.2    Quartz, S.3    Bossaerts, P.4
  • 15
    • 77953773184 scopus 로고    scopus 로고
    • Socially evaluated cold pressor stress after instrumental learning favors habits over goal-directed action
    • Schwabe L., Wolf O.T. Socially evaluated cold pressor stress after instrumental learning favors habits over goal-directed action. Psychoneuroendocrinology 2010, 35:977-986. 10.1016/j.psyneuen.2009.12.010.
    • (2010) Psychoneuroendocrinology , vol.35 , pp. 977-986
    • Schwabe, L.1    Wolf, O.T.2
  • 16
    • 79952701788 scopus 로고    scopus 로고
    • Stress-induced modulation of instrumental behavior: from goal-directed to habitual control of action
    • Schwabe L., Wolf O.T. Stress-induced modulation of instrumental behavior: from goal-directed to habitual control of action. Behav Brain Res 2011, 219:321-328. 10.1016/j.bbr.2010.12.038.
    • (2011) Behav Brain Res , vol.219 , pp. 321-328
    • Schwabe, L.1    Wolf, O.T.2
  • 17
    • 66449119919 scopus 로고    scopus 로고
    • A specific role for posterior dorsolateral striatum in human habit learning
    • Tricomi E., Balleine B.W., O'Doherty J.P. A specific role for posterior dorsolateral striatum in human habit learning. Eur J Neurosci 2009, 29:2225-2232. 10.1111/j.1460-9568.2009.06796.x.
    • (2009) Eur J Neurosci , vol.29 , pp. 2225-2232
    • Tricomi, E.1    Balleine, B.W.2    O'Doherty, J.P.3
  • 18
    • 1642580578 scopus 로고    scopus 로고
    • Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning
    • Yin H.H., Knowlton B.J., Balleine B.W. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur J Neurosci 2004, 19:181-189. 10.1111/j.1460-9568.2004.03095.x.
    • (2004) Eur J Neurosci , vol.19 , pp. 181-189
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 19
  • 21
    • 23244452169 scopus 로고    scopus 로고
    • The role of the dorsomedial striatum in instrumental conditioning
    • Yin H.H., Ostlund S.B., Knowlton B.J., Balleine B.W. The role of the dorsomedial striatum in instrumental conditioning. Eur J Neurosci 2005, 22:513-523. 10.1111/j.1460-9568.2005.04218.x.
    • (2005) Eur J Neurosci , vol.22 , pp. 513-523
    • Yin, H.H.1    Ostlund, S.B.2    Knowlton, B.J.3    Balleine, B.W.4
  • 22
    • 77953260848 scopus 로고    scopus 로고
    • States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
    • Glascher J., Daw N., Dayan P., O'Doherty J.P. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010, 66:585-595. 10.1016/j.neuron.2010.04.016.
    • (2010) Neuron , vol.66 , pp. 585-595
    • Glascher, J.1    Daw, N.2    Dayan, P.3    O'Doherty, J.P.4
  • 23
    • 72049086941 scopus 로고    scopus 로고
    • The reward circuit: linking primate anatomy and human imaging
    • Haber S.N., Knutson B. The reward circuit: linking primate anatomy and human imaging. Neuropsychopharmacology 2009, 35:4-26. 10.1038/npp.2009.129.
    • (2009) Neuropsychopharmacology , vol.35 , pp. 4-26
    • Haber, S.N.1    Knutson, B.2
  • 24
    • 67349098495 scopus 로고    scopus 로고
    • Two types of dopamine neuron distinctly convey positive and negative motivational signals
    • Matsumoto M., Hikosaka O. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 2009, 459:837-841. 10.1038/nature08028.
    • (2009) Nature , vol.459 , pp. 837-841
    • Matsumoto, M.1    Hikosaka, O.2
  • 25
    • 63849268432 scopus 로고    scopus 로고
    • Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli
    • Brischoux F., Chakraborty S., Brierley D.I., Ungless M.A. Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli. Proc Natl Acad Sci U S A 2009, 106:4894-4899. 10.1073/pnas.0811507106.
    • (2009) Proc Natl Acad Sci U S A , vol.106 , pp. 4894-4899
    • Brischoux, F.1    Chakraborty, S.2    Brierley, D.I.3    Ungless, M.A.4
  • 26
    • 84862766564 scopus 로고    scopus 로고
    • Are you or aren't you? Challenges associated with physiologically identifying dopamine neurons
    • Ungless M.A., Grace A.A. Are you or aren't you? Challenges associated with physiologically identifying dopamine neurons. Trends Neurosci 2012, 35:422-430. 10.1016/j.tins.2012.02.003.
    • (2012) Trends Neurosci , vol.35 , pp. 422-430
    • Ungless, M.A.1    Grace, A.A.2
  • 27
    • 78449244354 scopus 로고    scopus 로고
    • Shift from goal-directed to habitual cocaine seeking after prolonged experience in rats
    • Zapata A., Minney V.L., Shippenberg T.S. Shift from goal-directed to habitual cocaine seeking after prolonged experience in rats. J Neurosci 2010, 30:15457-15463. 10.1523/JNEUROSCI.4072-10.2010.
    • (2010) J Neurosci , vol.30 , pp. 15457-15463
    • Zapata, A.1    Minney, V.L.2    Shippenberg, T.S.3
  • 28
    • 15244346900 scopus 로고    scopus 로고
    • Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation
    • Faure A., Haberland U., Coné F., Massioui N.E. Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation. J Neurosci 2005, 25:2771-2780. 10.1523/JNEUROSCI.3894-04.2005.
    • (2005) J Neurosci , vol.25 , pp. 2771-2780
    • Faure, A.1    Haberland, U.2    Coné, F.3    Massioui, N.E.4
  • 29
    • 84155183278 scopus 로고    scopus 로고
    • NMDA receptors in dopaminergic neurons are crucial for habit learning
    • Wang L.P., Li F., Wang D., Xie K., Wang D., Shen X., Tsien J.Z. NMDA receptors in dopaminergic neurons are crucial for habit learning. Neuron 2011, 72:1055-1066. 10.1016/j.neuron.2011.10.019.
    • (2011) Neuron , vol.72 , pp. 1055-1066
    • Wang, L.P.1    Li, F.2    Wang, D.3    Xie, K.4    Wang, D.5    Shen, X.6    Tsien, J.Z.7
  • 30
    • 0033913868 scopus 로고    scopus 로고
    • Dissociation of Pavlovian and instrumental incentive learning under dopamine antagonists
    • Dickinson A., Smith J., Mirenowicz J. Dissociation of Pavlovian and instrumental incentive learning under dopamine antagonists. Behav Neurosci 2000, 114:468-483. 10.1037/0735-7044.114.3.468.
    • (2000) Behav Neurosci , vol.114 , pp. 468-483
    • Dickinson, A.1    Smith, J.2    Mirenowicz, J.3
  • 31
    • 77958461569 scopus 로고    scopus 로고
    • Habitual versus goal-directed action control in Parkinson disease
    • de Wit S., Barker R.A., Dickinson A.D., Cools R. Habitual versus goal-directed action control in Parkinson disease. J Cogn Neurosci 2011, 23:1218-1229. 10.1162/jocn.2010.21514.
    • (2011) J Cogn Neurosci , vol.23 , pp. 1218-1229
    • de Wit, S.1    Barker, R.A.2    Dickinson, A.D.3    Cools, R.4
  • 33
    • 79952746011 scopus 로고    scopus 로고
    • Model-based influences on humans' choices and striatal prediction errors
    • Daw N.D., Gershman S.J., Seymour B., Dayan P., Dolan R.J. Model-based influences on humans' choices and striatal prediction errors. Neuron 2011, 69:1204-1215. 10.1016/j.neuron.2011.02.027.
    • (2011) Neuron , vol.69 , pp. 1204-1215
    • Daw, N.D.1    Gershman, S.J.2    Seymour, B.3    Dayan, P.4    Dolan, R.J.5
  • 34
    • 79955709936 scopus 로고    scopus 로고
    • Neural correlates of forward planning in a spatial decision task in humans
    • Simon D.A., Daw N.D. Neural correlates of forward planning in a spatial decision task in humans. J Neurosci 2011, 31:5526-5539. 10.1523/JNEUROSCI.4647-10.2011.
    • (2011) J Neurosci , vol.31 , pp. 5526-5539
    • Simon, D.A.1    Daw, N.D.2
  • 35
    • 77954510325 scopus 로고    scopus 로고
    • Triple dissociation of information processing in dorsal striatum, ventral striatum, and hippocampus on a learned spatial decision task
    • van der Meer M.A.A., Johnson A., Schmitzer-Torbert N.C., Redish A.D. Triple dissociation of information processing in dorsal striatum, ventral striatum, and hippocampus on a learned spatial decision task. Neuron 2010, 67:25-32. 10.1016/j.neuron.2010.06.023.
    • (2010) Neuron , vol.67 , pp. 25-32
    • van der Meer, M.A.A.1    Johnson, A.2    Schmitzer-Torbert, N.C.3    Redish, A.D.4
  • 36
    • 79960218850 scopus 로고    scopus 로고
    • Expectancies in decision making, reinforcement learning, and ventral striatum
    • van der Meer M.A.A., Redish A.D. Expectancies in decision making, reinforcement learning, and ventral striatum. Front Neurosci 2010, 4:6. 10.3389/neuro.01.006.2010.
    • (2010) Front Neurosci , vol.4 , pp. 6
    • van der Meer, M.A.A.1    Redish, A.D.2
  • 37
    • 84860307045 scopus 로고    scopus 로고
    • Mapping value based planning and extensively trained choice in the human brain
    • Wunderlich K., Dayan P., Dolan R.J. Mapping value based planning and extensively trained choice in the human brain. Nat Neurosci 2012, 15:786-791. 10.1038/nn.3068.
    • (2012) Nat Neurosci , vol.15 , pp. 786-791
    • Wunderlich, K.1    Dayan, P.2    Dolan, R.J.3
  • 38
    • 78649604962 scopus 로고    scopus 로고
    • Evidence for model-based action planning in a sequential finger movement task
    • Fermin A., Yoshida T., Ito M., Yoshimoto J., Doya K. Evidence for model-based action planning in a sequential finger movement task. J Mot Behav 2010, 42:371-379. 10.1080/00222895.2010.526467.
    • (2010) J Mot Behav , vol.42 , pp. 371-379
    • Fermin, A.1    Yoshida, T.2    Ito, M.3    Yoshimoto, J.4    Doya, K.5
  • 39
    • 44349104251 scopus 로고    scopus 로고
    • Reward prediction based on stimulus categorization in primate lateral prefrontal cortex
    • Pan X., Sawa K., Tsuda I., Tsukada M., Sakagami M. Reward prediction based on stimulus categorization in primate lateral prefrontal cortex. Nat Neurosci 2008, 11:703-712. 10.1038/nn.2128.
    • (2008) Nat Neurosci , vol.11 , pp. 703-712
    • Pan, X.1    Sawa, K.2    Tsuda, I.3    Tsukada, M.4    Sakagami, M.5
  • 40
    • 33748188120 scopus 로고    scopus 로고
    • The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans
    • Hampton A.N., Bossaerts P., O'Doherty J.P. The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J Neurosci 2006, 26:8360-8367. 10.1523/JNEUROSCI.1010-06.2006.
    • (2006) J Neurosci , vol.26 , pp. 8360-8367
    • Hampton, A.N.1    Bossaerts, P.2    O'Doherty, J.P.3
  • 41
    • 77955937588 scopus 로고    scopus 로고
    • A pallidus-habenula-dopamine pathway signals inferred stimulus values
    • Bromberg-Martin E.S., Matsumoto M., Hong S., Hikosaka O. A pallidus-habenula-dopamine pathway signals inferred stimulus values. J Neurophysiol 2010, 104:1068-1076. 10.1152/jn.00158.2010.
    • (2010) J Neurophysiol , vol.104 , pp. 1068-1076
    • Bromberg-Martin, E.S.1    Matsumoto, M.2    Hong, S.3    Hikosaka, O.4
  • 42
    • 79956220050 scopus 로고    scopus 로고
    • Distributed coding of actual and hypothetical outcomes in the orbital and dorsolateral prefrontal cortex
    • Abe H., Lee D. Distributed coding of actual and hypothetical outcomes in the orbital and dorsolateral prefrontal cortex. Neuron 2011, 70:731-741. 10.1016/j.neuron.2011.03.026.
    • (2011) Neuron , vol.70 , pp. 731-741
    • Abe, H.1    Lee, D.2
  • 43
    • 80053095198 scopus 로고    scopus 로고
    • Hedging your bets by learning reward correlations in the human brain
    • Wunderlich K., Symmonds M., Bossaerts P., Dolan R.J. Hedging your bets by learning reward correlations in the human brain. Neuron 2011, 71:1141-1152. 10.1016/j.neuron.2011.07.025.
    • (2011) Neuron , vol.71 , pp. 1141-1152
    • Wunderlich, K.1    Symmonds, M.2    Bossaerts, P.3    Dolan, R.J.4
  • 44
    • 84859339117 scopus 로고    scopus 로고
    • Generalization of value in reinforcement learning by humans
    • Wimmer G.E., Daw N.D., Shohamy D. Generalization of value in reinforcement learning by humans. Eur J Neurosci 2012, 35:1092-1104. 10.1111/j.1460-9568.2012.08017.x.
    • (2012) Eur J Neurosci , vol.35 , pp. 1092-1104
    • Wimmer, G.E.1    Daw, N.D.2    Shohamy, D.3
  • 45
    • 34247147767 scopus 로고    scopus 로고
    • Determining the neural substrates of goal-directed learning in the human brain
    • Valentin V.V., Dickinson A., O'Doherty J.P. Determining the neural substrates of goal-directed learning in the human brain. J Neurosci 2007, 27:4019-4026. 10.1523/JNEUROSCI.0564-07.2007.
    • (2007) J Neurosci , vol.27 , pp. 4019-4026
    • Valentin, V.V.1    Dickinson, A.2    O'Doherty, J.P.3
  • 46
    • 70349123547 scopus 로고    scopus 로고
    • Differential engagement of the ventromedial prefrontal cortex by goal-directed and habitual behavior toward food pictures in humans
    • de Wit S., Corlett P.R., Aitken M.R., Dickinson A., Fletcher P.C. Differential engagement of the ventromedial prefrontal cortex by goal-directed and habitual behavior toward food pictures in humans. J Neurosci 2009, 29:11330-11338. 10.1523/JNEUROSCI.1639-09.2009.
    • (2009) J Neurosci , vol.29 , pp. 11330-11338
    • de Wit, S.1    Corlett, P.R.2    Aitken, M.R.3    Dickinson, A.4    Fletcher, P.C.5
  • 47
    • 79951823576 scopus 로고    scopus 로고
    • Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning
    • McDannald M.A., Lucantonio F., Burke K.A., Niv Y., Schoenbaum G. Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning. J Neurosci 2011, 31:2700-2705. 10.1523/JNEUROSCI.5499-10.2011.
    • (2011) J Neurosci , vol.31 , pp. 2700-2705
    • McDannald, M.A.1    Lucantonio, F.2    Burke, K.A.3    Niv, Y.4    Schoenbaum, G.5
  • 48
    • 33646566317 scopus 로고    scopus 로고
    • Neurons in the orbitofrontal cortex encode economic value
    • Padoa-Schioppa C., Assad J.A. Neurons in the orbitofrontal cortex encode economic value. Nature 2006, 441:223-226. 10.1038/nature04676.
    • (2006) Nature , vol.441 , pp. 223-226
    • Padoa-Schioppa, C.1    Assad, J.A.2
  • 49
    • 84655163444 scopus 로고    scopus 로고
    • Cross-species studies of orbitofrontal cortex and value-based decision-making
    • Wallis J.D. Cross-species studies of orbitofrontal cortex and value-based decision-making. Nat Neurosci 2012, 15:13-19. 10.1038/nn.2956.
    • (2012) Nat Neurosci , vol.15 , pp. 13-19
    • Wallis, J.D.1
  • 51
    • 84859297479 scopus 로고    scopus 로고
    • Dissociating hippocampal and striatal contributions to sequential prediction learning
    • Bornstein A.M., Daw N.D. Dissociating hippocampal and striatal contributions to sequential prediction learning. Eur J Neurosci 2012, 35:1011-1023. 10.1111/j.1460-9568.2011.07920.x.
    • (2012) Eur J Neurosci , vol.35 , pp. 1011-1023
    • Bornstein, A.M.1    Daw, N.D.2
  • 52
    • 84857211334 scopus 로고    scopus 로고
    • Mechanisms of hierarchical reinforcement learning in corticostriatal circuits. 1: computational analysis
    • Frank M.J., Badre D. Mechanisms of hierarchical reinforcement learning in corticostriatal circuits. 1: computational analysis. Cereb Cortex 2012, 22:509-526. 10.1093/cercor/bhr114.
    • (2012) Cereb Cortex , vol.22 , pp. 509-526
    • Frank, M.J.1    Badre, D.2
  • 53
    • 0001158047 scopus 로고
    • Improving generalization for temporal difference learning: the successor representation
    • Dayan P. Improving generalization for temporal difference learning: the successor representation. Neural Comput 1993, 5:613-624. 10.1162/neco.1993.5.4.613.
    • (1993) Neural Comput , vol.5 , pp. 613-624
    • Dayan, P.1
  • 54
    • 79955758129 scopus 로고    scopus 로고
    • Dopaminergic genes predict individual differences in susceptibility to confirmation bias
    • Doll B.B., Hutchison K.E., Frank M.J. Dopaminergic genes predict individual differences in susceptibility to confirmation bias. J Neurosci 2011, 31:6188-6198. 10.1523/JNEUROSCI.6486-10.2011.
    • (2011) J Neurosci , vol.31 , pp. 6188-6198
    • Doll, B.B.1    Hutchison, K.E.2    Frank, M.J.3
  • 55
    • 79959771766 scopus 로고    scopus 로고
    • The neural basis of following advice
    • Biele G., Rieskamp J., Krugel L.K., Heekeren H.R. The neural basis of following advice. PLoS Biol 2011, 9:e1001089. 10.1371/journal.pbio.1001089.
    • (2011) PLoS Biol , vol.9 , pp. e1001089
    • Biele, G.1    Rieskamp, J.2    Krugel, L.K.3    Heekeren, H.R.4
  • 56
    • 70449715719 scopus 로고    scopus 로고
    • Instructional control of reinforcement learning: a behavioral and neurocomputational investigation
    • Doll B.B., Jacobs W.J., Sanfey A.G., Frank M.J. Instructional control of reinforcement learning: a behavioral and neurocomputational investigation. Brain Res 2009, 1299:74-94. 10.1016/j.brainres.2009.07.007.
    • (2009) Brain Res , vol.1299 , pp. 74-94
    • Doll, B.B.1    Jacobs, W.J.2    Sanfey, A.G.3    Frank, M.J.4
  • 58
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • GTE Laboratories Incorporated, Morgan Kaufmann
    • Sutton R.S. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proceedings of the Seventh International Conference on Machine Learning 1990, 216-224. GTE Laboratories Incorporated, Morgan Kaufmann.
    • (1990) Proceedings of the Seventh International Conference on Machine Learning , pp. 216-224
    • Sutton, R.S.1
  • 60
    • 84859371025 scopus 로고    scopus 로고
    • Bonsai trees in your head: how the Pavlovian system sculpts goal-directed choices by pruning decision trees
    • Huys Q.J.M., Eshel N., O'Nions E., Sheridan L., Dayan P., Roiser J.P. Bonsai trees in your head: how the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLoS Comput Biol 2012, 8:e1002410. 10.1371/journal.pcbi.1002410.
    • (2012) PLoS Comput Biol , vol.8 , pp. e1002410
    • Huys, Q.J.M.1    Eshel, N.2    O'Nions, E.3    Sheridan, L.4    Dayan, P.5    Roiser, J.P.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.