메뉴 건너뛰기




Volumn 113, Issue 3, 2009, Pages 259-261

Reinforcement learning and higher level cognition: Introduction to special issue

Author keywords

Reinforcement learning

Indexed keywords

DOPAMINE;

EID: 70350566659     PISSN: 00100277     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.cognition.2009.09.005     Document Type: Editorial
Times cited : (18)

References (23)
  • 1
    • 42749096312 scopus 로고    scopus 로고
    • Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes
    • Badre D. Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes. Trends in cognitive sciences 12 5 (2008) 193-200
    • (2008) Trends in cognitive sciences , vol.12 , Issue.5 , pp. 193-200
    • Badre, D.1
  • 2
    • 70350569456 scopus 로고    scopus 로고
    • Action understanding as inverse planning
    • Baker C.L., Tenenbaum J.B., and Saxe R.B. Action understanding as inverse planning. Cognition 113 3 (2009) 329-349
    • (2009) Cognition , vol.113 , Issue.3 , pp. 329-349
    • Baker, C.L.1    Tenenbaum, J.B.2    Saxe, R.B.3
  • 3
    • 0001398415 scopus 로고
    • Instrumental performance following reinforcer devaluation depends upon incentive learning
    • Balleine B., and Dickinson A. Instrumental performance following reinforcer devaluation depends upon incentive learning. The Quarterly Journal of Experimental Psychology Section B 43 3 (1991) 279-296
    • (1991) The Quarterly Journal of Experimental Psychology Section B , vol.43 , Issue.3 , pp. 279-296
    • Balleine, B.1    Dickinson, A.2
  • 4
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
    • Botvinick M., Niv Y., and Barto A.C. Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition 113 3 (2009) 262-280
    • (2009) Cognition , vol.113 , Issue.3 , pp. 262-280
    • Botvinick, M.1    Niv, Y.2    Barto, A.C.3
  • 5
    • 70350565076 scopus 로고    scopus 로고
    • Rational and mechanistic perspectives on reinforcement learning
    • Chater N. Rational and mechanistic perspectives on reinforcement learning. Cognition 113 3 (2009) 350-364
    • (2009) Cognition , vol.113 , Issue.3 , pp. 350-364
    • Chater, N.1
  • 6
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw N.D., Niv Y., and Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience 8 12 (2005) 1704-1711
    • (2005) Nature Neuroscience , vol.8 , Issue.12 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 7
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw N.D., O'Doherty J.P., Dayan P., Seymour B., and Dolan R.J. Cortical substrates for exploratory decisions in humans. Nature 441 7095 (2006) 876-879
    • (2006) Nature , vol.441 , Issue.7095 , pp. 876-879
    • Daw, N.D.1    O'Doherty, J.P.2    Dayan, P.3    Seymour, B.4    Dolan, R.J.5
  • 9
    • 10344250993 scopus 로고    scopus 로고
    • By carrot or by stick: Cognitive reinforcement learning in Parkinsonism
    • Frank M.J., Seeberger L.C., and O'Reilly R.C. By carrot or by stick: Cognitive reinforcement learning in Parkinsonism. Science 306 (2004) 1940-1943
    • (2004) Science , vol.306 , pp. 1940-1943
    • Frank, M.J.1    Seeberger, L.C.2    O'Reilly, R.C.3
  • 10
    • 70350572378 scopus 로고    scopus 로고
    • Short-term gains, long-term pains: How cues about state aid learning in dynamic environments
    • Gureckis T.M., and Love B.C. Short-term gains, long-term pains: How cues about state aid learning in dynamic environments. Cognition 113 3 (2009) 293-313
    • (2009) Cognition , vol.113 , Issue.3 , pp. 293-313
    • Gureckis, T.M.1    Love, B.C.2
  • 11
    • 70350570499 scopus 로고    scopus 로고
    • A Bayesian formulation of behavioral control
    • Huys Q.J., and Dayan P. A Bayesian formulation of behavioral control. Cognition 113 3 (2009) 314-328
    • (2009) Cognition , vol.113 , Issue.3 , pp. 314-328
    • Huys, Q.J.1    Dayan, P.2
  • 13
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive hebbian learning
    • Montague P.R., Dayan P., and Sejnowski T.J. A framework for mesencephalic dopamine systems based on predictive hebbian learning. The Journal of Neuroscience 16 (1997) 1936-1947
    • (1997) The Journal of Neuroscience , vol.16 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 14
    • 58149379028 scopus 로고    scopus 로고
    • A role for dopamine in temporal decision making and reward maximization in Parkinsonism
    • Moustafa A.A., Cohen M.X., Sherman S.J., and Frank M.J. A role for dopamine in temporal decision making and reward maximization in Parkinsonism. Journal of Neuroscience 28 47 (2008) 12294-12304
    • (2008) Journal of Neuroscience , vol.28 , Issue.47 , pp. 12294-12304
    • Moustafa, A.A.1    Cohen, M.X.2    Sherman, S.J.3    Frank, M.J.4
  • 15
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: Opportunity costs and the control of response vigor
    • Niv Y., Daw N.D., Joel D., and Dayan P. Tonic dopamine: Opportunity costs and the control of response vigor. Psychopharmacology 191 3 (2007) 507-520
    • (2007) Psychopharmacology , vol.191 , Issue.3 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 16
    • 0037987978 scopus 로고    scopus 로고
    • Temporal difference models and reward-related learning in the human brain
    • O'Doherty J.P., Dayan P., Friston K., Critchley H., and Dolan R.J. Temporal difference models and reward-related learning in the human brain. Neuron 38 (2003) 329-337
    • (2003) Neuron , vol.38 , pp. 329-337
    • O'Doherty, J.P.1    Dayan, P.2    Friston, K.3    Critchley, H.4    Dolan, R.J.5
  • 17
    • 33644927837 scopus 로고    scopus 로고
    • Making working memory work: A computational model of learning in the prefrontal cortex and basal ganglia
    • O'Reilly R.C., and Frank M.J. Making working memory work: A computational model of learning in the prefrontal cortex and basal ganglia. Neural Computation 18 (2006) 283-328
    • (2006) Neural Computation , vol.18 , pp. 283-328
    • O'Reilly, R.C.1    Frank, M.J.2
  • 18
    • 33748302924 scopus 로고    scopus 로고
    • Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans
    • Pessiglione M., Seymour B., Flandin G., Dolan R.J., and Frith C.D. Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 442 7106 (2006) 1042-1045
    • (2006) Nature , vol.442 , Issue.7106 , pp. 1042-1045
    • Pessiglione, M.1    Seymour, B.2    Flandin, G.3    Dolan, R.J.4    Frith, C.D.5
  • 19
    • 10344225664 scopus 로고    scopus 로고
    • Neuroscience. Addiction as a computational process gone awry
    • Redish A.D. Neuroscience. Addiction as a computational process gone awry. Science 306 5703 (2004) 1944-1946
    • (2004) Science , vol.306 , Issue.5703 , pp. 1944-1946
    • Redish, A.D.1
  • 20
    • 70350574601 scopus 로고    scopus 로고
    • Developing pfc representations using reinforcement learning
    • Reynolds J.R., and O'Reilly R.C. Developing pfc representations using reinforcement learning. Cognition 113 3 (2009) 281-292
    • (2009) Cognition , vol.113 , Issue.3 , pp. 281-292
    • Reynolds, J.R.1    O'Reilly, R.C.2
  • 22
    • 0033170372 scopus 로고    scopus 로고
    • Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning
    • Sutton R., Precup D., and Singh S. Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112 1-2 (1999) 181-211
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.