메뉴 건너뛰기




Volumn 107, Issue 6, 2013, Pages 711-719

Adaptive properties of differential learning rates for positive and negative outcomes

Author keywords

Basal ganglia; Decision making; Meta learning; Reinforcement learning; Reward prediction error

Indexed keywords

ASYMMETRIC LEARNING; BASAL GANGLIA; COGNITIVE SCIENCE; METALEARNING; PREDICTION ERRORS; REWARD-PREDICTION ERROR; SUB-OPTIMAL CHOICES; SUPPORT LEARNING;

EID: 84890950743     PISSN: 03401200     EISSN: 14320770     Source Type: Journal    
DOI: 10.1007/s00422-013-0571-5     Document Type: Article
Times cited : (71)

References (41)
  • 1
    • 34548295327 scopus 로고    scopus 로고
    • Learning the value of information in an uncertain world
    • 17676057 10.1038/nn1954 1:CAS:528:DC%2BD2sXps1Sgurg%3D
    • Behrens TEJ, Woolrich MW, Walton ME, Rushworth MFS (2007) Learning the value of information in an uncertain world. Nat Neurosci 10(9):1214-1221
    • (2007) Nat Neurosci , vol.10 , Issue.9 , pp. 1214-1221
    • Behrens, T.E.J.1    Woolrich, M.W.2    Walton, M.E.3    Rushworth, M.F.S.4
  • 2
    • 78649966665 scopus 로고    scopus 로고
    • Dopamine in motivational control: Rewarding, aversive, and alerting
    • 21144997 10.1016/j.neuron.2010.11.022 1:CAS:528:DC%2BC3cXhsFGhu73O
    • Bromberg-Martin ES, Matsumoto M, Hikosaka O (2010) Dopamine in motivational control: rewarding, aversive, and alerting. Neuron 68(5):815-834
    • (2010) Neuron , vol.68 , Issue.5 , pp. 815-834
    • Bromberg-Martin, E.S.1    Matsumoto, M.2    Hikosaka, O.3
  • 3
    • 79955770782 scopus 로고    scopus 로고
    • Social stress reactivity alters reward and punishment learning
    • 20453038 10.1093/scan/nsq041
    • Cavanagh JF, Frank MJ (2011) Social stress reactivity alters reward and punishment learning. Soc Cogn Affect Neurosci 6(3):311-320
    • (2011) Soc Cogn Affect Neurosci , vol.6 , Issue.3 , pp. 311-320
    • Cavanagh, J.F.1    Frank, M.J.2
  • 4
    • 77951786435 scopus 로고    scopus 로고
    • Gambling severity predicts midbrain response to near-miss outcomes
    • 20445043 10.1523/JNEUROSCI.5758-09.2010 1:CAS:528:DC%2BC3cXovVWmsL4%3D
    • Chase HW, Clark L (2010) Gambling severity predicts midbrain response to near-miss outcomes. J Neurosci 30(18):6180-6187
    • (2010) J Neurosci , vol.30 , Issue.18 , pp. 6180-6187
    • Chase, H.W.1    Clark, L.2
  • 5
    • 60749098586 scopus 로고    scopus 로고
    • Neurobiological studies of risk assessment: A comparison of expected utility and mean-variance approaches
    • 19033235 10.3758/CABN.8.4.363
    • D'Acremont M, Bossaerts P (2008) Neurobiological studies of risk assessment: a comparison of expected utility and mean-variance approaches. Cogn Affect Behav Neurosci 8(4):363-374
    • (2008) Cogn Affect Behav Neurosci , vol.8 , Issue.4 , pp. 363-374
    • D'Acremont, M.1    Bossaerts, P.2
  • 6
    • 34548837994 scopus 로고    scopus 로고
    • Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling
    • 17638506 10.1037/0033-295X.114.3.784
    • Redish AD, Jensen S, Johnson A, Kurth-Nelson Z (2007) Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. Psychol Rev 114(3):784-805
    • (2007) Psychol Rev , vol.114 , Issue.3 , pp. 784-805
    • Redish, A.D.1    Jensen, S.2    Johnson, A.3    Kurth-Nelson, Z.4
  • 7
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • 16778890 10.1038/nature04766 1:CAS:528:DC%2BD28XlvVGltrw%3D
    • Daw ND, O'Doherty JP, Dayan P, Seymour B, Dolan RJ (2006) Cortical substrates for exploratory decisions in humans. Nature 441(7095):876-879
    • (2006) Nature , vol.441 , Issue.7095 , pp. 876-879
    • Daw, N.D.1    O'Doherty, J.P.2    Dayan, P.3    Seymour, B.4    Dolan, R.J.5
  • 8
    • 79952746011 scopus 로고    scopus 로고
    • Model-based influences on humans' choices and striatal prediction errors
    • 21435563 10.1016/j.neuron.2011.02.027 1:CAS:528:DC%2BC3MXjvFejsLY%3D
    • Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ (2011) Model-based influences on humans' choices and striatal prediction errors. Neuron 69(6):1204-1215
    • (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
    • Daw, N.D.1    Gershman, S.J.2    Seymour, B.3    Dayan, P.4    Dolan, R.J.5
  • 9
    • 52049107354 scopus 로고    scopus 로고
    • Reinforcement learning: The good, the bad and the ugly
    • 18708140 10.1016/j.conb.2008.08.003 1:CAS:528:DC%2BD1cXhtFCku7rF
    • Dayan P, Niv Y (2008) Reinforcement learning: the good, the bad and the ugly. Curr Opin Neurobiol 18(2):185-196
    • (2008) Curr Opin Neurobiol , vol.18 , Issue.2 , pp. 185-196
    • Dayan, P.1    Niv, Y.2
  • 10
    • 70449715719 scopus 로고    scopus 로고
    • Instructional control of reinforcement learning: A behavioral and neurocomputational investigation
    • 19595993 10.1016/j.brainres.2009.07.007 1:CAS:528:DC%2BD1MXht1OnsLzK
    • Doll BB, Jacobs WJ, Sanfey AG, Frank MJ (2009) Instructional control of reinforcement learning: a behavioral and neurocomputational investigation. Brain Res 1299:74-94
    • (2009) Brain Res , vol.1299 , pp. 74-94
    • Doll, B.B.1    Jacobs, W.J.2    Sanfey, A.G.3    Frank, M.J.4
  • 11
    • 0036592023 scopus 로고    scopus 로고
    • Metalearning and neuromodulation
    • 12371507 10.1016/S0893-6080(02)00044-8
    • Doya K (2002) Metalearning and neuromodulation. Neural Netw 15(4-6):495-506
    • (2002) Neural Netw , vol.15 , Issue.4-6 , pp. 495-506
    • Doya, K.1
  • 12
    • 0036618011 scopus 로고    scopus 로고
    • Multiple model-based reinforcement learning
    • 12020450 10.1162/089976602753712972
    • Doya K, Samejima K, Katagiri K, Kawato M (2002) Multiple model-based reinforcement learning. Neural Comput 14(6):1347-1369
    • (2002) Neural Comput , vol.14 , Issue.6 , pp. 1347-1369
    • Doya, K.1    Samejima, K.2    Katagiri, K.3    Kawato, M.4
  • 13
    • 84881062266 scopus 로고    scopus 로고
    • Two dimensions of value: Dopamine neurons represent reward but not aversiveness
    • 23908236 10.1126/science.1238699 1:CAS:528:DC%2BC3sXhtFyjsrzL
    • Fiorillo CD (2013) Two dimensions of value: dopamine neurons represent reward but not aversiveness. Science 341(6145):546-549
    • (2013) Science , vol.341 , Issue.6145 , pp. 546-549
    • Fiorillo, C.D.1
  • 14
    • 10344250993 scopus 로고    scopus 로고
    • By carrot or by stick: Cognitive reinforcement learning in parkinsonism
    • 15528409 10.1126/science.1102941 1:CAS:528:DC%2BD2cXhtVCqtbvK
    • Frank MJ, Seeberger LC, O'reilly RC (2004) By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science 306(5703):1940-1943
    • (2004) Science , vol.306 , Issue.5703 , pp. 1940-1943
    • Frank, M.J.1    Seeberger, L.C.2    O'Reilly, R.C.3
  • 15
    • 36048939114 scopus 로고    scopus 로고
    • Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning
    • 17913879 10.1073/pnas.0706111104 1:CAS:528:DC%2BD2sXhtF2gsr%2FE
    • Frank MJ, Moustafa AA, Haughey HM, Curran T, Hutchison KE (2007) Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning. Proc Natl Acad Sci 104(41):16311-16316
    • (2007) Proc Natl Acad Sci , vol.104 , Issue.41 , pp. 16311-16316
    • Frank, M.J.1    Moustafa, A.A.2    Haughey, H.M.3    Curran, T.4    Hutchison, K.E.5
  • 16
    • 68149138772 scopus 로고    scopus 로고
    • Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation
    • 19620978 10.1038/nn.2342 1:CAS:528:DC%2BD1MXoslymtLY%3D
    • Frank MJ, Doll BB, Oas-Terpstra J, Moreno F (2009) Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation. Nat Neurosci 12(8):1062-1068
    • (2009) Nat Neurosci , vol.12 , Issue.8 , pp. 1062-1068
    • Frank, M.J.1    Doll, B.B.2    Oas-Terpstra, J.3    Moreno, F.4
  • 17
    • 0025572196 scopus 로고
    • $$\text{ D }-1$$ D 1 and $$\ text{ D }-2$$ D 2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons
    • 2147780 10.1126/science.2147780 1:CAS:528:DyaK3MXjs1Gjsg%3D%3D
    • Gerfen CR, Engber TM, Mahan LC, Susel Z, Chase TN, Monsma FJ Jr, Sibley DR (1990) $$\text{ D }-1$$ D 1 and $$\text{ D }-2$$ D 2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons. Science 250:1429-1432
    • (1990) Science , vol.250 , pp. 1429-1432
    • Gerfen, C.R.1    Engber, T.M.2    Mahan, L.C.3    Susel, Z.4    Chase, T.N.5    Monsma, Jr.F.J.6    Sibley, D.R.7
  • 18
    • 77952541839 scopus 로고    scopus 로고
    • Learning latent structure: Carving nature at its joints
    • 20227271 10.1016/j.conb.2010.02.008 1:CAS:528:DC%2BC3cXlsFCis74%3D
    • Gershman SJ, Niv Y (2010) Learning latent structure: carving nature at its joints. Curr Opin Neurobiol 20(2):251-256
    • (2010) Curr Opin Neurobiol , vol.20 , Issue.2 , pp. 251-256
    • Gershman, S.J.1    Niv, Y.2
  • 19
    • 84856102932 scopus 로고    scopus 로고
    • Dopamine system dysregulation by the hippocampus: Implications for the pathophysiology and treatment of schizophrenia
    • 21621548 10.1016/j.neuropharm.2011.05.011 1:CAS:528:DC%2BC38XhtFWks7k%3D
    • Grace AA (2012) Dopamine system dysregulation by the hippocampus: implications for the pathophysiology and treatment of schizophrenia. Neuropharmacology 62(3):1342-1348
    • (2012) Neuropharmacology , vol.62 , Issue.3 , pp. 1342-1348
    • Grace, A.A.1
  • 20
    • 84862201490 scopus 로고    scopus 로고
    • Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia
    • FEBRUARY 22347155
    • Humphries MD, Khamassi M, Gurney K (2012) Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia. Front Neurosci 6(February):9
    • (2012) Front Neurosci , vol.6 , pp. 9
    • Humphries, M.D.1    Khamassi, M.2    Gurney, K.3
  • 21
    • 0000125532 scopus 로고
    • Prospect theory: An analysis of decision under risk
    • Kahneman D, Tversky A (1979) Prospect theory: an analysis of decision under risk. Econ J Econ Soc 47(2):263-292
    • (1979) Econ J Econ Soc , vol.47 , Issue.2 , pp. 263-292
    • Kahneman, D.1    Tversky, A.2
  • 22
    • 84859019561 scopus 로고    scopus 로고
    • Robot cognitive control with a neurophysiologically inspired reinforcement learning model
    • JULY
    • Khamassi M, Lallée S, Enel P, Procyk E, Dominey PF (2011) Robot cognitive control with a neurophysiologically inspired reinforcement learning model. Front Neurorobotic 5(July):1
    • (2011) Front Neurorobotic , vol.5 , pp. 1
    • Khamassi, M.1    Lallée, S.2    Enel, P.3    Procyk, E.4    Dominey, P.F.5
  • 23
    • 84872378856 scopus 로고    scopus 로고
    • Medial prefrontal cortex and the adaptive regulation of reinforcement learning parameters
    • 23317844 10.1016/B978-0-444-62604-2.00022-8
    • Khamassi M, Enel P, Dominey PF, Procyk E (2013) Medial prefrontal cortex and the adaptive regulation of reinforcement learning parameters. Prog Brain Res 202:441-464
    • (2013) Prog Brain Res , vol.202 , pp. 441-464
    • Khamassi, M.1    Enel, P.2    Dominey, P.F.3    Procyk, E.4
  • 24
    • 84861545384 scopus 로고    scopus 로고
    • Distinct roles for direct and indirect pathway striatal neurons in reinforcement
    • Kravitz AV, Tye LD, Kreitzer AC (2012) Distinct roles for direct and indirect pathway striatal neurons in reinforcement. Nat Neurosci 15:816-818
    • (2012) Nat Neurosci , vol.15 , pp. 816-818
    • Kravitz, A.V.1    Tye, L.D.2    Kreitzer, A.C.3
  • 25
    • 70449382577 scopus 로고    scopus 로고
    • Temporal-difference reinforcement learning with distributed representations
    • 19841749 10.1371/journal.pone.0007362
    • Kurth-Nelson Z, Redish AD (2009) Temporal-difference reinforcement learning with distributed representations. PLoS One 4(10):e7362
    • (2009) PLoS One , vol.4 , Issue.10 , pp. 7362
    • Kurth-Nelson, Z.1    Redish, A.D.2
  • 26
    • 79251569290 scopus 로고    scopus 로고
    • From reinforcement learning models to psychiatric and neurological disorders
    • 21270784 10.1038/nn.2723 1:CAS:528:DC%2BC3MXhtFShs70%3D
    • Maia TV, Frank MJ (2011) From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci 14(2):154-162
    • (2011) Nat Neurosci , vol.14 , Issue.2 , pp. 154-162
    • Maia, T.V.1    Frank, M.J.2
  • 27
    • 0036832952 scopus 로고    scopus 로고
    • Risk-sensitive reinforcement learning
    • 10.1023/A:1017940631555
    • Mihatsch O, Neuneier R (2002) Risk-sensitive reinforcement learning. Mach Learn 49:267-290
    • (2002) Mach Learn , vol.49 , pp. 267-290
    • Mihatsch, O.1    Neuneier, R.2
  • 28
    • 26444446315 scopus 로고    scopus 로고
    • Dopamine, uncertainty and TD learning
    • 15953384 10.1186/1744-9081-1-6
    • Niv Y, Duff MO, Dayan P (2005) Dopamine, uncertainty and TD learning. Behav Brain Funct 1:6
    • (2005) Behav Brain Funct , vol.1 , pp. 6
    • Niv, Y.1    Duff, M.O.2    Dayan, P.3
  • 29
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: Opportunity costs and the control of response vigor
    • 17031711 10.1007/s00213-006-0502-4 1:CAS:528:DC%2BD2sXitlCqtbo%3D
    • Niv Y, Daw ND, Joel D, Dayan P (2007) Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology 191(3):507-520
    • (2007) Psychopharmacology , vol.191 , Issue.3 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 30
    • 34447643062 scopus 로고    scopus 로고
    • Model-based fMRI and its application to reward learning and decision making
    • 17416921 10.1196/annals.1390.022
    • O'Doherty JP, Hampton A, Kim H (2007) Model-based fMRI and its application to reward learning and decision making. Ann NY Acad Sci 1104:35-53
    • (2007) Ann NY Acad Sci , vol.1104 , pp. 35-53
    • O'Doherty, J.P.1    Hampton, A.2    Kim, H.3
  • 31
    • 10344225664 scopus 로고    scopus 로고
    • Addiction as a computational process gone awry
    • 15591205 10.1126/science.1102384 1:CAS:528:DC%2BD2cXhtVCqtbvL
    • Redish AD (2004) Addiction as a computational process gone awry. Science 306(5703):1944-1947
    • (2004) Science , vol.306 , Issue.5703 , pp. 1944-1947
    • Redish, A.D.1
  • 32
    • 32444439058 scopus 로고    scopus 로고
    • Behavioral theories and the neurophysiology of reward
    • 16318590 10.1146/annurev.psych.56.091103.070229
    • Schultz W (2006) Behavioral theories and the neurophysiology of reward. Annu Rev Psychol 57:87-115
    • (2006) Annu Rev Psychol , vol.57 , pp. 87-115
    • Schultz, W.1
  • 33
    • 0037258402 scopus 로고    scopus 로고
    • Meta-learning in reinforcement learning
    • 12576101 10.1016/S0893-6080(02)00228-9
    • Schweighofer N, Doya K (2003) Meta-learning in reinforcement learning. Neural Netw 16(1):5-9
    • (2003) Neural Netw , vol.16 , Issue.1 , pp. 5-9
    • Schweighofer, N.1    Doya, K.2
  • 34
    • 82955247133 scopus 로고    scopus 로고
    • The optimism bias
    • Sharot T (2011) The optimism bias. Curr Biol 21(23):R941-R945
    • (2011) Curr Biol , vol.21 , Issue.23
    • Sharot, T.1
  • 35
    • 80054966343 scopus 로고    scopus 로고
    • How unrealistic optimism is maintained in the face of reality
    • 21983684 10.1038/nn.2949 1:CAS:528:DC%2BC3MXht12hsrzM
    • Sharot T, Korn CW, Dolan RJ (2011) How unrealistic optimism is maintained in the face of reality. Nat Neurosci 14(11):1475-1479
    • (2011) Nat Neurosci , vol.14 , Issue.11 , pp. 1475-1479
    • Sharot, T.1    Korn, C.W.2    Dolan, R.J.3
  • 36
    • 84880660982 scopus 로고    scopus 로고
    • The expected value of control: An integrative theory of anterior cingulate cortex function
    • 23889930 10.1016/j.neuron.2013.07.007 1:CAS:528:DC%2BC3sXhtFOktbfK
    • Shenhav A, Botvinick MM, Cohen JD (2013) The expected value of control: an integrative theory of anterior cingulate cortex function. Neuron 79(2):217-240
    • (2013) Neuron , vol.79 , Issue.2 , pp. 217-240
    • Shenhav, A.1    Botvinick, M.M.2    Cohen, J.D.3
  • 39
    • 84859344211 scopus 로고    scopus 로고
    • Information processing in decision-making systems
    • 22492194 10.1177/1073858411435128
    • van der Meer M, Kurth-Nelson Z, Redish AD (2012) Information processing in decision-making systems. Neuroscientist 18(4):342-359
    • (2012) Neuroscientist , vol.18 , Issue.4 , pp. 342-359
    • Van Der Meer, M.1    Kurth-Nelson, Z.2    Redish, A.D.3
  • 41
    • 36148934509 scopus 로고    scopus 로고
    • Adaptive behavior: Humans act as bayesian learners
    • 18029257 10.1016/j.cub.2007.09.007 1:CAS:528:DC%2BD2sXhtlWktb%2FM
    • Yu AJ (2007) Adaptive behavior: humans act as bayesian learners. Curr Biol 17(22):R977-R980
    • (2007) Curr Biol , vol.17 , Issue.22
    • Yu, A.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.