SCOPUS 정보 검색 플랫폼

Biological Cybernetics

Volumn 107, Issue 6, 2013, Pages 711-719

Adaptive properties of differential learning rates for positive and negative outcomes

(2) Cazé, Romain D a Van Der Meer, Matthijs A A b

a IMPERIAL COLLEGE LONDON (United Kingdom)

b UNIVERSITY OF WATERLOO (Canada)

Author keywords

Basal ganglia; Decision making; Meta learning; Reinforcement learning; Reward prediction error

Indexed keywords

ASYMMETRIC LEARNING; BASAL GANGLIA; COGNITIVE SCIENCE; METALEARNING; PREDICTION ERRORS; REWARD-PREDICTION ERROR; SUB-OPTIMAL CHOICES; SUPPORT LEARNING;

DECISION MAKING; ERRORS; LEARNING ALGORITHMS; REINFORCEMENT LEARNING;

FORECASTING;

ADAPTATION; ALGORITHM; ARTICLE; BIOLOGICAL MODEL; DECISION MAKING; HUMAN; LEARNING; PHYSIOLOGY; PREDICTIVE VALUE; REINFORCEMENT;

ADAPTATION, PHYSIOLOGICAL; ALGORITHMS; CHOICE BEHAVIOR; HUMANS; LEARNING; MODELS, BIOLOGICAL; PREDICTIVE VALUE OF TESTS; REINFORCEMENT (PSYCHOLOGY);

EID: 84890950743 PISSN: 03401200 EISSN: 14320770 Source Type: Journal
DOI: 10.1007/s00422-013-0571-5 Document Type: Article

Times cited : (71)

References (41)

1
- 34548295327
- Learning the value of information in an uncertain world
- 17676057 10.1038/nn1954 1:CAS:528:DC%2BD2sXps1Sgurg%3D
- Behrens TEJ, Woolrich MW, Walton ME, Rushworth MFS (2007) Learning the value of information in an uncertain world. Nat Neurosci 10(9):1214-1221
- (2007) Nat Neurosci , vol.10 , Issue.9 , pp. 1214-1221
- Behrens, T.E.J.¹ Woolrich, M.W.² Walton, M.E.³ Rushworth, M.F.S.⁴

2
- 78649966665
- Dopamine in motivational control: Rewarding, aversive, and alerting
- 21144997 10.1016/j.neuron.2010.11.022 1:CAS:528:DC%2BC3cXhsFGhu73O
- Bromberg-Martin ES, Matsumoto M, Hikosaka O (2010) Dopamine in motivational control: rewarding, aversive, and alerting. Neuron 68(5):815-834
- (2010) Neuron , vol.68 , Issue.5 , pp. 815-834
- Bromberg-Martin, E.S.¹ Matsumoto, M.² Hikosaka, O.³

3
- 79955770782
- Social stress reactivity alters reward and punishment learning
- 20453038 10.1093/scan/nsq041
- Cavanagh JF, Frank MJ (2011) Social stress reactivity alters reward and punishment learning. Soc Cogn Affect Neurosci 6(3):311-320
- (2011) Soc Cogn Affect Neurosci , vol.6 , Issue.3 , pp. 311-320
- Cavanagh, J.F.¹ Frank, M.J.²

4
- 77951786435
- Gambling severity predicts midbrain response to near-miss outcomes
- 20445043 10.1523/JNEUROSCI.5758-09.2010 1:CAS:528:DC%2BC3cXovVWmsL4%3D
- Chase HW, Clark L (2010) Gambling severity predicts midbrain response to near-miss outcomes. J Neurosci 30(18):6180-6187
- (2010) J Neurosci , vol.30 , Issue.18 , pp. 6180-6187
- Chase, H.W.¹ Clark, L.²

5
- 60749098586
- Neurobiological studies of risk assessment: A comparison of expected utility and mean-variance approaches
- 19033235 10.3758/CABN.8.4.363
- D'Acremont M, Bossaerts P (2008) Neurobiological studies of risk assessment: a comparison of expected utility and mean-variance approaches. Cogn Affect Behav Neurosci 8(4):363-374
- (2008) Cogn Affect Behav Neurosci , vol.8 , Issue.4 , pp. 363-374
- D'Acremont, M.¹ Bossaerts, P.²

6
- 34548837994
- Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling
- 17638506 10.1037/0033-295X.114.3.784
- Redish AD, Jensen S, Johnson A, Kurth-Nelson Z (2007) Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. Psychol Rev 114(3):784-805
- (2007) Psychol Rev , vol.114 , Issue.3 , pp. 784-805
- Redish, A.D.¹ Jensen, S.² Johnson, A.³ Kurth-Nelson, Z.⁴

7
- 33745223257
- Cortical substrates for exploratory decisions in humans
- 16778890 10.1038/nature04766 1:CAS:528:DC%2BD28XlvVGltrw%3D
- Daw ND, O'Doherty JP, Dayan P, Seymour B, Dolan RJ (2006) Cortical substrates for exploratory decisions in humans. Nature 441(7095):876-879
- (2006) Nature , vol.441 , Issue.7095 , pp. 876-879
- Daw, N.D.¹ O'Doherty, J.P.² Dayan, P.³ Seymour, B.⁴ Dolan, R.J.⁵

8
- 79952746011
- Model-based influences on humans' choices and striatal prediction errors
- 21435563 10.1016/j.neuron.2011.02.027 1:CAS:528:DC%2BC3MXjvFejsLY%3D
- Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ (2011) Model-based influences on humans' choices and striatal prediction errors. Neuron 69(6):1204-1215
- (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
- Daw, N.D.¹ Gershman, S.J.² Seymour, B.³ Dayan, P.⁴ Dolan, R.J.⁵

9
- 52049107354
- Reinforcement learning: The good, the bad and the ugly
- 18708140 10.1016/j.conb.2008.08.003 1:CAS:528:DC%2BD1cXhtFCku7rF
- Dayan P, Niv Y (2008) Reinforcement learning: the good, the bad and the ugly. Curr Opin Neurobiol 18(2):185-196
- (2008) Curr Opin Neurobiol , vol.18 , Issue.2 , pp. 185-196
- Dayan, P.¹ Niv, Y.²

10
- 70449715719
- Instructional control of reinforcement learning: A behavioral and neurocomputational investigation
- 19595993 10.1016/j.brainres.2009.07.007 1:CAS:528:DC%2BD1MXht1OnsLzK
- Doll BB, Jacobs WJ, Sanfey AG, Frank MJ (2009) Instructional control of reinforcement learning: a behavioral and neurocomputational investigation. Brain Res 1299:74-94
- (2009) Brain Res , vol.1299 , pp. 74-94
- Doll, B.B.¹ Jacobs, W.J.² Sanfey, A.G.³ Frank, M.J.⁴

11
- 0036592023
- Metalearning and neuromodulation
- 12371507 10.1016/S0893-6080(02)00044-8
- Doya K (2002) Metalearning and neuromodulation. Neural Netw 15(4-6):495-506
- (2002) Neural Netw , vol.15 , Issue.4-6 , pp. 495-506
- Doya, K.¹

12
- 0036618011
- Multiple model-based reinforcement learning
- 12020450 10.1162/089976602753712972
- Doya K, Samejima K, Katagiri K, Kawato M (2002) Multiple model-based reinforcement learning. Neural Comput 14(6):1347-1369
- (2002) Neural Comput , vol.14 , Issue.6 , pp. 1347-1369
- Doya, K.¹ Samejima, K.² Katagiri, K.³ Kawato, M.⁴

13
- 84881062266
- Two dimensions of value: Dopamine neurons represent reward but not aversiveness
- 23908236 10.1126/science.1238699 1:CAS:528:DC%2BC3sXhtFyjsrzL
- Fiorillo CD (2013) Two dimensions of value: dopamine neurons represent reward but not aversiveness. Science 341(6145):546-549
- (2013) Science , vol.341 , Issue.6145 , pp. 546-549
- Fiorillo, C.D.¹

14
- 10344250993
- By carrot or by stick: Cognitive reinforcement learning in parkinsonism
- 15528409 10.1126/science.1102941 1:CAS:528:DC%2BD2cXhtVCqtbvK
- Frank MJ, Seeberger LC, O'reilly RC (2004) By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science 306(5703):1940-1943
- (2004) Science , vol.306 , Issue.5703 , pp. 1940-1943
- Frank, M.J.¹ Seeberger, L.C.² O'Reilly, R.C.³

15
- 36048939114
- Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning
- 17913879 10.1073/pnas.0706111104 1:CAS:528:DC%2BD2sXhtF2gsr%2FE
- Frank MJ, Moustafa AA, Haughey HM, Curran T, Hutchison KE (2007) Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning. Proc Natl Acad Sci 104(41):16311-16316
- (2007) Proc Natl Acad Sci , vol.104 , Issue.41 , pp. 16311-16316
- Frank, M.J.¹ Moustafa, A.A.² Haughey, H.M.³ Curran, T.⁴ Hutchison, K.E.⁵

16
- 68149138772
- Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation
- 19620978 10.1038/nn.2342 1:CAS:528:DC%2BD1MXoslymtLY%3D
- Frank MJ, Doll BB, Oas-Terpstra J, Moreno F (2009) Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation. Nat Neurosci 12(8):1062-1068
- (2009) Nat Neurosci , vol.12 , Issue.8 , pp. 1062-1068
- Frank, M.J.¹ Doll, B.B.² Oas-Terpstra, J.³ Moreno, F.⁴

17
- 0025572196
- $$\text{ D }-1$$ D 1 and $$\ text{ D }-2$$ D 2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons
- 2147780 10.1126/science.2147780 1:CAS:528:DyaK3MXjs1Gjsg%3D%3D
- Gerfen CR, Engber TM, Mahan LC, Susel Z, Chase TN, Monsma FJ Jr, Sibley DR (1990) $$\text{ D }-1$$ D 1 and $$\text{ D }-2$$ D 2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons. Science 250:1429-1432
- (1990) Science , vol.250 , pp. 1429-1432
- Gerfen, C.R.¹ Engber, T.M.² Mahan, L.C.³ Susel, Z.⁴ Chase, T.N.⁵ Monsma, Jr.F.J.⁶ Sibley, D.R.⁷

18
- 77952541839
- Learning latent structure: Carving nature at its joints
- 20227271 10.1016/j.conb.2010.02.008 1:CAS:528:DC%2BC3cXlsFCis74%3D
- Gershman SJ, Niv Y (2010) Learning latent structure: carving nature at its joints. Curr Opin Neurobiol 20(2):251-256
- (2010) Curr Opin Neurobiol , vol.20 , Issue.2 , pp. 251-256
- Gershman, S.J.¹ Niv, Y.²

19
- 84856102932
- Dopamine system dysregulation by the hippocampus: Implications for the pathophysiology and treatment of schizophrenia
- 21621548 10.1016/j.neuropharm.2011.05.011 1:CAS:528:DC%2BC38XhtFWks7k%3D
- Grace AA (2012) Dopamine system dysregulation by the hippocampus: implications for the pathophysiology and treatment of schizophrenia. Neuropharmacology 62(3):1342-1348
- (2012) Neuropharmacology , vol.62 , Issue.3 , pp. 1342-1348
- Grace, A.A.¹

20
- 84862201490
- Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia
- FEBRUARY 22347155
- Humphries MD, Khamassi M, Gurney K (2012) Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia. Front Neurosci 6(February):9
- (2012) Front Neurosci , vol.6 , pp. 9
- Humphries, M.D.¹ Khamassi, M.² Gurney, K.³

21
- 0000125532
- Prospect theory: An analysis of decision under risk
- Kahneman D, Tversky A (1979) Prospect theory: an analysis of decision under risk. Econ J Econ Soc 47(2):263-292
- (1979) Econ J Econ Soc , vol.47 , Issue.2 , pp. 263-292
- Kahneman, D.¹ Tversky, A.²

22
- 84859019561
- Robot cognitive control with a neurophysiologically inspired reinforcement learning model
- JULY
- Khamassi M, Lallée S, Enel P, Procyk E, Dominey PF (2011) Robot cognitive control with a neurophysiologically inspired reinforcement learning model. Front Neurorobotic 5(July):1
- (2011) Front Neurorobotic , vol.5 , pp. 1
- Khamassi, M.¹ Lallée, S.² Enel, P.³ Procyk, E.⁴ Dominey, P.F.⁵

23
- 84872378856
- Medial prefrontal cortex and the adaptive regulation of reinforcement learning parameters
- 23317844 10.1016/B978-0-444-62604-2.00022-8
- Khamassi M, Enel P, Dominey PF, Procyk E (2013) Medial prefrontal cortex and the adaptive regulation of reinforcement learning parameters. Prog Brain Res 202:441-464
- (2013) Prog Brain Res , vol.202 , pp. 441-464
- Khamassi, M.¹ Enel, P.² Dominey, P.F.³ Procyk, E.⁴

24
- 84861545384
- Distinct roles for direct and indirect pathway striatal neurons in reinforcement
- Kravitz AV, Tye LD, Kreitzer AC (2012) Distinct roles for direct and indirect pathway striatal neurons in reinforcement. Nat Neurosci 15:816-818
- (2012) Nat Neurosci , vol.15 , pp. 816-818
- Kravitz, A.V.¹ Tye, L.D.² Kreitzer, A.C.³

25
- 70449382577
- Temporal-difference reinforcement learning with distributed representations
- 19841749 10.1371/journal.pone.0007362
- Kurth-Nelson Z, Redish AD (2009) Temporal-difference reinforcement learning with distributed representations. PLoS One 4(10):e7362
- (2009) PLoS One , vol.4 , Issue.10 , pp. 7362
- Kurth-Nelson, Z.¹ Redish, A.D.²

26
- 79251569290
- From reinforcement learning models to psychiatric and neurological disorders
- 21270784 10.1038/nn.2723 1:CAS:528:DC%2BC3MXhtFShs70%3D
- Maia TV, Frank MJ (2011) From reinforcement learning models to psychiatric and neurological disorders. Nat Neurosci 14(2):154-162
- (2011) Nat Neurosci , vol.14 , Issue.2 , pp. 154-162
- Maia, T.V.¹ Frank, M.J.²

27
- 0036832952
- Risk-sensitive reinforcement learning
- 10.1023/A:1017940631555
- Mihatsch O, Neuneier R (2002) Risk-sensitive reinforcement learning. Mach Learn 49:267-290
- (2002) Mach Learn , vol.49 , pp. 267-290
- Mihatsch, O.¹ Neuneier, R.²

28
- 26444446315
- Dopamine, uncertainty and TD learning
- 15953384 10.1186/1744-9081-1-6
- Niv Y, Duff MO, Dayan P (2005) Dopamine, uncertainty and TD learning. Behav Brain Funct 1:6
- (2005) Behav Brain Funct , vol.1 , pp. 6
- Niv, Y.¹ Duff, M.O.² Dayan, P.³

29
- 33847675011
- Tonic dopamine: Opportunity costs and the control of response vigor
- 17031711 10.1007/s00213-006-0502-4 1:CAS:528:DC%2BD2sXitlCqtbo%3D
- Niv Y, Daw ND, Joel D, Dayan P (2007) Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology 191(3):507-520
- (2007) Psychopharmacology , vol.191 , Issue.3 , pp. 507-520
- Niv, Y.¹ Daw, N.D.² Joel, D.³ Dayan, P.⁴

30
- 34447643062
- Model-based fMRI and its application to reward learning and decision making
- 17416921 10.1196/annals.1390.022
- O'Doherty JP, Hampton A, Kim H (2007) Model-based fMRI and its application to reward learning and decision making. Ann NY Acad Sci 1104:35-53
- (2007) Ann NY Acad Sci , vol.1104 , pp. 35-53
- O'Doherty, J.P.¹ Hampton, A.² Kim, H.³

31
- 10344225664
- Addiction as a computational process gone awry
- 15591205 10.1126/science.1102384 1:CAS:528:DC%2BD2cXhtVCqtbvL
- Redish AD (2004) Addiction as a computational process gone awry. Science 306(5703):1944-1947
- (2004) Science , vol.306 , Issue.5703 , pp. 1944-1947
- Redish, A.D.¹

32
- 32444439058
- Behavioral theories and the neurophysiology of reward
- 16318590 10.1146/annurev.psych.56.091103.070229
- Schultz W (2006) Behavioral theories and the neurophysiology of reward. Annu Rev Psychol 57:87-115
- (2006) Annu Rev Psychol , vol.57 , pp. 87-115
- Schultz, W.¹

33
- 0037258402
- Meta-learning in reinforcement learning
- 12576101 10.1016/S0893-6080(02)00228-9
- Schweighofer N, Doya K (2003) Meta-learning in reinforcement learning. Neural Netw 16(1):5-9
- (2003) Neural Netw , vol.16 , Issue.1 , pp. 5-9
- Schweighofer, N.¹ Doya, K.²

34
- 82955247133
- The optimism bias
- Sharot T (2011) The optimism bias. Curr Biol 21(23):R941-R945
- (2011) Curr Biol , vol.21 , Issue.23
- Sharot, T.¹

35
- 80054966343
- How unrealistic optimism is maintained in the face of reality
- 21983684 10.1038/nn.2949 1:CAS:528:DC%2BC3MXht12hsrzM
- Sharot T, Korn CW, Dolan RJ (2011) How unrealistic optimism is maintained in the face of reality. Nat Neurosci 14(11):1475-1479
- (2011) Nat Neurosci , vol.14 , Issue.11 , pp. 1475-1479
- Sharot, T.¹ Korn, C.W.² Dolan, R.J.³

36
- 84880660982
- The expected value of control: An integrative theory of anterior cingulate cortex function
- 23889930 10.1016/j.neuron.2013.07.007 1:CAS:528:DC%2BC3sXhtFOktbfK
- Shenhav A, Botvinick MM, Cohen JD (2013) The expected value of control: an integrative theory of anterior cingulate cortex function. Neuron 79(2):217-240
- (2013) Neuron , vol.79 , Issue.2 , pp. 217-240
- Shenhav, A.¹ Botvinick, M.M.² Cohen, J.D.³

37
- 0003617454
- Doctoral Dissertation, UMass Amherst
- Sutton RS (1984) Temporal credit assignment in reinforcement learning. Doctoral Dissertation, UMass Amherst
- (1984) Temporal Credit Assignment in Reinforcement Learning
- Sutton, R.S.¹

38
- 0004102479
- MIT Press Cambridge, MA
- Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. MIT Press, Cambridge, MA
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

39
- 84859344211
- Information processing in decision-making systems
- 22492194 10.1177/1073858411435128
- van der Meer M, Kurth-Nelson Z, Redish AD (2012) Information processing in decision-making systems. Neuroscientist 18(4):342-359
- (2012) Neuroscientist , vol.18 , Issue.4 , pp. 342-359
- Van Der Meer, M.¹ Kurth-Nelson, Z.² Redish, A.D.³

40
- 0004049893
- PhD thesis
- Watkins C (1989) Learning from delayed rewards. PhD thesis
- (1989) Learning from delayed rewards
- Watkins, C.¹

41
- 36148934509
- Adaptive behavior: Humans act as bayesian learners
- 18029257 10.1016/j.cub.2007.09.007 1:CAS:528:DC%2BD2sXhtlWktb%2FM
- Yu AJ (2007) Adaptive behavior: humans act as bayesian learners. Curr Biol 17(22):R977-R980
- (2007) Curr Biol , vol.17 , Issue.22
- Yu, A.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.