메뉴 건너뛰기




Volumn 22, Issue 5, 2015, Pages 1320-1327

Do learning rates adapt to the distribution of rewards?

Author keywords

Decision making; Multi armed bandit; Reinforcement learning

Indexed keywords

ADULT; FEMALE; HIGH RISK BEHAVIOR; HUMAN; LEARNING; MALE; MOTIVATION; REINFORCEMENT; REWARD;

EID: 84942370022     PISSN: 10699384     EISSN: 15315320     Source Type: Journal    
DOI: 10.3758/s13423-014-0790-3     Document Type: Article
Times cited : (82)

References (26)
  • 2
    • 77955458973 scopus 로고    scopus 로고
    • Multiple timescales of memory in lateral habenula and dopamine neurons
    • PID: 20696385
    • Bromberg-Martin, E.S., Matsumoto, M., Nakahara, H., Hikosaka, O. (2010). Multiple timescales of memory in lateral habenula and dopamine neurons. Neuron, 67, 499–510.
    • (2010) Neuron , vol.67 , pp. 499-510
    • Bromberg-Martin, E.S.1    Matsumoto, M.2    Nakahara, H.3    Hikosaka, O.4
  • 3
    • 84890950743 scopus 로고    scopus 로고
    • Adaptive properties of differential learning rates for positive and negative outcomes
    • PID: 24085507
    • Cazé, R.D., & van der Meer, M.A. (2013). Adaptive properties of differential learning rates for positive and negative outcomes. Biological Cybernetics, 107, 711–719.
    • (2013) Biological Cybernetics , vol.107 , pp. 711-719
    • Cazé, R.D.1    van der Meer, M.A.2
  • 5
    • 84874841717 scopus 로고    scopus 로고
    • Evaluating Amazon’s Mechanical Turk as a tool for experimental behavioral research
    • PID: 23516406
    • Crump, M.J., McDonnell, J.V., Gureckis, T.M. (2013). Evaluating Amazon’s Mechanical Turk as a tool for experimental behavioral research. PLoS One, 8, e57410.
    • (2013) PLoS One , vol.8 , pp. 57410
    • Crump, M.J.1    McDonnell, J.V.2    Gureckis, T.M.3
  • 6
    • 0036592008 scopus 로고    scopus 로고
    • Opponent interactions between serotonin and dopamine
    • PID: 12371515
    • Daw, N.D., Kakade, S., Dayan, P. (2002). Opponent interactions between serotonin and dopamine. Neural Networks, 15, 603–616.
    • (2002) Neural Networks , vol.15 , pp. 603-616
    • Daw, N.D.1    Kakade, S.2    Dayan, P.3
  • 7
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • PID: 16778890
    • Daw, N.D., O’Doherty, J.P., Dayan, P., Seymour, B., Dolan, R.J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441, 876–879.
    • (2006) Nature , vol.441 , pp. 876-879
    • Daw, N.D.1    O’Doherty, J.P.2    Dayan, P.3    Seymour, B.4    Dolan, R.J.5
  • 9
    • 0036592023 scopus 로고    scopus 로고
    • Metalearning and neuromodulation
    • PID: 12371507
    • Doya, K. (2002). Metalearning and neuromodulation. Neural Networks, 15, 495–506.
    • (2002) Neural Networks , vol.15 , pp. 495-506
    • Doya, K.1
  • 10
    • 68149138772 scopus 로고    scopus 로고
    • Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation
    • PID: 19620978
    • Frank, M.J., Doll, B.B., Oas-Terpstra, J., Moreno, F. (2009). Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation. Nature Neuroscience, 12, 1062–1068.
    • (2009) Nature Neuroscience , vol.12 , pp. 1062-1068
    • Frank, M.J.1    Doll, B.B.2    Oas-Terpstra, J.3    Moreno, F.4
  • 12
    • 10344250993 scopus 로고    scopus 로고
    • By carrot or by stick: Cognitive reinforcement learning in Parkinsonism
    • PID: 15528409
    • Frank, M.J., Seeberger, L.C., O’Reilly, R.C. (2004). By carrot or by stick: Cognitive reinforcement learning in Parkinsonism. Science, 306, 1940–1943.
    • (2004) Science , vol.306 , pp. 1940-1943
    • Frank, M.J.1    Seeberger, L.C.2    O’Reilly, R.C.3
  • 13
    • 34547350589 scopus 로고    scopus 로고
    • Information Theory, Inference and Learning Algorithms
    • MacKay, D.J. (2003). Information Theory, Inference and Learning Algorithms. Cambridge University Press.
    • (2003) Cambridge University Press
    • MacKay, D.J.1
  • 14
    • 33746358592 scopus 로고
    • A theory of attention: Variations in the associability of stimuli with reinforcement
    • Mackintosh, N.J. (1975). A theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review, 82, 276–298.
    • (1975) Psychological Review , vol.82 , pp. 276-298
    • Mackintosh, N.J.1
  • 15
    • 0036832952 scopus 로고    scopus 로고
    • Risk-sensitive reinforcement learning
    • Mihatsch, O., & Neuneier, R. (2002). Risk-sensitive reinforcement learning. Machine Learning, 49, 267–290.
    • (2002) Machine Learning , vol.49 , pp. 267-290
    • Mihatsch, O.1    Neuneier, R.2
  • 16
    • 84855688852 scopus 로고    scopus 로고
    • Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain
    • PID: 22238090
    • Niv, Y., Edlund, J.A., Dayan, P., O’Doherty, J.P. (2012). Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain. The Journal of Neuroscience, 32, 551– 562.
    • (2012) The Journal of Neuroscience , vol.32 , pp. 551-562
    • Niv, Y.1    Edlund, J.A.2    Dayan, P.3    O’Doherty, J.P.4
  • 17
  • 18
    • 0019089514 scopus 로고
    • A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli
    • PID: 7443916
    • Pearce, J.M., & Hall, G. (1980). A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychological Review, 87, 532–552.
    • (1980) Psychological Review , vol.87 , pp. 532-552
    • Pearce, J.M.1    Hall, G.2
  • 19
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • Appleton-Century-Crofts, New York
    • Rescorla, R.A., & Wagner, A.R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In: Black, A., & Prokasy, W. (Eds.), Classical conditioning II: Current research and theory. Appleton-Century-Crofts, New York, (pp. 64–99).
    • (1972) Classical conditioning II: Current research and theory , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2    Black, A.3    Prokasy, W.4
  • 20
    • 84942380631 scopus 로고    scopus 로고
    • Casella, G: Monte Carlo statistical methods. Springer
    • Robert, C.P., & Casella, G (2004). Monte Carlo statistical methods. Springer.
    • (2004)
    • Robert, C.P.1
  • 21
    • 72849112662 scopus 로고    scopus 로고
    • Dopaminergic drugs modulate learning rates and perseveration in Parkinson’s patients in a dynamic foraging task
    • PID: 19955362
    • Rutledge, R.B., Lazzaro, S.C., Lau, B., Myers, C.E., Gluck, M.A., Glimcher, P.W. (2009). Dopaminergic drugs modulate learning rates and perseveration in Parkinson’s patients in a dynamic foraging task. The Journal of Neuroscience, 29, 15104–15114.
    • (2009) The Journal of Neuroscience , vol.29 , pp. 15104-15114
    • Rutledge, R.B.1    Lazzaro, S.C.2    Lau, B.3    Myers, C.E.4    Gluck, M.A.5    Glimcher, P.W.6
  • 22
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461–464.
    • (1978) The Annals of Statistics , vol.6 , pp. 461-464
    • Schwarz, G.1
  • 25
    • 84942380632 scopus 로고    scopus 로고
    • Reinforcement learning: An introduction. MIT Press
    • Sutton, R.S., & Barto, A.G. (1998). Reinforcement learning: An introduction. MIT Press.
    • (1998) & Barto, A.G
    • Sutton, R.S.1
  • 26
    • 34548504092 scopus 로고    scopus 로고
    • Selective reinforcement learning deficits in schizophrenia support predictions from computational models of striatal-cortical dysfunction
    • PID: 17300757
    • Waltz, J.A., Frank, M.J., Robinson, B.M., Gold, J.M. (2007). Selective reinforcement learning deficits in schizophrenia support predictions from computational models of striatal-cortical dysfunction. Biological Psychiatry, 62, 756–764.
    • (2007) Biological Psychiatry , vol.62 , pp. 756-764
    • Waltz, J.A.1    Frank, M.J.2    Robinson, B.M.3    Gold, J.M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.