메뉴 건너뛰기




Volumn , Issue , 2011, Pages

Environmental statistics and the trade-off between model-based and TD learning in humans

Author keywords

[No Author keywords available]

Indexed keywords

ECONOMIC AND SOCIAL EFFECTS; LEARNING SYSTEMS;

EID: 85162381627     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (22)

References (24)
  • 1
    • 84882523833 scopus 로고    scopus 로고
    • Multiple forms of value learning and the function of dopamine
    • Paul W. Glimcher, Colin F. Camerer, Ernst Fehr, and Russell A. Poldrack, editors, chapter 24. Academic Press, London
    • Bernard W. Balleine, Nathaniel D. Daw, and John P. O'Doherty. Multiple forms of value learning and the function of dopamine. In Paul W. Glimcher, Colin F. Camerer, Ernst Fehr, and Russell A. Poldrack, editors, Neuroeconomics: Decision Making and the Brain, chapter 24, pages 367-387. Academic Press, London, 2008.
    • (2008) Neuroeconomics: Decision Making and the Brain , pp. 367-387
    • Balleine, B.W.1    Daw, N.D.2    O'doherty, J.P.3
  • 2
    • 27644568988 scopus 로고    scopus 로고
    • Decision making, impulse control and loss of willpower to resist drugs: A neurocognitive perspective
    • Antoine Bechara. Decision making, impulse control and loss of willpower to resist drugs: a neurocognitive perspective. Nat Neurosci, 8(11):1458-63, 2005.
    • (2005) Nat Neurosci , vol.8 , Issue.11 , pp. 1458-1463
    • Bechara, A.1
  • 3
    • 0344625375 scopus 로고    scopus 로고
    • The interaction of cognitive and stimulus-response processes in the control of behaviour
    • Frederick Toates. The interaction of cognitive and stimulus-response processes in the control of behaviour. Neuroscience & Biobehavioral Reviews, 22(1):59-83, 1997.
    • (1997) Neuroscience & Biobehavioral Reviews , vol.22 , Issue.1 , pp. 59-83
    • Toates, F.1
  • 4
    • 67349170462 scopus 로고    scopus 로고
    • Goal-directed control and its antipodes
    • Peter Dayan. Goal-directed control and its antipodes. Neural Netw, 22:213-219, 2009.
    • (2009) Neural Netw , vol.22 , pp. 213-219
    • Dayan, P.1
  • 5
    • 0011627410 scopus 로고
    • Feedback and task predictability as determinants of performance in multiple cue probability learning tasks
    • Neal Schmitt, Bryan W. Coyle, and Larry King. Feedback and task predictability as determinants of performance in multiple cue probability learning tasks. Organ Behav Hum Perform, 16(2):388-402, 1976.
    • (1976) Organ Behav Hum Perform , vol.16 , Issue.2 , pp. 388-402
    • Schmitt, N.1    Coyle, B.W.2    King, L.3
  • 6
    • 0001284732 scopus 로고
    • Task information and performance in probabilistic inference tasks
    • Berndt Brehmer and Jan Kuylenstierna. Task information and performance in probabilistic inference tasks. Organ Behav Hum Perform, 22:445-464, 1978.
    • (1978) Organ Behav Hum Perform , vol.22 , pp. 445-464
    • Brehmer, B.1    Kuylenstierna, J.2
  • 7
    • 0028467447 scopus 로고
    • Probabilistic classification learning in amnesia
    • B J Knowlton, L R Squire, and M A Gluck. Probabilistic classification learning in amnesia. Learn Mem, 1(2):106-120, 1994.
    • (1994) Learn Mem , vol.1 , Issue.2 , pp. 106-120
    • Knowlton, B.J.1    Squire, L.R.2    Gluck, M.A.3
  • 8
    • 2442612549 scopus 로고    scopus 로고
    • Dissociating explicit and procedural-learning based systems of perceptual category learning
    • W. Todd Maddox and F. Gregory Ashby. Dissociating explicit and procedural-learning based systems of perceptual category learning. Behavioural Processes, 66(3):309-332, 2004.
    • (2004) Behavioural Processes , vol.66 , Issue.3 , pp. 309-332
    • Maddox, W.T.1    Ashby, F.G.2
  • 9
    • 0942278855 scopus 로고    scopus 로고
    • Category number impacts rule-based but not information-integration category learning: Further evidence for dissociable categorylearning systems
    • W. Todd Maddox, J. Vincent Filoteo, Kelli D. Hejl, and A. David Ing. Category number impacts rule-based but not information-integration category learning: Further evidence for dissociable categorylearning systems. J Exp Psychol Learn Mem Cogn, 30(1):227-245, 2004.
    • (2004) J Exp Psychol Learn Mem Cogn , vol.30 , Issue.1 , pp. 227-245
    • Maddox, W.T.1    Filoteo, J.V.2    Hejl, K.D.3    Ing, A.D.4
  • 11
    • 0031801210 scopus 로고    scopus 로고
    • Goal-directed instrumental action: Contingency and incentive learning and their cortical substrates
    • Bernard W. Balleine and Anthony Dickinson. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology, 37(4-5):407-419, 1998.
    • (1998) Neuropharmacology , vol.37 , Issue.4-5 , pp. 407-419
    • Balleine, B.W.1    Dickinson, A.2
  • 12
    • 0033213819 scopus 로고    scopus 로고
    • What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
    • Kenji Doya. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Netw, 12(7-8):961-974, 1999.
    • (1999) Neural Netw , vol.12 , Issue.7-8 , pp. 961-974
    • Doya, K.1
  • 13
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Nathaniel D. Daw, Yael Niv, and Peter Dayan. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci, 8(12):1704-1711, 2005.
    • (2005) Nat Neurosci , vol.8 , Issue.12 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 15
    • 1942520195 scopus 로고    scopus 로고
    • Dissociable roles of ventral and dorsal striatum in instrumental conditioning
    • John P. O'Doherty, Peter Dayan, Johannes Schultz, Ralf Deichmann, Karl Friston, and Raymond J. Dolan. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304(5669):452-454, 2004.
    • (2004) Science , vol.304 , Issue.5669 , pp. 452-454
    • O'doherty, J.P.1    Dayan, P.2    Schultz, J.3    Deichmann, R.4    Friston, K.5    Dolan, R.J.6
  • 16
    • 27844567151 scopus 로고    scopus 로고
    • Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model
    • Adam Johnson and A. David Redish. Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model. Neural Netw, 18(9):1163-1171, 2005.
    • (2005) Neural Netw , vol.18 , Issue.9 , pp. 1163-1171
    • Johnson, A.1    Redish, A.D.2
  • 17
    • 85162020429 scopus 로고    scopus 로고
    • Hippocampal contributions to control: The third way
    • J. C. Platt, D. Koller, Y. Singer, and S. Roweis, editors. MIT Press, Cambridge, MA
    • Máté Lengyel and Peter Dayan. Hippocampal contributions to control: The third way. In J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems 20, pages 889-896. MIT Press, Cambridge, MA, 2008.
    • (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 889-896
    • Lengyel, M.1    Dayan, P.2
  • 18
    • 79958143780 scopus 로고    scopus 로고
    • Speed/accuracy trade-off between the habitual and the goal-directed processes
    • Mehdi Keramati, Amir Dezfouli, and Payam Piray. Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS Comput Biol, 7(5):e1002055, 2011.
    • (2011) PLoS Comput Biol , vol.7 , Issue.5
    • Keramati, M.1    Dezfouli, A.2    Piray, P.3
  • 19
    • 84899026236 scopus 로고    scopus 로고
    • Finite-sample convergence rates for q-learning and indirect algorithms
    • Michael S. Kearns, Sara A. Solla, and David A. Cohn, editors11. MIT Press, Cambridge, MA
    • Michael Kearns and Satinder Singh. Finite-sample convergence rates for q-learning and indirect algorithms. In Michael S. Kearns, Sara A. Solla, and David A. Cohn, editors, Advances in Neural Information Processing Systems 11, volume 11, pages 996-1002. MIT Press, Cambridge, MA, 1999.
    • (1999) Advances in Neural Information Processing Systems , vol.11 , pp. 996-1002
    • Kearns, M.1    Singh, S.2
  • 20
    • 85024429815 scopus 로고
    • A new approach to linear filtering and prediction problems
    • R. E. Kalman. A new approach to linear filtering and prediction problems. J Basic Eng, 82(1):35-45, 1960.
    • (1960) J Basic Eng , vol.82 , Issue.1 , pp. 35-45
    • Kalman, R.E.1
  • 21
    • 79952746011 scopus 로고    scopus 로고
    • Model-based influences on humans' choices and striatal prediction errors
    • Nathaniel D Daw, S. J. Gershman, B. Seymour, P. Dayan, and R. J. Dolan. Model-based influences on humans' choices and striatal prediction errors. Neuron, 69(6):1204-1215, 2011.
    • (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
    • Daw, N.D.1    Gershman, S.J.2    Seymour, B.3    Dayan, P.4    Dolan, R.J.5
  • 22
    • 33644782012 scopus 로고    scopus 로고
    • Dynamic response-by-response models of matching behavior in rhesus monkeys
    • Brian Lau and Paul W Glimcher. Dynamic response-by-response models of matching behavior in rhesus monkeys. J Exp Anal Behav, 84(3):555-579, 2005.
    • (2005) J Exp Anal Behav , vol.84 , Issue.3 , pp. 555-579
    • Lau, B.1    Glimcher, P.W.2
  • 24
    • 33749061026 scopus 로고    scopus 로고
    • Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics
    • Saori C Tanaka, Kazuyuki Samejima, Go Okada, Kazutaka Ueda, Yasumasa Okamoto, Shigeto Yamawaki, and Kenji Doya. Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics. Neural Netw, 19(8):1233-1241, 2006.
    • (2006) Neural Netw , vol.19 , Issue.8 , pp. 1233-1241
    • Tanaka, S.C.1    Samejima, K.2    Okada, G.3    Ueda, K.4    Okamoto, Y.5    Yamawaki, S.6    Doya, K.7


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.