SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011

Volumn , Issue , 2011, Pages

Environmental statistics and the trade-off between model-based and TD learning in humans

(2) Simon, Dylan A a Daw, Nathaniel D a

a NEW YORK UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ECONOMIC AND SOCIAL EFFECTS; LEARNING SYSTEMS;

ENVIRONMENTAL STATISTICS; HIGH VOLATILITY; HIGH-LOW; MODEL FREE; MODEL-BASED OPC; MODELING-BASED LEARNING; RELATIVE PERFORMANCE; STATISTICAL EFFICIENCY; TD-LEARNING; TRADE OFF;

DECISION MAKING;

EID: 85162381627 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (22)

References (24)

1
- 84882523833
- Multiple forms of value learning and the function of dopamine
- Paul W. Glimcher, Colin F. Camerer, Ernst Fehr, and Russell A. Poldrack, editors, chapter 24. Academic Press, London
- Bernard W. Balleine, Nathaniel D. Daw, and John P. O'Doherty. Multiple forms of value learning and the function of dopamine. In Paul W. Glimcher, Colin F. Camerer, Ernst Fehr, and Russell A. Poldrack, editors, Neuroeconomics: Decision Making and the Brain, chapter 24, pages 367-387. Academic Press, London, 2008.
- (2008) Neuroeconomics: Decision Making and the Brain , pp. 367-387
- Balleine, B.W.¹ Daw, N.D.² O'doherty, J.P.³

2
- 27644568988
- Decision making, impulse control and loss of willpower to resist drugs: A neurocognitive perspective
- Antoine Bechara. Decision making, impulse control and loss of willpower to resist drugs: a neurocognitive perspective. Nat Neurosci, 8(11):1458-63, 2005.
- (2005) Nat Neurosci , vol.8 , Issue.11 , pp. 1458-1463
- Bechara, A.¹

3
- 0344625375
- The interaction of cognitive and stimulus-response processes in the control of behaviour
- Frederick Toates. The interaction of cognitive and stimulus-response processes in the control of behaviour. Neuroscience & Biobehavioral Reviews, 22(1):59-83, 1997.
- (1997) Neuroscience & Biobehavioral Reviews , vol.22 , Issue.1 , pp. 59-83
- Toates, F.¹

4
- 67349170462
- Goal-directed control and its antipodes
- Peter Dayan. Goal-directed control and its antipodes. Neural Netw, 22:213-219, 2009.
- (2009) Neural Netw , vol.22 , pp. 213-219
- Dayan, P.¹

5
- 0011627410
- Feedback and task predictability as determinants of performance in multiple cue probability learning tasks
- Neal Schmitt, Bryan W. Coyle, and Larry King. Feedback and task predictability as determinants of performance in multiple cue probability learning tasks. Organ Behav Hum Perform, 16(2):388-402, 1976.
- (1976) Organ Behav Hum Perform , vol.16 , Issue.2 , pp. 388-402
- Schmitt, N.¹ Coyle, B.W.² King, L.³

6
- 0001284732
- Task information and performance in probabilistic inference tasks
- Berndt Brehmer and Jan Kuylenstierna. Task information and performance in probabilistic inference tasks. Organ Behav Hum Perform, 22:445-464, 1978.
- (1978) Organ Behav Hum Perform , vol.22 , pp. 445-464
- Brehmer, B.¹ Kuylenstierna, J.²

7
- 0028467447
- Probabilistic classification learning in amnesia
- B J Knowlton, L R Squire, and M A Gluck. Probabilistic classification learning in amnesia. Learn Mem, 1(2):106-120, 1994.
- (1994) Learn Mem , vol.1 , Issue.2 , pp. 106-120
- Knowlton, B.J.¹ Squire, L.R.² Gluck, M.A.³

8
- 2442612549
- Dissociating explicit and procedural-learning based systems of perceptual category learning
- W. Todd Maddox and F. Gregory Ashby. Dissociating explicit and procedural-learning based systems of perceptual category learning. Behavioural Processes, 66(3):309-332, 2004.
- (2004) Behavioural Processes , vol.66 , Issue.3 , pp. 309-332
- Maddox, W.T.¹ Ashby, F.G.²

9
- 0942278855
- Category number impacts rule-based but not information-integration category learning: Further evidence for dissociable categorylearning systems
- W. Todd Maddox, J. Vincent Filoteo, Kelli D. Hejl, and A. David Ing. Category number impacts rule-based but not information-integration category learning: Further evidence for dissociable categorylearning systems. J Exp Psychol Learn Mem Cogn, 30(1):227-245, 2004.
- (2004) J Exp Psychol Learn Mem Cogn , vol.30 , Issue.1 , pp. 227-245
- Maddox, W.T.¹ Filoteo, J.V.² Hejl, K.D.³ Ing, A.D.⁴

10
- 0035969560
- Interactive memory systems in the human brain
- R. A. Poldrack, J. Clark, E. J. Paré-Blagoev, D. Shohamy, J. Creso Moyano, C. Myers, and M. A. Gluck. Interactive memory systems in the human brain. Nature, 414(6863):546-550, 2001.
- (2001) Nature , vol.414 , Issue.6863 , pp. 546-550
- Poldrack, R.A.¹ Clark, J.² Paré-Blagoev, E.J.³ Shohamy, D.⁴ Creso Moyano, J.⁵ Myers, C.⁶ Gluck, M.A.⁷

11
- 0031801210
- Goal-directed instrumental action: Contingency and incentive learning and their cortical substrates
- Bernard W. Balleine and Anthony Dickinson. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology, 37(4-5):407-419, 1998.
- (1998) Neuropharmacology , vol.37 , Issue.4-5 , pp. 407-419
- Balleine, B.W.¹ Dickinson, A.²

12
- 0033213819
- What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
- Kenji Doya. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Netw, 12(7-8):961-974, 1999.
- (1999) Neural Netw , vol.12 , Issue.7-8 , pp. 961-974
- Doya, K.¹

13
- 28044450875
- Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- Nathaniel D. Daw, Yael Niv, and Peter Dayan. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci, 8(12):1704-1711, 2005.
- (2005) Nat Neurosci , vol.8 , Issue.12 , pp. 1704-1711
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

14
- 2942617032
- Temporal difference models describe higher-order learning in humans
- Ben Seymour, John P. O'Doherty, Peter Dayan, Martin Koltzenburg, Anthony K. Jones, Raymond J. Dolan, Karl J. Friston, and Richard S. Frackowiak. Temporal difference models describe higher-order learning in humans. Nature, 429(6992):664-667, 2004.
- (2004) Nature , vol.429 , Issue.6992 , pp. 664-667
- Seymour, B.¹ O'doherty, J.P.² Dayan, P.³ Koltzenburg, M.⁴ Jones, A.K.⁵ Dolan, R.J.⁶ Friston, K.J.⁷ Frackowiak, R.S.⁸

15
- 1942520195
- Dissociable roles of ventral and dorsal striatum in instrumental conditioning
- John P. O'Doherty, Peter Dayan, Johannes Schultz, Ralf Deichmann, Karl Friston, and Raymond J. Dolan. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304(5669):452-454, 2004.
- (2004) Science , vol.304 , Issue.5669 , pp. 452-454
- O'doherty, J.P.¹ Dayan, P.² Schultz, J.³ Deichmann, R.⁴ Friston, K.⁵ Dolan, R.J.⁶

16
- 27844567151
- Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model
- Adam Johnson and A. David Redish. Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model. Neural Netw, 18(9):1163-1171, 2005.
- (2005) Neural Netw , vol.18 , Issue.9 , pp. 1163-1171
- Johnson, A.¹ Redish, A.D.²

17
- 85162020429
- Hippocampal contributions to control: The third way
- J. C. Platt, D. Koller, Y. Singer, and S. Roweis, editors. MIT Press, Cambridge, MA
- Máté Lengyel and Peter Dayan. Hippocampal contributions to control: The third way. In J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems 20, pages 889-896. MIT Press, Cambridge, MA, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 889-896
- Lengyel, M.¹ Dayan, P.²

18
- 79958143780
- Speed/accuracy trade-off between the habitual and the goal-directed processes
- Mehdi Keramati, Amir Dezfouli, and Payam Piray. Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS Comput Biol, 7(5):e1002055, 2011.
- (2011) PLoS Comput Biol , vol.7 , Issue.5
- Keramati, M.¹ Dezfouli, A.² Piray, P.³

19
- 84899026236
- Finite-sample convergence rates for q-learning and indirect algorithms
- Michael S. Kearns, Sara A. Solla, and David A. Cohn, editors11. MIT Press, Cambridge, MA
- Michael Kearns and Satinder Singh. Finite-sample convergence rates for q-learning and indirect algorithms. In Michael S. Kearns, Sara A. Solla, and David A. Cohn, editors, Advances in Neural Information Processing Systems 11, volume 11, pages 996-1002. MIT Press, Cambridge, MA, 1999.
- (1999) Advances in Neural Information Processing Systems , vol.11 , pp. 996-1002
- Kearns, M.¹ Singh, S.²

20
- 85024429815
- A new approach to linear filtering and prediction problems
- R. E. Kalman. A new approach to linear filtering and prediction problems. J Basic Eng, 82(1):35-45, 1960.
- (1960) J Basic Eng , vol.82 , Issue.1 , pp. 35-45
- Kalman, R.E.¹

21
- 79952746011
- Model-based influences on humans' choices and striatal prediction errors
- Nathaniel D Daw, S. J. Gershman, B. Seymour, P. Dayan, and R. J. Dolan. Model-based influences on humans' choices and striatal prediction errors. Neuron, 69(6):1204-1215, 2011.
- (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
- Daw, N.D.¹ Gershman, S.J.² Seymour, B.³ Dayan, P.⁴ Dolan, R.J.⁵

22
- 33644782012
- Dynamic response-by-response models of matching behavior in rhesus monkeys
- Brian Lau and Paul W Glimcher. Dynamic response-by-response models of matching behavior in rhesus monkeys. J Exp Anal Behav, 84(3):555-579, 2005.
- (2005) J Exp Anal Behav , vol.84 , Issue.3 , pp. 555-579
- Lau, B.¹ Glimcher, P.W.²

23
- 84897149362
- R package version 0.999375-39
- Douglas Bates, Martin Maechler, and Ben Bolker. lme4: Linear mixed-effects models using S4 classes, 2011. R package version 0.999375-39.
- (2011) Lme4: Linear Mixed-effects Models Using S4 Classes
- Bates, D.¹ Maechler, M.² Bolker, B.³

24
- 33749061026
- Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics
- Saori C Tanaka, Kazuyuki Samejima, Go Okada, Kazutaka Ueda, Yasumasa Okamoto, Shigeto Yamawaki, and Kenji Doya. Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics. Neural Netw, 19(8):1233-1241, 2006.
- (2006) Neural Netw , vol.19 , Issue.8 , pp. 1233-1241
- Tanaka, S.C.¹ Samejima, K.² Okada, G.³ Ueda, K.⁴ Okamoto, Y.⁵ Yamawaki, S.⁶ Doya, K.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.