SCOPUS 정보 검색 플랫폼

PLoS Computational Biology

Volumn 6, Issue 12, 2010, Pages

Structure learning in human sequential decision-making

(2) Acuña, Daniel E a Schrater, Paul a,b

a University of Minnesota (United States)

b UNIVERSITY OF MINNESOTA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIORAL RESEARCH; REINFORCEMENT LEARNING;

BAYESIAN REINFORCEMENT LEARNING; DECISION TASK; GENERATIVE MODEL; HUMAN FACES; LEARNING AGENTS; LEARNING PROBLEM; SEQUENTIAL DECISION MAKING; SEQUENTIAL DECISIONS; STRUCTURE-LEARNING; SUB-OPTIMAL PERFORMANCE;

DECISION MAKING;

ARTICLE; BAYESIAN LEARNING; BEHAVIOR CHANGE; CONTROLLED STUDY; DECISION MAKING; EXPERIENTIAL LEARNING; EXPLORATORY BEHAVIOR; HUMAN; HUMAN EXPERIMENT; LEARNING ALGORITHM; LEARNING ENVIRONMENT; LEARNING STYLE; MATHEMATICAL COMPUTING; MATHEMATICAL MODEL; MENTAL TASK; QUALITATIVE ANALYSIS; QUANTITATIVE ANALYSIS; REINFORCEMENT; REWARD; TASK PERFORMANCE; ALGORITHM; BAYES THEOREM; LEARNING; PHYSIOLOGY; THEORETICAL MODEL;

ALGORITHMS; BAYES THEOREM; DECISION MAKING; HUMANS; LEARNING; MODELS, THEORETICAL; REWARD; TASK PERFORMANCE AND ANALYSIS;

EID: 78651226963 PISSN: 1553734X EISSN: 15537358 Source Type: Journal
DOI: 10.1371/journal.pcbi.1001003 Document Type: Article

Times cited : (39)

References (45)

1
- 0004870746
- A problem in the sequential design of experiments
- Bellman RE (1956) A problem in the sequential design of experiments. Sankhyā 16: 221-229.
- (1956) Sankhyā , vol.16 , pp. 221-229
- Bellman, R.E.¹

2
- 84891584370
- Chichester [West Sussex]; New York: Wiley
- Gittins JC (1989) Multi-armed bandit allocation indices. Chichester [West Sussex]; New York: Wiley.
- (1989) Multi-armed Bandit Allocation Indices
- Gittins, J.C.¹

3
- 0001043843
- Restless bandits: Activity allocation in a changing world
- Whittle P (1988) Restless bandits: activity allocation in a changing world. J Appl Probab 25: 287-298.
- (1988) J Appl Probab , vol.25 , pp. 287-298
- Whittle, P.¹

4
- 33745223257
- Cortical substrates for exploratory decisions in humans
- Daw ND, O'Doherty JP, Dayan P, Seymour B, Dolan RJ (2006) Cortical substrates for exploratory decisions in humans. Nature 441: 876-879.
- (2006) Nature , vol.441 , pp. 876-879
- Daw, N.D.¹ O'Doherty, J.P.² Dayan, P.³ Seymour, B.⁴ Dolan, R.J.⁵

5
- 84864921697
- Modeling human performance in restless bandits with particle filters
- Available
- Yi MS, Steyvers M, Lee M (2009) Modeling human performance in restless bandits with particle filters. The Journal of Problem Solving 2: Available: http://docs.lib.purdue.edu/jps/vol2/iss2/5/.
- (2009) The Journal of Problem Solving 2
- Yi, M.S.¹ Steyvers, M.² Lee, M.³

6
- 84858789760
- Sequential effects: Superstition or rational behavior?
- Cambridge, MA: MIT Press
- Yu AJ, Cohen JD (2009) Sequential effects: Superstition or rational behavior? In: Advances in Neural Information Processing Systems, 21. Cambridge, MA: MIT Press. pp 1873-1880.
- (2009) Advances in Neural Information Processing Systems, 21 , pp. 1873-1880
- Yu, A.J.¹ Cohen, J.D.²

7
- 57049112212
- When does reward maximization lead to matching law?
- Sakai Y, Fukai T (2008) When does reward maximization lead to matching law? PLoS One 3: e3795.
- (2008) PLoS One , vol.3
- Sakai, Y.¹ Fukai, T.²

8
- 37749023538
- The actor-critic learning is behind the matching law: Matching vs. optimal behaviors
- Sakai Y, Fukai T (2008) The actor-critic learning is behind the matching law: Matching vs. optimal behaviors. Neural Comput 20: 227-251.
- (2008) Neural Comput , vol.20 , pp. 227-251
- Sakai, Y.¹ Fukai, T.²

9
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling L, Littman M, Cassandra A (1998) Planning and acting in partially observable stochastic domains. Artif Intell 101: 99-134.
- (1998) Artif Intell , vol.101 , pp. 99-134
- Kaelbling, L.¹ Littman, M.² Cassandra, A.³

10
- 0031619316
- Bayesian Q-learning
- Dearden R, Friedman N, Russell S (1998) Bayesian Q-learning. In: Fifteenth National Conf. on Artificial Intelligence (AAAI). pp 761-768.
- (1998) Fifteenth National Conf. on Artificial Intelligence (AAAI) , pp. 761-768
- Dearden, R.¹ Friedman, N.² Russell, S.³

11
- 14344258433
- A bayesian framework for reinforcement learning
- Strens MJA (2000) A bayesian framework for reinforcement learning. In: Proceedings of the Seventeenth International Conference on Machine Learning Morgan Kaufmann Publishers Inc. pp 943-950.
- (2000) Proceedings of the Seventeenth International Conference on Machine Learning Morgan Kaufmann Publishers Inc , pp. 943-950
- Strens, M.J.A.¹

12
- 33749251297
- An analytic solution to discrete bayesian reinforcement learning
- Pittsburgh, Penn
- Poupart P, Vlassis N, Hoey J, Regan K (2006) An analytic solution to discrete bayesian reinforcement learning. In: 23rd International Conference on Machine Learning. Pittsburgh, Penn. pp 697-704.
- (2006) 23rd International Conference on Machine Learning , pp. 697-704
- Poupart, P.¹ Vlassis, N.² Hoey, J.³ Regan, K.⁴

13
- 84898969519
- Structure learning in human causal induction
- Cambridge, MA: MIT Press
- Tenenbaum JB, Griffiths TL (2001) Structure learning in human causal induction. In: Advances in Neural Information Processing Systems 13. Cambridge, MA: MIT Press. pp 59-65.
- (2001) Advances in Neural Information Processing Systems 13 , pp. 59-65
- Tenenbaum, J.B.¹ Griffiths, T.L.²

14
- 34249761849
- Learning bayesian networks: The combination of knowledge and statistical data
- Heckerman D, Geiger D, Chickering DM (1995) Learning bayesian networks: The combination of knowledge and statistical data. Mach Learn 20: 197-243.
- (1995) Mach Learn , vol.20 , pp. 197-243
- Heckerman, D.¹ Geiger, D.² Chickering, D.M.³

15
- 0344080086
- Upper Saddle River, NJ: Pearson Prentice Hall
- Neapolitan RE (2004) Learning Bayesian networks. Upper Saddle River, NJ: Pearson Prentice Hall.
- (2004) Learning Bayesian Networks
- Neapolitan, R.E.¹

16
- 33746260413
- Theory-based bayesian models of inductive learning and reasoning
- Tenenbaum JB, Griffiths TL, Kemp C (2006) Theory-based bayesian models of inductive learning and reasoning. Trends Cogn Sci 10: 309-318.
- (2006) Trends Cogn Sci , vol.10 , pp. 309-318
- Tenenbaum, J.B.¹ Griffiths, T.L.² Kemp, C.³

17
- 0003787146
- Princeton: Princeton University Press
- Bellman RE (1957) Dynamic programming. Princeton: Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.E.¹

18
- 0002955623
- A dynamic allocation index for the sequential design of experiments
- In: Gani J, Sarkadi K, Vincze I, eds., Amsterdam: North-Holland Pub. Co
- Gittins JC, Jones DM (1974) A dynamic allocation index for the sequential design of experiments. In: Gani J, Sarkadi K, Vincze I, eds. Progress in statistics. Amsterdam: North-Holland Pub. Co. pp 241-266.
- (1974) Progress in Statistics , pp. 241-266
- Gittins, J.C.¹ Jones, D.M.²

19
- 34249833101
- Technical note: Q-learning
- Watkins C, Dayan P (1992) Technical note: Q-learning. Mach Learn 8: 279-292.
- (1992) Mach Learn , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

20
- 0004102479
- MIT Press
- Sutton RS, Barto AG (1998) Reinforcement learning: An introduction MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

21
- 0030896968
- A neural substrate of prediction and reward
- Schultz W, Dayan P, Montague P (1997) A neural substrate of prediction and reward. Science 275: 1593-1599.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.³

22
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz W (1998) Predictive reward signal of dopamine neurons. J Neurophysiol 80: 1-27.
- (1998) J Neurophysiol , vol.80 , pp. 1-27
- Schultz, W.¹

23
- 0004012196
- Chapman & Hall/CRC
- Gelman A, Carlin JB, Stern HS, Rubin DB (2003) Bayesian Data Analysis Chapman & Hall/CRC.
- (2003) Bayesian Data Analysis
- Gelman, A.¹ Carlin, J.B.² Stern, H.S.³ Rubin, D.B.⁴

24
- 0008803714
- Sequential choice under ambiguity: Intuitive solutions to the armed-bandit problem
- Meyer RJ, Shi Y (1995) Sequential choice under ambiguity: Intuitive solutions to the armed-bandit problem. Manage Sci 41: 817-834.
- (1995) Manage Sci , vol.41 , pp. 817-834
- Meyer, R.J.¹ Shi, Y.²

25
- 0031287072
- An experimental analysis of the bandit problem
- Banks J, Olson M, Porter D (1997) An experimental analysis of the bandit problem. Econ Theory 10: 55-77.
- (1997) Econ Theory , vol.10 , pp. 55-77
- Banks, J.¹ Olson, M.² Porter, D.³

26
- 27644576071
- Ph.D. thesis, California Institute of Technology, Pasadena, CA
- Anderson C (2001) Behavioral Models of Strategies in Multi-Armed Bandit Problems. Ph.D. thesis, California Institute of Technology, Pasadena, CA.
- (2001) Behavioral Models of Strategies in Multi-Armed Bandit Problems
- Anderson, C.¹

27
- 61549113484
- Simple models of discrete choice and their performance in bandit experiments
- Gans N, Knox G, Croson R (2007) Simple models of discrete choice and their performance in bandit experiments. Manuf Serv Oper Manag 9: 383-408.
- (2007) Manuf Serv Oper Manag , vol.9 , pp. 383-408
- Gans, N.¹ Knox, G.² Croson, R.³

28
- 0010186317
- Reward probability, amount, and information as determiners of sequential two-alternative decisions
- Edwards W (1956) Reward probability, amount, and information as determiners of sequential two-alternative decisions. J Exp Psychol 52: 177-88.
- (1956) J Exp Psychol , vol.52 , pp. 177-188
- Edwards, W.¹

29
- 0001515225
- Probability learning in 1000 trials
- Edwards W (1961) Probability learning in 1000 trials. J Exp Psychol 62: 385-394.
- (1961) J Exp Psychol , vol.62 , pp. 385-394
- Edwards, W.¹

30
- 0342748193
- Supplementary report: The utility of correctly predicting infrequent events
- Brackbill Y, Bravos A (1962) Supplementary report: The utility of correctly predicting infrequent events. J Exp Psychol 64: 648-649.
- (1962) J Exp Psychol , vol.64 , pp. 648-649
- Brackbill, Y.¹ Bravos, A.²

31
- 26744464265
- Ph.D. Dissertation. Chapel Hill, NC: University of North Carolina, Chapel Hill
- Horowitz AD (1973) Experimental Study of the Two-Armed Bandit Problem. Ph.D. Dissertation. Chapel Hill, NC: University of North Carolina, Chapel Hill.
- (1973) Experimental Study of the Two-Armed Bandit Problem
- Horowitz, A.D.¹

32
- 77952541839
- Learning latent structure: Carving nature at its joints
- Gershman SJ, Niv Y (2010) Learning latent structure: carving nature at its joints. Curr Opin Neurobiol 20: 251-256.
- (2010) Curr Opin Neurobiol , vol.20 , pp. 251-256
- Gershman, S.J.¹ Niv, Y.²

33
- 84898993037
- Model uncertainty in classical conditioning
- Cambridge, MA: MIT Press
- Courville AC, Daw ND, Gordon GJ, Touretzky DS (2004) Model uncertainty in classical conditioning. In: Advances in Neural Information Processing Systems 16. Cambridge, MA: MIT Press. pp 977-986.
- (2004) Advances in Neural Information Processing Systems 16 , pp. 977-986
- Courville, A.C.¹ Daw, N.D.² Gordon, G.J.³ Touretzky, D.S.⁴

34
- 70350336831
- Structure learning in action
- Braun DA, Mehring C, Wolpert DM (2009) Structure learning in action. Behav Brain Res 206: 157-165.
- (2009) Behav Brain Res , vol.206 , pp. 157-165
- Braun, D.A.¹ Mehring, C.² Wolpert, D.M.³

35
- 77951576301
- Bayesian modeling of human sequential decisionmaking on the multi-armed bandit problem
- In: Sloutsky V, Love B, McRae K, eds., AustinTX: Cognitive Science Society
- Acuna D, Schrater P (2008) Bayesian modeling of human sequential decisionmaking on the multi-armed bandit problem. In: Sloutsky V, Love B, McRae K, eds. 30th Annual Conference of the Cognitive Science Society. AustinTX: Cognitive Science Society. pp 2065-2070.
- (2008) 30th Annual Conference of the Cognitive Science Society , pp. 2065-2070
- Acuna, D.¹ Schrater, P.²

36
- 33745910265
- A hierarchical Bayesian model of human decision-making on an optimal stopping problem
- Lee MD (2006) A hierarchical Bayesian model of human decision-making on an optimal stopping problem. Cogn Sci 30: 1-26.
- (2006) Cogn Sci , vol.30 , pp. 1-26
- Lee, M.D.¹

37
- 34548295327
- Learning the value of information in an uncertain world
- Behrens TEJ, Woolrich MW, Walton ME, Rushworth MFS (2007) Learning the value of information in an uncertain world. Nat Neurosci 10: 1214-1221.
- (2007) Nat Neurosci , vol.10 , pp. 1214-1221
- Behrens, T.E.J.¹ Woolrich, M.W.² Walton, M.E.³ Rushworth, M.F.S.⁴

38
- 84864032307
- Prediction and change detection
- Steyvers M, Brown S (2006) Prediction and change detection. In: NIPS 2006. pp 1281-1288.
- (2006) NIPS 2006 , pp. 1281-1288
- Steyvers, M.¹ Brown, S.²

39
- 0004309536
- Wiley New York
- Anderson J (2000) Learning and memory. Wiley New York.
- (2000) Learning and Memory
- Anderson, J.¹

40
- 0038829878
- Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
- Erev I, Roth AE (1998) Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria. Am Econ Rev 88: 848-881.
- (1998) Am Econ Rev , vol.88 , pp. 848-881
- Erev, I.¹ Roth, A.E.²

41
- 33646230819
- Dopamine, prediction error and associative learning: A model-based account
- Smith A, Li M, Becker S, Kapur S (2006) Dopamine, prediction error and associative learning: A model-based account. Network 17: 61-84.
- (2006) Network , vol.17 , pp. 61-84
- Smith, A.¹ Li, M.² Becker, S.³ Kapur, S.⁴

42
- 40849087850
- Integrating hippocampus and striatum in decision-making
- Johnson A, van der Meer M, Redish A (2007) Integrating hippocampus and striatum in decision-making. Curr Opin Neurobiol 17: 692-697.
- (2007) Curr Opin Neurobiol , vol.17 , pp. 692-697
- Johnson, A.¹ van der Meer, M.² Redish, A.³

43
- 67349268975
- A bayesian analysis of human decision-making on bandit problems
- Steyvers M, Lee MD, Wagenmakers E (2009) A bayesian analysis of human decision-making on bandit problems. J Math Psychol 53: 168-179.
- (2009) J Math Psychol , vol.53 , pp. 168-179
- Steyvers, M.¹ Lee, M.D.² Wagenmakers, E.³

44
- 78651233055
- Cambridge, MA: MIT Press
- Howard R (1960) Dynamic Programming. Cambridge, MA: MIT Press.
- (1960) Dynamic Programming
- Howard, R.¹

45
- 0004003001
- Academic Press
- Fel'dbaum A (1965) Optimal Control Systems Academic Press.
- (1965) Optimal Control Systems
- Fel'dbaum, A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.